BLASTX nr result

ID: Lithospermum22_contig00015696 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Lithospermum22_contig00015696
         (1397 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containi...   435   e-119
ref|XP_002521239.1| pentatricopeptide repeat-containing protein,...   382   e-103
ref|XP_002312938.1| predicted protein [Populus trichocarpa] gi|2...   380   e-103
dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana]           373   e-101
ref|NP_565402.1| pentatricopeptide repeat-containing protein [Ar...   373   e-101

>ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Vitis vinifera] gi|297744557|emb|CBI37819.3| unnamed
            protein product [Vitis vinifera]
          Length = 435

 Score =  435 bits (1119), Expect = e-119
 Identities = 217/418 (51%), Positives = 296/418 (70%)
 Frame = +3

Query: 120  VQCALTKKGHRLLTSITTTSEPLHNQRXXXXXXXXXXXXXXXXXXXXILSTTTTHPHISS 299
            +QCAL+K+G   L+S+    +P  + R                    +LS TTTHP++SS
Sbjct: 21   IQCALSKQGQLFLSSVAR--DPSASNRLICKFIASSSKSIALNALSHLLSPTTTHPYLSS 78

Query: 300  LSLPLYRLISEASWFDWNAKLVAQLIASLYKQERFDEAECLYSEAISRLIGRERDLCNFY 479
            L+LPLY  ISEASWF WN KL+A +IA LYKQ +  EAE L SE + +L  RERDL +FY
Sbjct: 79   LALPLYSRISEASWFSWNPKLIADVIALLYKQGQLKEAETLVSETLIKLGSRERDLVSFY 138

Query: 480  CHSVEFSSKNGSTKGVLEFCEKLKRLVSDTSSIYVQKRAYESMINGLCVTGLVLEAEDFM 659
            C+ ++  SK+ S +GV +   +L R+VS++SS+YV++RAY+SMI+ LC  GL LEAE+ +
Sbjct: 139  CNLIDSHSKHSSNQGVFDVISRLSRIVSESSSVYVKERAYKSMISSLCAVGLPLEAENLI 198

Query: 660  VEMKDLGLKASIFEYRSIVYAYGKAGLFEDMKKIISNIESEGFVMDTVCCNMVLSSLGSH 839
             EM+  GLK S+FE+RS+VY YG+ GL EDM++I+  + +EGF +DTV  NMVLSS G++
Sbjct: 199  EEMRVKGLKPSVFEFRSVVYGYGRVGLSEDMQRILLQMGNEGFELDTVVSNMVLSSYGAY 258

Query: 840  NELQEMILWLRRMKDLDIPLSIRSFNTVLNSCPNLMSMLENPKNIPLSIEELVENLSGSE 1019
            N+  EM+ WL+RMK+  IP SIR++N+VLNSCP +MS+L++ K  P +I+EL+E L G E
Sbjct: 259  NKQSEMVSWLQRMKNSSIPFSIRTYNSVLNSCPMIMSILQDLKTFPPTIDELMETLKGDE 318

Query: 1020 TKLVEELIMSSSSVIEEAMEWGSSELKLDLHGMHLGSAYLVFLKWLGELRHICLAKDCIF 1199
              LV+ELI   S V+ E MEW  SE KLDLHGMHLGSAYL+ L+W  ELR+   A + + 
Sbjct: 319  ALLVKELI--GSMVLAELMEWDCSEGKLDLHGMHLGSAYLIMLQWREELRYRLNAAEYVM 376

Query: 1200 PTQITVVCGLGKHSSIRGESSVKRLIREIIEQMNCPLRIDRKNVGCFIVKGSVFKFWL 1373
            P +ITVVCG GKHSS+RGES VKR++RE++ +   P++IDRKN+GCF+ K  V K WL
Sbjct: 377  PVEITVVCGSGKHSSVRGESPVKRMVREMMTRTRSPMKIDRKNIGCFVAKAKVVKNWL 434


>ref|XP_002521239.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223539507|gb|EEF41095.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 460

 Score =  382 bits (980), Expect = e-103
 Identities = 191/418 (45%), Positives = 278/418 (66%), Gaps = 3/418 (0%)
 Frame = +3

Query: 129  ALTKKGHRLLTSI---TTTSEPLHNQRXXXXXXXXXXXXXXXXXXXXILSTTTTHPHISS 299
            AL+K+G R L+S+   TT  + +   R                    +L+  ++H H+SS
Sbjct: 44   ALSKQGQRFLSSLAIATTKGDTVATNRLIKKFVAASPKSIALDALSHLLNPHSSHSHLSS 103

Query: 300  LSLPLYRLISEASWFDWNAKLVAQLIASLYKQERFDEAECLYSEAISRLIGRERDLCNFY 479
            L+  LY  I+EA WF WN KLVA ++A L KQ R+DE+  L S++IS+L  +ERDL  FY
Sbjct: 104  LAFTLYLKIAEARWFQWNPKLVADVVAFLDKQGRYDESATLVSDSISKLQVKERDLARFY 163

Query: 480  CHSVEFSSKNGSTKGVLEFCEKLKRLVSDTSSIYVQKRAYESMINGLCVTGLVLEAEDFM 659
            C+ VE  SK  S +G       L +LV +++S+YV+++ Y+SM+NGLC  G   EAE  +
Sbjct: 164  CNLVESQSKQNSIRGFDNSVASLMQLVCNSNSVYVKRQGYKSMVNGLCEMGRPREAETLI 223

Query: 660  VEMKDLGLKASIFEYRSIVYAYGKAGLFEDMKKIISNIESEGFVMDTVCCNMVLSSLGSH 839
             EM   G++ S+FE++ +VYAYG  G FE+M K +  +E  GF +DTVC NM+L+S G+H
Sbjct: 224  EEMGKEGVRPSMFEFKCVVYAYGSLGSFEEMNKCLHQMERAGFRVDTVCSNMILASYGAH 283

Query: 840  NELQEMILWLRRMKDLDIPLSIRSFNTVLNSCPNLMSMLENPKNIPLSIEELVENLSGSE 1019
            N L EM+LWL++MKDL IP S+R+ N+ LNSCP +MSM++N  + P+SI +L++ LS  E
Sbjct: 284  NALPEMVLWLQKMKDLGIPFSLRTCNSALNSCPTIMSMMQNSNDFPISIHDLMKILSEDE 343

Query: 1020 TKLVEELIMSSSSVIEEAMEWGSSELKLDLHGMHLGSAYLVFLKWLGELRHICLAKDCIF 1199
              LV+E++  +SSV++EAM+W  +E KLDLHG HL SAYL+ L W+ E+R    + + + 
Sbjct: 344  ALLVKEIV--TSSVLDEAMKWDVAEAKLDLHGTHLCSAYLIILLWIEEMRKRFKSVNYVN 401

Query: 1200 PTQITVVCGLGKHSSIRGESSVKRLIREIIEQMNCPLRIDRKNVGCFIVKGSVFKFWL 1373
            PT+ITVVCG G HS +RGES VK ++++ + +   P+RIDR+N+GCFI KG V + WL
Sbjct: 402  PTEITVVCGSGNHSIVRGESPVKCMVKDFMVRARSPMRIDRRNIGCFIAKGKVVEEWL 459


>ref|XP_002312938.1| predicted protein [Populus trichocarpa] gi|222849346|gb|EEE86893.1|
            predicted protein [Populus trichocarpa]
          Length = 473

 Score =  380 bits (977), Expect = e-103
 Identities = 190/368 (51%), Positives = 265/368 (72%), Gaps = 1/368 (0%)
 Frame = +3

Query: 273  TTTHPHISSLSLPLYRLISEASWFDWNAKLVAQLIASLYKQERFDEAECLYSEAISRLIG 452
            +T HP +  L+LPLY  ISEASWF WN KLVAQ++  L KQ    E + L SE +SRL  
Sbjct: 107  STHHPLLYLLTLPLYLKISEASWFSWNPKLVAQVVVLLDKQGLDKELKALMSETVSRLQF 166

Query: 453  RERDLCNFYCHSVEFSSKNGSTKGVLEFCEKLKRLVSDTSSIYVQKRAYESMINGLCVTG 632
            +ER+L  FYC+ + F+SK+   +G  +   +L + VSD++S+YV+K+ Y++MI+GLC  G
Sbjct: 167  KERELVLFYCNLIGFNSKHNWVRGFDDSYSRLNQFVSDSNSVYVKKQGYKAMISGLCEMG 226

Query: 633  LVLEAEDFMVEMKDLGLKASIFEYRSIVYAYGKAGLFEDMKKIISNIESEGFVMDTVCCN 812
               EAED + EM++ GLK  +FE+R ++Y YG+ GLF+DM++I+  +ES    +DTVC N
Sbjct: 227  RAREAEDLIGEMRERGLKPKLFEFRCVLYGYGRLGLFKDMERILDKMESGEIEVDTVCAN 286

Query: 813  MVLSSLGSHNELQEMILWLRRMKDLDIPLSIRSFNTVLNSCPNLMSMLEN-PKNIPLSIE 989
            MVL+S G+HN L EM LWLR+MK L IPLSIR+ N+VLNSCP +M+++ N   + P+SI+
Sbjct: 287  MVLASYGAHNALPEMGLWLRKMKTLGIPLSIRTCNSVLNSCPTIMALMRNLDASYPVSIQ 346

Query: 990  ELVENLSGSETKLVEELIMSSSSVIEEAMEWGSSELKLDLHGMHLGSAYLVFLKWLGELR 1169
            EL++ LS  E  LV+ELI   SSV++EA +W +SE KLDLHGMHLGSAY++ L+W+ E R
Sbjct: 347  ELLKILSEEEAMLVKELI--ESSVLKEATKWDTSEGKLDLHGMHLGSAYVIMLQWMEETR 404

Query: 1170 HICLAKDCIFPTQITVVCGLGKHSSIRGESSVKRLIREIIEQMNCPLRIDRKNVGCFIVK 1349
            +     + + P +ITVVCG G HS++RGES VK +I +I+ Q   P+RIDRKN+GCF+ K
Sbjct: 405  NRLSDGEHVIPAEITVVCGSGNHSTVRGESPVKSMITQIMAQTRSPMRIDRKNIGCFVAK 464

Query: 1350 GSVFKFWL 1373
            G+V K WL
Sbjct: 465  GNVVKKWL 472


>dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana]
          Length = 501

 Score =  373 bits (957), Expect = e-101
 Identities = 191/417 (45%), Positives = 275/417 (65%), Gaps = 3/417 (0%)
 Frame = +3

Query: 132  LTKKGHRLLTSITTTS---EPLHNQRXXXXXXXXXXXXXXXXXXXXILSTTTTHPHISSL 302
            L K G R L+S+++ +   +P    R                    +LS  T+HPH+S  
Sbjct: 85   LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 144

Query: 303  SLPLYRLISEASWFDWNAKLVAQLIASLYKQERFDEAECLYSEAISRLIGRERDLCNFYC 482
            +L LY  I+EASWFDWN KL+A+LIA L KQERFDE+E L S A+SRL   ERD   F C
Sbjct: 145  ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 204

Query: 483  HSVEFSSKNGSTKGVLEFCEKLKRLVSDTSSIYVQKRAYESMINGLCVTGLVLEAEDFMV 662
            + VE +SK GS +G  E   +L+ ++  +SS+YV+ +AY+SM++GLC      +AE  + 
Sbjct: 205  NLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIE 264

Query: 663  EMKDLGLKASIFEYRSIVYAYGKAGLFEDMKKIISNIESEGFVMDTVCCNMVLSSLGSHN 842
            EM+   +K  +FEY+S++Y YG+ GLF+DM +++  + +EG  +DTVC NMVLSS G+H+
Sbjct: 265  EMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHD 324

Query: 843  ELQEMILWLRRMKDLDIPLSIRSFNTVLNSCPNLMSMLENPKNIPLSIEELVENLSGSET 1022
             L +M  WL+++K  ++P SIR++N+VLNSCP ++SML++  + P+S+ EL   L+  E 
Sbjct: 325  ALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNEDEA 384

Query: 1023 KLVEELIMSSSSVIEEAMEWGSSELKLDLHGMHLGSAYLVFLKWLGELRHICLAKDCIFP 1202
             LV EL  + SSV++EA+EW + E KLDLHGMHL S+YL+ L+W+ E R     + C+ P
Sbjct: 385  LLVHEL--TQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIP 442

Query: 1203 TQITVVCGLGKHSSIRGESSVKRLIREIIEQMNCPLRIDRKNVGCFIVKGSVFKFWL 1373
             +I VV G GKHS++RGES VK L+++I+ +   P+RIDRKNVG FI KG   K WL
Sbjct: 443  AEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWL 499


>ref|NP_565402.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|13877877|gb|AAK44016.1|AF370201_1 unknown protein
            [Arabidopsis thaliana] gi|21280879|gb|AAM44931.1| unknown
            protein [Arabidopsis thaliana]
            gi|330251481|gb|AEC06575.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 504

 Score =  373 bits (957), Expect = e-101
 Identities = 191/417 (45%), Positives = 275/417 (65%), Gaps = 3/417 (0%)
 Frame = +3

Query: 132  LTKKGHRLLTSITTTS---EPLHNQRXXXXXXXXXXXXXXXXXXXXILSTTTTHPHISSL 302
            L K G R L+S+++ +   +P    R                    +LS  T+HPH+S  
Sbjct: 88   LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 147

Query: 303  SLPLYRLISEASWFDWNAKLVAQLIASLYKQERFDEAECLYSEAISRLIGRERDLCNFYC 482
            +L LY  I+EASWFDWN KL+A+LIA L KQERFDE+E L S A+SRL   ERD   F C
Sbjct: 148  ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 207

Query: 483  HSVEFSSKNGSTKGVLEFCEKLKRLVSDTSSIYVQKRAYESMINGLCVTGLVLEAEDFMV 662
            + VE +SK GS +G  E   +L+ ++  +SS+YV+ +AY+SM++GLC      +AE  + 
Sbjct: 208  NLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIE 267

Query: 663  EMKDLGLKASIFEYRSIVYAYGKAGLFEDMKKIISNIESEGFVMDTVCCNMVLSSLGSHN 842
            EM+   +K  +FEY+S++Y YG+ GLF+DM +++  + +EG  +DTVC NMVLSS G+H+
Sbjct: 268  EMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHD 327

Query: 843  ELQEMILWLRRMKDLDIPLSIRSFNTVLNSCPNLMSMLENPKNIPLSIEELVENLSGSET 1022
             L +M  WL+++K  ++P SIR++N+VLNSCP ++SML++  + P+S+ EL   L+  E 
Sbjct: 328  ALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNEDEA 387

Query: 1023 KLVEELIMSSSSVIEEAMEWGSSELKLDLHGMHLGSAYLVFLKWLGELRHICLAKDCIFP 1202
             LV EL  + SSV++EA+EW + E KLDLHGMHL S+YL+ L+W+ E R     + C+ P
Sbjct: 388  LLVHEL--TQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIP 445

Query: 1203 TQITVVCGLGKHSSIRGESSVKRLIREIIEQMNCPLRIDRKNVGCFIVKGSVFKFWL 1373
             +I VV G GKHS++RGES VK L+++I+ +   P+RIDRKNVG FI KG   K WL
Sbjct: 446  AEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWL 502