BLASTX nr result

ID: Akebia24_contig00040537 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00040537
         (780 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002278241.1| PREDICTED: pentatricopeptide repeat-containi...   164   3e-38
ref|XP_007041747.1| Tetratricopeptide repeat (TPR)-like superfam...   163   6e-38
ref|XP_002306200.1| hypothetical protein POPTR_0004s18470g [Popu...   151   3e-34
ref|XP_006486706.1| PREDICTED: pentatricopeptide repeat-containi...   144   4e-32
ref|XP_006422555.1| hypothetical protein CICLE_v10030410mg [Citr...   143   6e-32
ref|NP_195239.1| pentatricopeptide repeat-containing protein [Ar...   139   9e-31
ref|XP_002867090.1| pentatricopeptide repeat-containing protein ...   137   4e-30
ref|XP_004290750.1| PREDICTED: pentatricopeptide repeat-containi...   136   1e-29
ref|XP_006412144.1| hypothetical protein EUTSA_v10024444mg [Eutr...   135   2e-29
gb|EYU37981.1| hypothetical protein MIMGU_mgv1a023065mg, partial...   132   1e-28
ref|XP_006283134.1| hypothetical protein CARUB_v10004161mg [Caps...   132   1e-28
ref|XP_007198997.1| hypothetical protein PRUPE_ppa002025mg [Prun...   128   2e-27
gb|EXB24037.1| hypothetical protein L484_006069 [Morus notabilis]     126   8e-27
ref|XP_006845697.1| hypothetical protein AMTR_s00019p00238380 [A...   115   1e-23
ref|XP_004154387.1| PREDICTED: pentatricopeptide repeat-containi...   107   4e-21
ref|XP_003597735.1| Pentatricopeptide repeat-containing protein ...   103   9e-20
ref|XP_002302824.2| hypothetical protein POPTR_0002s22590g [Popu...   102   2e-19
ref|XP_007139896.1| hypothetical protein PHAVU_008G067700g [Phas...   101   3e-19
ref|XP_006572946.1| PREDICTED: pentatricopeptide repeat-containi...   101   3e-19
ref|XP_002265522.1| PREDICTED: putative pentatricopeptide repeat...   101   3e-19

>ref|XP_002278241.1| PREDICTED: pentatricopeptide repeat-containing protein At4g35130,
           chloroplastic [Vitis vinifera]
           gi|297744563|emb|CBI37825.3| unnamed protein product
           [Vitis vinifera]
          Length = 802

 Score =  164 bits (416), Expect = 3e-38
 Identities = 83/173 (47%), Positives = 117/173 (67%), Gaps = 1/173 (0%)
 Frame = +2

Query: 263 SNAPEDISPNRR-EGPTKDRNPDLLLKQRISKTSPFYRNRSKIVKSVTRPHENLSLTRTL 439
           S  P + S  +R   P  + + DL+LK RI KT+   RN+S +V+       ++SLTR L
Sbjct: 13  SKRPRNASREKRARTPQTNPDTDLILKPRIFKTARSKRNQSFLVE-----RNSVSLTRAL 67

Query: 440 FSYVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLRGDNF 619
            SYV  G +  AL LF+ +   D F+WN++IRGF +NG + +A++ Y+ M+F G+RGDNF
Sbjct: 68  SSYVERGYMKNALDLFENMRQCDTFIWNVMIRGFVDNGLFWDAVDFYHRMEFGGVRGDNF 127

Query: 620 TYPFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCVEFA 778
           TYPFVIK+C G   L EG ++H K+IK GL+LDI+I NS+IIMY K+GC+E A
Sbjct: 128 TYPFVIKACGGLYDLAEGERVHGKVIKSGLDLDIYIGNSLIIMYAKIGCIESA 180



 Score = 65.9 bits (159), Expect = 2e-08
 Identities = 30/114 (26%), Positives = 64/114 (56%)
 Frame = +2

Query: 437 LFSYVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLRGDN 616
           +  Y   GC++ A  +F+++   D+  WN +I G+ + G    ++  +  M+  G++ D 
Sbjct: 168 IIMYAKIGCIESAEMVFREMPVRDLVSWNSMISGYVSVGDGWRSLSCFREMQASGIKLDR 227

Query: 617 FTYPFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCVEFA 778
           F+   ++ +C+    L  G +IH ++++  LELD+ +  S++ MY K G +++A
Sbjct: 228 FSVIGILGACSLEGFLRNGKEIHCQMMRSRLELDVMVQTSLVDMYAKCGRMDYA 281



 Score = 64.3 bits (155), Expect = 5e-08
 Identities = 39/119 (32%), Positives = 63/119 (52%)
 Frame = +2

Query: 407 PHENLSLTRTLFSYVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNL 586
           PH  L  T  +  Y   G L  A  LF Q+N+ ++  WN +I  +T NG   +A+ L+  
Sbjct: 362 PHLVLE-TALVDMYGECGKLKPAECLFGQMNERNLISWNAMIASYTKNGENRKAMTLFQD 420

Query: 587 MKFEGLRGDNFTYPFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLG 763
           +  + L+ D  T   ++ +    +SL E  +IH  + K+ L+ + F+ NSI+ MYGK G
Sbjct: 421 LCNKTLKPDATTIASILPAYAELASLREAEQIHGYVTKLKLDSNTFVSNSIVFMYGKCG 479


>ref|XP_007041747.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative
           [Theobroma cacao] gi|508705682|gb|EOX97578.1|
           Tetratricopeptide repeat (TPR)-like superfamily protein,
           putative [Theobroma cacao]
          Length = 810

 Score =  163 bits (413), Expect = 6e-38
 Identities = 78/170 (45%), Positives = 119/170 (70%)
 Frame = +2

Query: 263 SNAPEDISPNRREGPTKDRNPDLLLKQRISKTSPFYRNRSKIVKSVTRPHENLSLTRTLF 442
           S AP  +S N+ + P  +RNP  + K R SK +   R +S+  +++  P +NL LTR L 
Sbjct: 16  SPAPIIVSQNQFDAPETNRNPYAVTKPRFSKPTQLRRTQSRTSQTLIEP-KNLKLTRALP 74

Query: 443 SYVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLRGDNFT 622
           ++V+SG ++ AL+LF+++N  D + WNI+I+   +NG +++AI  ++ M+FEG R D FT
Sbjct: 75  AFVDSGSMENALSLFEEMNHWDSYTWNIIIKDLVDNGLFKQAINFFHRMEFEGARPDKFT 134

Query: 623 YPFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCVE 772
           YPFVIK+C G  SL  G K+H+KL+K+GL+LD++ CNS+I MY K+GCVE
Sbjct: 135 YPFVIKACAGVLSLKGGEKVHAKLVKVGLDLDVYNCNSLISMYMKVGCVE 184



 Score = 72.4 bits (176), Expect = 2e-10
 Identities = 42/119 (35%), Positives = 64/119 (53%)
 Frame = +2

Query: 407 PHENLSLTRTLFSYVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNL 586
           PH  L  T  +  Y   G L  A  +F QIN  ++  WN ++  +  NG Y EA+EL+  
Sbjct: 368 PHLVLE-TALVDMYGRCGKLKLAEHVFVQINGKNLASWNAMLAAYVQNGQYTEALELFQN 426

Query: 587 MKFEGLRGDNFTYPFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLG 763
           + +E L+ D  T   V+ +    +SL EG +IH+ +IK+GL  +  + NSI  +Y K G
Sbjct: 427 IWYESLQPDAITIASVLPAYADLTSLSEGRQIHAFIIKLGLNSNTIVSNSITYLYAKCG 485



 Score = 63.5 bits (153), Expect = 8e-08
 Identities = 32/111 (28%), Positives = 60/111 (54%)
 Frame = +2

Query: 446 YVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLRGDNFTY 625
           Y+  GC++    +F+++   D+  WN L+ G+   G    ++     M   G+R D F++
Sbjct: 177 YMKVGCVELGQNVFREMAVRDLVSWNSLLSGYQQVGDGLSSLVSLREMVLVGIRPDRFSF 236

Query: 626 PFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCVEFA 778
              + +C+       G +IH ++I+ G E+D+ +  S+I MYGK G V++A
Sbjct: 237 ISGLGACSIEGCRRSGKEIHCQVIRGGFEMDLMVETSLIDMYGKCGSVDYA 287


>ref|XP_002306200.1| hypothetical protein POPTR_0004s18470g [Populus trichocarpa]
           gi|222849164|gb|EEE86711.1| hypothetical protein
           POPTR_0004s18470g [Populus trichocarpa]
          Length = 784

 Score =  151 bits (381), Expect = 3e-34
 Identities = 76/173 (43%), Positives = 118/173 (68%), Gaps = 2/173 (1%)
 Frame = +2

Query: 266 NAPEDISPNRREGPTKDRNPDLLLKQRISKTSPFY-RNRSKI-VKSVTRPHENLSLTRTL 439
           NA ++ SP + + P   +      K++ ++ SPF  R +SK   K + RP++ L++TR L
Sbjct: 12  NAYKNASPEQNKPPKAAQ-----FKRKTTRKSPFIKRAQSKTSFKPLARPND-LNITRDL 65

Query: 440 FSYVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLRGDNF 619
             +V SG +  AL +F+++N  D F+WN++IRG+TNNG ++EAI+ Y  M+ EG+R DNF
Sbjct: 66  CGFVESGLMGNALDMFEKMNHSDTFIWNVIIRGYTNNGLFQEAIDFYYRMECEGIRSDNF 125

Query: 620 TYPFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCVEFA 778
           T+PFVIK+C    +L+ G K+H KLIKIG +LD+++CN +I MY K+G +E A
Sbjct: 126 TFPFVIKACGELLALMVGQKVHGKLIKIGFDLDVYVCNFLIDMYLKIGFIELA 178



 Score = 68.6 bits (166), Expect = 2e-09
 Identities = 36/111 (32%), Positives = 63/111 (56%)
 Frame = +2

Query: 446 YVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLRGDNFTY 625
           Y   G L  A  +F Q+N+ ++  WN ++  +  N  Y+EA++++  +  E L+ D  T 
Sbjct: 356 YGKCGELKLAEHVFNQMNEKNMVSWNTMVAAYVQNEQYKEALKMFQHILNEPLKPDAITI 415

Query: 626 PFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCVEFA 778
             V+ +    +S  EG +IHS ++K+GL  + FI N+I+ MY K G ++ A
Sbjct: 416 ASVLPAVAELASRSEGKQIHSYIMKLGLGSNTFISNAIVYMYAKCGDLQTA 466



 Score = 60.5 bits (145), Expect = 7e-07
 Identities = 33/111 (29%), Positives = 60/111 (54%)
 Frame = +2

Query: 446 YVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLRGDNFTY 625
           Y+  G ++ A  +F ++   D+  WN ++ G+  +G    ++  +  M   G + D F  
Sbjct: 169 YLKIGFIELAEKVFDEMPVRDLVSWNSMVSGYQIDGDGLSSLMCFKEMLRLGNKADRFGM 228

Query: 626 PFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCVEFA 778
              + +C+    L  G++IH ++I+  LELDI +  S+I MYGK G V++A
Sbjct: 229 ISALGACSIEHCLRSGMEIHCQVIRSELELDIMVQTSLIDMYGKCGKVDYA 279


>ref|XP_006486706.1| PREDICTED: pentatricopeptide repeat-containing protein At4g35130,
           chloroplastic-like [Citrus sinensis]
          Length = 810

 Score =  144 bits (363), Expect = 4e-32
 Identities = 71/174 (40%), Positives = 111/174 (63%), Gaps = 2/174 (1%)
 Frame = +2

Query: 263 SNAPEDISPNRREGPTKDRNP--DLLLKQRISKTSPFYRNRSKIVKSVTRPHENLSLTRT 436
           SN+P   +P++++    + NP        R SK++  ++N++   K    P  N++ TR 
Sbjct: 16  SNSPTRRNPSQKQFKIPETNPTPSFETNARSSKSTHIHKNQTITSKKSIGPR-NITKTRA 74

Query: 437 LFSYVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLRGDN 616
           L   V+SG ++ A  LF++++  D ++WN++IRGF +NG ++EA+E ++ M  EG + D 
Sbjct: 75  LQELVSSGSMESACYLFEKMSYLDTYIWNVVIRGFVDNGLFQEAVEFHHRMVCEGFKADY 134

Query: 617 FTYPFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCVEFA 778
           FTYPFVIK+C G   L EG K+H  L K GL  D+++CNS+I+MY KLGCVE A
Sbjct: 135 FTYPFVIKACAGLLYLSEGEKVHGSLFKSGLNSDVYVCNSLIVMYMKLGCVECA 188



 Score = 70.9 bits (172), Expect = 5e-10
 Identities = 39/122 (31%), Positives = 67/122 (54%), Gaps = 1/122 (0%)
 Frame = +2

Query: 416 NLSLTRTLFS-YVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMK 592
           N++L   L   Y  SG L     LF  + + ++  WN +I  +  NG   EA+EL+  + 
Sbjct: 371 NVALETALIDMYAGSGALKMTEKLFGSMIEKNLVSWNAMIAAYVRNGQNREAMELFQDLW 430

Query: 593 FEGLRGDNFTYPFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCVE 772
            E L+ D  T+  ++ +    ++L + ++IHS + K+GL  +I+I NSI+ MY K G ++
Sbjct: 431 SEPLKPDAMTFASILPAYAEIATLSDSMQIHSLITKLGLVSNIYISNSIVYMYAKCGDLQ 490

Query: 773 FA 778
            A
Sbjct: 491 TA 492



 Score = 67.0 bits (162), Expect = 7e-09
 Identities = 35/111 (31%), Positives = 62/111 (55%)
 Frame = +2

Query: 446 YVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLRGDNFTY 625
           Y+  GC++ A  +F ++   D   WN +I G+ + G    ++  +  M+  GLR D F+ 
Sbjct: 179 YMKLGCVECAERMFDEMPVRDTVSWNSMIGGYCSVGDGVSSLVFFKEMQNCGLRYDRFSL 238

Query: 626 PFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCVEFA 778
              + + +    L  G +IH ++IK GLE+D+ +  S++ MYGK G V++A
Sbjct: 239 ISALGAISIEGCLKIGKEIHCQVIKSGLEMDVMVQTSLVDMYGKCGVVDYA 289


>ref|XP_006422555.1| hypothetical protein CICLE_v10030410mg [Citrus clementina]
           gi|557524489|gb|ESR35795.1| hypothetical protein
           CICLE_v10030410mg [Citrus clementina]
          Length = 810

 Score =  143 bits (361), Expect = 6e-32
 Identities = 71/174 (40%), Positives = 110/174 (63%), Gaps = 2/174 (1%)
 Frame = +2

Query: 263 SNAPEDISPNRREGPTKDRNP--DLLLKQRISKTSPFYRNRSKIVKSVTRPHENLSLTRT 436
           SN+P   +P++++    + NP        R SK++  ++N++   K    P  N++ TR 
Sbjct: 16  SNSPTRRNPSQKQFKIPETNPTPSFETNARSSKSTHIHKNQTITSKKSIGPR-NITKTRA 74

Query: 437 LFSYVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLRGDN 616
           L   V+SG ++ A  LF +++  D ++WN++IRGF +NG ++EA+E ++ M  EG + D 
Sbjct: 75  LQELVSSGSMESACYLFDKMSYLDTYIWNVVIRGFVDNGLFQEAVEFHHRMVCEGFKADY 134

Query: 617 FTYPFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCVEFA 778
           FTYPFVIK+C G   L EG K+H  L K GL  D+++CNS+I+MY KLGCVE A
Sbjct: 135 FTYPFVIKACAGLLYLSEGEKVHGSLFKSGLNSDVYVCNSLIVMYMKLGCVECA 188



 Score = 68.6 bits (166), Expect = 2e-09
 Identities = 38/122 (31%), Positives = 66/122 (54%), Gaps = 1/122 (0%)
 Frame = +2

Query: 416 NLSLTRTLFS-YVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMK 592
           N++L   L   Y  SG L     LF  + + ++  WN +I  +  NG   EA+EL+  + 
Sbjct: 371 NVALETALIDMYAGSGALKMTEKLFGSMIEKNLVSWNAMIAAYVRNGQNREAMELFQDLW 430

Query: 593 FEGLRGDNFTYPFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCVE 772
            E L+ D  T+  ++ +    ++L + ++IHS + K+GL  +I+I NSI+  Y K G ++
Sbjct: 431 SEPLKPDAMTFASILPAYAEIATLSDSMQIHSLITKLGLVSNIYISNSIVYTYAKCGDLQ 490

Query: 773 FA 778
            A
Sbjct: 491 TA 492



 Score = 66.6 bits (161), Expect = 9e-09
 Identities = 35/111 (31%), Positives = 62/111 (55%)
 Frame = +2

Query: 446 YVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLRGDNFTY 625
           Y+  GC++ A  +F ++   D   WN +I G+ + G    ++  +  M+  GLR D F+ 
Sbjct: 179 YMKLGCVECAERVFDEMPVRDTVSWNSMIGGYCSVGDGVSSLVFFKEMQNCGLRYDRFSL 238

Query: 626 PFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCVEFA 778
              + + +    L  G +IH ++IK GLE+D+ +  S++ MYGK G V++A
Sbjct: 239 ISALGAISIEGCLKIGKEIHCQVIKSGLEMDVMVQTSLVDMYGKCGVVDYA 289


>ref|NP_195239.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75098809|sp|O49619.1|PP350_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At4g35130, chloroplastic; Flags: Precursor
           gi|2924523|emb|CAA17777.1| putative protein [Arabidopsis
           thaliana] gi|7270464|emb|CAB80230.1| putative protein
           [Arabidopsis thaliana] gi|332661071|gb|AEE86471.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           thaliana]
          Length = 804

 Score =  139 bits (351), Expect = 9e-31
 Identities = 72/162 (44%), Positives = 103/162 (63%), Gaps = 1/162 (0%)
 Frame = +2

Query: 284 SPNRREGPTKDRNPDLLLKQRISKTSPFY-RNRSKIVKSVTRPHENLSLTRTLFSYVNSG 460
           S N +    ++ N +L     ISK +    R+R K+ K V  P    +LTR L  + +S 
Sbjct: 23  SENHQTTGKRNGNRNLEFDSGISKPARLVLRDRYKVTKQVNDP----ALTRALRGFADSR 78

Query: 461 CLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLRGDNFTYPFVIK 640
            ++ AL LF ++N  D FLWN++I+GFT+ G Y EA++ Y+ M F G++ D FTYPFVIK
Sbjct: 79  LMEDALQLFDEMNKADAFLWNVMIKGFTSCGLYIEAVQFYSRMVFAGVKADTFTYPFVIK 138

Query: 641 SCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGC 766
           S  G SSL EG KIH+ +IK+G   D+++CNS+I +Y KLGC
Sbjct: 139 SVAGISSLEEGKKIHAMVIKLGFVSDVYVCNSLISLYMKLGC 180


>ref|XP_002867090.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297312926|gb|EFH43349.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 803

 Score =  137 bits (345), Expect = 4e-30
 Identities = 65/132 (49%), Positives = 93/132 (70%)
 Frame = +2

Query: 371 RNRSKIVKSVTRPHENLSLTRTLFSYVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNN 550
           R+R K+ K +  P    +LTR L  + +SG ++ AL LF ++N  D F+WN++I+GFT+ 
Sbjct: 49  RDRYKVTKQLNDP----ALTRALRGFADSGLMEDALQLFDEMNKADTFVWNVMIKGFTSC 104

Query: 551 GFYEEAIELYNLMKFEGLRGDNFTYPFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFIC 730
           G Y EA++LY  M F G++ D+FTYPFVIKS TG SSL EG KIH+ +IK+    D+++C
Sbjct: 105 GLYFEALQLYCRMVFSGVKADSFTYPFVIKSVTGISSLEEGKKIHAMVIKLRFVSDVYVC 164

Query: 731 NSIIIMYGKLGC 766
           NS+I +Y KLGC
Sbjct: 165 NSLISLYMKLGC 176


>ref|XP_004290750.1| PREDICTED: pentatricopeptide repeat-containing protein At4g35130,
           chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 803

 Score =  136 bits (342), Expect = 1e-29
 Identities = 71/170 (41%), Positives = 104/170 (61%), Gaps = 1/170 (0%)
 Frame = +2

Query: 263 SNAPEDISPNRREGPTKDRNPDLLLKQRISKTSPFYRNRSKIVKSVTRPHENLS-LTRTL 439
           S AP ++S  R   P  ++  D     R  K     + R   +K  T   ++ S L + L
Sbjct: 14  STAPPNLSRKRAAEPKANQGAD---PPRFPKW--VRKGRKPPMKPTTMADQHSSSLKQAL 68

Query: 440 FSYVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLRGDNF 619
             +V SG ++ AL +F ++N  D + WNI+IRGF +NG + EAIE Y  M+ EG++ DN+
Sbjct: 69  HDHVQSGSMEDALWVFDKMNKLDAYNWNIVIRGFVDNGMFREAIEFYQRMEMEGVKEDNY 128

Query: 620 TYPFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCV 769
           TYPFVIK+C GS SLVE  ++H KL K+GL  D++ICN++  +Y KLGC+
Sbjct: 129 TYPFVIKACGGSLSLVEVRRVHGKLFKVGLVSDVYICNALCAVYAKLGCI 178



 Score = 58.2 bits (139), Expect = 3e-06
 Identities = 32/111 (28%), Positives = 56/111 (50%)
 Frame = +2

Query: 446 YVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLRGDNFTY 625
           Y   GC+  A  +F+++   D+  WN +I G+   G     +  +  M   G+  D F+ 
Sbjct: 172 YAKLGCIGDAEKVFEEMPVKDLVSWNSMIGGYVAVGDGWSGVICFRDMIVVGIMPDRFSM 231

Query: 626 PFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCVEFA 778
             V+ +C     L  G +IH +++K  +E D+ +  S+I MY K G V++A
Sbjct: 232 IGVLNACAIEGLLQTGKEIHCQVMKCMVESDVMVQTSLIDMYHKCGRVDYA 282


>ref|XP_006412144.1| hypothetical protein EUTSA_v10024444mg [Eutrema salsugineum]
           gi|557113314|gb|ESQ53597.1| hypothetical protein
           EUTSA_v10024444mg [Eutrema salsugineum]
          Length = 804

 Score =  135 bits (340), Expect = 2e-29
 Identities = 67/162 (41%), Positives = 103/162 (63%), Gaps = 1/162 (0%)
 Frame = +2

Query: 284 SPNRREGPTKDRNPDLLLKQRISKTSPF-YRNRSKIVKSVTRPHENLSLTRTLFSYVNSG 460
           S N +    ++ N +L+  +R S  +    R RS++ K       +L+L R L  + +SG
Sbjct: 23  SENHQTTGKRNWNRNLVFDRRFSNPARIALRERSRLTKL-----NDLALKRALREFADSG 77

Query: 461 CLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLRGDNFTYPFVIK 640
            ++ AL LF ++N  D F+WN++IRGF + G Y E+++ Y  M F G++ D+FTYPFVIK
Sbjct: 78  LMEDALHLFDEMNKADAFVWNVMIRGFASCGLYHESVQFYCRMVFAGIKADSFTYPFVIK 137

Query: 641 SCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGC 766
           S  G SSL EG K+H+ +IK+G   D+++CNS+I +Y KLGC
Sbjct: 138 SVAGISSLKEGKKVHAMVIKLGFYSDVYVCNSLISLYMKLGC 179


>gb|EYU37981.1| hypothetical protein MIMGU_mgv1a023065mg, partial [Mimulus
           guttatus]
          Length = 715

 Score =  132 bits (333), Expect = 1e-28
 Identities = 59/112 (52%), Positives = 81/112 (72%)
 Frame = +2

Query: 437 LFSYVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLRGDN 616
           L S VNSG LD AL +F+ +     F+WN++IRG  ++G +E+AIE Y  M+FEG + D 
Sbjct: 2   LLSLVNSGSLDNALQMFETMIKSSTFVWNVIIRGLVDSGLFEKAIEFYRRMQFEGTKPDK 61

Query: 617 FTYPFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCVE 772
           FT+PFVIK+C G   L  G  +HS +IK+GL LDI+ICN++IIMY K+GC+E
Sbjct: 62  FTFPFVIKACAGFFCLNAGRNVHSIIIKLGLNLDIYICNALIIMYAKVGCIE 113



 Score = 70.9 bits (172), Expect = 5e-10
 Identities = 36/114 (31%), Positives = 64/114 (56%)
 Frame = +2

Query: 437 LFSYVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLRGDN 616
           +  Y   GC++ +  +F+ +   D+  WN +I G+ ++G   E++  +  M+  G+  D 
Sbjct: 103 IIMYAKVGCIEDSEKIFEHMLIRDIVSWNSMISGYISSGNGWESLMCFRRMQTLGVEIDR 162

Query: 617 FTYPFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCVEFA 778
           F+Y     +C     L+ G +I S ++K G+ELD  I +SII M+GK G V++A
Sbjct: 163 FSYISAFNACALEGCLLHGKEIFSHVLKNGVELDSMIQSSIIDMFGKCGEVDYA 216


>ref|XP_006283134.1| hypothetical protein CARUB_v10004161mg [Capsella rubella]
           gi|482551839|gb|EOA16032.1| hypothetical protein
           CARUB_v10004161mg [Capsella rubella]
          Length = 807

 Score =  132 bits (332), Expect = 1e-28
 Identities = 63/132 (47%), Positives = 91/132 (68%)
 Frame = +2

Query: 371 RNRSKIVKSVTRPHENLSLTRTLFSYVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNN 550
           R RS++ K +  P    +LTR L  + +SG +D AL LF ++N  DV++WN++IRG+T+ 
Sbjct: 53  RKRSQLTKQLNDP----ALTRALRGFADSGLMDDALQLFDEMNKADVYVWNVIIRGYTSC 108

Query: 551 GFYEEAIELYNLMKFEGLRGDNFTYPFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFIC 730
           GFY EA++ Y  M   G++ D+FTYPFVIKS  G SSL +G KIH+ +IK+    D+++ 
Sbjct: 109 GFYIEAVQFYCRMVLAGIKADSFTYPFVIKSVAGISSLEDGKKIHAMVIKLRFVSDVYVS 168

Query: 731 NSIIIMYGKLGC 766
           NS+I MY KLGC
Sbjct: 169 NSLISMYMKLGC 180


>ref|XP_007198997.1| hypothetical protein PRUPE_ppa002025mg [Prunus persica]
           gi|462394397|gb|EMJ00196.1| hypothetical protein
           PRUPE_ppa002025mg [Prunus persica]
          Length = 727

 Score =  128 bits (322), Expect = 2e-27
 Identities = 53/105 (50%), Positives = 80/105 (76%)
 Frame = +2

Query: 464 LDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLRGDNFTYPFVIKS 643
           ++ AL +F+++N  D + WN++IRG T+NG + EAI+ Y+ M+ E +R DNFTYPFVIK+
Sbjct: 1   MEDALWVFEKMNHLDTYYWNVMIRGLTDNGLFREAIDFYHRMQSEAVRADNFTYPFVIKA 60

Query: 644 CTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCVEFA 778
           C G SSL EG K+H KL K+GL+ D+++ N++  +Y KLGC+E+A
Sbjct: 61  CGGLSSLAEGQKVHGKLFKVGLDSDVYVGNALCAVYAKLGCIEYA 105



 Score = 62.4 bits (150), Expect = 2e-07
 Identities = 33/111 (29%), Positives = 60/111 (54%)
 Frame = +2

Query: 446 YVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLRGDNFTY 625
           Y   GC++ A  +F+++   D+  WN +I G+ + G    ++     M+  G++ D F+ 
Sbjct: 96  YAKLGCIEYAERVFEEMPVKDMVSWNSMIGGYVSVGDGWSSLVCLKEMQVLGMKPDRFST 155

Query: 626 PFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCVEFA 778
              + +C     L  G +IH +++K  LELDI +  S+I MY K G V+++
Sbjct: 156 IGALNACAIECFLQTGKEIHCQVLKCMLELDIMVQTSLIDMYHKCGRVDYS 206


>gb|EXB24037.1| hypothetical protein L484_006069 [Morus notabilis]
          Length = 797

 Score =  126 bits (317), Expect = 8e-27
 Identities = 62/147 (42%), Positives = 97/147 (65%)
 Frame = +2

Query: 338 KQRISKTSPFYRNRSKIVKSVTRPHENLSLTRTLFSYVNSGCLDKALTLFQQINDPDVFL 517
           K R S   P  R + K   S+    ++ SL R + ++V+SG + +AL +F++++  + ++
Sbjct: 30  KPRTSIPIPSCRRKPK--NSIENRRDS-SLRRAIRNHVDSGQMREALEIFEKMDCSETYV 86

Query: 518 WNILIRGFTNNGFYEEAIELYNLMKFEGLRGDNFTYPFVIKSCTGSSSLVEGLKIHSKLI 697
           WN++IRGFT+NG + EAI  Y  M+ +G++ DNFTY FVIK+C  S S  EG K+H KL 
Sbjct: 87  WNLMIRGFTDNGLFFEAINFYRRMENQGIQADNFTYLFVIKACGASLSFFEGQKVHGKLF 146

Query: 698 KIGLELDIFICNSIIIMYGKLGCVEFA 778
           K+GL  D+ +CNS++ MYGK G ++ A
Sbjct: 147 KVGLNSDVCVCNSLVSMYGKSGFIKLA 173


>ref|XP_006845697.1| hypothetical protein AMTR_s00019p00238380 [Amborella trichopoda]
           gi|548848269|gb|ERN07372.1| hypothetical protein
           AMTR_s00019p00238380 [Amborella trichopoda]
          Length = 235

 Score =  115 bits (289), Expect = 1e-23
 Identities = 54/122 (44%), Positives = 80/122 (65%)
 Frame = +2

Query: 392 KSVTRPHENLSLTRTLFSYVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAI 571
           K+  +P    S+  +L +++NSG  + AL LF  +   D  LWNI+I+G+  NGF+ EAI
Sbjct: 39  KNSRKPCNYFSVPDSLHAFLNSGQTETALHLFHSMKTLDPLLWNIMIKGYVQNGFFHEAI 98

Query: 572 ELYNLMKFEGLRGDNFTYPFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMY 751
           E Y  M+  G+  D+FTYPFV+K+C   S++ EG K+H KL+K GL+  +F+ NS+I MY
Sbjct: 99  EFYYQMQSNGVIPDHFTYPFVLKACARLSNIAEGKKVHCKLVKTGLDTALFVANSLITMY 158

Query: 752 GK 757
            K
Sbjct: 159 CK 160


>ref|XP_004154387.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46790,
           chloroplastic-like [Cucumis sativus]
           gi|449522468|ref|XP_004168248.1| PREDICTED:
           pentatricopeptide repeat-containing protein At3g46790,
           chloroplastic-like [Cucumis sativus]
          Length = 574

 Score =  107 bits (268), Expect = 4e-21
 Identities = 55/114 (48%), Positives = 77/114 (67%), Gaps = 4/114 (3%)
 Frame = +2

Query: 449 VNSGCLDKALT----LFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLRGDN 616
           VN  C+  +LT    LF +I+  ++FLWN++IRG+  NG YE AI LY  M+  GL  D 
Sbjct: 43  VNLYCICNSLTNAHLLFDRISKRNLFLWNVMIRGYAWNGPYELAISLYYQMRDYGLVPDK 102

Query: 617 FTYPFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCVEFA 778
           FT+PFV+K+C+  S++ EG KIH  +I+ GLE D+F+  ++I MY K GCVE A
Sbjct: 103 FTFPFVLKACSALSAMEEGKKIHKDVIRSGLESDVFVGAALIDMYAKCGCVESA 156



 Score = 61.6 bits (148), Expect = 3e-07
 Identities = 31/111 (27%), Positives = 58/111 (52%)
 Frame = +2

Query: 446 YVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLRGDNFTY 625
           Y   GC++ A  +F +I++ DV  WN ++  ++ NG  +E++ L  +M F GL+    T+
Sbjct: 147 YAKCGCVESARQVFDKIDERDVVCWNSMLATYSQNGQPDESLALCRVMAFNGLKPTEGTF 206

Query: 626 PFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCVEFA 778
              I +   +  L +G ++H    + G E +  +  +++ MY K G V  A
Sbjct: 207 VISIAASADNGLLPQGKELHGYSWRHGFESNDKVKTALMDMYAKSGSVNVA 257


>ref|XP_003597735.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|87240430|gb|ABD32288.1| Tetratricopeptide-like
           helical [Medicago truncatula]
           gi|355486783|gb|AES67986.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 620

 Score =  103 bits (256), Expect = 9e-20
 Identities = 53/114 (46%), Positives = 73/114 (64%)
 Frame = +2

Query: 428 TRTLFSYVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLR 607
           T+ +  Y  S  L  A  LF +I   ++FLWN+LIRG+  NG ++ AI LY+ M   GLR
Sbjct: 86  TKLVHLYAVSNSLLNARNLFDKIPKQNLFLWNVLIRGYAWNGPHDNAIILYHKMLDYGLR 145

Query: 608 GDNFTYPFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCV 769
            DNFT PFV+K+C+  S++ EG  IH  +IK G E D+F+  ++I MY K GCV
Sbjct: 146 PDNFTLPFVLKACSALSAIGEGRSIHEYVIKSGWERDLFVGAALIDMYAKCGCV 199


>ref|XP_002302824.2| hypothetical protein POPTR_0002s22590g [Populus trichocarpa]
           gi|550345610|gb|EEE82097.2| hypothetical protein
           POPTR_0002s22590g [Populus trichocarpa]
          Length = 647

 Score =  102 bits (253), Expect = 2e-19
 Identities = 49/117 (41%), Positives = 70/117 (59%), Gaps = 1/117 (0%)
 Frame = +2

Query: 410 HENLS-LTRTLFSYVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNL 586
           H NL  LT  +  Y + G +  A +LF      D+FLWN++IRG  +N  Y  AI LY  
Sbjct: 7   HRNLHFLTNLIAQYASLGSVSYAYSLFSSTPSADLFLWNVMIRGLVDNSHYHHAILLYKQ 66

Query: 587 MKFEGLRGDNFTYPFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGK 757
           M   G++ DNFT+PF+IK+C+       G++IH  ++K G +  +FI NS+I MYGK
Sbjct: 67  MLRLGIQPDNFTFPFIIKACSCLRHFEFGIRIHQDVVKFGYQSQVFISNSLITMYGK 123


>ref|XP_007139896.1| hypothetical protein PHAVU_008G067700g [Phaseolus vulgaris]
           gi|561013029|gb|ESW11890.1| hypothetical protein
           PHAVU_008G067700g [Phaseolus vulgaris]
          Length = 577

 Score =  101 bits (252), Expect = 3e-19
 Identities = 47/103 (45%), Positives = 72/103 (69%)
 Frame = +2

Query: 470 KALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLRGDNFTYPFVIKSCT 649
           KA  LFQQ++ P +  WN++IRG++ +    EAI LYNLM ++GL GDN TYPF++K+C+
Sbjct: 29  KAHHLFQQVHRPTLPFWNLMIRGWSLSDQPSEAIRLYNLMYYQGLLGDNLTYPFLLKACS 88

Query: 650 GSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCVEFA 778
             S +  G  +H +++K+G E  +FI N++I MYG  G ++FA
Sbjct: 89  RVSHVSCGTSLHGRVLKLGFEPHLFISNALINMYGSCGHLDFA 131



 Score = 58.9 bits (141), Expect = 2e-06
 Identities = 41/148 (27%), Positives = 77/148 (52%), Gaps = 3/148 (2%)
 Frame = +2

Query: 344 RISKTSPFYRNRSKIVKSVTRPHENLSLTRTLFS-YVNSGCLDKALTLFQQINDPDVFLW 520
           R+S  S       +++K    PH  L ++  L + Y + G LD A  +F Q+ + D+  W
Sbjct: 89  RVSHVSCGTSLHGRVLKLGFEPH--LFISNALINMYGSCGHLDFARKVFDQMPERDLVSW 146

Query: 521 NILIRGFTNNGFYEEAIELYNLMKFEGLRGDNFTYPFVIKSCT--GSSSLVEGLKIHSKL 694
           N LI G+     + E + ++  M+   ++GD  T   V+ +C+  G  S+ + +  + + 
Sbjct: 147 NSLICGYGQCKKFREVLGVFEAMRVADVKGDAVTMVKVVLACSVLGEWSVADAMVDYIEE 206

Query: 695 IKIGLELDIFICNSIIIMYGKLGCVEFA 778
            K+  E+D+++ N++I MYG+ G V  A
Sbjct: 207 NKV--EMDVYLGNTLIDMYGRRGLVHLA 232


>ref|XP_006572946.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g31920-like [Glycine max]
          Length = 605

 Score =  101 bits (252), Expect = 3e-19
 Identities = 49/107 (45%), Positives = 72/107 (67%)
 Frame = +2

Query: 458 GCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLRGDNFTYPFVI 637
           G ++ A ++F QI +P  F +N +IRG  N+   EEA+ LY  M   G+  DNFTYPFV+
Sbjct: 79  GSMEYACSIFSQIEEPGSFEYNTMIRGNVNSMDLEEALLLYVEMLERGIEPDNFTYPFVL 138

Query: 638 KSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCVEFA 778
           K+C+   +L EG++IH+ + K GLE+D+F+ N +I MYGK G +E A
Sbjct: 139 KACSLLVALKEGVQIHAHVFKAGLEVDVFVQNGLISMYGKCGAIEHA 185


>ref|XP_002265522.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At3g08820-like [Vitis vinifera]
          Length = 686

 Score =  101 bits (252), Expect = 3e-19
 Identities = 48/123 (39%), Positives = 74/123 (60%)
 Frame = +2

Query: 410 HENLSLTRTLFSYVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLM 589
           H+N  L   L    +    +    LF QI  P++FLWN +IRG  +N  +++AIE Y LM
Sbjct: 44  HDNYLLNMILRCSFDFSDTNYTRFLFHQIKQPNIFLWNTMIRGLVSNDCFDDAIEFYGLM 103

Query: 590 KFEGLRGDNFTYPFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCV 769
           + EG   +NFT+PFV+K+C     L  G+KIH+ ++K G + D+F+  S++ +Y K G +
Sbjct: 104 RSEGFLPNNFTFPFVLKACARLLDLQLGVKIHTLVVKGGFDCDVFVKTSLVCLYAKCGYL 163

Query: 770 EFA 778
           E A
Sbjct: 164 EDA 166



 Score = 66.6 bits (161), Expect = 9e-09
 Identities = 35/111 (31%), Positives = 59/111 (53%)
 Frame = +2

Query: 446 YVNSGCLDKALTLFQQINDPDVFLWNILIRGFTNNGFYEEAIELYNLMKFEGLRGDNFTY 625
           Y   G L+ A  +F  I D +V  W  +I G+   G + EAI+++  +    L  D+FT 
Sbjct: 157 YAKCGYLEDAHKVFDDIPDKNVVSWTAIISGYIGVGKFREAIDMFRRLLEMNLAPDSFTI 216

Query: 626 PFVIKSCTGSSSLVEGLKIHSKLIKIGLELDIFICNSIIIMYGKLGCVEFA 778
             V+ +CT    L  G  IH  ++++G+  ++F+  S++ MY K G +E A
Sbjct: 217 VRVLSACTQLGDLNSGEWIHKCIMEMGMVRNVFVGTSLVDMYAKCGNMEKA 267


Top