BLASTX nr result

ID: Ephedra26_contig00019203 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra26_contig00019203
         (2057 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EXC12605.1| hypothetical protein L484_012982 [Morus notabilis]     316   2e-83
ref|XP_006426145.1| hypothetical protein CICLE_v10025134mg [Citr...   311   5e-82
ref|XP_006466418.1| PREDICTED: putative pentatricopeptide repeat...   310   2e-81
ref|XP_003635394.1| PREDICTED: putative pentatricopeptide repeat...   309   3e-81
emb|CAN71515.1| hypothetical protein VITISV_021787 [Vitis vinifera]   308   5e-81
ref|XP_004159605.1| PREDICTED: pentatricopeptide repeat-containi...   305   7e-80
ref|XP_004149630.1| PREDICTED: pentatricopeptide repeat-containi...   305   7e-80
ref|XP_006840374.1| hypothetical protein AMTR_s00045p00130470 [A...   300   2e-78
ref|XP_002530608.1| pentatricopeptide repeat-containing protein,...   300   2e-78
ref|XP_003546958.1| PREDICTED: pentatricopeptide repeat-containi...   293   3e-76
ref|XP_006348483.1| PREDICTED: pentatricopeptide repeat-containi...   289   4e-75
ref|XP_004513407.1| PREDICTED: putative pentatricopeptide repeat...   288   5e-75
ref|NP_190542.4| pentatricopeptide repeat-containing protein [Ar...   286   3e-74
emb|CAB66911.1| putative protein [Arabidopsis thaliana]               286   3e-74
ref|XP_006404107.1| hypothetical protein EUTSA_v10010190mg [Eutr...   282   5e-73
ref|XP_006595472.1| PREDICTED: putative pentatricopeptide repeat...   281   6e-73
ref|XP_002877696.1| pentatricopeptide repeat-containing protein ...   280   1e-72
ref|XP_002302657.2| hypothetical protein POPTR_0002s17660g [Popu...   278   7e-72
ref|XP_006292382.1| hypothetical protein CARUB_v10018595mg [Caps...   278   9e-72
gb|EOX91773.1| Pentatricopeptide repeat (PPR) superfamily protei...   276   2e-71

>gb|EXC12605.1| hypothetical protein L484_012982 [Morus notabilis]
          Length = 638

 Score =  316 bits (810), Expect = 2e-83
 Identities = 171/526 (32%), Positives = 289/526 (54%)
 Frame = +2

Query: 101  DEIFRVLQKYQRHRARTARGAMENALENLRVQVTPXXXXXXXXXXXXXYGGISYRFYVWA 280
            ++I+R+L+K+    ++      E+ +  LR  +T               G + YRF+VWA
Sbjct: 75   EKIYRILRKFHSRVSKLELALQESGVV-LRSGLTERVLGRCGDA-----GSLGYRFFVWA 128

Query: 281  SRQPDFKPTVDSEPALLRVICRIRESLWVFALLQEMRRTEPKFGTYRAFKVLAERYNHGH 460
            S+QP ++P+ +   A++R + ++R+   V+ALL+EMR+  P+  T   F VL  R+    
Sbjct: 129  SKQPGYRPSYEVYKAMIRALGKMRQFGAVWALLEEMRKENPQLITPEIFVVLMRRFASAR 188

Query: 461  LLTIAVNALYNLEKVGCPPDVGLFSCLLGMYCEENKSNKAMALFWDMMVKFEPSYRNYAI 640
            ++  AV     + K GC PD  +F CLL   C+     +A +LF +M VKF PS +++  
Sbjct: 189  MVKKAVEVFDEMPKYGCEPDEHVFGCLLDALCKNGSVKEAASLFEEMRVKFTPSLKHFTS 248

Query: 641  LISCLARNGRLEHARHLHRKLLDSGACLTMDVYHEILTGYSKEGKATESVKMLNEMRDMG 820
            L+    R G+L  A+ +  ++ ++G    + VY+ +L GY++ GK  ++  ++ EMR  G
Sbjct: 249  LLYGWCREGKLMEAKFVLVQMKEAGFEPDVVVYNNLLGGYAQAGKMADAYDLMKEMRGKG 308

Query: 821  YGVNAKAITIGVHALCSEGRIDEAFGMIEDMKKSGCFPDATTYNTLIETLCDGKKIKQAY 1000
               NA + T+ + ALC   +++EA  +  +M++SGC  D  TY TLI   C   KI++ Y
Sbjct: 309  CSPNAASYTVLIQALCKREKMEEAMRVFVEMQRSGCDADVMTYTTLISGFCKWGKIERGY 368

Query: 1001 EVMDAMVKQGISPNHMTYLVLMRAHQNKQGFWPAMQLMEKMSKQGCRPNLAVYKVIIRIC 1180
            E++D+M+++G SPN  TYL +M AH+ K+ F   ++L+ +M K GC P+L +Y  +IR+ 
Sbjct: 369  EILDSMIQRGFSPNETTYLHIMLAHEKKEEFEECVELIGEMRKIGCVPDLKIYNTVIRLA 428

Query: 1181 DKGRLPDQVLKMWKEMVTNDVVPDLITYKMLLCCLHDANYLTEACKYHNEMVEKDLLKAK 1360
             K R   + +++W E+  + + P L T+ +++        L EAC+Y  EMVE+ LL   
Sbjct: 429  CKLREVKEGVRLWNEIEASGLSPGLDTFVVMIHGFLGQGCLIEACQYFKEMVERGLLSGP 488

Query: 1361 QYGITYRIWXXXXXXXXXXXXISMWECLRKNERHVNVFAFSSWIEVLCETGFVLKASGYC 1540
            QYG    +               +W C+      +NV+A++ WI  L + G V +A  YC
Sbjct: 489  QYGTLKELLNALLRADKLEMAKDVWTCIVNKGCEINVYAWTIWIHALFKNGHVKEACSYC 548

Query: 1541 YDMVELVILPPPRIYQLLMYKLEEHYGRRAKVEIRIKLRTRMAEKR 1678
             DM++  ++P P  +  LM  L++ Y R+   EI  K+R +MAE R
Sbjct: 549  LDMMDADVMPQPDTFAKLMRGLKKLYNRQIAAEITEKVR-KMAEDR 593


>ref|XP_006426145.1| hypothetical protein CICLE_v10025134mg [Citrus clementina]
            gi|557528135|gb|ESR39385.1| hypothetical protein
            CICLE_v10025134mg [Citrus clementina]
          Length = 638

 Score =  311 bits (798), Expect = 5e-82
 Identities = 174/541 (32%), Positives = 297/541 (54%), Gaps = 3/541 (0%)
 Frame = +2

Query: 77   GTSNHITAD--EIFRVLQKYQRHRARTARGAMENALENLRVQVTPXXXXXXXXXXXXXYG 250
            G+ N  + D  +IFR+L+K+     +     +E AL++  V + P              G
Sbjct: 65   GSHNEFSHDVEKIFRILKKFHSRLPK-----LELALQHSGVVLRPGLTERVINRCGDA-G 118

Query: 251  GISYRFYVWASRQPDFKPTVDSEPALLRVICRIRESLWVFALLQEMRRTEPKFGTYRAFK 430
             + YR+Y+WAS+QP++  + D   AL++ + ++R+   V+AL++EMR+ +P+  T   F 
Sbjct: 119  NLGYRYYMWASKQPNYVHSYDVYRALIKSLSKMRKFGAVWALMEEMRKEKPQLITTEVFV 178

Query: 431  VLAERYNHGHLLTIAVNALYNLEKVGCPPDVGLFSCLLGMYCEENKSNKAMALFWDMMVK 610
            +L  R+    ++  A+  L  + K GC PD  +F CLL   C+ +   +A  LF +M  +
Sbjct: 179  ILMRRFASARMVKKAIEVLDEMPKYGCEPDEFVFGCLLDALCKNSSVKEAAKLFDEMRER 238

Query: 611  FEPSYRNYAILISCLARNGRLEHARHLHRKLLDSGACLTMDVYHEILTGYSKEGKATESV 790
            F+PS R++  L+    + G+L  A+++  ++ D+G    + VY+ +L+GY++ GK T++ 
Sbjct: 239  FKPSLRHFTSLLYGWCKEGKLVEAKYVLVQMKDAGFEPDIVVYNNLLSGYAQMGKMTDAF 298

Query: 791  KMLNEMRDMGYGVNAKAITIGVHALCSEGRIDEAFGMIEDMKKSGCFPDATTYNTLIETL 970
            ++L EMR  G   NA + T+ + ALC   +++EA     +M++SGC  D  TY TLI   
Sbjct: 299  ELLKEMRRKGCDPNANSYTVLIQALCRMEKMEEANRAFVEMERSGCEADVVTYTTLISGF 358

Query: 971  CDGKKIKQAYEVMDAMVKQGISPNHMTYLVLMRAHQNKQGFWPAMQLMEKMSKQGCRPNL 1150
            C  +KI + YE++D+M+++GI PN +TYL +M AH+ K+     ++LM +M K GC P++
Sbjct: 359  CKSRKIDRCYEILDSMIQRGILPNQLTYLHIMLAHEKKEELEECVELMGEMRKIGCVPDV 418

Query: 1151 AVYKVIIRI-CDKGRLPDQVLKMWKEMVTNDVVPDLITYKMLLCCLHDANYLTEACKYHN 1327
            + Y V+IR+ C  G L + V  +W EM    + P   ++ +++        L EAC+Y  
Sbjct: 419  SNYNVVIRLACKLGELKEAV-NVWNEMEAASLSPGTDSFVVMVHGFLGQGCLIEACEYFK 477

Query: 1328 EMVEKDLLKAKQYGITYRIWXXXXXXXXXXXXISMWECLRKNERHVNVFAFSSWIEVLCE 1507
            EMV + LL A QYG    +               +W C+      +NV+A++ WI  L  
Sbjct: 478  EMVGRGLLSAPQYGTLKELLNSLLRAQKVEMAKDVWSCIVTKGCELNVYAWTIWIHSLFS 537

Query: 1508 TGFVLKASGYCYDMVELVILPPPRIYQLLMYKLEEHYGRRAKVEIRIKLRTRMAEKRLPI 1687
             G V +A  YC DM++  ++P P  +  LM  L++ Y R+   EI  K+R   AE+++  
Sbjct: 538  NGHVKEACSYCLDMMDADVMPQPDTFAKLMRGLKKLYNRQIAAEITEKVRKMAAERQITF 597

Query: 1688 K 1690
            K
Sbjct: 598  K 598


>ref|XP_006466418.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At5g65820-like [Citrus sinensis]
          Length = 638

 Score =  310 bits (793), Expect = 2e-81
 Identities = 173/541 (31%), Positives = 297/541 (54%), Gaps = 3/541 (0%)
 Frame = +2

Query: 77   GTSNHITAD--EIFRVLQKYQRHRARTARGAMENALENLRVQVTPXXXXXXXXXXXXXYG 250
            G+ N  + D  +IFR+L+K+     +     +E AL++  V + P              G
Sbjct: 65   GSHNEFSHDVEKIFRILKKFHSRLPK-----LELALQHSGVVLRPGLTERVINRCGDA-G 118

Query: 251  GISYRFYVWASRQPDFKPTVDSEPALLRVICRIRESLWVFALLQEMRRTEPKFGTYRAFK 430
             + YR+Y+WAS+QP++  + D   AL++ + ++R+   V+AL++EMR+ +P+  T   F 
Sbjct: 119  NLGYRYYMWASKQPNYVHSYDVYRALIKSLSKMRKFGAVWALMEEMRKEKPQLITTEVFV 178

Query: 431  VLAERYNHGHLLTIAVNALYNLEKVGCPPDVGLFSCLLGMYCEENKSNKAMALFWDMMVK 610
            +L  R+    ++  A+  L  + K GC PD  +F CLL   C+ +   +A  LF ++  +
Sbjct: 179  ILMRRFASARMVKKAIEVLDEMPKYGCEPDEFVFGCLLDALCKNSSVKEAAKLFDEIRER 238

Query: 611  FEPSYRNYAILISCLARNGRLEHARHLHRKLLDSGACLTMDVYHEILTGYSKEGKATESV 790
            F+PS R++  L+    + G+L  A+++  ++ D+G    + VY+ +L+GY++ GK T++ 
Sbjct: 239  FKPSLRHFTSLLYGWCKEGKLVEAKYVLVQMKDAGFEPDIVVYNNLLSGYAQMGKMTDAF 298

Query: 791  KMLNEMRDMGYGVNAKAITIGVHALCSEGRIDEAFGMIEDMKKSGCFPDATTYNTLIETL 970
            ++L EMR  G   NA + T+ + ALC   +++EA     +M++SGC  D  TY TLI   
Sbjct: 299  ELLKEMRRKGCDPNANSYTVLIQALCRMEKMEEANRAFVEMERSGCEADVVTYTTLISGF 358

Query: 971  CDGKKIKQAYEVMDAMVKQGISPNHMTYLVLMRAHQNKQGFWPAMQLMEKMSKQGCRPNL 1150
            C  +KI + YE++D+M+++GI PN +TYL +M AH+ K+     ++LM +M K GC P++
Sbjct: 359  CKSRKIDRCYEILDSMIQRGILPNQLTYLHIMLAHEKKEELEECVELMGEMRKIGCVPDV 418

Query: 1151 AVYKVIIRI-CDKGRLPDQVLKMWKEMVTNDVVPDLITYKMLLCCLHDANYLTEACKYHN 1327
            + Y V+IR+ C  G L + V  +W EM    + P   ++ +++        L EAC+Y  
Sbjct: 419  SNYNVVIRLACKLGELKEAV-NVWNEMEAASLSPGTDSFVVMVHGFLGQGCLIEACEYFK 477

Query: 1328 EMVEKDLLKAKQYGITYRIWXXXXXXXXXXXXISMWECLRKNERHVNVFAFSSWIEVLCE 1507
            EMV + LL A QYG    +               +W C+      +NV+A++ WI  L  
Sbjct: 478  EMVGRGLLSAPQYGTLKALLNSLLRAQKVEMAKDVWSCIVTKGCELNVYAWTIWIHSLFS 537

Query: 1508 TGFVLKASGYCYDMVELVILPPPRIYQLLMYKLEEHYGRRAKVEIRIKLRTRMAEKRLPI 1687
             G V +A  YC DM++  ++P P  +  LM  L++ Y R+   EI  K+R   AE+++  
Sbjct: 538  NGHVKEACSYCLDMMDADVMPQPDTFAKLMRGLKKLYNRQIAAEITEKVRKMAAERQITF 597

Query: 1688 K 1690
            K
Sbjct: 598  K 598


>ref|XP_003635394.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At5g65820-like [Vitis vinifera]
          Length = 622

 Score =  309 bits (791), Expect = 3e-81
 Identities = 169/531 (31%), Positives = 286/531 (53%), Gaps = 1/531 (0%)
 Frame = +2

Query: 101  DEIFRVLQKYQRHRARTARGAMENALENLRVQVTPXXXXXXXXXXXXXYGGISYRFYVWA 280
            ++++R+L+K+     +     +E AL+   V V                G + YRF+VWA
Sbjct: 59   EKVYRILRKFHSRVPK-----LELALQESGVAVRSGLTERVLNRCGDA-GNLGYRFFVWA 112

Query: 281  SRQPDFKPTVDSEPALLRVICRIRESLWVFALLQEMRRTEPKFGTYRAFKVLAERYNHGH 460
            S+QP ++ + +   A+++++ ++R+   V+AL++EMRR  P+F +   F VL  R+    
Sbjct: 113  SKQPGYRHSYEVYKAMIKILGKMRQFGAVWALIEEMRRENPQFVSPYVFVVLMRRFASAR 172

Query: 461  LLTIAVNALYNLEKVGCPPDVGLFSCLLGMYCEENKSNKAMALFWDMMVKFEPSYRNYAI 640
            ++  A+  L  + K GC PD  +F CLL   C+     +A +LF DM ++F P+ +++  
Sbjct: 173  MVKKAIEVLDEMPKYGCEPDEHVFGCLLDALCKNGSVKEAASLFEDMRIRFTPTLKHFTS 232

Query: 641  LISCLARNGRLEHARHLHRKLLDSGACLTMDVYHEILTGYSKEGKATESVKMLNEMRDMG 820
            L+    R G+L  A+++  ++ ++G    + VY+ +LTGY+  GK  ++  +L EMR   
Sbjct: 233  LLYGWCREGKLMEAKYVLVQIREAGFEPDIVVYNNLLTGYAAAGKMVDAYDLLKEMRRKE 292

Query: 821  YGVNAKAITIGVHALCSEGRIDEAFGMIEDMKKSGCFPDATTYNTLIETLCDGKKIKQAY 1000
               N  + T  + ALC++ +++EA  +  +M+  GC  DA TY TLI   C   KI + Y
Sbjct: 293  CEPNVMSFTTLIQALCAKKKMEEAMRVFFEMQSCGCPADAVTYTTLISGFCKWGKISKGY 352

Query: 1001 EVMDAMVKQGISPNHMTYLVLMRAHQNKQGFWPAMQLMEKMSKQGCRPNLAVYKVIIRI- 1177
            E++D M++QG  PN MTYL +M AH+ K+     ++LME+M K GC P+L +Y ++IR+ 
Sbjct: 353  ELLDNMIQQGHIPNPMTYLHIMAAHEKKEELEECIELMEEMRKIGCTPDLNIYNIVIRLA 412

Query: 1178 CDKGRLPDQVLKMWKEMVTNDVVPDLITYKMLLCCLHDANYLTEACKYHNEMVEKDLLKA 1357
            C  G + + V ++W EM    + P L T+ +++        L EAC++  EMV + LL A
Sbjct: 413  CKLGEIKEGV-RVWNEMEATGLSPGLDTFVIMIHGFLSQRCLVEACEFFKEMVGRGLLSA 471

Query: 1358 KQYGITYRIWXXXXXXXXXXXXISMWECLRKNERHVNVFAFSSWIEVLCETGFVLKASGY 1537
             QYG    +               +W C+      +NV+A++ WI  L   G V +A  Y
Sbjct: 472  PQYGTLKELLNSLLRAEKLEMSKDVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSY 531

Query: 1538 CYDMVELVILPPPRIYQLLMYKLEEHYGRRAKVEIRIKLRTRMAEKRLPIK 1690
            C DM++  ++P P  +  LM  L + Y R+   EI  K+R   AE+ +  K
Sbjct: 532  CLDMMDAGVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVRKMAAEREMTFK 582


>emb|CAN71515.1| hypothetical protein VITISV_021787 [Vitis vinifera]
          Length = 655

 Score =  308 bits (790), Expect = 5e-81
 Identities = 161/482 (33%), Positives = 268/482 (55%), Gaps = 1/482 (0%)
 Frame = +2

Query: 248  GGISYRFYVWASRQPDFKPTVDSEPALLRVICRIRESLWVFALLQEMRRTEPKFGTYRAF 427
            G + YRF+VWAS+QP ++ + +   A+++++ ++R+   V+AL++EMRR  P+F +   F
Sbjct: 135  GNLGYRFFVWASKQPGYRHSYEVYKAMIKILGKMRQFGAVWALIEEMRRENPQFVSPYVF 194

Query: 428  KVLAERYNHGHLLTIAVNALYNLEKVGCPPDVGLFSCLLGMYCEENKSNKAMALFWDMMV 607
             VL  R+    ++  A+  L  + K GC PD  +F CLL   C+     +A +LF DM +
Sbjct: 195  VVLMRRFASARMVKKAIEVLDEMPKYGCEPDEHVFGCLLDALCKNGSVKEAASLFEDMRI 254

Query: 608  KFEPSYRNYAILISCLARNGRLEHARHLHRKLLDSGACLTMDVYHEILTGYSKEGKATES 787
            +F P+ +++  L+    R G+L  A+++  ++ ++G    + VY+ +LTGY+  GK  ++
Sbjct: 255  RFTPTLKHFTSLLYGWCREGKLMEAKYVLVQIREAGFEPDIVVYNNLLTGYAAAGKMVDA 314

Query: 788  VKMLNEMRDMGYGVNAKAITIGVHALCSEGRIDEAFGMIEDMKKSGCFPDATTYNTLIET 967
              +L EMR      N  + T  + ALC++ +++EA  +  +M+  GC  DA TY TLI  
Sbjct: 315  YDLLKEMRRKECEPNVMSFTTLIQALCAKKKMEEAMRVFFEMQSCGCPADAVTYTTLISG 374

Query: 968  LCDGKKIKQAYEVMDAMVKQGISPNHMTYLVLMRAHQNKQGFWPAMQLMEKMSKQGCRPN 1147
             C   KI + YE++D M++QG  PN MTYL +M AH+ K+     ++LME+M K GC P+
Sbjct: 375  FCKWGKISKGYELLDNMIQQGHIPNPMTYLHIMAAHEKKEELEECIELMEEMRKIGCTPD 434

Query: 1148 LAVYKVIIRI-CDKGRLPDQVLKMWKEMVTNDVVPDLITYKMLLCCLHDANYLTEACKYH 1324
            L +Y ++IR+ C  G + + V ++W EM    + P L T+ +++        L EAC++ 
Sbjct: 435  LNIYNIVIRLACKLGEIKEGV-RVWNEMEATGLSPGLDTFVIMIHGFLSQRCLVEACEFF 493

Query: 1325 NEMVEKDLLKAKQYGITYRIWXXXXXXXXXXXXISMWECLRKNERHVNVFAFSSWIEVLC 1504
             EMV + LL A QYG    +               +W C+      +NV+A++ WI  L 
Sbjct: 494  KEMVGRGLLSAPQYGTLKELLNSLLRAEKLEMSKDVWSCIMTKGCDLNVYAWTIWIHALF 553

Query: 1505 ETGFVLKASGYCYDMVELVILPPPRIYQLLMYKLEEHYGRRAKVEIRIKLRTRMAEKRLP 1684
              G V +A  YC DM++  ++P P  +  LM  L + Y R+   EI  K+R   AE+ + 
Sbjct: 554  SNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVRKMAAEREMT 613

Query: 1685 IK 1690
             K
Sbjct: 614  FK 615


>ref|XP_004159605.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like
            [Cucumis sativus]
          Length = 664

 Score =  305 bits (780), Expect = 7e-80
 Identities = 170/534 (31%), Positives = 286/534 (53%), Gaps = 1/534 (0%)
 Frame = +2

Query: 92   ITADEIFRVLQKYQRHRARTARGAMENALENLRVQVTPXXXXXXXXXXXXXYGGISYRFY 271
            +  ++++R+L+K+      T    +E AL+   V +                G + YRF+
Sbjct: 98   VDVEKVYRILRKFH-----TRVPKLELALQESGV-IMRSGLPERVLSRCGDAGNLGYRFF 151

Query: 272  VWASRQPDFKPTVDSEPALLRVICRIRESLWVFALLQEMRRTEPKFGTYRAFKVLAERYN 451
            VWAS+QP ++ + +   A+++ + ++R+   V+AL++EMR+  P   T   F VL  R+ 
Sbjct: 152  VWASKQPGYRHSYEVYKAMIKTLGKMRQFGAVWALIEEMRKENPYMLTPEVFIVLMRRFA 211

Query: 452  HGHLLTIAVNALYNLEKVGCPPDVGLFSCLLGMYCEENKSNKAMALFWDMMVKFEPSYRN 631
               ++  AV  L  + K GC PD  +F CLL   C+     +A +LF DM V+F P+ R+
Sbjct: 212  SVRMVKKAVEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAASLFEDMRVRFNPNLRH 271

Query: 632  YAILISCLARNGRLEHARHLHRKLLDSGACLTMDVYHEILTGYSKEGKATESVKMLNEMR 811
            +  L+    R G++  A+H+  ++ ++G    + VY+ +L GY++ GK  ++  +L EM+
Sbjct: 272  FTSLLYGWCREGKIMEAKHVLVQIKEAGFEPDIVVYNNLLGGYAQAGKMRDAFDLLAEMK 331

Query: 812  DMGYGVNAKAITIGVHALCSEGRIDEAFGMIEDMKKSGCFPDATTYNTLIETLCDGKKIK 991
             +  G NA + TI + + C   ++DEA  +  +M+ SGC  D  TY TLI   C      
Sbjct: 332  KVNCGPNAASFTILIQSFCKTEKMDEAMRIFTEMQGSGCEADVVTYTTLISGFCKWGNTD 391

Query: 992  QAYEVMDAMVKQGISPNHMTYLVLMRAHQNKQGFWPAMQLMEKMSKQGCRPNLAVYKVII 1171
            +AYE++D M+++G  P+ ++YL +M AH+ K+     M+L+E+M K GC P+L +Y  +I
Sbjct: 392  KAYEILDDMIQKGHDPSQLSYLCIMMAHEKKEELEECMELIEEMRKIGCVPDLNIYNTMI 451

Query: 1172 R-ICDKGRLPDQVLKMWKEMVTNDVVPDLITYKMLLCCLHDANYLTEACKYHNEMVEKDL 1348
            R +C  G L + V ++W EM    + P L TY +++        L EAC Y  EMVE+ L
Sbjct: 452  RLVCKLGDLKEAV-RLWGEMQAGGLNPGLDTYILMVHGFLSQGCLVEACDYFKEMVERGL 510

Query: 1349 LKAKQYGITYRIWXXXXXXXXXXXXISMWECLRKNERHVNVFAFSSWIEVLCETGFVLKA 1528
            L A QYG    +              +MW C+      +NV A++ WI  L   G V +A
Sbjct: 511  LSAPQYGTLKELTNALLRAEKLEMAKNMWSCMTTKGCELNVSAWTIWIHALFSNGHVKEA 570

Query: 1529 SGYCYDMVELVILPPPRIYQLLMYKLEEHYGRRAKVEIRIKLRTRMAEKRLPIK 1690
              YC DM++  ++P P  +  LM  L++ + R+  VEI  K+R   A++++  K
Sbjct: 571  CSYCLDMMDADLMPQPDTFAKLMRGLKKLFHRQLAVEITEKVRKMAADRQITFK 624


>ref|XP_004149630.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like
            [Cucumis sativus]
          Length = 641

 Score =  305 bits (780), Expect = 7e-80
 Identities = 170/534 (31%), Positives = 286/534 (53%), Gaps = 1/534 (0%)
 Frame = +2

Query: 92   ITADEIFRVLQKYQRHRARTARGAMENALENLRVQVTPXXXXXXXXXXXXXYGGISYRFY 271
            +  ++++R+L+K+      T    +E AL+   V +                G + YRF+
Sbjct: 75   VDVEKVYRILRKFH-----TRVPKLELALQESGV-IMRSGLPERVLSRCGDAGNLGYRFF 128

Query: 272  VWASRQPDFKPTVDSEPALLRVICRIRESLWVFALLQEMRRTEPKFGTYRAFKVLAERYN 451
            VWAS+QP ++ + +   A+++ + ++R+   V+AL++EMR+  P   T   F VL  R+ 
Sbjct: 129  VWASKQPGYRHSYEVYKAMIKTLGKMRQFGAVWALIEEMRKENPYMLTPEVFIVLMRRFA 188

Query: 452  HGHLLTIAVNALYNLEKVGCPPDVGLFSCLLGMYCEENKSNKAMALFWDMMVKFEPSYRN 631
               ++  AV  L  + K GC PD  +F CLL   C+     +A +LF DM V+F P+ R+
Sbjct: 189  SVRMVKKAVEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAASLFEDMRVRFNPNLRH 248

Query: 632  YAILISCLARNGRLEHARHLHRKLLDSGACLTMDVYHEILTGYSKEGKATESVKMLNEMR 811
            +  L+    R G++  A+H+  ++ ++G    + VY+ +L GY++ GK  ++  +L EM+
Sbjct: 249  FTSLLYGWCREGKIMEAKHVLVQIKEAGFEPDIVVYNNLLGGYAQAGKMRDAFDLLAEMK 308

Query: 812  DMGYGVNAKAITIGVHALCSEGRIDEAFGMIEDMKKSGCFPDATTYNTLIETLCDGKKIK 991
             +  G NA + TI + + C   ++DEA  +  +M+ SGC  D  TY TLI   C      
Sbjct: 309  KVNCGPNAASFTILIQSFCKTEKMDEAMRIFTEMQGSGCEADVVTYTTLISGFCKWGNTD 368

Query: 992  QAYEVMDAMVKQGISPNHMTYLVLMRAHQNKQGFWPAMQLMEKMSKQGCRPNLAVYKVII 1171
            +AYE++D M+++G  P+ ++YL +M AH+ K+     M+L+E+M K GC P+L +Y  +I
Sbjct: 369  KAYEILDDMIQKGHDPSQLSYLCIMMAHEKKEELEECMELIEEMRKIGCVPDLNIYNTMI 428

Query: 1172 R-ICDKGRLPDQVLKMWKEMVTNDVVPDLITYKMLLCCLHDANYLTEACKYHNEMVEKDL 1348
            R +C  G L + V ++W EM    + P L TY +++        L EAC Y  EMVE+ L
Sbjct: 429  RLVCKLGDLKEAV-RLWGEMQAGGLNPGLDTYILMVHGFLSQGCLVEACDYFKEMVERGL 487

Query: 1349 LKAKQYGITYRIWXXXXXXXXXXXXISMWECLRKNERHVNVFAFSSWIEVLCETGFVLKA 1528
            L A QYG    +              +MW C+      +NV A++ WI  L   G V +A
Sbjct: 488  LSAPQYGTLKELTNALLRAEKLEMAKNMWSCMTTKGCELNVSAWTIWIHALFSNGHVKEA 547

Query: 1529 SGYCYDMVELVILPPPRIYQLLMYKLEEHYGRRAKVEIRIKLRTRMAEKRLPIK 1690
              YC DM++  ++P P  +  LM  L++ + R+  VEI  K+R   A++++  K
Sbjct: 548  CSYCLDMMDADLMPQPDTFAKLMRGLKKLFHRQLAVEITEKVRKMAADRQITFK 601


>ref|XP_006840374.1| hypothetical protein AMTR_s00045p00130470 [Amborella trichopoda]
            gi|548842092|gb|ERN02049.1| hypothetical protein
            AMTR_s00045p00130470 [Amborella trichopoda]
          Length = 735

 Score =  300 bits (767), Expect = 2e-78
 Identities = 156/482 (32%), Positives = 265/482 (54%), Gaps = 1/482 (0%)
 Frame = +2

Query: 248  GGISYRFYVWASRQPDFKPTVDSEPALLRVICRIRESLWVFALLQEMRRTEPKFGTYRAF 427
            G +S+RF++W+S+QP +  + D    +++ + ++R+   V+ALL+EMR+  P+  T   F
Sbjct: 213  GNLSFRFFIWSSKQPGYCHSYDCYKLMIKQLGKMRQFGTVWALLEEMRKDNPEHITPETF 272

Query: 428  KVLAERYNHGHLLTIAVNALYNLEKVGCPPDVGLFSCLLGMYCEENKSNKAMALFWDMMV 607
             +L  R+    ++  A+  L  + K GC PD   F CLL   C+ N   +A +LF DM  
Sbjct: 273  VILLRRFAASRMVGKAIEVLDEMPKFGCEPDEHTFGCLLDALCKNNAVKEAASLFEDMKY 332

Query: 608  KFEPSYRNYAILISCLARNGRLEHARHLHRKLLDSGACLTMDVYHEILTGYSKEGKATES 787
            KF P+ ++Y  L+    R G++  A+H+  ++ ++G    + VY+ +L+G++   K  + 
Sbjct: 333  KFSPNLKHYTSLLYGWCREGKIIEAKHILVQMKEAGFEPDIVVYNNLLSGFAIAEKMEDG 392

Query: 788  VKMLNEMRDMGYGVNAKAITIGVHALCSEGRIDEAFGMIEDMKKSGCFPDATTYNTLIET 967
              +L EM+  GY  NA + TI + ALCS+GR++EA  +  +MK++GC  D  TY TLI  
Sbjct: 393  YDLLIEMKHKGYPPNATSYTILIQALCSKGRMEEALRLFVEMKRNGCLADVVTYTTLISG 452

Query: 968  LCDGKKIKQAYEVMDAMVKQGISPNHMTYLVLMRAHQNKQGFWPAMQLMEKMSKQGCRPN 1147
             C   K+  AYE++++MVKQG  PN MTYL ++ AH+ K+     ++L+ +MSK G +P+
Sbjct: 453  FCKVGKLDNAYELLESMVKQGCRPNQMTYLCILGAHEKKEELEECLELVREMSKTGIKPD 512

Query: 1148 LAVYKVIIRICDKGRLPDQVLKMWKEMVTNDVVPDLITYKMLL-CCLHDANYLTEACKYH 1324
              +Y  +IR+  K     +   +W EM        + ++ +++   L   N L +ACKY 
Sbjct: 513  ANIYNTLIRLACKLGQVKEAFDVWNEMEAKGFSAGIDSFTIMIHGLLLQGNSLIDACKYF 572

Query: 1325 NEMVEKDLLKAKQYGITYRIWXXXXXXXXXXXXISMWECLRKNERHVNVFAFSSWIEVLC 1504
             EMV + LL   QYG    +               +W+ +R    H+NV+A++ WI  L 
Sbjct: 573  KEMVGRGLLATPQYGTFKELLNGLMRAGKLEMGKDIWDTIRSGGCHLNVYAYTIWIHSLF 632

Query: 1505 ETGFVLKASGYCYDMVELVILPPPRIYQLLMYKLEEHYGRRAKVEIRIKLRTRMAEKRLP 1684
            + G V +A GYC +M++  I+P    +  LM  L + Y R+   EI  ++R   +++ + 
Sbjct: 633  DHGHVKEACGYCLEMLDGGIMPQADTFAKLMRGLRKLYNRQIAAEITERVRQMASDRNMS 692

Query: 1685 IK 1690
             K
Sbjct: 693  FK 694


>ref|XP_002530608.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223529856|gb|EEF31788.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 596

 Score =  300 bits (767), Expect = 2e-78
 Identities = 158/470 (33%), Positives = 257/470 (54%)
 Frame = +2

Query: 248  GGISYRFYVWASRQPDFKPTVDSEPALLRVICRIRESLWVFALLQEMRRTEPKFGTYRAF 427
            G + YRF+VWAS+QP ++ + ++  A++++  ++R+   V+ALL+EMR+      T   F
Sbjct: 120  GNLGYRFFVWASKQPGYRHSYENYKAMVKIFSKMRQFGAVWALLEEMRKDNSVLITSELF 179

Query: 428  KVLAERYNHGHLLTIAVNALYNLEKVGCPPDVGLFSCLLGMYCEENKSNKAMALFWDMMV 607
             VL  R+    L+  A+  L  + K GC PD  +F CLL   C+     +A +LF DM V
Sbjct: 180  IVLIRRFASARLVEKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKQAASLFEDMRV 239

Query: 608  KFEPSYRNYAILISCLARNGRLEHARHLHRKLLDSGACLTMDVYHEILTGYSKEGKATES 787
            +F PS R++  L+    R G+L  A+H+  ++ ++G    + V++ +L+ YS  GK T++
Sbjct: 240  RFSPSLRHFTSLLYGWCREGKLIEAKHVLVQMREAGFEPDIVVFNNLLSAYSMAGKMTDA 299

Query: 788  VKMLNEMRDMGYGVNAKAITIGVHALCSEGRIDEAFGMIEDMKKSGCFPDATTYNTLIET 967
              +L EM   G   NA + TI + A CS+ ++DEA  +  +M+++GC  D  TY  LI  
Sbjct: 300  FDLLKEMVRKGCEPNANSYTIMIQAFCSQEKMDEAMRVFVEMERTGCEADVVTYTALISG 359

Query: 968  LCDGKKIKQAYEVMDAMVKQGISPNHMTYLVLMRAHQNKQGFWPAMQLMEKMSKQGCRPN 1147
             C   KI + Y+++DAM ++G  PN +TYL ++ AH+ K+     ++L+E M   GC P+
Sbjct: 360  FCKWGKINRGYQILDAMKQKGHMPNQLTYLRILLAHEKKEELEECLELIESMRMVGCVPD 419

Query: 1148 LAVYKVIIRICDKGRLPDQVLKMWKEMVTNDVVPDLITYKMLLCCLHDANYLTEACKYHN 1327
            L++Y V+IR+  K     Q +++W EM  +D  P+L T+ +++        L EAC+Y  
Sbjct: 420  LSIYNVVIRLACKLGEVKQGVQIWNEMEASDFSPELDTFVIMIHGFLGQGCLVEACEYFK 479

Query: 1328 EMVEKDLLKAKQYGITYRIWXXXXXXXXXXXXISMWECLRKNERHVNVFAFSSWIEVLCE 1507
            EM+ + LL   QYGI   +               +W C+      +N  A++ WI  L  
Sbjct: 480  EMIGRGLLTTPQYGILKELLNALLRGEKLGMAKDVWSCIVTKGCELNADAWTIWIHSLFS 539

Query: 1508 TGFVLKASGYCYDMVELVILPPPRIYQLLMYKLEEHYGRRAKVEIRIKLR 1657
             G V +A  YC DM+E  I+P P  +  LM  L + Y R    EI  K++
Sbjct: 540  NGHVKEACSYCLDMMEADIMPKPETFAKLMRGLRKLYNREFAAEITEKIK 589


>ref|XP_003546958.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like
            isoform X1 [Glycine max] gi|571514894|ref|XP_006597171.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At3g49730-like isoform X2 [Glycine max]
            gi|571514897|ref|XP_006597172.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g49730-like isoform X3 [Glycine max]
          Length = 654

 Score =  293 bits (749), Expect = 3e-76
 Identities = 163/531 (30%), Positives = 283/531 (53%), Gaps = 1/531 (0%)
 Frame = +2

Query: 101  DEIFRVLQKYQRHRARTARGAMENALENLRVQVTPXXXXXXXXXXXXXYGGISYRFYVWA 280
            ++++R+L+KY     +     +E AL    V V P              G ++YRFY WA
Sbjct: 93   EKVYRILRKYHSRVPK-----LELALRESGVVVRPGLTERVLSRCGDA-GNLAYRFYSWA 146

Query: 281  SRQPDFKPTVDSEPALLRVICRIRESLWVFALLQEMRRTEPKFGTYRAFKVLAERYNHGH 460
            S+Q   +   D+  A+++V+ R+R+   V+AL++EMR+  P   T + F +L  R+    
Sbjct: 147  SKQSGHRLDHDAYKAMIKVLSRMRQFGAVWALIEEMRQENPHLITPQVFVILMRRFASAR 206

Query: 461  LLTIAVNALYNLEKVGCPPDVGLFSCLLGMYCEENKSNKAMALFWDMMVKFEPSYRNYAI 640
            ++  AV  L  + K GC PD  +F CLL   C+     +A +LF DM  +++PS +++  
Sbjct: 207  MVHKAVEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAASLFEDMRYRWKPSVKHFTS 266

Query: 641  LISCLARNGRLEHARHLHRKLLDSGACLTMDVYHEILTGYSKEGKATESVKMLNEMRDMG 820
            L+    + G+L  A+H+  ++ D G    + VY+ +L GY++ GK  ++  +L EMR   
Sbjct: 267  LLYGWCKEGKLMEAKHVLVQMKDMGIEPDIVVYNNLLGGYAQAGKMGDAYDLLKEMRRKR 326

Query: 821  YGVNAKAITIGVHALCSEGRIDEAFGMIEDMKKSGCFPDATTYNTLIETLCDGKKIKQAY 1000
               NA + T+ + +LC   R++EA  +  +M+ +GC  D  TY+TLI   C   KIK+ Y
Sbjct: 327  CEPNATSYTVLIQSLCKHERLEEATRLFVEMQTNGCQADVVTYSTLISGFCKWGKIKRGY 386

Query: 1001 EVMDAMVKQGISPNHMTYLVLMRAHQNKQGFWPAMQLMEKMSKQGCRPNLAVYKVIIRIC 1180
            E++D M++QG  PN + Y  +M AH+ K+      +L+ +M K GC P+L++Y  +IR+ 
Sbjct: 387  ELLDEMIQQGHFPNQVIYQHIMLAHEKKEELEECKELVNEMQKIGCAPDLSIYNTVIRLA 446

Query: 1181 DKGRLPDQVLKMWKEMVTNDVVPDLITYKMLLCCLHDANYLTEACKYHNEMVEKDLLKAK 1360
             K     + +++W EM ++ + P + T+ +++    +   L EAC+Y  EMV + L  A 
Sbjct: 447  CKLGEVKEGIQLWNEMESSGLSPGMDTFVIMINGFLEQGCLVEACEYFKEMVGRGLFTAP 506

Query: 1361 QYGITYRIWXXXXXXXXXXXXISMWECLRKNER-HVNVFAFSSWIEVLCETGFVLKASGY 1537
            QYG    +                W C+  ++   +NV A++ WI  L   G V +A  +
Sbjct: 507  QYGTLKELMNSLLRAEKLEMAKDAWNCITASKGCQLNVSAWTIWIHALFSKGHVKEACSF 566

Query: 1538 CYDMVELVILPPPRIYQLLMYKLEEHYGRRAKVEIRIKLRTRMAEKRLPIK 1690
            C DM++  ++P P  +  LM+ L++ Y R+   EI  K+R   A++++  K
Sbjct: 567  CIDMMDKDLMPNPDTFAKLMHGLKKLYNRQFAAEITEKVRKMAADRQITFK 617


>ref|XP_006348483.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like
            [Solanum tuberosum]
          Length = 625

 Score =  289 bits (739), Expect = 4e-75
 Identities = 158/526 (30%), Positives = 279/526 (53%)
 Frame = +2

Query: 101  DEIFRVLQKYQRHRARTARGAMENALENLRVQVTPXXXXXXXXXXXXXYGGISYRFYVWA 280
            ++++R+L+K+     +     +E+ +      V                G + YRF+VW 
Sbjct: 62   EKVYRILRKFHSRVPKLELALLESGV------VARSGLTERVLNRCGDAGNLGYRFFVWV 115

Query: 281  SRQPDFKPTVDSEPALLRVICRIRESLWVFALLQEMRRTEPKFGTYRAFKVLAERYNHGH 460
            S+QP ++ + D+  A+++++ ++R+   V+AL++EMR   P+F T   F VL  R+  G 
Sbjct: 116  SKQPGYRHSHDAYKAMIKILGKMRQFGTVWALVEEMRIENPQFLTPEVFIVLMRRFASGR 175

Query: 461  LLTIAVNALYNLEKVGCPPDVGLFSCLLGMYCEENKSNKAMALFWDMMVKFEPSYRNYAI 640
            ++  A+  L  + K G  PD  +F CLL   C+     +A ALF +M  +F P+ +++  
Sbjct: 176  MVKKAIEVLDEMPKYGVEPDEYVFGCLLDALCKNGSVKEAAALFDEMRFRFSPTIKHFTS 235

Query: 641  LISCLARNGRLEHARHLHRKLLDSGACLTMDVYHEILTGYSKEGKATESVKMLNEMRDMG 820
            L+    + G+L  A+ +  K+ ++G    + VY+ +L GY+   K  ++  +L EMR  G
Sbjct: 236  LLYGWCKEGKLIEAKVVLVKMREAGFEPDIVVYNNLLNGYAVSRKMADAFDLLQEMRRKG 295

Query: 821  YGVNAKAITIGVHALCSEGRIDEAFGMIEDMKKSGCFPDATTYNTLIETLCDGKKIKQAY 1000
               N  + TI + ALC + +++EA  +  DM++SGC  D  TY TLI   C   KI++ Y
Sbjct: 296  CNPNETSFTIVIQALCLQDKMEEAMRVFLDMERSGCEGDVVTYTTLISGFCKWGKIEKGY 355

Query: 1001 EVMDAMVKQGISPNHMTYLVLMRAHQNKQGFWPAMQLMEKMSKQGCRPNLAVYKVIIRIC 1180
            E++D M+++G +PN  TYL +M AH+ K+     ++L+++M K G  P+ ++Y ++IR+ 
Sbjct: 356  ELVDTMLQKGYNPNQTTYLHIMLAHEKKEELEECLELVKEMGKIGIPPDHSIYNIVIRLA 415

Query: 1181 DKGRLPDQVLKMWKEMVTNDVVPDLITYKMLLCCLHDANYLTEACKYHNEMVEKDLLKAK 1360
             K    D+ +++W ++  N + P + T+ +++    +   L EAC +  EM+ + LL A 
Sbjct: 416  CKLGEIDEGVRVWNQIEANGISPGVDTFIIMINGFVEQGRLIEACDHFKEMIGRGLLSAP 475

Query: 1361 QYGITYRIWXXXXXXXXXXXXISMWECLRKNERHVNVFAFSSWIEVLCETGFVLKASGYC 1540
            QYG    +               +W C+      +NV A++ WI  L   G V +A  YC
Sbjct: 476  QYGTLKDLLNSLLRAEKLELCKDVWSCIMTKGCELNVSAWTIWIHALFSNGHVKEACAYC 535

Query: 1541 YDMVELVILPPPRIYQLLMYKLEEHYGRRAKVEIRIKLRTRMAEKR 1678
             DM++  ++P P  +  LM  L + Y R    EI  K R +MAE+R
Sbjct: 536  LDMMDAGLMPQPDTFAKLMKGLRKLYNREIAAEITEKAR-KMAEQR 580


>ref|XP_004513407.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At5g65820-like isoform X1 [Cicer arietinum]
            gi|502165084|ref|XP_004513408.1| PREDICTED: putative
            pentatricopeptide repeat-containing protein
            At5g65820-like isoform X2 [Cicer arietinum]
          Length = 655

 Score =  288 bits (738), Expect = 5e-75
 Identities = 159/531 (29%), Positives = 282/531 (53%), Gaps = 1/531 (0%)
 Frame = +2

Query: 101  DEIFRVLQKYQRHRARTARGAMENALENLRVQVTPXXXXXXXXXXXXXYGGISYRFYVWA 280
            ++++R+L+KY     +     +E AL+   V V+               G ++YRF+ WA
Sbjct: 93   EKVYRILRKYHSRVPK-----LELALKESGVVVSSGLTERVLNRCGNS-GNLAYRFFSWA 146

Query: 281  SRQPDFKPTVDSEPALLRVICRIRESLWVFALLQEMRRTEPKFGTYRAFKVLAERYNHGH 460
            S+Q  ++ + +   A+++V+ ++R+   V+AL+ EMR   P+  +   F +L  R+    
Sbjct: 147  SKQSGYRHSEEVYKAMIKVLSKMRQFGAVWALIDEMRLENPQLISPHVFVILMRRFASAR 206

Query: 461  LLTIAVNALYNLEKVGCPPDVGLFSCLLGMYCEENKSNKAMALFWDMMVKFEPSYRNYAI 640
            ++  A+  L  + K GC PD  +F CLL   C+     +A +LF DM  +F P+ +++  
Sbjct: 207  MVHKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSIKEAASLFEDMRYRFPPTVKHFTS 266

Query: 641  LISCLARNGRLEHARHLHRKLLDSGACLTMDVYHEILTGYSKEGKATESVKMLNEMRDMG 820
            L+    + G+L  A+H+  ++ D+G    + V++ +L GY++ GK  ++  +L EM+  G
Sbjct: 267  LLYGWCKEGKLVEAKHVLVQMKDAGIEPDIVVFNNLLGGYAQGGKMADAYDLLKEMKRKG 326

Query: 821  YGVNAKAITIGVHALCSEGRIDEAFGMIEDMKKSGCFPDATTYNTLIETLCDGKKIKQAY 1000
               NA + TI + +LC   +++EA  +  +M+++ C  D  TY TLI   C   KIK+ Y
Sbjct: 327  CEPNAASYTILIQSLCKHEKLEEAMRIFVEMQRNDCQMDVITYTTLISGFCKWGKIKRGY 386

Query: 1001 EVMDAMVKQGISPNHMTYLVLMRAHQNKQGFWPAMQLMEKMSKQGCRPNLAVYKVIIRIC 1180
            E++D M+++G SPN +TYL +M AH+ K+     M+L+ +M K GC PNL +Y  +IR+ 
Sbjct: 387  ELLDQMIQEGHSPNQLTYLHIMLAHEKKEELEECMELVNEMKKIGCVPNLNIYNTVIRLA 446

Query: 1181 DKGRLPDQVLKMWKEMVTNDVVPDLITYKMLLCCLHDANYLTEACKYHNEMVEKDLLKAK 1360
             K     Q +++W EM  + + P   T+ +++    + + L EAC+Y  EMV + L  A 
Sbjct: 447  CKFGEVKQGVRLWNEMEASGLSPGTDTFVVMINGFLEQDCLIEACEYFKEMVGRGLFAAP 506

Query: 1361 QYGITYRIWXXXXXXXXXXXXISMWECLRKNER-HVNVFAFSSWIEVLCETGFVLKASGY 1537
            QYG    +                W C+  ++   +NV A++ WI  L   G V +A  +
Sbjct: 507  QYGTLKELMNSLLRAEKLEMAKDTWNCITASKSCEMNVAAWTIWIHALFSKGHVKEACSF 566

Query: 1538 CYDMVELVILPPPRIYQLLMYKLEEHYGRRAKVEIRIKLRTRMAEKRLPIK 1690
            C DM++  ++P P  +  L+  L++ Y R    EI  K+R   A++ +  K
Sbjct: 567  CIDMMDNDLMPQPDTFAKLIRGLKKLYNREFAAEITEKVRKMAADRHITFK 617


>ref|NP_190542.4| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|218546755|sp|P0C8A0.1|PP275_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At3g49730 gi|332645062|gb|AEE78583.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 638

 Score =  286 bits (731), Expect = 3e-74
 Identities = 157/534 (29%), Positives = 277/534 (51%), Gaps = 3/534 (0%)
 Frame = +2

Query: 98   ADEIFRVLQKYQRHRARTARGAMENALENLRVQVTPXXXXXXXXXXXXXYGGISYRFYVW 277
            A E+ ++ +  + H +R  +  +E AL    + + P              G + YRF++W
Sbjct: 64   AGEVEKIYRILRNHHSRVPK--LELALNESGIDLRPGLIIRVLSRCGDA-GNLGYRFFLW 120

Query: 278  ASRQPDFKPTVDSEPALLRVICRIRESLWVFALLQEMRRTEPKFGTYRAFKVLAERYNHG 457
            A++QP +  + +   +++ ++ ++R+   V+ L++EMR+T P+      F VL  R+   
Sbjct: 121  ATKQPGYFHSYEVCKSMVMILSKMRQFGAVWGLIEEMRKTNPELIEPELFVVLMRRFASA 180

Query: 458  HLLTIAVNALYNLEKVGCPPDVGLFSCLLGMYCEENKSNKAMALFWDMMVKFEPSYRNYA 637
            +++  AV  L  + K G  PD  +F CLL   C+     +A  +F DM  KF P+ R + 
Sbjct: 181  NMVKKAVEVLDEMPKYGLEPDEYVFGCLLDALCKNGSVKEASKVFEDMREKFPPNLRYFT 240

Query: 638  ILISCLARNGRLEHARHLHRKLLDSGACLTMDVYHEILTGYSKEGKATESVKMLNEMRDM 817
             L+    R G+L  A+ +  ++ ++G    + V+  +L+GY+  GK  ++  ++N+MR  
Sbjct: 241  SLLYGWCREGKLMEAKEVLVQMKEAGLEPDIVVFTNLLSGYAHAGKMADAYDLMNDMRKR 300

Query: 818  GYGVNAKAITIGVHALC-SEGRIDEAFGMIEDMKKSGCFPDATTYNTLIETLCDGKKIKQ 994
            G+  N    T+ + ALC +E R+DEA  +  +M++ GC  D  TY  LI   C    I +
Sbjct: 301  GFEPNVNCYTVLIQALCRTEKRMDEAMRVFVEMERYGCEADIVTYTALISGFCKWGMIDK 360

Query: 995  AYEVMDAMVKQGISPNHMTYLVLMRAHQNKQGFWPAMQLMEKMSKQGCRPNLAVYKVIIR 1174
             Y V+D M K+G+ P+ +TY+ +M AH+ K+ F   ++L+EKM ++GC P+L +Y V+IR
Sbjct: 361  GYSVLDDMRKKGVMPSQVTYMQIMVAHEKKEQFEECLELIEKMKRRGCHPDLLIYNVVIR 420

Query: 1175 ICDKGRLPDQVLKMWKEMVTNDVVPDLITYKMLLCCLHDANYLTEACKYHNEMVEKDLLK 1354
            +  K     + +++W EM  N + P + T+ +++       +L EAC +  EMV + +  
Sbjct: 421  LACKLGEVKEAVRLWNEMEANGLSPGVDTFVIMINGFTSQGFLIEACNHFKEMVSRGIFS 480

Query: 1355 AKQYGITYRIWXXXXXXXXXXXXISMWECL--RKNERHVNVFAFSSWIEVLCETGFVLKA 1528
            A QYG    +               +W C+  + +   +NV A++ WI  L   G V +A
Sbjct: 481  APQYGTLKSLLNNLVRDDKLEMAKDVWSCISNKTSSCELNVSAWTIWIHALYAKGHVKEA 540

Query: 1529 SGYCYDMVELVILPPPRIYQLLMYKLEEHYGRRAKVEIRIKLRTRMAEKRLPIK 1690
              YC DM+E+ ++P P  Y  LM  L + Y R    EI  K+    +E+ +  K
Sbjct: 541  CSYCLDMMEMDLMPQPNTYAKLMKGLNKLYNRTIAAEITEKVVKMASEREMSFK 594


>emb|CAB66911.1| putative protein [Arabidopsis thaliana]
          Length = 1184

 Score =  286 bits (731), Expect = 3e-74
 Identities = 157/534 (29%), Positives = 277/534 (51%), Gaps = 3/534 (0%)
 Frame = +2

Query: 98   ADEIFRVLQKYQRHRARTARGAMENALENLRVQVTPXXXXXXXXXXXXXYGGISYRFYVW 277
            A E+ ++ +  + H +R  +  +E AL    + + P              G + YRF++W
Sbjct: 64   AGEVEKIYRILRNHHSRVPK--LELALNESGIDLRPGLIIRVLSRCGDA-GNLGYRFFLW 120

Query: 278  ASRQPDFKPTVDSEPALLRVICRIRESLWVFALLQEMRRTEPKFGTYRAFKVLAERYNHG 457
            A++QP +  + +   +++ ++ ++R+   V+ L++EMR+T P+      F VL  R+   
Sbjct: 121  ATKQPGYFHSYEVCKSMVMILSKMRQFGAVWGLIEEMRKTNPELIEPELFVVLMRRFASA 180

Query: 458  HLLTIAVNALYNLEKVGCPPDVGLFSCLLGMYCEENKSNKAMALFWDMMVKFEPSYRNYA 637
            +++  AV  L  + K G  PD  +F CLL   C+     +A  +F DM  KF P+ R + 
Sbjct: 181  NMVKKAVEVLDEMPKYGLEPDEYVFGCLLDALCKNGSVKEASKVFEDMREKFPPNLRYFT 240

Query: 638  ILISCLARNGRLEHARHLHRKLLDSGACLTMDVYHEILTGYSKEGKATESVKMLNEMRDM 817
             L+    R G+L  A+ +  ++ ++G    + V+  +L+GY+  GK  ++  ++N+MR  
Sbjct: 241  SLLYGWCREGKLMEAKEVLVQMKEAGLEPDIVVFTNLLSGYAHAGKMADAYDLMNDMRKR 300

Query: 818  GYGVNAKAITIGVHALC-SEGRIDEAFGMIEDMKKSGCFPDATTYNTLIETLCDGKKIKQ 994
            G+  N    T+ + ALC +E R+DEA  +  +M++ GC  D  TY  LI   C    I +
Sbjct: 301  GFEPNVNCYTVLIQALCRTEKRMDEAMRVFVEMERYGCEADIVTYTALISGFCKWGMIDK 360

Query: 995  AYEVMDAMVKQGISPNHMTYLVLMRAHQNKQGFWPAMQLMEKMSKQGCRPNLAVYKVIIR 1174
             Y V+D M K+G+ P+ +TY+ +M AH+ K+ F   ++L+EKM ++GC P+L +Y V+IR
Sbjct: 361  GYSVLDDMRKKGVMPSQVTYMQIMVAHEKKEQFEECLELIEKMKRRGCHPDLLIYNVVIR 420

Query: 1175 ICDKGRLPDQVLKMWKEMVTNDVVPDLITYKMLLCCLHDANYLTEACKYHNEMVEKDLLK 1354
            +  K     + +++W EM  N + P + T+ +++       +L EAC +  EMV + +  
Sbjct: 421  LACKLGEVKEAVRLWNEMEANGLSPGVDTFVIMINGFTSQGFLIEACNHFKEMVSRGIFS 480

Query: 1355 AKQYGITYRIWXXXXXXXXXXXXISMWECL--RKNERHVNVFAFSSWIEVLCETGFVLKA 1528
            A QYG    +               +W C+  + +   +NV A++ WI  L   G V +A
Sbjct: 481  APQYGTLKSLLNNLVRDDKLEMAKDVWSCISNKTSSCELNVSAWTIWIHALYAKGHVKEA 540

Query: 1529 SGYCYDMVELVILPPPRIYQLLMYKLEEHYGRRAKVEIRIKLRTRMAEKRLPIK 1690
              YC DM+E+ ++P P  Y  LM  L + Y R    EI  K+    +E+ +  K
Sbjct: 541  CSYCLDMMEMDLMPQPNTYAKLMKGLNKLYNRTIAAEITEKVVKMASEREMSFK 594


>ref|XP_006404107.1| hypothetical protein EUTSA_v10010190mg [Eutrema salsugineum]
            gi|557105226|gb|ESQ45560.1| hypothetical protein
            EUTSA_v10010190mg [Eutrema salsugineum]
          Length = 645

 Score =  282 bits (721), Expect = 5e-73
 Identities = 153/484 (31%), Positives = 255/484 (52%), Gaps = 3/484 (0%)
 Frame = +2

Query: 248  GGISYRFYVWASRQPDFKPTVDSEPALLRVICRIRESLWVFALLQEMRRTEPKFGTYRAF 427
            G + YRF++WA++QP +  + +   ++++++ ++R+   V+AL++EMR+  P+      F
Sbjct: 117  GNLGYRFFLWAAKQPGYCHSYEVCKSMVKILSKMRQFGAVWALIEEMRKENPQLIEPELF 176

Query: 428  KVLAERYNHGHLLTIAVNALYNLEKVGCPPDVGLFSCLLGMYCEENKSNKAMALFWDMMV 607
             VL  R+   +++  AV  L  + K G  PD  +F CLL   C+      A  LF DM  
Sbjct: 177  VVLMRRFASANMVKKAVEVLDEMPKYGIEPDEYIFGCLLDALCKNGSVKDASKLFEDMRD 236

Query: 608  KFEPSYRNYAILISCLARNGRLEHARHLHRKLLDSGACLTMDVYHEILTGYSKEGKATES 787
            KF P+ R +  L+    R G+L  A+H+  ++ ++G    + V+  +L+GY+  GK  ++
Sbjct: 237  KFPPNLRYFTSLLYGWCREGKLIEAKHVLVQMKEAGLEPDIVVFTNLLSGYAHAGKMADA 296

Query: 788  VKMLNEMRDMGYGVNAKAITIGVHALCS-EGRIDEAFGMIEDMKKSGCFPDATTYNTLIE 964
              ++ +MR  GY  NA   T+ + ALC  E R+DEA  +  +M++ GC  D  TY  LI 
Sbjct: 297  YDLMKDMRRRGYEPNANCYTVLIQALCKMEKRMDEAMRVFVEMERYGCEADIVTYTALIS 356

Query: 965  TLCDGKKIKQAYEVMDAMVKQGISPNHMTYLVLMRAHQNKQGFWPAMQLMEKMSKQGCRP 1144
              C    I + Y V+D M K+G+ P  +TY+ +M AH+ K+ F   + L+EKM + GC P
Sbjct: 357  GFCKWGMIDKGYSVLDDMRKKGVMPLQVTYMQIMVAHEKKEQFEECLDLIEKMKQNGCLP 416

Query: 1145 NLAVYKVIIRICDKGRLPDQVLKMWKEMVTNDVVPDLITYKMLLCCLHDANYLTEACKYH 1324
            +L +Y V+IR+  K     + +++W EM  N + P + T+ +++        L EAC + 
Sbjct: 417  DLLIYNVVIRLACKLGEVKEAVRLWNEMEANGLSPGVDTFVIMINGFASQGCLIEACDHF 476

Query: 1325 NEMVEKDLLKAKQYGITYRIWXXXXXXXXXXXXISMWECL--RKNERHVNVFAFSSWIEV 1498
             EMV + +  A  YG    +               +W CL  + +   +NV A++ WI  
Sbjct: 477  KEMVSRGIFSAPHYGTLKILLNTLVRDDKLEMAKDVWSCLSNKSSSCELNVSAWTIWIHA 536

Query: 1499 LCETGFVLKASGYCYDMVELVILPPPRIYQLLMYKLEEHYGRRAKVEIRIKLRTRMAEKR 1678
            L   G V +A  YC DM+E+ ++P P  Y  LM  L + Y R    EI  K+R   +E+ 
Sbjct: 537  LFARGHVKEACSYCLDMMEMDLMPQPDTYAKLMKGLNKLYNRTIAAEITEKVRKMASERE 596

Query: 1679 LPIK 1690
            +  K
Sbjct: 597  MSFK 600


>ref|XP_006595472.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At5g65820-like, partial [Glycine max]
          Length = 656

 Score =  281 bits (720), Expect = 6e-73
 Identities = 160/532 (30%), Positives = 282/532 (53%), Gaps = 2/532 (0%)
 Frame = +2

Query: 101  DEIFRVLQKYQRHRARTARGAMENALENLRVQVTPXXXXXXXXXXXXXYGGISYRFYVWA 280
            ++++R+L+KY     +     +E AL    V V P              G ++YRFY WA
Sbjct: 95   EKVYRILRKYHSRVPK-----LELALRESGVVVRPGLTERVLNRCGDA-GNLAYRFYSWA 148

Query: 281  SRQPDFKPTVDSEPALLRVICRIRESLWVFALLQEMRRTEPKFGTYRAFKVLAERYNHGH 460
            S+Q   +   D+  A+++V+ R+R+   V+AL++EMR+  P   T + F +L  R+    
Sbjct: 149  SKQSGHRLDHDAYKAMIKVLSRMRQFGAVWALIEEMRQENPHLITPQVFVILMRRFASAR 208

Query: 461  LLTIAVNALYNLEKVGCPPDVGLFSCLLGMYCEENKSNKAMALFWDMMVKFEPSYRNYAI 640
            ++  AV  L  +   GC PD  +F CLL    +     +A +LF ++  +++PS +++  
Sbjct: 209  MVHKAVQVLDEMPNYGCEPDEYVFGCLLDALRKNGSVKEAASLFEELRYRWKPSVKHFTS 268

Query: 641  LISCLARNGRLEHARHLHRKLLDSGACLTMDVYHEILTGYSKEGKATESVKMLNEMRDMG 820
            L+    + G+L  A+H+  ++ D+G    + VY+ +L GY++  K  ++  +L EMR  G
Sbjct: 269  LLYGWCKEGKLMEAKHVLVQMKDAGIEPDIVVYNNLLGGYAQADKMGDAYDLLKEMRRKG 328

Query: 821  YGVNAKAITIGVHALCSEGRIDEAFGMIEDMKKSGCFPDATTYNTLIETLCDGKKIKQAY 1000
               NA + T+ + +LC   R++EA  +  +M+++GC  D  TY+TLI   C   KIK+ Y
Sbjct: 329  CEPNATSYTVLIQSLCKHERLEEATRVFVEMQRNGCQADLVTYSTLISGFCKWGKIKRGY 388

Query: 1001 EVMDAMVKQGISPNHMTYLVLMRAHQNKQGFWPAMQLMEKMSKQGCRPNLAVYKVIIRI- 1177
            E++D M++QG  PN + Y  +M AH+ K+      +L+ +M K GC P+L++Y  +IR+ 
Sbjct: 389  ELLDEMIQQGHFPNQVIYQHIMVAHEKKEELEECKELVNEMQKIGCAPDLSIYNTVIRLA 448

Query: 1178 CDKGRLPDQVLKMWKEMVTNDVVPDLITYKMLLCCLHDANYLTEACKYHNEMVEKDLLKA 1357
            C  G + + V ++W EM ++ + P + T+ +++    +   L EAC+Y  EMV + L  A
Sbjct: 449  CKLGEVKEGV-RLWNEMESSGLSPSIDTFVIMINGFLEQGCLVEACEYFKEMVGRGLFAA 507

Query: 1358 KQYGITYRIWXXXXXXXXXXXXISMWECLRKNER-HVNVFAFSSWIEVLCETGFVLKASG 1534
             QYG    +                W C+  ++   +NV A++ WI  L   G V +A  
Sbjct: 508  PQYGTLKELMNSLLRAEKLEMAKDAWNCITASKGCQLNVSAWTIWIHALFSKGHVKEACS 567

Query: 1535 YCYDMVELVILPPPRIYQLLMYKLEEHYGRRAKVEIRIKLRTRMAEKRLPIK 1690
            +C  M++  ++P P  +  LM  L++ Y R    EI  K+R   A++++  K
Sbjct: 568  FCIAMMDKDLMPQPDTFAKLMRGLKKLYNREFAAEITEKVRKMAADRKITFK 619


>ref|XP_002877696.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297323534|gb|EFH53955.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 1188

 Score =  280 bits (717), Expect = 1e-72
 Identities = 156/533 (29%), Positives = 272/533 (51%), Gaps = 3/533 (0%)
 Frame = +2

Query: 101  DEIFRVLQKYQRHRARTARGAMENALENLRVQVTPXXXXXXXXXXXXXYGGISYRFYVWA 280
            ++I+R+L+ Y     +     +E +L    + + P              G + YRF++WA
Sbjct: 71   EKIYRILRNYHSRVPK-----LELSLNESGIDLRPGLIVRVLSRCGDA-GNLGYRFFLWA 124

Query: 281  SRQPDFKPTVDSEPALLRVICRIRESLWVFALLQEMRRTEPKFGTYRAFKVLAERYNHGH 460
            ++QP +  + +   ++++++ ++R+   V+ L++EMR+  P+      F VL  R+   +
Sbjct: 125  TKQPGYCHSYEVCKSMVKILSKMRQFGAVWGLIEEMRKENPELIEPELFVVLIRRFASAN 184

Query: 461  LLTIAVNALYNLEKVGCPPDVGLFSCLLGMYCEENKSNKAMALFWDMMVKFEPSYRNYAI 640
            ++  AV  L  + K G  PD  +F CLL   C+      A  +F DM  K  P+ R +  
Sbjct: 185  MVKKAVEVLDEMPKYGFEPDEYVFGCLLDALCKNGSVKDASKVFEDMREKIPPNLRYFTS 244

Query: 641  LISCLARNGRLEHARHLHRKLLDSGACLTMDVYHEILTGYSKEGKATESVKMLNEMRDMG 820
            L+    R G+L  A+ +  ++ ++G    + V+  +L+GY+  GK  ++  +LN+MR  G
Sbjct: 245  LLYGWCREGKLMEAKEVLVQMKEAGLEPDIVVFTNLLSGYAHAGKMADAYDLLNDMRKRG 304

Query: 821  YGVNAKAITIGVHALC-SEGRIDEAFGMIEDMKKSGCFPDATTYNTLIETLCDGKKIKQA 997
            Y  NA   T+ + ALC +E R+DEA  +  +M++ GC  D  TY  LI   C    I + 
Sbjct: 305  YEPNANCYTVLIQALCRTEKRMDEAMRVFVEMERYGCEADIVTYTALISGFCKWGMIDKG 364

Query: 998  YEVMDAMVKQGISPNHMTYLVLMRAHQNKQGFWPAMQLMEKMSKQGCRPNLAVYKVIIRI 1177
            Y V+D M K+G+ P+ +TY+ ++ AH+ K+ F   ++L+EKM + GC P+L +Y V+IR+
Sbjct: 365  YSVLDDMRKKGVMPSQVTYMQILVAHEKKEQFEECLELIEKMKQIGCHPDLLIYNVVIRL 424

Query: 1178 CDKGRLPDQVLKMWKEMVTNDVVPDLITYKMLLCCLHDANYLTEACKYHNEMVEKDLLKA 1357
                R   + +++W EM  N + P    + +++       YL EAC +  EMV + +  A
Sbjct: 425  ACNFREVKEAVRLWNEMEANGLSPGADMFVIMINGFTSQGYLIEACSHFKEMVSRGIFSA 484

Query: 1358 KQYGITYRIWXXXXXXXXXXXXISMWECL--RKNERHVNVFAFSSWIEVLCETGFVLKAS 1531
             QYG    +               +W C+  + +   +NV A++ WI  L   G V +A 
Sbjct: 485  PQYGTLKSLLNTLLRDDKLEMAKDVWSCISNKTSSCELNVSAWTIWIHALFAKGHVKEAC 544

Query: 1532 GYCYDMVELVILPPPRIYQLLMYKLEEHYGRRAKVEIRIKLRTRMAEKRLPIK 1690
             YC DM+E+ ++P P  Y  LM  L + Y R    EI  K+    +E+ +  K
Sbjct: 545  SYCLDMMEMDLMPQPNTYVKLMKGLNKLYNRTIAAEITEKVMKMASEREMSFK 597


>ref|XP_002302657.2| hypothetical protein POPTR_0002s17660g [Populus trichocarpa]
            gi|550345236|gb|EEE81930.2| hypothetical protein
            POPTR_0002s17660g [Populus trichocarpa]
          Length = 495

 Score =  278 bits (711), Expect = 7e-72
 Identities = 146/455 (32%), Positives = 248/455 (54%)
 Frame = +2

Query: 326  LLRVICRIRESLWVFALLQEMRRTEPKFGTYRAFKVLAERYNHGHLLTIAVNALYNLEKV 505
            +++V+ ++++   V+ALL+EMRR      T   F V+  R+    ++  A+  L  + K 
Sbjct: 1    MIKVLSKMKQFGAVWALLEEMRRDNSVLITSEVFVVVMRRFASSRMVNKAIEVLDEMPKY 60

Query: 506  GCPPDVGLFSCLLGMYCEENKSNKAMALFWDMMVKFEPSYRNYAILISCLARNGRLEHAR 685
            GC PD  +F CLL   C+     +A +LF DM V+F PS +++  L+    + G+L  A+
Sbjct: 61   GCEPDEYVFGCLLDALCKNGSVKEAASLFEDMRVRFSPSLKHFTCLLYGWCKEGKLLEAK 120

Query: 686  HLHRKLLDSGACLTMDVYHEILTGYSKEGKATESVKMLNEMRDMGYGVNAKAITIGVHAL 865
            H+  ++ ++G    + VY+ +L+GY+  GK  ++  +L E+R  G   NA + TI + AL
Sbjct: 121  HVLVQMREAGFEPDIVVYNNLLSGYATAGKMGDAFDLLKEIRRKGCDPNATSYTILIQAL 180

Query: 866  CSEGRIDEAFGMIEDMKKSGCFPDATTYNTLIETLCDGKKIKQAYEVMDAMVKQGISPNH 1045
            C + ++DEA  +  +M++SGC  D  TY  L+   C  + I + Y+++ +M+++G  PN 
Sbjct: 181  CGQEKMDEAMRVFVEMERSGCDADVVTYTALVSGFCKWRMIDKGYQILQSMIQKGHMPNQ 240

Query: 1046 MTYLVLMRAHQNKQGFWPAMQLMEKMSKQGCRPNLAVYKVIIRICDKGRLPDQVLKMWKE 1225
            +TYL LM AH+ K+      +LM +M K GC P+L++Y V+IR+  K    +  +  W E
Sbjct: 241  LTYLHLMLAHEKKEELEECKELMGEMQKIGCIPDLSIYNVVIRLACKLGEVNAGVDAWNE 300

Query: 1226 MVTNDVVPDLITYKMLLCCLHDANYLTEACKYHNEMVEKDLLKAKQYGITYRIWXXXXXX 1405
            M  + + P L T+ +++       YL EAC+Y  EMVE+ LL ++QYGI   +       
Sbjct: 301  MEVSGLSPGLDTFVIMINGFLGHGYLVEACQYFKEMVERGLLSSRQYGILKDLLNALLRG 360

Query: 1406 XXXXXXISMWECLRKNERHVNVFAFSSWIEVLCETGFVLKASGYCYDMVELVILPPPRIY 1585
                    +W C+      +NV +++ WI  L   G V +A  YC DM++  ++P P  +
Sbjct: 361  EKLELAKDLWSCIVTKGCELNVDSWTIWIHALFSNGHVKEACSYCLDMMDADLMPKPETF 420

Query: 1586 QLLMYKLEEHYGRRAKVEIRIKLRTRMAEKRLPIK 1690
              LM  L + Y R+   EI  K+R   A++ +  K
Sbjct: 421  AKLMRGLRKLYNRQFAAEITEKVRKMAADRHVTFK 455


>ref|XP_006292382.1| hypothetical protein CARUB_v10018595mg [Capsella rubella]
            gi|482561089|gb|EOA25280.1| hypothetical protein
            CARUB_v10018595mg [Capsella rubella]
          Length = 639

 Score =  278 bits (710), Expect = 9e-72
 Identities = 156/533 (29%), Positives = 273/533 (51%), Gaps = 3/533 (0%)
 Frame = +2

Query: 101  DEIFRVLQKYQRHRARTARGAMENALENLRVQVTPXXXXXXXXXXXXXYGGISYRFYVWA 280
            D+I+R+L+ Y     +     +E AL    + + P              G + YRF++WA
Sbjct: 69   DKIYRILRNYHSRVPK-----LELALNESSIDLRPGLIVRVLSRCGDA-GNLGYRFFLWA 122

Query: 281  SRQPDFKPTVDSEPALLRVICRIRESLWVFALLQEMRRTEPKFGTYRAFKVLAERYNHGH 460
            ++QP +  + +   ++++V+ ++R+   V+ L++EMR+  P+      F +L  R+   +
Sbjct: 123  AKQPGYCHSYEVCKSMVKVLSKMRQFGAVWGLIEEMRKENPELIEPELFVILMRRFASAN 182

Query: 461  LLTIAVNALYNLEKVGCPPDVGLFSCLLGMYCEENKSNKAMALFWDMMVKFEPSYRNYAI 640
            ++  AV  L  + K G  PD  +F CLL   C+      A  LF DM  K+ P+ R +  
Sbjct: 183  MVKKAVEVLDEMPKYGLEPDEYVFGCLLDALCKNGSVKDASKLFEDMKEKYPPNLRYFTS 242

Query: 641  LISCLARNGRLEHARHLHRKLLDSGACLTMDVYHEILTGYSKEGKATESVKMLNEMRDMG 820
            L+    R G+L  A+ +  ++ ++G    + V+  +L+GY+  GK  ++  ++ +MR  G
Sbjct: 243  LLYGWCREGKLMEAKEVLVQMKEAGLEPDIVVFTNLLSGYAHAGKMADAYDLMKDMRKRG 302

Query: 821  YGVNAKAITIGVHALC-SEGRIDEAFGMIEDMKKSGCFPDATTYNTLIETLCDGKKIKQA 997
            Y  NA   T+ + ALC +E R+DEA  +  +M++ GC  D  TY  LI   C  + I + 
Sbjct: 303  YEPNANCYTVLIQALCKTEKRMDEAMRVFVEMERYGCEADIVTYTALISGFCKWEMIDKG 362

Query: 998  YEVMDAMVKQGISPNHMTYLVLMRAHQNKQGFWPAMQLMEKMSKQGCRPNLAVYKVIIRI 1177
            Y V+D M K+G+ P+ +TY+ +M AH+ K+ F   + L+EKM + GC+ +L +Y V+IR+
Sbjct: 363  YSVLDDMRKKGVIPSQVTYMQIMVAHEKKEQFEECLDLIEKMKQIGCQLDLLIYNVVIRL 422

Query: 1178 CDKGRLPDQVLKMWKEMVTNDVVPDLITYKMLLCCLHDANYLTEACKYHNEMVEKDLLKA 1357
              K     + +++W EM  N + P + T+ +++        L EAC +  EMV + +  A
Sbjct: 423  ACKLGEVKEAVRLWNEMEANGLSPGVDTFVIMINGFTSQGCLVEACNHFKEMVSRGIFSA 482

Query: 1358 KQYGITYRIWXXXXXXXXXXXXISMWECL--RKNERHVNVFAFSSWIEVLCETGFVLKAS 1531
             QYG    +               +W C+  + +   +NV A++ WI  L   G V +A 
Sbjct: 483  PQYGTLKLLLNNLVRDEKLEMAKDVWSCISNKSSSCELNVSAWTIWIHALLAKGHVKEAC 542

Query: 1532 GYCYDMVELVILPPPRIYQLLMYKLEEHYGRRAKVEIRIKLRTRMAEKRLPIK 1690
             YC DM+++ ++P P  Y  LM  L + Y R    EI  K+    +E+ +  K
Sbjct: 543  SYCLDMMKMDLMPQPDTYVKLMKGLNKLYNRTIAAEITEKVMKMASEREMSFK 595


>gb|EOX91773.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao]
          Length = 647

 Score =  276 bits (706), Expect = 2e-71
 Identities = 141/481 (29%), Positives = 253/481 (52%)
 Frame = +2

Query: 248  GGISYRFYVWASRQPDFKPTVDSEPALLRVICRIRESLWVFALLQEMRRTEPKFGTYRAF 427
            G + Y+F+ WAS+QP + P+ +   A+++++ ++R+   V+AL++E++R  P F T   F
Sbjct: 127  GNLGYKFFTWASKQPGYHPSYEIYKAMIKILGKMRQFGAVWALIEEIKRENPHFITAELF 186

Query: 428  KVLAERYNHGHLLTIAVNALYNLEKVGCPPDVGLFSCLLGMYCEENKSNKAMALFWDMMV 607
             +L  R+    ++  A+     + K GC  D  +F  LL   C+     +A  +F +M V
Sbjct: 187  ILLIRRFASSRMVKKAIEVFDEMPKYGCLQDDAVFGSLLDALCKNGNVKEAALVFEEMRV 246

Query: 608  KFEPSYRNYAILISCLARNGRLEHARHLHRKLLDSGACLTMDVYHEILTGYSKEGKATES 787
            +F P+ +++  L+    + GR+  A+H+  ++ ++G    + V++ +L+GY    K  ++
Sbjct: 247  RFLPNLKHFTSLLYGWCKEGRILEAKHVLVQMKEAGFEPDIVVFNNLLSGYVLGNKMGDA 306

Query: 788  VKMLNEMRDMGYGVNAKAITIGVHALCSEGRIDEAFGMIEDMKKSGCFPDATTYNTLIET 967
              +L EMR  G   NA + TI +  LC   R++EA  +  DM+++GC  D   Y TLI  
Sbjct: 307  FDLLKEMRKKGIDPNANSYTIVIQGLCKADRMEEAMRVFVDMERNGCRGDVVVYTTLISG 366

Query: 968  LCDGKKIKQAYEVMDAMVKQGISPNHMTYLVLMRAHQNKQGFWPAMQLMEKMSKQGCRPN 1147
             C   ++++ YEV+D M+ +G+ PN +TYL +M AH+ K      ++LME+M K GC P+
Sbjct: 367  FCKWGRVEKGYEVLDRMISEGLMPNSLTYLHIMLAHEKKDELEECLELMEEMRKIGCVPD 426

Query: 1148 LAVYKVIIRICDKGRLPDQVLKMWKEMVTNDVVPDLITYKMLLCCLHDANYLTEACKYHN 1327
              +Y V++R+  K     +  ++W EM      P +  + +++        L EAC+Y  
Sbjct: 427  GGIYNVVVRLACKLEEVKEAARVWNEMEGRGFSPGVDNFIVMIHGFIGQGCLVEACEYFK 486

Query: 1328 EMVEKDLLKAKQYGITYRIWXXXXXXXXXXXXISMWECLRKNERHVNVFAFSSWIEVLCE 1507
            EM  + L    QYGI   +              ++W C+      +NV A++ W+  L  
Sbjct: 487  EMAGRGLFCVPQYGILKDLLNSLLRAEKLEMAKNVWSCIVSKGCELNVSAWTIWVHALFS 546

Query: 1508 TGFVLKASGYCYDMVELVILPPPRIYQLLMYKLEEHYGRRAKVEIRIKLRTRMAEKRLPI 1687
             G V +A  YC +M+++ ++P P  +  LM  L + Y R+   EI  K+R   A++ +  
Sbjct: 547  KGHVKEACSYCLEMMDVDVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVRKMAADREITF 606

Query: 1688 K 1690
            K
Sbjct: 607  K 607


Top