BLASTX nr result

ID: Catharanthus23_contig00006126 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00006126
         (2387 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003635394.1| PREDICTED: putative pentatricopeptide repeat...   890   0.0  
emb|CAN71515.1| hypothetical protein VITISV_021787 [Vitis vinifera]   890   0.0  
ref|XP_006348483.1| PREDICTED: pentatricopeptide repeat-containi...   882   0.0  
gb|EXC12605.1| hypothetical protein L484_012982 [Morus notabilis]     870   0.0  
ref|XP_004159605.1| PREDICTED: pentatricopeptide repeat-containi...   841   0.0  
ref|XP_004149630.1| PREDICTED: pentatricopeptide repeat-containi...   841   0.0  
ref|XP_006426145.1| hypothetical protein CICLE_v10025134mg [Citr...   834   0.0  
ref|XP_004513407.1| PREDICTED: putative pentatricopeptide repeat...   833   0.0  
ref|XP_006466418.1| PREDICTED: putative pentatricopeptide repeat...   831   0.0  
ref|XP_003546958.1| PREDICTED: pentatricopeptide repeat-containi...   826   0.0  
ref|XP_006595472.1| PREDICTED: putative pentatricopeptide repeat...   818   0.0  
ref|XP_002530608.1| pentatricopeptide repeat-containing protein,...   799   0.0  
gb|EOX91773.1| Pentatricopeptide repeat (PPR) superfamily protei...   799   0.0  
gb|EPS62602.1| hypothetical protein M569_12187, partial [Genlise...   783   0.0  
ref|XP_006404107.1| hypothetical protein EUTSA_v10010190mg [Eutr...   769   0.0  
ref|XP_002866691.1| pentatricopeptide repeat-containing protein ...   757   0.0  
ref|NP_201383.1| pentatricopeptide repeat-containing protein [Ar...   756   0.0  
ref|XP_006393982.1| hypothetical protein EUTSA_v10003830mg [Eutr...   755   0.0  
ref|NP_190542.4| pentatricopeptide repeat-containing protein [Ar...   752   0.0  
emb|CAB66911.1| putative protein [Arabidopsis thaliana]               752   0.0  

>ref|XP_003635394.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At5g65820-like [Vitis vinifera]
          Length = 622

 Score =  890 bits (2300), Expect = 0.0
 Identities = 427/528 (80%), Positives = 471/528 (89%)
 Frame = +3

Query: 3    KIHSRVPKLELALQESGFTVSSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSHGVYR 182
            K HSRVPKLELALQESG  V SGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHS+ VY+
Sbjct: 67   KFHSRVPKLELALQESGVAVRSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSYEVYK 126

Query: 183  AMIKILGKMRQFGAIWALIEEMRKENPHLLSAEVFVVLMRRFASARMVKKAIEVLDEMPK 362
            AMIKILGKMRQFGA+WALIEEMR+ENP  +S  VFVVLMRRFASARMVKKAIEVLDEMPK
Sbjct: 127  AMIKILGKMRQFGAVWALIEEMRRENPQFVSPYVFVVLMRRFASARMVKKAIEVLDEMPK 186

Query: 363  YGCEPDEHVFGCLLDALCKNGSVKEAALLFEDMSMRFTPTIKHFTSLLYGWCKEGKLMEA 542
            YGCEPDEHVFGCLLDALCKNGSVKEAA LFEDM +RFTPT+KHFTSLLYGWC+EGKLMEA
Sbjct: 187  YGCEPDEHVFGCLLDALCKNGSVKEAASLFEDMRIRFTPTLKHFTSLLYGWCREGKLMEA 246

Query: 543  KFVLVKMREAGFEPDIVVYNNLLNGYSVAGKMADAFDLLQEMKRKGCEPNATSFTIIVQA 722
            K+VLV++REAGFEPDIVVYNNLL GY+ AGKM DA+DLL+EM+RK CEPN  SFT ++QA
Sbjct: 247  KYVLVQIREAGFEPDIVVYNNLLTGYAAAGKMVDAYDLLKEMRRKECEPNVMSFTTLIQA 306

Query: 723  LCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGKIDRGYQLLNSMIQKGHTPN 902
            LCA+ KMEEAMRVF EM+  GC AD VTYTTLISGFCKWGKI +GY+LL++MIQ+GH PN
Sbjct: 307  LCAKKKMEEAMRVFFEMQSCGCPADAVTYTTLISGFCKWGKISKGYELLDNMIQQGHIPN 366

Query: 903  HTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIFNTVIRLACKLGEIKEAIRFWR 1082
              +YL+I+ AH               M+KIG  PDL I+N VIRLACKLGEIKE +R W 
Sbjct: 367  PMTYLHIMAAHEKKEELEECIELMEEMRKIGCTPDLNIYNIVIRLACKLGEIKEGVRVWN 426

Query: 1083 QIEVNGMSPGVDTFVILINGLVEQGCLVEACDYFKEMVGRGLLSSPQYGTLKDLLNSLLR 1262
            ++E  G+SPG+DTFVI+I+G + Q CLVEAC++FKEMVGRGLLS+PQYGTLK+LLNSLLR
Sbjct: 427  EMEATGLSPGLDTFVIMIHGFLSQRCLVEACEFFKEMVGRGLLSAPQYGTLKELLNSLLR 486

Query: 1263 ADKLEMSKDVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDT 1442
            A+KLEMSKDVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDT
Sbjct: 487  AEKLEMSKDVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDT 546

Query: 1443 FAKLMRGLRKLYNRQFAAEITEKVRKMAAERQMTFKMYKRRGERDLKE 1586
            FAKLMRGLRKLYNRQ AAEITEKVRKMAAER+MTFKMYKRRGER+LKE
Sbjct: 547  FAKLMRGLRKLYNRQIAAEITEKVRKMAAEREMTFKMYKRRGERNLKE 594


>emb|CAN71515.1| hypothetical protein VITISV_021787 [Vitis vinifera]
          Length = 655

 Score =  890 bits (2300), Expect = 0.0
 Identities = 427/528 (80%), Positives = 471/528 (89%)
 Frame = +3

Query: 3    KIHSRVPKLELALQESGFTVSSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSHGVYR 182
            K HSRVPKLELALQESG  V SGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHS+ VY+
Sbjct: 100  KFHSRVPKLELALQESGVAVRSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSYEVYK 159

Query: 183  AMIKILGKMRQFGAIWALIEEMRKENPHLLSAEVFVVLMRRFASARMVKKAIEVLDEMPK 362
            AMIKILGKMRQFGA+WALIEEMR+ENP  +S  VFVVLMRRFASARMVKKAIEVLDEMPK
Sbjct: 160  AMIKILGKMRQFGAVWALIEEMRRENPQFVSPYVFVVLMRRFASARMVKKAIEVLDEMPK 219

Query: 363  YGCEPDEHVFGCLLDALCKNGSVKEAALLFEDMSMRFTPTIKHFTSLLYGWCKEGKLMEA 542
            YGCEPDEHVFGCLLDALCKNGSVKEAA LFEDM +RFTPT+KHFTSLLYGWC+EGKLMEA
Sbjct: 220  YGCEPDEHVFGCLLDALCKNGSVKEAASLFEDMRIRFTPTLKHFTSLLYGWCREGKLMEA 279

Query: 543  KFVLVKMREAGFEPDIVVYNNLLNGYSVAGKMADAFDLLQEMKRKGCEPNATSFTIIVQA 722
            K+VLV++REAGFEPDIVVYNNLL GY+ AGKM DA+DLL+EM+RK CEPN  SFT ++QA
Sbjct: 280  KYVLVQIREAGFEPDIVVYNNLLTGYAAAGKMVDAYDLLKEMRRKECEPNVMSFTTLIQA 339

Query: 723  LCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGKIDRGYQLLNSMIQKGHTPN 902
            LCA+ KMEEAMRVF EM+  GC AD VTYTTLISGFCKWGKI +GY+LL++MIQ+GH PN
Sbjct: 340  LCAKKKMEEAMRVFFEMQSCGCPADAVTYTTLISGFCKWGKISKGYELLDNMIQQGHIPN 399

Query: 903  HTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIFNTVIRLACKLGEIKEAIRFWR 1082
              +YL+I+ AH               M+KIG  PDL I+N VIRLACKLGEIKE +R W 
Sbjct: 400  PMTYLHIMAAHEKKEELEECIELMEEMRKIGCTPDLNIYNIVIRLACKLGEIKEGVRVWN 459

Query: 1083 QIEVNGMSPGVDTFVILINGLVEQGCLVEACDYFKEMVGRGLLSSPQYGTLKDLLNSLLR 1262
            ++E  G+SPG+DTFVI+I+G + Q CLVEAC++FKEMVGRGLLS+PQYGTLK+LLNSLLR
Sbjct: 460  EMEATGLSPGLDTFVIMIHGFLSQRCLVEACEFFKEMVGRGLLSAPQYGTLKELLNSLLR 519

Query: 1263 ADKLEMSKDVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDT 1442
            A+KLEMSKDVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDT
Sbjct: 520  AEKLEMSKDVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDT 579

Query: 1443 FAKLMRGLRKLYNRQFAAEITEKVRKMAAERQMTFKMYKRRGERDLKE 1586
            FAKLMRGLRKLYNRQ AAEITEKVRKMAAER+MTFKMYKRRGER+LKE
Sbjct: 580  FAKLMRGLRKLYNRQIAAEITEKVRKMAAEREMTFKMYKRRGERNLKE 627


>ref|XP_006348483.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like
            [Solanum tuberosum]
          Length = 625

 Score =  882 bits (2279), Expect = 0.0
 Identities = 421/528 (79%), Positives = 468/528 (88%)
 Frame = +3

Query: 3    KIHSRVPKLELALQESGFTVSSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSHGVYR 182
            K HSRVPKLELAL ESG    SGLTERVLNRCGDAGNLGYRFFVW SKQPGYRHSH  Y+
Sbjct: 70   KFHSRVPKLELALLESGVVARSGLTERVLNRCGDAGNLGYRFFVWVSKQPGYRHSHDAYK 129

Query: 183  AMIKILGKMRQFGAIWALIEEMRKENPHLLSAEVFVVLMRRFASARMVKKAIEVLDEMPK 362
            AMIKILGKMRQFG +WAL+EEMR ENP  L+ EVF+VLMRRFAS RMVKKAIEVLDEMPK
Sbjct: 130  AMIKILGKMRQFGTVWALVEEMRIENPQFLTPEVFIVLMRRFASGRMVKKAIEVLDEMPK 189

Query: 363  YGCEPDEHVFGCLLDALCKNGSVKEAALLFEDMSMRFTPTIKHFTSLLYGWCKEGKLMEA 542
            YG EPDE+VFGCLLDALCKNGSVKEAA LF++M  RF+PTIKHFTSLLYGWCKEGKL+EA
Sbjct: 190  YGVEPDEYVFGCLLDALCKNGSVKEAAALFDEMRFRFSPTIKHFTSLLYGWCKEGKLIEA 249

Query: 543  KFVLVKMREAGFEPDIVVYNNLLNGYSVAGKMADAFDLLQEMKRKGCEPNATSFTIIVQA 722
            K VLVKMREAGFEPDIVVYNNLLNGY+V+ KMADAFDLLQEM+RKGC PN TSFTI++QA
Sbjct: 250  KVVLVKMREAGFEPDIVVYNNLLNGYAVSRKMADAFDLLQEMRRKGCNPNETSFTIVIQA 309

Query: 723  LCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGKIDRGYQLLNSMIQKGHTPN 902
            LC Q+KMEEAMRVF +MERSGCE DVVTYTTLISGFCKWGKI++GY+L+++M+QKG+ PN
Sbjct: 310  LCLQDKMEEAMRVFLDMERSGCEGDVVTYTTLISGFCKWGKIEKGYELVDTMLQKGYNPN 369

Query: 903  HTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIFNTVIRLACKLGEIKEAIRFWR 1082
             T+YL+I+LAH               M KIG+ PD +I+N VIRLACKLGEI E +R W 
Sbjct: 370  QTTYLHIMLAHEKKEELEECLELVKEMGKIGIPPDHSIYNIVIRLACKLGEIDEGVRVWN 429

Query: 1083 QIEVNGMSPGVDTFVILINGLVEQGCLVEACDYFKEMVGRGLLSSPQYGTLKDLLNSLLR 1262
            QIE NG+SPGVDTF+I+ING VEQG L+EACD+FKEM+GRGLLS+PQYGTLKDLLNSLLR
Sbjct: 430  QIEANGISPGVDTFIIMINGFVEQGRLIEACDHFKEMIGRGLLSAPQYGTLKDLLNSLLR 489

Query: 1263 ADKLEMSKDVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDT 1442
            A+KLE+ KDVWSCIMTKGC+LNV AWTIWIHALFSNGHVKEAC+YCLDMMDAG+MPQPDT
Sbjct: 490  AEKLELCKDVWSCIMTKGCELNVSAWTIWIHALFSNGHVKEACAYCLDMMDAGLMPQPDT 549

Query: 1443 FAKLMRGLRKLYNRQFAAEITEKVRKMAAERQMTFKMYKRRGERDLKE 1586
            FAKLM+GLRKLYNR+ AAEITEK RKMA +R MTFKMYKRRGERDLKE
Sbjct: 550  FAKLMKGLRKLYNREIAAEITEKARKMAEQRNMTFKMYKRRGERDLKE 597


>gb|EXC12605.1| hypothetical protein L484_012982 [Morus notabilis]
          Length = 638

 Score =  870 bits (2249), Expect = 0.0
 Identities = 408/528 (77%), Positives = 472/528 (89%)
 Frame = +3

Query: 3    KIHSRVPKLELALQESGFTVSSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSHGVYR 182
            K HSRV KLELALQESG  + SGLTERVL RCGDAG+LGYRFFVWASKQPGYR S+ VY+
Sbjct: 83   KFHSRVSKLELALQESGVVLRSGLTERVLGRCGDAGSLGYRFFVWASKQPGYRPSYEVYK 142

Query: 183  AMIKILGKMRQFGAIWALIEEMRKENPHLLSAEVFVVLMRRFASARMVKKAIEVLDEMPK 362
            AMI+ LGKMRQFGA+WAL+EEMRKENP L++ E+FVVLMRRFASARMVKKA+EV DEMPK
Sbjct: 143  AMIRALGKMRQFGAVWALLEEMRKENPQLITPEIFVVLMRRFASARMVKKAVEVFDEMPK 202

Query: 363  YGCEPDEHVFGCLLDALCKNGSVKEAALLFEDMSMRFTPTIKHFTSLLYGWCKEGKLMEA 542
            YGCEPDEHVFGCLLDALCKNGSVKEAA LFE+M ++FTP++KHFTSLLYGWC+EGKLMEA
Sbjct: 203  YGCEPDEHVFGCLLDALCKNGSVKEAASLFEEMRVKFTPSLKHFTSLLYGWCREGKLMEA 262

Query: 543  KFVLVKMREAGFEPDIVVYNNLLNGYSVAGKMADAFDLLQEMKRKGCEPNATSFTIIVQA 722
            KFVLV+M+EAGFEPD+VVYNNLL GY+ AGKMADA+DL++EM+ KGC PNA S+T+++QA
Sbjct: 263  KFVLVQMKEAGFEPDVVVYNNLLGGYAQAGKMADAYDLMKEMRGKGCSPNAASYTVLIQA 322

Query: 723  LCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGKIDRGYQLLNSMIQKGHTPN 902
            LC + KMEEAMRVF EM+RSGC+ADV+TYTTLISGFCKWGKI+RGY++L+SMIQ+G +PN
Sbjct: 323  LCKREKMEEAMRVFVEMQRSGCDADVMTYTTLISGFCKWGKIERGYEILDSMIQRGFSPN 382

Query: 903  HTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIFNTVIRLACKLGEIKEAIRFWR 1082
             T+YL+I+LAH               M+KIG VPDL I+NTVIRLACKL E+KE +R W 
Sbjct: 383  ETTYLHIMLAHEKKEEFEECVELIGEMRKIGCVPDLKIYNTVIRLACKLREVKEGVRLWN 442

Query: 1083 QIEVNGMSPGVDTFVILINGLVEQGCLVEACDYFKEMVGRGLLSSPQYGTLKDLLNSLLR 1262
            +IE +G+SPG+DTFV++I+G + QGCL+EAC YFKEMV RGLLS PQYGTLK+LLN+LLR
Sbjct: 443  EIEASGLSPGLDTFVVMIHGFLGQGCLIEACQYFKEMVERGLLSGPQYGTLKELLNALLR 502

Query: 1263 ADKLEMSKDVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDT 1442
            ADKLEM+KDVW+CI+ KGC++NVYAWTIWIHALF NGHVKEACSYCLDMMDA VMPQPDT
Sbjct: 503  ADKLEMAKDVWTCIVNKGCEINVYAWTIWIHALFKNGHVKEACSYCLDMMDADVMPQPDT 562

Query: 1443 FAKLMRGLRKLYNRQFAAEITEKVRKMAAERQMTFKMYKRRGERDLKE 1586
            FAKLMRGL+KLYNRQ AAEITEKVRKMA +RQMTFKMYKRRGERDLKE
Sbjct: 563  FAKLMRGLKKLYNRQIAAEITEKVRKMAEDRQMTFKMYKRRGERDLKE 610


>ref|XP_004159605.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like
            [Cucumis sativus]
          Length = 664

 Score =  841 bits (2172), Expect = 0.0
 Identities = 392/528 (74%), Positives = 462/528 (87%)
 Frame = +3

Query: 3    KIHSRVPKLELALQESGFTVSSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSHGVYR 182
            K H+RVPKLELALQESG  + SGL ERVL+RCGDAGNLGYRFFVWASKQPGYRHS+ VY+
Sbjct: 109  KFHTRVPKLELALQESGVIMRSGLPERVLSRCGDAGNLGYRFFVWASKQPGYRHSYEVYK 168

Query: 183  AMIKILGKMRQFGAIWALIEEMRKENPHLLSAEVFVVLMRRFASARMVKKAIEVLDEMPK 362
            AMIK LGKMRQFGA+WALIEEMRKENP++L+ EVF+VLMRRFAS RMVKKA+EVLDEMPK
Sbjct: 169  AMIKTLGKMRQFGAVWALIEEMRKENPYMLTPEVFIVLMRRFASVRMVKKAVEVLDEMPK 228

Query: 363  YGCEPDEHVFGCLLDALCKNGSVKEAALLFEDMSMRFTPTIKHFTSLLYGWCKEGKLMEA 542
            YGCEPDE+VFGCLLDALCKNGSVKEAA LFEDM +RF P ++HFTSLLYGWC+EGK+MEA
Sbjct: 229  YGCEPDEYVFGCLLDALCKNGSVKEAASLFEDMRVRFNPNLRHFTSLLYGWCREGKIMEA 288

Query: 543  KFVLVKMREAGFEPDIVVYNNLLNGYSVAGKMADAFDLLQEMKRKGCEPNATSFTIIVQA 722
            K VLV+++EAGFEPDIVVYNNLL GY+ AGKM DAFDLL EMK+  C PNA SFTI++Q+
Sbjct: 289  KHVLVQIKEAGFEPDIVVYNNLLGGYAQAGKMRDAFDLLAEMKKVNCGPNAASFTILIQS 348

Query: 723  LCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGKIDRGYQLLNSMIQKGHTPN 902
             C   KM+EAMR+F+EM+ SGCEADVVTYTTLISGFCKWG  D+ Y++L+ MIQKGH P+
Sbjct: 349  FCKTEKMDEAMRIFTEMQGSGCEADVVTYTTLISGFCKWGNTDKAYEILDDMIQKGHDPS 408

Query: 903  HTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIFNTVIRLACKLGEIKEAIRFWR 1082
              SYL I++AH               M+KIG VPDL I+NT+IRL CKLG++KEA+R W 
Sbjct: 409  QLSYLCIMMAHEKKEELEECMELIEEMRKIGCVPDLNIYNTMIRLVCKLGDLKEAVRLWG 468

Query: 1083 QIEVNGMSPGVDTFVILINGLVEQGCLVEACDYFKEMVGRGLLSSPQYGTLKDLLNSLLR 1262
            +++  G++PG+DT++++++G + QGCLVEACDYFKEMV RGLLS+PQYGTLK+L N+LLR
Sbjct: 469  EMQAGGLNPGLDTYILMVHGFLSQGCLVEACDYFKEMVERGLLSAPQYGTLKELTNALLR 528

Query: 1263 ADKLEMSKDVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDT 1442
            A+KLEM+K++WSC+ TKGC+LNV AWTIWIHALFSNGHVKEACSYCLDMMDA +MPQPDT
Sbjct: 529  AEKLEMAKNMWSCMTTKGCELNVSAWTIWIHALFSNGHVKEACSYCLDMMDADLMPQPDT 588

Query: 1443 FAKLMRGLRKLYNRQFAAEITEKVRKMAAERQMTFKMYKRRGERDLKE 1586
            FAKLMRGL+KL++RQ A EITEKVRKMAA+RQ+TFKMYKRRGERDLKE
Sbjct: 589  FAKLMRGLKKLFHRQLAVEITEKVRKMAADRQITFKMYKRRGERDLKE 636


>ref|XP_004149630.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like
            [Cucumis sativus]
          Length = 641

 Score =  841 bits (2172), Expect = 0.0
 Identities = 392/528 (74%), Positives = 462/528 (87%)
 Frame = +3

Query: 3    KIHSRVPKLELALQESGFTVSSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSHGVYR 182
            K H+RVPKLELALQESG  + SGL ERVL+RCGDAGNLGYRFFVWASKQPGYRHS+ VY+
Sbjct: 86   KFHTRVPKLELALQESGVIMRSGLPERVLSRCGDAGNLGYRFFVWASKQPGYRHSYEVYK 145

Query: 183  AMIKILGKMRQFGAIWALIEEMRKENPHLLSAEVFVVLMRRFASARMVKKAIEVLDEMPK 362
            AMIK LGKMRQFGA+WALIEEMRKENP++L+ EVF+VLMRRFAS RMVKKA+EVLDEMPK
Sbjct: 146  AMIKTLGKMRQFGAVWALIEEMRKENPYMLTPEVFIVLMRRFASVRMVKKAVEVLDEMPK 205

Query: 363  YGCEPDEHVFGCLLDALCKNGSVKEAALLFEDMSMRFTPTIKHFTSLLYGWCKEGKLMEA 542
            YGCEPDE+VFGCLLDALCKNGSVKEAA LFEDM +RF P ++HFTSLLYGWC+EGK+MEA
Sbjct: 206  YGCEPDEYVFGCLLDALCKNGSVKEAASLFEDMRVRFNPNLRHFTSLLYGWCREGKIMEA 265

Query: 543  KFVLVKMREAGFEPDIVVYNNLLNGYSVAGKMADAFDLLQEMKRKGCEPNATSFTIIVQA 722
            K VLV+++EAGFEPDIVVYNNLL GY+ AGKM DAFDLL EMK+  C PNA SFTI++Q+
Sbjct: 266  KHVLVQIKEAGFEPDIVVYNNLLGGYAQAGKMRDAFDLLAEMKKVNCGPNAASFTILIQS 325

Query: 723  LCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGKIDRGYQLLNSMIQKGHTPN 902
             C   KM+EAMR+F+EM+ SGCEADVVTYTTLISGFCKWG  D+ Y++L+ MIQKGH P+
Sbjct: 326  FCKTEKMDEAMRIFTEMQGSGCEADVVTYTTLISGFCKWGNTDKAYEILDDMIQKGHDPS 385

Query: 903  HTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIFNTVIRLACKLGEIKEAIRFWR 1082
              SYL I++AH               M+KIG VPDL I+NT+IRL CKLG++KEA+R W 
Sbjct: 386  QLSYLCIMMAHEKKEELEECMELIEEMRKIGCVPDLNIYNTMIRLVCKLGDLKEAVRLWG 445

Query: 1083 QIEVNGMSPGVDTFVILINGLVEQGCLVEACDYFKEMVGRGLLSSPQYGTLKDLLNSLLR 1262
            +++  G++PG+DT++++++G + QGCLVEACDYFKEMV RGLLS+PQYGTLK+L N+LLR
Sbjct: 446  EMQAGGLNPGLDTYILMVHGFLSQGCLVEACDYFKEMVERGLLSAPQYGTLKELTNALLR 505

Query: 1263 ADKLEMSKDVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDT 1442
            A+KLEM+K++WSC+ TKGC+LNV AWTIWIHALFSNGHVKEACSYCLDMMDA +MPQPDT
Sbjct: 506  AEKLEMAKNMWSCMTTKGCELNVSAWTIWIHALFSNGHVKEACSYCLDMMDADLMPQPDT 565

Query: 1443 FAKLMRGLRKLYNRQFAAEITEKVRKMAAERQMTFKMYKRRGERDLKE 1586
            FAKLMRGL+KL++RQ A EITEKVRKMAA+RQ+TFKMYKRRGERDLKE
Sbjct: 566  FAKLMRGLKKLFHRQLAVEITEKVRKMAADRQITFKMYKRRGERDLKE 613


>ref|XP_006426145.1| hypothetical protein CICLE_v10025134mg [Citrus clementina]
            gi|557528135|gb|ESR39385.1| hypothetical protein
            CICLE_v10025134mg [Citrus clementina]
          Length = 638

 Score =  834 bits (2155), Expect = 0.0
 Identities = 394/528 (74%), Positives = 461/528 (87%)
 Frame = +3

Query: 3    KIHSRVPKLELALQESGFTVSSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSHGVYR 182
            K HSR+PKLELALQ SG  +  GLTERV+NRCGDAGNLGYR+++WASKQP Y HS+ VYR
Sbjct: 83   KFHSRLPKLELALQHSGVVLRPGLTERVINRCGDAGNLGYRYYMWASKQPNYVHSYDVYR 142

Query: 183  AMIKILGKMRQFGAIWALIEEMRKENPHLLSAEVFVVLMRRFASARMVKKAIEVLDEMPK 362
            A+IK L KMR+FGA+WAL+EEMRKE P L++ EVFV+LMRRFASARMVKKAIEVLDEMPK
Sbjct: 143  ALIKSLSKMRKFGAVWALMEEMRKEKPQLITTEVFVILMRRFASARMVKKAIEVLDEMPK 202

Query: 363  YGCEPDEHVFGCLLDALCKNGSVKEAALLFEDMSMRFTPTIKHFTSLLYGWCKEGKLMEA 542
            YGCEPDE VFGCLLDALCKN SVKEAA LF++M  RF P+++HFTSLLYGWCKEGKL+EA
Sbjct: 203  YGCEPDEFVFGCLLDALCKNSSVKEAAKLFDEMRERFKPSLRHFTSLLYGWCKEGKLVEA 262

Query: 543  KFVLVKMREAGFEPDIVVYNNLLNGYSVAGKMADAFDLLQEMKRKGCEPNATSFTIIVQA 722
            K+VLV+M++AGFEPDIVVYNNLL+GY+  GKM DAF+LL+EM+RKGC+PNA S+T+++QA
Sbjct: 263  KYVLVQMKDAGFEPDIVVYNNLLSGYAQMGKMTDAFELLKEMRRKGCDPNANSYTVLIQA 322

Query: 723  LCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGKIDRGYQLLNSMIQKGHTPN 902
            LC   KMEEA R F EMERSGCEADVVTYTTLISGFCK  KIDR Y++L+SMIQ+G  PN
Sbjct: 323  LCRMEKMEEANRAFVEMERSGCEADVVTYTTLISGFCKSRKIDRCYEILDSMIQRGILPN 382

Query: 903  HTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIFNTVIRLACKLGEIKEAIRFWR 1082
              +YL+I+LAH               M+KIG VPD++ +N VIRLACKLGE+KEA+  W 
Sbjct: 383  QLTYLHIMLAHEKKEELEECVELMGEMRKIGCVPDVSNYNVVIRLACKLGELKEAVNVWN 442

Query: 1083 QIEVNGMSPGVDTFVILINGLVEQGCLVEACDYFKEMVGRGLLSSPQYGTLKDLLNSLLR 1262
            ++E   +SPG D+FV++++G + QGCL+EAC+YFKEMVGRGLLS+PQYGTLK+LLNSLLR
Sbjct: 443  EMEAASLSPGTDSFVVMVHGFLGQGCLIEACEYFKEMVGRGLLSAPQYGTLKELLNSLLR 502

Query: 1263 ADKLEMSKDVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDT 1442
            A K+EM+KDVWSCI+TKGC+LNVYAWTIWIH+LFSNGHVKEACSYCLDMMDA VMPQPDT
Sbjct: 503  AQKVEMAKDVWSCIVTKGCELNVYAWTIWIHSLFSNGHVKEACSYCLDMMDADVMPQPDT 562

Query: 1443 FAKLMRGLRKLYNRQFAAEITEKVRKMAAERQMTFKMYKRRGERDLKE 1586
            FAKLMRGL+KLYNRQ AAEITEKVRKMAAERQ+TFKMYKRRGERDLKE
Sbjct: 563  FAKLMRGLKKLYNRQIAAEITEKVRKMAAERQITFKMYKRRGERDLKE 610


>ref|XP_004513407.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At5g65820-like isoform X1 [Cicer arietinum]
            gi|502165084|ref|XP_004513408.1| PREDICTED: putative
            pentatricopeptide repeat-containing protein
            At5g65820-like isoform X2 [Cicer arietinum]
          Length = 655

 Score =  833 bits (2153), Expect = 0.0
 Identities = 390/529 (73%), Positives = 460/529 (86%), Gaps = 1/529 (0%)
 Frame = +3

Query: 3    KIHSRVPKLELALQESGFTVSSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSHGVYR 182
            K HSRVPKLELAL+ESG  VSSGLTERVLNRCG++GNL YRFF WASKQ GYRHS  VY+
Sbjct: 101  KYHSRVPKLELALKESGVVVSSGLTERVLNRCGNSGNLAYRFFSWASKQSGYRHSEEVYK 160

Query: 183  AMIKILGKMRQFGAIWALIEEMRKENPHLLSAEVFVVLMRRFASARMVKKAIEVLDEMPK 362
            AMIK+L KMRQFGA+WALI+EMR ENP L+S  VFV+LMRRFASARMV KAIEVLDEMPK
Sbjct: 161  AMIKVLSKMRQFGAVWALIDEMRLENPQLISPHVFVILMRRFASARMVHKAIEVLDEMPK 220

Query: 363  YGCEPDEHVFGCLLDALCKNGSVKEAALLFEDMSMRFTPTIKHFTSLLYGWCKEGKLMEA 542
            YGCEPDE+VFGCLLDALCKNGS+KEAA LFEDM  RF PT+KHFTSLLYGWCKEGKL+EA
Sbjct: 221  YGCEPDEYVFGCLLDALCKNGSIKEAASLFEDMRYRFPPTVKHFTSLLYGWCKEGKLVEA 280

Query: 543  KFVLVKMREAGFEPDIVVYNNLLNGYSVAGKMADAFDLLQEMKRKGCEPNATSFTIIVQA 722
            K VLV+M++AG EPDIVV+NNLL GY+  GKMADA+DLL+EMKRKGCEPNA S+TI++Q+
Sbjct: 281  KHVLVQMKDAGIEPDIVVFNNLLGGYAQGGKMADAYDLLKEMKRKGCEPNAASYTILIQS 340

Query: 723  LCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGKIDRGYQLLNSMIQKGHTPN 902
            LC   K+EEAMR+F EM+R+ C+ DV+TYTTLISGFCKWGKI RGY+LL+ MIQ+GH+PN
Sbjct: 341  LCKHEKLEEAMRIFVEMQRNDCQMDVITYTTLISGFCKWGKIKRGYELLDQMIQEGHSPN 400

Query: 903  HTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIFNTVIRLACKLGEIKEAIRFWR 1082
              +YL+I+LAH               M+KIG VP+L I+NTVIRLACK GE+K+ +R W 
Sbjct: 401  QLTYLHIMLAHEKKEELEECMELVNEMKKIGCVPNLNIYNTVIRLACKFGEVKQGVRLWN 460

Query: 1083 QIEVNGMSPGVDTFVILINGLVEQGCLVEACDYFKEMVGRGLLSSPQYGTLKDLLNSLLR 1262
            ++E +G+SPG DTFV++ING +EQ CL+EAC+YFKEMVGRGL ++PQYGTLK+L+NSLLR
Sbjct: 461  EMEASGLSPGTDTFVVMINGFLEQDCLIEACEYFKEMVGRGLFAAPQYGTLKELMNSLLR 520

Query: 1263 ADKLEMSKDVWSCI-MTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPD 1439
            A+KLEM+KD W+CI  +K C++NV AWTIWIHALFS GHVKEACS+C+DMMD  +MPQPD
Sbjct: 521  AEKLEMAKDTWNCITASKSCEMNVAAWTIWIHALFSKGHVKEACSFCIDMMDNDLMPQPD 580

Query: 1440 TFAKLMRGLRKLYNRQFAAEITEKVRKMAAERQMTFKMYKRRGERDLKE 1586
            TFAKL+RGL+KLYNR+FAAEITEKVRKMAA+R +TFKMYKRRGERDLKE
Sbjct: 581  TFAKLIRGLKKLYNREFAAEITEKVRKMAADRHITFKMYKRRGERDLKE 629


>ref|XP_006466418.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At5g65820-like [Citrus sinensis]
          Length = 638

 Score =  831 bits (2147), Expect = 0.0
 Identities = 393/528 (74%), Positives = 460/528 (87%)
 Frame = +3

Query: 3    KIHSRVPKLELALQESGFTVSSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSHGVYR 182
            K HSR+PKLELALQ SG  +  GLTERV+NRCGDAGNLGYR+++WASKQP Y HS+ VYR
Sbjct: 83   KFHSRLPKLELALQHSGVVLRPGLTERVINRCGDAGNLGYRYYMWASKQPNYVHSYDVYR 142

Query: 183  AMIKILGKMRQFGAIWALIEEMRKENPHLLSAEVFVVLMRRFASARMVKKAIEVLDEMPK 362
            A+IK L KMR+FGA+WAL+EEMRKE P L++ EVFV+LMRRFASARMVKKAIEVLDEMPK
Sbjct: 143  ALIKSLSKMRKFGAVWALMEEMRKEKPQLITTEVFVILMRRFASARMVKKAIEVLDEMPK 202

Query: 363  YGCEPDEHVFGCLLDALCKNGSVKEAALLFEDMSMRFTPTIKHFTSLLYGWCKEGKLMEA 542
            YGCEPDE VFGCLLDALCKN SVKEAA LF+++  RF P+++HFTSLLYGWCKEGKL+EA
Sbjct: 203  YGCEPDEFVFGCLLDALCKNSSVKEAAKLFDEIRERFKPSLRHFTSLLYGWCKEGKLVEA 262

Query: 543  KFVLVKMREAGFEPDIVVYNNLLNGYSVAGKMADAFDLLQEMKRKGCEPNATSFTIIVQA 722
            K+VLV+M++AGFEPDIVVYNNLL+GY+  GKM DAF+LL+EM+RKGC+PNA S+T+++QA
Sbjct: 263  KYVLVQMKDAGFEPDIVVYNNLLSGYAQMGKMTDAFELLKEMRRKGCDPNANSYTVLIQA 322

Query: 723  LCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGKIDRGYQLLNSMIQKGHTPN 902
            LC   KMEEA R F EMERSGCEADVVTYTTLISGFCK  KIDR Y++L+SMIQ+G  PN
Sbjct: 323  LCRMEKMEEANRAFVEMERSGCEADVVTYTTLISGFCKSRKIDRCYEILDSMIQRGILPN 382

Query: 903  HTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIFNTVIRLACKLGEIKEAIRFWR 1082
              +YL+I+LAH               M+KIG VPD++ +N VIRLACKLGE+KEA+  W 
Sbjct: 383  QLTYLHIMLAHEKKEELEECVELMGEMRKIGCVPDVSNYNVVIRLACKLGELKEAVNVWN 442

Query: 1083 QIEVNGMSPGVDTFVILINGLVEQGCLVEACDYFKEMVGRGLLSSPQYGTLKDLLNSLLR 1262
            ++E   +SPG D+FV++++G + QGCL+EAC+YFKEMVGRGLLS+PQYGTLK LLNSLLR
Sbjct: 443  EMEAASLSPGTDSFVVMVHGFLGQGCLIEACEYFKEMVGRGLLSAPQYGTLKALLNSLLR 502

Query: 1263 ADKLEMSKDVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDT 1442
            A K+EM+KDVWSCI+TKGC+LNVYAWTIWIH+LFSNGHVKEACSYCLDMMDA VMPQPDT
Sbjct: 503  AQKVEMAKDVWSCIVTKGCELNVYAWTIWIHSLFSNGHVKEACSYCLDMMDADVMPQPDT 562

Query: 1443 FAKLMRGLRKLYNRQFAAEITEKVRKMAAERQMTFKMYKRRGERDLKE 1586
            FAKLMRGL+KLYNRQ AAEITEKVRKMAAERQ+TFKMYKRRGERDLKE
Sbjct: 563  FAKLMRGLKKLYNRQIAAEITEKVRKMAAERQITFKMYKRRGERDLKE 610


>ref|XP_003546958.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like
            isoform X1 [Glycine max] gi|571514894|ref|XP_006597171.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At3g49730-like isoform X2 [Glycine max]
            gi|571514897|ref|XP_006597172.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g49730-like isoform X3 [Glycine max]
          Length = 654

 Score =  826 bits (2133), Expect = 0.0
 Identities = 389/529 (73%), Positives = 457/529 (86%), Gaps = 1/529 (0%)
 Frame = +3

Query: 3    KIHSRVPKLELALQESGFTVSSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSHGVYR 182
            K HSRVPKLELAL+ESG  V  GLTERVL+RCGDAGNL YRF+ WASKQ G+R  H  Y+
Sbjct: 101  KYHSRVPKLELALRESGVVVRPGLTERVLSRCGDAGNLAYRFYSWASKQSGHRLDHDAYK 160

Query: 183  AMIKILGKMRQFGAIWALIEEMRKENPHLLSAEVFVVLMRRFASARMVKKAIEVLDEMPK 362
            AMIK+L +MRQFGA+WALIEEMR+ENPHL++ +VFV+LMRRFASARMV KA+EVLDEMPK
Sbjct: 161  AMIKVLSRMRQFGAVWALIEEMRQENPHLITPQVFVILMRRFASARMVHKAVEVLDEMPK 220

Query: 363  YGCEPDEHVFGCLLDALCKNGSVKEAALLFEDMSMRFTPTIKHFTSLLYGWCKEGKLMEA 542
            YGCEPDE+VFGCLLDALCKNGSVKEAA LFEDM  R+ P++KHFTSLLYGWCKEGKLMEA
Sbjct: 221  YGCEPDEYVFGCLLDALCKNGSVKEAASLFEDMRYRWKPSVKHFTSLLYGWCKEGKLMEA 280

Query: 543  KFVLVKMREAGFEPDIVVYNNLLNGYSVAGKMADAFDLLQEMKRKGCEPNATSFTIIVQA 722
            K VLV+M++ G EPDIVVYNNLL GY+ AGKM DA+DLL+EM+RK CEPNATS+T+++Q+
Sbjct: 281  KHVLVQMKDMGIEPDIVVYNNLLGGYAQAGKMGDAYDLLKEMRRKRCEPNATSYTVLIQS 340

Query: 723  LCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGKIDRGYQLLNSMIQKGHTPN 902
            LC   ++EEA R+F EM+ +GC+ADVVTY+TLISGFCKWGKI RGY+LL+ MIQ+GH PN
Sbjct: 341  LCKHERLEEATRLFVEMQTNGCQADVVTYSTLISGFCKWGKIKRGYELLDEMIQQGHFPN 400

Query: 903  HTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIFNTVIRLACKLGEIKEAIRFWR 1082
               Y +I+LAH               MQKIG  PDL+I+NTVIRLACKLGE+KE I+ W 
Sbjct: 401  QVIYQHIMLAHEKKEELEECKELVNEMQKIGCAPDLSIYNTVIRLACKLGEVKEGIQLWN 460

Query: 1083 QIEVNGMSPGVDTFVILINGLVEQGCLVEACDYFKEMVGRGLLSSPQYGTLKDLLNSLLR 1262
            ++E +G+SPG+DTFVI+ING +EQGCLVEAC+YFKEMVGRGL ++PQYGTLK+L+NSLLR
Sbjct: 461  EMESSGLSPGMDTFVIMINGFLEQGCLVEACEYFKEMVGRGLFTAPQYGTLKELMNSLLR 520

Query: 1263 ADKLEMSKDVWSCI-MTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPD 1439
            A+KLEM+KD W+CI  +KGC LNV AWTIWIHALFS GHVKEACS+C+DMMD  +MP PD
Sbjct: 521  AEKLEMAKDAWNCITASKGCQLNVSAWTIWIHALFSKGHVKEACSFCIDMMDKDLMPNPD 580

Query: 1440 TFAKLMRGLRKLYNRQFAAEITEKVRKMAAERQMTFKMYKRRGERDLKE 1586
            TFAKLM GL+KLYNRQFAAEITEKVRKMAA+RQ+TFKMYKRRGERDLKE
Sbjct: 581  TFAKLMHGLKKLYNRQFAAEITEKVRKMAADRQITFKMYKRRGERDLKE 629


>ref|XP_006595472.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At5g65820-like, partial [Glycine max]
          Length = 656

 Score =  818 bits (2114), Expect = 0.0
 Identities = 384/529 (72%), Positives = 457/529 (86%), Gaps = 1/529 (0%)
 Frame = +3

Query: 3    KIHSRVPKLELALQESGFTVSSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSHGVYR 182
            K HSRVPKLELAL+ESG  V  GLTERVLNRCGDAGNL YRF+ WASKQ G+R  H  Y+
Sbjct: 103  KYHSRVPKLELALRESGVVVRPGLTERVLNRCGDAGNLAYRFYSWASKQSGHRLDHDAYK 162

Query: 183  AMIKILGKMRQFGAIWALIEEMRKENPHLLSAEVFVVLMRRFASARMVKKAIEVLDEMPK 362
            AMIK+L +MRQFGA+WALIEEMR+ENPHL++ +VFV+LMRRFASARMV KA++VLDEMP 
Sbjct: 163  AMIKVLSRMRQFGAVWALIEEMRQENPHLITPQVFVILMRRFASARMVHKAVQVLDEMPN 222

Query: 363  YGCEPDEHVFGCLLDALCKNGSVKEAALLFEDMSMRFTPTIKHFTSLLYGWCKEGKLMEA 542
            YGCEPDE+VFGCLLDAL KNGSVKEAA LFE++  R+ P++KHFTSLLYGWCKEGKLMEA
Sbjct: 223  YGCEPDEYVFGCLLDALRKNGSVKEAASLFEELRYRWKPSVKHFTSLLYGWCKEGKLMEA 282

Query: 543  KFVLVKMREAGFEPDIVVYNNLLNGYSVAGKMADAFDLLQEMKRKGCEPNATSFTIIVQA 722
            K VLV+M++AG EPDIVVYNNLL GY+ A KM DA+DLL+EM+RKGCEPNATS+T+++Q+
Sbjct: 283  KHVLVQMKDAGIEPDIVVYNNLLGGYAQADKMGDAYDLLKEMRRKGCEPNATSYTVLIQS 342

Query: 723  LCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGKIDRGYQLLNSMIQKGHTPN 902
            LC   ++EEA RVF EM+R+GC+AD+VTY+TLISGFCKWGKI RGY+LL+ MIQ+GH PN
Sbjct: 343  LCKHERLEEATRVFVEMQRNGCQADLVTYSTLISGFCKWGKIKRGYELLDEMIQQGHFPN 402

Query: 903  HTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIFNTVIRLACKLGEIKEAIRFWR 1082
               Y +I++AH               MQKIG  PDL+I+NTVIRLACKLGE+KE +R W 
Sbjct: 403  QVIYQHIMVAHEKKEELEECKELVNEMQKIGCAPDLSIYNTVIRLACKLGEVKEGVRLWN 462

Query: 1083 QIEVNGMSPGVDTFVILINGLVEQGCLVEACDYFKEMVGRGLLSSPQYGTLKDLLNSLLR 1262
            ++E +G+SP +DTFVI+ING +EQGCLVEAC+YFKEMVGRGL ++PQYGTLK+L+NSLLR
Sbjct: 463  EMESSGLSPSIDTFVIMINGFLEQGCLVEACEYFKEMVGRGLFAAPQYGTLKELMNSLLR 522

Query: 1263 ADKLEMSKDVWSCI-MTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPD 1439
            A+KLEM+KD W+CI  +KGC LNV AWTIWIHALFS GHVKEACS+C+ MMD  +MPQPD
Sbjct: 523  AEKLEMAKDAWNCITASKGCQLNVSAWTIWIHALFSKGHVKEACSFCIAMMDKDLMPQPD 582

Query: 1440 TFAKLMRGLRKLYNRQFAAEITEKVRKMAAERQMTFKMYKRRGERDLKE 1586
            TFAKLMRGL+KLYNR+FAAEITEKVRKMAA+R++TFKMYKRRGERDLKE
Sbjct: 583  TFAKLMRGLKKLYNREFAAEITEKVRKMAADRKITFKMYKRRGERDLKE 631


>ref|XP_002530608.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223529856|gb|EEF31788.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 596

 Score =  799 bits (2064), Expect = 0.0
 Identities = 372/503 (73%), Positives = 446/503 (88%)
 Frame = +3

Query: 9    HSRVPKLELALQESGFTVSSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSHGVYRAM 188
            HSRVPKLELALQESG T+ +GLTERVLNRCGDAGNLGYRFFVWASKQPGYRHS+  Y+AM
Sbjct: 87   HSRVPKLELALQESGVTMRAGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSYENYKAM 146

Query: 189  IKILGKMRQFGAIWALIEEMRKENPHLLSAEVFVVLMRRFASARMVKKAIEVLDEMPKYG 368
            +KI  KMRQFGA+WAL+EEMRK+N  L+++E+F+VL+RRFASAR+V+KAIEVLDEMPKYG
Sbjct: 147  VKIFSKMRQFGAVWALLEEMRKDNSVLITSELFIVLIRRFASARLVEKAIEVLDEMPKYG 206

Query: 369  CEPDEHVFGCLLDALCKNGSVKEAALLFEDMSMRFTPTIKHFTSLLYGWCKEGKLMEAKF 548
            CEPDE+VFGCLLDALCKNGSVK+AA LFEDM +RF+P+++HFTSLLYGWC+EGKL+EAK 
Sbjct: 207  CEPDEYVFGCLLDALCKNGSVKQAASLFEDMRVRFSPSLRHFTSLLYGWCREGKLIEAKH 266

Query: 549  VLVKMREAGFEPDIVVYNNLLNGYSVAGKMADAFDLLQEMKRKGCEPNATSFTIIVQALC 728
            VLV+MREAGFEPDIVV+NNLL+ YS+AGKM DAFDLL+EM RKGCEPNA S+TI++QA C
Sbjct: 267  VLVQMREAGFEPDIVVFNNLLSAYSMAGKMTDAFDLLKEMVRKGCEPNANSYTIMIQAFC 326

Query: 729  AQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGKIDRGYQLLNSMIQKGHTPNHT 908
            +Q KM+EAMRVF EMER+GCEADVVTYT LISGFCKWGKI+RGYQ+L++M QKGH PN  
Sbjct: 327  SQEKMDEAMRVFVEMERTGCEADVVTYTALISGFCKWGKINRGYQILDAMKQKGHMPNQL 386

Query: 909  SYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIFNTVIRLACKLGEIKEAIRFWRQI 1088
            +YL ILLAH               M+ +G VPDL+I+N VIRLACKLGE+K+ ++ W ++
Sbjct: 387  TYLRILLAHEKKEELEECLELIESMRMVGCVPDLSIYNVVIRLACKLGEVKQGVQIWNEM 446

Query: 1089 EVNGMSPGVDTFVILINGLVEQGCLVEACDYFKEMVGRGLLSSPQYGTLKDLLNSLLRAD 1268
            E +  SP +DTFVI+I+G + QGCLVEAC+YFKEM+GRGLL++PQYG LK+LLN+LLR +
Sbjct: 447  EASDFSPELDTFVIMIHGFLGQGCLVEACEYFKEMIGRGLLTTPQYGILKELLNALLRGE 506

Query: 1269 KLEMSKDVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFA 1448
            KL M+KDVWSCI+TKGC+LN  AWTIWIH+LFSNGHVKEACSYCLDMM+A +MP+P+TFA
Sbjct: 507  KLGMAKDVWSCIVTKGCELNADAWTIWIHSLFSNGHVKEACSYCLDMMEADIMPKPETFA 566

Query: 1449 KLMRGLRKLYNRQFAAEITEKVR 1517
            KLMRGLRKLYNR+FAAEITEK++
Sbjct: 567  KLMRGLRKLYNREFAAEITEKIK 589


>gb|EOX91773.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao]
          Length = 647

 Score =  799 bits (2063), Expect = 0.0
 Identities = 372/528 (70%), Positives = 449/528 (85%)
 Frame = +3

Query: 3    KIHSRVPKLELALQESGFTVSSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSHGVYR 182
            K H+RVPKL LALQ+SG     GLTERVLNRCGDAGNLGY+FF WASKQPGY  S+ +Y+
Sbjct: 92   KFHTRVPKLNLALQQSGVVFRPGLTERVLNRCGDAGNLGYKFFTWASKQPGYHPSYEIYK 151

Query: 183  AMIKILGKMRQFGAIWALIEEMRKENPHLLSAEVFVVLMRRFASARMVKKAIEVLDEMPK 362
            AMIKILGKMRQFGA+WALIEE+++ENPH ++AE+F++L+RRFAS+RMVKKAIEV DEMPK
Sbjct: 152  AMIKILGKMRQFGAVWALIEEIKRENPHFITAELFILLIRRFASSRMVKKAIEVFDEMPK 211

Query: 363  YGCEPDEHVFGCLLDALCKNGSVKEAALLFEDMSMRFTPTIKHFTSLLYGWCKEGKLMEA 542
            YGC  D+ VFG LLDALCKNG+VKEAAL+FE+M +RF P +KHFTSLLYGWCKEG+++EA
Sbjct: 212  YGCLQDDAVFGSLLDALCKNGNVKEAALVFEEMRVRFLPNLKHFTSLLYGWCKEGRILEA 271

Query: 543  KFVLVKMREAGFEPDIVVYNNLLNGYSVAGKMADAFDLLQEMKRKGCEPNATSFTIIVQA 722
            K VLV+M+EAGFEPDIVV+NNLL+GY +  KM DAFDLL+EM++KG +PNA S+TI++Q 
Sbjct: 272  KHVLVQMKEAGFEPDIVVFNNLLSGYVLGNKMGDAFDLLKEMRKKGIDPNANSYTIVIQG 331

Query: 723  LCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGKIDRGYQLLNSMIQKGHTPN 902
            LC  ++MEEAMRVF +MER+GC  DVV YTTLISGFCKWG++++GY++L+ MI +G  PN
Sbjct: 332  LCKADRMEEAMRVFVDMERNGCRGDVVVYTTLISGFCKWGRVEKGYEVLDRMISEGLMPN 391

Query: 903  HTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIFNTVIRLACKLGEIKEAIRFWR 1082
              +YL+I+LAH               M+KIG VPD  I+N V+RLACKL E+KEA R W 
Sbjct: 392  SLTYLHIMLAHEKKDELEECLELMEEMRKIGCVPDGGIYNVVVRLACKLEEVKEAARVWN 451

Query: 1083 QIEVNGMSPGVDTFVILINGLVEQGCLVEACDYFKEMVGRGLLSSPQYGTLKDLLNSLLR 1262
            ++E  G SPGVD F+++I+G + QGCLVEAC+YFKEM GRGL   PQYG LKDLLNSLLR
Sbjct: 452  EMEGRGFSPGVDNFIVMIHGFIGQGCLVEACEYFKEMAGRGLFCVPQYGILKDLLNSLLR 511

Query: 1263 ADKLEMSKDVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDT 1442
            A+KLEM+K+VWSCI++KGC+LNV AWTIW+HALFS GHVKEACSYCL+MMD  VMPQPDT
Sbjct: 512  AEKLEMAKNVWSCIVSKGCELNVSAWTIWVHALFSKGHVKEACSYCLEMMDVDVMPQPDT 571

Query: 1443 FAKLMRGLRKLYNRQFAAEITEKVRKMAAERQMTFKMYKRRGERDLKE 1586
            FAKLMRGLRKLYNRQ AAEITEKVRKMAA+R++TFKMYKRRG+RDLKE
Sbjct: 572  FAKLMRGLRKLYNRQIAAEITEKVRKMAADREITFKMYKRRGQRDLKE 619


>gb|EPS62602.1| hypothetical protein M569_12187, partial [Genlisea aurea]
          Length = 593

 Score =  783 bits (2021), Expect = 0.0
 Identities = 382/530 (72%), Positives = 451/530 (85%), Gaps = 2/530 (0%)
 Frame = +3

Query: 3    KIHSRVPKLELALQESGFTVSSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSHGVYR 182
            K +S+VPKLELALQ SG +V SGLTERVLNRCGDAGNLGYRFFVWASKQPGY HSH VY+
Sbjct: 44   KFNSKVPKLELALQHSGVSVRSGLTERVLNRCGDAGNLGYRFFVWASKQPGYNHSHDVYK 103

Query: 183  AMIKILGKMRQFGAIWALIEEMRKENPHLLSAEVFVVLMRRFASARMVKKAIEVLDEMPK 362
            AMI+ILGKMRQFGA+WALIEEMRKENP LL+ EVF+VLMRRFASARMVKKA+EVLDEMP 
Sbjct: 104  AMIRILGKMRQFGAVWALIEEMRKENPQLLTPEVFIVLMRRFASARMVKKAVEVLDEMPS 163

Query: 363  YGCEPDEHVFGCLLDALCKNGSVKEAALLFEDMSMRFTPTIKHFTSLLYGWCKEGKLMEA 542
            YGCEPDE+VFGCLLDALCKNGSVKEA+LL EDM MRF PT+KHFTSLL+GWC+EGKL+EA
Sbjct: 164  YGCEPDEYVFGCLLDALCKNGSVKEASLLMEDMQMRFKPTMKHFTSLLHGWCREGKLIEA 223

Query: 543  KFVLVKMREAGFEPDIVVYNNLLNGYSVAGKMADAFDLLQEMKRKGCEPNATSFTIIVQA 722
            K VL KMREAGF PDIVVYN LL GY+ AGK+ADA  LL EM+R  C P ATS+T ++++
Sbjct: 224  KTVLQKMREAGFLPDIVVYNTLLAGYAAAGKIADARHLLLEMRRNSCRPTATSYTAVIRS 283

Query: 723  LCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGKIDRGYQLLNSMIQKGHTPN 902
            LCA+ KM EA+++FSEME  GCEADVV YTTLISGFCK GK  +GY+LL++MI+KG TPN
Sbjct: 284  LCAREKMAEAVQLFSEMEADGCEADVVAYTTLISGFCKRGKTGKGYELLDAMIRKGITPN 343

Query: 903  HTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIFNTVIRLACKLGEIKEAIRFWR 1082
            +T+Y Y++ AH               M++IG+ PD  ++N VIRL+CKLGE+++ IR   
Sbjct: 344  NTTYSYLISAHEKEEELEECLGLAKSMRQIGVTPDSAVYNPVIRLSCKLGEVEDGIRLMN 403

Query: 1083 QIEVNGMSPGVDTFVILINGLVEQGCLVEACDYFKEMVGRGLLSSPQYGTLKDLLNSLLR 1262
            ++E +G+SPGVDTFVILINGL+  G L EAC  F+EMVGRGL+++PQYG LKDLLNSLLR
Sbjct: 404  EMEEDGISPGVDTFVILINGLILHGHLDEACLRFEEMVGRGLVAAPQYGLLKDLLNSLLR 463

Query: 1263 ADKLEMSKDVWSCIMT-KG-CDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQP 1436
              KL++SKDVWS ++T KG CD+NVYAWTIWIHAL S G+VKEAC Y L+MM+AG+MPQP
Sbjct: 464  CGKLQLSKDVWSKMVTSKGCCDVNVYAWTIWIHALLSKGYVKEACFYGLEMMEAGLMPQP 523

Query: 1437 DTFAKLMRGLRKLYNRQFAAEITEKVRKMAAERQMTFKMYKRRGERDLKE 1586
            DTFAKL+RGLRKLYNR+ AAEITEKV++MAAER +TFKMYKRRGERDLK+
Sbjct: 524  DTFAKLIRGLRKLYNREIAAEITEKVKRMAAERHITFKMYKRRGERDLKD 573


>ref|XP_006404107.1| hypothetical protein EUTSA_v10010190mg [Eutrema salsugineum]
            gi|557105226|gb|ESQ45560.1| hypothetical protein
            EUTSA_v10010190mg [Eutrema salsugineum]
          Length = 645

 Score =  769 bits (1986), Expect = 0.0
 Identities = 365/529 (68%), Positives = 439/529 (82%), Gaps = 3/529 (0%)
 Frame = +3

Query: 9    HSRVPKLELALQESGFTVSSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSHGVYRAM 188
            HSRVPKLEL L ESG  +  GL  RVL+RCGDAGNLGYRFF+WA+KQPGY HS+ V ++M
Sbjct: 84   HSRVPKLELVLHESGINLRPGLIVRVLSRCGDAGNLGYRFFLWAAKQPGYCHSYEVCKSM 143

Query: 189  IKILGKMRQFGAIWALIEEMRKENPHLLSAEVFVVLMRRFASARMVKKAIEVLDEMPKYG 368
            +KIL KMRQFGA+WALIEEMRKENP L+  E+FVVLMRRFASA MVKKA+EVLDEMPKYG
Sbjct: 144  VKILSKMRQFGAVWALIEEMRKENPQLIEPELFVVLMRRFASANMVKKAVEVLDEMPKYG 203

Query: 369  CEPDEHVFGCLLDALCKNGSVKEAALLFEDMSMRFTPTIKHFTSLLYGWCKEGKLMEAKF 548
             EPDE++FGCLLDALCKNGSVK+A+ LFEDM  +F P +++FTSLLYGWC+EGKL+EAK 
Sbjct: 204  IEPDEYIFGCLLDALCKNGSVKDASKLFEDMRDKFPPNLRYFTSLLYGWCREGKLIEAKH 263

Query: 549  VLVKMREAGFEPDIVVYNNLLNGYSVAGKMADAFDLLQEMKRKGCEPNATSFTIIVQALC 728
            VLV+M+EAG EPDIVV+ NLL+GY+ AGKMADA+DL+++M+R+G EPNA  +T+++QALC
Sbjct: 264  VLVQMKEAGLEPDIVVFTNLLSGYAHAGKMADAYDLMKDMRRRGYEPNANCYTVLIQALC 323

Query: 729  AQNK-MEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGKIDRGYQLLNSMIQKGHTPNH 905
               K M+EAMRVF EMER GCEAD+VTYT LISGFCKWG ID+GY +L+ M +KG  P  
Sbjct: 324  KMEKRMDEAMRVFVEMERYGCEADIVTYTALISGFCKWGMIDKGYSVLDDMRKKGVMPLQ 383

Query: 906  TSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIFNTVIRLACKLGEIKEAIRFWRQ 1085
             +Y+ I++AH               M++ G +PDL I+N VIRLACKLGE+KEA+R W +
Sbjct: 384  VTYMQIMVAHEKKEQFEECLDLIEKMKQNGCLPDLLIYNVVIRLACKLGEVKEAVRLWNE 443

Query: 1086 IEVNGMSPGVDTFVILINGLVEQGCLVEACDYFKEMVGRGLLSSPQYGTLKDLLNSLLRA 1265
            +E NG+SPGVDTFVI+ING   QGCL+EACD+FKEMV RG+ S+P YGTLK LLN+L+R 
Sbjct: 444  MEANGLSPGVDTFVIMINGFASQGCLIEACDHFKEMVSRGIFSAPHYGTLKILLNTLVRD 503

Query: 1266 DKLEMSKDVWSCIMTK--GCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPD 1439
            DKLEM+KDVWSC+  K   C+LNV AWTIWIHALF+ GHVKEACSYCLDMM+  +MPQPD
Sbjct: 504  DKLEMAKDVWSCLSNKSSSCELNVSAWTIWIHALFARGHVKEACSYCLDMMEMDLMPQPD 563

Query: 1440 TFAKLMRGLRKLYNRQFAAEITEKVRKMAAERQMTFKMYKRRGERDLKE 1586
            T+AKLM+GL KLYNR  AAEITEKVRKMA+ER+M+FKMYKRRGE DL E
Sbjct: 564  TYAKLMKGLNKLYNRTIAAEITEKVRKMASEREMSFKMYKRRGEEDLIE 612


>ref|XP_002866691.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297312526|gb|EFH42950.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 638

 Score =  757 bits (1954), Expect = 0.0
 Identities = 363/529 (68%), Positives = 441/529 (83%), Gaps = 1/529 (0%)
 Frame = +3

Query: 3    KIHSRVPKLELALQESGFTVSSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSHGVYR 182
            K HSRVPKLELAL ESG  +  GL ERVLNRCGDAGNLGYRFFVWA+KQP Y HS  VY+
Sbjct: 93   KFHSRVPKLELALNESGVELRPGLIERVLNRCGDAGNLGYRFFVWAAKQPRYCHSIEVYK 152

Query: 183  AMIKILGKMRQFGAIWALIEEMRKENPHLLSAEVFVVLMRRFASARMVKKAIEVLDEMPK 362
            +M+KIL KMRQFGA+W LIEEMRKENP L+  E+FVVL++RFASA MVKKAIEVLDEMP 
Sbjct: 153  SMVKILSKMRQFGAVWGLIEEMRKENPQLIEPELFVVLVQRFASADMVKKAIEVLDEMPT 212

Query: 363  YGCEPDEHVFGCLLDALCKNGSVKEAALLFEDMSMRFTPTIKHFTSLLYGWCKEGKLMEA 542
            +G EPDE+VFGCLLDALCK+GSVK+AA LFEDM +RF   +++FTSLLYGWC+E K+MEA
Sbjct: 213  FGLEPDEYVFGCLLDALCKHGSVKDAAKLFEDMRLRFPVNLRYFTSLLYGWCREEKMMEA 272

Query: 543  KFVLVKMREAGFEPDIVVYNNLLNGYSVAGKMADAFDLLQEMKRKGCEPNATSFTIIVQA 722
            K+VLV+M+EAGFEPDIV Y NLL+GY+ AGKMADA+DLL++M+R+G EPNAT +T+++QA
Sbjct: 273  KYVLVQMKEAGFEPDIVDYTNLLSGYANAGKMADAYDLLKDMRRRGFEPNATCYTVLIQA 332

Query: 723  LCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGKIDRGYQLLNSMIQKGHTPN 902
            LC  ++MEEAM+VF EMER  CEADVVTYT L+SGFCKWGKID+ Y +L+ MI+KG  P+
Sbjct: 333  LCKVDRMEEAMKVFVEMERYECEADVVTYTALVSGFCKWGKIDKCYLVLDDMIKKGLMPS 392

Query: 903  HTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIFNTVIRLACKLGEIKEAIRFWR 1082
              +Y++I+ AH               M++I   PD+ I+N VIRLACKLGE+KEA+R W 
Sbjct: 393  QLTYMHIMAAHEKKEKLIECLELMEKMKQIEYHPDIGIYNVVIRLACKLGEVKEAVRLWN 452

Query: 1083 QIEVNGMSPGVDTFVILINGLVEQGCLVEACDYFKEMVGRGLLSSPQYGTLKDLLNSLLR 1262
            ++E NG+SPG DTFVI+INGL  QGCL+EACD+FKEMV RGL S PQYGTLK LLN+LL+
Sbjct: 453  EMEGNGLSPGADTFVIIINGLTSQGCLLEACDHFKEMVARGLFSVPQYGTLKLLLNTLLK 512

Query: 1263 ADKLEMSKDVWSCIMTKG-CDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPD 1439
              KLEM+KDVWSCI +KG C+L+V +WTIWIHALFS G+ KEACSYCL+M++   MPQPD
Sbjct: 513  DKKLEMAKDVWSCITSKGSCELSVSSWTIWIHALFSKGYEKEACSYCLEMIELEFMPQPD 572

Query: 1440 TFAKLMRGLRKLYNRQFAAEITEKVRKMAAERQMTFKMYKRRGERDLKE 1586
            TFAKLM+GL+KLY+R+FA EITEKVR MAAE++M+FKMYKRRG +DL E
Sbjct: 573  TFAKLMKGLKKLYHREFAVEITEKVRNMAAEKEMSFKMYKRRGVQDLTE 621


>ref|NP_201383.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75170571|sp|Q9FH87.1|PP447_ARATH RecName:
            Full=Putative pentatricopeptide repeat-containing protein
            At5g65820 gi|9758569|dbj|BAB09050.1| unnamed protein
            product [Arabidopsis thaliana]
            gi|332010728|gb|AED98111.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 637

 Score =  756 bits (1952), Expect = 0.0
 Identities = 364/529 (68%), Positives = 440/529 (83%), Gaps = 1/529 (0%)
 Frame = +3

Query: 3    KIHSRVPKLELALQESGFTVSSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSHGVYR 182
            K HSRVPKLELAL ESG  +  GL ERVLNRCGDAGNLGYRFFVWA+KQP Y HS  VY+
Sbjct: 92   KFHSRVPKLELALNESGVELRPGLIERVLNRCGDAGNLGYRFFVWAAKQPRYCHSIEVYK 151

Query: 183  AMIKILGKMRQFGAIWALIEEMRKENPHLLSAEVFVVLMRRFASARMVKKAIEVLDEMPK 362
            +M+KIL KMRQFGA+W LIEEMRKENP L+  E+FVVL++RFASA MVKKAIEVLDEMPK
Sbjct: 152  SMVKILSKMRQFGAVWGLIEEMRKENPQLIEPELFVVLVQRFASADMVKKAIEVLDEMPK 211

Query: 363  YGCEPDEHVFGCLLDALCKNGSVKEAALLFEDMSMRFTPTIKHFTSLLYGWCKEGKLMEA 542
            +G EPDE+VFGCLLDALCK+GSVK+AA LFEDM MRF   +++FTSLLYGWC+ GK+MEA
Sbjct: 212  FGFEPDEYVFGCLLDALCKHGSVKDAAKLFEDMRMRFPVNLRYFTSLLYGWCRVGKMMEA 271

Query: 543  KFVLVKMREAGFEPDIVVYNNLLNGYSVAGKMADAFDLLQEMKRKGCEPNATSFTIIVQA 722
            K+VLV+M EAGFEPDIV Y NLL+GY+ AGKMADA+DLL++M+R+G EPNA  +T+++QA
Sbjct: 272  KYVLVQMNEAGFEPDIVDYTNLLSGYANAGKMADAYDLLRDMRRRGFEPNANCYTVLIQA 331

Query: 723  LCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGKIDRGYQLLNSMIQKGHTPN 902
            LC  ++MEEAM+VF EMER  CEADVVTYT L+SGFCKWGKID+ Y +L+ MI+KG  P+
Sbjct: 332  LCKVDRMEEAMKVFVEMERYECEADVVTYTALVSGFCKWGKIDKCYIVLDDMIKKGLMPS 391

Query: 903  HTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIFNTVIRLACKLGEIKEAIRFWR 1082
              +Y++I++AH               M++I   PD+ I+N VIRLACKLGE+KEA+R W 
Sbjct: 392  ELTYMHIMVAHEKKESFEECLELMEKMRQIEYHPDIGIYNVVIRLACKLGEVKEAVRLWN 451

Query: 1083 QIEVNGMSPGVDTFVILINGLVEQGCLVEACDYFKEMVGRGLLSSPQYGTLKDLLNSLLR 1262
            ++E NG+SPGVDTFVI+INGL  QGCL+EA D+FKEMV RGL S  QYGTLK LLN++L+
Sbjct: 452  EMEENGLSPGVDTFVIMINGLASQGCLLEASDHFKEMVTRGLFSVSQYGTLKLLLNTVLK 511

Query: 1263 ADKLEMSKDVWSCIMTKG-CDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPD 1439
              KLEM+KDVWSCI +KG C+LNV +WTIWIHALFS G+ KEACSYC++M++   MPQPD
Sbjct: 512  DKKLEMAKDVWSCITSKGACELNVLSWTIWIHALFSKGYEKEACSYCIEMIEMDFMPQPD 571

Query: 1440 TFAKLMRGLRKLYNRQFAAEITEKVRKMAAERQMTFKMYKRRGERDLKE 1586
            TFAKLM+GL+KLYNR+FA EITEKVR MAAER+M+FKMYKRRG +DL E
Sbjct: 572  TFAKLMKGLKKLYNREFAGEITEKVRNMAAEREMSFKMYKRRGVQDLTE 620


>ref|XP_006393982.1| hypothetical protein EUTSA_v10003830mg [Eutrema salsugineum]
            gi|557090621|gb|ESQ31268.1| hypothetical protein
            EUTSA_v10003830mg [Eutrema salsugineum]
          Length = 620

 Score =  755 bits (1950), Expect = 0.0
 Identities = 358/527 (67%), Positives = 439/527 (83%), Gaps = 1/527 (0%)
 Frame = +3

Query: 3    KIHSRVPKLELALQESGFTVSSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSHGVYR 182
            K HSRVPKLELAL ESG  +  GL ERVLNRCGDAGNLGYRFFVWA+KQPGY HS+ VY+
Sbjct: 85   KFHSRVPKLELALNESGVELRPGLIERVLNRCGDAGNLGYRFFVWAAKQPGYCHSYQVYK 144

Query: 183  AMIKILGKMRQFGAIWALIEEMRKENPHLLSAEVFVVLMRRFASARMVKKAIEVLDEMPK 362
            +M+KIL KMR F A+WALIEEMRKENP L+  E+FVVL+RRFAS+ MVKKAIEVLDEMPK
Sbjct: 145  SMVKILSKMRHFEAVWALIEEMRKENPQLIEPELFVVLVRRFASSNMVKKAIEVLDEMPK 204

Query: 363  YGCEPDEHVFGCLLDALCKNGSVKEAALLFEDMSMRFTPTIKHFTSLLYGWCKEGKLMEA 542
            +G EPDE+VFGCLLDALCKNGSVK+AA LFE+M +RF P +++FTSLLYGWC+EGK+MEA
Sbjct: 205  FGLEPDEYVFGCLLDALCKNGSVKDAAKLFEEMRLRFPPNLRYFTSLLYGWCREGKMMEA 264

Query: 543  KFVLVKMREAGFEPDIVVYNNLLNGYSVAGKMADAFDLLQEMKRKGCEPNATSFTIIVQA 722
            + VLV+M+EA FEPD+VVY NLL+GY+ AGKMA+A+DLL++M+R+G EPNA  +T+++QA
Sbjct: 265  EHVLVEMKEARFEPDVVVYTNLLSGYAHAGKMAEAYDLLKDMRRRGFEPNANCYTVLIQA 324

Query: 723  LCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGKIDRGYQLLNSMIQKGHTPN 902
            LC  ++MEEAMRVF EMER  CEAD+VTY  L+SGFCKWGKID+ Y +L+ MI+K   P+
Sbjct: 325  LCKVDRMEEAMRVFVEMERYECEADIVTYNALVSGFCKWGKIDKCYSVLDDMIKKCLMPS 384

Query: 903  HTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIFNTVIRLACKLGEIKEAIRFWR 1082
              +Y++I+ AH               M++IG   DL ++N VIRLACKLGE+KEA+R W 
Sbjct: 385  QLTYMHIMAAHEKKEKFEECLELMEKMKEIGYHLDLGVYNVVIRLACKLGEVKEAVRLWN 444

Query: 1083 QIEVNGMSPGVDTFVILINGLVEQGCLVEACDYFKEMVGRGLLSSPQYGTLKDLLNSLLR 1262
            ++E +G+SPGVDTFVI+I+GL  QGCL+EACD+FK MV RGL S PQYGTLK LLN+LLR
Sbjct: 445  EMEASGLSPGVDTFVIMIDGLTNQGCLLEACDHFKVMVSRGLFSVPQYGTLKSLLNALLR 504

Query: 1263 ADKLEMSKDVWSCIMTKG-CDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPD 1439
              KLE +KD+WSCIM++G C+LNV +WTIWIHALFS G+VK+ACSYCL+MM+   M QPD
Sbjct: 505  DGKLETAKDIWSCIMSEGSCELNVSSWTIWIHALFSKGYVKDACSYCLEMMEMDFMLQPD 564

Query: 1440 TFAKLMRGLRKLYNRQFAAEITEKVRKMAAERQMTFKMYKRRGERDL 1580
            TFAKLM+GL+KLYNR+FA EITEKVR MAAER+++FKMYKRRG  DL
Sbjct: 565  TFAKLMKGLKKLYNREFAVEITEKVRNMAAERELSFKMYKRRGVEDL 611


>ref|NP_190542.4| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|218546755|sp|P0C8A0.1|PP275_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At3g49730 gi|332645062|gb|AEE78583.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 638

 Score =  752 bits (1941), Expect = 0.0
 Identities = 357/529 (67%), Positives = 435/529 (82%), Gaps = 3/529 (0%)
 Frame = +3

Query: 9    HSRVPKLELALQESGFTVSSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSHGVYRAM 188
            HSRVPKLELAL ESG  +  GL  RVL+RCGDAGNLGYRFF+WA+KQPGY HS+ V ++M
Sbjct: 78   HSRVPKLELALNESGIDLRPGLIIRVLSRCGDAGNLGYRFFLWATKQPGYFHSYEVCKSM 137

Query: 189  IKILGKMRQFGAIWALIEEMRKENPHLLSAEVFVVLMRRFASARMVKKAIEVLDEMPKYG 368
            + IL KMRQFGA+W LIEEMRK NP L+  E+FVVLMRRFASA MVKKA+EVLDEMPKYG
Sbjct: 138  VMILSKMRQFGAVWGLIEEMRKTNPELIEPELFVVLMRRFASANMVKKAVEVLDEMPKYG 197

Query: 369  CEPDEHVFGCLLDALCKNGSVKEAALLFEDMSMRFTPTIKHFTSLLYGWCKEGKLMEAKF 548
             EPDE+VFGCLLDALCKNGSVKEA+ +FEDM  +F P +++FTSLLYGWC+EGKLMEAK 
Sbjct: 198  LEPDEYVFGCLLDALCKNGSVKEASKVFEDMREKFPPNLRYFTSLLYGWCREGKLMEAKE 257

Query: 549  VLVKMREAGFEPDIVVYNNLLNGYSVAGKMADAFDLLQEMKRKGCEPNATSFTIIVQALC 728
            VLV+M+EAG EPDIVV+ NLL+GY+ AGKMADA+DL+ +M+++G EPN   +T+++QALC
Sbjct: 258  VLVQMKEAGLEPDIVVFTNLLSGYAHAGKMADAYDLMNDMRKRGFEPNVNCYTVLIQALC 317

Query: 729  -AQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGKIDRGYQLLNSMIQKGHTPNH 905
              + +M+EAMRVF EMER GCEAD+VTYT LISGFCKWG ID+GY +L+ M +KG  P+ 
Sbjct: 318  RTEKRMDEAMRVFVEMERYGCEADIVTYTALISGFCKWGMIDKGYSVLDDMRKKGVMPSQ 377

Query: 906  TSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIFNTVIRLACKLGEIKEAIRFWRQ 1085
             +Y+ I++AH               M++ G  PDL I+N VIRLACKLGE+KEA+R W +
Sbjct: 378  VTYMQIMVAHEKKEQFEECLELIEKMKRRGCHPDLLIYNVVIRLACKLGEVKEAVRLWNE 437

Query: 1086 IEVNGMSPGVDTFVILINGLVEQGCLVEACDYFKEMVGRGLLSSPQYGTLKDLLNSLLRA 1265
            +E NG+SPGVDTFVI+ING   QG L+EAC++FKEMV RG+ S+PQYGTLK LLN+L+R 
Sbjct: 438  MEANGLSPGVDTFVIMINGFTSQGFLIEACNHFKEMVSRGIFSAPQYGTLKSLLNNLVRD 497

Query: 1266 DKLEMSKDVWSCI--MTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPD 1439
            DKLEM+KDVWSCI   T  C+LNV AWTIWIHAL++ GHVKEACSYCLDMM+  +MPQP+
Sbjct: 498  DKLEMAKDVWSCISNKTSSCELNVSAWTIWIHALYAKGHVKEACSYCLDMMEMDLMPQPN 557

Query: 1440 TFAKLMRGLRKLYNRQFAAEITEKVRKMAAERQMTFKMYKRRGERDLKE 1586
            T+AKLM+GL KLYNR  AAEITEKV KMA+ER+M+FKMYK++GE DL E
Sbjct: 558  TYAKLMKGLNKLYNRTIAAEITEKVVKMASEREMSFKMYKKKGEEDLIE 606


>emb|CAB66911.1| putative protein [Arabidopsis thaliana]
          Length = 1184

 Score =  752 bits (1941), Expect = 0.0
 Identities = 357/529 (67%), Positives = 435/529 (82%), Gaps = 3/529 (0%)
 Frame = +3

Query: 9    HSRVPKLELALQESGFTVSSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSHGVYRAM 188
            HSRVPKLELAL ESG  +  GL  RVL+RCGDAGNLGYRFF+WA+KQPGY HS+ V ++M
Sbjct: 78   HSRVPKLELALNESGIDLRPGLIIRVLSRCGDAGNLGYRFFLWATKQPGYFHSYEVCKSM 137

Query: 189  IKILGKMRQFGAIWALIEEMRKENPHLLSAEVFVVLMRRFASARMVKKAIEVLDEMPKYG 368
            + IL KMRQFGA+W LIEEMRK NP L+  E+FVVLMRRFASA MVKKA+EVLDEMPKYG
Sbjct: 138  VMILSKMRQFGAVWGLIEEMRKTNPELIEPELFVVLMRRFASANMVKKAVEVLDEMPKYG 197

Query: 369  CEPDEHVFGCLLDALCKNGSVKEAALLFEDMSMRFTPTIKHFTSLLYGWCKEGKLMEAKF 548
             EPDE+VFGCLLDALCKNGSVKEA+ +FEDM  +F P +++FTSLLYGWC+EGKLMEAK 
Sbjct: 198  LEPDEYVFGCLLDALCKNGSVKEASKVFEDMREKFPPNLRYFTSLLYGWCREGKLMEAKE 257

Query: 549  VLVKMREAGFEPDIVVYNNLLNGYSVAGKMADAFDLLQEMKRKGCEPNATSFTIIVQALC 728
            VLV+M+EAG EPDIVV+ NLL+GY+ AGKMADA+DL+ +M+++G EPN   +T+++QALC
Sbjct: 258  VLVQMKEAGLEPDIVVFTNLLSGYAHAGKMADAYDLMNDMRKRGFEPNVNCYTVLIQALC 317

Query: 729  -AQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGKIDRGYQLLNSMIQKGHTPNH 905
              + +M+EAMRVF EMER GCEAD+VTYT LISGFCKWG ID+GY +L+ M +KG  P+ 
Sbjct: 318  RTEKRMDEAMRVFVEMERYGCEADIVTYTALISGFCKWGMIDKGYSVLDDMRKKGVMPSQ 377

Query: 906  TSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIFNTVIRLACKLGEIKEAIRFWRQ 1085
             +Y+ I++AH               M++ G  PDL I+N VIRLACKLGE+KEA+R W +
Sbjct: 378  VTYMQIMVAHEKKEQFEECLELIEKMKRRGCHPDLLIYNVVIRLACKLGEVKEAVRLWNE 437

Query: 1086 IEVNGMSPGVDTFVILINGLVEQGCLVEACDYFKEMVGRGLLSSPQYGTLKDLLNSLLRA 1265
            +E NG+SPGVDTFVI+ING   QG L+EAC++FKEMV RG+ S+PQYGTLK LLN+L+R 
Sbjct: 438  MEANGLSPGVDTFVIMINGFTSQGFLIEACNHFKEMVSRGIFSAPQYGTLKSLLNNLVRD 497

Query: 1266 DKLEMSKDVWSCI--MTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPD 1439
            DKLEM+KDVWSCI   T  C+LNV AWTIWIHAL++ GHVKEACSYCLDMM+  +MPQP+
Sbjct: 498  DKLEMAKDVWSCISNKTSSCELNVSAWTIWIHALYAKGHVKEACSYCLDMMEMDLMPQPN 557

Query: 1440 TFAKLMRGLRKLYNRQFAAEITEKVRKMAAERQMTFKMYKRRGERDLKE 1586
            T+AKLM+GL KLYNR  AAEITEKV KMA+ER+M+FKMYK++GE DL E
Sbjct: 558  TYAKLMKGLNKLYNRTIAAEITEKVVKMASEREMSFKMYKKKGEEDLIE 606


Top