BLASTX nr result

ID: Rauwolfia21_contig00017729 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00017729
         (1927 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003635394.1| PREDICTED: putative pentatricopeptide repeat...   897   0.0  
ref|XP_006348483.1| PREDICTED: pentatricopeptide repeat-containi...   879   0.0  
gb|EXC12605.1| hypothetical protein L484_012982 [Morus notabilis]     877   0.0  
emb|CAN71515.1| hypothetical protein VITISV_021787 [Vitis vinifera]   868   0.0  
ref|XP_004159605.1| PREDICTED: pentatricopeptide repeat-containi...   855   0.0  
ref|XP_004149630.1| PREDICTED: pentatricopeptide repeat-containi...   855   0.0  
ref|XP_002530608.1| pentatricopeptide repeat-containing protein,...   832   0.0  
ref|XP_004513407.1| PREDICTED: putative pentatricopeptide repeat...   832   0.0  
ref|XP_003546958.1| PREDICTED: pentatricopeptide repeat-containi...   825   0.0  
ref|XP_006426145.1| hypothetical protein CICLE_v10025134mg [Citr...   822   0.0  
ref|XP_006595472.1| PREDICTED: putative pentatricopeptide repeat...   821   0.0  
ref|XP_006466418.1| PREDICTED: putative pentatricopeptide repeat...   818   0.0  
gb|EOX91773.1| Pentatricopeptide repeat (PPR) superfamily protei...   803   0.0  
gb|EPS62602.1| hypothetical protein M569_12187, partial [Genlise...   781   0.0  
ref|XP_006404107.1| hypothetical protein EUTSA_v10010190mg [Eutr...   774   0.0  
ref|XP_006393982.1| hypothetical protein EUTSA_v10003830mg [Eutr...   760   0.0  
ref|XP_002866691.1| pentatricopeptide repeat-containing protein ...   757   0.0  
ref|NP_190542.4| pentatricopeptide repeat-containing protein [Ar...   756   0.0  
emb|CAB66911.1| putative protein [Arabidopsis thaliana]               756   0.0  
ref|XP_006292382.1| hypothetical protein CARUB_v10018595mg [Caps...   756   0.0  

>ref|XP_003635394.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At5g65820-like [Vitis vinifera]
          Length = 622

 Score =  897 bits (2318), Expect = 0.0
 Identities = 434/554 (78%), Positives = 484/554 (87%)
 Frame = +1

Query: 265  NCFERIAEARRGLDLIRIRTEPEPSSDSQTQDEFTADVEKVYRILRKFHSRIPKLELALQ 444
            NC   I+E R G  L+R+ +  E  +  Q  DEF+ADVEKVYRILRKFHSR+PKLELALQ
Sbjct: 23   NC--TISERRGGFGLVRLESNRENCTYDQNYDEFSADVEKVYRILRKFHSRVPKLELALQ 80

Query: 445  ESGIVVRSGLTERVLNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGA 624
            ESG+ VRSGLTERVLNRCGDAGNLGYRFF+WASKQPGYRHSY+VYKAMIKILGKMRQFGA
Sbjct: 81   ESGVAVRSGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSYEVYKAMIKILGKMRQFGA 140

Query: 625  VWALIEEMRKENPHLLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLL 804
            VWALIEEMR+ENP  +SP VFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDE+VFGCLL
Sbjct: 141  VWALIEEMRRENPQFVSPYVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEHVFGCLL 200

Query: 805  DALCKNGSVKEAALLFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEP 984
            DALCKNGSVKEAA LFEDMR++F PT+KHFTSLLYGWC+EGKLMEAK+VLV++REAGFEP
Sbjct: 201  DALCKNGSVKEAASLFEDMRIRFTPTLKHFTSLLYGWCREGKLMEAKYVLVQIREAGFEP 260

Query: 985  DIVVYNNLLNGYAVAGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALCAQNKMEEAMRVF 1164
            DIVVYNNLL GYA AGKMVDA+ LL+EM+ K CEPN  SFT ++QALCA+ KMEEAMRVF
Sbjct: 261  DIVVYNNLLTGYAAAGKMVDAYDLLKEMRRKECEPNVMSFTTLIQALCAKKKMEEAMRVF 320

Query: 1165 SEMERSGCEADVVTYTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXXX 1344
             EM+  GC AD VTYTTLISGFCKWG+I++GYELLD+MIQ+GH PN  +YL+I+ AH   
Sbjct: 321  FEMQSCGCPADAVTYTTLISGFCKWGKISKGYELLDNMIQQGHIPNPMTYLHIMAAHEKK 380

Query: 1345 XXXXXXXXXXXXMQKIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTH 1524
                        M+KIG  PDL IYN VIRLACKLGEIKE +R W ++E  G+SPG+DT 
Sbjct: 381  EELEECIELMEEMRKIGCTPDLNIYNIVIRLACKLGEIKEGVRVWNEMEATGLSPGLDTF 440

Query: 1525 VILINGLVEQGCLVEACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCI 1704
            VI+I+G + Q CLVEAC++FKEMV RGLLSAPQYGTLK+LLNSLLR++KLEMSK+VWSCI
Sbjct: 441  VIMIHGFLSQRCLVEACEFFKEMVGRGLLSAPQYGTLKELLNSLLRAEKLEMSKDVWSCI 500

Query: 1705 MTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNR 1884
            MTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNR
Sbjct: 501  MTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNR 560

Query: 1885 QFAAEITEKVRKMA 1926
            Q AAEITEKVRKMA
Sbjct: 561  QIAAEITEKVRKMA 574


>ref|XP_006348483.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like
            [Solanum tuberosum]
          Length = 625

 Score =  879 bits (2270), Expect = 0.0
 Identities = 421/527 (79%), Positives = 468/527 (88%)
 Frame = +1

Query: 346  SQTQDEFTADVEKVYRILRKFHSRIPKLELALQESGIVVRSGLTERVLNRCGDAGNLGYR 525
            ++  DEF+ADVEKVYRILRKFHSR+PKLELAL ESG+V RSGLTERVLNRCGDAGNLGYR
Sbjct: 51   NKNHDEFSADVEKVYRILRKFHSRVPKLELALLESGVVARSGLTERVLNRCGDAGNLGYR 110

Query: 526  FFIWASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALIEEMRKENPHLLSPEVFVVLMRR 705
            FF+W SKQPGYRHS+D YKAMIKILGKMRQFG VWAL+EEMR ENP  L+PEVF+VLMRR
Sbjct: 111  FFVWVSKQPGYRHSHDAYKAMIKILGKMRQFGTVWALVEEMRIENPQFLTPEVFIVLMRR 170

Query: 706  FASARMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAALLFEDMRLKFIPTI 885
            FAS RMVKKAIEVLDEMPKYG EPDEYVFGCLLDALCKNGSVKEAA LF++MR +F PTI
Sbjct: 171  FASGRMVKKAIEVLDEMPKYGVEPDEYVFGCLLDALCKNGSVKEAAALFDEMRFRFSPTI 230

Query: 886  KHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVYNNLLNGYAVAGKMVDAFVLLQE 1065
            KHFTSLLYGWCKEGKL+EAK VLVKMREAGFEPDIVVYNNLLNGYAV+ KM DAF LLQE
Sbjct: 231  KHFTSLLYGWCKEGKLIEAKVVLVKMREAGFEPDIVVYNNLLNGYAVSRKMADAFDLLQE 290

Query: 1066 MKSKGCEPNATSFTIVVQALCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGE 1245
            M+ KGC PN TSFTIV+QALC Q+KMEEAMRVF +MERSGCE DVVTYTTLISGFCKWG+
Sbjct: 291  MRRKGCNPNETSFTIVIQALCLQDKMEEAMRVFLDMERSGCEGDVVTYTTLISGFCKWGK 350

Query: 1246 INRGYELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIYNT 1425
            I +GYEL+D+M+QKG+ PN+T+YL+I+LAH               M KIG+ PD +IYN 
Sbjct: 351  IEKGYELVDTMLQKGYNPNQTTYLHIMLAHEKKEELEECLELVKEMGKIGIPPDHSIYNI 410

Query: 1426 VIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILINGLVEQGCLVEACDYFKEMVYRG 1605
            VIRLACKLGEI E +R W QIE NG+SPGVDT +I+ING VEQG L+EACD+FKEM+ RG
Sbjct: 411  VIRLACKLGEIDEGVRVWNQIEANGISPGVDTFIIMINGFVEQGRLIEACDHFKEMIGRG 470

Query: 1606 LLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKE 1785
            LLSAPQYGTLKDLLNSLLR++KLE+ K+VWSCIMTKGC+LNV AWTIWIHALFSNGHVKE
Sbjct: 471  LLSAPQYGTLKDLLNSLLRAEKLELCKDVWSCIMTKGCELNVSAWTIWIHALFSNGHVKE 530

Query: 1786 ACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFAAEITEKVRKMA 1926
            AC+YCLDMMDAG+MPQPDTFAKLM+GLRKLYNR+ AAEITEK RKMA
Sbjct: 531  ACAYCLDMMDAGLMPQPDTFAKLMKGLRKLYNREIAAEITEKARKMA 577


>gb|EXC12605.1| hypothetical protein L484_012982 [Morus notabilis]
          Length = 638

 Score =  877 bits (2267), Expect = 0.0
 Identities = 416/570 (72%), Positives = 484/570 (84%)
 Frame = +1

Query: 217  LSSYSHLGLHQNPLNMNCFERIAEARRGLDLIRIRTEPEPSSDSQTQDEFTADVEKVYRI 396
            LS  +     QNP N            G   + +   P  S D +T DEF+ DVEK+YRI
Sbjct: 30   LSPQTQFSSTQNPHNR---------ATGFSPVHLEQNPVVSDDDETHDEFSGDVEKIYRI 80

Query: 397  LRKFHSRIPKLELALQESGIVVRSGLTERVLNRCGDAGNLGYRFFIWASKQPGYRHSYDV 576
            LRKFHSR+ KLELALQESG+V+RSGLTERVL RCGDAG+LGYRFF+WASKQPGYR SY+V
Sbjct: 81   LRKFHSRVSKLELALQESGVVLRSGLTERVLGRCGDAGSLGYRFFVWASKQPGYRPSYEV 140

Query: 577  YKAMIKILGKMRQFGAVWALIEEMRKENPHLLSPEVFVVLMRRFASARMVKKAIEVLDEM 756
            YKAMI+ LGKMRQFGAVWAL+EEMRKENP L++PE+FVVLMRRFASARMVKKA+EV DEM
Sbjct: 141  YKAMIRALGKMRQFGAVWALLEEMRKENPQLITPEIFVVLMRRFASARMVKKAVEVFDEM 200

Query: 757  PKYGCEPDEYVFGCLLDALCKNGSVKEAALLFEDMRLKFIPTIKHFTSLLYGWCKEGKLM 936
            PKYGCEPDE+VFGCLLDALCKNGSVKEAA LFE+MR+KF P++KHFTSLLYGWC+EGKLM
Sbjct: 201  PKYGCEPDEHVFGCLLDALCKNGSVKEAASLFEEMRVKFTPSLKHFTSLLYGWCREGKLM 260

Query: 937  EAKFVLVKMREAGFEPDIVVYNNLLNGYAVAGKMVDAFVLLQEMKSKGCEPNATSFTIVV 1116
            EAKFVLV+M+EAGFEPD+VVYNNLL GYA AGKM DA+ L++EM+ KGC PNA S+T+++
Sbjct: 261  EAKFVLVQMKEAGFEPDVVVYNNLLGGYAQAGKMADAYDLMKEMRGKGCSPNAASYTVLI 320

Query: 1117 QALCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGEINRGYELLDSMIQKGHT 1296
            QALC + KMEEAMRVF EM+RSGC+ADV+TYTTLISGFCKWG+I RGYE+LDSMIQ+G +
Sbjct: 321  QALCKREKMEEAMRVFVEMQRSGCDADVMTYTTLISGFCKWGKIERGYEILDSMIQRGFS 380

Query: 1297 PNRTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIYNTVIRLACKLGEIKEAIRF 1476
            PN T+YL+I+LAH               M+KIG VPDL IYNTVIRLACKL E+KE +R 
Sbjct: 381  PNETTYLHIMLAHEKKEEFEECVELIGEMRKIGCVPDLKIYNTVIRLACKLREVKEGVRL 440

Query: 1477 WTQIEVNGMSPGVDTHVILINGLVEQGCLVEACDYFKEMVYRGLLSAPQYGTLKDLLNSL 1656
            W +IE +G+SPG+DT V++I+G + QGCL+EAC YFKEMV RGLLS PQYGTLK+LLN+L
Sbjct: 441  WNEIEASGLSPGLDTFVVMIHGFLGQGCLIEACQYFKEMVERGLLSGPQYGTLKELLNAL 500

Query: 1657 LRSDKLEMSKEVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQP 1836
            LR+DKLEM+K+VW+CI+ KGC++NVYAWTIWIHALF NGHVKEACSYCLDMMDA VMPQP
Sbjct: 501  LRADKLEMAKDVWTCIVNKGCEINVYAWTIWIHALFKNGHVKEACSYCLDMMDADVMPQP 560

Query: 1837 DTFAKLMRGLRKLYNRQFAAEITEKVRKMA 1926
            DTFAKLMRGL+KLYNRQ AAEITEKVRKMA
Sbjct: 561  DTFAKLMRGLKKLYNRQIAAEITEKVRKMA 590


>emb|CAN71515.1| hypothetical protein VITISV_021787 [Vitis vinifera]
          Length = 655

 Score =  868 bits (2242), Expect = 0.0
 Identities = 418/518 (80%), Positives = 463/518 (89%), Gaps = 1/518 (0%)
 Frame = +1

Query: 376  VEK-VYRILRKFHSRIPKLELALQESGIVVRSGLTERVLNRCGDAGNLGYRFFIWASKQP 552
            +EK VYRILRKFHSR+PKLELALQESG+ VRSGLTERVLNRCGDAGNLGYRFF+WASKQP
Sbjct: 90   IEKTVYRILRKFHSRVPKLELALQESGVAVRSGLTERVLNRCGDAGNLGYRFFVWASKQP 149

Query: 553  GYRHSYDVYKAMIKILGKMRQFGAVWALIEEMRKENPHLLSPEVFVVLMRRFASARMVKK 732
            GYRHSY+VYKAMIKILGKMRQFGAVWALIEEMR+ENP  +SP VFVVLMRRFASARMVKK
Sbjct: 150  GYRHSYEVYKAMIKILGKMRQFGAVWALIEEMRRENPQFVSPYVFVVLMRRFASARMVKK 209

Query: 733  AIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAALLFEDMRLKFIPTIKHFTSLLYG 912
            AIEVLDEMPKYGCEPDE+VFGCLLDALCKNGSVKEAA LFEDMR++F PT+KHFTSLLYG
Sbjct: 210  AIEVLDEMPKYGCEPDEHVFGCLLDALCKNGSVKEAASLFEDMRIRFTPTLKHFTSLLYG 269

Query: 913  WCKEGKLMEAKFVLVKMREAGFEPDIVVYNNLLNGYAVAGKMVDAFVLLQEMKSKGCEPN 1092
            WC+EGKLMEAK+VLV++REAGFEPDIVVYNNLL GYA AGKMVDA+ LL+EM+ K CEPN
Sbjct: 270  WCREGKLMEAKYVLVQIREAGFEPDIVVYNNLLTGYAAAGKMVDAYDLLKEMRRKECEPN 329

Query: 1093 ATSFTIVVQALCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGEINRGYELLD 1272
              SFT ++QALCA+ KMEEAMRVF EM+  GC AD VTYTTLISGFCKWG+I++GYELLD
Sbjct: 330  VMSFTTLIQALCAKKKMEEAMRVFFEMQSCGCPADAVTYTTLISGFCKWGKISKGYELLD 389

Query: 1273 SMIQKGHTPNRTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIYNTVIRLACKLG 1452
            +MIQ+GH PN  +YL+I+ AH               M+KIG  PDL IYN VIRLACKLG
Sbjct: 390  NMIQQGHIPNPMTYLHIMAAHEKKEELEECIELMEEMRKIGCTPDLNIYNIVIRLACKLG 449

Query: 1453 EIKEAIRFWTQIEVNGMSPGVDTHVILINGLVEQGCLVEACDYFKEMVYRGLLSAPQYGT 1632
            EIKE +R W ++E  G+SPG+DT VI+I+G + Q CLVEAC++FKEMV RGLLSAPQYGT
Sbjct: 450  EIKEGVRVWNEMEATGLSPGLDTFVIMIHGFLSQRCLVEACEFFKEMVGRGLLSAPQYGT 509

Query: 1633 LKDLLNSLLRSDKLEMSKEVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMM 1812
            LK+LLNSLLR++KLEMSK+VWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMM
Sbjct: 510  LKELLNSLLRAEKLEMSKDVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMM 569

Query: 1813 DAGVMPQPDTFAKLMRGLRKLYNRQFAAEITEKVRKMA 1926
            DAGVMPQPDTFAKLMRGLRKLYNRQ AAEITEKVRKMA
Sbjct: 570  DAGVMPQPDTFAKLMRGLRKLYNRQIAAEITEKVRKMA 607


>ref|XP_004159605.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like
            [Cucumis sativus]
          Length = 664

 Score =  855 bits (2208), Expect = 0.0
 Identities = 400/545 (73%), Positives = 471/545 (86%)
 Frame = +1

Query: 292  RRGLDLIRIRTEPEPSSDSQTQDEFTADVEKVYRILRKFHSRIPKLELALQESGIVVRSG 471
            R G   I ++T P  S+  +  DEF+ DVEKVYRILRKFH+R+PKLELALQESG+++RSG
Sbjct: 72   RGGFGPIHLKTTPHESAHDRDADEFSVDVEKVYRILRKFHTRVPKLELALQESGVIMRSG 131

Query: 472  LTERVLNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALIEEMR 651
            L ERVL+RCGDAGNLGYRFF+WASKQPGYRHSY+VYKAMIK LGKMRQFGAVWALIEEMR
Sbjct: 132  LPERVLSRCGDAGNLGYRFFVWASKQPGYRHSYEVYKAMIKTLGKMRQFGAVWALIEEMR 191

Query: 652  KENPHLLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSV 831
            KENP++L+PEVF+VLMRRFAS RMVKKA+EVLDEMPKYGCEPDEYVFGCLLDALCKNGSV
Sbjct: 192  KENPYMLTPEVFIVLMRRFASVRMVKKAVEVLDEMPKYGCEPDEYVFGCLLDALCKNGSV 251

Query: 832  KEAALLFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVYNNLL 1011
            KEAA LFEDMR++F P ++HFTSLLYGWC+EGK+MEAK VLV+++EAGFEPDIVVYNNLL
Sbjct: 252  KEAASLFEDMRVRFNPNLRHFTSLLYGWCREGKIMEAKHVLVQIKEAGFEPDIVVYNNLL 311

Query: 1012 NGYAVAGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALCAQNKMEEAMRVFSEMERSGCE 1191
             GYA AGKM DAF LL EMK   C PNA SFTI++Q+ C   KM+EAMR+F+EM+ SGCE
Sbjct: 312  GGYAQAGKMRDAFDLLAEMKKVNCGPNAASFTILIQSFCKTEKMDEAMRIFTEMQGSGCE 371

Query: 1192 ADVVTYTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXXXXXX 1371
            ADVVTYTTLISGFCKWG  ++ YE+LD MIQKGH P++ SYL I++AH            
Sbjct: 372  ADVVTYTTLISGFCKWGNTDKAYEILDDMIQKGHDPSQLSYLCIMMAHEKKEELEECMEL 431

Query: 1372 XXXMQKIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILINGLVE 1551
               M+KIG VPDL IYNT+IRL CKLG++KEA+R W +++  G++PG+DT++++++G + 
Sbjct: 432  IEEMRKIGCVPDLNIYNTMIRLVCKLGDLKEAVRLWGEMQAGGLNPGLDTYILMVHGFLS 491

Query: 1552 QGCLVEACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCIMTKGCDLNV 1731
            QGCLVEACDYFKEMV RGLLSAPQYGTLK+L N+LLR++KLEM+K +WSC+ TKGC+LNV
Sbjct: 492  QGCLVEACDYFKEMVERGLLSAPQYGTLKELTNALLRAEKLEMAKNMWSCMTTKGCELNV 551

Query: 1732 YAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFAAEITEK 1911
             AWTIWIHALFSNGHVKEACSYCLDMMDA +MPQPDTFAKLMRGL+KL++RQ A EITEK
Sbjct: 552  SAWTIWIHALFSNGHVKEACSYCLDMMDADLMPQPDTFAKLMRGLKKLFHRQLAVEITEK 611

Query: 1912 VRKMA 1926
            VRKMA
Sbjct: 612  VRKMA 616


>ref|XP_004149630.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like
            [Cucumis sativus]
          Length = 641

 Score =  855 bits (2208), Expect = 0.0
 Identities = 400/545 (73%), Positives = 471/545 (86%)
 Frame = +1

Query: 292  RRGLDLIRIRTEPEPSSDSQTQDEFTADVEKVYRILRKFHSRIPKLELALQESGIVVRSG 471
            R G   I ++T P  S+  +  DEF+ DVEKVYRILRKFH+R+PKLELALQESG+++RSG
Sbjct: 49   RGGFGPIHLKTTPHESAHDRDADEFSVDVEKVYRILRKFHTRVPKLELALQESGVIMRSG 108

Query: 472  LTERVLNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALIEEMR 651
            L ERVL+RCGDAGNLGYRFF+WASKQPGYRHSY+VYKAMIK LGKMRQFGAVWALIEEMR
Sbjct: 109  LPERVLSRCGDAGNLGYRFFVWASKQPGYRHSYEVYKAMIKTLGKMRQFGAVWALIEEMR 168

Query: 652  KENPHLLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSV 831
            KENP++L+PEVF+VLMRRFAS RMVKKA+EVLDEMPKYGCEPDEYVFGCLLDALCKNGSV
Sbjct: 169  KENPYMLTPEVFIVLMRRFASVRMVKKAVEVLDEMPKYGCEPDEYVFGCLLDALCKNGSV 228

Query: 832  KEAALLFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVYNNLL 1011
            KEAA LFEDMR++F P ++HFTSLLYGWC+EGK+MEAK VLV+++EAGFEPDIVVYNNLL
Sbjct: 229  KEAASLFEDMRVRFNPNLRHFTSLLYGWCREGKIMEAKHVLVQIKEAGFEPDIVVYNNLL 288

Query: 1012 NGYAVAGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALCAQNKMEEAMRVFSEMERSGCE 1191
             GYA AGKM DAF LL EMK   C PNA SFTI++Q+ C   KM+EAMR+F+EM+ SGCE
Sbjct: 289  GGYAQAGKMRDAFDLLAEMKKVNCGPNAASFTILIQSFCKTEKMDEAMRIFTEMQGSGCE 348

Query: 1192 ADVVTYTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXXXXXX 1371
            ADVVTYTTLISGFCKWG  ++ YE+LD MIQKGH P++ SYL I++AH            
Sbjct: 349  ADVVTYTTLISGFCKWGNTDKAYEILDDMIQKGHDPSQLSYLCIMMAHEKKEELEECMEL 408

Query: 1372 XXXMQKIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILINGLVE 1551
               M+KIG VPDL IYNT+IRL CKLG++KEA+R W +++  G++PG+DT++++++G + 
Sbjct: 409  IEEMRKIGCVPDLNIYNTMIRLVCKLGDLKEAVRLWGEMQAGGLNPGLDTYILMVHGFLS 468

Query: 1552 QGCLVEACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCIMTKGCDLNV 1731
            QGCLVEACDYFKEMV RGLLSAPQYGTLK+L N+LLR++KLEM+K +WSC+ TKGC+LNV
Sbjct: 469  QGCLVEACDYFKEMVERGLLSAPQYGTLKELTNALLRAEKLEMAKNMWSCMTTKGCELNV 528

Query: 1732 YAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFAAEITEK 1911
             AWTIWIHALFSNGHVKEACSYCLDMMDA +MPQPDTFAKLMRGL+KL++RQ A EITEK
Sbjct: 529  SAWTIWIHALFSNGHVKEACSYCLDMMDADLMPQPDTFAKLMRGLKKLFHRQLAVEITEK 588

Query: 1912 VRKMA 1926
            VRKMA
Sbjct: 589  VRKMA 593


>ref|XP_002530608.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223529856|gb|EEF31788.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 596

 Score =  832 bits (2150), Expect = 0.0
 Identities = 405/617 (65%), Positives = 496/617 (80%), Gaps = 4/617 (0%)
 Frame = +1

Query: 79   MQTLSSKKSLVLCGKYAPLFSSAKRNTPRKEILHLVLYNESSNNRCLSSYSHLGLHQNPL 258
            MQ LSSK ++ L  K+   F+          ++H+ LY +              + +NPL
Sbjct: 1    MQRLSSK-TISLLNKHCCRFN----------LIHVQLYQKGQEP----------IDRNPL 39

Query: 259  NMNCFERIAEARRGLDLIRIRTEPEPSSD----SQTQDEFTADVEKVYRILRKFHSRIPK 426
            + N        R G  ++ ++T+   +SD    S   DEF  DVEKVYRILR FHSR+PK
Sbjct: 40   SNNL-------RNGFGVVCLKTQENNTSDRDNSSSKVDEFAKDVEKVYRILRNFHSRVPK 92

Query: 427  LELALQESGIVVRSGLTERVLNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGK 606
            LELALQESG+ +R+GLTERVLNRCGDAGNLGYRFF+WASKQPGYRHSY+ YKAM+KI  K
Sbjct: 93   LELALQESGVTMRAGLTERVLNRCGDAGNLGYRFFVWASKQPGYRHSYENYKAMVKIFSK 152

Query: 607  MRQFGAVWALIEEMRKENPHLLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEY 786
            MRQFGAVWAL+EEMRK+N  L++ E+F+VL+RRFASAR+V+KAIEVLDEMPKYGCEPDEY
Sbjct: 153  MRQFGAVWALLEEMRKDNSVLITSELFIVLIRRFASARLVEKAIEVLDEMPKYGCEPDEY 212

Query: 787  VFGCLLDALCKNGSVKEAALLFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMR 966
            VFGCLLDALCKNGSVK+AA LFEDMR++F P+++HFTSLLYGWC+EGKL+EAK VLV+MR
Sbjct: 213  VFGCLLDALCKNGSVKQAASLFEDMRVRFSPSLRHFTSLLYGWCREGKLIEAKHVLVQMR 272

Query: 967  EAGFEPDIVVYNNLLNGYAVAGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALCAQNKME 1146
            EAGFEPDIVV+NNLL+ Y++AGKM DAF LL+EM  KGCEPNA S+TI++QA C+Q KM+
Sbjct: 273  EAGFEPDIVVFNNLLSAYSMAGKMTDAFDLLKEMVRKGCEPNANSYTIMIQAFCSQEKMD 332

Query: 1147 EAMRVFSEMERSGCEADVVTYTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYIL 1326
            EAMRVF EMER+GCEADVVTYT LISGFCKWG+INRGY++LD+M QKGH PN+ +YL IL
Sbjct: 333  EAMRVFVEMERTGCEADVVTYTALISGFCKWGKINRGYQILDAMKQKGHMPNQLTYLRIL 392

Query: 1327 LAHXXXXXXXXXXXXXXXMQKIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMS 1506
            LAH               M+ +G VPDL+IYN VIRLACKLGE+K+ ++ W ++E +  S
Sbjct: 393  LAHEKKEELEECLELIESMRMVGCVPDLSIYNVVIRLACKLGEVKQGVQIWNEMEASDFS 452

Query: 1507 PGVDTHVILINGLVEQGCLVEACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSK 1686
            P +DT VI+I+G + QGCLVEAC+YFKEM+ RGLL+ PQYG LK+LLN+LLR +KL M+K
Sbjct: 453  PELDTFVIMIHGFLGQGCLVEACEYFKEMIGRGLLTTPQYGILKELLNALLRGEKLGMAK 512

Query: 1687 EVWSCIMTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGL 1866
            +VWSCI+TKGC+LN  AWTIWIH+LFSNGHVKEACSYCLDMM+A +MP+P+TFAKLMRGL
Sbjct: 513  DVWSCIVTKGCELNADAWTIWIHSLFSNGHVKEACSYCLDMMEADIMPKPETFAKLMRGL 572

Query: 1867 RKLYNRQFAAEITEKVR 1917
            RKLYNR+FAAEITEK++
Sbjct: 573  RKLYNREFAAEITEKIK 589


>ref|XP_004513407.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At5g65820-like isoform X1 [Cicer arietinum]
            gi|502165084|ref|XP_004513408.1| PREDICTED: putative
            pentatricopeptide repeat-containing protein
            At5g65820-like isoform X2 [Cicer arietinum]
          Length = 655

 Score =  832 bits (2148), Expect = 0.0
 Identities = 390/541 (72%), Positives = 465/541 (85%), Gaps = 1/541 (0%)
 Frame = +1

Query: 307  LIRIRTEPEPSSDSQTQDEFTADVEKVYRILRKFHSRIPKLELALQESGIVVRSGLTERV 486
            LI +++     +D  + DEFT+DVEKVYRILRK+HSR+PKLELAL+ESG+VV SGLTERV
Sbjct: 69   LIHLQSNANHFNDQNSDDEFTSDVEKVYRILRKYHSRVPKLELALKESGVVVSSGLTERV 128

Query: 487  LNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALIEEMRKENPH 666
            LNRCG++GNL YRFF WASKQ GYRHS +VYKAMIK+L KMRQFGAVWALI+EMR ENP 
Sbjct: 129  LNRCGNSGNLAYRFFSWASKQSGYRHSEEVYKAMIKVLSKMRQFGAVWALIDEMRLENPQ 188

Query: 667  LLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAAL 846
            L+SP VFV+LMRRFASARMV KAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGS+KEAA 
Sbjct: 189  LISPHVFVILMRRFASARMVHKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSIKEAAS 248

Query: 847  LFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVYNNLLNGYAV 1026
            LFEDMR +F PT+KHFTSLLYGWCKEGKL+EAK VLV+M++AG EPDIVV+NNLL GYA 
Sbjct: 249  LFEDMRYRFPPTVKHFTSLLYGWCKEGKLVEAKHVLVQMKDAGIEPDIVVFNNLLGGYAQ 308

Query: 1027 AGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALCAQNKMEEAMRVFSEMERSGCEADVVT 1206
             GKM DA+ LL+EMK KGCEPNA S+TI++Q+LC   K+EEAMR+F EM+R+ C+ DV+T
Sbjct: 309  GGKMADAYDLLKEMKRKGCEPNAASYTILIQSLCKHEKLEEAMRIFVEMQRNDCQMDVIT 368

Query: 1207 YTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXXXXXXXXXMQ 1386
            YTTLISGFCKWG+I RGYELLD MIQ+GH+PN+ +YL+I+LAH               M+
Sbjct: 369  YTTLISGFCKWGKIKRGYELLDQMIQEGHSPNQLTYLHIMLAHEKKEELEECMELVNEMK 428

Query: 1387 KIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILINGLVEQGCLV 1566
            KIG VP+L IYNTVIRLACK GE+K+ +R W ++E +G+SPG DT V++ING +EQ CL+
Sbjct: 429  KIGCVPNLNIYNTVIRLACKFGEVKQGVRLWNEMEASGLSPGTDTFVVMINGFLEQDCLI 488

Query: 1567 EACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCI-MTKGCDLNVYAWT 1743
            EAC+YFKEMV RGL +APQYGTLK+L+NSLLR++KLEM+K+ W+CI  +K C++NV AWT
Sbjct: 489  EACEYFKEMVGRGLFAAPQYGTLKELMNSLLRAEKLEMAKDTWNCITASKSCEMNVAAWT 548

Query: 1744 IWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFAAEITEKVRKM 1923
            IWIHALFS GHVKEACS+C+DMMD  +MPQPDTFAKL+RGL+KLYNR+FAAEITEKVRKM
Sbjct: 549  IWIHALFSKGHVKEACSFCIDMMDNDLMPQPDTFAKLIRGLKKLYNREFAAEITEKVRKM 608

Query: 1924 A 1926
            A
Sbjct: 609  A 609


>ref|XP_003546958.1| PREDICTED: pentatricopeptide repeat-containing protein At3g49730-like
            isoform X1 [Glycine max] gi|571514894|ref|XP_006597171.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At3g49730-like isoform X2 [Glycine max]
            gi|571514897|ref|XP_006597172.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At3g49730-like isoform X3 [Glycine max]
          Length = 654

 Score =  825 bits (2131), Expect(2) = 0.0
 Identities = 392/541 (72%), Positives = 463/541 (85%), Gaps = 1/541 (0%)
 Frame = +1

Query: 307  LIRIRTEPEPSSDSQTQDEFTADVEKVYRILRKFHSRIPKLELALQESGIVVRSGLTERV 486
            LIR++      +D  T DEF +DVEKVYRILRK+HSR+PKLELAL+ESG+VVR GLTERV
Sbjct: 69   LIRLQEISINHTDDHTHDEFASDVEKVYRILRKYHSRVPKLELALRESGVVVRPGLTERV 128

Query: 487  LNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALIEEMRKENPH 666
            L+RCGDAGNL YRF+ WASKQ G+R  +D YKAMIK+L +MRQFGAVWALIEEMR+ENPH
Sbjct: 129  LSRCGDAGNLAYRFYSWASKQSGHRLDHDAYKAMIKVLSRMRQFGAVWALIEEMRQENPH 188

Query: 667  LLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAAL 846
            L++P+VFV+LMRRFASARMV KA+EVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAA 
Sbjct: 189  LITPQVFVILMRRFASARMVHKAVEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAAS 248

Query: 847  LFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVYNNLLNGYAV 1026
            LFEDMR ++ P++KHFTSLLYGWCKEGKLMEAK VLV+M++ G EPDIVVYNNLL GYA 
Sbjct: 249  LFEDMRYRWKPSVKHFTSLLYGWCKEGKLMEAKHVLVQMKDMGIEPDIVVYNNLLGGYAQ 308

Query: 1027 AGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALCAQNKMEEAMRVFSEMERSGCEADVVT 1206
            AGKM DA+ LL+EM+ K CEPNATS+T+++Q+LC   ++EEA R+F EM+ +GC+ADVVT
Sbjct: 309  AGKMGDAYDLLKEMRRKRCEPNATSYTVLIQSLCKHERLEEATRLFVEMQTNGCQADVVT 368

Query: 1207 YTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXXXXXXXXXMQ 1386
            Y+TLISGFCKWG+I RGYELLD MIQ+GH PN+  Y +I+LAH               MQ
Sbjct: 369  YSTLISGFCKWGKIKRGYELLDEMIQQGHFPNQVIYQHIMLAHEKKEELEECKELVNEMQ 428

Query: 1387 KIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILINGLVEQGCLV 1566
            KIG  PDL+IYNTVIRLACKLGE+KE I+ W ++E +G+SPG+DT VI+ING +EQGCLV
Sbjct: 429  KIGCAPDLSIYNTVIRLACKLGEVKEGIQLWNEMESSGLSPGMDTFVIMINGFLEQGCLV 488

Query: 1567 EACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCI-MTKGCDLNVYAWT 1743
            EAC+YFKEMV RGL +APQYGTLK+L+NSLLR++KLEM+K+ W+CI  +KGC LNV AWT
Sbjct: 489  EACEYFKEMVGRGLFTAPQYGTLKELMNSLLRAEKLEMAKDAWNCITASKGCQLNVSAWT 548

Query: 1744 IWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFAAEITEKVRKM 1923
            IWIHALFS GHVKEACS+C+DMMD  +MP PDTFAKLM GL+KLYNRQFAAEITEKVRKM
Sbjct: 549  IWIHALFSKGHVKEACSFCIDMMDKDLMPNPDTFAKLMHGLKKLYNRQFAAEITEKVRKM 608

Query: 1924 A 1926
            A
Sbjct: 609  A 609



 Score = 25.0 bits (53), Expect(2) = 0.0
 Identities = 12/37 (32%), Positives = 21/37 (56%)
 Frame = +2

Query: 116 AVSTLLYFHLPNETHRGRKYCTSSSTMNHQTIVVFPP 226
           A+S+LL   + +ET     +CT+S   + +T  + PP
Sbjct: 13  AISSLLSLVIRHETTVCHFFCTTSEVSSSRTSSLLPP 49


>ref|XP_006426145.1| hypothetical protein CICLE_v10025134mg [Citrus clementina]
            gi|557528135|gb|ESR39385.1| hypothetical protein
            CICLE_v10025134mg [Citrus clementina]
          Length = 638

 Score =  822 bits (2122), Expect = 0.0
 Identities = 391/547 (71%), Positives = 469/547 (85%), Gaps = 6/547 (1%)
 Frame = +1

Query: 304  DLIRIRTEPEPSSDSQTQD------EFTADVEKVYRILRKFHSRIPKLELALQESGIVVR 465
            +L+ ++T+ +    + T D      EF+ DVEK++RIL+KFHSR+PKLELALQ SG+V+R
Sbjct: 44   NLVCLKTKEDDCKCNNTTDTHGSHNEFSHDVEKIFRILKKFHSRLPKLELALQHSGVVLR 103

Query: 466  SGLTERVLNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALIEE 645
             GLTERV+NRCGDAGNLGYR+++WASKQP Y HSYDVY+A+IK L KMR+FGAVWAL+EE
Sbjct: 104  PGLTERVINRCGDAGNLGYRYYMWASKQPNYVHSYDVYRALIKSLSKMRKFGAVWALMEE 163

Query: 646  MRKENPHLLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNG 825
            MRKE P L++ EVFV+LMRRFASARMVKKAIEVLDEMPKYGCEPDE+VFGCLLDALCKN 
Sbjct: 164  MRKEKPQLITTEVFVILMRRFASARMVKKAIEVLDEMPKYGCEPDEFVFGCLLDALCKNS 223

Query: 826  SVKEAALLFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVYNN 1005
            SVKEAA LF++MR +F P+++HFTSLLYGWCKEGKL+EAK+VLV+M++AGFEPDIVVYNN
Sbjct: 224  SVKEAAKLFDEMRERFKPSLRHFTSLLYGWCKEGKLVEAKYVLVQMKDAGFEPDIVVYNN 283

Query: 1006 LLNGYAVAGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALCAQNKMEEAMRVFSEMERSG 1185
            LL+GYA  GKM DAF LL+EM+ KGC+PNA S+T+++QALC   KMEEA R F EMERSG
Sbjct: 284  LLSGYAQMGKMTDAFELLKEMRRKGCDPNANSYTVLIQALCRMEKMEEANRAFVEMERSG 343

Query: 1186 CEADVVTYTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXXXX 1365
            CEADVVTYTTLISGFCK  +I+R YE+LDSMIQ+G  PN+ +YL+I+LAH          
Sbjct: 344  CEADVVTYTTLISGFCKSRKIDRCYEILDSMIQRGILPNQLTYLHIMLAHEKKEELEECV 403

Query: 1366 XXXXXMQKIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILINGL 1545
                 M+KIG VPD++ YN VIRLACKLGE+KEA+  W ++E   +SPG D+ V++++G 
Sbjct: 404  ELMGEMRKIGCVPDVSNYNVVIRLACKLGELKEAVNVWNEMEAASLSPGTDSFVVMVHGF 463

Query: 1546 VEQGCLVEACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCIMTKGCDL 1725
            + QGCL+EAC+YFKEMV RGLLSAPQYGTLK+LLNSLLR+ K+EM+K+VWSCI+TKGC+L
Sbjct: 464  LGQGCLIEACEYFKEMVGRGLLSAPQYGTLKELLNSLLRAQKVEMAKDVWSCIVTKGCEL 523

Query: 1726 NVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFAAEIT 1905
            NVYAWTIWIH+LFSNGHVKEACSYCLDMMDA VMPQPDTFAKLMRGL+KLYNRQ AAEIT
Sbjct: 524  NVYAWTIWIHSLFSNGHVKEACSYCLDMMDADVMPQPDTFAKLMRGLKKLYNRQIAAEIT 583

Query: 1906 EKVRKMA 1926
            EKVRKMA
Sbjct: 584  EKVRKMA 590


>ref|XP_006595472.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At5g65820-like, partial [Glycine max]
          Length = 656

 Score =  821 bits (2120), Expect = 0.0
 Identities = 398/593 (67%), Positives = 483/593 (81%), Gaps = 8/593 (1%)
 Frame = +1

Query: 172  ILHLVLYNESSNNRCLSSYSHLGLHQNPLNM-------NCFERIAEARRGLDLIRIRTEP 330
            +L LV+ +E++      + S L   Q P +        + F+  A   +    IR++   
Sbjct: 20   LLSLVIRHENTLCHFFCTTSELSSSQTPSSQLPPPHFKSTFDNNALTNQ-FGFIRLQEIS 78

Query: 331  EPSSDSQTQDEFTADVEKVYRILRKFHSRIPKLELALQESGIVVRSGLTERVLNRCGDAG 510
               +D QT DEF +DVEKVYRILRK+HSR+PKLELAL+ESG+VVR GLTERVLNRCGDAG
Sbjct: 79   INHTDDQTHDEFASDVEKVYRILRKYHSRVPKLELALRESGVVVRPGLTERVLNRCGDAG 138

Query: 511  NLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALIEEMRKENPHLLSPEVFV 690
            NL YRF+ WASKQ G+R  +D YKAMIK+L +MRQFGAVWALIEEMR+ENPHL++P+VFV
Sbjct: 139  NLAYRFYSWASKQSGHRLDHDAYKAMIKVLSRMRQFGAVWALIEEMRQENPHLITPQVFV 198

Query: 691  VLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAALLFEDMRLK 870
            +LMRRFASARMV KA++VLDEMP YGCEPDEYVFGCLLDAL KNGSVKEAA LFE++R +
Sbjct: 199  ILMRRFASARMVHKAVQVLDEMPNYGCEPDEYVFGCLLDALRKNGSVKEAASLFEELRYR 258

Query: 871  FIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVYNNLLNGYAVAGKMVDAF 1050
            + P++KHFTSLLYGWCKEGKLMEAK VLV+M++AG EPDIVVYNNLL GYA A KM DA+
Sbjct: 259  WKPSVKHFTSLLYGWCKEGKLMEAKHVLVQMKDAGIEPDIVVYNNLLGGYAQADKMGDAY 318

Query: 1051 VLLQEMKSKGCEPNATSFTIVVQALCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGF 1230
             LL+EM+ KGCEPNATS+T+++Q+LC   ++EEA RVF EM+R+GC+AD+VTY+TLISGF
Sbjct: 319  DLLKEMRRKGCEPNATSYTVLIQSLCKHERLEEATRVFVEMQRNGCQADLVTYSTLISGF 378

Query: 1231 CKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDL 1410
            CKWG+I RGYELLD MIQ+GH PN+  Y +I++AH               MQKIG  PDL
Sbjct: 379  CKWGKIKRGYELLDEMIQQGHFPNQVIYQHIMVAHEKKEELEECKELVNEMQKIGCAPDL 438

Query: 1411 TIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILINGLVEQGCLVEACDYFKE 1590
            +IYNTVIRLACKLGE+KE +R W ++E +G+SP +DT VI+ING +EQGCLVEAC+YFKE
Sbjct: 439  SIYNTVIRLACKLGEVKEGVRLWNEMESSGLSPSIDTFVIMINGFLEQGCLVEACEYFKE 498

Query: 1591 MVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCI-MTKGCDLNVYAWTIWIHALFS 1767
            MV RGL +APQYGTLK+L+NSLLR++KLEM+K+ W+CI  +KGC LNV AWTIWIHALFS
Sbjct: 499  MVGRGLFAAPQYGTLKELMNSLLRAEKLEMAKDAWNCITASKGCQLNVSAWTIWIHALFS 558

Query: 1768 NGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFAAEITEKVRKMA 1926
             GHVKEACS+C+ MMD  +MPQPDTFAKLMRGL+KLYNR+FAAEITEKVRKMA
Sbjct: 559  KGHVKEACSFCIAMMDKDLMPQPDTFAKLMRGLKKLYNREFAAEITEKVRKMA 611


>ref|XP_006466418.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At5g65820-like [Citrus sinensis]
          Length = 638

 Score =  818 bits (2113), Expect = 0.0
 Identities = 390/547 (71%), Positives = 467/547 (85%), Gaps = 6/547 (1%)
 Frame = +1

Query: 304  DLIRIRTEPEPSSDSQTQD------EFTADVEKVYRILRKFHSRIPKLELALQESGIVVR 465
            +L+ ++T+ +      T D      EF+ DVEK++RIL+KFHSR+PKLELALQ SG+V+R
Sbjct: 44   NLVCLKTKEDDCKCDNTTDTHGSHNEFSHDVEKIFRILKKFHSRLPKLELALQHSGVVLR 103

Query: 466  SGLTERVLNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALIEE 645
             GLTERV+NRCGDAGNLGYR+++WASKQP Y HSYDVY+A+IK L KMR+FGAVWAL+EE
Sbjct: 104  PGLTERVINRCGDAGNLGYRYYMWASKQPNYVHSYDVYRALIKSLSKMRKFGAVWALMEE 163

Query: 646  MRKENPHLLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNG 825
            MRKE P L++ EVFV+LMRRFASARMVKKAIEVLDEMPKYGCEPDE+VFGCLLDALCKN 
Sbjct: 164  MRKEKPQLITTEVFVILMRRFASARMVKKAIEVLDEMPKYGCEPDEFVFGCLLDALCKNS 223

Query: 826  SVKEAALLFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVYNN 1005
            SVKEAA LF+++R +F P+++HFTSLLYGWCKEGKL+EAK+VLV+M++AGFEPDIVVYNN
Sbjct: 224  SVKEAAKLFDEIRERFKPSLRHFTSLLYGWCKEGKLVEAKYVLVQMKDAGFEPDIVVYNN 283

Query: 1006 LLNGYAVAGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALCAQNKMEEAMRVFSEMERSG 1185
            LL+GYA  GKM DAF LL+EM+ KGC+PNA S+T+++QALC   KMEEA R F EMERSG
Sbjct: 284  LLSGYAQMGKMTDAFELLKEMRRKGCDPNANSYTVLIQALCRMEKMEEANRAFVEMERSG 343

Query: 1186 CEADVVTYTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXXXX 1365
            CEADVVTYTTLISGFCK  +I+R YE+LDSMIQ+G  PN+ +YL+I+LAH          
Sbjct: 344  CEADVVTYTTLISGFCKSRKIDRCYEILDSMIQRGILPNQLTYLHIMLAHEKKEELEECV 403

Query: 1366 XXXXXMQKIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILINGL 1545
                 M+KIG VPD++ YN VIRLACKLGE+KEA+  W ++E   +SPG D+ V++++G 
Sbjct: 404  ELMGEMRKIGCVPDVSNYNVVIRLACKLGELKEAVNVWNEMEAASLSPGTDSFVVMVHGF 463

Query: 1546 VEQGCLVEACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCIMTKGCDL 1725
            + QGCL+EAC+YFKEMV RGLLSAPQYGTLK LLNSLLR+ K+EM+K+VWSCI+TKGC+L
Sbjct: 464  LGQGCLIEACEYFKEMVGRGLLSAPQYGTLKALLNSLLRAQKVEMAKDVWSCIVTKGCEL 523

Query: 1726 NVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFAAEIT 1905
            NVYAWTIWIH+LFSNGHVKEACSYCLDMMDA VMPQPDTFAKLMRGL+KLYNRQ AAEIT
Sbjct: 524  NVYAWTIWIHSLFSNGHVKEACSYCLDMMDADVMPQPDTFAKLMRGLKKLYNRQIAAEIT 583

Query: 1906 EKVRKMA 1926
            EKVRKMA
Sbjct: 584  EKVRKMA 590


>gb|EOX91773.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao]
          Length = 647

 Score =  803 bits (2075), Expect = 0.0
 Identities = 393/613 (64%), Positives = 480/613 (78%), Gaps = 2/613 (0%)
 Frame = +1

Query: 94   SKKSLVLCGKYAPLFSSAKRNTPRKEILHLVLYNESSNNRCLSSYSHLGLHQNPLNMNCF 273
            S K+L L  +   L  S+  NT      H++  N ++NN           + N LN+   
Sbjct: 5    SSKTLCLIARQRHLSLSSYPNTYH---FHILPDNNNNNN-----------NSNSLNLLS- 49

Query: 274  ERIAEARRGLDLIRIRT-EPEPSSDSQTQ-DEFTADVEKVYRILRKFHSRIPKLELALQE 447
               + ++ G  L+ + T +P   SD+  Q D+F +DVEK+YRILRKFH+R+PKL LALQ+
Sbjct: 50   ---SNSKSGFGLVTLETKQPTLKSDNDQQTDDFASDVEKIYRILRKFHTRVPKLNLALQQ 106

Query: 448  SGIVVRSGLTERVLNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGAV 627
            SG+V R GLTERVLNRCGDAGNLGY+FF WASKQPGY  SY++YKAMIKILGKMRQFGAV
Sbjct: 107  SGVVFRPGLTERVLNRCGDAGNLGYKFFTWASKQPGYHPSYEIYKAMIKILGKMRQFGAV 166

Query: 628  WALIEEMRKENPHLLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLLD 807
            WALIEE+++ENPH ++ E+F++L+RRFAS+RMVKKAIEV DEMPKYGC  D+ VFG LLD
Sbjct: 167  WALIEEIKRENPHFITAELFILLIRRFASSRMVKKAIEVFDEMPKYGCLQDDAVFGSLLD 226

Query: 808  ALCKNGSVKEAALLFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEPD 987
            ALCKNG+VKEAAL+FE+MR++F+P +KHFTSLLYGWCKEG+++EAK VLV+M+EAGFEPD
Sbjct: 227  ALCKNGNVKEAALVFEEMRVRFLPNLKHFTSLLYGWCKEGRILEAKHVLVQMKEAGFEPD 286

Query: 988  IVVYNNLLNGYAVAGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALCAQNKMEEAMRVFS 1167
            IVV+NNLL+GY +  KM DAF LL+EM+ KG +PNA S+TIV+Q LC  ++MEEAMRVF 
Sbjct: 287  IVVFNNLLSGYVLGNKMGDAFDLLKEMRKKGIDPNANSYTIVIQGLCKADRMEEAMRVFV 346

Query: 1168 EMERSGCEADVVTYTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXXXX 1347
            +MER+GC  DVV YTTLISGFCKWG + +GYE+LD MI +G  PN  +YL+I+LAH    
Sbjct: 347  DMERNGCRGDVVVYTTLISGFCKWGRVEKGYEVLDRMISEGLMPNSLTYLHIMLAHEKKD 406

Query: 1348 XXXXXXXXXXXMQKIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTHV 1527
                       M+KIG VPD  IYN V+RLACKL E+KEA R W ++E  G SPGVD  +
Sbjct: 407  ELEECLELMEEMRKIGCVPDGGIYNVVVRLACKLEEVKEAARVWNEMEGRGFSPGVDNFI 466

Query: 1528 ILINGLVEQGCLVEACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCIM 1707
            ++I+G + QGCLVEAC+YFKEM  RGL   PQYG LKDLLNSLLR++KLEM+K VWSCI+
Sbjct: 467  VMIHGFIGQGCLVEACEYFKEMAGRGLFCVPQYGILKDLLNSLLRAEKLEMAKNVWSCIV 526

Query: 1708 TKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQ 1887
            +KGC+LNV AWTIW+HALFS GHVKEACSYCL+MMD  VMPQPDTFAKLMRGLRKLYNRQ
Sbjct: 527  SKGCELNVSAWTIWVHALFSKGHVKEACSYCLEMMDVDVMPQPDTFAKLMRGLRKLYNRQ 586

Query: 1888 FAAEITEKVRKMA 1926
             AAEITEKVRKMA
Sbjct: 587  IAAEITEKVRKMA 599


>gb|EPS62602.1| hypothetical protein M569_12187, partial [Genlisea aurea]
          Length = 593

 Score =  781 bits (2018), Expect = 0.0
 Identities = 387/551 (70%), Positives = 457/551 (82%), Gaps = 7/551 (1%)
 Frame = +1

Query: 295  RGLDLIRIRTEPEPSSDS-----QTQDEFTADVEKVYRILRKFHSRIPKLELALQESGIV 459
            RG DLIRI  + +    S        D+F+ADVEKVY+ILRKF+S++PKLELALQ SG+ 
Sbjct: 3    RGFDLIRIEEDEQQQDCSVGRRNNISDDFSADVEKVYKILRKFNSKVPKLELALQHSGVS 62

Query: 460  VRSGLTERVLNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALI 639
            VRSGLTERVLNRCGDAGNLGYRFF+WASKQPGY HS+DVYKAMI+ILGKMRQFGAVWALI
Sbjct: 63   VRSGLTERVLNRCGDAGNLGYRFFVWASKQPGYNHSHDVYKAMIRILGKMRQFGAVWALI 122

Query: 640  EEMRKENPHLLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCK 819
            EEMRKENP LL+PEVF+VLMRRFASARMVKKA+EVLDEMP YGCEPDEYVFGCLLDALCK
Sbjct: 123  EEMRKENPQLLTPEVFIVLMRRFASARMVKKAVEVLDEMPSYGCEPDEYVFGCLLDALCK 182

Query: 820  NGSVKEAALLFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVY 999
            NGSVKEA+LL EDM+++F PT+KHFTSLL+GWC+EGKL+EAK VL KMREAGF PDIVVY
Sbjct: 183  NGSVKEASLLMEDMQMRFKPTMKHFTSLLHGWCREGKLIEAKTVLQKMREAGFLPDIVVY 242

Query: 1000 NNLLNGYAVAGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALCAQNKMEEAMRVFSEMER 1179
            N LL GYA AGK+ DA  LL EM+   C P ATS+T V+++LCA+ KM EA+++FSEME 
Sbjct: 243  NTLLAGYAAAGKIADARHLLLEMRRNSCRPTATSYTAVIRSLCAREKMAEAVQLFSEMEA 302

Query: 1180 SGCEADVVTYTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXX 1359
             GCEADVV YTTLISGFCK G+  +GYELLD+MI+KG TPN T+Y Y++ AH        
Sbjct: 303  DGCEADVVAYTTLISGFCKRGKTGKGYELLDAMIRKGITPNNTTYSYLISAHEKEEELEE 362

Query: 1360 XXXXXXXMQKIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILIN 1539
                   M++IG+ PD  +YN VIRL+CKLGE+++ IR   ++E +G+SPGVDT VILIN
Sbjct: 363  CLGLAKSMRQIGVTPDSAVYNPVIRLSCKLGEVEDGIRLMNEMEEDGISPGVDTFVILIN 422

Query: 1540 GLVEQGCLVEACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCIMT-KG 1716
            GL+  G L EAC  F+EMV RGL++APQYG LKDLLNSLLR  KL++SK+VWS ++T KG
Sbjct: 423  GLILHGHLDEACLRFEEMVGRGLVAAPQYGLLKDLLNSLLRCGKLQLSKDVWSKMVTSKG 482

Query: 1717 -CDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFA 1893
             CD+NVYAWTIWIHAL S G+VKEAC Y L+MM+AG+MPQPDTFAKL+RGLRKLYNR+ A
Sbjct: 483  CCDVNVYAWTIWIHALLSKGYVKEACFYGLEMMEAGLMPQPDTFAKLIRGLRKLYNREIA 542

Query: 1894 AEITEKVRKMA 1926
            AEITEKV++MA
Sbjct: 543  AEITEKVKRMA 553


>ref|XP_006404107.1| hypothetical protein EUTSA_v10010190mg [Eutrema salsugineum]
            gi|557105226|gb|ESQ45560.1| hypothetical protein
            EUTSA_v10010190mg [Eutrema salsugineum]
          Length = 645

 Score =  774 bits (1998), Expect = 0.0
 Identities = 369/529 (69%), Positives = 439/529 (82%), Gaps = 3/529 (0%)
 Frame = +1

Query: 349  QTQDEFTADVEKVYRILRKFHSRIPKLELALQESGIVVRSGLTERVLNRCGDAGNLGYRF 528
            Q +DEF  DVEK+YRILR +HSR+PKLEL L ESGI +R GL  RVL+RCGDAGNLGYRF
Sbjct: 64   QQEDEFAGDVEKIYRILRNYHSRVPKLELVLHESGINLRPGLIVRVLSRCGDAGNLGYRF 123

Query: 529  FIWASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALIEEMRKENPHLLSPEVFVVLMRRF 708
            F+WA+KQPGY HSY+V K+M+KIL KMRQFGAVWALIEEMRKENP L+ PE+FVVLMRRF
Sbjct: 124  FLWAAKQPGYCHSYEVCKSMVKILSKMRQFGAVWALIEEMRKENPQLIEPELFVVLMRRF 183

Query: 709  ASARMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAALLFEDMRLKFIPTIK 888
            ASA MVKKA+EVLDEMPKYG EPDEY+FGCLLDALCKNGSVK+A+ LFEDMR KF P ++
Sbjct: 184  ASANMVKKAVEVLDEMPKYGIEPDEYIFGCLLDALCKNGSVKDASKLFEDMRDKFPPNLR 243

Query: 889  HFTSLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVYNNLLNGYAVAGKMVDAFVLLQEM 1068
            +FTSLLYGWC+EGKL+EAK VLV+M+EAG EPDIVV+ NLL+GYA AGKM DA+ L+++M
Sbjct: 244  YFTSLLYGWCREGKLIEAKHVLVQMKEAGLEPDIVVFTNLLSGYAHAGKMADAYDLMKDM 303

Query: 1069 KSKGCEPNATSFTIVVQALCAQNK-MEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGE 1245
            + +G EPNA  +T+++QALC   K M+EAMRVF EMER GCEAD+VTYT LISGFCKWG 
Sbjct: 304  RRRGYEPNANCYTVLIQALCKMEKRMDEAMRVFVEMERYGCEADIVTYTALISGFCKWGM 363

Query: 1246 INRGYELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIYNT 1425
            I++GY +LD M +KG  P + +Y+ I++AH               M++ G +PDL IYN 
Sbjct: 364  IDKGYSVLDDMRKKGVMPLQVTYMQIMVAHEKKEQFEECLDLIEKMKQNGCLPDLLIYNV 423

Query: 1426 VIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILINGLVEQGCLVEACDYFKEMVYRG 1605
            VIRLACKLGE+KEA+R W ++E NG+SPGVDT VI+ING   QGCL+EACD+FKEMV RG
Sbjct: 424  VIRLACKLGEVKEAVRLWNEMEANGLSPGVDTFVIMINGFASQGCLIEACDHFKEMVSRG 483

Query: 1606 LLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCIMTK--GCDLNVYAWTIWIHALFSNGHV 1779
            + SAP YGTLK LLN+L+R DKLEM+K+VWSC+  K   C+LNV AWTIWIHALF+ GHV
Sbjct: 484  IFSAPHYGTLKILLNTLVRDDKLEMAKDVWSCLSNKSSSCELNVSAWTIWIHALFARGHV 543

Query: 1780 KEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFAAEITEKVRKMA 1926
            KEACSYCLDMM+  +MPQPDT+AKLM+GL KLYNR  AAEITEKVRKMA
Sbjct: 544  KEACSYCLDMMEMDLMPQPDTYAKLMKGLNKLYNRTIAAEITEKVRKMA 592


>ref|XP_006393982.1| hypothetical protein EUTSA_v10003830mg [Eutrema salsugineum]
            gi|557090621|gb|ESQ31268.1| hypothetical protein
            EUTSA_v10003830mg [Eutrema salsugineum]
          Length = 620

 Score =  760 bits (1962), Expect = 0.0
 Identities = 361/544 (66%), Positives = 445/544 (81%), Gaps = 1/544 (0%)
 Frame = +1

Query: 298  GLDLIRIRTEPEPSSDSQTQDEFTADVEKVYRILRKFHSRIPKLELALQESGIVVRSGLT 477
            G  L+ +    +  + +   DEF +DVEK YRILRKFHSR+PKLELAL ESG+ +R GL 
Sbjct: 50   GTGLVCLDKSHKERTKNSNHDEFASDVEKAYRILRKFHSRVPKLELALNESGVELRPGLI 109

Query: 478  ERVLNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALIEEMRKE 657
            ERVLNRCGDAGNLGYRFF+WA+KQPGY HSY VYK+M+KIL KMR F AVWALIEEMRKE
Sbjct: 110  ERVLNRCGDAGNLGYRFFVWAAKQPGYCHSYQVYKSMVKILSKMRHFEAVWALIEEMRKE 169

Query: 658  NPHLLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKE 837
            NP L+ PE+FVVL+RRFAS+ MVKKAIEVLDEMPK+G EPDEYVFGCLLDALCKNGSVK+
Sbjct: 170  NPQLIEPELFVVLVRRFASSNMVKKAIEVLDEMPKFGLEPDEYVFGCLLDALCKNGSVKD 229

Query: 838  AALLFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVYNNLLNG 1017
            AA LFE+MRL+F P +++FTSLLYGWC+EGK+MEA+ VLV+M+EA FEPD+VVY NLL+G
Sbjct: 230  AAKLFEEMRLRFPPNLRYFTSLLYGWCREGKMMEAEHVLVEMKEARFEPDVVVYTNLLSG 289

Query: 1018 YAVAGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALCAQNKMEEAMRVFSEMERSGCEAD 1197
            YA AGKM +A+ LL++M+ +G EPNA  +T+++QALC  ++MEEAMRVF EMER  CEAD
Sbjct: 290  YAHAGKMAEAYDLLKDMRRRGFEPNANCYTVLIQALCKVDRMEEAMRVFVEMERYECEAD 349

Query: 1198 VVTYTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXXXXXXXX 1377
            +VTY  L+SGFCKWG+I++ Y +LD MI+K   P++ +Y++I+ AH              
Sbjct: 350  IVTYNALVSGFCKWGKIDKCYSVLDDMIKKCLMPSQLTYMHIMAAHEKKEKFEECLELME 409

Query: 1378 XMQKIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILINGLVEQG 1557
             M++IG   DL +YN VIRLACKLGE+KEA+R W ++E +G+SPGVDT VI+I+GL  QG
Sbjct: 410  KMKEIGYHLDLGVYNVVIRLACKLGEVKEAVRLWNEMEASGLSPGVDTFVIMIDGLTNQG 469

Query: 1558 CLVEACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSCIMTKG-CDLNVY 1734
            CL+EACD+FK MV RGL S PQYGTLK LLN+LLR  KLE +K++WSCIM++G C+LNV 
Sbjct: 470  CLLEACDHFKVMVSRGLFSVPQYGTLKSLLNALLRDGKLETAKDIWSCIMSEGSCELNVS 529

Query: 1735 AWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFAAEITEKV 1914
            +WTIWIHALFS G+VK+ACSYCL+MM+   M QPDTFAKLM+GL+KLYNR+FA EITEKV
Sbjct: 530  SWTIWIHALFSKGYVKDACSYCLEMMEMDFMLQPDTFAKLMKGLKKLYNREFAVEITEKV 589

Query: 1915 RKMA 1926
            R MA
Sbjct: 590  RNMA 593


>ref|XP_002866691.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297312526|gb|EFH42950.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 638

 Score =  757 bits (1954), Expect = 0.0
 Identities = 362/524 (69%), Positives = 439/524 (83%), Gaps = 1/524 (0%)
 Frame = +1

Query: 358  DEFTADVEKVYRILRKFHSRIPKLELALQESGIVVRSGLTERVLNRCGDAGNLGYRFFIW 537
            DEF +DVEK YRILRKFHSR+PKLELAL ESG+ +R GL ERVLNRCGDAGNLGYRFF+W
Sbjct: 78   DEFASDVEKAYRILRKFHSRVPKLELALNESGVELRPGLIERVLNRCGDAGNLGYRFFVW 137

Query: 538  ASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALIEEMRKENPHLLSPEVFVVLMRRFASA 717
            A+KQP Y HS +VYK+M+KIL KMRQFGAVW LIEEMRKENP L+ PE+FVVL++RFASA
Sbjct: 138  AAKQPRYCHSIEVYKSMVKILSKMRQFGAVWGLIEEMRKENPQLIEPELFVVLVQRFASA 197

Query: 718  RMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAALLFEDMRLKFIPTIKHFT 897
             MVKKAIEVLDEMP +G EPDEYVFGCLLDALCK+GSVK+AA LFEDMRL+F   +++FT
Sbjct: 198  DMVKKAIEVLDEMPTFGLEPDEYVFGCLLDALCKHGSVKDAAKLFEDMRLRFPVNLRYFT 257

Query: 898  SLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVYNNLLNGYAVAGKMVDAFVLLQEMKSK 1077
            SLLYGWC+E K+MEAK+VLV+M+EAGFEPDIV Y NLL+GYA AGKM DA+ LL++M+ +
Sbjct: 258  SLLYGWCREEKMMEAKYVLVQMKEAGFEPDIVDYTNLLSGYANAGKMADAYDLLKDMRRR 317

Query: 1078 GCEPNATSFTIVVQALCAQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGEINRG 1257
            G EPNAT +T+++QALC  ++MEEAM+VF EMER  CEADVVTYT L+SGFCKWG+I++ 
Sbjct: 318  GFEPNATCYTVLIQALCKVDRMEEAMKVFVEMERYECEADVVTYTALVSGFCKWGKIDKC 377

Query: 1258 YELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIYNTVIRL 1437
            Y +LD MI+KG  P++ +Y++I+ AH               M++I   PD+ IYN VIRL
Sbjct: 378  YLVLDDMIKKGLMPSQLTYMHIMAAHEKKEKLIECLELMEKMKQIEYHPDIGIYNVVIRL 437

Query: 1438 ACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILINGLVEQGCLVEACDYFKEMVYRGLLSA 1617
            ACKLGE+KEA+R W ++E NG+SPG DT VI+INGL  QGCL+EACD+FKEMV RGL S 
Sbjct: 438  ACKLGEVKEAVRLWNEMEGNGLSPGADTFVIIINGLTSQGCLLEACDHFKEMVARGLFSV 497

Query: 1618 PQYGTLKDLLNSLLRSDKLEMSKEVWSCIMTKG-CDLNVYAWTIWIHALFSNGHVKEACS 1794
            PQYGTLK LLN+LL+  KLEM+K+VWSCI +KG C+L+V +WTIWIHALFS G+ KEACS
Sbjct: 498  PQYGTLKLLLNTLLKDKKLEMAKDVWSCITSKGSCELSVSSWTIWIHALFSKGYEKEACS 557

Query: 1795 YCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFAAEITEKVRKMA 1926
            YCL+M++   MPQPDTFAKLM+GL+KLY+R+FA EITEKVR MA
Sbjct: 558  YCLEMIELEFMPQPDTFAKLMKGLKKLYHREFAVEITEKVRNMA 601


>ref|NP_190542.4| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|218546755|sp|P0C8A0.1|PP275_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At3g49730 gi|332645062|gb|AEE78583.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 638

 Score =  756 bits (1953), Expect = 0.0
 Identities = 369/557 (66%), Positives = 444/557 (79%), Gaps = 3/557 (0%)
 Frame = +1

Query: 265  NCFERIAEARRGLDLIRIRTEPEPSSDSQTQDEFTADVEKVYRILRKFHSRIPKLELALQ 444
            N F    E + G+ L+     PE     + +DEF  +VEK+YRILR  HSR+PKLELAL 
Sbjct: 39   NDFVESTERKNGVGLVC----PE-----KHEDEFAGEVEKIYRILRNHHSRVPKLELALN 89

Query: 445  ESGIVVRSGLTERVLNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGA 624
            ESGI +R GL  RVL+RCGDAGNLGYRFF+WA+KQPGY HSY+V K+M+ IL KMRQFGA
Sbjct: 90   ESGIDLRPGLIIRVLSRCGDAGNLGYRFFLWATKQPGYFHSYEVCKSMVMILSKMRQFGA 149

Query: 625  VWALIEEMRKENPHLLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLL 804
            VW LIEEMRK NP L+ PE+FVVLMRRFASA MVKKA+EVLDEMPKYG EPDEYVFGCLL
Sbjct: 150  VWGLIEEMRKTNPELIEPELFVVLMRRFASANMVKKAVEVLDEMPKYGLEPDEYVFGCLL 209

Query: 805  DALCKNGSVKEAALLFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEP 984
            DALCKNGSVKEA+ +FEDMR KF P +++FTSLLYGWC+EGKLMEAK VLV+M+EAG EP
Sbjct: 210  DALCKNGSVKEASKVFEDMREKFPPNLRYFTSLLYGWCREGKLMEAKEVLVQMKEAGLEP 269

Query: 985  DIVVYNNLLNGYAVAGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALC-AQNKMEEAMRV 1161
            DIVV+ NLL+GYA AGKM DA+ L+ +M+ +G EPN   +T+++QALC  + +M+EAMRV
Sbjct: 270  DIVVFTNLLSGYAHAGKMADAYDLMNDMRKRGFEPNVNCYTVLIQALCRTEKRMDEAMRV 329

Query: 1162 FSEMERSGCEADVVTYTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXX 1341
            F EMER GCEAD+VTYT LISGFCKWG I++GY +LD M +KG  P++ +Y+ I++AH  
Sbjct: 330  FVEMERYGCEADIVTYTALISGFCKWGMIDKGYSVLDDMRKKGVMPSQVTYMQIMVAHEK 389

Query: 1342 XXXXXXXXXXXXXMQKIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDT 1521
                         M++ G  PDL IYN VIRLACKLGE+KEA+R W ++E NG+SPGVDT
Sbjct: 390  KEQFEECLELIEKMKRRGCHPDLLIYNVVIRLACKLGEVKEAVRLWNEMEANGLSPGVDT 449

Query: 1522 HVILINGLVEQGCLVEACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSC 1701
             VI+ING   QG L+EAC++FKEMV RG+ SAPQYGTLK LLN+L+R DKLEM+K+VWSC
Sbjct: 450  FVIMINGFTSQGFLIEACNHFKEMVSRGIFSAPQYGTLKSLLNNLVRDDKLEMAKDVWSC 509

Query: 1702 I--MTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKL 1875
            I   T  C+LNV AWTIWIHAL++ GHVKEACSYCLDMM+  +MPQP+T+AKLM+GL KL
Sbjct: 510  ISNKTSSCELNVSAWTIWIHALYAKGHVKEACSYCLDMMEMDLMPQPNTYAKLMKGLNKL 569

Query: 1876 YNRQFAAEITEKVRKMA 1926
            YNR  AAEITEKV KMA
Sbjct: 570  YNRTIAAEITEKVVKMA 586


>emb|CAB66911.1| putative protein [Arabidopsis thaliana]
          Length = 1184

 Score =  756 bits (1953), Expect = 0.0
 Identities = 369/557 (66%), Positives = 444/557 (79%), Gaps = 3/557 (0%)
 Frame = +1

Query: 265  NCFERIAEARRGLDLIRIRTEPEPSSDSQTQDEFTADVEKVYRILRKFHSRIPKLELALQ 444
            N F    E + G+ L+     PE     + +DEF  +VEK+YRILR  HSR+PKLELAL 
Sbjct: 39   NDFVESTERKNGVGLVC----PE-----KHEDEFAGEVEKIYRILRNHHSRVPKLELALN 89

Query: 445  ESGIVVRSGLTERVLNRCGDAGNLGYRFFIWASKQPGYRHSYDVYKAMIKILGKMRQFGA 624
            ESGI +R GL  RVL+RCGDAGNLGYRFF+WA+KQPGY HSY+V K+M+ IL KMRQFGA
Sbjct: 90   ESGIDLRPGLIIRVLSRCGDAGNLGYRFFLWATKQPGYFHSYEVCKSMVMILSKMRQFGA 149

Query: 625  VWALIEEMRKENPHLLSPEVFVVLMRRFASARMVKKAIEVLDEMPKYGCEPDEYVFGCLL 804
            VW LIEEMRK NP L+ PE+FVVLMRRFASA MVKKA+EVLDEMPKYG EPDEYVFGCLL
Sbjct: 150  VWGLIEEMRKTNPELIEPELFVVLMRRFASANMVKKAVEVLDEMPKYGLEPDEYVFGCLL 209

Query: 805  DALCKNGSVKEAALLFEDMRLKFIPTIKHFTSLLYGWCKEGKLMEAKFVLVKMREAGFEP 984
            DALCKNGSVKEA+ +FEDMR KF P +++FTSLLYGWC+EGKLMEAK VLV+M+EAG EP
Sbjct: 210  DALCKNGSVKEASKVFEDMREKFPPNLRYFTSLLYGWCREGKLMEAKEVLVQMKEAGLEP 269

Query: 985  DIVVYNNLLNGYAVAGKMVDAFVLLQEMKSKGCEPNATSFTIVVQALC-AQNKMEEAMRV 1161
            DIVV+ NLL+GYA AGKM DA+ L+ +M+ +G EPN   +T+++QALC  + +M+EAMRV
Sbjct: 270  DIVVFTNLLSGYAHAGKMADAYDLMNDMRKRGFEPNVNCYTVLIQALCRTEKRMDEAMRV 329

Query: 1162 FSEMERSGCEADVVTYTTLISGFCKWGEINRGYELLDSMIQKGHTPNRTSYLYILLAHXX 1341
            F EMER GCEAD+VTYT LISGFCKWG I++GY +LD M +KG  P++ +Y+ I++AH  
Sbjct: 330  FVEMERYGCEADIVTYTALISGFCKWGMIDKGYSVLDDMRKKGVMPSQVTYMQIMVAHEK 389

Query: 1342 XXXXXXXXXXXXXMQKIGLVPDLTIYNTVIRLACKLGEIKEAIRFWTQIEVNGMSPGVDT 1521
                         M++ G  PDL IYN VIRLACKLGE+KEA+R W ++E NG+SPGVDT
Sbjct: 390  KEQFEECLELIEKMKRRGCHPDLLIYNVVIRLACKLGEVKEAVRLWNEMEANGLSPGVDT 449

Query: 1522 HVILINGLVEQGCLVEACDYFKEMVYRGLLSAPQYGTLKDLLNSLLRSDKLEMSKEVWSC 1701
             VI+ING   QG L+EAC++FKEMV RG+ SAPQYGTLK LLN+L+R DKLEM+K+VWSC
Sbjct: 450  FVIMINGFTSQGFLIEACNHFKEMVSRGIFSAPQYGTLKSLLNNLVRDDKLEMAKDVWSC 509

Query: 1702 I--MTKGCDLNVYAWTIWIHALFSNGHVKEACSYCLDMMDAGVMPQPDTFAKLMRGLRKL 1875
            I   T  C+LNV AWTIWIHAL++ GHVKEACSYCLDMM+  +MPQP+T+AKLM+GL KL
Sbjct: 510  ISNKTSSCELNVSAWTIWIHALYAKGHVKEACSYCLDMMEMDLMPQPNTYAKLMKGLNKL 569

Query: 1876 YNRQFAAEITEKVRKMA 1926
            YNR  AAEITEKV KMA
Sbjct: 570  YNRTIAAEITEKVVKMA 586


>ref|XP_006292382.1| hypothetical protein CARUB_v10018595mg [Capsella rubella]
            gi|482561089|gb|EOA25280.1| hypothetical protein
            CARUB_v10018595mg [Capsella rubella]
          Length = 639

 Score =  756 bits (1951), Expect = 0.0
 Identities = 360/527 (68%), Positives = 434/527 (82%), Gaps = 3/527 (0%)
 Frame = +1

Query: 355  QDEFTADVEKVYRILRKFHSRIPKLELALQESGIVVRSGLTERVLNRCGDAGNLGYRFFI 534
            +DEF  DV+K+YRILR +HSR+PKLELAL ES I +R GL  RVL+RCGDAGNLGYRFF+
Sbjct: 61   EDEFAGDVDKIYRILRNYHSRVPKLELALNESSIDLRPGLIVRVLSRCGDAGNLGYRFFL 120

Query: 535  WASKQPGYRHSYDVYKAMIKILGKMRQFGAVWALIEEMRKENPHLLSPEVFVVLMRRFAS 714
            WA+KQPGY HSY+V K+M+K+L KMRQFGAVW LIEEMRKENP L+ PE+FV+LMRRFAS
Sbjct: 121  WAAKQPGYCHSYEVCKSMVKVLSKMRQFGAVWGLIEEMRKENPELIEPELFVILMRRFAS 180

Query: 715  ARMVKKAIEVLDEMPKYGCEPDEYVFGCLLDALCKNGSVKEAALLFEDMRLKFIPTIKHF 894
            A MVKKA+EVLDEMPKYG EPDEYVFGCLLDALCKNGSVK+A+ LFEDM+ K+ P +++F
Sbjct: 181  ANMVKKAVEVLDEMPKYGLEPDEYVFGCLLDALCKNGSVKDASKLFEDMKEKYPPNLRYF 240

Query: 895  TSLLYGWCKEGKLMEAKFVLVKMREAGFEPDIVVYNNLLNGYAVAGKMVDAFVLLQEMKS 1074
            TSLLYGWC+EGKLMEAK VLV+M+EAG EPDIVV+ NLL+GYA AGKM DA+ L+++M+ 
Sbjct: 241  TSLLYGWCREGKLMEAKEVLVQMKEAGLEPDIVVFTNLLSGYAHAGKMADAYDLMKDMRK 300

Query: 1075 KGCEPNATSFTIVVQALC-AQNKMEEAMRVFSEMERSGCEADVVTYTTLISGFCKWGEIN 1251
            +G EPNA  +T+++QALC  + +M+EAMRVF EMER GCEAD+VTYT LISGFCKW  I+
Sbjct: 301  RGYEPNANCYTVLIQALCKTEKRMDEAMRVFVEMERYGCEADIVTYTALISGFCKWEMID 360

Query: 1252 RGYELLDSMIQKGHTPNRTSYLYILLAHXXXXXXXXXXXXXXXMQKIGLVPDLTIYNTVI 1431
            +GY +LD M +KG  P++ +Y+ I++AH               M++IG   DL IYN VI
Sbjct: 361  KGYSVLDDMRKKGVIPSQVTYMQIMVAHEKKEQFEECLDLIEKMKQIGCQLDLLIYNVVI 420

Query: 1432 RLACKLGEIKEAIRFWTQIEVNGMSPGVDTHVILINGLVEQGCLVEACDYFKEMVYRGLL 1611
            RLACKLGE+KEA+R W ++E NG+SPGVDT VI+ING   QGCLVEAC++FKEMV RG+ 
Sbjct: 421  RLACKLGEVKEAVRLWNEMEANGLSPGVDTFVIMINGFTSQGCLVEACNHFKEMVSRGIF 480

Query: 1612 SAPQYGTLKDLLNSLLRSDKLEMSKEVWSCIMTK--GCDLNVYAWTIWIHALFSNGHVKE 1785
            SAPQYGTLK LLN+L+R +KLEM+K+VWSCI  K   C+LNV AWTIWIHAL + GHVKE
Sbjct: 481  SAPQYGTLKLLLNNLVRDEKLEMAKDVWSCISNKSSSCELNVSAWTIWIHALLAKGHVKE 540

Query: 1786 ACSYCLDMMDAGVMPQPDTFAKLMRGLRKLYNRQFAAEITEKVRKMA 1926
            ACSYCLDMM   +MPQPDT+ KLM+GL KLYNR  AAEITEKV KMA
Sbjct: 541  ACSYCLDMMKMDLMPQPDTYVKLMKGLNKLYNRTIAAEITEKVMKMA 587


Top