BLASTX nr result

ID: Forsythia22_contig00014784 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00014784
         (2779 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011094657.1| PREDICTED: pentatricopeptide repeat-containi...  1160   0.0  
ref|XP_012840925.1| PREDICTED: pentatricopeptide repeat-containi...  1103   0.0  
emb|CDO97701.1| unnamed protein product [Coffea canephora]           1087   0.0  
emb|CBI32743.3| unnamed protein product [Vitis vinifera]             1053   0.0  
ref|XP_002276355.1| PREDICTED: pentatricopeptide repeat-containi...  1053   0.0  
ref|XP_009630943.1| PREDICTED: pentatricopeptide repeat-containi...  1043   0.0  
ref|XP_006353112.1| PREDICTED: pentatricopeptide repeat-containi...  1040   0.0  
ref|XP_004251992.1| PREDICTED: pentatricopeptide repeat-containi...  1038   0.0  
ref|XP_010111755.1| hypothetical protein L484_008414 [Morus nota...  1033   0.0  
ref|XP_009782844.1| PREDICTED: pentatricopeptide repeat-containi...  1031   0.0  
ref|XP_007033459.1| Tetratricopeptide repeat (TPR)-like superfam...  1023   0.0  
ref|XP_010061750.1| PREDICTED: pentatricopeptide repeat-containi...  1010   0.0  
ref|XP_012481297.1| PREDICTED: pentatricopeptide repeat-containi...  1006   0.0  
ref|XP_012082370.1| PREDICTED: pentatricopeptide repeat-containi...   999   0.0  
ref|XP_002530985.1| pentatricopeptide repeat-containing protein,...   998   0.0  
ref|XP_008378595.1| PREDICTED: pentatricopeptide repeat-containi...   995   0.0  
ref|XP_009376212.1| PREDICTED: pentatricopeptide repeat-containi...   994   0.0  
ref|XP_004299746.2| PREDICTED: pentatricopeptide repeat-containi...   993   0.0  
ref|XP_008338950.1| PREDICTED: pentatricopeptide repeat-containi...   993   0.0  
ref|XP_004149878.1| PREDICTED: pentatricopeptide repeat-containi...   987   0.0  

>ref|XP_011094657.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230
            [Sesamum indicum]
          Length = 756

 Score = 1160 bits (3000), Expect = 0.0
 Identities = 595/740 (80%), Positives = 648/740 (87%), Gaps = 4/740 (0%)
 Frame = -3

Query: 2639 MAYLAASRQPHXXXXXXXXXXXXSI--FFFYSSSPVDSPNPESNPVEHLIPESSCADSSA 2466
            MA LAA RQ H            SI    F SS+P     P SN  E  +  +  ADSS+
Sbjct: 1    MASLAARRQTHLNANLTNLSSPLSIKSLSFCSSAP---SKPSSNE-EPSVGANQNADSSS 56

Query: 2465 TVTAAANPSRN--AGGKRQKNPEKIEDIINRMMANRAWTTRLQNSIRNLVPSFDHELVYN 2292
            T T AA+ S    + GKRQKNPEKIEDII RMMANRAWTTRLQNSIRNLVPSFDHELVYN
Sbjct: 57   TKTTAASASATFRSEGKRQKNPEKIEDIICRMMANRAWTTRLQNSIRNLVPSFDHELVYN 116

Query: 2291 VLHGAKNSEYALQFFRWVERSGLFQHNRETHLKIIEILGRASKLNHARCILLDMPKKGVE 2112
            VLHGAK SE+ALQFFRWVERS LFQHNRETHLKIIEILGRASKLNHARCILLDMPKKG+E
Sbjct: 117  VLHGAKKSEHALQFFRWVERSNLFQHNRETHLKIIEILGRASKLNHARCILLDMPKKGLE 176

Query: 2111 WDEDLWVLLIDSYGKAGIVQESVKLFQKMEELGVGRTIKSYDTLFRVIMRRGRYMMAKRY 1932
            WDEDLWVL+IDSYGKAGIVQESVKLFQKMEELGV R+IKSYD LF+VI+RRGRYMMAKRY
Sbjct: 177  WDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGVERSIKSYDALFKVILRRGRYMMAKRY 236

Query: 1931 FNKMLSEGIEPTRHTFNVMIWGFFLSGKVETANRFFEDMKSREILPDVVTYNTLINGYNR 1752
            FNKMLSEGIEPTRHTFN+MIWGFFLSGKVETANRFFEDMK+REI+PDVVTYNTLINGY R
Sbjct: 237  FNKMLSEGIEPTRHTFNIMIWGFFLSGKVETANRFFEDMKNREIMPDVVTYNTLINGYYR 296

Query: 1751 VKKMDEAEKYYVEMKGRNIEPTVITYTTLIKGYLSVGQVDDALKLLEDMKSFAIKPNAIT 1572
            VKKM+EAEKY+VEMKGRNIEPTV+TYTTLIKGY+SV +VDDAL+L+E+MK F IKPNAIT
Sbjct: 297  VKKMEEAEKYFVEMKGRNIEPTVVTYTTLIKGYVSVDRVDDALRLMEEMKGFGIKPNAIT 356

Query: 1571 YSTLLPGMCDAEKMSEARDILKEMVEKYIAPKDNAIFTRLISGQCKAGDLDAATDVLKAM 1392
            YSTLLPG+CDAEKMSEA+ ILKEMVEKYIAPKDN+IF RLISGQCK G+LDAA DVLKAM
Sbjct: 357  YSTLLPGLCDAEKMSEAQTILKEMVEKYIAPKDNSIFMRLISGQCKVGNLDAAADVLKAM 416

Query: 1391 IRLSVPTEAGHYGILIENFCKAGQYDSAVXXXXXXXXXXXXLRPQSTLHMEPSAYNPIIE 1212
            IRLSVPTEAGHYG+LIEN CKAG+YD  V            LRPQSTLHMEPSAYNP+IE
Sbjct: 417  IRLSVPTEAGHYGVLIENCCKAGEYDRGVKLLDKLIEKDIILRPQSTLHMEPSAYNPLIE 476

Query: 1211 YLCDNGQTAKAETLLRQLMKMGVQDPIAFNNLICGHAKEGNPDPAIELLKIMIRRNVLSE 1032
            YLC+NGQ+AKAETLLRQLMK+GVQDPIA N LICG ++EGNPD A E+LKIM+RRNV SE
Sbjct: 477  YLCNNGQSAKAETLLRQLMKLGVQDPIALNTLICGRSQEGNPDSAFEILKIMLRRNVRSE 536

Query: 1031 KIAYESLIESYLKKSEPADAKTTLDSMIENGHLPDSSLYRSVMESLLEDGRVQTASRVMK 852
            K AY+SL++SYL+KS PADAK  LDSM+ENGHLPDSSLYRSVMESL EDGRVQTASRVMK
Sbjct: 537  KSAYDSLVKSYLRKSNPADAKAALDSMVENGHLPDSSLYRSVMESLFEDGRVQTASRVMK 596

Query: 851  TMLEKGVKDHEDLIAKILEALLMRGHVEEAIGRIELLMVNGLAPDFDSLLSGLCEKGKTI 672
             MLEKGV DHEDLIAKILEAL MRGHVEEA GRIELLM +G+APDFDSLLS LCEKGKTI
Sbjct: 597  MMLEKGVTDHEDLIAKILEALFMRGHVEEACGRIELLMQSGIAPDFDSLLSVLCEKGKTI 656

Query: 671  SALKLLDFGLERDCNIDFSSYYKVLDALLAAGKTLNAYSVLCKIMEKGGITDWSSCKDLI 492
            +ALKLLD+GLERD  ID SSY KVLDALLAAGKTLNAYS+L KIMEKGG+TDWSSCK+LI
Sbjct: 657  AALKLLDYGLERDYKIDVSSYEKVLDALLAAGKTLNAYSILLKIMEKGGVTDWSSCKELI 716

Query: 491  QNLNEEGNTKQADILSRMIL 432
            ++LNEEGNTKQADIL+RMI+
Sbjct: 717  KSLNEEGNTKQADILARMIM 736


>ref|XP_012840925.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230
            [Erythranthe guttatus]
          Length = 763

 Score = 1103 bits (2854), Expect = 0.0
 Identities = 558/742 (75%), Positives = 630/742 (84%), Gaps = 7/742 (0%)
 Frame = -3

Query: 2639 MAYLAASRQPHXXXXXXXXXXXXSI--FFFYSSSPVDSPNPESNPVEHL-IPESSCADSS 2469
            MA+LAAS+QPH            SI    F S++P   PNP  NP E L + E   ADSS
Sbjct: 1    MAFLAASKQPHFNSNITKLSSPFSIKSLLFCSAAPSPPPNPNPNPNEELPVSEIPIADSS 60

Query: 2468 ATVTAAANPSRNAGGKRQ----KNPEKIEDIINRMMANRAWTTRLQNSIRNLVPSFDHEL 2301
            +    AA P      +RQ    KNPEKIEDII RMMANRAWTTRLQNSIR LVP+FDHEL
Sbjct: 61   SANATAAEPPSPPTFRRQLRRPKNPEKIEDIICRMMANRAWTTRLQNSIRKLVPAFDHEL 120

Query: 2300 VYNVLHGAKNSEYALQFFRWVERSGLFQHNRETHLKIIEILGRASKLNHARCILLDMPKK 2121
            VYNVLH ++NSE+ALQFFRWVERS LFQHNRETH KIIEILGRASKLNHARCILLDMPKK
Sbjct: 121  VYNVLHASRNSEHALQFFRWVERSSLFQHNRETHHKIIEILGRASKLNHARCILLDMPKK 180

Query: 2120 GVEWDEDLWVLLIDSYGKAGIVQESVKLFQKMEELGVGRTIKSYDTLFRVIMRRGRYMMA 1941
            G+EWDEDLWV++IDSYGKAGIVQESVKLFQKMEELGV R IKSY+TLF+VI RRGRYMMA
Sbjct: 181  GLEWDEDLWVMMIDSYGKAGIVQESVKLFQKMEELGVERGIKSYNTLFKVISRRGRYMMA 240

Query: 1940 KRYFNKMLSEGIEPTRHTFNVMIWGFFLSGKVETANRFFEDMKSREILPDVVTYNTLING 1761
            KRYFNKMLSEGIEP RHTFN++IWGFFLSGKVETANRFFEDMK+REI PDVVTYNTLING
Sbjct: 241  KRYFNKMLSEGIEPNRHTFNLLIWGFFLSGKVETANRFFEDMKTREITPDVVTYNTLING 300

Query: 1760 YNRVKKMDEAEKYYVEMKGRNIEPTVITYTTLIKGYLSVGQVDDALKLLEDMKSFAIKPN 1581
            Y RVKKMDEA KY+ EMKGRNIEP V+TYTTLIKGY+SV QVDDAL+L+E+MK F IKPN
Sbjct: 301  YYRVKKMDEAVKYFTEMKGRNIEPNVVTYTTLIKGYVSVEQVDDALRLVEEMKGFGIKPN 360

Query: 1580 AITYSTLLPGMCDAEKMSEARDILKEMVEKYIAPKDNAIFTRLISGQCKAGDLDAATDVL 1401
            AITYSTLLPG+CDAEKMSEA+ IL+EMV+K+I P +N+IF RL+ GQCK GDLDAA DVL
Sbjct: 361  AITYSTLLPGLCDAEKMSEAKTILREMVDKHIGPLENSIFMRLLYGQCKVGDLDAAADVL 420

Query: 1400 KAMIRLSVPTEAGHYGILIENFCKAGQYDSAVXXXXXXXXXXXXLRPQSTLHMEPSAYNP 1221
            KAMI+LSVPTEAGHYGILIENFCKAGQYD AV            LR +STLHMEP++YNP
Sbjct: 421  KAMIKLSVPTEAGHYGILIENFCKAGQYDRAVKLLDKLVEKDIILRSESTLHMEPTSYNP 480

Query: 1220 IIEYLCDNGQTAKAETLLRQLMKMGVQDPIAFNNLICGHAKEGNPDPAIELLKIMIRRNV 1041
            II+YLC NGQTAKAE L+RQLMK+GV+D +A N LI GHA+EG+PD A ELLKIM+RRNV
Sbjct: 481  IIDYLCQNGQTAKAEALVRQLMKLGVRDSVALNTLIRGHAQEGSPDSAFELLKIMLRRNV 540

Query: 1040 LSEKIAYESLIESYLKKSEPADAKTTLDSMIENGHLPDSSLYRSVMESLLEDGRVQTASR 861
             ++K AY+SL++SYL K++PA+AK  +DSM+ENGH+PDSSL+RSVMESL EDGRVQTASR
Sbjct: 541  PTDKTAYDSLVQSYLTKNDPAEAKAVIDSMVENGHIPDSSLFRSVMESLFEDGRVQTASR 600

Query: 860  VMKTMLEKGVKDHEDLIAKILEALLMRGHVEEAIGRIELLMVNGLAPDFDSLLSGLCEKG 681
            VM  MLEKGV +HEDLI KILEALLMRGHVEEA+GRI+LLM +G+ PD D LLS LCEKG
Sbjct: 601  VMNMMLEKGVNEHEDLIFKILEALLMRGHVEEALGRIDLLMQSGIEPDLDGLLSVLCEKG 660

Query: 680  KTISALKLLDFGLERDCNIDFSSYYKVLDALLAAGKTLNAYSVLCKIMEKGGITDWSSCK 501
            KTI+ALK++D+GLERD  ID SSY KVLDA LAAGKTLNAYS+LCKIM KGG++DWSSCK
Sbjct: 661  KTIAALKVVDYGLERDYTIDVSSYEKVLDAQLAAGKTLNAYSILCKIMGKGGVSDWSSCK 720

Query: 500  DLIQNLNEEGNTKQADILSRMI 435
            DLI++LN+EGNTKQADIL RMI
Sbjct: 721  DLIKSLNDEGNTKQADILRRMI 742


>emb|CDO97701.1| unnamed protein product [Coffea canephora]
          Length = 753

 Score = 1087 bits (2810), Expect = 0.0
 Identities = 557/742 (75%), Positives = 626/742 (84%), Gaps = 6/742 (0%)
 Frame = -3

Query: 2639 MAYLAASRQPHXXXXXXXXXXXXSIF---FFYSSSPVDSPNPESNPVEHLIPESSCADSS 2469
            MAYL+AS+  H                  FF+ SSP  +P+ E+  +    P  S    +
Sbjct: 1    MAYLSASKPSHFHPRNLSNVSTPLSLKSLFFFCSSPGGNPDQETAAIS---PTESTGPET 57

Query: 2468 ATVTAAANPSRNAGGKR---QKNPEKIEDIINRMMANRAWTTRLQNSIRNLVPSFDHELV 2298
              V  AA PSR+  GK    QKNPEK+EDII RMMANRAWTTRLQNSIRNLVPSFDHELV
Sbjct: 58   RNVDPAATPSRSPRGKHRRLQKNPEKLEDIICRMMANRAWTTRLQNSIRNLVPSFDHELV 117

Query: 2297 YNVLHGAKNSEYALQFFRWVERSGLFQHNRETHLKIIEILGRASKLNHARCILLDMPKKG 2118
            YNVLHGAKNSE+ALQFFRWVER+GLFQH RETHLKIIEILGRASKLNHARCILLD+P+KG
Sbjct: 118  YNVLHGAKNSEHALQFFRWVERAGLFQHTRETHLKIIEILGRASKLNHARCILLDLPQKG 177

Query: 2117 VEWDEDLWVLLIDSYGKAGIVQESVKLFQKMEELGVGRTIKSYDTLFRVIMRRGRYMMAK 1938
            VEWDED+WVLLI+SYG AGIVQESV+LFQKMEELGV RTIK+YD LF+VIMRRGRY MAK
Sbjct: 178  VEWDEDMWVLLIESYGSAGIVQESVQLFQKMEELGVQRTIKTYDALFKVIMRRGRYGMAK 237

Query: 1937 RYFNKMLSEGIEPTRHTFNVMIWGFFLSGKVETANRFFEDMKSREILPDVVTYNTLINGY 1758
            RYFNKML EGIEPTRHT+N+MIWGFFLS KVE+A RFFE+MKSREI PDVVTYNT+INGY
Sbjct: 238  RYFNKMLKEGIEPTRHTYNLMIWGFFLSSKVESAVRFFEEMKSREISPDVVTYNTMINGY 297

Query: 1757 NRVKKMDEAEKYYVEMKGRNIEPTVITYTTLIKGYLSVGQVDDALKLLEDMKSFAIKPNA 1578
             RVK M+EAEKY+VEMKGRN+EP+VITYTTLIKG++S G VD+AL+ LE+MK F IKPNA
Sbjct: 298  CRVKNMEEAEKYFVEMKGRNLEPSVITYTTLIKGFVSAGGVDNALRFLEEMKKFGIKPNA 357

Query: 1577 ITYSTLLPGMCDAEKMSEARDILKEMVEKYIAPKDNAIFTRLISGQCKAGDLDAATDVLK 1398
            ITYSTLLPG+CDA+KMSEA  ILKEMVE++IAP D++IFTRL+SGQCKAG LDAA DVLK
Sbjct: 358  ITYSTLLPGLCDADKMSEADKILKEMVERHIAPNDDSIFTRLLSGQCKAGHLDAAADVLK 417

Query: 1397 AMIRLSVPTEAGHYGILIENFCKAGQYDSAVXXXXXXXXXXXXLRPQSTLHMEPSAYNPI 1218
            AMIRLS+PTEAGHYGILIENFCKAG Y  AV            LRPQ+TL MEPSAYNP+
Sbjct: 418  AMIRLSIPTEAGHYGILIENFCKAGAYQRAVQLLDKLIEKEIILRPQTTLQMEPSAYNPV 477

Query: 1217 IEYLCDNGQTAKAETLLRQLMKMGVQDPIAFNNLICGHAKEGNPDPAIELLKIMIRRNVL 1038
            IE+LC+NGQT KAE LLRQLMKMGVQDP+AFN LI GH+KEG P+ A ELL IM+RR V+
Sbjct: 478  IEHLCNNGQTRKAEMLLRQLMKMGVQDPVAFNYLIRGHSKEGTPESASELLTIMVRRKVV 537

Query: 1037 SEKIAYESLIESYLKKSEPADAKTTLDSMIENGHLPDSSLYRSVMESLLEDGRVQTASRV 858
            SE  A+ SLIESYL K +PADAK+ LD+MIENGHLPDSSLYRSVMESL  DGRVQTASRV
Sbjct: 538  SEASAHVSLIESYLTKGDPADAKSALDTMIENGHLPDSSLYRSVMESLFADGRVQTASRV 597

Query: 857  MKTMLEKGVKDHEDLIAKILEALLMRGHVEEAIGRIELLMVNGLAPDFDSLLSGLCEKGK 678
            M TMLEKGVK+H DLIAKILEALL+RGHVEEA+GRIELLM NGLAP+FD+LLS LCEKGK
Sbjct: 598  MMTMLEKGVKEHMDLIAKILEALLLRGHVEEALGRIELLMQNGLAPNFDNLLSILCEKGK 657

Query: 677  TISALKLLDFGLERDCNIDFSSYYKVLDALLAAGKTLNAYSVLCKIMEKGGITDWSSCKD 498
            TI+ALKLLD  L+RD ++DFSSY KVLD LLAAGKTLNAYS+LCKI EKGG+TD +S +D
Sbjct: 658  TIAALKLLDHCLQRDYSVDFSSYDKVLDGLLAAGKTLNAYSILCKITEKGGVTDKNSYED 717

Query: 497  LIQNLNEEGNTKQADILSRMIL 432
            LI+ LN EGNTKQADILSRMI+
Sbjct: 718  LIKTLNAEGNTKQADILSRMIM 739


>emb|CBI32743.3| unnamed protein product [Vitis vinifera]
          Length = 772

 Score = 1053 bits (2722), Expect = 0.0
 Identities = 534/717 (74%), Positives = 609/717 (84%), Gaps = 8/717 (1%)
 Frame = -3

Query: 2561 FFYSSSPVDSPNPESNPVEHLIPESSCADSSA---TVTAA-----ANPSRNAGGKRQKNP 2406
            F  S S VD      +     IPE+  + S +    +TAA     A+P    G  + +NP
Sbjct: 29   FIQSFSSVDESISAGDLTSSPIPETPVSGSPSEPGNLTAAEAGEKASPRTPRG--KLRNP 86

Query: 2405 EKIEDIINRMMANRAWTTRLQNSIRNLVPSFDHELVYNVLHGAKNSEYALQFFRWVERSG 2226
            EKIEDII RMMANRAWTTRLQNSIR+LVP FDH LV+NVLHG++NS++ALQFFRWVER+G
Sbjct: 87   EKIEDIICRMMANRAWTTRLQNSIRSLVPQFDHSLVWNVLHGSRNSDHALQFFRWVERAG 146

Query: 2225 LFQHNRETHLKIIEILGRASKLNHARCILLDMPKKGVEWDEDLWVLLIDSYGKAGIVQES 2046
            LF+H+R+THLKIIEILGRASKLNHARCILLDMPKKGVEWDEDL+VLLIDSYGKAGIVQES
Sbjct: 147  LFRHDRDTHLKIIEILGRASKLNHARCILLDMPKKGVEWDEDLFVLLIDSYGKAGIVQES 206

Query: 2045 VKLFQKMEELGVGRTIKSYDTLFRVIMRRGRYMMAKRYFNKMLSEGIEPTRHTFNVMIWG 1866
            VK+FQKM+ELGV RTIKSYD LF+VI+RRGRYMMAKRYFN ML+EG+ PT HT+N+MIWG
Sbjct: 207  VKVFQKMKELGVERTIKSYDALFKVILRRGRYMMAKRYFNAMLNEGVMPTCHTYNIMIWG 266

Query: 1865 FFLSGKVETANRFFEDMKSREILPDVVTYNTLINGYNRVKKMDEAEKYYVEMKGRNIEPT 1686
            FFLS KVETANRFFE+MK R I PDVVTYNT+INGY R+KKM+EAEK++VEMKGRNIEPT
Sbjct: 267  FFLSLKVETANRFFEEMKERRISPDVVTYNTMINGYYRIKKMEEAEKFFVEMKGRNIEPT 326

Query: 1685 VITYTTLIKGYLSVGQVDDALKLLEDMKSFAIKPNAITYSTLLPGMCDAEKMSEARDILK 1506
            VI+YTT+IKGY+SVG+VDD L+L E+MKSF IKPNA+TYSTLLPG+CD EKM EA++++K
Sbjct: 327  VISYTTMIKGYVSVGRVDDGLRLFEEMKSFGIKPNAVTYSTLLPGLCDGEKMLEAQNVVK 386

Query: 1505 EMVEKYIAPKDNAIFTRLISGQCKAGDLDAATDVLKAMIRLSVPTEAGHYGILIENFCKA 1326
            EMVE+YIAPKDN+IF RLI+ QCKAG LDAA DVLKAMIRLS+PTEAGHYG+LIENFCK+
Sbjct: 387  EMVERYIAPKDNSIFMRLITCQCKAGQLDAAADVLKAMIRLSIPTEAGHYGVLIENFCKS 446

Query: 1325 GQYDSAVXXXXXXXXXXXXLRPQSTLHMEPSAYNPIIEYLCDNGQTAKAETLLRQLMKMG 1146
            G YD AV            LRPQ++L ME S YN IIEYLC++GQT+KAETL RQLMK G
Sbjct: 447  GVYDRAVKLLDKLIEKEIILRPQNSLEMESSGYNLIIEYLCNSGQTSKAETLFRQLMKKG 506

Query: 1145 VQDPIAFNNLICGHAKEGNPDPAIELLKIMIRRNVLSEKIAYESLIESYLKKSEPADAKT 966
            VQDPIAFNNLI GH+KEG P+ A E+LKIM RR V  E  AY  LIES+LKK EPADAKT
Sbjct: 507  VQDPIAFNNLIRGHSKEGAPESAFEILKIMGRREVPREADAYRLLIESFLKKGEPADAKT 566

Query: 965  TLDSMIENGHLPDSSLYRSVMESLLEDGRVQTASRVMKTMLEKGVKDHEDLIAKILEALL 786
             LD MIENGH+PDSSL+RSVMESL EDGR+QTASRVM  M+EKGVK++ DL+AKILEALL
Sbjct: 567  ALDGMIENGHIPDSSLFRSVMESLFEDGRIQTASRVMNNMVEKGVKENMDLVAKILEALL 626

Query: 785  MRGHVEEAIGRIELLMVNGLAPDFDSLLSGLCEKGKTISALKLLDFGLERDCNIDFSSYY 606
            +RGHVEEA+GRI+LLM NG  PDFD LLS LC KGKTI+ALKLLDFGLERD NI FSSY 
Sbjct: 627  LRGHVEEALGRIDLLMNNGCEPDFDGLLSVLCAKGKTIAALKLLDFGLERDYNISFSSYE 686

Query: 605  KVLDALLAAGKTLNAYSVLCKIMEKGGITDWSSCKDLIQNLNEEGNTKQADILSRMI 435
             VLDALL AGKTLNAYS+LCKIM+KGG TDWSSCKDLI++LNEEGNTKQADILSRMI
Sbjct: 687  NVLDALLTAGKTLNAYSILCKIMQKGGATDWSSCKDLIRSLNEEGNTKQADILSRMI 743


>ref|XP_002276355.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230
            [Vitis vinifera]
          Length = 763

 Score = 1053 bits (2722), Expect = 0.0
 Identities = 534/717 (74%), Positives = 609/717 (84%), Gaps = 8/717 (1%)
 Frame = -3

Query: 2561 FFYSSSPVDSPNPESNPVEHLIPESSCADSSA---TVTAA-----ANPSRNAGGKRQKNP 2406
            F  S S VD      +     IPE+  + S +    +TAA     A+P    G  + +NP
Sbjct: 29   FIQSFSSVDESISAGDLTSSPIPETPVSGSPSEPGNLTAAEAGEKASPRTPRG--KLRNP 86

Query: 2405 EKIEDIINRMMANRAWTTRLQNSIRNLVPSFDHELVYNVLHGAKNSEYALQFFRWVERSG 2226
            EKIEDII RMMANRAWTTRLQNSIR+LVP FDH LV+NVLHG++NS++ALQFFRWVER+G
Sbjct: 87   EKIEDIICRMMANRAWTTRLQNSIRSLVPQFDHSLVWNVLHGSRNSDHALQFFRWVERAG 146

Query: 2225 LFQHNRETHLKIIEILGRASKLNHARCILLDMPKKGVEWDEDLWVLLIDSYGKAGIVQES 2046
            LF+H+R+THLKIIEILGRASKLNHARCILLDMPKKGVEWDEDL+VLLIDSYGKAGIVQES
Sbjct: 147  LFRHDRDTHLKIIEILGRASKLNHARCILLDMPKKGVEWDEDLFVLLIDSYGKAGIVQES 206

Query: 2045 VKLFQKMEELGVGRTIKSYDTLFRVIMRRGRYMMAKRYFNKMLSEGIEPTRHTFNVMIWG 1866
            VK+FQKM+ELGV RTIKSYD LF+VI+RRGRYMMAKRYFN ML+EG+ PT HT+N+MIWG
Sbjct: 207  VKVFQKMKELGVERTIKSYDALFKVILRRGRYMMAKRYFNAMLNEGVMPTCHTYNIMIWG 266

Query: 1865 FFLSGKVETANRFFEDMKSREILPDVVTYNTLINGYNRVKKMDEAEKYYVEMKGRNIEPT 1686
            FFLS KVETANRFFE+MK R I PDVVTYNT+INGY R+KKM+EAEK++VEMKGRNIEPT
Sbjct: 267  FFLSLKVETANRFFEEMKERRISPDVVTYNTMINGYYRIKKMEEAEKFFVEMKGRNIEPT 326

Query: 1685 VITYTTLIKGYLSVGQVDDALKLLEDMKSFAIKPNAITYSTLLPGMCDAEKMSEARDILK 1506
            VI+YTT+IKGY+SVG+VDD L+L E+MKSF IKPNA+TYSTLLPG+CD EKM EA++++K
Sbjct: 327  VISYTTMIKGYVSVGRVDDGLRLFEEMKSFGIKPNAVTYSTLLPGLCDGEKMLEAQNVVK 386

Query: 1505 EMVEKYIAPKDNAIFTRLISGQCKAGDLDAATDVLKAMIRLSVPTEAGHYGILIENFCKA 1326
            EMVE+YIAPKDN+IF RLI+ QCKAG LDAA DVLKAMIRLS+PTEAGHYG+LIENFCK+
Sbjct: 387  EMVERYIAPKDNSIFMRLITCQCKAGQLDAAADVLKAMIRLSIPTEAGHYGVLIENFCKS 446

Query: 1325 GQYDSAVXXXXXXXXXXXXLRPQSTLHMEPSAYNPIIEYLCDNGQTAKAETLLRQLMKMG 1146
            G YD AV            LRPQ++L ME S YN IIEYLC++GQT+KAETL RQLMK G
Sbjct: 447  GVYDRAVKLLDKLIEKEIILRPQNSLEMESSGYNLIIEYLCNSGQTSKAETLFRQLMKKG 506

Query: 1145 VQDPIAFNNLICGHAKEGNPDPAIELLKIMIRRNVLSEKIAYESLIESYLKKSEPADAKT 966
            VQDPIAFNNLI GH+KEG P+ A E+LKIM RR V  E  AY  LIES+LKK EPADAKT
Sbjct: 507  VQDPIAFNNLIRGHSKEGAPESAFEILKIMGRREVPREADAYRLLIESFLKKGEPADAKT 566

Query: 965  TLDSMIENGHLPDSSLYRSVMESLLEDGRVQTASRVMKTMLEKGVKDHEDLIAKILEALL 786
             LD MIENGH+PDSSL+RSVMESL EDGR+QTASRVM  M+EKGVK++ DL+AKILEALL
Sbjct: 567  ALDGMIENGHIPDSSLFRSVMESLFEDGRIQTASRVMNNMVEKGVKENMDLVAKILEALL 626

Query: 785  MRGHVEEAIGRIELLMVNGLAPDFDSLLSGLCEKGKTISALKLLDFGLERDCNIDFSSYY 606
            +RGHVEEA+GRI+LLM NG  PDFD LLS LC KGKTI+ALKLLDFGLERD NI FSSY 
Sbjct: 627  LRGHVEEALGRIDLLMNNGCEPDFDGLLSVLCAKGKTIAALKLLDFGLERDYNISFSSYE 686

Query: 605  KVLDALLAAGKTLNAYSVLCKIMEKGGITDWSSCKDLIQNLNEEGNTKQADILSRMI 435
             VLDALL AGKTLNAYS+LCKIM+KGG TDWSSCKDLI++LNEEGNTKQADILSRMI
Sbjct: 687  NVLDALLTAGKTLNAYSILCKIMQKGGATDWSSCKDLIRSLNEEGNTKQADILSRMI 743


>ref|XP_009630943.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230
            [Nicotiana tomentosiformis]
          Length = 721

 Score = 1043 bits (2696), Expect = 0.0
 Identities = 524/710 (73%), Positives = 601/710 (84%)
 Frame = -3

Query: 2561 FFYSSSPVDSPNPESNPVEHLIPESSCADSSATVTAAANPSRNAGGKRQKNPEKIEDIIN 2382
            FFY S  +++P+P +      IP +  A                     K PEK+ED+I 
Sbjct: 22   FFYCSESLNNPDPSTR-----IPTTHNA---------------------KTPEKVEDLIC 55

Query: 2381 RMMANRAWTTRLQNSIRNLVPSFDHELVYNVLHGAKNSEYALQFFRWVERSGLFQHNRET 2202
            RMM+ R WTTRLQNSIRNLVPSFDHELVYNVLH AKNSE+ALQFFRWVERSGLF+H+RET
Sbjct: 56   RMMSTRVWTTRLQNSIRNLVPSFDHELVYNVLHNAKNSEHALQFFRWVERSGLFRHDRET 115

Query: 2201 HLKIIEILGRASKLNHARCILLDMPKKGVEWDEDLWVLLIDSYGKAGIVQESVKLFQKME 2022
            H KII+ILGR+ KLNHARCILLDMP KGV+WDEDLWVL+IDSYGKAGIVQESVKLFQKME
Sbjct: 116  HFKIIQILGRSEKLNHARCILLDMPNKGVDWDEDLWVLMIDSYGKAGIVQESVKLFQKME 175

Query: 2021 ELGVGRTIKSYDTLFRVIMRRGRYMMAKRYFNKMLSEGIEPTRHTFNVMIWGFFLSGKVE 1842
            ELGV RT+KSY+ LF VI RRGRYMMAKRYFNKM+SEGIEPTRHT+N++IWGFFLS K++
Sbjct: 176  ELGVERTVKSYNALFNVITRRGRYMMAKRYFNKMVSEGIEPTRHTYNLLIWGFFLSSKLD 235

Query: 1841 TANRFFEDMKSREILPDVVTYNTLINGYNRVKKMDEAEKYYVEMKGRNIEPTVITYTTLI 1662
            TA RFFE MKSREI+PDVVTYNT+INGYNR KK++EAEKY+VEMK RNI P VI+YTTLI
Sbjct: 236  TAIRFFEAMKSREIIPDVVTYNTMINGYNRAKKIEEAEKYFVEMKARNIAPNVISYTTLI 295

Query: 1661 KGYLSVGQVDDALKLLEDMKSFAIKPNAITYSTLLPGMCDAEKMSEARDILKEMVEKYIA 1482
            KGY  VG+VDDAL+L E+MKSF IKPNAITYSTLLP +C+  KMSEAR ILKEM EKYIA
Sbjct: 296  KGYSLVGRVDDALRLFEEMKSFGIKPNAITYSTLLPSLCEGRKMSEARMILKEMEEKYIA 355

Query: 1481 PKDNAIFTRLISGQCKAGDLDAATDVLKAMIRLSVPTEAGHYGILIENFCKAGQYDSAVX 1302
            PKDN++F RLISGQC+AGDLDAATDVLKAMIRLSVPTEAGHYGILIENFCKAG YD AV 
Sbjct: 356  PKDNSVFIRLISGQCEAGDLDAATDVLKAMIRLSVPTEAGHYGILIENFCKAGVYDRAVK 415

Query: 1301 XXXXXXXXXXXLRPQSTLHMEPSAYNPIIEYLCDNGQTAKAETLLRQLMKMGVQDPIAFN 1122
                       L   S+L MEPSAYN II+YLC+NGQT+KAETL RQLMK+G+QDP+AFN
Sbjct: 416  LLDKLIEKEIIL---SSLPMEPSAYNLIIDYLCNNGQTSKAETLFRQLMKIGIQDPVAFN 472

Query: 1121 NLICGHAKEGNPDPAIELLKIMIRRNVLSEKIAYESLIESYLKKSEPADAKTTLDSMIEN 942
            NL+CGH++EG PD A ELLKIM RR VLS+ IA+++L+ESYLKK EPADAKT LD+M+E 
Sbjct: 473  NLVCGHSREGAPDSAFELLKIMGRRKVLSDGIAHKALVESYLKKGEPADAKTALDNMLEQ 532

Query: 941  GHLPDSSLYRSVMESLLEDGRVQTASRVMKTMLEKGVKDHEDLIAKILEALLMRGHVEEA 762
            GH PDS LYRSVMESL+ DGRVQTASRVMK ML+KGVK H DLI+ +LEALLMRGHVEEA
Sbjct: 533  GHEPDSLLYRSVMESLMGDGRVQTASRVMKIMLDKGVKQHMDLISTMLEALLMRGHVEEA 592

Query: 761  IGRIELLMVNGLAPDFDSLLSGLCEKGKTISALKLLDFGLERDCNIDFSSYYKVLDALLA 582
            +GRIELL+ +GL+PD D+LLS LC+KGKT +ALKLLDFGLERDC IDFSSY KVLD+L+A
Sbjct: 593  LGRIELLLHSGLSPDIDALLSALCDKGKTAAALKLLDFGLERDCTIDFSSYDKVLDSLVA 652

Query: 581  AGKTLNAYSVLCKIMEKGGITDWSSCKDLIQNLNEEGNTKQADILSRMIL 432
            AGKTLNAYS+LCK+MEKGG+ D  SC++LI++LN+EGNTKQADIL RMI+
Sbjct: 653  AGKTLNAYSMLCKMMEKGGVKDHKSCEELIKSLNQEGNTKQADILKRMII 702


>ref|XP_006353112.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            isoform X1 [Solanum tuberosum]
          Length = 731

 Score = 1040 bits (2690), Expect = 0.0
 Identities = 519/672 (77%), Positives = 587/672 (87%)
 Frame = -3

Query: 2447 NPSRNAGGKRQKNPEKIEDIINRMMANRAWTTRLQNSIRNLVPSFDHELVYNVLHGAKNS 2268
            N  R   G   K  EK+ED+I RMM+ RAWTTRLQNSIRN+VPSFDHELVYNVLH AKNS
Sbjct: 41   NHDRIPKGNSPKPQEKLEDLICRMMSTRAWTTRLQNSIRNIVPSFDHELVYNVLHSAKNS 100

Query: 2267 EYALQFFRWVERSGLFQHNRETHLKIIEILGRASKLNHARCILLDMPKKGVEWDEDLWVL 2088
            E+ALQFFRWVERSGLF+H+RETH KII+ILGRA KLNHARCILLDMP KGV+WDEDLWVL
Sbjct: 101  EHALQFFRWVERSGLFRHDRETHFKIIQILGRAEKLNHARCILLDMPNKGVDWDEDLWVL 160

Query: 2087 LIDSYGKAGIVQESVKLFQKMEELGVGRTIKSYDTLFRVIMRRGRYMMAKRYFNKMLSEG 1908
            +IDSYGKAGIVQESVKLFQKMEELGV RT+KSY+ LF VI RRGRYMMAKRYFNKM+++G
Sbjct: 161  MIDSYGKAGIVQESVKLFQKMEELGVERTVKSYNALFNVITRRGRYMMAKRYFNKMVNQG 220

Query: 1907 IEPTRHTFNVMIWGFFLSGKVETANRFFEDMKSREILPDVVTYNTLINGYNRVKKMDEAE 1728
            IEPT HT+N++IWGFFLS KV+TA RFFEDMKS+ I+PDVVTYNT+INGY RVKK++EAE
Sbjct: 221  IEPTGHTYNLLIWGFFLSSKVDTAIRFFEDMKSKGIMPDVVTYNTMINGYIRVKKIEEAE 280

Query: 1727 KYYVEMKGRNIEPTVITYTTLIKGYLSVGQVDDALKLLEDMKSFAIKPNAITYSTLLPGM 1548
            KY+VEMK RNIEPTVI+YTTLIKGY +V ++DDA++L E+MKSF IKPNAITYSTLLPG+
Sbjct: 281  KYFVEMKARNIEPTVISYTTLIKGYSAVERIDDAVRLFEEMKSFGIKPNAITYSTLLPGL 340

Query: 1547 CDAEKMSEARDILKEMVEKYIAPKDNAIFTRLISGQCKAGDLDAATDVLKAMIRLSVPTE 1368
            CDA+KMSEA  ILKEM +KYIAPKDN+IF RLISGQC+AGDLDAA DVLK MIRLSVPTE
Sbjct: 341  CDAQKMSEAGAILKEMEDKYIAPKDNSIFIRLISGQCEAGDLDAAADVLKTMIRLSVPTE 400

Query: 1367 AGHYGILIENFCKAGQYDSAVXXXXXXXXXXXXLRPQSTLHMEPSAYNPIIEYLCDNGQT 1188
            AGHYG+LIENFCKAG YD AV            LRPQS+  MEPSAYN II+YLC+NGQT
Sbjct: 401  AGHYGVLIENFCKAGIYDRAVKFLDKLIEKEIVLRPQSSSSMEPSAYNLIIDYLCNNGQT 460

Query: 1187 AKAETLLRQLMKMGVQDPIAFNNLICGHAKEGNPDPAIELLKIMIRRNVLSEKIAYESLI 1008
             KAET  RQLMK GVQDPIAFNNL+CGH++EG PD A ELLKIM RR VLS+ IA++SL+
Sbjct: 461  GKAETFFRQLMKTGVQDPIAFNNLVCGHSREGVPDSAFELLKIMGRRKVLSDGIAHKSLV 520

Query: 1007 ESYLKKSEPADAKTTLDSMIENGHLPDSSLYRSVMESLLEDGRVQTASRVMKTMLEKGVK 828
            ESYLKK EPADAK  LD+M+E+GH PDS LYRSVMESL+ DGRVQTASRVMK MLEKGVK
Sbjct: 521  ESYLKKREPADAKAALDNMLEHGHDPDSLLYRSVMESLMGDGRVQTASRVMKIMLEKGVK 580

Query: 827  DHEDLIAKILEALLMRGHVEEAIGRIELLMVNGLAPDFDSLLSGLCEKGKTISALKLLDF 648
            +H DLI+ ILEALLMRGHVEEA+GRIELL+ N L+PD D LLS LCEKGKT +ALKLLDF
Sbjct: 581  EHMDLISTILEALLMRGHVEEALGRIELLLHNSLSPDLDGLLSVLCEKGKTSAALKLLDF 640

Query: 647  GLERDCNIDFSSYYKVLDALLAAGKTLNAYSVLCKIMEKGGITDWSSCKDLIQNLNEEGN 468
             LER+CNIDFSSY KVLD+LLAAGKTLNAYS+LCK+ME GG+ D  SC++LI++LN+EGN
Sbjct: 641  ILERNCNIDFSSYDKVLDSLLAAGKTLNAYSILCKMMENGGVKDHKSCEELIKSLNDEGN 700

Query: 467  TKQADILSRMIL 432
            TKQADIL RMIL
Sbjct: 701  TKQADILRRMIL 712


>ref|XP_004251992.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230
            [Solanum lycopersicum]
          Length = 731

 Score = 1038 bits (2683), Expect = 0.0
 Identities = 517/672 (76%), Positives = 584/672 (86%)
 Frame = -3

Query: 2447 NPSRNAGGKRQKNPEKIEDIINRMMANRAWTTRLQNSIRNLVPSFDHELVYNVLHGAKNS 2268
            N  R   G   K  EK+ED+I RMM+ RAWTTRLQNSIRN+VPSFDHELVYNVLH AKNS
Sbjct: 41   NHERIPKGNSPKPQEKLEDLICRMMSTRAWTTRLQNSIRNIVPSFDHELVYNVLHSAKNS 100

Query: 2267 EYALQFFRWVERSGLFQHNRETHLKIIEILGRASKLNHARCILLDMPKKGVEWDEDLWVL 2088
            E+ALQFFRWVERSGLF+H+RETH KII+ILGRA KLNHARCILLDMP KGV+WDEDLWVL
Sbjct: 101  EHALQFFRWVERSGLFRHDRETHFKIIQILGRAEKLNHARCILLDMPNKGVDWDEDLWVL 160

Query: 2087 LIDSYGKAGIVQESVKLFQKMEELGVGRTIKSYDTLFRVIMRRGRYMMAKRYFNKMLSEG 1908
            +IDSYGKAGIVQESVKLFQKMEELGV RT+KSY+ LF VI RRGRYMMAKRYFN+M+++G
Sbjct: 161  MIDSYGKAGIVQESVKLFQKMEELGVERTVKSYNALFNVITRRGRYMMAKRYFNRMVNQG 220

Query: 1907 IEPTRHTFNVMIWGFFLSGKVETANRFFEDMKSREILPDVVTYNTLINGYNRVKKMDEAE 1728
            IEPT HT+N++IWGFFLS KV+TA RFFEDMK + I+PDVVTYNT+INGYN VKK++EAE
Sbjct: 221  IEPTGHTYNLLIWGFFLSSKVDTAIRFFEDMKGKGIMPDVVTYNTMINGYNCVKKIEEAE 280

Query: 1727 KYYVEMKGRNIEPTVITYTTLIKGYLSVGQVDDALKLLEDMKSFAIKPNAITYSTLLPGM 1548
            KY+VEMK RNIEP VI+YTTLIKGY +V ++DDALKL E+MKSF IKPNAITYSTLLPG+
Sbjct: 281  KYFVEMKARNIEPNVISYTTLIKGYSAVERIDDALKLFEEMKSFGIKPNAITYSTLLPGL 340

Query: 1547 CDAEKMSEARDILKEMVEKYIAPKDNAIFTRLISGQCKAGDLDAATDVLKAMIRLSVPTE 1368
            CDA+KMSEA  ILKEM E+YIAPKDN+IF RLISGQC+AGDLDAA DVLK MIRLSVPTE
Sbjct: 341  CDAQKMSEAGTILKEMEERYIAPKDNSIFIRLISGQCEAGDLDAAADVLKTMIRLSVPTE 400

Query: 1367 AGHYGILIENFCKAGQYDSAVXXXXXXXXXXXXLRPQSTLHMEPSAYNPIIEYLCDNGQT 1188
            AGHYG+LIENFCKAG YD AV            LRPQS+  ME SAYN II+YLC+NGQT
Sbjct: 401  AGHYGVLIENFCKAGIYDRAVKFLDKLIEKEIVLRPQSSSSMETSAYNLIIDYLCNNGQT 460

Query: 1187 AKAETLLRQLMKMGVQDPIAFNNLICGHAKEGNPDPAIELLKIMIRRNVLSEKIAYESLI 1008
             KAETL RQLMK G+QDPIAFNNL+CGH++EG PD A ELLKIM RR VLS+ IA++SL+
Sbjct: 461  GKAETLFRQLMKTGIQDPIAFNNLVCGHSREGVPDSAFELLKIMGRRKVLSDSIAHKSLV 520

Query: 1007 ESYLKKSEPADAKTTLDSMIENGHLPDSSLYRSVMESLLEDGRVQTASRVMKTMLEKGVK 828
            ESYLKK EPADAK  LD+M+E+GH PDS LYRSVMESL+ DGRVQTASRVMK MLEKGVK
Sbjct: 521  ESYLKKGEPADAKAALDNMLEHGHDPDSLLYRSVMESLMGDGRVQTASRVMKIMLEKGVK 580

Query: 827  DHEDLIAKILEALLMRGHVEEAIGRIELLMVNGLAPDFDSLLSGLCEKGKTISALKLLDF 648
            +H DLI+ ILEALLMRGHVEEA GRIELL+ N L+PD D LLS LCEKGKT +ALKLLDF
Sbjct: 581  EHMDLISTILEALLMRGHVEEAFGRIELLLHNSLSPDLDGLLSVLCEKGKTTAALKLLDF 640

Query: 647  GLERDCNIDFSSYYKVLDALLAAGKTLNAYSVLCKIMEKGGITDWSSCKDLIQNLNEEGN 468
             LER+CNIDFSSY KVLD+LLAAGKTLNAYS+LCK+ME GG+ D  SC++LI++LN+EGN
Sbjct: 641  ILERNCNIDFSSYDKVLDSLLAAGKTLNAYSILCKMMENGGVKDHKSCEELIKSLNDEGN 700

Query: 467  TKQADILSRMIL 432
            TKQADIL RMIL
Sbjct: 701  TKQADILRRMIL 712


>ref|XP_010111755.1| hypothetical protein L484_008414 [Morus notabilis]
            gi|587945196|gb|EXC31617.1| hypothetical protein
            L484_008414 [Morus notabilis]
          Length = 768

 Score = 1033 bits (2671), Expect = 0.0
 Identities = 516/704 (73%), Positives = 597/704 (84%)
 Frame = -3

Query: 2546 SPVDSPNPESNPVEHLIPESSCADSSATVTAAANPSRNAGGKRQKNPEKIEDIINRMMAN 2367
            SP   PNP+  P E   P  S  +++A         R   GK  +NPEKIEDII RMMAN
Sbjct: 53   SPDPVPNPDCPPSESPNPPKSRPENTAI-------QRTPRGK-SRNPEKIEDIICRMMAN 104

Query: 2366 RAWTTRLQNSIRNLVPSFDHELVYNVLHGAKNSEYALQFFRWVERSGLFQHNRETHLKII 2187
            RAWTTRLQNSIR LVP FDH LV+NVLHGA+NS++ALQFFRWVERSGLF H+RETHLKII
Sbjct: 105  RAWTTRLQNSIRRLVPQFDHSLVWNVLHGARNSDHALQFFRWVERSGLFNHDRETHLKII 164

Query: 2186 EILGRASKLNHARCILLDMPKKGVEWDEDLWVLLIDSYGKAGIVQESVKLFQKMEELGVG 2007
            EIL RASKLNHARCILLDMPKK V+WDEDL+VL ID YGKAGIVQESV++F KM+ELGV 
Sbjct: 165  EILTRASKLNHARCILLDMPKKSVQWDEDLFVLFIDGYGKAGIVQESVRMFNKMKELGVE 224

Query: 2006 RTIKSYDTLFRVIMRRGRYMMAKRYFNKMLSEGIEPTRHTFNVMIWGFFLSGKVETANRF 1827
            R++KSYD LF+VI+RRGRYMMAKRYFN M++EGIEPT+HT+N+M+WGFFLS ++ETA RF
Sbjct: 225  RSVKSYDALFKVILRRGRYMMAKRYFNAMINEGIEPTKHTYNIMLWGFFLSLRLETAKRF 284

Query: 1826 FEDMKSREILPDVVTYNTLINGYNRVKKMDEAEKYYVEMKGRNIEPTVITYTTLIKGYLS 1647
            +EDMK+R + PDVVTYNT+INGYNR K MDEAEK +VEMKGRNI PTVI+YTT+IKGY+S
Sbjct: 285  YEDMKNRGVWPDVVTYNTMINGYNRFKMMDEAEKMFVEMKGRNIAPTVISYTTMIKGYVS 344

Query: 1646 VGQVDDALKLLEDMKSFAIKPNAITYSTLLPGMCDAEKMSEARDILKEMVEKYIAPKDNA 1467
            +G+VDD L+L E+MKSF IKPNA+TY+TLLPG+CDAEKMSEAR +LKEMV++YIAPKDN+
Sbjct: 345  IGRVDDGLRLFEEMKSFGIKPNAVTYTTLLPGLCDAEKMSEARTMLKEMVDRYIAPKDNS 404

Query: 1466 IFTRLISGQCKAGDLDAATDVLKAMIRLSVPTEAGHYGILIENFCKAGQYDSAVXXXXXX 1287
            IF RL+S QCK GDLDAA DVLKAMIRLS+PTEAGHYGILIENFCKA  YD AV      
Sbjct: 405  IFLRLLSSQCKVGDLDAAADVLKAMIRLSIPTEAGHYGILIENFCKAAVYDRAVKLLDKL 464

Query: 1286 XXXXXXLRPQSTLHMEPSAYNPIIEYLCDNGQTAKAETLLRQLMKMGVQDPIAFNNLICG 1107
                  LRPQS+  ME SAYN +I++LC++GQT KAE   RQLMK GVQDP+AFNNLI G
Sbjct: 465  IEKEIVLRPQSSTEMEASAYNAMIQFLCNHGQTGKAEIFFRQLMKKGVQDPVAFNNLIRG 524

Query: 1106 HAKEGNPDPAIELLKIMIRRNVLSEKIAYESLIESYLKKSEPADAKTTLDSMIENGHLPD 927
            H+KEGNPD A E+LKIM RR V  +  +Y  LI+SYL K EPADAKT LDSMIEN HLP+
Sbjct: 525  HSKEGNPDSAFEILKIMGRRGVARDADSYRLLIKSYLSKGEPADAKTALDSMIENDHLPE 584

Query: 926  SSLYRSVMESLLEDGRVQTASRVMKTMLEKGVKDHEDLIAKILEALLMRGHVEEAIGRIE 747
            SSL+RSVMESL EDGR QTASRVMK+M+EKGVK++ DL+AKILEALL+RGHVEEA+GRI+
Sbjct: 585  SSLFRSVMESLYEDGRAQTASRVMKSMIEKGVKENMDLVAKILEALLVRGHVEEALGRID 644

Query: 746  LLMVNGLAPDFDSLLSGLCEKGKTISALKLLDFGLERDCNIDFSSYYKVLDALLAAGKTL 567
            LLM +G AP+FDSLLS LCEKGKTI+ALKLLDF LERD  +DFSSY KVLDALLAAGKTL
Sbjct: 645  LLMQSGCAPNFDSLLSVLCEKGKTIAALKLLDFCLERDYVVDFSSYDKVLDALLAAGKTL 704

Query: 566  NAYSVLCKIMEKGGITDWSSCKDLIQNLNEEGNTKQADILSRMI 435
            NAYS+LCKIM KGG+TDWS C+DLI++LN+EGNTKQADI+SRMI
Sbjct: 705  NAYSILCKIMGKGGVTDWSGCEDLIKSLNKEGNTKQADIISRMI 748


>ref|XP_009782844.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230
            [Nicotiana sylvestris] gi|698423785|ref|XP_009782850.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At2g37230 [Nicotiana sylvestris]
            gi|698423808|ref|XP_009782863.1| PREDICTED:
            pentatricopeptide repeat-containing protein At2g37230
            [Nicotiana sylvestris]
          Length = 722

 Score = 1031 bits (2666), Expect = 0.0
 Identities = 521/710 (73%), Positives = 594/710 (83%)
 Frame = -3

Query: 2561 FFYSSSPVDSPNPESNPVEHLIPESSCADSSATVTAAANPSRNAGGKRQKNPEKIEDIIN 2382
            FFYSS  +++P+P                 S  +    NP         K PEK+ED+I 
Sbjct: 22   FFYSSESLNNPDP-----------------STRIPTTHNP---------KTPEKVEDLIC 55

Query: 2381 RMMANRAWTTRLQNSIRNLVPSFDHELVYNVLHGAKNSEYALQFFRWVERSGLFQHNRET 2202
            RMM+ R WTTRLQNSIRNLVPSFDHELVYNVLH AKNSE ALQFFRWVERSGLF+H+RET
Sbjct: 56   RMMSTRVWTTRLQNSIRNLVPSFDHELVYNVLHNAKNSEQALQFFRWVERSGLFRHDRET 115

Query: 2201 HLKIIEILGRASKLNHARCILLDMPKKGVEWDEDLWVLLIDSYGKAGIVQESVKLFQKME 2022
            H KII+ILGR+ KLNHARCILLDMP KGV+WDEDLWVL+IDSYGKAGIVQESVKLFQKME
Sbjct: 116  HFKIIQILGRSEKLNHARCILLDMPNKGVDWDEDLWVLMIDSYGKAGIVQESVKLFQKME 175

Query: 2021 ELGVGRTIKSYDTLFRVIMRRGRYMMAKRYFNKMLSEGIEPTRHTFNVMIWGFFLSGKVE 1842
            ELGV RTIKSY+ LF VI RRGRYMMAKRYFNKM++EGIEPT HT+N++IWGFFLS K +
Sbjct: 176  ELGVERTIKSYNALFNVITRRGRYMMAKRYFNKMVNEGIEPTTHTYNLLIWGFFLSSKPD 235

Query: 1841 TANRFFEDMKSREILPDVVTYNTLINGYNRVKKMDEAEKYYVEMKGRNIEPTVITYTTLI 1662
            TA RFFE MKSREI PDVVTYNT+INGYNR KK++EAEKY+VEMK RNIEP VI+YTTLI
Sbjct: 236  TAIRFFEAMKSREISPDVVTYNTMINGYNRAKKIEEAEKYFVEMKARNIEPNVISYTTLI 295

Query: 1661 KGYLSVGQVDDALKLLEDMKSFAIKPNAITYSTLLPGMCDAEKMSEARDILKEMVEKYIA 1482
            KGY  V +VDDAL+L E+MKSF IKPNAITYSTLLP +C+ +KMSEAR ILKEM EKYIA
Sbjct: 296  KGYSLVVRVDDALRLFEEMKSFGIKPNAITYSTLLPSLCEGQKMSEARMILKEMEEKYIA 355

Query: 1481 PKDNAIFTRLISGQCKAGDLDAATDVLKAMIRLSVPTEAGHYGILIENFCKAGQYDSAVX 1302
            PKDN++F RLISGQC+AGDLDAA DVLKAMIRLSVPTEAGHYG+LIENFCKAG  D AV 
Sbjct: 356  PKDNSVFIRLISGQCEAGDLDAAADVLKAMIRLSVPTEAGHYGVLIENFCKAGVCDRAVK 415

Query: 1301 XXXXXXXXXXXLRPQSTLHMEPSAYNPIIEYLCDNGQTAKAETLLRQLMKMGVQDPIAFN 1122
                       L   S+L ME SAYN II+YLC+NGQT+KAETL RQLMK+G+QDP+AFN
Sbjct: 416  LLDKLIEKEIIL--SSSLPMEQSAYNLIIDYLCNNGQTSKAETLFRQLMKIGIQDPVAFN 473

Query: 1121 NLICGHAKEGNPDPAIELLKIMIRRNVLSEKIAYESLIESYLKKSEPADAKTTLDSMIEN 942
            NL+CGH++EG PD A E+LKIM RR VLS+ IA++SL+ESYLKK EPADAK  LD+M+E 
Sbjct: 474  NLVCGHSREGAPDSAFEILKIMGRRKVLSDGIAHKSLVESYLKKGEPADAKAALDNMLEQ 533

Query: 941  GHLPDSSLYRSVMESLLEDGRVQTASRVMKTMLEKGVKDHEDLIAKILEALLMRGHVEEA 762
            GH PDS LYRSVMESL+ DGRVQTASRVMK MLEKGVK+H DLI+ ILEALLMRGHVEEA
Sbjct: 534  GHEPDSLLYRSVMESLMGDGRVQTASRVMKIMLEKGVKEHMDLISTILEALLMRGHVEEA 593

Query: 761  IGRIELLMVNGLAPDFDSLLSGLCEKGKTISALKLLDFGLERDCNIDFSSYYKVLDALLA 582
            +GRIELL+ +GL+PD D LLS LC+KGKT +ALKLLDFGLERDC IDFSSY KVLD+L+A
Sbjct: 594  LGRIELLLHSGLSPDIDGLLSALCDKGKTAAALKLLDFGLERDCTIDFSSYDKVLDSLVA 653

Query: 581  AGKTLNAYSVLCKIMEKGGITDWSSCKDLIQNLNEEGNTKQADILSRMIL 432
            AGKTLNAYS+LCK+MEKGG+ D  SC++LI++LN+EGNTKQADIL RMI+
Sbjct: 654  AGKTLNAYSMLCKMMEKGGVKDHKSCEELIKSLNQEGNTKQADILRRMII 703


>ref|XP_007033459.1| Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma
            cacao] gi|508712488|gb|EOY04385.1| Tetratricopeptide
            repeat (TPR)-like superfamily protein [Theobroma cacao]
          Length = 743

 Score = 1023 bits (2644), Expect = 0.0
 Identities = 514/708 (72%), Positives = 599/708 (84%)
 Frame = -3

Query: 2558 FYSSSPVDSPNPESNPVEHLIPESSCADSSATVTAAANPSRNAGGKRQKNPEKIEDIINR 2379
            F+++S    P+  S  + +  P+    +    VT   +P       + +NPEK+ED+I R
Sbjct: 26   FFTTS--QDPSTASQELNNAPPQQ---EGEKVVTQRTSPRG-----KTRNPEKVEDVICR 75

Query: 2378 MMANRAWTTRLQNSIRNLVPSFDHELVYNVLHGAKNSEYALQFFRWVERSGLFQHNRETH 2199
            MM NRAWTTRLQNSIR LVP FDH LVYNVLHGAKNSE ALQFFRWVER+GL +H+RE H
Sbjct: 76   MMENRAWTTRLQNSIRALVPEFDHALVYNVLHGAKNSEQALQFFRWVERAGLIRHDREAH 135

Query: 2198 LKIIEILGRASKLNHARCILLDMPKKGVEWDEDLWVLLIDSYGKAGIVQESVKLFQKMEE 2019
            +KII+ILGRASKLNHARCILLDMPKKGVEWDEDL+V+LIDSYGKAGIVQE+VK+FQKM E
Sbjct: 136  MKIIQILGRASKLNHARCILLDMPKKGVEWDEDLFVVLIDSYGKAGIVQEAVKIFQKMNE 195

Query: 2018 LGVGRTIKSYDTLFRVIMRRGRYMMAKRYFNKMLSEGIEPTRHTFNVMIWGFFLSGKVET 1839
            LGV RTIKSYD  F+VI+RRGRYMMAKRYFNKMLSEGI PTRHT+N+M+WGFFLS +++T
Sbjct: 196  LGVERTIKSYDAFFKVILRRGRYMMAKRYFNKMLSEGIVPTRHTYNIMLWGFFLSLRLDT 255

Query: 1838 ANRFFEDMKSREILPDVVTYNTLINGYNRVKKMDEAEKYYVEMKGRNIEPTVITYTTLIK 1659
            ANRF+EDMK+R I PDVVTYNT+INGY+R KKM+EAEK +VEMKG+N+ PTVI+YTT+IK
Sbjct: 256  ANRFYEDMKTRGISPDVVTYNTMINGYSRFKKMEEAEKLFVEMKGKNLAPTVISYTTMIK 315

Query: 1658 GYLSVGQVDDALKLLEDMKSFAIKPNAITYSTLLPGMCDAEKMSEARDILKEMVEKYIAP 1479
            GY++V QVDD L+LLE+MKSF IKPNA TYSTLLPG+CDA KM+EA+ ILKEMVE YIAP
Sbjct: 316  GYVAVEQVDDGLRLLEEMKSFGIKPNATTYSTLLPGLCDAGKMTEAKSILKEMVEWYIAP 375

Query: 1478 KDNAIFTRLISGQCKAGDLDAATDVLKAMIRLSVPTEAGHYGILIENFCKAGQYDSAVXX 1299
            KDN+IF  L++ QCK+GDLDAA DVLKAMIRLS+PTEAGHYG+LIENFCKA  +D A+  
Sbjct: 376  KDNSIFINLLNSQCKSGDLDAAADVLKAMIRLSIPTEAGHYGVLIENFCKANLFDRAIKL 435

Query: 1298 XXXXXXXXXXLRPQSTLHMEPSAYNPIIEYLCDNGQTAKAETLLRQLMKMGVQDPIAFNN 1119
                      LRPQ++L ME SAYN +I+YLC +GQT KAE   RQLMK GV DP AFNN
Sbjct: 436  LDKLVEKEIILRPQNSLDMEASAYNAMIQYLCHHGQTGKAEVFFRQLMKKGVLDPTAFNN 495

Query: 1118 LICGHAKEGNPDPAIELLKIMIRRNVLSEKIAYESLIESYLKKSEPADAKTTLDSMIENG 939
            LI GHAKEGNP  A E+LKIM RR V  +  AY+ LIESYL+K EPADAKT+LDSMIE+G
Sbjct: 496  LIRGHAKEGNPGLAFEILKIMGRRGVPKDADAYKLLIESYLRKGEPADAKTSLDSMIEDG 555

Query: 938  HLPDSSLYRSVMESLLEDGRVQTASRVMKTMLEKGVKDHEDLIAKILEALLMRGHVEEAI 759
             LP+S +++SVMESL EDGR+QTASRVMK+M+EKGVK+H DL+AKILEALLMRGHVEEA+
Sbjct: 556  LLPESGIFKSVMESLFEDGRIQTASRVMKSMVEKGVKEHMDLVAKILEALLMRGHVEEAL 615

Query: 758  GRIELLMVNGLAPDFDSLLSGLCEKGKTISALKLLDFGLERDCNIDFSSYYKVLDALLAA 579
            GRIELLM NG AP+ DSLLS L EKGKTI+ALKLLDFGLERDC+IDFSSY KVLDALLAA
Sbjct: 616  GRIELLMQNGCAPNLDSLLSVLSEKGKTIAALKLLDFGLERDCSIDFSSYEKVLDALLAA 675

Query: 578  GKTLNAYSVLCKIMEKGGITDWSSCKDLIQNLNEEGNTKQADILSRMI 435
            GKTLNAYS+LCKIMEKGGIT+WSS +DLI++LN+EGNTKQADILSRMI
Sbjct: 676  GKTLNAYSILCKIMEKGGITNWSSLEDLIKSLNQEGNTKQADILSRMI 723


>ref|XP_010061750.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230
            [Eucalyptus grandis] gi|629103276|gb|KCW68745.1|
            hypothetical protein EUGRSUZ_F02345 [Eucalyptus grandis]
          Length = 764

 Score = 1010 bits (2612), Expect = 0.0
 Identities = 496/713 (69%), Positives = 598/713 (83%), Gaps = 5/713 (0%)
 Frame = -3

Query: 2558 FYSSSPVDSPNPESNPVEHLIPES-----SCADSSATVTAAANPSRNAGGKRQKNPEKIE 2394
            F SS+    P  +  P E+  P S     + +  S+   A   P + +   + +NPEK+E
Sbjct: 32   FCSSTEQPIPGVDHKPSENASPHSQPEPTTQSGPSSAAEARERPRQRSPRGKARNPEKVE 91

Query: 2393 DIINRMMANRAWTTRLQNSIRNLVPSFDHELVYNVLHGAKNSEYALQFFRWVERSGLFQH 2214
            DII RMMANRAWTTRLQNSIR LVP FDH LVYNVLHGA+NSE+ALQFFRWVER+GLF+H
Sbjct: 92   DIICRMMANRAWTTRLQNSIRALVPEFDHSLVYNVLHGARNSEHALQFFRWVERAGLFRH 151

Query: 2213 NRETHLKIIEILGRASKLNHARCILLDMPKKGVEWDEDLWVLLIDSYGKAGIVQESVKLF 2034
            +RETHLKIIE LGRASKLNHARCILLDMPKKGVEWDEDL++++I+SYGKAGIVQE+VK+F
Sbjct: 152  DRETHLKIIETLGRASKLNHARCILLDMPKKGVEWDEDLFIVMIESYGKAGIVQEAVKMF 211

Query: 2033 QKMEELGVGRTIKSYDTLFRVIMRRGRYMMAKRYFNKMLSEGIEPTRHTFNVMIWGFFLS 1854
             KM+ELGV RT+ SYD +F+VI+R GRYMMAKR FN ML+EGIEP RHT+N+MIWGFFLS
Sbjct: 212  MKMKELGVSRTVNSYDAVFKVILRCGRYMMAKRLFNAMLNEGIEPARHTYNIMIWGFFLS 271

Query: 1853 GKVETANRFFEDMKSREILPDVVTYNTLINGYNRVKKMDEAEKYYVEMKGRNIEPTVITY 1674
             ++ TA RFFEDM SR I PDVVTYNT+INGY R KKMDEAEK +VEMKG+NI PTVI+Y
Sbjct: 272  MRLRTALRFFEDMSSRGISPDVVTYNTMINGYYRFKKMDEAEKLFVEMKGKNIAPTVISY 331

Query: 1673 TTLIKGYLSVGQVDDALKLLEDMKSFAIKPNAITYSTLLPGMCDAEKMSEARDILKEMVE 1494
            TT+IKGY+S+G+VDD L+LL++MKS+ IKPN +TYSTLLPG+C+AEKM+EAR ILKE+VE
Sbjct: 332  TTMIKGYVSLGRVDDGLRLLDEMKSYGIKPNDVTYSTLLPGLCEAEKMAEARSILKEIVE 391

Query: 1493 KYIAPKDNAIFTRLISGQCKAGDLDAATDVLKAMIRLSVPTEAGHYGILIENFCKAGQYD 1314
            +Y+APKDN+IF RL++ QC +GD+DAA DVLKAMIRLS+PTEAGHYG+LIENFCK   YD
Sbjct: 392  RYMAPKDNSIFLRLLTCQCTSGDMDAAVDVLKAMIRLSIPTEAGHYGVLIENFCKNNAYD 451

Query: 1313 SAVXXXXXXXXXXXXLRPQSTLHMEPSAYNPIIEYLCDNGQTAKAETLLRQLMKMGVQDP 1134
             A+            LRPQ+TL M P AYNP+I+YLC++GQT KAE   RQL+K GVQD 
Sbjct: 452  RAIKLLDKLIEKEIILRPQNTLEMGPEAYNPMIQYLCNHGQTGKAEIFFRQLLKKGVQDS 511

Query: 1133 IAFNNLICGHAKEGNPDPAIELLKIMIRRNVLSEKIAYESLIESYLKKSEPADAKTTLDS 954
            +AFN++ICGH+KEGNP+ A E+LKIM RR V  +  +Y+ LIESYL+K EPADAKT LD+
Sbjct: 512  VAFNSIICGHSKEGNPNAAFEILKIMDRRGVPRDAHSYKLLIESYLRKGEPADAKTALDN 571

Query: 953  MIENGHLPDSSLYRSVMESLLEDGRVQTASRVMKTMLEKGVKDHEDLIAKILEALLMRGH 774
            MIE+G++PDSS+YRSVM+SL EDGRVQTASR MK+M+EKGV ++ DL+AKILEALLMRGH
Sbjct: 572  MIESGYVPDSSVYRSVMQSLFEDGRVQTASRAMKSMVEKGVHENMDLVAKILEALLMRGH 631

Query: 773  VEEAIGRIELLMVNGLAPDFDSLLSGLCEKGKTISALKLLDFGLERDCNIDFSSYYKVLD 594
            VEEAIGR++LLM +G +PDFD+LLS L EKGKTI+ALKLLDF L+RDC IDFSSY KVLD
Sbjct: 632  VEEAIGRMDLLMQSGCSPDFDNLLSVLSEKGKTIAALKLLDFALDRDCTIDFSSYDKVLD 691

Query: 593  ALLAAGKTLNAYSVLCKIMEKGGITDWSSCKDLIQNLNEEGNTKQADILSRMI 435
            ALL +GKTLNAYS+LCKIMEKGG++DW SC DLI++LN+EG TKQAD+LSRMI
Sbjct: 692  ALLGSGKTLNAYSILCKIMEKGGVSDWRSCGDLIKSLNQEGYTKQADVLSRMI 744


>ref|XP_012481297.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230
            [Gossypium raimondii] gi|763760358|gb|KJB27612.1|
            hypothetical protein B456_005G002200 [Gossypium
            raimondii]
          Length = 739

 Score = 1006 bits (2600), Expect = 0.0
 Identities = 498/671 (74%), Positives = 579/671 (86%)
 Frame = -3

Query: 2447 NPSRNAGGKRQKNPEKIEDIINRMMANRAWTTRLQNSIRNLVPSFDHELVYNVLHGAKNS 2268
            N  R   GK  +NPEK+EDII RMM NRAWTTRLQNSIR LVP FDH LVYNVLHGAKNS
Sbjct: 50   NNRRTPRGKT-RNPEKVEDIICRMMENRAWTTRLQNSIRALVPEFDHALVYNVLHGAKNS 108

Query: 2267 EYALQFFRWVERSGLFQHNRETHLKIIEILGRASKLNHARCILLDMPKKGVEWDEDLWVL 2088
            ++ALQFFRWVER+GL  H+RE HLKII+ILGRASKLNHARCILLDMPKKGVEWDEDL+V+
Sbjct: 109  DHALQFFRWVERAGLIHHDREAHLKIIQILGRASKLNHARCILLDMPKKGVEWDEDLFVV 168

Query: 2087 LIDSYGKAGIVQESVKLFQKMEELGVGRTIKSYDTLFRVIMRRGRYMMAKRYFNKMLSEG 1908
            LIDSYGKAGIVQE+VK+FQKMEELGV RTIKSYD  F+VI+RRGRYMMAKRYFNKMLSEG
Sbjct: 169  LIDSYGKAGIVQEAVKIFQKMEELGVDRTIKSYDAFFKVILRRGRYMMAKRYFNKMLSEG 228

Query: 1907 IEPTRHTFNVMIWGFFLSGKVETANRFFEDMKSREILPDVVTYNTLINGYNRVKKMDEAE 1728
            I+PTRHT+N+M+WGFFLS +++TANRF+EDMK+R I PD VTYNT+INGY R K+M+EAE
Sbjct: 229  IQPTRHTYNIMLWGFFLSLRLDTANRFYEDMKTRGISPDAVTYNTMINGYTRFKRMEEAE 288

Query: 1727 KYYVEMKGRNIEPTVITYTTLIKGYLSVGQVDDALKLLEDMKSFAIKPNAITYSTLLPGM 1548
            K +VEMK +N+ PTVI+YTT+IKGY++V QVDD L+L E+MKS  IKPNA TYSTLLPG+
Sbjct: 289  KLFVEMKAKNLAPTVISYTTMIKGYVAVEQVDDGLRLFEEMKSSGIKPNATTYSTLLPGL 348

Query: 1547 CDAEKMSEARDILKEMVEKYIAPKDNAIFTRLISGQCKAGDLDAATDVLKAMIRLSVPTE 1368
            CDA K +EA+ ILKEMVE+Y APKDN+IF +L++ QCK+GDL+AA DVLKAMIRLS+PTE
Sbjct: 349  CDAGKTTEAKTILKEMVERYTAPKDNSIFIKLLNSQCKSGDLNAAADVLKAMIRLSIPTE 408

Query: 1367 AGHYGILIENFCKAGQYDSAVXXXXXXXXXXXXLRPQSTLHMEPSAYNPIIEYLCDNGQT 1188
            AGHYG+LIENFCKA ++D A+            LRP+++L +E +AYNP+I+YLC +GQT
Sbjct: 409  AGHYGVLIENFCKANEFDRAIKLLDKLVEKEIVLRPENSLDIEANAYNPLIQYLCHHGQT 468

Query: 1187 AKAETLLRQLMKMGVQDPIAFNNLICGHAKEGNPDPAIELLKIMIRRNVLSEKIAYESLI 1008
             KAE   RQLMK GV DP AFNNLI GHAKEGNP    E+LKIM RR V  +  AY+ LI
Sbjct: 469  GKAEVFFRQLMKKGVLDPTAFNNLIRGHAKEGNPGLGFEILKIMGRRGVPKDADAYKLLI 528

Query: 1007 ESYLKKSEPADAKTTLDSMIENGHLPDSSLYRSVMESLLEDGRVQTASRVMKTMLEKGVK 828
            ESYL+K EPADAKT LDSMIE+G LPDS +++SVMESL EDGR+QTASRVMK+M+EKGVK
Sbjct: 529  ESYLRKGEPADAKTALDSMIEDGLLPDSGIFKSVMESLFEDGRIQTASRVMKSMVEKGVK 588

Query: 827  DHEDLIAKILEALLMRGHVEEAIGRIELLMVNGLAPDFDSLLSGLCEKGKTISALKLLDF 648
            +H DL++KILEALLMRGHVEEA+GRIELLM NG A + DSLLS L EKGKTI+ALKLLDF
Sbjct: 589  EHMDLVSKILEALLMRGHVEEALGRIELLMQNGCATNLDSLLSILSEKGKTIAALKLLDF 648

Query: 647  GLERDCNIDFSSYYKVLDALLAAGKTLNAYSVLCKIMEKGGITDWSSCKDLIQNLNEEGN 468
            GLERDC+ID SSY KVLDALL AGKTLNAYS+LCKIMEKGGIT+WSS +DLI++LN+EGN
Sbjct: 649  GLERDCSIDVSSYEKVLDALLTAGKTLNAYSILCKIMEKGGITNWSSLEDLIKSLNQEGN 708

Query: 467  TKQADILSRMI 435
            TKQADILSRMI
Sbjct: 709  TKQADILSRMI 719


>ref|XP_012082370.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230
            [Jatropha curcas] gi|643717679|gb|KDP29122.1|
            hypothetical protein JCGZ_16511 [Jatropha curcas]
          Length = 760

 Score =  999 bits (2584), Expect = 0.0
 Identities = 497/664 (74%), Positives = 580/664 (87%)
 Frame = -3

Query: 2426 GKRQKNPEKIEDIINRMMANRAWTTRLQNSIRNLVPSFDHELVYNVLHGAKNSEYALQFF 2247
            GKR + PEK+EDII +MMA+R WTTRLQNSIR+LVP FDH LVYNVLHGA+N E+ALQFF
Sbjct: 78   GKRPE-PEKLEDIICKMMASRPWTTRLQNSIRDLVPEFDHSLVYNVLHGARNYEHALQFF 136

Query: 2246 RWVERSGLFQHNRETHLKIIEILGRASKLNHARCILLDMPKKGVEWDEDLWVLLIDSYGK 2067
            RWVER+GLF+H+RETH+KIIEILGRASKLNHARCILLDMPKKGVEWDED++V+LI+SYGK
Sbjct: 137  RWVERAGLFRHDRETHMKIIEILGRASKLNHARCILLDMPKKGVEWDEDMFVVLIESYGK 196

Query: 2066 AGIVQESVKLFQKMEELGVGRTIKSYDTLFRVIMRRGRYMMAKRYFNKMLSEGIEPTRHT 1887
            AGIVQE+VK+FQKM ELGVGR+IKSYD +F+VI+RRGRYMMAKR+FNKMLSEGIEPTRHT
Sbjct: 197  AGIVQEAVKIFQKMNELGVGRSIKSYDAVFKVILRRGRYMMAKRFFNKMLSEGIEPTRHT 256

Query: 1886 FNVMIWGFFLSGKVETANRFFEDMKSREILPDVVTYNTLINGYNRVKKMDEAEKYYVEMK 1707
            +N+M+WGFFLS ++ETA RF+EDMKSR I PDVVTYNT+INGY R KKMD+AEK +VEMK
Sbjct: 257  YNIMLWGFFLSLRLETAMRFYEDMKSRGISPDVVTYNTMINGYYRFKKMDDAEKLFVEMK 316

Query: 1706 GRNIEPTVITYTTLIKGYLSVGQVDDALKLLEDMKSFAIKPNAITYSTLLPGMCDAEKMS 1527
            G NI PTVI+YTT+IKGY +V +VDD L+LLE+MK F I+PNA TYSTLLP +CDA KM+
Sbjct: 317  GSNIAPTVISYTTMIKGYFAVDRVDDGLRLLEEMKEFGIQPNAYTYSTLLPALCDAGKMT 376

Query: 1526 EARDILKEMVEKYIAPKDNAIFTRLISGQCKAGDLDAATDVLKAMIRLSVPTEAGHYGIL 1347
            EA+DILKEMV +++APKDNAIF +L+S QCKAGDL AA DVLKAMIRLS+PTEAGHYG+L
Sbjct: 377  EAKDILKEMVGRHLAPKDNAIFMKLLSSQCKAGDLRAAEDVLKAMIRLSIPTEAGHYGVL 436

Query: 1346 IENFCKAGQYDSAVXXXXXXXXXXXXLRPQSTLHMEPSAYNPIIEYLCDNGQTAKAETLL 1167
            IENFCKA +YD AV            LRPQSTL +E +AYNP+I+YLC +GQT KAE   
Sbjct: 437  IENFCKAEEYDLAVKFLDKLIEKEIILRPQSTLEIESNAYNPMIQYLCSHGQTGKAEIFF 496

Query: 1166 RQLMKMGVQDPIAFNNLICGHAKEGNPDPAIELLKIMIRRNVLSEKIAYESLIESYLKKS 987
            RQLMK GVQDP AFNNLI GHAKEG+PD A E+LKIM RR V  +  AY  LIESYL+K 
Sbjct: 497  RQLMKKGVQDPDAFNNLIRGHAKEGSPDSAFEILKIMGRRGVPRDADAYRLLIESYLRKG 556

Query: 986  EPADAKTTLDSMIENGHLPDSSLYRSVMESLLEDGRVQTASRVMKTMLEKGVKDHEDLIA 807
            EPADAKT LD MIE+GH+PDSS++RSVM+SL +DGRVQTASRVMK+M+EKGVK++ DL A
Sbjct: 557  EPADAKTALDGMIEDGHVPDSSVFRSVMQSLFDDGRVQTASRVMKSMIEKGVKENIDLTA 616

Query: 806  KILEALLMRGHVEEAIGRIELLMVNGLAPDFDSLLSGLCEKGKTISALKLLDFGLERDCN 627
            KILEALLMRGHVEEA+GRIELLM +G + +FD+LLS L EK KTI+A+KLLDF LERD N
Sbjct: 617  KILEALLMRGHVEEALGRIELLMHSGCSVNFDALLSVLSEKSKTIAAVKLLDFALERDFN 676

Query: 626  IDFSSYYKVLDALLAAGKTLNAYSVLCKIMEKGGITDWSSCKDLIQNLNEEGNTKQADIL 447
            +DF SY KVLD+LLAAGKTLNAYS+LCKI+EKGG TDWSS  +LI++LN+EGNTKQADIL
Sbjct: 677  VDFKSYDKVLDSLLAAGKTLNAYSILCKILEKGGATDWSSSDNLIKSLNQEGNTKQADIL 736

Query: 446  SRMI 435
            SRMI
Sbjct: 737  SRMI 740


>ref|XP_002530985.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223529437|gb|EEF31397.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 753

 Score =  998 bits (2579), Expect = 0.0
 Identities = 501/735 (68%), Positives = 600/735 (81%)
 Frame = -3

Query: 2639 MAYLAASRQPHXXXXXXXXXXXXSIFFFYSSSPVDSPNPESNPVEHLIPESSCADSSATV 2460
            MAYL+ S+ P+              F   +  P+ S    SNP    + +++ A ++   
Sbjct: 1    MAYLSLSK-PYKSRVYHTIPRLSLHFCTLTQDPIPSVTQISNPQSETLNDAAAAAAATQE 59

Query: 2459 TAAANPSRNAGGKRQKNPEKIEDIINRMMANRAWTTRLQNSIRNLVPSFDHELVYNVLHG 2280
                   R   GKR  +PEK+ED I+RMMANR WTTRLQNSIRNLVP FDH LVYNVLH 
Sbjct: 60   NQTQTYQRIPRGKRP-DPEKVEDTISRMMANRPWTTRLQNSIRNLVPHFDHSLVYNVLHA 118

Query: 2279 AKNSEYALQFFRWVERSGLFQHNRETHLKIIEILGRASKLNHARCILLDMPKKGVEWDED 2100
            A+NSE+ALQFFRWVER+GLF+++R+TH+KIIEILGRASKLNHARCILLDMPKKGVEWDE 
Sbjct: 119  ARNSEHALQFFRWVERAGLFKNDRDTHMKIIEILGRASKLNHARCILLDMPKKGVEWDEY 178

Query: 2099 LWVLLIDSYGKAGIVQESVKLFQKMEELGVGRTIKSYDTLFRVIMRRGRYMMAKRYFNKM 1920
            ++V+LI+SYGKAGIVQE+VK+F KM ELGV R+IKSYD LF+VI+RRGRYMMAKR FNKM
Sbjct: 179  MFVVLIESYGKAGIVQEAVKIFNKMNELGVERSIKSYDALFKVILRRGRYMMAKRVFNKM 238

Query: 1919 LSEGIEPTRHTFNVMIWGFFLSGKVETANRFFEDMKSREILPDVVTYNTLINGYNRVKKM 1740
            L++GI+PTRHT+N+M+WGFFLS ++ETA RF++DMK+R I PDVVTYNT+ING+ R KKM
Sbjct: 239  LNDGIQPTRHTYNIMLWGFFLSLRLETAMRFYDDMKNRGISPDVVTYNTMINGFYRFKKM 298

Query: 1739 DEAEKYYVEMKGRNIEPTVITYTTLIKGYLSVGQVDDALKLLEDMKSFAIKPNAITYSTL 1560
            +EAEK +VEMKG+NI PTVI+YTT+IKGY++V +VDD L+LLE+MKSF IKPN  TYSTL
Sbjct: 299  EEAEKLFVEMKGKNIAPTVISYTTMIKGYVAVDRVDDGLRLLEEMKSFNIKPNVHTYSTL 358

Query: 1559 LPGMCDAEKMSEARDILKEMVEKYIAPKDNAIFTRLISGQCKAGDLDAATDVLKAMIRLS 1380
            LPG+CDA KM+EA+DIL EMV +++APKDN+IF RL+S QCKAGDL AA DVL  M+RL 
Sbjct: 359  LPGLCDAWKMTEAKDILIEMVARHLAPKDNSIFLRLLSCQCKAGDLRAAEDVLNTMMRLH 418

Query: 1379 VPTEAGHYGILIENFCKAGQYDSAVXXXXXXXXXXXXLRPQSTLHMEPSAYNPIIEYLCD 1200
            +PTEAGHYG+LIENFCKA +YD AV            LRPQSTL +E +AYNP+I+YLC 
Sbjct: 419  IPTEAGHYGVLIENFCKAEEYDRAVKYLDKLIEKEIILRPQSTLEIESNAYNPMIQYLCS 478

Query: 1199 NGQTAKAETLLRQLMKMGVQDPIAFNNLICGHAKEGNPDPAIELLKIMIRRNVLSEKIAY 1020
            +GQT KAE   RQLMK GVQDP+AFNNLICGHAKEG PD A E+ KIM +R V  +  AY
Sbjct: 479  HGQTGKAEIFFRQLMKKGVQDPLAFNNLICGHAKEGYPDSAFEIFKIMGKRGVPRDADAY 538

Query: 1019 ESLIESYLKKSEPADAKTTLDSMIENGHLPDSSLYRSVMESLLEDGRVQTASRVMKTMLE 840
              +IESYL+K EPADAKT LD M+E+GH+PD S++RSVMESL EDGRVQTASRVMK+M+E
Sbjct: 539  RLIIESYLRKGEPADAKTALDGMLEDGHVPDPSVFRSVMESLFEDGRVQTASRVMKSMVE 598

Query: 839  KGVKDHEDLIAKILEALLMRGHVEEAIGRIELLMVNGLAPDFDSLLSGLCEKGKTISALK 660
            KGVK++ DL+ KILEALLMRGHVEEA+GRIELLM +G   +FD LLS L EKGKTI+ALK
Sbjct: 599  KGVKENMDLVGKILEALLMRGHVEEALGRIELLMQSGFHVNFDDLLSVLSEKGKTIAALK 658

Query: 659  LLDFGLERDCNIDFSSYYKVLDALLAAGKTLNAYSVLCKIMEKGGITDWSSCKDLIQNLN 480
            LLDF LERD N+DF SY KVLDALLAAGKTLNAYS+LCKIM+KGG++DWSS KDLI++LN
Sbjct: 659  LLDFALERDFNLDFKSYDKVLDALLAAGKTLNAYSILCKIMQKGGVSDWSSSKDLIKSLN 718

Query: 479  EEGNTKQADILSRMI 435
            +EGNTKQADILSRMI
Sbjct: 719  QEGNTKQADILSRMI 733


>ref|XP_008378595.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            [Malus domestica]
          Length = 760

 Score =  995 bits (2573), Expect = 0.0
 Identities = 498/695 (71%), Positives = 583/695 (83%), Gaps = 9/695 (1%)
 Frame = -3

Query: 2492 ESSCADSSATVTAAANPSRNAGGK----RQKNPEKIEDIINRMMANRAWTTRLQNSIRNL 2325
            E+  A  +A  T AA+P++    K    R +NPEK EDII RMMANRAWTTRLQNSIRNL
Sbjct: 46   EAVAATEAAAPTEAASPTQIHVPKPKQYRPRNPEKTEDIICRMMANRAWTTRLQNSIRNL 105

Query: 2324 VPSFDHELVYNVLHGAKNSEYALQFFRWVERSGLFQHNRETHLKIIEILGRASKLNHARC 2145
            VP FDH LV+NVLHGA+N E+ALQFFRWVERSG F+H+RETHLKIIEILGR  KLNHARC
Sbjct: 106  VPEFDHNLVWNVLHGARNWEHALQFFRWVERSGFFKHDRETHLKIIEILGRNLKLNHARC 165

Query: 2144 ILLDMPKKGVEWDEDLWVLLIDSYGKAG-----IVQESVKLFQKMEELGVGRTIKSYDTL 1980
            ILLDMPKKG + DEDL++ LID YGKAG     I+QESVKLF  M+ELGV R++KSY+ L
Sbjct: 166  ILLDMPKKGEQLDEDLFLALIDGYGKAGYGKAGIIQESVKLFDSMKELGVQRSLKSYEAL 225

Query: 1979 FRVIMRRGRYMMAKRYFNKMLSEGIEPTRHTFNVMIWGFFLSGKVETANRFFEDMKSREI 1800
            F+ IMRRGRYMMAKRYFN MLSEGIEP RHT+NVMIWGFF+S ++ETA RF+EDMKSR I
Sbjct: 226  FKAIMRRGRYMMAKRYFNAMLSEGIEPNRHTYNVMIWGFFMSKRLETAKRFYEDMKSRGI 285

Query: 1799 LPDVVTYNTLINGYNRVKKMDEAEKYYVEMKGRNIEPTVITYTTLIKGYLSVGQVDDALK 1620
            LPD+VTYNT+I+GYNR K MDEAE+ +VE+KGRN+EP VI YTT+IKGY+ VG+VDDAL+
Sbjct: 286  LPDLVTYNTMIHGYNRFKMMDEAEQLFVELKGRNLEPNVICYTTMIKGYVDVGRVDDALR 345

Query: 1619 LLEDMKSFAIKPNAITYSTLLPGMCDAEKMSEARDILKEMVEKYIAPKDNAIFTRLISGQ 1440
            L ++MKSF IKPNA+T+STLLPG+C+AEK +EA ++LKEMV++YIAPKDN+IF +L+   
Sbjct: 346  LFQEMKSFGIKPNAVTFSTLLPGLCEAEKKNEAVNMLKEMVQRYIAPKDNSIFEKLLYLM 405

Query: 1439 CKAGDLDAATDVLKAMIRLSVPTEAGHYGILIENFCKAGQYDSAVXXXXXXXXXXXXLRP 1260
            CK+GDLDAA DVLKAMIRLS+PTE GHYGILIENFCKAG YD A+            LRP
Sbjct: 406  CKSGDLDAAADVLKAMIRLSIPTEPGHYGILIENFCKAGVYDRAIKLLDKLIEKEIILRP 465

Query: 1259 QSTLHMEPSAYNPIIEYLCDNGQTAKAETLLRQLMKMGVQDPIAFNNLICGHAKEGNPDP 1080
            QS++ +E SAYNP+IEYLC++GQT KAE   RQLMK GVQD +AFNNL+CGHAKEGN D 
Sbjct: 466  QSSIELEASAYNPMIEYLCNHGQTEKAEVFFRQLMKKGVQDSVAFNNLMCGHAKEGNSDS 525

Query: 1079 AIELLKIMIRRNVLSEKIAYESLIESYLKKSEPADAKTTLDSMIENGHLPDSSLYRSVME 900
            A E+L+IM RR V  E  +Y  LI SYL K EPADAKT LDSM+E GH+P++SL+ SVME
Sbjct: 526  AFEILRIMGRRGVPGEADSYRLLINSYLSKGEPADAKTALDSMLEGGHIPEASLFGSVME 585

Query: 899  SLLEDGRVQTASRVMKTMLEKGVKDHEDLIAKILEALLMRGHVEEAIGRIELLMVNGLAP 720
            SL EDGRVQTASRVMK+M+EKGVK++ DL+AKILE L MRGHVEEA+GRI+LLM +G  P
Sbjct: 586  SLFEDGRVQTASRVMKSMVEKGVKENMDLVAKILETLFMRGHVEEALGRIDLLMQSGCTP 645

Query: 719  DFDSLLSGLCEKGKTISALKLLDFGLERDCNIDFSSYYKVLDALLAAGKTLNAYSVLCKI 540
             FDSLLS L EKGKTI+ALKLLDF LERDCN+DFSSY KVLDALL AGKTLNAYS+LCKI
Sbjct: 646  QFDSLLSVLAEKGKTIAALKLLDFCLERDCNVDFSSYDKVLDALLEAGKTLNAYSILCKI 705

Query: 539  MEKGGITDWSSCKDLIQNLNEEGNTKQADILSRMI 435
            MEKGG++DWSS KDLI++LN+EGNTKQADILSRMI
Sbjct: 706  MEKGGLSDWSSTKDLIKSLNQEGNTKQADILSRMI 740


>ref|XP_009376212.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230
            [Pyrus x bretschneideri]
          Length = 757

 Score =  994 bits (2570), Expect = 0.0
 Identities = 494/691 (71%), Positives = 580/691 (83%), Gaps = 9/691 (1%)
 Frame = -3

Query: 2480 ADSSATVTAAANPSRNAGGK----RQKNPEKIEDIINRMMANRAWTTRLQNSIRNLVPSF 2313
            A  +A  T AA+P++    K    R +NPEK EDII RMMANRAWTTRLQNSIRNLVP F
Sbjct: 47   ATEAAAPTEAASPTQTHVPKPKQHRPRNPEKTEDIICRMMANRAWTTRLQNSIRNLVPEF 106

Query: 2312 DHELVYNVLHGAKNSEYALQFFRWVERSGLFQHNRETHLKIIEILGRASKLNHARCILLD 2133
            DH LV+NVLHG +N E+ALQFFRWVERSGLF H+RETHLKII++LGR  KLNHARCILLD
Sbjct: 107  DHNLVWNVLHGTRNWEHALQFFRWVERSGLFNHDRETHLKIIDVLGRNLKLNHARCILLD 166

Query: 2132 MPKKGVEWDEDLWVLLIDSYGKAG-----IVQESVKLFQKMEELGVGRTIKSYDTLFRVI 1968
            MPKKG + DEDL++ LID YGKAG     I+QESVKLF  M+ELGV R++KSY+ LF+ I
Sbjct: 167  MPKKGEQLDEDLFLALIDGYGKAGYGKAGIIQESVKLFDSMKELGVQRSLKSYEALFKAI 226

Query: 1967 MRRGRYMMAKRYFNKMLSEGIEPTRHTFNVMIWGFFLSGKVETANRFFEDMKSREILPDV 1788
            MRRGRY MAKRYFN MLSEGIEP RHT+NVMIWGFFLS ++ETA RF+EDMK+R ILPD+
Sbjct: 227  MRRGRYTMAKRYFNAMLSEGIEPNRHTYNVMIWGFFLSKRLETAKRFYEDMKNRGILPDL 286

Query: 1787 VTYNTLINGYNRVKKMDEAEKYYVEMKGRNIEPTVITYTTLIKGYLSVGQVDDALKLLED 1608
            VTYNT+I+GYNR K MDEAE+ +VE+KGRN+EP VI YTT+IKGY+ VG+VDDAL+L ++
Sbjct: 287  VTYNTMIHGYNRFKMMDEAEQLFVELKGRNLEPNVICYTTMIKGYVDVGKVDDALRLFQE 346

Query: 1607 MKSFAIKPNAITYSTLLPGMCDAEKMSEARDILKEMVEKYIAPKDNAIFTRLISGQCKAG 1428
            MKSF IKPNA+T+STLLPG+C+AEK  EA ++LKEMV++YIAPKDN+IF +L+S  CK+G
Sbjct: 347  MKSFGIKPNAVTFSTLLPGLCEAEKKDEAVNMLKEMVQRYIAPKDNSIFEKLLSLMCKSG 406

Query: 1427 DLDAATDVLKAMIRLSVPTEAGHYGILIENFCKAGQYDSAVXXXXXXXXXXXXLRPQSTL 1248
            DLDAA DVLKAMIRLS+PTE GHYGILIENFCKAG YD A+            +RPQS++
Sbjct: 407  DLDAAADVLKAMIRLSIPTEPGHYGILIENFCKAGVYDRAIKLLDKLIEKEIIMRPQSSI 466

Query: 1247 HMEPSAYNPIIEYLCDNGQTAKAETLLRQLMKMGVQDPIAFNNLICGHAKEGNPDPAIEL 1068
             +E SAYNP+IEYLC++GQT KAE   RQLMK GVQD +AFNNL+CGHAKEGN D A E+
Sbjct: 467  ELEASAYNPMIEYLCNHGQTEKAEVFFRQLMKKGVQDSVAFNNLMCGHAKEGNSDSAFEI 526

Query: 1067 LKIMIRRNVLSEKIAYESLIESYLKKSEPADAKTTLDSMIENGHLPDSSLYRSVMESLLE 888
            L+IM RR V  E  +Y  LI SYL K EPADAKT LDSM+E GH+P++SL+RSVMESL +
Sbjct: 527  LRIMGRRGVPGEADSYRLLINSYLSKGEPADAKTALDSMLEGGHIPEASLFRSVMESLFQ 586

Query: 887  DGRVQTASRVMKTMLEKGVKDHEDLIAKILEALLMRGHVEEAIGRIELLMVNGLAPDFDS 708
            DGRVQTASRVMK+M+EKGVK++ DL+AKILE L MRGHVEEA+GRI+LLM +G  P FDS
Sbjct: 587  DGRVQTASRVMKSMVEKGVKENMDLVAKILETLFMRGHVEEALGRIDLLMQSGCTPQFDS 646

Query: 707  LLSGLCEKGKTISALKLLDFGLERDCNIDFSSYYKVLDALLAAGKTLNAYSVLCKIMEKG 528
            LLS L EKGKTI+ALKLLDF LERDCN+DFSSY KVLDALL AGKTLNAYS+LCKIMEKG
Sbjct: 647  LLSVLAEKGKTIAALKLLDFCLERDCNVDFSSYDKVLDALLEAGKTLNAYSILCKIMEKG 706

Query: 527  GITDWSSCKDLIQNLNEEGNTKQADILSRMI 435
            G++DWSS KDLI++LN+EGNTKQADILSRMI
Sbjct: 707  GVSDWSSTKDLIKSLNQEGNTKQADILSRMI 737


>ref|XP_004299746.2| PREDICTED: pentatricopeptide repeat-containing protein At2g37230
            [Fragaria vesca subsp. vesca]
          Length = 766

 Score =  993 bits (2567), Expect = 0.0
 Identities = 497/715 (69%), Positives = 589/715 (82%), Gaps = 8/715 (1%)
 Frame = -3

Query: 2555 YSSSPVDSPNP----ESNPVEHLIPESSCADSSATVTAAANPSRNAGGKRQ----KNPEK 2400
            + S+   SP P    ++ P E          + +   A+A P       RQ    +NPEK
Sbjct: 31   FCSTETPSPQPGSASDAPPAETPTGSPPDPQNGSAAAASAPPPPQTPKPRQLRRARNPEK 90

Query: 2399 IEDIINRMMANRAWTTRLQNSIRNLVPSFDHELVYNVLHGAKNSEYALQFFRWVERSGLF 2220
             EDII RMMANRAWTTRLQNSIR+LVP FDH LV+NVLHGAK S+ ALQFFRWVERS LF
Sbjct: 91   TEDIICRMMANRAWTTRLQNSIRDLVPEFDHNLVWNVLHGAKTSDQALQFFRWVERSRLF 150

Query: 2219 QHNRETHLKIIEILGRASKLNHARCILLDMPKKGVEWDEDLWVLLIDSYGKAGIVQESVK 2040
            QH+RETHLKIIEILGRASKLNHARCILLDMPKKGV+WDEDL++ LIDSYGKAGIVQESVK
Sbjct: 151  QHDRETHLKIIEILGRASKLNHARCILLDMPKKGVQWDEDLFIHLIDSYGKAGIVQESVK 210

Query: 2039 LFQKMEELGVGRTIKSYDTLFRVIMRRGRYMMAKRYFNKMLSEGIEPTRHTFNVMIWGFF 1860
            LF +M+ELGV R++KSY+ LF+ I+RRGRYMM KRYFN ML+EGIEPTRHT+N+MIWGFF
Sbjct: 211  LFNQMKELGVERSLKSYEALFKSILRRGRYMMGKRYFNHMLAEGIEPTRHTYNIMIWGFF 270

Query: 1859 LSGKVETANRFFEDMKSREILPDVVTYNTLINGYNRVKKMDEAEKYYVEMKGRNIEPTVI 1680
            LS ++ETA RFFEDMK+R + PDVVTYNT+INGYNR K MDEAE+ +VE+KG+NI+P VI
Sbjct: 271  LSLRLETAKRFFEDMKTRGLSPDVVTYNTMINGYNRFKMMDEAEQLFVELKGKNIQPNVI 330

Query: 1679 TYTTLIKGYLSVGQVDDALKLLEDMKSFAIKPNAITYSTLLPGMCDAEKMSEARDILKEM 1500
            +YTT+IKGY+SVG+VDD  +L ++MKSF IKPN +T+STLLPG+CDAEK  EA+++L EM
Sbjct: 331  SYTTMIKGYVSVGKVDDGYRLFQEMKSFGIKPNDVTFSTLLPGLCDAEKKDEAQNLLSEM 390

Query: 1499 VEKYIAPKDNAIFTRLISGQCKAGDLDAATDVLKAMIRLSVPTEAGHYGILIENFCKAGQ 1320
            VE++IAPKDN++F +L+  QCK+GDLDAA +VLKAMIRL +PTEAGHYGILIENFCKAG 
Sbjct: 391  VERHIAPKDNSVFEKLLYCQCKSGDLDAAANVLKAMIRLHIPTEAGHYGILIENFCKAGV 450

Query: 1319 YDSAVXXXXXXXXXXXXLRPQSTLHMEPSAYNPIIEYLCDNGQTAKAETLLRQLMKMGVQ 1140
            YD AV            +R QS++ +E SAYNP+IEYLCD+GQT KAE L RQLMK GVQ
Sbjct: 451  YDRAVHLLDRLIEKEIIMRSQSSMELEASAYNPMIEYLCDHGQTDKAEVLFRQLMKKGVQ 510

Query: 1139 DPIAFNNLICGHAKEGNPDPAIELLKIMIRRNVLSEKIAYESLIESYLKKSEPADAKTTL 960
            D +AFNNLI GHAKEGN D A E+LKIM RR V  E  +Y+ LI+SYL K EPADAKT L
Sbjct: 511  DSVAFNNLIRGHAKEGNSDSAFEILKIMGRRGVPREADSYKLLIKSYLSKGEPADAKTAL 570

Query: 959  DSMIENGHLPDSSLYRSVMESLLEDGRVQTASRVMKTMLEKGVKDHEDLIAKILEALLMR 780
            DSMIENGH+P+SSL+RSVMESL EDGRVQTASR+MK+M+EKGV ++ DL+AKILEAL +R
Sbjct: 571  DSMIENGHVPESSLFRSVMESLFEDGRVQTASRIMKSMVEKGVNENMDLVAKILEALFIR 630

Query: 779  GHVEEAIGRIELLMVNGLAPDFDSLLSGLCEKGKTISALKLLDFGLERDCNIDFSSYYKV 600
            GHVEEA+GRI+LLM +G AP+FDSLLS L EKGKTI+A+KLLDF LERDC +DF SY KV
Sbjct: 631  GHVEEALGRIDLLMQSGCAPEFDSLLSVLAEKGKTIAAVKLLDFCLERDCMVDFKSYDKV 690

Query: 599  LDALLAAGKTLNAYSVLCKIMEKGGITDWSSCKDLIQNLNEEGNTKQADILSRMI 435
            LDALL +GKTLNAYS+LCKIM+KGG+TDW S  DLI++LN EGNTKQAD+LSR I
Sbjct: 691  LDALLESGKTLNAYSILCKIMDKGGVTDWRSTDDLIKSLNLEGNTKQADVLSRKI 745


>ref|XP_008338950.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            [Malus domestica]
          Length = 763

 Score =  993 bits (2567), Expect = 0.0
 Identities = 506/744 (68%), Positives = 602/744 (80%), Gaps = 9/744 (1%)
 Frame = -3

Query: 2639 MAYLAASRQPHXXXXXXXXXXXXSIFFFYSSSPVDSPNPESNPVEHLIPESSCADSSATV 2460
            MAY++ S+ P+            ++   +SS+  +SP   +   E     ++  ++ A  
Sbjct: 4    MAYISLSK-PYRWRPRPSNPQSLTLLRLFSST--ESPTGATAATESATEATAATEAVAAA 60

Query: 2459 TAAANPSRNAGGK----RQKNPEKIEDIINRMMANRAWTTRLQNSIRNLVPSFDHELVYN 2292
             AAA P++    K    R +NPEK EDII RMMANRAWTTRLQNSIRNLVP FDH LV+N
Sbjct: 61   EAAA-PAQTRVPKPKQHRHRNPEKTEDIICRMMANRAWTTRLQNSIRNLVPEFDHNLVWN 119

Query: 2291 VLHGAKNSEYALQFFRWVERSGLFQHNRETHLKIIEILGRASKLNHARCILLDMPKKGVE 2112
            VLHGA+N E+ALQFFRWVERSGLF+H+RETHLKIIEILGR  KLNHARCILLDMPKKGV+
Sbjct: 120  VLHGARNWEHALQFFRWVERSGLFKHDRETHLKIIEILGRNLKLNHARCILLDMPKKGVQ 179

Query: 2111 WDEDLWVLLIDSYGKAG-----IVQESVKLFQKMEELGVGRTIKSYDTLFRVIMRRGRYM 1947
             DEDL++ LID YGKAG     I+QESVKLF  M+ELGV R++KSY+ LF+ IMRRGRYM
Sbjct: 180  LDEDLFLALIDGYGKAGYGKAGIIQESVKLFDSMKELGVQRSLKSYEALFKAIMRRGRYM 239

Query: 1946 MAKRYFNKMLSEGIEPTRHTFNVMIWGFFLSGKVETANRFFEDMKSREILPDVVTYNTLI 1767
            MAKRYFN MLSEGIEP RHT+NVMIWGFF+S ++ETA RF+EDMKSR I PD+VTYNT+I
Sbjct: 240  MAKRYFNAMLSEGIEPDRHTYNVMIWGFFMSKRLETAKRFYEDMKSRGISPDLVTYNTMI 299

Query: 1766 NGYNRVKKMDEAEKYYVEMKGRNIEPTVITYTTLIKGYLSVGQVDDALKLLEDMKSFAIK 1587
            +GYNR K MDEAE+ +VE+KGRNIEP VI YTT+IKGY+ VG+VDDAL+L ++MKSF IK
Sbjct: 300  HGYNRFKMMDEAEQLFVELKGRNIEPNVICYTTMIKGYVDVGRVDDALRLFQEMKSFGIK 359

Query: 1586 PNAITYSTLLPGMCDAEKMSEARDILKEMVEKYIAPKDNAIFTRLISGQCKAGDLDAATD 1407
            PNA+T+STLLPG+C+AEK  EA ++LKEMV++YIAPKDNAIF +L++  CK+GDLD+A D
Sbjct: 360  PNAVTFSTLLPGLCEAEKKDEAVNMLKEMVQRYIAPKDNAIFEKLLTLMCKSGDLDSAAD 419

Query: 1406 VLKAMIRLSVPTEAGHYGILIENFCKAGQYDSAVXXXXXXXXXXXXLRPQSTLHMEPSAY 1227
            VLKAMIRLS+PTE GHYGILIENFCKAG YD A+            LRPQS++ +E SAY
Sbjct: 420  VLKAMIRLSIPTEPGHYGILIENFCKAGVYDRAIKLLDKLIEKEIILRPQSSIELEASAY 479

Query: 1226 NPIIEYLCDNGQTAKAETLLRQLMKMGVQDPIAFNNLICGHAKEGNPDPAIELLKIMIRR 1047
            NP+IE+LC++GQT KAE   RQLMK GVQD +AFNNL+CGHAKEGN D A E+L+IM RR
Sbjct: 480  NPMIEHLCNHGQTEKAEVFFRQLMKKGVQDSVAFNNLMCGHAKEGNSDSAFEILRIMGRR 539

Query: 1046 NVLSEKIAYESLIESYLKKSEPADAKTTLDSMIENGHLPDSSLYRSVMESLLEDGRVQTA 867
             V  E  +Y  LI SYL K EPADAKT LDSM+E+GH+P+S L+RSV+ESL EDGRVQTA
Sbjct: 540  GVPGEADSYRLLINSYLSKGEPADAKTALDSMLESGHIPESPLFRSVLESLFEDGRVQTA 599

Query: 866  SRVMKTMLEKGVKDHEDLIAKILEALLMRGHVEEAIGRIELLMVNGLAPDFDSLLSGLCE 687
            SRVMK+M+EKGVK++ DL+AKILEAL MRGHVEEA+GRI+LLM +G  P FDSLLS L E
Sbjct: 600  SRVMKSMVEKGVKENMDLVAKILEALFMRGHVEEALGRIDLLMQSGCTPQFDSLLSVLAE 659

Query: 686  KGKTISALKLLDFGLERDCNIDFSSYYKVLDALLAAGKTLNAYSVLCKIMEKGGITDWSS 507
            KGKTI ALKLLDF LERDC++DFSSY KVLDALL AGKTLNAYS+LCKIMEKG  +DWSS
Sbjct: 660  KGKTIGALKLLDFCLERDCSVDFSSYDKVLDALLEAGKTLNAYSILCKIMEKGEASDWSS 719

Query: 506  CKDLIQNLNEEGNTKQADILSRMI 435
             KDLI++LN+EGNTKQADILSRMI
Sbjct: 720  TKDLIKSLNQEGNTKQADILSRMI 743


>ref|XP_004149878.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230
            [Cucumis sativus] gi|700210174|gb|KGN65270.1|
            hypothetical protein Csa_1G293020 [Cucumis sativus]
          Length = 760

 Score =  987 bits (2552), Expect = 0.0
 Identities = 500/716 (69%), Positives = 579/716 (80%), Gaps = 6/716 (0%)
 Frame = -3

Query: 2564 FFFYSSSPVD------SPNPESNPVEHLIPESSCADSSATVTAAANPSRNAGGKRQKNPE 2403
            FF  +  P+       SPN  S   +  +P++     SA V             R ++PE
Sbjct: 33   FFSSTQEPISTATQNGSPNDPSASSDAALPQTG---ESAAVNGVQQVKGRIPRGRPRDPE 89

Query: 2402 KIEDIINRMMANRAWTTRLQNSIRNLVPSFDHELVYNVLHGAKNSEYALQFFRWVERSGL 2223
            K+E II +MMANR WTTRLQNSIR+LVP FDH LVYNVLH AK SE+AL FFRWVER+GL
Sbjct: 90   KLEKIICKMMANREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGL 149

Query: 2222 FQHNRETHLKIIEILGRASKLNHARCILLDMPKKGVEWDEDLWVLLIDSYGKAGIVQESV 2043
            FQH+RETH KIIEILGRASKLNHARCILLDMP KGV+WDEDL+V+LI+SYGKAGIVQE+V
Sbjct: 150  FQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYGKAGIVQEAV 209

Query: 2042 KLFQKMEELGVGRTIKSYDTLFRVIMRRGRYMMAKRYFNKMLSEGIEPTRHTFNVMIWGF 1863
            K+FQKM+ELGV R++KSYD LF+ IMRRGRYMMAKRYFN ML+EGIEP RHT+NVM+WGF
Sbjct: 210  KIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGF 269

Query: 1862 FLSGKVETANRFFEDMKSREILPDVVTYNTLINGYNRVKKMDEAEKYYVEMKGRNIEPTV 1683
            FLS ++ETA RF+EDMKSR I PDVVTYNT+INGY R K M+EAE+++ EMKG+NI PTV
Sbjct: 270  FLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGYCRFKMMEEAEQFFTEMKGKNIAPTV 329

Query: 1682 ITYTTLIKGYLSVGQVDDALKLLEDMKSFAIKPNAITYSTLLPGMCDAEKMSEARDILKE 1503
            I+YTT+IKGY+SV + DDAL+L E+MK+   KPN ITYSTLLPG+CDAEK+ EAR IL E
Sbjct: 330  ISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTE 389

Query: 1502 MVEKYIAPKDNAIFTRLISGQCKAGDLDAATDVLKAMIRLSVPTEAGHYGILIENFCKAG 1323
            MV ++ APKDN+IF RL+S QCK GDLDAA  VLKAMIRLS+PTEAGHYGILIEN CKAG
Sbjct: 390  MVTRHFAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAG 449

Query: 1322 QYDSAVXXXXXXXXXXXXLRPQSTLHMEPSAYNPIIEYLCDNGQTAKAETLLRQLMKMGV 1143
             YD AV            LRPQSTL ME SAYN II+YLC++GQT KA+T  RQL+K G+
Sbjct: 450  MYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTFFRQLLKKGI 509

Query: 1142 QDPIAFNNLICGHAKEGNPDPAIELLKIMIRRNVLSEKIAYESLIESYLKKSEPADAKTT 963
            QD +AFNNLI GHAKEGNPD A E+LKIM RR V  +  +Y+ LI+SYL K EPADAKT 
Sbjct: 510  QDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTA 569

Query: 962  LDSMIENGHLPDSSLYRSVMESLLEDGRVQTASRVMKTMLEKGVKDHEDLIAKILEALLM 783
            LDSMIENGH PDS+L+RSVMESL  DGRVQTASRVM +ML+KG+ ++ DL+AKILEAL M
Sbjct: 570  LDSMIENGHSPDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALFM 629

Query: 782  RGHVEEAIGRIELLMVNGLAPDFDSLLSGLCEKGKTISALKLLDFGLERDCNIDFSSYYK 603
            RGH EEA+GRI LLM     PDF+SLLS LCEKGKT SA KLLDFGLER+CNI+FSSY K
Sbjct: 630  RGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEK 689

Query: 602  VLDALLAAGKTLNAYSVLCKIMEKGGITDWSSCKDLIQNLNEEGNTKQADILSRMI 435
            VLDALL AGKTLNAY++LCKIMEKGG  DWSSC DLI++LN+EGNTKQADILSRMI
Sbjct: 690  VLDALLGAGKTLNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMI 745


Top