BLASTX nr result

ID: Akebia24_contig00021077 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00021077
         (2382 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI32743.3| unnamed protein product [Vitis vinifera]             1088   0.0  
ref|XP_002276355.1| PREDICTED: pentatricopeptide repeat-containi...  1088   0.0  
gb|EXC31617.1| hypothetical protein L484_008414 [Morus notabilis]    1077   0.0  
ref|XP_007033459.1| Tetratricopeptide repeat (TPR)-like superfam...  1047   0.0  
ref|XP_002530985.1| pentatricopeptide repeat-containing protein,...  1039   0.0  
ref|XP_004163187.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...  1035   0.0  
ref|XP_004149878.1| PREDICTED: pentatricopeptide repeat-containi...  1035   0.0  
ref|XP_004299746.1| PREDICTED: pentatricopeptide repeat-containi...  1020   0.0  
ref|XP_006481496.1| PREDICTED: pentatricopeptide repeat-containi...  1005   0.0  
ref|XP_006428766.1| hypothetical protein CICLE_v10011107mg [Citr...  1000   0.0  
ref|XP_006353112.1| PREDICTED: pentatricopeptide repeat-containi...   998   0.0  
ref|XP_002315730.1| pentatricopeptide repeat-containing family p...   994   0.0  
ref|XP_004251992.1| PREDICTED: pentatricopeptide repeat-containi...   990   0.0  
ref|XP_007225233.1| hypothetical protein PRUPE_ppa001877mg [Prun...   982   0.0  
ref|XP_003524868.1| PREDICTED: pentatricopeptide repeat-containi...   974   0.0  
ref|XP_003532699.1| PREDICTED: pentatricopeptide repeat-containi...   971   0.0  
ref|XP_006410903.1| hypothetical protein EUTSA_v10017966mg [Eutr...   970   0.0  
ref|XP_002881498.1| pentatricopeptide repeat-containing protein ...   957   0.0  
ref|XP_006296196.1| hypothetical protein CARUB_v10025361mg [Caps...   957   0.0  
ref|NP_181260.1| pentatricopeptide repeat-containing protein [Ar...   951   0.0  

>emb|CBI32743.3| unnamed protein product [Vitis vinifera]
          Length = 772

 Score = 1088 bits (2813), Expect = 0.0
 Identities = 550/705 (78%), Positives = 610/705 (86%), Gaps = 20/705 (2%)
 Frame = +1

Query: 181  ISADDQTSDPNKE-------------------ENVVRRTPRGKPPNPEKLEDIICRMMAN 303
            ISA D TS P  E                   E    RTPRGK  NPEK+EDIICRMMAN
Sbjct: 40   ISAGDLTSSPIPETPVSGSPSEPGNLTAAEAGEKASPRTPRGKLRNPEKIEDIICRMMAN 99

Query: 304  RAWTTRLQNSIRNLVPQFDHSLVYNVLHGARNSEHALQFFRWVEKTG-YRHDRSTHLKII 480
            RAWTTRLQNSIR+LVPQFDHSLV+NVLHG+RNS+HALQFFRWVE+ G +RHDR THLKII
Sbjct: 100  RAWTTRLQNSIRSLVPQFDHSLVWNVLHGSRNSDHALQFFRWVERAGLFRHDRDTHLKII 159

Query: 481  EILGRASKLNHARCILFDMPKKGVEWDEEMFVLLIDSYGKAGIVQESVKIFQKMKELGVK 660
            EILGRASKLNHARCIL DMPKKGVEWDE++FVLLIDSYGKAGIVQESVK+FQKMKELGV+
Sbjct: 160  EILGRASKLNHARCILLDMPKKGVEWDEDLFVLLIDSYGKAGIVQESVKVFQKMKELGVE 219

Query: 661  RSIKSYDTLFKVIMRRGRVQMAKRYFNSMLSEGVIPTRHTYNLMLWGFFLSLKVETANRF 840
            R+IKSYD LFKVI+RRGR  MAKRYFN+ML+EGV+PT HTYN+M+WGFFLSLKVETANRF
Sbjct: 220  RTIKSYDALFKVILRRGRYMMAKRYFNAMLNEGVMPTCHTYNIMIWGFFLSLKVETANRF 279

Query: 841  FEDMKNRGIVPDVVTYNTMINGFCRVKQMDEAEKFFVEMKGKNLIPTVISYTTMIKGYVS 1020
            FE+MK R I PDVVTYNTMING+ R+K+M+EAEKFFVEMKG+N+ PTVISYTTMIKGYVS
Sbjct: 280  FEEMKERRISPDVVTYNTMINGYYRIKKMEEAEKFFVEMKGRNIEPTVISYTTMIKGYVS 339

Query: 1021 VSRVDDALRLFDEMGLFGIKANDFTYSTLLPGLCDTEKMPEARKLLKEMVERHIVPKDNS 1200
            V RVDD LRLF+EM  FGIK N  TYSTLLPGLCD EKM EA+ ++KEMVER+I PKDNS
Sbjct: 340  VGRVDDGLRLFEEMKSFGIKPNAVTYSTLLPGLCDGEKMLEAQNVVKEMVERYIAPKDNS 399

Query: 1201 IFLRLISCQCKSGHLDWAADVLKGMIRLSIPTEISHYGVLIENFCKVGEYDRAINLLDKV 1380
            IF+RLI+CQCK+G LD AADVLK MIRLSIPTE  HYGVLIENFCK G YDRA+ LLDK+
Sbjct: 400  IFMRLITCQCKAGQLDAAADVLKAMIRLSIPTEAGHYGVLIENFCKSGVYDRAVKLLDKL 459

Query: 1381 CEKEILLNPRSSSEMEPNAYNPMIKYLCEHGQTLKAEMFFRQLMKKGVQDPSAFNNLLCG 1560
             EKEI+L P++S EME + YN +I+YLC  GQT KAE  FRQLMKKGVQDP AFNNL+ G
Sbjct: 460  IEKEIILRPQNSLEMESSGYNLIIEYLCNSGQTSKAETLFRQLMKKGVQDPIAFNNLIRG 519

Query: 1561 HSKEGMPESAFEILKIMGRRGVCPDADAYELLVKSFLKKGEPADAKTALDGMIESGHLPD 1740
            HSKEG PESAFEILKIMGRR V  +ADAY LL++SFLKKGEPADAKTALDGMIE+GH+PD
Sbjct: 520  HSKEGAPESAFEILKIMGRREVPREADAYRLLIESFLKKGEPADAKTALDGMIENGHIPD 579

Query: 1741 SSLFRSVMQSLFDDGRVQTASRVMKSMLEKGVKENMDLVARILEALLIRGHVEEALGRIE 1920
            SSLFRSVM+SLF+DGR+QTASRVM +M+EKGVKENMDLVA+ILEALL+RGHVEEALGRI+
Sbjct: 580  SSLFRSVMESLFEDGRIQTASRVMNNMVEKGVKENMDLVAKILEALLLRGHVEEALGRID 639

Query: 1921 LLMGNDCVPDFDSLLAVLCEKEKTIAALKLLDFALERDCNVSFSSYDKVLDALLAAGKTL 2100
            LLM N C PDFD LL+VLC K KTIAALKLLDF LERD N+SFSSY+ VLDALL AGKTL
Sbjct: 640  LLMNNGCEPDFDGLLSVLCAKGKTIAALKLLDFGLERDYNISFSSYENVLDALLTAGKTL 699

Query: 2101 NAYSILCKIMEKGGVTDWFSSCEDLIRSLNAEGNTKQADILKRMI 2235
            NAYSILCKIM+KGG TDW SSC+DLIRSLN EGNTKQADIL RMI
Sbjct: 700  NAYSILCKIMQKGGATDW-SSCKDLIRSLNEEGNTKQADILSRMI 743


>ref|XP_002276355.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230
            [Vitis vinifera]
          Length = 763

 Score = 1088 bits (2813), Expect = 0.0
 Identities = 550/705 (78%), Positives = 610/705 (86%), Gaps = 20/705 (2%)
 Frame = +1

Query: 181  ISADDQTSDPNKE-------------------ENVVRRTPRGKPPNPEKLEDIICRMMAN 303
            ISA D TS P  E                   E    RTPRGK  NPEK+EDIICRMMAN
Sbjct: 40   ISAGDLTSSPIPETPVSGSPSEPGNLTAAEAGEKASPRTPRGKLRNPEKIEDIICRMMAN 99

Query: 304  RAWTTRLQNSIRNLVPQFDHSLVYNVLHGARNSEHALQFFRWVEKTG-YRHDRSTHLKII 480
            RAWTTRLQNSIR+LVPQFDHSLV+NVLHG+RNS+HALQFFRWVE+ G +RHDR THLKII
Sbjct: 100  RAWTTRLQNSIRSLVPQFDHSLVWNVLHGSRNSDHALQFFRWVERAGLFRHDRDTHLKII 159

Query: 481  EILGRASKLNHARCILFDMPKKGVEWDEEMFVLLIDSYGKAGIVQESVKIFQKMKELGVK 660
            EILGRASKLNHARCIL DMPKKGVEWDE++FVLLIDSYGKAGIVQESVK+FQKMKELGV+
Sbjct: 160  EILGRASKLNHARCILLDMPKKGVEWDEDLFVLLIDSYGKAGIVQESVKVFQKMKELGVE 219

Query: 661  RSIKSYDTLFKVIMRRGRVQMAKRYFNSMLSEGVIPTRHTYNLMLWGFFLSLKVETANRF 840
            R+IKSYD LFKVI+RRGR  MAKRYFN+ML+EGV+PT HTYN+M+WGFFLSLKVETANRF
Sbjct: 220  RTIKSYDALFKVILRRGRYMMAKRYFNAMLNEGVMPTCHTYNIMIWGFFLSLKVETANRF 279

Query: 841  FEDMKNRGIVPDVVTYNTMINGFCRVKQMDEAEKFFVEMKGKNLIPTVISYTTMIKGYVS 1020
            FE+MK R I PDVVTYNTMING+ R+K+M+EAEKFFVEMKG+N+ PTVISYTTMIKGYVS
Sbjct: 280  FEEMKERRISPDVVTYNTMINGYYRIKKMEEAEKFFVEMKGRNIEPTVISYTTMIKGYVS 339

Query: 1021 VSRVDDALRLFDEMGLFGIKANDFTYSTLLPGLCDTEKMPEARKLLKEMVERHIVPKDNS 1200
            V RVDD LRLF+EM  FGIK N  TYSTLLPGLCD EKM EA+ ++KEMVER+I PKDNS
Sbjct: 340  VGRVDDGLRLFEEMKSFGIKPNAVTYSTLLPGLCDGEKMLEAQNVVKEMVERYIAPKDNS 399

Query: 1201 IFLRLISCQCKSGHLDWAADVLKGMIRLSIPTEISHYGVLIENFCKVGEYDRAINLLDKV 1380
            IF+RLI+CQCK+G LD AADVLK MIRLSIPTE  HYGVLIENFCK G YDRA+ LLDK+
Sbjct: 400  IFMRLITCQCKAGQLDAAADVLKAMIRLSIPTEAGHYGVLIENFCKSGVYDRAVKLLDKL 459

Query: 1381 CEKEILLNPRSSSEMEPNAYNPMIKYLCEHGQTLKAEMFFRQLMKKGVQDPSAFNNLLCG 1560
             EKEI+L P++S EME + YN +I+YLC  GQT KAE  FRQLMKKGVQDP AFNNL+ G
Sbjct: 460  IEKEIILRPQNSLEMESSGYNLIIEYLCNSGQTSKAETLFRQLMKKGVQDPIAFNNLIRG 519

Query: 1561 HSKEGMPESAFEILKIMGRRGVCPDADAYELLVKSFLKKGEPADAKTALDGMIESGHLPD 1740
            HSKEG PESAFEILKIMGRR V  +ADAY LL++SFLKKGEPADAKTALDGMIE+GH+PD
Sbjct: 520  HSKEGAPESAFEILKIMGRREVPREADAYRLLIESFLKKGEPADAKTALDGMIENGHIPD 579

Query: 1741 SSLFRSVMQSLFDDGRVQTASRVMKSMLEKGVKENMDLVARILEALLIRGHVEEALGRIE 1920
            SSLFRSVM+SLF+DGR+QTASRVM +M+EKGVKENMDLVA+ILEALL+RGHVEEALGRI+
Sbjct: 580  SSLFRSVMESLFEDGRIQTASRVMNNMVEKGVKENMDLVAKILEALLLRGHVEEALGRID 639

Query: 1921 LLMGNDCVPDFDSLLAVLCEKEKTIAALKLLDFALERDCNVSFSSYDKVLDALLAAGKTL 2100
            LLM N C PDFD LL+VLC K KTIAALKLLDF LERD N+SFSSY+ VLDALL AGKTL
Sbjct: 640  LLMNNGCEPDFDGLLSVLCAKGKTIAALKLLDFGLERDYNISFSSYENVLDALLTAGKTL 699

Query: 2101 NAYSILCKIMEKGGVTDWFSSCEDLIRSLNAEGNTKQADILKRMI 2235
            NAYSILCKIM+KGG TDW SSC+DLIRSLN EGNTKQADIL RMI
Sbjct: 700  NAYSILCKIMQKGGATDW-SSCKDLIRSLNEEGNTKQADILSRMI 743


>gb|EXC31617.1| hypothetical protein L484_008414 [Morus notabilis]
          Length = 768

 Score = 1077 bits (2784), Expect = 0.0
 Identities = 532/674 (78%), Positives = 597/674 (88%), Gaps = 1/674 (0%)
 Frame = +1

Query: 217  EENVVRRTPRGKPPNPEKLEDIICRMMANRAWTTRLQNSIRNLVPQFDHSLVYNVLHGAR 396
            E   ++RTPRGK  NPEK+EDIICRMMANRAWTTRLQNSIR LVPQFDHSLV+NVLHGAR
Sbjct: 76   ENTAIQRTPRGKSRNPEKIEDIICRMMANRAWTTRLQNSIRRLVPQFDHSLVWNVLHGAR 135

Query: 397  NSEHALQFFRWVEKTG-YRHDRSTHLKIIEILGRASKLNHARCILFDMPKKGVEWDEEMF 573
            NS+HALQFFRWVE++G + HDR THLKIIEIL RASKLNHARCIL DMPKK V+WDE++F
Sbjct: 136  NSDHALQFFRWVERSGLFNHDRETHLKIIEILTRASKLNHARCILLDMPKKSVQWDEDLF 195

Query: 574  VLLIDSYGKAGIVQESVKIFQKMKELGVKRSIKSYDTLFKVIMRRGRVQMAKRYFNSMLS 753
            VL ID YGKAGIVQESV++F KMKELGV+RS+KSYD LFKVI+RRGR  MAKRYFN+M++
Sbjct: 196  VLFIDGYGKAGIVQESVRMFNKMKELGVERSVKSYDALFKVILRRGRYMMAKRYFNAMIN 255

Query: 754  EGVIPTRHTYNLMLWGFFLSLKVETANRFFEDMKNRGIVPDVVTYNTMINGFCRVKQMDE 933
            EG+ PT+HTYN+MLWGFFLSL++ETA RF+EDMKNRG+ PDVVTYNTMING+ R K MDE
Sbjct: 256  EGIEPTKHTYNIMLWGFFLSLRLETAKRFYEDMKNRGVWPDVVTYNTMINGYNRFKMMDE 315

Query: 934  AEKFFVEMKGKNLIPTVISYTTMIKGYVSVSRVDDALRLFDEMGLFGIKANDFTYSTLLP 1113
            AEK FVEMKG+N+ PTVISYTTMIKGYVS+ RVDD LRLF+EM  FGIK N  TY+TLLP
Sbjct: 316  AEKMFVEMKGRNIAPTVISYTTMIKGYVSIGRVDDGLRLFEEMKSFGIKPNAVTYTTLLP 375

Query: 1114 GLCDTEKMPEARKLLKEMVERHIVPKDNSIFLRLISCQCKSGHLDWAADVLKGMIRLSIP 1293
            GLCD EKM EAR +LKEMV+R+I PKDNSIFLRL+S QCK G LD AADVLK MIRLSIP
Sbjct: 376  GLCDAEKMSEARTMLKEMVDRYIAPKDNSIFLRLLSSQCKVGDLDAAADVLKAMIRLSIP 435

Query: 1294 TEISHYGVLIENFCKVGEYDRAINLLDKVCEKEILLNPRSSSEMEPNAYNPMIKYLCEHG 1473
            TE  HYG+LIENFCK   YDRA+ LLDK+ EKEI+L P+SS+EME +AYN MI++LC HG
Sbjct: 436  TEAGHYGILIENFCKAAVYDRAVKLLDKLIEKEIVLRPQSSTEMEASAYNAMIQFLCNHG 495

Query: 1474 QTLKAEMFFRQLMKKGVQDPSAFNNLLCGHSKEGMPESAFEILKIMGRRGVCPDADAYEL 1653
            QT KAE+FFRQLMKKGVQDP AFNNL+ GHSKEG P+SAFEILKIMGRRGV  DAD+Y L
Sbjct: 496  QTGKAEIFFRQLMKKGVQDPVAFNNLIRGHSKEGNPDSAFEILKIMGRRGVARDADSYRL 555

Query: 1654 LVKSFLKKGEPADAKTALDGMIESGHLPDSSLFRSVMQSLFDDGRVQTASRVMKSMLEKG 1833
            L+KS+L KGEPADAKTALD MIE+ HLP+SSLFRSVM+SL++DGR QTASRVMKSM+EKG
Sbjct: 556  LIKSYLSKGEPADAKTALDSMIENDHLPESSLFRSVMESLYEDGRAQTASRVMKSMIEKG 615

Query: 1834 VKENMDLVARILEALLIRGHVEEALGRIELLMGNDCVPDFDSLLAVLCEKEKTIAALKLL 2013
            VKENMDLVA+ILEALL+RGHVEEALGRI+LLM + C P+FDSLL+VLCEK KTIAALKLL
Sbjct: 616  VKENMDLVAKILEALLVRGHVEEALGRIDLLMQSGCAPNFDSLLSVLCEKGKTIAALKLL 675

Query: 2014 DFALERDCNVSFSSYDKVLDALLAAGKTLNAYSILCKIMEKGGVTDWFSSCEDLIRSLNA 2193
            DF LERD  V FSSYDKVLDALLAAGKTLNAYSILCKIM KGGVTDW S CEDLI+SLN 
Sbjct: 676  DFCLERDYVVDFSSYDKVLDALLAAGKTLNAYSILCKIMGKGGVTDW-SGCEDLIKSLNK 734

Query: 2194 EGNTKQADILKRMI 2235
            EGNTKQADI+ RMI
Sbjct: 735  EGNTKQADIISRMI 748


>ref|XP_007033459.1| Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma
            cacao] gi|508712488|gb|EOY04385.1| Tetratricopeptide
            repeat (TPR)-like superfamily protein [Theobroma cacao]
          Length = 743

 Score = 1047 bits (2707), Expect = 0.0
 Identities = 525/688 (76%), Positives = 599/688 (87%), Gaps = 4/688 (0%)
 Frame = +1

Query: 184  SADDQTSDPNKE-ENVV--RRTPRGKPPNPEKLEDIICRMMANRAWTTRLQNSIRNLVPQ 354
            S +   + P +E E VV  R +PRGK  NPEK+ED+ICRMM NRAWTTRLQNSIR LVP+
Sbjct: 37   SQELNNAPPQQEGEKVVTQRTSPRGKTRNPEKVEDVICRMMENRAWTTRLQNSIRALVPE 96

Query: 355  FDHSLVYNVLHGARNSEHALQFFRWVEKTGY-RHDRSTHLKIIEILGRASKLNHARCILF 531
            FDH+LVYNVLHGA+NSE ALQFFRWVE+ G  RHDR  H+KII+ILGRASKLNHARCIL 
Sbjct: 97   FDHALVYNVLHGAKNSEQALQFFRWVERAGLIRHDREAHMKIIQILGRASKLNHARCILL 156

Query: 532  DMPKKGVEWDEEMFVLLIDSYGKAGIVQESVKIFQKMKELGVKRSIKSYDTLFKVIMRRG 711
            DMPKKGVEWDE++FV+LIDSYGKAGIVQE+VKIFQKM ELGV+R+IKSYD  FKVI+RRG
Sbjct: 157  DMPKKGVEWDEDLFVVLIDSYGKAGIVQEAVKIFQKMNELGVERTIKSYDAFFKVILRRG 216

Query: 712  RVQMAKRYFNSMLSEGVIPTRHTYNLMLWGFFLSLKVETANRFFEDMKNRGIVPDVVTYN 891
            R  MAKRYFN MLSEG++PTRHTYN+MLWGFFLSL+++TANRF+EDMK RGI PDVVTYN
Sbjct: 217  RYMMAKRYFNKMLSEGIVPTRHTYNIMLWGFFLSLRLDTANRFYEDMKTRGISPDVVTYN 276

Query: 892  TMINGFCRVKQMDEAEKFFVEMKGKNLIPTVISYTTMIKGYVSVSRVDDALRLFDEMGLF 1071
            TMING+ R K+M+EAEK FVEMKGKNL PTVISYTTMIKGYV+V +VDD LRL +EM  F
Sbjct: 277  TMINGYSRFKKMEEAEKLFVEMKGKNLAPTVISYTTMIKGYVAVEQVDDGLRLLEEMKSF 336

Query: 1072 GIKANDFTYSTLLPGLCDTEKMPEARKLLKEMVERHIVPKDNSIFLRLISCQCKSGHLDW 1251
            GIK N  TYSTLLPGLCD  KM EA+ +LKEMVE +I PKDNSIF+ L++ QCKSG LD 
Sbjct: 337  GIKPNATTYSTLLPGLCDAGKMTEAKSILKEMVEWYIAPKDNSIFINLLNSQCKSGDLDA 396

Query: 1252 AADVLKGMIRLSIPTEISHYGVLIENFCKVGEYDRAINLLDKVCEKEILLNPRSSSEMEP 1431
            AADVLK MIRLSIPTE  HYGVLIENFCK   +DRAI LLDK+ EKEI+L P++S +ME 
Sbjct: 397  AADVLKAMIRLSIPTEAGHYGVLIENFCKANLFDRAIKLLDKLVEKEIILRPQNSLDMEA 456

Query: 1432 NAYNPMIKYLCEHGQTLKAEMFFRQLMKKGVQDPSAFNNLLCGHSKEGMPESAFEILKIM 1611
            +AYN MI+YLC HGQT KAE+FFRQLMKKGV DP+AFNNL+ GH+KEG P  AFEILKIM
Sbjct: 457  SAYNAMIQYLCHHGQTGKAEVFFRQLMKKGVLDPTAFNNLIRGHAKEGNPGLAFEILKIM 516

Query: 1612 GRRGVCPDADAYELLVKSFLKKGEPADAKTALDGMIESGHLPDSSLFRSVMQSLFDDGRV 1791
            GRRGV  DADAY+LL++S+L+KGEPADAKT+LD MIE G LP+S +F+SVM+SLF+DGR+
Sbjct: 517  GRRGVPKDADAYKLLIESYLRKGEPADAKTSLDSMIEDGLLPESGIFKSVMESLFEDGRI 576

Query: 1792 QTASRVMKSMLEKGVKENMDLVARILEALLIRGHVEEALGRIELLMGNDCVPDFDSLLAV 1971
            QTASRVMKSM+EKGVKE+MDLVA+ILEALL+RGHVEEALGRIELLM N C P+ DSLL+V
Sbjct: 577  QTASRVMKSMVEKGVKEHMDLVAKILEALLMRGHVEEALGRIELLMQNGCAPNLDSLLSV 636

Query: 1972 LCEKEKTIAALKLLDFALERDCNVSFSSYDKVLDALLAAGKTLNAYSILCKIMEKGGVTD 2151
            L EK KTIAALKLLDF LERDC++ FSSY+KVLDALLAAGKTLNAYSILCKIMEKGG+T+
Sbjct: 637  LSEKGKTIAALKLLDFGLERDCSIDFSSYEKVLDALLAAGKTLNAYSILCKIMEKGGITN 696

Query: 2152 WFSSCEDLIRSLNAEGNTKQADILKRMI 2235
            W SS EDLI+SLN EGNTKQADIL RMI
Sbjct: 697  W-SSLEDLIKSLNQEGNTKQADILSRMI 723


>ref|XP_002530985.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223529437|gb|EEF31397.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 753

 Score = 1039 bits (2687), Expect = 0.0
 Identities = 519/669 (77%), Positives = 584/669 (87%), Gaps = 1/669 (0%)
 Frame = +1

Query: 232  RRTPRGKPPNPEKLEDIICRMMANRAWTTRLQNSIRNLVPQFDHSLVYNVLHGARNSEHA 411
            +R PRGK P+PEK+ED I RMMANR WTTRLQNSIRNLVP FDHSLVYNVLH ARNSEHA
Sbjct: 66   QRIPRGKRPDPEKVEDTISRMMANRPWTTRLQNSIRNLVPHFDHSLVYNVLHAARNSEHA 125

Query: 412  LQFFRWVEKTG-YRHDRSTHLKIIEILGRASKLNHARCILFDMPKKGVEWDEEMFVLLID 588
            LQFFRWVE+ G +++DR TH+KIIEILGRASKLNHARCIL DMPKKGVEWDE MFV+LI+
Sbjct: 126  LQFFRWVERAGLFKNDRDTHMKIIEILGRASKLNHARCILLDMPKKGVEWDEYMFVVLIE 185

Query: 589  SYGKAGIVQESVKIFQKMKELGVKRSIKSYDTLFKVIMRRGRVQMAKRYFNSMLSEGVIP 768
            SYGKAGIVQE+VKIF KM ELGV+RSIKSYD LFKVI+RRGR  MAKR FN ML++G+ P
Sbjct: 186  SYGKAGIVQEAVKIFNKMNELGVERSIKSYDALFKVILRRGRYMMAKRVFNKMLNDGIQP 245

Query: 769  TRHTYNLMLWGFFLSLKVETANRFFEDMKNRGIVPDVVTYNTMINGFCRVKQMDEAEKFF 948
            TRHTYN+MLWGFFLSL++ETA RF++DMKNRGI PDVVTYNTMINGF R K+M+EAEK F
Sbjct: 246  TRHTYNIMLWGFFLSLRLETAMRFYDDMKNRGISPDVVTYNTMINGFYRFKKMEEAEKLF 305

Query: 949  VEMKGKNLIPTVISYTTMIKGYVSVSRVDDALRLFDEMGLFGIKANDFTYSTLLPGLCDT 1128
            VEMKGKN+ PTVISYTTMIKGYV+V RVDD LRL +EM  F IK N  TYSTLLPGLCD 
Sbjct: 306  VEMKGKNIAPTVISYTTMIKGYVAVDRVDDGLRLLEEMKSFNIKPNVHTYSTLLPGLCDA 365

Query: 1129 EKMPEARKLLKEMVERHIVPKDNSIFLRLISCQCKSGHLDWAADVLKGMIRLSIPTEISH 1308
             KM EA+ +L EMV RH+ PKDNSIFLRL+SCQCK+G L  A DVL  M+RL IPTE  H
Sbjct: 366  WKMTEAKDILIEMVARHLAPKDNSIFLRLLSCQCKAGDLRAAEDVLNTMMRLHIPTEAGH 425

Query: 1309 YGVLIENFCKVGEYDRAINLLDKVCEKEILLNPRSSSEMEPNAYNPMIKYLCEHGQTLKA 1488
            YGVLIENFCK  EYDRA+  LDK+ EKEI+L P+S+ E+E NAYNPMI+YLC HGQT KA
Sbjct: 426  YGVLIENFCKAEEYDRAVKYLDKLIEKEIILRPQSTLEIESNAYNPMIQYLCSHGQTGKA 485

Query: 1489 EMFFRQLMKKGVQDPSAFNNLLCGHSKEGMPESAFEILKIMGRRGVCPDADAYELLVKSF 1668
            E+FFRQLMKKGVQDP AFNNL+CGH+KEG P+SAFEI KIMG+RGV  DADAY L+++S+
Sbjct: 486  EIFFRQLMKKGVQDPLAFNNLICGHAKEGYPDSAFEIFKIMGKRGVPRDADAYRLIIESY 545

Query: 1669 LKKGEPADAKTALDGMIESGHLPDSSLFRSVMQSLFDDGRVQTASRVMKSMLEKGVKENM 1848
            L+KGEPADAKTALDGM+E GH+PD S+FRSVM+SLF+DGRVQTASRVMKSM+EKGVKENM
Sbjct: 546  LRKGEPADAKTALDGMLEDGHVPDPSVFRSVMESLFEDGRVQTASRVMKSMVEKGVKENM 605

Query: 1849 DLVARILEALLIRGHVEEALGRIELLMGNDCVPDFDSLLAVLCEKEKTIAALKLLDFALE 2028
            DLV +ILEALL+RGHVEEALGRIELLM +    +FD LL+VL EK KTIAALKLLDFALE
Sbjct: 606  DLVGKILEALLMRGHVEEALGRIELLMQSGFHVNFDDLLSVLSEKGKTIAALKLLDFALE 665

Query: 2029 RDCNVSFSSYDKVLDALLAAGKTLNAYSILCKIMEKGGVTDWFSSCEDLIRSLNAEGNTK 2208
            RD N+ F SYDKVLDALLAAGKTLNAYSILCKIM+KGGV+DW SS +DLI+SLN EGNTK
Sbjct: 666  RDFNLDFKSYDKVLDALLAAGKTLNAYSILCKIMQKGGVSDW-SSSKDLIKSLNQEGNTK 724

Query: 2209 QADILKRMI 2235
            QADIL RMI
Sbjct: 725  QADILSRMI 733


>ref|XP_004163187.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At2g37230-like [Cucumis sativus]
          Length = 760

 Score = 1035 bits (2677), Expect = 0.0
 Identities = 517/722 (71%), Positives = 594/722 (82%), Gaps = 9/722 (1%)
 Frame = +1

Query: 97   PLFLYSLHGFCSNLSXXXXXXXXXXXXXISADDQTSDP--------NKEENVVRRTPRGK 252
            P  L SLH F S                 SA    + P        N  + V  R PRG+
Sbjct: 25   PTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGR 84

Query: 253  PPNPEKLEDIICRMMANRAWTTRLQNSIRNLVPQFDHSLVYNVLHGARNSEHALQFFRWV 432
            P +PEKLE IIC+MMANR WTTRLQNSIR+LVPQFDH+LVYNVLH A+ SEHAL FFRWV
Sbjct: 85   PRDPEKLEXIICKMMANREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWV 144

Query: 433  EKTG-YRHDRSTHLKIIEILGRASKLNHARCILFDMPKKGVEWDEEMFVLLIDSYGKAGI 609
            E+ G ++HDR TH KIIEILGRASKLNHARCIL DMP KGV+WDE++FV+LI+SYGKAGI
Sbjct: 145  ERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYGKAGI 204

Query: 610  VQESVKIFQKMKELGVKRSIKSYDTLFKVIMRRGRVQMAKRYFNSMLSEGVIPTRHTYNL 789
            VQE+VKIFQKMKELGV+RS+KSYD LFK IMRRGR  MAKRYFN+ML+EG+ P RHTYN+
Sbjct: 205  VQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNV 264

Query: 790  MLWGFFLSLKVETANRFFEDMKNRGIVPDVVTYNTMINGFCRVKQMDEAEKFFVEMKGKN 969
            MLWGFFLSL++ETA RF+EDMK+RGI PDVVTYNTMING+CR K M+EAE+FF EMKGKN
Sbjct: 265  MLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGYCRFKMMEEAEQFFTEMKGKN 324

Query: 970  LIPTVISYTTMIKGYVSVSRVDDALRLFDEMGLFGIKANDFTYSTLLPGLCDTEKMPEAR 1149
            + PTVISYTTMIKGYVSVSR DDALRLF+EM   G K ND TYSTLLPGLCD EK+PEAR
Sbjct: 325  IAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEAR 384

Query: 1150 KLLKEMVERHIVPKDNSIFLRLISCQCKSGHLDWAADVLKGMIRLSIPTEISHYGVLIEN 1329
            K+L EMV RH  PKDNSIF+RL+SCQCK G LD A  VLK MIRLSIPTE  HYG+LIEN
Sbjct: 385  KILTEMVTRHFAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIEN 444

Query: 1330 FCKVGEYDRAINLLDKVCEKEILLNPRSSSEMEPNAYNPMIKYLCEHGQTLKAEMFFRQL 1509
             CK G YD+A+ LL+ + EKEI+L P+S+ EME +AYN +I+YLC HGQT KA+ FFRQL
Sbjct: 445  CCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTFFRQL 504

Query: 1510 MKKGVQDPSAFNNLLCGHSKEGMPESAFEILKIMGRRGVCPDADAYELLVKSFLKKGEPA 1689
            +KKG+QD  AFNNL+ GH+KEG P+ AFE+LKIMGRRGV  DA++Y+LL+KS+L KGEPA
Sbjct: 505  LKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPA 564

Query: 1690 DAKTALDGMIESGHLPDSSLFRSVMQSLFDDGRVQTASRVMKSMLEKGVKENMDLVARIL 1869
            DAKTALD MIE+GH PDS+LFRSVM+SLF DGRVQTASRVM SML+KG+ EN+DLVA+IL
Sbjct: 565  DAKTALDSMIENGHSPDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKIL 624

Query: 1870 EALLIRGHVEEALGRIELLMGNDCVPDFDSLLAVLCEKEKTIAALKLLDFALERDCNVSF 2049
            EAL +RGH EEALGRI LLM  +C PDF+SLL+VLCEK KT +A KLLDF LER+CN+ F
Sbjct: 625  EALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEF 684

Query: 2050 SSYDKVLDALLAAGKTLNAYSILCKIMEKGGVTDWFSSCEDLIRSLNAEGNTKQADILKR 2229
            SSY+KVLDALL AGKTLNAY+ILCKIMEKGG  DW SSC+DLI+SLN EGNTKQADIL R
Sbjct: 685  SSYEKVLDALLGAGKTLNAYAILCKIMEKGGAKDW-SSCDDLIKSLNQEGNTKQADILSR 743

Query: 2230 MI 2235
            MI
Sbjct: 744  MI 745


>ref|XP_004149878.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            [Cucumis sativus]
          Length = 760

 Score = 1035 bits (2677), Expect = 0.0
 Identities = 517/722 (71%), Positives = 594/722 (82%), Gaps = 9/722 (1%)
 Frame = +1

Query: 97   PLFLYSLHGFCSNLSXXXXXXXXXXXXXISADDQTSDP--------NKEENVVRRTPRGK 252
            P  L SLH F S                 SA    + P        N  + V  R PRG+
Sbjct: 25   PTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGR 84

Query: 253  PPNPEKLEDIICRMMANRAWTTRLQNSIRNLVPQFDHSLVYNVLHGARNSEHALQFFRWV 432
            P +PEKLE IIC+MMANR WTTRLQNSIR+LVPQFDH+LVYNVLH A+ SEHAL FFRWV
Sbjct: 85   PRDPEKLEKIICKMMANREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWV 144

Query: 433  EKTG-YRHDRSTHLKIIEILGRASKLNHARCILFDMPKKGVEWDEEMFVLLIDSYGKAGI 609
            E+ G ++HDR TH KIIEILGRASKLNHARCIL DMP KGV+WDE++FV+LI+SYGKAGI
Sbjct: 145  ERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYGKAGI 204

Query: 610  VQESVKIFQKMKELGVKRSIKSYDTLFKVIMRRGRVQMAKRYFNSMLSEGVIPTRHTYNL 789
            VQE+VKIFQKMKELGV+RS+KSYD LFK IMRRGR  MAKRYFN+ML+EG+ P RHTYN+
Sbjct: 205  VQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNV 264

Query: 790  MLWGFFLSLKVETANRFFEDMKNRGIVPDVVTYNTMINGFCRVKQMDEAEKFFVEMKGKN 969
            MLWGFFLSL++ETA RF+EDMK+RGI PDVVTYNTMING+CR K M+EAE+FF EMKGKN
Sbjct: 265  MLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGYCRFKMMEEAEQFFTEMKGKN 324

Query: 970  LIPTVISYTTMIKGYVSVSRVDDALRLFDEMGLFGIKANDFTYSTLLPGLCDTEKMPEAR 1149
            + PTVISYTTMIKGYVSVSR DDALRLF+EM   G K ND TYSTLLPGLCD EK+PEAR
Sbjct: 325  IAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEAR 384

Query: 1150 KLLKEMVERHIVPKDNSIFLRLISCQCKSGHLDWAADVLKGMIRLSIPTEISHYGVLIEN 1329
            K+L EMV RH  PKDNSIF+RL+SCQCK G LD A  VLK MIRLSIPTE  HYG+LIEN
Sbjct: 385  KILTEMVTRHFAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIEN 444

Query: 1330 FCKVGEYDRAINLLDKVCEKEILLNPRSSSEMEPNAYNPMIKYLCEHGQTLKAEMFFRQL 1509
             CK G YD+A+ LL+ + EKEI+L P+S+ EME +AYN +I+YLC HGQT KA+ FFRQL
Sbjct: 445  CCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTFFRQL 504

Query: 1510 MKKGVQDPSAFNNLLCGHSKEGMPESAFEILKIMGRRGVCPDADAYELLVKSFLKKGEPA 1689
            +KKG+QD  AFNNL+ GH+KEG P+ AFE+LKIMGRRGV  DA++Y+LL+KS+L KGEPA
Sbjct: 505  LKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPA 564

Query: 1690 DAKTALDGMIESGHLPDSSLFRSVMQSLFDDGRVQTASRVMKSMLEKGVKENMDLVARIL 1869
            DAKTALD MIE+GH PDS+LFRSVM+SLF DGRVQTASRVM SML+KG+ EN+DLVA+IL
Sbjct: 565  DAKTALDSMIENGHSPDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKIL 624

Query: 1870 EALLIRGHVEEALGRIELLMGNDCVPDFDSLLAVLCEKEKTIAALKLLDFALERDCNVSF 2049
            EAL +RGH EEALGRI LLM  +C PDF+SLL+VLCEK KT +A KLLDF LER+CN+ F
Sbjct: 625  EALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEF 684

Query: 2050 SSYDKVLDALLAAGKTLNAYSILCKIMEKGGVTDWFSSCEDLIRSLNAEGNTKQADILKR 2229
            SSY+KVLDALL AGKTLNAY+ILCKIMEKGG  DW SSC+DLI+SLN EGNTKQADIL R
Sbjct: 685  SSYEKVLDALLGAGKTLNAYAILCKIMEKGGAKDW-SSCDDLIKSLNQEGNTKQADILSR 743

Query: 2230 MI 2235
            MI
Sbjct: 744  MI 745


>ref|XP_004299746.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            [Fragaria vesca subsp. vesca]
          Length = 763

 Score = 1020 bits (2638), Expect = 0.0
 Identities = 521/754 (69%), Positives = 603/754 (79%), Gaps = 14/754 (1%)
 Frame = +1

Query: 16   MAYISSSATKPLLWKXXXXXXXXXXXNPLFLYSLHGFCSNLSXXXXXXXXXXXXXISADD 195
            MA+IS S  KP  W+           NP  L  L  FCS  +               A+ 
Sbjct: 1    MAFISLS--KPSQWRPRLS-------NPQSLPLLRLFCSTETPSPQPGSASDAPP--AET 49

Query: 196  QTSDPNKEENVVRRTPRGKPP-------------NPEKLEDIICRMMANRAWTTRLQNSI 336
             T  P   +N         PP             NPEK EDIICRMMANRAWTTRLQNSI
Sbjct: 50   PTGSPPDPQNGSAAAASAPPPPQTPKPRQLRRARNPEKTEDIICRMMANRAWTTRLQNSI 109

Query: 337  RNLVPQFDHSLVYNVLHGARNSEHALQFFRWVEKTG-YRHDRSTHLKIIEILGRASKLNH 513
            R+LVP+FDH+LV+NVLHGA+ S+ ALQFFRWVE++  ++HDR THLKIIEILGRASKLNH
Sbjct: 110  RDLVPEFDHNLVWNVLHGAKTSDQALQFFRWVERSRLFQHDRETHLKIIEILGRASKLNH 169

Query: 514  ARCILFDMPKKGVEWDEEMFVLLIDSYGKAGIVQESVKIFQKMKELGVKRSIKSYDTLFK 693
            ARCIL DMPKKGV+WDE++F+ LIDSYGKAGIVQESVK+F +MKELGV+RS+KSY+ LFK
Sbjct: 170  ARCILLDMPKKGVQWDEDLFIHLIDSYGKAGIVQESVKLFNQMKELGVERSLKSYEALFK 229

Query: 694  VIMRRGRVQMAKRYFNSMLSEGVIPTRHTYNLMLWGFFLSLKVETANRFFEDMKNRGIVP 873
             I+RRGR  M KRYFN ML+EG+ PTRHTYN+M+WGFFLSL++ETA RFFEDMK RG+ P
Sbjct: 230  SILRRGRYMMGKRYFNHMLAEGIEPTRHTYNIMIWGFFLSLRLETAKRFFEDMKTRGLSP 289

Query: 874  DVVTYNTMINGFCRVKQMDEAEKFFVEMKGKNLIPTVISYTTMIKGYVSVSRVDDALRLF 1053
            DVVTYNTMING+ R K MDEAE+ FVE+KGKN+ P VISYTTMIKGYVSV +VDD  RLF
Sbjct: 290  DVVTYNTMINGYNRFKMMDEAEQLFVELKGKNIQPNVISYTTMIKGYVSVGKVDDGYRLF 349

Query: 1054 DEMGLFGIKANDFTYSTLLPGLCDTEKMPEARKLLKEMVERHIVPKDNSIFLRLISCQCK 1233
             EM  FGIK ND T+STLLPGLCD EK  EA+ LL EMVERHI PKDNS+F +L+ CQCK
Sbjct: 350  QEMKSFGIKPNDVTFSTLLPGLCDAEKKDEAQNLLSEMVERHIAPKDNSVFEKLLYCQCK 409

Query: 1234 SGHLDWAADVLKGMIRLSIPTEISHYGVLIENFCKVGEYDRAINLLDKVCEKEILLNPRS 1413
            SG LD AA+VLK MIRL IPTE  HYG+LIENFCK G YDRA++LLD++ EKEI++  +S
Sbjct: 410  SGDLDAAANVLKAMIRLHIPTEAGHYGILIENFCKAGVYDRAVHLLDRLIEKEIIMRSQS 469

Query: 1414 SSEMEPNAYNPMIKYLCEHGQTLKAEMFFRQLMKKGVQDPSAFNNLLCGHSKEGMPESAF 1593
            S E+E +AYNPMI+YLC+HGQT KAE+ FRQLMKKGVQD  AFNNL+ GH+KEG  +SAF
Sbjct: 470  SMELEASAYNPMIEYLCDHGQTDKAEVLFRQLMKKGVQDSVAFNNLIRGHAKEGNSDSAF 529

Query: 1594 EILKIMGRRGVCPDADAYELLVKSFLKKGEPADAKTALDGMIESGHLPDSSLFRSVMQSL 1773
            EILKIMGRRGV  +AD+Y+LL+KS+L KGEPADAKTALD MIE+GH+P+SSLFRSVM+SL
Sbjct: 530  EILKIMGRRGVPREADSYKLLIKSYLSKGEPADAKTALDSMIENGHVPESSLFRSVMESL 589

Query: 1774 FDDGRVQTASRVMKSMLEKGVKENMDLVARILEALLIRGHVEEALGRIELLMGNDCVPDF 1953
            F+DGRVQTASR+MKSM+EKGV ENMDLVA+ILEAL IRGHVEEALGRI+LLM + C P+F
Sbjct: 590  FEDGRVQTASRIMKSMVEKGVNENMDLVAKILEALFIRGHVEEALGRIDLLMQSGCAPEF 649

Query: 1954 DSLLAVLCEKEKTIAALKLLDFALERDCNVSFSSYDKVLDALLAAGKTLNAYSILCKIME 2133
            DSLL+VL EK KTIAA+KLLDF LERDC V F SYDKVLDALL +GKTLNAYSILCKIM+
Sbjct: 650  DSLLSVLAEKGKTIAAVKLLDFCLERDCMVDFKSYDKVLDALLESGKTLNAYSILCKIMD 709

Query: 2134 KGGVTDWFSSCEDLIRSLNAEGNTKQADILKRMI 2235
            KGGVTDW  S +DLI+SLN EGNTKQAD+L R I
Sbjct: 710  KGGVTDW-RSTDDLIKSLNLEGNTKQADVLSRKI 742


>ref|XP_006481496.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            [Citrus sinensis]
          Length = 751

 Score = 1005 bits (2599), Expect = 0.0
 Identities = 504/676 (74%), Positives = 580/676 (85%), Gaps = 1/676 (0%)
 Frame = +1

Query: 211  NKEENVVRRTPRGKPPNPEKLEDIICRMMANRAWTTRLQNSIRNLVPQFDHSLVYNVLHG 390
            ++E +  +R PRG   +P KLED IC++MA RAWTTRLQN IR LVPQFDH+LVYNVLHG
Sbjct: 58   DEEPSQRQRIPRGNHRSPVKLEDTICKLMAERAWTTRLQNKIRALVPQFDHNLVYNVLHG 117

Query: 391  ARNSEHALQFFRWVEKTG-YRHDRSTHLKIIEILGRASKLNHARCILFDMPKKGVEWDEE 567
            A+NSEHALQFFRWVE+ G + HDR THLK+IEILGR  KLNHARCIL DMPKKGV+WDE+
Sbjct: 118  AKNSEHALQFFRWVERAGLFNHDRETHLKMIEILGRVGKLNHARCILLDMPKKGVQWDED 177

Query: 568  MFVLLIDSYGKAGIVQESVKIFQKMKELGVKRSIKSYDTLFKVIMRRGRVQMAKRYFNSM 747
            MF +LI+SYGK GIVQESVKIF  MK+LGV+RS+KSYD LFK+I+RRGR  MAKRYFN M
Sbjct: 178  MFEVLIESYGKKGIVQESVKIFDIMKQLGVERSVKSYDALFKLILRRGRYMMAKRYFNKM 237

Query: 748  LSEGVIPTRHTYNLMLWGFFLSLKVETANRFFEDMKNRGIVPDVVTYNTMINGFCRVKQM 927
            LSEG+ PTRHTYN+MLWGFFLSLK+ETA RFFEDMK+RGI PDVVTYNTMING+ R K+M
Sbjct: 238  LSEGIEPTRHTYNVMLWGFFLSLKLETAIRFFEDMKSRGISPDVVTYNTMINGYNRFKKM 297

Query: 928  DEAEKFFVEMKGKNLIPTVISYTTMIKGYVSVSRVDDALRLFDEMGLFGIKANDFTYSTL 1107
            DEAEK F EMK KN+ PTVISYTTMIKGYV+V R DDALR+FDEM  F +K N  TY+ L
Sbjct: 298  DEAEKLFAEMKEKNIEPTVISYTTMIKGYVAVERADDALRIFDEMKSFDVKPNAVTYTAL 357

Query: 1108 LPGLCDTEKMPEARKLLKEMVERHIVPKDNSIFLRLISCQCKSGHLDWAADVLKGMIRLS 1287
            LPGLCD  KM E +K+L+EMVER+I PKDNS+F++L+  QCKSGHL+ AADVLK MIRLS
Sbjct: 358  LPGLCDAGKMVEVQKVLREMVERYIPPKDNSVFMKLLGVQCKSGHLNAAADVLKAMIRLS 417

Query: 1288 IPTEISHYGVLIENFCKVGEYDRAINLLDKVCEKEILLNPRSSSEMEPNAYNPMIKYLCE 1467
            IPTE  HYG+LIENFCK   YDRAI LLDK+ EKEI+L P+S+ +ME ++YNPMI++LC 
Sbjct: 418  IPTEAGHYGILIENFCKAEMYDRAIKLLDKLVEKEIILRPQSTLDMEASSYNPMIQHLCH 477

Query: 1468 HGQTLKAEMFFRQLMKKGVQDPSAFNNLLCGHSKEGMPESAFEILKIMGRRGVCPDADAY 1647
            +GQT KAE+FFRQLMKKGV DP AFNNL+ GHSKEG P+SAFEI+KIMGRRGV  DADAY
Sbjct: 478  NGQTGKAEIFFRQLMKKGVLDPVAFNNLIRGHSKEGNPDSAFEIVKIMGRRGVPRDADAY 537

Query: 1648 ELLVKSFLKKGEPADAKTALDGMIESGHLPDSSLFRSVMQSLFDDGRVQTASRVMKSMLE 1827
              L++S+L+KGEPADAKTALD MIE GH P SSLFRSVM+SLF+DGRVQTASRVMKSM+E
Sbjct: 538  ICLIESYLRKGEPADAKTALDSMIEDGHSPASSLFRSVMESLFEDGRVQTASRVMKSMVE 597

Query: 1828 KGVKENMDLVARILEALLIRGHVEEALGRIELLMGNDCVPDFDSLLAVLCEKEKTIAALK 2007
            KGVKEN+DLVA+ILEALL+RGHVEEALGRI+L+M +  VP+FDSLL+VL EK KTIAA+K
Sbjct: 598  KGVKENLDLVAKILEALLMRGHVEEALGRIDLMMQSGSVPNFDSLLSVLSEKGKTIAAVK 657

Query: 2008 LLDFALERDCNVSFSSYDKVLDALLAAGKTLNAYSILCKIMEKGGVTDWFSSCEDLIRSL 2187
            LLDF L RDC +  +SY+KVLDALLAAGKTLNAYSIL KIMEKGGVTDW SS + LI  L
Sbjct: 658  LLDFCLGRDCIIDLASYEKVLDALLAAGKTLNAYSILFKIMEKGGVTDWKSS-DKLIAGL 716

Query: 2188 NAEGNTKQADILKRMI 2235
            N EGNTKQADIL RMI
Sbjct: 717  NQEGNTKQADILSRMI 732


>ref|XP_006428766.1| hypothetical protein CICLE_v10011107mg [Citrus clementina]
            gi|557530823|gb|ESR42006.1| hypothetical protein
            CICLE_v10011107mg [Citrus clementina]
          Length = 787

 Score = 1000 bits (2586), Expect = 0.0
 Identities = 502/676 (74%), Positives = 579/676 (85%), Gaps = 1/676 (0%)
 Frame = +1

Query: 211  NKEENVVRRTPRGKPPNPEKLEDIICRMMANRAWTTRLQNSIRNLVPQFDHSLVYNVLHG 390
            ++E +  +R PRG   +P KLED IC++MA RAWTTRLQN IR LVPQFDH+LVYNVLHG
Sbjct: 94   DEEPSQRQRIPRGNHRSPVKLEDTICKLMAERAWTTRLQNKIRALVPQFDHNLVYNVLHG 153

Query: 391  ARNSEHALQFFRWVEKTG-YRHDRSTHLKIIEILGRASKLNHARCILFDMPKKGVEWDEE 567
            A+NSEHALQFFRWVE+ G + HDR THLK+IEILGR  KLNHARCIL DMPKKGV+WDE+
Sbjct: 154  AKNSEHALQFFRWVERAGLFNHDRETHLKMIEILGRVGKLNHARCILLDMPKKGVQWDED 213

Query: 568  MFVLLIDSYGKAGIVQESVKIFQKMKELGVKRSIKSYDTLFKVIMRRGRVQMAKRYFNSM 747
            +F +LI+SYGK GIVQESVKIF  MK+LGV+RS+KSYD LFK+I+RRGR  MAKRYFN M
Sbjct: 214  LFEVLIESYGKKGIVQESVKIFDIMKQLGVERSVKSYDALFKLILRRGRYMMAKRYFNKM 273

Query: 748  LSEGVIPTRHTYNLMLWGFFLSLKVETANRFFEDMKNRGIVPDVVTYNTMINGFCRVKQM 927
            LSEG+ PTRHTYN+MLWGFFLSLK+ETA RFFEDMK+RGI PDVVTYNTMING+ R K+M
Sbjct: 274  LSEGIEPTRHTYNVMLWGFFLSLKLETAIRFFEDMKSRGISPDVVTYNTMINGYNRFKKM 333

Query: 928  DEAEKFFVEMKGKNLIPTVISYTTMIKGYVSVSRVDDALRLFDEMGLFGIKANDFTYSTL 1107
            DEAEK F EMK KN+ PTVISYTTMIKGYV+V R DDALR+FDEM  F +K N  TY+ L
Sbjct: 334  DEAEKLFAEMKEKNIEPTVISYTTMIKGYVAVERADDALRIFDEMKSFDVKPNAVTYTAL 393

Query: 1108 LPGLCDTEKMPEARKLLKEMVERHIVPKDNSIFLRLISCQCKSGHLDWAADVLKGMIRLS 1287
            LPGLCD  KM E +K+L+EMVER+I PKDNS+F++L+  QCKSGHL+ AADVLK MIRLS
Sbjct: 394  LPGLCDAGKMVEVQKVLREMVERYIPPKDNSVFMKLLDVQCKSGHLNAAADVLKAMIRLS 453

Query: 1288 IPTEISHYGVLIENFCKVGEYDRAINLLDKVCEKEILLNPRSSSEMEPNAYNPMIKYLCE 1467
            IPTE  HYG+LIENFCK   YDRAI LLDK+ EKEI+L P+S+ +ME ++YN MI++LC 
Sbjct: 454  IPTEAGHYGILIENFCKAEMYDRAIKLLDKLVEKEIILRPQSTLDMEASSYNLMIQHLCH 513

Query: 1468 HGQTLKAEMFFRQLMKKGVQDPSAFNNLLCGHSKEGMPESAFEILKIMGRRGVCPDADAY 1647
            +GQT KAE+FFRQLMKKGV DP AFNNL+ GHSKEG P+SAFEI+KIMGRRGV  DADAY
Sbjct: 514  NGQTGKAEIFFRQLMKKGVLDPVAFNNLIRGHSKEGNPDSAFEIVKIMGRRGVPRDADAY 573

Query: 1648 ELLVKSFLKKGEPADAKTALDGMIESGHLPDSSLFRSVMQSLFDDGRVQTASRVMKSMLE 1827
              L++S+L+KGEPADAKTALD MIE GH P SSLFRSVM+SLF+DGRVQTASRVMKSM+E
Sbjct: 574  ICLIESYLRKGEPADAKTALDSMIEDGHSPASSLFRSVMESLFEDGRVQTASRVMKSMVE 633

Query: 1828 KGVKENMDLVARILEALLIRGHVEEALGRIELLMGNDCVPDFDSLLAVLCEKEKTIAALK 2007
            KGVKEN+DLVA+ILEALL+RGHVEEALGRI+L+M +  VP+FDSLL+VL EK KTIAA+K
Sbjct: 634  KGVKENLDLVAKILEALLMRGHVEEALGRIDLMMQSGSVPNFDSLLSVLSEKGKTIAAVK 693

Query: 2008 LLDFALERDCNVSFSSYDKVLDALLAAGKTLNAYSILCKIMEKGGVTDWFSSCEDLIRSL 2187
            LLDF L RDC +  +SY+KVLDALLAAGKTLNAYSIL KIMEKGGVTDW SS + LI  L
Sbjct: 694  LLDFCLGRDCIIDLASYEKVLDALLAAGKTLNAYSILFKIMEKGGVTDWKSS-DKLIAGL 752

Query: 2188 NAEGNTKQADILKRMI 2235
            N EGNTKQADIL RMI
Sbjct: 753  NQEGNTKQADILSRMI 768


>ref|XP_006353112.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            isoform X1 [Solanum tuberosum]
          Length = 731

 Score =  998 bits (2579), Expect = 0.0
 Identities = 493/670 (73%), Positives = 577/670 (86%), Gaps = 2/670 (0%)
 Frame = +1

Query: 235  RTPRGKPPNP-EKLEDIICRMMANRAWTTRLQNSIRNLVPQFDHSLVYNVLHGARNSEHA 411
            R P+G  P P EKLED+ICRMM+ RAWTTRLQNSIRN+VP FDH LVYNVLH A+NSEHA
Sbjct: 44   RIPKGNSPKPQEKLEDLICRMMSTRAWTTRLQNSIRNIVPSFDHELVYNVLHSAKNSEHA 103

Query: 412  LQFFRWVEKTG-YRHDRSTHLKIIEILGRASKLNHARCILFDMPKKGVEWDEEMFVLLID 588
            LQFFRWVE++G +RHDR TH KII+ILGRA KLNHARCIL DMP KGV+WDE+++VL+ID
Sbjct: 104  LQFFRWVERSGLFRHDRETHFKIIQILGRAEKLNHARCILLDMPNKGVDWDEDLWVLMID 163

Query: 589  SYGKAGIVQESVKIFQKMKELGVKRSIKSYDTLFKVIMRRGRVQMAKRYFNSMLSEGVIP 768
            SYGKAGIVQESVK+FQKM+ELGV+R++KSY+ LF VI RRGR  MAKRYFN M+++G+ P
Sbjct: 164  SYGKAGIVQESVKLFQKMEELGVERTVKSYNALFNVITRRGRYMMAKRYFNKMVNQGIEP 223

Query: 769  TRHTYNLMLWGFFLSLKVETANRFFEDMKNRGIVPDVVTYNTMINGFCRVKQMDEAEKFF 948
            T HTYNL++WGFFLS KV+TA RFFEDMK++GI+PDVVTYNTMING+ RVK+++EAEK+F
Sbjct: 224  TGHTYNLLIWGFFLSSKVDTAIRFFEDMKSKGIMPDVVTYNTMINGYIRVKKIEEAEKYF 283

Query: 949  VEMKGKNLIPTVISYTTMIKGYVSVSRVDDALRLFDEMGLFGIKANDFTYSTLLPGLCDT 1128
            VEMK +N+ PTVISYTT+IKGY +V R+DDA+RLF+EM  FGIK N  TYSTLLPGLCD 
Sbjct: 284  VEMKARNIEPTVISYTTLIKGYSAVERIDDAVRLFEEMKSFGIKPNAITYSTLLPGLCDA 343

Query: 1129 EKMPEARKLLKEMVERHIVPKDNSIFLRLISCQCKSGHLDWAADVLKGMIRLSIPTEISH 1308
            +KM EA  +LKEM +++I PKDNSIF+RLIS QC++G LD AADVLK MIRLS+PTE  H
Sbjct: 344  QKMSEAGAILKEMEDKYIAPKDNSIFIRLISGQCEAGDLDAAADVLKTMIRLSVPTEAGH 403

Query: 1309 YGVLIENFCKVGEYDRAINLLDKVCEKEILLNPRSSSEMEPNAYNPMIKYLCEHGQTLKA 1488
            YGVLIENFCK G YDRA+  LDK+ EKEI+L P+SSS MEP+AYN +I YLC +GQT KA
Sbjct: 404  YGVLIENFCKAGIYDRAVKFLDKLIEKEIVLRPQSSSSMEPSAYNLIIDYLCNNGQTGKA 463

Query: 1489 EMFFRQLMKKGVQDPSAFNNLLCGHSKEGMPESAFEILKIMGRRGVCPDADAYELLVKSF 1668
            E FFRQLMK GVQDP AFNNL+CGHS+EG+P+SAFE+LKIMGRR V  D  A++ LV+S+
Sbjct: 464  ETFFRQLMKTGVQDPIAFNNLVCGHSREGVPDSAFELLKIMGRRKVLSDGIAHKSLVESY 523

Query: 1669 LKKGEPADAKTALDGMIESGHLPDSSLFRSVMQSLFDDGRVQTASRVMKSMLEKGVKENM 1848
            LKK EPADAK ALD M+E GH PDS L+RSVM+SL  DGRVQTASRVMK MLEKGVKE+M
Sbjct: 524  LKKREPADAKAALDNMLEHGHDPDSLLYRSVMESLMGDGRVQTASRVMKIMLEKGVKEHM 583

Query: 1849 DLVARILEALLIRGHVEEALGRIELLMGNDCVPDFDSLLAVLCEKEKTIAALKLLDFALE 2028
            DL++ ILEALL+RGHVEEALGRIELL+ N   PD D LL+VLCEK KT AALKLLDF LE
Sbjct: 584  DLISTILEALLMRGHVEEALGRIELLLHNSLSPDLDGLLSVLCEKGKTSAALKLLDFILE 643

Query: 2029 RDCNVSFSSYDKVLDALLAAGKTLNAYSILCKIMEKGGVTDWFSSCEDLIRSLNAEGNTK 2208
            R+CN+ FSSYDKVLD+LLAAGKTLNAYSILCK+ME GGV D   SCE+LI+SLN EGNTK
Sbjct: 644  RNCNIDFSSYDKVLDSLLAAGKTLNAYSILCKMMENGGVKD-HKSCEELIKSLNDEGNTK 702

Query: 2209 QADILKRMIM 2238
            QADIL+RMI+
Sbjct: 703  QADILRRMIL 712


>ref|XP_002315730.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|222864770|gb|EEF01901.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 760

 Score =  994 bits (2569), Expect = 0.0
 Identities = 497/687 (72%), Positives = 581/687 (84%), Gaps = 5/687 (0%)
 Frame = +1

Query: 190  DDQTSDPN-KEENVVRRTPRGKPPN--PEKLEDIICRMMANRAWTTRLQNSIRNLVPQFD 360
            D +T  PN  +E   +R PR K  +  PEKLEDIICRMMANR WTTRLQNSIR LVP+FD
Sbjct: 55   DPKTETPNVAQEKQYQRIPRAKQQHRSPEKLEDIICRMMANRDWTTRLQNSIRALVPEFD 114

Query: 361  HSLVYNVLHGARNSEHALQFFRWVEKTGY-RHDRSTHLKIIEILGRASKLNHARCILF-D 534
            HSLVYNVLHGAR  +HALQFFRWVE+ G  +HDR TH+KII+ILGR S LNHARCI+  D
Sbjct: 115  HSLVYNVLHGARKPDHALQFFRWVERAGLIQHDRETHMKIIQILGRYSMLNHARCIVLED 174

Query: 535  MPKKGVEWDEEMFVLLIDSYGKAGIVQESVKIFQKMKELGVKRSIKSYDTLFKVIMRRGR 714
            MPKKG E DE+MFVLLIDSYGKAGIVQESVK+F KMKELGV+RS+KSY+ LFKVI+R+GR
Sbjct: 175  MPKKGFELDEDMFVLLIDSYGKAGIVQESVKMFSKMKELGVERSVKSYNALFKVIVRKGR 234

Query: 715  VQMAKRYFNSMLSEGVIPTRHTYNLMLWGFFLSLKVETANRFFEDMKNRGIVPDVVTYNT 894
              MAKR+FN ML EG+ PTRHTYN+++WGFFLS+++ TA RF+EDMK RGI PDVVTYNT
Sbjct: 235  YMMAKRFFNKMLDEGIGPTRHTYNVLIWGFFLSMRLRTAVRFYEDMKVRGISPDVVTYNT 294

Query: 895  MINGFCRVKQMDEAEKFFVEMKGKNLIPTVISYTTMIKGYVSVSRVDDALRLFDEMGLFG 1074
            MING+ R K+M+EAEK F EMK K++ PTVISYTTMIKGY +V R++D LRL +EM   G
Sbjct: 295  MINGYYRHKRMEEAEKLFAEMKAKDIAPTVISYTTMIKGYFAVDRINDGLRLLEEMKSVG 354

Query: 1075 IKANDFTYSTLLPGLCDTEKMPEARKLLKEMVERHIVPKDNSIFLRLISCQCKSGHLDWA 1254
            IK N+ TY+TLLP LCD  KM EA+ +LKEMV R I PKDNSIFL+L++ QCK+G L  A
Sbjct: 355  IKPNNVTYTTLLPDLCDAGKMTEAKDILKEMVRRRIAPKDNSIFLKLLNSQCKAGDLKAA 414

Query: 1255 ADVLKGMIRLSIPTEISHYGVLIENFCKVGEYDRAINLLDKVCEKEILLNPRSSSEMEPN 1434
             DVL GMI+LSIP+E  HYGVLIENFCK  EYD+A+  +DK+ E +I+L P+S+ EME  
Sbjct: 415  VDVLDGMIKLSIPSEAGHYGVLIENFCKAEEYDQAVKFVDKLIENDIILRPQSTLEMESG 474

Query: 1435 AYNPMIKYLCEHGQTLKAEMFFRQLMKKGVQDPSAFNNLLCGHSKEGMPESAFEILKIMG 1614
            AYNP+I+YLC HGQT KAE+ FRQL+KKGV+DP AFNNL+CGH+KEG P+SAFEILKIMG
Sbjct: 475  AYNPVIQYLCSHGQTGKAEILFRQLLKKGVEDPLAFNNLICGHAKEGTPDSAFEILKIMG 534

Query: 1615 RRGVCPDADAYELLVKSFLKKGEPADAKTALDGMIESGHLPDSSLFRSVMQSLFDDGRVQ 1794
            R+G+  DADAY LL++S+L+KGEPADAKTALD MIE GHLPDSS+FRSVM+SL++DGRVQ
Sbjct: 535  RKGIPRDADAYRLLIESYLRKGEPADAKTALDSMIEDGHLPDSSVFRSVMESLYEDGRVQ 594

Query: 1795 TASRVMKSMLEKGVKENMDLVARILEALLIRGHVEEALGRIELLMGNDCVPDFDSLLAVL 1974
            TASRVMKSM+EKGVKENMDLVA+ILEALL+RGH EEALGRI+LLM + C  +FDSLL++L
Sbjct: 595  TASRVMKSMVEKGVKENMDLVAKILEALLMRGHEEEALGRIDLLMSSQCNVNFDSLLSIL 654

Query: 1975 CEKEKTIAALKLLDFALERDCNVSFSSYDKVLDALLAAGKTLNAYSILCKIMEKGGVTDW 2154
             EK KTIAALKLLDF L+RDC++ F SYDKVLDALLAAGKTLNAYSILCKIMEKGGVT W
Sbjct: 655  SEKGKTIAALKLLDFGLQRDCDIDFKSYDKVLDALLAAGKTLNAYSILCKIMEKGGVTSW 714

Query: 2155 FSSCEDLIRSLNAEGNTKQADILKRMI 2235
              S EDLI+SLN EGNTKQADIL RMI
Sbjct: 715  -RSYEDLIKSLNQEGNTKQADILSRMI 740


>ref|XP_004251992.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            isoform 1 [Solanum lycopersicum]
            gi|460413221|ref|XP_004251993.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g37230-like isoform 2 [Solanum lycopersicum]
          Length = 731

 Score =  990 bits (2559), Expect = 0.0
 Identities = 490/670 (73%), Positives = 573/670 (85%), Gaps = 2/670 (0%)
 Frame = +1

Query: 235  RTPRGKPPNP-EKLEDIICRMMANRAWTTRLQNSIRNLVPQFDHSLVYNVLHGARNSEHA 411
            R P+G  P P EKLED+ICRMM+ RAWTTRLQNSIRN+VP FDH LVYNVLH A+NSEHA
Sbjct: 44   RIPKGNSPKPQEKLEDLICRMMSTRAWTTRLQNSIRNIVPSFDHELVYNVLHSAKNSEHA 103

Query: 412  LQFFRWVEKTG-YRHDRSTHLKIIEILGRASKLNHARCILFDMPKKGVEWDEEMFVLLID 588
            LQFFRWVE++G +RHDR TH KII+ILGRA KLNHARCIL DMP KGV+WDE+++VL+ID
Sbjct: 104  LQFFRWVERSGLFRHDRETHFKIIQILGRAEKLNHARCILLDMPNKGVDWDEDLWVLMID 163

Query: 589  SYGKAGIVQESVKIFQKMKELGVKRSIKSYDTLFKVIMRRGRVQMAKRYFNSMLSEGVIP 768
            SYGKAGIVQESVK+FQKM+ELGV+R++KSY+ LF VI RRGR  MAKRYFN M+++G+ P
Sbjct: 164  SYGKAGIVQESVKLFQKMEELGVERTVKSYNALFNVITRRGRYMMAKRYFNRMVNQGIEP 223

Query: 769  TRHTYNLMLWGFFLSLKVETANRFFEDMKNRGIVPDVVTYNTMINGFCRVKQMDEAEKFF 948
            T HTYNL++WGFFLS KV+TA RFFEDMK +GI+PDVVTYNTMING+  VK+++EAEK+F
Sbjct: 224  TGHTYNLLIWGFFLSSKVDTAIRFFEDMKGKGIMPDVVTYNTMINGYNCVKKIEEAEKYF 283

Query: 949  VEMKGKNLIPTVISYTTMIKGYVSVSRVDDALRLFDEMGLFGIKANDFTYSTLLPGLCDT 1128
            VEMK +N+ P VISYTT+IKGY +V R+DDAL+LF+EM  FGIK N  TYSTLLPGLCD 
Sbjct: 284  VEMKARNIEPNVISYTTLIKGYSAVERIDDALKLFEEMKSFGIKPNAITYSTLLPGLCDA 343

Query: 1129 EKMPEARKLLKEMVERHIVPKDNSIFLRLISCQCKSGHLDWAADVLKGMIRLSIPTEISH 1308
            +KM EA  +LKEM ER+I PKDNSIF+RLIS QC++G LD AADVLK MIRLS+PTE  H
Sbjct: 344  QKMSEAGTILKEMEERYIAPKDNSIFIRLISGQCEAGDLDAAADVLKTMIRLSVPTEAGH 403

Query: 1309 YGVLIENFCKVGEYDRAINLLDKVCEKEILLNPRSSSEMEPNAYNPMIKYLCEHGQTLKA 1488
            YGVLIENFCK G YDRA+  LDK+ EKEI+L P+SSS ME +AYN +I YLC +GQT KA
Sbjct: 404  YGVLIENFCKAGIYDRAVKFLDKLIEKEIVLRPQSSSSMETSAYNLIIDYLCNNGQTGKA 463

Query: 1489 EMFFRQLMKKGVQDPSAFNNLLCGHSKEGMPESAFEILKIMGRRGVCPDADAYELLVKSF 1668
            E  FRQLMK G+QDP AFNNL+CGHS+EG+P+SAFE+LKIMGRR V  D+ A++ LV+S+
Sbjct: 464  ETLFRQLMKTGIQDPIAFNNLVCGHSREGVPDSAFELLKIMGRRKVLSDSIAHKSLVESY 523

Query: 1669 LKKGEPADAKTALDGMIESGHLPDSSLFRSVMQSLFDDGRVQTASRVMKSMLEKGVKENM 1848
            LKKGEPADAK ALD M+E GH PDS L+RSVM+SL  DGRVQTASRVMK MLEKGVKE+M
Sbjct: 524  LKKGEPADAKAALDNMLEHGHDPDSLLYRSVMESLMGDGRVQTASRVMKIMLEKGVKEHM 583

Query: 1849 DLVARILEALLIRGHVEEALGRIELLMGNDCVPDFDSLLAVLCEKEKTIAALKLLDFALE 2028
            DL++ ILEALL+RGHVEEA GRIELL+ N   PD D LL+VLCEK KT AALKLLDF LE
Sbjct: 584  DLISTILEALLMRGHVEEAFGRIELLLHNSLSPDLDGLLSVLCEKGKTTAALKLLDFILE 643

Query: 2029 RDCNVSFSSYDKVLDALLAAGKTLNAYSILCKIMEKGGVTDWFSSCEDLIRSLNAEGNTK 2208
            R+CN+ FSSYDKVLD+LLAAGKTLNAYSILCK+ME GGV D   SCE+LI+SLN EGNTK
Sbjct: 644  RNCNIDFSSYDKVLDSLLAAGKTLNAYSILCKMMENGGVKD-HKSCEELIKSLNDEGNTK 702

Query: 2209 QADILKRMIM 2238
            QADIL+RMI+
Sbjct: 703  QADILRRMIL 712


>ref|XP_007225233.1| hypothetical protein PRUPE_ppa001877mg [Prunus persica]
            gi|462422169|gb|EMJ26432.1| hypothetical protein
            PRUPE_ppa001877mg [Prunus persica]
          Length = 749

 Score =  982 bits (2539), Expect = 0.0
 Identities = 510/749 (68%), Positives = 597/749 (79%), Gaps = 9/749 (1%)
 Frame = +1

Query: 16   MAYISSSATKPLLWKXXXXXXXXXXXNPLFLYSLHGFCSNLSXXXXXXXXXXXXXISADD 195
            MAYIS S  KP  W+           NP  L     F S  +              S + 
Sbjct: 1    MAYISLS--KPFQWRPRPS-------NPQTLTLFRLFSSTEAATGA----------STEA 41

Query: 196  QTSDPNKEENVVRRT--PRGKPP---NPEKLEDIICRMMANRAWTTRLQNSIRNLVPQFD 360
             T  PN ++  V  T  P+ +     N EK+EDIICRMMANR WTTRLQNSIRNLVP+FD
Sbjct: 42   PTETPNPQDGSVTPTHVPKARQHRTRNAEKIEDIICRMMANRVWTTRLQNSIRNLVPEFD 101

Query: 361  HSLVYNVLHGARNSEHALQFFRWVEKTG-YRHDRSTHLKIIEILGRASKLNHARCILFDM 537
            H+LV+NVLHGAR+ EHALQFFRWVE++G ++HDR THLKIIEIL R SKLNHARCIL DM
Sbjct: 102  HNLVWNVLHGARSWEHALQFFRWVERSGLFKHDRETHLKIIEILSRNSKLNHARCILLDM 161

Query: 538  PKKGVEWDEEMFVLLIDSYGKAG---IVQESVKIFQKMKELGVKRSIKSYDTLFKVIMRR 708
            PKKGV+ DE++F+ LID YGK+    I+QESVK+F KMKELGV+RS+KSY+ L+K I+R 
Sbjct: 162  PKKGVQLDEDLFIGLIDGYGKSDKGCIIQESVKLFIKMKELGVERSLKSYEALYKAILRW 221

Query: 709  GRVQMAKRYFNSMLSEGVIPTRHTYNLMLWGFFLSLKVETANRFFEDMKNRGIVPDVVTY 888
            GR  MAKRYFN+MLSEG+ PTRHTYN+M+WGF  S K+ETA RFFEDMK+RGI PD+VTY
Sbjct: 222  GRCMMAKRYFNAMLSEGIEPTRHTYNVMIWGFLKSRKLETAKRFFEDMKSRGISPDLVTY 281

Query: 889  NTMINGFCRVKQMDEAEKFFVEMKGKNLIPTVISYTTMIKGYVSVSRVDDALRLFDEMGL 1068
            NTMI+G+ RV +MDE+E+ FVE+KG+N+ P VISYTTMIKGYVSV RVDD LRLF EM  
Sbjct: 282  NTMIHGYIRVDKMDESEQLFVELKGRNIEPNVISYTTMIKGYVSVGRVDDGLRLFGEMKS 341

Query: 1069 FGIKANDFTYSTLLPGLCDTEKMPEARKLLKEMVERHIVPKDNSIFLRLISCQCKSGHLD 1248
            FGI+ N  T+STLLPGLCD EK   A K+L EMV ++I P DNSIF RL+S QCKSG +D
Sbjct: 342  FGIRPNAVTFSTLLPGLCDAEKKDAAHKVLMEMVSKYIAPIDNSIFERLLSLQCKSGDMD 401

Query: 1249 WAADVLKGMIRLSIPTEISHYGVLIENFCKVGEYDRAINLLDKVCEKEILLNPRSSSEME 1428
             AA VLK MIRL IPTE  HYG+LIENFCK G YD+A+ LLDK+ EKEI+L P++S E+E
Sbjct: 402  AAAYVLKAMIRLRIPTEAGHYGILIENFCKAGVYDQAVKLLDKLIEKEIILRPQNSIELE 461

Query: 1429 PNAYNPMIKYLCEHGQTLKAEMFFRQLMKKGVQDPSAFNNLLCGHSKEGMPESAFEILKI 1608
            P+A+NPMI+YLC HGQT KAE FFRQLMKKGV+D  AFNNLL GH+KEG  +SAFEIL+I
Sbjct: 462  PSAFNPMIEYLCNHGQTGKAEAFFRQLMKKGVEDSVAFNNLLRGHAKEGNSDSAFEILRI 521

Query: 1609 MGRRGVCPDADAYELLVKSFLKKGEPADAKTALDGMIESGHLPDSSLFRSVMQSLFDDGR 1788
            M RRG+  +AD+Y LL+KS+L KGEPADAKTALD MIE GH+P+SSLFRSV++SLF+DGR
Sbjct: 522  MNRRGIPGEADSYILLIKSYLSKGEPADAKTALDSMIEGGHIPESSLFRSVIESLFEDGR 581

Query: 1789 VQTASRVMKSMLEKGVKENMDLVARILEALLIRGHVEEALGRIELLMGNDCVPDFDSLLA 1968
            VQTASRVMKSM+EKGV ENMDLVA+ILEAL +RGHVEEALGRI+LLM + C   FDSLL+
Sbjct: 582  VQTASRVMKSMVEKGVMENMDLVAKILEALFMRGHVEEALGRIDLLMQSGCALQFDSLLS 641

Query: 1969 VLCEKEKTIAALKLLDFALERDCNVSFSSYDKVLDALLAAGKTLNAYSILCKIMEKGGVT 2148
            VL +K KTIAALKLLDF LERDC+V FSSYDKVLDALLA+GKTLNAYSILCK+MEKGG+T
Sbjct: 642  VLADKGKTIAALKLLDFCLERDCSVDFSSYDKVLDALLASGKTLNAYSILCKLMEKGGIT 701

Query: 2149 DWFSSCEDLIRSLNAEGNTKQADILKRMI 2235
            DW SS EDLI+SLN EGNTKQADIL RMI
Sbjct: 702  DW-SSTEDLIKSLNQEGNTKQADILSRMI 729


>ref|XP_003524868.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            [Glycine max]
          Length = 733

 Score =  974 bits (2517), Expect = 0.0
 Identities = 482/665 (72%), Positives = 572/665 (86%), Gaps = 4/665 (0%)
 Frame = +1

Query: 253  PPNPEKLEDIICRMMANRAWTTRLQNSIRNLVPQFDHSLVYNVLHGARNSEHALQFFRWV 432
            PP    LE  IC+MM+NRAWTTRLQNSIR+LVP+FD SLVYNVLHGA + EHALQF+RWV
Sbjct: 50   PPREHNLELTICKMMSNRAWTTRLQNSIRSLVPEFDPSLVYNVLHGAASPEHALQFYRWV 109

Query: 433  EKTG-YRHDRSTHLKIIEILGRASKLNHARCILFDMPKKGVEW---DEEMFVLLIDSYGK 600
            E+ G + H   T LKI++ILGR SKLNHARCILF+  + GV      E+ FV LIDSYG+
Sbjct: 110  ERAGLFTHTPETTLKIVQILGRYSKLNHARCILFNDTRGGVSRAAVTEDAFVSLIDSYGR 169

Query: 601  AGIVQESVKIFQKMKELGVKRSIKSYDTLFKVIMRRGRVQMAKRYFNSMLSEGVIPTRHT 780
            AGIVQESVK+F+KMKELG+ R++KSYD LFKVI+RRGR  MAKRY+N+ML EGV PTRHT
Sbjct: 170  AGIVQESVKLFKKMKELGLDRTVKSYDALFKVILRRGRYMMAKRYYNAMLLEGVDPTRHT 229

Query: 781  YNLMLWGFFLSLKVETANRFFEDMKNRGIVPDVVTYNTMINGFCRVKQMDEAEKFFVEMK 960
            +N++LWG FLSL+++TA RF+EDMK+RGI+PDVVTYNT+ING+ R K++DEAEK FVEMK
Sbjct: 230  FNILLWGMFLSLRLDTAVRFYEDMKSRGILPDVVTYNTLINGYFRFKKVDEAEKLFVEMK 289

Query: 961  GKNLIPTVISYTTMIKGYVSVSRVDDALRLFDEMGLFGIKANDFTYSTLLPGLCDTEKMP 1140
            G++++P VIS+TTM+KGYV+  R+DDAL++F+EM   G+K N  T+STLLPGLCD EKM 
Sbjct: 290  GRDIVPNVISFTTMLKGYVAAGRIDDALKVFEEMKGCGVKPNVVTFSTLLPGLCDAEKMA 349

Query: 1141 EARKLLKEMVERHIVPKDNSIFLRLISCQCKSGHLDWAADVLKGMIRLSIPTEISHYGVL 1320
            EAR +L EMVER+I PKDN++F++++SCQCK+G LD AADVLK M+RLSIPTE  HYGVL
Sbjct: 350  EARDVLGEMVERYIAPKDNALFMKMMSCQCKAGDLDAAADVLKAMVRLSIPTEAGHYGVL 409

Query: 1321 IENFCKVGEYDRAINLLDKVCEKEILLNPRSSSEMEPNAYNPMIKYLCEHGQTLKAEMFF 1500
            IE+FCK   YD+A  LLDK+ EKEI+L P++ SEMEP+AYN MI YLCEHG+T KAE FF
Sbjct: 410  IESFCKANVYDKAEKLLDKLIEKEIVLRPQNDSEMEPSAYNLMIGYLCEHGRTGKAETFF 469

Query: 1501 RQLMKKGVQDPSAFNNLLCGHSKEGMPESAFEILKIMGRRGVCPDADAYELLVKSFLKKG 1680
            RQL+KKGVQD  AFNNL+ GHSKEG P+SAFEI+KIMGRRGV  D D+Y LL++S+L+KG
Sbjct: 470  RQLLKKGVQDSVAFNNLIRGHSKEGNPDSAFEIMKIMGRRGVARDVDSYRLLIESYLRKG 529

Query: 1681 EPADAKTALDGMIESGHLPDSSLFRSVMQSLFDDGRVQTASRVMKSMLEKGVKENMDLVA 1860
            EPADAKTALDGM+ESGHLP+SSL+RSVM+SLFDDGRVQTASRVMKSM+EKG KENMDLV 
Sbjct: 530  EPADAKTALDGMLESGHLPESSLYRSVMESLFDDGRVQTASRVMKSMVEKGAKENMDLVL 589

Query: 1861 RILEALLIRGHVEEALGRIELLMGNDCVPDFDSLLAVLCEKEKTIAALKLLDFALERDCN 2040
            +ILEALL+RGHVEEALGRI+LLM N C PDFD LL+VLCEKEKTIAALKLLDF LERDC 
Sbjct: 590  KILEALLLRGHVEEALGRIDLLMHNGCEPDFDHLLSVLCEKEKTIAALKLLDFVLERDCI 649

Query: 2041 VSFSSYDKVLDALLAAGKTLNAYSILCKIMEKGGVTDWFSSCEDLIRSLNAEGNTKQADI 2220
            + FS YDKVLDALLAAGKTLNAYSILCKI+EKGG TDW SS ++LI+SLN EGNTKQAD+
Sbjct: 650  IDFSIYDKVLDALLAAGKTLNAYSILCKILEKGGSTDW-SSRDELIKSLNQEGNTKQADV 708

Query: 2221 LKRMI 2235
            L RMI
Sbjct: 709  LSRMI 713


>ref|XP_003532699.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            [Glycine max]
          Length = 738

 Score =  971 bits (2510), Expect = 0.0
 Identities = 488/694 (70%), Positives = 581/694 (83%), Gaps = 9/694 (1%)
 Frame = +1

Query: 181  ISADDQTSDPNKEENVVRRTPRGKPPNPEKLEDIICRMMANRAWTTRLQNSIRNLVPQFD 360
            +S  D  S P+ +     + P   PP    LE  IC+MM+NRAWTTRLQNSIR+LVP+FD
Sbjct: 30   LSETDHPSPPSPQP----QPPPIIPPRENNLELTICKMMSNRAWTTRLQNSIRSLVPEFD 85

Query: 361  HSLVYNVLHGARNSEHALQFFRWVEKTG-YRHDRSTHLKIIEILGRASKLNHARCILFDM 537
             SLVYNVLHGA + EHALQF+RWVE+ G + H   T LKI++ILGR SKLNHARCILFD 
Sbjct: 86   PSLVYNVLHGAASPEHALQFYRWVERAGLFTHTPETTLKIVQILGRYSKLNHARCILFDD 145

Query: 538  PKKGVEW---DEEMFVLLIDSYGKAGIVQESVKIFQKMKELGVKRSIKSYDTLFKVIMRR 708
             + G       E+ FV LIDSYG+AGIVQESVK+F+KMKELGV R++KSYD LFKVI+RR
Sbjct: 146  TRGGASRATVTEDAFVSLIDSYGRAGIVQESVKLFKKMKELGVDRTVKSYDALFKVILRR 205

Query: 709  GRVQMAKRYFNSMLSEGVIPTRHTYNLMLWGFFLSLKVETANRFFEDMKNRGIVPDVVTY 888
            GR  MAKRY+N+ML+E V PTRHTYN++LWG FLSL+++TA RF+EDMK+RGI+PDVVTY
Sbjct: 206  GRYMMAKRYYNAMLNESVEPTRHTYNILLWGMFLSLRLDTAVRFYEDMKSRGILPDVVTY 265

Query: 889  NTMINGFCRVKQMDEAEKFFVEMKGKNLIPTVISYTTMIKGYVSVSRVDDALRLFDEMGL 1068
            NT+ING+ R K+++EAEK FVEMKG++++P VIS+TTM+KGYV+  ++DDAL++F+EM  
Sbjct: 266  NTLINGYFRFKKVEEAEKLFVEMKGRDIVPNVISFTTMLKGYVAAGQIDDALKVFEEMKG 325

Query: 1069 FGIKANDFTYSTLLPGLCDTEKMPEARKLLKEMVERHIVPKDNSIFLRLISCQCKSGHLD 1248
             G+K N  T+STLLPGLCD EKM EAR +L EMVER+I PKDN++F++L+SCQCK+G LD
Sbjct: 326  CGVKPNAVTFSTLLPGLCDAEKMAEARDVLGEMVERYIAPKDNAVFMKLMSCQCKAGDLD 385

Query: 1249 WAADVLKGMIRLSIPTEISHYGVLIENFCKVGEYDRAINLLDKVCEKEILLNPRSS---- 1416
             A DVLK MIRLSIPTE  HYGVLIENFCK   YD+A  LLDK+ EKEI+L  +++    
Sbjct: 386  AAGDVLKAMIRLSIPTEAGHYGVLIENFCKANLYDKAEKLLDKMIEKEIVLRQKNAYETE 445

Query: 1417 -SEMEPNAYNPMIKYLCEHGQTLKAEMFFRQLMKKGVQDPSAFNNLLCGHSKEGMPESAF 1593
              EMEP+AYN MI YLCEHG+T KAE FFRQLMKKGVQD  +FNNL+CGHSKEG P+SAF
Sbjct: 446  LFEMEPSAYNLMIGYLCEHGRTGKAETFFRQLMKKGVQDSVSFNNLICGHSKEGNPDSAF 505

Query: 1594 EILKIMGRRGVCPDADAYELLVKSFLKKGEPADAKTALDGMIESGHLPDSSLFRSVMQSL 1773
            EI+KIMGRRGV  DAD+Y LL++S+L+KGEPADAKTALDGM+ESGHLP+SSL+RSVM+SL
Sbjct: 506  EIIKIMGRRGVARDADSYRLLIESYLRKGEPADAKTALDGMLESGHLPESSLYRSVMESL 565

Query: 1774 FDDGRVQTASRVMKSMLEKGVKENMDLVARILEALLIRGHVEEALGRIELLMGNDCVPDF 1953
            FDDGRVQTASRVMKSM+EKGVKENMDLV+++LEALL+RGHVEEALGRI LLM N C PDF
Sbjct: 566  FDDGRVQTASRVMKSMVEKGVKENMDLVSKVLEALLMRGHVEEALGRIHLLMLNGCEPDF 625

Query: 1954 DSLLAVLCEKEKTIAALKLLDFALERDCNVSFSSYDKVLDALLAAGKTLNAYSILCKIME 2133
            D LL+VLCEKEKTIAALKLLDF LERDC + FS YDKVLDALLAAGKTLNAYSILCKI+E
Sbjct: 626  DHLLSVLCEKEKTIAALKLLDFVLERDCIIDFSIYDKVLDALLAAGKTLNAYSILCKILE 685

Query: 2134 KGGVTDWFSSCEDLIRSLNAEGNTKQADILKRMI 2235
            KGG TDW SS ++LI+SLN EGNTKQAD+L RMI
Sbjct: 686  KGGSTDW-SSRDELIKSLNQEGNTKQADVLSRMI 718


>ref|XP_006410903.1| hypothetical protein EUTSA_v10017966mg [Eutrema salsugineum]
            gi|557112072|gb|ESQ52356.1| hypothetical protein
            EUTSA_v10017966mg [Eutrema salsugineum]
          Length = 761

 Score =  970 bits (2508), Expect = 0.0
 Identities = 483/670 (72%), Positives = 568/670 (84%), Gaps = 3/670 (0%)
 Frame = +1

Query: 235  RTPRGKPPNPEKLEDIICRMMANRAWTTRLQNSIRNLVPQFDHSLVYNVLHGARNSEHAL 414
            R  RGK  N EKLED ICRMM NR WTTRLQNSIR+LVP++DHSLVYNVLHGAR  +HAL
Sbjct: 76   RFQRGKRQNHEKLEDTICRMMDNREWTTRLQNSIRDLVPEWDHSLVYNVLHGARKLDHAL 135

Query: 415  QFFRWVEKTGY-RHDRSTHLKIIEILGRASKLNHARCILFDMPKKGVEWDEEMFVLLIDS 591
            QFFRW E++G  RHDR TH+K+IE+LG+ASKLNHARCIL DMP+KG+ WDE+MFV+LI+S
Sbjct: 136  QFFRWSERSGLIRHDRDTHMKMIEMLGQASKLNHARCILLDMPEKGIPWDEDMFVVLIES 195

Query: 592  YGKAGIVQESVKIFQKMKELGVKRSIKSYDTLFKVIMRRGRVQMAKRYFNSMLSEGVIPT 771
            YGKAGIVQESVKIFQKMK+LGV+R+IKSYDTLFKVI+RRGR  MAKRYFN M+SEG+ PT
Sbjct: 196  YGKAGIVQESVKIFQKMKDLGVERTIKSYDTLFKVILRRGRYMMAKRYFNKMVSEGIEPT 255

Query: 772  RHTYNLMLWGFFLSLKVETANRFFEDMKNRGIVPDVVTYNTMINGFCRVKQMDEAEKFFV 951
            RHTYNLMLWGFFLSL++ETA RF+EDM +RGI PDVVTYNTMING+CR K+MDEAEK FV
Sbjct: 256  RHTYNLMLWGFFLSLRLETALRFYEDMISRGISPDVVTYNTMINGYCRFKKMDEAEKVFV 315

Query: 952  EMKGKNLIPTVISYTTMIKGYVSVSRVDDALRLFDEMGLFGIKANDFTYSTLLPGLCDTE 1131
            EMKGKN+ P+V+SYTTMIKGY++V RVDD LR+FDEM  FGI+ N  TYSTLLPGLCD  
Sbjct: 316  EMKGKNIEPSVVSYTTMIKGYLAVERVDDGLRIFDEMRSFGIEPNATTYSTLLPGLCDAG 375

Query: 1132 KMPEARKLLKEMVERHIVPKDNSIFLRLISCQCKSGHLDWAADVLKGMIRLSIPTEISHY 1311
            KM EA+ +LK M+ +HI PKDNSIFL+L+  Q K+G +  A +VLK M  L++P E  HY
Sbjct: 376  KMVEAKSILKNMMAKHIAPKDNSIFLKLLVSQSKAGDMAAATEVLKAMATLNVPAEAGHY 435

Query: 1312 GVLIENFCKVGEYDRAINLLDKVCEKEILLNPRSSSEMEPNAYNPMIKYLCEHGQTLKAE 1491
            GVLIEN CK   ++RAI LLD + EKEI+L  + + EMEPNAYNP+I+YLC +GQT KAE
Sbjct: 436  GVLIENQCKANAHNRAIKLLDILVEKEIILRHQDTLEMEPNAYNPIIEYLCNNGQTSKAE 495

Query: 1492 MFFRQLMKKGVQDPSAFNNLLCGHSKEGMPESAFEILKIMGRRGVCPDADAYELLVKSFL 1671
            + FRQLMK+GVQD  A NNL+ GH+KEG P+S++EILKIM RRGV  DA+AYELL+KS++
Sbjct: 496  VLFRQLMKRGVQDQEALNNLIRGHAKEGNPDSSYEILKIMSRRGVPRDANAYELLIKSYM 555

Query: 1672 KKGEPADAKTALDGMIESGHLPDSSLFRSVMQSLFDDGRVQTASRVMKSMLEK--GVKEN 1845
             KGEP DAKTALD M+E GH+PDSSLFRSV++SLF+DGRVQTASRVM  M++K  G+++N
Sbjct: 556  SKGEPGDAKTALDSMVEDGHVPDSSLFRSVIESLFEDGRVQTASRVMMIMIDKNVGIEDN 615

Query: 1846 MDLVARILEALLIRGHVEEALGRIELLMGNDCVPDFDSLLAVLCEKEKTIAALKLLDFAL 2025
            MDLVA+ILEALL+RGHVEEALGRI+LL  N    D DSLL+VL EK KTIAALKLLDF L
Sbjct: 616  MDLVAKILEALLMRGHVEEALGRIDLLNQNGHSADLDSLLSVLSEKGKTIAALKLLDFGL 675

Query: 2026 ERDCNVSFSSYDKVLDALLAAGKTLNAYSILCKIMEKGGVTDWFSSCEDLIRSLNAEGNT 2205
            ERD ++ FSSYDKVLDALL AGKTLNAYS+LCKIM KG VTDW  SC+DLI+SLN EGNT
Sbjct: 676  ERDLSLDFSSYDKVLDALLGAGKTLNAYSVLCKIMAKGSVTDW-KSCDDLIKSLNQEGNT 734

Query: 2206 KQADILKRMI 2235
            KQAD+L RMI
Sbjct: 735  KQADVLSRMI 744


>ref|XP_002881498.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297327337|gb|EFH57757.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 756

 Score =  957 bits (2474), Expect = 0.0
 Identities = 476/670 (71%), Positives = 563/670 (84%), Gaps = 3/670 (0%)
 Frame = +1

Query: 235  RTPRGKPPNPEKLEDIICRMMANRAWTTRLQNSIRNLVPQFDHSLVYNVLHGARNSEHAL 414
            R  RGK  N EKLED ICRMM NRAWTTRLQNSIR+LVP++DHSLVYNVLHGA+  EHAL
Sbjct: 74   RFQRGKRQNHEKLEDTICRMMDNRAWTTRLQNSIRDLVPEWDHSLVYNVLHGAKKLEHAL 133

Query: 415  QFFRWVEKTGY-RHDRSTHLKIIEILGRASKLNHARCILFDMPKKGVEWDEEMFVLLIDS 591
            QFFRW E++G  RHDR TH+K+I++LG   KLNHARCIL DMP+KGV WDE+MFV+LI+S
Sbjct: 134  QFFRWTERSGLIRHDRDTHMKMIKMLGEVQKLNHARCILLDMPEKGVPWDEDMFVVLIES 193

Query: 592  YGKAGIVQESVKIFQKMKELGVKRSIKSYDTLFKVIMRRGRVQMAKRYFNSMLSEGVIPT 771
            YGKAGIVQESVKIFQKMK+LGV+R+IKSY+TLFKVI+RRGR  MAKRYFN M+SEGV PT
Sbjct: 194  YGKAGIVQESVKIFQKMKDLGVERTIKSYNTLFKVILRRGRYMMAKRYFNKMVSEGVEPT 253

Query: 772  RHTYNLMLWGFFLSLKVETANRFFEDMKNRGIVPDVVTYNTMINGFCRVKQMDEAEKFFV 951
            RHTYNLMLWGFFLSL++ETA RFF+DMK RGI PD VTYNT+ING+CR K+MDEAEK FV
Sbjct: 254  RHTYNLMLWGFFLSLRLETALRFFDDMKTRGISPDAVTYNTIINGYCRFKKMDEAEKLFV 313

Query: 952  EMKGKNLIPTVISYTTMIKGYVSVSRVDDALRLFDEMGLFGIKANDFTYSTLLPGLCDTE 1131
            EMKG N  P+V++YTTMIKGY+SV RVDD LR+F+EM  FGI+ N  TYSTLLPGLCD  
Sbjct: 314  EMKGNNSEPSVVTYTTMIKGYLSVDRVDDGLRIFEEMRSFGIEPNATTYSTLLPGLCDVG 373

Query: 1132 KMPEARKLLKEMVERHIVPKDNSIFLRLISCQCKSGHLDWAADVLKGMIRLSIPTEISHY 1311
            KM EA+ +LK M+ +HI PKDNSIFL+L+  Q K+G +  A +VLK M  L++P E  HY
Sbjct: 374  KMVEAKNILKNMMAKHIAPKDNSIFLKLLVSQSKAGDMAAATEVLKAMATLNVPAEAGHY 433

Query: 1312 GVLIENFCKVGEYDRAINLLDKVCEKEILLNPRSSSEMEPNAYNPMIKYLCEHGQTLKAE 1491
            GVLIEN CK   Y+RAI LLD + EKEI+L  + + EMEP+AYNP+I+YLC +GQT KAE
Sbjct: 434  GVLIENQCKASAYNRAIKLLDTLIEKEIILRHQDTLEMEPSAYNPIIEYLCNNGQTAKAE 493

Query: 1492 MFFRQLMKKGVQDPSAFNNLLCGHSKEGMPESAFEILKIMGRRGVCPDADAYELLVKSFL 1671
            + FRQLMK+GVQD  A NNL+ GH+KEG PES++EILKIM RRGV  +A+AYELL+KS++
Sbjct: 494  VLFRQLMKRGVQDQDALNNLIRGHAKEGNPESSYEILKIMSRRGVPREANAYELLIKSYM 553

Query: 1672 KKGEPADAKTALDGMIESGHLPDSSLFRSVMQSLFDDGRVQTASRVMKSMLEK--GVKEN 1845
             KGEP DAKTALD M+E GH+PDS+LFRSV++SLF+DGRVQTASRVM  M++K  G+++N
Sbjct: 554  SKGEPGDAKTALDSMVEDGHVPDSALFRSVIESLFEDGRVQTASRVMMIMIDKNVGIEDN 613

Query: 1846 MDLVARILEALLIRGHVEEALGRIELLMGNDCVPDFDSLLAVLCEKEKTIAALKLLDFAL 2025
            MDL+A+ILEALL+RGHVEEALGRI+LL  N    D DSLL+VL EK KTIAALKLLDF L
Sbjct: 614  MDLIAKILEALLMRGHVEEALGRIDLLNQNGHTADLDSLLSVLSEKGKTIAALKLLDFGL 673

Query: 2026 ERDCNVSFSSYDKVLDALLAAGKTLNAYSILCKIMEKGGVTDWFSSCEDLIRSLNAEGNT 2205
            ERD ++ FSSYDKVLDALL AGKTLNAYS+LCKIMEKG  TDW SS ++LI+SLN EGNT
Sbjct: 674  ERDLSLDFSSYDKVLDALLGAGKTLNAYSVLCKIMEKGSSTDWKSS-DELIKSLNQEGNT 732

Query: 2206 KQADILKRMI 2235
            KQAD+L RMI
Sbjct: 733  KQADVLSRMI 742


>ref|XP_006296196.1| hypothetical protein CARUB_v10025361mg [Capsella rubella]
            gi|482564904|gb|EOA29094.1| hypothetical protein
            CARUB_v10025361mg [Capsella rubella]
          Length = 757

 Score =  957 bits (2473), Expect = 0.0
 Identities = 476/670 (71%), Positives = 563/670 (84%), Gaps = 3/670 (0%)
 Frame = +1

Query: 235  RTPRGKPPNPEKLEDIICRMMANRAWTTRLQNSIRNLVPQFDHSLVYNVLHGARNSEHAL 414
            R  RGK  N EKLED ICRMM NRAWTTRLQNSIR+LVP++DHSLVYNVLHGA+  EHAL
Sbjct: 75   RFQRGKRQNHEKLEDTICRMMDNRAWTTRLQNSIRDLVPEWDHSLVYNVLHGAKKLEHAL 134

Query: 415  QFFRWVEKTGY-RHDRSTHLKIIEILGRASKLNHARCILFDMPKKGVEWDEEMFVLLIDS 591
            QFFRW E++G  RHDR TH+K+I++LG   K+N+ARCIL DMP+KGV WDE+MFV+LI+S
Sbjct: 135  QFFRWTERSGLIRHDRDTHMKMIKMLGEVQKVNYARCILLDMPEKGVPWDEDMFVVLIES 194

Query: 592  YGKAGIVQESVKIFQKMKELGVKRSIKSYDTLFKVIMRRGRVQMAKRYFNSMLSEGVIPT 771
            YGKAGIVQESVKIFQKMK+LGV+R+IKSY+TLFKVIMRRGR  MAKRYFN M+SEGV PT
Sbjct: 195  YGKAGIVQESVKIFQKMKDLGVERTIKSYNTLFKVIMRRGRYMMAKRYFNKMVSEGVEPT 254

Query: 772  RHTYNLMLWGFFLSLKVETANRFFEDMKNRGIVPDVVTYNTMINGFCRVKQMDEAEKFFV 951
            RHTYNLMLWGFFLSL++ETA RFFEDMK RGI PD VTYNTMING+CR K+MDEAEK FV
Sbjct: 255  RHTYNLMLWGFFLSLRLETALRFFEDMKTRGISPDAVTYNTMINGYCRFKKMDEAEKLFV 314

Query: 952  EMKGKNLIPTVISYTTMIKGYVSVSRVDDALRLFDEMGLFGIKANDFTYSTLLPGLCDTE 1131
            EMKG N+ P+V+SYTTMIKGY+SV RVDD LR+F+EM   GI+ N  TYST+LPGLCD  
Sbjct: 315  EMKGNNIEPSVVSYTTMIKGYLSVDRVDDGLRIFEEMRSSGIEPNATTYSTVLPGLCDAG 374

Query: 1132 KMPEARKLLKEMVERHIVPKDNSIFLRLISCQCKSGHLDWAADVLKGMIRLSIPTEISHY 1311
            KM EA+ +LK M+ +HI PKDNSIFL+L+  Q K+G +  A +VLK M  L++P E  HY
Sbjct: 375  KMVEAKNILKNMMAKHIAPKDNSIFLKLLVSQSKAGDMAAATEVLKAMATLNVPAEAGHY 434

Query: 1312 GVLIENFCKVGEYDRAINLLDKVCEKEILLNPRSSSEMEPNAYNPMIKYLCEHGQTLKAE 1491
            GVLIEN CK   Y+RAI LLD + EKEI+L  + + EMEP+AYNP+I+YLC +GQT KAE
Sbjct: 435  GVLIENQCKANAYNRAIKLLDTLLEKEIILRHQDTLEMEPSAYNPIIEYLCNNGQTSKAE 494

Query: 1492 MFFRQLMKKGVQDPSAFNNLLCGHSKEGMPESAFEILKIMGRRGVCPDADAYELLVKSFL 1671
            + FRQLMK+GVQD  A NNL+ GH+KEG P+S++EILKIM RRGV  +A+AYELL+KS++
Sbjct: 495  VLFRQLMKRGVQDQDALNNLISGHAKEGNPDSSYEILKIMSRRGVPREANAYELLIKSYM 554

Query: 1672 KKGEPADAKTALDGMIESGHLPDSSLFRSVMQSLFDDGRVQTASRVMKSMLEK--GVKEN 1845
             KGEP DAKTALD M+E GH+PDSSLFRSV++SLF+DGRVQTASRVM  M++K  G++EN
Sbjct: 555  SKGEPGDAKTALDSMVEDGHVPDSSLFRSVIESLFEDGRVQTASRVMMIMIDKNVGIEEN 614

Query: 1846 MDLVARILEALLIRGHVEEALGRIELLMGNDCVPDFDSLLAVLCEKEKTIAALKLLDFAL 2025
            MDL+A+ILEALL+RGHVEEALGRI+LL  N    D DSLL+VL EK KTIAALKLLDF L
Sbjct: 615  MDLIAKILEALLMRGHVEEALGRIDLLNQNGHAADLDSLLSVLSEKGKTIAALKLLDFGL 674

Query: 2026 ERDCNVSFSSYDKVLDALLAAGKTLNAYSILCKIMEKGGVTDWFSSCEDLIRSLNAEGNT 2205
            ERD ++ FSSY+KVLDALL AGKTLNAYS+LCKIMEKG  TDW SS ++LI+SLN EGNT
Sbjct: 675  ERDLSLDFSSYEKVLDALLGAGKTLNAYSVLCKIMEKGSATDWKSS-DELIKSLNQEGNT 733

Query: 2206 KQADILKRMI 2235
            KQAD+L RMI
Sbjct: 734  KQADVLSRMI 743


>ref|NP_181260.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75216851|sp|Q9ZUU3.1|PP190_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g37230 gi|4056478|gb|AAC98044.1| unknown protein
            [Arabidopsis thaliana] gi|28973644|gb|AAO64144.1| unknown
            protein [Arabidopsis thaliana]
            gi|110736716|dbj|BAF00321.1| hypothetical protein
            [Arabidopsis thaliana] gi|330254276|gb|AEC09370.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 757

 Score =  951 bits (2459), Expect = 0.0
 Identities = 474/670 (70%), Positives = 562/670 (83%), Gaps = 3/670 (0%)
 Frame = +1

Query: 235  RTPRGKPPNPEKLEDIICRMMANRAWTTRLQNSIRNLVPQFDHSLVYNVLHGARNSEHAL 414
            R  RGK  N EKLED ICRMM NRAWTTRLQNSIR+LVP++DHSLVYNVLHGA+  EHAL
Sbjct: 75   RFQRGKRQNHEKLEDTICRMMDNRAWTTRLQNSIRDLVPEWDHSLVYNVLHGAKKLEHAL 134

Query: 415  QFFRWVEKTGY-RHDRSTHLKIIEILGRASKLNHARCILFDMPKKGVEWDEEMFVLLIDS 591
            QFFRW E++G  RHDR TH+K+I++LG  SKLNHARCIL DMP+KGV WDE+MFV+LI+S
Sbjct: 135  QFFRWTERSGLIRHDRDTHMKMIKMLGEVSKLNHARCILLDMPEKGVPWDEDMFVVLIES 194

Query: 592  YGKAGIVQESVKIFQKMKELGVKRSIKSYDTLFKVIMRRGRVQMAKRYFNSMLSEGVIPT 771
            YGKAGIVQESVKIFQKMK+LGV+R+IKSY++LFKVI+RRGR  MAKRYFN M+SEGV PT
Sbjct: 195  YGKAGIVQESVKIFQKMKDLGVERTIKSYNSLFKVILRRGRYMMAKRYFNKMVSEGVEPT 254

Query: 772  RHTYNLMLWGFFLSLKVETANRFFEDMKNRGIVPDVVTYNTMINGFCRVKQMDEAEKFFV 951
            RHTYNLMLWGFFLSL++ETA RFFEDMK RGI PD  T+NTMINGFCR K+MDEAEK FV
Sbjct: 255  RHTYNLMLWGFFLSLRLETALRFFEDMKTRGISPDDATFNTMINGFCRFKKMDEAEKLFV 314

Query: 952  EMKGKNLIPTVISYTTMIKGYVSVSRVDDALRLFDEMGLFGIKANDFTYSTLLPGLCDTE 1131
            EMKG  + P+V+SYTTMIKGY++V RVDD LR+F+EM   GI+ N  TYSTLLPGLCD  
Sbjct: 315  EMKGNKIGPSVVSYTTMIKGYLAVDRVDDGLRIFEEMRSSGIEPNATTYSTLLPGLCDAG 374

Query: 1132 KMPEARKLLKEMVERHIVPKDNSIFLRLISCQCKSGHLDWAADVLKGMIRLSIPTEISHY 1311
            KM EA+ +LK M+ +HI PKDNSIFL+L+  Q K+G +  A +VLK M  L++P E  HY
Sbjct: 375  KMVEAKNILKNMMAKHIAPKDNSIFLKLLVSQSKAGDMAAATEVLKAMATLNVPAEAGHY 434

Query: 1312 GVLIENFCKVGEYDRAINLLDKVCEKEILLNPRSSSEMEPNAYNPMIKYLCEHGQTLKAE 1491
            GVLIEN CK   Y+RAI LLD + EKEI+L  + + EMEP+AYNP+I+YLC +GQT KAE
Sbjct: 435  GVLIENQCKASAYNRAIKLLDTLIEKEIILRHQDTLEMEPSAYNPIIEYLCNNGQTAKAE 494

Query: 1492 MFFRQLMKKGVQDPSAFNNLLCGHSKEGMPESAFEILKIMGRRGVCPDADAYELLVKSFL 1671
            + FRQLMK+GVQD  A NNL+ GH+KEG P+S++EILKIM RRGV  +++AYELL+KS++
Sbjct: 495  VLFRQLMKRGVQDQDALNNLIRGHAKEGNPDSSYEILKIMSRRGVPRESNAYELLIKSYM 554

Query: 1672 KKGEPADAKTALDGMIESGHLPDSSLFRSVMQSLFDDGRVQTASRVMKSMLEK--GVKEN 1845
             KGEP DAKTALD M+E GH+PDSSLFRSV++SLF+DGRVQTASRVM  M++K  G+++N
Sbjct: 555  SKGEPGDAKTALDSMVEDGHVPDSSLFRSVIESLFEDGRVQTASRVMMIMIDKNVGIEDN 614

Query: 1846 MDLVARILEALLIRGHVEEALGRIELLMGNDCVPDFDSLLAVLCEKEKTIAALKLLDFAL 2025
            MDL+A+ILEALL+RGHVEEALGRI+LL  N    D DSLL+VL EK KTIAALKLLDF L
Sbjct: 615  MDLIAKILEALLMRGHVEEALGRIDLLNQNGHTADLDSLLSVLSEKGKTIAALKLLDFGL 674

Query: 2026 ERDCNVSFSSYDKVLDALLAAGKTLNAYSILCKIMEKGGVTDWFSSCEDLIRSLNAEGNT 2205
            ERD ++ FSSYDKVLDALL AGKTLNAYS+LCKIMEKG  TDW SS ++LI+SLN EGNT
Sbjct: 675  ERDLSLEFSSYDKVLDALLGAGKTLNAYSVLCKIMEKGSSTDWKSS-DELIKSLNQEGNT 733

Query: 2206 KQADILKRMI 2235
            KQAD+L RMI
Sbjct: 734  KQADVLSRMI 743


Top