BLASTX nr result

ID: Glycyrrhiza28_contig00016731 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza28_contig00016731
         (1616 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

KYP41091.1 Pentatricopeptide repeat-containing protein At2g37230...   736   0.0  
KHN46063.1 Pentatricopeptide repeat-containing protein [Glycine ...   730   0.0  
KHN12442.1 Pentatricopeptide repeat-containing protein [Glycine ...   726   0.0  
XP_004488235.1 PREDICTED: pentatricopeptide repeat-containing pr...   730   0.0  
XP_003524868.1 PREDICTED: pentatricopeptide repeat-containing pr...   729   0.0  
XP_003532699.1 PREDICTED: pentatricopeptide repeat-containing pr...   728   0.0  
XP_007158766.1 hypothetical protein PHAVU_002G180100g [Phaseolus...   709   0.0  
XP_016172946.1 PREDICTED: pentatricopeptide repeat-containing pr...   709   0.0  
XP_014510338.1 PREDICTED: pentatricopeptide repeat-containing pr...   709   0.0  
XP_017425821.1 PREDICTED: pentatricopeptide repeat-containing pr...   707   0.0  
XP_015931857.1 PREDICTED: pentatricopeptide repeat-containing pr...   706   0.0  
XP_015939760.1 PREDICTED: pentatricopeptide repeat-containing pr...   703   0.0  
XP_019464106.1 PREDICTED: pentatricopeptide repeat-containing pr...   701   0.0  
XP_013463303.1 pentatricopeptide (PPR) repeat protein [Medicago ...   699   0.0  
XP_018837769.1 PREDICTED: pentatricopeptide repeat-containing pr...   671   0.0  
GAU37748.1 hypothetical protein TSUD_102640 [Trifolium subterran...   666   0.0  
XP_010111755.1 hypothetical protein L484_008414 [Morus notabilis...   668   0.0  
EOY04385.1 Tetratricopeptide repeat (TPR)-like superfamily prote...   659   0.0  
XP_007033459.2 PREDICTED: pentatricopeptide repeat-containing pr...   658   0.0  
XP_008338950.1 PREDICTED: pentatricopeptide repeat-containing pr...   657   0.0  

>KYP41091.1 Pentatricopeptide repeat-containing protein At2g37230 family [Cajanus
            cajan]
          Length = 740

 Score =  736 bits (1900), Expect = 0.0
 Identities = 378/457 (82%), Positives = 409/457 (89%), Gaps = 5/457 (1%)
 Frame = +3

Query: 15   ENLFVEMKGRNIAPNVISYTTMLKGSVAAGRIDRALEVFEEMKACGIKPNAVTFSTLLPG 194
            ENLF EMKG+ IAPNVIS+TTMLKG VAAGRID A+EVF+EMK+CGIKPNAVTFSTLLPG
Sbjct: 284  ENLFAEMKGKEIAPNVISFTTMLKGYVAAGRIDGAMEVFQEMKSCGIKPNAVTFSTLLPG 343

Query: 195  LCDADKVVEARNVLGEMVERYVAPKDNAVFMKLLTSQCKSGDLDGAADVLKAMIRLSIPT 374
            LCDA+K  EAR VL EMVER++APKDNAVFMKLL+  CK+GDLD AADVLK MIRLSIPT
Sbjct: 344  LCDAEKTAEAREVLAEMVERFIAPKDNAVFMKLLSCHCKAGDLDAAADVLKGMIRLSIPT 403

Query: 375  EAGHYGVLIENFCKANVFDRAXXXXXXXXXXXXXXRPQSSYELEPSAYNPMIKYLCDHGQ 554
            EAGHYGVLIE+FCKA+V+D+A              RPQ+++E+EPSAYN MI+YLCDHG+
Sbjct: 404  EAGHYGVLIESFCKASVYDKAEKLLDKLIEKEIVLRPQNAFEMEPSAYNLMIEYLCDHGR 463

Query: 555  TGKAETLFRQLMKKGVLDSVAFNNLIRGHAKEGNPDSALEIVKIMGRRGVPRDADSYKLL 734
            TGKAET FRQLMKKGV DS AFN+LIRGH+KEGNPDSA EI+KIMGRRGVPRDADSYKLL
Sbjct: 464  TGKAETFFRQLMKKGVQDSAAFNSLIRGHSKEGNPDSAFEIIKIMGRRGVPRDADSYKLL 523

Query: 735  IESYLGKGEPADAKTVLDSMLESGHLPESSLYRSVMESLFEDGRVQTASRVMKSM----V 902
            IESYL KGEPADAKT LDSMLESGH PESSLYRS+MESLF+DGRVQTASRVMKSM    V
Sbjct: 524  IESYLRKGEPADAKTALDSMLESGHQPESSLYRSIMESLFDDGRVQTASRVMKSMVEKGV 583

Query: 903  EKGVKENMDLVSKILEALLMRGHVEEALGRIDLLMLNGCELDLDHLLSVLCEKEKTIAAL 1082
            EKGVKENMDLVSKILEALLMRGHVEEALGRIDLL +NGCE D DHLLSVLCEKEKTIAAL
Sbjct: 584  EKGVKENMDLVSKILEALLMRGHVEEALGRIDLLTVNGCEPDFDHLLSVLCEKEKTIAAL 643

Query: 1083 KLLDFVLERDIIIGFSIYDKVLDALLAAGKTLNAHSILCKILEKRGATDWSSRDELIKSL 1262
            KLLDFVLERD II FSIYDKVLDALLAAGKTLNA+SILCKILEKRG+TDWSSRDELIKSL
Sbjct: 644  KLLDFVLERDCIIEFSIYDKVLDALLAAGKTLNAYSILCKILEKRGSTDWSSRDELIKSL 703

Query: 1263 NQEGNTKQADVLSRMIKEKEGSPL-KREGKKKATHAT 1370
            NQEGNTKQADVLSRMIK  +G P+ KREGK+KAT AT
Sbjct: 704  NQEGNTKQADVLSRMIKGTDGGPVKKREGKRKATLAT 740


>KHN46063.1 Pentatricopeptide repeat-containing protein [Glycine soja]
          Length = 671

 Score =  730 bits (1885), Expect = 0.0
 Identities = 371/452 (82%), Positives = 404/452 (89%)
 Frame = +3

Query: 15   ENLFVEMKGRNIAPNVISYTTMLKGSVAAGRIDRALEVFEEMKACGIKPNAVTFSTLLPG 194
            E LFVEMKGR+I PNVIS+TTMLKG VAAGRID AL+VFEEMK CG+KPN VTFSTLLPG
Sbjct: 220  EKLFVEMKGRDIVPNVISFTTMLKGYVAAGRIDDALKVFEEMKGCGVKPNVVTFSTLLPG 279

Query: 195  LCDADKVVEARNVLGEMVERYVAPKDNAVFMKLLTSQCKSGDLDGAADVLKAMIRLSIPT 374
            LCDA+K+ EAR+VLGEMVERY+APKDNA+FMK+++ QCK+GDLD AADVLKAM+RLSIPT
Sbjct: 280  LCDAEKMAEARDVLGEMVERYIAPKDNALFMKMMSCQCKAGDLDAAADVLKAMVRLSIPT 339

Query: 375  EAGHYGVLIENFCKANVFDRAXXXXXXXXXXXXXXRPQSSYELEPSAYNPMIKYLCDHGQ 554
            EAGHYGVLIE+FCKANV+D+A              RPQ+  E+EPSAYN MI YLC+HG+
Sbjct: 340  EAGHYGVLIESFCKANVYDKAEKLLDKLIEKEIVLRPQNDSEMEPSAYNLMIGYLCEHGR 399

Query: 555  TGKAETLFRQLMKKGVLDSVAFNNLIRGHAKEGNPDSALEIVKIMGRRGVPRDADSYKLL 734
            TGKAET FRQL+KKGV DSVAFNNLIRGH+KEGNPDSA EI+KIMGRRGV RD DSY+LL
Sbjct: 400  TGKAETFFRQLLKKGVQDSVAFNNLIRGHSKEGNPDSAFEIMKIMGRRGVARDVDSYRLL 459

Query: 735  IESYLGKGEPADAKTVLDSMLESGHLPESSLYRSVMESLFEDGRVQTASRVMKSMVEKGV 914
            IESYL KGEPADAKT LD MLESGHLPESSLYRSVMESLF+DGRVQTASRVMKSMVEKGV
Sbjct: 460  IESYLRKGEPADAKTALDGMLESGHLPESSLYRSVMESLFDDGRVQTASRVMKSMVEKGV 519

Query: 915  KENMDLVSKILEALLMRGHVEEALGRIDLLMLNGCELDLDHLLSVLCEKEKTIAALKLLD 1094
            KENMDLV KILEALL+RGHVEEALGRIDLLM NGCE D DHLLSVLCEKEKTIAALKLLD
Sbjct: 520  KENMDLVLKILEALLLRGHVEEALGRIDLLMHNGCEPDFDHLLSVLCEKEKTIAALKLLD 579

Query: 1095 FVLERDIIIGFSIYDKVLDALLAAGKTLNAHSILCKILEKRGATDWSSRDELIKSLNQEG 1274
            FVLERD II FSIYDKVLDALLAAGKTLNA+SILCKILEK G+TDWSSRDELIKSLNQEG
Sbjct: 580  FVLERDCIIDFSIYDKVLDALLAAGKTLNAYSILCKILEKGGSTDWSSRDELIKSLNQEG 639

Query: 1275 NTKQADVLSRMIKEKEGSPLKREGKKKATHAT 1370
            NTKQADVLSRMIK  +G  L+R GK+KAT +T
Sbjct: 640  NTKQADVLSRMIKGTDGRTLRRGGKRKATVST 671


>KHN12442.1 Pentatricopeptide repeat-containing protein [Glycine soja]
          Length = 556

 Score =  726 bits (1873), Expect = 0.0
 Identities = 372/457 (81%), Positives = 405/457 (88%), Gaps = 5/457 (1%)
 Frame = +3

Query: 15   ENLFVEMKGRNIAPNVISYTTMLKGSVAAGRIDRALEVFEEMKACGIKPNAVTFSTLLPG 194
            E LFVEMKGR+I PNVIS+TTMLKG VAAG+ID AL+VFEEMK CG+KPNAVTFSTLLPG
Sbjct: 100  EKLFVEMKGRDIVPNVISFTTMLKGYVAAGQIDDALKVFEEMKGCGVKPNAVTFSTLLPG 159

Query: 195  LCDADKVVEARNVLGEMVERYVAPKDNAVFMKLLTSQCKSGDLDGAADVLKAMIRLSIPT 374
            LCDA+K+ EAR+VLGEMVERY+APKDNAVFMKL++ QCK+GDLD A DVLKAMIRLSIPT
Sbjct: 160  LCDAEKMAEARDVLGEMVERYIAPKDNAVFMKLMSYQCKAGDLDAAGDVLKAMIRLSIPT 219

Query: 375  EAGHYGVLIENFCKANVFDRAXXXXXXXXXXXXXXRPQSSYE-----LEPSAYNPMIKYL 539
            EAGHYGVLIENFCKAN++D+A              R +++YE     +EPSAYN MI YL
Sbjct: 220  EAGHYGVLIENFCKANLYDKAEKLLDKMIEKEIVLRQKNAYETELFEMEPSAYNLMIGYL 279

Query: 540  CDHGQTGKAETLFRQLMKKGVLDSVAFNNLIRGHAKEGNPDSALEIVKIMGRRGVPRDAD 719
            C+HG+TGKAET FRQLMKKGV DSV+FNNLI GH+KEGNPDSA+EI+KIMGRRGV RDAD
Sbjct: 280  CEHGRTGKAETFFRQLMKKGVQDSVSFNNLICGHSKEGNPDSAIEIIKIMGRRGVARDAD 339

Query: 720  SYKLLIESYLGKGEPADAKTVLDSMLESGHLPESSLYRSVMESLFEDGRVQTASRVMKSM 899
            SY+LLIESYL KGEPADAKT LD MLESGHLPESSLYRSVMESLF+DGRVQTASRVMKSM
Sbjct: 340  SYRLLIESYLRKGEPADAKTALDGMLESGHLPESSLYRSVMESLFDDGRVQTASRVMKSM 399

Query: 900  VEKGVKENMDLVSKILEALLMRGHVEEALGRIDLLMLNGCELDLDHLLSVLCEKEKTIAA 1079
            VEKGVKEN DLVSK+LEALLMRGHVEEALGRI LLMLNGCE D DHLLSVLCEKEKTIAA
Sbjct: 400  VEKGVKENTDLVSKVLEALLMRGHVEEALGRIHLLMLNGCEPDFDHLLSVLCEKEKTIAA 459

Query: 1080 LKLLDFVLERDIIIGFSIYDKVLDALLAAGKTLNAHSILCKILEKRGATDWSSRDELIKS 1259
            LKLLDFVLERD II FSIYDKVLDALLAAGKTLNA+SILCKILEK G+TDWSSRDELIKS
Sbjct: 460  LKLLDFVLERDCIIDFSIYDKVLDALLAAGKTLNAYSILCKILEKGGSTDWSSRDELIKS 519

Query: 1260 LNQEGNTKQADVLSRMIKEKEGSPLKREGKKKATHAT 1370
            LNQEGNTKQADVLSRMIK  +G P KR GK+K T +T
Sbjct: 520  LNQEGNTKQADVLSRMIKGTDGGPPKRGGKRKTTVST 556


>XP_004488235.1 PREDICTED: pentatricopeptide repeat-containing protein At2g37230
            [Cicer arietinum]
          Length = 739

 Score =  730 bits (1885), Expect = 0.0
 Identities = 370/452 (81%), Positives = 400/452 (88%)
 Frame = +3

Query: 15   ENLFVEMKGRNIAPNVISYTTMLKGSVAAGRIDRALEVFEEMKACGIKPNAVTFSTLLPG 194
            ENLF EMK RNI PNVISYTTMLKG +  G++DRA+E FEEMK+CGIKPNAVTF+TLLPG
Sbjct: 288  ENLFAEMKERNIVPNVISYTTMLKGCIDVGKVDRAIEFFEEMKSCGIKPNAVTFTTLLPG 347

Query: 195  LCDADKVVEARNVLGEMVERYVAPKDNAVFMKLLTSQCKSGDLDGAADVLKAMIRLSIPT 374
            LCD DK+VEA  VLGEMVE YVAPKDN+VFMKL+  QCK+G+LD AADVLKAMIRLSIPT
Sbjct: 348  LCDGDKMVEAGKVLGEMVESYVAPKDNSVFMKLMNCQCKAGNLDAAADVLKAMIRLSIPT 407

Query: 375  EAGHYGVLIENFCKANVFDRAXXXXXXXXXXXXXXRPQSSYELEPSAYNPMIKYLCDHGQ 554
            EAGHYGVLIENFCK N +DRA              RP++S+E+EPSAYNPMI+YLCD+G+
Sbjct: 408  EAGHYGVLIENFCKVNGYDRAEKLLDKLIEKEIVLRPENSFEIEPSAYNPMIEYLCDNGR 467

Query: 555  TGKAETLFRQLMKKGVLDSVAFNNLIRGHAKEGNPDSALEIVKIMGRRGVPRDADSYKLL 734
            TGKAET FRQLMKKGVLDSVAFNNLIRGH+KEGNPDSALEI KIM RR VPRD DSYKLL
Sbjct: 468  TGKAETFFRQLMKKGVLDSVAFNNLIRGHSKEGNPDSALEIAKIMSRREVPRDEDSYKLL 527

Query: 735  IESYLGKGEPADAKTVLDSMLESGHLPESSLYRSVMESLFEDGRVQTASRVMKSMVEKGV 914
            +ESYL KGEPADAKT  D MLE GH P+SSLYRSVMESLFEDGRVQTASRVMKSMVEKGV
Sbjct: 528  VESYLRKGEPADAKTAFDHMLEGGHQPDSSLYRSVMESLFEDGRVQTASRVMKSMVEKGV 587

Query: 915  KENMDLVSKILEALLMRGHVEEALGRIDLLMLNGCELDLDHLLSVLCEKEKTIAALKLLD 1094
            KENMDLVSKILEALLMRGHVEEALGRI LLM NG E D DHLLS+LCEKEKTIAAL+LLD
Sbjct: 588  KENMDLVSKILEALLMRGHVEEALGRIALLMQNGFEPDFDHLLSILCEKEKTIAALRLLD 647

Query: 1095 FVLERDIIIGFSIYDKVLDALLAAGKTLNAHSILCKILEKRGATDWSSRDELIKSLNQEG 1274
            FVLE+DIII FSIYDKVLDAL AAGKTLNA+SILCKILEKRGATDWSSRDELIKSLNQ+G
Sbjct: 648  FVLEKDIIINFSIYDKVLDALFAAGKTLNAYSILCKILEKRGATDWSSRDELIKSLNQDG 707

Query: 1275 NTKQADVLSRMIKEKEGSPLKREGKKKATHAT 1370
            NTKQAD+LSRMIKEK  SP KR+GKKKA+ AT
Sbjct: 708  NTKQADILSRMIKEKVESPPKRDGKKKASRAT 739


>XP_003524868.1 PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            [Glycine max] KRH58681.1 hypothetical protein
            GLYMA_05G142300 [Glycine max]
          Length = 733

 Score =  729 bits (1881), Expect = 0.0
 Identities = 370/452 (81%), Positives = 403/452 (89%)
 Frame = +3

Query: 15   ENLFVEMKGRNIAPNVISYTTMLKGSVAAGRIDRALEVFEEMKACGIKPNAVTFSTLLPG 194
            E LFVEMKGR+I PNVIS+TTMLKG VAAGRID AL+VFEEMK CG+KPN VTFSTLLPG
Sbjct: 282  EKLFVEMKGRDIVPNVISFTTMLKGYVAAGRIDDALKVFEEMKGCGVKPNVVTFSTLLPG 341

Query: 195  LCDADKVVEARNVLGEMVERYVAPKDNAVFMKLLTSQCKSGDLDGAADVLKAMIRLSIPT 374
            LCDA+K+ EAR+VLGEMVERY+APKDNA+FMK+++ QCK+GDLD AADVLKAM+RLSIPT
Sbjct: 342  LCDAEKMAEARDVLGEMVERYIAPKDNALFMKMMSCQCKAGDLDAAADVLKAMVRLSIPT 401

Query: 375  EAGHYGVLIENFCKANVFDRAXXXXXXXXXXXXXXRPQSSYELEPSAYNPMIKYLCDHGQ 554
            EAGHYGVLIE+FCKANV+D+A              RPQ+  E+EPSAYN MI YLC+HG+
Sbjct: 402  EAGHYGVLIESFCKANVYDKAEKLLDKLIEKEIVLRPQNDSEMEPSAYNLMIGYLCEHGR 461

Query: 555  TGKAETLFRQLMKKGVLDSVAFNNLIRGHAKEGNPDSALEIVKIMGRRGVPRDADSYKLL 734
            TGKAET FRQL+KKGV DSVAFNNLIRGH+KEGNPDSA EI+KIMGRRGV RD DSY+LL
Sbjct: 462  TGKAETFFRQLLKKGVQDSVAFNNLIRGHSKEGNPDSAFEIMKIMGRRGVARDVDSYRLL 521

Query: 735  IESYLGKGEPADAKTVLDSMLESGHLPESSLYRSVMESLFEDGRVQTASRVMKSMVEKGV 914
            IESYL KGEPADAKT LD MLESGHLPESSLYRSVMESLF+DGRVQTASRVMKSMVEKG 
Sbjct: 522  IESYLRKGEPADAKTALDGMLESGHLPESSLYRSVMESLFDDGRVQTASRVMKSMVEKGA 581

Query: 915  KENMDLVSKILEALLMRGHVEEALGRIDLLMLNGCELDLDHLLSVLCEKEKTIAALKLLD 1094
            KENMDLV KILEALL+RGHVEEALGRIDLLM NGCE D DHLLSVLCEKEKTIAALKLLD
Sbjct: 582  KENMDLVLKILEALLLRGHVEEALGRIDLLMHNGCEPDFDHLLSVLCEKEKTIAALKLLD 641

Query: 1095 FVLERDIIIGFSIYDKVLDALLAAGKTLNAHSILCKILEKRGATDWSSRDELIKSLNQEG 1274
            FVLERD II FSIYDKVLDALLAAGKTLNA+SILCKILEK G+TDWSSRDELIKSLNQEG
Sbjct: 642  FVLERDCIIDFSIYDKVLDALLAAGKTLNAYSILCKILEKGGSTDWSSRDELIKSLNQEG 701

Query: 1275 NTKQADVLSRMIKEKEGSPLKREGKKKATHAT 1370
            NTKQADVLSRMIK  +G  L+R GK+KAT +T
Sbjct: 702  NTKQADVLSRMIKGTDGRTLRRGGKRKATVST 733


>XP_003532699.1 PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            [Glycine max] XP_014634299.1 PREDICTED: pentatricopeptide
            repeat-containing protein At2g37230-like [Glycine max]
            XP_014634300.1 PREDICTED: pentatricopeptide
            repeat-containing protein At2g37230-like [Glycine max]
            XP_014634301.1 PREDICTED: pentatricopeptide
            repeat-containing protein At2g37230-like [Glycine max]
            KRH42571.1 hypothetical protein GLYMA_08G098100 [Glycine
            max] KRH42572.1 hypothetical protein GLYMA_08G098100
            [Glycine max]
          Length = 738

 Score =  728 bits (1878), Expect = 0.0
 Identities = 373/457 (81%), Positives = 405/457 (88%), Gaps = 5/457 (1%)
 Frame = +3

Query: 15   ENLFVEMKGRNIAPNVISYTTMLKGSVAAGRIDRALEVFEEMKACGIKPNAVTFSTLLPG 194
            E LFVEMKGR+I PNVIS+TTMLKG VAAG+ID AL+VFEEMK CG+KPNAVTFSTLLPG
Sbjct: 282  EKLFVEMKGRDIVPNVISFTTMLKGYVAAGQIDDALKVFEEMKGCGVKPNAVTFSTLLPG 341

Query: 195  LCDADKVVEARNVLGEMVERYVAPKDNAVFMKLLTSQCKSGDLDGAADVLKAMIRLSIPT 374
            LCDA+K+ EAR+VLGEMVERY+APKDNAVFMKL++ QCK+GDLD A DVLKAMIRLSIPT
Sbjct: 342  LCDAEKMAEARDVLGEMVERYIAPKDNAVFMKLMSCQCKAGDLDAAGDVLKAMIRLSIPT 401

Query: 375  EAGHYGVLIENFCKANVFDRAXXXXXXXXXXXXXXRPQSSYE-----LEPSAYNPMIKYL 539
            EAGHYGVLIENFCKAN++D+A              R +++YE     +EPSAYN MI YL
Sbjct: 402  EAGHYGVLIENFCKANLYDKAEKLLDKMIEKEIVLRQKNAYETELFEMEPSAYNLMIGYL 461

Query: 540  CDHGQTGKAETLFRQLMKKGVLDSVAFNNLIRGHAKEGNPDSALEIVKIMGRRGVPRDAD 719
            C+HG+TGKAET FRQLMKKGV DSV+FNNLI GH+KEGNPDSA EI+KIMGRRGV RDAD
Sbjct: 462  CEHGRTGKAETFFRQLMKKGVQDSVSFNNLICGHSKEGNPDSAFEIIKIMGRRGVARDAD 521

Query: 720  SYKLLIESYLGKGEPADAKTVLDSMLESGHLPESSLYRSVMESLFEDGRVQTASRVMKSM 899
            SY+LLIESYL KGEPADAKT LD MLESGHLPESSLYRSVMESLF+DGRVQTASRVMKSM
Sbjct: 522  SYRLLIESYLRKGEPADAKTALDGMLESGHLPESSLYRSVMESLFDDGRVQTASRVMKSM 581

Query: 900  VEKGVKENMDLVSKILEALLMRGHVEEALGRIDLLMLNGCELDLDHLLSVLCEKEKTIAA 1079
            VEKGVKENMDLVSK+LEALLMRGHVEEALGRI LLMLNGCE D DHLLSVLCEKEKTIAA
Sbjct: 582  VEKGVKENMDLVSKVLEALLMRGHVEEALGRIHLLMLNGCEPDFDHLLSVLCEKEKTIAA 641

Query: 1080 LKLLDFVLERDIIIGFSIYDKVLDALLAAGKTLNAHSILCKILEKRGATDWSSRDELIKS 1259
            LKLLDFVLERD II FSIYDKVLDALLAAGKTLNA+SILCKILEK G+TDWSSRDELIKS
Sbjct: 642  LKLLDFVLERDCIIDFSIYDKVLDALLAAGKTLNAYSILCKILEKGGSTDWSSRDELIKS 701

Query: 1260 LNQEGNTKQADVLSRMIKEKEGSPLKREGKKKATHAT 1370
            LNQEGNTKQADVLSRMIK  +G P KR GK+K T +T
Sbjct: 702  LNQEGNTKQADVLSRMIKGTDGGPPKRGGKRKTTVST 738


>XP_007158766.1 hypothetical protein PHAVU_002G180100g [Phaseolus vulgaris]
            ESW30760.1 hypothetical protein PHAVU_002G180100g
            [Phaseolus vulgaris]
          Length = 728

 Score =  709 bits (1831), Expect = 0.0
 Identities = 363/452 (80%), Positives = 396/452 (87%)
 Frame = +3

Query: 15   ENLFVEMKGRNIAPNVISYTTMLKGSVAAGRIDRALEVFEEMKACGIKPNAVTFSTLLPG 194
            E LFVEMKGR+I PNVIS+TTMLKG VAAGRID A++VFE+MK CGIKPNAVTFSTLLPG
Sbjct: 277  EKLFVEMKGRDIVPNVISFTTMLKGYVAAGRIDDAMKVFEDMKNCGIKPNAVTFSTLLPG 336

Query: 195  LCDADKVVEARNVLGEMVERYVAPKDNAVFMKLLTSQCKSGDLDGAADVLKAMIRLSIPT 374
            LCDA+K VEAR+VL EMVERY+APKDN+VFMKLL+ Q KSGDLD AADVLKAMIRLSIPT
Sbjct: 337  LCDAEKTVEARDVLREMVERYIAPKDNSVFMKLLSVQSKSGDLDAAADVLKAMIRLSIPT 396

Query: 375  EAGHYGVLIENFCKANVFDRAXXXXXXXXXXXXXXRPQSSYELEPSAYNPMIKYLCDHGQ 554
            EAGHYGVLIE+FCKAN  D+A              RPQ+++E+E S+YN MI+YLCDHG+
Sbjct: 397  EAGHYGVLIESFCKANEHDKAEKLLDKLIEKEIVSRPQNAFEMEASSYNLMIEYLCDHGR 456

Query: 555  TGKAETLFRQLMKKGVLDSVAFNNLIRGHAKEGNPDSALEIVKIMGRRGVPRDADSYKLL 734
            T KAE  FRQL+KKGV DSVAFN+LIRGH+KEGNPDSA EI+KIMGRR VPRDADSY+LL
Sbjct: 457  TSKAERFFRQLLKKGVQDSVAFNSLIRGHSKEGNPDSAFEIIKIMGRRAVPRDADSYRLL 516

Query: 735  IESYLGKGEPADAKTVLDSMLESGHLPESSLYRSVMESLFEDGRVQTASRVMKSMVEKGV 914
            IESYL KGEPADAKT LDSMLESGHLPESSLYR VMESLF DGRVQTASRVMKSMVEKGV
Sbjct: 517  IESYLRKGEPADAKTALDSMLESGHLPESSLYRLVMESLFNDGRVQTASRVMKSMVEKGV 576

Query: 915  KENMDLVSKILEALLMRGHVEEALGRIDLLMLNGCELDLDHLLSVLCEKEKTIAALKLLD 1094
            KE+MDLVSKILEALLMRGHVEEALGRIDLLM NGCE D DHLLS+LCEKEKTIAALKLLD
Sbjct: 577  KEHMDLVSKILEALLMRGHVEEALGRIDLLMHNGCEPDFDHLLSILCEKEKTIAALKLLD 636

Query: 1095 FVLERDIIIGFSIYDKVLDALLAAGKTLNAHSILCKILEKRGATDWSSRDELIKSLNQEG 1274
            FVLERD II FS+YDKVLD LLA GKTLNA+SILCKILEKRG+TDW SR+ELIKSLN EG
Sbjct: 637  FVLERDCIIDFSLYDKVLDTLLAVGKTLNAYSILCKILEKRGSTDWRSREELIKSLNHEG 696

Query: 1275 NTKQADVLSRMIKEKEGSPLKREGKKKATHAT 1370
            NTKQADVLSRM K  +G  +  EGK+K T AT
Sbjct: 697  NTKQADVLSRMFKGTDGGRVNSEGKRKVTVAT 728



 Score = 63.2 bits (152), Expect = 8e-07
 Identities = 70/307 (22%), Positives = 127/307 (41%), Gaps = 48/307 (15%)
 Frame = +3

Query: 513  AYNPMIKYLCDHGQTGKAETLFRQLMKKGVLDSV-AFNNLIRGHAKEGNPDSALEIVKIM 689
            +Y+ + K +   G+   A+  +  ++++GV  +   +N L+ G       D+A+   + M
Sbjct: 189  SYDALFKVILRRGRYMMAKRYYNAMLREGVEPTRHTYNILLWGMFLSLRLDTAVRFYEEM 248

Query: 690  GRRGVPRDADSYKLLIESYLGKGEPADAKTVLDSMLESGHLPESSLYRSVMESLFEDGRV 869
              RGV  D  +Y  LI  Y    +  DA+ +   M     +P    + ++++     GR+
Sbjct: 249  NSRGVLPDVVTYNTLINGYFRFKKVEDAEKLFVEMKGRDIVPNVISFTTMLKGYVAAGRI 308

Query: 870  QTASRVMKSMVEKGVKENMDLVSKILEAL-----------LMRGHVEEALGRID------ 998
              A +V + M   G+K N    S +L  L           ++R  VE  +   D      
Sbjct: 309  DDAMKVFEDMKNCGIKPNAVTFSTLLPGLCDAEKTVEARDVLREMVERYIAPKDNSVFMK 368

Query: 999  LLMLNGCELDLDHLLSVL----------------------CEKEKTIAALKLLDFVLERD 1112
            LL +     DLD    VL                      C+  +   A KLLD ++E++
Sbjct: 369  LLSVQSKSGDLDAAADVLKAMIRLSIPTEAGHYGVLIESFCKANEHDKAEKLLDKLIEKE 428

Query: 1113 II--------IGFSIYDKVLDALLAAGKTLNAHSILCKILEKRGATDWSSRDELIKSLNQ 1268
            I+        +  S Y+ +++ L   G+T  A     ++L K+G  D  + + LI+  ++
Sbjct: 429  IVSRPQNAFEMEASSYNLMIEYLCDHGRTSKAERFFRQLL-KKGVQDSVAFNSLIRGHSK 487

Query: 1269 EGNTKQA 1289
            EGN   A
Sbjct: 488  EGNPDSA 494


>XP_016172946.1 PREDICTED: pentatricopeptide repeat-containing protein At2g37230
            [Arachis ipaensis]
          Length = 736

 Score =  709 bits (1831), Expect = 0.0
 Identities = 361/453 (79%), Positives = 394/453 (86%), Gaps = 1/453 (0%)
 Frame = +3

Query: 15   ENLFVEMKGR-NIAPNVISYTTMLKGSVAAGRIDRALEVFEEMKACGIKPNAVTFSTLLP 191
            E LF EMKGR +  PNVISYTTMLKG V AGR+D A  +FEEMK+CG+KPNAVTF+TLLP
Sbjct: 284  EKLFAEMKGRGDTVPNVISYTTMLKGYVGAGRVDDAARIFEEMKSCGVKPNAVTFTTLLP 343

Query: 192  GLCDADKVVEARNVLGEMVERYVAPKDNAVFMKLLTSQCKSGDLDGAADVLKAMIRLSIP 371
             LCDA K  EA+NVL EMV+RY+APKDN++FMKLL+ QC+ GDLD A DVLK MIRLSIP
Sbjct: 344  ALCDAGKAAEAKNVLREMVDRYIAPKDNSIFMKLLSCQCECGDLDAAGDVLKGMIRLSIP 403

Query: 372  TEAGHYGVLIENFCKANVFDRAXXXXXXXXXXXXXXRPQSSYELEPSAYNPMIKYLCDHG 551
            TEAGHYGVLIENFCKANV+D+A              RPQSS+++EPSAYN MIKYLC++G
Sbjct: 404  TEAGHYGVLIENFCKANVYDKAVKLLDKLIEKEIILRPQSSFDMEPSAYNLMIKYLCENG 463

Query: 552  QTGKAETLFRQLMKKGVLDSVAFNNLIRGHAKEGNPDSALEIVKIMGRRGVPRDADSYKL 731
            QT KAET FRQLMKKGV DSV+FNNLI GH+KEGNPDSA EI+KIMGRRGV RDADSY+L
Sbjct: 464  QTAKAETFFRQLMKKGVQDSVSFNNLIHGHSKEGNPDSAFEILKIMGRRGVARDADSYRL 523

Query: 732  LIESYLGKGEPADAKTVLDSMLESGHLPESSLYRSVMESLFEDGRVQTASRVMKSMVEKG 911
            LIESYL  GEPADAKT LDSMLESGH+PES+L+RSVMESLFEDGRVQTASRVMK MVEK 
Sbjct: 524  LIESYLRNGEPADAKTALDSMLESGHVPESTLFRSVMESLFEDGRVQTASRVMKCMVEKE 583

Query: 912  VKENMDLVSKILEALLMRGHVEEALGRIDLLMLNGCELDLDHLLSVLCEKEKTIAALKLL 1091
            +KENMDLVSKILEALLMRGHVEEALGRI+LL  NG E DLDHLLSVLCEKEKTIAALKL 
Sbjct: 584  IKENMDLVSKILEALLMRGHVEEALGRIELLNQNGFEPDLDHLLSVLCEKEKTIAALKLF 643

Query: 1092 DFVLERDIIIGFSIYDKVLDALLAAGKTLNAHSILCKILEKRGATDWSSRDELIKSLNQE 1271
            DF LERD I+GFSIYDKVLDALLAAGKTLNA+SILCKILEKRG TDWSSRDELIKSLN+E
Sbjct: 644  DFALERDYIVGFSIYDKVLDALLAAGKTLNAYSILCKILEKRGGTDWSSRDELIKSLNRE 703

Query: 1272 GNTKQADVLSRMIKEKEGSPLKREGKKKATHAT 1370
            GNTKQAD+LSRMIK  EGSPLKREGKKK   AT
Sbjct: 704  GNTKQADILSRMIKGTEGSPLKREGKKKFAPAT 736


>XP_014510338.1 PREDICTED: pentatricopeptide repeat-containing protein At2g37230
            [Vigna radiata var. radiata]
          Length = 728

 Score =  709 bits (1829), Expect = 0.0
 Identities = 362/452 (80%), Positives = 399/452 (88%)
 Frame = +3

Query: 15   ENLFVEMKGRNIAPNVISYTTMLKGSVAAGRIDRALEVFEEMKACGIKPNAVTFSTLLPG 194
            E LFVEMK R+IAPNVIS+TTMLKG VAAGRID A++VFE+MK CGIKPN+VTFSTLLPG
Sbjct: 277  EKLFVEMKERDIAPNVISFTTMLKGYVAAGRIDDAMKVFEDMKDCGIKPNSVTFSTLLPG 336

Query: 195  LCDADKVVEARNVLGEMVERYVAPKDNAVFMKLLTSQCKSGDLDGAADVLKAMIRLSIPT 374
            LCDA+K  EAR+VLGEMV+RY+ PKDN+VFMKLL  Q KSG+LD AADVLKAMIRLSIPT
Sbjct: 337  LCDAEKTEEARDVLGEMVDRYITPKDNSVFMKLLGVQSKSGNLDAAADVLKAMIRLSIPT 396

Query: 375  EAGHYGVLIENFCKANVFDRAXXXXXXXXXXXXXXRPQSSYELEPSAYNPMIKYLCDHGQ 554
            EAGHYGVLIE+FCKAN +D+A              RPQ+++E+E  AYN MI+YLCDHG+
Sbjct: 397  EAGHYGVLIESFCKANEYDKAEKLLDKLIEKEIVLRPQNAFEMEAGAYNLMIEYLCDHGR 456

Query: 555  TGKAETLFRQLMKKGVLDSVAFNNLIRGHAKEGNPDSALEIVKIMGRRGVPRDADSYKLL 734
            T KAE  FRQL+KKGV DSVAFN+LIRGH+KEGNPDSA EI+KIMGR+GVPRDADSY+LL
Sbjct: 457  TNKAEMFFRQLLKKGVQDSVAFNSLIRGHSKEGNPDSAFEIIKIMGRKGVPRDADSYRLL 516

Query: 735  IESYLGKGEPADAKTVLDSMLESGHLPESSLYRSVMESLFEDGRVQTASRVMKSMVEKGV 914
            IESYL KGEPADAKT LDSMLESGH PESS+YR VMESLF+DGRVQTASRVMKSMVEKGV
Sbjct: 517  IESYLRKGEPADAKTALDSMLESGHHPESSVYRLVMESLFDDGRVQTASRVMKSMVEKGV 576

Query: 915  KENMDLVSKILEALLMRGHVEEALGRIDLLMLNGCELDLDHLLSVLCEKEKTIAALKLLD 1094
            KE+MDLVSKILEALLMRGHVEEALGRIDLLM NGCE D DHLLSVLCEKEKTIAALKLLD
Sbjct: 577  KEHMDLVSKILEALLMRGHVEEALGRIDLLMHNGCEPDFDHLLSVLCEKEKTIAALKLLD 636

Query: 1095 FVLERDIIIGFSIYDKVLDALLAAGKTLNAHSILCKILEKRGATDWSSRDELIKSLNQEG 1274
            FVLERD II FSIYDKVLDALLAAGKTLNA+SILCKILEKRG+TDW SR+ELIKSLNQEG
Sbjct: 637  FVLERDCIIDFSIYDKVLDALLAAGKTLNAYSILCKILEKRGSTDWRSREELIKSLNQEG 696

Query: 1275 NTKQADVLSRMIKEKEGSPLKREGKKKATHAT 1370
            NTKQAD+LSRM K  +G  + REGK+K T AT
Sbjct: 697  NTKQADILSRMFKGTDGGLVNREGKRKGTVAT 728



 Score = 62.8 bits (151), Expect = 1e-06
 Identities = 71/317 (22%), Positives = 133/317 (41%), Gaps = 48/317 (15%)
 Frame = +3

Query: 513  AYNPMIKYLCDHGQTGKAETLFRQLMKKGVLDSV-AFNNLIRGHAKEGNPDSALEIVKIM 689
            +Y+ + K +   G+   A+  +  ++++GV  +   +N L+ G       D+A+   + M
Sbjct: 189  SYDALFKVILRRGRYMMAKRYYNAMLREGVEPTRHTYNILLWGMFLSLRLDTAVRFYEEM 248

Query: 690  GRRGVPRDADSYKLLIESYLGKGEPADAKTVLDSMLESGHLPESSLYRSVMESLFEDGRV 869
              RG+  D  +Y  LI  Y    +  DA+ +   M E    P    + ++++     GR+
Sbjct: 249  KSRGILPDVVTYNTLINGYFRFKKVEDAEKLFVEMKERDIAPNVISFTTMLKGYVAAGRI 308

Query: 870  QTASRVMKSMVEKGVKENMDLVSKILEALLMRGHVEEA---LGR--------------ID 998
              A +V + M + G+K N    S +L  L      EEA   LG               + 
Sbjct: 309  DDAMKVFEDMKDCGIKPNSVTFSTLLPGLCDAEKTEEARDVLGEMVDRYITPKDNSVFMK 368

Query: 999  LLMLNGCELDLDHLLSVL----------------------CEKEKTIAALKLLDFVLERD 1112
            LL +     +LD    VL                      C+  +   A KLLD ++E++
Sbjct: 369  LLGVQSKSGNLDAAADVLKAMIRLSIPTEAGHYGVLIESFCKANEYDKAEKLLDKLIEKE 428

Query: 1113 III----GFSI----YDKVLDALLAAGKTLNAHSILCKILEKRGATDWSSRDELIKSLNQ 1268
            I++     F +    Y+ +++ L   G+T  A     ++L K+G  D  + + LI+  ++
Sbjct: 429  IVLRPQNAFEMEAGAYNLMIEYLCDHGRTNKAEMFFRQLL-KKGVQDSVAFNSLIRGHSK 487

Query: 1269 EGNTKQADVLSRMIKEK 1319
            EGN   A  + +++  K
Sbjct: 488  EGNPDSAFEIIKIMGRK 504


>XP_017425821.1 PREDICTED: pentatricopeptide repeat-containing protein At2g37230
            [Vigna angularis] KOM43328.1 hypothetical protein
            LR48_Vigan05g093200 [Vigna angularis] BAT74296.1
            hypothetical protein VIGAN_01193600 [Vigna angularis var.
            angularis]
          Length = 728

 Score =  707 bits (1824), Expect = 0.0
 Identities = 362/452 (80%), Positives = 399/452 (88%)
 Frame = +3

Query: 15   ENLFVEMKGRNIAPNVISYTTMLKGSVAAGRIDRALEVFEEMKACGIKPNAVTFSTLLPG 194
            E LFVEMK R+IAPNVIS+TTMLKG VAAGRID A++VF +MK CGIKPNAVTFSTLLPG
Sbjct: 277  EKLFVEMKERDIAPNVISFTTMLKGYVAAGRIDDAMKVFVDMKDCGIKPNAVTFSTLLPG 336

Query: 195  LCDADKVVEARNVLGEMVERYVAPKDNAVFMKLLTSQCKSGDLDGAADVLKAMIRLSIPT 374
            LCDA+K  EAR+VLGEMVERY+APKDN+VFMKLL  Q KSG+LD AADVLKAMIRLSIPT
Sbjct: 337  LCDAEKTEEARDVLGEMVERYIAPKDNSVFMKLLGVQSKSGNLDAAADVLKAMIRLSIPT 396

Query: 375  EAGHYGVLIENFCKANVFDRAXXXXXXXXXXXXXXRPQSSYELEPSAYNPMIKYLCDHGQ 554
            EAGHYGVLIE+FCKAN +D+A              RPQ+++ +E SAYN MI+YLCDHG+
Sbjct: 397  EAGHYGVLIESFCKANEYDKAEKLLDKLIEKEIVLRPQNAFAMEASAYNLMIEYLCDHGR 456

Query: 555  TGKAETLFRQLMKKGVLDSVAFNNLIRGHAKEGNPDSALEIVKIMGRRGVPRDADSYKLL 734
            T KAE  FRQL+KKGV DSVAFN+LIRGH+KEGNPDSA EI+KIMGR+GVPRDADSY+LL
Sbjct: 457  TNKAEIFFRQLLKKGVQDSVAFNSLIRGHSKEGNPDSAFEIIKIMGRKGVPRDADSYRLL 516

Query: 735  IESYLGKGEPADAKTVLDSMLESGHLPESSLYRSVMESLFEDGRVQTASRVMKSMVEKGV 914
            IESYL KGEPADAKT LDSMLESGH PESS+Y+ VMESLF+DGRVQTASRVMKSMVEKGV
Sbjct: 517  IESYLRKGEPADAKTALDSMLESGHHPESSVYKLVMESLFDDGRVQTASRVMKSMVEKGV 576

Query: 915  KENMDLVSKILEALLMRGHVEEALGRIDLLMLNGCELDLDHLLSVLCEKEKTIAALKLLD 1094
            KE+MDLVSKILEALLMRGHVEEALGRIDLLM NGCE D DHLLSVLCEKEKTIAALKLLD
Sbjct: 577  KEHMDLVSKILEALLMRGHVEEALGRIDLLMHNGCEPDFDHLLSVLCEKEKTIAALKLLD 636

Query: 1095 FVLERDIIIGFSIYDKVLDALLAAGKTLNAHSILCKILEKRGATDWSSRDELIKSLNQEG 1274
            FVLERD II FSIYDKVLDALLAAGKTLNA+SILCKIL+KRG+TDW SR+ELIKSLNQEG
Sbjct: 637  FVLERDCIIDFSIYDKVLDALLAAGKTLNAYSILCKILDKRGSTDWRSREELIKSLNQEG 696

Query: 1275 NTKQADVLSRMIKEKEGSPLKREGKKKATHAT 1370
            NTKQAD+LSRM K  +G  + REGK+K T AT
Sbjct: 697  NTKQADILSRMFKGTDGGLVNREGKRKVTVAT 728


>XP_015931857.1 PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            [Arachis duranensis]
          Length = 759

 Score =  706 bits (1822), Expect = 0.0
 Identities = 360/453 (79%), Positives = 394/453 (86%), Gaps = 1/453 (0%)
 Frame = +3

Query: 15   ENLFVEMKGR-NIAPNVISYTTMLKGSVAAGRIDRALEVFEEMKACGIKPNAVTFSTLLP 191
            E LF EMKGR +  PNVISYTTMLKG V AGR+D A  +FEEMK+CG+KPNAVTF+TLLP
Sbjct: 307  EKLFAEMKGRGDTVPNVISYTTMLKGYVGAGRVDDAARIFEEMKSCGVKPNAVTFTTLLP 366

Query: 192  GLCDADKVVEARNVLGEMVERYVAPKDNAVFMKLLTSQCKSGDLDGAADVLKAMIRLSIP 371
             L DA K  EA+NVL EMV+RY+APKDN++FMKLL+ QC+ GDLD A DVLK MIRLSIP
Sbjct: 367  ALSDAGKAAEAKNVLREMVDRYIAPKDNSIFMKLLSCQCECGDLDAAGDVLKGMIRLSIP 426

Query: 372  TEAGHYGVLIENFCKANVFDRAXXXXXXXXXXXXXXRPQSSYELEPSAYNPMIKYLCDHG 551
            TEAGHYGVLIENFCKANV+D+A              RPQSS+++EPSAYN MIKYLC++G
Sbjct: 427  TEAGHYGVLIENFCKANVYDKAVKLLDKLIEKEIILRPQSSFDMEPSAYNLMIKYLCENG 486

Query: 552  QTGKAETLFRQLMKKGVLDSVAFNNLIRGHAKEGNPDSALEIVKIMGRRGVPRDADSYKL 731
            QT KAET FRQLMKKGV DSV+FNNLI GH+KEGNPDSA EI+KIMGRRGV RDADSY+L
Sbjct: 487  QTAKAETFFRQLMKKGVQDSVSFNNLIHGHSKEGNPDSAFEILKIMGRRGVARDADSYRL 546

Query: 732  LIESYLGKGEPADAKTVLDSMLESGHLPESSLYRSVMESLFEDGRVQTASRVMKSMVEKG 911
            LIESYL KGEPADAKT LDSMLESGH+PES+L+RSVMESLFEDGRVQTASRVMK MVEK 
Sbjct: 547  LIESYLRKGEPADAKTALDSMLESGHVPESTLFRSVMESLFEDGRVQTASRVMKCMVEKE 606

Query: 912  VKENMDLVSKILEALLMRGHVEEALGRIDLLMLNGCELDLDHLLSVLCEKEKTIAALKLL 1091
            +KENMDLVSKILEALLMRGHVEEALGRI+LL  NG E DLDHLLSVLCEKEKTIAALKL 
Sbjct: 607  IKENMDLVSKILEALLMRGHVEEALGRIELLNQNGFEPDLDHLLSVLCEKEKTIAALKLF 666

Query: 1092 DFVLERDIIIGFSIYDKVLDALLAAGKTLNAHSILCKILEKRGATDWSSRDELIKSLNQE 1271
            DF LERD I+GFSIYDKVLDALLAAGKTLNA+SILCKILEKRG TDWSSRDELIKSLN+E
Sbjct: 667  DFALERDYIVGFSIYDKVLDALLAAGKTLNAYSILCKILEKRGRTDWSSRDELIKSLNRE 726

Query: 1272 GNTKQADVLSRMIKEKEGSPLKREGKKKATHAT 1370
            GNTKQAD+LSRMIK  EG+PLKREGKKK   AT
Sbjct: 727  GNTKQADILSRMIKGTEGTPLKREGKKKFAPAT 759


>XP_015939760.1 PREDICTED: pentatricopeptide repeat-containing protein
            At2g37230-like, partial [Arachis duranensis]
          Length = 718

 Score =  703 bits (1815), Expect = 0.0
 Identities = 358/453 (79%), Positives = 392/453 (86%), Gaps = 1/453 (0%)
 Frame = +3

Query: 15   ENLFVEMKGR-NIAPNVISYTTMLKGSVAAGRIDRALEVFEEMKACGIKPNAVTFSTLLP 191
            E LF EMKGR +  PNVISYTTMLKG V AGR+D A  +FEEMK+CG+KPNAVTF+TLLP
Sbjct: 266  EKLFAEMKGRGDTVPNVISYTTMLKGYVGAGRVDDAARIFEEMKSCGVKPNAVTFTTLLP 325

Query: 192  GLCDADKVVEARNVLGEMVERYVAPKDNAVFMKLLTSQCKSGDLDGAADVLKAMIRLSIP 371
             LCDA K  EA+NVL EMV+RY+APKDN++FMKLL+ QC+  DLD A DVLK MIRLSIP
Sbjct: 326  ALCDAGKAAEAKNVLQEMVDRYIAPKDNSIFMKLLSCQCECSDLDAAGDVLKGMIRLSIP 385

Query: 372  TEAGHYGVLIENFCKANVFDRAXXXXXXXXXXXXXXRPQSSYELEPSAYNPMIKYLCDHG 551
            TEAGHYGVLIENFCKANV+D+A              RPQSS+++EPSAYN MIKYLC++G
Sbjct: 386  TEAGHYGVLIENFCKANVYDKAVKLLDKLIEKEIILRPQSSFDMEPSAYNLMIKYLCENG 445

Query: 552  QTGKAETLFRQLMKKGVLDSVAFNNLIRGHAKEGNPDSALEIVKIMGRRGVPRDADSYKL 731
            QT KAET FRQLMKKGV DSV+FNNLI GH+KEGNPDSA EI+KIMGRRGV RDAD Y+L
Sbjct: 446  QTAKAETFFRQLMKKGVQDSVSFNNLIHGHSKEGNPDSAFEILKIMGRRGVARDADFYRL 505

Query: 732  LIESYLGKGEPADAKTVLDSMLESGHLPESSLYRSVMESLFEDGRVQTASRVMKSMVEKG 911
            LIESYL KGEPADAKT LDSMLESGH+PES+L+RSVMESLFEDGRVQTASRVMK MVEK 
Sbjct: 506  LIESYLRKGEPADAKTALDSMLESGHVPESTLFRSVMESLFEDGRVQTASRVMKCMVEKE 565

Query: 912  VKENMDLVSKILEALLMRGHVEEALGRIDLLMLNGCELDLDHLLSVLCEKEKTIAALKLL 1091
            +KENMDLVSKILEALLMRGHVEEALGRI+LL  NG E DLDHLLSVLCEKEKTIAALKL 
Sbjct: 566  IKENMDLVSKILEALLMRGHVEEALGRIELLNQNGFEPDLDHLLSVLCEKEKTIAALKLF 625

Query: 1092 DFVLERDIIIGFSIYDKVLDALLAAGKTLNAHSILCKILEKRGATDWSSRDELIKSLNQE 1271
            DF LERD I+GFSIYDKVLD LLAAGKTLNA+SILCKILEKRG TDWSSRDELIKSLN+E
Sbjct: 626  DFALERDYIVGFSIYDKVLDPLLAAGKTLNAYSILCKILEKRGRTDWSSRDELIKSLNRE 685

Query: 1272 GNTKQADVLSRMIKEKEGSPLKREGKKKATHAT 1370
            GNTKQAD+LSRMIK  EG+PLKREGKKK   AT
Sbjct: 686  GNTKQADILSRMIKGTEGTPLKREGKKKFAPAT 718


>XP_019464106.1 PREDICTED: pentatricopeptide repeat-containing protein At2g37230
            [Lupinus angustifolius] XP_019464107.1 PREDICTED:
            pentatricopeptide repeat-containing protein At2g37230
            [Lupinus angustifolius] OIW00649.1 hypothetical protein
            TanjilG_09130 [Lupinus angustifolius]
          Length = 740

 Score =  701 bits (1808), Expect = 0.0
 Identities = 351/451 (77%), Positives = 399/451 (88%)
 Frame = +3

Query: 15   ENLFVEMKGRNIAPNVISYTTMLKGSVAAGRIDRALEVFEEMKACGIKPNAVTFSTLLPG 194
            E LFVEMKG++IAP+VISY TMLKG  A G++D AL+++EEMK  GI PNA+TF+ LLPG
Sbjct: 289  EKLFVEMKGKDIAPDVISYNTMLKGYFAVGQVDDALKIYEEMKGVGINPNAITFTMLLPG 348

Query: 195  LCDADKVVEARNVLGEMVERYVAPKDNAVFMKLLTSQCKSGDLDGAADVLKAMIRLSIPT 374
            LCD  K+ EA+NVLGEMVE+YVAPKDN++FMKL+T QCK+GDLD AA VLKAMIRL IPT
Sbjct: 349  LCDVGKIAEAQNVLGEMVEKYVAPKDNSIFMKLMTCQCKAGDLDAAAGVLKAMIRLRIPT 408

Query: 375  EAGHYGVLIENFCKANVFDRAXXXXXXXXXXXXXXRPQSSYELEPSAYNPMIKYLCDHGQ 554
            EAGHYGVLIENFCKANV+D+A              RP+S++E+E SAYNPMI+YLCD+GQ
Sbjct: 409  EAGHYGVLIENFCKANVYDKAVNLLDRLIEKDIILRPKSTFEIEASAYNPMIQYLCDNGQ 468

Query: 555  TGKAETLFRQLMKKGVLDSVAFNNLIRGHAKEGNPDSALEIVKIMGRRGVPRDADSYKLL 734
            T KAET FRQL+KKGV+D+VAFNNLIRGH+KEGNPDSALEI+ IMGRR VPRDADSYKLL
Sbjct: 469  TVKAETFFRQLLKKGVIDAVAFNNLIRGHSKEGNPDSALEILTIMGRREVPRDADSYKLL 528

Query: 735  IESYLGKGEPADAKTVLDSMLESGHLPESSLYRSVMESLFEDGRVQTASRVMKSMVEKGV 914
            IESYL KGEPADAKT LD MLE+GH+PESSLYR+VMESLFEDGRVQTASRVMKSMVEKGV
Sbjct: 529  IESYLRKGEPADAKTALDGMLENGHIPESSLYRAVMESLFEDGRVQTASRVMKSMVEKGV 588

Query: 915  KENMDLVSKILEALLMRGHVEEALGRIDLLMLNGCELDLDHLLSVLCEKEKTIAALKLLD 1094
            KE+MDLVSKILEALL+RGHVEEA+GRIDLLM +GCE D+D LLS+LCEK+KTIAALKLLD
Sbjct: 589  KEHMDLVSKILEALLIRGHVEEAIGRIDLLMHSGCEPDIDRLLSILCEKKKTIAALKLLD 648

Query: 1095 FVLERDIIIGFSIYDKVLDALLAAGKTLNAHSILCKILEKRGATDWSSRDELIKSLNQEG 1274
            FVLERD ++  S+YDKVLDAL+AAGKTLNA+SILCKI+EKRGATDWSSRDELIKSLN EG
Sbjct: 649  FVLERDYVLDVSMYDKVLDALIAAGKTLNAYSILCKIVEKRGATDWSSRDELIKSLNLEG 708

Query: 1275 NTKQADVLSRMIKEKEGSPLKREGKKKATHA 1367
            NTKQADVLSRM+K K  SP KREGKK+A  A
Sbjct: 709  NTKQADVLSRMMKGKTQSPAKREGKKQAAAA 739


>XP_013463303.1 pentatricopeptide (PPR) repeat protein [Medicago truncatula]
            KEH37314.1 pentatricopeptide (PPR) repeat protein
            [Medicago truncatula]
          Length = 745

 Score =  699 bits (1803), Expect = 0.0
 Identities = 357/451 (79%), Positives = 388/451 (86%)
 Frame = +3

Query: 15   ENLFVEMKGRNIAPNVISYTTMLKGSVAAGRIDRALEVFEEMKACGIKPNAVTFSTLLPG 194
            E+LFVEMKG+N+ PNVISYTTMLKG V  G++DRA EVFEEMK CGIKPNAVTF+TLLPG
Sbjct: 294  ESLFVEMKGKNLMPNVISYTTMLKGFVDVGKVDRAFEVFEEMKDCGIKPNAVTFTTLLPG 353

Query: 195  LCDADKVVEARNVLGEMVERYVAPKDNAVFMKLLTSQCKSGDLDGAADVLKAMIRLSIPT 374
            LCDADK+VEA NVLGEMVERY+APKDN+VFMKL+  QCK G+LD A DVL AMIRLSIPT
Sbjct: 354  LCDADKMVEAGNVLGEMVERYIAPKDNSVFMKLMECQCKGGNLDAAVDVLNAMIRLSIPT 413

Query: 375  EAGHYGVLIENFCKANVFDRAXXXXXXXXXXXXXXRPQSSYELEPSAYNPMIKYLCDHGQ 554
            EAGHYGVLIENFCKANV+DRA              RP++SYE+E SAYN MI YLCD+G+
Sbjct: 414  EAGHYGVLIENFCKANVYDRAEKLLDKLIEKDIVLRPETSYEMEASAYNRMIGYLCDNGK 473

Query: 555  TGKAETLFRQLMKKGVLDSVAFNNLIRGHAKEGNPDSALEIVKIMGRRGVPRDADSYKLL 734
            T KAE  FRQLMKKGVLD VAFNNL+ GH+KEGNPDSA EI  IM RR V  D  SY+LL
Sbjct: 474  TAKAEMFFRQLMKKGVLDPVAFNNLMCGHSKEGNPDSAFEIATIMSRRKVHSDEYSYRLL 533

Query: 735  IESYLGKGEPADAKTVLDSMLESGHLPESSLYRSVMESLFEDGRVQTASRVMKSMVEKGV 914
            IESYL KGEPADAKT LD MLE GH P SSLYRSVMESLFEDGRVQTASRVMK+MVEKGV
Sbjct: 534  IESYLRKGEPADAKTALDHMLEGGHEPNSSLYRSVMESLFEDGRVQTASRVMKNMVEKGV 593

Query: 915  KENMDLVSKILEALLMRGHVEEALGRIDLLMLNGCELDLDHLLSVLCEKEKTIAALKLLD 1094
            K NMDLVSKILEAL +RGHVEEALGRIDLLM +GCE D DHLLS+LCEKEK IAAL+LLD
Sbjct: 594  KNNMDLVSKILEALFIRGHVEEALGRIDLLMNSGCEPDFDHLLSILCEKEKRIAALRLLD 653

Query: 1095 FVLERDIIIGFSIYDKVLDALLAAGKTLNAHSILCKILEKRGATDWSSRDELIKSLNQEG 1274
            FVLERDIII FS YDKVLD LLAAGKTLNA+SILCKI+EKRGATDWSSRDELIKSLNQ+G
Sbjct: 654  FVLERDIIIDFSNYDKVLDTLLAAGKTLNAYSILCKIMEKRGATDWSSRDELIKSLNQQG 713

Query: 1275 NTKQADVLSRMIKEKEGSPLKREGKKKATHA 1367
            NTKQADVLSRM+KEK  SP K+EGKKKA+ A
Sbjct: 714  NTKQADVLSRMVKEKVASPPKKEGKKKASRA 744


>XP_018837769.1 PREDICTED: pentatricopeptide repeat-containing protein At2g37230
            [Juglans regia] XP_018837770.1 PREDICTED:
            pentatricopeptide repeat-containing protein At2g37230
            [Juglans regia] XP_018837771.1 PREDICTED:
            pentatricopeptide repeat-containing protein At2g37230
            [Juglans regia] XP_018837772.1 PREDICTED:
            pentatricopeptide repeat-containing protein At2g37230
            [Juglans regia] XP_018837773.1 PREDICTED:
            pentatricopeptide repeat-containing protein At2g37230
            [Juglans regia]
          Length = 766

 Score =  671 bits (1730), Expect = 0.0
 Identities = 339/451 (75%), Positives = 388/451 (86%)
 Frame = +3

Query: 15   ENLFVEMKGRNIAPNVISYTTMLKGSVAAGRIDRALEVFEEMKACGIKPNAVTFSTLLPG 194
            E LF E+KGRNIAP VISYTTM+KG V+ GRID  L + +EMK+ G+KPNAVT+STLLPG
Sbjct: 315  EKLFDELKGRNIAPTVISYTTMIKGYVSVGRIDDGLRLLDEMKSFGVKPNAVTYSTLLPG 374

Query: 195  LCDADKVVEARNVLGEMVERYVAPKDNAVFMKLLTSQCKSGDLDGAADVLKAMIRLSIPT 374
            LCDA+K+ EAR +L EMVER+ APKD+++F++LLT QC SGDLD AADVLK+MIRLSIPT
Sbjct: 375  LCDAEKMSEARMMLKEMVERHFAPKDSSIFVRLLTCQCNSGDLDAAADVLKSMIRLSIPT 434

Query: 375  EAGHYGVLIENFCKANVFDRAXXXXXXXXXXXXXXRPQSSYELEPSAYNPMIKYLCDHGQ 554
            EAGHYGVLIENFCKA V+DRA              RPQ+S E+E +AYNPMI+YLC+HGQ
Sbjct: 435  EAGHYGVLIENFCKAGVYDRAIKLLDKLIEKEIILRPQTSLEMESTAYNPMIQYLCEHGQ 494

Query: 555  TGKAETLFRQLMKKGVLDSVAFNNLIRGHAKEGNPDSALEIVKIMGRRGVPRDADSYKLL 734
            TGKAE  FRQLMKKG+LDS AFNNLI GH++EGNPDSA EI++IMGRRGV RDADS+KLL
Sbjct: 495  TGKAEIFFRQLMKKGILDSFAFNNLICGHSREGNPDSAFEILRIMGRRGVSRDADSFKLL 554

Query: 735  IESYLGKGEPADAKTVLDSMLESGHLPESSLYRSVMESLFEDGRVQTASRVMKSMVEKGV 914
            I+SYL +GEPADAKT LDSM+E GHLP+SSL+RSVMESLFEDGR+QT+SRVMKSMVEKGV
Sbjct: 555  IKSYLNRGEPADAKTALDSMIEVGHLPDSSLFRSVMESLFEDGRIQTSSRVMKSMVEKGV 614

Query: 915  KENMDLVSKILEALLMRGHVEEALGRIDLLMLNGCELDLDHLLSVLCEKEKTIAALKLLD 1094
            KENMDLV+KILEALLMRGHVEEALGRIDLLM +GC  D D LL+VLCEK KTIAALK+LD
Sbjct: 615  KENMDLVAKILEALLMRGHVEEALGRIDLLMHSGCTPDFDSLLTVLCEKGKTIAALKVLD 674

Query: 1095 FVLERDIIIGFSIYDKVLDALLAAGKTLNAHSILCKILEKRGATDWSSRDELIKSLNQEG 1274
            F LERD  + FS YDKVLDALLAAGKTLNA+SILCKI+EK GATDWSSR+ LIKSLNQEG
Sbjct: 675  FALERDYTVDFSSYDKVLDALLAAGKTLNAYSILCKIMEKGGATDWSSREALIKSLNQEG 734

Query: 1275 NTKQADVLSRMIKEKEGSPLKREGKKKATHA 1367
            NTKQAD+LSRMIK+ E S   ++GKK AT A
Sbjct: 735  NTKQADILSRMIKDGEKSHAGKKGKKHATVA 765



 Score = 60.5 bits (145), Expect = 6e-06
 Identities = 73/363 (20%), Positives = 147/363 (40%), Gaps = 13/363 (3%)
 Frame = +3

Query: 270  DNAVFMKLLTSQCKSGDLDGAADVLKAMIRLSIPTEAGHYGVLIENFCKANVFDRAXXXX 449
            D    +K++    ++  ++ A  +L  M +  +  +   + VLIE++ KA +   A    
Sbjct: 154  DRETHLKMIEILGRASKINHARCILLDMPKKGVEWDEDLFVVLIESYGKAGIVQEAVKIF 213

Query: 450  XXXXXXXXXXRPQSSYELEPSAYNPMIKYLCDHGQTGKAETLFRQLMKKGVLDSV-AFNN 626
                        +   E    +Y+ + K +   G+   A+  F  ++ +G+  +   FN 
Sbjct: 214  QKMK--------ELGVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTFNV 265

Query: 627  LIRGHAKEGNPDSALEIVKIMGRRGVPRDADSYKLLIESYLGKGEPADAKTVLDSMLESG 806
            ++ G       ++A    + M  RGV  D  +Y  +I  Y       +A+ + D +    
Sbjct: 266  MLWGFFLSLRLETAKRFYEDMKSRGVSPDVVTYNTMINGYYRFKMMDEAEKLFDELKGRN 325

Query: 807  HLPESSLYRSVMESLFEDGRVQTASRVMKSMVEKGVKENMDLVSKILEALLMRGHVEEAL 986
              P    Y ++++     GR+    R++  M   GVK N    S +L  L     + EA 
Sbjct: 326  IAPTVISYTTMIKGYVSVGRIDDGLRLLDEMKSFGVKPNAVTYSTLLPGLCDAEKMSEAR 385

Query: 987  GRI-DLLMLNGCELDLD---HLLSVLCEKEKTIAALKLLDFVLERDIIIGFSIYDKVLDA 1154
              + +++  +    D      LL+  C      AA  +L  ++   I      Y  +++ 
Sbjct: 386  MMLKEMVERHFAPKDSSIFVRLLTCQCNSGDLDAAADVLKSMIRLSIPTEAGHYGVLIEN 445

Query: 1155 LLAAGKTLNAHSILCKILEK----RGAT----DWSSRDELIKSLNQEGNTKQADVLSRMI 1310
               AG    A  +L K++EK    R  T    + ++ + +I+ L + G T +A++  R +
Sbjct: 446  FCKAGVYDRAIKLLDKLIEKEIILRPQTSLEMESTAYNPMIQYLCEHGQTGKAEIFFRQL 505

Query: 1311 KEK 1319
             +K
Sbjct: 506  MKK 508


>GAU37748.1 hypothetical protein TSUD_102640 [Trifolium subterraneum]
          Length = 665

 Score =  666 bits (1718), Expect = 0.0
 Identities = 343/435 (78%), Positives = 375/435 (86%), Gaps = 1/435 (0%)
 Frame = +3

Query: 69   YTTMLK-GSVAAGRIDRALEVFEEMKACGIKPNAVTFSTLLPGLCDADKVVEARNVLGEM 245
            Y  ML+ G V AGR+DRA+EVFEEMK  GIKPNAVT++TLLPGLCDADK+VEA NVLGEM
Sbjct: 231  YNAMLREGCVDAGRVDRAVEVFEEMKNNGIKPNAVTYTTLLPGLCDADKLVEAGNVLGEM 290

Query: 246  VERYVAPKDNAVFMKLLTSQCKSGDLDGAADVLKAMIRLSIPTEAGHYGVLIENFCKANV 425
            VERY+AP DN+VFMKL+   CK G+LD A +VLKAMIRLSIPTEAGHYGVLIENFCKANV
Sbjct: 291  VERYIAPNDNSVFMKLMNCHCKVGNLDDAVNVLKAMIRLSIPTEAGHYGVLIENFCKANV 350

Query: 426  FDRAXXXXXXXXXXXXXXRPQSSYELEPSAYNPMIKYLCDHGQTGKAETLFRQLMKKGVL 605
            +DRA              RP++S+E+ PSAYNPMI+YLCD+G+T KAET FRQLMKKGVL
Sbjct: 351  YDRAEKLLDKLVEKDIILRPENSFEMGPSAYNPMIEYLCDNGRTVKAETFFRQLMKKGVL 410

Query: 606  DSVAFNNLIRGHAKEGNPDSALEIVKIMGRRGVPRDADSYKLLIESYLGKGEPADAKTVL 785
            DSVAFNNLIRGH KEGNP+SALEI  IM RRGV  D+DSY+LL ESYL KGEPADAK  L
Sbjct: 411  DSVAFNNLIRGHLKEGNPESALEIATIMSRRGVSSDSDSYRLLTESYLRKGEPADAKIAL 470

Query: 786  DSMLESGHLPESSLYRSVMESLFEDGRVQTASRVMKSMVEKGVKENMDLVSKILEALLMR 965
            D M+ESGH P+SSLYRSVMESLFEDGRVQTASRVMKSMVEKGV ENMDLVSKILEALLMR
Sbjct: 471  DHMIESGHQPDSSLYRSVMESLFEDGRVQTASRVMKSMVEKGVMENMDLVSKILEALLMR 530

Query: 966  GHVEEALGRIDLLMLNGCELDLDHLLSVLCEKEKTIAALKLLDFVLERDIIIGFSIYDKV 1145
            GHVEEALGRIDLLM NGCE D DHLLS+LCEKEK IAAL+LLDFVLE+DIII FS YDKV
Sbjct: 531  GHVEEALGRIDLLMQNGCEPDFDHLLSILCEKEKRIAALRLLDFVLEKDIIIDFSNYDKV 590

Query: 1146 LDALLAAGKTLNAHSILCKILEKRGATDWSSRDELIKSLNQEGNTKQADVLSRMIKEKEG 1325
            LD LLAAGKTLNA+S+LCKIL K GATDWSSRD LIKSLNQEGNTKQAD+LSRMIKEK  
Sbjct: 591  LDTLLAAGKTLNAYSVLCKILAKGGATDWSSRDLLIKSLNQEGNTKQADILSRMIKEKVA 650

Query: 1326 SPLKREGKKKATHAT 1370
            S  K+EGKKKA+ AT
Sbjct: 651  SSPKKEGKKKASRAT 665


>XP_010111755.1 hypothetical protein L484_008414 [Morus notabilis] EXC31617.1
            hypothetical protein L484_008414 [Morus notabilis]
          Length = 768

 Score =  668 bits (1723), Expect = 0.0
 Identities = 336/449 (74%), Positives = 387/449 (86%)
 Frame = +3

Query: 15   ENLFVEMKGRNIAPNVISYTTMLKGSVAAGRIDRALEVFEEMKACGIKPNAVTFSTLLPG 194
            E +FVEMKGRNIAP VISYTTM+KG V+ GR+D  L +FEEMK+ GIKPNAVT++TLLPG
Sbjct: 317  EKMFVEMKGRNIAPTVISYTTMIKGYVSIGRVDDGLRLFEEMKSFGIKPNAVTYTTLLPG 376

Query: 195  LCDADKVVEARNVLGEMVERYVAPKDNAVFMKLLTSQCKSGDLDGAADVLKAMIRLSIPT 374
            LCDA+K+ EAR +L EMV+RY+APKDN++F++LL+SQCK GDLD AADVLKAMIRLSIPT
Sbjct: 377  LCDAEKMSEARTMLKEMVDRYIAPKDNSIFLRLLSSQCKVGDLDAAADVLKAMIRLSIPT 436

Query: 375  EAGHYGVLIENFCKANVFDRAXXXXXXXXXXXXXXRPQSSYELEPSAYNPMIKYLCDHGQ 554
            EAGHYG+LIENFCKA V+DRA              RPQSS E+E SAYN MI++LC+HGQ
Sbjct: 437  EAGHYGILIENFCKAAVYDRAVKLLDKLIEKEIVLRPQSSTEMEASAYNAMIQFLCNHGQ 496

Query: 555  TGKAETLFRQLMKKGVLDSVAFNNLIRGHAKEGNPDSALEIVKIMGRRGVPRDADSYKLL 734
            TGKAE  FRQLMKKGV D VAFNNLIRGH+KEGNPDSA EI+KIMGRRGV RDADSY+LL
Sbjct: 497  TGKAEIFFRQLMKKGVQDPVAFNNLIRGHSKEGNPDSAFEILKIMGRRGVARDADSYRLL 556

Query: 735  IESYLGKGEPADAKTVLDSMLESGHLPESSLYRSVMESLFEDGRVQTASRVMKSMVEKGV 914
            I+SYL KGEPADAKT LDSM+E+ HLPESSL+RSVMESL+EDGR QTASRVMKSM+EKGV
Sbjct: 557  IKSYLSKGEPADAKTALDSMIENDHLPESSLFRSVMESLYEDGRAQTASRVMKSMIEKGV 616

Query: 915  KENMDLVSKILEALLMRGHVEEALGRIDLLMLNGCELDLDHLLSVLCEKEKTIAALKLLD 1094
            KENMDLV+KILEALL+RGHVEEALGRIDLLM +GC  + D LLSVLCEK KTIAALKLLD
Sbjct: 617  KENMDLVAKILEALLVRGHVEEALGRIDLLMQSGCAPNFDSLLSVLCEKGKTIAALKLLD 676

Query: 1095 FVLERDIIIGFSIYDKVLDALLAAGKTLNAHSILCKILEKRGATDWSSRDELIKSLNQEG 1274
            F LERD ++ FS YDKVLDALLAAGKTLNA+SILCKI+ K G TDWS  ++LIKSLN+EG
Sbjct: 677  FCLERDYVVDFSSYDKVLDALLAAGKTLNAYSILCKIMGKGGVTDWSGCEDLIKSLNKEG 736

Query: 1275 NTKQADVLSRMIKEKEGSPLKREGKKKAT 1361
            NTKQAD++SRMIK  + +   R+GK+KA+
Sbjct: 737  NTKQADIISRMIKGGQEASGSRKGKRKAS 765


>EOY04385.1 Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma
            cacao]
          Length = 743

 Score =  659 bits (1701), Expect = 0.0
 Identities = 338/452 (74%), Positives = 383/452 (84%)
 Frame = +3

Query: 15   ENLFVEMKGRNIAPNVISYTTMLKGSVAAGRIDRALEVFEEMKACGIKPNAVTFSTLLPG 194
            E LFVEMKG+N+AP VISYTTM+KG VA  ++D  L + EEMK+ GIKPNA T+STLLPG
Sbjct: 292  EKLFVEMKGKNLAPTVISYTTMIKGYVAVEQVDDGLRLLEEMKSFGIKPNATTYSTLLPG 351

Query: 195  LCDADKVVEARNVLGEMVERYVAPKDNAVFMKLLTSQCKSGDLDGAADVLKAMIRLSIPT 374
            LCDA K+ EA+++L EMVE Y+APKDN++F+ LL SQCKSGDLD AADVLKAMIRLSIPT
Sbjct: 352  LCDAGKMTEAKSILKEMVEWYIAPKDNSIFINLLNSQCKSGDLDAAADVLKAMIRLSIPT 411

Query: 375  EAGHYGVLIENFCKANVFDRAXXXXXXXXXXXXXXRPQSSYELEPSAYNPMIKYLCDHGQ 554
            EAGHYGVLIENFCKAN+FDRA              RPQ+S ++E SAYN MI+YLC HGQ
Sbjct: 412  EAGHYGVLIENFCKANLFDRAIKLLDKLVEKEIILRPQNSLDMEASAYNAMIQYLCHHGQ 471

Query: 555  TGKAETLFRQLMKKGVLDSVAFNNLIRGHAKEGNPDSALEIVKIMGRRGVPRDADSYKLL 734
            TGKAE  FRQLMKKGVLD  AFNNLIRGHAKEGNP  A EI+KIMGRRGVP+DAD+YKLL
Sbjct: 472  TGKAEVFFRQLMKKGVLDPTAFNNLIRGHAKEGNPGLAFEILKIMGRRGVPKDADAYKLL 531

Query: 735  IESYLGKGEPADAKTVLDSMLESGHLPESSLYRSVMESLFEDGRVQTASRVMKSMVEKGV 914
            IESYL KGEPADAKT LDSM+E G LPES +++SVMESLFEDGR+QTASRVMKSMVEKGV
Sbjct: 532  IESYLRKGEPADAKTSLDSMIEDGLLPESGIFKSVMESLFEDGRIQTASRVMKSMVEKGV 591

Query: 915  KENMDLVSKILEALLMRGHVEEALGRIDLLMLNGCELDLDHLLSVLCEKEKTIAALKLLD 1094
            KE+MDLV+KILEALLMRGHVEEALGRI+LLM NGC  +LD LLSVL EK KTIAALKLLD
Sbjct: 592  KEHMDLVAKILEALLMRGHVEEALGRIELLMQNGCAPNLDSLLSVLSEKGKTIAALKLLD 651

Query: 1095 FVLERDIIIGFSIYDKVLDALLAAGKTLNAHSILCKILEKRGATDWSSRDELIKSLNQEG 1274
            F LERD  I FS Y+KVLDALLAAGKTLNA+SILCKI+EK G T+WSS ++LIKSLNQEG
Sbjct: 652  FGLERDCSIDFSSYEKVLDALLAAGKTLNAYSILCKIMEKGGITNWSSLEDLIKSLNQEG 711

Query: 1275 NTKQADVLSRMIKEKEGSPLKREGKKKATHAT 1370
            NTKQAD+LSRMIK  E +   ++GKK+AT A+
Sbjct: 712  NTKQADILSRMIKGGEAASGSKKGKKQATVAS 743


>XP_007033459.2 PREDICTED: pentatricopeptide repeat-containing protein At2g37230
            [Theobroma cacao]
          Length = 743

 Score =  658 bits (1698), Expect = 0.0
 Identities = 338/452 (74%), Positives = 383/452 (84%)
 Frame = +3

Query: 15   ENLFVEMKGRNIAPNVISYTTMLKGSVAAGRIDRALEVFEEMKACGIKPNAVTFSTLLPG 194
            E LFVEMKG+N+AP VISYTTM+KG VA  ++D  L + EEMK+ GIKPNA T+STLLPG
Sbjct: 292  EKLFVEMKGKNLAPTVISYTTMIKGYVAVEQVDDGLRLLEEMKSFGIKPNATTYSTLLPG 351

Query: 195  LCDADKVVEARNVLGEMVERYVAPKDNAVFMKLLTSQCKSGDLDGAADVLKAMIRLSIPT 374
            LCDA K+ EA+++L EMVE Y+APKDN++F+ LL SQCKSGDLD AADVLKAMIRLSIPT
Sbjct: 352  LCDAGKMTEAKSILKEMVEWYIAPKDNSIFINLLNSQCKSGDLDAAADVLKAMIRLSIPT 411

Query: 375  EAGHYGVLIENFCKANVFDRAXXXXXXXXXXXXXXRPQSSYELEPSAYNPMIKYLCDHGQ 554
            EAGHYGVLIENFCKAN+FDRA              RPQ+S ++E SAYN MI+YLC HGQ
Sbjct: 412  EAGHYGVLIENFCKANLFDRAIKLLDKLVEKEIILRPQNSLDMEASAYNAMIQYLCHHGQ 471

Query: 555  TGKAETLFRQLMKKGVLDSVAFNNLIRGHAKEGNPDSALEIVKIMGRRGVPRDADSYKLL 734
            TGKAE  FRQLMKKGVLDS AFNNLIRGHAKEGNP  A EI+KIMGRRGVP+DAD+YKLL
Sbjct: 472  TGKAEVFFRQLMKKGVLDSTAFNNLIRGHAKEGNPGLAFEILKIMGRRGVPKDADAYKLL 531

Query: 735  IESYLGKGEPADAKTVLDSMLESGHLPESSLYRSVMESLFEDGRVQTASRVMKSMVEKGV 914
            IESYL KGEPADAKT LDSM+E   LPES +++SVMESLFEDGR+QTASRVMKSMVEKGV
Sbjct: 532  IESYLRKGEPADAKTSLDSMIEDRLLPESGIFKSVMESLFEDGRIQTASRVMKSMVEKGV 591

Query: 915  KENMDLVSKILEALLMRGHVEEALGRIDLLMLNGCELDLDHLLSVLCEKEKTIAALKLLD 1094
            KE+MDLV+KILEALLMRGHVEEALGRI+LLM NGC  +LD LLSVL EK KTIAALKLLD
Sbjct: 592  KEHMDLVAKILEALLMRGHVEEALGRIELLMQNGCAPNLDSLLSVLSEKGKTIAALKLLD 651

Query: 1095 FVLERDIIIGFSIYDKVLDALLAAGKTLNAHSILCKILEKRGATDWSSRDELIKSLNQEG 1274
            F LERD  I FS Y+KVLDALLAAGKTLNA+SILCKI+EK G T+WSS ++LIKSLNQEG
Sbjct: 652  FGLERDCSIDFSSYEKVLDALLAAGKTLNAYSILCKIMEKGGITNWSSLEDLIKSLNQEG 711

Query: 1275 NTKQADVLSRMIKEKEGSPLKREGKKKATHAT 1370
            NTKQAD+LSRMIK  E +   ++GKK+AT A+
Sbjct: 712  NTKQADILSRMIKGGEAASGSKKGKKQATVAS 743


>XP_008338950.1 PREDICTED: pentatricopeptide repeat-containing protein At2g37230-like
            [Malus domestica]
          Length = 763

 Score =  657 bits (1694), Expect = 0.0
 Identities = 340/452 (75%), Positives = 377/452 (83%)
 Frame = +3

Query: 15   ENLFVEMKGRNIAPNVISYTTMLKGSVAAGRIDRALEVFEEMKACGIKPNAVTFSTLLPG 194
            E LFVE+KGRNI PNVI YTTM+KG V  GR+D AL +F+EMK+ GIKPNAVTFSTLLPG
Sbjct: 312  EQLFVELKGRNIEPNVICYTTMIKGYVDVGRVDDALRLFQEMKSFGIKPNAVTFSTLLPG 371

Query: 195  LCDADKVVEARNVLGEMVERYVAPKDNAVFMKLLTSQCKSGDLDGAADVLKAMIRLSIPT 374
            LC+A+K  EA N+L EMV+RY+APKDNA+F KLLT  CKSGDLD AADVLKAMIRLSIPT
Sbjct: 372  LCEAEKKDEAVNMLKEMVQRYIAPKDNAIFEKLLTLMCKSGDLDSAADVLKAMIRLSIPT 431

Query: 375  EAGHYGVLIENFCKANVFDRAXXXXXXXXXXXXXXRPQSSYELEPSAYNPMIKYLCDHGQ 554
            E GHYG+LIENFCKA V+DRA              RPQSS ELE SAYNPMI++LC+HGQ
Sbjct: 432  EPGHYGILIENFCKAGVYDRAIKLLDKLIEKEIILRPQSSIELEASAYNPMIEHLCNHGQ 491

Query: 555  TGKAETLFRQLMKKGVLDSVAFNNLIRGHAKEGNPDSALEIVKIMGRRGVPRDADSYKLL 734
            T KAE  FRQLMKKGV DSVAFNNL+ GHAKEGN DSA EI++IMGRRGVP +ADSY+LL
Sbjct: 492  TEKAEVFFRQLMKKGVQDSVAFNNLMCGHAKEGNSDSAFEILRIMGRRGVPGEADSYRLL 551

Query: 735  IESYLGKGEPADAKTVLDSMLESGHLPESSLYRSVMESLFEDGRVQTASRVMKSMVEKGV 914
            I SYL KGEPADAKT LDSMLESGH+PES L+RSV+ESLFEDGRVQTASRVMKSMVEKGV
Sbjct: 552  INSYLSKGEPADAKTALDSMLESGHIPESPLFRSVLESLFEDGRVQTASRVMKSMVEKGV 611

Query: 915  KENMDLVSKILEALLMRGHVEEALGRIDLLMLNGCELDLDHLLSVLCEKEKTIAALKLLD 1094
            KENMDLV+KILEAL MRGHVEEALGRIDLLM +GC    D LLSVL EK KTI ALKLLD
Sbjct: 612  KENMDLVAKILEALFMRGHVEEALGRIDLLMQSGCTPQFDSLLSVLAEKGKTIGALKLLD 671

Query: 1095 FVLERDIIIGFSIYDKVLDALLAAGKTLNAHSILCKILEKRGATDWSSRDELIKSLNQEG 1274
            F LERD  + FS YDKVLDALL AGKTLNA+SILCKI+EK  A+DWSS  +LIKSLNQEG
Sbjct: 672  FCLERDCSVDFSSYDKVLDALLEAGKTLNAYSILCKIMEKGEASDWSSTKDLIKSLNQEG 731

Query: 1275 NTKQADVLSRMIKEKEGSPLKREGKKKATHAT 1370
            NTKQAD+LSRMIK  E S   ++GKK+A  A+
Sbjct: 732  NTKQADILSRMIKGGEKSGQSKKGKKEAVVAS 763


Top