BLASTX nr result

ID: Akebia27_contig00023973 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00023973
         (2535 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007052035.1| Tetratricopeptide repeat (TPR)-like superfam...   488   e-135
emb|CBI38862.3| unnamed protein product [Vitis vinifera]              485   e-134
ref|XP_002273719.1| PREDICTED: pentatricopeptide repeat-containi...   485   e-134
gb|EXB52663.1| hypothetical protein L484_022440 [Morus notabilis]     469   e-129
ref|XP_004134345.1| PREDICTED: pentatricopeptide repeat-containi...   461   e-126
ref|XP_006445236.1| hypothetical protein CICLE_v10020287mg [Citr...   460   e-126
ref|XP_004306911.1| PREDICTED: pentatricopeptide repeat-containi...   456   e-125
ref|XP_002320730.1| hypothetical protein POPTR_0014s06610g [Popu...   453   e-124
ref|XP_006339440.1| PREDICTED: pentatricopeptide repeat-containi...   450   e-123
ref|XP_007220375.1| hypothetical protein PRUPE_ppa018787mg [Prun...   446   e-122
ref|XP_004229820.1| PREDICTED: pentatricopeptide repeat-containi...   445   e-122
ref|XP_003552343.1| PREDICTED: pentatricopeptide repeat-containi...   442   e-121
gb|AFK47264.1| unknown [Lotus japonicus]                              441   e-120
ref|XP_003623723.1| Pentatricopeptide repeat-containing protein ...   440   e-120
ref|XP_003533639.2| PREDICTED: pentatricopeptide repeat-containi...   439   e-120
ref|XP_004492640.1| PREDICTED: pentatricopeptide repeat-containi...   437   e-120
ref|XP_007140011.1| hypothetical protein PHAVU_008G076800g [Phas...   435   e-119
ref|XP_007140090.1| hypothetical protein PHAVU_008G083500g [Phas...   431   e-117
ref|XP_002892034.1| pentatricopeptide repeat-containing protein ...   430   e-117
ref|NP_171699.1| pentatricopeptide repeat-containing protein [Ar...   427   e-117

>ref|XP_007052035.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative
            [Theobroma cacao] gi|508704296|gb|EOX96192.1|
            Tetratricopeptide repeat (TPR)-like superfamily protein,
            putative [Theobroma cacao]
          Length = 420

 Score =  488 bits (1257), Expect = e-135
 Identities = 256/419 (61%), Positives = 312/419 (74%), Gaps = 17/419 (4%)
 Frame = +1

Query: 250  MVILACNMLSYTYPITPFFISTTK----------TPYNLSQKATVLQKFRI-----FASH 384
            MV  ACN+   +Y   PF   T K           P    +K       ++      AS 
Sbjct: 1    MVTSACNIPYCSYSTYPFINKTKKQIHPQSWGNRNPLLFQKKGAKFSSCKVNNQPEIASS 60

Query: 385  KLSLNGEQE--EERERFKWVEIGLDLTEAQKHSISLLPPKMSKRCKALMEQLICFDPQKT 558
             +   G+ E  EE+ R+KWVEIG D+ E QK +I+ LP KM+KRCKALM+Q+ICF P+K 
Sbjct: 61   NVEEKGKPETNEEKRRYKWVEIGPDIAEEQKQAITELPFKMTKRCKALMKQIICFCPEKG 120

Query: 559  NLSRLLVSWVKIMKPIRADWLSVLKQMSKLEHSLLFEVTEFALLEESFEANVRDYTKIID 738
            +L+ LL +WVKIMKP RADWL VLK++  +EH L FEV E ALLEESFEAN+RD+TKII 
Sbjct: 121  SLADLLAAWVKIMKPRRADWLVVLKELKIMEHPLYFEVAELALLEESFEANIRDFTKIIH 180

Query: 739  GYGKQNRLLDAETALQAMKERGFTIDQITLTVLVQMYSKAGNLNRAKETFEEIKLLGLPL 918
            GYGKQ RL +AE  L AMK RGF  DQ+TLT +V MYSKAGNL  A+ETFEEIKLLG  L
Sbjct: 181  GYGKQKRLQEAENILVAMKRRGFICDQVTLTTMVHMYSKAGNLKLAEETFEEIKLLGQQL 240

Query: 919  DKRSYGSMIMAYIRAGMVDHGESLLGEMEAQDLCAGSEVYKALLRAYSSIGDTNGAQRVF 1098
            DKRSYGSMIMAYIR+G  + GE+LL EM++Q++ AGSEVYKALLRAYS +GD NGAQRVF
Sbjct: 241  DKRSYGSMIMAYIRSGTPEQGEALLREMDSQEIYAGSEVYKALLRAYSMLGDANGAQRVF 300

Query: 1099 DAIQFAGIAPDARLCALLMNAYAVAGQSDGARIVFENMRKAGLEPSDKCVAVMLSAYEKE 1278
            D IQ AGI+PDAR+C LL+NAY +AGQSD A I FENMR+AGLEPSDKCVA++++AYEK+
Sbjct: 301  DTIQLAGISPDARMCGLLINAYQLAGQSDKAHIAFENMRRAGLEPSDKCVALVVAAYEKQ 360

Query: 1279 NKLNAALDLLMDLERDGIMVCQEASEVLARWFRRLGVVDEVEHVLREYATKEIKRKTRA 1455
            NKLN ALD LM+LERDGI+V +EAS +LA+WF++LGVV++VE VLRE+A KE   K  A
Sbjct: 361  NKLNKALDFLMELERDGIVVGKEASGILAQWFKKLGVVEQVELVLREFAAKETNSKVPA 419


>emb|CBI38862.3| unnamed protein product [Vitis vinifera]
          Length = 353

 Score =  485 bits (1249), Expect = e-134
 Identities = 240/342 (70%), Positives = 290/342 (84%)
 Frame = +1

Query: 409  EEERERFKWVEIGLDLTEAQKHSISLLPPKMSKRCKALMEQLICFDPQKTNLSRLLVSWV 588
            E E++R+KW+EIG ++TEAQK +IS +  KM+KRCKAL++Q+ICF P++ +LS LL +WV
Sbjct: 3    EGEKKRYKWIEIGPNITEAQKMTISQISLKMTKRCKALVKQIICFSPEERSLSDLLAAWV 62

Query: 589  KIMKPIRADWLSVLKQMSKLEHSLLFEVTEFALLEESFEANVRDYTKIIDGYGKQNRLLD 768
            KIMKP RADWLSVLK++ +L+H LL EV E ALLEESFEAN+RDYTKIIDGYGKQNRL D
Sbjct: 63   KIMKPRRADWLSVLKELGRLDHPLLLEVAELALLEESFEANIRDYTKIIDGYGKQNRLQD 122

Query: 769  AETALQAMKERGFTIDQITLTVLVQMYSKAGNLNRAKETFEEIKLLGLPLDKRSYGSMIM 948
            AE  L AMK RGF  DQ+TLT ++ MYSKAGNL  A++TFEEIKLLG PLDKRSYGSMIM
Sbjct: 123  AENTLSAMKRRGFICDQVTLTAMINMYSKAGNLELAEKTFEEIKLLGHPLDKRSYGSMIM 182

Query: 949  AYIRAGMVDHGESLLGEMEAQDLCAGSEVYKALLRAYSSIGDTNGAQRVFDAIQFAGIAP 1128
            AYIRAGM D GE L+ EMEA+++ AG EVYKALLRAYS+  D  GAQRVFDAIQFAGI+P
Sbjct: 183  AYIRAGMPDQGEILVKEMEAKEIYAGREVYKALLRAYSNTSDAEGAQRVFDAIQFAGISP 242

Query: 1129 DARLCALLMNAYAVAGQSDGARIVFENMRKAGLEPSDKCVAVMLSAYEKENKLNAALDLL 1308
            D +LCALL+NAY VAGQ+  A + FENMR++GL+P+DK +A+ML+AYEKENKLN ALD L
Sbjct: 243  DVKLCALLINAYRVAGQTQKAHVAFENMRRSGLKPNDKSIALMLAAYEKENKLNKALDFL 302

Query: 1309 MDLERDGIMVCQEASEVLARWFRRLGVVDEVEHVLREYATKE 1434
            +DLERDGI++ +EASE+LA WF+RLGVV EVE VLREY+ KE
Sbjct: 303  IDLERDGIVLGKEASELLAAWFQRLGVVKEVELVLREYSAKE 344


>ref|XP_002273719.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970-like
            [Vitis vinifera]
          Length = 352

 Score =  485 bits (1249), Expect = e-134
 Identities = 240/342 (70%), Positives = 290/342 (84%)
 Frame = +1

Query: 409  EEERERFKWVEIGLDLTEAQKHSISLLPPKMSKRCKALMEQLICFDPQKTNLSRLLVSWV 588
            E E++R+KW+EIG ++TEAQK +IS +  KM+KRCKAL++Q+ICF P++ +LS LL +WV
Sbjct: 3    EGEKKRYKWIEIGPNITEAQKMTISQISLKMTKRCKALVKQIICFSPEERSLSDLLAAWV 62

Query: 589  KIMKPIRADWLSVLKQMSKLEHSLLFEVTEFALLEESFEANVRDYTKIIDGYGKQNRLLD 768
            KIMKP RADWLSVLK++ +L+H LL EV E ALLEESFEAN+RDYTKIIDGYGKQNRL D
Sbjct: 63   KIMKPRRADWLSVLKELGRLDHPLLLEVAELALLEESFEANIRDYTKIIDGYGKQNRLQD 122

Query: 769  AETALQAMKERGFTIDQITLTVLVQMYSKAGNLNRAKETFEEIKLLGLPLDKRSYGSMIM 948
            AE  L AMK RGF  DQ+TLT ++ MYSKAGNL  A++TFEEIKLLG PLDKRSYGSMIM
Sbjct: 123  AENTLSAMKRRGFICDQVTLTAMINMYSKAGNLELAEKTFEEIKLLGHPLDKRSYGSMIM 182

Query: 949  AYIRAGMVDHGESLLGEMEAQDLCAGSEVYKALLRAYSSIGDTNGAQRVFDAIQFAGIAP 1128
            AYIRAGM D GE L+ EMEA+++ AG EVYKALLRAYS+  D  GAQRVFDAIQFAGI+P
Sbjct: 183  AYIRAGMPDQGEILVKEMEAKEIYAGREVYKALLRAYSNTSDAEGAQRVFDAIQFAGISP 242

Query: 1129 DARLCALLMNAYAVAGQSDGARIVFENMRKAGLEPSDKCVAVMLSAYEKENKLNAALDLL 1308
            D +LCALL+NAY VAGQ+  A + FENMR++GL+P+DK +A+ML+AYEKENKLN ALD L
Sbjct: 243  DVKLCALLINAYRVAGQTQKAHVAFENMRRSGLKPNDKSIALMLAAYEKENKLNKALDFL 302

Query: 1309 MDLERDGIMVCQEASEVLARWFRRLGVVDEVEHVLREYATKE 1434
            +DLERDGI++ +EASE+LA WF+RLGVV EVE VLREY+ KE
Sbjct: 303  IDLERDGIVLGKEASELLAAWFQRLGVVKEVELVLREYSAKE 344


>gb|EXB52663.1| hypothetical protein L484_022440 [Morus notabilis]
          Length = 406

 Score =  469 bits (1207), Expect = e-129
 Identities = 239/378 (63%), Positives = 288/378 (76%)
 Frame = +1

Query: 322  TPYNLSQKATVLQKFRIFASHKLSLNGEQEEERERFKWVEIGLDLTEAQKHSISLLPPKM 501
            TP N   +    ++  +  S + +   E    + +FKWVE+G  +TE+QK +IS L PKM
Sbjct: 28   TPTNFPSRNLHFRRPLVATSVEETEKAENGGGKPKFKWVEVGPGITESQKEAISQLSPKM 87

Query: 502  SKRCKALMEQLICFDPQKTNLSRLLVSWVKIMKPIRADWLSVLKQMSKLEHSLLFEVTEF 681
            +KRC+ALM+QLICF   K +L+ LL +WV+IMKP RADWL+++KQ+  ++H L F+V E 
Sbjct: 88   TKRCRALMKQLICFSAHKASLNELLAAWVRIMKPQRADWLAIIKQLKIMDHPLYFQVAEV 147

Query: 682  ALLEESFEANVRDYTKIIDGYGKQNRLLDAETALQAMKERGFTIDQITLTVLVQMYSKAG 861
            ALLEESFEAN+RDYTKII  YGKQNRL DAE  L AMK RGF  DQ+TLT  + MYSKAG
Sbjct: 148  ALLEESFEANIRDYTKIIHCYGKQNRLEDAEKTLLAMKSRGFIRDQVTLTTFIHMYSKAG 207

Query: 862  NLNRAKETFEEIKLLGLPLDKRSYGSMIMAYIRAGMVDHGESLLGEMEAQDLCAGSEVYK 1041
            NL  A+ETFEE+KLLG PLDKRSYGSMIMAYIRAGM D GE++L EM+ +++ AGSEVYK
Sbjct: 208  NLKLAEETFEELKLLGQPLDKRSYGSMIMAYIRAGMPDQGENILREMDVEEIYAGSEVYK 267

Query: 1042 ALLRAYSSIGDTNGAQRVFDAIQFAGIAPDARLCALLMNAYAVAGQSDGARIVFENMRKA 1221
            ALLRAYS  GD  GAQRVFDAIQ AGI PD RLC LL+NAY  +GQS+ A + F NMR+A
Sbjct: 268  ALLRAYSMTGDAEGAQRVFDAIQLAGILPDPRLCGLLINAYVESGQSEKACVAFGNMRRA 327

Query: 1222 GLEPSDKCVAVMLSAYEKENKLNAALDLLMDLERDGIMVCQEASEVLARWFRRLGVVDEV 1401
            GLEPSDKCVA++L AYEKENKL  ALD LM+LER GIMV +EASE L  WFR+LGVV EV
Sbjct: 328  GLEPSDKCVALVLCAYEKENKLQRALDFLMELERHGIMVGEEASETLVGWFRKLGVVKEV 387

Query: 1402 EHVLREYATKEIKRKTRA 1455
            + VLREYA+K    K RA
Sbjct: 388  DLVLREYASKGASSKIRA 405


>ref|XP_004134345.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970-like
            [Cucumis sativus] gi|449480346|ref|XP_004155867.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At1g01970-like [Cucumis sativus]
          Length = 404

 Score =  461 bits (1185), Expect = e-126
 Identities = 228/354 (64%), Positives = 282/354 (79%)
 Frame = +1

Query: 382  HKLSLNGEQEEERERFKWVEIGLDLTEAQKHSISLLPPKMSKRCKALMEQLICFDPQKTN 561
            HKL    E E E+ RF+WVE+G D+TE QK +IS LPPKM+KRCKA+M+Q+ICF PQK  
Sbjct: 55   HKL----ESEREKPRFRWVEVGYDITETQKQAISQLPPKMTKRCKAVMKQIICFSPQKGE 110

Query: 562  LSRLLVSWVKIMKPIRADWLSVLKQMSKLEHSLLFEVTEFALLEESFEANVRDYTKIIDG 741
            LS +L +WV+IMKP RADWL VLK +  L H L  +V E AL E +FEAN RDYTKII  
Sbjct: 111  LSDMLAAWVRIMKPERADWLLVLKHLRILNHPLYIQVAEAALEEITFEANTRDYTKIIHH 170

Query: 742  YGKQNRLLDAETALQAMKERGFTIDQITLTVLVQMYSKAGNLNRAKETFEEIKLLGLPLD 921
            YGKQN+L DAE  L +M+ERGF  DQITLT ++ +YSKA  LN AK+TFEE+KLL  PLD
Sbjct: 171  YGKQNQLEDAEKVLLSMRERGFVCDQITLTTMIHIYSKADKLNLAKQTFEELKLLEQPLD 230

Query: 922  KRSYGSMIMAYIRAGMVDHGESLLGEMEAQDLCAGSEVYKALLRAYSSIGDTNGAQRVFD 1101
            KRS+G+MIMAY+RAG  + GE +L EM+A+D+ AGSEVYKALLRAYS +G+  GAQRVFD
Sbjct: 231  KRSFGAMIMAYVRAGFPEEGEKILKEMDAKDIYAGSEVYKALLRAYSMVGNAEGAQRVFD 290

Query: 1102 AIQFAGIAPDARLCALLMNAYAVAGQSDGARIVFENMRKAGLEPSDKCVAVMLSAYEKEN 1281
            AIQ A I PD +LC LL+NAY +AGQS  A+I F+NMR+AG+EPSDKC+A+ LSAYEKEN
Sbjct: 291  AIQLAAITPDEKLCGLLINAYLMAGQSREAQIAFDNMRRAGIEPSDKCIALALSAYEKEN 350

Query: 1282 KLNAALDLLMDLERDGIMVCQEASEVLARWFRRLGVVDEVEHVLREYATKEIKR 1443
            +LN+AL+LL+DLE+D +MV +EAS++LA W +RLGVV+EVE VLREY  KE+ R
Sbjct: 351  RLNSALELLIDLEKDNVMVGKEASKILAAWLKRLGVVEEVEIVLREYTEKEVNR 404


>ref|XP_006445236.1| hypothetical protein CICLE_v10020287mg [Citrus clementina]
            gi|568875716|ref|XP_006490938.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At1g01970-like [Citrus sinensis]
            gi|557547498|gb|ESR58476.1| hypothetical protein
            CICLE_v10020287mg [Citrus clementina]
          Length = 423

 Score =  460 bits (1183), Expect = e-126
 Identities = 229/346 (66%), Positives = 278/346 (80%)
 Frame = +1

Query: 409  EEERERFKWVEIGLDLTEAQKHSISLLPPKMSKRCKALMEQLICFDPQKTNLSRLLVSWV 588
            +++   F W++IG ++TE QK +IS  P KM+KRCKA ++Q+IC  P+  NLS LL +WV
Sbjct: 69   KDDTSMFTWIQIGPNITEEQKQAISQFPRKMTKRCKAFVKQIICVSPETGNLSDLLAAWV 128

Query: 589  KIMKPIRADWLSVLKQMSKLEHSLLFEVTEFALLEESFEANVRDYTKIIDGYGKQNRLLD 768
            + MKP RADWL+VLKQ+  +EH L  +V E ALLEESFEAN+RDYTKII GYGK+ ++ +
Sbjct: 129  RFMKPRRADWLAVLKQLKLMEHPLYLQVAELALLEESFEANIRDYTKIIHGYGKKMQIQN 188

Query: 769  AETALQAMKERGFTIDQITLTVLVQMYSKAGNLNRAKETFEEIKLLGLPLDKRSYGSMIM 948
            AE  L AMK RGF  DQ+TLTV+V MYSKAGNL  A+ETFEEIKLLG PLDKRSYGSM+M
Sbjct: 189  AENTLLAMKRRGFICDQVTLTVMVVMYSKAGNLKMAEETFEEIKLLGEPLDKRSYGSMVM 248

Query: 949  AYIRAGMVDHGESLLGEMEAQDLCAGSEVYKALLRAYSSIGDTNGAQRVFDAIQFAGIAP 1128
            AY+RAGM+D GE LL EM+AQ++  GSEVYKALLR YS  G++ GAQRVF+AIQFAGI P
Sbjct: 249  AYVRAGMLDRGEVLLREMDAQEVYVGSEVYKALLRGYSMNGNSEGAQRVFEAIQFAGITP 308

Query: 1129 DARLCALLMNAYAVAGQSDGARIVFENMRKAGLEPSDKCVAVMLSAYEKENKLNAALDLL 1308
            DAR+CALL+NAY +AGQS  A   F+NMRKAGLEPSDKCVA++LSA EKEN+LN AL+ L
Sbjct: 309  DARMCALLINAYQMAGQSQKAYTAFQNMRKAGLEPSDKCVALILSACEKENQLNRALEFL 368

Query: 1309 MDLERDGIMVCQEASEVLARWFRRLGVVDEVEHVLREYATKEIKRK 1446
            +DLERDG MV +EAS  LA WF+RLGVV+EVEHVLREY  +E   K
Sbjct: 369  IDLERDGFMVGKEASCTLAAWFKRLGVVEEVEHVLREYGLRETYSK 414


>ref|XP_004306911.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970-like
            [Fragaria vesca subsp. vesca]
          Length = 415

 Score =  456 bits (1172), Expect = e-125
 Identities = 230/378 (60%), Positives = 292/378 (77%), Gaps = 2/378 (0%)
 Frame = +1

Query: 298  PFFISTT--KTPYNLSQKATVLQKFRIFASHKLSLNGEQEEERERFKWVEIGLDLTEAQK 471
            P F  TT   TP NLS          +  S + +   E  E + RFKW EIG D+TEAQ+
Sbjct: 27   PKFSVTTFRPTPINLSSSGHRFHPPLMALSIEETAMAENTEGKPRFKWGEIGSDITEAQQ 86

Query: 472  HSISLLPPKMSKRCKALMEQLICFDPQKTNLSRLLVSWVKIMKPIRADWLSVLKQMSKLE 651
             +I  LPPKMSKRC+A+M+Q+ICF P+K +L  +L +WV IMKP RADWL+VLK++   +
Sbjct: 87   DAIDELPPKMSKRCQAIMKQIICFAPEKGSLCEVLNAWVSIMKPSRADWLAVLKELRIKD 146

Query: 652  HSLLFEVTEFALLEESFEANVRDYTKIIDGYGKQNRLLDAETALQAMKERGFTIDQITLT 831
            H L  +V E A+L++SFE NVRDYTKII GYGK+NR+ DAE+ L  MK RGF  DQ+TLT
Sbjct: 147  HPLYLQVAEIAVLDDSFEPNVRDYTKIIHGYGKRNRIEDAESTLLNMKSRGFVCDQVTLT 206

Query: 832  VLVQMYSKAGNLNRAKETFEEIKLLGLPLDKRSYGSMIMAYIRAGMVDHGESLLGEMEAQ 1011
             ++ MYSKAG+L  A++TFE+IKLLG  +DKR+YGSMIMAYIRAGM + GE++L EM+AQ
Sbjct: 207  AMIDMYSKAGHLKLAEDTFEDIKLLGQQVDKRAYGSMIMAYIRAGMPEQGETVLIEMDAQ 266

Query: 1012 DLCAGSEVYKALLRAYSSIGDTNGAQRVFDAIQFAGIAPDARLCALLMNAYAVAGQSDGA 1191
            ++ AGSEVYKALLRAYS +GDT GAQRVF+A+Q AGI+PDA++C LL+NAY ++GQS  A
Sbjct: 267  EIVAGSEVYKALLRAYSMVGDTEGAQRVFNALQLAGISPDAKICGLLINAYGISGQSQKA 326

Query: 1192 RIVFENMRKAGLEPSDKCVAVMLSAYEKENKLNAALDLLMDLERDGIMVCQEASEVLARW 1371
            R  FENMRKAGL+PSDKC+A+ML+AYEKENKL  AL  LM LER+GIMV +E +E LA W
Sbjct: 327  RAAFENMRKAGLKPSDKCIALMLAAYEKENKLQMALKFLMGLEREGIMVGKEVAETLAGW 386

Query: 1372 FRRLGVVDEVEHVLREYA 1425
            F++LGVV+EV+ VLRE+A
Sbjct: 387  FKKLGVVEEVDMVLREFA 404


>ref|XP_002320730.1| hypothetical protein POPTR_0014s06610g [Populus trichocarpa]
            gi|222861503|gb|EEE99045.1| hypothetical protein
            POPTR_0014s06610g [Populus trichocarpa]
          Length = 407

 Score =  453 bits (1166), Expect = e-124
 Identities = 237/405 (58%), Positives = 295/405 (72%), Gaps = 13/405 (3%)
 Frame = +1

Query: 250  MVILACNMLSYTYPITPFFISTTKT-------------PYNLSQKATVLQKFRIFASHKL 390
            M     N+L ++ P  P      KT             P  L+   + +Q      + + 
Sbjct: 1    MATYVINILPFSSPTCPLHSEPKKTSNLHFLGNSLCQQPVTLTSCKSQIQPVLAAINVEE 60

Query: 391  SLNGEQEEERERFKWVEIGLDLTEAQKHSISLLPPKMSKRCKALMEQLICFDPQKTNLSR 570
             + GE  +E+ +F+WVEIG ++ E QK +IS LP KM+KRCKALM Q+ICF+ +K +L  
Sbjct: 61   KVEGEIGKEKPKFRWVEIGPNIPEEQKQAISQLPFKMTKRCKALMRQIICFNDKKGSLRG 120

Query: 571  LLVSWVKIMKPIRADWLSVLKQMSKLEHSLLFEVTEFALLEESFEANVRDYTKIIDGYGK 750
            LL +WVKIMKP R DWLS+LK+++K+EH L  EV E ALLEESFEANVRDYTKII  YG 
Sbjct: 121  LLSAWVKIMKPRRKDWLSILKELNKMEHPLYLEVVEIALLEESFEANVRDYTKIIHFYGM 180

Query: 751  QNRLLDAETALQAMKERGFTIDQITLTVLVQMYSKAGNLNRAKETFEEIKLLGLPLDKRS 930
             N+L +AE    AM+ERGF  DQ+TLT ++ MYSK GNL  A+ETFEE+KLLG PLD+RS
Sbjct: 181  NNQLEEAERTRLAMEERGFVSDQVTLTAMIHMYSKGGNLTLAEETFEELKLLGQPLDRRS 240

Query: 931  YGSMIMAYIRAGMVDHGESLLGEMEAQDLCAGSEVYKALLRAYSSIGDTNGAQRVFDAIQ 1110
            YGSMIMAYIRAGM + GE +L EM+AQ++ AGSEVYKALLRAYS IGD +GAQRVFDAIQ
Sbjct: 241  YGSMIMAYIRAGMPEKGEMILREMDAQEIRAGSEVYKALLRAYSIIGDADGAQRVFDAIQ 300

Query: 1111 FAGIAPDARLCALLMNAYAVAGQSDGARIVFENMRKAGLEPSDKCVAVMLSAYEKENKLN 1290
             AGI PD R CA+L+NAY +AGQS  A   FENM +AG+EP+D+CVA++L+AYEKENKLN
Sbjct: 301  LAGIPPDDRTCAVLLNAYGMAGQSQNAYATFENMWRAGIEPTDRCVALVLAAYEKENKLN 360

Query: 1291 AALDLLMDLERDGIMVCQEASEVLARWFRRLGVVDEVEHVLREYA 1425
             ALD L+ LER+ +++ +EASEVLA WF RLGVV EVE VLREYA
Sbjct: 361  QALDFLIGLEREKLIIGKEASEVLAEWFGRLGVVKEVELVLREYA 405


>ref|XP_006339440.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970-like
            [Solanum tuberosum]
          Length = 415

 Score =  450 bits (1157), Expect = e-123
 Identities = 219/351 (62%), Positives = 278/351 (79%)
 Frame = +1

Query: 376  ASHKLSLNGEQEEERERFKWVEIGLDLTEAQKHSISLLPPKMSKRCKALMEQLICFDPQK 555
            A H+      Q  ++ R+KWV+IG D+TE Q+ +I  LPPKM  RCKALM+Q+IC+ P+K
Sbjct: 59   AVHQKGSAENQVNDKPRYKWVKIGSDVTEEQQRAILKLPPKMINRCKALMQQIICYSPEK 118

Query: 556  TNLSRLLVSWVKIMKPIRADWLSVLKQMSKLEHSLLFEVTEFALLEESFEANVRDYTKII 735
             ++S LL +WVK MKP RADWL+VLK++ +L H +  EV E +LL ESFEAN+RDYTKII
Sbjct: 119  GSVSLLLEAWVKSMKPERADWLAVLKELDRLNHPMYLEVAELSLLAESFEANIRDYTKII 178

Query: 736  DGYGKQNRLLDAETALQAMKERGFTIDQITLTVLVQMYSKAGNLNRAKETFEEIKLLGLP 915
             GY KQNRL +AE+   +MK RGFT DQ+TLT LV MYSKAGNL  A++TFEE++LLG+P
Sbjct: 179  HGYAKQNRLKEAESVFLSMKSRGFTCDQVTLTALVHMYSKAGNLKLAEDTFEEMRLLGVP 238

Query: 916  LDKRSYGSMIMAYIRAGMVDHGESLLGEMEAQDLCAGSEVYKALLRAYSSIGDTNGAQRV 1095
            LDKRS+GS+IMAY+RAG +  GE+LL EME Q++ AG EVYKALLRAYS  GD+ GAQRV
Sbjct: 239  LDKRSFGSIIMAYVRAGKLGQGEALLKEMEEQEIYAGPEVYKALLRAYSMSGDSKGAQRV 298

Query: 1096 FDAIQFAGIAPDARLCALLMNAYAVAGQSDGARIVFENMRKAGLEPSDKCVAVMLSAYEK 1275
            FD  Q AG+ PDA +C LLMNAY +AGQ   A I FENMR+ G++P+DKC+ ++L AYE 
Sbjct: 299  FDTTQLAGVIPDATICGLLMNAYIMAGQLSEACITFENMRRVGIKPNDKCITLLLKAYET 358

Query: 1276 ENKLNAALDLLMDLERDGIMVCQEASEVLARWFRRLGVVDEVEHVLREYAT 1428
            ENKL+ ALD+LMDLERDG+++ +EASE+LARWF+RLGVV EVE VLR+YA+
Sbjct: 359  ENKLSKALDVLMDLERDGVVLGREASELLARWFKRLGVVGEVELVLRDYAS 409


>ref|XP_007220375.1| hypothetical protein PRUPE_ppa018787mg [Prunus persica]
            gi|462416837|gb|EMJ21574.1| hypothetical protein
            PRUPE_ppa018787mg [Prunus persica]
          Length = 377

 Score =  446 bits (1146), Expect = e-122
 Identities = 224/360 (62%), Positives = 280/360 (77%)
 Frame = +1

Query: 376  ASHKLSLNGEQEEERERFKWVEIGLDLTEAQKHSISLLPPKMSKRCKALMEQLICFDPQK 555
            AS + +   E ++ + RFK   +  ++TEAQK +I+ LP  M+KRCKALM QLIC+ PQK
Sbjct: 17   ASVEETAQTESKDGKPRFKLDAVDPEITEAQKQAIAQLPYHMAKRCKALMRQLICYSPQK 76

Query: 556  TNLSRLLVSWVKIMKPIRADWLSVLKQMSKLEHSLLFEVTEFALLEESFEANVRDYTKII 735
             +L  LL +WV+ MKP RA WL+VLK++   +H L  +V E A+LEESFE N+RDYTKII
Sbjct: 77   GSLCELLAAWVRAMKPSRAHWLAVLKELRIKDHPLYLQVAEIAVLEESFEVNLRDYTKII 136

Query: 736  DGYGKQNRLLDAETALQAMKERGFTIDQITLTVLVQMYSKAGNLNRAKETFEEIKLLGLP 915
             GYGKQNR+ +A   L  MK RGF  DQ+TLT ++ MYSKAG++  A+ETFEEIKLLG P
Sbjct: 137  HGYGKQNRIEEAVKILSNMKARGFICDQVTLTAMIDMYSKAGHVKLAEETFEEIKLLGQP 196

Query: 916  LDKRSYGSMIMAYIRAGMVDHGESLLGEMEAQDLCAGSEVYKALLRAYSSIGDTNGAQRV 1095
            LDKRSYGSMIMAYIRAG+ D GESLL EM+AQ++ AGSEVYKALLRAYS +GDT GAQRV
Sbjct: 197  LDKRSYGSMIMAYIRAGVPDQGESLLIEMDAQEIYAGSEVYKALLRAYSMVGDTEGAQRV 256

Query: 1096 FDAIQFAGIAPDARLCALLMNAYAVAGQSDGARIVFENMRKAGLEPSDKCVAVMLSAYEK 1275
            F+A+Q AGI+PDA+LC LL+NAY V+GQS  AR+ FENMR AG+ P+DKC+A++L+AYEK
Sbjct: 257  FNAVQLAGISPDAKLCGLLINAYGVSGQSQKARVAFENMRTAGIRPTDKCIALVLAAYEK 316

Query: 1276 ENKLNAALDLLMDLERDGIMVCQEASEVLARWFRRLGVVDEVEHVLREYATKEIKRKTRA 1455
            ENKL  AL  LM LERDGIMV +EA+E LA WFR+LGVV+EV+ +LRE+A  E   +  A
Sbjct: 317  ENKLQKALKFLMALERDGIMVGKEAAETLAAWFRKLGVVEEVDTILREFAETEANSRVPA 376


>ref|XP_004229820.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970-like
            [Solanum lycopersicum]
          Length = 415

 Score =  445 bits (1145), Expect = e-122
 Identities = 216/341 (63%), Positives = 274/341 (80%)
 Frame = +1

Query: 406  QEEERERFKWVEIGLDLTEAQKHSISLLPPKMSKRCKALMEQLICFDPQKTNLSRLLVSW 585
            Q  ++ R++WV+IG D+TE Q+ +I  LPPKM  RCKALM+Q+IC+ P+K ++S LL +W
Sbjct: 69   QVNDKPRYRWVKIGSDVTEEQQRAILKLPPKMINRCKALMQQIICYSPEKGSVSLLLEAW 128

Query: 586  VKIMKPIRADWLSVLKQMSKLEHSLLFEVTEFALLEESFEANVRDYTKIIDGYGKQNRLL 765
            VK MKP RADWL+VLK++ +L H +  EV E +LL ESFEAN+RDYTKII GY KQNRL 
Sbjct: 129  VKSMKPDRADWLAVLKELDRLNHPMYLEVAELSLLAESFEANIRDYTKIIHGYAKQNRLK 188

Query: 766  DAETALQAMKERGFTIDQITLTVLVQMYSKAGNLNRAKETFEEIKLLGLPLDKRSYGSMI 945
            +AE+   +MK RGFT DQ+TLT LV MYSKA NL  A++TFEE++LLG+PLDKRS+GS+I
Sbjct: 189  EAESVFLSMKSRGFTCDQVTLTALVHMYSKASNLKLAEDTFEEMRLLGVPLDKRSFGSII 248

Query: 946  MAYIRAGMVDHGESLLGEMEAQDLCAGSEVYKALLRAYSSIGDTNGAQRVFDAIQFAGIA 1125
            MAY+RAG +  GE+LL EME Q+  AG EVYKALLRAYS  GD+ GAQRVFD IQ AG+ 
Sbjct: 249  MAYVRAGKLGQGEALLKEMEEQETYAGPEVYKALLRAYSMSGDSKGAQRVFDTIQLAGVI 308

Query: 1126 PDARLCALLMNAYAVAGQSDGARIVFENMRKAGLEPSDKCVAVMLSAYEKENKLNAALDL 1305
            PDA +C LLMNAY +AGQ     I FENMR+ G++P+DKC+ ++L+AYE ENKL+ ALD+
Sbjct: 309  PDATICGLLMNAYIMAGQLSETCIAFENMRRVGIKPNDKCITLLLTAYETENKLSKALDV 368

Query: 1306 LMDLERDGIMVCQEASEVLARWFRRLGVVDEVEHVLREYAT 1428
            LMDLERDGI++ +EASE+LARWF+RLGVV EVE VLR+YA+
Sbjct: 369  LMDLERDGIVLGREASELLARWFKRLGVVGEVELVLRDYAS 409


>ref|XP_003552343.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970-like
            isoform X1 [Glycine max] gi|571548118|ref|XP_006602756.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At1g01970-like isoform X2 [Glycine max]
          Length = 414

 Score =  442 bits (1138), Expect = e-121
 Identities = 219/341 (64%), Positives = 275/341 (80%)
 Frame = +1

Query: 406  QEEERERFKWVEIGLDLTEAQKHSISLLPPKMSKRCKALMEQLICFDPQKTNLSRLLVSW 585
            +E+   R++W+E+G ++ + Q+ +IS LP +M+ RCKALM Q+IC+  +K ++S LL SW
Sbjct: 69   KEDNDRRYRWIEVGKNVPKEQQQAISKLPFRMADRCKALMRQIICYSAEKGSMSDLLRSW 128

Query: 586  VKIMKPIRADWLSVLKQMSKLEHSLLFEVTEFALLEESFEANVRDYTKIIDGYGKQNRLL 765
            VK+MKP RADWLSVLK++   EH +  EV + AL+EESFE N+RDYTKII  YG+ N L 
Sbjct: 129  VKLMKPTRADWLSVLKELKIREHPVYLEVAKHALMEESFEVNIRDYTKIIHYYGEHNLLE 188

Query: 766  DAETALQAMKERGFTIDQITLTVLVQMYSKAGNLNRAKETFEEIKLLGLPLDKRSYGSMI 945
            DAE  L  MK+RGF  DQ+ LT +V MYSKAGN +RAKE FEEIKLLG PLDKRSYGSMI
Sbjct: 189  DAEKFLTLMKQRGFIYDQVILTTMVHMYSKAGNHDRAKEYFEEIKLLGKPLDKRSYGSMI 248

Query: 946  MAYIRAGMVDHGESLLGEMEAQDLCAGSEVYKALLRAYSSIGDTNGAQRVFDAIQFAGIA 1125
            MAYIRAGM + GE+LL EMEAQ++ AGSEVYKALLRAYS IG+  GAQRVFDAIQ AGI 
Sbjct: 249  MAYIRAGMPEEGENLLQEMEAQEILAGSEVYKALLRAYSMIGNAEGAQRVFDAIQLAGIT 308

Query: 1126 PDARLCALLMNAYAVAGQSDGARIVFENMRKAGLEPSDKCVAVMLSAYEKENKLNAALDL 1305
            PD ++C+L++NAY +AGQS  A I FENMR+AG++PSDKC+A +L AYEKE+K+N AL+ 
Sbjct: 309  PDDKICSLVVNAYVMAGQSQKALIAFENMRRAGIKPSDKCIASVLVAYEKESKINTALEF 368

Query: 1306 LMDLERDGIMVCQEASEVLARWFRRLGVVDEVEHVLREYAT 1428
            L+DLERDGIMV +EAS VLA+WFR+LGVV+EVE VLR++ T
Sbjct: 369  LIDLERDGIMVEEEASAVLAKWFRKLGVVEEVELVLRDFVT 409


>gb|AFK47264.1| unknown [Lotus japonicus]
          Length = 414

 Score =  441 bits (1133), Expect = e-120
 Identities = 218/341 (63%), Positives = 274/341 (80%)
 Frame = +1

Query: 406  QEEERERFKWVEIGLDLTEAQKHSISLLPPKMSKRCKALMEQLICFDPQKTNLSRLLVSW 585
            ++E   RF+W EIG ++T  Q  +IS LP KM+KRCKALM Q+ICF  +K N+S LL +W
Sbjct: 69   KDENHRRFRWTEIGHNITHEQNEAISKLPFKMTKRCKALMRQIICFSAEKGNVSDLLNAW 128

Query: 586  VKIMKPIRADWLSVLKQMSKLEHSLLFEVTEFALLEESFEANVRDYTKIIDGYGKQNRLL 765
            VKIMKPIRA+WLSVLK++  +EH L  EV E ALLEESFE N+RDYT II   GK N+L 
Sbjct: 129  VKIMKPIRAEWLSVLKELETMEHPLYLEVAEHALLEESFEVNIRDYTNIIHYCGKHNQLE 188

Query: 766  DAETALQAMKERGFTIDQITLTVLVQMYSKAGNLNRAKETFEEIKLLGLPLDKRSYGSMI 945
            +AE  L AMK+RGF  DQ+ LT +V +YSKAG+L+RA+E FEEI+LLG PLDKRSYGSMI
Sbjct: 189  EAENILTAMKQRGFICDQVILTTMVHIYSKAGHLDRAEEYFEEIRLLGEPLDKRSYGSMI 248

Query: 946  MAYIRAGMVDHGESLLGEMEAQDLCAGSEVYKALLRAYSSIGDTNGAQRVFDAIQFAGIA 1125
             AYIRAGM + GESLL EM+A+++ AGSEVYKALLRAYS IG+  GAQRVFDAIQ AGI 
Sbjct: 249  TAYIRAGMPERGESLLEEMDAREIYAGSEVYKALLRAYSRIGNAEGAQRVFDAIQLAGII 308

Query: 1126 PDARLCALLMNAYAVAGQSDGARIVFENMRKAGLEPSDKCVAVMLSAYEKENKLNAALDL 1305
            PD ++C L+  AY +AGQS+ ARI FENM++AG+EP+D+C+  +L AYEKE+KLN AL+ 
Sbjct: 309  PDDKICGLVTKAYGMAGQSEKARIAFENMKRAGIEPTDRCIGSVLVAYEKESKLNTALEF 368

Query: 1306 LMDLERDGIMVCQEASEVLARWFRRLGVVDEVEHVLREYAT 1428
            L+DLE++GIMV +EAS +LA WFR+LGVV+EVE VLR+++T
Sbjct: 369  LIDLEKEGIMVGEEASAILAGWFRKLGVVEEVELVLRDFST 409


>ref|XP_003623723.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355498738|gb|AES79941.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 426

 Score =  440 bits (1131), Expect = e-120
 Identities = 232/397 (58%), Positives = 295/397 (74%), Gaps = 7/397 (1%)
 Frame = +1

Query: 259  LACNMLSYTYPITPFFISTTKTPYNLSQKATVLQKFRI-FASHKLSLNGEQ------EEE 417
            L CN  +  Y IT  +    K   +LS+K + L   +  F S  +S+  E+      E  
Sbjct: 24   LVCNFYNPNYSITKLYQIHNKRN-SLSKKPSYLDIHKHHFDSVLVSVGTEEIVEEVIEGS 82

Query: 418  RERFKWVEIGLDLTEAQKHSISLLPPKMSKRCKALMEQLICFDPQKTNLSRLLVSWVKIM 597
             ++F+W EI  D+TE QK +I+ LP +M KRCKA+M Q+ICF  +K  L  +L +WV+IM
Sbjct: 83   YKKFRWNEIRNDITEEQKQAIAKLPFRMEKRCKAVMRQIICFSEEKGRLCDVLRAWVEIM 142

Query: 598  KPIRADWLSVLKQMSKLEHSLLFEVTEFALLEESFEANVRDYTKIIDGYGKQNRLLDAET 777
            KP RADWLSVLK++  ++H L  EV E AL+EESFE N+RDYTK+I  Y K+N+L  AE 
Sbjct: 143  KPTRADWLSVLKELKNMDHPLYLEVAEHALVEESFEPNLRDYTKLIHYYSKENQLEAAEN 202

Query: 778  ALQAMKERGFTIDQITLTVLVQMYSKAGNLNRAKETFEEIKLLGLPLDKRSYGSMIMAYI 957
                MK+RGF  DQ+ LT +V MYSKAG+L+RA+E FEEIKLLG PLDKRSYGSMIMAYI
Sbjct: 203  IFTLMKQRGFICDQVILTTMVHMYSKAGHLDRAEEYFEEIKLLGEPLDKRSYGSMIMAYI 262

Query: 958  RAGMVDHGESLLGEMEAQDLCAGSEVYKALLRAYSSIGDTNGAQRVFDAIQFAGIAPDAR 1137
            RAGM + GESLL EM+AQD+ AGSEVYKALLRAYS IG+  GAQRVFDAIQ AGI PD +
Sbjct: 263  RAGMPEKGESLLEEMDAQDIYAGSEVYKALLRAYSVIGNAEGAQRVFDAIQLAGIIPDDK 322

Query: 1138 LCALLMNAYAVAGQSDGARIVFENMRKAGLEPSDKCVAVMLSAYEKENKLNAALDLLMDL 1317
            +C+LL+ AY++AGQS  ARI FENM++AG+EP+DKC++ +L AYEKEN LN AL+ L++L
Sbjct: 323  MCSLLIYAYSMAGQSQKARIAFENMKRAGIEPTDKCISSVLVAYEKENMLNTALEFLIEL 382

Query: 1318 ERDGIMVCQEASEVLARWFRRLGVVDEVEHVLREYAT 1428
            ERDGIMV +E S +LA WFR+LGVV+EVE VLR++AT
Sbjct: 383  ERDGIMVKEETSRILAGWFRKLGVVEEVELVLRDFAT 419


>ref|XP_003533639.2| PREDICTED: pentatricopeptide repeat-containing protein At1g01970-like
            [Glycine max]
          Length = 415

 Score =  439 bits (1130), Expect = e-120
 Identities = 220/344 (63%), Positives = 273/344 (79%)
 Frame = +1

Query: 406  QEEERERFKWVEIGLDLTEAQKHSISLLPPKMSKRCKALMEQLICFDPQKTNLSRLLVSW 585
            +E    R++W+E+G ++T+ Q+ +IS LP +M+ R KALM Q+ICF  +K  +S LL SW
Sbjct: 69   KEGNERRYRWIEVGKNVTKEQQQAISKLPFRMADRSKALMRQIICFSAEKGTISDLLRSW 128

Query: 586  VKIMKPIRADWLSVLKQMSKLEHSLLFEVTEFALLEESFEANVRDYTKIIDGYGKQNRLL 765
            VK+MKP RADWLSVLK++   EH    EV +  LLEESFE N+RDYTKII  YG+ N L 
Sbjct: 129  VKLMKPTRADWLSVLKELRTTEHPFYLEVAKHTLLEESFEVNIRDYTKIIHYYGEHNLLE 188

Query: 766  DAETALQAMKERGFTIDQITLTVLVQMYSKAGNLNRAKETFEEIKLLGLPLDKRSYGSMI 945
            DAE  L  MK+RGF  DQ+ LT +V M SKAGN +RAKE FEEIKLLG PLDKRSYGSMI
Sbjct: 189  DAEKFLTLMKQRGFIYDQVILTTMVHMSSKAGNHDRAKEYFEEIKLLGEPLDKRSYGSMI 248

Query: 946  MAYIRAGMVDHGESLLGEMEAQDLCAGSEVYKALLRAYSSIGDTNGAQRVFDAIQFAGIA 1125
            MAYIRAGM + GE+LL +MEAQ++ AGSE+YKALLRAYS IG+  GAQRVFDAIQ AGI 
Sbjct: 249  MAYIRAGMPEEGENLLQQMEAQEILAGSEIYKALLRAYSMIGNAEGAQRVFDAIQLAGIT 308

Query: 1126 PDARLCALLMNAYAVAGQSDGARIVFENMRKAGLEPSDKCVAVMLSAYEKENKLNAALDL 1305
            PD ++C+LL+NAYA+AGQS  A I FENMR+AG++PSDKC+A +L AYEKE+K+N AL+ 
Sbjct: 309  PDDKICSLLVNAYAMAGQSQKALIAFENMRRAGIKPSDKCIASVLVAYEKESKINTALEF 368

Query: 1306 LMDLERDGIMVCQEASEVLARWFRRLGVVDEVEHVLREYATKEI 1437
            L+DLERDGIMV +EAS VLA+WFR+LGVV+EVE VLR++A  E+
Sbjct: 369  LIDLERDGIMVGEEASAVLAKWFRKLGVVEEVELVLRDFAIIEV 412


>ref|XP_004492640.1| PREDICTED: pentatricopeptide repeat-containing protein At1g01970-like
            isoform X1 [Cicer arietinum]
            gi|502104764|ref|XP_004492641.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At1g01970-like isoform X2 [Cicer arietinum]
          Length = 425

 Score =  437 bits (1125), Expect = e-120
 Identities = 220/340 (64%), Positives = 271/340 (79%)
 Frame = +1

Query: 409  EEERERFKWVEIGLDLTEAQKHSISLLPPKMSKRCKALMEQLICFDPQKTNLSRLLVSWV 588
            E   +RF+WVEI  D+TE QK++I+ LP KM KRCKA+M Q+ICF  +K NL  +L +WV
Sbjct: 81   EGNNKRFRWVEIRNDITEEQKNAIAKLPFKMIKRCKAVMRQIICFSAEKGNLCDVLGAWV 140

Query: 589  KIMKPIRADWLSVLKQMSKLEHSLLFEVTEFALLEESFEANVRDYTKIIDGYGKQNRLLD 768
            KIMKP RADWLSVLK++  ++H L  EV E ALLEESFE N+RDYTK+I  Y K+N+L  
Sbjct: 141  KIMKPTRADWLSVLKELKNMDHPLHLEVAEHALLEESFEPNLRDYTKLIHYYSKENQLEA 200

Query: 769  AETALQAMKERGFTIDQITLTVLVQMYSKAGNLNRAKETFEEIKLLGLPLDKRSYGSMIM 948
            AE     MK+RGF  DQ+ LT +V MYSKAG+L+RA+E FEEIKLLG  LDKRSYGSMIM
Sbjct: 201  AENIFTTMKQRGFICDQVILTTMVHMYSKAGHLDRAEEYFEEIKLLGEQLDKRSYGSMIM 260

Query: 949  AYIRAGMVDHGESLLGEMEAQDLCAGSEVYKALLRAYSSIGDTNGAQRVFDAIQFAGIAP 1128
            AYIRAGM + GESLL EM+AQ++ AGSEVYKALLRAYS  G+  GAQRVFDAIQ AGI P
Sbjct: 261  AYIRAGMPEQGESLLEEMDAQEIYAGSEVYKALLRAYSGSGNAEGAQRVFDAIQLAGITP 320

Query: 1129 DARLCALLMNAYAVAGQSDGARIVFENMRKAGLEPSDKCVAVMLSAYEKENKLNAALDLL 1308
            D ++C+LL+ AY +AGQS  A+I FENM+KAG+EP+DKC++++L AYEKEN L+ AL  L
Sbjct: 321  DDKMCSLLIYAYGMAGQSQKAQIAFENMKKAGIEPTDKCISLVLFAYEKENMLDTALAFL 380

Query: 1309 MDLERDGIMVCQEASEVLARWFRRLGVVDEVEHVLREYAT 1428
            +DLERDGIMV +E S +LA WFR+LGVV+EVE VLR++AT
Sbjct: 381  IDLERDGIMVGEETSRILAGWFRKLGVVEEVELVLRDFAT 420


>ref|XP_007140011.1| hypothetical protein PHAVU_008G076800g [Phaseolus vulgaris]
            gi|593337615|ref|XP_007140012.1| hypothetical protein
            PHAVU_008G076800g [Phaseolus vulgaris]
            gi|561013144|gb|ESW12005.1| hypothetical protein
            PHAVU_008G076800g [Phaseolus vulgaris]
            gi|561013145|gb|ESW12006.1| hypothetical protein
            PHAVU_008G076800g [Phaseolus vulgaris]
          Length = 409

 Score =  435 bits (1119), Expect = e-119
 Identities = 216/340 (63%), Positives = 272/340 (80%)
 Frame = +1

Query: 409  EEERERFKWVEIGLDLTEAQKHSISLLPPKMSKRCKALMEQLICFDPQKTNLSRLLVSWV 588
            EE   RF+W+E+G ++T  Q+ +IS LP +MSKR KALM Q+ICF  +K  +S LL SWV
Sbjct: 65   EENERRFRWIEVGNNVTIEQRQAISELPFRMSKRSKALMRQIICFSAEKGTISDLLESWV 124

Query: 589  KIMKPIRADWLSVLKQMSKLEHSLLFEVTEFALLEESFEANVRDYTKIIDGYGKQNRLLD 768
            +IM PIRADWLS+LK++S +EH L  EV ++AL EESFE N+RDYTKII  YGK N L D
Sbjct: 125  RIMNPIRADWLSILKELSIMEHPLYLEVAKYALQEESFEVNIRDYTKIIHYYGKHNLLED 184

Query: 769  AETALQAMKERGFTIDQITLTVLVQMYSKAGNLNRAKETFEEIKLLGLPLDKRSYGSMIM 948
            AE  L  MK+RGF  DQ+ LT +V MYSKAG  ++AKE FEEIK LG PLDKRSYGSMIM
Sbjct: 185  AENFLTLMKQRGFIYDQVILTTMVHMYSKAGRHDQAKEYFEEIKSLGEPLDKRSYGSMIM 244

Query: 949  AYIRAGMVDHGESLLGEMEAQDLCAGSEVYKALLRAYSSIGDTNGAQRVFDAIQFAGIAP 1128
            AYIRAGM + GE+LL EMEAQ++ AGSEVYKALLR+YS IG+  GAQRVFDAIQ AGI P
Sbjct: 245  AYIRAGMPEEGENLLQEMEAQEITAGSEVYKALLRSYSMIGNAEGAQRVFDAIQLAGITP 304

Query: 1129 DARLCALLMNAYAVAGQSDGARIVFENMRKAGLEPSDKCVAVMLSAYEKENKLNAALDLL 1308
            + ++C+L++NAYA+AGQS  A I FENMR+A ++P+DKC+A +L AYEKE+K+N AL+ L
Sbjct: 305  NDKMCSLVVNAYAMAGQSQKALIAFENMRRASIKPTDKCIASVLVAYEKESKINTALEFL 364

Query: 1309 MDLERDGIMVCQEASEVLARWFRRLGVVDEVEHVLREYAT 1428
            +DLE+DG  + +EAS VLA+WFR+LGVV+EVE +LR++AT
Sbjct: 365  LDLEKDGNKIGKEASAVLAKWFRKLGVVEEVELILRDFAT 404


>ref|XP_007140090.1| hypothetical protein PHAVU_008G083500g [Phaseolus vulgaris]
            gi|561013223|gb|ESW12084.1| hypothetical protein
            PHAVU_008G083500g [Phaseolus vulgaris]
          Length = 409

 Score =  431 bits (1107), Expect = e-117
 Identities = 214/340 (62%), Positives = 271/340 (79%)
 Frame = +1

Query: 409  EEERERFKWVEIGLDLTEAQKHSISLLPPKMSKRCKALMEQLICFDPQKTNLSRLLVSWV 588
            EE   RF+W+E+G ++T  Q+ +IS LP +MSKR KALM Q+ICF  +K  +S LL SWV
Sbjct: 65   EENERRFRWIEVGKNVTIEQRQAISELPFRMSKRSKALMRQIICFSAEKGTISDLLESWV 124

Query: 589  KIMKPIRADWLSVLKQMSKLEHSLLFEVTEFALLEESFEANVRDYTKIIDGYGKQNRLLD 768
            +IM PIRADWLSVLK++  +EH L  EV + A  EESF+ N+RDYTKII  YGK N L D
Sbjct: 125  RIMNPIRADWLSVLKELRIMEHPLYLEVAKHAFQEESFDVNIRDYTKIIHYYGKHNLLED 184

Query: 769  AETALQAMKERGFTIDQITLTVLVQMYSKAGNLNRAKETFEEIKLLGLPLDKRSYGSMIM 948
            AE  L  MK+RGF  DQ+ LT +V MYSKAG  ++AKE FEEIK LG PLDKRSYGS+IM
Sbjct: 185  AENFLTLMKQRGFIYDQVILTTMVHMYSKAGRHDQAKEYFEEIKSLGEPLDKRSYGSIIM 244

Query: 949  AYIRAGMVDHGESLLGEMEAQDLCAGSEVYKALLRAYSSIGDTNGAQRVFDAIQFAGIAP 1128
            AYIRAGM + GE++L EMEAQ++ A SEVYKALLRAYS IG+  GAQRVFDAIQ AGI P
Sbjct: 245  AYIRAGMPEEGENVLEEMEAQEITASSEVYKALLRAYSMIGNAEGAQRVFDAIQLAGITP 304

Query: 1129 DARLCALLMNAYAVAGQSDGARIVFENMRKAGLEPSDKCVAVMLSAYEKENKLNAALDLL 1308
            + ++C+L++NAYA+AGQS  A I FENMR+A ++P+DKC+A +L AYEK++K+NAAL+ L
Sbjct: 305  NDKMCSLVINAYAMAGQSQKALIAFENMRRASIKPTDKCIASVLVAYEKQSKINAALEFL 364

Query: 1309 MDLERDGIMVCQEASEVLARWFRRLGVVDEVEHVLREYAT 1428
            +DLE+DGIMV +EAS VLA+WF++LGVV+EVE +LR++AT
Sbjct: 365  LDLEKDGIMVGEEASAVLAKWFQKLGVVEEVELILRDFAT 404


>ref|XP_002892034.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297337876|gb|EFH68293.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 409

 Score =  430 bits (1106), Expect = e-117
 Identities = 212/344 (61%), Positives = 271/344 (78%)
 Frame = +1

Query: 403  EQEEERERFKWVEIGLDLTEAQKHSISLLPPKMSKRCKALMEQLICFDPQKTNLSRLLVS 582
            E  E+  R  WV++GLDLTE Q  +I+ +P KMSKRC+ALM Q+ICF  +K +   LL +
Sbjct: 62   EDTEQIPRSNWVDVGLDLTEEQDEAITRIPIKMSKRCQALMRQIICFSSEKGSFCDLLGA 121

Query: 583  WVKIMKPIRADWLSVLKQMSKLEHSLLFEVTEFALLEESFEANVRDYTKIIDGYGKQNRL 762
            WV+ M PIRADWLS+LK++  L+     +V EF+LLE+SFEAN RDYTKII  YGK N++
Sbjct: 122  WVRRMNPIRADWLSILKELKNLDSPFYIKVAEFSLLEDSFEANARDYTKIIHYYGKLNQV 181

Query: 763  LDAETALQAMKERGFTIDQITLTVLVQMYSKAGNLNRAKETFEEIKLLGLPLDKRSYGSM 942
             DAE  L +MK RGF IDQ+TLT +VQ+YSKAG    A+ETF EIKL+G PLD RSYGSM
Sbjct: 182  EDAERTLLSMKNRGFLIDQVTLTAIVQLYSKAGYHKLAEETFNEIKLIGEPLDNRSYGSM 241

Query: 943  IMAYIRAGMVDHGESLLGEMEAQDLCAGSEVYKALLRAYSSIGDTNGAQRVFDAIQFAGI 1122
            IMAYIRAG  + GE+LL EM++Q++CAG EVYKALLRAYS  GD  GA+RVFDA+Q AGI
Sbjct: 242  IMAYIRAGAPEKGEALLREMDSQEICAGREVYKALLRAYSMGGDAEGAKRVFDAVQIAGI 301

Query: 1123 APDARLCALLMNAYAVAGQSDGARIVFENMRKAGLEPSDKCVAVMLSAYEKENKLNAALD 1302
             PD +LC LL+NAY+V+GQS  AR+ FENMRKAG++ +DKCVA++L+AYEKE KLN AL 
Sbjct: 302  TPDVKLCGLLINAYSVSGQSQNARLAFENMRKAGIKATDKCVALVLAAYEKEEKLNEALG 361

Query: 1303 LLMDLERDGIMVCQEASEVLARWFRRLGVVDEVEHVLREYATKE 1434
             L++LE+D IMV +EAS VLA+WF++LGVV+EVE +LRE+++ +
Sbjct: 362  FLVELEKDSIMVGKEASAVLAQWFKKLGVVEEVELLLREFSSSQ 405


>ref|NP_171699.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75264110|sp|Q9LPC4.1|PPR1_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g01970 gi|8570448|gb|AAF76475.1|AC020622_9 Contains
            similarity to an unknown protein gi|AAD26479 from
            Arabidopsis thaliana BAC gb|AC007169 and contains
            multiple PPR PF|01535 repeats [Arabidopsis thaliana]
            gi|34098825|gb|AAQ56795.1| At1g01970 [Arabidopsis
            thaliana] gi|110735700|dbj|BAE99830.1| hypothetical
            protein [Arabidopsis thaliana]
            gi|332189240|gb|AEE27361.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 409

 Score =  427 bits (1099), Expect = e-117
 Identities = 209/344 (60%), Positives = 271/344 (78%)
 Frame = +1

Query: 403  EQEEERERFKWVEIGLDLTEAQKHSISLLPPKMSKRCKALMEQLICFDPQKTNLSRLLVS 582
            E  E+   F W ++GL+LTE Q  +I+ +P KMSKRC+ALM Q+ICF P+K +   LL +
Sbjct: 62   EDAEQSRSFNWADVGLNLTEEQDEAITRIPIKMSKRCQALMRQIICFSPEKGSFCDLLGA 121

Query: 583  WVKIMKPIRADWLSVLKQMSKLEHSLLFEVTEFALLEESFEANVRDYTKIIDGYGKQNRL 762
            W++ M PIRADWLS+LK++  L+     +V EF+LL++SFEAN RDYTKII  YGK N++
Sbjct: 122  WLRRMNPIRADWLSILKELKNLDSPFYIKVAEFSLLQDSFEANARDYTKIIHYYGKLNQV 181

Query: 763  LDAETALQAMKERGFTIDQITLTVLVQMYSKAGNLNRAKETFEEIKLLGLPLDKRSYGSM 942
             DAE  L +MK RGF IDQ+TLT +VQ+YSKAG    A+ETF EIKLLG PLD RSYGSM
Sbjct: 182  EDAERTLLSMKNRGFLIDQVTLTAMVQLYSKAGCHKLAEETFNEIKLLGEPLDYRSYGSM 241

Query: 943  IMAYIRAGMVDHGESLLGEMEAQDLCAGSEVYKALLRAYSSIGDTNGAQRVFDAIQFAGI 1122
            IMAYIRAG+ + GESLL EM++Q++CAG EVYKALLR YS  GD  GA+RVFDA+Q AGI
Sbjct: 242  IMAYIRAGVPEKGESLLREMDSQEICAGREVYKALLRDYSMGGDAEGAKRVFDAVQIAGI 301

Query: 1123 APDARLCALLMNAYAVAGQSDGARIVFENMRKAGLEPSDKCVAVMLSAYEKENKLNAALD 1302
             PD +LC LL+NAY+V+GQS  AR+ FENMRKAG++ +DKCVA++L+AYEKE KLN AL 
Sbjct: 302  TPDVKLCGLLINAYSVSGQSQNARLAFENMRKAGIKATDKCVALVLAAYEKEEKLNEALG 361

Query: 1303 LLMDLERDGIMVCQEASEVLARWFRRLGVVDEVEHVLREYATKE 1434
             L++LE+D IM+ +EAS VLA+WF++LGVV+EVE +LRE+++ +
Sbjct: 362  FLVELEKDSIMLGKEASAVLAQWFKKLGVVEEVELLLREFSSSQ 405


Top