BLASTX nr result

ID: Mentha23_contig00042772 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha23_contig00042772
         (768 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU31068.1| hypothetical protein MIMGU_mgv1a024034mg [Mimulus...   393   e-107
gb|EPS58459.1| hypothetical protein M569_16353, partial [Genlise...   373   e-101
ref|XP_004169853.1| PREDICTED: pentatricopeptide repeat-containi...   368   1e-99
ref|XP_004143565.1| PREDICTED: pentatricopeptide repeat-containi...   368   1e-99
ref|XP_006354698.1| PREDICTED: pentatricopeptide repeat-containi...   366   4e-99
ref|XP_004237613.1| PREDICTED: pentatricopeptide repeat-containi...   365   9e-99
ref|XP_003524280.1| PREDICTED: pentatricopeptide repeat-containi...   365   1e-98
ref|XP_002306340.2| pentatricopeptide repeat-containing family p...   364   2e-98
ref|XP_004504032.1| PREDICTED: pentatricopeptide repeat-containi...   364   2e-98
ref|XP_007159746.1| hypothetical protein PHAVU_002G263600g [Phas...   364   2e-98
ref|XP_002276540.1| PREDICTED: pentatricopeptide repeat-containi...   362   8e-98
ref|XP_007212021.1| hypothetical protein PRUPE_ppa004899mg [Prun...   361   2e-97
ref|XP_002532046.1| pentatricopeptide repeat-containing protein,...   360   3e-97
ref|XP_006585305.1| PREDICTED: pentatricopeptide repeat-containi...   358   9e-97
gb|EXB80843.1| hypothetical protein L484_020101 [Morus notabilis]     358   2e-96
ref|XP_006428072.1| hypothetical protein CICLE_v10025440mg [Citr...   358   2e-96
ref|XP_007048085.1| Tetratricopeptide repeat (TPR)-like superfam...   355   1e-95
ref|XP_003630096.1| Pentatricopeptide repeat-containing protein ...   352   6e-95
ref|XP_004296690.1| PREDICTED: pentatricopeptide repeat-containi...   350   4e-94
ref|XP_006411755.1| hypothetical protein EUTSA_v10024997mg [Eutr...   347   3e-93

>gb|EYU31068.1| hypothetical protein MIMGU_mgv1a024034mg [Mimulus guttatus]
          Length = 496

 Score =  393 bits (1010), Expect = e-107
 Identities = 191/251 (76%), Positives = 220/251 (87%), Gaps = 1/251 (0%)
 Frame = -2

Query: 755 PISTIRMCG-ISALXXXXXXXXXXXXKIMPSPSSEAEELVRSIVRNISDKQPLLNTLNKH 579
           P STIRMC  +SA             +   SPSSEAE+LV+ I+RN SDKQPL+NTL+K+
Sbjct: 37  PPSTIRMCSWVSARPERNPGPRKTHKRSTSSPSSEAEDLVKLIMRNFSDKQPLVNTLDKY 96

Query: 578 VQFLRTEHCFLLFEELGRSERWLQCLEVFRWMQKQRWYVADNGVYSKLISVMGKKGQSRM 399
           V+F+RT+HCFLLFEELG++E+WLQCLEVFRWMQKQRWY+ADNGVYSKLISVMGKKGQ+RM
Sbjct: 97  VKFVRTDHCFLLFEELGKTEKWLQCLEVFRWMQKQRWYIADNGVYSKLISVMGKKGQTRM 156

Query: 398 AMWLFSEMRNSGCKPDTSVYNALITAHLHSRDKAKALSKALSYFEKMKGMERCKPSVVTY 219
           AMWLFSEMRNSGC+PDTSVYN+LITAHLHSRDK+KAL+K+L YFEKMK MERCKPS+VTY
Sbjct: 157 AMWLFSEMRNSGCRPDTSVYNSLITAHLHSRDKSKALAKSLGYFEKMKSMERCKPSIVTY 216

Query: 218 NILLRAFAQSKNVEQVNVLFNDLDESIISPDAFTFNGVMDAYGKVGLIREMELVLAKMKS 39
           NILLRAFAQ+KNV+QVN LF DL+ESI++PD FTFNGVMDAYGK G+IREME VLAKMKS
Sbjct: 217 NILLRAFAQAKNVDQVNTLFKDLEESIVTPDIFTFNGVMDAYGKNGMIREMEFVLAKMKS 276

Query: 38  KKIKPGIITFN 6
            +IKP  ITFN
Sbjct: 277 NQIKPDTITFN 287


>gb|EPS58459.1| hypothetical protein M569_16353, partial [Genlisea aurea]
          Length = 415

 Score =  373 bits (957), Expect = e-101
 Identities = 177/222 (79%), Positives = 204/222 (91%)
 Frame = -2

Query: 668 SPSSEAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFR 489
           S SSEAE+LVRS++RN +D QPL +TLNK+V+ LRT HCFL+FEELG+S+RWLQCLEVFR
Sbjct: 4   SSSSEAEDLVRSVMRNFTDSQPLTSTLNKYVKLLRTAHCFLIFEELGKSDRWLQCLEVFR 63

Query: 488 WMQKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHS 309
           WMQKQRWYVADNGVYSKLISVMGK+G++RMAMWLFSEMRNSGC+PDTSVYN+LI+AHLHS
Sbjct: 64  WMQKQRWYVADNGVYSKLISVMGKQGKTRMAMWLFSEMRNSGCRPDTSVYNSLISAHLHS 123

Query: 308 RDKAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISP 129
           RDK KAL K L YFEKMKG+ERC+P+VVTYNILLRAFAQ+KN+EQVN LF +LD SIISP
Sbjct: 124 RDKTKALDKVLWYFEKMKGIERCQPNVVTYNILLRAFAQAKNIEQVNALFKELDGSIISP 183

Query: 128 DAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPGIITFNL 3
           D  T+NGVMDAYGK G+IREMELVL+KMKS +IKP +ITFNL
Sbjct: 184 DVLTYNGVMDAYGKNGMIREMELVLSKMKSAQIKPDVITFNL 225


>ref|XP_004169853.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620,
           chloroplastic-like [Cucumis sativus]
          Length = 494

 Score =  368 bits (944), Expect = 1e-99
 Identities = 173/219 (78%), Positives = 200/219 (91%)
 Frame = -2

Query: 659 SEAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFRWMQ 480
           SEAEELVR I+RN SDK+PLL TL+K+V+ +RTEHCFLLFEELG+ ++WL+CLEVFRWMQ
Sbjct: 59  SEAEELVRGIIRNFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKRDKWLECLEVFRWMQ 118

Query: 479 KQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRDK 300
           KQRWY+ADNGVYSKLIS+MGKKGQ RMAMWLFSEMRNSGC+PDTSVYNALITAHLHS+DK
Sbjct: 119 KQRWYIADNGVYSKLISIMGKKGQIRMAMWLFSEMRNSGCRPDTSVYNALITAHLHSKDK 178

Query: 299 AKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPDAF 120
           AKAL K LSYFEKMKGMERCKP++VTYNIL RAFAQ+  V+QVN LF DLDES++S D +
Sbjct: 179 AKALVKVLSYFEKMKGMERCKPNIVTYNILTRAFAQAAKVDQVNTLFKDLDESVVSADIY 238

Query: 119 TFNGVMDAYGKVGLIREMELVLAKMKSKKIKPGIITFNL 3
           T+NGVMDAYGK G I+EMEL+LA+MKS +IKP II+FNL
Sbjct: 239 TYNGVMDAYGKNGNIKEMELMLARMKSNQIKPDIISFNL 277


>ref|XP_004143565.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620,
           chloroplastic-like [Cucumis sativus]
          Length = 528

 Score =  368 bits (944), Expect = 1e-99
 Identities = 173/219 (78%), Positives = 200/219 (91%)
 Frame = -2

Query: 659 SEAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFRWMQ 480
           SEAEELVR I+RN SDK+PLL TL+K+V+ +RTEHCFLLFEELG+ ++WL+CLEVFRWMQ
Sbjct: 59  SEAEELVRGIIRNFSDKEPLLKTLDKYVRVMRTEHCFLLFEELGKRDKWLECLEVFRWMQ 118

Query: 479 KQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRDK 300
           KQRWY+ADNGVYSKLIS+MGKKGQ RMAMWLFSEMRNSGC+PDTSVYNALITAHLHS+DK
Sbjct: 119 KQRWYIADNGVYSKLISIMGKKGQIRMAMWLFSEMRNSGCRPDTSVYNALITAHLHSKDK 178

Query: 299 AKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPDAF 120
           AKAL K LSYFEKMKGMERCKP++VTYNIL RAFAQ+  V+QVN LF DLDES++S D +
Sbjct: 179 AKALVKVLSYFEKMKGMERCKPNIVTYNILTRAFAQAAKVDQVNTLFKDLDESVVSADIY 238

Query: 119 TFNGVMDAYGKVGLIREMELVLAKMKSKKIKPGIITFNL 3
           T+NGVMDAYGK G I+EMEL+LA+MKS +IKP II+FNL
Sbjct: 239 TYNGVMDAYGKNGNIKEMELMLARMKSNQIKPDIISFNL 277


>ref|XP_006354698.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620,
           chloroplastic-like isoform X1 [Solanum tuberosum]
           gi|565376411|ref|XP_006354699.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g39620,
           chloroplastic-like isoform X2 [Solanum tuberosum]
           gi|565376413|ref|XP_006354700.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g39620,
           chloroplastic-like isoform X3 [Solanum tuberosum]
          Length = 457

 Score =  366 bits (940), Expect = 4e-99
 Identities = 170/222 (76%), Positives = 204/222 (91%)
 Frame = -2

Query: 668 SPSSEAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFR 489
           S SSEA+ELV  ++RN SDK+PL++TL+K+V+ +RTEHCFLLFE+LG+++ WLQCLEVFR
Sbjct: 32  SSSSEAQELVTLVMRNFSDKKPLVSTLDKYVKLVRTEHCFLLFEQLGKTDNWLQCLEVFR 91

Query: 488 WMQKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHS 309
           WMQKQRWY+ADNGVYSKLISVMGKKGQ RMAMWLFSEMRNSGC+PDTSVYNA+I+AHLHS
Sbjct: 92  WMQKQRWYIADNGVYSKLISVMGKKGQIRMAMWLFSEMRNSGCRPDTSVYNAVISAHLHS 151

Query: 308 RDKAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISP 129
           RDK+KAL+KA+ YFEKMKGMERC PS+VTYNILLRAFAQ+KNVEQV+ L  DLDESI++P
Sbjct: 152 RDKSKALTKAMGYFEKMKGMERCSPSIVTYNILLRAFAQAKNVEQVDALLKDLDESIVTP 211

Query: 128 DAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPGIITFNL 3
           D FTFNG+MDAYGK G+I EME +L++MKS ++KP IITFN+
Sbjct: 212 DIFTFNGLMDAYGKNGMINEMEHILSRMKSNQLKPDIITFNI 253


>ref|XP_004237613.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620,
           chloroplastic-like [Solanum lycopersicum]
          Length = 478

 Score =  365 bits (937), Expect = 9e-99
 Identities = 171/222 (77%), Positives = 203/222 (91%)
 Frame = -2

Query: 668 SPSSEAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFR 489
           S SSEA+ELV  ++RN SDK+PL++TL+K+V+ +RTEHCFLLFE+LG+++ WLQCLEVFR
Sbjct: 53  SSSSEAQELVTLVMRNFSDKKPLVSTLDKYVKLVRTEHCFLLFEQLGKTDNWLQCLEVFR 112

Query: 488 WMQKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHS 309
           WMQKQRWY+ADNGVYSKLISVMGKKGQ RMAMWLFSEMRNSGC+PDTSVYNA+I+AHLHS
Sbjct: 113 WMQKQRWYIADNGVYSKLISVMGKKGQIRMAMWLFSEMRNSGCRPDTSVYNAVISAHLHS 172

Query: 308 RDKAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISP 129
           RDK+KAL+KA+ YFEKMK MERC PS+VTYNILLRAFAQ+KNVEQV+ L  DLDESI++P
Sbjct: 173 RDKSKALTKAMGYFEKMKEMERCSPSIVTYNILLRAFAQAKNVEQVDALLKDLDESIVTP 232

Query: 128 DAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPGIITFNL 3
           D FTFNG+MDAYGK G+I EME VL++MKS K+KP IITFN+
Sbjct: 233 DIFTFNGLMDAYGKNGMINEMEHVLSRMKSNKLKPDIITFNI 274


>ref|XP_003524280.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620,
           chloroplastic-like [Glycine max]
          Length = 503

 Score =  365 bits (936), Expect = 1e-98
 Identities = 173/221 (78%), Positives = 201/221 (90%), Gaps = 1/221 (0%)
 Frame = -2

Query: 662 SSEAEELVRSIVRNIS-DKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFRW 486
           +SEA+ELVR +   IS DK+PLL TLNK+V+ +RT+HCFLLFEEL + + WLQCLEVFRW
Sbjct: 39  NSEAQELVRLLTSKISNDKEPLLKTLNKYVKQVRTQHCFLLFEELAKHDNWLQCLEVFRW 98

Query: 485 MQKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSR 306
           MQKQRWY+ADNG+YSKLISVMGKKGQ+RMAMWLFSEMRN+GC+PDTSVYNALITAHLHSR
Sbjct: 99  MQKQRWYIADNGIYSKLISVMGKKGQTRMAMWLFSEMRNTGCRPDTSVYNALITAHLHSR 158

Query: 305 DKAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPD 126
           DK KAL+KA+ YF+KMKGMERCKP++VTYNILLRAFAQ++NVEQVN LF DLDESI+SPD
Sbjct: 159 DKTKALAKAIGYFQKMKGMERCKPNIVTYNILLRAFAQARNVEQVNSLFKDLDESIVSPD 218

Query: 125 AFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPGIITFNL 3
            +TFNGVMDAYGK G+IREME VLA+MKS + KP +ITFNL
Sbjct: 219 IYTFNGVMDAYGKNGMIREMEAVLARMKSNQCKPDLITFNL 259


>ref|XP_002306340.2| pentatricopeptide repeat-containing family protein, partial
           [Populus trichocarpa] gi|550338395|gb|EEE93336.2|
           pentatricopeptide repeat-containing family protein,
           partial [Populus trichocarpa]
          Length = 414

 Score =  364 bits (935), Expect = 2e-98
 Identities = 170/219 (77%), Positives = 203/219 (92%)
 Frame = -2

Query: 659 SEAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFRWMQ 480
           SEA+ELVR +VR+ SDKQPL+ TLNK+V+ +RTEHCF+LFEELG++++WLQCLEVFRWMQ
Sbjct: 26  SEAQELVRVLVRSFSDKQPLVKTLNKYVKVMRTEHCFMLFEELGKTDKWLQCLEVFRWMQ 85

Query: 479 KQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRDK 300
           KQRWYVADNG YSKLISVMGKKGQ+RMAMWLFSEMRNSGC+PDTSVYNALITAHLHS+DK
Sbjct: 86  KQRWYVADNGCYSKLISVMGKKGQTRMAMWLFSEMRNSGCRPDTSVYNALITAHLHSKDK 145

Query: 299 AKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPDAF 120
           AK+L+KAL+YFEKMK +ERC+P+VVTYNI+LRAFAQ++NV QVN LF DL+ESI+SPD +
Sbjct: 146 AKSLTKALAYFEKMKSIERCQPNVVTYNIILRAFAQARNVNQVNALFKDLEESIVSPDIY 205

Query: 119 TFNGVMDAYGKVGLIREMELVLAKMKSKKIKPGIITFNL 3
           T+NGV+DAYGK G+IREME VL++MK  + KP IITFNL
Sbjct: 206 TYNGVLDAYGKNGMIREMESVLSRMKIDQCKPDIITFNL 244


>ref|XP_004504032.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620,
           chloroplastic-like isoform X1 [Cicer arietinum]
           gi|502140047|ref|XP_004504033.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g39620,
           chloroplastic-like isoform X2 [Cicer arietinum]
          Length = 477

 Score =  364 bits (935), Expect = 2e-98
 Identities = 173/255 (67%), Positives = 215/255 (84%)
 Frame = -2

Query: 767 YRSYPISTIRMCGISALXXXXXXXXXXXXKIMPSPSSEAEELVRSIVRNISDKQPLLNTL 588
           Y S+P S++ +  +  +            KI  S  SE +ELVR + R IS+K+PL+ TL
Sbjct: 15  YYSFPTSSVNLPRVIRISCGSNPTRLNRKKIT-SERSETQELVRLLTRKISEKEPLVTTL 73

Query: 587 NKHVQFLRTEHCFLLFEELGRSERWLQCLEVFRWMQKQRWYVADNGVYSKLISVMGKKGQ 408
           NK+V+ +RTEHCFLLFEELG+ ++WLQCLEVFRWMQ+QRWY+ADNGVYSKLISVMGKKGQ
Sbjct: 74  NKYVKLVRTEHCFLLFEELGKHDKWLQCLEVFRWMQRQRWYIADNGVYSKLISVMGKKGQ 133

Query: 407 SRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRDKAKALSKALSYFEKMKGMERCKPSV 228
            R+AMWLFSEMRN+GC+PDTSVYNALI+AHLH+R+K+ AL+KAL YFEKMKG+ERCKP++
Sbjct: 134 IRLAMWLFSEMRNTGCRPDTSVYNALISAHLHTRNKSNALAKALGYFEKMKGIERCKPNI 193

Query: 227 VTYNILLRAFAQSKNVEQVNVLFNDLDESIISPDAFTFNGVMDAYGKVGLIREMELVLAK 48
           VTYNILLRAFAQS+NV+QVN LF DLD+S++SPD +TFNGVMDAYGK G+IREME VLA+
Sbjct: 194 VTYNILLRAFAQSRNVDQVNSLFKDLDDSVVSPDIYTFNGVMDAYGKNGMIREMETVLAR 253

Query: 47  MKSKKIKPGIITFNL 3
           MKS ++KP +IT+NL
Sbjct: 254 MKSNQVKPDLITYNL 268


>ref|XP_007159746.1| hypothetical protein PHAVU_002G263600g [Phaseolus vulgaris]
           gi|561033161|gb|ESW31740.1| hypothetical protein
           PHAVU_002G263600g [Phaseolus vulgaris]
          Length = 534

 Score =  364 bits (934), Expect = 2e-98
 Identities = 170/223 (76%), Positives = 201/223 (90%)
 Frame = -2

Query: 671 PSPSSEAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVF 492
           P+ ++EA ELVR + R ISDK+PLL TLNK V+ +RTEHCFLLFEELG+   WLQC+EVF
Sbjct: 40  PNHNAEARELVRLLTRKISDKEPLLKTLNKFVKQVRTEHCFLLFEELGKEGNWLQCIEVF 99

Query: 491 RWMQKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLH 312
           RWMQKQRWY+ADNG+YSKLISVMGK+GQ+RMAMWLFSEMRN+GC+PDTSVYNALITAHLH
Sbjct: 100 RWMQKQRWYIADNGIYSKLISVMGKRGQTRMAMWLFSEMRNAGCRPDTSVYNALITAHLH 159

Query: 311 SRDKAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIIS 132
           SRDK KALSKA+ YF+KMKG+ERCKP++VTYNILLRAFAQ++N+EQV+ LF DLDES IS
Sbjct: 160 SRDKTKALSKAIGYFQKMKGIERCKPNIVTYNILLRAFAQARNLEQVSSLFKDLDESSIS 219

Query: 131 PDAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPGIITFNL 3
           PD +TFNGVMDAYGK G+IREME +LA+M+S + KP +ITFNL
Sbjct: 220 PDIYTFNGVMDAYGKNGMIREMEAILAQMRSSQYKPDLITFNL 262


>ref|XP_002276540.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620,
           chloroplastic [Vitis vinifera]
           gi|296082481|emb|CBI21486.3| unnamed protein product
           [Vitis vinifera]
          Length = 489

 Score =  362 bits (929), Expect = 8e-98
 Identities = 170/224 (75%), Positives = 202/224 (90%), Gaps = 1/224 (0%)
 Frame = -2

Query: 671 PSPS-SEAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEV 495
           P P  SE EELVR +++N   ++PL++TLNK+V+ +RTEHCF LFEELG++++WLQCLEV
Sbjct: 61  PKPDKSEVEELVRVLMKNFGGERPLISTLNKYVKVIRTEHCFRLFEELGKTDKWLQCLEV 120

Query: 494 FRWMQKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHL 315
           FRWMQKQRWY+ADNGVYSKLISVMGKKGQ+RMAMWLFSEMRNSGC+PDTSVYNALITAHL
Sbjct: 121 FRWMQKQRWYIADNGVYSKLISVMGKKGQTRMAMWLFSEMRNSGCRPDTSVYNALITAHL 180

Query: 314 HSRDKAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESII 135
           HSRDK+KAL KAL YF+KMKGMERCKP++VTYNILLRAFAQ++NV Q N LF +L+ESI+
Sbjct: 181 HSRDKSKALIKALGYFDKMKGMERCKPNIVTYNILLRAFAQAQNVNQANALFKELNESIV 240

Query: 134 SPDAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPGIITFNL 3
           SPD FTFNGVMDAYGK G+I+EME VL++MKS + KP IITFN+
Sbjct: 241 SPDIFTFNGVMDAYGKNGMIKEMESVLSRMKSNQCKPDIITFNV 284


>ref|XP_007212021.1| hypothetical protein PRUPE_ppa004899mg [Prunus persica]
           gi|462407886|gb|EMJ13220.1| hypothetical protein
           PRUPE_ppa004899mg [Prunus persica]
          Length = 486

 Score =  361 bits (926), Expect = 2e-97
 Identities = 172/218 (78%), Positives = 198/218 (90%)
 Frame = -2

Query: 656 EAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFRWMQK 477
           +  E+VR ++R+ SDK+PLL TLNK+V+ +RTEHCFLLFEELG+S+ WLQCLEVFRWMQK
Sbjct: 64  DVREVVRMLMRSFSDKEPLLKTLNKYVRIVRTEHCFLLFEELGKSDEWLQCLEVFRWMQK 123

Query: 476 QRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRDKA 297
           QRWYVADNGVYSKLISVMGKKGQ+RMAMWLFSEMRNSGC+PDTSVYNALI+AHL+S+DKA
Sbjct: 124 QRWYVADNGVYSKLISVMGKKGQTRMAMWLFSEMRNSGCRPDTSVYNALISAHLNSKDKA 183

Query: 296 KALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPDAFT 117
           KAL KAL YF+KMKGMERC+P++VTYNILLRAFAQS+NVE+VN LF DLDESI SPD +T
Sbjct: 184 KALDKALRYFDKMKGMERCQPNIVTYNILLRAFAQSRNVEKVNSLFKDLDESIASPDIYT 243

Query: 116 FNGVMDAYGKVGLIREMELVLAKMKSKKIKPGIITFNL 3
           +NGVMDAYGK G IREME VL+ MKS + KP IITFNL
Sbjct: 244 YNGVMDAYGKNGNIREMESVLSHMKSNQCKPDIITFNL 281


>ref|XP_002532046.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223528289|gb|EEF30336.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 478

 Score =  360 bits (924), Expect = 3e-97
 Identities = 172/223 (77%), Positives = 202/223 (90%), Gaps = 1/223 (0%)
 Frame = -2

Query: 668 SPSSEAEELVRSIVRNIS-DKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVF 492
           S  SE E+LVR ++R+ S DK PL+ TL+K+V+ +RTEHCFLLFEELGR ++WLQCLEVF
Sbjct: 60  SEESETEDLVRYVLRSFSSDKVPLVRTLDKYVRVVRTEHCFLLFEELGRRDKWLQCLEVF 119

Query: 491 RWMQKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLH 312
           RWMQKQRWY+AD+GVYSKLISVMGKKGQ+RMAMWLFSEMRNSGC+PD+SVYNALITAHLH
Sbjct: 120 RWMQKQRWYIADSGVYSKLISVMGKKGQTRMAMWLFSEMRNSGCRPDSSVYNALITAHLH 179

Query: 311 SRDKAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIIS 132
           S+DKAKAL KAL YFEKMKGM+RC+P+VVTYNILLRAFAQ++NV QVN LF DLD+SI+S
Sbjct: 180 SKDKAKALIKALGYFEKMKGMQRCQPNVVTYNILLRAFAQARNVNQVNALFKDLDQSIVS 239

Query: 131 PDAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPGIITFNL 3
           PD +T+NGVMDAYGK G+IREME VL++MKS + KP IITFNL
Sbjct: 240 PDIYTYNGVMDAYGKNGMIREMESVLSRMKSNQCKPDIITFNL 282



 Score = 57.4 bits (137), Expect = 6e-06
 Identities = 44/179 (24%), Positives = 76/179 (42%), Gaps = 31/179 (17%)
 Frame = -2

Query: 446 YSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAH----------------L 315
           Y+ ++   GK G  R    + S M+++ CKPD   +N LI ++                L
Sbjct: 245 YNGVMDAYGKNGMIREMESVLSRMKSNQCKPDIITFNLLIDSYGKKQDFDKMEQVFKSLL 304

Query: 314 HSRD---------------KAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNV 180
           HS++               KA+    A S  +KM  M +  P+ +TY  L+  +    +V
Sbjct: 305 HSKERPTLPTFNSMITNYGKARQKENAESVLQKMTKM-KYTPNFITYESLIMMYGFCDSV 363

Query: 179 EQVNVLFNDLDESIISPDAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPGIITFNL 3
            +   +F+D+ ES       T N ++D Y   GL  E +L+    ++  + P   T+ L
Sbjct: 364 SKAREIFDDMIESGKEVKVSTLNAMLDVYCLNGLPMEADLLFDNARNVGLLPDSTTYKL 422


>ref|XP_006585305.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620,
           chloroplastic-like [Glycine max]
          Length = 526

 Score =  358 bits (920), Expect = 9e-97
 Identities = 171/222 (77%), Positives = 200/222 (90%), Gaps = 2/222 (0%)
 Frame = -2

Query: 662 SSEAEELVRSIVRNI--SDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFR 489
           +SEA+ELVR +   I  +DK+ LL TLNK+V+ +RT+HCFLLFEELG+ + WLQCLEVFR
Sbjct: 47  NSEAQELVRLLTSKIRSNDKEVLLKTLNKYVKQVRTQHCFLLFEELGKHDNWLQCLEVFR 106

Query: 488 WMQKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHS 309
           WMQKQRWY+ADNG+YSKLISVMGKKGQ+RMAMWLFSEMRN+GC+PDTSVYNALITAHL S
Sbjct: 107 WMQKQRWYIADNGIYSKLISVMGKKGQTRMAMWLFSEMRNTGCRPDTSVYNALITAHLRS 166

Query: 308 RDKAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISP 129
           RDK KAL+KA+ YF+KMKGMERCKP++VTYNILLRAFAQ++NVEQVN LF DLDESI+SP
Sbjct: 167 RDKIKALAKAIGYFQKMKGMERCKPNIVTYNILLRAFAQARNVEQVNSLFKDLDESIVSP 226

Query: 128 DAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPGIITFNL 3
           D +TFNGVMDAYGK G+IREME VLA+MKS + KP +ITFNL
Sbjct: 227 DIYTFNGVMDAYGKNGMIREMEAVLARMKSNQCKPDLITFNL 268


>gb|EXB80843.1| hypothetical protein L484_020101 [Morus notabilis]
          Length = 485

 Score =  358 bits (918), Expect = 2e-96
 Identities = 172/220 (78%), Positives = 199/220 (90%), Gaps = 1/220 (0%)
 Frame = -2

Query: 659 SEAEELVRSIVRNI-SDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFRWM 483
           SEA +LVR ++R+  SDK+PL+ TLNK+V+ +RTEHCFLLFEELGRS++WLQCLEVFRWM
Sbjct: 61  SEALDLVRLLMRSFNSDKEPLVKTLNKYVKTVRTEHCFLLFEELGRSDKWLQCLEVFRWM 120

Query: 482 QKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRD 303
           QKQRWY+ADNGVYSKLISVMGKKGQ+RMAMWLFSEMRNS C+PDTSVYNALITAHLHS D
Sbjct: 121 QKQRWYIADNGVYSKLISVMGKKGQTRMAMWLFSEMRNSSCRPDTSVYNALITAHLHSSD 180

Query: 302 KAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPDA 123
           K KAL KA+ YFEKMKG+ERCKP++VTYNILLRAFAQ++NV++VN LF DLD SI+SPD 
Sbjct: 181 KVKALDKAIGYFEKMKGIERCKPNIVTYNILLRAFAQARNVQRVNSLFKDLDGSIVSPDI 240

Query: 122 FTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPGIITFNL 3
           +T+NGVMDAYGK G+IREME VL+ MKS  IKP IITFNL
Sbjct: 241 YTYNGVMDAYGKNGMIREMESVLSLMKSNHIKPDIITFNL 280


>ref|XP_006428072.1| hypothetical protein CICLE_v10025440mg [Citrus clementina]
           gi|568819570|ref|XP_006464322.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g39620,
           chloroplastic-like [Citrus sinensis]
           gi|557530062|gb|ESR41312.1| hypothetical protein
           CICLE_v10025440mg [Citrus clementina]
          Length = 500

 Score =  358 bits (918), Expect = 2e-96
 Identities = 165/222 (74%), Positives = 203/222 (91%)
 Frame = -2

Query: 668 SPSSEAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFR 489
           S   E++ELVR ++R+ SDK+PL+ TLNK+V+ +R+EHCFLLFEELG+S++WLQCLEVFR
Sbjct: 74  SEELESKELVRVLMRSFSDKEPLVRTLNKYVKVVRSEHCFLLFEELGKSDKWLQCLEVFR 133

Query: 488 WMQKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHS 309
           WMQKQRWY+AD G+YSKLI+VMGKKGQ+R+AMWLFSEMRNSGC+PD SVYNALITAHLH+
Sbjct: 134 WMQKQRWYIADTGIYSKLIAVMGKKGQTRLAMWLFSEMRNSGCRPDPSVYNALITAHLHT 193

Query: 308 RDKAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISP 129
           RDKAKAL+KAL YF+KMKGMERCKP++VTYNILLRA AQ++NV+QVN LF +LDESI++P
Sbjct: 194 RDKAKALAKALGYFQKMKGMERCKPNIVTYNILLRACAQARNVDQVNALFKELDESILAP 253

Query: 128 DAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPGIITFNL 3
           D +T+NGVMDAYGK G+I+EME VL++MKS + KP IITFNL
Sbjct: 254 DIYTYNGVMDAYGKNGMIKEMESVLSRMKSNQCKPDIITFNL 295


>ref|XP_007048085.1| Tetratricopeptide repeat (TPR)-like superfamily protein isoform 1
           [Theobroma cacao] gi|590707730|ref|XP_007048086.1|
           Tetratricopeptide repeat (TPR)-like superfamily protein
           isoform 1 [Theobroma cacao] gi|508700346|gb|EOX92242.1|
           Tetratricopeptide repeat (TPR)-like superfamily protein
           isoform 1 [Theobroma cacao] gi|508700347|gb|EOX92243.1|
           Tetratricopeptide repeat (TPR)-like superfamily protein
           isoform 1 [Theobroma cacao]
          Length = 488

 Score =  355 bits (911), Expect = 1e-95
 Identities = 164/218 (75%), Positives = 198/218 (90%)
 Frame = -2

Query: 656 EAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFRWMQK 477
           EA ELVR ++R+ SDK+PL+ TLN++V+ +R EHCFLLFEELG++++WLQCLEVFRWMQK
Sbjct: 61  EALELVRVLMRSFSDKEPLVKTLNRYVRVVRCEHCFLLFEELGKTDKWLQCLEVFRWMQK 120

Query: 476 QRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRDKA 297
           QRWY+ADNG+YSKLI+VMGKKGQ+RMAMWLFSEMRNSGC+PD SVYNALITAHLHSRDK+
Sbjct: 121 QRWYIADNGIYSKLITVMGKKGQTRMAMWLFSEMRNSGCRPDVSVYNALITAHLHSRDKS 180

Query: 296 KALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPDAFT 117
           KAL KA+ YF KMKGMERCKP++VTYNILLRAF+Q++NV+QVN LF DL ESII+PD +T
Sbjct: 181 KALDKAMGYFNKMKGMERCKPNIVTYNILLRAFSQARNVDQVNALFKDLAESIIAPDIYT 240

Query: 116 FNGVMDAYGKVGLIREMELVLAKMKSKKIKPGIITFNL 3
           +NGVMDAYGK G+IREME VL++MKS + KP  ITFN+
Sbjct: 241 YNGVMDAYGKNGMIREMESVLSRMKSNQCKPDTITFNV 278


>ref|XP_003630096.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355524118|gb|AET04572.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 635

 Score =  352 bits (904), Expect = 6e-95
 Identities = 165/219 (75%), Positives = 195/219 (89%)
 Frame = -2

Query: 659 SEAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFRWMQ 480
           SE +ELVR + R ISDK+PLL TLNK+V+ +RTEHCFLLFEELG+ ++WLQCLEVFRWMQ
Sbjct: 49  SETQELVRLLTRKISDKEPLLKTLNKYVKLVRTEHCFLLFEELGKHDKWLQCLEVFRWMQ 108

Query: 479 KQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRDK 300
           +QRWY+ADNGVYSKLISVMGKKGQ R+AMWLFSEMRN+GC+PDTSVYN+LI+AHLHSRDK
Sbjct: 109 RQRWYIADNGVYSKLISVMGKKGQIRLAMWLFSEMRNTGCRPDTSVYNSLISAHLHSRDK 168

Query: 299 AKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPDAF 120
           +KAL KAL YFEKMK  ERCKP++VTYNILLRAFAQ+++V QVN LF DLDES +SPD +
Sbjct: 169 SKALVKALGYFEKMKTTERCKPNIVTYNILLRAFAQARDVNQVNYLFKDLDESSVSPDIY 228

Query: 119 TFNGVMDAYGKVGLIREMELVLAKMKSKKIKPGIITFNL 3
           TFNGVMD YGK G+IREME VL +MKS ++K  +IT+NL
Sbjct: 229 TFNGVMDGYGKNGMIREMESVLVRMKSNQVKLDLITYNL 267


>ref|XP_004296690.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620,
           chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 420

 Score =  350 bits (897), Expect = 4e-94
 Identities = 163/214 (76%), Positives = 194/214 (90%)
 Frame = -2

Query: 644 LVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFRWMQKQRWY 465
           +VR ++R+ SDK+PL+ TLNK+V+ +RTEHCFLLFEELG+S +WLQCLEVFRWMQKQRWY
Sbjct: 1   MVRMLIRSFSDKEPLVKTLNKYVKIVRTEHCFLLFEELGKSGKWLQCLEVFRWMQKQRWY 60

Query: 464 VADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHSRDKAKALS 285
           VADNGVYSKLISVMGKKGQ+RMAMWLFSEMRNSGC+PDTSVYNALI+AHL+S+DK KAL 
Sbjct: 61  VADNGVYSKLISVMGKKGQTRMAMWLFSEMRNSGCRPDTSVYNALISAHLNSKDKGKALE 120

Query: 284 KALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISPDAFTFNGV 105
           K L YF KMKGMERC+P++VTYNILLRA+AQ++NV++VN LF DLDESI  PD +T+NGV
Sbjct: 121 KGLVYFNKMKGMERCQPNIVTYNILLRAYAQARNVDKVNSLFKDLDESIACPDIYTYNGV 180

Query: 104 MDAYGKVGLIREMELVLAKMKSKKIKPGIITFNL 3
           MDAYGK G+IR+ME VL++MKS + KP IITFNL
Sbjct: 181 MDAYGKNGMIRDMESVLSRMKSNQCKPDIITFNL 214



 Score = 62.0 bits (149), Expect = 2e-07
 Identities = 46/179 (25%), Positives = 79/179 (44%), Gaps = 31/179 (17%)
 Frame = -2

Query: 446 YSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAH----------------L 315
           Y+ ++   GK G  R    + S M+++ CKPD   +N LI ++                L
Sbjct: 177 YNGVMDAYGKNGMIRDMESVLSRMKSNQCKPDIITFNLLIDSYGKKQQFDKMEQVFKSLL 236

Query: 314 HSRD---------------KAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNV 180
           HS++               KA+   +A S F++M  M +  PS +TY  L+  +    +V
Sbjct: 237 HSKERPTLPTFNSMIINYGKARLKEQAESVFKRMIDM-KYSPSFITYESLMMMYGYCDSV 295

Query: 179 EQVNVLFNDLDESIISPDAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPGIITFNL 3
            +   +F+ + ES       T N ++D Y + GL  E + +L    S  I+P + T+ L
Sbjct: 296 SKAREIFDGVAESGQEMKVSTLNVMLDVYCRNGLPMEADKLLLSANSIGIRPNVCTYKL 354


>ref|XP_006411755.1| hypothetical protein EUTSA_v10024997mg [Eutrema salsugineum]
           gi|557112925|gb|ESQ53208.1| hypothetical protein
           EUTSA_v10024997mg [Eutrema salsugineum]
          Length = 496

 Score =  347 bits (890), Expect = 3e-93
 Identities = 164/222 (73%), Positives = 193/222 (86%)
 Frame = -2

Query: 668 SPSSEAEELVRSIVRNISDKQPLLNTLNKHVQFLRTEHCFLLFEELGRSERWLQCLEVFR 489
           S   E   LVRS++  ISD++PL+ TL+K+V+ +R EHCFLLFEELG+S++WLQCLEVFR
Sbjct: 64  SAERENRVLVRSLMSRISDREPLVKTLDKYVKVVRCEHCFLLFEELGKSDKWLQCLEVFR 123

Query: 488 WMQKQRWYVADNGVYSKLISVMGKKGQSRMAMWLFSEMRNSGCKPDTSVYNALITAHLHS 309
           WMQKQRWY+ADNGVYSKLISVMGKKGQ+RMAMWLFSEM+NSGC+PD SVYNALITAHLH+
Sbjct: 124 WMQKQRWYIADNGVYSKLISVMGKKGQTRMAMWLFSEMKNSGCRPDASVYNALITAHLHT 183

Query: 308 RDKAKALSKALSYFEKMKGMERCKPSVVTYNILLRAFAQSKNVEQVNVLFNDLDESIISP 129
           RDKAKAL K   YF+KMKGMERC+P+VVTYNILLRAFAQS  V+QVN LF +LD S +SP
Sbjct: 184 RDKAKALEKVRGYFDKMKGMERCQPNVVTYNILLRAFAQSGKVDQVNALFKELDISAVSP 243

Query: 128 DAFTFNGVMDAYGKVGLIREMELVLAKMKSKKIKPGIITFNL 3
           D +TFNGVMDAYGK G+I+EME VL +M+S + KP IITFNL
Sbjct: 244 DVYTFNGVMDAYGKNGMIKEMESVLTRMRSNECKPDIITFNL 285


Top