BLASTX nr result

ID: Perilla23_contig00024941 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Perilla23_contig00024941
         (964 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011094657.1| PREDICTED: pentatricopeptide repeat-containi...   408   e-111
ref|XP_012840925.1| PREDICTED: pentatricopeptide repeat-containi...   397   e-108
gb|EYU34455.1| hypothetical protein MIMGU_mgv1a002067mg [Erythra...   397   e-108
emb|CDO97701.1| unnamed protein product [Coffea canephora]            351   4e-94
ref|XP_012481297.1| PREDICTED: pentatricopeptide repeat-containi...   341   4e-91
emb|CBI32743.3| unnamed protein product [Vitis vinifera]              338   5e-90
ref|XP_002276355.1| PREDICTED: pentatricopeptide repeat-containi...   338   5e-90
ref|XP_009630943.1| PREDICTED: pentatricopeptide repeat-containi...   337   6e-90
ref|XP_004299746.2| PREDICTED: pentatricopeptide repeat-containi...   337   1e-89
ref|XP_007033459.1| Tetratricopeptide repeat (TPR)-like superfam...   334   5e-89
ref|XP_010111755.1| hypothetical protein L484_008414 [Morus nota...   333   2e-88
ref|XP_012082370.1| PREDICTED: pentatricopeptide repeat-containi...   331   5e-88
ref|XP_006353112.1| PREDICTED: pentatricopeptide repeat-containi...   331   6e-88
ref|XP_009782844.1| PREDICTED: pentatricopeptide repeat-containi...   330   8e-88
gb|ACU25637.1| pentatricopeptide repeat-containing protein [Cith...   330   1e-87
ref|XP_004251992.1| PREDICTED: pentatricopeptide repeat-containi...   330   1e-87
ref|NP_181260.1| pentatricopeptide repeat-containing protein [Ar...   328   5e-87
gb|ACU25641.1| pentatricopeptide repeat-containing protein [Bouc...   327   7e-87
gb|ACU25638.1| pentatricopeptide repeat-containing protein [Cith...   327   7e-87
ref|XP_006428766.1| hypothetical protein CICLE_v10011107mg [Citr...   327   9e-87

>ref|XP_011094657.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230
           [Sesamum indicum]
          Length = 756

 Score =  408 bits (1049), Expect = e-111
 Identities = 212/292 (72%), Positives = 230/292 (78%)
 Frame = -3

Query: 878 MALLAASKQTHFNTNLARVXXXXXXXXXXXXXSAPSQPPNPITNEEVSIPADPNPDYSSA 699
           MA LAA +QTH N NL  +             SAPS+P +   NEE S+ A+ N D SS 
Sbjct: 1   MASLAARRQTHLNANLTNLSSPLSIKSLSFCSSAPSKPSS---NEEPSVGANQNADSSST 57

Query: 698 KTTGDEAPLPADANPDYSSAKLTADATPLREGRRRRNPEKVEDTICRMMDSRPWTTRLQN 519
           KTT                   +A AT   EG+R++NPEK+ED ICRMM +R WTTRLQN
Sbjct: 58  KTTA-----------------ASASATFRSEGKRQKNPEKIEDIICRMMANRAWTTRLQN 100

Query: 518 SIRQLVLSFDHELVYNVLHGAKNSGHALQFFRWVERSNLFQHNRETHLKIIEILGRASKL 339
           SIR LV SFDHELVYNVLHGAK S HALQFFRWVERSNLFQHNRETHLKIIEILGRASKL
Sbjct: 101 SIRNLVPSFDHELVYNVLHGAKKSEHALQFFRWVERSNLFQHNRETHLKIIEILGRASKL 160

Query: 338 NHARCILLDMPKKGLKWDEDLWVVMIDSYGKAGIVQESVKLFRIMEELGVDRTIKSYDVL 159
           NHARCILLDMPKKGL+WDEDLWV+MIDSYGKAGIVQESVKLF+ MEELGV+R+IKSYD L
Sbjct: 161 NHARCILLDMPKKGLEWDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGVERSIKSYDAL 220

Query: 158 FKVILRRGRYMMAKRYFNKMLSEGIEPTRHTFNILIWGFFLSGKVETVNRFF 3
           FKVILRRGRYMMAKRYFNKMLSEGIEPTRHTFNI+IWGFFLSGKVET NRFF
Sbjct: 221 FKVILRRGRYMMAKRYFNKMLSEGIEPTRHTFNIMIWGFFLSGKVETANRFF 272


>ref|XP_012840925.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230
           [Erythranthe guttatus]
          Length = 763

 Score =  397 bits (1019), Expect = e-108
 Identities = 205/294 (69%), Positives = 227/294 (77%), Gaps = 2/294 (0%)
 Frame = -3

Query: 878 MALLAASKQTHFNTNLARVXXXXXXXXXXXXXSAPSQPPNPITN--EEVSIPADPNPDYS 705
           MA LAASKQ HFN+N+ ++             +APS PPNP  N  EE+ +   P  D S
Sbjct: 1   MAFLAASKQPHFNSNITKLSSPFSIKSLLFCSAAPSPPPNPNPNPNEELPVSEIPIADSS 60

Query: 704 SAKTTGDEAPLPADANPDYSSAKLTADATPLREGRRRRNPEKVEDTICRMMDSRPWTTRL 525
           SA  T  E P P                T  R+ RR +NPEK+ED ICRMM +R WTTRL
Sbjct: 61  SANATAAEPPSPP---------------TFRRQLRRPKNPEKIEDIICRMMANRAWTTRL 105

Query: 524 QNSIRQLVLSFDHELVYNVLHGAKNSGHALQFFRWVERSNLFQHNRETHLKIIEILGRAS 345
           QNSIR+LV +FDHELVYNVLH ++NS HALQFFRWVERS+LFQHNRETH KIIEILGRAS
Sbjct: 106 QNSIRKLVPAFDHELVYNVLHASRNSEHALQFFRWVERSSLFQHNRETHHKIIEILGRAS 165

Query: 344 KLNHARCILLDMPKKGLKWDEDLWVVMIDSYGKAGIVQESVKLFRIMEELGVDRTIKSYD 165
           KLNHARCILLDMPKKGL+WDEDLWV+MIDSYGKAGIVQESVKLF+ MEELGV+R IKSY+
Sbjct: 166 KLNHARCILLDMPKKGLEWDEDLWVMMIDSYGKAGIVQESVKLFQKMEELGVERGIKSYN 225

Query: 164 VLFKVILRRGRYMMAKRYFNKMLSEGIEPTRHTFNILIWGFFLSGKVETVNRFF 3
            LFKVI RRGRYMMAKRYFNKMLSEGIEP RHTFN+LIWGFFLSGKVET NRFF
Sbjct: 226 TLFKVISRRGRYMMAKRYFNKMLSEGIEPNRHTFNLLIWGFFLSGKVETANRFF 279


>gb|EYU34455.1| hypothetical protein MIMGU_mgv1a002067mg [Erythranthe guttata]
          Length = 719

 Score =  397 bits (1019), Expect = e-108
 Identities = 205/294 (69%), Positives = 227/294 (77%), Gaps = 2/294 (0%)
 Frame = -3

Query: 878 MALLAASKQTHFNTNLARVXXXXXXXXXXXXXSAPSQPPNPITN--EEVSIPADPNPDYS 705
           MA LAASKQ HFN+N+ ++             +APS PPNP  N  EE+ +   P  D S
Sbjct: 1   MAFLAASKQPHFNSNITKLSSPFSIKSLLFCSAAPSPPPNPNPNPNEELPVSEIPIADSS 60

Query: 704 SAKTTGDEAPLPADANPDYSSAKLTADATPLREGRRRRNPEKVEDTICRMMDSRPWTTRL 525
           SA  T  E P P                T  R+ RR +NPEK+ED ICRMM +R WTTRL
Sbjct: 61  SANATAAEPPSPP---------------TFRRQLRRPKNPEKIEDIICRMMANRAWTTRL 105

Query: 524 QNSIRQLVLSFDHELVYNVLHGAKNSGHALQFFRWVERSNLFQHNRETHLKIIEILGRAS 345
           QNSIR+LV +FDHELVYNVLH ++NS HALQFFRWVERS+LFQHNRETH KIIEILGRAS
Sbjct: 106 QNSIRKLVPAFDHELVYNVLHASRNSEHALQFFRWVERSSLFQHNRETHHKIIEILGRAS 165

Query: 344 KLNHARCILLDMPKKGLKWDEDLWVVMIDSYGKAGIVQESVKLFRIMEELGVDRTIKSYD 165
           KLNHARCILLDMPKKGL+WDEDLWV+MIDSYGKAGIVQESVKLF+ MEELGV+R IKSY+
Sbjct: 166 KLNHARCILLDMPKKGLEWDEDLWVMMIDSYGKAGIVQESVKLFQKMEELGVERGIKSYN 225

Query: 164 VLFKVILRRGRYMMAKRYFNKMLSEGIEPTRHTFNILIWGFFLSGKVETVNRFF 3
            LFKVI RRGRYMMAKRYFNKMLSEGIEP RHTFN+LIWGFFLSGKVET NRFF
Sbjct: 226 TLFKVISRRGRYMMAKRYFNKMLSEGIEPNRHTFNLLIWGFFLSGKVETANRFF 279


>emb|CDO97701.1| unnamed protein product [Coffea canephora]
          Length = 753

 Score =  351 bits (901), Expect = 4e-94
 Identities = 185/294 (62%), Positives = 218/294 (74%), Gaps = 2/294 (0%)
 Frame = -3

Query: 878 MALLAASKQTHFNT-NLARVXXXXXXXXXXXXXSAPSQPPNPITNEEVSIPADPNPDYSS 702
           MA L+ASK +HF+  NL+ V             S+P                  NPD  +
Sbjct: 1   MAYLSASKPSHFHPRNLSNVSTPLSLKSLFFFCSSPGG----------------NPDQET 44

Query: 701 AKTTGDEAPLPADANPDYSSAKLTADATPLREGRR-RRNPEKVEDTICRMMDSRPWTTRL 525
           A  +  E+  P   N D ++   T   +P  + RR ++NPEK+ED ICRMM +R WTTRL
Sbjct: 45  AAISPTESTGPETRNVDPAA---TPSRSPRGKHRRLQKNPEKLEDIICRMMANRAWTTRL 101

Query: 524 QNSIRQLVLSFDHELVYNVLHGAKNSGHALQFFRWVERSNLFQHNRETHLKIIEILGRAS 345
           QNSIR LV SFDHELVYNVLHGAKNS HALQFFRWVER+ LFQH RETHLKIIEILGRAS
Sbjct: 102 QNSIRNLVPSFDHELVYNVLHGAKNSEHALQFFRWVERAGLFQHTRETHLKIIEILGRAS 161

Query: 344 KLNHARCILLDMPKKGLKWDEDLWVVMIDSYGKAGIVQESVKLFRIMEELGVDRTIKSYD 165
           KLNHARCILLD+P+KG++WDED+WV++I+SYG AGIVQESV+LF+ MEELGV RTIK+YD
Sbjct: 162 KLNHARCILLDLPQKGVEWDEDMWVLLIESYGSAGIVQESVQLFQKMEELGVQRTIKTYD 221

Query: 164 VLFKVILRRGRYMMAKRYFNKMLSEGIEPTRHTFNILIWGFFLSGKVETVNRFF 3
            LFKVI+RRGRY MAKRYFNKML EGIEPTRHT+N++IWGFFLS KVE+  RFF
Sbjct: 222 ALFKVIMRRGRYGMAKRYFNKMLKEGIEPTRHTYNLMIWGFFLSSKVESAVRFF 275


>ref|XP_012481297.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230
           [Gossypium raimondii] gi|763760358|gb|KJB27612.1|
           hypothetical protein B456_005G002200 [Gossypium
           raimondii]
          Length = 739

 Score =  341 bits (875), Expect = 4e-91
 Identities = 161/199 (80%), Positives = 182/199 (91%)
 Frame = -3

Query: 599 RRRNPEKVEDTICRMMDSRPWTTRLQNSIRQLVLSFDHELVYNVLHGAKNSGHALQFFRW 420
           + RNPEKVED ICRMM++R WTTRLQNSIR LV  FDH LVYNVLHGAKNS HALQFFRW
Sbjct: 58  KTRNPEKVEDIICRMMENRAWTTRLQNSIRALVPEFDHALVYNVLHGAKNSDHALQFFRW 117

Query: 419 VERSNLFQHNRETHLKIIEILGRASKLNHARCILLDMPKKGLKWDEDLWVVMIDSYGKAG 240
           VER+ L  H+RE HLKII+ILGRASKLNHARCILLDMPKKG++WDEDL+VV+IDSYGKAG
Sbjct: 118 VERAGLIHHDREAHLKIIQILGRASKLNHARCILLDMPKKGVEWDEDLFVVLIDSYGKAG 177

Query: 239 IVQESVKLFRIMEELGVDRTIKSYDVLFKVILRRGRYMMAKRYFNKMLSEGIEPTRHTFN 60
           IVQE+VK+F+ MEELGVDRTIKSYD  FKVILRRGRYMMAKRYFNKMLSEGI+PTRHT+N
Sbjct: 178 IVQEAVKIFQKMEELGVDRTIKSYDAFFKVILRRGRYMMAKRYFNKMLSEGIQPTRHTYN 237

Query: 59  ILIWGFFLSGKVETVNRFF 3
           I++WGFFLS +++T NRF+
Sbjct: 238 IMLWGFFLSLRLDTANRFY 256


>emb|CBI32743.3| unnamed protein product [Vitis vinifera]
          Length = 772

 Score =  338 bits (866), Expect = 5e-90
 Identities = 169/258 (65%), Positives = 203/258 (78%), Gaps = 5/258 (1%)
 Frame = -3

Query: 761 NPITNEEVSIPADPNPDYSSAKTTGDEAP-LPADANP----DYSSAKLTADATPLREGRR 597
           NP +   +   +  +   S+   T    P  P   +P    + ++A+    A+P     +
Sbjct: 23  NPSSLNFIQSFSSVDESISAGDLTSSPIPETPVSGSPSEPGNLTAAEAGEKASPRTPRGK 82

Query: 596 RRNPEKVEDTICRMMDSRPWTTRLQNSIRQLVLSFDHELVYNVLHGAKNSGHALQFFRWV 417
            RNPEK+ED ICRMM +R WTTRLQNSIR LV  FDH LV+NVLHG++NS HALQFFRWV
Sbjct: 83  LRNPEKIEDIICRMMANRAWTTRLQNSIRSLVPQFDHSLVWNVLHGSRNSDHALQFFRWV 142

Query: 416 ERSNLFQHNRETHLKIIEILGRASKLNHARCILLDMPKKGLKWDEDLWVVMIDSYGKAGI 237
           ER+ LF+H+R+THLKIIEILGRASKLNHARCILLDMPKKG++WDEDL+V++IDSYGKAGI
Sbjct: 143 ERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPKKGVEWDEDLFVLLIDSYGKAGI 202

Query: 236 VQESVKLFRIMEELGVDRTIKSYDVLFKVILRRGRYMMAKRYFNKMLSEGIEPTRHTFNI 57
           VQESVK+F+ M+ELGV+RTIKSYD LFKVILRRGRYMMAKRYFN ML+EG+ PT HT+NI
Sbjct: 203 VQESVKVFQKMKELGVERTIKSYDALFKVILRRGRYMMAKRYFNAMLNEGVMPTCHTYNI 262

Query: 56  LIWGFFLSGKVETVNRFF 3
           +IWGFFLS KVET NRFF
Sbjct: 263 MIWGFFLSLKVETANRFF 280


>ref|XP_002276355.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230
           [Vitis vinifera]
          Length = 763

 Score =  338 bits (866), Expect = 5e-90
 Identities = 169/258 (65%), Positives = 203/258 (78%), Gaps = 5/258 (1%)
 Frame = -3

Query: 761 NPITNEEVSIPADPNPDYSSAKTTGDEAP-LPADANP----DYSSAKLTADATPLREGRR 597
           NP +   +   +  +   S+   T    P  P   +P    + ++A+    A+P     +
Sbjct: 23  NPSSLNFIQSFSSVDESISAGDLTSSPIPETPVSGSPSEPGNLTAAEAGEKASPRTPRGK 82

Query: 596 RRNPEKVEDTICRMMDSRPWTTRLQNSIRQLVLSFDHELVYNVLHGAKNSGHALQFFRWV 417
            RNPEK+ED ICRMM +R WTTRLQNSIR LV  FDH LV+NVLHG++NS HALQFFRWV
Sbjct: 83  LRNPEKIEDIICRMMANRAWTTRLQNSIRSLVPQFDHSLVWNVLHGSRNSDHALQFFRWV 142

Query: 416 ERSNLFQHNRETHLKIIEILGRASKLNHARCILLDMPKKGLKWDEDLWVVMIDSYGKAGI 237
           ER+ LF+H+R+THLKIIEILGRASKLNHARCILLDMPKKG++WDEDL+V++IDSYGKAGI
Sbjct: 143 ERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPKKGVEWDEDLFVLLIDSYGKAGI 202

Query: 236 VQESVKLFRIMEELGVDRTIKSYDVLFKVILRRGRYMMAKRYFNKMLSEGIEPTRHTFNI 57
           VQESVK+F+ M+ELGV+RTIKSYD LFKVILRRGRYMMAKRYFN ML+EG+ PT HT+NI
Sbjct: 203 VQESVKVFQKMKELGVERTIKSYDALFKVILRRGRYMMAKRYFNAMLNEGVMPTCHTYNI 262

Query: 56  LIWGFFLSGKVETVNRFF 3
           +IWGFFLS KVET NRFF
Sbjct: 263 MIWGFFLSLKVETANRFF 280


>ref|XP_009630943.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230
           [Nicotiana tomentosiformis]
          Length = 721

 Score =  337 bits (865), Expect = 6e-90
 Identities = 160/197 (81%), Positives = 177/197 (89%)
 Frame = -3

Query: 593 RNPEKVEDTICRMMDSRPWTTRLQNSIRQLVLSFDHELVYNVLHGAKNSGHALQFFRWVE 414
           + PEKVED ICRMM +R WTTRLQNSIR LV SFDHELVYNVLH AKNS HALQFFRWVE
Sbjct: 45  KTPEKVEDLICRMMSTRVWTTRLQNSIRNLVPSFDHELVYNVLHNAKNSEHALQFFRWVE 104

Query: 413 RSNLFQHNRETHLKIIEILGRASKLNHARCILLDMPKKGLKWDEDLWVVMIDSYGKAGIV 234
           RS LF+H+RETH KII+ILGR+ KLNHARCILLDMP KG+ WDEDLWV+MIDSYGKAGIV
Sbjct: 105 RSGLFRHDRETHFKIIQILGRSEKLNHARCILLDMPNKGVDWDEDLWVLMIDSYGKAGIV 164

Query: 233 QESVKLFRIMEELGVDRTIKSYDVLFKVILRRGRYMMAKRYFNKMLSEGIEPTRHTFNIL 54
           QESVKLF+ MEELGV+RT+KSY+ LF VI RRGRYMMAKRYFNKM+SEGIEPTRHT+N+L
Sbjct: 165 QESVKLFQKMEELGVERTVKSYNALFNVITRRGRYMMAKRYFNKMVSEGIEPTRHTYNLL 224

Query: 53  IWGFFLSGKVETVNRFF 3
           IWGFFLS K++T  RFF
Sbjct: 225 IWGFFLSSKLDTAIRFF 241


>ref|XP_004299746.2| PREDICTED: pentatricopeptide repeat-containing protein At2g37230
           [Fragaria vesca subsp. vesca]
          Length = 766

 Score =  337 bits (863), Expect = 1e-89
 Identities = 170/247 (68%), Positives = 194/247 (78%), Gaps = 7/247 (2%)
 Frame = -3

Query: 722 PNPDYSSAKTTGDEAPLPADANPDYSSAKLTADATPL-------REGRRRRNPEKVEDTI 564
           P+P   SA +    A  P  + PD  +    A + P        R+ RR RNPEK ED I
Sbjct: 37  PSPQPGSA-SDAPPAETPTGSPPDPQNGSAAAASAPPPPQTPKPRQLRRARNPEKTEDII 95

Query: 563 CRMMDSRPWTTRLQNSIRQLVLSFDHELVYNVLHGAKNSGHALQFFRWVERSNLFQHNRE 384
           CRMM +R WTTRLQNSIR LV  FDH LV+NVLHGAK S  ALQFFRWVERS LFQH+RE
Sbjct: 96  CRMMANRAWTTRLQNSIRDLVPEFDHNLVWNVLHGAKTSDQALQFFRWVERSRLFQHDRE 155

Query: 383 THLKIIEILGRASKLNHARCILLDMPKKGLKWDEDLWVVMIDSYGKAGIVQESVKLFRIM 204
           THLKIIEILGRASKLNHARCILLDMPKKG++WDEDL++ +IDSYGKAGIVQESVKLF  M
Sbjct: 156 THLKIIEILGRASKLNHARCILLDMPKKGVQWDEDLFIHLIDSYGKAGIVQESVKLFNQM 215

Query: 203 EELGVDRTIKSYDVLFKVILRRGRYMMAKRYFNKMLSEGIEPTRHTFNILIWGFFLSGKV 24
           +ELGV+R++KSY+ LFK ILRRGRYMM KRYFN ML+EGIEPTRHT+NI+IWGFFLS ++
Sbjct: 216 KELGVERSLKSYEALFKSILRRGRYMMGKRYFNHMLAEGIEPTRHTYNIMIWGFFLSLRL 275

Query: 23  ETVNRFF 3
           ET  RFF
Sbjct: 276 ETAKRFF 282


>ref|XP_007033459.1| Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma
           cacao] gi|508712488|gb|EOY04385.1| Tetratricopeptide
           repeat (TPR)-like superfamily protein [Theobroma cacao]
          Length = 743

 Score =  334 bits (857), Expect = 5e-89
 Identities = 162/221 (73%), Positives = 187/221 (84%)
 Frame = -3

Query: 665 DANPDYSSAKLTADATPLREGRRRRNPEKVEDTICRMMDSRPWTTRLQNSIRQLVLSFDH 486
           +A P     K+    T  R   + RNPEKVED ICRMM++R WTTRLQNSIR LV  FDH
Sbjct: 42  NAPPQQEGEKVVTQRTSPRG--KTRNPEKVEDVICRMMENRAWTTRLQNSIRALVPEFDH 99

Query: 485 ELVYNVLHGAKNSGHALQFFRWVERSNLFQHNRETHLKIIEILGRASKLNHARCILLDMP 306
            LVYNVLHGAKNS  ALQFFRWVER+ L +H+RE H+KII+ILGRASKLNHARCILLDMP
Sbjct: 100 ALVYNVLHGAKNSEQALQFFRWVERAGLIRHDREAHMKIIQILGRASKLNHARCILLDMP 159

Query: 305 KKGLKWDEDLWVVMIDSYGKAGIVQESVKLFRIMEELGVDRTIKSYDVLFKVILRRGRYM 126
           KKG++WDEDL+VV+IDSYGKAGIVQE+VK+F+ M ELGV+RTIKSYD  FKVILRRGRYM
Sbjct: 160 KKGVEWDEDLFVVLIDSYGKAGIVQEAVKIFQKMNELGVERTIKSYDAFFKVILRRGRYM 219

Query: 125 MAKRYFNKMLSEGIEPTRHTFNILIWGFFLSGKVETVNRFF 3
           MAKRYFNKMLSEGI PTRHT+NI++WGFFLS +++T NRF+
Sbjct: 220 MAKRYFNKMLSEGIVPTRHTYNIMLWGFFLSLRLDTANRFY 260


>ref|XP_010111755.1| hypothetical protein L484_008414 [Morus notabilis]
           gi|587945196|gb|EXC31617.1| hypothetical protein
           L484_008414 [Morus notabilis]
          Length = 768

 Score =  333 bits (853), Expect = 2e-88
 Identities = 164/253 (64%), Positives = 198/253 (78%)
 Frame = -3

Query: 761 NPITNEEVSIPADPNPDYSSAKTTGDEAPLPADANPDYSSAKLTADATPLREGRRRRNPE 582
           +P    E S    PNPD   +     E+P P  + P+ ++ + T          + RNPE
Sbjct: 45  DPAPTTEKSPDPVPNPDCPPS-----ESPNPPKSRPENTAIQRTPRG-------KSRNPE 92

Query: 581 KVEDTICRMMDSRPWTTRLQNSIRQLVLSFDHELVYNVLHGAKNSGHALQFFRWVERSNL 402
           K+ED ICRMM +R WTTRLQNSIR+LV  FDH LV+NVLHGA+NS HALQFFRWVERS L
Sbjct: 93  KIEDIICRMMANRAWTTRLQNSIRRLVPQFDHSLVWNVLHGARNSDHALQFFRWVERSGL 152

Query: 401 FQHNRETHLKIIEILGRASKLNHARCILLDMPKKGLKWDEDLWVVMIDSYGKAGIVQESV 222
           F H+RETHLKIIEIL RASKLNHARCILLDMPKK ++WDEDL+V+ ID YGKAGIVQESV
Sbjct: 153 FNHDRETHLKIIEILTRASKLNHARCILLDMPKKSVQWDEDLFVLFIDGYGKAGIVQESV 212

Query: 221 KLFRIMEELGVDRTIKSYDVLFKVILRRGRYMMAKRYFNKMLSEGIEPTRHTFNILIWGF 42
           ++F  M+ELGV+R++KSYD LFKVILRRGRYMMAKRYFN M++EGIEPT+HT+NI++WGF
Sbjct: 213 RMFNKMKELGVERSVKSYDALFKVILRRGRYMMAKRYFNAMINEGIEPTKHTYNIMLWGF 272

Query: 41  FLSGKVETVNRFF 3
           FLS ++ET  RF+
Sbjct: 273 FLSLRLETAKRFY 285


>ref|XP_012082370.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230
           [Jatropha curcas] gi|643717679|gb|KDP29122.1|
           hypothetical protein JCGZ_16511 [Jatropha curcas]
          Length = 760

 Score =  331 bits (849), Expect = 5e-88
 Identities = 161/229 (70%), Positives = 190/229 (82%), Gaps = 9/229 (3%)
 Frame = -3

Query: 662 ANPDYSSAKLTADATPLREGR---------RRRNPEKVEDTICRMMDSRPWTTRLQNSIR 510
           ANP   S   T +A    + +         +R  PEK+ED IC+MM SRPWTTRLQNSIR
Sbjct: 49  ANPQQESQVETPNAVQENQSQQRIPRIPRGKRPEPEKLEDIICKMMASRPWTTRLQNSIR 108

Query: 509 QLVLSFDHELVYNVLHGAKNSGHALQFFRWVERSNLFQHNRETHLKIIEILGRASKLNHA 330
            LV  FDH LVYNVLHGA+N  HALQFFRWVER+ LF+H+RETH+KIIEILGRASKLNHA
Sbjct: 109 DLVPEFDHSLVYNVLHGARNYEHALQFFRWVERAGLFRHDRETHMKIIEILGRASKLNHA 168

Query: 329 RCILLDMPKKGLKWDEDLWVVMIDSYGKAGIVQESVKLFRIMEELGVDRTIKSYDVLFKV 150
           RCILLDMPKKG++WDED++VV+I+SYGKAGIVQE+VK+F+ M ELGV R+IKSYD +FKV
Sbjct: 169 RCILLDMPKKGVEWDEDMFVVLIESYGKAGIVQEAVKIFQKMNELGVGRSIKSYDAVFKV 228

Query: 149 ILRRGRYMMAKRYFNKMLSEGIEPTRHTFNILIWGFFLSGKVETVNRFF 3
           ILRRGRYMMAKR+FNKMLSEGIEPTRHT+NI++WGFFLS ++ET  RF+
Sbjct: 229 ILRRGRYMMAKRFFNKMLSEGIEPTRHTYNIMLWGFFLSLRLETAMRFY 277


>ref|XP_006353112.1| PREDICTED: pentatricopeptide repeat-containing protein
           At2g37230-like isoform X1 [Solanum tuberosum]
          Length = 731

 Score =  331 bits (848), Expect = 6e-88
 Identities = 157/202 (77%), Positives = 177/202 (87%)
 Frame = -3

Query: 608 EGRRRRNPEKVEDTICRMMDSRPWTTRLQNSIRQLVLSFDHELVYNVLHGAKNSGHALQF 429
           +G   +  EK+ED ICRMM +R WTTRLQNSIR +V SFDHELVYNVLH AKNS HALQF
Sbjct: 47  KGNSPKPQEKLEDLICRMMSTRAWTTRLQNSIRNIVPSFDHELVYNVLHSAKNSEHALQF 106

Query: 428 FRWVERSNLFQHNRETHLKIIEILGRASKLNHARCILLDMPKKGLKWDEDLWVVMIDSYG 249
           FRWVERS LF+H+RETH KII+ILGRA KLNHARCILLDMP KG+ WDEDLWV+MIDSYG
Sbjct: 107 FRWVERSGLFRHDRETHFKIIQILGRAEKLNHARCILLDMPNKGVDWDEDLWVLMIDSYG 166

Query: 248 KAGIVQESVKLFRIMEELGVDRTIKSYDVLFKVILRRGRYMMAKRYFNKMLSEGIEPTRH 69
           KAGIVQESVKLF+ MEELGV+RT+KSY+ LF VI RRGRYMMAKRYFNKM+++GIEPT H
Sbjct: 167 KAGIVQESVKLFQKMEELGVERTVKSYNALFNVITRRGRYMMAKRYFNKMVNQGIEPTGH 226

Query: 68  TFNILIWGFFLSGKVETVNRFF 3
           T+N+LIWGFFLS KV+T  RFF
Sbjct: 227 TYNLLIWGFFLSSKVDTAIRFF 248


>ref|XP_009782844.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230
           [Nicotiana sylvestris] gi|698423785|ref|XP_009782850.1|
           PREDICTED: pentatricopeptide repeat-containing protein
           At2g37230 [Nicotiana sylvestris]
           gi|698423808|ref|XP_009782863.1| PREDICTED:
           pentatricopeptide repeat-containing protein At2g37230
           [Nicotiana sylvestris]
          Length = 722

 Score =  330 bits (847), Expect = 8e-88
 Identities = 163/218 (74%), Positives = 181/218 (83%), Gaps = 2/218 (0%)
 Frame = -3

Query: 650 YSSAKLTAD--ATPLREGRRRRNPEKVEDTICRMMDSRPWTTRLQNSIRQLVLSFDHELV 477
           YSS  L     +T +      + PEKVED ICRMM +R WTTRLQNSIR LV SFDHELV
Sbjct: 24  YSSESLNNPDPSTRIPTTHNPKTPEKVEDLICRMMSTRVWTTRLQNSIRNLVPSFDHELV 83

Query: 476 YNVLHGAKNSGHALQFFRWVERSNLFQHNRETHLKIIEILGRASKLNHARCILLDMPKKG 297
           YNVLH AKNS  ALQFFRWVERS LF+H+RETH KII+ILGR+ KLNHARCILLDMP KG
Sbjct: 84  YNVLHNAKNSEQALQFFRWVERSGLFRHDRETHFKIIQILGRSEKLNHARCILLDMPNKG 143

Query: 296 LKWDEDLWVVMIDSYGKAGIVQESVKLFRIMEELGVDRTIKSYDVLFKVILRRGRYMMAK 117
           + WDEDLWV+MIDSYGKAGIVQESVKLF+ MEELGV+RTIKSY+ LF VI RRGRYMMAK
Sbjct: 144 VDWDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGVERTIKSYNALFNVITRRGRYMMAK 203

Query: 116 RYFNKMLSEGIEPTRHTFNILIWGFFLSGKVETVNRFF 3
           RYFNKM++EGIEPT HT+N+LIWGFFLS K +T  RFF
Sbjct: 204 RYFNKMVNEGIEPTTHTYNLLIWGFFLSSKPDTAIRFF 241


>gb|ACU25637.1| pentatricopeptide repeat-containing protein [Citharexylum
           ligustrinum]
          Length = 484

 Score =  330 bits (845), Expect = 1e-87
 Identities = 160/173 (92%), Positives = 167/173 (96%)
 Frame = -3

Query: 521 NSIRQLVLSFDHELVYNVLHGAKNSGHALQFFRWVERSNLFQHNRETHLKIIEILGRASK 342
           NSIR LV SFDHELVYNVLHG+K S HALQFFRWVERSNLF+HNRETHLKIIEILGRASK
Sbjct: 1   NSIRNLVPSFDHELVYNVLHGSKKSEHALQFFRWVERSNLFEHNRETHLKIIEILGRASK 60

Query: 341 LNHARCILLDMPKKGLKWDEDLWVVMIDSYGKAGIVQESVKLFRIMEELGVDRTIKSYDV 162
           LNHARCILLDMPKKGL+WDEDLWV+MIDSYGKAGIVQESVKLF+ MEELGV+RTIKSYDV
Sbjct: 61  LNHARCILLDMPKKGLEWDEDLWVLMIDSYGKAGIVQESVKLFQKMEELGVERTIKSYDV 120

Query: 161 LFKVILRRGRYMMAKRYFNKMLSEGIEPTRHTFNILIWGFFLSGKVETVNRFF 3
           LFKVILRRGRYMMAKRYFNKMLSEGIEPTRHTFNI+IWGFFLSGKVET NRFF
Sbjct: 121 LFKVILRRGRYMMAKRYFNKMLSEGIEPTRHTFNIMIWGFFLSGKVETANRFF 173


>ref|XP_004251992.1| PREDICTED: pentatricopeptide repeat-containing protein At2g37230
           [Solanum lycopersicum]
          Length = 731

 Score =  330 bits (845), Expect = 1e-87
 Identities = 156/202 (77%), Positives = 177/202 (87%)
 Frame = -3

Query: 608 EGRRRRNPEKVEDTICRMMDSRPWTTRLQNSIRQLVLSFDHELVYNVLHGAKNSGHALQF 429
           +G   +  EK+ED ICRMM +R WTTRLQNSIR +V SFDHELVYNVLH AKNS HALQF
Sbjct: 47  KGNSPKPQEKLEDLICRMMSTRAWTTRLQNSIRNIVPSFDHELVYNVLHSAKNSEHALQF 106

Query: 428 FRWVERSNLFQHNRETHLKIIEILGRASKLNHARCILLDMPKKGLKWDEDLWVVMIDSYG 249
           FRWVERS LF+H+RETH KII+ILGRA KLNHARCILLDMP KG+ WDEDLWV+MIDSYG
Sbjct: 107 FRWVERSGLFRHDRETHFKIIQILGRAEKLNHARCILLDMPNKGVDWDEDLWVLMIDSYG 166

Query: 248 KAGIVQESVKLFRIMEELGVDRTIKSYDVLFKVILRRGRYMMAKRYFNKMLSEGIEPTRH 69
           KAGIVQESVKLF+ MEELGV+RT+KSY+ LF VI RRGRYMMAKRYFN+M+++GIEPT H
Sbjct: 167 KAGIVQESVKLFQKMEELGVERTVKSYNALFNVITRRGRYMMAKRYFNRMVNQGIEPTGH 226

Query: 68  TFNILIWGFFLSGKVETVNRFF 3
           T+N+LIWGFFLS KV+T  RFF
Sbjct: 227 TYNLLIWGFFLSSKVDTAIRFF 248


>ref|NP_181260.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75216851|sp|Q9ZUU3.1|PP190_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At2g37230 gi|4056478|gb|AAC98044.1| unknown protein
           [Arabidopsis thaliana] gi|28973644|gb|AAO64144.1|
           unknown protein [Arabidopsis thaliana]
           gi|110736716|dbj|BAF00321.1| hypothetical protein
           [Arabidopsis thaliana] gi|330254276|gb|AEC09370.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           thaliana]
          Length = 757

 Score =  328 bits (840), Expect = 5e-87
 Identities = 166/261 (63%), Positives = 205/261 (78%), Gaps = 15/261 (5%)
 Frame = -3

Query: 740 VSIPADPNPDYSSAK---TTGDEAPLPADANPDYSSA--------KLTADAT-PLREG-- 603
           +S+P   N    S     +T +E   PA+ANP+  S          LT+  T PLRE   
Sbjct: 18  LSLPRSSNSSLFSLPRLFSTIEETQTPANANPETQSPDAKSETKKNLTSTETRPLRERFQ 77

Query: 602 -RRRRNPEKVEDTICRMMDSRPWTTRLQNSIRQLVLSFDHELVYNVLHGAKNSGHALQFF 426
             +R+N EK+EDTICRMMD+R WTTRLQNSIR LV  +DH LVYNVLHGAK   HALQFF
Sbjct: 78  RGKRQNHEKLEDTICRMMDNRAWTTRLQNSIRDLVPEWDHSLVYNVLHGAKKLEHALQFF 137

Query: 425 RWVERSNLFQHNRETHLKIIEILGRASKLNHARCILLDMPKKGLKWDEDLWVVMIDSYGK 246
           RW ERS L +H+R+TH+K+I++LG  SKLNHARCILLDMP+KG+ WDED++VV+I+SYGK
Sbjct: 138 RWTERSGLIRHDRDTHMKMIKMLGEVSKLNHARCILLDMPEKGVPWDEDMFVVLIESYGK 197

Query: 245 AGIVQESVKLFRIMEELGVDRTIKSYDVLFKVILRRGRYMMAKRYFNKMLSEGIEPTRHT 66
           AGIVQESVK+F+ M++LGV+RTIKSY+ LFKVILRRGRYMMAKRYFNKM+SEG+EPTRHT
Sbjct: 198 AGIVQESVKIFQKMKDLGVERTIKSYNSLFKVILRRGRYMMAKRYFNKMVSEGVEPTRHT 257

Query: 65  FNILIWGFFLSGKVETVNRFF 3
           +N+++WGFFLS ++ET  RFF
Sbjct: 258 YNLMLWGFFLSLRLETALRFF 278


>gb|ACU25641.1| pentatricopeptide repeat-containing protein [Bouchea fluminensis]
          Length = 481

 Score =  327 bits (839), Expect = 7e-87
 Identities = 159/173 (91%), Positives = 165/173 (95%)
 Frame = -3

Query: 521 NSIRQLVLSFDHELVYNVLHGAKNSGHALQFFRWVERSNLFQHNRETHLKIIEILGRASK 342
           NSIR LV SFDHELVYNVLHGAKNS HALQFFRWVERSNLF HNRETHLKIIEILGRASK
Sbjct: 1   NSIRNLVPSFDHELVYNVLHGAKNSEHALQFFRWVERSNLFXHNRETHLKIIEILGRASK 60

Query: 341 LNHARCILLDMPKKGLKWDEDLWVVMIDSYGKAGIVQESVKLFRIMEELGVDRTIKSYDV 162
           LNHARCILLDMPKKGL+WDEDLWV+MIDSYGKAGIVQESVK+F+ MEELGV RTIKSYD 
Sbjct: 61  LNHARCILLDMPKKGLEWDEDLWVLMIDSYGKAGIVQESVKMFQKMEELGVXRTIKSYDA 120

Query: 161 LFKVILRRGRYMMAKRYFNKMLSEGIEPTRHTFNILIWGFFLSGKVETVNRFF 3
           LFKVILRRGRYMMAKRYFNKMLSEGIEPTRHTFN++IWGFFLSGKVET NRFF
Sbjct: 121 LFKVILRRGRYMMAKRYFNKMLSEGIEPTRHTFNVMIWGFFLSGKVETSNRFF 173


>gb|ACU25638.1| pentatricopeptide repeat-containing protein [Citharexylum
           montevidense]
          Length = 481

 Score =  327 bits (839), Expect = 7e-87
 Identities = 159/173 (91%), Positives = 165/173 (95%)
 Frame = -3

Query: 521 NSIRQLVLSFDHELVYNVLHGAKNSGHALQFFRWVERSNLFQHNRETHLKIIEILGRASK 342
           NSIR LV SFDHELVYNVLHGAK S HALQFFRWV+RSNLF+HNRETHLKIIEILGRASK
Sbjct: 1   NSIRNLVPSFDHELVYNVLHGAKKSEHALQFFRWVZRSNLFEHNRETHLKIIEILGRASK 60

Query: 341 LNHARCILLDMPKKGLKWDEDLWVVMIDSYGKAGIVQESVKLFRIMEELGVDRTIKSYDV 162
           LNHARCILLDMPKKGL+WDEDLWV+MIDSYGKAGIVQESVKLF  MEELGV+RTIKSYDV
Sbjct: 61  LNHARCILLDMPKKGLQWDEDLWVLMIDSYGKAGIVQESVKLFHKMEELGVERTIKSYDV 120

Query: 161 LFKVILRRGRYMMAKRYFNKMLSEGIEPTRHTFNILIWGFFLSGKVETVNRFF 3
           LFKVILRRGRYMMAKRYFNKMLSEGIEPTRHTFNI+IWGFFLSGKVET  RFF
Sbjct: 121 LFKVILRRGRYMMAKRYFNKMLSEGIEPTRHTFNIMIWGFFLSGKVETAXRFF 173


>ref|XP_006428766.1| hypothetical protein CICLE_v10011107mg [Citrus clementina]
           gi|557530823|gb|ESR42006.1| hypothetical protein
           CICLE_v10011107mg [Citrus clementina]
          Length = 787

 Score =  327 bits (838), Expect = 9e-87
 Identities = 161/231 (69%), Positives = 188/231 (81%), Gaps = 4/231 (1%)
 Frame = -3

Query: 683 EAPLPADANPDYSSAKLTADATPLREGR----RRRNPEKVEDTICRMMDSRPWTTRLQNS 516
           ++P P   NPD       AD  P +  R      R+P K+EDTIC++M  R WTTRLQN 
Sbjct: 82  DSPAP---NPD----PFQADEEPSQRQRIPRGNHRSPVKLEDTICKLMAERAWTTRLQNK 134

Query: 515 IRQLVLSFDHELVYNVLHGAKNSGHALQFFRWVERSNLFQHNRETHLKIIEILGRASKLN 336
           IR LV  FDH LVYNVLHGAKNS HALQFFRWVER+ LF H+RETHLK+IEILGR  KLN
Sbjct: 135 IRALVPQFDHNLVYNVLHGAKNSEHALQFFRWVERAGLFNHDRETHLKMIEILGRVGKLN 194

Query: 335 HARCILLDMPKKGLKWDEDLWVVMIDSYGKAGIVQESVKLFRIMEELGVDRTIKSYDVLF 156
           HARCILLDMPKKG++WDEDL+ V+I+SYGK GIVQESVK+F IM++LGV+R++KSYD LF
Sbjct: 195 HARCILLDMPKKGVQWDEDLFEVLIESYGKKGIVQESVKIFDIMKQLGVERSVKSYDALF 254

Query: 155 KVILRRGRYMMAKRYFNKMLSEGIEPTRHTFNILIWGFFLSGKVETVNRFF 3
           K+ILRRGRYMMAKRYFNKMLSEGIEPTRHT+N+++WGFFLS K+ET  RFF
Sbjct: 255 KLILRRGRYMMAKRYFNKMLSEGIEPTRHTYNVMLWGFFLSLKLETAIRFF 305


Top