BLASTX nr result
ID: Perilla23_contig00001846
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Perilla23_contig00001846 (1537 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011100044.1| PREDICTED: uncharacterized protein LOC105178... 327 2e-86 ref|XP_012845331.1| PREDICTED: uncharacterized protein LOC105965... 305 9e-80 ref|XP_011097551.1| PREDICTED: uncharacterized protein LOC105176... 277 2e-71 emb|CDP01491.1| unnamed protein product [Coffea canephora] 198 1e-47 ref|XP_012853562.1| PREDICTED: uncharacterized protein LOC105973... 191 1e-45 gb|EYU23966.1| hypothetical protein MIMGU_mgv1a017990mg, partial... 191 1e-45 ref|XP_006353221.1| PREDICTED: uncharacterized protein LOC102592... 191 1e-45 ref|XP_007047892.1| Uncharacterized protein isoform 1 [Theobroma... 191 1e-45 ref|XP_009604212.1| PREDICTED: uncharacterized protein LOC104099... 188 1e-44 ref|XP_010312631.1| PREDICTED: uncharacterized protein LOC101259... 187 2e-44 ref|XP_007047893.1| Uncharacterized protein isoform 2 [Theobroma... 187 3e-44 ref|XP_009802512.1| PREDICTED: uncharacterized protein LOC104248... 184 1e-43 ref|XP_010646942.1| PREDICTED: uncharacterized protein LOC100244... 178 1e-41 ref|XP_012081607.1| PREDICTED: uncharacterized protein LOC105641... 177 2e-41 ref|XP_010087627.1| hypothetical protein L484_022154 [Morus nota... 175 1e-40 gb|KDO60721.1| hypothetical protein CISIN_1g020717mg [Citrus sin... 169 6e-39 gb|KDO60720.1| hypothetical protein CISIN_1g020717mg [Citrus sin... 169 6e-39 ref|XP_010245835.1| PREDICTED: uncharacterized protein LOC104589... 169 8e-39 ref|XP_006426358.1| hypothetical protein CICLE_v10026087mg [Citr... 168 1e-38 ref|XP_012469812.1| PREDICTED: uncharacterized protein LOC105787... 167 2e-38 >ref|XP_011100044.1| PREDICTED: uncharacterized protein LOC105178294 [Sesamum indicum] Length = 314 Score = 327 bits (837), Expect = 2e-86 Identities = 186/314 (59%), Positives = 217/314 (69%), Gaps = 1/314 (0%) Frame = -3 Query: 1361 MALTLHTILTNKYPTFSHLLLXXXXXXXXXXXXXXXXXSRTHFPNQRYFTSQPRLFAAAR 1182 MAL L TI NK P+ S+LL+ SRTHF N R+ TSQ LFAA + Sbjct: 1 MALMLQTISGNKNPSTSNLLVSQSRFLKSLKFYFPPSNSRTHFLNCRFLTSQSYLFAATK 60 Query: 1181 ISDSFQEPYVASQQQANGNGGANFDAFLSVLEFIXXXXXXXXXXXXXVRCXXXXXXXXXX 1002 ++ SF+EPY ASQ Q GN ++FDAFLS +EF+ VRC Sbjct: 61 VNASFEEPYGASQNQVGGNSSSSFDAFLSAVEFLSLASSAAVSVYIAVRCGIQKGGALGL 120 Query: 1001 XGSRILVWQCVVLSCGLAAGAAIRRRQWRRICGVGFSRGPASSGANLLERVEKLEEDLRS 822 GS+ILVWQCVVL GL AGA IR+RQWRR+CGVG SR PASSGANLLERVEKLE+DLRS Sbjct: 121 LGSKILVWQCVVLVGGLVAGAVIRQRQWRRVCGVGLSRAPASSGANLLERVEKLEDDLRS 180 Query: 821 AATIVQALSRRLEKLGIRFRLTRKALKEPIAETAVLVQKNSEATQALAAQGDILEKELGE 642 +ATI+QALSRRLEKLGIRFRLTRKALKEPIAET+ LVQKNSEAT+ALAAQ DILEKELGE Sbjct: 181 SATIIQALSRRLEKLGIRFRLTRKALKEPIAETSALVQKNSEATRALAAQEDILEKELGE 240 Query: 641 IQKVXXXXXXXXXXXXXXXLAVGKAGKLWDAKPVKSQDRKASGASKPVAES-SNLDINQI 465 IQKV LA+GKAGKL + K VKS+D+KA S+ + +NL+INQI Sbjct: 241 IQKVLLAMQEQQQKQLELILAIGKAGKLLETKRVKSRDQKAPETSESSVDGVANLEINQI 300 Query: 464 ETMAWQKEGHNDRP 423 ET+A +KE +NDRP Sbjct: 301 ETLARKKEANNDRP 314 >ref|XP_012845331.1| PREDICTED: uncharacterized protein LOC105965332 [Erythranthe guttatus] gi|604319828|gb|EYU30992.1| hypothetical protein MIMGU_mgv1a010697mg [Erythranthe guttata] Length = 304 Score = 305 bits (780), Expect = 9e-80 Identities = 186/322 (57%), Positives = 208/322 (64%), Gaps = 10/322 (3%) Frame = -3 Query: 1361 MALTLHTILTN-----KYPTFSHLLLXXXXXXXXXXXXXXXXXSRTHFPNQRYFTSQPRL 1197 MALT HT+ TN K P F SRT F R+FT+ PR Sbjct: 1 MALTFHTVFTNPTRFLKSPKFPF----------------PPPNSRTQFETPRFFTTHPRR 44 Query: 1196 FAAARISDSFQEPYVASQQ-QANGNGGANFDAFLSVLEFIXXXXXXXXXXXXXVRCXXXX 1020 FAAARI+ SFQEPY AS +G GG +DAFLS LEF+ V+C Sbjct: 45 FAAARINASFQEPYGASGNITTSGGGGGGYDAFLSTLEFLSLASSAGFSVYIAVKCGVQK 104 Query: 1019 XXXXXXXG--SRILVWQCVVLSCGLAAGAAIRRRQWRRICGVGFSRGPASSGANLLERVE 846 S+ LVWQCVVL GLAAGA IRRRQWRRICGVGFSRGP S GA+LL+RVE Sbjct: 105 GGGGALGVVGSKFLVWQCVVLVIGLAAGAVIRRRQWRRICGVGFSRGPPSYGASLLDRVE 164 Query: 845 KLEEDLRSAATIVQALSRRLEKLGIRFRLTRKALKEPIAETAVLVQKNSEATQALAAQGD 666 KLEEDLRS +TI+QALSRRLEKLGIRFRLTRKALKEPIAETA LV+KNSEATQALAAQ D Sbjct: 165 KLEEDLRSVSTIIQALSRRLEKLGIRFRLTRKALKEPIAETAALVRKNSEATQALAAQED 224 Query: 665 ILEKELGEIQKVXXXXXXXXXXXXXXXLAVGKAGKLWDAKPVKSQDRKASGASK-PVAES 489 LEKELGEIQKV LA+GKAGKLWD K V++QD+KA +K PVA Sbjct: 225 NLEKELGEIQKVLLAMQEQQQKQLELILAIGKAGKLWDTKRVENQDKKAPEPTKSPVA-- 282 Query: 488 SNLDIN-QIETMAWQKEGHNDR 426 NL+IN +IET WQKE ND+ Sbjct: 283 -NLEINPKIETPTWQKESSNDK 303 >ref|XP_011097551.1| PREDICTED: uncharacterized protein LOC105176447 [Sesamum indicum] Length = 318 Score = 277 bits (709), Expect = 2e-71 Identities = 162/317 (51%), Positives = 191/317 (60%), Gaps = 5/317 (1%) Frame = -3 Query: 1361 MALTLHTILTNKYPTFSHLLLXXXXXXXXXXXXXXXXXSRTHFPNQRYFTSQPRLFAAAR 1182 M+ TL I TNK PTFS L SR HFPN R+ TSQ ++F+ R Sbjct: 1 MSFTLRAIFTNKNPTFSPATLSPTRLLKSVNLSFSATKSRIHFPNHRFLTSQCQVFSITR 60 Query: 1181 ISDSFQEPYVASQQQANGNGGA----NFDAFLSVLEFIXXXXXXXXXXXXXVRCXXXXXX 1014 I S E Y S+ Q NG+G +FDAFLS LEF+ + Sbjct: 61 IKVSLNESYGTSESQVNGSGSTLNHFSFDAFLSTLEFLSLASSAAISVYVALSSGVQQGG 120 Query: 1013 XXXXXGSRILVWQCVVLSCGLAAGAAIRRRQWRRICGVGFSRGPASSGANLLERVEKLEE 834 GS+ILVWQCVVL + GA IRRRQWRRICG GFSR AS G NLL RVEKLEE Sbjct: 121 VLGRVGSKILVWQCVVLVSSVVVGAVIRRRQWRRICGAGFSRSSASYGVNLLGRVEKLEE 180 Query: 833 DLRSAATIVQALSRRLEKLGIRFRLTRKALKEPIAETAVLVQKNSEATQALAAQGDILEK 654 DLRS+ATI++ LSR+LEKLGIR R+TRKAL+EPIAETA L QKNSEAT+ALA Q DILEK Sbjct: 181 DLRSSATIIRVLSRQLEKLGIRVRVTRKALQEPIAETAALAQKNSEATRALAVQEDILEK 240 Query: 653 ELGEIQKVXXXXXXXXXXXXXXXLAVGKAGKLWDAKPVKSQDRKASGASKPVAES-SNLD 477 ELGEIQKV LA+GKAGKLW+ + ++++D+ S SKP E NL Sbjct: 241 ELGEIQKVLLAMQEQQQKQLELILAIGKAGKLWETQRLQTKDQNVSETSKPAVEGLPNLG 300 Query: 476 INQIETMAWQKEGHNDR 426 NQIE A + + N R Sbjct: 301 TNQIEAPALRSQADNKR 317 >emb|CDP01491.1| unnamed protein product [Coffea canephora] Length = 326 Score = 198 bits (503), Expect = 1e-47 Identities = 118/243 (48%), Positives = 144/243 (59%), Gaps = 3/243 (1%) Frame = -3 Query: 1148 SQQQANGNGGANFDAFLSVLEFIXXXXXXXXXXXXXVRCXXXXXXXXXXXG--SRILVWQ 975 S+ + N NFDAFLS+LEF V + +VWQ Sbjct: 86 SRSENPANWDVNFDAFLSILEFFCLVSSIAISGILAVNSGFLGGQRMVFRWLGEKGMVWQ 145 Query: 974 CVVLSCGLAAGAAIRRRQWRRICGVGFSRGPASSGANLLERVEKLEEDLRSAATIVQALS 795 CVVL G+ GA IRRRQWRRIC + P NL+ER+EKLEE+ +S+AT+++ALS Sbjct: 146 CVVLVAGVLVGAVIRRRQWRRICQAKYFSRPV----NLVERIEKLEENFKSSATVIRALS 201 Query: 794 RRLEKLGIRFRLTRKALKEPIAETAVLVQKNSEATQALAAQGDILEKELGEIQKVXXXXX 615 R+LEKLGIRFR+ RKALKEPIAETA L QKNSEAT+ALA Q DILEKELGEIQKV Sbjct: 202 RQLEKLGIRFRVFRKALKEPIAETAALAQKNSEATRALAIQEDILEKELGEIQKVLLAMQ 261 Query: 614 XXXXXXXXXXLAVGKAGKLWDAKPVKSQDRKASGASKPVAES-SNLDINQIETMAWQKEG 438 LA+ K+GKLWD K ++ AS V + L NQI+ +A QKE Sbjct: 262 EQQQKQLELILAIAKSGKLWDTKREENHGNNTPAASNSVVDGVQQLGKNQIQAVAGQKES 321 Query: 437 HND 429 +ND Sbjct: 322 NND 324 >ref|XP_012853562.1| PREDICTED: uncharacterized protein LOC105973097 [Erythranthe guttatus] Length = 312 Score = 191 bits (485), Expect = 1e-45 Identities = 131/290 (45%), Positives = 159/290 (54%), Gaps = 8/290 (2%) Frame = -3 Query: 1361 MALTLHTILTNKYPTFSHLLLXXXXXXXXXXXXXXXXXSRTHFPNQRYFTS----QPRLF 1194 M+L L +I NK PT S L + P Q ++ QP L Sbjct: 1 MSLNLQSIFPNKNPTTSSASLSSPPPPHFLKPHLNFPFPPPNSPTQSPISTFSKPQPHLL 60 Query: 1193 ---AAARISDSFQEPYVASQQQANGNGGANFDAFLSVLEFIXXXXXXXXXXXXXVRCXXX 1023 +I+ S E Y A N + + DAFLS +EF+ V Sbjct: 61 LRRVGKKINASSDEAYAAIGVAPNPS---SLDAFLSAVEFLSLASSAAVSVYVAVG-GGV 116 Query: 1022 XXXXXXXXGSRILVWQCVVLSCGLAAGAAIRRRQWRRICGVGFSRGPASSGANLLE-RVE 846 GSRILVWQCVVL G+ GAAIRRRQWRRICG A SG N L RVE Sbjct: 117 LKGGGLVLGSRILVWQCVVLVGGVLVGAAIRRRQWRRICGAA----AAPSGVNNLSARVE 172 Query: 845 KLEEDLRSAATIVQALSRRLEKLGIRFRLTRKALKEPIAETAVLVQKNSEATQALAAQGD 666 K+EEDLRS+ATI++ LSR+L+KLG RFR+TRKALKEP++ETA L QKNSEAT+ALAAQ D Sbjct: 173 KVEEDLRSSATIIRVLSRQLDKLGSRFRVTRKALKEPVSETAALAQKNSEATRALAAQED 232 Query: 665 ILEKELGEIQKVXXXXXXXXXXXXXXXLAVGKAGKLWDAKPVKSQDRKAS 516 ILE ELGEIQ V +A+GKAGKLW+ + V ++D AS Sbjct: 233 ILENELGEIQNVLLAMQEQQQKQLELIIALGKAGKLWETRSVPTKDHNAS 282 >gb|EYU23966.1| hypothetical protein MIMGU_mgv1a017990mg, partial [Erythranthe guttata] Length = 289 Score = 191 bits (485), Expect = 1e-45 Identities = 131/290 (45%), Positives = 159/290 (54%), Gaps = 8/290 (2%) Frame = -3 Query: 1361 MALTLHTILTNKYPTFSHLLLXXXXXXXXXXXXXXXXXSRTHFPNQRYFTS----QPRLF 1194 M+L L +I NK PT S L + P Q ++ QP L Sbjct: 1 MSLNLQSIFPNKNPTTSSASLSSPPPPHFLKPHLNFPFPPPNSPTQSPISTFSKPQPHLL 60 Query: 1193 ---AAARISDSFQEPYVASQQQANGNGGANFDAFLSVLEFIXXXXXXXXXXXXXVRCXXX 1023 +I+ S E Y A N + + DAFLS +EF+ V Sbjct: 61 LRRVGKKINASSDEAYAAIGVAPNPS---SLDAFLSAVEFLSLASSAAVSVYVAVG-GGV 116 Query: 1022 XXXXXXXXGSRILVWQCVVLSCGLAAGAAIRRRQWRRICGVGFSRGPASSGANLLE-RVE 846 GSRILVWQCVVL G+ GAAIRRRQWRRICG A SG N L RVE Sbjct: 117 LKGGGLVLGSRILVWQCVVLVGGVLVGAAIRRRQWRRICGAA----AAPSGVNNLSARVE 172 Query: 845 KLEEDLRSAATIVQALSRRLEKLGIRFRLTRKALKEPIAETAVLVQKNSEATQALAAQGD 666 K+EEDLRS+ATI++ LSR+L+KLG RFR+TRKALKEP++ETA L QKNSEAT+ALAAQ D Sbjct: 173 KVEEDLRSSATIIRVLSRQLDKLGSRFRVTRKALKEPVSETAALAQKNSEATRALAAQED 232 Query: 665 ILEKELGEIQKVXXXXXXXXXXXXXXXLAVGKAGKLWDAKPVKSQDRKAS 516 ILE ELGEIQ V +A+GKAGKLW+ + V ++D AS Sbjct: 233 ILENELGEIQNVLLAMQEQQQKQLELIIALGKAGKLWETRSVPTKDHNAS 282 >ref|XP_006353221.1| PREDICTED: uncharacterized protein LOC102592816 [Solanum tuberosum] Length = 313 Score = 191 bits (485), Expect = 1e-45 Identities = 121/249 (48%), Positives = 151/249 (60%), Gaps = 8/249 (3%) Frame = -3 Query: 1148 SQQQANGNGGA----NFDAFLSVLEFIXXXXXXXXXXXXXVRCXXXXXXXXXXXGSRILV 981 S+ NG A NFD FLS+LEF+ V +R+L Sbjct: 66 SEGTVNGQVSAEYEFNFDGFLSILEFLCLLSSAVVAIGFAVNSWVLGSQKWLG--NRVLA 123 Query: 980 WQCVVLSCGLAAGAAIRRRQWRRICGVGFSR-GPASSGANLLERVEKLEEDLRSAATIVQ 804 QCVVL G+ G+ IRRRQWRRIC FSR G G NLLER+EK+EEDLRS+ATI++ Sbjct: 124 AQCVVLVGGVIIGSVIRRRQWRRICMNKFSRSGSDLKGVNLLERIEKVEEDLRSSATIIR 183 Query: 803 ALSRRLEKLGIRFRLTRKALKEPIAETAVLVQKNSEATQALAAQGDILEKELGEIQKVXX 624 LSR+LEKLGIRFR+TRK LK+PI E A+L QKNSEAT+ALA Q + LEKELGEIQKV Sbjct: 184 VLSRQLEKLGIRFRVTRKTLKDPITEAAMLAQKNSEATRALALQDERLEKELGEIQKVLL 243 Query: 623 XXXXXXXXXXXXXLAVGKAGKLWDAKPVKSQD--RKASGASKPVAES-SNLDINQIETMA 453 LA+GK GKL++ K SQD +K++ S A+ L +NQI+ + Sbjct: 244 AMQDQQHKQLELILAIGKTGKLFENKRGLSQDPNKKSNDVSNTAADGFPQLGVNQIQALK 303 Query: 452 WQKEGHNDR 426 Q+E +NDR Sbjct: 304 RQRETNNDR 312 >ref|XP_007047892.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508700153|gb|EOX92049.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 316 Score = 191 bits (485), Expect = 1e-45 Identities = 117/242 (48%), Positives = 141/242 (58%) Frame = -3 Query: 1151 ASQQQANGNGGANFDAFLSVLEFIXXXXXXXXXXXXXVRCXXXXXXXXXXXGSRILVWQC 972 ASQ+Q N D+FLS+ EF+ V R++VW Sbjct: 76 ASQEQNPIFNDFNLDSFLSIAEFLCILSSAVVSVVGAVS--GWKGVILGGIWRRVMVWGI 133 Query: 971 VVLSCGLAAGAAIRRRQWRRICGVGFSRGPASSGANLLERVEKLEEDLRSAATIVQALSR 792 V L G+A GA IRRRQWRRIC G NL+ R+EKLEEDLRS ATI +ALSR Sbjct: 134 VGLVSGVAIGAWIRRRQWRRICAETVKGGGGGKNLNLIGRIEKLEEDLRSYATITRALSR 193 Query: 791 RLEKLGIRFRLTRKALKEPIAETAVLVQKNSEATQALAAQGDILEKELGEIQKVXXXXXX 612 +LEKLGIRFR+TRKALKEPIAETA L QKNSEAT+ALA Q DILEKELGEIQKV Sbjct: 194 QLEKLGIRFRVTRKALKEPIAETAALAQKNSEATRALAVQEDILEKELGEIQKVLLAMQE 253 Query: 611 XXXXXXXXXLAVGKAGKLWDAKPVKSQDRKASGASKPVAESSNLDINQIETMAWQKEGHN 432 LA+GK+GKL++ K SQ++ A E + ++INQ + + K N Sbjct: 254 QQGKQLELILAIGKSGKLFEDKREPSQEKNTVEACNLTEEVNQMEINQTQPLGTSKGSGN 313 Query: 431 DR 426 DR Sbjct: 314 DR 315 >ref|XP_009604212.1| PREDICTED: uncharacterized protein LOC104099043 [Nicotiana tomentosiformis] Length = 362 Score = 188 bits (478), Expect = 1e-44 Identities = 114/246 (46%), Positives = 150/246 (60%), Gaps = 4/246 (1%) Frame = -3 Query: 1151 ASQQQANGNGGANFDAFLSVLEFIXXXXXXXXXXXXXVRCXXXXXXXXXXXGSRILVWQC 972 A ++Q+ N DAFLS+LEF+ V +R+L QC Sbjct: 118 AVKEQSLAEFEFNIDAFLSILEFLCLFSSAVVSIGYAVNSWFLGSQKWLG--NRVLAAQC 175 Query: 971 VVLSCGLAAGAAIRRRQWRRICGVGFSR-GPASSGANLLERVEKLEEDLRSAATIVQALS 795 VVL G+ G+ IRRRQW RIC V FSR G S G NL+ER+EKLEEDLRS+ T+++ LS Sbjct: 176 VVLVGGVVIGSVIRRRQWSRICMVEFSRSGSGSRGVNLVERIEKLEEDLRSSTTLIRVLS 235 Query: 794 RRLEKLGIRFRLTRKALKEPIAETAVLVQKNSEATQALAAQGDILEKELGEIQKVXXXXX 615 R+LEKLGIRFR+TRK LK+P+ E A L QKNSEAT+ALA QG+ LEKELGEIQKV Sbjct: 236 RQLEKLGIRFRITRKTLKDPVTEAATLAQKNSEATRALALQGEHLEKELGEIQKVLLAMQ 295 Query: 614 XXXXXXXXXXLAVGKAGKLWDAKPVKSQD--RKASGASKPVAES-SNLDINQIETMAWQK 444 LA+GK GKL++ K SQ+ + S S + L++N+++++ Q+ Sbjct: 296 EQQHKQLELILAIGKTGKLFENKRGPSQEPAQNTSNVSNTAIDGVPQLEVNRLQSLKGQR 355 Query: 443 EGHNDR 426 E +NDR Sbjct: 356 EINNDR 361 >ref|XP_010312631.1| PREDICTED: uncharacterized protein LOC101259600 [Solanum lycopersicum] Length = 310 Score = 187 bits (476), Expect = 2e-44 Identities = 117/242 (48%), Positives = 146/242 (60%), Gaps = 8/242 (3%) Frame = -3 Query: 1148 SQQQANGNGGA----NFDAFLSVLEFIXXXXXXXXXXXXXVRCXXXXXXXXXXXGSRILV 981 S+ NG A NFD FLS+LEF+ V C +R+L Sbjct: 66 SEGTVNGQVSAEYEFNFDGFLSILEFLCLLSSAVVAIGFAVNCWFLGSHKWLG--NRVLA 123 Query: 980 WQCVVLSCGLAAGAAIRRRQWRRICGVGFSR-GPASSGANLLERVEKLEEDLRSAATIVQ 804 QCVVL G+ G+ IRRRQWRRIC FSR G G N+LER+EK+EEDLRS+ATI++ Sbjct: 124 AQCVVLVGGVIIGSVIRRRQWRRICMNNFSRPGSDLKGVNMLERIEKVEEDLRSSATIIR 183 Query: 803 ALSRRLEKLGIRFRLTRKALKEPIAETAVLVQKNSEATQALAAQGDILEKELGEIQKVXX 624 LSR+LEKLGIRFR+TRK LK+PI E A+L QKNSEAT+ALA QG+ LEKELGE+QKV Sbjct: 184 VLSRQLEKLGIRFRVTRKTLKDPITEAAMLAQKNSEATRALALQGERLEKELGEVQKVLL 243 Query: 623 XXXXXXXXXXXXXLAVGKAGKLWDAKPVKSQD--RKASGASKPVAES-SNLDINQIETMA 453 LA+GK GKL++ K SQD +K + S A+ L +NQI+ + Sbjct: 244 AMQDQQHKQLELILAIGKTGKLFENKRGPSQDPNQKTNDMSNTAADGFPQLGVNQIQALK 303 Query: 452 WQ 447 Q Sbjct: 304 RQ 305 >ref|XP_007047893.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508700154|gb|EOX92050.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 313 Score = 187 bits (474), Expect = 3e-44 Identities = 117/242 (48%), Positives = 141/242 (58%) Frame = -3 Query: 1151 ASQQQANGNGGANFDAFLSVLEFIXXXXXXXXXXXXXVRCXXXXXXXXXXXGSRILVWQC 972 ASQ+Q N D+FLS+ EF+ V R++VW Sbjct: 76 ASQEQNPIFNDFNLDSFLSIAEFLCILSSAVVSVVGAVS--GWKGVILGGIWRRVMVWGI 133 Query: 971 VVLSCGLAAGAAIRRRQWRRICGVGFSRGPASSGANLLERVEKLEEDLRSAATIVQALSR 792 V L G+A GA IRRRQWRRIC G NL+ R+EKLEEDLRS ATI +ALSR Sbjct: 134 VGLVSGVAIGAWIRRRQWRRICAETVKGGGGGKNLNLIGRIEKLEEDLRSYATITRALSR 193 Query: 791 RLEKLGIRFRLTRKALKEPIAETAVLVQKNSEATQALAAQGDILEKELGEIQKVXXXXXX 612 +LEKLGIRFR+TRKALKEPIAETA L QKNSEAT+ALA Q DILEKELGEIQKV Sbjct: 194 QLEKLGIRFRVTRKALKEPIAETAALAQKNSEATRALAVQEDILEKELGEIQKV---LLA 250 Query: 611 XXXXXXXXXLAVGKAGKLWDAKPVKSQDRKASGASKPVAESSNLDINQIETMAWQKEGHN 432 LA+GK+GKL++ K SQ++ A E + ++INQ + + K N Sbjct: 251 MQGKQLELILAIGKSGKLFEDKREPSQEKNTVEACNLTEEVNQMEINQTQPLGTSKGSGN 310 Query: 431 DR 426 DR Sbjct: 311 DR 312 >ref|XP_009802512.1| PREDICTED: uncharacterized protein LOC104248030 [Nicotiana sylvestris] Length = 363 Score = 184 bits (468), Expect = 1e-43 Identities = 111/235 (47%), Positives = 144/235 (61%), Gaps = 5/235 (2%) Frame = -3 Query: 1115 NFDAFLSVLEFIXXXXXXXXXXXXXVRCXXXXXXXXXXXGSRILVWQCVVLSCGLAAGAA 936 N DAFLS+LEF+ V +R+L QCVVL G+ G+ Sbjct: 130 NIDAFLSILEFLCLFSSAVVAIGYAVNSWFWGSQKWLG--NRVLGAQCVVLVGGVIIGSV 187 Query: 935 IRRRQWRRICGVGFSR--GPASSGANLLERVEKLEEDLRSAATIVQALSRRLEKLGIRFR 762 IRRRQW RIC FS G S G NL+ER+EKLEEDLRS+AT+++ LSR+LEKLGIRFR Sbjct: 188 IRRRQWSRICTFEFSSRSGSGSRGVNLVERIEKLEEDLRSSATLIRVLSRQLEKLGIRFR 247 Query: 761 LTRKALKEPIAETAVLVQKNSEATQALAAQGDILEKELGEIQKVXXXXXXXXXXXXXXXL 582 +TRK LK+P+ E A L QKNSEAT+ALA QG+ LEKELGEIQKV L Sbjct: 248 VTRKTLKDPVTEAAALAQKNSEATRALALQGERLEKELGEIQKVLLAMQEQQHKQLELIL 307 Query: 581 AVGKAGKLWDAKPVKSQD--RKASGASKPVAES-SNLDINQIETMAWQKEGHNDR 426 A+GK GKL++ K SQ+ + S S + S L++N+++++ +E +NDR Sbjct: 308 AIGKTGKLFENKRGPSQEPAQNTSNVSNTAVDGVSQLEVNRLQSLKGHRETNNDR 362 >ref|XP_010646942.1| PREDICTED: uncharacterized protein LOC100244969 [Vitis vinifera] Length = 193 Score = 178 bits (451), Expect = 1e-41 Identities = 102/190 (53%), Positives = 129/190 (67%) Frame = -3 Query: 995 SRILVWQCVVLSCGLAAGAAIRRRQWRRICGVGFSRGPASSGANLLERVEKLEEDLRSAA 816 +RIL+WQ V L G+ G+ IRRRQW RI + P NL+ER+EK+EED+RS A Sbjct: 6 NRILLWQAVALVGGVVVGSWIRRRQWWRI--FNDTAKPGIESVNLVERMEKMEEDIRSMA 63 Query: 815 TIVQALSRRLEKLGIRFRLTRKALKEPIAETAVLVQKNSEATQALAAQGDILEKELGEIQ 636 T+++ +SR+LEKLGIRFR+TRKALK+PIAETAVL QKNSEAT+ALA Q DILEKELGEIQ Sbjct: 64 TLIRVMSRQLEKLGIRFRVTRKALKQPIAETAVLAQKNSEATRALAIQEDILEKELGEIQ 123 Query: 635 KVXXXXXXXXXXXXXXXLAVGKAGKLWDAKPVKSQDRKASGASKPVAESSNLDINQIETM 456 KV LA+GKAGKLW+ + +S+++ A A AE + +QI Sbjct: 124 KVLLAMQEQQQKQLDLILAIGKAGKLWENRRGQSEEQDAIEACDS-AEVGQMKAHQIPAA 182 Query: 455 AWQKEGHNDR 426 A QK +NDR Sbjct: 183 ARQKGSNNDR 192 >ref|XP_012081607.1| PREDICTED: uncharacterized protein LOC105641632 [Jatropha curcas] gi|802673470|ref|XP_012081608.1| PREDICTED: uncharacterized protein LOC105641632 [Jatropha curcas] gi|643718519|gb|KDP29713.1| hypothetical protein JCGZ_18648 [Jatropha curcas] Length = 319 Score = 177 bits (450), Expect = 2e-41 Identities = 108/236 (45%), Positives = 139/236 (58%), Gaps = 3/236 (1%) Frame = -3 Query: 1121 GANFDAFLSVLEFIXXXXXXXXXXXXXVRCXXXXXXXXXXXG---SRILVWQCVVLSCGL 951 G N DAFLS+ E + V +R L W VV+ G+ Sbjct: 86 GFNLDAFLSIAEILCIISSAVVTVCYAVNSTFLSSKRTVFAVIGSNRALAWGLVVMMGGV 145 Query: 950 AAGAAIRRRQWRRICGVGFSRGPASSGANLLERVEKLEEDLRSAATIVQALSRRLEKLGI 771 GA IR+RQW R C V G S NL+ER+EKLEEDLRS+ATI++ LSR+LEKLGI Sbjct: 146 LIGALIRKRQWLRFCRVTVREGRES--VNLVERIEKLEEDLRSSATIIRVLSRQLEKLGI 203 Query: 770 RFRLTRKALKEPIAETAVLVQKNSEATQALAAQGDILEKELGEIQKVXXXXXXXXXXXXX 591 RFR+TRKALKEPIAETA L +KNSEAT+ALA Q DILEKELGEIQKV Sbjct: 204 RFRVTRKALKEPIAETAALAKKNSEATRALAMQEDILEKELGEIQKVLLAMQEQQEKQLE 263 Query: 590 XXLAVGKAGKLWDAKPVKSQDRKASGASKPVAESSNLDINQIETMAWQKEGHNDRP 423 LA+GK+GKLW+++ SQ + + S+ + + +++++ K +NDRP Sbjct: 264 LILAIGKSGKLWESRQEPSQQQGLNETSELTKGAKQSETHKVQSSNSIKGINNDRP 319 >ref|XP_010087627.1| hypothetical protein L484_022154 [Morus notabilis] gi|587838793|gb|EXB29482.1| hypothetical protein L484_022154 [Morus notabilis] Length = 374 Score = 175 bits (443), Expect = 1e-40 Identities = 117/278 (42%), Positives = 152/278 (54%), Gaps = 8/278 (2%) Frame = -3 Query: 1238 HFPNQRYFTSQPRLFAAARISDS-----FQEPYVASQQQANGNGGANFDAFLSVLEFIXX 1074 HF ++ +F R + S+S F+ S+ +G +FD+FLS++E + Sbjct: 40 HFASRSHFHISNRQLPSPCCSNSPRTHRFRLGVFESEGPVRRDGDLDFDSFLSIVETLCV 99 Query: 1073 XXXXXXXXXXXVRCXXXXXXXXXXXGSR---ILVWQCVVLSCGLAAGAAIRRRQWRRICG 903 V C + IL +V+ GL GA IRRRQWRR C Sbjct: 100 FSSAVVSLGFAVNCVVSSSKKTVMAAAMGNGILSCGMLVMVAGLGIGAWIRRRQWRRFCS 159 Query: 902 VGFSRGPASSGANLLERVEKLEEDLRSAATIVQALSRRLEKLGIRFRLTRKALKEPIAET 723 G RG NLLERVEKLEEDLR++AT+++ +SR+LEKLGIRFR+TRKALKEP+AET Sbjct: 160 -GSVRGGLE--VNLLERVEKLEEDLRNSATLIRVISRQLEKLGIRFRVTRKALKEPLAET 216 Query: 722 AVLVQKNSEATQALAAQGDILEKELGEIQKVXXXXXXXXXXXXXXXLAVGKAGKLWDAKP 543 A L QKNSEAT+ALA Q DILEKELGEIQKV LA+GK GKL++ +P Sbjct: 217 AALAQKNSEATRALAVQEDILEKELGEIQKVLLAMQEQQQKQLELILAIGKTGKLFETRP 276 Query: 542 VKSQDRKASGASKPVAESSNLDINQIETMAWQKEGHND 429 +SQ+++ AES QKE H + Sbjct: 277 ERSQEQERIEIHDSTAESLK-----------QKESHQE 303 >gb|KDO60721.1| hypothetical protein CISIN_1g020717mg [Citrus sinensis] Length = 293 Score = 169 bits (428), Expect = 6e-39 Identities = 92/190 (48%), Positives = 122/190 (64%) Frame = -3 Query: 995 SRILVWQCVVLSCGLAAGAAIRRRQWRRICGVGFSRGPASSGANLLERVEKLEEDLRSAA 816 SR+L V L CG+ GA IRRRQWRR+CG +R NL+ R+EKLEED++S+A Sbjct: 104 SRVLACGVVSLVCGVWIGAIIRRRQWRRVCGEK-ARAEGRESVNLVGRIEKLEEDMKSSA 162 Query: 815 TIVQALSRRLEKLGIRFRLTRKALKEPIAETAVLVQKNSEATQALAAQGDILEKELGEIQ 636 TI++ LSR+LEKLG+RFR+TRKALK+PI + A L QKNSEAT+ALA QGD+LEKELGEIQ Sbjct: 163 TILRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQ 222 Query: 635 KVXXXXXXXXXXXXXXXLAVGKAGKLWDAKPVKSQDRKASGASKPVAESSNLDINQIETM 456 KV LA+GK GKL++ + SQ++ S + + ++ + E Sbjct: 223 KVLLAMQEQQQKQLELILAIGKTGKLFENRQEPSQEQDKLKTSDFIDGAKQMETQETEAF 282 Query: 455 AWQKEGHNDR 426 + NDR Sbjct: 283 GSSRGNKNDR 292 >gb|KDO60720.1| hypothetical protein CISIN_1g020717mg [Citrus sinensis] Length = 322 Score = 169 bits (428), Expect = 6e-39 Identities = 92/190 (48%), Positives = 122/190 (64%) Frame = -3 Query: 995 SRILVWQCVVLSCGLAAGAAIRRRQWRRICGVGFSRGPASSGANLLERVEKLEEDLRSAA 816 SR+L V L CG+ GA IRRRQWRR+CG +R NL+ R+EKLEED++S+A Sbjct: 133 SRVLACGVVSLVCGVWIGAIIRRRQWRRVCGEK-ARAEGRESVNLVGRIEKLEEDMKSSA 191 Query: 815 TIVQALSRRLEKLGIRFRLTRKALKEPIAETAVLVQKNSEATQALAAQGDILEKELGEIQ 636 TI++ LSR+LEKLG+RFR+TRKALK+PI + A L QKNSEAT+ALA QGD+LEKELGEIQ Sbjct: 192 TILRVLSRQLEKLGVRFRVTRKALKDPITQAAALAQKNSEATRALAMQGDVLEKELGEIQ 251 Query: 635 KVXXXXXXXXXXXXXXXLAVGKAGKLWDAKPVKSQDRKASGASKPVAESSNLDINQIETM 456 KV LA+GK GKL++ + SQ++ S + + ++ + E Sbjct: 252 KVLLAMQEQQQKQLELILAIGKTGKLFENRQEPSQEQDKLKTSDFIDGAKQMETQETEAF 311 Query: 455 AWQKEGHNDR 426 + NDR Sbjct: 312 GSSRGNKNDR 321 >ref|XP_010245835.1| PREDICTED: uncharacterized protein LOC104589273 isoform X1 [Nelumbo nucifera] Length = 334 Score = 169 bits (427), Expect = 8e-39 Identities = 98/191 (51%), Positives = 121/191 (63%), Gaps = 1/191 (0%) Frame = -3 Query: 995 SRILVWQCVVLSCGLAAGAAIRRRQWRRICGVGFSRGPASSGANLLERVEKLEEDLRSAA 816 +RI VWQ V+L +AAGA +RRRQWRRIC G S NL+ER+EK+EEDLRS+A Sbjct: 143 NRIFVWQFVLLVGAVAAGALVRRRQWRRICRDTIKTGAGGSSVNLIERIEKIEEDLRSSA 202 Query: 815 TIVQALSRRLEKLGIRFRLTRKALKEPIAETAVLVQKNSEATQALAAQGDILEKELGEIQ 636 TI++ LSR+LEKLG RFR+TRKALKEPI +TA L QKNSEAT++LA Q D LEKEL EIQ Sbjct: 203 TIIRVLSRQLEKLGTRFRVTRKALKEPITQTAALAQKNSEATRSLAVQEDNLEKELVEIQ 262 Query: 635 KVXXXXXXXXXXXXXXXLAVGKAGKLWDAKPVKSQDRKASGASKPVAESSNLDI-NQIET 459 KV LA+GK GKL ++K +++ S + I NQ +T Sbjct: 263 KVLLAMQDQQQKQLKLILAIGKVGKLRESKHDTVTEQETIEPSNSFFKEDLQQIENQTQT 322 Query: 458 MAWQKEGHNDR 426 K NDR Sbjct: 323 SMEHKGTSNDR 333 >ref|XP_006426358.1| hypothetical protein CICLE_v10026087mg [Citrus clementina] gi|557528348|gb|ESR39598.1| hypothetical protein CICLE_v10026087mg [Citrus clementina] Length = 322 Score = 168 bits (426), Expect = 1e-38 Identities = 93/190 (48%), Positives = 121/190 (63%) Frame = -3 Query: 995 SRILVWQCVVLSCGLAAGAAIRRRQWRRICGVGFSRGPASSGANLLERVEKLEEDLRSAA 816 SR+L V L CG+ GA IRRRQWRR+CG R NL+ R+EKLEED++S+A Sbjct: 133 SRVLACGVVSLVCGVWVGAVIRRRQWRRVCGETV-RVEGRERVNLVGRIEKLEEDMKSSA 191 Query: 815 TIVQALSRRLEKLGIRFRLTRKALKEPIAETAVLVQKNSEATQALAAQGDILEKELGEIQ 636 TI++ LSR+LEKLG+RFR+TRKALK+PI E A L QKNSEAT+ALA QGD+LEKELGEIQ Sbjct: 192 TILRVLSRQLEKLGVRFRVTRKALKDPITEAAALAQKNSEATRALAMQGDVLEKELGEIQ 251 Query: 635 KVXXXXXXXXXXXXXXXLAVGKAGKLWDAKPVKSQDRKASGASKPVAESSNLDINQIETM 456 KV LA+GK GKL++ + SQ++ S + + ++ + E Sbjct: 252 KVLLAMQEQQQKQLELILAIGKTGKLFENRQEPSQEQDKLKTSDFIDGAKQMETQETEAF 311 Query: 455 AWQKEGHNDR 426 + NDR Sbjct: 312 GSSRGNKNDR 321 >ref|XP_012469812.1| PREDICTED: uncharacterized protein LOC105787802 [Gossypium raimondii] gi|763750833|gb|KJB18221.1| hypothetical protein B456_003G040700 [Gossypium raimondii] Length = 311 Score = 167 bits (423), Expect = 2e-38 Identities = 93/189 (49%), Positives = 120/189 (63%) Frame = -3 Query: 992 RILVWQCVVLSCGLAAGAAIRRRQWRRICGVGFSRGPASSGANLLERVEKLEEDLRSAAT 813 R++ W + L G A GA IRRRQWRRIC G NL++R+EKLEEDL+S+ Sbjct: 124 RVMAWNVLGLVSGFAIGAWIRRRQWRRICVETAKAG--GKRLNLVDRIEKLEEDLKSSVA 181 Query: 812 IVQALSRRLEKLGIRFRLTRKALKEPIAETAVLVQKNSEATQALAAQGDILEKELGEIQK 633 I++ LSR+LEKLGIRFR+TRK LK+PI ETA L QKNSEAT+ALAAQ +ILEKEL EIQK Sbjct: 182 IIRVLSRQLEKLGIRFRVTRKGLKQPIEETAALAQKNSEATRALAAQEEILEKELEEIQK 241 Query: 632 VXXXXXXXXXXXXXXXLAVGKAGKLWDAKPVKSQDRKASGASKPVAESSNLDINQIETMA 453 V LA+ K+GKL++ K SQ++ A K E+ +++NQ + Sbjct: 242 VLLAMQEQQQKQLELILAIAKSGKLFEEKREPSQEKDMVEACKSTEEAKQMEVNQTRPLG 301 Query: 452 WQKEGHNDR 426 + NDR Sbjct: 302 TTRGSGNDR 310