BLASTX nr result
ID: Perilla23_contig00024487
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Perilla23_contig00024487 (760 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_012847879.1| PREDICTED: uncharacterized protein LOC105967... 441 e-121 gb|EYU28595.1| hypothetical protein MIMGU_mgv1a009576mg [Erythra... 441 e-121 ref|XP_011087006.1| PREDICTED: uncharacterized protein LOC105168... 435 e-119 ref|XP_006357484.1| PREDICTED: uncharacterized protein LOC102595... 367 5e-99 ref|XP_010656966.1| PREDICTED: uncharacterized protein LOC100267... 367 6e-99 ref|XP_009802435.1| PREDICTED: uncharacterized protein LOC104247... 366 8e-99 ref|XP_009625804.1| PREDICTED: uncharacterized protein LOC104116... 365 1e-98 ref|XP_002273967.1| PREDICTED: uncharacterized protein LOC100267... 365 2e-98 ref|XP_010051902.1| PREDICTED: uncharacterized protein LOC104440... 364 4e-98 ref|XP_008229151.1| PREDICTED: uncharacterized protein LOC103328... 364 4e-98 gb|KCW89528.1| hypothetical protein EUGRSUZ_A01814 [Eucalyptus g... 364 4e-98 ref|XP_010542890.1| PREDICTED: uncharacterized protein LOC104815... 363 5e-98 emb|CAN81333.1| hypothetical protein VITISV_021624 [Vitis vinifera] 363 5e-98 ref|XP_007198832.1| hypothetical protein PRUPE_ppa007536mg [Prun... 363 7e-98 gb|KHN36634.1| hypothetical protein glysoja_001365 [Glycine soja] 363 9e-98 ref|XP_004243350.2| PREDICTED: uncharacterized protein LOC101268... 362 2e-97 ref|XP_007043587.1| DTW domain-containing protein isoform 1 [The... 362 2e-97 gb|KRH22395.1| hypothetical protein GLYMA_13G297500 [Glycine max] 361 3e-97 ref|XP_006593349.1| PREDICTED: uncharacterized protein LOC100306... 361 3e-97 ref|XP_002517923.1| conserved hypothetical protein [Ricinus comm... 360 6e-97 >ref|XP_012847879.1| PREDICTED: uncharacterized protein LOC105967823 [Erythranthe guttatus] Length = 388 Score = 441 bits (1133), Expect = e-121 Identities = 210/253 (83%), Positives = 223/253 (88%) Frame = -1 Query: 760 HRARYASLSGSEEKIQFFSARQIACRLLGSSGYLCQ*CWLPREDCICSKVTACPLWHRLR 581 HRA+YA LS SEEK+QFFSARQIACRLLGS GYLCQ CWLP EDC+CS+VT CPLWH +R Sbjct: 118 HRAKYACLSNSEEKLQFFSARQIACRLLGSRGYLCQKCWLPLEDCMCSRVTVCPLWHGVR 177 Query: 580 IWLYMHPKDFLRQNNTGKLLWQVFGVQAARLCLFGIAEHEEMMWNELNRAGRNKVWCLYP 401 IWLYMHPKDFLRQNNTGKLLWQVFGV+AA LCL+GIAEHE MMWNELN+AGRNKVWCLYP Sbjct: 178 IWLYMHPKDFLRQNNTGKLLWQVFGVEAASLCLYGIAEHENMMWNELNQAGRNKVWCLYP 237 Query: 400 NKNAVTESVKDIVGCSMNLESQAGLANEDDTMHFIFIDGTWSNSAAMFRRLQARAESEWG 221 NKNA TESVKD VGCS NL A N+ DTMHFI IDGTWSNS AMFRRLQ RA+SEWG Sbjct: 238 NKNAETESVKDSVGCSENLRRLAAPGNKYDTMHFILIDGTWSNSGAMFRRLQDRAKSEWG 297 Query: 220 QEPHCISLNTGVSLMHKLRPQPSWDRTCTAAAAIGLLYELTLLPEFSSFGLDKQAEAIDN 41 +E CISLNTG SLMHKLRPQPSWDRTCTAAAAIGLLYEL L+ E SSFGLDKQAEAI+N Sbjct: 298 KELGCISLNTGASLMHKLRPQPSWDRTCTAAAAIGLLYELQLITELSSFGLDKQAEAIEN 357 Query: 40 ALEVLFEALIARR 2 ALEVL EAL ARR Sbjct: 358 ALEVLLEALTARR 370 >gb|EYU28595.1| hypothetical protein MIMGU_mgv1a009576mg [Erythranthe guttata] Length = 337 Score = 441 bits (1133), Expect = e-121 Identities = 210/253 (83%), Positives = 223/253 (88%) Frame = -1 Query: 760 HRARYASLSGSEEKIQFFSARQIACRLLGSSGYLCQ*CWLPREDCICSKVTACPLWHRLR 581 HRA+YA LS SEEK+QFFSARQIACRLLGS GYLCQ CWLP EDC+CS+VT CPLWH +R Sbjct: 67 HRAKYACLSNSEEKLQFFSARQIACRLLGSRGYLCQKCWLPLEDCMCSRVTVCPLWHGVR 126 Query: 580 IWLYMHPKDFLRQNNTGKLLWQVFGVQAARLCLFGIAEHEEMMWNELNRAGRNKVWCLYP 401 IWLYMHPKDFLRQNNTGKLLWQVFGV+AA LCL+GIAEHE MMWNELN+AGRNKVWCLYP Sbjct: 127 IWLYMHPKDFLRQNNTGKLLWQVFGVEAASLCLYGIAEHENMMWNELNQAGRNKVWCLYP 186 Query: 400 NKNAVTESVKDIVGCSMNLESQAGLANEDDTMHFIFIDGTWSNSAAMFRRLQARAESEWG 221 NKNA TESVKD VGCS NL A N+ DTMHFI IDGTWSNS AMFRRLQ RA+SEWG Sbjct: 187 NKNAETESVKDSVGCSENLRRLAAPGNKYDTMHFILIDGTWSNSGAMFRRLQDRAKSEWG 246 Query: 220 QEPHCISLNTGVSLMHKLRPQPSWDRTCTAAAAIGLLYELTLLPEFSSFGLDKQAEAIDN 41 +E CISLNTG SLMHKLRPQPSWDRTCTAAAAIGLLYEL L+ E SSFGLDKQAEAI+N Sbjct: 247 KELGCISLNTGASLMHKLRPQPSWDRTCTAAAAIGLLYELQLITELSSFGLDKQAEAIEN 306 Query: 40 ALEVLFEALIARR 2 ALEVL EAL ARR Sbjct: 307 ALEVLLEALTARR 319 >ref|XP_011087006.1| PREDICTED: uncharacterized protein LOC105168567 [Sesamum indicum] Length = 516 Score = 435 bits (1118), Expect = e-119 Identities = 209/253 (82%), Positives = 223/253 (88%) Frame = -1 Query: 760 HRARYASLSGSEEKIQFFSARQIACRLLGSSGYLCQ*CWLPREDCICSKVTACPLWHRLR 581 HRA+YASL+ SEEK+QFFSARQIACRLLGS GYLCQ CWLPREDC+CSKVT C LW RLR Sbjct: 120 HRAKYASLANSEEKLQFFSARQIACRLLGSRGYLCQKCWLPREDCMCSKVTMCSLWRRLR 179 Query: 580 IWLYMHPKDFLRQNNTGKLLWQVFGVQAARLCLFGIAEHEEMMWNELNRAGRNKVWCLYP 401 IWLYMHPKDFLRQNNTGKLLWQV+GVQAA LCLFGI EHEEMMWNELN AGRNKVWCLYP Sbjct: 180 IWLYMHPKDFLRQNNTGKLLWQVYGVQAASLCLFGIVEHEEMMWNELNHAGRNKVWCLYP 239 Query: 400 NKNAVTESVKDIVGCSMNLESQAGLANEDDTMHFIFIDGTWSNSAAMFRRLQARAESEWG 221 NKNA TESVKD VGC+++ ANED MHFI IDGTWSNS+AMFRRLQ RA+ WG Sbjct: 240 NKNAATESVKDGVGCAVD------PANEDGIMHFILIDGTWSNSSAMFRRLQDRAKLVWG 293 Query: 220 QEPHCISLNTGVSLMHKLRPQPSWDRTCTAAAAIGLLYELTLLPEFSSFGLDKQAEAIDN 41 QE CISLNTGVSLMHKLRPQPSWDRTCTAAAAIGLLYEL L+PEFSSFGL+KQAEA++ Sbjct: 294 QELRCISLNTGVSLMHKLRPQPSWDRTCTAAAAIGLLYELQLVPEFSSFGLEKQAEAVEK 353 Query: 40 ALEVLFEALIARR 2 ALEVL EAL ARR Sbjct: 354 ALEVLLEALTARR 366 >ref|XP_006357484.1| PREDICTED: uncharacterized protein LOC102595688 [Solanum tuberosum] Length = 361 Score = 367 bits (942), Expect = 5e-99 Identities = 178/256 (69%), Positives = 208/256 (81%), Gaps = 3/256 (1%) Frame = -1 Query: 760 HRARYASLSGSEEKIQFFSARQIACRLLGSSGYLCQ*CWLPREDCICSKVTACPLWHRLR 581 HRA+Y SL SE+K+QFFSARQIACR+LGS GYLCQ CWL DC+CSK+ CPLW+R+R Sbjct: 88 HRAKYQSLGDSEQKLQFFSARQIACRVLGSRGYLCQKCWLSSGDCMCSKLITCPLWNRMR 147 Query: 580 IWLYMHPKDFLRQNNTGKLLWQVFGVQAARLCLFGIAEHEEMMWNELNRAGRNKVWCLYP 401 WLYMHPKDFLRQNNTGKLLWQVFGVQAA LCL+GI EHE+MMW+ L AG++KVWCLYP Sbjct: 148 FWLYMHPKDFLRQNNTGKLLWQVFGVQAATLCLYGITEHEDMMWDALKLAGKDKVWCLYP 207 Query: 400 NKNAVTESVKD-IVGCSM-NLESQAGLANEDDTMHFIFIDGTWSNSAAMFRRLQARAESE 227 NKNA T SVKD + G S N+E+ +A +++FI IDGTWSNS AMF RL+ R + Sbjct: 208 NKNAPTNSVKDSMAGLSFANVENHPEMAGGAHSLNFILIDGTWSNSGAMFSRLKDRYKLM 267 Query: 226 WGQEP-HCISLNTGVSLMHKLRPQPSWDRTCTAAAAIGLLYELTLLPEFSSFGLDKQAEA 50 WG+E CI+LNTG SLMHKLRPQPSWDRTCTAAAAIGLL EL LP F S+GLDKQA+A Sbjct: 268 WGEEEIPCITLNTGASLMHKLRPQPSWDRTCTAAAAIGLLDELHDLPNFVSYGLDKQAKA 327 Query: 49 IDNALEVLFEALIARR 2 +++A+EVLFEAL RR Sbjct: 328 LEDAVEVLFEALTIRR 343 >ref|XP_010656966.1| PREDICTED: uncharacterized protein LOC100267683 isoform X2 [Vitis vinifera] gi|302143284|emb|CBI21845.3| unnamed protein product [Vitis vinifera] Length = 390 Score = 367 bits (941), Expect = 6e-99 Identities = 178/256 (69%), Positives = 203/256 (79%), Gaps = 3/256 (1%) Frame = -1 Query: 760 HRARYASLSGSEEKIQFFSARQIACRLLGSSGYLCQ*CWLPREDCICSKVTACPLWHRLR 581 HRA Y +L SE+K+QFFSARQIACRLLGS GYLCQ CWL EDC+CSKV C LWH +R Sbjct: 117 HRATYQALGDSEKKLQFFSARQIACRLLGSRGYLCQKCWLALEDCMCSKVIPCVLWHGIR 176 Query: 580 IWLYMHPKDFLRQNNTGKLLWQVFGVQAARLCLFGIAEHEEMMWNELNRAGRNKVWCLYP 401 WLYMHPKDFLRQNNTGKLLWQVFGV+AA LCLFGIAEHEE+MWN AG++ VWCLYP Sbjct: 177 FWLYMHPKDFLRQNNTGKLLWQVFGVKAATLCLFGIAEHEEIMWNTFALAGKSNVWCLYP 236 Query: 400 NKNAVTESVKDIVGCSM--NLESQAGLANEDDTMHFIFIDGTWSNSAAMFRRLQARAESE 227 NKNA T+SV+DI LE + N + ++FI IDGTW+NSAAMFRRL+ +A+ Sbjct: 237 NKNAPTKSVQDIFAQESLGGLECSSTTTNREKILNFILIDGTWNNSAAMFRRLKEQAKLA 296 Query: 226 WGQEP-HCISLNTGVSLMHKLRPQPSWDRTCTAAAAIGLLYELTLLPEFSSFGLDKQAEA 50 WG+E CISL G S MHKLRPQPSWDRTCTAAAAIGLL EL L+PEF S+GLDKQAEA Sbjct: 297 WGEEDLPCISLAMGASAMHKLRPQPSWDRTCTAAAAIGLLSELQLIPEFGSYGLDKQAEA 356 Query: 49 IDNALEVLFEALIARR 2 +++AL VL EAL ARR Sbjct: 357 VEDALAVLLEALTARR 372 >ref|XP_009802435.1| PREDICTED: uncharacterized protein LOC104247967 [Nicotiana sylvestris] Length = 385 Score = 366 bits (940), Expect = 8e-99 Identities = 178/255 (69%), Positives = 202/255 (79%), Gaps = 2/255 (0%) Frame = -1 Query: 760 HRARYASLSGSEEKIQFFSARQIACRLLGSSGYLCQ*CWLPREDCICSKVTACPLWHRLR 581 HRA Y SL SE K+QFFSARQIACR+LGS GYLCQ CWLP EDC+CSK+ +CPLWHR+R Sbjct: 113 HRATYQSLGDSETKLQFFSARQIACRVLGSRGYLCQKCWLPSEDCMCSKLISCPLWHRVR 172 Query: 580 IWLYMHPKDFLRQNNTGKLLWQVFGVQAARLCLFGIAEHEEMMWNELNRAGRNKVWCLYP 401 WLYMHPKDFLRQNNTGKLLWQVFG+QAA LCL+GIAEHEEMM L AG++KVWCLYP Sbjct: 173 FWLYMHPKDFLRQNNTGKLLWQVFGIQAATLCLYGIAEHEEMMGGALKLAGKDKVWCLYP 232 Query: 400 NKNAVTESVKDIVG--CSMNLESQAGLANEDDTMHFIFIDGTWSNSAAMFRRLQARAESE 227 NKNA +SVKD + ++E + +++FI IDGTWSNS AMF RL+ R E Sbjct: 233 NKNAPAKSVKDSMAELSFSHVECPPETTDGGHSLNFILIDGTWSNSGAMFSRLKDRYELM 292 Query: 226 WGQEPHCISLNTGVSLMHKLRPQPSWDRTCTAAAAIGLLYELTLLPEFSSFGLDKQAEAI 47 WG E CI LNTG SLMHKLRPQPSWDRTCTAAAAIGLL EL LP F S+GLDKQA+AI Sbjct: 293 WGDELPCIMLNTGASLMHKLRPQPSWDRTCTAAAAIGLLDELHDLPNFVSYGLDKQAKAI 352 Query: 46 DNALEVLFEALIARR 2 ++A+EVL EAL ARR Sbjct: 353 EDAVEVLLEALTARR 367 >ref|XP_009625804.1| PREDICTED: uncharacterized protein LOC104116618 isoform X1 [Nicotiana tomentosiformis] Length = 385 Score = 365 bits (938), Expect = 1e-98 Identities = 177/255 (69%), Positives = 203/255 (79%), Gaps = 2/255 (0%) Frame = -1 Query: 760 HRARYASLSGSEEKIQFFSARQIACRLLGSSGYLCQ*CWLPREDCICSKVTACPLWHRLR 581 HRA Y SL SE K+QFFSARQIACR+LGS GYLCQ CWLP EDC+CSK+T+CPLW R+R Sbjct: 113 HRATYQSLGDSETKLQFFSARQIACRVLGSRGYLCQKCWLPSEDCMCSKLTSCPLWRRVR 172 Query: 580 IWLYMHPKDFLRQNNTGKLLWQVFGVQAARLCLFGIAEHEEMMWNELNRAGRNKVWCLYP 401 WLYMHPKDFLRQNNTGKLLWQVFG+QAA LCL+GIAEHEEMMW+ L AG++KVWCLYP Sbjct: 173 FWLYMHPKDFLRQNNTGKLLWQVFGIQAATLCLYGIAEHEEMMWDALKLAGKDKVWCLYP 232 Query: 400 NKNAVTESVKDIVG--CSMNLESQAGLANEDDTMHFIFIDGTWSNSAAMFRRLQARAESE 227 NKNA +SVKD + ++E + +++FI IDGTWSNS AMF RL+ R E Sbjct: 233 NKNAPAKSVKDSMAELSFAHVECPPETTDGGHSLNFILIDGTWSNSGAMFSRLKDRYELM 292 Query: 226 WGQEPHCISLNTGVSLMHKLRPQPSWDRTCTAAAAIGLLYELTLLPEFSSFGLDKQAEAI 47 WG E CI+LNTG SLMH LRPQPSWDRTCTAAAAIGLL EL L F S+GLDKQA+AI Sbjct: 293 WGDELPCITLNTGASLMHILRPQPSWDRTCTAAAAIGLLDELHDLTNFVSYGLDKQAKAI 352 Query: 46 DNALEVLFEALIARR 2 ++A+EVL EAL ARR Sbjct: 353 EDAVEVLLEALTARR 367 >ref|XP_002273967.1| PREDICTED: uncharacterized protein LOC100267683 isoform X1 [Vitis vinifera] Length = 401 Score = 365 bits (936), Expect = 2e-98 Identities = 180/267 (67%), Positives = 206/267 (77%), Gaps = 14/267 (5%) Frame = -1 Query: 760 HRARYASLSGSEEKIQFFSARQIACRLLGSSGYLCQ*CWLPREDCICSKVTACPLWHRLR 581 HRA Y +L SE+K+QFFSARQIACRLLGS GYLCQ CWL EDC+CSKV C LWH +R Sbjct: 117 HRATYQALGDSEKKLQFFSARQIACRLLGSRGYLCQKCWLALEDCMCSKVIPCVLWHGIR 176 Query: 580 IWLYMHPKDFLRQNNTGKLLWQVFGVQAARLCLFGIAEHEEMMWNELNRAGRNKVWCLYP 401 WLYMHPKDFLRQNNTGKLLWQVFGV+AA LCLFGIAEHEE+MWN AG++ VWCLYP Sbjct: 177 FWLYMHPKDFLRQNNTGKLLWQVFGVKAATLCLFGIAEHEEIMWNTFALAGKSNVWCLYP 236 Query: 400 NKNAVTESVKDI--------VGCSMNL-----ESQAGLANEDDTMHFIFIDGTWSNSAAM 260 NKNA T+SV+DI + CS +Q G N + ++FI IDGTW+NSAAM Sbjct: 237 NKNAPTKSVQDIFAQESLGGLECSSTTGEQHGGAQCGWTNREKILNFILIDGTWNNSAAM 296 Query: 259 FRRLQARAESEWGQEP-HCISLNTGVSLMHKLRPQPSWDRTCTAAAAIGLLYELTLLPEF 83 FRRL+ +A+ WG+E CISL G S MHKLRPQPSWDRTCTAAAAIGLL EL L+PEF Sbjct: 297 FRRLKEQAKLAWGEEDLPCISLAMGASAMHKLRPQPSWDRTCTAAAAIGLLSELQLIPEF 356 Query: 82 SSFGLDKQAEAIDNALEVLFEALIARR 2 S+GLDKQAEA+++AL VL EAL ARR Sbjct: 357 GSYGLDKQAEAVEDALAVLLEALTARR 383 >ref|XP_010051902.1| PREDICTED: uncharacterized protein LOC104440669 [Eucalyptus grandis] Length = 422 Score = 364 bits (934), Expect = 4e-98 Identities = 171/256 (66%), Positives = 207/256 (80%), Gaps = 3/256 (1%) Frame = -1 Query: 760 HRARYASLSGSEEKIQFFSARQIACRLLGSSGYLCQ*CWLPREDCICSKVTACPLWHRLR 581 HRA+Y +L SE+K+QFF+ARQIACRLLGS YLCQ CWLP EDC+CSKV C LWH +R Sbjct: 149 HRAKYQALGDSEKKLQFFAARQIACRLLGSRNYLCQKCWLPMEDCMCSKVLPCSLWHGMR 208 Query: 580 IWLYMHPKDFLRQNNTGKLLWQVFGVQAARLCLFGIAEHEEMMWNELNRAGRNKVWCLYP 401 W+YMHPKDFLRQNNTGKLLWQ+FGVQAA LCLFG+AE EE+MWN AG+++VWCLYP Sbjct: 209 FWVYMHPKDFLRQNNTGKLLWQLFGVQAATLCLFGVAEDEEIMWNTFKLAGKSRVWCLYP 268 Query: 400 NKNAVTESVKDIVGCSMNLESQAGLA--NEDDTMHFIFIDGTWSNSAAMFRRLQARAESE 227 NK+A+TESV+D + +A + NED ++F+ IDGTWSNSAAMFRRL+ +A+ Sbjct: 269 NKSALTESVQDTFSRAQFQSERASITEKNEDRILNFVLIDGTWSNSAAMFRRLKEQAKIV 328 Query: 226 WGQEP-HCISLNTGVSLMHKLRPQPSWDRTCTAAAAIGLLYELTLLPEFSSFGLDKQAEA 50 WG++ CISL TG S MHKLRPQPSWDRTCTAAAAI L+ EL +P+FSS+GLDKQAEA Sbjct: 329 WGEDDLPCISLATGASAMHKLRPQPSWDRTCTAAAAIALMSELQAVPDFSSYGLDKQAEA 388 Query: 49 IDNALEVLFEALIARR 2 +++AL+VL EAL ARR Sbjct: 389 VEDALQVLLEALTARR 404 >ref|XP_008229151.1| PREDICTED: uncharacterized protein LOC103328534 [Prunus mume] Length = 386 Score = 364 bits (934), Expect = 4e-98 Identities = 175/256 (68%), Positives = 201/256 (78%), Gaps = 3/256 (1%) Frame = -1 Query: 760 HRARYASLSGSEEKIQFFSARQIACRLLGSSGYLCQ*CWLPREDCICSKVTACPLWHRLR 581 HR Y +L SE+K+QFF+ARQIACRLLGS GYLCQ CWLP EDC+CS VT LWHR+R Sbjct: 113 HRKTYQALGDSEQKLQFFAARQIACRLLGSRGYLCQKCWLPLEDCMCSNVTQSTLWHRMR 172 Query: 580 IWLYMHPKDFLRQNNTGKLLWQVFGVQAARLCLFGIAEHEEMMWNELNRAGRNKVWCLYP 401 WLYMHPKDFLRQNNTGKLLWQVFG +AA LCLFGI+EHEE+MWN L AG+N VWCLYP Sbjct: 173 FWLYMHPKDFLRQNNTGKLLWQVFGTEAATLCLFGISEHEEIMWNALKLAGKNNVWCLYP 232 Query: 400 NKNAVTESVKDIVG--CSMNLESQAGLANEDDTMHFIFIDGTWSNSAAMFRRLQARAESE 227 NKNA +SV+D+ G S LE NED T++FI IDGTW+NS ++F RL+ +A S Sbjct: 233 NKNAALKSVEDVFGQEPSPYLECTDTKTNEDGTLNFILIDGTWNNSVSIFSRLKDQATSV 292 Query: 226 WGQEPH-CISLNTGVSLMHKLRPQPSWDRTCTAAAAIGLLYELTLLPEFSSFGLDKQAEA 50 WG+ CISL TGVS MHKLRPQPSWDRTCTA AAIGLL EL LLPEFSSFG +KQAEA Sbjct: 293 WGEADFPCISLATGVSAMHKLRPQPSWDRTCTAGAAIGLLSELQLLPEFSSFGFEKQAEA 352 Query: 49 IDNALEVLFEALIARR 2 +++ L +L EAL RR Sbjct: 353 LEDTLVILLEALTTRR 368 >gb|KCW89528.1| hypothetical protein EUGRSUZ_A01814 [Eucalyptus grandis] Length = 404 Score = 364 bits (934), Expect = 4e-98 Identities = 171/256 (66%), Positives = 207/256 (80%), Gaps = 3/256 (1%) Frame = -1 Query: 760 HRARYASLSGSEEKIQFFSARQIACRLLGSSGYLCQ*CWLPREDCICSKVTACPLWHRLR 581 HRA+Y +L SE+K+QFF+ARQIACRLLGS YLCQ CWLP EDC+CSKV C LWH +R Sbjct: 131 HRAKYQALGDSEKKLQFFAARQIACRLLGSRNYLCQKCWLPMEDCMCSKVLPCSLWHGMR 190 Query: 580 IWLYMHPKDFLRQNNTGKLLWQVFGVQAARLCLFGIAEHEEMMWNELNRAGRNKVWCLYP 401 W+YMHPKDFLRQNNTGKLLWQ+FGVQAA LCLFG+AE EE+MWN AG+++VWCLYP Sbjct: 191 FWVYMHPKDFLRQNNTGKLLWQLFGVQAATLCLFGVAEDEEIMWNTFKLAGKSRVWCLYP 250 Query: 400 NKNAVTESVKDIVGCSMNLESQAGLA--NEDDTMHFIFIDGTWSNSAAMFRRLQARAESE 227 NK+A+TESV+D + +A + NED ++F+ IDGTWSNSAAMFRRL+ +A+ Sbjct: 251 NKSALTESVQDTFSRAQFQSERASITEKNEDRILNFVLIDGTWSNSAAMFRRLKEQAKIV 310 Query: 226 WGQEP-HCISLNTGVSLMHKLRPQPSWDRTCTAAAAIGLLYELTLLPEFSSFGLDKQAEA 50 WG++ CISL TG S MHKLRPQPSWDRTCTAAAAI L+ EL +P+FSS+GLDKQAEA Sbjct: 311 WGEDDLPCISLATGASAMHKLRPQPSWDRTCTAAAAIALMSELQAVPDFSSYGLDKQAEA 370 Query: 49 IDNALEVLFEALIARR 2 +++AL+VL EAL ARR Sbjct: 371 VEDALQVLLEALTARR 386 >ref|XP_010542890.1| PREDICTED: uncharacterized protein LOC104815960 isoform X1 [Tarenaya hassleriana] Length = 387 Score = 363 bits (933), Expect = 5e-98 Identities = 170/256 (66%), Positives = 207/256 (80%), Gaps = 3/256 (1%) Frame = -1 Query: 760 HRARYASLSGSEEKIQFFSARQIACRLLGSSGYLCQ*CWLPREDCICSKVTACPLWHRLR 581 HRA Y +L E+K +FF+ARQIACRLLGS GYLCQ CWL EDC+CSKV C LW R+R Sbjct: 115 HRATYEALGDPEKKFRFFAARQIACRLLGSRGYLCQKCWLAMEDCMCSKVKPCDLWKRIR 174 Query: 580 IWLYMHPKDFLRQNNTGKLLWQVFGVQAARLCLFGIAEHEEMMWNELNRAGRNKVWCLYP 401 WLYMHPKDFLRQNNTGKLLWQ+FGVQ+A LCLFGIAE EE+MWNE RAG+N+VWCLYP Sbjct: 175 FWLYMHPKDFLRQNNTGKLLWQIFGVQSATLCLFGIAEDEEIMWNEFKRAGKNRVWCLYP 234 Query: 400 NKNAVTESVKDIVG--CSMNLESQAGLANEDDTMHFIFIDGTWSNSAAMFRRLQARAESE 227 N+N+V+ SV+D+ G S +LE+++ + N D T++FI +DGTW+NSAAMF+RL+ A+ Sbjct: 235 NQNSVSMSVEDVFGNATSESLENESSMTNGDKTLNFILLDGTWNNSAAMFKRLKDHAKLI 294 Query: 226 WGQEP-HCISLNTGVSLMHKLRPQPSWDRTCTAAAAIGLLYELTLLPEFSSFGLDKQAEA 50 W ++ CISL G S MHKLRPQPSWDRTCTAAAAIGLL ELTLLP FS++GLDKQ+E Sbjct: 295 WDEDDLPCISLAAGASAMHKLRPQPSWDRTCTAAAAIGLLSELTLLPHFSAYGLDKQSET 354 Query: 49 IDNALEVLFEALIARR 2 ++ AL +L E+L ARR Sbjct: 355 VEEALVILLESLTARR 370 >emb|CAN81333.1| hypothetical protein VITISV_021624 [Vitis vinifera] Length = 401 Score = 363 bits (933), Expect = 5e-98 Identities = 180/267 (67%), Positives = 205/267 (76%), Gaps = 14/267 (5%) Frame = -1 Query: 760 HRARYASLSGSEEKIQFFSARQIACRLLGSSGYLCQ*CWLPREDCICSKVTACPLWHRLR 581 HRA Y +L SE+K+QFFSARQIACRLLGS GYLCQ CWL EDC+CSKV C LWH +R Sbjct: 117 HRATYQALGDSEKKLQFFSARQIACRLLGSRGYLCQKCWLALEDCMCSKVIPCXLWHGIR 176 Query: 580 IWLYMHPKDFLRQNNTGKLLWQVFGVQAARLCLFGIAEHEEMMWNELNRAGRNKVWCLYP 401 WLYMHPKDFLRQNNTGKLLWQVFGV+AA LCLFGIAEHEE+MWN AG++ VWCLYP Sbjct: 177 FWLYMHPKDFLRQNNTGKLLWQVFGVKAATLCLFGIAEHEEIMWNTFALAGKSNVWCLYP 236 Query: 400 NKNAVTESVKDIVGCSM--NLE-----------SQAGLANEDDTMHFIFIDGTWSNSAAM 260 NKNA T+SV+DI LE +Q G N + ++FI IDGTW+NSAAM Sbjct: 237 NKNAPTKSVQDIFAQESLGGLECPSTTGEQHGGAQCGRTNREKILNFILIDGTWNNSAAM 296 Query: 259 FRRLQARAESEWGQEP-HCISLNTGVSLMHKLRPQPSWDRTCTAAAAIGLLYELTLLPEF 83 FRRL+ +A+ WG+E CISL G S MHKLRPQPSWDRTCTAAAAIGLL EL L+PEF Sbjct: 297 FRRLKEQAKLAWGEEDLPCISLAMGASAMHKLRPQPSWDRTCTAAAAIGLLSELQLIPEF 356 Query: 82 SSFGLDKQAEAIDNALEVLFEALIARR 2 S+GLDKQAEA+++AL VL EAL ARR Sbjct: 357 GSYGLDKQAEAVEDALAVLLEALTARR 383 >ref|XP_007198832.1| hypothetical protein PRUPE_ppa007536mg [Prunus persica] gi|462394127|gb|EMJ00031.1| hypothetical protein PRUPE_ppa007536mg [Prunus persica] Length = 364 Score = 363 bits (932), Expect = 7e-98 Identities = 175/256 (68%), Positives = 200/256 (78%), Gaps = 3/256 (1%) Frame = -1 Query: 760 HRARYASLSGSEEKIQFFSARQIACRLLGSSGYLCQ*CWLPREDCICSKVTACPLWHRLR 581 HR Y +L SE+K+QFF+ARQIACRLLGS GYLCQ CWLP EDC+CS VT LWHR+R Sbjct: 91 HRKTYQALGDSEQKLQFFAARQIACRLLGSRGYLCQKCWLPLEDCMCSNVTQSTLWHRMR 150 Query: 580 IWLYMHPKDFLRQNNTGKLLWQVFGVQAARLCLFGIAEHEEMMWNELNRAGRNKVWCLYP 401 WLYMHPKDFLRQNNTGKLLWQVFG +AA LCLFGI+EHEE+MWN L AG+ VWCLYP Sbjct: 151 FWLYMHPKDFLRQNNTGKLLWQVFGTEAATLCLFGISEHEEIMWNALKLAGKKNVWCLYP 210 Query: 400 NKNAVTESVKDIVG--CSMNLESQAGLANEDDTMHFIFIDGTWSNSAAMFRRLQARAESE 227 NKNA +SV+D+ G S LE NED T++FI IDGTW+NS ++F RL+ +A S Sbjct: 211 NKNAALKSVEDVFGQEPSPYLECTDTKTNEDGTLNFILIDGTWNNSVSIFSRLKDQATSV 270 Query: 226 WGQEPH-CISLNTGVSLMHKLRPQPSWDRTCTAAAAIGLLYELTLLPEFSSFGLDKQAEA 50 WG+ CISL TGVS MHKLRPQPSWDRTCTA AAIGLL EL LLPEFSSFG DKQAEA Sbjct: 271 WGEADFPCISLATGVSAMHKLRPQPSWDRTCTAGAAIGLLSELQLLPEFSSFGFDKQAEA 330 Query: 49 IDNALEVLFEALIARR 2 +++ L +L EAL RR Sbjct: 331 LEDTLVILLEALTTRR 346 >gb|KHN36634.1| hypothetical protein glysoja_001365 [Glycine soja] Length = 338 Score = 363 bits (931), Expect = 9e-98 Identities = 175/257 (68%), Positives = 203/257 (78%), Gaps = 4/257 (1%) Frame = -1 Query: 760 HRARYASLSGSEEKIQFFSARQIACRLLGSSGYLCQ*CWLPREDCICSKVTACPLWHRLR 581 HRA Y +L SE+K+QF+SARQIACR+LGS GYLCQ CWLP EDC+CSKVT+C L+ +R Sbjct: 65 HRATYQALGDSEKKLQFYSARQIACRILGSRGYLCQKCWLPMEDCMCSKVTSCSLYPGIR 124 Query: 580 IWLYMHPKDFLRQNNTGKLLWQVFGVQAARLCLFGIAEHEEMMWNELNRAGRNKVWCLYP 401 WLYMHPKDFLRQNNTGK+LWQVFGV AA LCLFGI EHEE+MWN L AG++ VWCLYP Sbjct: 125 FWLYMHPKDFLRQNNTGKILWQVFGVDAATLCLFGIPEHEEIMWNSLKMAGKSNVWCLYP 184 Query: 400 NKNAVTESVKDIVGCSMNLESQAGLANE---DDTMHFIFIDGTWSNSAAMFRRLQARAES 230 NKNAV +SV+++ G + + + D T HFI IDGTWSNSAAMFRRLQ +A+S Sbjct: 185 NKNAVLKSVQNVFGEEPVARDEVAPSKQLKVDTTQHFILIDGTWSNSAAMFRRLQDKAKS 244 Query: 229 EWGQEP-HCISLNTGVSLMHKLRPQPSWDRTCTAAAAIGLLYELTLLPEFSSFGLDKQAE 53 WG E CISLN G S MHKLRPQPSWDRTCTAAAA GLL EL LLP+FSSF L+KQAE Sbjct: 245 VWGDEDLACISLNPGASAMHKLRPQPSWDRTCTAAAAAGLLSELQLLPQFSSFELEKQAE 304 Query: 52 AIDNALEVLFEALIARR 2 A+++AL VL +AL RR Sbjct: 305 AVEHALTVLLDALTNRR 321 >ref|XP_004243350.2| PREDICTED: uncharacterized protein LOC101268201 [Solanum lycopersicum] Length = 392 Score = 362 bits (929), Expect = 2e-97 Identities = 176/256 (68%), Positives = 206/256 (80%), Gaps = 3/256 (1%) Frame = -1 Query: 760 HRARYASLSGSEEKIQFFSARQIACRLLGSSGYLCQ*CWLPREDCICSKVTACPLWHRLR 581 HRA+Y SL SE+K+QFFSARQIACR+LGS GYLCQ CWL DC+CSK+ CPLW+R+R Sbjct: 119 HRAKYQSLGDSEQKLQFFSARQIACRVLGSRGYLCQKCWLCSADCMCSKLITCPLWNRMR 178 Query: 580 IWLYMHPKDFLRQNNTGKLLWQVFGVQAARLCLFGIAEHEEMMWNELNRAGRNKVWCLYP 401 WLYMHPKDFLRQNNTGKLLWQVFGV AA LCL+GI EHE+MMW+ L AG++KVWCLYP Sbjct: 179 FWLYMHPKDFLRQNNTGKLLWQVFGVHAASLCLYGITEHEDMMWDALKLAGKDKVWCLYP 238 Query: 400 NKNAVTESVKD-IVGCSM-NLESQAGLANEDDTMHFIFIDGTWSNSAAMFRRLQARAESE 227 NKNA T SVKD + G S N+E+ +A+ +++FI IDGTWSNS AMF RL+ R + Sbjct: 239 NKNAPTNSVKDSMAGLSFANVENHPEMADGAHSLNFILIDGTWSNSGAMFSRLKDRYKVM 298 Query: 226 WGQEP-HCISLNTGVSLMHKLRPQPSWDRTCTAAAAIGLLYELTLLPEFSSFGLDKQAEA 50 WG+E CI+LNTG SLMHKLRPQPSWDRTCTAAAAIGLL EL LP F S GLD QA+A Sbjct: 299 WGEEEIPCITLNTGASLMHKLRPQPSWDRTCTAAAAIGLLDELHDLPNFVSHGLDNQAKA 358 Query: 49 IDNALEVLFEALIARR 2 +++A+EVLFEAL RR Sbjct: 359 LEDAVEVLFEALTTRR 374 >ref|XP_007043587.1| DTW domain-containing protein isoform 1 [Theobroma cacao] gi|508707522|gb|EOX99418.1| DTW domain-containing protein isoform 1 [Theobroma cacao] Length = 401 Score = 362 bits (929), Expect = 2e-97 Identities = 174/256 (67%), Positives = 202/256 (78%), Gaps = 3/256 (1%) Frame = -1 Query: 760 HRARYASLSGSEEKIQFFSARQIACRLLGSSGYLCQ*CWLPREDCICSKVTACPLWHRLR 581 HRA+Y +L S++K QFFSARQIACRLLGS GYLCQ CWLP EDC+CSKV C LWH ++ Sbjct: 128 HRAKYQALGDSDKKFQFFSARQIACRLLGSRGYLCQKCWLPMEDCMCSKVKPCSLWHGIK 187 Query: 580 IWLYMHPKDFLRQNNTGKLLWQVFGVQAARLCLFGIAEHEEMMWNELNRAGRNKVWCLYP 401 WLYMHPKDFLRQNNTGKLLWQVFGVQAA LCL+GI+E EE+MWN AG+ KVWCLYP Sbjct: 188 FWLYMHPKDFLRQNNTGKLLWQVFGVQAATLCLYGISEDEEIMWNAFKDAGKGKVWCLYP 247 Query: 400 NKNAVTESVKDIVGC--SMNLESQAGLANEDDTMHFIFIDGTWSNSAAMFRRLQARAESE 227 N+N V ++V+D C S +LE L N ++F+ IDGTWSNSAAMFRRL+ +A+ Sbjct: 248 NQNIVPKTVQDAFSCQSSADLECTHSLTNRYRPLNFVLIDGTWSNSAAMFRRLKEQAKLL 307 Query: 226 WGQEP-HCISLNTGVSLMHKLRPQPSWDRTCTAAAAIGLLYELTLLPEFSSFGLDKQAEA 50 WG+E CISL G S MHKLRPQPSWDRTCTAAAAIG+L EL LLPE SS+GLDKQ EA Sbjct: 308 WGEEDLPCISLAAGASAMHKLRPQPSWDRTCTAAAAIGVLAELQLLPECSSYGLDKQVEA 367 Query: 49 IDNALEVLFEALIARR 2 +++AL VL EAL ARR Sbjct: 368 VEDALVVLLEALTARR 383 >gb|KRH22395.1| hypothetical protein GLYMA_13G297500 [Glycine max] Length = 386 Score = 361 bits (926), Expect = 3e-97 Identities = 174/257 (67%), Positives = 202/257 (78%), Gaps = 4/257 (1%) Frame = -1 Query: 760 HRARYASLSGSEEKIQFFSARQIACRLLGSSGYLCQ*CWLPREDCICSKVTACPLWHRLR 581 HRA Y +L SE+K+QF+SARQIACR+LGS GYLCQ CWLP EDC+CSKVT+C L+ +R Sbjct: 113 HRATYQALGDSEKKLQFYSARQIACRILGSRGYLCQKCWLPMEDCMCSKVTSCSLYPGIR 172 Query: 580 IWLYMHPKDFLRQNNTGKLLWQVFGVQAARLCLFGIAEHEEMMWNELNRAGRNKVWCLYP 401 WLYMHPKDFLRQNNTGK+LWQVFGV AA LCLFGI EHEE+MWN L AG++ VWCLYP Sbjct: 173 FWLYMHPKDFLRQNNTGKILWQVFGVDAATLCLFGIPEHEEIMWNSLKMAGKSNVWCLYP 232 Query: 400 NKNAVTESVKDIVGCSMNLESQAGLANE---DDTMHFIFIDGTWSNSAAMFRRLQARAES 230 NKNAV +SV+++ G + + + D T HFI IDGTWSNSAAMFRRLQ +A+S Sbjct: 233 NKNAVLKSVQNVFGEESVARDEVAPSKQLKVDTTQHFILIDGTWSNSAAMFRRLQDKAKS 292 Query: 229 EWGQEP-HCISLNTGVSLMHKLRPQPSWDRTCTAAAAIGLLYELTLLPEFSSFGLDKQAE 53 WG E CISLN G S MHKLRPQPSWDRTCTAAAA GLL EL LLP+FSS L+KQAE Sbjct: 293 VWGDEDLSCISLNPGASAMHKLRPQPSWDRTCTAAAAAGLLSELQLLPQFSSVELEKQAE 352 Query: 52 AIDNALEVLFEALIARR 2 A+++AL VL +AL RR Sbjct: 353 AVEHALTVLLDALTNRR 369 >ref|XP_006593349.1| PREDICTED: uncharacterized protein LOC100306372 isoform X1 [Glycine max] gi|947073501|gb|KRH22392.1| hypothetical protein GLYMA_13G297500 [Glycine max] Length = 383 Score = 361 bits (926), Expect = 3e-97 Identities = 174/257 (67%), Positives = 202/257 (78%), Gaps = 4/257 (1%) Frame = -1 Query: 760 HRARYASLSGSEEKIQFFSARQIACRLLGSSGYLCQ*CWLPREDCICSKVTACPLWHRLR 581 HRA Y +L SE+K+QF+SARQIACR+LGS GYLCQ CWLP EDC+CSKVT+C L+ +R Sbjct: 110 HRATYQALGDSEKKLQFYSARQIACRILGSRGYLCQKCWLPMEDCMCSKVTSCSLYPGIR 169 Query: 580 IWLYMHPKDFLRQNNTGKLLWQVFGVQAARLCLFGIAEHEEMMWNELNRAGRNKVWCLYP 401 WLYMHPKDFLRQNNTGK+LWQVFGV AA LCLFGI EHEE+MWN L AG++ VWCLYP Sbjct: 170 FWLYMHPKDFLRQNNTGKILWQVFGVDAATLCLFGIPEHEEIMWNSLKMAGKSNVWCLYP 229 Query: 400 NKNAVTESVKDIVGCSMNLESQAGLANE---DDTMHFIFIDGTWSNSAAMFRRLQARAES 230 NKNAV +SV+++ G + + + D T HFI IDGTWSNSAAMFRRLQ +A+S Sbjct: 230 NKNAVLKSVQNVFGEESVARDEVAPSKQLKVDTTQHFILIDGTWSNSAAMFRRLQDKAKS 289 Query: 229 EWGQEP-HCISLNTGVSLMHKLRPQPSWDRTCTAAAAIGLLYELTLLPEFSSFGLDKQAE 53 WG E CISLN G S MHKLRPQPSWDRTCTAAAA GLL EL LLP+FSS L+KQAE Sbjct: 290 VWGDEDLSCISLNPGASAMHKLRPQPSWDRTCTAAAAAGLLSELQLLPQFSSVELEKQAE 349 Query: 52 AIDNALEVLFEALIARR 2 A+++AL VL +AL RR Sbjct: 350 AVEHALTVLLDALTNRR 366 >ref|XP_002517923.1| conserved hypothetical protein [Ricinus communis] gi|223542905|gb|EEF44441.1| conserved hypothetical protein [Ricinus communis] Length = 301 Score = 360 bits (924), Expect = 6e-97 Identities = 178/258 (68%), Positives = 209/258 (81%), Gaps = 5/258 (1%) Frame = -1 Query: 760 HRARYASLSGSEEKIQFFSARQIACRLLGSSGYLCQ*CWLP-REDCICSKVTACPLWHRL 584 HRA Y +L SEEK+QFFSARQIACR+LGS GYLCQ CWLP EDC+CSKV LW L Sbjct: 22 HRATYEALGDSEEKLQFFSARQIACRVLGSRGYLCQKCWLPLEEDCMCSKVKHSSLWPGL 81 Query: 583 RIWLYMHPKDFLRQNNTGKLLWQVFGVQAARLCLFGIAEHEEMMWNELNRAGRNKVWCLY 404 R WLYMHPKDFLRQNNTGKLLWQVFGVQ+A LCLFGI EHEE+MWN AG++KVWCLY Sbjct: 82 RFWLYMHPKDFLRQNNTGKLLWQVFGVQSAMLCLFGIPEHEEVMWNAFKHAGKDKVWCLY 141 Query: 403 PNKNAVTESVKDI---VGCSMNLESQAGLANEDDTMHFIFIDGTWSNSAAMFRRLQARAE 233 PNKNA+T+SV+D +G + +++S A ANE T++F+ IDGTWSNSAAMFRRL+ + + Sbjct: 142 PNKNAITKSVQDAFVQLGGNCHVDSTA-QANEHKTLNFVLIDGTWSNSAAMFRRLKEQTK 200 Query: 232 SEWGQEP-HCISLNTGVSLMHKLRPQPSWDRTCTAAAAIGLLYELTLLPEFSSFGLDKQA 56 S WG+E CISL TG S+MHKLRPQPS+DRTCTA AAIGLL EL LPE SS+GLDKQA Sbjct: 201 SVWGEEDLPCISLATGASMMHKLRPQPSYDRTCTAGAAIGLLSELQDLPELSSYGLDKQA 260 Query: 55 EAIDNALEVLFEALIARR 2 EA++++L+VL EAL RR Sbjct: 261 EALEDSLDVLLEALTTRR 278