BLASTX nr result
ID: Forsythia23_contig00029439
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia23_contig00029439 (838 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011093415.1| PREDICTED: uncharacterized protein LOC105173... 326 1e-86 ref|XP_011093414.1| PREDICTED: uncharacterized protein LOC105173... 326 1e-86 ref|XP_010661167.1| PREDICTED: uncharacterized protein LOC100252... 258 3e-66 ref|XP_010661166.1| PREDICTED: uncharacterized protein LOC100252... 258 3e-66 ref|XP_010661165.1| PREDICTED: uncharacterized protein LOC100252... 258 3e-66 ref|XP_012831417.1| PREDICTED: uncharacterized protein LOC105952... 256 1e-65 ref|XP_012831416.1| PREDICTED: uncharacterized protein LOC105952... 256 1e-65 gb|EYU42266.1| hypothetical protein MIMGU_mgv1a000058mg [Erythra... 256 1e-65 ref|XP_007029854.1| Uncharacterized protein isoform 4 [Theobroma... 252 2e-64 ref|XP_007029853.1| Uncharacterized protein isoform 3, partial [... 252 2e-64 ref|XP_007029852.1| Uncharacterized protein isoform 2 [Theobroma... 252 2e-64 ref|XP_007029851.1| Uncharacterized protein isoform 1 [Theobroma... 252 2e-64 ref|XP_012492387.1| PREDICTED: uncharacterized protein LOC105804... 246 2e-62 ref|XP_012492386.1| PREDICTED: uncharacterized protein LOC105804... 246 2e-62 ref|XP_012492385.1| PREDICTED: uncharacterized protein LOC105804... 246 2e-62 gb|KJB38961.1| hypothetical protein B456_007G249800 [Gossypium r... 246 2e-62 gb|KJB38960.1| hypothetical protein B456_007G249800 [Gossypium r... 246 2e-62 gb|KJB38959.1| hypothetical protein B456_007G249800 [Gossypium r... 246 2e-62 gb|KJB38958.1| hypothetical protein B456_007G249800 [Gossypium r... 246 2e-62 gb|KJB38957.1| hypothetical protein B456_007G249800 [Gossypium r... 246 2e-62 >ref|XP_011093415.1| PREDICTED: uncharacterized protein LOC105173395 isoform X2 [Sesamum indicum] gi|747091379|ref|XP_011093416.1| PREDICTED: uncharacterized protein LOC105173395 isoform X2 [Sesamum indicum] Length = 1974 Score = 326 bits (835), Expect = 1e-86 Identities = 167/288 (57%), Positives = 205/288 (71%), Gaps = 10/288 (3%) Frame = -3 Query: 836 LMENCNDLLIGTLRVYGAIPLKMSQCSDASSGTVGEFSKSSSGFLNDMCNTLSFTEVSEM 657 LMENC DLL+ T R +G IPLK SD S ++G+++KSSS FLND+C + S EVSE Sbjct: 1178 LMENCRDLLVATSRAWGVIPLKSPLHSDTSICSIGDYTKSSSWFLNDLCKS-SPIEVSER 1236 Query: 656 HRGNIGAATDVSQNI-QLTLEEINSFSKDLDTLISKLNPTIEQCWKIXXXXXXXXXLTSA 480 H+ + A +DV + QL LEE+ S SK L+ LISKLNPT+EQCWK+ +T A Sbjct: 1237 HQDDDDAVSDVRHKVCQLNLEEVKSLSKHLEALISKLNPTLEQCWKLHHKLSKKLAVTCA 1296 Query: 479 ECFLYSRCLCFVVEKVPASSGVEK---------LLESNIFYEFPDFWRTSLKGLSEMILV 327 ECF+YS+CL + EKV SSGVE+ LL S EFPD WRTSL GLS+MILV Sbjct: 1297 ECFMYSQCLSLIAEKVSDSSGVEEVFDSSGVENLLPSKFVDEFPDSWRTSLGGLSQMILV 1356 Query: 326 LQKNDCWEVASVMLDSLLGLPRCLRLDNAIGDMCTAIKNFSNKAPKIAWRLQADKXXXXX 147 LQ+ CW+VA V+LDSLLG+P+C LDN I D+C+A+KNFSN AP I+WRLQ DK Sbjct: 1357 LQEKHCWDVACVLLDSLLGVPQCFCLDNVIADICSAVKNFSNSAPNISWRLQTDKMISLL 1416 Query: 146 LARGIHNLHESEVPLIDLFCAMIGHPEPEQRYIALKHLGRLVGLDVEG 3 LARGIHNL ++ PL+DLFCA++GHPEPEQRYIALKHLG +VG DV G Sbjct: 1417 LARGIHNLCQTVAPLVDLFCAILGHPEPEQRYIALKHLGGIVGQDVNG 1464 >ref|XP_011093414.1| PREDICTED: uncharacterized protein LOC105173395 isoform X1 [Sesamum indicum] Length = 2174 Score = 326 bits (835), Expect = 1e-86 Identities = 167/288 (57%), Positives = 205/288 (71%), Gaps = 10/288 (3%) Frame = -3 Query: 836 LMENCNDLLIGTLRVYGAIPLKMSQCSDASSGTVGEFSKSSSGFLNDMCNTLSFTEVSEM 657 LMENC DLL+ T R +G IPLK SD S ++G+++KSSS FLND+C + S EVSE Sbjct: 1378 LMENCRDLLVATSRAWGVIPLKSPLHSDTSICSIGDYTKSSSWFLNDLCKS-SPIEVSER 1436 Query: 656 HRGNIGAATDVSQNI-QLTLEEINSFSKDLDTLISKLNPTIEQCWKIXXXXXXXXXLTSA 480 H+ + A +DV + QL LEE+ S SK L+ LISKLNPT+EQCWK+ +T A Sbjct: 1437 HQDDDDAVSDVRHKVCQLNLEEVKSLSKHLEALISKLNPTLEQCWKLHHKLSKKLAVTCA 1496 Query: 479 ECFLYSRCLCFVVEKVPASSGVEK---------LLESNIFYEFPDFWRTSLKGLSEMILV 327 ECF+YS+CL + EKV SSGVE+ LL S EFPD WRTSL GLS+MILV Sbjct: 1497 ECFMYSQCLSLIAEKVSDSSGVEEVFDSSGVENLLPSKFVDEFPDSWRTSLGGLSQMILV 1556 Query: 326 LQKNDCWEVASVMLDSLLGLPRCLRLDNAIGDMCTAIKNFSNKAPKIAWRLQADKXXXXX 147 LQ+ CW+VA V+LDSLLG+P+C LDN I D+C+A+KNFSN AP I+WRLQ DK Sbjct: 1557 LQEKHCWDVACVLLDSLLGVPQCFCLDNVIADICSAVKNFSNSAPNISWRLQTDKMISLL 1616 Query: 146 LARGIHNLHESEVPLIDLFCAMIGHPEPEQRYIALKHLGRLVGLDVEG 3 LARGIHNL ++ PL+DLFCA++GHPEPEQRYIALKHLG +VG DV G Sbjct: 1617 LARGIHNLCQTVAPLVDLFCAILGHPEPEQRYIALKHLGGIVGQDVNG 1664 >ref|XP_010661167.1| PREDICTED: uncharacterized protein LOC100252352 isoform X3 [Vitis vinifera] Length = 1954 Score = 258 bits (660), Expect = 3e-66 Identities = 135/282 (47%), Positives = 189/282 (67%), Gaps = 4/282 (1%) Frame = -3 Query: 836 LMENCNDLLIGTLRVYGAIPLKMSQCSDASSGTVGE-FSKSSSGFLNDMCNTLSFTEVSE 660 +ME+C LL+ TLRV+G IPL+M+ SD S+GT + SKS S FLND+C+ +E Sbjct: 1160 VMESCKVLLVRTLRVFGIIPLQMTSFSDVSTGTPCDGCSKSYSWFLNDVCHDSCPMGDTE 1219 Query: 659 MHRGNIGAATDVSQNI-QLTLEEINSFSKDLDTLISKLNPTIEQCWKIXXXXXXXXXLTS 483 + A + Q + L+ EEI +F++DL+ LI KL+PT+E CWK+ +TS Sbjct: 1220 NLESDKSDAVSLGQKVYHLSAEEITNFAQDLEGLICKLSPTVELCWKLHPQLAKKLTVTS 1279 Query: 482 AECFLYSRCLCFVVEKVPAS--SGVEKLLESNIFYEFPDFWRTSLKGLSEMILVLQKNDC 309 A+CF+YSRCL V++V + E + N +F R L+GLS +I++LQ+N C Sbjct: 1280 AQCFMYSRCLSSFVKRVDNAREDDNENVFPPNSVDQFLIHSRIGLEGLSGIIMMLQENHC 1339 Query: 308 WEVASVMLDSLLGLPRCLRLDNAIGDMCTAIKNFSNKAPKIAWRLQADKXXXXXLARGIH 129 WEVAS++LD LLG+P+C LD+ IG +C+AI+NFS APKI+WRLQ DK +RG + Sbjct: 1340 WEVASMILDCLLGVPKCFSLDDVIGTICSAIRNFSCSAPKISWRLQTDKWLSILFSRGAY 1399 Query: 128 NLHESEVPLIDLFCAMIGHPEPEQRYIALKHLGRLVGLDVEG 3 LHESE+PL+ LFC+M+ HPEPEQR+I+L+HLGR VG D+ G Sbjct: 1400 RLHESELPLVGLFCSMLSHPEPEQRFISLQHLGRFVGQDLNG 1441 >ref|XP_010661166.1| PREDICTED: uncharacterized protein LOC100252352 isoform X2 [Vitis vinifera] Length = 1991 Score = 258 bits (660), Expect = 3e-66 Identities = 135/282 (47%), Positives = 189/282 (67%), Gaps = 4/282 (1%) Frame = -3 Query: 836 LMENCNDLLIGTLRVYGAIPLKMSQCSDASSGTVGE-FSKSSSGFLNDMCNTLSFTEVSE 660 +ME+C LL+ TLRV+G IPL+M+ SD S+GT + SKS S FLND+C+ +E Sbjct: 1360 VMESCKVLLVRTLRVFGIIPLQMTSFSDVSTGTPCDGCSKSYSWFLNDVCHDSCPMGDTE 1419 Query: 659 MHRGNIGAATDVSQNI-QLTLEEINSFSKDLDTLISKLNPTIEQCWKIXXXXXXXXXLTS 483 + A + Q + L+ EEI +F++DL+ LI KL+PT+E CWK+ +TS Sbjct: 1420 NLESDKSDAVSLGQKVYHLSAEEITNFAQDLEGLICKLSPTVELCWKLHPQLAKKLTVTS 1479 Query: 482 AECFLYSRCLCFVVEKVPAS--SGVEKLLESNIFYEFPDFWRTSLKGLSEMILVLQKNDC 309 A+CF+YSRCL V++V + E + N +F R L+GLS +I++LQ+N C Sbjct: 1480 AQCFMYSRCLSSFVKRVDNAREDDNENVFPPNSVDQFLIHSRIGLEGLSGIIMMLQENHC 1539 Query: 308 WEVASVMLDSLLGLPRCLRLDNAIGDMCTAIKNFSNKAPKIAWRLQADKXXXXXLARGIH 129 WEVAS++LD LLG+P+C LD+ IG +C+AI+NFS APKI+WRLQ DK +RG + Sbjct: 1540 WEVASMILDCLLGVPKCFSLDDVIGTICSAIRNFSCSAPKISWRLQTDKWLSILFSRGAY 1599 Query: 128 NLHESEVPLIDLFCAMIGHPEPEQRYIALKHLGRLVGLDVEG 3 LHESE+PL+ LFC+M+ HPEPEQR+I+L+HLGR VG D+ G Sbjct: 1600 RLHESELPLVGLFCSMLSHPEPEQRFISLQHLGRFVGQDLNG 1641 >ref|XP_010661165.1| PREDICTED: uncharacterized protein LOC100252352 isoform X1 [Vitis vinifera] Length = 2154 Score = 258 bits (660), Expect = 3e-66 Identities = 135/282 (47%), Positives = 189/282 (67%), Gaps = 4/282 (1%) Frame = -3 Query: 836 LMENCNDLLIGTLRVYGAIPLKMSQCSDASSGTVGE-FSKSSSGFLNDMCNTLSFTEVSE 660 +ME+C LL+ TLRV+G IPL+M+ SD S+GT + SKS S FLND+C+ +E Sbjct: 1360 VMESCKVLLVRTLRVFGIIPLQMTSFSDVSTGTPCDGCSKSYSWFLNDVCHDSCPMGDTE 1419 Query: 659 MHRGNIGAATDVSQNI-QLTLEEINSFSKDLDTLISKLNPTIEQCWKIXXXXXXXXXLTS 483 + A + Q + L+ EEI +F++DL+ LI KL+PT+E CWK+ +TS Sbjct: 1420 NLESDKSDAVSLGQKVYHLSAEEITNFAQDLEGLICKLSPTVELCWKLHPQLAKKLTVTS 1479 Query: 482 AECFLYSRCLCFVVEKVPAS--SGVEKLLESNIFYEFPDFWRTSLKGLSEMILVLQKNDC 309 A+CF+YSRCL V++V + E + N +F R L+GLS +I++LQ+N C Sbjct: 1480 AQCFMYSRCLSSFVKRVDNAREDDNENVFPPNSVDQFLIHSRIGLEGLSGIIMMLQENHC 1539 Query: 308 WEVASVMLDSLLGLPRCLRLDNAIGDMCTAIKNFSNKAPKIAWRLQADKXXXXXLARGIH 129 WEVAS++LD LLG+P+C LD+ IG +C+AI+NFS APKI+WRLQ DK +RG + Sbjct: 1540 WEVASMILDCLLGVPKCFSLDDVIGTICSAIRNFSCSAPKISWRLQTDKWLSILFSRGAY 1599 Query: 128 NLHESEVPLIDLFCAMIGHPEPEQRYIALKHLGRLVGLDVEG 3 LHESE+PL+ LFC+M+ HPEPEQR+I+L+HLGR VG D+ G Sbjct: 1600 RLHESELPLVGLFCSMLSHPEPEQRFISLQHLGRFVGQDLNG 1641 >ref|XP_012831417.1| PREDICTED: uncharacterized protein LOC105952416 isoform X2 [Erythranthe guttatus] Length = 1586 Score = 256 bits (655), Expect = 1e-65 Identities = 143/276 (51%), Positives = 170/276 (61%) Frame = -3 Query: 836 LMENCNDLLIGTLRVYGAIPLKMSQCSDASSGTVGEFSKSSSGFLNDMCNTLSFTEVSEM 657 LMENC DLLI T R+ G IPL ++ SD+ SKSSS FL D+CN S TEVSE Sbjct: 1098 LMENCRDLLIATSRLRGIIPLTIASLSDSDP------SKSSSCFLKDICNPSSPTEVSEK 1151 Query: 656 HRGNIGAATDVSQNIQLTLEEINSFSKDLDTLISKLNPTIEQCWKIXXXXXXXXXLTSAE 477 R QL EE+ SFSK+LD LI+KL PT+EQCWK+ L AE Sbjct: 1152 FR-------------QLNSEEVKSFSKELDALITKLYPTLEQCWKLHSTMSKKLALVCAE 1198 Query: 476 CFLYSRCLCFVVEKVPASSGVEKLLESNIFYEFPDFWRTSLKGLSEMILVLQKNDCWEVA 297 CF+YSRCL ++ E DF T LKGL E IL+LQ CWEVA Sbjct: 1199 CFVYSRCLSLNID------------------ELTDFCGTGLKGLFETILILQDKHCWEVA 1240 Query: 296 SVMLDSLLGLPRCLRLDNAIGDMCTAIKNFSNKAPKIAWRLQADKXXXXXLARGIHNLHE 117 SV+LDSL+ +PR RLD I +C+AIKNFSN AP I WRLQ DK RGI+N+ Sbjct: 1241 SVLLDSLIKVPRFFRLDYVISGICSAIKNFSNNAPNIVWRLQIDKMMSLLFERGINNICR 1300 Query: 116 SEVPLIDLFCAMIGHPEPEQRYIALKHLGRLVGLDV 9 +E L+DLFCA++G+PEPEQRYIA+KHLGRLVG DV Sbjct: 1301 NEASLVDLFCALLGNPEPEQRYIAVKHLGRLVGQDV 1336 >ref|XP_012831416.1| PREDICTED: uncharacterized protein LOC105952416 isoform X1 [Erythranthe guttatus] Length = 1781 Score = 256 bits (655), Expect = 1e-65 Identities = 143/276 (51%), Positives = 170/276 (61%) Frame = -3 Query: 836 LMENCNDLLIGTLRVYGAIPLKMSQCSDASSGTVGEFSKSSSGFLNDMCNTLSFTEVSEM 657 LMENC DLLI T R+ G IPL ++ SD+ SKSSS FL D+CN S TEVSE Sbjct: 1293 LMENCRDLLIATSRLRGIIPLTIASLSDSDP------SKSSSCFLKDICNPSSPTEVSEK 1346 Query: 656 HRGNIGAATDVSQNIQLTLEEINSFSKDLDTLISKLNPTIEQCWKIXXXXXXXXXLTSAE 477 R QL EE+ SFSK+LD LI+KL PT+EQCWK+ L AE Sbjct: 1347 FR-------------QLNSEEVKSFSKELDALITKLYPTLEQCWKLHSTMSKKLALVCAE 1393 Query: 476 CFLYSRCLCFVVEKVPASSGVEKLLESNIFYEFPDFWRTSLKGLSEMILVLQKNDCWEVA 297 CF+YSRCL ++ E DF T LKGL E IL+LQ CWEVA Sbjct: 1394 CFVYSRCLSLNID------------------ELTDFCGTGLKGLFETILILQDKHCWEVA 1435 Query: 296 SVMLDSLLGLPRCLRLDNAIGDMCTAIKNFSNKAPKIAWRLQADKXXXXXLARGIHNLHE 117 SV+LDSL+ +PR RLD I +C+AIKNFSN AP I WRLQ DK RGI+N+ Sbjct: 1436 SVLLDSLIKVPRFFRLDYVISGICSAIKNFSNNAPNIVWRLQIDKMMSLLFERGINNICR 1495 Query: 116 SEVPLIDLFCAMIGHPEPEQRYIALKHLGRLVGLDV 9 +E L+DLFCA++G+PEPEQRYIA+KHLGRLVG DV Sbjct: 1496 NEASLVDLFCALLGNPEPEQRYIAVKHLGRLVGQDV 1531 >gb|EYU42266.1| hypothetical protein MIMGU_mgv1a000058mg [Erythranthe guttata] Length = 2003 Score = 256 bits (655), Expect = 1e-65 Identities = 143/276 (51%), Positives = 170/276 (61%) Frame = -3 Query: 836 LMENCNDLLIGTLRVYGAIPLKMSQCSDASSGTVGEFSKSSSGFLNDMCNTLSFTEVSEM 657 LMENC DLLI T R+ G IPL ++ SD+ SKSSS FL D+CN S TEVSE Sbjct: 1293 LMENCRDLLIATSRLRGIIPLTIASLSDSDP------SKSSSCFLKDICNPSSPTEVSEK 1346 Query: 656 HRGNIGAATDVSQNIQLTLEEINSFSKDLDTLISKLNPTIEQCWKIXXXXXXXXXLTSAE 477 R QL EE+ SFSK+LD LI+KL PT+EQCWK+ L AE Sbjct: 1347 FR-------------QLNSEEVKSFSKELDALITKLYPTLEQCWKLHSTMSKKLALVCAE 1393 Query: 476 CFLYSRCLCFVVEKVPASSGVEKLLESNIFYEFPDFWRTSLKGLSEMILVLQKNDCWEVA 297 CF+YSRCL ++ E DF T LKGL E IL+LQ CWEVA Sbjct: 1394 CFVYSRCLSLNID------------------ELTDFCGTGLKGLFETILILQDKHCWEVA 1435 Query: 296 SVMLDSLLGLPRCLRLDNAIGDMCTAIKNFSNKAPKIAWRLQADKXXXXXLARGIHNLHE 117 SV+LDSL+ +PR RLD I +C+AIKNFSN AP I WRLQ DK RGI+N+ Sbjct: 1436 SVLLDSLIKVPRFFRLDYVISGICSAIKNFSNNAPNIVWRLQIDKMMSLLFERGINNICR 1495 Query: 116 SEVPLIDLFCAMIGHPEPEQRYIALKHLGRLVGLDV 9 +E L+DLFCA++G+PEPEQRYIA+KHLGRLVG DV Sbjct: 1496 NEASLVDLFCALLGNPEPEQRYIAVKHLGRLVGQDV 1531 >ref|XP_007029854.1| Uncharacterized protein isoform 4 [Theobroma cacao] gi|508718459|gb|EOY10356.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 1785 Score = 252 bits (644), Expect = 2e-64 Identities = 141/283 (49%), Positives = 185/283 (65%), Gaps = 5/283 (1%) Frame = -3 Query: 836 LMENCNDLLIGTLRVYGAIPLKMSQCSDASSGTVGEF-SKSSSGFLNDMCNTLSFTEVSE 660 +ME+C L+ LRV +PL++ SD SG +GE S+S S FLND+ + + E+SE Sbjct: 990 VMESCKVFLLQHLRVSNFVPLQLPPFSD--SGKLGESGSESFSWFLNDILHGSTPNEISE 1047 Query: 659 MHRGNIGAATDVSQ-NIQLTLEEINSFSKDLDTLISKLNPTIEQCWKIXXXXXXXXXLTS 483 N A +++ N L+ EEI F+KDL+ +ISKL PTIEQCW + + S Sbjct: 1048 NLESNSFDAIVLNEKNYNLSEEEIEDFTKDLEGVISKLYPTIEQCWSLHHQLAKKLTIAS 1107 Query: 482 AECFLYSRCLCFVVEKVPASSGV--EKLLESNIFYEFPDFWRTSLKGLSEMILVLQKNDC 309 A+CF+YSRCL + + + G E L S P W+T L+GL+ IL+LQ+N C Sbjct: 1108 AQCFVYSRCLLSMAPAIHNAEGYKNENSLPSKSVDRLPAQWKTGLEGLAGTILMLQENAC 1167 Query: 308 WEVASVMLDSLLGLPRCLRLDNAIGDMCTAIKNFSNKAPKIAWRLQADKXXXXXLARGIH 129 W+VASVMLD LLG+P LDN I +CTAIKNFS+KAPKI+WRLQ DK RGIH Sbjct: 1168 WQVASVMLDCLLGVPLGFPLDNVIDSICTAIKNFSSKAPKISWRLQTDKWLSILCIRGIH 1227 Query: 128 NLHESEV-PLIDLFCAMIGHPEPEQRYIALKHLGRLVGLDVEG 3 +LHESEV PL+++F M+GHPEPEQR+I L+HLGRLVG DV+G Sbjct: 1228 SLHESEVPPLVNMFLTMLGHPEPEQRFIVLQHLGRLVGQDVDG 1270 >ref|XP_007029853.1| Uncharacterized protein isoform 3, partial [Theobroma cacao] gi|508718458|gb|EOY10355.1| Uncharacterized protein isoform 3, partial [Theobroma cacao] Length = 1882 Score = 252 bits (644), Expect = 2e-64 Identities = 141/283 (49%), Positives = 185/283 (65%), Gaps = 5/283 (1%) Frame = -3 Query: 836 LMENCNDLLIGTLRVYGAIPLKMSQCSDASSGTVGEF-SKSSSGFLNDMCNTLSFTEVSE 660 +ME+C L+ LRV +PL++ SD SG +GE S+S S FLND+ + + E+SE Sbjct: 1206 VMESCKVFLLQHLRVSNFVPLQLPPFSD--SGKLGESGSESFSWFLNDILHGSTPNEISE 1263 Query: 659 MHRGNIGAATDVSQ-NIQLTLEEINSFSKDLDTLISKLNPTIEQCWKIXXXXXXXXXLTS 483 N A +++ N L+ EEI F+KDL+ +ISKL PTIEQCW + + S Sbjct: 1264 NLESNSFDAIVLNEKNYNLSEEEIEDFTKDLEGVISKLYPTIEQCWSLHHQLAKKLTIAS 1323 Query: 482 AECFLYSRCLCFVVEKVPASSGV--EKLLESNIFYEFPDFWRTSLKGLSEMILVLQKNDC 309 A+CF+YSRCL + + + G E L S P W+T L+GL+ IL+LQ+N C Sbjct: 1324 AQCFVYSRCLLSMAPAIHNAEGYKNENSLPSKSVDRLPAQWKTGLEGLAGTILMLQENAC 1383 Query: 308 WEVASVMLDSLLGLPRCLRLDNAIGDMCTAIKNFSNKAPKIAWRLQADKXXXXXLARGIH 129 W+VASVMLD LLG+P LDN I +CTAIKNFS+KAPKI+WRLQ DK RGIH Sbjct: 1384 WQVASVMLDCLLGVPLGFPLDNVIDSICTAIKNFSSKAPKISWRLQTDKWLSILCIRGIH 1443 Query: 128 NLHESEV-PLIDLFCAMIGHPEPEQRYIALKHLGRLVGLDVEG 3 +LHESEV PL+++F M+GHPEPEQR+I L+HLGRLVG DV+G Sbjct: 1444 SLHESEVPPLVNMFLTMLGHPEPEQRFIVLQHLGRLVGQDVDG 1486 >ref|XP_007029852.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508718457|gb|EOY10354.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 1949 Score = 252 bits (644), Expect = 2e-64 Identities = 141/283 (49%), Positives = 185/283 (65%), Gaps = 5/283 (1%) Frame = -3 Query: 836 LMENCNDLLIGTLRVYGAIPLKMSQCSDASSGTVGEF-SKSSSGFLNDMCNTLSFTEVSE 660 +ME+C L+ LRV +PL++ SD SG +GE S+S S FLND+ + + E+SE Sbjct: 1202 VMESCKVFLLQHLRVSNFVPLQLPPFSD--SGKLGESGSESFSWFLNDILHGSTPNEISE 1259 Query: 659 MHRGNIGAATDVSQ-NIQLTLEEINSFSKDLDTLISKLNPTIEQCWKIXXXXXXXXXLTS 483 N A +++ N L+ EEI F+KDL+ +ISKL PTIEQCW + + S Sbjct: 1260 NLESNSFDAIVLNEKNYNLSEEEIEDFTKDLEGVISKLYPTIEQCWSLHHQLAKKLTIAS 1319 Query: 482 AECFLYSRCLCFVVEKVPASSGV--EKLLESNIFYEFPDFWRTSLKGLSEMILVLQKNDC 309 A+CF+YSRCL + + + G E L S P W+T L+GL+ IL+LQ+N C Sbjct: 1320 AQCFVYSRCLLSMAPAIHNAEGYKNENSLPSKSVDRLPAQWKTGLEGLAGTILMLQENAC 1379 Query: 308 WEVASVMLDSLLGLPRCLRLDNAIGDMCTAIKNFSNKAPKIAWRLQADKXXXXXLARGIH 129 W+VASVMLD LLG+P LDN I +CTAIKNFS+KAPKI+WRLQ DK RGIH Sbjct: 1380 WQVASVMLDCLLGVPLGFPLDNVIDSICTAIKNFSSKAPKISWRLQTDKWLSILCIRGIH 1439 Query: 128 NLHESEV-PLIDLFCAMIGHPEPEQRYIALKHLGRLVGLDVEG 3 +LHESEV PL+++F M+GHPEPEQR+I L+HLGRLVG DV+G Sbjct: 1440 SLHESEVPPLVNMFLTMLGHPEPEQRFIVLQHLGRLVGQDVDG 1482 >ref|XP_007029851.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508718456|gb|EOY10353.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 2158 Score = 252 bits (644), Expect = 2e-64 Identities = 141/283 (49%), Positives = 185/283 (65%), Gaps = 5/283 (1%) Frame = -3 Query: 836 LMENCNDLLIGTLRVYGAIPLKMSQCSDASSGTVGEF-SKSSSGFLNDMCNTLSFTEVSE 660 +ME+C L+ LRV +PL++ SD SG +GE S+S S FLND+ + + E+SE Sbjct: 1363 VMESCKVFLLQHLRVSNFVPLQLPPFSD--SGKLGESGSESFSWFLNDILHGSTPNEISE 1420 Query: 659 MHRGNIGAATDVSQ-NIQLTLEEINSFSKDLDTLISKLNPTIEQCWKIXXXXXXXXXLTS 483 N A +++ N L+ EEI F+KDL+ +ISKL PTIEQCW + + S Sbjct: 1421 NLESNSFDAIVLNEKNYNLSEEEIEDFTKDLEGVISKLYPTIEQCWSLHHQLAKKLTIAS 1480 Query: 482 AECFLYSRCLCFVVEKVPASSGV--EKLLESNIFYEFPDFWRTSLKGLSEMILVLQKNDC 309 A+CF+YSRCL + + + G E L S P W+T L+GL+ IL+LQ+N C Sbjct: 1481 AQCFVYSRCLLSMAPAIHNAEGYKNENSLPSKSVDRLPAQWKTGLEGLAGTILMLQENAC 1540 Query: 308 WEVASVMLDSLLGLPRCLRLDNAIGDMCTAIKNFSNKAPKIAWRLQADKXXXXXLARGIH 129 W+VASVMLD LLG+P LDN I +CTAIKNFS+KAPKI+WRLQ DK RGIH Sbjct: 1541 WQVASVMLDCLLGVPLGFPLDNVIDSICTAIKNFSSKAPKISWRLQTDKWLSILCIRGIH 1600 Query: 128 NLHESEV-PLIDLFCAMIGHPEPEQRYIALKHLGRLVGLDVEG 3 +LHESEV PL+++F M+GHPEPEQR+I L+HLGRLVG DV+G Sbjct: 1601 SLHESEVPPLVNMFLTMLGHPEPEQRFIVLQHLGRLVGQDVDG 1643 >ref|XP_012492387.1| PREDICTED: uncharacterized protein LOC105804353 isoform X3 [Gossypium raimondii] Length = 1984 Score = 246 bits (627), Expect = 2e-62 Identities = 139/283 (49%), Positives = 178/283 (62%), Gaps = 5/283 (1%) Frame = -3 Query: 836 LMENCNDLLIGTLRVYGAIPLKMSQCSDASSGTVGEF-SKSSSGFLNDMCNTLSFTEVSE 660 +M +C L+ LR Y IPL++ SD S T+GE S+S S FLND+ S E SE Sbjct: 1194 VMGSCKVFLLQNLRAYNFIPLQLPGSSD--SRTLGESGSESFSWFLNDILPCSSLNETSE 1251 Query: 659 -MHRGNIGAATDVSQNIQLTLEEINSFSKDLDTLISKLNPTIEQCWKIXXXXXXXXXLTS 483 + N AA ++ L+ EEI F+KDL+ LI KL PTIEQCW + +T Sbjct: 1252 KVESNNTDAAVLNEKDYHLSEEEIKEFTKDLEGLIPKLYPTIEQCWSLHLQLAKKLAITL 1311 Query: 482 AECFLYSRCLCFVVEKVPASSG--VEKLLESNIFYEFPDFWRTSLKGLSEMILVLQKNDC 309 A CF+YSRCL V + + G EK L S + P W+T L+GL+ MIL+LQ+N C Sbjct: 1312 ARCFIYSRCLSSVAPGIHNAEGDISEKSLASTSIDQLPAQWKTGLEGLAGMILLLQENTC 1371 Query: 308 WEVASVMLDSLLGLPRCLRLDNAIGDMCTAIKNFSNKAPKIAWRLQADKXXXXXLARGIH 129 W+VASVMLD LLG+P L++ I +CTA+KNF KAPKI+WRLQ DK RG Sbjct: 1372 WQVASVMLDCLLGVPLSFPLNDVIDPICTALKNFCCKAPKISWRLQTDKWLSILSFRGFQ 1431 Query: 128 NLHESEV-PLIDLFCAMIGHPEPEQRYIALKHLGRLVGLDVEG 3 NLHESE+ PL++L M+GHPEPEQR+I L+HLGRLVG DV+G Sbjct: 1432 NLHESEIAPLVNLLVTMLGHPEPEQRFIVLQHLGRLVGQDVDG 1474 >ref|XP_012492386.1| PREDICTED: uncharacterized protein LOC105804353 isoform X2 [Gossypium raimondii] Length = 2152 Score = 246 bits (627), Expect = 2e-62 Identities = 139/283 (49%), Positives = 178/283 (62%), Gaps = 5/283 (1%) Frame = -3 Query: 836 LMENCNDLLIGTLRVYGAIPLKMSQCSDASSGTVGEF-SKSSSGFLNDMCNTLSFTEVSE 660 +M +C L+ LR Y IPL++ SD S T+GE S+S S FLND+ S E SE Sbjct: 1363 VMGSCKVFLLQNLRAYNFIPLQLPGSSD--SRTLGESGSESFSWFLNDILPCSSLNETSE 1420 Query: 659 -MHRGNIGAATDVSQNIQLTLEEINSFSKDLDTLISKLNPTIEQCWKIXXXXXXXXXLTS 483 + N AA ++ L+ EEI F+KDL+ LI KL PTIEQCW + +T Sbjct: 1421 KVESNNTDAAVLNEKDYHLSEEEIKEFTKDLEGLIPKLYPTIEQCWSLHLQLAKKLAITL 1480 Query: 482 AECFLYSRCLCFVVEKVPASSG--VEKLLESNIFYEFPDFWRTSLKGLSEMILVLQKNDC 309 A CF+YSRCL V + + G EK L S + P W+T L+GL+ MIL+LQ+N C Sbjct: 1481 ARCFIYSRCLSSVAPGIHNAEGDISEKSLASTSIDQLPAQWKTGLEGLAGMILLLQENTC 1540 Query: 308 WEVASVMLDSLLGLPRCLRLDNAIGDMCTAIKNFSNKAPKIAWRLQADKXXXXXLARGIH 129 W+VASVMLD LLG+P L++ I +CTA+KNF KAPKI+WRLQ DK RG Sbjct: 1541 WQVASVMLDCLLGVPLSFPLNDVIDPICTALKNFCCKAPKISWRLQTDKWLSILSFRGFQ 1600 Query: 128 NLHESEV-PLIDLFCAMIGHPEPEQRYIALKHLGRLVGLDVEG 3 NLHESE+ PL++L M+GHPEPEQR+I L+HLGRLVG DV+G Sbjct: 1601 NLHESEIAPLVNLLVTMLGHPEPEQRFIVLQHLGRLVGQDVDG 1643 >ref|XP_012492385.1| PREDICTED: uncharacterized protein LOC105804353 isoform X1 [Gossypium raimondii] Length = 2153 Score = 246 bits (627), Expect = 2e-62 Identities = 139/283 (49%), Positives = 178/283 (62%), Gaps = 5/283 (1%) Frame = -3 Query: 836 LMENCNDLLIGTLRVYGAIPLKMSQCSDASSGTVGEF-SKSSSGFLNDMCNTLSFTEVSE 660 +M +C L+ LR Y IPL++ SD S T+GE S+S S FLND+ S E SE Sbjct: 1363 VMGSCKVFLLQNLRAYNFIPLQLPGSSD--SRTLGESGSESFSWFLNDILPCSSLNETSE 1420 Query: 659 -MHRGNIGAATDVSQNIQLTLEEINSFSKDLDTLISKLNPTIEQCWKIXXXXXXXXXLTS 483 + N AA ++ L+ EEI F+KDL+ LI KL PTIEQCW + +T Sbjct: 1421 KVESNNTDAAVLNEKDYHLSEEEIKEFTKDLEGLIPKLYPTIEQCWSLHLQLAKKLAITL 1480 Query: 482 AECFLYSRCLCFVVEKVPASSG--VEKLLESNIFYEFPDFWRTSLKGLSEMILVLQKNDC 309 A CF+YSRCL V + + G EK L S + P W+T L+GL+ MIL+LQ+N C Sbjct: 1481 ARCFIYSRCLSSVAPGIHNAEGDISEKSLASTSIDQLPAQWKTGLEGLAGMILLLQENTC 1540 Query: 308 WEVASVMLDSLLGLPRCLRLDNAIGDMCTAIKNFSNKAPKIAWRLQADKXXXXXLARGIH 129 W+VASVMLD LLG+P L++ I +CTA+KNF KAPKI+WRLQ DK RG Sbjct: 1541 WQVASVMLDCLLGVPLSFPLNDVIDPICTALKNFCCKAPKISWRLQTDKWLSILSFRGFQ 1600 Query: 128 NLHESEV-PLIDLFCAMIGHPEPEQRYIALKHLGRLVGLDVEG 3 NLHESE+ PL++L M+GHPEPEQR+I L+HLGRLVG DV+G Sbjct: 1601 NLHESEIAPLVNLLVTMLGHPEPEQRFIVLQHLGRLVGQDVDG 1643 >gb|KJB38961.1| hypothetical protein B456_007G249800 [Gossypium raimondii] Length = 1658 Score = 246 bits (627), Expect = 2e-62 Identities = 139/283 (49%), Positives = 178/283 (62%), Gaps = 5/283 (1%) Frame = -3 Query: 836 LMENCNDLLIGTLRVYGAIPLKMSQCSDASSGTVGEF-SKSSSGFLNDMCNTLSFTEVSE 660 +M +C L+ LR Y IPL++ SD S T+GE S+S S FLND+ S E SE Sbjct: 868 VMGSCKVFLLQNLRAYNFIPLQLPGSSD--SRTLGESGSESFSWFLNDILPCSSLNETSE 925 Query: 659 -MHRGNIGAATDVSQNIQLTLEEINSFSKDLDTLISKLNPTIEQCWKIXXXXXXXXXLTS 483 + N AA ++ L+ EEI F+KDL+ LI KL PTIEQCW + +T Sbjct: 926 KVESNNTDAAVLNEKDYHLSEEEIKEFTKDLEGLIPKLYPTIEQCWSLHLQLAKKLAITL 985 Query: 482 AECFLYSRCLCFVVEKVPASSG--VEKLLESNIFYEFPDFWRTSLKGLSEMILVLQKNDC 309 A CF+YSRCL V + + G EK L S + P W+T L+GL+ MIL+LQ+N C Sbjct: 986 ARCFIYSRCLSSVAPGIHNAEGDISEKSLASTSIDQLPAQWKTGLEGLAGMILLLQENTC 1045 Query: 308 WEVASVMLDSLLGLPRCLRLDNAIGDMCTAIKNFSNKAPKIAWRLQADKXXXXXLARGIH 129 W+VASVMLD LLG+P L++ I +CTA+KNF KAPKI+WRLQ DK RG Sbjct: 1046 WQVASVMLDCLLGVPLSFPLNDVIDPICTALKNFCCKAPKISWRLQTDKWLSILSFRGFQ 1105 Query: 128 NLHESEV-PLIDLFCAMIGHPEPEQRYIALKHLGRLVGLDVEG 3 NLHESE+ PL++L M+GHPEPEQR+I L+HLGRLVG DV+G Sbjct: 1106 NLHESEIAPLVNLLVTMLGHPEPEQRFIVLQHLGRLVGQDVDG 1148 >gb|KJB38960.1| hypothetical protein B456_007G249800 [Gossypium raimondii] Length = 1666 Score = 246 bits (627), Expect = 2e-62 Identities = 139/283 (49%), Positives = 178/283 (62%), Gaps = 5/283 (1%) Frame = -3 Query: 836 LMENCNDLLIGTLRVYGAIPLKMSQCSDASSGTVGEF-SKSSSGFLNDMCNTLSFTEVSE 660 +M +C L+ LR Y IPL++ SD S T+GE S+S S FLND+ S E SE Sbjct: 876 VMGSCKVFLLQNLRAYNFIPLQLPGSSD--SRTLGESGSESFSWFLNDILPCSSLNETSE 933 Query: 659 -MHRGNIGAATDVSQNIQLTLEEINSFSKDLDTLISKLNPTIEQCWKIXXXXXXXXXLTS 483 + N AA ++ L+ EEI F+KDL+ LI KL PTIEQCW + +T Sbjct: 934 KVESNNTDAAVLNEKDYHLSEEEIKEFTKDLEGLIPKLYPTIEQCWSLHLQLAKKLAITL 993 Query: 482 AECFLYSRCLCFVVEKVPASSG--VEKLLESNIFYEFPDFWRTSLKGLSEMILVLQKNDC 309 A CF+YSRCL V + + G EK L S + P W+T L+GL+ MIL+LQ+N C Sbjct: 994 ARCFIYSRCLSSVAPGIHNAEGDISEKSLASTSIDQLPAQWKTGLEGLAGMILLLQENTC 1053 Query: 308 WEVASVMLDSLLGLPRCLRLDNAIGDMCTAIKNFSNKAPKIAWRLQADKXXXXXLARGIH 129 W+VASVMLD LLG+P L++ I +CTA+KNF KAPKI+WRLQ DK RG Sbjct: 1054 WQVASVMLDCLLGVPLSFPLNDVIDPICTALKNFCCKAPKISWRLQTDKWLSILSFRGFQ 1113 Query: 128 NLHESEV-PLIDLFCAMIGHPEPEQRYIALKHLGRLVGLDVEG 3 NLHESE+ PL++L M+GHPEPEQR+I L+HLGRLVG DV+G Sbjct: 1114 NLHESEIAPLVNLLVTMLGHPEPEQRFIVLQHLGRLVGQDVDG 1156 >gb|KJB38959.1| hypothetical protein B456_007G249800 [Gossypium raimondii] Length = 2160 Score = 246 bits (627), Expect = 2e-62 Identities = 139/283 (49%), Positives = 178/283 (62%), Gaps = 5/283 (1%) Frame = -3 Query: 836 LMENCNDLLIGTLRVYGAIPLKMSQCSDASSGTVGEF-SKSSSGFLNDMCNTLSFTEVSE 660 +M +C L+ LR Y IPL++ SD S T+GE S+S S FLND+ S E SE Sbjct: 1371 VMGSCKVFLLQNLRAYNFIPLQLPGSSD--SRTLGESGSESFSWFLNDILPCSSLNETSE 1428 Query: 659 -MHRGNIGAATDVSQNIQLTLEEINSFSKDLDTLISKLNPTIEQCWKIXXXXXXXXXLTS 483 + N AA ++ L+ EEI F+KDL+ LI KL PTIEQCW + +T Sbjct: 1429 KVESNNTDAAVLNEKDYHLSEEEIKEFTKDLEGLIPKLYPTIEQCWSLHLQLAKKLAITL 1488 Query: 482 AECFLYSRCLCFVVEKVPASSG--VEKLLESNIFYEFPDFWRTSLKGLSEMILVLQKNDC 309 A CF+YSRCL V + + G EK L S + P W+T L+GL+ MIL+LQ+N C Sbjct: 1489 ARCFIYSRCLSSVAPGIHNAEGDISEKSLASTSIDQLPAQWKTGLEGLAGMILLLQENTC 1548 Query: 308 WEVASVMLDSLLGLPRCLRLDNAIGDMCTAIKNFSNKAPKIAWRLQADKXXXXXLARGIH 129 W+VASVMLD LLG+P L++ I +CTA+KNF KAPKI+WRLQ DK RG Sbjct: 1549 WQVASVMLDCLLGVPLSFPLNDVIDPICTALKNFCCKAPKISWRLQTDKWLSILSFRGFQ 1608 Query: 128 NLHESEV-PLIDLFCAMIGHPEPEQRYIALKHLGRLVGLDVEG 3 NLHESE+ PL++L M+GHPEPEQR+I L+HLGRLVG DV+G Sbjct: 1609 NLHESEIAPLVNLLVTMLGHPEPEQRFIVLQHLGRLVGQDVDG 1651 >gb|KJB38958.1| hypothetical protein B456_007G249800 [Gossypium raimondii] Length = 1917 Score = 246 bits (627), Expect = 2e-62 Identities = 139/283 (49%), Positives = 178/283 (62%), Gaps = 5/283 (1%) Frame = -3 Query: 836 LMENCNDLLIGTLRVYGAIPLKMSQCSDASSGTVGEF-SKSSSGFLNDMCNTLSFTEVSE 660 +M +C L+ LR Y IPL++ SD S T+GE S+S S FLND+ S E SE Sbjct: 1371 VMGSCKVFLLQNLRAYNFIPLQLPGSSD--SRTLGESGSESFSWFLNDILPCSSLNETSE 1428 Query: 659 -MHRGNIGAATDVSQNIQLTLEEINSFSKDLDTLISKLNPTIEQCWKIXXXXXXXXXLTS 483 + N AA ++ L+ EEI F+KDL+ LI KL PTIEQCW + +T Sbjct: 1429 KVESNNTDAAVLNEKDYHLSEEEIKEFTKDLEGLIPKLYPTIEQCWSLHLQLAKKLAITL 1488 Query: 482 AECFLYSRCLCFVVEKVPASSG--VEKLLESNIFYEFPDFWRTSLKGLSEMILVLQKNDC 309 A CF+YSRCL V + + G EK L S + P W+T L+GL+ MIL+LQ+N C Sbjct: 1489 ARCFIYSRCLSSVAPGIHNAEGDISEKSLASTSIDQLPAQWKTGLEGLAGMILLLQENTC 1548 Query: 308 WEVASVMLDSLLGLPRCLRLDNAIGDMCTAIKNFSNKAPKIAWRLQADKXXXXXLARGIH 129 W+VASVMLD LLG+P L++ I +CTA+KNF KAPKI+WRLQ DK RG Sbjct: 1549 WQVASVMLDCLLGVPLSFPLNDVIDPICTALKNFCCKAPKISWRLQTDKWLSILSFRGFQ 1608 Query: 128 NLHESEV-PLIDLFCAMIGHPEPEQRYIALKHLGRLVGLDVEG 3 NLHESE+ PL++L M+GHPEPEQR+I L+HLGRLVG DV+G Sbjct: 1609 NLHESEIAPLVNLLVTMLGHPEPEQRFIVLQHLGRLVGQDVDG 1651 >gb|KJB38957.1| hypothetical protein B456_007G249800 [Gossypium raimondii] Length = 2161 Score = 246 bits (627), Expect = 2e-62 Identities = 139/283 (49%), Positives = 178/283 (62%), Gaps = 5/283 (1%) Frame = -3 Query: 836 LMENCNDLLIGTLRVYGAIPLKMSQCSDASSGTVGEF-SKSSSGFLNDMCNTLSFTEVSE 660 +M +C L+ LR Y IPL++ SD S T+GE S+S S FLND+ S E SE Sbjct: 1371 VMGSCKVFLLQNLRAYNFIPLQLPGSSD--SRTLGESGSESFSWFLNDILPCSSLNETSE 1428 Query: 659 -MHRGNIGAATDVSQNIQLTLEEINSFSKDLDTLISKLNPTIEQCWKIXXXXXXXXXLTS 483 + N AA ++ L+ EEI F+KDL+ LI KL PTIEQCW + +T Sbjct: 1429 KVESNNTDAAVLNEKDYHLSEEEIKEFTKDLEGLIPKLYPTIEQCWSLHLQLAKKLAITL 1488 Query: 482 AECFLYSRCLCFVVEKVPASSG--VEKLLESNIFYEFPDFWRTSLKGLSEMILVLQKNDC 309 A CF+YSRCL V + + G EK L S + P W+T L+GL+ MIL+LQ+N C Sbjct: 1489 ARCFIYSRCLSSVAPGIHNAEGDISEKSLASTSIDQLPAQWKTGLEGLAGMILLLQENTC 1548 Query: 308 WEVASVMLDSLLGLPRCLRLDNAIGDMCTAIKNFSNKAPKIAWRLQADKXXXXXLARGIH 129 W+VASVMLD LLG+P L++ I +CTA+KNF KAPKI+WRLQ DK RG Sbjct: 1549 WQVASVMLDCLLGVPLSFPLNDVIDPICTALKNFCCKAPKISWRLQTDKWLSILSFRGFQ 1608 Query: 128 NLHESEV-PLIDLFCAMIGHPEPEQRYIALKHLGRLVGLDVEG 3 NLHESE+ PL++L M+GHPEPEQR+I L+HLGRLVG DV+G Sbjct: 1609 NLHESEIAPLVNLLVTMLGHPEPEQRFIVLQHLGRLVGQDVDG 1651