BLASTX nr result
ID: Dioscorea21_contig00024579
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00024579 (1166 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI18109.3| unnamed protein product [Vitis vinifera] 286 4e-89 ref|XP_002264351.2| PREDICTED: uncharacterized protein LOC100241... 282 6e-88 ref|XP_004156673.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 275 8e-85 ref|XP_004137896.1| PREDICTED: uncharacterized protein LOC101215... 275 8e-85 ref|NP_195783.3| breast cancer 2 susceptibility protein [Arabido... 265 4e-82 >emb|CBI18109.3| unnamed protein product [Vitis vinifera] Length = 1134 Score = 286 bits (731), Expect(2) = 4e-89 Identities = 150/270 (55%), Positives = 187/270 (69%) Frame = +3 Query: 150 MRQSEIHKKIEKALEDAGLGAREVTPFMRVRIVGLTSKGSTKKVRPKEGLVTIWNPTEKQ 329 +RQS++ K IE ALE AGL REVTPFMRVR+VGLT K K+ KEGL+TIWNPTEKQ Sbjct: 863 IRQSDLQKSIEMALEGAGLSTREVTPFMRVRVVGLTCKSYEGKIHHKEGLITIWNPTEKQ 922 Query: 330 KQDLVEGQIYCVRGLMPLHSVSDVIYLQGRGSSTLWELLAPKEHEKFEHFFTPRKPVSLS 509 + +LVEGQ Y V GLMPL+S S+ +YLQ RGS+T W L+P E FE F PRK V LS Sbjct: 923 QFELVEGQAYAVAGLMPLNSDSETLYLQARGSTTKWNPLSPLAIEHFEPFLNPRKSVLLS 982 Query: 510 NLGDVPLASEFDIAAVVVHVGEVCLSGRRKKQWVFMTDGSWGSSDSEFEGPYSCLLAVNF 689 NLG++PL+SEFDIAA+VV+VGEV + +KKQWVF+TDGS SE +CLLA++F Sbjct: 983 NLGEIPLSSEFDIAALVVYVGEVYTAAHQKKQWVFVTDGSVSELGSEEAS--NCLLAISF 1040 Query: 690 CGPILNNDSSSLISHNLSGTVVGFFNLVKRARDQMNHLWVAXXXXXXXXXXXXXXXXXXH 869 C P + +DS + ++ NL G+ VGF NL+KRA+DQMN LWVA H Sbjct: 1041 CSPSV-DDSFAPVNSNLEGSTVGFVNLIKRAKDQMNQLWVAEATENSDYFFSFDLPHCYH 1099 Query: 870 LYKTGECAQKWAKISCSTIQKLRERVLCII 959 L A++WAKIS TI+KL+E+VL II Sbjct: 1100 LKNAAASAERWAKISSLTIEKLKEKVLFII 1129 Score = 70.1 bits (170), Expect(2) = 4e-89 Identities = 33/47 (70%), Positives = 43/47 (91%) Frame = +2 Query: 20 SSDLDSEDGAKIFKILQTAAEPELLMADMTSEQLSSFTEYQAKHETV 160 ++D DSE+GAKIF+IL++AAEPE+LMA+MTSEQL+SFT YQAK E + Sbjct: 817 NNDNDSEEGAKIFEILESAAEPEVLMAEMTSEQLASFTSYQAKLEAI 863 >ref|XP_002264351.2| PREDICTED: uncharacterized protein LOC100241398 [Vitis vinifera] Length = 1126 Score = 282 bits (721), Expect(2) = 6e-88 Identities = 147/266 (55%), Positives = 184/266 (69%) Frame = +3 Query: 150 MRQSEIHKKIEKALEDAGLGAREVTPFMRVRIVGLTSKGSTKKVRPKEGLVTIWNPTEKQ 329 +RQS++ K IE ALE AGL REVTPFMRVR+VGLT K K+ KEGL+TIWNPTEKQ Sbjct: 843 IRQSDLQKSIEMALEGAGLSTREVTPFMRVRVVGLTCKSYEGKIHHKEGLITIWNPTEKQ 902 Query: 330 KQDLVEGQIYCVRGLMPLHSVSDVIYLQGRGSSTLWELLAPKEHEKFEHFFTPRKPVSLS 509 + +LVEGQ Y V GLMPL+S S+ +YLQ RGS+T W L+P E FE F PRK V LS Sbjct: 903 QFELVEGQAYAVAGLMPLNSDSETLYLQARGSTTKWNPLSPLAIEHFEPFLNPRKSVLLS 962 Query: 510 NLGDVPLASEFDIAAVVVHVGEVCLSGRRKKQWVFMTDGSWGSSDSEFEGPYSCLLAVNF 689 NLG++PL+SEFDIAA+VV+VGEV + +KKQWVF+TDGS SE +CLLA++F Sbjct: 963 NLGEIPLSSEFDIAALVVYVGEVYTAAHQKKQWVFVTDGSVSELGSEEAS--NCLLAISF 1020 Query: 690 CGPILNNDSSSLISHNLSGTVVGFFNLVKRARDQMNHLWVAXXXXXXXXXXXXXXXXXXH 869 C P + +DS + ++ NL G+ VGF NL+KRA+DQMN LWVA H Sbjct: 1021 CSPSV-DDSFAPVNSNLEGSTVGFVNLIKRAKDQMNQLWVAEATENSDYFFSFDLPHCYH 1079 Query: 870 LYKTGECAQKWAKISCSTIQKLRERV 947 L A++WAKIS TI+KL+E+V Sbjct: 1080 LKNAAASAERWAKISSLTIEKLKEKV 1105 Score = 70.1 bits (170), Expect(2) = 6e-88 Identities = 33/47 (70%), Positives = 43/47 (91%) Frame = +2 Query: 20 SSDLDSEDGAKIFKILQTAAEPELLMADMTSEQLSSFTEYQAKHETV 160 ++D DSE+GAKIF+IL++AAEPE+LMA+MTSEQL+SFT YQAK E + Sbjct: 797 NNDNDSEEGAKIFEILESAAEPEVLMAEMTSEQLASFTSYQAKLEAI 843 >ref|XP_004156673.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101230245 [Cucumis sativus] Length = 1111 Score = 275 bits (703), Expect(2) = 8e-85 Identities = 134/273 (49%), Positives = 190/273 (69%) Frame = +3 Query: 150 MRQSEIHKKIEKALEDAGLGAREVTPFMRVRIVGLTSKGSTKKVRPKEGLVTIWNPTEKQ 329 +RQS++ K IE+AL DAGL R+VTPFMRVR+VGLTSK S +K KEGL+TIWNP+EKQ Sbjct: 839 IRQSDMEKSIERALADAGLSGRDVTPFMRVRVVGLTSKSSQRKTHGKEGLITIWNPSEKQ 898 Query: 330 KQDLVEGQIYCVRGLMPLHSVSDVIYLQGRGSSTLWELLAPKEHEKFEHFFTPRKPVSLS 509 + +LVEGQ Y + GL+P++ +D++YLQ +GS+T W+ L+P+ + FE F+ PRK V LS Sbjct: 899 QLELVEGQAYAIGGLVPINCDADILYLQTKGSTTKWQSLSPQSMKCFEPFYKPRKSVLLS 958 Query: 510 NLGDVPLASEFDIAAVVVHVGEVCLSGRRKKQWVFMTDGSWGSSDSEFEGPYSCLLAVNF 689 NLG+VPL+SEFD+ A++VHVGEV + ++KKQW+F+ DG S+S EG + LLA++F Sbjct: 959 NLGEVPLSSEFDVVAIIVHVGEVFATAQQKKQWIFVVDGF--VSESHSEGISNSLLAISF 1016 Query: 690 CGPILNNDSSSLISHNLSGTVVGFFNLVKRARDQMNHLWVAXXXXXXXXXXXXXXXXXXH 869 C ++DS ++ NL+G+ GF NL+KR +DQ+NHLWVA H Sbjct: 1017 CSQYADDDSFVPMNSNLTGSTAGFCNLIKRPKDQINHLWVAEATENTSYFLNFDSTDCSH 1076 Query: 870 LYKTGECAQKWAKISCSTIQKLRERVLCIIQGH 968 + A++WA+ S S I+ LRE++L +I H Sbjct: 1077 MKNAAVFAKRWAENSTSIIKNLREKILFMIDDH 1109 Score = 66.6 bits (161), Expect(2) = 8e-85 Identities = 31/46 (67%), Positives = 40/46 (86%) Frame = +2 Query: 23 SDLDSEDGAKIFKILQTAAEPELLMADMTSEQLSSFTEYQAKHETV 160 ++ DSE+GAK+FKIL+TAAEPELLMA+M+ EQL+SF YQAK E + Sbjct: 794 NESDSEEGAKLFKILETAAEPELLMAEMSPEQLTSFASYQAKIEAI 839 >ref|XP_004137896.1| PREDICTED: uncharacterized protein LOC101215906 [Cucumis sativus] Length = 1111 Score = 275 bits (703), Expect(2) = 8e-85 Identities = 134/273 (49%), Positives = 190/273 (69%) Frame = +3 Query: 150 MRQSEIHKKIEKALEDAGLGAREVTPFMRVRIVGLTSKGSTKKVRPKEGLVTIWNPTEKQ 329 +RQS++ K IE+AL DAGL R+VTPFMRVR+VGLTSK S +K KEGL+TIWNP+EKQ Sbjct: 839 IRQSDMEKSIERALADAGLSGRDVTPFMRVRVVGLTSKSSQRKTHGKEGLITIWNPSEKQ 898 Query: 330 KQDLVEGQIYCVRGLMPLHSVSDVIYLQGRGSSTLWELLAPKEHEKFEHFFTPRKPVSLS 509 + +LVEGQ Y + GL+P++ +D++YLQ +GS+T W+ L+P+ + FE F+ PRK V LS Sbjct: 899 QLELVEGQAYAIGGLVPINCDADILYLQTKGSTTKWQSLSPQSMKCFEPFYKPRKSVLLS 958 Query: 510 NLGDVPLASEFDIAAVVVHVGEVCLSGRRKKQWVFMTDGSWGSSDSEFEGPYSCLLAVNF 689 NLG+VPL+SEFD+ A++VHVGEV + ++KKQW+F+ DG S+S EG + LLA++F Sbjct: 959 NLGEVPLSSEFDVVAIIVHVGEVFATAQQKKQWIFVVDGF--VSESHSEGISNSLLAISF 1016 Query: 690 CGPILNNDSSSLISHNLSGTVVGFFNLVKRARDQMNHLWVAXXXXXXXXXXXXXXXXXXH 869 C ++DS ++ NL+G+ GF NL+KR +DQ+NHLWVA H Sbjct: 1017 CSQYADDDSFVPMNSNLTGSTAGFCNLIKRPKDQINHLWVAEATENTSYFLNFDSTDCSH 1076 Query: 870 LYKTGECAQKWAKISCSTIQKLRERVLCIIQGH 968 + A++WA+ S S I+ LRE++L +I H Sbjct: 1077 MKNAAVFAKRWAENSTSIIKNLREKILFMIDDH 1109 Score = 66.6 bits (161), Expect(2) = 8e-85 Identities = 31/46 (67%), Positives = 40/46 (86%) Frame = +2 Query: 23 SDLDSEDGAKIFKILQTAAEPELLMADMTSEQLSSFTEYQAKHETV 160 ++ DSE+GAK+FKIL+TAAEPELLMA+M+ EQL+SF YQAK E + Sbjct: 794 NESDSEEGAKLFKILETAAEPELLMAEMSPEQLTSFASYQAKIEAI 839 >ref|NP_195783.3| breast cancer 2 susceptibility protein [Arabidopsis thaliana] gi|31335362|emb|CAD32572.1| breast cancer susceptibility protein 2b [Arabidopsis thaliana] gi|332002986|gb|AED90369.1| breast cancer 2 susceptibility protein [Arabidopsis thaliana] Length = 1155 Score = 265 bits (677), Expect(2) = 4e-82 Identities = 138/272 (50%), Positives = 186/272 (68%), Gaps = 3/272 (1%) Frame = +3 Query: 153 RQSEIHKKIEKALEDAGLGAREVTPFMRVRIVGLTSKGSTKKVRPKEGLVTIWNPTEKQK 332 +Q ++ K + KALEDAGLG R VTPFMR+R+VGLTS + + PKEG+VTIW+PTE+Q+ Sbjct: 881 KQMQMEKSVAKALEDAGLGERNVTPFMRIRLVGLTSLSNEGEHNPKEGIVTIWDPTERQR 940 Query: 333 QDLVEGQIYCVRGLMPLHSVSDVIYLQGRGSSTLWELLAPKEHEKFEHFFTPRKPVSLSN 512 +L EG+IY ++GL+P++S S+ +YL RGSS+ W+ L+PK+ E F+ FF PRKP+SLSN Sbjct: 941 TELTEGKIYIMKGLVPMNSDSETLYLHARGSSSRWQPLSPKDSENFQPFFNPRKPISLSN 1000 Query: 513 LGDVPLASEFDIAAVVVHVGEVCLSGRRKKQWVFMTDGSWGSSDSEFEGPYS-CLLAVNF 689 LG++PL+SEFDIAA VV+VG+ +KKQWVF+TDGS ++ G S LLA++F Sbjct: 1001 LGEIPLSSEFDIAAYVVYVGDAYTDVLQKKQWVFVTDGS-----TQHSGEISNSLLAISF 1055 Query: 690 CGPILNNDSSSLISHNLSGTVVGFFNLVKRARDQMNHLWVAXXXXXXXXXXXXXXXXXXH 869 P +++ S S ISHNL G+VVGF NL+KRA+D N +WVA H Sbjct: 1056 STPFMDDSSVSHISHNLVGSVVGFCNLIKRAKDATNEMWVAETTENSVYFINAEAAYSSH 1115 Query: 870 LYKTGECAQKWAKI--SCSTIQKLRERVLCII 959 L Q WAK+ S S I +LR+RVL II Sbjct: 1116 LKTRSAHIQTWAKLYSSKSVIHELRQRVLFII 1147 Score = 67.4 bits (163), Expect(2) = 4e-82 Identities = 31/44 (70%), Positives = 40/44 (90%) Frame = +2 Query: 23 SDLDSEDGAKIFKILQTAAEPELLMADMTSEQLSSFTEYQAKHE 154 +D DSE+GAK+FK+L+TAAEPELLMA+M+ EQL+SFT Y+AK E Sbjct: 835 NDTDSEEGAKVFKLLETAAEPELLMAEMSLEQLTSFTTYKAKFE 878