BLASTX nr result

ID: Dioscorea21_contig00024579 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00024579
         (1166 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI18109.3| unnamed protein product [Vitis vinifera]              286   4e-89
ref|XP_002264351.2| PREDICTED: uncharacterized protein LOC100241...   282   6e-88
ref|XP_004156673.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   275   8e-85
ref|XP_004137896.1| PREDICTED: uncharacterized protein LOC101215...   275   8e-85
ref|NP_195783.3| breast cancer 2 susceptibility protein [Arabido...   265   4e-82

>emb|CBI18109.3| unnamed protein product [Vitis vinifera]
          Length = 1134

 Score =  286 bits (731), Expect(2) = 4e-89
 Identities = 150/270 (55%), Positives = 187/270 (69%)
 Frame = +3

Query: 150  MRQSEIHKKIEKALEDAGLGAREVTPFMRVRIVGLTSKGSTKKVRPKEGLVTIWNPTEKQ 329
            +RQS++ K IE ALE AGL  REVTPFMRVR+VGLT K    K+  KEGL+TIWNPTEKQ
Sbjct: 863  IRQSDLQKSIEMALEGAGLSTREVTPFMRVRVVGLTCKSYEGKIHHKEGLITIWNPTEKQ 922

Query: 330  KQDLVEGQIYCVRGLMPLHSVSDVIYLQGRGSSTLWELLAPKEHEKFEHFFTPRKPVSLS 509
            + +LVEGQ Y V GLMPL+S S+ +YLQ RGS+T W  L+P   E FE F  PRK V LS
Sbjct: 923  QFELVEGQAYAVAGLMPLNSDSETLYLQARGSTTKWNPLSPLAIEHFEPFLNPRKSVLLS 982

Query: 510  NLGDVPLASEFDIAAVVVHVGEVCLSGRRKKQWVFMTDGSWGSSDSEFEGPYSCLLAVNF 689
            NLG++PL+SEFDIAA+VV+VGEV  +  +KKQWVF+TDGS     SE     +CLLA++F
Sbjct: 983  NLGEIPLSSEFDIAALVVYVGEVYTAAHQKKQWVFVTDGSVSELGSEEAS--NCLLAISF 1040

Query: 690  CGPILNNDSSSLISHNLSGTVVGFFNLVKRARDQMNHLWVAXXXXXXXXXXXXXXXXXXH 869
            C P + +DS + ++ NL G+ VGF NL+KRA+DQMN LWVA                  H
Sbjct: 1041 CSPSV-DDSFAPVNSNLEGSTVGFVNLIKRAKDQMNQLWVAEATENSDYFFSFDLPHCYH 1099

Query: 870  LYKTGECAQKWAKISCSTIQKLRERVLCII 959
            L      A++WAKIS  TI+KL+E+VL II
Sbjct: 1100 LKNAAASAERWAKISSLTIEKLKEKVLFII 1129



 Score = 70.1 bits (170), Expect(2) = 4e-89
 Identities = 33/47 (70%), Positives = 43/47 (91%)
 Frame = +2

Query: 20  SSDLDSEDGAKIFKILQTAAEPELLMADMTSEQLSSFTEYQAKHETV 160
           ++D DSE+GAKIF+IL++AAEPE+LMA+MTSEQL+SFT YQAK E +
Sbjct: 817 NNDNDSEEGAKIFEILESAAEPEVLMAEMTSEQLASFTSYQAKLEAI 863


>ref|XP_002264351.2| PREDICTED: uncharacterized protein LOC100241398 [Vitis vinifera]
          Length = 1126

 Score =  282 bits (721), Expect(2) = 6e-88
 Identities = 147/266 (55%), Positives = 184/266 (69%)
 Frame = +3

Query: 150  MRQSEIHKKIEKALEDAGLGAREVTPFMRVRIVGLTSKGSTKKVRPKEGLVTIWNPTEKQ 329
            +RQS++ K IE ALE AGL  REVTPFMRVR+VGLT K    K+  KEGL+TIWNPTEKQ
Sbjct: 843  IRQSDLQKSIEMALEGAGLSTREVTPFMRVRVVGLTCKSYEGKIHHKEGLITIWNPTEKQ 902

Query: 330  KQDLVEGQIYCVRGLMPLHSVSDVIYLQGRGSSTLWELLAPKEHEKFEHFFTPRKPVSLS 509
            + +LVEGQ Y V GLMPL+S S+ +YLQ RGS+T W  L+P   E FE F  PRK V LS
Sbjct: 903  QFELVEGQAYAVAGLMPLNSDSETLYLQARGSTTKWNPLSPLAIEHFEPFLNPRKSVLLS 962

Query: 510  NLGDVPLASEFDIAAVVVHVGEVCLSGRRKKQWVFMTDGSWGSSDSEFEGPYSCLLAVNF 689
            NLG++PL+SEFDIAA+VV+VGEV  +  +KKQWVF+TDGS     SE     +CLLA++F
Sbjct: 963  NLGEIPLSSEFDIAALVVYVGEVYTAAHQKKQWVFVTDGSVSELGSEEAS--NCLLAISF 1020

Query: 690  CGPILNNDSSSLISHNLSGTVVGFFNLVKRARDQMNHLWVAXXXXXXXXXXXXXXXXXXH 869
            C P + +DS + ++ NL G+ VGF NL+KRA+DQMN LWVA                  H
Sbjct: 1021 CSPSV-DDSFAPVNSNLEGSTVGFVNLIKRAKDQMNQLWVAEATENSDYFFSFDLPHCYH 1079

Query: 870  LYKTGECAQKWAKISCSTIQKLRERV 947
            L      A++WAKIS  TI+KL+E+V
Sbjct: 1080 LKNAAASAERWAKISSLTIEKLKEKV 1105



 Score = 70.1 bits (170), Expect(2) = 6e-88
 Identities = 33/47 (70%), Positives = 43/47 (91%)
 Frame = +2

Query: 20  SSDLDSEDGAKIFKILQTAAEPELLMADMTSEQLSSFTEYQAKHETV 160
           ++D DSE+GAKIF+IL++AAEPE+LMA+MTSEQL+SFT YQAK E +
Sbjct: 797 NNDNDSEEGAKIFEILESAAEPEVLMAEMTSEQLASFTSYQAKLEAI 843


>ref|XP_004156673.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101230245
            [Cucumis sativus]
          Length = 1111

 Score =  275 bits (703), Expect(2) = 8e-85
 Identities = 134/273 (49%), Positives = 190/273 (69%)
 Frame = +3

Query: 150  MRQSEIHKKIEKALEDAGLGAREVTPFMRVRIVGLTSKGSTKKVRPKEGLVTIWNPTEKQ 329
            +RQS++ K IE+AL DAGL  R+VTPFMRVR+VGLTSK S +K   KEGL+TIWNP+EKQ
Sbjct: 839  IRQSDMEKSIERALADAGLSGRDVTPFMRVRVVGLTSKSSQRKTHGKEGLITIWNPSEKQ 898

Query: 330  KQDLVEGQIYCVRGLMPLHSVSDVIYLQGRGSSTLWELLAPKEHEKFEHFFTPRKPVSLS 509
            + +LVEGQ Y + GL+P++  +D++YLQ +GS+T W+ L+P+  + FE F+ PRK V LS
Sbjct: 899  QLELVEGQAYAIGGLVPINCDADILYLQTKGSTTKWQSLSPQSMKCFEPFYKPRKSVLLS 958

Query: 510  NLGDVPLASEFDIAAVVVHVGEVCLSGRRKKQWVFMTDGSWGSSDSEFEGPYSCLLAVNF 689
            NLG+VPL+SEFD+ A++VHVGEV  + ++KKQW+F+ DG    S+S  EG  + LLA++F
Sbjct: 959  NLGEVPLSSEFDVVAIIVHVGEVFATAQQKKQWIFVVDGF--VSESHSEGISNSLLAISF 1016

Query: 690  CGPILNNDSSSLISHNLSGTVVGFFNLVKRARDQMNHLWVAXXXXXXXXXXXXXXXXXXH 869
            C    ++DS   ++ NL+G+  GF NL+KR +DQ+NHLWVA                  H
Sbjct: 1017 CSQYADDDSFVPMNSNLTGSTAGFCNLIKRPKDQINHLWVAEATENTSYFLNFDSTDCSH 1076

Query: 870  LYKTGECAQKWAKISCSTIQKLRERVLCIIQGH 968
            +      A++WA+ S S I+ LRE++L +I  H
Sbjct: 1077 MKNAAVFAKRWAENSTSIIKNLREKILFMIDDH 1109



 Score = 66.6 bits (161), Expect(2) = 8e-85
 Identities = 31/46 (67%), Positives = 40/46 (86%)
 Frame = +2

Query: 23  SDLDSEDGAKIFKILQTAAEPELLMADMTSEQLSSFTEYQAKHETV 160
           ++ DSE+GAK+FKIL+TAAEPELLMA+M+ EQL+SF  YQAK E +
Sbjct: 794 NESDSEEGAKLFKILETAAEPELLMAEMSPEQLTSFASYQAKIEAI 839


>ref|XP_004137896.1| PREDICTED: uncharacterized protein LOC101215906 [Cucumis sativus]
          Length = 1111

 Score =  275 bits (703), Expect(2) = 8e-85
 Identities = 134/273 (49%), Positives = 190/273 (69%)
 Frame = +3

Query: 150  MRQSEIHKKIEKALEDAGLGAREVTPFMRVRIVGLTSKGSTKKVRPKEGLVTIWNPTEKQ 329
            +RQS++ K IE+AL DAGL  R+VTPFMRVR+VGLTSK S +K   KEGL+TIWNP+EKQ
Sbjct: 839  IRQSDMEKSIERALADAGLSGRDVTPFMRVRVVGLTSKSSQRKTHGKEGLITIWNPSEKQ 898

Query: 330  KQDLVEGQIYCVRGLMPLHSVSDVIYLQGRGSSTLWELLAPKEHEKFEHFFTPRKPVSLS 509
            + +LVEGQ Y + GL+P++  +D++YLQ +GS+T W+ L+P+  + FE F+ PRK V LS
Sbjct: 899  QLELVEGQAYAIGGLVPINCDADILYLQTKGSTTKWQSLSPQSMKCFEPFYKPRKSVLLS 958

Query: 510  NLGDVPLASEFDIAAVVVHVGEVCLSGRRKKQWVFMTDGSWGSSDSEFEGPYSCLLAVNF 689
            NLG+VPL+SEFD+ A++VHVGEV  + ++KKQW+F+ DG    S+S  EG  + LLA++F
Sbjct: 959  NLGEVPLSSEFDVVAIIVHVGEVFATAQQKKQWIFVVDGF--VSESHSEGISNSLLAISF 1016

Query: 690  CGPILNNDSSSLISHNLSGTVVGFFNLVKRARDQMNHLWVAXXXXXXXXXXXXXXXXXXH 869
            C    ++DS   ++ NL+G+  GF NL+KR +DQ+NHLWVA                  H
Sbjct: 1017 CSQYADDDSFVPMNSNLTGSTAGFCNLIKRPKDQINHLWVAEATENTSYFLNFDSTDCSH 1076

Query: 870  LYKTGECAQKWAKISCSTIQKLRERVLCIIQGH 968
            +      A++WA+ S S I+ LRE++L +I  H
Sbjct: 1077 MKNAAVFAKRWAENSTSIIKNLREKILFMIDDH 1109



 Score = 66.6 bits (161), Expect(2) = 8e-85
 Identities = 31/46 (67%), Positives = 40/46 (86%)
 Frame = +2

Query: 23  SDLDSEDGAKIFKILQTAAEPELLMADMTSEQLSSFTEYQAKHETV 160
           ++ DSE+GAK+FKIL+TAAEPELLMA+M+ EQL+SF  YQAK E +
Sbjct: 794 NESDSEEGAKLFKILETAAEPELLMAEMSPEQLTSFASYQAKIEAI 839


>ref|NP_195783.3| breast cancer 2 susceptibility protein [Arabidopsis thaliana]
            gi|31335362|emb|CAD32572.1| breast cancer susceptibility
            protein 2b [Arabidopsis thaliana]
            gi|332002986|gb|AED90369.1| breast cancer 2
            susceptibility protein [Arabidopsis thaliana]
          Length = 1155

 Score =  265 bits (677), Expect(2) = 4e-82
 Identities = 138/272 (50%), Positives = 186/272 (68%), Gaps = 3/272 (1%)
 Frame = +3

Query: 153  RQSEIHKKIEKALEDAGLGAREVTPFMRVRIVGLTSKGSTKKVRPKEGLVTIWNPTEKQK 332
            +Q ++ K + KALEDAGLG R VTPFMR+R+VGLTS  +  +  PKEG+VTIW+PTE+Q+
Sbjct: 881  KQMQMEKSVAKALEDAGLGERNVTPFMRIRLVGLTSLSNEGEHNPKEGIVTIWDPTERQR 940

Query: 333  QDLVEGQIYCVRGLMPLHSVSDVIYLQGRGSSTLWELLAPKEHEKFEHFFTPRKPVSLSN 512
             +L EG+IY ++GL+P++S S+ +YL  RGSS+ W+ L+PK+ E F+ FF PRKP+SLSN
Sbjct: 941  TELTEGKIYIMKGLVPMNSDSETLYLHARGSSSRWQPLSPKDSENFQPFFNPRKPISLSN 1000

Query: 513  LGDVPLASEFDIAAVVVHVGEVCLSGRRKKQWVFMTDGSWGSSDSEFEGPYS-CLLAVNF 689
            LG++PL+SEFDIAA VV+VG+      +KKQWVF+TDGS     ++  G  S  LLA++F
Sbjct: 1001 LGEIPLSSEFDIAAYVVYVGDAYTDVLQKKQWVFVTDGS-----TQHSGEISNSLLAISF 1055

Query: 690  CGPILNNDSSSLISHNLSGTVVGFFNLVKRARDQMNHLWVAXXXXXXXXXXXXXXXXXXH 869
              P +++ S S ISHNL G+VVGF NL+KRA+D  N +WVA                  H
Sbjct: 1056 STPFMDDSSVSHISHNLVGSVVGFCNLIKRAKDATNEMWVAETTENSVYFINAEAAYSSH 1115

Query: 870  LYKTGECAQKWAKI--SCSTIQKLRERVLCII 959
            L       Q WAK+  S S I +LR+RVL II
Sbjct: 1116 LKTRSAHIQTWAKLYSSKSVIHELRQRVLFII 1147



 Score = 67.4 bits (163), Expect(2) = 4e-82
 Identities = 31/44 (70%), Positives = 40/44 (90%)
 Frame = +2

Query: 23  SDLDSEDGAKIFKILQTAAEPELLMADMTSEQLSSFTEYQAKHE 154
           +D DSE+GAK+FK+L+TAAEPELLMA+M+ EQL+SFT Y+AK E
Sbjct: 835 NDTDSEEGAKVFKLLETAAEPELLMAEMSLEQLTSFTTYKAKFE 878


Top