BLASTX nr result
ID: Dioscorea21_contig00004215
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00004215 (1600 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAH68012.1| OSIGBa0157K09-H0214G12.23 [Oryza sativa Indica G... 350 5e-94 ref|NP_001053298.1| Os04g0512400 [Oryza sativa Japonica Group] g... 350 5e-94 ref|XP_004163582.1| PREDICTED: protein BREAST CANCER SUSCEPTIBIL... 342 1e-91 ref|XP_004151994.1| PREDICTED: protein BREAST CANCER SUSCEPTIBIL... 339 1e-90 tpg|DAA36947.1| TPA: ATBRCA1 [Zea mays] 338 3e-90 >emb|CAH68012.1| OSIGBa0157K09-H0214G12.23 [Oryza sativa Indica Group] Length = 629 Score = 350 bits (899), Expect = 5e-94 Identities = 191/421 (45%), Positives = 258/421 (61%), Gaps = 21/421 (4%) Frame = -1 Query: 1303 NEPAKSKKRKLNTGIKTCSQIKPSHSSNQS----AADGPVKCAFCHSFKITDLTGLMQHY 1136 + P KR+ NT ++K S++Q A G KC FCHS K T+ TG + HY Sbjct: 209 SSPQSVLKREPNTANDDNRELKRQKSTDQDDRQPAVAGAWKCEFCHSSKTTESTGPLSHY 268 Query: 1135 VGERLVEKHQPSQANSIHAHQKCVDWAPQVYYEGDKVVNLEAEILRASXXXXXXXXXXXX 956 + +E +Q + N +H H+KC++WAPQ ++ GD NLE E+ RAS Sbjct: 269 LHGEPLEDNQAWKPNVLHVHEKCIEWAPQAFFTGDIANNLEPELARASKIKCSVCGLKGA 328 Query: 955 XXXCFLEKCKKSYHVPCAVQLRGCRWDCENYLVLCPSHTNLKLPCDDPGSTGMKIHTDQ- 779 C ++ C+KS+HVPCA + GCRWD EN+++LCPSH++ KLPC+ S K + Sbjct: 329 ALGCLVKSCRKSFHVPCAHGISGCRWDDENFVMLCPSHSSKKLPCERSKSKNKKTSLQRS 388 Query: 778 -----------PSPGQMD-----STTQNSKWILCGSALSEEEKELVDKFANFIGAAVRKT 647 PS MD S S+W++CGSALS +EKE++D+F + G V Sbjct: 389 SSDTMLDDLNSPSTIHMDGLWTASPFLTSEWVICGSALSSQEKEILDQFEHQTGITVTNG 448 Query: 646 WDQIVTHVIASTDEKGACSRTLKVLMAILTGKWVLNINWVKASMEARKLVSEEPYEINLD 467 W VTHVIA+TDE GAC+RTLKVLMAIL GKWVLNINW+KA MEA++ V EEPYEI+ D Sbjct: 449 WRSNVTHVIANTDECGACARTLKVLMAILAGKWVLNINWLKACMEAKEPVPEEPYEISSD 508 Query: 466 IYGSSDGPKTGRIRLMKKFPKLFAGLSFYFTGYFSPSRQRDLETLISVAGGVILDKNDAL 287 ++GS DGP+ GR+R M+ P LFAGL+FYF+G+F P+ + LE LI+ AGG ILDK D Sbjct: 509 VHGSFDGPRMGRLRAMQNAPHLFAGLTFYFSGHFMPNYKVHLEDLITAAGGSILDKADL- 567 Query: 286 VLDNSSSQLIYIVYNAEPPASNFSWDPVEDVRKRCEDAEALAGKIHAQVITHTRLLDAIA 107 SS+ L I+Y+ EPP + E +RKR +AE LA I ++ + HT +LD+IA Sbjct: 568 ----SSTSL--IIYSMEPPQGSDPDTLNEVIRKRKAEAEELAATIGSRAVPHTCVLDSIA 621 Query: 106 S 104 S Sbjct: 622 S 622 >ref|NP_001053298.1| Os04g0512400 [Oryza sativa Japonica Group] gi|38345319|emb|CAE03392.2| OSJNBa0004N05.16 [Oryza sativa Japonica Group] gi|113564869|dbj|BAF15212.1| Os04g0512400 [Oryza sativa Japonica Group] gi|215737022|dbj|BAG95951.1| unnamed protein product [Oryza sativa Japonica Group] gi|218195201|gb|EEC77628.1| hypothetical protein OsI_16617 [Oryza sativa Indica Group] gi|222629197|gb|EEE61329.1| hypothetical protein OsJ_15441 [Oryza sativa Japonica Group] Length = 629 Score = 350 bits (899), Expect = 5e-94 Identities = 191/421 (45%), Positives = 258/421 (61%), Gaps = 21/421 (4%) Frame = -1 Query: 1303 NEPAKSKKRKLNTGIKTCSQIKPSHSSNQS----AADGPVKCAFCHSFKITDLTGLMQHY 1136 + P KR+ NT ++K S++Q A G KC FCHS K T+ TG + HY Sbjct: 209 SSPQSVLKREPNTANDDNRELKRQKSTDQDDRQPAVAGAWKCEFCHSSKTTESTGPLSHY 268 Query: 1135 VGERLVEKHQPSQANSIHAHQKCVDWAPQVYYEGDKVVNLEAEILRASXXXXXXXXXXXX 956 + +E +Q + N +H H+KC++WAPQ ++ GD NLE E+ RAS Sbjct: 269 LHGEPLEDNQAWKPNVLHVHEKCIEWAPQAFFTGDIANNLEPELARASKIKCSVCGLKGA 328 Query: 955 XXXCFLEKCKKSYHVPCAVQLRGCRWDCENYLVLCPSHTNLKLPCDDPGSTGMKIHTDQ- 779 C ++ C+KS+HVPCA + GCRWD EN+++LCPSH++ KLPC+ S K + Sbjct: 329 ALGCLVKSCRKSFHVPCAHGISGCRWDDENFVMLCPSHSSKKLPCERSKSKNKKTSLQRS 388 Query: 778 -----------PSPGQMD-----STTQNSKWILCGSALSEEEKELVDKFANFIGAAVRKT 647 PS MD S S+W++CGSALS +EKE++D+F + G V Sbjct: 389 SSDTMLDDLNSPSTIHMDGLWTASPFLTSEWVICGSALSSQEKEILDQFEHQTGITVTNG 448 Query: 646 WDQIVTHVIASTDEKGACSRTLKVLMAILTGKWVLNINWVKASMEARKLVSEEPYEINLD 467 W VTHVIA+TDE GAC+RTLKVLMAIL GKWVLNINW+KA MEA++ V EEPYEI+ D Sbjct: 449 WRSNVTHVIANTDECGACARTLKVLMAILAGKWVLNINWLKACMEAKEPVPEEPYEISSD 508 Query: 466 IYGSSDGPKTGRIRLMKKFPKLFAGLSFYFTGYFSPSRQRDLETLISVAGGVILDKNDAL 287 ++GS DGP+ GR+R M+ P LFAGL+FYF+G+F P+ + LE LI+ AGG ILDK D Sbjct: 509 VHGSFDGPRMGRLRAMQNAPHLFAGLTFYFSGHFMPNYKVHLEDLITAAGGSILDKADL- 567 Query: 286 VLDNSSSQLIYIVYNAEPPASNFSWDPVEDVRKRCEDAEALAGKIHAQVITHTRLLDAIA 107 SS+ L I+Y+ EPP + E +RKR +AE LA I ++ + HT +LD+IA Sbjct: 568 ----SSTSL--IIYSMEPPQGSDPDTLNEVIRKRKAEAEELAATIGSRAVPHTCVLDSIA 621 Query: 106 S 104 S Sbjct: 622 S 622 >ref|XP_004163582.1| PREDICTED: protein BREAST CANCER SUSCEPTIBILITY 1 homolog [Cucumis sativus] Length = 679 Score = 342 bits (878), Expect = 1e-91 Identities = 183/423 (43%), Positives = 255/423 (60%), Gaps = 14/423 (3%) Frame = -1 Query: 1303 NEPAKSKKRKLNTGIKTCSQIKPSHSSNQSAADGP----VKCAFCHSFKITDLTGLMQHY 1136 +EP S+ N+G++ SQ+ + S + AD VKCAFC S K+T+ TG + HY Sbjct: 259 SEPGNSETS--NSGMEHKSQVTNASSMPLADADDTIVRNVKCAFCQSSKVTEDTGAVLHY 316 Query: 1135 VGERLVEKHQPSQANSIHAHQKCVDWAPQVYYEGDKVVNLEAEILRASXXXXXXXXXXXX 956 + RLV+ + + N IH H+ CV+WAPQ Y++GD V NL+AE+ R S Sbjct: 317 MNGRLVDGVEAASPNVIHVHKLCVEWAPQAYFQGDDVHNLKAEVARGSKLKCSKCGLKGA 376 Query: 955 XXXCFLEKCKKSYHVPCAVQLRGCRWDCENYLVLCPSHTNLKLPCD--DPGSTGMKIHTD 782 C+L C+KSYHVPCA+++ CRWD +N+LVLCPSHT+ + P + P + Sbjct: 377 ALGCYLRSCQKSYHVPCALEIDECRWDMDNFLVLCPSHTSARFPDERSKPRKNNFDVFNI 436 Query: 781 QPSPGQMDSTTQNS------KWILCGSALSEEEKELVDKFANFIGAAVRKTWDQIVTHVI 620 S Q D + S KW CGSALS EE+ ++ KFA GA V K W VTHVI Sbjct: 437 VSSRNQKDLSNWASASDGVNKWTFCGSALSAEERNILVKFAKLTGATVSKLWKPDVTHVI 496 Query: 619 ASTDEKGACSRTLKVLMAILTGKWVLNINWVKASMEARKLVSEEPYEINLDIYGSSDGPK 440 ASTDE GAC+RT KVLM IL G W+LN++WVK M+ + ++EE YEI LD YG +DGPK Sbjct: 497 ASTDENGACTRTYKVLMGILNGIWILNMDWVKDCMKEKCPLNEEAYEIALDNYGCTDGPK 556 Query: 439 TGRIRLMKKFPKLFAGLSFYFTGYFSPSRQRDLETLISVAGGVILDKND--ALVLDNSSS 266 TGR+R++ K PKLF GLSFYFTG F P+ + DL+ L+ AGG +L+ + A ++ ++ Sbjct: 557 TGRLRVLNKEPKLFIGLSFYFTGDFPPAYEEDLQDLVITAGGTVLEDEELAATSSNDQAA 616 Query: 265 QLIYIVYNAEPPASNFSWDPVEDVRKRCEDAEALAGKIHAQVITHTRLLDAIASSKF*SC 86 + +VYN + P + V + +R +AE +A K+ AQVI HT L+++IA Sbjct: 617 PKVVVVYNLDSPGGCKVGEEVSILWQRMNEAEGIAAKVGAQVIGHTWLVESIAMGSLQPF 676 Query: 85 IHC 77 + C Sbjct: 677 VSC 679 >ref|XP_004151994.1| PREDICTED: protein BREAST CANCER SUSCEPTIBILITY 1 homolog [Cucumis sativus] Length = 679 Score = 339 bits (870), Expect = 1e-90 Identities = 182/423 (43%), Positives = 254/423 (60%), Gaps = 14/423 (3%) Frame = -1 Query: 1303 NEPAKSKKRKLNTGIKTCSQIKPSHSSNQSAADGP----VKCAFCHSFKITDLTGLMQHY 1136 +EP S+ N+G++ SQ+ + S + AD VKCAFC S K+T+ TG + HY Sbjct: 259 SEPGNSETS--NSGMEHKSQVTNASSMPLADADDTIVRNVKCAFCQSSKVTEDTGAVLHY 316 Query: 1135 VGERLVEKHQPSQANSIHAHQKCVDWAPQVYYEGDKVVNLEAEILRASXXXXXXXXXXXX 956 + RLV+ + + N IH H+ CV+WAPQ Y++GD V NL+AE+ R S Sbjct: 317 MNGRLVDGVEAASPNVIHVHKLCVEWAPQAYFQGDDVHNLKAEVARGSKLKCSKCGLKGA 376 Query: 955 XXXCFLEKCKKSYHVPCAVQLRGCRWDCENYLVLCPSHTNLKLPCD--DPGSTGMKIHTD 782 C+L C+KSYHVPCA+++ CRWD +N+LVLCPSHT+ + P + P + Sbjct: 377 ALGCYLRSCQKSYHVPCALEIDECRWDMDNFLVLCPSHTSARFPDERSKPRKNNFDVFNI 436 Query: 781 QPSPGQMDSTTQNS------KWILCGSALSEEEKELVDKFANFIGAAVRKTWDQIVTHVI 620 S Q D + S KW CGSALS EE+ ++ KFA GA V K W VTHVI Sbjct: 437 VSSRNQKDLSNWASASDGVNKWTFCGSALSAEERNILVKFAKLTGATVSKLWKPDVTHVI 496 Query: 619 ASTDEKGACSRTLKVLMAILTGKWVLNINWVKASMEARKLVSEEPYEINLDIYGSSDGPK 440 ASTDE GAC+RT KVLM IL G W+LN++WVK M+ + ++EE YEI LD YG +DGPK Sbjct: 497 ASTDENGACTRTYKVLMGILNGIWILNMDWVKDCMKEKCPLNEEAYEIALDNYGCTDGPK 556 Query: 439 TGRIRLMKKFPKLFAGLSFYFTGYFSPSRQRDLETLISVAGGVILDKND--ALVLDNSSS 266 TGR+R++ K KLF GLSFYFTG F P+ + DL+ L+ AGG +L+ + A ++ ++ Sbjct: 557 TGRLRVLNKESKLFIGLSFYFTGDFPPAYEEDLQDLVITAGGTVLEDEELAATSSNDQAA 616 Query: 265 QLIYIVYNAEPPASNFSWDPVEDVRKRCEDAEALAGKIHAQVITHTRLLDAIASSKF*SC 86 + +VYN + P + V + +R +AE +A K+ AQVI HT L+++IA Sbjct: 617 PKVVVVYNLDSPGGCKVGEEVSILWQRMNEAEGIAAKVGAQVIGHTWLVESIAMGSLQPF 676 Query: 85 IHC 77 + C Sbjct: 677 VSC 679 >tpg|DAA36947.1| TPA: ATBRCA1 [Zea mays] Length = 631 Score = 338 bits (866), Expect = 3e-90 Identities = 180/412 (43%), Positives = 249/412 (60%), Gaps = 19/412 (4%) Frame = -1 Query: 1282 KRKLNTGIKTCSQIKPSHSSNQSAADGPV----KCAFCHSFKITDLTGLMQHYVGERLVE 1115 KR+ N ++K S++Q + KC FCHS ++T+ TG + HY+ VE Sbjct: 216 KREANAMDDHTRELKKQKSNDQVQRQTNMASAWKCEFCHSSQVTECTGPLSHYLNGEPVE 275 Query: 1114 KHQPSQANSIHAHQKCVDWAPQVYYEGDKVVNLEAEILRASXXXXXXXXXXXXXXXCFLE 935 Q +++ H H+KC++WAPQ ++ GD NL E+ RAS C ++ Sbjct: 276 ADQAWKSSVQHVHEKCIEWAPQAFFTGDTANNLGPELARASKIKCSVCGLKGAALGCLVK 335 Query: 934 KCKKSYHVPCAVQLRGCRWDCENYLVLCPSHTNLKLPCDDPGSTGMKIHTDQPS------ 773 C+KS+HVPCA ++GC+WD EN+++LCP+H++ KLPC+ K QPS Sbjct: 336 SCRKSFHVPCAYNIKGCKWDQENFVMLCPTHSSKKLPCERL-KPKKKAKLQQPSSDIDGP 394 Query: 772 --PGQMD-------STTQNSKWILCGSALSEEEKELVDKFANFIGAAVRKTWDQIVTHVI 620 P M S S+W++CGSAL EKE++D+F G V TW VTHVI Sbjct: 395 ISPSPMQRAELWTASPFLTSEWVICGSALVGHEKEILDQFECHTGITVTNTWSSDVTHVI 454 Query: 619 ASTDEKGACSRTLKVLMAILTGKWVLNINWVKASMEARKLVSEEPYEINLDIYGSSDGPK 440 A+TDE+GAC+RTLKVLMAIL GKWVLN+NW+KA MEAR+ V EEPYEI D++GS DGP+ Sbjct: 455 ANTDERGACARTLKVLMAILAGKWVLNVNWLKACMEAREPVPEEPYEIRCDVHGSVDGPR 514 Query: 439 TGRIRLMKKFPKLFAGLSFYFTGYFSPSRQRDLETLISVAGGVILDKNDALVLDNSSSQL 260 +GR+R M++ P LFAGL+FYF+G+F P + +LE LI+ AGG +L+K + SS+ L Sbjct: 515 SGRLRAMQQAPGLFAGLTFYFSGHFMPGYRANLEDLIAAAGGSVLEKAEL-----SSTSL 569 Query: 259 IYIVYNAEPPASNFSWDPVEDVRKRCEDAEALAGKIHAQVITHTRLLDAIAS 104 I A PP +N D +E + KR +A+ LA + I HT LLD IAS Sbjct: 570 ILYSMEAPPPHNNL--DALETINKRLAEAQELATTAGCKAIPHTWLLDCIAS 619