BLASTX nr result
ID: Dioscorea21_contig00003807
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00003807 (2087 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002883367.1| hydroxyproline-rich glycoprotein family prot... 157 1e-35 gb|EEE60086.1| hypothetical protein OsJ_12936 [Oryza sativa Japo... 155 3e-35 gb|EEC76321.1| hypothetical protein OsI_13877 [Oryza sativa Indi... 155 3e-35 ref|NP_001051543.1| Os03g0794900 [Oryza sativa Japonica Group] g... 155 3e-35 ref|XP_003536533.1| PREDICTED: uncharacterized protein LOC100779... 155 5e-35 >ref|XP_002883367.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata subsp. lyrata] gi|297329207|gb|EFH59626.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata subsp. lyrata] Length = 531 Score = 157 bits (396), Expect = 1e-35 Identities = 87/181 (48%), Positives = 116/181 (64%), Gaps = 12/181 (6%) Frame = -2 Query: 2041 SFETLEKQRELITSCTALWKELSDHFSTLERGXXXXXXXXXXXXXXLDASTKRTLDSLNR 1862 SFE +KQ L+TSCT LW+ELSDHF++LE+ LD T+ +L+SL R Sbjct: 17 SFEEFQKQTSLMTSCTLLWQELSDHFTSLEQNLMKKSEALKQMIETLDNQTQTSLESLKR 76 Query: 1861 REHSIDTSVCIALAKLEDRRLAALHAL---------SSTDSTEADD---LAAKLRSLSTK 1718 RE +ID SV I K+ +R AAL +L S+ DS E DD L + L+SL K Sbjct: 77 REVTIDHSVEIVAGKVGERARAALESLEKARDGGDGSNDDSGEVDDEEGLLSALKSLCLK 136 Query: 1717 MDTEGFFDLLTSKKKEVDILRSEVPVALADCIDPAKFVIDAISMVFPVDRRVVKTSNDYG 1538 MD GF++ +T++KKE++ LRS++P AL DC+DPA V++AIS VFPVD+R K SNDYG Sbjct: 137 MDARGFWNFVTARKKELENLRSKIPAALVDCVDPAMLVLEAISEVFPVDKRGDKVSNDYG 196 Query: 1537 W 1535 W Sbjct: 197 W 197 Score = 133 bits (335), Expect = 2e-28 Identities = 61/102 (59%), Positives = 77/102 (75%) Frame = -3 Query: 1377 IDGTKPSSAHAFLQHVVTFGIAVKEDKELYRMLILAFAWRRQMPKLALLLGFEDSMTDMI 1198 I+ K H FLQH+VTFGI ED LYR L++ AWR+QMPKLA+ +G D M DMI Sbjct: 247 IENVKTPDVHTFLQHLVTFGIVKSEDLALYRKLVVGSAWRKQMPKLAVSVGLGDQMPDMI 306 Query: 1197 EELINKGQHLDAINFAYEAGLQDKFPPIPLLKTFLKDSKKTA 1072 EELI++GQ LDA++F YE GL DKFPP+PLLK +L+D+KK+A Sbjct: 307 EELISRGQQLDAVHFTYEVGLVDKFPPVPLLKAYLRDAKKSA 348 Score = 76.6 bits (187), Expect = 2e-11 Identities = 49/107 (45%), Positives = 58/107 (54%), Gaps = 11/107 (10%) Frame = -2 Query: 871 PANKRTRANNGGPMPPAKAGRLTNNAYVSSFPAAPTYVRSPSHPTYPAAASPYGQVGNGA 692 PANKRTRA+ GPMPPAKAGR+T NAYVSSFP ++RSPSH A+ + Y Sbjct: 415 PANKRTRASYNGPMPPAKAGRIT-NAYVSSFP----FIRSPSHSPQYASPAAYPSPPTTV 469 Query: 691 YGSRSPP-----------AIRDPYGYPAEEMSPHARGAPYLPAPLQY 584 Y +RSPP P GYPA + + G PAP Y Sbjct: 470 YSNRSPPYPYSPEIIPGSYQGSPIGYPA--YNGYCNGPVPAPAPPVY 514 >gb|EEE60086.1| hypothetical protein OsJ_12936 [Oryza sativa Japonica Group] Length = 516 Score = 155 bits (393), Expect = 3e-35 Identities = 70/100 (70%), Positives = 85/100 (85%) Frame = -3 Query: 1377 IDGTKPSSAHAFLQHVVTFGIAVKEDKELYRMLILAFAWRRQMPKLALLLGFEDSMTDMI 1198 ++G KP AHAFLQHV TF +A KEDKELYR ++++F+WRRQMP+LA+ LG ED M D+I Sbjct: 211 VEGAKPPDAHAFLQHVATFAVAEKEDKELYRRIVVSFSWRRQMPRLAITLGLEDEMDDII 270 Query: 1197 EELINKGQHLDAINFAYEAGLQDKFPPIPLLKTFLKDSKK 1078 EELI KGQ LDA+NFAYEAGLQ+KFPP PLLK +L+DSKK Sbjct: 271 EELITKGQQLDAVNFAYEAGLQEKFPPAPLLKAYLEDSKK 310 Score = 110 bits (274), Expect = 2e-21 Identities = 64/184 (34%), Positives = 96/184 (52%) Frame = -2 Query: 2086 MGSAAELIPSSAVRESFETLEKQRELITSCTALWKELSDHFSTLERGXXXXXXXXXXXXX 1907 +G + S+AVR+ F LE+Q++L+ +CT L+++L++HF +LER Sbjct: 11 VGVGVDSATSAAVRDGFAELERQQQLLATCTRLYQQLTEHFGSLERRLAARSETLRTKRR 70 Query: 1906 XLDASTKRTLDSLNRREHSIDTSVCIALAKLEDRRLAALHALSSTDSTEADDLAAKLRSL 1727 LD T R L++L RRE SID SV +AL++L+ S DD Sbjct: 71 FLDVRTSRRLEALRRREASIDGSVSLALSRLD----------SLAKGQRGDD-------- 112 Query: 1726 STKMDTEGFFDLLTSKKKEVDILRSEVPVALADCIDPAKFVIDAISMVFPVDRRVVKTSN 1547 +EVD LR+E+P AL C+DPA+F +DA+S VFP+D+R V++ Sbjct: 113 -----------------REVDALRAELPDALKRCVDPARFAMDAVSEVFPIDKRAVRSPT 155 Query: 1546 DYGW 1535 D W Sbjct: 156 DLAW 159 Score = 75.9 bits (185), Expect(2) = 2e-15 Identities = 47/110 (42%), Positives = 60/110 (54%), Gaps = 8/110 (7%) Frame = -2 Query: 889 SPSPNSPANKRTRANNGGPMPPAKAGRLTNNAYVSSFPAA----PTYVRSPSHPTYPAAA 722 S + PANKR RA+ GGPMPPAKAGRLT+ S PA T++RSPSH +Y A Sbjct: 381 SGGSSGPANKRIRASTGGPMPPAKAGRLTDYTGTPSSPATTTTNATFIRSPSHASY-GTA 439 Query: 721 SPYG---QVGNGAYGSRSPPAIRDPYGY-PAEEMSPHARGAPYLPAPLQY 584 SPY + Y ++ A+R+PY Y E+S G Y P+ Y Sbjct: 440 SPYSYDRPAAHPLYCGQNTLAMREPYAYHHPSEVSSVGLGMSYPSPPITY 489 Score = 34.7 bits (78), Expect(2) = 2e-15 Identities = 17/31 (54%), Positives = 20/31 (64%), Gaps = 7/31 (22%) Frame = -1 Query: 479 PMTYPTYGGYSNG------LAPAY-QQGYYR 408 P+TYP Y GYSNG +APA+ Q YYR Sbjct: 486 PITYPAYAGYSNGIGYSNAMAPAFHHQAYYR 516 >gb|EEC76321.1| hypothetical protein OsI_13877 [Oryza sativa Indica Group] Length = 550 Score = 155 bits (393), Expect = 3e-35 Identities = 70/100 (70%), Positives = 85/100 (85%) Frame = -3 Query: 1377 IDGTKPSSAHAFLQHVVTFGIAVKEDKELYRMLILAFAWRRQMPKLALLLGFEDSMTDMI 1198 ++G KP AHAFLQHV TF +A KEDKELYR ++++F+WRRQMP+LA+ LG ED M D+I Sbjct: 245 VEGAKPPDAHAFLQHVATFAVAEKEDKELYRRIVVSFSWRRQMPRLAITLGLEDEMDDII 304 Query: 1197 EELINKGQHLDAINFAYEAGLQDKFPPIPLLKTFLKDSKK 1078 EELI KGQ LDA+NFAYEAGLQ+KFPP PLLK +L+DSKK Sbjct: 305 EELITKGQQLDAVNFAYEAGLQEKFPPAPLLKAYLEDSKK 344 Score = 144 bits (363), Expect = 9e-32 Identities = 75/184 (40%), Positives = 113/184 (61%) Frame = -2 Query: 2086 MGSAAELIPSSAVRESFETLEKQRELITSCTALWKELSDHFSTLERGXXXXXXXXXXXXX 1907 +G + S+AVR+ F LE+Q++L+ +CT L+++L++HF +LER Sbjct: 11 VGVGVDSATSAAVRDGFAELERQQQLLATCTRLYQQLTEHFGSLERRLAARSETLRTKRR 70 Query: 1906 XLDASTKRTLDSLNRREHSIDTSVCIALAKLEDRRLAALHALSSTDSTEADDLAAKLRSL 1727 LD T R L++L RRE SID SV +AL++L+ S S +A +A LRSL Sbjct: 71 FLDVRTSRRLEALRRREASIDGSVSLALSRLDSLAKGDAGTTGSA-SADAAGIAEGLRSL 129 Query: 1726 STKMDTEGFFDLLTSKKKEVDILRSEVPVALADCIDPAKFVIDAISMVFPVDRRVVKTSN 1547 MD+ GFF + +++KEVD LR+E+P AL C+DPA+F +DA+S VFP+D+R V++ Sbjct: 130 CASMDSAGFFTFVVARRKEVDALRAELPDALKRCVDPARFAMDAVSEVFPIDKRAVRSPT 189 Query: 1546 DYGW 1535 D W Sbjct: 190 DLAW 193 Score = 75.9 bits (185), Expect(2) = 7e-16 Identities = 47/110 (42%), Positives = 60/110 (54%), Gaps = 8/110 (7%) Frame = -2 Query: 889 SPSPNSPANKRTRANNGGPMPPAKAGRLTNNAYVSSFPAA----PTYVRSPSHPTYPAAA 722 S + PANKR RA+ GGPMPPAKAGRLT+ S PA T++RSPSH +Y A Sbjct: 415 SGGSSGPANKRIRASTGGPMPPAKAGRLTDYTGTPSSPATTTTNATFIRSPSHASY-GTA 473 Query: 721 SPYG---QVGNGAYGSRSPPAIRDPYGY-PAEEMSPHARGAPYLPAPLQY 584 SPY + Y ++ A+R+PY Y E+S G Y P+ Y Sbjct: 474 SPYSYDRPAAHPLYCGQNTLAMREPYAYHHPSEVSSVGLGMSYPSPPMTY 523 Score = 36.2 bits (82), Expect(2) = 7e-16 Identities = 18/31 (58%), Positives = 20/31 (64%), Gaps = 7/31 (22%) Frame = -1 Query: 479 PMTYPTYGGYSNG------LAPAY-QQGYYR 408 PMTYP Y GYSNG +APA+ Q YYR Sbjct: 520 PMTYPAYAGYSNGIGYSNAMAPAFHHQAYYR 550 >ref|NP_001051543.1| Os03g0794900 [Oryza sativa Japonica Group] gi|50400037|gb|AAT76425.1| expressed protein [Oryza sativa Japonica Group] gi|108711534|gb|ABF99329.1| hydroxyproline-rich glycoprotein family protein, putative, expressed [Oryza sativa Japonica Group] gi|113550014|dbj|BAF13457.1| Os03g0794900 [Oryza sativa Japonica Group] gi|215734812|dbj|BAG95534.1| unnamed protein product [Oryza sativa Japonica Group] Length = 550 Score = 155 bits (393), Expect = 3e-35 Identities = 70/100 (70%), Positives = 85/100 (85%) Frame = -3 Query: 1377 IDGTKPSSAHAFLQHVVTFGIAVKEDKELYRMLILAFAWRRQMPKLALLLGFEDSMTDMI 1198 ++G KP AHAFLQHV TF +A KEDKELYR ++++F+WRRQMP+LA+ LG ED M D+I Sbjct: 245 VEGAKPPDAHAFLQHVATFAVAEKEDKELYRRIVVSFSWRRQMPRLAITLGLEDEMDDII 304 Query: 1197 EELINKGQHLDAINFAYEAGLQDKFPPIPLLKTFLKDSKK 1078 EELI KGQ LDA+NFAYEAGLQ+KFPP PLLK +L+DSKK Sbjct: 305 EELITKGQQLDAVNFAYEAGLQEKFPPAPLLKAYLEDSKK 344 Score = 144 bits (363), Expect = 9e-32 Identities = 75/184 (40%), Positives = 113/184 (61%) Frame = -2 Query: 2086 MGSAAELIPSSAVRESFETLEKQRELITSCTALWKELSDHFSTLERGXXXXXXXXXXXXX 1907 +G + S+AVR+ F LE+Q++L+ +CT L+++L++HF +LER Sbjct: 11 VGVGVDSATSAAVRDGFAELERQQQLLATCTRLYQQLTEHFGSLERRLAARSETLRTKRR 70 Query: 1906 XLDASTKRTLDSLNRREHSIDTSVCIALAKLEDRRLAALHALSSTDSTEADDLAAKLRSL 1727 LD T R L++L RRE SID SV +AL++L+ S S +A +A LRSL Sbjct: 71 FLDVRTSRRLEALRRREASIDGSVSLALSRLDSLAKGDAGTTGSA-SADAAGIAEGLRSL 129 Query: 1726 STKMDTEGFFDLLTSKKKEVDILRSEVPVALADCIDPAKFVIDAISMVFPVDRRVVKTSN 1547 MD+ GFF + +++KEVD LR+E+P AL C+DPA+F +DA+S VFP+D+R V++ Sbjct: 130 CASMDSAGFFTFVVARRKEVDALRAELPDALKRCVDPARFAMDAVSEVFPIDKRAVRSPT 189 Query: 1546 DYGW 1535 D W Sbjct: 190 DLAW 193 Score = 75.9 bits (185), Expect(2) = 2e-15 Identities = 47/110 (42%), Positives = 60/110 (54%), Gaps = 8/110 (7%) Frame = -2 Query: 889 SPSPNSPANKRTRANNGGPMPPAKAGRLTNNAYVSSFPAA----PTYVRSPSHPTYPAAA 722 S + PANKR RA+ GGPMPPAKAGRLT+ S PA T++RSPSH +Y A Sbjct: 415 SGGSSGPANKRIRASTGGPMPPAKAGRLTDYTGTPSSPATTTTNATFIRSPSHASY-GTA 473 Query: 721 SPYG---QVGNGAYGSRSPPAIRDPYGY-PAEEMSPHARGAPYLPAPLQY 584 SPY + Y ++ A+R+PY Y E+S G Y P+ Y Sbjct: 474 SPYSYDRPAAHPLYCGQNTLAMREPYAYHHPSEVSSVGLGMSYPSPPITY 523 Score = 34.7 bits (78), Expect(2) = 2e-15 Identities = 17/31 (54%), Positives = 20/31 (64%), Gaps = 7/31 (22%) Frame = -1 Query: 479 PMTYPTYGGYSNG------LAPAY-QQGYYR 408 P+TYP Y GYSNG +APA+ Q YYR Sbjct: 520 PITYPAYAGYSNGIGYSNAMAPAFHHQAYYR 550 >ref|XP_003536533.1| PREDICTED: uncharacterized protein LOC100779694 [Glycine max] Length = 530 Score = 155 bits (391), Expect = 5e-35 Identities = 88/186 (47%), Positives = 118/186 (63%), Gaps = 11/186 (5%) Frame = -2 Query: 2059 SSAVRESFETLEKQRELITSCTALWKELSDHFSTLERGXXXXXXXXXXXXXXLDASTKRT 1880 S + SF+ ++Q L+TSCT LWKELSDHFS+LE+ LD +T + Sbjct: 11 SELTQPSFDEFQRQTSLMTSCTLLWKELSDHFSSLEQDLNHKSEALKRKIRTLDNTTSDS 70 Query: 1879 LDSLNRREHSIDTSVCIALAKLEDRRLAALHAL--------SSTDSTEADD---LAAKLR 1733 L L+RRE S+D ++ IAL L+ RR AAL AL +S+ E DD L KL+ Sbjct: 71 LRLLDRRETSLDATLQIALRTLDTRRTAALSALLTDADDIINSSPDGEVDDTTGLILKLK 130 Query: 1732 SLSTKMDTEGFFDLLTSKKKEVDILRSEVPVALADCIDPAKFVIDAISMVFPVDRRVVKT 1553 S +MD GFF +++KKKE+D LR+E+PVALA+C+DPAKFV++AIS VFPVD+R K Sbjct: 131 SFCLRMDAFGFFAFVSAKKKELDGLRAEMPVALAECVDPAKFVLEAISEVFPVDKRGDKA 190 Query: 1552 SNDYGW 1535 +D GW Sbjct: 191 GHDLGW 196 Score = 140 bits (353), Expect = 1e-30 Identities = 65/102 (63%), Positives = 78/102 (76%) Frame = -3 Query: 1377 IDGTKPSSAHAFLQHVVTFGIAVKEDKELYRMLILAFAWRRQMPKLALLLGFEDSMTDMI 1198 ++ K H FLQHVVTFGI ED +LYR L++A AWR+QMPKLAL LG M DMI Sbjct: 246 VENVKTPDVHTFLQHVVTFGIVKNEDSDLYRKLVIASAWRKQMPKLALSLGLAQQMPDMI 305 Query: 1197 EELINKGQHLDAINFAYEAGLQDKFPPIPLLKTFLKDSKKTA 1072 EELI+KGQ LDA++F YE GL +KFPP+PLLK+FLKD+KK A Sbjct: 306 EELISKGQQLDAVHFTYEVGLVEKFPPVPLLKSFLKDAKKVA 347 Score = 107 bits (266), Expect(2) = 2e-26 Identities = 62/98 (63%), Positives = 70/98 (71%), Gaps = 2/98 (2%) Frame = -2 Query: 871 PANKRTRANN--GGPMPPAKAGRLTNNAYVSSFPAAPTYVRSPSHPTYPAAASPYGQVGN 698 PANKRTRA+N GGPMPPAKAGRLTN AYVSSFPAAPT+VRSPSH YPAA PY + Sbjct: 417 PANKRTRASNSNGGPMPPAKAGRLTN-AYVSSFPAAPTFVRSPSHGQYPAALPPYPSPPH 475 Query: 697 GAYGSRSPPAIRDPYGYPAEEMSPHARGAPYLPAPLQY 584 YGSRSPP +PY + E +P G+ Y AP+ Y Sbjct: 476 -MYGSRSPPT--NPYAAYSPEPAPAIAGS-YPAAPMNY 509 Score = 40.8 bits (94), Expect(2) = 2e-26 Identities = 18/25 (72%), Positives = 18/25 (72%), Gaps = 1/25 (4%) Frame = -1 Query: 479 PMTYP-TYGGYSNGLAPAYQQGYYR 408 PM YP YGGY N LAP YQQ YYR Sbjct: 506 PMNYPPAYGGYGNVLAPTYQQAYYR 530