BLASTX nr result

ID: Dioscorea21_contig00003807 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00003807
         (2087 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002883367.1| hydroxyproline-rich glycoprotein family prot...   157   1e-35
gb|EEE60086.1| hypothetical protein OsJ_12936 [Oryza sativa Japo...   155   3e-35
gb|EEC76321.1| hypothetical protein OsI_13877 [Oryza sativa Indi...   155   3e-35
ref|NP_001051543.1| Os03g0794900 [Oryza sativa Japonica Group] g...   155   3e-35
ref|XP_003536533.1| PREDICTED: uncharacterized protein LOC100779...   155   5e-35

>ref|XP_002883367.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis lyrata
            subsp. lyrata] gi|297329207|gb|EFH59626.1|
            hydroxyproline-rich glycoprotein family protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 531

 Score =  157 bits (396), Expect = 1e-35
 Identities = 87/181 (48%), Positives = 116/181 (64%), Gaps = 12/181 (6%)
 Frame = -2

Query: 2041 SFETLEKQRELITSCTALWKELSDHFSTLERGXXXXXXXXXXXXXXLDASTKRTLDSLNR 1862
            SFE  +KQ  L+TSCT LW+ELSDHF++LE+               LD  T+ +L+SL R
Sbjct: 17   SFEEFQKQTSLMTSCTLLWQELSDHFTSLEQNLMKKSEALKQMIETLDNQTQTSLESLKR 76

Query: 1861 REHSIDTSVCIALAKLEDRRLAALHAL---------SSTDSTEADD---LAAKLRSLSTK 1718
            RE +ID SV I   K+ +R  AAL +L         S+ DS E DD   L + L+SL  K
Sbjct: 77   REVTIDHSVEIVAGKVGERARAALESLEKARDGGDGSNDDSGEVDDEEGLLSALKSLCLK 136

Query: 1717 MDTEGFFDLLTSKKKEVDILRSEVPVALADCIDPAKFVIDAISMVFPVDRRVVKTSNDYG 1538
            MD  GF++ +T++KKE++ LRS++P AL DC+DPA  V++AIS VFPVD+R  K SNDYG
Sbjct: 137  MDARGFWNFVTARKKELENLRSKIPAALVDCVDPAMLVLEAISEVFPVDKRGDKVSNDYG 196

Query: 1537 W 1535
            W
Sbjct: 197  W 197



 Score =  133 bits (335), Expect = 2e-28
 Identities = 61/102 (59%), Positives = 77/102 (75%)
 Frame = -3

Query: 1377 IDGTKPSSAHAFLQHVVTFGIAVKEDKELYRMLILAFAWRRQMPKLALLLGFEDSMTDMI 1198
            I+  K    H FLQH+VTFGI   ED  LYR L++  AWR+QMPKLA+ +G  D M DMI
Sbjct: 247  IENVKTPDVHTFLQHLVTFGIVKSEDLALYRKLVVGSAWRKQMPKLAVSVGLGDQMPDMI 306

Query: 1197 EELINKGQHLDAINFAYEAGLQDKFPPIPLLKTFLKDSKKTA 1072
            EELI++GQ LDA++F YE GL DKFPP+PLLK +L+D+KK+A
Sbjct: 307  EELISRGQQLDAVHFTYEVGLVDKFPPVPLLKAYLRDAKKSA 348



 Score = 76.6 bits (187), Expect = 2e-11
 Identities = 49/107 (45%), Positives = 58/107 (54%), Gaps = 11/107 (10%)
 Frame = -2

Query: 871 PANKRTRANNGGPMPPAKAGRLTNNAYVSSFPAAPTYVRSPSHPTYPAAASPYGQVGNGA 692
           PANKRTRA+  GPMPPAKAGR+T NAYVSSFP    ++RSPSH    A+ + Y       
Sbjct: 415 PANKRTRASYNGPMPPAKAGRIT-NAYVSSFP----FIRSPSHSPQYASPAAYPSPPTTV 469

Query: 691 YGSRSPP-----------AIRDPYGYPAEEMSPHARGAPYLPAPLQY 584
           Y +RSPP               P GYPA   + +  G    PAP  Y
Sbjct: 470 YSNRSPPYPYSPEIIPGSYQGSPIGYPA--YNGYCNGPVPAPAPPVY 514


>gb|EEE60086.1| hypothetical protein OsJ_12936 [Oryza sativa Japonica Group]
          Length = 516

 Score =  155 bits (393), Expect = 3e-35
 Identities = 70/100 (70%), Positives = 85/100 (85%)
 Frame = -3

Query: 1377 IDGTKPSSAHAFLQHVVTFGIAVKEDKELYRMLILAFAWRRQMPKLALLLGFEDSMTDMI 1198
            ++G KP  AHAFLQHV TF +A KEDKELYR ++++F+WRRQMP+LA+ LG ED M D+I
Sbjct: 211  VEGAKPPDAHAFLQHVATFAVAEKEDKELYRRIVVSFSWRRQMPRLAITLGLEDEMDDII 270

Query: 1197 EELINKGQHLDAINFAYEAGLQDKFPPIPLLKTFLKDSKK 1078
            EELI KGQ LDA+NFAYEAGLQ+KFPP PLLK +L+DSKK
Sbjct: 271  EELITKGQQLDAVNFAYEAGLQEKFPPAPLLKAYLEDSKK 310



 Score =  110 bits (274), Expect = 2e-21
 Identities = 64/184 (34%), Positives = 96/184 (52%)
 Frame = -2

Query: 2086 MGSAAELIPSSAVRESFETLEKQRELITSCTALWKELSDHFSTLERGXXXXXXXXXXXXX 1907
            +G   +   S+AVR+ F  LE+Q++L+ +CT L+++L++HF +LER              
Sbjct: 11   VGVGVDSATSAAVRDGFAELERQQQLLATCTRLYQQLTEHFGSLERRLAARSETLRTKRR 70

Query: 1906 XLDASTKRTLDSLNRREHSIDTSVCIALAKLEDRRLAALHALSSTDSTEADDLAAKLRSL 1727
             LD  T R L++L RRE SID SV +AL++L+          S       DD        
Sbjct: 71   FLDVRTSRRLEALRRREASIDGSVSLALSRLD----------SLAKGQRGDD-------- 112

Query: 1726 STKMDTEGFFDLLTSKKKEVDILRSEVPVALADCIDPAKFVIDAISMVFPVDRRVVKTSN 1547
                             +EVD LR+E+P AL  C+DPA+F +DA+S VFP+D+R V++  
Sbjct: 113  -----------------REVDALRAELPDALKRCVDPARFAMDAVSEVFPIDKRAVRSPT 155

Query: 1546 DYGW 1535
            D  W
Sbjct: 156  DLAW 159



 Score = 75.9 bits (185), Expect(2) = 2e-15
 Identities = 47/110 (42%), Positives = 60/110 (54%), Gaps = 8/110 (7%)
 Frame = -2

Query: 889 SPSPNSPANKRTRANNGGPMPPAKAGRLTNNAYVSSFPAA----PTYVRSPSHPTYPAAA 722
           S   + PANKR RA+ GGPMPPAKAGRLT+     S PA      T++RSPSH +Y   A
Sbjct: 381 SGGSSGPANKRIRASTGGPMPPAKAGRLTDYTGTPSSPATTTTNATFIRSPSHASY-GTA 439

Query: 721 SPYG---QVGNGAYGSRSPPAIRDPYGY-PAEEMSPHARGAPYLPAPLQY 584
           SPY       +  Y  ++  A+R+PY Y    E+S    G  Y   P+ Y
Sbjct: 440 SPYSYDRPAAHPLYCGQNTLAMREPYAYHHPSEVSSVGLGMSYPSPPITY 489



 Score = 34.7 bits (78), Expect(2) = 2e-15
 Identities = 17/31 (54%), Positives = 20/31 (64%), Gaps = 7/31 (22%)
 Frame = -1

Query: 479 PMTYPTYGGYSNG------LAPAY-QQGYYR 408
           P+TYP Y GYSNG      +APA+  Q YYR
Sbjct: 486 PITYPAYAGYSNGIGYSNAMAPAFHHQAYYR 516


>gb|EEC76321.1| hypothetical protein OsI_13877 [Oryza sativa Indica Group]
          Length = 550

 Score =  155 bits (393), Expect = 3e-35
 Identities = 70/100 (70%), Positives = 85/100 (85%)
 Frame = -3

Query: 1377 IDGTKPSSAHAFLQHVVTFGIAVKEDKELYRMLILAFAWRRQMPKLALLLGFEDSMTDMI 1198
            ++G KP  AHAFLQHV TF +A KEDKELYR ++++F+WRRQMP+LA+ LG ED M D+I
Sbjct: 245  VEGAKPPDAHAFLQHVATFAVAEKEDKELYRRIVVSFSWRRQMPRLAITLGLEDEMDDII 304

Query: 1197 EELINKGQHLDAINFAYEAGLQDKFPPIPLLKTFLKDSKK 1078
            EELI KGQ LDA+NFAYEAGLQ+KFPP PLLK +L+DSKK
Sbjct: 305  EELITKGQQLDAVNFAYEAGLQEKFPPAPLLKAYLEDSKK 344



 Score =  144 bits (363), Expect = 9e-32
 Identities = 75/184 (40%), Positives = 113/184 (61%)
 Frame = -2

Query: 2086 MGSAAELIPSSAVRESFETLEKQRELITSCTALWKELSDHFSTLERGXXXXXXXXXXXXX 1907
            +G   +   S+AVR+ F  LE+Q++L+ +CT L+++L++HF +LER              
Sbjct: 11   VGVGVDSATSAAVRDGFAELERQQQLLATCTRLYQQLTEHFGSLERRLAARSETLRTKRR 70

Query: 1906 XLDASTKRTLDSLNRREHSIDTSVCIALAKLEDRRLAALHALSSTDSTEADDLAAKLRSL 1727
             LD  T R L++L RRE SID SV +AL++L+           S  S +A  +A  LRSL
Sbjct: 71   FLDVRTSRRLEALRRREASIDGSVSLALSRLDSLAKGDAGTTGSA-SADAAGIAEGLRSL 129

Query: 1726 STKMDTEGFFDLLTSKKKEVDILRSEVPVALADCIDPAKFVIDAISMVFPVDRRVVKTSN 1547
               MD+ GFF  + +++KEVD LR+E+P AL  C+DPA+F +DA+S VFP+D+R V++  
Sbjct: 130  CASMDSAGFFTFVVARRKEVDALRAELPDALKRCVDPARFAMDAVSEVFPIDKRAVRSPT 189

Query: 1546 DYGW 1535
            D  W
Sbjct: 190  DLAW 193



 Score = 75.9 bits (185), Expect(2) = 7e-16
 Identities = 47/110 (42%), Positives = 60/110 (54%), Gaps = 8/110 (7%)
 Frame = -2

Query: 889 SPSPNSPANKRTRANNGGPMPPAKAGRLTNNAYVSSFPAA----PTYVRSPSHPTYPAAA 722
           S   + PANKR RA+ GGPMPPAKAGRLT+     S PA      T++RSPSH +Y   A
Sbjct: 415 SGGSSGPANKRIRASTGGPMPPAKAGRLTDYTGTPSSPATTTTNATFIRSPSHASY-GTA 473

Query: 721 SPYG---QVGNGAYGSRSPPAIRDPYGY-PAEEMSPHARGAPYLPAPLQY 584
           SPY       +  Y  ++  A+R+PY Y    E+S    G  Y   P+ Y
Sbjct: 474 SPYSYDRPAAHPLYCGQNTLAMREPYAYHHPSEVSSVGLGMSYPSPPMTY 523



 Score = 36.2 bits (82), Expect(2) = 7e-16
 Identities = 18/31 (58%), Positives = 20/31 (64%), Gaps = 7/31 (22%)
 Frame = -1

Query: 479 PMTYPTYGGYSNG------LAPAY-QQGYYR 408
           PMTYP Y GYSNG      +APA+  Q YYR
Sbjct: 520 PMTYPAYAGYSNGIGYSNAMAPAFHHQAYYR 550


>ref|NP_001051543.1| Os03g0794900 [Oryza sativa Japonica Group] gi|50400037|gb|AAT76425.1|
            expressed protein [Oryza sativa Japonica Group]
            gi|108711534|gb|ABF99329.1| hydroxyproline-rich
            glycoprotein family protein, putative, expressed [Oryza
            sativa Japonica Group] gi|113550014|dbj|BAF13457.1|
            Os03g0794900 [Oryza sativa Japonica Group]
            gi|215734812|dbj|BAG95534.1| unnamed protein product
            [Oryza sativa Japonica Group]
          Length = 550

 Score =  155 bits (393), Expect = 3e-35
 Identities = 70/100 (70%), Positives = 85/100 (85%)
 Frame = -3

Query: 1377 IDGTKPSSAHAFLQHVVTFGIAVKEDKELYRMLILAFAWRRQMPKLALLLGFEDSMTDMI 1198
            ++G KP  AHAFLQHV TF +A KEDKELYR ++++F+WRRQMP+LA+ LG ED M D+I
Sbjct: 245  VEGAKPPDAHAFLQHVATFAVAEKEDKELYRRIVVSFSWRRQMPRLAITLGLEDEMDDII 304

Query: 1197 EELINKGQHLDAINFAYEAGLQDKFPPIPLLKTFLKDSKK 1078
            EELI KGQ LDA+NFAYEAGLQ+KFPP PLLK +L+DSKK
Sbjct: 305  EELITKGQQLDAVNFAYEAGLQEKFPPAPLLKAYLEDSKK 344



 Score =  144 bits (363), Expect = 9e-32
 Identities = 75/184 (40%), Positives = 113/184 (61%)
 Frame = -2

Query: 2086 MGSAAELIPSSAVRESFETLEKQRELITSCTALWKELSDHFSTLERGXXXXXXXXXXXXX 1907
            +G   +   S+AVR+ F  LE+Q++L+ +CT L+++L++HF +LER              
Sbjct: 11   VGVGVDSATSAAVRDGFAELERQQQLLATCTRLYQQLTEHFGSLERRLAARSETLRTKRR 70

Query: 1906 XLDASTKRTLDSLNRREHSIDTSVCIALAKLEDRRLAALHALSSTDSTEADDLAAKLRSL 1727
             LD  T R L++L RRE SID SV +AL++L+           S  S +A  +A  LRSL
Sbjct: 71   FLDVRTSRRLEALRRREASIDGSVSLALSRLDSLAKGDAGTTGSA-SADAAGIAEGLRSL 129

Query: 1726 STKMDTEGFFDLLTSKKKEVDILRSEVPVALADCIDPAKFVIDAISMVFPVDRRVVKTSN 1547
               MD+ GFF  + +++KEVD LR+E+P AL  C+DPA+F +DA+S VFP+D+R V++  
Sbjct: 130  CASMDSAGFFTFVVARRKEVDALRAELPDALKRCVDPARFAMDAVSEVFPIDKRAVRSPT 189

Query: 1546 DYGW 1535
            D  W
Sbjct: 190  DLAW 193



 Score = 75.9 bits (185), Expect(2) = 2e-15
 Identities = 47/110 (42%), Positives = 60/110 (54%), Gaps = 8/110 (7%)
 Frame = -2

Query: 889 SPSPNSPANKRTRANNGGPMPPAKAGRLTNNAYVSSFPAA----PTYVRSPSHPTYPAAA 722
           S   + PANKR RA+ GGPMPPAKAGRLT+     S PA      T++RSPSH +Y   A
Sbjct: 415 SGGSSGPANKRIRASTGGPMPPAKAGRLTDYTGTPSSPATTTTNATFIRSPSHASY-GTA 473

Query: 721 SPYG---QVGNGAYGSRSPPAIRDPYGY-PAEEMSPHARGAPYLPAPLQY 584
           SPY       +  Y  ++  A+R+PY Y    E+S    G  Y   P+ Y
Sbjct: 474 SPYSYDRPAAHPLYCGQNTLAMREPYAYHHPSEVSSVGLGMSYPSPPITY 523



 Score = 34.7 bits (78), Expect(2) = 2e-15
 Identities = 17/31 (54%), Positives = 20/31 (64%), Gaps = 7/31 (22%)
 Frame = -1

Query: 479 PMTYPTYGGYSNG------LAPAY-QQGYYR 408
           P+TYP Y GYSNG      +APA+  Q YYR
Sbjct: 520 PITYPAYAGYSNGIGYSNAMAPAFHHQAYYR 550


>ref|XP_003536533.1| PREDICTED: uncharacterized protein LOC100779694 [Glycine max]
          Length = 530

 Score =  155 bits (391), Expect = 5e-35
 Identities = 88/186 (47%), Positives = 118/186 (63%), Gaps = 11/186 (5%)
 Frame = -2

Query: 2059 SSAVRESFETLEKQRELITSCTALWKELSDHFSTLERGXXXXXXXXXXXXXXLDASTKRT 1880
            S   + SF+  ++Q  L+TSCT LWKELSDHFS+LE+               LD +T  +
Sbjct: 11   SELTQPSFDEFQRQTSLMTSCTLLWKELSDHFSSLEQDLNHKSEALKRKIRTLDNTTSDS 70

Query: 1879 LDSLNRREHSIDTSVCIALAKLEDRRLAALHAL--------SSTDSTEADD---LAAKLR 1733
            L  L+RRE S+D ++ IAL  L+ RR AAL AL        +S+   E DD   L  KL+
Sbjct: 71   LRLLDRRETSLDATLQIALRTLDTRRTAALSALLTDADDIINSSPDGEVDDTTGLILKLK 130

Query: 1732 SLSTKMDTEGFFDLLTSKKKEVDILRSEVPVALADCIDPAKFVIDAISMVFPVDRRVVKT 1553
            S   +MD  GFF  +++KKKE+D LR+E+PVALA+C+DPAKFV++AIS VFPVD+R  K 
Sbjct: 131  SFCLRMDAFGFFAFVSAKKKELDGLRAEMPVALAECVDPAKFVLEAISEVFPVDKRGDKA 190

Query: 1552 SNDYGW 1535
             +D GW
Sbjct: 191  GHDLGW 196



 Score =  140 bits (353), Expect = 1e-30
 Identities = 65/102 (63%), Positives = 78/102 (76%)
 Frame = -3

Query: 1377 IDGTKPSSAHAFLQHVVTFGIAVKEDKELYRMLILAFAWRRQMPKLALLLGFEDSMTDMI 1198
            ++  K    H FLQHVVTFGI   ED +LYR L++A AWR+QMPKLAL LG    M DMI
Sbjct: 246  VENVKTPDVHTFLQHVVTFGIVKNEDSDLYRKLVIASAWRKQMPKLALSLGLAQQMPDMI 305

Query: 1197 EELINKGQHLDAINFAYEAGLQDKFPPIPLLKTFLKDSKKTA 1072
            EELI+KGQ LDA++F YE GL +KFPP+PLLK+FLKD+KK A
Sbjct: 306  EELISKGQQLDAVHFTYEVGLVEKFPPVPLLKSFLKDAKKVA 347



 Score =  107 bits (266), Expect(2) = 2e-26
 Identities = 62/98 (63%), Positives = 70/98 (71%), Gaps = 2/98 (2%)
 Frame = -2

Query: 871 PANKRTRANN--GGPMPPAKAGRLTNNAYVSSFPAAPTYVRSPSHPTYPAAASPYGQVGN 698
           PANKRTRA+N  GGPMPPAKAGRLTN AYVSSFPAAPT+VRSPSH  YPAA  PY    +
Sbjct: 417 PANKRTRASNSNGGPMPPAKAGRLTN-AYVSSFPAAPTFVRSPSHGQYPAALPPYPSPPH 475

Query: 697 GAYGSRSPPAIRDPYGYPAEEMSPHARGAPYLPAPLQY 584
             YGSRSPP   +PY   + E +P   G+ Y  AP+ Y
Sbjct: 476 -MYGSRSPPT--NPYAAYSPEPAPAIAGS-YPAAPMNY 509



 Score = 40.8 bits (94), Expect(2) = 2e-26
 Identities = 18/25 (72%), Positives = 18/25 (72%), Gaps = 1/25 (4%)
 Frame = -1

Query: 479 PMTYP-TYGGYSNGLAPAYQQGYYR 408
           PM YP  YGGY N LAP YQQ YYR
Sbjct: 506 PMNYPPAYGGYGNVLAPTYQQAYYR 530


Top