BLASTX nr result

ID: Dioscorea21_contig00011117 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00011117
         (1171 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002325689.1| predicted protein [Populus trichocarpa] gi|2...   156   8e-36
ref|XP_002319928.1| predicted protein [Populus trichocarpa] gi|2...   147   5e-33
ref|XP_002517992.1| conserved hypothetical protein [Ricinus comm...   139   2e-30
ref|XP_002271315.1| PREDICTED: uncharacterized protein LOC100264...   128   2e-27
ref|NP_194552.1| uncharacterized protein [Arabidopsis thaliana] ...   114   4e-23

>ref|XP_002325689.1| predicted protein [Populus trichocarpa] gi|222862564|gb|EEF00071.1|
            predicted protein [Populus trichocarpa]
          Length = 443

 Score =  156 bits (395), Expect = 8e-36
 Identities = 120/339 (35%), Positives = 178/339 (52%), Gaps = 50/339 (14%)
 Frame = -2

Query: 1119 EISKLDSSRKLCFLKLDGVDEDKEPRSWSKTPKPWPSISRGTTPIGLGKAAQSPAGARDA 940
            EI +++   K    +L+ +  +K  R+ SKT +      RG          +      + 
Sbjct: 114  EIEEIEREIKRLSSRLEALRLEKVERNISKTIE-----KRGRIVAAKFMDQKQSVKIEEP 168

Query: 939  IVP-------RRGMSLGPSEIFSATRLR----------PQS----------KLQEIKEKE 841
            ++P       RRG+SLGPSEI S ++ R          P S          KL+EI E +
Sbjct: 169  LIPSSKSKINRRGVSLGPSEILSGSKSRLFCGKQDMNTPVSIQNRRKSCFWKLEEIDELK 228

Query: 840  VMGKERGRSSSTSPKSRRATDSKISDLRKGIATVGPKKLVKKEDTPLTNLKPKALFQEPK 661
               KERG+S S SP+SR+   SKI   ++ + TVG ++ VKKED  + +++PK LF++ +
Sbjct: 229  AT-KERGKSLSVSPRSRKNV-SKIQFPKQAVTTVGSRRSVKKEDGIIASIQPKNLFKDGE 286

Query: 660  NSITGKRPMKNVKGRVVASRYSLVNS-RATGDEQGSKRRKWSLPEVGKEEAPADESKRSL 484
             S+T K+P+K   GRVVASRYS + + ++ G+   S+ RK SLP+  KE    D +KR  
Sbjct: 287  KSVTNKKPLK--PGRVVASRYSQIGTNQSNGNLSASEARKRSLPDNEKE----DVNKRRA 340

Query: 483  SLG----------------EVEVRSMI------AQSPPSIMKVAALLPKIRTTRCAAQSP 370
            S G                E+ +  ++       +SPP++  VA +LPKIRT RC A++P
Sbjct: 341  SRGNGACQRMDSGRVKKKWEIPIEVVVYKGDDEGESPPTVSTVADVLPKIRTVRCVAETP 400

Query: 369  RDSGRAKRVAELVGKTSHFGTVTDDGACSPCQSLNFDEE 253
            RDSG AKRVA+LVGK   F     +   S CQ+L+F  E
Sbjct: 401  RDSGAAKRVADLVGKKPFFCIEEAEAGDSVCQALSFAGE 439


>ref|XP_002319928.1| predicted protein [Populus trichocarpa] gi|222858304|gb|EEE95851.1|
           predicted protein [Populus trichocarpa]
          Length = 446

 Score =  147 bits (371), Expect = 5e-33
 Identities = 104/269 (38%), Positives = 154/269 (57%), Gaps = 40/269 (14%)
 Frame = -2

Query: 936 VPRRGMSLGPSEIFSATRLR----------PQS----------KLQEIKEKEVMGKERGR 817
           + RRG+SLGPSEIFS ++ R          P S          KL+EI E +   KERG+
Sbjct: 177 INRRGVSLGPSEIFSGSKSRLLFGKQEMKTPVSTQNRRKSCFWKLEEIDELKAT-KERGK 235

Query: 816 SSSTSPKSRRATDSKISDLRKGIATVGPKKLVKKEDTPLTNLKPKALFQEPKNSITGKRP 637
           S S SP+SR+   SKI   ++ + TVG ++ VKKED  + +++PK LF++ + S+  K+P
Sbjct: 236 SLSVSPRSRKNV-SKIQVPKQAVTTVGSRRSVKKEDGVIASIQPKNLFKDGERSVPNKKP 294

Query: 636 MKNVKGRVVASRYSLVNS-RATGDEQGSKRRKWSLPEVGKEEAPADESKRSLSL------ 478
           +K   GRVVASRY+ + + ++ G+   S+ RK SLP+  KE+     + R   +      
Sbjct: 295 LK--PGRVVASRYNQIGTNQSNGNLTASEARKRSLPDNEKEDVNKRRASRGNGVSQRAES 352

Query: 477 GEVEVRSMIA------------QSPPSIMKVAALLPKIRTTRCAAQSPRDSGRAKRVAEL 334
           G V+ R  I             +SP ++  V  +LP I+T R  A++PRDSG AKRVA+L
Sbjct: 353 GRVKKRWEIPSEVVVYKDDAEEESPQAVSVVTDMLPNIKTVRSVAETPRDSGPAKRVADL 412

Query: 333 VGKTSHFGTVTDDGA-CSPCQSLNFDEEQ 250
           VG+ S+F  V +  A  S CQ+L+F EE+
Sbjct: 413 VGRKSYFPPVEETAAGDSVCQALSFAEEE 441


>ref|XP_002517992.1| conserved hypothetical protein [Ricinus communis]
           gi|223542974|gb|EEF44510.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 443

 Score =  139 bits (349), Expect = 2e-30
 Identities = 104/272 (38%), Positives = 146/272 (53%), Gaps = 43/272 (15%)
 Frame = -2

Query: 936 VPRRGMSLGPSEIFSATRLRPQSK-------------------LQEIKEKEVMGKERGRS 814
           + RRG+SLGPSEI+SAT+ R  SK                   L+EI E +   KERG+S
Sbjct: 176 INRRGVSLGPSEIYSATKARLLSKQEMSTPVSTKNRRKSCFWKLEEIDELKAT-KERGKS 234

Query: 813 SSTSPKSRRATDSKISDLRKGIATVGPKKLVKKEDTPLTNLKPKALFQEPKNSITGKRPM 634
           SS SP+SR+   SK+   +    T+G +K VKKED  L +++PK LF++ + S+  K+P+
Sbjct: 235 SSVSPRSRKNL-SKVQAPKMAATTIGSRKSVKKEDGILASIQPKTLFKDGQKSVPNKKPV 293

Query: 633 KNVKGRVVASRYSLVNSRATGDEQGS-KRRKWSLPEVGKEEAPA-------------DES 496
           K   GRVV SRY   N  AT    G+   RK SLP+  KE+A               + S
Sbjct: 294 K--PGRVVPSRY---NQIATNQSDGNFSARKRSLPDSDKEDANKRRASRENGANQRIESS 348

Query: 495 KRSLSLGEVEVRSMIAQSPPSIM--------KVAALLPKIRTTRCAAQSPRDSGRAKRVA 340
            ++    E+    ++ +S  +I+         VA +LPKI+T R   ++PRDSG AKRVA
Sbjct: 349 SKAKKKWEIPSELVMFKSDDAIVGESLKVKSPVADVLPKIKTFRSVNETPRDSGAAKRVA 408

Query: 339 ELVGKTSHFGTVTDDGAC--SPCQSLNFDEEQ 250
           +LVGK S F     +     S CQ+L FD E+
Sbjct: 409 DLVGKKSFFSINEGEETAGDSICQALRFDFEE 440


>ref|XP_002271315.1| PREDICTED: uncharacterized protein LOC100264907 [Vitis vinifera]
          Length = 466

 Score =  128 bits (322), Expect = 2e-27
 Identities = 109/274 (39%), Positives = 149/274 (54%), Gaps = 49/274 (17%)
 Frame = -2

Query: 930 RRGMSLGPSEIFSATRLR-----------PQS-------KLQEIKEKEVMGKERGRSSST 805
           RRGMSLGPSEI +  RLR            QS       KL++I E +V  KERG+S + 
Sbjct: 190 RRGMSLGPSEIAAGGRLRHLGKPEVTPISTQSRRKSCFWKLEDIDEGKVT-KERGKSMTV 248

Query: 804 SPKSRRATDSKISDLRKGIATVGPKKLVKKEDTPLTNLKPKALFQEPKNSITGKRPMKNV 625
           SPK+R+   SK    ++   T+  K+ VKKE   +++++PK LF + + S   K+P+KN 
Sbjct: 249 SPKNRKII-SKTQASKQAATTIASKRPVKKELGFVSSIQPKKLFTDGEKS--AKKPLKN- 304

Query: 624 KGRVVASRYSLVNSRATGDEQGSKRRKWSLPEV---GKE---------EAPA---DESKR 490
            GRVVASRYSL+ +++TG    S R++ SLPE    GK          E P     E+  
Sbjct: 305 -GRVVASRYSLIGNQSTGGCSSSLRKR-SLPENEDNGKRCDKRRNSSLEKPGGIFQENGE 362

Query: 489 SLSLGEV----EVRSMIA---------QSPPS---IMKVAALLPKIRTTRCAAQSPRDSG 358
           +L  G V    E+ S +          +SPPS   I K+  +LP IRT RC  +SPR+SG
Sbjct: 363 NLDKGRVKKKWEIPSEVVVVHKSLENDESPPSPRSITKMPDILPMIRTDRCINESPRNSG 422

Query: 357 RAKRVAELVGKTSHFGTVTDDGACSPCQSLNFDE 256
            AKRVAEL+G+ S+F    D    S   SL+F E
Sbjct: 423 PAKRVAELIGRKSYFCADEDGEDHSVSLSLDFAE 456


>ref|NP_194552.1| uncharacterized protein [Arabidopsis thaliana]
           gi|7269677|emb|CAB79625.1| putative protein [Arabidopsis
           thaliana] gi|30102712|gb|AAP21274.1| At4g28230
           [Arabidopsis thaliana] gi|110736440|dbj|BAF00188.1|
           hypothetical protein [Arabidopsis thaliana]
           gi|332660056|gb|AEE85456.1| uncharacterized protein
           [Arabidopsis thaliana]
          Length = 402

 Score =  114 bits (286), Expect = 4e-23
 Identities = 98/258 (37%), Positives = 139/258 (53%), Gaps = 33/258 (12%)
 Frame = -2

Query: 930 RRGMSLGPSEIFSATRL------------RPQS---KLQEIKEKEVMGKERGRSS-STSP 799
           RRG+SLGP+EIF++ +             R +S   KL  I+E +V  + +GR+S S SP
Sbjct: 164 RRGVSLGPAEIFNSAKKSETVTPLQSAQNRRKSCFFKLPGIEEGQVTTRGKGRTSLSLSP 223

Query: 798 KSRRATDSKISDLRKGIAT-VGPKKLVKKEDTPLTNLKPKALFQEPKNSITGKRPMKNVK 622
           +SR+A   K++  +K  AT VG K+ VKKE+  L  ++PK LF+E + +++ ++P+K   
Sbjct: 224 RSRKA---KMTAAQKQAATTVGSKRAVKKEEGVLLTIQPKRLFKEDEKNVSLRKPLK--P 278

Query: 621 GRVVASRYSLVNSRATGDEQGSKRRKWSLPEVGKEE--------APADESKRSLSL---- 478
           GRVVASRYS +    TG++   KR   SLPE  ++E          +DES +S       
Sbjct: 279 GRVVASRYSQMGKTQTGEKDVRKR---SLPEDEEKENHKRSEKRRASDESNKSEGRVKKR 335

Query: 477 ----GEVEVRSMIAQSPPSIMKVAALLPKIRTTRCAAQSPRDSGRAKRVAELVGKTSHFG 310
                EV++ S       S   +   LPKIRT R    SPRDSG AKRVAEL  K  +F 
Sbjct: 336 WEIPSEVDLYSSGENGDES--PIVKELPKIRTLRRVGGSPRDSGAAKRVAELQAKDRNF- 392

Query: 309 TVTDDGACSPCQSLNFDE 256
                   + CQ L F+E
Sbjct: 393 --------TFCQLLKFEE 402


Top