BLASTX nr result

ID: Dioscorea21_contig00005798 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00005798
         (2618 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002265987.2| PREDICTED: uncharacterized protein LOC100241...   238   6e-60
ref|XP_002513834.1| conserved hypothetical protein [Ricinus comm...   210   1e-51
ref|XP_003614856.1| hypothetical protein MTR_5g060420 [Medicago ...   192   3e-46
ref|XP_003542703.1| PREDICTED: uncharacterized protein LOC100800...   190   1e-45
ref|XP_002301016.1| predicted protein [Populus trichocarpa] gi|2...   187   2e-44

>ref|XP_002265987.2| PREDICTED: uncharacterized protein LOC100241871 [Vitis vinifera]
          Length = 665

 Score =  238 bits (607), Expect = 6e-60
 Identities = 187/572 (32%), Positives = 256/572 (44%), Gaps = 34/572 (5%)
 Frame = -2

Query: 1873 GRNGKDALRRSQSMVTGKRGESWPRRPGNDLV----------NXXXXXXXXXXXXSKASF 1724
            GR  +D LRRSQSM+TGKRG+ WPR+   D+           +             KA+F
Sbjct: 131  GRLERDMLRRSQSMITGKRGDMWPRKVAADVSTVNKTIHSNGDGQLASGIVTSSVQKAAF 190

Query: 1723 EQDFPTLGAEERQGPPEIGRVASPGLSSAIQSLPFSAPTTIRGDGWTSVLAEVPVVVGGN 1544
            +++FP+LGAE++QG P+IGRV SPGL+SAIQSLP      I GDGWTS LAEVPV++G N
Sbjct: 191  DRNFPSLGAEDKQGAPDIGRVTSPGLTSAIQSLPIGNTVVIGGDGWTSALAEVPVIIGSN 250

Query: 1543 GPSIQSSLQTATVAPASTSLSTTTGLNMAETLAQTPSRVRSD--PQSSVDTQKIDELTRR 1370
               + S  Q+ + +  S + STT+GLNMAETL Q P+R R++  PQ SV TQ+++EL  +
Sbjct: 251  TTGVSSVQQSVSASSVSVAPSTTSGLNMAETLVQGPARARANATPQLSVGTQRLEELALK 310

Query: 1369 QCFKLIPVTPAMXXXXXXXXXXXXXXKGARGVDLVAPSKATPQAPYQLSNHMLR-VPSRL 1193
            Q  +LIP+TP+M                        P       P  L NH  R  P+R 
Sbjct: 311  QSRQLIPMTPSMPKTLVPSPSD-------------KPKSKIGLQPLHLVNHSQRGGPARS 357

Query: 1192 DIPKASQAGNFQVL--NRERNGSASAGKD------GSNSMXXXXXXXXXXXXXSTAAPPL 1037
            D+ K S  G   VL  +RERNG +   KD      GS                ++   P 
Sbjct: 358  DVTKTSNVGKLHVLKPSRERNGVSPTAKDSLSPTMGSRVANSPLAVTPSAAGSASLRSPR 417

Query: 1036 KNSTDPKGRVLSPVQNAFGEKRPTLQTQNRNDFFNSIRKKTSMAAPQPI--------STS 881
             N T         V     EKRPT Q Q+RNDFFN +RKK+S   P  +        S+ 
Sbjct: 418  NNPTLASAERRPSVVLTSVEKRPTSQAQSRNDFFNLMRKKSSTNPPSAVPESGPAVSSSV 477

Query: 880  SEKASNLTTDAISVT-----EEMASSPNSGLECIKESKNCLSGCSGPCEDAESSSFDNGA 716
            SEK+  L T+ ++        ++ SS NSGL+   E++                    G 
Sbjct: 478  SEKSDELITEVVTAPVTPKGRDILSSDNSGLDWSNENR--------------------GD 517

Query: 715  STERSDDQPIGIVSVGQEMTCSPSSAGAGCVKENGNXXXXXXXXXGDAESSSADNGANTE 536
             TE  +++  G                   V +N              + S  D G   +
Sbjct: 518  KTENGNNEACG-------------------VSQNDRDDEIDNVNGDACDVSQRDQG---D 555

Query: 535  KLGDQMTESVSVGQEMVSSLTSGLGCVKENGNCAIGCSRDQSESCASDNGEDSFSDPVDP 356
            ++ D   ++  V Q+ +           +NG                 +  D    P D 
Sbjct: 556  EVHDGNGDACDVSQKFL-----------DNGE--------------KHSSPDEVLYP-DE 589

Query: 355  EEKAFLESLGWNSEDAADEYLTPQEITEFMAE 260
            EE AFL SLGW  E+  DE LT +EI  F  E
Sbjct: 590  EEAAFLRSLGW-EENGEDEGLTEEEINAFYKE 620


>ref|XP_002513834.1| conserved hypothetical protein [Ricinus communis]
            gi|223546920|gb|EEF48417.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 596

 Score =  210 bits (535), Expect = 1e-51
 Identities = 182/553 (32%), Positives = 252/553 (45%), Gaps = 31/553 (5%)
 Frame = -2

Query: 2245 EPKFIPGWLKXXXXXXXXXXXXXXHIATSSLHLDEHGGGRSSRNRFSTVSACDHDAPXXX 2066
            EP  +P WL+                A+SS H D       SR+R S  S  D D+P   
Sbjct: 5    EPTLVPEWLRSSGSVPGGGSSAHH-FASSSPHSDVSSSVHHSRSRNSK-STSDFDSPRSA 62

Query: 2065 XXXXXXXXXXXXXXXXXXTNHDRDNISQMYSSFSXXXXXXXXXXXXXXXXXXXXXPFQLD 1886
                                H        YSSFS                      +  D
Sbjct: 63   FLDRTSSSNSRRSSSNGSAKH-------AYSSFSRSHRDKDRERDKERLNFGNH--WDND 113

Query: 1885 SFDS-----GRNGKDALRRSQSMVTGKRGESWPRRPGNDLVNXXXXXXXXXXXXS----- 1736
            + D       RN KDALRRS SMV+ K GE  PRR   DL N                  
Sbjct: 114  ASDPLGSILSRNEKDALRRSHSMVSRKLGEVLPRRFAADLRNGSNSNHVNGNGLISGGGV 173

Query: 1735 -----KASFEQDFPTLGAEERQGPPEIGRVASPGLSSAIQSLPFSAPTTIRGDGWTSVLA 1571
                 KA FE+DFP+LG+EERQG P+IGRV+SPGLS+A+QSLP S+   I G+GWTS LA
Sbjct: 174  GNSIPKAVFEKDFPSLGSEERQGAPDIGRVSSPGLSTAVQSLPVSSSALIGGEGWTSALA 233

Query: 1570 EVPVVVGGNGPSIQSSLQTATVAPASTSLSTTTGLNMAETLAQTPSRVRSDPQSSVDTQK 1391
            EVP ++G N     SS+QT   + AS + ST  GLNMAE L Q P+R R+ PQ SV TQ+
Sbjct: 234  EVPAIIGNNSSGSSSSVQTVATS-ASGAPSTVAGLNMAEALTQAPTRTRTAPQLSVQTQR 292

Query: 1390 IDELTRRQCFKLIPVTPAMXXXXXXXXXXXXXXKG-ARGVDLVAPSKATPQAP---YQLS 1223
            ++EL  +Q  +LIPVTP+M              K   R  ++    K   Q P   + ++
Sbjct: 293  LEELAIKQSRQLIPVTPSMPKSSVLNSSDKSKPKTVVRSSEMNMAPKNLQQQPSSLHAVT 352

Query: 1222 NHMLRVPSRLDIPKASQAGNFQVLNRERNGSASAGKDGSN-SMXXXXXXXXXXXXXSTAA 1046
              +     + D  KAS    F +     NG++ + KD +N +              S  +
Sbjct: 353  QSLAGGHVKSDASKASHGKLFVLKPGWENGASPSPKDIANPNNAGRAANSQLAAAPSVPS 412

Query: 1045 PPLKNSTDP-------KGRVLSPVQNAFGEKRPTL-QTQNRNDFFNSIRKKTSMAAPQPI 890
             PL++  +P       K   L+ +     EKRP L QTQ+R+DFFN ++KKT   +   +
Sbjct: 413  APLRSPNNPKLSAGERKSASLNLISGFNVEKRPLLSQTQSRHDFFNLLKKKTLKNSSTAL 472

Query: 889  STSSEKASNLTTD-AISVTEEMASSPNSGLECIKESKNCLSGCSGPCEDA--ESSSFDNG 719
            + S+   S+ T + A  + +E AS+P S  + IK     L+G  G CE+   E ++F   
Sbjct: 473  TDSASAISSPTNEKACEINKEAASAP-SCPQAIKNGSE-LTGNGGTCEEVSEEEAAFLRS 530

Query: 718  ASTERSDDQPIGI 680
               E +  +  G+
Sbjct: 531  LGWEENSGEDEGL 543


>ref|XP_003614856.1| hypothetical protein MTR_5g060420 [Medicago truncatula]
            gi|355516191|gb|AES97814.1| hypothetical protein
            MTR_5g060420 [Medicago truncatula]
          Length = 685

 Score =  192 bits (489), Expect = 3e-46
 Identities = 138/387 (35%), Positives = 194/387 (50%), Gaps = 26/387 (6%)
 Frame = -2

Query: 1876 SGRNGKDALRRSQSMVTGKRGESWPRRPGNDLV----------NXXXXXXXXXXXXSKAS 1727
            SGR  +D LRRS SMV+ K+GE+ PRR   D            N             KA 
Sbjct: 125  SGRIERDTLRRSHSMVSRKQGETLPRRVAADTKSGGSSNHNNGNGALSVGSVGSSIQKAV 184

Query: 1726 FEQDFPTLGAEERQGPPEIGRVASPGL-SSAIQSLPFSAPTTIRGDGWTSVLAEVPVVVG 1550
            F++DFP+LGA+E+QG  EIGRV+SPGL ++A QSLP  +   I G+GWTS LAEVP V+G
Sbjct: 185  FDKDFPSLGADEKQGIAEIGRVSSPGLGATASQSLPVGSSALIGGEGWTSALAEVPSVIG 244

Query: 1549 GNGPSIQSSLQTATVAPASTSLSTTTGLNMAETLAQTPSRVRSDPQSSVDTQKIDELTRR 1370
             +     S+ QT      S S ST  GLNMAE LAQ PSR RS PQ SV TQ+++EL  +
Sbjct: 245  SSSAGSSSAQQTIAATSVSVSSSTAAGLNMAEALAQAPSRARSTPQVSVKTQRLEELAIK 304

Query: 1369 QCFKLIPVTPAMXXXXXXXXXXXXXXKGA-RGVDLVAPSKATPQAPYQL---SNHMLRVP 1202
            Q  +LIPVTP+M              K A R  ++   +K+  Q P  L   S  +  V 
Sbjct: 305  QSRQLIPVTPSMPKALALNSSEKSKPKTAVRNAEMNVATKSALQQPSALHIASQSVRIVN 364

Query: 1201 SRLDIPKASQAGNFQVLNRE--RNGSASAGKDGSNSMXXXXXXXXXXXXXSTAAPPL--- 1037
            +++D+PK S  G F  L      NG++   KD SN               ++AA P    
Sbjct: 365  AKVDVPKTS--GKFTDLKSVVWENGASPTSKDVSNPTNYANSKSANQHCVASAAAPTPVR 422

Query: 1036 --KNSTDPKGRVLSPVQ----NAFGEKRPTLQTQNRNDFFNSIRKKTSMAAPQPISTSSE 875
               N   P+ R  + +     +A  +K+   Q ++RNDFFN ++ KT+  +      S +
Sbjct: 423  NPSNLNSPRERKPASLDLKLGSALDKKQSISQVKSRNDFFNLLKNKTATNSSTVFPDSGQ 482

Query: 874  KASNLTTDAISVTEEMASSPNSGLECI 794
              S+ T +        +  P++  + +
Sbjct: 483  MVSSPTLEKSGEVNRESVMPSASPQSV 509


>ref|XP_003542703.1| PREDICTED: uncharacterized protein LOC100800475 [Glycine max]
          Length = 621

 Score =  190 bits (483), Expect = 1e-45
 Identities = 139/386 (36%), Positives = 186/386 (48%), Gaps = 32/386 (8%)
 Frame = -2

Query: 1873 GRNGKDALRRSQSMVTGKRGESWPRRPGNDLV---------NXXXXXXXXXXXXSKASFE 1721
            GR  +D LRRS SMV+ K+ E  PRR   D           N             KA F+
Sbjct: 124  GRMERDTLRRSHSMVSRKQSEVIPRRVAVDTKSGGSHQNNSNGILSGSNVSSSIQKAVFD 183

Query: 1720 QDFPTLGAEERQGPPEIGRVASPGLSSAI-QSLPFSAPTTIRGDGWTSVLAEVPVVVGGN 1544
            +DFP+L  EE+QG  E+ RV+SPGL +A+ QSLP  +   I G+GWTS LAEVP ++G +
Sbjct: 184  KDFPSLSTEEKQGIAEVVRVSSPGLGAAVSQSLPVGSSALIGGEGWTSALAEVPAIIGSS 243

Query: 1543 GPSIQSSLQTATVAPASTSLSTTTGLNMAETLAQTPSRVRSDPQSSVDTQKIDELTRRQC 1364
                 S  QT      S + STT GLNMAE LAQTPSR RS PQ  V TQ+++EL  +Q 
Sbjct: 244  STGSLSVQQTVNTTSGSVAPSTTAGLNMAEALAQTPSRARSAPQVLVKTQRLEELAIKQS 303

Query: 1363 FKLIPVTPAMXXXXXXXXXXXXXXKGARGVDLVAPSKATPQAPYQL---SNHMLRVPSRL 1193
             +LIPVTP+M                 R  D+   +K  PQ P  L   S  +  V +++
Sbjct: 304  RQLIPVTPSMPKASVHNSEKSKPKTAIRNADMNVVTKTVPQQPSALHIASQSVRSVNAKV 363

Query: 1192 DIPKASQAGNFQVLNRE--RNGSASAGKDGSN--SMXXXXXXXXXXXXXSTAAPPLKNST 1025
            D PK S  G F  L      NG++   KD SN  +                A+ PL+N  
Sbjct: 364  DTPKTS--GKFTDLKSVVWENGASPTSKDVSNPTNYSNSKPGNQHAVASGAASAPLRNPN 421

Query: 1024 D---PKGRVLSPVQNAFG----EKRPTLQTQNRNDFFNSIRKKTSM--------AAPQPI 890
            +   P  R  S +    G    +K    Q Q+RNDFFN I+KKT M        + P   
Sbjct: 422  NLKSPTERKPSSMDLKLGSNLEKKHSISQVQSRNDFFNLIKKKTLMNCSAVLPDSGPMVS 481

Query: 889  STSSEKASNLTTDAISVTEEMASSPN 812
            S + EK+  +  + ++ +    S  N
Sbjct: 482  SPAMEKSGEVNREIVNPSASPQSLGN 507


>ref|XP_002301016.1| predicted protein [Populus trichocarpa] gi|222842742|gb|EEE80289.1|
            predicted protein [Populus trichocarpa]
          Length = 591

 Score =  187 bits (474), Expect = 2e-44
 Identities = 160/506 (31%), Positives = 222/506 (43%), Gaps = 27/506 (5%)
 Frame = -2

Query: 2245 EPKFIPGWLKXXXXXXXXXXXXXXHIATSSLHLDEHGGGRSSRNRFSTVSACDHDAPXXX 2066
            EP  +P WL+                A+SS H D    G  +RNR S  S  D D+P   
Sbjct: 5    EPSLVPEWLRSPGSVSGAGNSAHH-FASSSSHSDVSSLGNHTRNR-SFKSINDFDSPRSA 62

Query: 2065 XXXXXXXXXXXXXXXXXXTNHDRDNISQMYSSFSXXXXXXXXXXXXXXXXXXXXXPFQLD 1886
                                H        YSSFS                      +  D
Sbjct: 63   FLDRQSSSNSRRSSINGSAKHP-------YSSFSRSHRDKDRERDKERSSFGDH--WDRD 113

Query: 1885 SFD------SGRNGKDALRRSQSMVTGKRGESWPRRPGNDLVNXXXXXXXXXXXXS---- 1736
            S D      + RN KD LR S SMV+ K  E   RR  ++L N                 
Sbjct: 114  SSDPLGGILTSRNEKDTLRHSHSMVSRKHSEVMLRRAASELKNGSSSNLANSNGLVSGGS 173

Query: 1735 ------KASFEQDFPTLGAEERQGPPEIGRVASPGLSSAIQSLPFSAPTTIRGDGWTSVL 1574
                  KA FE+DFP+LG E+R+G P+I RV+SPGLSS++Q+LP  +   I G+GWTS L
Sbjct: 174  FGSSSQKAVFEKDFPSLGNEDREGVPDIARVSSPGLSSSVQNLPVGSSALIGGEGWTSAL 233

Query: 1573 AEVPVVVGGNGPSIQSSLQTATVAPASTSLSTTTGLNMAETLAQTPSRVRSDPQSSVDTQ 1394
            AEVP ++G +  S  S+ QT   + + TS S   GLNMAE L Q P R R+ PQ SV TQ
Sbjct: 234  AEVPTIIGNSSTSSSSTAQTVAASSSGTS-SVMAGLNMAEALTQAPLRTRTAPQLSVQTQ 292

Query: 1393 KIDELTRRQCFKLIPVTPAM-XXXXXXXXXXXXXXKGAR--GVDLVAPSKATPQAPYQLS 1223
            +++EL  +Q  +LIPVTP+M                G R   +++ A S     + +  +
Sbjct: 293  RLEELAIKQSRQLIPVTPSMPKNLVLSSSDKSKPKTGIRPGEMNMAAKSSQQQSSLHPAN 352

Query: 1222 NHMLRVPSRLDIPKASQAGNFQVLNRE-RNGSASAGKDGSNSMXXXXXXXXXXXXXSTAA 1046
               + V  + D  K S  G   VL     NG + + KD ++               S  +
Sbjct: 353  QSSVGVHVKSDATKTS--GKLFVLKPVWENGVSPSPKDAASPNTSSRTANSQLAAPSVPS 410

Query: 1045 PPLK-------NSTDPKGRVLSPVQNAFGEKRPTLQTQNRNDFFNSIRKKTSMAAPQPIS 887
            PPL+       +S D K   L+      GEKR    TQ+RN+FFN ++KKT+M       
Sbjct: 411  PPLRSPNNPKISSVDRKPTSLNLNSGFGGEKR----TQSRNNFFNDLKKKTAMNTSSVAD 466

Query: 886  TSSEKASNLTTDAISVTEEMASSPNS 809
            ++S   S  +  +  V +E+ S+P S
Sbjct: 467  SASVVLSPASEKSCEVIKEVVSAPAS 492


Top