BLASTX nr result

ID: Dioscorea21_contig00012081 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00012081
         (1551 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002280902.1| PREDICTED: uncharacterized WD repeat-contain...   423   e-116
ref|XP_003526921.1| PREDICTED: U5 small nuclear ribonucleoprotei...   421   e-115
ref|XP_002321752.1| predicted protein [Populus trichocarpa] gi|2...   419   e-114
ref|XP_003522451.1| PREDICTED: uncharacterized protein LOC100798...   416   e-114
ref|XP_004145140.1| PREDICTED: uncharacterized protein LOC101220...   416   e-113

>ref|XP_002280902.1| PREDICTED: uncharacterized WD repeat-containing protein alr3466
            [Vitis vinifera] gi|297734510|emb|CBI15757.3| unnamed
            protein product [Vitis vinifera]
          Length = 526

 Score =  423 bits (1087), Expect = e-116
 Identities = 228/484 (47%), Positives = 308/484 (63%), Gaps = 22/484 (4%)
 Frame = -2

Query: 1550 EHQLRKTCIMVVQLEEAEKRLVAAQLKLRRQSSSSELQAGNRTAGNGSSSGK-------- 1395
            EH  ++      QL+EAEK+L+ +Q +L R  S   L +      NG+   K        
Sbjct: 44   EHIRKRITYYQSQLQEAEKKLLDSQAQLGRLRSKDNLVSSRSPLDNGTKKVKVERRSTSP 103

Query: 1394 ----SRPQLLIPSVDR--SSKTHMAAKGPLLIGSTPKPK---SRVNAEDSKVASSSRPTA 1242
                SRP+LLIP+V+   S    +A  G      TP       +V  E S  +SS R   
Sbjct: 104  LQLQSRPELLIPAVNPKISQPVKLADAGTRARSPTPNHAHSVGKVKWEKSHGSSSEREMV 163

Query: 1241 SLGSQ-MRHKIEQKDHKDLIPSVGSCSSARLFDFAPMVRIPSQHNRKMRCLEANPTDDRM 1065
             +  +  + K E K+HK+LI S+ + SS  +        I SQH RK+R L   P ++++
Sbjct: 164  EVQDKGTKRKFEPKEHKELITSIATSSSPVIVRCYSSSHISSQHKRKLRSLVLCPVNNQL 223

Query: 1064 FVTSALNGTVNVWKLQADGSKPNALLLSTIDCEIPERKKWPEDLAWHPNGEMLFCAYSAD 885
            F TSAL+G VN+W++QA GS   A LLS  DC   ++++WPED+AWHP G  +F  Y+AD
Sbjct: 224  FATSALDGIVNLWQIQARGS--GASLLSATDCLSQKQRRWPEDMAWHPEGNSIFSVYNAD 281

Query: 884  GGGPQVSTVDLKAT-GKDRVAFLAKKPHLKGTINSIIFLHGIDLCFATGGSDHAVILWKR 708
             G  Q+S ++L  T G  RV FL +KPH+KG INSI F+   + CF TGGSDHAV+LW  
Sbjct: 282  DGDSQISVLNLNRTKGGARVTFLEEKPHVKGIINSISFMPWENPCFVTGGSDHAVVLWNE 341

Query: 707  RNG--FWKPKVLHQNQHSSTVKGVASLQHMKILLSVGLDKKIIGFDILADKCGFQHQINH 534
            ++    WKPK LH+N HSS V GVA +Q  +I+LSVG DK+IIG D+   +  F+HQI+ 
Sbjct: 342  KDEEKLWKPKALHRNMHSSAVMGVAGMQQKQIVLSVGADKRIIGLDLHTGRTDFKHQIDS 401

Query: 533  QCMSVLPNPCDVNLYMVQTSEHGRQLRLYDIRSREREIHTFGWKQ-SSESKSGLISQSWS 357
            +CMS++PNPCD NLYMVQ    G+QLRL+DIR R+ E+H FGWKQ SS+S S LI+Q+WS
Sbjct: 402  KCMSIVPNPCDFNLYMVQAGTPGKQLRLFDIRLRQTELHAFGWKQESSDSLSALINQTWS 461

Query: 356  HDGWHLSSGSLDPAIHLFDIRCNGKGPSQTIHAHQRRVFKAIWHQSIPLMTSVSSDHNIG 177
             DG +++SGS+DP IH+FDIR     PSQ+I AHQ+RVFKAIWH S+PL+ S+SSD NIG
Sbjct: 462  PDGLYITSGSVDPVIHIFDIRSYAHKPSQSIKAHQKRVFKAIWHHSLPLLISISSDLNIG 521

Query: 176  LHEM 165
            LH++
Sbjct: 522  LHKL 525


>ref|XP_003526921.1| PREDICTED: U5 small nuclear ribonucleoprotein 40 kDa protein-like
            [Glycine max]
          Length = 533

 Score =  421 bits (1081), Expect = e-115
 Identities = 229/491 (46%), Positives = 307/491 (62%), Gaps = 41/491 (8%)
 Frame = -2

Query: 1514 QLEEAEKRLVAAQLKLRRQ-----SSSSELQAG--------------NRTAGNGSSSGKS 1392
            QL+EAEKRL  ++ KL R      SS S L  G              +R  G+  +  +S
Sbjct: 44   QLDEAEKRLQDSESKLARLRGQTVSSRSTLDDGIKTLKTERRSNSPIDRNEGSTKNRHQS 103

Query: 1391 RPQLLIPS-------------------VDRSSKTHMAAKGPLLIGSTPKPKSRVNAEDSK 1269
            +P+LLIPS                   +  SS+         + G + + KS   +  ++
Sbjct: 104  KPELLIPSANPKISQPVLLPKSFSKASITSSSEATPGVHSSPITGGSSRGKSD-KSHSNR 162

Query: 1268 VASSSRPTASLGSQMRHKIEQKDHKDLIPSVGSCSSARLFDFAPMVRIPSQHNRKMRCLE 1089
            ++S  + T       + K EQK+HK+LIP +   SS  L        I SQH RK+R + 
Sbjct: 163  LSSEQQKTEVKEKGTKRKFEQKEHKELIPLIRKSSSPSLVTCQTSNHISSQHKRKLRSIA 222

Query: 1088 ANPTDDRMFVTSALNGTVNVWKLQADGSKPNALLLSTIDCEIPERKKWPEDLAWHPNGEM 909
              P +D++FVTSAL+G VN W++QA G+   A  LST DC   ++++WPEDLAWHP G  
Sbjct: 223  LCPVNDQLFVTSALDGVVNFWQVQAKGA--GASRLSTTDCASQKQRRWPEDLAWHPEGNS 280

Query: 908  LFCAYSADGGGPQVSTVDL-KATGKDRVAFLAKKPHLKGTINSIIFLHGIDLCFATGGSD 732
            LF  YSADGG  QVS  +L K  G +RV FL  KPH+KG IN I+F+   + CF TGGSD
Sbjct: 281  LFSVYSADGGDSQVSITNLNKGPGGERVKFLEDKPHVKGIINGIVFMPWENTCFVTGGSD 340

Query: 731  HAVILWKRRNGF-WKPKVLHQNQHSSTVKGVASLQHMKILLSVGLDKKIIGFDILADKCG 555
            HAVILW  ++   WKPK LH+N HSS V GVA +QH +++LSVG D++I G+D+ A +  
Sbjct: 341  HAVILWSEQDDEKWKPKALHRNLHSSAVMGVAGMQHKQMVLSVGADRRIFGYDVRAGRAD 400

Query: 554  FQHQINHQCMSVLPNPCDVNLYMVQTSEHGRQLRLYDIRSREREIHTFGWKQ-SSESKSG 378
            F+HQ++ +CMSVLPNP D NL+MVQT  H RQLRL+DIR R  E+H FGWKQ SS+S+S 
Sbjct: 401  FKHQVDSKCMSVLPNPSDFNLFMVQTGTHERQLRLFDIRLRNTELHAFGWKQESSDSQSA 460

Query: 377  LISQSWSHDGWHLSSGSLDPAIHLFDIRCNGKGPSQTIHAHQRRVFKAIWHQSIPLMTSV 198
            LI+Q+WS DG +++SGS DP IH+FDIR     PSQ+I  HQ+RVF+A+W QSIPL+ S+
Sbjct: 461  LINQAWSPDGLYITSGSADPVIHIFDIRYTAHKPSQSIRVHQKRVFRAMWLQSIPLVISI 520

Query: 197  SSDHNIGLHEM 165
            SSD NIGLH++
Sbjct: 521  SSDLNIGLHKV 531


>ref|XP_002321752.1| predicted protein [Populus trichocarpa] gi|222868748|gb|EEF05879.1|
            predicted protein [Populus trichocarpa]
          Length = 484

 Score =  419 bits (1077), Expect = e-114
 Identities = 228/470 (48%), Positives = 308/470 (65%), Gaps = 8/470 (1%)
 Frame = -2

Query: 1550 EHQLRKTCIMVVQLEEAEKRLVAAQLKLRRQSSSSELQAGNR-TAGNGSSSGKSRPQLLI 1374
            +H  ++      QL EAEKRL  +Q+KL R S      A N+ +  NG  + K   +   
Sbjct: 20   QHLKQRVSYYQTQLVEAEKRLEESQVKLGRLSGKGNATAPNKPSVENGIKNVKVERKS-- 77

Query: 1373 PSVDRSSKTHMAAKGPLLIGSTPKPKSRVNAEDSKVASSSRPTASLGSQMR---HKIEQK 1203
            PS  R ++    ++  +      K  S V  E S   SSS     +  Q R    KIEQK
Sbjct: 78   PSPVRVNEASPGSQPHVADVKVEKSNSDVKVERSH-KSSSPDVEVIEIQDRGTKRKIEQK 136

Query: 1202 DHKDLIPSVGSCSSARLFDFAPMVRIPSQHNRKMRCLEANPTDDRMFVTSALNGTVNVWK 1023
            +HK+LIP V   SS           IPSQH RK+R +   P +D++FV+SAL+G V++W+
Sbjct: 137  EHKELIPLVSRSSSPCTVHCHTSNHIPSQHKRKLRSVAVCPANDQLFVSSALDGMVHLWQ 196

Query: 1022 LQADGSKPNALLLSTIDCEIPERKKWPEDLAWHPNGEMLFCAYSADGGGPQVSTVDL-KA 846
            LQA GS   A +LST DC  P +++WPED+AWHP G  LF AY+AD G  Q+S ++L K 
Sbjct: 197  LQARGS--GASILSTTDCVSPLQRRWPEDIAWHPLGNSLFSAYTADSGDSQISILNLNKM 254

Query: 845  TGKDRVAFLAKKPHLKGTINSIIFLHGIDLCFATGGSDHAVILWKRRN--GFWKPKVLHQ 672
             G+ RV FL  KPH+KGTINSI F+   + CF TG SDH V+LW  ++    WKPK+LH+
Sbjct: 255  QGRARVTFLDDKPHIKGTINSIEFMPWENTCFVTGCSDHGVVLWNEKDDENLWKPKILHR 314

Query: 671  NQHSSTVKGVASLQHMKILLSVGLDKKIIGFDILADKCGFQHQINHQCMSVLPNPCDVNL 492
            N HSS V GVA +Q  +I+LS G DK+I+GFD+   +  F+HQ++ +CMSVLPNPCD NL
Sbjct: 315  NLHSSAVMGVAGMQQKQIVLSAGADKRIVGFDVQVGRADFKHQLDSKCMSVLPNPCDFNL 374

Query: 491  YMVQTSEHGRQLRLYDIRSREREIHTFGWKQ-SSESKSGLISQSWSHDGWHLSSGSLDPA 315
            +MVQT  HG+QLRL+D R ++ EIH+FG+KQ SS+S+S L +Q+WS DG +L+SGS+DP 
Sbjct: 375  FMVQTGTHGKQLRLFDNRLKQMEIHSFGFKQESSDSQSALTNQAWSPDGLYLTSGSVDPV 434

Query: 314  IHLFDIRCNGKGPSQTIHAHQRRVFKAIWHQSIPLMTSVSSDHNIGLHEM 165
            IH+FDIR N   PSQ+I AHQ+RVFKA+WH S+PL+ S+SSD +IGLH++
Sbjct: 435  IHIFDIRYNYDKPSQSIKAHQKRVFKAVWHYSLPLLISISSDLHIGLHKI 484


>ref|XP_003522451.1| PREDICTED: uncharacterized protein LOC100798141 [Glycine max]
          Length = 1494

 Score =  416 bits (1070), Expect = e-114
 Identities = 231/491 (47%), Positives = 307/491 (62%), Gaps = 41/491 (8%)
 Frame = -2

Query: 1514 QLEEAEKRLVAAQLKLRRQ-----SSSSELQAGNRTAGNGSSSG--------------KS 1392
            QL+EAEKRL  ++ KL R      SS S L  G  T    S S               +S
Sbjct: 44   QLDEAEKRLQDSESKLARLRGQTVSSRSTLDDGIETLKTKSRSSSPIDRNEVSTKSQHQS 103

Query: 1391 RPQLLIPSVD-----------RSSKTHMAAKGPL--------LIGSTPKPKSRVNAEDSK 1269
            +P+LLIPS++            SSK  + +            + G + + KS   ++ ++
Sbjct: 104  KPELLIPSLNPKISQPVLLPKSSSKASITSSSEATPGVHNSPITGGSSRGKSD-KSQSNR 162

Query: 1268 VASSSRPTASLGSQMRHKIEQKDHKDLIPSVGSCSSARLFDFAPMVRIPSQHNRKMRCLE 1089
            ++S  + T       + K EQK HK+LIP V   SS  L        I SQH RK+R + 
Sbjct: 163  LSSEQQKTEVKEKGTKRKFEQKGHKELIPLVRKSSSPSLVTCQTSNHISSQHKRKLRSIA 222

Query: 1088 ANPTDDRMFVTSALNGTVNVWKLQADGSKPNALLLSTIDCEIPERKKWPEDLAWHPNGEM 909
              P +D++FVTSAL+G VN W++QA G+   A  LST DC   ++++WPEDLAWHP G  
Sbjct: 223  LCPVNDQLFVTSALDGVVNFWQVQAKGA--GASRLSTTDCASQKQRRWPEDLAWHPEGNS 280

Query: 908  LFCAYSADGGGPQVSTVDL-KATGKDRVAFLAKKPHLKGTINSIIFLHGIDLCFATGGSD 732
            LF  YSADGG  QVS  +L K  G +RV FL  KPH+KG IN I+F+   + CF TGGSD
Sbjct: 281  LFSVYSADGGDSQVSITNLNKGQGGERVKFLEDKPHVKGIINGIVFMPWENTCFVTGGSD 340

Query: 731  HAVILWKRRNGF-WKPKVLHQNQHSSTVKGVASLQHMKILLSVGLDKKIIGFDILADKCG 555
            HAVILW  ++   WKPK LH+N HSS V GVA +Q  +++LSVG D++I G+D+   +  
Sbjct: 341  HAVILWNEQDDEKWKPKALHRNLHSSAVMGVAGMQQKQMVLSVGADRRIFGYDVRVGRAD 400

Query: 554  FQHQINHQCMSVLPNPCDVNLYMVQTSEHGRQLRLYDIRSREREIHTFGWKQ-SSESKSG 378
            F+HQ++ +CMSVLPNP D NL+MVQT  H RQLRL+DIR R  E+H FGWKQ SS+S+S 
Sbjct: 401  FKHQVDSKCMSVLPNPSDFNLFMVQTGTHERQLRLFDIRLRNTELHAFGWKQESSDSQSA 460

Query: 377  LISQSWSHDGWHLSSGSLDPAIHLFDIRCNGKGPSQTIHAHQRRVFKAIWHQSIPLMTSV 198
            LI+Q+WS DG +++SGS DP IH+FDIR     PSQ+I AHQ+RVF+A+W QSIPL+ S+
Sbjct: 461  LINQAWSPDGHYITSGSADPVIHIFDIRYTAHKPSQSIRAHQKRVFRAMWLQSIPLVISI 520

Query: 197  SSDHNIGLHEM 165
            SSD NIGLH++
Sbjct: 521  SSDLNIGLHKV 531


>ref|XP_004145140.1| PREDICTED: uncharacterized protein LOC101220523 [Cucumis sativus]
            gi|449473874|ref|XP_004154008.1| PREDICTED:
            uncharacterized protein LOC101207060 [Cucumis sativus]
            gi|449523433|ref|XP_004168728.1| PREDICTED:
            uncharacterized LOC101207060 [Cucumis sativus]
          Length = 524

 Score =  416 bits (1068), Expect = e-113
 Identities = 234/481 (48%), Positives = 310/481 (64%), Gaps = 32/481 (6%)
 Frame = -2

Query: 1514 QLEEAEKRLVAAQLKL---RRQSSSSELQAGNR----------TAGNGSSSGK-SRPQLL 1377
            QLEEA+KRL   + KL   RRQS++   +   R          T   GS     S+P+L+
Sbjct: 45   QLEEAQKRLQDTESKLARLRRQSNAVSSKDSLRSRAVSVKVEQTVNEGSRPQPVSKPELV 104

Query: 1376 IPSV--------------DRSSKTHMAAKGPLLIGSTPKPKSRVNAEDSKVASSSRPTAS 1239
            IP+V               ++S +  A   P  I +  K +   N  ++ +  +S  T  
Sbjct: 105  IPAVVPKTSQNSALAGNGAKASNSSRAQSSPSHIKNVVKVEGDKNIGNTSLRETSN-TPD 163

Query: 1238 LGSQMRHKIEQKDHKDLIPSVGSCSSARLFDFAPMVRIPSQHNRKMRCLEANPTDDRMFV 1059
             G++ + + + K+HK+LIP + S SS           I SQH RK+R L + P ++++FV
Sbjct: 164  RGTKRKLEQQFKEHKELIPLIRSSSSPSQIRCVGSNHISSQHKRKLRSLISCPVNEQLFV 223

Query: 1058 TSALNGTVNVWKLQADGSKPNALLLSTIDCEIPERKKWPEDLAWHPNGEMLFCAYSADGG 879
            TSAL+G VN+W+LQA GS  +A LLS+ DC  P++++WPED+AWHP G  +F  YSADGG
Sbjct: 224  TSALDGVVNLWQLQARGS--SASLLSSADCVSPKQRRWPEDMAWHPEGNRVFLVYSADGG 281

Query: 878  GPQVSTVDL-KATGKDRVAFLAKKPHLKGTINSIIFLHGIDLCFATGGSDHAVILWKRRN 702
              QVS ++L K+ GK RV FL  KPH+KG INSIIFL      F TGGSDHAVI WK  +
Sbjct: 282  DSQVSIMNLNKSEGKARVTFLEDKPHVKGIINSIIFLPWDSTSFITGGSDHAVIQWKEGD 341

Query: 701  GF--WKPKVLHQNQHSSTVKGVASLQHMKILLSVGLDKKIIGFDILADKCGFQHQINHQC 528
            G   WKPK LH++ HSS V GVA +Q   I+LS G DK+I+GFD+   +  F+HQI  +C
Sbjct: 342  GDKRWKPKALHRSLHSSAVMGVAGMQQKPIVLSAGADKRILGFDVNVGRTEFRHQIESKC 401

Query: 527  MSVLPNPCDVNLYMVQTSEHGRQLRLYDIRSREREIHTFGWKQ-SSESKSGLISQSWSHD 351
            MS+LPNPCD NL+MVQT   G+QLRLYDIR R+ E+H FGW+Q SSES+S LI+Q+WS D
Sbjct: 402  MSILPNPCDFNLFMVQTGTPGKQLRLYDIRLRQTELHAFGWEQKSSESQSALINQAWSPD 461

Query: 350  GWHLSSGSLDPAIHLFDIRCNGKGPSQTIHAHQRRVFKAIWHQSIPLMTSVSSDHNIGLH 171
            G  L+SGS DP IHLFDIR N   PSQ+I AH +RVFKA+W +S+PL+ S+SSD NIGLH
Sbjct: 462  GLILTSGSADPVIHLFDIRYNLHKPSQSISAHHKRVFKAVWLESLPLLISISSDLNIGLH 521

Query: 170  E 168
            +
Sbjct: 522  K 522


Top