BLASTX nr result

ID: Dioscorea21_contig00010128 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00010128
         (1821 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABF95105.1| Vacuolar sorting protein 9 domain containing prot...   526   e-147
gb|EEC74907.1| hypothetical protein OsI_10843 [Oryza sativa Indi...   521   e-145
ref|XP_002468141.1| hypothetical protein SORBIDRAFT_01g040290 [S...   514   e-143
ref|NP_001141062.1| uncharacterized protein LOC100273143 [Zea ma...   503   e-140
ref|XP_002285827.1| PREDICTED: vacuolar protein sorting-associat...   498   e-138

>gb|ABF95105.1| Vacuolar sorting protein 9 domain containing protein, expressed
            [Oryza sativa Japonica Group] gi|222624609|gb|EEE58741.1|
            hypothetical protein OsJ_10226 [Oryza sativa Japonica
            Group]
          Length = 480

 Score =  526 bits (1354), Expect = e-147
 Identities = 277/467 (59%), Positives = 333/467 (71%), Gaps = 13/467 (2%)
 Frame = +3

Query: 93   MDAGAGGNTDVFGSSTAPLTWHDFLDRMRHPSAADFVKSIKSFIVSFSNRAPDPEKDSAS 272
            MD G GG  D FGS+TAPL WHDFL+RMR PSAADFVKSIK FIV+FSNRAPDPE DSA+
Sbjct: 1    MDGGGGG--DAFGSATAPLAWHDFLERMRQPSAADFVKSIKGFIVTFSNRAPDPEHDSAA 58

Query: 273  VQEFLLNMEGAFKVHMLWXXXXXXXXXXXXXXXXKYIMTKLFNRAFASLPEDVKHDEELY 452
            VQEFL NMEGAF+ H  W                KY+MTKLFNR FAS+PEDVK DEEL+
Sbjct: 59   VQEFLENMEGAFRAHTPWAGSSEEELESAGEGLEKYVMTKLFNRVFASVPEDVKSDEELF 118

Query: 453  EKISLLQQFVTPENLDIKPNFRNETSWLLAQKELQKINMYKAPRDKLVCILNCCKVINNL 632
            EK+SLLQQF+ PENLDIKP +++ETSWLLAQKELQKINMYKAPRDKL CILNCCKVINNL
Sbjct: 119  EKMSLLQQFIRPENLDIKPEYQSETSWLLAQKELQKINMYKAPRDKLACILNCCKVINNL 178

Query: 633  LINASIALKDNPPGADDFLPVLIYVTLKANPPQLYSNLLYIERYRRRSRLVSESAYFFTN 812
            L+NASI   +NPPGAD+FLPVLIYVT+KANPPQL+SNLLYI+RYRR+SRLVSE+ YFFTN
Sbjct: 179  LLNASIVSNENPPGADEFLPVLIYVTIKANPPQLHSNLLYIQRYRRQSRLVSEAQYFFTN 238

Query: 813  IVSAESFIWNITAESLSMDEMEFQRKMDSARAYLLGQSTNIENLKDQANESIPDHKTQA- 989
            I+SAESFIWNI  ESLSMDE +FQ+KMD AR  +LG S + EN  +Q N  + + K+Q  
Sbjct: 239  ILSAESFIWNIDGESLSMDERDFQKKMDLARERMLGLSASSENQDNQNNLDVREQKSQTL 298

Query: 990  -----SEVQRSVGSYMQ------KNVYNVPPTPIVKAPSLSDLENKGASEILKDDQPIKY 1136
                 S+V  S+    Q      +   +    P+ +  S+SDLE KGA+E+LKDD   K 
Sbjct: 299  KASRDSDVNLSLKDNFQGPGLEMRRDSDASSNPVERVQSISDLEKKGAAELLKDDDLNKK 358

Query: 1137 FQEYPFLFAQAGDLTVDDVENLLNCYKQVVFKYISLSKGMGIPAASPAASGTQSQSKADT 1316
             QEYPFLFA++GDLTV DVENLLN YKQ+V KY++LS+GMGI   +P     Q+ S    
Sbjct: 359  IQEYPFLFARSGDLTVADVENLLNSYKQLVLKYVALSQGMGINLENPPVQSMQTVSDLVE 418

Query: 1317 DKXXXXXXXXXXXXXXKPEIIQRSNDSSSENLFSGMDET-EQKTTID 1454
             +                   + S+D  ++ L+S +D T  Q+T +D
Sbjct: 419  SEEPKNVKNAVNFSEGSS---KTSDDIKNDTLYSEVDNTGTQQTAVD 462


>gb|EEC74907.1| hypothetical protein OsI_10843 [Oryza sativa Indica Group]
          Length = 470

 Score =  521 bits (1343), Expect = e-145
 Identities = 268/416 (64%), Positives = 314/416 (75%), Gaps = 12/416 (2%)
 Frame = +3

Query: 93   MDAGAGGNTDVFGSSTAPLTWHDFLDRMRHPSAADFVKSIKSFIVSFSNRAPDPEKDSAS 272
            MD G GG  D FGS+TAPL WHDFL+RMR PSAADFVKSIK FIV+FSNRAPDPE DSA+
Sbjct: 1    MDGGGGG--DAFGSATAPLAWHDFLERMRQPSAADFVKSIKGFIVTFSNRAPDPEHDSAA 58

Query: 273  VQEFLLNMEGAFKVHMLWXXXXXXXXXXXXXXXXKYIMTKLFNRAFASLPEDVKHDEELY 452
            VQEFL NMEGAF+ H  W                KY+MTKLFNR FAS+PEDVK DEEL+
Sbjct: 59   VQEFLENMEGAFRAHTPWAGSSEEELESAGEGLEKYVMTKLFNRVFASVPEDVKSDEELF 118

Query: 453  EKISLLQQFVTPENLDIKPNFRNETSWLLAQKELQKINMYKAPRDKLVCILNCCKVINNL 632
            EK+SLLQQF+ PENLDIKP +++ETSWLLAQKELQKINMYKAPRDKL CILNCCKVINNL
Sbjct: 119  EKMSLLQQFIRPENLDIKPEYQSETSWLLAQKELQKINMYKAPRDKLACILNCCKVINNL 178

Query: 633  LINASIALKDNPPGADDFLPVLIYVTLKANPPQLYSNLLYIERYRRRSRLVSESAYFFTN 812
            L+NASI   +NPPGAD+FLPVLIYVT+KANPPQL+SNLLYI+RYR +SRLVSE+ YFFTN
Sbjct: 179  LLNASIVSNENPPGADEFLPVLIYVTIKANPPQLHSNLLYIQRYRPQSRLVSEAQYFFTN 238

Query: 813  IVSAESFIWNITAESLSMDEMEFQRKMDSARAYLLGQSTNIENLKDQANESIPDHKTQA- 989
            I+SAESFIWNI  ESLSMDE +FQ+KMD AR  LLG S + EN  +Q N  + + K+Q  
Sbjct: 239  ILSAESFIWNIDGESLSMDERDFQKKMDLARERLLGLSASSENQDNQNNLDVREQKSQTL 298

Query: 990  -----SEVQRSVGSYMQ------KNVYNVPPTPIVKAPSLSDLENKGASEILKDDQPIKY 1136
                 S+V  S+    Q      +   +    P+ +  S+SDLE KGA+E+LKDD   K 
Sbjct: 299  KASRDSDVNLSLKDNFQGPGLEMRRDSDASSNPVERVQSISDLEKKGAAELLKDDDLNKK 358

Query: 1137 FQEYPFLFAQAGDLTVDDVENLLNCYKQVVFKYISLSKGMGIPAASPAASGTQSQS 1304
             QEYPFLFA++GDLTV DVENLLN YKQ+V KY++LS+GMGI   +P     Q+ S
Sbjct: 359  IQEYPFLFARSGDLTVADVENLLNSYKQLVLKYVALSQGMGINLENPPVQSMQTVS 414


>ref|XP_002468141.1| hypothetical protein SORBIDRAFT_01g040290 [Sorghum bicolor]
            gi|241921995|gb|EER95139.1| hypothetical protein
            SORBIDRAFT_01g040290 [Sorghum bicolor]
          Length = 470

 Score =  514 bits (1325), Expect = e-143
 Identities = 269/464 (57%), Positives = 328/464 (70%), Gaps = 14/464 (3%)
 Frame = +3

Query: 111  GNTDVFGSSTAPLTWHDFLDRMRHPSAADFVKSIKSFIVSFSNRAPDPEKDSASVQEFLL 290
            GN D   SSTAPL WHDFL+RMR PSAA+FVKSIKSFIV+FSNRAPDPEKDSA+VQEFL 
Sbjct: 3    GNADA--SSTAPLAWHDFLERMRQPSAAEFVKSIKSFIVTFSNRAPDPEKDSAAVQEFLE 60

Query: 291  NMEGAFKVHMLWXXXXXXXXXXXXXXXXKYIMTKLFNRAFASLPEDVKHDEELYEKISLL 470
            NMEGAF+ H  W                KY+MTKLFNR FAS+PEDVK DEEL+EK+SLL
Sbjct: 61   NMEGAFRAHTPWAGSSEEELKSAGEGLEKYVMTKLFNRVFASVPEDVKSDEELFEKMSLL 120

Query: 471  QQFVTPENLDIKPNFRNETSWLLAQKELQKINMYKAPRDKLVCILNCCKVINNLLINASI 650
            QQF+ PENLDIKP ++NETSWLLAQKELQKINMYKAPRDKL CILNCCKVINNLL+NASI
Sbjct: 121  QQFIRPENLDIKPEYQNETSWLLAQKELQKINMYKAPRDKLACILNCCKVINNLLLNASI 180

Query: 651  ALKDNPPGADDFLPVLIYVTLKANPPQLYSNLLYIERYRRRSRLVSESAYFFTNIVSAES 830
               +NPPGAD+FLPVLIYVT+KANPPQL+SNLLYI+RYRR++RLVSE+ YFFTNI+SAES
Sbjct: 181  VSNENPPGADEFLPVLIYVTIKANPPQLHSNLLYIQRYRRQTRLVSEAQYFFTNILSAES 240

Query: 831  FIWNITAESLSMDEMEFQRKMDSARAYLLGQSTNIENLKDQANESIPDHKTQ-------- 986
            FIWNI  ESLSM+E++FQRKMDSAR  LLG S + EN   QAN  + D K+Q        
Sbjct: 241  FIWNIDGESLSMNELDFQRKMDSARERLLGLSADSENQDSQANPDVQDWKSQNLKANRNS 300

Query: 987  ------ASEVQRSVGSYMQKNVYNVPPTPIVKAPSLSDLENKGASEILKDDQPIKYFQEY 1148
                     VQ S     + +   V    + +  S+SDLE KGA+E+L +D   K FQEY
Sbjct: 301  DASLSLKDHVQGSGQDMRRDSDVTVSGKHVEQVQSVSDLEKKGAAELLNEDDLNKKFQEY 360

Query: 1149 PFLFAQAGDLTVDDVENLLNCYKQVVFKYISLSKGMGIPAASPAASGTQSQSKADTDKXX 1328
            PFLFA+AGDLTV DVE+LLN YKQ+V +Y++L++GMG+   +  A   Q+     ++   
Sbjct: 361  PFLFARAGDLTVADVESLLNSYKQLVLRYVALAQGMGVSPETTLARSGQTSDLVVSEDPD 420

Query: 1329 XXXXXXXXXXXXKPEIIQRSNDSSSENLFSGMDETEQKTTIDEA 1460
                          ++I  ++ S + +     ++  QKT +D +
Sbjct: 421  NLNSVVNDNEKKVDDVISENHHSEAVDT-EASEQMTQKTAVDSS 463


>ref|NP_001141062.1| uncharacterized protein LOC100273143 [Zea mays]
            gi|194702456|gb|ACF85312.1| unknown [Zea mays]
            gi|413956249|gb|AFW88898.1| hypothetical protein
            ZEAMMB73_627333 [Zea mays]
          Length = 483

 Score =  503 bits (1295), Expect = e-140
 Identities = 256/415 (61%), Positives = 310/415 (74%), Gaps = 14/415 (3%)
 Frame = +3

Query: 111  GNTDVFGSSTAPLTWHDFLDRMRHPSAADFVKSIKSFIVSFSNRAPDPEKDSASVQEFLL 290
            G+ D FGS TAPL WHDFL+RMR PSAA+FVKSIKSFIV+FSNRAPDPEKDS ++QEFL 
Sbjct: 3    GSADSFGSLTAPLAWHDFLERMRQPSAAEFVKSIKSFIVTFSNRAPDPEKDSTAIQEFLE 62

Query: 291  NMEGAFKVHMLWXXXXXXXXXXXXXXXXKYIMTKLFNRAFASLPEDVKHDEELYEKISLL 470
            NMEGAF+ H  W                KY+MTKLFNR FAS+PEDVK DEEL+EK+SLL
Sbjct: 63   NMEGAFRAHTPWAGSSEEELESAGEGLEKYVMTKLFNRVFASVPEDVKSDEELFEKMSLL 122

Query: 471  QQFVTPENLDIKPNFRNETSWLLAQKELQKINMYKAPRDKLVCILNCCKVINNLLINASI 650
            QQFV PENLDIKP ++NETSWLLAQKELQKINMYKAPRDKL CILNCCKVINNLL+NASI
Sbjct: 123  QQFVRPENLDIKPEYQNETSWLLAQKELQKINMYKAPRDKLACILNCCKVINNLLLNASI 182

Query: 651  ALKDNPPGADDFLPVLIYVTLKANPPQLYSNLLYIERYRRRSRLVSESAYFFTNIVSAES 830
               + PPGAD+FLPVLIYVT+KANPPQL+SNLLYI+RYRR++RLVSE+ YFFTNI+SAES
Sbjct: 183  VSNETPPGADEFLPVLIYVTIKANPPQLHSNLLYIQRYRRQTRLVSEAQYFFTNILSAES 242

Query: 831  FIWNITAESLSMDEMEFQRKMDSARAYLLGQSTNIENLKDQANESIPDHKTQA------- 989
            FIWNI  ESLSM+E++FQR+MDSAR  +LG S + E   +QAN  + D  +Q+       
Sbjct: 243  FIWNIDGESLSMNELDFQRRMDSARERMLGLSADSEYQDNQANPDVQDRTSQSLGANRNS 302

Query: 990  -------SEVQRSVGSYMQKNVYNVPPTPIVKAPSLSDLENKGASEILKDDQPIKYFQEY 1148
                     VQ S     + +   V      +  S+S+LE KG +E+L +D   K FQEY
Sbjct: 303  DASLSLKDHVQGSGQDMRRDSDVTVSGKQAEQVQSISELEKKGTAELLNEDDLNKKFQEY 362

Query: 1149 PFLFAQAGDLTVDDVENLLNCYKQVVFKYISLSKGMGIPAASPAASGTQSQSKAD 1313
            PFLFA+AGDLT+ DVE+LLN YK +V +Y++L++GMG+   SP  + TQ+   +D
Sbjct: 363  PFLFARAGDLTIADVESLLNSYKHLVLRYVALAQGMGV---SPETTLTQNGQTSD 414


>ref|XP_002285827.1| PREDICTED: vacuolar protein sorting-associated protein 9A isoform 1
            [Vitis vinifera]
          Length = 463

 Score =  498 bits (1283), Expect = e-138
 Identities = 265/459 (57%), Positives = 330/459 (71%), Gaps = 9/459 (1%)
 Frame = +3

Query: 114  NTDVFGSSTAPLTWHDFLDRMRHPSAADFVKSIKSFIVSFSNRAPDPEKDSASVQEFLLN 293
            N D F SSTAPLTWHDFL+RMR PSAADFVK+IKSFIVSFSN APDPE+DSA+VQEFL N
Sbjct: 3    NADPFASSTAPLTWHDFLERMRQPSAADFVKAIKSFIVSFSNNAPDPERDSAAVQEFLAN 62

Query: 294  MEGAFKVHMLWXXXXXXXXXXXXXXXXKYIMTKLFNRAFASLPEDVKHDEELYEKISLLQ 473
            ME AF+ H LW                KY+MTKL+ R FAS+P+D K DE+L+EKI L+Q
Sbjct: 63   MEMAFRAHPLWAGCSEEELESAGEGLEKYVMTKLYTRVFASVPDDSKLDEQLFEKIGLVQ 122

Query: 474  QFVTPENLDIKPNFRNETSWLLAQKELQKINMYKAPRDKLVCILNCCKVINNLLINASIA 653
            QF+ PE LDIK  F+NETSWLLAQKELQKINMYKAPRDKLVCILNCCKVINNLL+NASIA
Sbjct: 123  QFIRPEQLDIKTTFQNETSWLLAQKELQKINMYKAPRDKLVCILNCCKVINNLLLNASIA 182

Query: 654  LKDNPPGADDFLPVLIYVTLKANPPQLYSNLLYIERYRRRSRLVSESAYFFTNIVSAESF 833
              ++PPGAD+FLPVLIYVTLKANPPQL+SNLLYI RYRR+SR+V+E+AYFFTN++SAESF
Sbjct: 183  SNEDPPGADEFLPVLIYVTLKANPPQLHSNLLYILRYRRQSRMVAEAAYFFTNMLSAESF 242

Query: 834  IWNITAESLSMDEMEFQRKMDSARAYLLGQSTNIEN-LKDQANESIPDHKTQASEVQRSV 1010
            I NI AESLSMDE EF+  M+SARA L G S++++  LK+   +S+   K +   +   +
Sbjct: 243  ISNINAESLSMDEREFEMNMESARALLSGLSSDLDGVLKEPQQKSLYSTKEKDPSIGSDL 302

Query: 1011 GSYMQKNVYNVPPTP------IVKAPSLSDLENKGASEILKDDQPIKYFQEYPFLFAQAG 1172
                 +        P      I K PS+SDLENKGA+ +LK+DQ    F+EYP+L+A  G
Sbjct: 303  SLLSSEATSGAKLEPHAKDQLITKVPSISDLENKGAAMLLKEDQASLAFREYPYLYANVG 362

Query: 1173 DLTVDDVENLLNCYKQVVFKYISLSKGMGIPA--ASPAASGTQSQSKADTDKXXXXXXXX 1346
            DLTV+DVE+LLN YKQ+VFK++ LSKG+G+PA     + S TQ+Q  A+T K        
Sbjct: 363  DLTVNDVEDLLNHYKQLVFKHVCLSKGLGVPAPPLPLSISQTQAQKHAETMKDSADTRAA 422

Query: 1347 XXXXXXKPEIIQRSNDSSSENLFSGMDETEQKTTIDEAI 1463
                    +I   ++ S+  +LF  ++ +E K   +EA+
Sbjct: 423  EVKDNTLNDIGSTNDVSNQVSLFE-VETSESKLPQEEAV 460


Top