BLASTX nr result

ID: Dioscorea21_contig00038709 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00038709
         (389 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI28363.3| unnamed protein product [Vitis vinifera]              140   1e-31
ref|XP_002269762.1| PREDICTED: pentatricopeptide repeat-containi...   140   1e-31
gb|AFW71662.1| hypothetical protein ZEAMMB73_019211 [Zea mays]        121   5e-26
ref|XP_002453928.1| hypothetical protein SORBIDRAFT_04g021580 [S...   119   3e-25
ref|XP_003575063.1| PREDICTED: pentatricopeptide repeat-containi...   118   6e-25

>emb|CBI28363.3| unnamed protein product [Vitis vinifera]
          Length = 930

 Score =  140 bits (353), Expect = 1e-31
 Identities = 69/129 (53%), Positives = 92/129 (71%)
 Frame = +3

Query: 3   VKMGLDSCTFLMNSLMDFYGRSCRMSEMRKVFDGLFHKDLVSWNTAISCYTDDHQSNEAL 182
           +KMG DSC +L NSLMDFY +   +  MR+VF  +  K+LVSWNT I+ Y  +    EAL
Sbjct: 249 IKMGFDSCLYLENSLMDFYAKCGDLEGMRRVFSHMSEKNLVSWNTFINGYVHNFHYLEAL 308

Query: 183 DLFHLLMLEAFECDDYTLGSVLQAVTDLSSLNHGKEIHGYVLKAGIWYNSYVISALLVMY 362
            +F +LM E  +CDD++L S+L+AV+ L  L+HGKEIHGY+L+AGI  N YV+S+LL MY
Sbjct: 309 RIFQILMEEVSQCDDFSLLSILKAVSGLGHLDHGKEIHGYILRAGIETNRYVVSSLLDMY 368

Query: 363 IECNDNNSL 389
           I C D+ SL
Sbjct: 369 IGCIDHESL 377



 Score = 63.9 bits (154), Expect = 1e-08
 Identities = 34/117 (29%), Positives = 59/117 (50%)
 Frame = +3

Query: 27  TFLMNSLMDFYGRSCRMSEMRKVFDGLFHKDLVSWNTAISCYTDDHQSNEALDLFHLLML 206
           +F+ N+L+  YG    + +   VF G+   DLV W++ +S Y  +    E L +F  ++ 
Sbjct: 156 SFVENALVSMYGSCGALEDAAVVFGGIDKPDLVGWSSILSGYVKNGLEEEGLRIFCDMVS 215

Query: 207 EAFECDDYTLGSVLQAVTDLSSLNHGKEIHGYVLKAGIWYNSYVISALLVMYIECND 377
              E D +    VL A T+L   + G + H Y++K G     Y+ ++L+  Y +C D
Sbjct: 216 GGIEPDAFAFSMVLGACTNLECWDFGTQAHCYIIKMGFDSCLYLENSLMDFYAKCGD 272



 Score = 59.3 bits (142), Expect = 3e-07
 Identities = 32/113 (28%), Positives = 59/113 (52%)
 Frame = +3

Query: 30  FLMNSLMDFYGRSCRMSEMRKVFDGLFHKDLVSWNTAISCYTDDHQSNEALDLFHLLMLE 209
           F+M SL+ +      +   ++VF  +   D   W+  IS ++ +    EAL LF  +  +
Sbjct: 399 FIMTSLLKWCSLESSLETAKRVFTRVEQPDTAPWSALISGHSWNGCFAEALKLFRKMQFD 458

Query: 210 AFECDDYTLGSVLQAVTDLSSLNHGKEIHGYVLKAGIWYNSYVISALLVMYIE 368
             + +++T  SV+ A   L +L  GKE+H  +L++G   N  V++ L+ +Y E
Sbjct: 459 GIKANEFTFTSVILACLALENLRKGKELHCKILRSGYESNFSVVNTLINLYSE 511



 Score = 58.2 bits (139), Expect = 7e-07
 Identities = 36/119 (30%), Positives = 60/119 (50%), Gaps = 4/119 (3%)
 Frame = +3

Query: 27  TFLMNSLMDFYGRSCRMSEMRKVFDGLFHKDLVSWNTAISCYTDDHQSNEALDLFHLLML 206
           T L N  +  Y  +  M E RK+FD +  + LVSW   +S Y     ++E L +F  ++ 
Sbjct: 51  TRLFNLYLRMYVNAGAMQEARKLFDEMPERSLVSWTIVMSGYARHGPASEVLMMFWDMLC 110

Query: 207 EA----FECDDYTLGSVLQAVTDLSSLNHGKEIHGYVLKAGIWYNSYVISALLVMYIEC 371
            +       D +    VL+A   +  L++G+ +HG V+K     +S+V +AL+ MY  C
Sbjct: 111 GSGGGLLRPDSFVFAVVLRACGMVECLSYGRGVHGLVVKQSSVVDSFVENALVSMYGSC 169


>ref|XP_002269762.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g13650-like [Vitis vinifera]
          Length = 906

 Score =  140 bits (353), Expect = 1e-31
 Identities = 69/129 (53%), Positives = 92/129 (71%)
 Frame = +3

Query: 3   VKMGLDSCTFLMNSLMDFYGRSCRMSEMRKVFDGLFHKDLVSWNTAISCYTDDHQSNEAL 182
           +KMG DSC +L NSLMDFY +   +  MR+VF  +  K+LVSWNT I+ Y  +    EAL
Sbjct: 190 IKMGFDSCLYLENSLMDFYAKCGDLEGMRRVFSHMSEKNLVSWNTFINGYVHNFHYLEAL 249

Query: 183 DLFHLLMLEAFECDDYTLGSVLQAVTDLSSLNHGKEIHGYVLKAGIWYNSYVISALLVMY 362
            +F +LM E  +CDD++L S+L+AV+ L  L+HGKEIHGY+L+AGI  N YV+S+LL MY
Sbjct: 250 RIFQILMEEVSQCDDFSLLSILKAVSGLGHLDHGKEIHGYILRAGIETNRYVVSSLLDMY 309

Query: 363 IECNDNNSL 389
           I C D+ SL
Sbjct: 310 IGCIDHESL 318



 Score = 66.2 bits (160), Expect = 3e-09
 Identities = 35/122 (28%), Positives = 67/122 (54%)
 Frame = +3

Query: 3   VKMGLDSCTFLMNSLMDFYGRSCRMSEMRKVFDGLFHKDLVSWNTAISCYTDDHQSNEAL 182
           +K+ L S +++++SL+D Y +       ++VF  +   D   W+  IS ++ +    EAL
Sbjct: 366 IKLDLKSDSYVLSSLIDMYSKCGIWEAAKRVFTRVEQPDTAPWSALISGHSWNGCFAEAL 425

Query: 183 DLFHLLMLEAFECDDYTLGSVLQAVTDLSSLNHGKEIHGYVLKAGIWYNSYVISALLVMY 362
            LF  +  +  + +++T  SV+ A   L +L  GKE+H  +L++G   N  V++ L+ +Y
Sbjct: 426 KLFRKMQFDGIKANEFTFTSVILACLALENLRKGKELHCKILRSGYESNFSVVNTLINLY 485

Query: 363 IE 368
            E
Sbjct: 486 SE 487



 Score = 63.9 bits (154), Expect = 1e-08
 Identities = 34/117 (29%), Positives = 59/117 (50%)
 Frame = +3

Query: 27  TFLMNSLMDFYGRSCRMSEMRKVFDGLFHKDLVSWNTAISCYTDDHQSNEALDLFHLLML 206
           +F+ N+L+  YG    + +   VF G+   DLV W++ +S Y  +    E L +F  ++ 
Sbjct: 97  SFVENALVSMYGSCGALEDAAVVFGGIDKPDLVGWSSILSGYVKNGLEEEGLRIFCDMVS 156

Query: 207 EAFECDDYTLGSVLQAVTDLSSLNHGKEIHGYVLKAGIWYNSYVISALLVMYIECND 377
              E D +    VL A T+L   + G + H Y++K G     Y+ ++L+  Y +C D
Sbjct: 157 GGIEPDAFAFSMVLGACTNLECWDFGTQAHCYIIKMGFDSCLYLENSLMDFYAKCGD 213



 Score = 54.7 bits (130), Expect = 8e-06
 Identities = 33/109 (30%), Positives = 56/109 (51%), Gaps = 4/109 (3%)
 Frame = +3

Query: 57  YGRSCRMSEMRKVFDGLFHKDLVSWNTAISCYTDDHQSNEALDLFHLLMLEA----FECD 224
           Y  +  M E RK+FD +  + LVSW   +S Y     ++E L +F  ++  +       D
Sbjct: 2   YVNAGAMQEARKLFDEMPERSLVSWTIVMSGYARHGPASEVLMMFWDMLCGSGGGLLRPD 61

Query: 225 DYTLGSVLQAVTDLSSLNHGKEIHGYVLKAGIWYNSYVISALLVMYIEC 371
            +    VL+A   +  L++G+ +HG V+K     +S+V +AL+ MY  C
Sbjct: 62  SFVFAVVLRACGMVECLSYGRGVHGLVVKQSSVVDSFVENALVSMYGSC 110


>gb|AFW71662.1| hypothetical protein ZEAMMB73_019211 [Zea mays]
          Length = 798

 Score =  121 bits (304), Expect = 5e-26
 Identities = 60/121 (49%), Positives = 80/121 (66%)
 Frame = +3

Query: 3   VKMGLDSCTFLMNSLMDFYGRSCRMSEMRKVFDGLFHKDLVSWNTAISCYTDDHQSNEAL 182
           +KMGL    FL N L+ FYGRS  +  MR VFD +  KDLVSWNT I CY ++    EA 
Sbjct: 246 IKMGLVGKEFLDNCLIGFYGRSGELQLMRNVFDEMNGKDLVSWNTVIQCYAENLCHEEAS 305

Query: 183 DLFHLLMLEAFECDDYTLGSVLQAVTDLSSLNHGKEIHGYVLKAGIWYNSYVISALLVMY 362
             F  +M E  ECD++TLGS+LQ VT   +  HG EIHGY+++AG+  + +V+SAL+ MY
Sbjct: 306 AHFRAMMFEFAECDEFTLGSILQVVTRTGAFGHGMEIHGYLIRAGLDSDKHVMSALMDMY 365

Query: 363 I 365
           +
Sbjct: 366 V 366



 Score = 57.4 bits (137), Expect = 1e-06
 Identities = 33/113 (29%), Positives = 56/113 (49%), Gaps = 2/113 (1%)
 Frame = +3

Query: 30  FLMNSLMDFYGRSCRMSEMRKVFDGLFHKDLVSWNTAISCYTDDHQSNEALDLFHLLMLE 209
           F+ N L+  Y     +    KVF  +   DLVSW + +S YT++ +  EAL LF  +   
Sbjct: 152 FVANGLVTMYSSCQSLPCAEKVFGSIASPDLVSWTSMLSAYTENGRDAEALVLFMEMARG 211

Query: 210 AFECDDYTLGSVLQAVTDLS--SLNHGKEIHGYVLKAGIWYNSYVISALLVMY 362
              CD +TL   L+A + L    +  G ++H  ++K G+    ++ + L+  Y
Sbjct: 212 GVACDAFTLSVALRAASSLGHVGVGLGHQLHCCMIKMGLVGKEFLDNCLIGFY 264



 Score = 57.0 bits (136), Expect = 2e-06
 Identities = 35/113 (30%), Positives = 59/113 (52%), Gaps = 3/113 (2%)
 Frame = +3

Query: 42  SLMDFYGRSCRMSEMRKVFDGLFHK--DLVSWNTAISCYTDDHQSNEALDLFHLLMLEAF 215
           SL+  + R+ RM   R+VFD +  +   LV+W T +S Y     ++EAL+L   ++    
Sbjct: 52  SLLRAHARAGRMQPAREVFDAMPDRGRSLVAWTTLMSGYATHGPASEALELLLCMLGLLV 111

Query: 216 ECDDYTLGSVLQAVTDLSSLNHGKEIHGYVLKAG-IWYNSYVISALLVMYIEC 371
             D +     L+A   + SL  G+++HG V K G +  + +V + L+ MY  C
Sbjct: 112 RPDAFVFSVALRACAAVGSLRLGRQLHGAVAKLGYVGVDLFVANGLVTMYSSC 164


>ref|XP_002453928.1| hypothetical protein SORBIDRAFT_04g021580 [Sorghum bicolor]
           gi|241933759|gb|EES06904.1| hypothetical protein
           SORBIDRAFT_04g021580 [Sorghum bicolor]
          Length = 798

 Score =  119 bits (298), Expect = 3e-25
 Identities = 59/121 (48%), Positives = 79/121 (65%)
 Frame = +3

Query: 3   VKMGLDSCTFLMNSLMDFYGRSCRMSEMRKVFDGLFHKDLVSWNTAISCYTDDHQSNEAL 182
           +KMGL    FL N L+ FYGRS  +  MR VFD +  KDLVSWNT I CY ++    EA 
Sbjct: 245 IKMGLVGKEFLDNCLIGFYGRSGELQLMRNVFDEMNGKDLVSWNTIIQCYAENLCHEEAS 304

Query: 183 DLFHLLMLEAFECDDYTLGSVLQAVTDLSSLNHGKEIHGYVLKAGIWYNSYVISALLVMY 362
             F  +M E  ECD++TLGS+L  VT   +  HG EIHGY+++AG+  + +V+SAL+ MY
Sbjct: 305 AHFRAMMFEFAECDEFTLGSILHVVTATGAFGHGMEIHGYLIRAGLDSDKHVMSALIDMY 364

Query: 363 I 365
           +
Sbjct: 365 V 365



 Score = 62.4 bits (150), Expect = 4e-08
 Identities = 33/111 (29%), Positives = 56/111 (50%)
 Frame = +3

Query: 30  FLMNSLMDFYGRSCRMSEMRKVFDGLFHKDLVSWNTAISCYTDDHQSNEALDLFHLLMLE 209
           F+ N L+  Y     +    KVF  +   DLVSW + +S YT++    EAL LF  +  +
Sbjct: 153 FVANGLVTMYSSCQSLRCAEKVFGSITSPDLVSWTSMLSAYTENGCDAEALMLFMEMARD 212

Query: 210 AFECDDYTLGSVLQAVTDLSSLNHGKEIHGYVLKAGIWYNSYVISALLVMY 362
              CD +TL   L+A + L  +  G ++H  ++K G+    ++ + L+  Y
Sbjct: 213 GIACDAFTLSVALRAASSLGHVGLGHQLHCCMIKMGLVGKEFLDNCLIGFY 263



 Score = 55.1 bits (131), Expect = 6e-06
 Identities = 35/113 (30%), Positives = 59/113 (52%), Gaps = 3/113 (2%)
 Frame = +3

Query: 42  SLMDFYGRSCRMSEMRKVFDGLFHK--DLVSWNTAISCYTDDHQSNEALDLFHLLMLEAF 215
           SL+  + R+ RM   R+VFD +  +   LV+W T +S Y     ++EAL+L   ++    
Sbjct: 53  SLLRAHVRAGRMRPAREVFDAMPDRGRSLVAWTTLMSGYATHGPASEALELLLCMLGLLV 112

Query: 216 ECDDYTLGSVLQAVTDLSSLNHGKEIHGYVLKAG-IWYNSYVISALLVMYIEC 371
             D +     L+A   + SL  G+++HG V K G +  + +V + L+ MY  C
Sbjct: 113 RPDAFVFSVALRACAAVGSLGLGRQLHGAVAKLGYVGADLFVANGLVTMYSSC 165



 Score = 55.1 bits (131), Expect = 6e-06
 Identities = 30/121 (24%), Positives = 60/121 (49%)
 Frame = +3

Query: 3   VKMGLDSCTFLMNSLMDFYGRSCRMSEMRKVFDGLFHKDLVSWNTAISCYTDDHQSNEAL 182
           +K  ++  +F+ +SL+D Y +   + E   +F    +     W+ AIS    + Q   A+
Sbjct: 421 LKSNMNPDSFVTSSLVDMYAKCGALEESNMLFSRTKNPGTAVWSAAISGNCLNGQYGRAV 480

Query: 183 DLFHLLMLEAFECDDYTLGSVLQAVTDLSSLNHGKEIHGYVLKAGIWYNSYVISALLVMY 362
            LF  +  E  + +++T  ++L A   L   + G EIH   +++G   N+ V+ +L+  Y
Sbjct: 481 HLFRRMQSEHVQPNEFTYTAILTACMALGDTDSGMEIHSNSIRSGYGTNTSVLKSLITFY 540

Query: 363 I 365
           +
Sbjct: 541 L 541


>ref|XP_003575063.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g39530-like [Brachypodium distachyon]
          Length = 789

 Score =  118 bits (295), Expect = 6e-25
 Identities = 55/121 (45%), Positives = 82/121 (67%)
 Frame = +3

Query: 3   VKMGLDSCTFLMNSLMDFYGRSCRMSEMRKVFDGLFHKDLVSWNTAISCYTDDHQSNEAL 182
           +K+G  +  FL N L++FYG+S  +  M+KVFD +  KDLVS NT I CY D+    +AL
Sbjct: 235 IKLGFSNSGFLENCLIEFYGKSSELHLMQKVFDDMDDKDLVSSNTVIQCYADNMCDEQAL 294

Query: 183 DLFHLLMLEAFECDDYTLGSVLQAVTDLSSLNHGKEIHGYVLKAGIWYNSYVISALLVMY 362
             F  +M E  ECD++TLGS+L  VT   + ++G EIHGY+++AG+  + +V+SAL+ MY
Sbjct: 295 SHFRAMMFEGSECDEFTLGSILHVVTRRGAFDYGMEIHGYLIRAGLDSDKHVMSALMDMY 354

Query: 363 I 365
           +
Sbjct: 355 V 355



 Score = 66.6 bits (161), Expect = 2e-09
 Identities = 37/120 (30%), Positives = 66/120 (55%), Gaps = 1/120 (0%)
 Frame = +3

Query: 6   KMG-LDSCTFLMNSLMDFYGRSCRMSEMRKVFDGLFHKDLVSWNTAISCYTDDHQSNEAL 182
           KMG + +  F+ N L+  Y     +    KVF+G+   DLVSW + +S YT++    EA+
Sbjct: 134 KMGYVGADLFVANGLLTMYASCRSLGCAEKVFNGIATPDLVSWTSMLSGYTENGCHAEAV 193

Query: 183 DLFHLLMLEAFECDDYTLGSVLQAVTDLSSLNHGKEIHGYVLKAGIWYNSYVISALLVMY 362
            LF  ++     CD +TL   L+A + L++L+ G ++H  ++K G   + ++ + L+  Y
Sbjct: 194 MLFVEMVHAGIRCDAFTLSVALRAASSLANLSLGHQLHCCIIKLGFSNSGFLENCLIEFY 253



 Score = 58.9 bits (141), Expect = 4e-07
 Identities = 34/121 (28%), Positives = 61/121 (50%)
 Frame = +3

Query: 3   VKMGLDSCTFLMNSLMDFYGRSCRMSEMRKVFDGLFHKDLVSWNTAISCYTDDHQSNEAL 182
           +K+ ++S  F+ +SL+D Y +   + E   +F    +     W+  IS    + Q   AL
Sbjct: 411 LKLNMNSDAFVTSSLVDMYAKCGCLEESHLLFSTTKYPGTAEWSAVISGNCLNGQFERAL 470

Query: 183 DLFHLLMLEAFECDDYTLGSVLQAVTDLSSLNHGKEIHGYVLKAGIWYNSYVISALLVMY 362
            LF  + L+    +++T  SVL A  DL  +  G EIHG  ++ G   ++ V+ +L+  Y
Sbjct: 471 HLFRRMQLDHVRPNEFTYTSVLTACIDLGDVVGGIEIHGNSVRNGYGTHASVVKSLISFY 530

Query: 363 I 365
           +
Sbjct: 531 L 531


Top