BLASTX nr result

ID: Dioscorea21_contig00016274 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00016274
         (921 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAT85295.1| FYVE zinc finger containing protein [Oryza sativa...   162   1e-37
gb|AAP44653.1| unknown protein [Oryza sativa Japonica Group]          162   1e-37
ref|XP_002456991.1| hypothetical protein SORBIDRAFT_03g046860 [S...   151   2e-34
ref|XP_002305636.1| predicted protein [Populus trichocarpa] gi|2...   149   1e-33
ref|XP_002452359.1| hypothetical protein SORBIDRAFT_04g024360 [S...   147   3e-33

>gb|AAT85295.1| FYVE zinc finger containing protein [Oryza sativa Japonica Group]
            gi|108710311|gb|ABF98106.1| FYVE zinc finger family
            protein, expressed [Oryza sativa Japonica Group]
          Length = 1094

 Score =  162 bits (409), Expect = 1e-37
 Identities = 109/229 (47%), Positives = 126/229 (55%), Gaps = 30/229 (13%)
 Frame = -3

Query: 598  EQNLKQGIDIRKDEILAHKRKALALKREGKLAEAREELRQAKLLEKSLEEVQQ------- 440
            E  +  G D  KDEIL HKRKA+A KREGK+AEAREEL+QAKLLEK LE  Q+       
Sbjct: 831  EPQIPHGHDTLKDEILHHKRKAVAFKREGKMAEAREELKQAKLLEKRLEVSQENSANGRD 890

Query: 439  -------------PQVDSKTAPTQSFDDAPVVQEIKPKQAQKPLSGRERLKLQQESLAHK 299
                          Q  S  + T     AP  QEIKP Q  K LS R+RLK+Q+ESLAHK
Sbjct: 891  ESMKPVVQETNLIQQSASAKSCTDDISSAPPAQEIKPVQPPKALSSRDRLKIQRESLAHK 950

Query: 298  RNALKLRREGKIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAPDDAVVEDLLDPQ 119
            RNALKLRREGK                                     +DA VEDLLDPQ
Sbjct: 951  RNALKLRREGK----TAEADAEFELAKSLESQLEESESQVSGGKSSDANDAAVEDLLDPQ 1006

Query: 118  LMSVLKSIGWNDSDMF-----SQPSKKAEPK---AVTSK--SEKSHLEE 2
            +MS LKSIGW+D+D+      +QPSKKAE K   A T+K  SEK+ LEE
Sbjct: 1007 IMSALKSIGWSDADLSAQSSNAQPSKKAEAKPTVAATTKPQSEKTQLEE 1055



 Score = 67.8 bits (164), Expect = 3e-09
 Identities = 42/102 (41%), Positives = 62/102 (60%), Gaps = 1/102 (0%)
 Frame = -3

Query: 565 KDEILAHKRKALALKREGKLAEAREELRQAKLLEKSLE-EVQQPQVDSKTAPTQSFDDAP 389
           K+++LA KR+A+A K+ G +AEA   LR+AKLLEK LE E  + +V S      +  +  
Sbjct: 459 KEQVLALKREAIAQKKAGNVAEAMSLLRKAKLLEKDLETEQSESKVPSPQGHRSTRTEDI 518

Query: 388 VVQEIKPKQAQKPLSGRERLKLQQESLAHKRNALKLRREGKI 263
            V E+  +    P   + +L +Q+E LA K+ AL LRREGK+
Sbjct: 519 TVAEMNTRPVSAP---KSKLAIQRELLALKKKALALRREGKV 557


>gb|AAP44653.1| unknown protein [Oryza sativa Japonica Group]
          Length = 1142

 Score =  162 bits (409), Expect = 1e-37
 Identities = 109/229 (47%), Positives = 126/229 (55%), Gaps = 30/229 (13%)
 Frame = -3

Query: 598  EQNLKQGIDIRKDEILAHKRKALALKREGKLAEAREELRQAKLLEKSLEEVQQ------- 440
            E  +  G D  KDEIL HKRKA+A KREGK+AEAREEL+QAKLLEK LE  Q+       
Sbjct: 879  EPQIPHGHDTLKDEILHHKRKAVAFKREGKMAEAREELKQAKLLEKRLEVSQENSANGRD 938

Query: 439  -------------PQVDSKTAPTQSFDDAPVVQEIKPKQAQKPLSGRERLKLQQESLAHK 299
                          Q  S  + T     AP  QEIKP Q  K LS R+RLK+Q+ESLAHK
Sbjct: 939  ESMKPVVQETNLIQQSASAKSCTDDISSAPPAQEIKPVQPPKALSSRDRLKIQRESLAHK 998

Query: 298  RNALKLRREGKIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAPDDAVVEDLLDPQ 119
            RNALKLRREGK                                     +DA VEDLLDPQ
Sbjct: 999  RNALKLRREGK----TAEADAEFELAKSLESQLEESESQVSGGKSSDANDAAVEDLLDPQ 1054

Query: 118  LMSVLKSIGWNDSDMF-----SQPSKKAEPK---AVTSK--SEKSHLEE 2
            +MS LKSIGW+D+D+      +QPSKKAE K   A T+K  SEK+ LEE
Sbjct: 1055 IMSALKSIGWSDADLSAQSSNAQPSKKAEAKPTVAATTKPQSEKTQLEE 1103



 Score = 67.8 bits (164), Expect = 3e-09
 Identities = 42/102 (41%), Positives = 62/102 (60%), Gaps = 1/102 (0%)
 Frame = -3

Query: 565 KDEILAHKRKALALKREGKLAEAREELRQAKLLEKSLE-EVQQPQVDSKTAPTQSFDDAP 389
           K+++LA KR+A+A K+ G +AEA   LR+AKLLEK LE E  + +V S      +  +  
Sbjct: 507 KEQVLALKREAIAQKKAGNVAEAMSLLRKAKLLEKDLETEQSESKVPSPQGHRSTRTEDI 566

Query: 388 VVQEIKPKQAQKPLSGRERLKLQQESLAHKRNALKLRREGKI 263
            V E+  +    P   + +L +Q+E LA K+ AL LRREGK+
Sbjct: 567 TVAEMNTRPVSAP---KSKLAIQRELLALKKKALALRREGKV 605


>ref|XP_002456991.1| hypothetical protein SORBIDRAFT_03g046860 [Sorghum bicolor]
            gi|241928966|gb|EES02111.1| hypothetical protein
            SORBIDRAFT_03g046860 [Sorghum bicolor]
          Length = 955

 Score =  151 bits (382), Expect = 2e-34
 Identities = 113/285 (39%), Positives = 140/285 (49%), Gaps = 21/285 (7%)
 Frame = -3

Query: 793  SVDAPGPVLDSQSMVVQSGTQKVQHRESLESSNLSQATASVKVGDQITAGMQETKXXXXX 614
            SVD  G   DS S +    T   Q   + ESS+ + +  S +     + G ++       
Sbjct: 618  SVDTLG---DSPSKLQVETTGSKQIHVAKESSDGASSALS-RPSYTNSLGSEKGSHSPSE 673

Query: 613  XXVNTEQNLKQGIDIRKDEILAHKRKALALKREGKLAEAREELRQAKLLEKSLEEVQQPQ 434
              V  E +   G DI  DEIL HKRKA+A KREGK+AEAREEL+ A+LLEK LE  QQ  
Sbjct: 674  LRVRKEPHKTHGDDILTDEILFHKRKAVAFKREGKMAEAREELKLARLLEKRLEGAQQDN 733

Query: 433  VD--------------------SKTAPTQSFDDAPVVQEIKPKQAQKPLSGRERLKLQQE 314
            +D                    S +  T     AP  Q  K  Q QK +S R+RLK+Q+E
Sbjct: 734  MDGDDNFIAPAGGQSIVAQQRASSSIQTDGVASAPPAQASKSTQPQKVMSSRDRLKIQRE 793

Query: 313  SLAHKRNALKLRREGKIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAPDDAVVED 134
            SLAHKRNALKLRREGKI                                   P+DA+VED
Sbjct: 794  SLAHKRNALKLRREGKI--AEADAAFELAKALESQLEESDNQGSSSGVKSGEPNDAMVED 851

Query: 133  LLDPQLMSVLKSIGWNDSDMFSQPSKKAEPK-AVTSKSEKSHLEE 2
            LLDPQ+MS LKSIGW+D D+  Q S    PK   TSK + +   E
Sbjct: 852  LLDPQIMSALKSIGWSDMDLSMQSSSTQPPKPPQTSKGQPTQKVE 896



 Score = 63.9 bits (154), Expect = 5e-08
 Identities = 42/106 (39%), Positives = 68/106 (64%), Gaps = 2/106 (1%)
 Frame = -3

Query: 574 DIRKDEILAHKRKALALKREGKLAEAREELRQAKLLEKSLEEVQQPQVDSKTAPTQSFDD 395
           D+ K+++LA K++A+A +R G +AEA   L++AKLLEK + E ++P  +SK A  +    
Sbjct: 307 DVLKEQMLALKKEAVANRRSGNVAEAMTLLKKAKLLEKDM-ETEEP--ESKVASPEG-QK 362

Query: 394 APVVQEI--KPKQAQKPLSGRERLKLQQESLAHKRNALKLRREGKI 263
             + ++I      A+  L+ R +L +Q+E LA K+ AL LRREGK+
Sbjct: 363 TMLAEDITFAGTTARPVLAHRSKLAIQRELLALKKKALALRREGKV 408



 Score = 57.0 bits (136), Expect = 6e-06
 Identities = 32/64 (50%), Positives = 43/64 (67%), Gaps = 5/64 (7%)
 Frame = -3

Query: 571 IRKDEILAHKRKALALKREGKLAEAREELRQAKLLEKSLEEVQ-----QPQVDSKTAPTQ 407
           I K ++ A KR+AL LKREG+LAEA+EEL++AK+LEK LEE +     +   D   A  Q
Sbjct: 169 INKSQVNALKRQALLLKREGRLAEAKEELKKAKILEKQLEEQEILGETEDSDDDLAAIIQ 228

Query: 406 SFDD 395
           + DD
Sbjct: 229 NMDD 232


>ref|XP_002305636.1| predicted protein [Populus trichocarpa] gi|222848600|gb|EEE86147.1|
            predicted protein [Populus trichocarpa]
          Length = 1213

 Score =  149 bits (375), Expect = 1e-33
 Identities = 96/201 (47%), Positives = 116/201 (57%), Gaps = 4/201 (1%)
 Frame = -3

Query: 595  QNLKQGIDIRKDEILAHKRKALALKREGKLAEAREELRQAKLLEKSLEEVQQPQVDSKTA 416
            QN K  +   + E+LA KRKA+ALKREGKLAEAREELRQAKLLEKSLE      V     
Sbjct: 971  QNNKNAL---QQEVLARKRKAVALKREGKLAEAREELRQAKLLEKSLEVETLEPVSGTHD 1027

Query: 415  PTQSFDDAPVVQE---IKPKQAQKPLSGRERLKLQQESLAHKRNALKLRREGKIXXXXXX 245
             + S  +AP  Q+     PK + KPLSGR+R KLQQESL+HKR ALKLRREG++      
Sbjct: 1028 GSTSVSNAPPFQQKDPSAPKFSPKPLSGRDRFKLQQESLSHKRQALKLRREGQVEEAEAE 1087

Query: 244  XXXXXXXXXXXXXXXXXXXXXXXXXXXXAPDDAVVEDLLDPQLMSVLKSIGWNDSDMFSQ 65
                                          DD VVED LDPQL+S LK+IG  DS + SQ
Sbjct: 1088 FELAKALEAQLDEMSSNDSGKSSVNIAEPVDDVVVEDFLDPQLLSALKAIGIEDSSIISQ 1147

Query: 64   PSKKAEPKAVT-SKSEKSHLE 5
             S++  P  V+ +KSEK+  E
Sbjct: 1148 SSERPGPAKVSPTKSEKNSQE 1168



 Score = 57.4 bits (137), Expect = 5e-06
 Identities = 41/127 (32%), Positives = 63/127 (49%), Gaps = 23/127 (18%)
 Frame = -3

Query: 760 QSMVVQSGTQKVQHRES-LESSNLSQATASVKVGDQITAGMQETKXXXXXXXVNTEQNLK 584
           + ++  S T ++Q+ +   ES   S+  A V   D  TA ++E            ++ +K
Sbjct: 208 RKVLSSSNTVEIQNEDGPKESVRKSKRLAQVNEKDSFTAELRELGWSDMDLHDKDKKLVK 267

Query: 583 QGID----------------------IRKDEILAHKRKALALKREGKLAEAREELRQAKL 470
             ++                      I K ++   KRKALALKREGKLAEA+EEL++AK+
Sbjct: 268 MSLEGELSSLLGEISGRTNKNTGSSGIDKTQVFELKRKALALKREGKLAEAKEELKKAKV 327

Query: 469 LEKSLEE 449
           LE+ LEE
Sbjct: 328 LEQQLEE 334


>ref|XP_002452359.1| hypothetical protein SORBIDRAFT_04g024360 [Sorghum bicolor]
            gi|241932190|gb|EES05335.1| hypothetical protein
            SORBIDRAFT_04g024360 [Sorghum bicolor]
          Length = 1103

 Score =  147 bits (371), Expect = 3e-33
 Identities = 102/240 (42%), Positives = 123/240 (51%), Gaps = 41/240 (17%)
 Frame = -3

Query: 598  EQNLKQGIDIRKDEILAHKRKALALKREGKLAEAREELRQAKLLEKSLEEVQQPQVD--- 428
            E    QG D  KD+IL HKRKA+A KREGK+AEAREEL+ AKLLEK L+  QQ  +D   
Sbjct: 828  EHQKTQGDDTLKDDILLHKRKAVAFKREGKMAEAREELKLAKLLEKRLQGAQQDSMDGVG 887

Query: 427  -----------------SKTAPTQSFDDAPVVQEIKPKQAQKPLSGRERLKLQQESLAHK 299
                             S +  T     AP  Q  K  Q QK +S R+RLK+Q+ESLAHK
Sbjct: 888  DSITPAVEQNIVVQQPASSSNHTDDVTSAPPAQVSKSTQPQKAMSSRDRLKIQRESLAHK 947

Query: 298  RNALKLRREGKIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAPDDAVVEDLLDPQ 119
            RNALKLRREGK                                    P+DA+VE+LLDPQ
Sbjct: 948  RNALKLRREGK--TAEADAEFELAKELESQLEEPDNQSSSSGGKSSEPNDAIVENLLDPQ 1005

Query: 118  LMSVLKSIGWNDSDM----------------FSQPSKKAEPKAV---TSK--SEKSHLEE 2
            +MS L+SIGW+D D+                 SQP +K E K+    TSK  SE+S LEE
Sbjct: 1006 IMSALRSIGWSDMDLSMQSSSAQPQKPMQPSTSQPPQKVEAKSSVTGTSKPQSERSQLEE 1065



 Score = 65.9 bits (159), Expect = 1e-08
 Identities = 57/189 (30%), Positives = 87/189 (46%), Gaps = 24/189 (12%)
 Frame = -3

Query: 574 DIRKDEILAHKRKALALKREGKLAEAREELRQAKLLEKSLEEVQQPQVDSKTAP----TQ 407
           ++ K+++LA KR+A+A +R G +AEA   L++AKLLEK L E+++P V    +P    T 
Sbjct: 449 EVLKEQVLALKREAVANRRSGNVAEAMLLLKKAKLLEKDL-EIEEP-VSKVPSPEGQKTT 506

Query: 406 SFDDAPVVQEIKPKQAQKPLSGRERLKLQQESLAHKRNALKLRREGKIXXXXXXXXXXXX 227
           + +DA          A+   + + +L +Q+E LA K+ AL LRREGK+            
Sbjct: 507 NVEDA----TFAGMNARSISAPKSKLAIQRELLALKKKALALRREGKVDESEEELKKGSV 562

Query: 226 XXXXXXXXXXXXXXXXXXXXXXAPD--------------DAVVE------DLLDPQLMSV 107
                                  P               D V E      D+ DP L+SV
Sbjct: 563 LGKQLEELENSSKPPVPKETRSLPSNPPYKVEPPNISLADEVYEPEVTDNDMQDPALLSV 622

Query: 106 LKSIGWNDS 80
           LK++GW D+
Sbjct: 623 LKNMGWEDA 631


Top