BLASTX nr result

ID: Dioscorea21_contig00009151 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00009151
         (2226 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EEE67857.1| hypothetical protein OsJ_25665 [Oryza sativa Japo...   353   8e-95
gb|EEC82727.1| hypothetical protein OsI_27422 [Oryza sativa Indi...   349   2e-93
ref|XP_003559755.1| PREDICTED: uncharacterized protein LOC100840...   348   5e-93
ref|XP_002278421.1| PREDICTED: uncharacterized protein LOC100240...   322   3e-85
ref|XP_002330183.1| predicted protein [Populus trichocarpa] gi|2...   309   2e-81

>gb|EEE67857.1| hypothetical protein OsJ_25665 [Oryza sativa Japonica Group]
          Length = 786

 Score =  353 bits (907), Expect = 8e-95
 Identities = 214/487 (43%), Positives = 279/487 (57%), Gaps = 17/487 (3%)
 Frame = -2

Query: 1904 DSESDFSDFVISDEELRDLGIGGESNGXXXXXXXXXXXXXXQPKRIVXXXXXXXXXXXXX 1725
            +S SD SD+VIS++EL+DL +    +                P+R               
Sbjct: 311  ESSSD-SDYVISEQELKDLEVSMPPDAALQSPATL-------PRRTFLSRRVGEKGKEPE 362

Query: 1724 EVDSGKQVCGICLSEEKKTTVQGLLECCAHYFCFACIMEWSKVESRCPVCKRRFASISKS 1545
              ++ KQ CGICLSEE++ T+QG+L CCAHYFCFACIMEWSKVESRCP+CKRRF +I+KS
Sbjct: 363  --EAWKQTCGICLSEEQRATIQGVLNCCAHYFCFACIMEWSKVESRCPLCKRRFTTITKS 420

Query: 1544 GGPLNPGLGNRRAVIRVPKRDQVYQPSEEEMRGFLDPYENVVCVECQQGGDDYLMLLCDI 1365
                + GLG+R+AVIRV KRDQVYQP+EEEMR +LDPYENVVC+EC +GGDD LMLLCDI
Sbjct: 421  S-MADLGLGSRKAVIRVEKRDQVYQPTEEEMRRWLDPYENVVCIECNRGGDDNLMLLCDI 479

Query: 1364 CDTPAHTYCVGLGRDVPEGNWYCEGCRPMEDGFSYAQ-FFNGMDQGASSADPLSGHFSFS 1188
            CD+ AHTYCVGLGR VPEGNWYC GCR   +G S      +  +   + A+  SG F  +
Sbjct: 480  CDSSAHTYCVGLGRQVPEGNWYCGGCRSGGEGPSAQDTVVHCRESNTNPANSSSGSFGSA 539

Query: 1187 GIDNYRNTHSSTIQQPITPQGQSPFLGIDLNVSPR----YPWREEHEC---ASQSPRTGA 1029
                   T S   Q+P     Q    G DLN+SPR       REE      A  +P    
Sbjct: 540  -------TPSGVFQRPPPINTQPSLQGFDLNLSPRETPDEDKREESHVSADAVSTPTGRH 592

Query: 1028 STLSGRRAIQQRIRILFNRSR-------QSFARDAAYEPVISSGLQRGGPLPHSDSLIHP 870
            +TL  RRA  +RIRIL  R R            D        +      P   + S    
Sbjct: 593  ATLDRRRAFNRRIRILLFRPRVTPNGWQNPIQSDRTIPENEQNPQSTSTPTEVNPSCSRD 652

Query: 869  SPLQNLSDSVQFRQNSGPIIQQSTVNEGSSFRVADGAKEQVQSMVRSHMKSLSRSIALER 690
            S +QN   S  F Q +  +I++ T   GS+F+  +GAKEQ+  +V+ ++K +     L +
Sbjct: 653  SSMQNQQSSSSFVQPARGLIER-TYGGGSNFQQTEGAKEQLIPIVKRNLKLMCAQSPLGQ 711

Query: 689  ATYKDIARHSTHTILAACGIEHRREIVMS-PVQLPNTCIHDSE-DGPENLMKNCCSACFS 516
            + +K++AR +THTILA  GI H  + V+S P  LP+ C H  +   P  LM+ CCS+CF+
Sbjct: 712  SDFKNVARRATHTILALSGIAHNEDFVVSTPHPLPSHCNHACDGQEPAFLMRTCCSSCFN 771

Query: 515  LFVQGVV 495
             FV GVV
Sbjct: 772  SFVGGVV 778


>gb|EEC82727.1| hypothetical protein OsI_27422 [Oryza sativa Indica Group]
          Length = 455

 Score =  349 bits (896), Expect = 2e-93
 Identities = 200/425 (47%), Positives = 257/425 (60%), Gaps = 17/425 (4%)
 Frame = -2

Query: 1718 DSGKQVCGICLSEEKKTTVQGLLECCAHYFCFACIMEWSKVESRCPVCKRRFASISKSGG 1539
            ++ KQ CGICLSEE++ T+QG+L CCAHYFCFACIMEWSKVESRCP+CKRRF +I+KS  
Sbjct: 32   EAWKQTCGICLSEEQRATIQGVLNCCAHYFCFACIMEWSKVESRCPLCKRRFTTITKSS- 90

Query: 1538 PLNPGLGNRRAVIRVPKRDQVYQPSEEEMRGFLDPYENVVCVECQQGGDDYLMLLCDICD 1359
              + GLG+R+AVIRV KRDQVYQP+EEEMR +LDPYENVVC+EC +GGDD LMLLCDICD
Sbjct: 91   MADLGLGSRKAVIRVEKRDQVYQPTEEEMRRWLDPYENVVCIECNRGGDDNLMLLCDICD 150

Query: 1358 TPAHTYCVGLGRDVPEGNWYCEGCRPMEDGFSYAQ-FFNGMDQGASSADPLSGHFSFSGI 1182
            + AHTYCVGLGR VPEGNWYC GCR   +G S      +  +   + A+  SG F  +  
Sbjct: 151  SSAHTYCVGLGRQVPEGNWYCGGCRSGGEGPSAQDTVVHCRESNTNPANSSSGSFGSA-- 208

Query: 1181 DNYRNTHSSTIQQPITPQGQSPFLGIDLNVSPR----YPWREEHEC---ASQSPRTGAST 1023
                 T S   Q+P     Q    G DLN+SPR       REE      A  +P    +T
Sbjct: 209  -----TPSGVFQRPPPINTQPSLQGFDLNLSPRETPDEDKREESHVSADAVSTPTGRHAT 263

Query: 1022 LSGRRAIQQRIRILFNRSR-------QSFARDAAYEPVISSGLQRGGPLPHSDSLIHPSP 864
            L  RRA  +RIRIL  R R            D        +      P   + S    S 
Sbjct: 264  LDRRRAFNRRIRILLFRPRVTPNGWQNPIQSDRTIPENEQNPQSTSTPTEVNPSCSRDSS 323

Query: 863  LQNLSDSVQFRQNSGPIIQQSTVNEGSSFRVADGAKEQVQSMVRSHMKSLSRSIALERAT 684
            +QN   S  F Q +  +I++ T   GS+F+  +GAKEQ+  +V+ ++K +     L ++ 
Sbjct: 324  MQNQQSSSSFVQPARGLIER-TYGGGSNFQQTEGAKEQLIPIVKRNLKLMCAQSPLGQSD 382

Query: 683  YKDIARHSTHTILAACGIEHRREIVMS-PVQLPNTCIHDSE-DGPENLMKNCCSACFSLF 510
            +K++AR +THTILA  GI H  + V+S P  LP+ C H  +   P  LM+ CCS+CF+ F
Sbjct: 383  FKNVARRATHTILALSGIAHNEDFVVSTPHPLPSHCNHACDGQEPAFLMRTCCSSCFNSF 442

Query: 509  VQGVV 495
            V GVV
Sbjct: 443  VGGVV 447


>ref|XP_003559755.1| PREDICTED: uncharacterized protein LOC100840975 [Brachypodium
            distachyon]
          Length = 1111

 Score =  348 bits (892), Expect = 5e-93
 Identities = 207/492 (42%), Positives = 278/492 (56%), Gaps = 19/492 (3%)
 Frame = -2

Query: 1904 DSESDFSDFVISDEELRDLGIGGESNGXXXXXXXXXXXXXXQPKRIVXXXXXXXXXXXXX 1725
            DS SD SD+VIS+EEL+DLG+                      +R V             
Sbjct: 633  DSSSD-SDYVISEEELKDLGL-------PRPLETVPQLPPHPTRRTVVPRGIDGKGKEPE 684

Query: 1724 EVDSGKQVCGICLSEEKKTTVQGLLECCAHYFCFACIMEWSKVESRCPVCKRRFASISKS 1545
              ++ KQ+CGICLSEE++ T+QG+L CC+HYFCFACIMEWSKVESRCP+CKRRF +I+KS
Sbjct: 685  --ETLKQICGICLSEEQRATIQGVLNCCSHYFCFACIMEWSKVESRCPLCKRRFNTITKS 742

Query: 1544 GGPLNPGLGNRRAVIRVPKRDQVYQPSEEEMRGFLDPYENVVCVECQQGGDDYLMLLCDI 1365
              P + GLG+R   IRV KRDQVYQP+E+EMR +LDPYENVVC+EC QGGDD LMLLCDI
Sbjct: 743  SVP-DLGLGSRNVAIRVEKRDQVYQPTEDEMRRWLDPYENVVCIECNQGGDDNLMLLCDI 801

Query: 1364 CDTPAHTYCVGLGRDVPEGNWYCEGCRPMEDGFSYAQFFNGM----DQGASSADPLSGHF 1197
            CD+ AHT+CVGLGR+VPEGNWYC GCR   +G SYAQ  + +    +   ++AD  SG  
Sbjct: 802  CDSSAHTFCVGLGREVPEGNWYCGGCRSSVEGPSYAQTEDRVVHHGENNMNTADSSSGSV 861

Query: 1196 SFSGIDNYRNTHSSTIQQPITPQGQSPFLGIDLNVSPRYPWREEHECASQSPRTGASTLS 1017
                    R   S   Q+P     Q    G DLN+SP     E+    S       ST +
Sbjct: 862  G-------RALSSGIFQRPPPLNIQPSLQGFDLNLSPIETPDEDKRAESHISAEPVSTPT 914

Query: 1016 GRRAIQQRIRILFNRSRQSFARDAAYEPVISSGLQRGGPLPHSDS-------LIHPSPLQ 858
            GR A   R R L  R R    R         +G+Q    +P ++            SP  
Sbjct: 915  GRHATVDRRRALNRRIRILLFRPRTATNPWQNGVQHDSIIPGTEQNNQNTCPSTEVSPSC 974

Query: 857  NLSDSVQFRQNSGPIIQQS------TVNEGSSFRVADGAKEQVQSMVRSHMKSLSRSIAL 696
            + +D +Q +Q+S P +Q S      T   GS+FR    AK+Q+  +V+  +K +     L
Sbjct: 975  SSADFMQSQQSSSPFVQSSSNLTHCTYGGGSNFREIANAKDQLIPIVKRSIKHIYAQSPL 1034

Query: 695  ERATYKDIARHSTHTILAACGIEHRRE-IVMSPVQLPNTCIHDSED-GPENLMKNCCSAC 522
            ++ ++ ++AR +T+T+LA  GI H R+ +V +P   P+ C H  +   P  LM+  CS+C
Sbjct: 1035 DQTSFMNVARRATNTVLALSGIAHNRDRVVATPFPFPSHCRHACDGREPAFLMRTVCSSC 1094

Query: 521  FSLFVQGVVEKL 486
            F+ FV  VV  +
Sbjct: 1095 FNSFVGDVVSHI 1106


>ref|XP_002278421.1| PREDICTED: uncharacterized protein LOC100240780 [Vitis vinifera]
          Length = 733

 Score =  322 bits (824), Expect = 3e-85
 Identities = 199/491 (40%), Positives = 262/491 (53%), Gaps = 81/491 (16%)
 Frame = -2

Query: 1712 GKQVCGICLSEEKKTTVQGLLECCAHYFCFACIMEWSKVESRCPVCKRRFASISKSGGPL 1533
            GKQVCGICLSEE K  V+G L+CC+HYFCF CIMEWSKVESRCP+CK+RF +ISK     
Sbjct: 253  GKQVCGICLSEEGKRRVRGTLDCCSHYFCFGCIMEWSKVESRCPLCKQRFMTISKPARA- 311

Query: 1532 NPGLGNRRAVIRVPKRDQVYQPSEEEMRGFLDPYENVVCVECQQGGDDYLMLLCDICDTP 1353
            N G+  R  +I+VP+RDQVY PSEEE+RG+LDPYENV+C EC QGGDD LMLLCD+CD+P
Sbjct: 312  NTGIDLRDVMIQVPERDQVYLPSEEEIRGYLDPYENVICTECHQGGDDGLMLLCDLCDSP 371

Query: 1352 AHTYCVGLGRDVPEGNWYCEGCRPMEDGFSYAQFFNGMDQGASSADPLSGHFSFSGIDNY 1173
            AHTYCVGLGR+VPEGNWYCEGCRP +                   DPLS H +     + 
Sbjct: 372  AHTYCVGLGREVPEGNWYCEGCRPSQ-----------------VQDPLSDHRTTQNTLSD 414

Query: 1172 RNTHSSTIQQPITPQGQ---SPFL-GIDLNVSP----------------------RYPWR 1071
            R +    I + +        +PF  GI +N+SP                      R+  R
Sbjct: 415  RPSPVGNIGESLVTSLSLLSTPFTQGIGINISPRYRNAEAASPVSASGASTLSGRRWIHR 474

Query: 1070 EEHECASQS--------------PRTGASTLSGR------RAIQQRIRILFNRSRQSFAR 951
            + H+  S +              P +G+  L+ +       A Q    +    S  +F  
Sbjct: 475  QIHQIRSNNRMSHVAVRNIGNSAPNSGSDFLNSQIDQGLDTASQHTKALETGTSHSTFLE 534

Query: 950  DAAYE-------------PVISSGLQRGGPLPHSDSLIHP-------------SPLQNLS 849
            ++  +             P +S   ++    P + +   P             S   ++S
Sbjct: 535  ESLQDNRYPSLQNMDLLSPRLSQSRRQDIQAPTTTATAGPARGTLWDELVGISSAFNSIS 594

Query: 848  DSVQFRQ-------NSGPIIQQSTVNEGSSFRVADGAKEQVQSMVRSHMKSLSRSIALER 690
             + Q  Q        S   +  + V EG+ F V    KEQ+QSMVRSH+KSLS+ I L  
Sbjct: 595  GNEQLHQCSSRSSIRSDGSVSPNAVREGNHFHVV---KEQLQSMVRSHLKSLSKDIDLGL 651

Query: 689  ATYKDIARHSTHTILAACGIEHRREIVMSPVQLPNTCIHDSE--DGPENLMKNCCSACFS 516
            +T+KD+AR STHTILAA G+EHRR  V S V  P  C H     DG  +LMK+ CS CF 
Sbjct: 652  STFKDVARSSTHTILAAYGLEHRRSEVHS-VPTPPICSHIERIADGQMSLMKSSCSCCFD 710

Query: 515  LFVQGVVEKLM 483
             +V+ VV +++
Sbjct: 711  SYVRDVVRRIL 721


>ref|XP_002330183.1| predicted protein [Populus trichocarpa] gi|222871639|gb|EEF08770.1|
            predicted protein [Populus trichocarpa]
          Length = 735

 Score =  309 bits (792), Expect = 2e-81
 Identities = 189/487 (38%), Positives = 269/487 (55%), Gaps = 77/487 (15%)
 Frame = -2

Query: 1712 GKQVCGICLSEEKKTTVQGLLECCAHYFCFACIMEWSKVESRCPVCKRRFASISKSGGPL 1533
            G+QVCGICLSEE K   +G L+CC+HYFCF CIMEWSKVESRCP+CK+RF +I+K+G  +
Sbjct: 243  GRQVCGICLSEEDKRRFRGTLDCCSHYFCFTCIMEWSKVESRCPLCKQRFRTITKNGRSI 302

Query: 1532 NPGLGNRRAVIRVPKRDQVYQPSEEEMRGFLDPYENVVCVECQQGGDDYLMLLCDICDTP 1353
              G+  R  VI+VPKRDQVYQP+EEE+R ++DPYENV+C EC +GGDD LMLLCD+CD+ 
Sbjct: 303  -VGVDLRNMVIQVPKRDQVYQPTEEEIRSYIDPYENVICKECHEGGDDGLMLLCDLCDSS 361

Query: 1352 AHTYCVGLGRDVPEGNWYCEGCRPMEDGFSYAQ--------------FFN---------- 1245
            AHTYCVGLGR VPEGNWYC+ CRP+  G S +Q               FN          
Sbjct: 362  AHTYCVGLGRQVPEGNWYCDDCRPVALGSSSSQTQDSLPDQWNISSNIFNRPSLMLNLEE 421

Query: 1244 GMDQGASSADPLSGHFSFSGIDNYR-NTHSSTIQQPITPQGQSPFLG-------IDLNVS 1089
            G+D    S+  L+   +F  + + R  T  + +  P++  G S   G       I + +S
Sbjct: 422  GLDPNLESSPRLTVPQAFGSLSSPRFPTGDNHVASPVSGAGASTLSGRRHIHRNIRILLS 481

Query: 1088 PRYPWREEHECASQSPRTGASTLSG-----------RRAIQ----------------QRI 990
               P    +  A++     A++L G             A+Q                +R+
Sbjct: 482  NMNPSTNMNPMANRIDVISAASLRGDLSNSQIDLGRETALQNLRTQEVDTLEQTHHEERL 541

Query: 989  RILFNRSRQSFARDAAY--EPVISSGLQRGGPLPHSDSLIHPS------PLQNLSDSVQF 834
            +   ++      RD+ Y     ++  + +G  +P SD  ++ +       + ++S S Q 
Sbjct: 542  QTNDHQPSSFQNRDSFYLTPNQLTRQIVQGPTIPTSDRPVNLTLWPELMGINSMSGSEQL 601

Query: 833  RQ--------NSGPIIQQSTVNEGSSFRVADGAKEQVQSMVRSHMKSLSRSIALERATYK 678
             +        + G +       E   + V    KE++QSMV++H+ SLS    L+  T+K
Sbjct: 602  HEFDSRAMTGHGGTLSSYQVRGESQFYDV----KEKLQSMVKNHLGSLSHDTELDHDTFK 657

Query: 677  DIARHSTHTILAACGIEHRREIVMSPVQLPNTCIHDSE--DGPENLMKNCCSACFSLFVQ 504
            DI+R STHTILAACG+EH+R  V + V  P+TCIH      G  + MK CCS+CF  FV+
Sbjct: 658  DISRSSTHTILAACGLEHKRSEVHT-VPPPSTCIHIDRVVAGQTSPMKGCCSSCFDSFVR 716

Query: 503  GVVEKLM 483
             VV+++M
Sbjct: 717  DVVKRIM 723


Top