BLASTX nr result

ID: Dioscorea21_contig00014441 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00014441
         (1975 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Viti...   524   e-146
emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]   519   e-144
ref|XP_002309707.1| predicted protein [Populus trichocarpa] gi|2...   514   e-143
gb|ACN76570.1| cysteine proteinase [Triticum aestivum]                513   e-143
ref|XP_003580841.1| PREDICTED: cysteine protease ATG4B-like [Bra...   507   e-141

>ref|XP_003635099.1| PREDICTED: cysteine protease ATG4-like [Vitis vinifera]
            gi|296086874|emb|CBI33041.3| unnamed protein product
            [Vitis vinifera]
          Length = 486

 Score =  524 bits (1349), Expect = e-146
 Identities = 269/504 (53%), Positives = 334/504 (66%), Gaps = 14/504 (2%)
 Frame = +1

Query: 262  MTGLLERAVASNFPAGEQQSRNSSERETLAVAVSEEESATSNTQTRSSKASLWSSFIAST 441
            M G  E+AVAS F    +   ++SE ++              + T+ SK SLWSS  AS 
Sbjct: 1    MKGFCEKAVASKFSCKTKSDSSNSEPQS--------------SDTKLSKVSLWSSVFASA 46

Query: 442  FTIFETERSSGDKEGKRKSY------GWTXXXXXXXXXXXMRRLQERILGTNRVDVSSSN 603
            F++FET   S     ++K+       GWT           MRR+QER+LGT++  +SSS 
Sbjct: 47   FSVFETNSESSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSST 106

Query: 604  SDIWLLGINYKVSQEESSDR--DNYGLDAFLQDFSSRIWITYRKGFDPIVDTKFVSDVNW 777
            SDIWLLG+ YK+SQEESS+    + GL  F QDFSSRI +TYRKGF+ I D+K  SDVNW
Sbjct: 107  SDIWLLGLCYKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNW 166

Query: 778  GCMIRSSQMLVAQAMLFHHMGRSWRKPTQKPYEREYVRILHHFGDSGMSAFSIHNLLQAG 957
            GCM+RSSQMLVAQA+L H MGRSWRK + KP +++Y+ ILHHFGDS  SAFSIHN+LQAG
Sbjct: 167  GCMLRSSQMLVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAG 226

Query: 958  RNYGLAAGSWVGPYAMCRAWAALTQPNGQHGDKTKEILPMVVYVVSGDEDGERGGAPVIC 1137
            + YGLAAGSWVGPYAMCR+W  L +   +  D   + LPM +Y+VSGDEDGERGGAPV+ 
Sbjct: 227  KAYGLAAGSWVGPYAMCRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVY 286

Query: 1138 IDNVARLCSDATSDHVTWXXXXXXXXXXXXXEKINPRYIPLLCETFTFPQSLGILGGKTG 1317
            I+  +R C + +   V W             EK+NPRYIP L  TFTFPQSLGILGGK G
Sbjct: 287  IEEASRHCLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPG 346

Query: 1318 ASTYIVGVQDNKALYLDPHEVQQAVDIKEDDLEAETSSYHCSVVRQMQLDLIDPSLAIGF 1497
            ASTYIVGVQD KA YLDPHE Q  VDI+ ++LEA+TSSYHC+++R + LD IDPSLAIGF
Sbjct: 347  ASTYIVGVQDEKAFYLDPHEAQSVVDIRRENLEADTSSYHCNIIRHICLDSIDPSLAIGF 406

Query: 1498 YCRDKDDFDDFCSRASALGDRSNGAPLFTVTQSPQSSRTIHQGALMENIDGSDDFRVGET 1677
            YCRDKDDFDDFC RAS L D+SNGAPLFTV       + I   +  + +D    FR  ++
Sbjct: 407  YCRDKDDFDDFCIRASKLADKSNGAPLFTVAHIHSLPKPI---SCSDGMDDCSGFREDDS 463

Query: 1678 FNTEDICDDSQTQ------EDEWQ 1731
            F   D+  +   +      ED+WQ
Sbjct: 464  F---DVVSNKGAEGYEHEHEDDWQ 484


>emb|CAN81102.1| hypothetical protein VITISV_021940 [Vitis vinifera]
          Length = 489

 Score =  519 bits (1336), Expect = e-144
 Identities = 270/507 (53%), Positives = 333/507 (65%), Gaps = 17/507 (3%)
 Frame = +1

Query: 262  MTGLLERAVASNFPAGEQQSRNSSERETLAVAVSEEESATSNTQTRSSKASLWSSFIAST 441
            M G  E+AVAS F    +   ++SE ++              + T+ SK SLWSS  AS 
Sbjct: 1    MKGFCEKAVASKFSCKTKSDSSNSEPQS--------------SDTKLSKVSLWSSVFASA 46

Query: 442  FTIFETERSSGDKEGKRKSY------GWTXXXXXXXXXXXMRRLQERILGTNRVDVSSSN 603
            F++FET   S     ++K+       GWT           MRR+QER+LGT++  +SSS 
Sbjct: 47   FSVFETNSESSPSASEKKAIDNGRNNGWTTAVRKVVTGVSMRRIQERVLGTSKTGISSST 106

Query: 604  SDIWLLGINYKVSQEESSDR--DNYGLDAFLQDFSSRIWITYRKGFDPIVDTKFVSDVNW 777
            SDIWLLG+ YK+SQEESS+    + GL  F QDFSSRI +TYRKGF+ I D+K  SDVNW
Sbjct: 107  SDIWLLGLCYKISQEESSNHASSSNGLAEFEQDFSSRILMTYRKGFEAIGDSKLTSDVNW 166

Query: 778  GCMIRSSQMLVAQAMLFHHMGRSWRKPTQKPYEREYVRILHHFGDSGMSAFSIHNLLQAG 957
            GCM+RSSQMLVAQA+L H MGRSWRK + KP +++Y+ ILHHFGDS  SAFSIHN+LQAG
Sbjct: 167  GCMLRSSQMLVAQALLLHRMGRSWRKTSHKPMDQDYIEILHHFGDSKASAFSIHNILQAG 226

Query: 958  RNYGLAAGSWVGPYAMCRAWAALTQPNGQHGDKTKEILPMVVYVVSGDEDGERGGAPVIC 1137
            + YGLAAGSWVGPYAMCR+W  L +   +  D   + LPM +Y+VSGDEDGERGGAPV+ 
Sbjct: 227  KAYGLAAGSWVGPYAMCRSWETLARSKREETDLECQSLPMAIYIVSGDEDGERGGAPVVY 286

Query: 1138 IDNVARLCSDATSDHVTWXXXXXXXXXXXXXEKINPRYIPLLCETFTFPQSLGILGGKTG 1317
            I+  +R C + +   V W             EK+NPRYIP L  TFTFPQSLGILGGK G
Sbjct: 287  IEEASRHCLEFSKGQVDWTPILLLVPLVLGLEKVNPRYIPSLAATFTFPQSLGILGGKPG 346

Query: 1318 ASTYIVGVQDNKALYLDPHEVQQAVDIKEDDLEAETSSYHC---SVVRQMQLDLIDPSLA 1488
            ASTYIVGVQD KA YLDPHE Q  VDI+ ++LEA+TSSYHC   S++R + LD IDPSLA
Sbjct: 347  ASTYIVGVQDEKAFYLDPHEAQSVVDIRRENLEADTSSYHCNCSSIIRHICLDSIDPSLA 406

Query: 1489 IGFYCRDKDDFDDFCSRASALGDRSNGAPLFTVTQSPQSSRTIHQGALMENIDGSDDFRV 1668
            IGFYCRDKDDFDDFC RAS L D SNGAPLFTV       + I   +  + +D    FR 
Sbjct: 407  IGFYCRDKDDFDDFCIRASKLADESNGAPLFTVAHIHSLPKPI---SCSDGMDDCSGFRE 463

Query: 1669 GETFNTEDICDDSQTQ------EDEWQ 1731
             ++F   D+  +   +      ED+WQ
Sbjct: 464  DDSF---DVVSNKGAEGYEHEHEDDWQ 487


>ref|XP_002309707.1| predicted protein [Populus trichocarpa] gi|222852610|gb|EEE90157.1|
            predicted protein [Populus trichocarpa]
          Length = 481

 Score =  514 bits (1323), Expect = e-143
 Identities = 270/464 (58%), Positives = 317/464 (68%), Gaps = 9/464 (1%)
 Frame = +1

Query: 367  EESATSNTQTRSSKASLWSSFIASTFTIFETERSSGDKEGK-----RKSYGWTXXXXXXX 531
            + S   +T T+ SK SLWSSF AS F++F+  R S           R S GWT       
Sbjct: 28   DSSEPGSTDTKVSKPSLWSSFFASAFSVFDIYRDSSSTSHNEAPHIRHSNGWTSSVKKIV 87

Query: 532  XXXXMRRLQERILGTNRVDVSSSNSDIWLLGINYKVSQEESS---DRDNYGLDAFLQDFS 702
                MRR+QER+LGT++  +S++ SDIWLLG  YK+SQ++SS   D  N  L AF +DFS
Sbjct: 88   AGGTMRRIQERVLGTSKTGISNTTSDIWLLGARYKISQDDSSGNADATN-ALAAFHRDFS 146

Query: 703  SRIWITYRKGFDPIVDTKFVSDVNWGCMIRSSQMLVAQAMLFHHMGRSWRKPTQKPYERE 882
            SRI ITYRKGFD I D+K  SDVNWGCM+RSSQMLVAQA+LFH +GRSWRKP  KP +R+
Sbjct: 147  SRILITYRKGFDMIEDSKLTSDVNWGCMLRSSQMLVAQALLFHRLGRSWRKPVDKPLDRD 206

Query: 883  YVRILHHFGDSGMSAFSIHNLLQAGRNYGLAAGSWVGPYAMCRAWAALTQPNGQHGDKTK 1062
            YV ILH FGDS  SAFSIHNLLQAG+ YGLAAGSWVGPYAMCR+W +L +   +  +   
Sbjct: 207  YVEILHLFGDSEASAFSIHNLLQAGKAYGLAAGSWVGPYAMCRSWESLARSKREETNLEY 266

Query: 1063 EILPMVVYVVSGDEDGERGGAPVICIDNVARLCSDATSDHVTWXXXXXXXXXXXXXEKIN 1242
            + LPM VYVVSG EDGERGGAPV+ I++ AR CS+ +     W             +KIN
Sbjct: 267  QTLPMAVYVVSGCEDGERGGAPVLSIEDAARHCSEFSKGREDWTPILLLVPLVLGLDKIN 326

Query: 1243 PRYIPLLCETFTFPQSLGILGGKTGASTYIVGVQDNKALYLDPHEVQQAVDIKEDDLEAE 1422
            PRYIP L  TFTFPQSLGILGGK GASTYIVGVQD  A YLDPHEVQ  V+   DD+EA 
Sbjct: 327  PRYIPSLQATFTFPQSLGILGGKPGASTYIVGVQDENAFYLDPHEVQPVVNFSRDDVEAN 386

Query: 1423 TSSYHCSVVRQMQLDLIDPSLAIGFYCRDKDDFDDFCSRASALGDRSNGAPLFTVTQSPQ 1602
            TSSYHC VVR + LDLIDPSLAIGFYCRDKDDFDDFCS AS L D SNGAPLFTV  S +
Sbjct: 387  TSSYHCDVVRHIPLDLIDPSLAIGFYCRDKDDFDDFCSLASKLADESNGAPLFTVANSYK 446

Query: 1603 SSRTIHQGALMENIDGSDDFRVG-ETFNTEDICDDSQTQEDEWQ 1731
            SS+        ++ +  DD  +G  T N  + C      ED+WQ
Sbjct: 447  SSK-------HDSSEVRDDDPLGVMTMNDAEGC----LNEDDWQ 479


>gb|ACN76570.1| cysteine proteinase [Triticum aestivum]
          Length = 484

 Score =  513 bits (1320), Expect = e-143
 Identities = 275/496 (55%), Positives = 330/496 (66%), Gaps = 6/496 (1%)
 Frame = +1

Query: 262  MTGLLERAVASNFPAGEQQSRNSSERETLAVAVSEEESATSNTQTRSSKASLWSSFIAST 441
            MT L ER  A   P       +S E +  AVA S   SA+ + +         +S ++S 
Sbjct: 1    MTSLPERGAA---PPSNPTPSSSCEGDA-AVASSSASSASEDQRKDGGPKQCKASILSSV 56

Query: 442  FTIFETERSSGDKEGKRKS--YGWTXXXXXXXXXXXMRRLQERILGTNRVDVSSSNSDIW 615
             TIFE ++    + G   S  Y W+           M R     LG  +   + +  D+W
Sbjct: 57   LTIFEPDQDQSGRSGGHASGSYAWSRVLRRFVGGGSMWRF----LGCGK---ALTAGDVW 109

Query: 616  LLGINYKVSQEESS-DRDNYGLDA-FLQDFSSRIWITYRKGFDPIVDTKFVSDVNWGCMI 789
             LG  YK+S EESS D D+ G  A FL+DFSSR+WITYRKGFD I D+K  SDVNWGCM+
Sbjct: 110  FLGKCYKLSSEESSSDSDSEGGHAAFLEDFSSRVWITYRKGFDVISDSKLTSDVNWGCMV 169

Query: 790  RSSQMLVAQAMLFHHMGRSWRKPTQKPYEREYVRILHHFGDSGMSAFSIHNLLQAGRNYG 969
            RSSQMLVAQA++FHH+GRSWRKP Q P + E+ RILH FGDS + AFSIHNLLQAG++YG
Sbjct: 170  RSSQMLVAQALIFHHLGRSWRKPAQNPSDPEHTRILHLFGDSEVCAFSIHNLLQAGKSYG 229

Query: 970  LAAGSWVGPYAMCRAWAALTQPNGQHGD--KTKEILPMVVYVVSGDEDGERGGAPVICID 1143
            LAAGSWVGPYAMCRAW  L + N +  +     E  PMV+YVVSGDEDGERGGAPV+CID
Sbjct: 230  LAAGSWVGPYAMCRAWQTLIRTNREQPEVINRNESFPMVLYVVSGDEDGERGGAPVVCID 289

Query: 1144 NVARLCSDATSDHVTWXXXXXXXXXXXXXEKINPRYIPLLCETFTFPQSLGILGGKTGAS 1323
              A+LC D       W             +KINPRYIPLL ETFTFPQSLGILGGK GAS
Sbjct: 290  VAAQLCYDFNKGQSAWSPILLLVPLVLGLDKINPRYIPLLKETFTFPQSLGILGGKPGAS 349

Query: 1324 TYIVGVQDNKALYLDPHEVQQAVDIKEDDLEAETSSYHCSVVRQMQLDLIDPSLAIGFYC 1503
            TYI GVQD++ALYLDPHEVQ AV+I  D+LEA+TSSYHCS VR M LDLIDPSLAIGFYC
Sbjct: 350  TYIAGVQDDRALYLDPHEVQLAVNIASDNLEADTSSYHCSTVRDMPLDLIDPSLAIGFYC 409

Query: 1504 RDKDDFDDFCSRASALGDRSNGAPLFTVTQSPQSSRTIHQGALMENIDGSDDFRVGETFN 1683
            RDKDDFDDFCSRAS L +++NGAPLFTV QS Q S+ ++     ++  G   + V +  +
Sbjct: 410  RDKDDFDDFCSRASELAEQANGAPLFTVVQSVQPSKQMYN---QDDGSGCSGYGVSDNID 466

Query: 1684 TEDICDDSQTQEDEWQ 1731
            TED+    +T EDEWQ
Sbjct: 467  TEDLDGSGETGEDEWQ 482


>ref|XP_003580841.1| PREDICTED: cysteine protease ATG4B-like [Brachypodium distachyon]
          Length = 493

 Score =  507 bits (1305), Expect = e-141
 Identities = 277/512 (54%), Positives = 336/512 (65%), Gaps = 22/512 (4%)
 Frame = +1

Query: 262  MTGLLERAVA--SNFPAGEQQSRNSSERETLAVAVSEEESATSNTQTRSSKASLWSSFIA 435
            MT L ER  A  S+ P+    SR   +  T AVA S   SA+ +  ++  K S+    ++
Sbjct: 1    MTSLPERGAAPPSDLPS---PSRRKGDAATAAVASS---SASEDIGSKHCKGSI----LS 50

Query: 436  STFTIFETERSS---------------GDKEGKRKSYGWTXXXXXXXXXXXMRRLQERIL 570
            S FTIFE ++ S               G   G      W+           M R     L
Sbjct: 51   SVFTIFEAQQDSSSSVAAAAACENKSPGHSSGPSYGGAWSRALRRFVGGGSMWRF----L 106

Query: 571  GTNRVDVSSSNSDIWLLGINYKVSQEESS---DRDNYGLDAFLQDFSSRIWITYRKGFDP 741
            G  +V    +N D+W LG  YK S EESS   D D+ G  AFL+DFSSRIW+TYRKGFD 
Sbjct: 107  GCAKV---LTNGDVWFLGKCYKFSSEESSSDLDTDS-GHAAFLEDFSSRIWVTYRKGFDA 162

Query: 742  IVDTKFVSDVNWGCMIRSSQMLVAQAMLFHHMGRSWRKPTQKPYEREYVRILHHFGDSGM 921
            I D+KF SDVNWGCM+RSSQMLVAQA++FHH+GRSWRKP+QKP   EY+RILH FGDS +
Sbjct: 163  ISDSKFTSDVNWGCMVRSSQMLVAQALMFHHLGRSWRKPSQKPCNPEYIRILHLFGDSEV 222

Query: 922  SAFSIHNLLQAGRNYGLAAGSWVGPYAMCRAWAALTQPNGQHGDKTK--EILPMVVYVVS 1095
             AFS+HNLLQAG++YGLAAGSWVGPYAMCRAW  L + N +  + +   E  PM +YVVS
Sbjct: 223  CAFSVHNLLQAGKSYGLAAGSWVGPYAMCRAWQTLIRTNREQPEVSNGNESFPMALYVVS 282

Query: 1096 GDEDGERGGAPVICIDNVARLCSDATSDHVTWXXXXXXXXXXXXXEKINPRYIPLLCETF 1275
            GDEDGERGGAPV+CID  A+LC D   D  TW             +KINPRYIPLL ETF
Sbjct: 283  GDEDGERGGAPVVCIDVAAQLCYDFNKDQSTWSPILLLVPLVLGLDKINPRYIPLLKETF 342

Query: 1276 TFPQSLGILGGKTGASTYIVGVQDNKALYLDPHEVQQAVDIKEDDLEAETSSYHCSVVRQ 1455
            TFPQSLGILGGK G STYI G+QD++ALYLDPH+VQ AV+I  D+L+A+TSSYHCS VR 
Sbjct: 343  TFPQSLGILGGKPGTSTYIAGIQDDRALYLDPHDVQMAVNIASDNLDADTSSYHCSTVRD 402

Query: 1456 MQLDLIDPSLAIGFYCRDKDDFDDFCSRASALGDRSNGAPLFTVTQSPQSSRTIHQGALM 1635
            M LDL+DPSLAIGFYCRDKDDFDDFCSRAS L  ++NGAPLFTV QS Q S+ ++     
Sbjct: 403  MALDLLDPSLAIGFYCRDKDDFDDFCSRASELVVKANGAPLFTVVQSIQPSKQMYN---Q 459

Query: 1636 ENIDGSDDFRVGETFNTEDICDDSQTQEDEWQ 1731
            ++  GS    + +  N ED+    +  E+EWQ
Sbjct: 460  DDGSGSSGDGMADNINMEDLDGSGEAGEEEWQ 491


Top