BLASTX nr result

ID: Atractylodes21_contig00021720 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes21_contig00021720
         (1555 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]            583   e-164
ref|XP_002313136.1| predicted protein [Populus trichocarpa] gi|2...   575   e-162
ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [V...   572   e-161
dbj|BAD29954.1| cysteine protease [Daucus carota]                     571   e-160
ref|XP_002518705.1| cysteine protease, putative [Ricinus communi...   568   e-159

>gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
          Length = 463

 Score =  583 bits (1504), Expect = e-164
 Identities = 288/448 (64%), Positives = 333/448 (74%), Gaps = 7/448 (1%)
 Frame = -1

Query: 1363 FICSGHDVKSEMSIINYDQTHMRPHRLRTHDEVLAIYRSWLTHHRRFYNALGENERRFEI 1184
            F+C       +MSII+YDQTH      RT  E +AIY  WLT H + YNA+GE ERRFEI
Sbjct: 14   FLCFAFSSALDMSIISYDQTHPPQ---RTDAEAMAIYEKWLTTHGKAYNAIGEKERRFEI 70

Query: 1183 FKDNLQFIDEHNADPNRSYKLGLNRFADMTNEEYRSKFMGMKTEMRNR-ARRVSHRYAVK 1007
            FKDNL+F+DEHNA    SY++GLNRFAD+TNEEYRS F+G   EM+ R A   S RYA +
Sbjct: 71   FKDNLRFVDEHNAVAG-SYRVGLNRFADLTNEEYRSMFLGGNMEMKERSASTKSDRYAFR 129

Query: 1006 LGGENLPESVDWREKGAVSPIKDQGQCGSCWAFSSIAAVEGINQIVTXXXXXXXXXXLVD 827
              G+ LP SVDWREKGAVSP+KDQGQCGSCWAFS+I+AVEGINQIVT          LVD
Sbjct: 130  -AGDKLPGSVDWREKGAVSPVKDQGQCGSCWAFSTISAVEGINQIVTGELISLSEQELVD 188

Query: 826  CE-TQSSGCNGGLMDYAFEFILKNGGIDTEQDYPYHAVEGPCDTSRKKARVVSIDGYEDV 650
            C+ + + GCNGGLMDY F+FI+ NGGIDTE+DYPY AV+G CD  RK ARVVSI+GYEDV
Sbjct: 189  CDKSYNMGCNGGLMDYGFQFIINNGGIDTEEDYPYRAVDGTCDQFRKNARVVSINGYEDV 248

Query: 649  PENDEYALKKAVAHQPVSVAVEAGGRAFQLYQSGVFSGTCGTEIDHGVVAVGYGTENGID 470
            PE+DE +LKKAVA+QPVSVA+EAGGRAFQLY+SGVF+G CGT +DHGVVAVGYGTENG+D
Sbjct: 249  PEDDENSLKKAVANQPVSVAIEAGGRAFQLYESGVFTGHCGTNLDHGVVAVGYGTENGVD 308

Query: 469  YWLVRNSWGPSWGENGYIKLERNLKFTNTGKCGITMMASYPVK-----XXXXXXXXXXXX 305
            YW VRNSWGP WGENGYIKLERN+  T +GKCGI  MASYP K                 
Sbjct: 309  YWTVRNSWGPKWGENGYIKLERNINAT-SGKCGIASMASYPTKTGSNPPNPGPSPPTPVN 367

Query: 304  XXTVCDDYYDCPPGNTXXXXXXXXXXXFGWGCCPFESATCCEDKYSCCPHDHPICDLHAG 125
              TVCDDYY CP G+T            GWGCCP ESATCC+D  SCCPH++PICDL  G
Sbjct: 368  PPTVCDDYYSCPEGSTCCCVYQYGDFCIGWGCCPLESATCCDDHSSCCPHEYPICDLDGG 427

Query: 124  TCLISKNNPIGVKALERTPARLYSSHVH 41
            TCL+SK+NP+GVKAL+R PAR    H+H
Sbjct: 428  TCLMSKDNPLGVKALKRGPARRNVGHLH 455


>ref|XP_002313136.1| predicted protein [Populus trichocarpa] gi|222849544|gb|EEE87091.1|
            predicted protein [Populus trichocarpa]
          Length = 477

 Score =  575 bits (1483), Expect = e-162
 Identities = 280/468 (59%), Positives = 338/468 (72%), Gaps = 12/468 (2%)
 Frame = -1

Query: 1414 MPMAFPAIISVTLFLFFFICSGHDVKSEMSIINYDQTHMRPHRLRTHDEVLAIYRSWLTH 1235
            M   + +   +  F F  +C   D    MSII+Y+  H +    RT  E L +Y  WL  
Sbjct: 1    MASLYRSFAFLATFYFLSVCLAID----MSIIDYNLKHGQVPE-RTEAETLRLYEMWLVK 55

Query: 1234 HRRFYNALGENERRFEIFKDNLQFIDEHNADPNRSYKLGLNRFADMTNEEYRSKFMGMKT 1055
            + + YNALGE ERRFEIFKDNL+F+D+HN+  N SYKLGLN+FAD++NEEYR+ ++G + 
Sbjct: 56   YGKAYNALGEKERRFEIFKDNLKFVDQHNSVGNPSYKLGLNKFADLSNEEYRAAYLGTRM 115

Query: 1054 EMRNR--ARRVSHRYAVKLGGENLPESVDWREKGAVSPIKDQGQCGSCWAFSSIAAVEGI 881
            + + R      S RY  K  G++LPESVDWREKGAV+P+KDQGQCGSCWAFS++ AVEGI
Sbjct: 116  DGKRRLLGGPKSARYLFK-DGDDLPESVDWREKGAVAPVKDQGQCGSCWAFSTVGAVEGI 174

Query: 880  NQIVTXXXXXXXXXXLVDCE-TQSSGCNGGLMDYAFEFILKNGGIDTEQDYPYHAVEGPC 704
            NQIVT          LVDC+   + GCNGGLMDYAFEFI+KNGGIDTE+DYPY AV+  C
Sbjct: 175  NQIVTGNLTSLSEQELVDCDKVYNQGCNGGLMDYAFEFIMKNGGIDTEEDYPYKAVDSMC 234

Query: 703  DTSRKKARVVSIDGYEDVPENDEYALKKAVAHQPVSVAVEAGGRAFQLYQSGVFSGTCGT 524
            D +RK ARVV+IDGYEDVP+NDE +L+KAVA+QPVSVA+EAGGRAFQLYQSGVF+G+CGT
Sbjct: 235  DPNRKNARVVTIDGYEDVPQNDEKSLRKAVANQPVSVAIEAGGRAFQLYQSGVFTGSCGT 294

Query: 523  EIDHGVVAVGYGTENGIDYWLVRNSWGPSWGENGYIKLERNLKFTNTGKCGITMMASYPV 344
            ++DHGVVAVGYGTENG+DYW+VRNSWGP+WGENGYI++ERN+  T TGKCGI M ASYP 
Sbjct: 295  QLDHGVVAVGYGTENGVDYWVVRNSWGPAWGENGYIRMERNVASTETGKCGIAMEASYPT 354

Query: 343  KXXXXXXXXXXXXXXTV---------CDDYYDCPPGNTXXXXXXXXXXXFGWGCCPFESA 191
            K               V         CDDYY CP G+T           FGWGCCP ESA
Sbjct: 355  KKGANPPNPGPSPPSPVNPSPPPSSECDDYYSCPAGSTCCCIYPYGDYCFGWGCCPLESA 414

Query: 190  TCCEDKYSCCPHDHPICDLHAGTCLISKNNPIGVKALERTPARLYSSH 47
            TCC+D  SCCPH++P+CDL AGTC +SKNNP GVKAL R PAR+  SH
Sbjct: 415  TCCDDHNSCCPHEYPVCDLEAGTCRMSKNNPFGVKALTRAPARIAQSH 462


>ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 469

 Score =  572 bits (1474), Expect = e-161
 Identities = 283/463 (61%), Positives = 340/463 (73%), Gaps = 12/463 (2%)
 Frame = -1

Query: 1414 MPMAFPAIISVTLFLFFFIC----SGHDVKSEMSIINYDQTHMRPHRLRTHDEVLAIYRS 1247
            M ++   I++  LF FFF+     S     ++MSII+Y     +    RT  EV+A+Y +
Sbjct: 1    MAVSQSPIMASFLFSFFFLLAALFSASASAADMSIISYGDRLEK----RTDAEVMAVYEA 56

Query: 1246 WLTHHRRFYNALGENERRFEIFKDNLQFIDEHNADPNRSYKLGLNRFADMTNEEYRSKFM 1067
            WL  H + YNALGE ERRFEIFKDNL+FI+EHNA  NR+YK+GLNRFAD+TNEEYRS+++
Sbjct: 57   WLVKHGKSYNALGERERRFEIFKDNLRFIEEHNA-VNRTYKVGLNRFADLTNEEYRSRYL 115

Query: 1066 GMKTEMRN--RARRVSHRYAVKLGGENLPESVDWREKGAVSPIKDQGQCGSCWAFSSIAA 893
            G + E R   RA RVS RY+ +  GE+LPESVDWREKGAV P+KDQG CGSCWAFS+IAA
Sbjct: 116  GRRDETRRGLRASRVSDRYSFR-AGEDLPESVDWREKGAVVPVKDQGNCGSCWAFSTIAA 174

Query: 892  VEGINQIVTXXXXXXXXXXLVDCE-TQSSGCNGGLMDYAFEFILKNGGIDTEQDYPYHAV 716
            VEGINQI T          LVDC+ + + GCNGGLMDYAFEFI+ NGGID+E+DYPY A 
Sbjct: 175  VEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDSEEDYPYRAA 234

Query: 715  EGPCDTSRKKARVVSIDGYEDVPENDEYALKKAVAHQPVSVAVEAGGRAFQLYQSGVFSG 536
            +  CD +RK ARVVSIDGYEDVP+NDE +LKKAVA+QPVSVA+EAGGRAFQLYQSGVF+G
Sbjct: 235  DTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQSGVFTG 294

Query: 535  TCGTEIDHGVVAVGYGTENGIDYWLVRNSWGPSWGENGYIKLERNLKFTNTGKCGITMMA 356
             CGT++DHGVVAVGYGTEN +DYW+VRNSWGP+WGE+GYIKLERNL  T TGKCGI +  
Sbjct: 295  QCGTQLDHGVVAVGYGTENSVDYWIVRNSWGPNWGESGYIKLERNLAGTETGKCGIAIEP 354

Query: 355  SYPVK-----XXXXXXXXXXXXXXTVCDDYYDCPPGNTXXXXXXXXXXXFGWGCCPFESA 191
            SYP+K                    VCD+YY CP  +T           F WGCCP E A
Sbjct: 355  SYPIKNGQNPPNPGPSPPSPSKPSVVCDEYYTCPEESTCCCIYEYAGFCFEWGCCPLEGA 414

Query: 190  TCCEDKYSCCPHDHPICDLHAGTCLISKNNPIGVKALERTPAR 62
            TCC+D YSCCPH++P+CD+ AGTC +SK NP+ VKA  RTPAR
Sbjct: 415  TCCDDHYSCCPHEYPVCDVDAGTCQMSKGNPLSVKAWRRTPAR 457


>dbj|BAD29954.1| cysteine protease [Daucus carota]
          Length = 474

 Score =  571 bits (1472), Expect = e-160
 Identities = 279/450 (62%), Positives = 334/450 (74%), Gaps = 12/450 (2%)
 Frame = -1

Query: 1378 LFLFFFICSGHDVKS--EMSIINYDQTH-MRPHRLRTHDEVLAIYRSWLTHHRRFYNALG 1208
            L LFF + S   + S  +MSII YD+TH +    LRTHD++L++Y SWL  H + YNALG
Sbjct: 16   LVLFFSLASFLMLSSASDMSIITYDETHGLNSPPLRTHDQLLSLYESWLVKHHKNYNALG 75

Query: 1207 ENERRFEIFKDNLQFIDEHNADPNRSYKLGLNRFADMTNEEYRSKFMG---MKTEMRNRA 1037
            E E RF IFKDN+ F+D HN+  N+SYKLGLN+FAD+TN+EYRS ++    MK E +N  
Sbjct: 76   EKETRFGIFKDNVGFVDRHNSMRNQSYKLGLNKFADLTNDEYRSLYLSGKMMKRERKNED 135

Query: 1036 RRVSHRYAVKLGGENLPESVDWREKGAVSPIKDQGQCGSCWAFSSIAAVEGINQIVTXXX 857
               S R+  +  G++LPESVDWR++GAV+P+KDQGQCGSCWAFS++ AVEGIN+IVT   
Sbjct: 136  GFRSDRFVFE-DGDHLPESVDWRDRGAVAPVKDQGQCGSCWAFSTVGAVEGINKIVTGEL 194

Query: 856  XXXXXXXLVDCET-QSSGCNGGLMDYAFEFILKNGGIDTEQDYPYHAVEGPCDTSRKKAR 680
                   LVDC+   + GCNGGLMDYAFEFI+KNGGIDTE DYPY  V+G CD +RK A+
Sbjct: 195  ISLSEQELVDCDNGYNQGCNGGLMDYAFEFIVKNGGIDTEDDYPYKGVDGLCDQNRKNAK 254

Query: 679  VVSIDGYEDVPENDEYALKKAVAHQPVSVAVEAGGRAFQLYQSGVFSGTCGTEIDHGVVA 500
            VV+I+GYEDVP NDE +LKKAVAHQPVSVA+EAGGRAFQLY+SGVF+G CGTE+DHGVVA
Sbjct: 255  VVTINGYEDVPHNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGVFTGQCGTELDHGVVA 314

Query: 499  VGYGTENGIDYWLVRNSWGPSWGENGYIKLERNLKFTNTGKCGITMMASYPVK-----XX 335
            VGYG+ENG DYW+VRNSWGP WGE+GYI+LERN+  T+TGKCGI M ASYP K       
Sbjct: 315  VGYGSENGKDYWIVRNSWGPDWGESGYIRLERNVASTSTGKCGIAMQASYPTKTGDNPPK 374

Query: 334  XXXXXXXXXXXXTVCDDYYDCPPGNTXXXXXXXXXXXFGWGCCPFESATCCEDKYSCCPH 155
                        TVCDDYY CP   T           FGWGCCP  SATCC+D YSCCP 
Sbjct: 375  PGPSPPSPVKPQTVCDDYYSCPESTTCCCLYEIGQYCFGWGCCPLASATCCDDHYSCCPQ 434

Query: 154  DHPICDLHAGTCLISKNNPIGVKALERTPA 65
            + P+CDL AGTCL+SK+NPIGVKALER PA
Sbjct: 435  EFPVCDLDAGTCLMSKDNPIGVKALERRPA 464


>ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
            gi|223542086|gb|EEF43630.1| cysteine protease, putative
            [Ricinus communis]
          Length = 471

 Score =  568 bits (1465), Expect = e-159
 Identities = 278/463 (60%), Positives = 337/463 (72%), Gaps = 18/463 (3%)
 Frame = -1

Query: 1375 FLFFFIC-----SGHDVKSEMSIINYDQTHMRPHRLRTHDEVLAIYRSWLTHHRRFYNAL 1211
            FL FFI      S      +MSI++Y+  H   + LRT  +V  +Y  WL  H + YNAL
Sbjct: 6    FLAFFILFSGLLSSFSSALDMSIVDYNIKHGTKYPLRTDSQVRRMYEMWLVEHGKAYNAL 65

Query: 1210 GENERRFEIFKDNLQFIDEHNADPNRSYKLGLNRFADMTNEEYRSKFMGMKTEMRNRARR 1031
            GE E+RFEIFKDNL+FIDEHN+  +RSYK+GLNRFAD+TNEEY++ F+G K E +NR   
Sbjct: 66   GEKEKRFEIFKDNLRFIDEHNS-VDRSYKVGLNRFADLTNEEYKAMFLGTKMERKNRFLG 124

Query: 1030 V-SHRYAVKLGGENLPESVDWREKGAVSPIKDQGQCGSCWAFSSIAAVEGINQIVTXXXX 854
              S RY  K  G++LPE+VDWREKGAV P+KDQGQCGSCWAFS++ AVEGINQIVT    
Sbjct: 125  TRSQRYLFK-DGDDLPENVDWREKGAVVPVKDQGQCGSCWAFSTVGAVEGINQIVTGELI 183

Query: 853  XXXXXXLVDCE-TQSSGCNGGLMDYAFEFILKNGGIDTEQDYPYHAVEGPCDTSRKKARV 677
                  LVDC+ + + GCNGGLMDYAFEFI+ NGGIDTE+DYPY A +  CD +RK A+V
Sbjct: 184  SLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDTEEDYPYKASDNICDPNRKNAKV 243

Query: 676  VSIDGYEDVPENDEYALKKAVAHQPVSVAVEAGGRAFQLYQSGVFSGTCGTEIDHGVVAV 497
            V+IDGYEDVPENDE +LKKAVAHQPVSVA+EAGGRAFQLY+SGVF+G CGTE+DHGVVAV
Sbjct: 244  VTIDGYEDVPENDENSLKKAVAHQPVSVAIEAGGRAFQLYKSGVFTGRCGTELDHGVVAV 303

Query: 496  GYGTENGIDYWLVRNSWGPSWGENGYIKLERNLKFTNTGKCGITMMASYPVK-------- 341
            GYGTENG++YW+VRNSWG +WGE+GYI++ERN+  T TGKCGI +  SYP K        
Sbjct: 304  GYGTENGVNYWIVRNSWGSAWGESGYIRMERNVANTKTGKCGIAIQPSYPTKKGANPPNP 363

Query: 340  ---XXXXXXXXXXXXXXTVCDDYYDCPPGNTXXXXXXXXXXXFGWGCCPFESATCCEDKY 170
                             TVCDDY+ CP GNT           FGWGCCP ESATCC+D  
Sbjct: 364  GPSPPSPVNPPPPVSPSTVCDDYFSCPDGNTCCCIYEYSGYCFGWGCCPLESATCCDDHN 423

Query: 169  SCCPHDHPICDLHAGTCLISKNNPIGVKALERTPARLYSSHVH 41
            SCCPH++P+CDL AGTC +SK+NP+GVKAL R PA+   +H++
Sbjct: 424  SCCPHEYPVCDLKAGTCRLSKDNPLGVKALRRGPAKRTHTHLN 466


Top