BLASTX nr result

ID: Cnidium21_contig00003913 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cnidium21_contig00003913
         (1411 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002533377.1| cysteine protease, putative [Ricinus communi...   499   e-139
ref|NP_567010.5| Papain family cysteine protease [Arabidopsis th...   491   e-136
ref|NP_001236888.1| cysteine proteinase precursor [Glycine max] ...   491   e-136
gb|AEQ54770.1| cysteine proteinase CP1 [Coffea canephora]             489   e-136
ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arab...   486   e-135

>ref|XP_002533377.1| cysteine protease, putative [Ricinus communis]
            gi|223526784|gb|EEF29008.1| cysteine protease, putative
            [Ricinus communis]
          Length = 381

 Score =  499 bits (1286), Expect = e-139
 Identities = 240/355 (67%), Positives = 278/355 (78%), Gaps = 5/355 (1%)
 Frame = -3

Query: 1394 PLLTYALISVLLNYAPTIST----NSNIRQVTD-MDFTGNNNKLIGTATERHFISFMNNY 1230
            PL   A  ++ L  + T       +  I QVTD    T +N K +GT TE +F  FM  Y
Sbjct: 15   PLAILAFTTLTLTTSATSGDATLQDPTILQVTDDPSVTLSNRKFLGTNTEENFKMFMIKY 74

Query: 1229 GKEYSTREEYMHRLGIFAKNMLRAAEHQALDPTAVHGVTQFSDLSEEEFETSFLXXXXXX 1050
             KEY TREEYMHRLG+FAKN++RAAEHQ LDPTAVHG+T F DL+EEEFE  +       
Sbjct: 75   DKEYDTREEYMHRLGVFAKNLIRAAEHQVLDPTAVHGITPFMDLTEEEFERMYTGVVGGG 134

Query: 1049 XXXXXXXXGEAPVVDVKGLPESFDWREKGAVTPVKMQGSCGSCWAFSTTGSIEGANFIAT 870
                      +  ++  GLP SFDWR+KGAVT VKMQG+CGSCWAFSTTG+IEGANFIAT
Sbjct: 135  AVGAEGVTATS-FLETAGLPSSFDWRKKGAVTDVKMQGACGSCWAFSTTGAIEGANFIAT 193

Query: 869  GKLIGLSEQQLVDCDHTCDTKDKSSCNDGCSGGLMTNAYEYLIKAGGIEEEDAYPYTGKP 690
            GKL+ LSEQQLVDCD  CD K+K++C+DGC GGLMTNAY YLI+AGG+E+E +YPYTGKP
Sbjct: 194  GKLLNLSEQQLVDCDRVCDIKEKTACDDGCGGGLMTNAYRYLIEAGGLEDEISYPYTGKP 253

Query: 689  GDCKFDPKKIAVRVTNFTNIPGDEEQIAAHLVHHGPLAVGLNAVFMQTYIGGVSCPLICG 510
            G CKFD KKIAVRV NFT+IP DE QIAAHLVHHGPLA+GLNAVFMQTYIGGVSCPLICG
Sbjct: 254  GKCKFDEKKIAVRVVNFTSIPIDENQIAAHLVHHGPLAIGLNAVFMQTYIGGVSCPLICG 313

Query: 509  KKFLNHGVLLVGYGAKGFSILRLGNRPYWIIKNSWGENWGEQGYYRLCRGHNMCG 345
            KK++NHGVLLVGYGAKGFSILRLG +PYWIIKNSWG+ WGE+GYYR+C+G+ MCG
Sbjct: 314  KKWINHGVLLVGYGAKGFSILRLGYKPYWIIKNSWGKRWGEEGYYRICKGYGMCG 368


>ref|NP_567010.5| Papain family cysteine protease [Arabidopsis thaliana]
            gi|17979125|gb|AAL49820.1| putative cysteine proteinase
            [Arabidopsis thaliana] gi|332645795|gb|AEE79316.1| Papain
            family cysteine protease [Arabidopsis thaliana]
          Length = 367

 Score =  491 bits (1265), Expect = e-136
 Identities = 233/346 (67%), Positives = 278/346 (80%), Gaps = 2/346 (0%)
 Frame = -3

Query: 1376 LISVLLNYAPTIST--NSNIRQVTDMDFTGNNNKLIGTATERHFISFMNNYGKEYSTREE 1203
            LI+ ++ +   +++  +  IRQVT  D       L+GT TE  F  FM++YGK YSTREE
Sbjct: 9    LITCIILFCHVVASVEDLTIRQVT-ADNRRIRPNLLGTHTESKFRLFMSDYGKNYSTREE 67

Query: 1202 YMHRLGIFAKNMLRAAEHQALDPTAVHGVTQFSDLSEEEFETSFLXXXXXXXXXXXXXXG 1023
            Y+HRLGIFAKN+L+AAEHQ +DP+AVHGVTQFSDL+EEEF+  +                
Sbjct: 68   YIHRLGIFAKNVLKAAEHQMMDPSAVHGVTQFSDLTEEEFKRMYTGVADVGGSRGGTVGA 127

Query: 1022 EAPVVDVKGLPESFDWREKGAVTPVKMQGSCGSCWAFSTTGSIEGANFIATGKLIGLSEQ 843
            EAP+V+V GLPE FDWREKG VT VK QG+CGSCWAFSTTG+ EGA+F++TGKL+ LSEQ
Sbjct: 128  EAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQ 187

Query: 842  QLVDCDHTCDTKDKSSCNDGCSGGLMTNAYEYLIKAGGIEEEDAYPYTGKPGDCKFDPKK 663
            QLVDCD  CD KDK +C++GC GGLMTNAYEYL++AGG+EEE +YPYTGK G CKFDP+K
Sbjct: 188  QLVDCDQACDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKRGHCKFDPEK 247

Query: 662  IAVRVTNFTNIPGDEEQIAAHLVHHGPLAVGLNAVFMQTYIGGVSCPLICGKKFLNHGVL 483
            +AVRV NFT IP DE QIAA+LV HGPLAVGLNAVFMQTYIGGVSCPLIC K+ +NHGVL
Sbjct: 248  VAVRVLNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRNVNHGVL 307

Query: 482  LVGYGAKGFSILRLGNRPYWIIKNSWGENWGEQGYYRLCRGHNMCG 345
            LVGYG+KGFSILRL N+PYWIIKNSWG+ WGE GYY+LCRGH++CG
Sbjct: 308  LVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYKLCRGHDICG 353


>ref|NP_001236888.1| cysteine proteinase precursor [Glycine max] gi|479060|emb|CAA83673.1|
            cysteine proteinase [Glycine max]
            gi|300507422|gb|ADK24076.1| cysteine proteinase [Glycine
            max] gi|300507425|gb|ADK24077.1| cysteine proteinase
            [Glycine max] gi|1096153|prf||2111244A Cys protease
          Length = 380

 Score =  491 bits (1263), Expect = e-136
 Identities = 225/307 (73%), Positives = 263/307 (85%)
 Frame = -3

Query: 1265 TERHFISFMNNYGKEYSTREEYMHRLGIFAKNMLRAAEHQALDPTAVHGVTQFSDLSEEE 1086
            TE+ F  FM NYG+ YST EEY+ RLGIFA+NM+RAAEHQALDPTAVHGVTQFSDL+E+E
Sbjct: 50   TEKKFKVFMENYGRSYSTEEEYLRRLGIFAQNMVRAAEHQALDPTAVHGVTQFSDLTEDE 109

Query: 1085 FETSFLXXXXXXXXXXXXXXGEAPVVDVKGLPESFDWREKGAVTPVKMQGSCGSCWAFST 906
            FE  +               G AP ++V GLPE+FDWREKGAVT VK+QG CGSCWAFST
Sbjct: 110  FEKLYTGVNGGFPSSNNAAGGIAPPLEVDGLPENFDWREKGAVTEVKLQGRCGSCWAFST 169

Query: 905  TGSIEGANFIATGKLIGLSEQQLVDCDHTCDTKDKSSCNDGCSGGLMTNAYEYLIKAGGI 726
            TGSIEGANF+ATGKL+ LSEQQL+DCD+ CD  +K+SC++GC+GGLMTNAY YL+++GG+
Sbjct: 170  TGSIEGANFLATGKLVSLSEQQLLDCDNKCDITEKTSCDNGCNGGLMTNAYNYLLESGGL 229

Query: 725  EEEDAYPYTGKPGDCKFDPKKIAVRVTNFTNIPGDEEQIAAHLVHHGPLAVGLNAVFMQT 546
            EEE +YPYTG+ G+CKFDP+KIAV++TNFTNIP DE QIAA+LV +GPLA+G+NA+FMQT
Sbjct: 230  EEESSYPYTGERGECKFDPEKIAVKITNFTNIPADENQIAAYLVKNGPLAMGVNAIFMQT 289

Query: 545  YIGGVSCPLICGKKFLNHGVLLVGYGAKGFSILRLGNRPYWIIKNSWGENWGEQGYYRLC 366
            YIGGVSCPLIC KK LNHGVLLVGYGAKGFSILRLGN+PYWIIKNSWGE WGE GYY+LC
Sbjct: 290  YIGGVSCPLICSKKRLNHGVLLVGYGAKGFSILRLGNKPYWIIKNSWGEKWGEDGYYKLC 349

Query: 365  RGHNMCG 345
            RGH MCG
Sbjct: 350  RGHGMCG 356


>gb|AEQ54770.1| cysteine proteinase CP1 [Coffea canephora]
          Length = 397

 Score =  489 bits (1259), Expect = e-136
 Identities = 233/342 (68%), Positives = 272/342 (79%), Gaps = 15/342 (4%)
 Frame = -3

Query: 1325 IRQVTDMDF-------TGNNNKLIGTATERHFISFMNNYGKEYSTREEYMHRLGIFAKNM 1167
            IRQVTD          +  N++L+GT TE HF SF+  Y K YST EEY+HRLGIFAKN+
Sbjct: 43   IRQVTDNHHHRHHPGRSSANHRLLGTTTEVHFKSFVEEYEKTYSTHEEYVHRLGIFAKNL 102

Query: 1166 LRAAEHQALDPTAVHGVTQFSDLSEEEFETSFLXXXXXXXXXXXXXXGEAP--------V 1011
            ++AAEHQA+DP+A+HGVTQFSDL+EEEFE +++              G+          +
Sbjct: 103  IKAAEHQAMDPSAIHGVTQFSDLTEEEFEATYMGLKGGAGVGGTTQLGKDDGDESAAEVM 162

Query: 1010 VDVKGLPESFDWREKGAVTPVKMQGSCGSCWAFSTTGSIEGANFIATGKLIGLSEQQLVD 831
            +DV  LPESFDWREKGAVT VK QG CGSCWAFSTTG+IEGANFIATGKL+ LSEQQLVD
Sbjct: 163  MDVSDLPESFDWREKGAVTEVKTQGRCGSCWAFSTTGAIEGANFIATGKLLSLSEQQLVD 222

Query: 830  CDHTCDTKDKSSCNDGCSGGLMTNAYEYLIKAGGIEEEDAYPYTGKPGDCKFDPKKIAVR 651
            CDH CD K+K  C+DGCSGGLMT A+ YLI+AGGIEEE  YPYTGK G+CKF+P+K+AV+
Sbjct: 223  CDHMCDLKEKDDCDDGCSGGLMTTAFNYLIEAGGIEEEVTYPYTGKRGECKFNPEKVAVK 282

Query: 650  VTNFTNIPGDEEQIAAHLVHHGPLAVGLNAVFMQTYIGGVSCPLICGKKFLNHGVLLVGY 471
            V NF  IP DE QIAA++VH+GPLA+GLNAVFMQTYIGGVSCPLIC KK +NHGVLLVGY
Sbjct: 283  VRNFAKIPEDESQIAANVVHNGPLAIGLNAVFMQTYIGGVSCPLICDKKRINHGVLLVGY 342

Query: 470  GAKGFSILRLGNRPYWIIKNSWGENWGEQGYYRLCRGHNMCG 345
            G++GFSILRLG +PYWIIKNSWG+ WGE GYYRLCRGHNMCG
Sbjct: 343  GSRGFSILRLGYKPYWIIKNSWGKRWGEHGYYRLCRGHNMCG 384


>ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
            lyrata] gi|297322116|gb|EFH52537.1| hypothetical protein
            ARALYDRAFT_485911 [Arabidopsis lyrata subsp. lyrata]
          Length = 368

 Score =  486 bits (1252), Expect = e-135
 Identities = 233/347 (67%), Positives = 279/347 (80%), Gaps = 3/347 (0%)
 Frame = -3

Query: 1376 LISVLLNYAPTIST--NSNIRQVTDMDFTGNNNKLIGTATERHFISFMNNYGKEYSTREE 1203
            LI+ ++ +   +++  +  IRQVT  +     N L+GT TE  F  FM++YGK YSTREE
Sbjct: 9    LITCIIFFCHVVASVEDLTIRQVTADERRVRPN-LLGTHTESKFRVFMSDYGKNYSTREE 67

Query: 1202 YMHRLGIFAKNMLRAAEHQALDPTAVHGVTQFSDLSEEEFETSFLXXXXXXXXXXXXXXG 1023
            Y+HRLGIFAKN+L+AAEHQ +DPTAVHGVTQFSDL+EEEF+  +                
Sbjct: 68   YIHRLGIFAKNVLKAAEHQMMDPTAVHGVTQFSDLTEEEFKRMYTGVADVGGSRGHAVGA 127

Query: 1022 EAPVVDVKGLPESFDWREKGAVTPVKMQGSCGSCWAFSTTGSIEGANFIATGKLIGLSEQ 843
            EAP+V+V GLPE FDWREKG VT VK QG+CGSCWAFSTTG+ EGA+F++TGKL+ LSEQ
Sbjct: 128  EAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQ 187

Query: 842  QLVDCDHT-CDTKDKSSCNDGCSGGLMTNAYEYLIKAGGIEEEDAYPYTGKPGDCKFDPK 666
            QLVDCD   CD KDK +C++GC GGLMTNAYEYL++AGG+EEE +YPYTGK G CKFDP+
Sbjct: 188  QLVDCDQAVCDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKRGHCKFDPE 247

Query: 665  KIAVRVTNFTNIPGDEEQIAAHLVHHGPLAVGLNAVFMQTYIGGVSCPLICGKKFLNHGV 486
            K+AVRV NFT IP DE+QIAA+LV  GPLAVGLNAVFMQTYIGGVSCPLIC K+ +NHGV
Sbjct: 248  KVAVRVVNFTTIPLDEDQIAANLVRQGPLAVGLNAVFMQTYIGGVSCPLICSKRKVNHGV 307

Query: 485  LLVGYGAKGFSILRLGNRPYWIIKNSWGENWGEQGYYRLCRGHNMCG 345
            LLVGYG+KGFSILRL N+PYWIIKNSWG+ WGE GYY+LCRGH++CG
Sbjct: 308  LLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYKLCRGHDICG 354


Top