BLASTX nr result
ID: Cnidium21_contig00003913
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cnidium21_contig00003913 (1411 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002533377.1| cysteine protease, putative [Ricinus communi... 499 e-139 ref|NP_567010.5| Papain family cysteine protease [Arabidopsis th... 491 e-136 ref|NP_001236888.1| cysteine proteinase precursor [Glycine max] ... 491 e-136 gb|AEQ54770.1| cysteine proteinase CP1 [Coffea canephora] 489 e-136 ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arab... 486 e-135 >ref|XP_002533377.1| cysteine protease, putative [Ricinus communis] gi|223526784|gb|EEF29008.1| cysteine protease, putative [Ricinus communis] Length = 381 Score = 499 bits (1286), Expect = e-139 Identities = 240/355 (67%), Positives = 278/355 (78%), Gaps = 5/355 (1%) Frame = -3 Query: 1394 PLLTYALISVLLNYAPTIST----NSNIRQVTD-MDFTGNNNKLIGTATERHFISFMNNY 1230 PL A ++ L + T + I QVTD T +N K +GT TE +F FM Y Sbjct: 15 PLAILAFTTLTLTTSATSGDATLQDPTILQVTDDPSVTLSNRKFLGTNTEENFKMFMIKY 74 Query: 1229 GKEYSTREEYMHRLGIFAKNMLRAAEHQALDPTAVHGVTQFSDLSEEEFETSFLXXXXXX 1050 KEY TREEYMHRLG+FAKN++RAAEHQ LDPTAVHG+T F DL+EEEFE + Sbjct: 75 DKEYDTREEYMHRLGVFAKNLIRAAEHQVLDPTAVHGITPFMDLTEEEFERMYTGVVGGG 134 Query: 1049 XXXXXXXXGEAPVVDVKGLPESFDWREKGAVTPVKMQGSCGSCWAFSTTGSIEGANFIAT 870 + ++ GLP SFDWR+KGAVT VKMQG+CGSCWAFSTTG+IEGANFIAT Sbjct: 135 AVGAEGVTATS-FLETAGLPSSFDWRKKGAVTDVKMQGACGSCWAFSTTGAIEGANFIAT 193 Query: 869 GKLIGLSEQQLVDCDHTCDTKDKSSCNDGCSGGLMTNAYEYLIKAGGIEEEDAYPYTGKP 690 GKL+ LSEQQLVDCD CD K+K++C+DGC GGLMTNAY YLI+AGG+E+E +YPYTGKP Sbjct: 194 GKLLNLSEQQLVDCDRVCDIKEKTACDDGCGGGLMTNAYRYLIEAGGLEDEISYPYTGKP 253 Query: 689 GDCKFDPKKIAVRVTNFTNIPGDEEQIAAHLVHHGPLAVGLNAVFMQTYIGGVSCPLICG 510 G CKFD KKIAVRV NFT+IP DE QIAAHLVHHGPLA+GLNAVFMQTYIGGVSCPLICG Sbjct: 254 GKCKFDEKKIAVRVVNFTSIPIDENQIAAHLVHHGPLAIGLNAVFMQTYIGGVSCPLICG 313 Query: 509 KKFLNHGVLLVGYGAKGFSILRLGNRPYWIIKNSWGENWGEQGYYRLCRGHNMCG 345 KK++NHGVLLVGYGAKGFSILRLG +PYWIIKNSWG+ WGE+GYYR+C+G+ MCG Sbjct: 314 KKWINHGVLLVGYGAKGFSILRLGYKPYWIIKNSWGKRWGEEGYYRICKGYGMCG 368 >ref|NP_567010.5| Papain family cysteine protease [Arabidopsis thaliana] gi|17979125|gb|AAL49820.1| putative cysteine proteinase [Arabidopsis thaliana] gi|332645795|gb|AEE79316.1| Papain family cysteine protease [Arabidopsis thaliana] Length = 367 Score = 491 bits (1265), Expect = e-136 Identities = 233/346 (67%), Positives = 278/346 (80%), Gaps = 2/346 (0%) Frame = -3 Query: 1376 LISVLLNYAPTIST--NSNIRQVTDMDFTGNNNKLIGTATERHFISFMNNYGKEYSTREE 1203 LI+ ++ + +++ + IRQVT D L+GT TE F FM++YGK YSTREE Sbjct: 9 LITCIILFCHVVASVEDLTIRQVT-ADNRRIRPNLLGTHTESKFRLFMSDYGKNYSTREE 67 Query: 1202 YMHRLGIFAKNMLRAAEHQALDPTAVHGVTQFSDLSEEEFETSFLXXXXXXXXXXXXXXG 1023 Y+HRLGIFAKN+L+AAEHQ +DP+AVHGVTQFSDL+EEEF+ + Sbjct: 68 YIHRLGIFAKNVLKAAEHQMMDPSAVHGVTQFSDLTEEEFKRMYTGVADVGGSRGGTVGA 127 Query: 1022 EAPVVDVKGLPESFDWREKGAVTPVKMQGSCGSCWAFSTTGSIEGANFIATGKLIGLSEQ 843 EAP+V+V GLPE FDWREKG VT VK QG+CGSCWAFSTTG+ EGA+F++TGKL+ LSEQ Sbjct: 128 EAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQ 187 Query: 842 QLVDCDHTCDTKDKSSCNDGCSGGLMTNAYEYLIKAGGIEEEDAYPYTGKPGDCKFDPKK 663 QLVDCD CD KDK +C++GC GGLMTNAYEYL++AGG+EEE +YPYTGK G CKFDP+K Sbjct: 188 QLVDCDQACDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKRGHCKFDPEK 247 Query: 662 IAVRVTNFTNIPGDEEQIAAHLVHHGPLAVGLNAVFMQTYIGGVSCPLICGKKFLNHGVL 483 +AVRV NFT IP DE QIAA+LV HGPLAVGLNAVFMQTYIGGVSCPLIC K+ +NHGVL Sbjct: 248 VAVRVLNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQTYIGGVSCPLICSKRNVNHGVL 307 Query: 482 LVGYGAKGFSILRLGNRPYWIIKNSWGENWGEQGYYRLCRGHNMCG 345 LVGYG+KGFSILRL N+PYWIIKNSWG+ WGE GYY+LCRGH++CG Sbjct: 308 LVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYKLCRGHDICG 353 >ref|NP_001236888.1| cysteine proteinase precursor [Glycine max] gi|479060|emb|CAA83673.1| cysteine proteinase [Glycine max] gi|300507422|gb|ADK24076.1| cysteine proteinase [Glycine max] gi|300507425|gb|ADK24077.1| cysteine proteinase [Glycine max] gi|1096153|prf||2111244A Cys protease Length = 380 Score = 491 bits (1263), Expect = e-136 Identities = 225/307 (73%), Positives = 263/307 (85%) Frame = -3 Query: 1265 TERHFISFMNNYGKEYSTREEYMHRLGIFAKNMLRAAEHQALDPTAVHGVTQFSDLSEEE 1086 TE+ F FM NYG+ YST EEY+ RLGIFA+NM+RAAEHQALDPTAVHGVTQFSDL+E+E Sbjct: 50 TEKKFKVFMENYGRSYSTEEEYLRRLGIFAQNMVRAAEHQALDPTAVHGVTQFSDLTEDE 109 Query: 1085 FETSFLXXXXXXXXXXXXXXGEAPVVDVKGLPESFDWREKGAVTPVKMQGSCGSCWAFST 906 FE + G AP ++V GLPE+FDWREKGAVT VK+QG CGSCWAFST Sbjct: 110 FEKLYTGVNGGFPSSNNAAGGIAPPLEVDGLPENFDWREKGAVTEVKLQGRCGSCWAFST 169 Query: 905 TGSIEGANFIATGKLIGLSEQQLVDCDHTCDTKDKSSCNDGCSGGLMTNAYEYLIKAGGI 726 TGSIEGANF+ATGKL+ LSEQQL+DCD+ CD +K+SC++GC+GGLMTNAY YL+++GG+ Sbjct: 170 TGSIEGANFLATGKLVSLSEQQLLDCDNKCDITEKTSCDNGCNGGLMTNAYNYLLESGGL 229 Query: 725 EEEDAYPYTGKPGDCKFDPKKIAVRVTNFTNIPGDEEQIAAHLVHHGPLAVGLNAVFMQT 546 EEE +YPYTG+ G+CKFDP+KIAV++TNFTNIP DE QIAA+LV +GPLA+G+NA+FMQT Sbjct: 230 EEESSYPYTGERGECKFDPEKIAVKITNFTNIPADENQIAAYLVKNGPLAMGVNAIFMQT 289 Query: 545 YIGGVSCPLICGKKFLNHGVLLVGYGAKGFSILRLGNRPYWIIKNSWGENWGEQGYYRLC 366 YIGGVSCPLIC KK LNHGVLLVGYGAKGFSILRLGN+PYWIIKNSWGE WGE GYY+LC Sbjct: 290 YIGGVSCPLICSKKRLNHGVLLVGYGAKGFSILRLGNKPYWIIKNSWGEKWGEDGYYKLC 349 Query: 365 RGHNMCG 345 RGH MCG Sbjct: 350 RGHGMCG 356 >gb|AEQ54770.1| cysteine proteinase CP1 [Coffea canephora] Length = 397 Score = 489 bits (1259), Expect = e-136 Identities = 233/342 (68%), Positives = 272/342 (79%), Gaps = 15/342 (4%) Frame = -3 Query: 1325 IRQVTDMDF-------TGNNNKLIGTATERHFISFMNNYGKEYSTREEYMHRLGIFAKNM 1167 IRQVTD + N++L+GT TE HF SF+ Y K YST EEY+HRLGIFAKN+ Sbjct: 43 IRQVTDNHHHRHHPGRSSANHRLLGTTTEVHFKSFVEEYEKTYSTHEEYVHRLGIFAKNL 102 Query: 1166 LRAAEHQALDPTAVHGVTQFSDLSEEEFETSFLXXXXXXXXXXXXXXGEAP--------V 1011 ++AAEHQA+DP+A+HGVTQFSDL+EEEFE +++ G+ + Sbjct: 103 IKAAEHQAMDPSAIHGVTQFSDLTEEEFEATYMGLKGGAGVGGTTQLGKDDGDESAAEVM 162 Query: 1010 VDVKGLPESFDWREKGAVTPVKMQGSCGSCWAFSTTGSIEGANFIATGKLIGLSEQQLVD 831 +DV LPESFDWREKGAVT VK QG CGSCWAFSTTG+IEGANFIATGKL+ LSEQQLVD Sbjct: 163 MDVSDLPESFDWREKGAVTEVKTQGRCGSCWAFSTTGAIEGANFIATGKLLSLSEQQLVD 222 Query: 830 CDHTCDTKDKSSCNDGCSGGLMTNAYEYLIKAGGIEEEDAYPYTGKPGDCKFDPKKIAVR 651 CDH CD K+K C+DGCSGGLMT A+ YLI+AGGIEEE YPYTGK G+CKF+P+K+AV+ Sbjct: 223 CDHMCDLKEKDDCDDGCSGGLMTTAFNYLIEAGGIEEEVTYPYTGKRGECKFNPEKVAVK 282 Query: 650 VTNFTNIPGDEEQIAAHLVHHGPLAVGLNAVFMQTYIGGVSCPLICGKKFLNHGVLLVGY 471 V NF IP DE QIAA++VH+GPLA+GLNAVFMQTYIGGVSCPLIC KK +NHGVLLVGY Sbjct: 283 VRNFAKIPEDESQIAANVVHNGPLAIGLNAVFMQTYIGGVSCPLICDKKRINHGVLLVGY 342 Query: 470 GAKGFSILRLGNRPYWIIKNSWGENWGEQGYYRLCRGHNMCG 345 G++GFSILRLG +PYWIIKNSWG+ WGE GYYRLCRGHNMCG Sbjct: 343 GSRGFSILRLGYKPYWIIKNSWGKRWGEHGYYRLCRGHNMCG 384 >ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp. lyrata] gi|297322116|gb|EFH52537.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp. lyrata] Length = 368 Score = 486 bits (1252), Expect = e-135 Identities = 233/347 (67%), Positives = 279/347 (80%), Gaps = 3/347 (0%) Frame = -3 Query: 1376 LISVLLNYAPTIST--NSNIRQVTDMDFTGNNNKLIGTATERHFISFMNNYGKEYSTREE 1203 LI+ ++ + +++ + IRQVT + N L+GT TE F FM++YGK YSTREE Sbjct: 9 LITCIIFFCHVVASVEDLTIRQVTADERRVRPN-LLGTHTESKFRVFMSDYGKNYSTREE 67 Query: 1202 YMHRLGIFAKNMLRAAEHQALDPTAVHGVTQFSDLSEEEFETSFLXXXXXXXXXXXXXXG 1023 Y+HRLGIFAKN+L+AAEHQ +DPTAVHGVTQFSDL+EEEF+ + Sbjct: 68 YIHRLGIFAKNVLKAAEHQMMDPTAVHGVTQFSDLTEEEFKRMYTGVADVGGSRGHAVGA 127 Query: 1022 EAPVVDVKGLPESFDWREKGAVTPVKMQGSCGSCWAFSTTGSIEGANFIATGKLIGLSEQ 843 EAP+V+V GLPE FDWREKG VT VK QG+CGSCWAFSTTG+ EGA+F++TGKL+ LSEQ Sbjct: 128 EAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQ 187 Query: 842 QLVDCDHT-CDTKDKSSCNDGCSGGLMTNAYEYLIKAGGIEEEDAYPYTGKPGDCKFDPK 666 QLVDCD CD KDK +C++GC GGLMTNAYEYL++AGG+EEE +YPYTGK G CKFDP+ Sbjct: 188 QLVDCDQAVCDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKRGHCKFDPE 247 Query: 665 KIAVRVTNFTNIPGDEEQIAAHLVHHGPLAVGLNAVFMQTYIGGVSCPLICGKKFLNHGV 486 K+AVRV NFT IP DE+QIAA+LV GPLAVGLNAVFMQTYIGGVSCPLIC K+ +NHGV Sbjct: 248 KVAVRVVNFTTIPLDEDQIAANLVRQGPLAVGLNAVFMQTYIGGVSCPLICSKRKVNHGV 307 Query: 485 LLVGYGAKGFSILRLGNRPYWIIKNSWGENWGEQGYYRLCRGHNMCG 345 LLVGYG+KGFSILRL N+PYWIIKNSWG+ WGE GYY+LCRGH++CG Sbjct: 308 LLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYKLCRGHDICG 354