BLASTX nr result
ID: Coptis21_contig00003219
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis21_contig00003219 (2054 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1... 582 e-163 ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor,... 552 e-154 ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1... 546 e-152 ref|NP_188636.2| aspartyl protease family protein [Arabidopsis t... 540 e-151 ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata] ... 536 e-150 >ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera] Length = 479 Score = 582 bits (1501), Expect = e-163 Identities = 292/448 (65%), Positives = 332/448 (74%) Frame = +3 Query: 396 HLNVQEIITGIKARPSKAMAYHKLSENDDASTGSKWKLNLVHRDVISEGNVSDYHQLFIG 575 HLNV+E I G + P + H+ G KW + +VHRD +S GN D+ G Sbjct: 44 HLNVKETIAGTRIIPLEVSEDHE-------EGGEKWMMKVVHRDQLSFGNSDDHRHRLDG 96 Query: 576 LMKRDLKRVXXXXXXXXXXXXXXXXVSYEVNDFGSEVISGMEQGSGEYFVRIGVGSPPRS 755 +KRD KRV SY V+DFG++VISGMEQGSGEYFVRIGVGSPPRS Sbjct: 97 RLKRDAKRVASLIRRLSSGGGG----SYRVDDFGTDVISGMEQGSGEYFVRIGVGSPPRS 152 Query: 756 QYMVIDSGSDIVWVQCQPCNQCYKQSDPVFDPANSASFAGVACSSNVCDRLENAGCRAGR 935 QYMVIDSGSDIVWVQCQPC QCY QSDPVFDPA+SASF GV+CSS+VCDRLENAGC AGR Sbjct: 153 QYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCSSSVCDRLENAGCHAGR 212 Query: 936 CRYAVSYGDGSYTRGTLALETITLGRTVVHNVAIGCGHKNRGMFVXXXXXXXXXXXSMSF 1115 CRY VSYGDGSYT+GTLALET+T GRT+V +VAIGCGH+NRGMFV SMSF Sbjct: 213 CRYEVSYGDGSYTKGTLALETLTFGRTMVRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSF 272 Query: 1116 IGQLEGQTGGTFGYCLVSRGSNSYGSLVFGRDALPVGAVWVPLVRNPRAPSFXXXXXXXX 1295 +GQL GQTGG F YCLVSRG++S GSLVFGR+ALP GA WVPLVRNPRAPSF Sbjct: 273 VGQLGGQTGGAFSYCLVSRGTDSSGSLVFGREALPAGAAWVPLVRNPRAPSFYYIGLAGL 332 Query: 1296 XXXXMRVPVSEDLFRLTEWGEGGVVMDTGTAVTRLPLAAYEALRDSFIAGTAELPRTPVG 1475 +RVP+SE++FRLTE G+GGVVMDTGTAVTRLP AY+A RD+F+A TA LPR G Sbjct: 333 GVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRA-TG 391 Query: 1476 VSIFDTCYDLSNYNMVRVPTVSFYFSSGPVLTLPARNFLIPVDEMGTFCFAFAPSSTKLS 1655 V+IFDTCYDL + VRVPTVSFYFS GP+LTLPARNFLIP+D+ GTFCFAFAPS++ LS Sbjct: 392 VAIFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLS 451 Query: 1656 XXXXXXXXXXXXSFDGANGFVGFGPNTC 1739 SFDGANG+VGFGPN C Sbjct: 452 ILGNIQQEGIQISFDGANGYVGFGPNIC 479 >ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus communis] gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus communis] Length = 481 Score = 552 bits (1422), Expect = e-154 Identities = 282/450 (62%), Positives = 323/450 (71%), Gaps = 3/450 (0%) Frame = +3 Query: 399 LNVQEIITGIKARPSKAMAYHKLSEN-DDASTGSKWKLNLVHRDVISEGNVSDYHQL--F 569 LNV+E IT +KA Y +L +N +D T KWKL LVHRD I+ N S Y F Sbjct: 41 LNVKEAIT-----ETKASQYQELFDNQNDTLTEGKWKLKLVHRDKITAFNKSSYDHSHNF 95 Query: 570 IGLMKRDLKRVXXXXXXXXXXXXXXXXVSYEVNDFGSEVISGMEQGSGEYFVRIGVGSPP 749 ++RD KRV SY V +FG+EV+SGM QGSGEYF+RIGVGSPP Sbjct: 96 HARIQRDKKRVATLIRRLSPRDATS---SYSVEEFGAEVVSGMNQGSGEYFIRIGVGSPP 152 Query: 750 RSQYMVIDSGSDIVWVQCQPCNQCYKQSDPVFDPANSASFAGVACSSNVCDRLENAGCRA 929 R QY+VIDSGSDIVWVQCQPC QCY Q+DPVFDPA+SASF GV CSS+VC+R+ENAGC A Sbjct: 153 REQYVVIDSGSDIVWVQCQPCTQCYHQTDPVFDPADSASFMGVPCSSSVCERIENAGCHA 212 Query: 930 GRCRYAVSYGDGSYTRGTLALETITLGRTVVHNVAIGCGHKNRGMFVXXXXXXXXXXXSM 1109 G CRY V YGDGSYT+GTLALET+T GRTVV NVAIGCGH+NRGMFV SM Sbjct: 213 GGCRYEVMYGDGSYTKGTLALETLTFGRTVVRNVAIGCGHRNRGMFVGAAGLLGLGGGSM 272 Query: 1110 SFIGQLEGQTGGTFGYCLVSRGSNSYGSLVFGRDALPVGAVWVPLVRNPRAPSFXXXXXX 1289 S +GQL GQTGG F YCLVSRG++S GSL FGR A+PVGA W+PL+RNPRAPSF Sbjct: 273 SLVGQLGGQTGGAFSYCLVSRGTDSAGSLEFGRGAMPVGAAWIPLIRNPRAPSFYYIRLS 332 Query: 1290 XXXXXXMRVPVSEDLFRLTEWGEGGVVMDTGTAVTRLPLAAYEALRDSFIAGTAELPRTP 1469 M+VP+SED+F+L E G GGVVMDTGTAVTR+P AY A RD+FI T LPR Sbjct: 333 GVGVGGMKVPISEDVFQLNEMGNGGVVMDTGTAVTRIPTVAYVAFRDAFIGQTGNLPRAS 392 Query: 1470 VGVSIFDTCYDLSNYNMVRVPTVSFYFSSGPVLTLPARNFLIPVDEMGTFCFAFAPSSTK 1649 GVSIFDTCY+L+ + VRVPTVSFYF+ GP+LTLPARNFLIPVD++GTFCFAFA S + Sbjct: 393 -GVSIFDTCYNLNGFVSVRVPTVSFYFAGGPILTLPARNFLIPVDDVGTFCFAFAASPSG 451 Query: 1650 LSXXXXXXXXXXXXSFDGANGFVGFGPNTC 1739 LS SFDGANGFVGFGPN C Sbjct: 452 LSIIGNIQQEGIQISFDGANGFVGFGPNVC 481 >ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max] Length = 473 Score = 546 bits (1406), Expect = e-152 Identities = 276/451 (61%), Positives = 327/451 (72%), Gaps = 4/451 (0%) Frame = +3 Query: 399 LNVQEIITGIKARPS---KAMAYHKLSENDDASTGSKWKLNLVHRDVISEGNVS-DYHQL 566 LNV++I+T K P+ K + + KL+ +AS+ +K+KL LVHRD + N S D+ Sbjct: 29 LNVKQILTETKLNPTNTYKHLQHQKLNIATEASSPAKYKLKLVHRDKVPTFNTSHDHRTR 88 Query: 567 FIGLMKRDLKRVXXXXXXXXXXXXXXXXVSYEVNDFGSEVISGMEQGSGEYFVRIGVGSP 746 F M+RD KRV +Y FGS+V+SGMEQGSGEYFVRIGVGSP Sbjct: 89 FNARMQRDTKRVAALRRHLAAGKP-----TYAEEAFGSDVVSGMEQGSGEYFVRIGVGSP 143 Query: 747 PRSQYMVIDSGSDIVWVQCQPCNQCYKQSDPVFDPANSASFAGVACSSNVCDRLENAGCR 926 PR+QY+VIDSGSDI+WVQC+PC QCY QSDPVF+PA+S+S+AGV+C+S VC ++NAGC Sbjct: 144 PRNQYVVIDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSSYAGVSCASTVCSHVDNAGCH 203 Query: 927 AGRCRYAVSYGDGSYTRGTLALETITLGRTVVHNVAIGCGHKNRGMFVXXXXXXXXXXXS 1106 GRCRY VSYGDGSYT+GTLALET+T GRT++ NVAIGCGH N+GMFV Sbjct: 204 EGRCRYEVSYGDGSYTKGTLALETLTFGRTLIRNVAIGCGHHNQGMFVGAAGLLGLGSGP 263 Query: 1107 MSFIGQLEGQTGGTFGYCLVSRGSNSYGSLVFGRDALPVGAVWVPLVRNPRAPSFXXXXX 1286 MSF+GQL GQ GGTF YCLVSRG S G L FGR+A+PVGA WVPL+ NPRA SF Sbjct: 264 MSFVGQLGGQAGGTFSYCLVSRGIQSSGLLQFGREAVPVGAAWVPLIHNPRAQSFYYVGL 323 Query: 1287 XXXXXXXMRVPVSEDLFRLTEWGEGGVVMDTGTAVTRLPLAAYEALRDSFIAGTAELPRT 1466 +RVP+SED+F+L+E G+GGVVMDTGTAVTRLP AAYEA RD+FIA T LPR Sbjct: 324 SGLGVGGLRVPISEDVFKLSELGDGGVVMDTGTAVTRLPTAAYEAFRDAFIAQTTNLPRA 383 Query: 1467 PVGVSIFDTCYDLSNYNMVRVPTVSFYFSSGPVLTLPARNFLIPVDEMGTFCFAFAPSST 1646 GVSIFDTCYDL + VRVPTVSFYFS GP+LTLPARNFLIPVD++G+FCFAFAPSS+ Sbjct: 384 S-GVSIFDTCYDLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDDVGSFCFAFAPSSS 442 Query: 1647 KLSXXXXXXXXXXXXSFDGANGFVGFGPNTC 1739 LS S DGANGFVGFGPN C Sbjct: 443 GLSIIGNIQQEGIEISVDGANGFVGFGPNVC 473 >ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana] gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2; Short=AtASPG2; Flags: Precursor gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis thaliana] gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana] gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana] Length = 470 Score = 540 bits (1390), Expect = e-151 Identities = 265/417 (63%), Positives = 304/417 (72%) Frame = +3 Query: 489 TGSKWKLNLVHRDVISEGNVSDYHQLFIGLMKRDLKRVXXXXXXXXXXXXXXXXVSYEVN 668 + SK+ L L+HRD ++H M+RD RV YEVN Sbjct: 55 SSSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVN 114 Query: 669 DFGSEVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCNQCYKQSDPVFD 848 DFGS+++SGM+QGSGEYFVRIGVGSPPR QYMVIDSGSD+VWVQCQPC CYKQSDPVFD Sbjct: 115 DFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFD 174 Query: 849 PANSASFAGVACSSNVCDRLENAGCRAGRCRYAVSYGDGSYTRGTLALETITLGRTVVHN 1028 PA S S+ GV+C S+VCDR+EN+GC +G CRY V YGDGSYT+GTLALET+T +TVV N Sbjct: 175 PAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRN 234 Query: 1029 VAIGCGHKNRGMFVXXXXXXXXXXXSMSFIGQLEGQTGGTFGYCLVSRGSNSYGSLVFGR 1208 VA+GCGH+NRGMF+ SMSF+GQL GQTGG FGYCLVSRG++S GSLVFGR Sbjct: 235 VAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLVFGR 294 Query: 1209 DALPVGAVWVPLVRNPRAPSFXXXXXXXXXXXXMRVPVSEDLFRLTEWGEGGVVMDTGTA 1388 +ALPVGA WVPLVRNPRAPSF +R+P+ + +F LTE G+GGVVMDTGTA Sbjct: 295 EALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTA 354 Query: 1389 VTRLPLAAYEALRDSFIAGTAELPRTPVGVSIFDTCYDLSNYNMVRVPTVSFYFSSGPVL 1568 VTRLP AAY A RD F + TA LPR GVSIFDTCYDLS + VRVPTVSFYF+ GPVL Sbjct: 355 VTRLPTAAYVAFRDGFKSQTANLPRAS-GVSIFDTCYDLSGFVSVRVPTVSFYFTEGPVL 413 Query: 1569 TLPARNFLIPVDEMGTFCFAFAPSSTKLSXXXXXXXXXXXXSFDGANGFVGFGPNTC 1739 TLPARNFL+PVD+ GT+CFAFA S T LS SFDGANGFVGFGPN C Sbjct: 414 TLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470 >ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata] gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata] Length = 471 Score = 536 bits (1381), Expect = e-150 Identities = 268/423 (63%), Positives = 308/423 (72%), Gaps = 1/423 (0%) Frame = +3 Query: 474 NDDASTGSKWKLNLVHRDVISEGNVSDYHQLFIGLMKRDLKRVXXXXXXXXXXXXXXXXV 653 +DD++ SK+ L L+HRD ++H M+RD RV Sbjct: 52 SDDSN--SKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVVVASSD 109 Query: 654 S-YEVNDFGSEVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCNQCYKQ 830 S YEVNDFGS+V+SGM+QGSGEYFVRIGVGSPPR QYMVIDSGSD+VWVQCQPC CYKQ Sbjct: 110 SRYEVNDFGSDVVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQ 169 Query: 831 SDPVFDPANSASFAGVACSSNVCDRLENAGCRAGRCRYAVSYGDGSYTRGTLALETITLG 1010 SDPVFDPA S S+ GV+C S+VCDR+EN+GC +G CRY V YGDGSYT+GTLALET+T Sbjct: 170 SDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFA 229 Query: 1011 RTVVHNVAIGCGHKNRGMFVXXXXXXXXXXXSMSFIGQLEGQTGGTFGYCLVSRGSNSYG 1190 +TVV NVA+GCGH+NRGMF+ SMSF+GQL GQTGG FGYCLVSRG++S G Sbjct: 230 KTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTG 289 Query: 1191 SLVFGRDALPVGAVWVPLVRNPRAPSFXXXXXXXXXXXXMRVPVSEDLFRLTEWGEGGVV 1370 SLVFGR+ALPVGA WVPLVRNPRAPSF +R+P+ + +F LTE G+GGVV Sbjct: 290 SLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVV 349 Query: 1371 MDTGTAVTRLPLAAYEALRDSFIAGTAELPRTPVGVSIFDTCYDLSNYNMVRVPTVSFYF 1550 MDTGTAVTRLP AY A RD F + TA LPR GVSIFDTCYDLS + VRVPTVSFYF Sbjct: 350 MDTGTAVTRLPTGAYAAFRDGFKSQTANLPRAS-GVSIFDTCYDLSGFVSVRVPTVSFYF 408 Query: 1551 SSGPVLTLPARNFLIPVDEMGTFCFAFAPSSTKLSXXXXXXXXXXXXSFDGANGFVGFGP 1730 + GPVLTLPARNFL+PVD+ GT+CFAFA S T LS SFDGANGFVGFGP Sbjct: 409 TEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGP 468 Query: 1731 NTC 1739 N C Sbjct: 469 NVC 471