BLASTX nr result
ID: Coptis23_contig00012468
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis23_contig00012468 (1848 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002518445.1| Aspartic proteinase precursor, putative [Ric... 728 0.0 gb|ACX55829.1| aspartic proteinase 1 [Castanea mollissima] 726 0.0 ref|XP_002529926.1| Aspartic proteinase precursor, putative [Ric... 725 0.0 gb|ACX55830.1| aspartic proteinase 2 [Castanea mollissima] 724 0.0 ref|XP_002298827.1| predicted protein [Populus trichocarpa] gi|2... 723 0.0 >ref|XP_002518445.1| Aspartic proteinase precursor, putative [Ricinus communis] gi|223542290|gb|EEF43832.1| Aspartic proteinase precursor, putative [Ricinus communis] Length = 511 Score = 728 bits (1878), Expect = 0.0 Identities = 349/489 (71%), Positives = 404/489 (82%), Gaps = 11/489 (2%) Frame = +2 Query: 206 TTSNDDALVRITLKKRTLDTDNQRVARLESQER-----------MKYSGANSDIVGLKNY 352 +++ +D LVR+ LKK LD +++ ARLES+ ++ ++DIV LKNY Sbjct: 23 SSAPNDGLVRLGLKKMKLDENSRLAARLESKNAEALRASVRKYGLRGDSKDTDIVALKNY 82 Query: 353 MDAQYYGEIGVGTPPQKFTVIFDTGSSNLWVPSSKCYLSVACFFHSKYKASQSSTYKQNG 532 +DAQYYGEIG+GTPPQKFTV+FDTGSSNLWVPSSKC SVACFFHS+YK+ QSSTYK+NG Sbjct: 83 LDAQYYGEIGIGTPPQKFTVVFDTGSSNLWVPSSKCIFSVACFFHSRYKSGQSSTYKKNG 142 Query: 533 KPAEIHYGSGSIAGYFSEDDVTVGDIVVKQQEFIEATSEPGVTFMAAKFDGILGLGFQEI 712 K AEIHYGSG+I+G+FS D+V VG++VVK QEFIEAT EPGVTF+AAKFDGILGLGFQEI Sbjct: 143 KSAEIHYGSGAISGFFSSDNVVVGNLVVKDQEFIEATKEPGVTFVAAKFDGILGLGFQEI 202 Query: 713 SVGKAVPVWYNMLTQGLIKEPAFSFWLNRNAQEEEGGEIVFGGSDPNHYKGKHTYVPVTQ 892 SVG AVPVWYNM+ QGLIKEP FSFWLNRN Q EEGGEIVFGG D NHYKGKHTYVPVTQ Sbjct: 203 SVGNAVPVWYNMIKQGLIKEPVFSFWLNRNTQGEEGGEIVFGGVDLNHYKGKHTYVPVTQ 262 Query: 893 KGYWQFDMGDVLIXXXXXXXXXXXXAAIADSGTSLLAGPTAIITEINHAIGAAGVLSWEC 1072 KGYWQF+MGDVLI +AIADSGTSLLAGPT ++T IN AIGA GV S EC Sbjct: 263 KGYWQFEMGDVLIGHKPTEYCAGGCSAIADSGTSLLAGPTTVVTLINEAIGATGVASQEC 322 Query: 1073 KAVVSQYGEVIMDLLLNEVQPKKVCSKIGLCTFDGARGVSAGIESVVDENGEGLSGGLND 1252 K V++QYGE IMDLL+ E QPKK+CS+IGLCTFDG RGVS GI+SVVD+N + SG + D Sbjct: 323 KTVIAQYGETIMDLLIAEAQPKKICSQIGLCTFDGTRGVSMGIQSVVDDNNDKSSGIVRD 382 Query: 1253 AMCSACEMTVVWMKNQLSQNQTQDRAFSYVNELCEKLPNPMGESAVDCNRLSTMPKVSFT 1432 AMCSACEMTVVWM+NQL +NQTQDR +YVNELC+++PNP+GES VDC +S+MP VSFT Sbjct: 383 AMCSACEMTVVWMQNQLRENQTQDRILNYVNELCDRIPNPLGESIVDCGSISSMPVVSFT 442 Query: 1433 IASKLFELEPHEYVLKVGEGSAAQCISGFMAMDVPRPRGPLWILGDIFMGRYHTVFDFGG 1612 I K+F+L P EY+LKVGEG+ AQCISGFMA+DVP PRGPLWILGDIFMGRYHTVFD+G Sbjct: 443 IGGKVFDLSPQEYILKVGEGAQAQCISGFMALDVPPPRGPLWILGDIFMGRYHTVFDYGN 502 Query: 1613 QRVGFAEAA 1639 RVGFAEAA Sbjct: 503 LRVGFAEAA 511 >gb|ACX55829.1| aspartic proteinase 1 [Castanea mollissima] Length = 513 Score = 726 bits (1874), Expect = 0.0 Identities = 354/491 (72%), Positives = 402/491 (81%), Gaps = 14/491 (2%) Frame = +2 Query: 209 TSNDDALVRITLKKRTLDTDNQRVARLESQE--------RMKYSGANS------DIVGLK 346 ++++ LVRI LKK LD +N+ A+LES++ R Y NS DIV LK Sbjct: 24 SASNGGLVRIGLKKMKLDKNNRVAAQLESKDGEVRSASIRKYYLRGNSGDPEDIDIVSLK 83 Query: 347 NYMDAQYYGEIGVGTPPQKFTVIFDTGSSNLWVPSSKCYLSVACFFHSKYKASQSSTYKQ 526 NYMDAQY+GEIGVGTPPQKFTVIFDTGSSNLWVPSSKCY SVAC+FHSKYK+S SSTYK+ Sbjct: 84 NYMDAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACYFHSKYKSSSSSTYKK 143 Query: 527 NGKPAEIHYGSGSIAGYFSEDDVTVGDIVVKQQEFIEATSEPGVTFMAAKFDGILGLGFQ 706 NGKPA+IHYG+G+I+GYFS+D V VGD+VVK QEFIEAT EP +TF+ AKFDGILGLGF+ Sbjct: 144 NGKPADIHYGTGAISGYFSQDHVKVGDLVVKNQEFIEATREPSITFLVAKFDGILGLGFK 203 Query: 707 EISVGKAVPVWYNMLTQGLIKEPAFSFWLNRNAQEEEGGEIVFGGSDPNHYKGKHTYVPV 886 EISVG AVPVWYNM+ QGL+KEP FSFW NRN EEEGGEIVFGG DPNHYKGKHTYVPV Sbjct: 204 EISVGNAVPVWYNMVKQGLVKEPVFSFWFNRNTDEEEGGEIVFGGVDPNHYKGKHTYVPV 263 Query: 887 TQKGYWQFDMGDVLIXXXXXXXXXXXXAAIADSGTSLLAGPTAIITEINHAIGAAGVLSW 1066 TQKGYWQFDMGDVLI +AIADSGTSLLAGPT IITE+NHAIGA GV+S Sbjct: 264 TQKGYWQFDMGDVLIDGQTTGFCARGCSAIADSGTSLLAGPTTIITEVNHAIGATGVVSQ 323 Query: 1067 ECKAVVSQYGEVIMDLLLNEVQPKKVCSKIGLCTFDGARGVSAGIESVVDENGEGLSGGL 1246 ECKAVV++YGE I+ +LL + QP K+CS+IGLCTFDG RGVS IESVVD N S GL Sbjct: 324 ECKAVVAEYGETIIKMLLEKDQPMKICSQIGLCTFDGVRGVSMDIESVVD-NTRKASNGL 382 Query: 1247 NDAMCSACEMTVVWMKNQLSQNQTQDRAFSYVNELCEKLPNPMGESAVDCNRLSTMPKVS 1426 DAMCS CEMTVVWM+NQL QNQTQDR +YVNELC++LP+PMGESAVDC LS++P VS Sbjct: 383 RDAMCSTCEMTVVWMQNQLKQNQTQDRILTYVNELCDRLPSPMGESAVDCGSLSSLPNVS 442 Query: 1427 FTIASKLFELEPHEYVLKVGEGSAAQCISGFMAMDVPRPRGPLWILGDIFMGRYHTVFDF 1606 TI ++F+L P +YVLKVGEG AAQCISGF A+DVP PRGPLWILGD+FMGRYHTVFD+ Sbjct: 443 LTIGGRVFDLSPEQYVLKVGEGEAAQCISGFTALDVPPPRGPLWILGDVFMGRYHTVFDY 502 Query: 1607 GGQRVGFAEAA 1639 G QRVGFAEAA Sbjct: 503 GNQRVGFAEAA 513 >ref|XP_002529926.1| Aspartic proteinase precursor, putative [Ricinus communis] gi|223530603|gb|EEF32480.1| Aspartic proteinase precursor, putative [Ricinus communis] Length = 514 Score = 725 bits (1871), Expect = 0.0 Identities = 345/493 (69%), Positives = 407/493 (82%), Gaps = 14/493 (2%) Frame = +2 Query: 203 STTSNDDALVRITLKKRTLDTDNQRVARLESQERMKYSGA--------------NSDIVG 340 + +S++D LVRI LKKR D +N+ A+ ES+E + + + DIV Sbjct: 22 TASSSNDGLVRIGLKKRKFDQNNRVAAQFESKEGEAFRASIKKYHIRGNLGDAEDIDIVS 81 Query: 341 LKNYMDAQYYGEIGVGTPPQKFTVIFDTGSSNLWVPSSKCYLSVACFFHSKYKASQSSTY 520 LKNYMDAQY+GEIG+GTPPQKFTVIFDTGSSNLWVPSSKCY SVAC+FHSKYK+ QSSTY Sbjct: 82 LKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACYFHSKYKSGQSSTY 141 Query: 521 KQNGKPAEIHYGSGSIAGYFSEDDVTVGDIVVKQQEFIEATSEPGVTFMAAKFDGILGLG 700 K+NGK A+IHYG+G+I+G+FS+D+V VG++V+K QEFIEAT EP +TF+ AKFDGILGLG Sbjct: 142 KKNGKSADIHYGTGAISGFFSQDNVKVGELVIKNQEFIEATREPSITFLVAKFDGILGLG 201 Query: 701 FQEISVGKAVPVWYNMLTQGLIKEPAFSFWLNRNAQEEEGGEIVFGGSDPNHYKGKHTYV 880 FQEISVG AVPVWYNM+ QGL+KEP FSFW NRNA E+EGGEIVFGG DPNHYKG+HTYV Sbjct: 202 FQEISVGNAVPVWYNMVNQGLVKEPVFSFWFNRNADEDEGGEIVFGGMDPNHYKGEHTYV 261 Query: 881 PVTQKGYWQFDMGDVLIXXXXXXXXXXXXAAIADSGTSLLAGPTAIITEINHAIGAAGVL 1060 PVTQKGYWQFDMGDVLI AAIADSGTSLLAGPT IITE+NHAIGA GV+ Sbjct: 262 PVTQKGYWQFDMGDVLIDGKTTGICSSGCAAIADSGTSLLAGPTTIITEVNHAIGATGVV 321 Query: 1061 SWECKAVVSQYGEVIMDLLLNEVQPKKVCSKIGLCTFDGARGVSAGIESVVDENGEGLSG 1240 S ECKAVV+QYGE I+ +LL + QP+K+CS+IGLCTFDG+RGVS GIESVV+E + ++G Sbjct: 322 SQECKAVVAQYGETIIAMLLAKDQPQKICSQIGLCTFDGSRGVSMGIESVVNEKIQEVAG 381 Query: 1241 GLNDAMCSACEMTVVWMKNQLSQNQTQDRAFSYVNELCEKLPNPMGESAVDCNRLSTMPK 1420 GL+DAMCS CEM VVWM+NQL QNQTQ+ +YVNELCE+LP+PMGESAVDC LSTMP Sbjct: 382 GLHDAMCSTCEMAVVWMQNQLKQNQTQEHILNYVNELCERLPSPMGESAVDCGSLSTMPN 441 Query: 1421 VSFTIASKLFELEPHEYVLKVGEGSAAQCISGFMAMDVPRPRGPLWILGDIFMGRYHTVF 1600 VSFTI ++F+L P +YVLKVG+G AAQCISGF A+DVP PRGPLWILGD+FMG +HTVF Sbjct: 442 VSFTIGGRVFDLAPEQYVLKVGDGEAAQCISGFTALDVPPPRGPLWILGDVFMGPFHTVF 501 Query: 1601 DFGGQRVGFAEAA 1639 D+G +RVGFAE A Sbjct: 502 DYGNKRVGFAEVA 514 >gb|ACX55830.1| aspartic proteinase 2 [Castanea mollissima] Length = 513 Score = 724 bits (1870), Expect = 0.0 Identities = 353/491 (71%), Positives = 402/491 (81%), Gaps = 14/491 (2%) Frame = +2 Query: 209 TSNDDALVRITLKKRTLDTDNQRVARLESQE--------RMKYSGANS------DIVGLK 346 ++++ LVRI LKK LD +N+ A+LES++ R Y NS DIV LK Sbjct: 24 SASNGGLVRIGLKKMKLDKNNRVAAQLESKDGEVRSASIRKYYLRGNSGDPEDIDIVSLK 83 Query: 347 NYMDAQYYGEIGVGTPPQKFTVIFDTGSSNLWVPSSKCYLSVACFFHSKYKASQSSTYKQ 526 NYMDAQY+GEIGVGTPPQKFTVIFDTGSSNLWVPSSKCY SVAC+FHSKYK+S SSTYK+ Sbjct: 84 NYMDAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACYFHSKYKSSSSSTYKK 143 Query: 527 NGKPAEIHYGSGSIAGYFSEDDVTVGDIVVKQQEFIEATSEPGVTFMAAKFDGILGLGFQ 706 NGKPA+IHYG+G+I+GYFS+D V VGD+VVK QEFIEAT EP +TF+ AKFDGILGLGF+ Sbjct: 144 NGKPADIHYGTGAISGYFSQDHVKVGDLVVKNQEFIEATREPSITFLVAKFDGILGLGFK 203 Query: 707 EISVGKAVPVWYNMLTQGLIKEPAFSFWLNRNAQEEEGGEIVFGGSDPNHYKGKHTYVPV 886 EISVG AVPVWYNM+ QGL+KEP FSFW NRN EEEGGEIVFGG DPNHYKGKHTYVPV Sbjct: 204 EISVGNAVPVWYNMVKQGLVKEPVFSFWFNRNTDEEEGGEIVFGGVDPNHYKGKHTYVPV 263 Query: 887 TQKGYWQFDMGDVLIXXXXXXXXXXXXAAIADSGTSLLAGPTAIITEINHAIGAAGVLSW 1066 TQKGYWQFDMGDVLI +AIADSGTSLLAGPT IITE+NHAIGA GV+S Sbjct: 264 TQKGYWQFDMGDVLIDGQTTGFCVTTCSAIADSGTSLLAGPTTIITEVNHAIGATGVVSQ 323 Query: 1067 ECKAVVSQYGEVIMDLLLNEVQPKKVCSKIGLCTFDGARGVSAGIESVVDENGEGLSGGL 1246 ECKAVV++YGE I+ +LL + QP K+CS+IGLCTFDG +GVS IESVVD N S GL Sbjct: 324 ECKAVVAEYGETIIKMLLEKDQPMKICSQIGLCTFDGTQGVSMDIESVVD-NTHKASNGL 382 Query: 1247 NDAMCSACEMTVVWMKNQLSQNQTQDRAFSYVNELCEKLPNPMGESAVDCNRLSTMPKVS 1426 DAMCS CEMTVVWM+NQL QNQTQDR +YVNELC++LP+PMGESAVDC LS++P VS Sbjct: 383 RDAMCSTCEMTVVWMQNQLKQNQTQDRILTYVNELCDRLPSPMGESAVDCGSLSSLPNVS 442 Query: 1427 FTIASKLFELEPHEYVLKVGEGSAAQCISGFMAMDVPRPRGPLWILGDIFMGRYHTVFDF 1606 TI ++F+L P +YVLKVGEG AAQCISGF A+DVP PRGPLWILGD+FMGRYHTVFD+ Sbjct: 443 LTIGGRVFDLSPEQYVLKVGEGEAAQCISGFTALDVPPPRGPLWILGDVFMGRYHTVFDY 502 Query: 1607 GGQRVGFAEAA 1639 G QRVGFAEAA Sbjct: 503 GNQRVGFAEAA 513 >ref|XP_002298827.1| predicted protein [Populus trichocarpa] gi|222846085|gb|EEE83632.1| predicted protein [Populus trichocarpa] Length = 494 Score = 723 bits (1867), Expect = 0.0 Identities = 347/484 (71%), Positives = 401/484 (82%), Gaps = 11/484 (2%) Frame = +2 Query: 218 DDALVRITLKKRTLDTDNQRVARLESQER---MKY--------SGANSDIVGLKNYMDAQ 364 +D L+RI LKKR + +N+ A+LES+E KY ++DIV LKNYMDAQ Sbjct: 10 NDGLIRIGLKKRKYERNNRLAAKLESKEGESIKKYHLLRNLGGDAEDTDIVSLKNYMDAQ 69 Query: 365 YYGEIGVGTPPQKFTVIFDTGSSNLWVPSSKCYLSVACFFHSKYKASQSSTYKQNGKPAE 544 Y+GEIG+GTPPQKFTVIFDTGSSNLWVPSSKCY SVAC+FHSKYK+S S TYK+NGK AE Sbjct: 70 YFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKCYFSVACYFHSKYKSSHSRTYKENGKSAE 129 Query: 545 IHYGSGSIAGYFSEDDVTVGDIVVKQQEFIEATSEPGVTFMAAKFDGILGLGFQEISVGK 724 IHYG+G+I+G+FS+D V VGD+VVK QEFIEAT EP VTF+ AKFDGILGLGFQEISVGK Sbjct: 130 IHYGTGAISGFFSQDHVKVGDLVVKNQEFIEATREPSVTFLVAKFDGILGLGFQEISVGK 189 Query: 725 AVPVWYNMLTQGLIKEPAFSFWLNRNAQEEEGGEIVFGGSDPNHYKGKHTYVPVTQKGYW 904 AVPVWYNM+ QGL+KEP FSFW NRNA E+EGGEIVFGG DP+HYKG+HTYVPVTQKGYW Sbjct: 190 AVPVWYNMVEQGLVKEPVFSFWFNRNADEKEGGEIVFGGVDPDHYKGEHTYVPVTQKGYW 249 Query: 905 QFDMGDVLIXXXXXXXXXXXXAAIADSGTSLLAGPTAIITEINHAIGAAGVLSWECKAVV 1084 QFDMGDVLI AAIADSGTSLLAGPT IITE+NHAIGA GV+S ECKAVV Sbjct: 250 QFDMGDVLIGGQTSGFCASGCAAIADSGTSLLAGPTTIITEVNHAIGATGVVSQECKAVV 309 Query: 1085 SQYGEVIMDLLLNEVQPKKVCSKIGLCTFDGARGVSAGIESVVDENGEGLSGGLNDAMCS 1264 +QYG+ IM++LL + QP+K+C++IGLCTFDG RGVS GIESVV+E+ + S G +DAMCS Sbjct: 310 AQYGDTIMEMLLAKDQPQKICAQIGLCTFDGTRGVSMGIESVVNEHAQKASDGFHDAMCS 369 Query: 1265 ACEMTVVWMKNQLSQNQTQDRAFSYVNELCEKLPNPMGESAVDCNRLSTMPKVSFTIASK 1444 CEM VVWM+NQL QNQTQ+R YVNELCE+LP+PMGESAVDC+ LS+MP VSFTI + Sbjct: 370 TCEMAVVWMQNQLKQNQTQERILDYVNELCERLPSPMGESAVDCDGLSSMPNVSFTIGGR 429 Query: 1445 LFELEPHEYVLKVGEGSAAQCISGFMAMDVPRPRGPLWILGDIFMGRYHTVFDFGGQRVG 1624 +FEL P +YVLKVGEG AQCISGF A+DVP PRGPLWILGD+FMG +HTVFD+G RVG Sbjct: 430 VFELSPEQYVLKVGEGDVAQCISGFTALDVPPPRGPLWILGDVFMGSFHTVFDYGNMRVG 489 Query: 1625 FAEA 1636 FAEA Sbjct: 490 FAEA 493