BLASTX nr result
ID: Coptis24_contig00008280
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis24_contig00008280 (2300 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002283247.2| PREDICTED: uncharacterized protein LOC100247... 278 6e-72 ref|XP_004134381.1| PREDICTED: uncharacterized protein LOC101205... 275 3e-71 ref|XP_002304703.1| predicted protein [Populus trichocarpa] gi|2... 263 2e-67 ref|XP_002518133.1| hypothetical protein RCOM_1020610 [Ricinus c... 256 1e-65 ref|XP_003535221.1| PREDICTED: uncharacterized protein LOC100789... 239 3e-60 >ref|XP_002283247.2| PREDICTED: uncharacterized protein LOC100247656 [Vitis vinifera] Length = 363 Score = 278 bits (710), Expect = 6e-72 Identities = 170/333 (51%), Positives = 216/333 (64%), Gaps = 7/333 (2%) Frame = +2 Query: 1322 SFNPGPTPFLAQASSNIRSEMDRESGLRYGATKRVRRQRKRGSACSDGGESL--IFDDLE 1495 S GP P AQ +SN+ S ++ + + ++RQRK GSA S+ GES +F+D E Sbjct: 2 SLQHGPAPSGAQTTSNLMSGSNQVNVRHRARSILIKRQRKHGSATSNTGESSTSVFNDSE 61 Query: 1496 I--LGPSEEPSNARSTRIRSNRHCGSLHPVIEIEDVSP-VQSGGLRNMTQMDNDDSDVRA 1666 I LG S EPSN+ STR +S+ G + PVIEI++ SP ++ RN M+NDDSD RA Sbjct: 62 IMFLGSSGEPSNSGSTRSQSHHGQGVMEPVIEIDEQSPEIRHVASRNGGSMNNDDSDARA 121 Query: 1667 RQVEADEMLARELQEEFYHELPGVGGGEIDASIAWGLQQEEDVERAASLRRQRVHPRDA- 1843 RQ+EADE+LARELQE+ YHE+P GG IDA IA LQQ+E V+ +S R RV PR + Sbjct: 122 RQIEADEILARELQEQLYHEMPVDGGVGIDAHIAQMLQQQEQVQPTSSSRNHRV-PRASG 180 Query: 1844 -SMSHLYRQQQTQALQNSSVRYTNRAIRARSRAPTSARGAQFRDRRIGSSPTISARGRHM 2020 ++S LYRQ Q+++ QN S+R +A R PTS R AQ R R S I + R++ Sbjct: 181 PAISRLYRQSQSRSSQNPSIRRGTQA-----RGPTSTRMAQLRSRFPNQSHAIPSGERNL 235 Query: 2021 HFPQNMDVEMRIHIXXXXXXXXXGISEDVTRASTLFHAQRDFNENDYEMLLALDENNHQH 2200 HFP NMD++MRI I G D+ + QRDFNENDYEMLLALDENNH + Sbjct: 236 HFPLNMDLDMRIDI-LEALEAAVGDFGDMRMPGHILQIQRDFNENDYEMLLALDENNH-N 293 Query: 2201 VGATSNQINGLPQSTVQTDNFAEACAICLETPT 2299 VGA+ NQ+N LPQSTVQTDNF E+CAICLETPT Sbjct: 294 VGASVNQMNSLPQSTVQTDNFEESCAICLETPT 326 >ref|XP_004134381.1| PREDICTED: uncharacterized protein LOC101205482 [Cucumis sativus] Length = 803 Score = 275 bits (704), Expect = 3e-71 Identities = 206/591 (34%), Positives = 311/591 (52%), Gaps = 52/591 (8%) Frame = +2 Query: 683 LAADGKTLGKNKEKVIDLMSSPVSASR---NLRKKTSLQDDRSYVDEGLCSSSSMGVDRG 853 +A D K + E+ M P+++ + N++ K + ++ S+ D GL + G+++ Sbjct: 202 VAKDFKIENTSNEQSASYM--PIASKKLNVNIKGKEKVVEE-SFQDVGLSMINRDGIEKS 258 Query: 854 KGIYIPSDGKHNTEPVMSRSLHSPTLPRKSGQRRLVRNGCISPHNVA-KAKLIAESHVKS 1030 ++ +H + + R S PR +G +RLVRNGCISPHN+A +AK ++E KS Sbjct: 259 NN----TNNRHEKQGLGPRQFVSS--PRATGHKRLVRNGCISPHNIAIRAKSLSEQCEKS 312 Query: 1031 HEDGRQVDTGVVPG---------------ERDTDRVKGKGVMEESS------------AR 1129 + + + G +P + +++ KGKG+M + S + Sbjct: 313 SREVDKSNLGNMPSSSPSCPIDINDIVAEDNFSNKDKGKGIMRQPSLSHDKDDVRVIFSS 372 Query: 1130 KSSLNPTVEAN--RTE--GSFEAFDAMGGWXXXXXXXXXXXXFLSDSAGHLSRTISGVGH 1297 S V AN RT G+ E + +G W LS+ +G+ + I VG Sbjct: 373 SSDTGKDVGANPGRTSRLGTSEHCEKVGVWRRTHNHLKNGIV-LSNPSGNSFKKIDSVGR 431 Query: 1298 SHEINDAASFN---PGPTPFLAQA--------SSNIRSEMDRESGLRYGATKRVRRQRKR 1444 + P +A+A S ++D+ +G + +K ++Q+K Sbjct: 432 LSNGKTEIAMERQIPSRQELIAEADCGGSADTSQRASPKLDQTNGPIHAESKLNKKQKKH 491 Query: 1445 GSACSDGGESLIFDDLEILGPSEEPSNARSTRIRSNRHCGSLHPVIEIEDVSPV------ 1606 S I D+ LG S E SN+RSTR++S C +L+ VIE++++SP Sbjct: 492 ESTYQINSSRRI-PDVVCLGTSGESSNSRSTRLKSKIVCDNLNEVIEVDELSPEMRHPVS 550 Query: 1607 QSGGLRNMTQMDNDDSDVRARQVEADEMLARELQEEFYHELPGVGGGEIDASIAWGLQQE 1786 Q+GG +++D SDVRARQ+EADE+LARELQE+ Y E+P +GG EID +A LQQ Sbjct: 551 QTGG-----SLNDDTSDVRARQLEADEILARELQEQLYQEIP-IGGEEIDEHLAMALQQV 604 Query: 1787 EDVERAASLRRQRVHPRDASMSHLYRQQQTQALQNSSVRYTNRAIRARSRAPTSARGAQF 1966 E A S RR R + ++ R+ ++Q+LQN S R R+R SAR AQ Sbjct: 605 EHGLLAPS-RRSHNSQRGSLVAQANRRTRSQSLQNPSNR-------TRTRVTHSARMAQI 656 Query: 1967 RDRRIGSSPTISARGRHMHFPQNMDVEMRIHIXXXXXXXXXGISEDVTRASTLFHAQRDF 2146 R++ G S +S R R+++FP +MD++MR+ I G +DV + H QRDF Sbjct: 657 RNQFFGGSHRVSTRQRNLNFPMHMDLDMRLDILEALEAAV-GDMDDVRMNRDILHMQRDF 715 Query: 2147 NENDYEMLLALDENNHQHVGATSNQINGLPQSTVQTDNFAEACAICLETPT 2299 NENDYEMLL+LDENNH+H GA++N+IN LPQSTVQTD+ EACAICL+TPT Sbjct: 716 NENDYEMLLSLDENNHRHAGASTNRINSLPQSTVQTDSTQEACAICLDTPT 766 >ref|XP_002304703.1| predicted protein [Populus trichocarpa] gi|222842135|gb|EEE79682.1| predicted protein [Populus trichocarpa] Length = 740 Score = 263 bits (671), Expect = 2e-67 Identities = 201/536 (37%), Positives = 273/536 (50%), Gaps = 30/536 (5%) Frame = +2 Query: 782 SLQDDRSYVDEGLCSSSSMGVDRGKGIYIPSDGKHNTEPVMSRSLHSPTLPRKSGQRRLV 961 S + +D C+ S + K I S +H E + S T PR G++RLV Sbjct: 205 STSKGKEKIDVNTCNGSGSASNNVKEIDHASGHQHKIEKQLPACHLSVTSPRVGGKKRLV 264 Query: 962 RNGCISPHNVA-KAKLIAESHV-------KSHEDGRQVD-------TGVVPGERDTDRVK 1096 RNGCISPHN+A +A+ +AES ++H + D +V + D R K Sbjct: 265 RNGCISPHNIATRAQKLAESSQDGSPGDERNHARNKLSDGPPNIDLREIVAEDNDCYRAK 324 Query: 1097 GKGVMEESSARKSSLNPTVEANRT-EGSFEAFDAMGGWXXXXXXXXXXXXFLSD-SAGHL 1270 GK + SA K +AN T +G +A GGW LS G L Sbjct: 325 GKKAIVHPSASKEH-----DANMTRDGCRDAL--FGGWRSTHKRSKTQDQPLSYMEQGIL 377 Query: 1271 SRTISGVGHSHEINDAASFNPGPTPFLAQASSNIRSEMDRESGL--RYGATKRVRRQRKR 1444 R ++E +D L + S+ ++ L YG T R + + Sbjct: 378 GRDDHARCSTNEHDDR----------LVERDSSSGGKLHHVGNLVATYGLTSRNQGE--- 424 Query: 1445 GSACSDGGESLIFDDLEIL--GPSEEPSNARSTRIRSNRHCGSLHPVIEIEDV-SPVQSG 1615 CS +++ DD E+L G S E S++RS+R+ +++H G+L P+ EI+++ + V++ Sbjct: 425 ---CS----TIVPDDTEVLFLGSSRESSSSRSSRVHNHQHDGNLEPIYEIDELLTEVRNN 477 Query: 1616 GLRNMTQMDNDDSDVRARQVEADEMLARELQEEFYHELPGVGGGEIDASIAWGLQQEEDV 1795 + + N+DSDV ARQVEADEMLARELQE YHE P GGGEID +IAW LQQEED Sbjct: 478 DPQLIGFRSNEDSDVTARQVEADEMLARELQERLYHEEPTFGGGEIDENIAWVLQQEEDA 537 Query: 1796 ERAASLRRQRV-HPRDASMSHLYRQQQTQALQNSSVR-------YTNRAIRARSRAPTSA 1951 A S V H R++ ++H RQ+ ++ N S R T RA RSR Sbjct: 538 LPATSGHNHPVPHLRNSLVAHSSRQRLPRSSHNPSNRRGNQVQVTTTRASGLRSRLSNRT 597 Query: 1952 RGAQFRDRRIGSSPTISARGRHMHFPQNMDVEMRIHIXXXXXXXXXGISEDVTRASTLFH 2131 R+R PT+ G + FP MD+EMR++I E A+ + H Sbjct: 598 PVRISRER--NPFPTVFPGGLNFQFPSGMDLEMRLNILENL--------EASMTATRMLH 647 Query: 2132 AQRDFNENDYEMLLALDENNHQHVGATSNQINGLPQSTVQTDNFAEACAICLETPT 2299 QRDFNENDYEMLLALDENN QH GA++NQIN LP+S VQTDNF E CA+CLE PT Sbjct: 648 VQRDFNENDYEMLLALDENNSQH-GASANQINCLPESVVQTDNFGETCAVCLEAPT 702 >ref|XP_002518133.1| hypothetical protein RCOM_1020610 [Ricinus communis] gi|223542729|gb|EEF44266.1| hypothetical protein RCOM_1020610 [Ricinus communis] Length = 791 Score = 256 bits (655), Expect = 1e-65 Identities = 196/559 (35%), Positives = 276/559 (49%), Gaps = 52/559 (9%) Frame = +2 Query: 779 TSLQDDRSYVDEGLCSSSSMGVDRGKGIYIPSDGKHNTEPVMSRSLHSPTLPRKSGQRRL 958 ++L + VD C+ S ++ GKGI + H E S S S T PR +G +RL Sbjct: 193 SNLFKGKEKVDVNACNGSDSALNHGKGIDLTGSSPHKIEKQASASHLSVTSPRVTGHKRL 252 Query: 959 VRNGCISPHNVA-KAKLIAESHVK------------------SHEDGRQVDTGVVPGERD 1081 VRNGCISPHN+A + + +AES S D R++ G E + Sbjct: 253 VRNGCISPHNIATRQQKLAESRQDCSIDVGTDDSKNIVSDGPSEVDIREIIVGEKNEENN 312 Query: 1082 TDRVKGKGVM---EESSARKSSLNPTVEANRTEG------SFEAFDA-MGGWXXXXXXXX 1231 R KGKG++ S+ + + ++R E S + DA +GGW Sbjct: 313 HYRAKGKGLVTYPSTSTENDAQIFHVSTSSRIENKAANVTSDTSRDASLGGWRSTRNHAK 372 Query: 1232 XXXX----FLSDSA--GHLSRTISGVGHSHEINDAASFNPGPTPFLAQASSNIRSEMDRE 1393 F +D ++R +G + ++++ AQ +S S +++ Sbjct: 373 KLYHADDEFSADEQHENRVARRNTGTANVKNVHESGD------RVQAQTASRHVSGLNQT 426 Query: 1394 SGLRYGATKRVRRQRKRGSACSDGGE--SLIFDDLEI--LGPSEEPSNARSTRIRSNRHC 1561 + + +RQ+K G + GE + + DD EI LG S+E S +RS+R + Sbjct: 427 NRPHHIGNIHTKRQKKYGLTSRNDGEYSTTVPDDSEIMLLGSSDESSRSRSSRTSYRQRR 486 Query: 1562 GSLHPVIEIEDVSPVQSGGLRNMTQMDND-DSDVRARQVEADEMLARELQEEFYHELPGV 1738 G LHP+ E+++ P + G +ND ++D RARQVEADEMLARELQE+ Y E P Sbjct: 487 GILHPIYEVDESLPERRTGSSQGLSSENDIEADARARQVEADEMLARELQEQLYQETPAS 546 Query: 1739 GGGEIDASIAWGLQQEEDVERAASLRRQRV-HPRDASMSHLYRQQQTQALQNSSVRYTNR 1915 GG EID AW LQQ EDV AS + + R + H Q Q ++ QN S R Sbjct: 547 GGSEIDEDAAWLLQQVEDVFPTASSQSYPISRLRRPATMHSNTQPQPRSFQNPSNRR--- 603 Query: 1916 AIRARSRAPTSARGAQFRDRRIGSSP-----------TISARGRHMHFPQNMDVEMRIHI 2062 +SR P + R +Q R+R P T S+ R+ FP +MD+EMR+ I Sbjct: 604 --GTQSRLP-ATRTSQLRNRLFNRPPARLLRARNHSLTSSSTTRNFQFPLSMDLEMRLDI 660 Query: 2063 XXXXXXXXXGISEDVTRASTLFHAQRDFNENDYEMLLALDENNHQHVGATSNQINGLPQS 2242 + + S + QRDFNENDYEMLLALDENN QH GA++N+IN LP+S Sbjct: 661 -------LEALEDMSVTNSHILQVQRDFNENDYEMLLALDENNQQH-GASTNRINSLPES 712 Query: 2243 TVQTDNFAEACAICLETPT 2299 +QTDNF E CAICLETPT Sbjct: 713 VLQTDNFEETCAICLETPT 731 >ref|XP_003535221.1| PREDICTED: uncharacterized protein LOC100789823 [Glycine max] Length = 735 Score = 239 bits (609), Expect = 3e-60 Identities = 186/542 (34%), Positives = 253/542 (46%), Gaps = 54/542 (9%) Frame = +2 Query: 833 SMGVDRGKGIYIPSDGKHNTEPVMSRSLHSPTLPRKSGQRRLVRNGCISPHNVA------ 994 ++ VD GKGI + +D + E +S T PR G +RLVRNGCISPHN+A Sbjct: 173 NISVDHGKGISLSNDSQLQNEKQVSLPPRVSTSPRGRGHKRLVRNGCISPHNIATMEKQL 232 Query: 995 ------KAKLIAESHVKSHEDGRQVDTGV---VPGERDTDRVKGKGVMEESSARK----S 1135 K K + +SH S V V GER R KGK V+ S + + Sbjct: 233 AEQSNHKTKDVEQSHGHSVSSSTVSPVSVDDIVAGERGNGRGKGKEVLAYRSPHRLTFRT 292 Query: 1136 SLNPTVEANRTEGSFEA------FDAMGGWXXXXXXXXXXXXFLSDSAGHLSRTISGVG- 1294 + +P G A + G L D GH R + VG Sbjct: 293 ASSPVTNYEEINGPSNAIRNPLQYSGGQGGRRTTHNERNANWHLHDVNGHHLRINNDVGR 352 Query: 1295 --HSHEINDAASFNPG-----------PTPFLAQASSNIRSEMDRESGLRYGATKRVRRQ 1435 + H N G + AQ +S I ++D+ SG A +RQ Sbjct: 353 FINGHNTTGMDRRNTGNGQSSNHIHGSQSDHTAQPTSVIIPDVDQSSGTHRTADILTKRQ 412 Query: 1436 RKRGSAC-----------SDGGESLIFDD--LEILGPSEEPSNARSTRIRSNRHCGSLHP 1576 RKR S S + + D +E+L P S++ T + H Sbjct: 413 RKRESPSGFMFRGSTGDSSSSSRNPVSDPEVIELLSPPRGSSSSSRTSVLD-------HE 465 Query: 1577 VIEIEDVSPVQSGGLRNMTQMDNDDSDVRARQVEADEMLARELQEEFYHELPGVGGGEID 1756 V+++ + ++ DN+ S+ RARQVEADE LARELQE+ YH+ P G G ID Sbjct: 466 VVDLLSTPRYANRSSEDLDDNDNNSSEARARQVEADERLARELQEQLYHDDPFEGRG-ID 524 Query: 1757 ASIAWGLQQEEDVERAASLRRQRVHPRDASMSHLYRQQQTQALQNSSVRYTNRAIRARSR 1936 +AW LQ+ E + RA PR + RQ +T+ +N S R RA ++ Sbjct: 525 EDLAWDLQRAEALMRATIDSHSISQPRQ--LPRAIRQPRTRFPENPSRR------RAMAQ 576 Query: 1937 APTSARGAQFRDRRIGSS--PTISARGRHMHFPQNMDVEMRIHIXXXXXXXXXGISEDVT 2110 A S R +Q+R R + P+ S+RGR FP +MD++MR+ I S D+ Sbjct: 577 ASFSNRMSQWRSRATSRTRAPSTSSRGRGPRFPLDMDLDMRLDILEALEDSVGDFS-DMG 635 Query: 2111 RASTLFHAQRDFNENDYEMLLALDENNHQHVGATSNQINGLPQSTVQTDNFAEACAICLE 2290 +F+A+RDF + DYEMLLALDE NHQH GA+SN IN LPQST+QTDNF +ACAICLE Sbjct: 636 ITDGIFNARRDFTDADYEMLLALDEGNHQHTGASSNLINSLPQSTIQTDNFTDACAICLE 695 Query: 2291 TP 2296 TP Sbjct: 696 TP 697