BLASTX nr result
ID: Coptis24_contig00002431
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis24_contig00002431 (2494 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI27324.3| unnamed protein product [Vitis vinifera] 261 6e-67 ref|XP_002514618.1| hypothetical protein RCOM_1467930 [Ricinus c... 250 1e-63 ref|XP_002314922.1| predicted protein [Populus trichocarpa] gi|2... 245 4e-62 ref|XP_002512034.1| conserved hypothetical protein [Ricinus comm... 239 3e-60 ref|XP_003520133.1| PREDICTED: uncharacterized protein LOC100778... 235 5e-59 >emb|CBI27324.3| unnamed protein product [Vitis vinifera] Length = 620 Score = 261 bits (667), Expect = 6e-67 Identities = 165/352 (46%), Positives = 211/352 (59%), Gaps = 15/352 (4%) Frame = -1 Query: 1480 EEVFLQGGERSNETHVSYDRATSPGLNEMIHREPCLSTIQSPKDKIYEGAQFNYEEKSFL 1301 EE S ET V+ S +E +H P S P + Y G F ++KS Sbjct: 284 EEASFCKSHSSVETQVNL--VASSEKDEGLHEVPNGSHF--PNESKYGGTIFKGKKKSVS 339 Query: 1300 GEPSLRSNQPCGETRVNEVDRSVWWAYFPPDDQIQ--------------RESEVINGNVS 1163 E S QP ET +N P D++Q + +V N S Sbjct: 340 REQSFWKPQPSDETWIN-------CGVAPFKDKVQVVPEKAFFCTSLLAEKLDVSNEKGS 392 Query: 1162 HQDNVKTAS-ENLSPDSPRIETQVNHQTGRFTDEKEQVLESVNNTEQAVERSPLLHSPDG 986 + K S E S +P ETQ+ KE + ES+ +EQ + + H+ +G Sbjct: 393 CLERKKPVSGEPSSCHAPPNETQIKKSKSCLM-RKELIAESMPVSEQDNKIDDITHAQNG 451 Query: 985 NGPLDYENNIIGEREKLKETDEFKRAAEEEWASRQRQLQIQAEEAQRLRKRKKAETQRLL 806 + +II EREKLKET+E+KRA EEEW SRQRQLQ+QAEE QRL+KR+KAE RLL Sbjct: 452 ------QRDIINEREKLKETEEYKRAIEEEWTSRQRQLQLQAEEVQRLKKRRKAENTRLL 505 Query: 805 DMERRQKERIEEMRETQKKNVETINLKDQLRAEVRKELHRMELVYTDMVSLLRALGIRVG 626 DMERRQK+R++EMRETQKK+ E +N+K+++R EVRKEL ++E+ +DM SLLR L I VG Sbjct: 506 DMERRQKQRVQEMRETQKKDEENMNMKEKIRLEVRKELDKLEMTCSDMASLLRGLEIHVG 565 Query: 625 TGFCPSSREVNAAYKQALLKFHPDRASRTDVRQQVEAEEKFKLVSRLKEKLL 470 GFCPSS EV+AAYK+ALLKFHPDRASRTD+ QVEAEEKFKL+SR+KEK L Sbjct: 566 GGFCPSSNEVHAAYKRALLKFHPDRASRTDIYHQVEAEEKFKLISRMKEKFL 617 Score = 139 bits (351), Expect = 3e-30 Identities = 104/322 (32%), Positives = 154/322 (47%), Gaps = 25/322 (7%) Frame = -3 Query: 2492 VIIIDAPESSEKKLLGPSVSQ--KKLPTGKVISIDDDEISASETENADGGGNHQDSGATS 2319 VIIID P+S ++ + V + K++P +ISIDDDE + EN DS A+S Sbjct: 16 VIIIDGPKSVQQNVQDSGVMRRDKRVPLESIISIDDDESTDIHPENGAESRGDLDSDASS 75 Query: 2318 SKASCPSV--SQSSEEANVDECQLNRKRKIPMKFSKSKRTYSEKTFSRNYFGWGPL---- 2157 SK SCP+ SQ+S +ECQ R+RK P+K SK KRTYS K SRN +G P+ Sbjct: 76 SKRSCPASNHSQNSVGLEAEECQFIRERKSPVKLSKCKRTYSGKAPSRNRYGLDPMPEST 135 Query: 2156 -CXXXXXXXXXXDCEIIEGSCGKISEQWDRAALKKRTFEGIPIGQPGLDDEASGSGSFAD 1980 DCE++EGS GK+ EQW++A LK++ + L D+ S SGS D Sbjct: 136 SPESTSSESGLSDCELMEGSRGKLHEQWEQAYLKRKDVP--QTAKSDLGDQPSASGSNTD 193 Query: 1979 VPKHVEVENSAEQHVRDPFXXXXXXXXXXXXXXXXSRGCDDNIRED---SFLSPEENVVG 1809 P ++EVEN EQH P S ++N ++ SF +P+ + +G Sbjct: 194 TPPNIEVENMTEQHQETP---------------VCSSSSNENFEKENLPSFFAPDGSNLG 238 Query: 1808 DSGKTVEPDSP------------SWCKTPVREEAHFFPKKVDIQETGKTFGGEPSFCDAE 1665 + E ++P S CK E+ F D+Q+ +F S + + Sbjct: 239 ATSPNPEVENPFAEFEFKFDEESSRCKIESMEKTQFSDVNNDVQDEEASFCKSHSSVETQ 298 Query: 1664 VQ-SDTDDSDDELYSLPKDREF 1602 V + + D+ L+ +P F Sbjct: 299 VNLVASSEKDEGLHEVPNGSHF 320 >ref|XP_002514618.1| hypothetical protein RCOM_1467930 [Ricinus communis] gi|223546222|gb|EEF47724.1| hypothetical protein RCOM_1467930 [Ricinus communis] Length = 451 Score = 250 bits (638), Expect = 1e-63 Identities = 140/264 (53%), Positives = 178/264 (67%), Gaps = 20/264 (7%) Frame = -1 Query: 1201 IQRESEVINGNVSHQDNVKTAS----ENLSPDSPRIETQVNHQTGRFTDEKEQVL----- 1049 I E+ N N +H ++ S EN PDS + + Q F + Q + Sbjct: 187 ITAEARTSNSNSNHDNDKFNQSACVGENARPDSSNLMSDGKSQHKDFEETVLQEMCTKSS 246 Query: 1048 ----ESVNNT-------EQAVERSPLLHSPDGNGPLDYENNIIGEREKLKETDEFKRAAE 902 +++N T ++ R LL G D +N IIG+RE LK+TD ++RA E Sbjct: 247 FCNTQAMNETTSIHYAHSESQVRLDLLPLVSNGGLSDVQNYIIGDREMLKKTDAYRRAQE 306 Query: 901 EEWASRQRQLQIQAEEAQRLRKRKKAETQRLLDMERRQKERIEEMRETQKKNVETINLKD 722 EEWASRQR+LQIQAEEA+RLRKR+KAETQRLL+ ERRQK+R+EE+RE QKK+ ET+NLK+ Sbjct: 307 EEWASRQRELQIQAEEARRLRKRRKAETQRLLETERRQKQRVEEVREAQKKDEETLNLKE 366 Query: 721 QLRAEVRKELHRMELVYTDMVSLLRALGIRVGTGFCPSSREVNAAYKQALLKFHPDRASR 542 QLR EVR+EL ++E TDM SLLR LGI +G GF PSS EV AYKQALL+FHPDRASR Sbjct: 367 QLRIEVRRELDKLEANCTDMASLLRGLGIDIGGGFYPSSTEVRTAYKQALLRFHPDRASR 426 Query: 541 TDVRQQVEAEEKFKLVSRLKEKLL 470 +D+RQQ+EAEEKFKL+SR KEK + Sbjct: 427 SDIRQQIEAEEKFKLISRAKEKFM 450 >ref|XP_002314922.1| predicted protein [Populus trichocarpa] gi|222863962|gb|EEF01093.1| predicted protein [Populus trichocarpa] Length = 724 Score = 245 bits (626), Expect = 4e-62 Identities = 136/253 (53%), Positives = 176/253 (69%), Gaps = 9/253 (3%) Frame = -1 Query: 1192 ESEVINGNVSHQDNVKTASENLSPD---SPRIETQVNHQT------GRFTDEKEQVLESV 1040 E +V+ V+ + S+ S +PR ++ H T + KE + Sbjct: 471 EDKVVENVVAPSWTTQEVSDEKSDHYERAPREKSSQCHDTLSKRGISNSAEGKEAFTDFA 530 Query: 1039 NNTEQAVERSPLLHSPDGNGPLDYENNIIGEREKLKETDEFKRAAEEEWASRQRQLQIQA 860 ++++ ER PL S G+ L E +II EREKLKETDE+K+A EEEWA+RQRQLQIQA Sbjct: 531 SSSQPCYERDPLCAS-HGDLLLSAERDIINEREKLKETDEYKQAIEEEWAARQRQLQIQA 589 Query: 859 EEAQRLRKRKKAETQRLLDMERRQKERIEEMRETQKKNVETINLKDQLRAEVRKELHRME 680 EE QRLRKR+KAET R+LDMERRQK+R+EE+RETQKK+ E +N+K++ R EVRKEL+R+E Sbjct: 590 EEVQRLRKRRKAETLRILDMERRQKQRLEEVRETQKKDEENLNMKERFRVEVRKELYRLE 649 Query: 679 LVYTDMVSLLRALGIRVGTGFCPSSREVNAAYKQALLKFHPDRASRTDVRQQVEAEEKFK 500 + +M SLLR LGI V G P +V+AAYK+ALLK HPDRAS+TD+RQQVEAEEKFK Sbjct: 650 VTCFNMASLLRGLGIHVEGGLKPLPNQVHAAYKRALLKLHPDRASKTDIRQQVEAEEKFK 709 Query: 499 LVSRLKEKLLPVS 461 L+SR+KEK L S Sbjct: 710 LISRMKEKFLSTS 722 Score = 91.7 bits (226), Expect = 9e-16 Identities = 62/214 (28%), Positives = 97/214 (45%), Gaps = 29/214 (13%) Frame = -3 Query: 2492 VIIIDAPESSEKKLLGPSVSQKKLPTGKVISIDDDEISASETE--------------NAD 2355 VII+D PES ++KL G S ++ + +IS+DDD+ + E N Sbjct: 47 VIIVDVPESLQQKLRGSSAVREGTRSPCIISVDDDDDDDDDDEEEEDECYTVDDHEINEQ 106 Query: 2354 GGGNHQDSGATSSKASCPSVSQSSEEANVDECQLNRKRKIPMKFSKSKRTYSEKTFSRNY 2175 GN G +S + + + D C++ + + K K RTY+EK SRN Sbjct: 107 VDGNLDSDGTSSPSSPASDHIEKPVHRDADGCRVAEENRPVFKLRKCNRTYAEKATSRNR 166 Query: 2174 FGWGPLCXXXXXXXXXXD---------------CEIIEGSCGKISEQWDRAALKKRTFEG 2040 +G CE++EGS G++ EQW++A+LK+++ Sbjct: 167 YGLDSDAEKATSRNRYGLDSDAESDSSEDNTSDCEVMEGSFGEVREQWEKASLKRKSMR- 225 Query: 2039 IPIGQPGLDDEASGSGSFADVPKHVEVENSAEQH 1938 GLDD+AS S +DV +VEVEN +Q+ Sbjct: 226 ----CKGLDDQASPCSSHSDVHPNVEVENKTKQN 255 >ref|XP_002512034.1| conserved hypothetical protein [Ricinus communis] gi|223549214|gb|EEF50703.1| conserved hypothetical protein [Ricinus communis] Length = 632 Score = 239 bits (609), Expect = 3e-60 Identities = 120/181 (66%), Positives = 150/181 (82%), Gaps = 4/181 (2%) Frame = -1 Query: 991 DGNGPLD----YENNIIGEREKLKETDEFKRAAEEEWASRQRQLQIQAEEAQRLRKRKKA 824 DG PLD + +II EREKLKETDEFKRA EEEWA+RQ++L QAEEA+RLRKRKKA Sbjct: 450 DGKDPLDALSPVQRDIINEREKLKETDEFKRAVEEEWAARQKELNFQAEEAKRLRKRKKA 509 Query: 823 ETQRLLDMERRQKERIEEMRETQKKNVETINLKDQLRAEVRKELHRMELVYTDMVSLLRA 644 E+ R+L +E+R K+R+EE+RETQ+K+ E +N+K+ LR EVRKEL+++E DM SLLR Sbjct: 510 ESLRILALEKRLKQRVEEVRETQRKDEENLNMKENLRTEVRKELYQLENACIDMASLLRG 569 Query: 643 LGIRVGTGFCPSSREVNAAYKQALLKFHPDRASRTDVRQQVEAEEKFKLVSRLKEKLLPV 464 LGI+VG GF P ++EV+AAYK+ALLKFHPDRASRTD+RQQVEAEEKFKL+SR+K+K L Sbjct: 570 LGIQVGGGFHPLTQEVHAAYKRALLKFHPDRASRTDIRQQVEAEEKFKLISRMKQKFLST 629 Query: 463 S 461 S Sbjct: 630 S 630 Score = 85.9 bits (211), Expect = 5e-14 Identities = 64/197 (32%), Positives = 97/197 (49%), Gaps = 11/197 (5%) Frame = -3 Query: 2492 VIIIDAPESSEKKLLGPSVSQ--KKLPTGKVISIDDDEI-------SASETENADGGGNH 2340 VI ID PE +K G SV + + P +IS+DDDEI + EN ++ Sbjct: 21 VITIDVPEPLRQKFCGSSVHKDGESFPIPCIISLDDDEICDYVDNHEVNVDENVGLDSDY 80 Query: 2339 QDSGATSSKASCPSVSQSSEEANVDECQLNRKRKIPMKFSKSKRTYSEKTFSRNYFG--W 2166 D +S AS + SE+A+VD+CQ+ +++ K SK +TY++ + SRN +G + Sbjct: 81 MDFSNKTSPAS--DFMRKSEDADVDDCQVVFEKRPAFKLSKCNKTYNKDSSSRNRYGLNF 138 Query: 2165 GPLCXXXXXXXXXXDCEIIEGSCGKISEQWDRAALKKRTFEGIPIGQPGLDDEASGSGSF 1986 DCE++E SC ++ EQW + A +KR G D+AS S Sbjct: 139 ESETESGSSKSDSSDCEVVEISCRELHEQWVK-AFQKRKLNTDYKGPLSPQDKASPCSSH 197 Query: 1985 ADVPKHVEVENSAEQHV 1935 +D +V VENS Q + Sbjct: 198 SDAHPNVRVENSTRQKI 214 >ref|XP_003520133.1| PREDICTED: uncharacterized protein LOC100778452 [Glycine max] Length = 555 Score = 235 bits (599), Expect = 5e-59 Identities = 122/164 (74%), Positives = 143/164 (87%) Frame = -1 Query: 967 ENNIIGEREKLKETDEFKRAAEEEWASRQRQLQIQAEEAQRLRKRKKAETQRLLDMERRQ 788 E +II EREKLKETDE+K+A EEEWASRQRQLQIQAEE QRLRKR+KAE +RLLDM+RRQ Sbjct: 386 EKDIINEREKLKETDEYKQAMEEEWASRQRQLQIQAEEVQRLRKRRKAE-KRLLDMQRRQ 444 Query: 787 KERIEEMRETQKKNVETINLKDQLRAEVRKELHRMELVYTDMVSLLRALGIRVGTGFCPS 608 KERIEE+RETQKK+ E +NLK+QLR E++K L+++E+ DM SLLR LGI+VG F P Sbjct: 445 KERIEEVRETQKKDEEVMNLKEQLRVEIQKGLNQLEMRCHDMPSLLRGLGIQVGGSFIPL 504 Query: 607 SREVNAAYKQALLKFHPDRASRTDVRQQVEAEEKFKLVSRLKEK 476 EV+AAYK+ALLKFHPDRAS+TDVR QVEAEEKFKL+SRLKEK Sbjct: 505 PNEVHAAYKRALLKFHPDRASKTDVRAQVEAEEKFKLISRLKEK 548