BLASTX nr result
ID: Coptis21_contig00025804
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis21_contig00025804 (665 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002283907.2| PREDICTED: pentatricopeptide repeat-containi... 337 1e-90 ref|XP_002532248.1| pentatricopeptide repeat-containing protein,... 336 3e-90 ref|XP_004136259.1| PREDICTED: pentatricopeptide repeat-containi... 319 3e-85 ref|XP_002301082.1| predicted protein [Populus trichocarpa] gi|2... 315 5e-84 ref|XP_003527867.1| PREDICTED: pentatricopeptide repeat-containi... 305 5e-81 >ref|XP_002283907.2| PREDICTED: pentatricopeptide repeat-containing protein At5g18475-like [Vitis vinifera] Length = 513 Score = 337 bits (864), Expect = 1e-90 Identities = 161/220 (73%), Positives = 186/220 (84%) Frame = -1 Query: 665 ETCKFHEGLFLNLMIHFSKSCLHERIVEMLFAIQPIVRERPSLKAISTCLNLLIEAREID 486 ETCKFHEG+FLNLM HFSK LHER+VEM AI+PIVRE+PSLKAISTCLNLL+E+ ++D Sbjct: 122 ETCKFHEGIFLNLMKHFSKLSLHERVVEMFDAIRPIVREKPSLKAISTCLNLLVESNQVD 181 Query: 485 LARNVLLELKERSDFKPNTCVFNILVKYHCKKGDLDSAFEVVNEMKKGEKSYPNLITYST 306 L R LL K+ + +PNTC+FNILVK+HCK GD+DSAFEVV EMKK SYPNLITYST Sbjct: 182 LTRKFLLNSKKSLNLEPNTCIFNILVKHHCKNGDIDSAFEVVEEMKKSHVSYPNLITYST 241 Query: 305 LMGGLCDTGRLQEAIGVFEEMVSKDQILPDTLTYNILINGFCRGGKVDRARKIMDFMRKN 126 L+ GLC +GRL+EAI +FEEMVSKDQILPD LTYN LINGFC G KVDRA KIM+FM+KN Sbjct: 242 LINGLCGSGRLKEAIELFEEMVSKDQILPDALTYNALINGFCHGEKVDRALKIMEFMKKN 301 Query: 125 ECEPNIFNYSSLMNGYCKEGRYEEAKEVFDEMRRSRLEID 6 C PN+FNYS+LMNG+CKEGR EEAKEVFDEM+ L+ D Sbjct: 302 GCNPNVFNYSALMNGFCKEGRLEEAKEVFDEMKSLGLKPD 341 Score = 84.3 bits (207), Expect = 2e-14 Identities = 47/169 (27%), Positives = 93/169 (55%), Gaps = 1/169 (0%) Frame = -1 Query: 545 PSLKAISTCLNLLIEAREIDLARNVLLELKERSDFKPNTCVFNILVKYHCKKGDLDSAFE 366 P + +N ++D A ++ E +++ PN ++ L+ CK+G L+ A E Sbjct: 270 PDALTYNALINGFCHGEKVDRALKIM-EFMKKNGCNPNVFNYSALMNGFCKEGRLEEAKE 328 Query: 365 VVNEMKK-GEKSYPNLITYSTLMGGLCDTGRLQEAIGVFEEMVSKDQILPDTLTYNILIN 189 V +EMK G K P+ + Y+TL+ C GR+ EA+ + ++M +++ DT+T+N+++ Sbjct: 329 VFDEMKSLGLK--PDTVGYTTLINFFCRAGRVDEAMELLKDM-RENKCRADTVTFNVILG 385 Query: 188 GFCRGGKVDRARKIMDFMRKNECEPNIFNYSSLMNGYCKEGRYEEAKEV 42 G CR G+ + AR +++ + N +Y ++N C+EG ++A ++ Sbjct: 386 GLCREGRFEEARGMLERLPYEGVYLNKASYRIVLNSLCREGELQKATQL 434 Score = 61.2 bits (147), Expect = 2e-07 Identities = 48/206 (23%), Positives = 91/206 (44%), Gaps = 34/206 (16%) Frame = -1 Query: 545 PSLKAISTCLNLLIEAREIDLARNVLLELKERSDFKPNTCVFNILVKYHCKKGDLDSAFE 366 P++ S +N + ++ A+ V E+K KP+T + L+ + C+ G +D A E Sbjct: 305 PNVFNYSALMNGFCKEGRLEEAKEVFDEMKSLG-LKPDTVGYTTLINFFCRAGRVDEAME 363 Query: 365 VVNEMKKGEKSYPNLITYSTLMGGLCDTGRLQEAIGVFE--------------------- 249 ++ +M++ K + +T++ ++GGLC GR +EA G+ E Sbjct: 364 LLKDMREN-KCRADTVTFNVILGGLCREGRFEEARGMLERLPYEGVYLNKASYRIVLNSL 422 Query: 248 ----EMVSKDQ---------ILPDTLTYNILINGFCRGGKVDRARKIMDFMRKNECEPNI 108 E+ Q +LP T N L+ C GKV A + + + +P Sbjct: 423 CREGELQKATQLVGLMLGRGVLPHFATSNELLVHLCEAGKVGDAVMALLGLLELGFKPEP 482 Query: 107 FNYSSLMNGYCKEGRYEEAKEVFDEM 30 +++ L+ C+E + A E+ D++ Sbjct: 483 NSWALLVELICRERKLLPAFELLDDL 508 >ref|XP_002532248.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223528066|gb|EEF30142.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 521 Score = 336 bits (861), Expect = 3e-90 Identities = 157/220 (71%), Positives = 188/220 (85%) Frame = -1 Query: 665 ETCKFHEGLFLNLMIHFSKSCLHERIVEMLFAIQPIVRERPSLKAISTCLNLLIEAREID 486 ETCKFHE +FLNLM HF KS LHER++EM +AIQPIVRE+PSLKAISTCLN+L+E+++ID Sbjct: 121 ETCKFHENIFLNLMKHFYKSSLHERVLEMFYAIQPIVREKPSLKAISTCLNILVESKQID 180 Query: 485 LARNVLLELKERSDFKPNTCVFNILVKYHCKKGDLDSAFEVVNEMKKGEKSYPNLITYST 306 LA+ LL + E +PNTC+FNILVK+HCK GDL+SA EV++EMKK +SYPN+ITYST Sbjct: 181 LAQKCLLYVNEHLKVRPNTCIFNILVKHHCKSGDLESALEVMHEMKKSRRSYPNVITYST 240 Query: 305 LMGGLCDTGRLQEAIGVFEEMVSKDQILPDTLTYNILINGFCRGGKVDRARKIMDFMRKN 126 L+ GLC GRL+EAI +FEEMVSKDQILPD LTY++LI GFC GGK DRARKIM+FMR N Sbjct: 241 LIDGLCGNGRLKEAIELFEEMVSKDQILPDALTYSVLIKGFCHGGKADRARKIMEFMRSN 300 Query: 125 ECEPNIFNYSSLMNGYCKEGRYEEAKEVFDEMRRSRLEID 6 C+PN+FNYS LMNG+CKEGR EEAKEVFDEM+ S L+ D Sbjct: 301 GCDPNVFNYSVLMNGFCKEGRLEEAKEVFDEMKSSGLKPD 340 Score = 80.5 bits (197), Expect = 3e-13 Identities = 50/188 (26%), Positives = 94/188 (50%), Gaps = 3/188 (1%) Frame = -1 Query: 605 CLHERIVEMLFAIQPIVRER---PSLKAISTCLNLLIEAREIDLARNVLLELKERSDFKP 435 C + R+ E + + +V + P S + + D AR ++ E + P Sbjct: 246 CGNGRLKEAIELFEEMVSKDQILPDALTYSVLIKGFCHGGKADRARKIM-EFMRSNGCDP 304 Query: 434 NTCVFNILVKYHCKKGDLDSAFEVVNEMKKGEKSYPNLITYSTLMGGLCDTGRLQEAIGV 255 N +++L+ CK+G L+ A EV +EMK P+ + Y+TL+ C GR+ EA+ + Sbjct: 305 NVFNYSVLMNGFCKEGRLEEAKEVFDEMKSSGLK-PDTVGYTTLINCFCGVGRIDEAMEL 363 Query: 254 FEEMVSKDQILPDTLTYNILINGFCRGGKVDRARKIMDFMRKNECEPNIFNYSSLMNGYC 75 +EM + D +T+N+L+ G CR G+ D A ++++ + N +Y ++N C Sbjct: 364 LKEMTEM-KCKADAVTFNVLLKGLCREGRFDEALRMLENLAYEGVYLNKGSYRIVLNFLC 422 Query: 74 KEGRYEEA 51 ++G E++ Sbjct: 423 QKGELEKS 430 Score = 59.7 bits (143), Expect = 5e-07 Identities = 44/173 (25%), Positives = 79/173 (45%) Frame = -1 Query: 548 RPSLKAISTCLNLLIEAREIDLARNVLLELKERSDFKPNTCVFNILVKYHCKKGDLDSAF 369 +P +T +N ID A +L E+ E K + FN+L+K C++G D A Sbjct: 338 KPDTVGYTTLINCFCGVGRIDEAMELLKEMTEMK-CKADAVTFNVLLKGLCREGRFDEAL 396 Query: 368 EVVNEMKKGEKSYPNLITYSTLMGGLCDTGRLQEAIGVFEEMVSKDQILPDTLTYNILIN 189 ++ + E Y N +Y ++ LC G L+++ + M+S+ +P T N L+ Sbjct: 397 RMLENLAY-EGVYLNKGSYRIVLNFLCQKGELEKSCALLGLMLSRG-FVPHYATSNELLV 454 Query: 188 GFCRGGKVDRARKIMDFMRKNECEPNIFNYSSLMNGYCKEGRYEEAKEVFDEM 30 C G VD A + + + P +++ L+ C+E + E+ DE+ Sbjct: 455 CLCEAGMVDNAVTALFGLTQMGFTPEPKSWAHLIEYICRERKLLFVFELVDEL 507 >ref|XP_004136259.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475-like [Cucumis sativus] gi|449497032|ref|XP_004160294.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475-like [Cucumis sativus] Length = 504 Score = 319 bits (818), Expect = 3e-85 Identities = 146/220 (66%), Positives = 184/220 (83%) Frame = -1 Query: 665 ETCKFHEGLFLNLMIHFSKSCLHERIVEMLFAIQPIVRERPSLKAISTCLNLLIEAREID 486 +TCK HEG+FLNLM HFSKS +HER+++M +AI+ IVRE+PSLKAISTCLNLL+E+ +D Sbjct: 112 DTCKVHEGIFLNLMKHFSKSSMHERVLDMFYAIKSIVREKPSLKAISTCLNLLVESDRVD 171 Query: 485 LARNVLLELKERSDFKPNTCVFNILVKYHCKKGDLDSAFEVVNEMKKGEKSYPNLITYST 306 LAR +L+ + + + +PNTC+FNILVK+HC+ GDL +AFEVV EMK SYPNL+TYST Sbjct: 172 LARKLLVNARSKLNLRPNTCIFNILVKHHCRNGDLQAAFEVVKEMKSARVSYPNLVTYST 231 Query: 305 LMGGLCDTGRLQEAIGVFEEMVSKDQILPDTLTYNILINGFCRGGKVDRARKIMDFMRKN 126 L+GGLC+ G+L+EAI FEEMVSKD ILPD LTYNILINGFC+ GKVDRAR I++FM+ N Sbjct: 232 LIGGLCENGKLKEAIEFFEEMVSKDNILPDALTYNILINGFCQRGKVDRARTILEFMKSN 291 Query: 125 ECEPNIFNYSSLMNGYCKEGRYEEAKEVFDEMRRSRLEID 6 C PN+FNYS LMNGYCKEGR +EAKEVF+E++ ++ D Sbjct: 292 GCSPNVFNYSVLMNGYCKEGRLQEAKEVFNEIKSLGMKPD 331 Score = 112 bits (280), Expect = 6e-23 Identities = 59/172 (34%), Positives = 97/172 (56%) Frame = -1 Query: 545 PSLKAISTCLNLLIEAREIDLARNVLLELKERSDFKPNTCVFNILVKYHCKKGDLDSAFE 366 P+L ST + L E ++ A E+ + + P+ +NIL+ C++G +D A Sbjct: 224 PNLVTYSTLIGGLCENGKLKEAIEFFEEMVSKDNILPDALTYNILINGFCQRGKVDRART 283 Query: 365 VVNEMKKGEKSYPNLITYSTLMGGLCDTGRLQEAIGVFEEMVSKDQILPDTLTYNILING 186 ++ MK S PN+ YS LM G C GRLQEA VF E+ S + PDT++Y LIN Sbjct: 284 ILEFMKSNGCS-PNVFNYSVLMNGYCKEGRLQEAKEVFNEIKSLG-MKPDTISYTTLINC 341 Query: 185 FCRGGKVDRARKIMDFMRKNECEPNIFNYSSLMNGYCKEGRYEEAKEVFDEM 30 CR G+VD A +++ M+ +C + ++ ++ G C+EGR++EA ++ ++ Sbjct: 342 LCRTGRVDEATELLQQMKDKDCRADTVTFNVMLGGLCREGRFDEALDMVQKL 393 >ref|XP_002301082.1| predicted protein [Populus trichocarpa] gi|222842808|gb|EEE80355.1| predicted protein [Populus trichocarpa] Length = 509 Score = 315 bits (807), Expect = 5e-84 Identities = 151/220 (68%), Positives = 181/220 (82%) Frame = -1 Query: 665 ETCKFHEGLFLNLMIHFSKSCLHERIVEMLFAIQPIVRERPSLKAISTCLNLLIEAREID 486 ETCKFHE LFLNLM +F+KS ER+VEM IQPIVRE+PSLKAISTCLNLL+E++++D Sbjct: 112 ETCKFHESLFLNLMKYFAKSSEFERVVEMFNKIQPIVREKPSLKAISTCLNLLVESKQVD 171 Query: 485 LARNVLLELKERSDFKPNTCVFNILVKYHCKKGDLDSAFEVVNEMKKGEKSYPNLITYST 306 L R LL+L + KPNTC+FNI +KYHCK GDL+SAF VV EMKK SYPNLITYST Sbjct: 172 LLRGFLLDLNKDHMLKPNTCIFNIFIKYHCKSGDLESAFAVVKEMKKSSISYPNLITYST 231 Query: 305 LMGGLCDTGRLQEAIGVFEEMVSKDQILPDTLTYNILINGFCRGGKVDRARKIMDFMRKN 126 LM GLC++GRL+EAI +FEEMVSKDQILPD LTYN+LINGF GKVDRA+KIM+FM+ N Sbjct: 232 LMDGLCESGRLKEAIELFEEMVSKDQILPDALTYNVLINGFSCWGKVDRAKKIMEFMKSN 291 Query: 125 ECEPNIFNYSSLMNGYCKEGRYEEAKEVFDEMRRSRLEID 6 C PN+FNYS+LM+G+CKEGR EEA + F+EM+ L+ D Sbjct: 292 GCSPNVFNYSALMSGFCKEGRLEEAMDAFEEMKIFGLKQD 331 Score = 73.9 bits (180), Expect = 3e-11 Identities = 38/130 (29%), Positives = 71/130 (54%) Frame = -1 Query: 440 KPNTCVFNILVKYHCKKGDLDSAFEVVNEMKKGEKSYPNLITYSTLMGGLCDTGRLQEAI 261 K +T + IL+ Y C+ G +D A ++ EMK+ K +++T + L+ G C GR +EA+ Sbjct: 329 KQDTVGYTILINYFCRFGRIDEAMALLEEMKE-TKCKADIVTVNVLLRGFCGEGRTEEAL 387 Query: 260 GVFEEMVSKDQILPDTLTYNILINGFCRGGKVDRARKIMDFMRKNECEPNIFNYSSLMNG 81 G+ + S + I + +Y I++N C+ G +D+A +++ P+ + L+ G Sbjct: 388 GMLNRL-SSEGIYLNKASYRIVLNSLCQKGDLDKALELLGLTLSRGFVPHHATSNELLVG 446 Query: 80 YCKEGRYEEA 51 CK G ++A Sbjct: 447 LCKAGMADDA 456 Score = 58.2 bits (139), Expect = 1e-06 Identities = 43/169 (25%), Positives = 80/169 (47%) Frame = -1 Query: 518 LNLLIEAREIDLARNVLLELKERSDFKPNTCVFNILVKYHCKKGDLDSAFEVVNEMKKGE 339 +N ID A +L E+KE + K + N+L++ C +G + A ++N + E Sbjct: 339 INYFCRFGRIDEAMALLEEMKE-TKCKADIVTVNVLLRGFCGEGRTEEALGMLNRLSS-E 396 Query: 338 KSYPNLITYSTLMGGLCDTGRLQEAIGVFEEMVSKDQILPDTLTYNILINGFCRGGKVDR 159 Y N +Y ++ LC G L +A+ + +S+ +P T N L+ G C+ G D Sbjct: 397 GIYLNKASYRIVLNSLCQKGDLDKALELLGLTLSRG-FVPHHATSNELLVGLCKAGMADD 455 Query: 158 ARKIMDFMRKNECEPNIFNYSSLMNGYCKEGRYEEAKEVFDEMRRSRLE 12 A + + + +P +++ L+ C+E + A E+ DE+ + E Sbjct: 456 AVVALYGLAEMGFKPEQDSWALLVEFVCRERKLLLAFELLDELTANECE 504 >ref|XP_003527867.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475-like [Glycine max] Length = 546 Score = 305 bits (781), Expect = 5e-81 Identities = 142/221 (64%), Positives = 177/221 (80%) Frame = -1 Query: 665 ETCKFHEGLFLNLMIHFSKSCLHERIVEMLFAIQPIVRERPSLKAISTCLNLLIEAREID 486 ETCKFHEG+F+NLM HFSKS LHE+++ F+IQPIVRE+PS KA+STCLNLL+++ +D Sbjct: 155 ETCKFHEGIFVNLMKHFSKSSLHEKLLHAYFSIQPIVREKPSPKALSTCLNLLLDSNRVD 214 Query: 485 LARNVLLELKERSDFKPNTCVFNILVKYHCKKGDLDSAFEVVNEMKKGEKSYPNLITYST 306 LAR +LL K KPN CVFNILVKYHCK GDLDSAFE+V EM+ E SYPNL+TYST Sbjct: 215 LARKLLLHAKRDLTRKPNVCVFNILVKYHCKNGDLDSAFEIVEEMRNSEFSYPNLVTYST 274 Query: 305 LMGGLCDTGRLQEAIGVFEEMVSKDQILPDTLTYNILINGFCRGGKVDRARKIMDFMRKN 126 LM GLC GR++EA +FEEMVS+D I+PD LTYN+LINGFCRGGK DRAR ++ FM+ N Sbjct: 275 LMDGLCRNGRVKEAFDLFEEMVSRDHIVPDPLTYNVLINGFCRGGKPDRARNVIQFMKSN 334 Query: 125 ECEPNIFNYSSLMNGYCKEGRYEEAKEVFDEMRRSRLEIDA 3 C PN++NYS+L++G CK G+ E+AK V E++ S L+ DA Sbjct: 335 GCYPNVYNYSALVDGLCKVGKLEDAKGVLAEIKGSGLKPDA 375 Score = 111 bits (277), Expect = 1e-22 Identities = 57/172 (33%), Positives = 97/172 (56%) Frame = -1 Query: 545 PSLKAISTCLNLLIEAREIDLARNVLLELKERSDFKPNTCVFNILVKYHCKKGDLDSAFE 366 P+L ST ++ L + A ++ E+ R P+ +N+L+ C+ G D A Sbjct: 267 PNLVTYSTLMDGLCRNGRVKEAFDLFEEMVSRDHIVPDPLTYNVLINGFCRGGKPDRARN 326 Query: 365 VVNEMKKGEKSYPNLITYSTLMGGLCDTGRLQEAIGVFEEMVSKDQILPDTLTYNILING 186 V+ MK YPN+ YS L+ GLC G+L++A GV E + + PD +TY LIN Sbjct: 327 VIQFMKSNG-CYPNVYNYSALVDGLCKVGKLEDAKGVLAE-IKGSGLKPDAVTYTSLINF 384 Query: 185 FCRGGKVDRARKIMDFMRKNECEPNIFNYSSLMNGYCKEGRYEEAKEVFDEM 30 CR GK D A ++++ M++N C+ + ++ L+ G C+EG++EEA ++ +++ Sbjct: 385 LCRNGKSDEAIELLEEMKENGCQADSVTFNVLLGGLCREGKFEEALDMVEKL 436 Score = 78.6 bits (192), Expect = 1e-12 Identities = 56/197 (28%), Positives = 98/197 (49%), Gaps = 3/197 (1%) Frame = -1 Query: 605 CLHERIVEMLFAIQPIVRER---PSLKAISTCLNLLIEAREIDLARNVLLELKERSDFKP 435 C + R+ E + +V P + +N + D ARNV+ +K + P Sbjct: 280 CRNGRVKEAFDLFEEMVSRDHIVPDPLTYNVLINGFCRGGKPDRARNVIQFMKSNGCY-P 338 Query: 434 NTCVFNILVKYHCKKGDLDSAFEVVNEMKKGEKSYPNLITYSTLMGGLCDTGRLQEAIGV 255 N ++ LV CK G L+ A V+ E+K G P+ +TY++L+ LC G+ EAI + Sbjct: 339 NVYNYSALVDGLCKVGKLEDAKGVLAEIK-GSGLKPDAVTYTSLINFLCRNGKSDEAIEL 397 Query: 254 FEEMVSKDQILPDTLTYNILINGFCRGGKVDRARKIMDFMRKNECEPNIFNYSSLMNGYC 75 EEM ++ D++T+N+L+ G CR GK + A +++ + + N +Y ++N Sbjct: 398 LEEM-KENGCQADSVTFNVLLGGLCREGKFEEALDMVEKLPQQGVYLNKGSYRIVLNSLT 456 Query: 74 KEGRYEEAKEVFDEMRR 24 ++ + AKE+ M R Sbjct: 457 QKCELKRAKELLGLMLR 473