BLASTX nr result
ID: Coptis24_contig00027287
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis24_contig00027287 (670 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002306741.1| predicted protein [Populus trichocarpa] gi|2... 306 2e-81 ref|XP_002279693.1| PREDICTED: pentatricopeptide repeat-containi... 304 9e-81 ref|XP_003617444.1| Pentatricopeptide repeat-containing protein ... 286 3e-75 ref|XP_003545143.1| PREDICTED: pentatricopeptide repeat-containi... 282 5e-74 ref|NP_181820.1| pentatricopeptide repeat-containing protein [Ar... 281 1e-73 >ref|XP_002306741.1| predicted protein [Populus trichocarpa] gi|222856190|gb|EEE93737.1| predicted protein [Populus trichocarpa] Length = 509 Score = 306 bits (784), Expect = 2e-81 Identities = 147/223 (65%), Positives = 185/223 (82%) Frame = -1 Query: 670 GALEQGKWIHAYINKNKIELNTIVLTAIIDMYCKCGRIEEAFKVFKASSKIGLSCWNSMI 491 GAL QG+WIH YI KN LN+IV+TAIIDMY KCG I++A +VFK++ K GLSCWNS+I Sbjct: 243 GALRQGEWIHDYIVKNNFALNSIVITAIIDMYSKCGSIDKALQVFKSAPKKGLSCWNSLI 302 Query: 490 IGLAVNGHGEEAIQLFSDLESSGLRPDDVSFIGVLTACNHSGMVNEAKYYLSLMTETYVI 311 +GLA++G G EA++LFS LESS L+PD VSFIGVLTACNH+GMV+ AK Y LM+ETY I Sbjct: 303 LGLAMSGRGNEAVRLFSKLESSNLKPDHVSFIGVLTACNHAGMVDRAKDYFLLMSETYKI 362 Query: 310 EPTIKHYGCMVDVLGRAGLLEEAEALITNMSIKADAIIWGSLLSACRDHGNVEMGERAAK 131 EP+IKHY CMVDVLGRAGLLEEAE LI +M + DAIIWGSLLS+CR++GN+EM ++AAK Sbjct: 363 EPSIKHYSCMVDVLGRAGLLEEAEELIKSMPVNPDAIIWGSLLSSCREYGNIEMAKQAAK 422 Query: 130 RLVELDPSDSGSYILLSNVYASSGRYEEAMDARRSMQERGTKK 2 R+ ELDP++S S+ILLSNVYA+ +EEA++ R S++E+ K Sbjct: 423 RVNELDPNESSSFILLSNVYAAHNHFEEAIEQRLSLKEKQMDK 465 Score = 59.3 bits (142), Expect = 7e-07 Identities = 54/254 (21%), Positives = 105/254 (41%), Gaps = 36/254 (14%) Frame = -1 Query: 670 GALEQGKWIHAYINKNKIELNTIVLTAIIDMYCKCGRIEEAFKVFKASSK---------- 521 G +G +H + K +E + + I++MY CG + EA ++F ++ Sbjct: 111 GLAHEGAQLHGRVIKLGLENDQFIQNTILNMYVNCGFLGEAQRIFDGATGFDVVTWNTMI 170 Query: 520 IGLS---------------------CWNSMIIGLAVNGHGEEAIQLFSDLESSGLRPDDV 404 IGL+ WNSMI G G EA++LFS ++ G++P + Sbjct: 171 IGLAKCGEIDKSRRLFDKMLLRNTVSWNSMISGYVRKGRFFEAMELFSRMQEEGIKPSEF 230 Query: 403 SFIGVLTACNHSGMVNEAKY-YLSLMTETYVIEPTIKHYGCMVDVLGRAGLLEEAEALIT 227 + + +L AC G + + ++ + ++ + + + ++D+ + G +++A + Sbjct: 231 TMVSLLNACACLGALRQGEWIHDYIVKNNFALNSIV--ITAIIDMYSKCGSIDKA-LQVF 287 Query: 226 NMSIKADAIIWGSLLSACRDHGNVEMGERAAKRLVELDPS----DSGSYILLSNVYASSG 59 + K W SL+ G G A + +L+ S D S+I + +G Sbjct: 288 KSAPKKGLSCWNSLILGLAMSGR---GNEAVRLFSKLESSNLKPDHVSFIGVLTACNHAG 344 Query: 58 RYEEAMDARRSMQE 17 + A D M E Sbjct: 345 MVDRAKDYFLLMSE 358 >ref|XP_002279693.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920, chloroplastic [Vitis vinifera] gi|302143555|emb|CBI22116.3| unnamed protein product [Vitis vinifera] Length = 533 Score = 304 bits (779), Expect = 9e-81 Identities = 148/223 (66%), Positives = 181/223 (81%) Frame = -1 Query: 670 GALEQGKWIHAYINKNKIELNTIVLTAIIDMYCKCGRIEEAFKVFKASSKIGLSCWNSMI 491 GAL+QG+WIH YI KN ELN IV +IIDMYCKCG I EAF+VF+ + GLS WN+MI Sbjct: 271 GALKQGEWIHDYIRKNNFELNVIVTASIIDMYCKCGSIGEAFQVFEMAPLKGLSSWNTMI 330 Query: 490 IGLAVNGHGEEAIQLFSDLESSGLRPDDVSFIGVLTACNHSGMVNEAKYYLSLMTETYVI 311 +GLA+NG EAIQLFS LE S LRPDDV+F+GVLTACN+SG+V++AK Y SLM++TY I Sbjct: 331 LGLAMNGCENEAIQLFSRLECSNLRPDDVTFVGVLTACNYSGLVDKAKEYFSLMSKTYKI 390 Query: 310 EPTIKHYGCMVDVLGRAGLLEEAEALITNMSIKADAIIWGSLLSACRDHGNVEMGERAAK 131 EP+IKHY CMVD LGRAGLLEEAE LI NM + DAIIW SLLSACR HGNVE+ +RAAK Sbjct: 391 EPSIKHYSCMVDTLGRAGLLEEAEELIRNMPVNPDAIIWSSLLSACRKHGNVELAKRAAK 450 Query: 130 RLVELDPSDSGSYILLSNVYASSGRYEEAMDARRSMQERGTKK 2 +V+LD +DS Y+LLSN+YA+S ++EEAM+ R SM+E+ +K Sbjct: 451 HIVDLDGNDSCGYVLLSNIYAASDQFEEAMEQRLSMKEKQIEK 493 Score = 56.6 bits (135), Expect = 4e-06 Identities = 43/166 (25%), Positives = 79/166 (47%), Gaps = 3/166 (1%) Frame = -1 Query: 670 GALEQGKWIHAYINKNKIELNTIVLTAIIDMYCKCGRIEEAFKVFKASSKIGLSCWNSMI 491 G G +H + K ++ + + II MY CG + E +K F + WNSMI Sbjct: 139 GLAHYGAQLHGRVIKLGLQFDPFIRNTIIYMYANCGFLSEMWKAFYERMDFDIVAWNSMI 198 Query: 490 IGLAVNGHGEEAIQLFSDLESSGLRPDDVSFIGVLTACNHSGMVNEAKYYLSLMTETYVI 311 +GLA G +E+ +LF ++ LR + VS+ +++ +G + EA M E I Sbjct: 199 MGLAKCGEVDESRKLFDEMP---LR-NTVSWNSMISGYVRNGRLREALDLFGQMQEER-I 253 Query: 310 EPTIKHYGCMVDVLGRAGLLEEAEAL---ITNMSIKADAIIWGSLL 182 +P+ +++ R G L++ E + I + + + I+ S++ Sbjct: 254 KPSEFTMVSLLNASARLGALKQGEWIHDYIRKNNFELNVIVTASII 299 >ref|XP_003617444.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355518779|gb|AET00403.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 542 Score = 286 bits (732), Expect = 3e-75 Identities = 138/224 (61%), Positives = 174/224 (77%), Gaps = 1/224 (0%) Frame = -1 Query: 670 GALEQGKWIHAYINKNKIELNTIVLTAIIDMYCKCGRIEEAFKVFKASSKIGLSCWNSMI 491 GAL+ GKW+H YI +N ELN IV+TAIIDMYCKCG +E A +VF+ + GLSCWNS+I Sbjct: 277 GALQHGKWVHDYIKRNHFELNVIVVTAIIDMYCKCGSVENAVEVFETCPRRGLSCWNSII 336 Query: 490 IGLAVNGHGEEAIQLFSDLESSGL-RPDDVSFIGVLTACNHSGMVNEAKYYLSLMTETYV 314 IGLA+NGH EA + FS LESS L +PD VSFIGVLTAC H G +N+A+ Y LM Y Sbjct: 337 IGLAMNGHEREAFEFFSKLESSKLLKPDSVSFIGVLTACKHLGAINKARDYFELMMNKYE 396 Query: 313 IEPTIKHYGCMVDVLGRAGLLEEAEALITNMSIKADAIIWGSLLSACRDHGNVEMGERAA 134 IEP+IKHY C+VDVLG+AGLLEEAE LI M +K DAIIWGSLLS+CR H NV++ RAA Sbjct: 397 IEPSIKHYTCIVDVLGQAGLLEEAEELIKGMPLKPDAIIWGSLLSSCRKHRNVQIARRAA 456 Query: 133 KRLVELDPSDSGSYILLSNVYASSGRYEEAMDARRSMQERGTKK 2 +R+ EL+PSD+ Y+L+SNV+A+S ++EEA++ R M+E T+K Sbjct: 457 QRVYELNPSDASGYVLMSNVHAASNKFEEAIEQRLLMKENLTEK 500 Score = 60.5 bits (145), Expect = 3e-07 Identities = 52/241 (21%), Positives = 100/241 (41%), Gaps = 41/241 (17%) Frame = -1 Query: 622 KIEL---NTIVLTAIIDMYCKCGRIEEAFKVFKASSKIGLSCWNSMIIGLAVNGHGEEAI 452 K+EL + + + ++I Y KCG I+E+ +F WNSMI G NG EA+ Sbjct: 189 KLELYDHDVVAINSMIMGYAKCGEIDESRNLFDDMITRTSVSWNSMISGYVRNGKLMEAL 248 Query: 451 QLFSDLESSGLRPDDVSFIGVLTACNHSGMVNEAKYYLSLMTETYVIEPTIKHYGCMVDV 272 +LF+ ++ G + + + +L AC H G + K+ + + E + ++D+ Sbjct: 249 ELFNKMQVEGFEVSEFTMVSLLNACAHLGALQHGKWVHDYIKRNH-FELNVIVVTAIIDM 307 Query: 271 LGRAGLLEEA-----------------------------------EALITNMSIKADAII 197 + G +E A L ++ +K D++ Sbjct: 308 YCKCGSVENAVEVFETCPRRGLSCWNSIIIGLAMNGHEREAFEFFSKLESSKLLKPDSVS 367 Query: 196 WGSLLSACRDHGNVEMGERAAKRLV---ELDPSDSGSYILLSNVYASSGRYEEAMDARRS 26 + +L+AC+ G + + ++ E++PS Y + +V +G EEA + + Sbjct: 368 FIGVLTACKHLGAINKARDYFELMMNKYEIEPSIK-HYTCIVDVLGQAGLLEEAEELIKG 426 Query: 25 M 23 M Sbjct: 427 M 427 >ref|XP_003545143.1| PREDICTED: pentatricopeptide repeat-containing protein At2g42920, chloroplastic-like [Glycine max] Length = 534 Score = 282 bits (721), Expect = 5e-74 Identities = 138/223 (61%), Positives = 175/223 (78%) Frame = -1 Query: 670 GALEQGKWIHAYINKNKIELNTIVLTAIIDMYCKCGRIEEAFKVFKASSKIGLSCWNSMI 491 GAL+ G+W+H Y+ + ELN IVLTAIIDMYCKCG I +A +VF+AS GLSCWNS+I Sbjct: 270 GALKHGEWVHDYVKRGHFELNVIVLTAIIDMYCKCGVIVKAIEVFEASPTRGLSCWNSII 329 Query: 490 IGLAVNGHGEEAIQLFSDLESSGLRPDDVSFIGVLTACNHSGMVNEAKYYLSLMTETYVI 311 IGLA+NG+ +AI+ FS LE+S L+PD VSFIGVLTAC + G V +A+ Y SLM Y I Sbjct: 330 IGLALNGYERKAIEYFSKLEASDLKPDHVSFIGVLTACKYIGAVGKARDYFSLMMNKYEI 389 Query: 310 EPTIKHYGCMVDVLGRAGLLEEAEALITNMSIKADAIIWGSLLSACRDHGNVEMGERAAK 131 EP+IKHY CMV+VLG+A LLEEAE LI M +KAD IIWGSLLS+CR HGNVE+ +RAA+ Sbjct: 390 EPSIKHYTCMVEVLGQAALLEEAEQLIKGMPLKADFIIWGSLLSSCRKHGNVEIAKRAAQ 449 Query: 130 RLVELDPSDSGSYILLSNVYASSGRYEEAMDARRSMQERGTKK 2 R+ EL+PSD+ Y+L+SNV A+S ++EEAM+ R M+ER +K Sbjct: 450 RVCELNPSDASGYLLMSNVQAASNQFEEAMEQRILMRERLAEK 492 >ref|NP_181820.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75206274|sp|Q9SJG6.1|PP200_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g42920, chloroplastic; Flags: Precursor gi|4512663|gb|AAD21717.1| hypothetical protein [Arabidopsis thaliana] gi|20197867|gb|AAM15291.1| hypothetical protein [Arabidopsis thaliana] gi|110738441|dbj|BAF01146.1| hypothetical protein [Arabidopsis thaliana] gi|330255093|gb|AEC10187.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 559 Score = 281 bits (718), Expect = 1e-73 Identities = 141/223 (63%), Positives = 170/223 (76%) Frame = -1 Query: 670 GALEQGKWIHAYINKNKIELNTIVLTAIIDMYCKCGRIEEAFKVFKASSKIGLSCWNSMI 491 GA EQG+WIH YI +N+ ELN+IV+TA+IDMYCKCG IEE VF+ + K LSCWNSMI Sbjct: 272 GASEQGRWIHEYIVRNRFELNSIVVTALIDMYCKCGCIEEGLNVFECAPKKQLSCWNSMI 331 Query: 490 IGLAVNGHGEEAIQLFSDLESSGLRPDDVSFIGVLTACNHSGMVNEAKYYLSLMTETYVI 311 +GLA NG E A+ LFS+LE SGL PD VSFIGVLTAC HSG V+ A + LM E Y+I Sbjct: 332 LGLANNGFEERAMDLFSELERSGLEPDSVSFIGVLTACAHSGEVHRADEFFRLMKEKYMI 391 Query: 310 EPTIKHYGCMVDVLGRAGLLEEAEALITNMSIKADAIIWGSLLSACRDHGNVEMGERAAK 131 EP+IKHY MV+VLG AGLLEEAEALI NM ++ D +IW SLLSACR GNVEM +RAAK Sbjct: 392 EPSIKHYTLMVNVLGGAGLLEEAEALIKNMPVEEDTVIWSSLLSACRKIGNVEMAKRAAK 451 Query: 130 RLVELDPSDSGSYILLSNVYASSGRYEEAMDARRSMQERGTKK 2 L +LDP ++ Y+LLSN YAS G +EEA++ R M+ER +K Sbjct: 452 CLKKLDPDETCGYVLLSNAYASYGLFEEAVEQRLLMKERQMEK 494 Score = 68.2 bits (165), Expect = 1e-09 Identities = 49/207 (23%), Positives = 97/207 (46%), Gaps = 5/207 (2%) Frame = -1 Query: 619 IELNTIVLTAIIDMYCKCGRIEEAFKVFKASSKIGLSCWNSMIIGLAVNGHGEEAIQLFS 440 I + + ++I + KCG I++A +F + WNSMI G NG ++A+ +F Sbjct: 188 IGFDVVAWNSMIMGFAKCGLIDQAQNLFDEMPQRNGVSWNSMISGFVRNGRFKDALDMFR 247 Query: 439 DLESSGLRPDDVSFIGVLTACNHSGMVNEAKY-YLSLMTETYVIEPTIKHYGCMVDVLGR 263 +++ ++PD + + +L AC + G + ++ + ++ + + + ++D+ + Sbjct: 248 EMQEKDVKPDGFTMVSLLNACAYLGASEQGRWIHEYIVRNRFELNSIV--VTALIDMYCK 305 Query: 262 AGLLEEAEALITNMSIKADAIIWGSLLSACRDHGNVEMGERAAKRLVELDPS----DSGS 95 G +EE + + K W S++ ++G ERA EL+ S DS S Sbjct: 306 CGCIEEG-LNVFECAPKKQLSCWNSMILGLANNG---FEERAMDLFSELERSGLEPDSVS 361 Query: 94 YILLSNVYASSGRYEEAMDARRSMQER 14 +I + A SG A + R M+E+ Sbjct: 362 FIGVLTACAHSGEVHRADEFFRLMKEK 388 Score = 55.5 bits (132), Expect = 9e-06 Identities = 49/220 (22%), Positives = 99/220 (45%), Gaps = 4/220 (1%) Frame = -1 Query: 652 KWIHAYINKNKIELNTIVLTAIIDMYCKC-GRIEEAFKVFKASSKIGLSCWNSMIIGLAV 476 K IHA + K + +T+ + ++ C + A+ VF + WN++I G + Sbjct: 42 KQIHASLIKTGLISDTVTASRVLAFCCASPSDMNYAYLVFTRINHKNPFVWNTIIRGFSR 101 Query: 475 NGHGEEAIQLFSDL--ESSGLRPDDVSFIGVLTACNHSGMVNEAKYYLSLMTETYVIEPT 302 + E AI +F D+ S ++P +++ V A G + + ++ + + + + Sbjct: 102 SSFPEMAISIFIDMLCSSPSVKPQRLTYPSVFKAYGRLGQARDGRQLHGMVIKEGLEDDS 161 Query: 301 IKHYGCMVDVLGRAGLLEEAEALITNMSIKADAIIWGSLLSACRDHGNVEMGERAAKRLV 122 M+ + G L EA + M I D + W S++ G ++ A+ L Sbjct: 162 FIR-NTMLHMYVTCGCLIEAWRIFLGM-IGFDVVAWNSMIMGFAKCGLIDQ----AQNLF 215 Query: 121 ELDPSDSG-SYILLSNVYASSGRYEEAMDARRSMQERGTK 5 + P +G S+ + + + +GR+++A+D R MQE+ K Sbjct: 216 DEMPQRNGVSWNSMISGFVRNGRFKDALDMFREMQEKDVK 255