BLASTX nr result
ID: Coptis24_contig00033663
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis24_contig00033663 (428 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003551787.1| PREDICTED: pentatricopeptide repeat-containi... 92 6e-17 ref|XP_003530462.1| PREDICTED: pentatricopeptide repeat-containi... 91 1e-16 ref|XP_002878586.1| pentatricopeptide repeat-containing protein ... 90 2e-16 ref|NP_179798.1| pentatricopeptide repeat-containing protein [Ar... 89 5e-16 emb|CBI22025.3| unnamed protein product [Vitis vinifera] 88 8e-16 >ref|XP_003551787.1| PREDICTED: pentatricopeptide repeat-containing protein At3g02330-like [Glycine max] Length = 852 Score = 91.7 bits (226), Expect = 6e-17 Identities = 50/134 (37%), Positives = 81/134 (60%), Gaps = 4/134 (2%) Frame = +1 Query: 1 FDEMPERNVVSWNAMIAAYVHNGKEVDGLELFYTMKCVEFVVPDEFA-FATVLTGCARVS 177 FD MPER+VVSWN++++ Y+HNG +E+F M+ ++ +P ++A FA +L C+ + Sbjct: 95 FDSMPERDVVSWNSLLSCYLHNGVNRKSIEIFVRMRSLK--IPHDYATFAVILKACSGIE 152 Query: 178 ELKLGMQIHAGVLVVGFEIECVL--AVCNMYLRCGEVSFAERFLRERGE-NATLKVLMIK 348 + LG+Q+H + +GFE + V A+ +MY +C ++ A R RE E N +I Sbjct: 153 DYGLGLQVHCLAIQMGFENDVVTGSALVDMYSKCKKLDDAFRVFREMPERNLVCWSAVIA 212 Query: 349 GYVSNKRCDDALRL 390 GYV N R + L+L Sbjct: 213 GYVQNDRFIEGLKL 226 Score = 70.9 bits (172), Expect = 1e-10 Identities = 44/132 (33%), Positives = 72/132 (54%), Gaps = 3/132 (2%) Frame = +1 Query: 1 FDEMPERNVVSWNAMIAAYVHNGKEVDGLELFYTMKCVEFVVPDEFAFATVLTGCARVSE 180 F+EM R+ VSWNA+IAA+ N + V L LF +M + PD+F + +V+ CA Sbjct: 398 FEEMERRDAVSWNAIIAAHEQNEEIVKTLSLFVSM-LRSTMEPDDFTYGSVVKACAGQQA 456 Query: 181 LKLGMQIHAGVLVVGFEIECVL--AVCNMYLRCGEVSFAERFLRERGENATLK-VLMIKG 351 L G +IH ++ G ++ + A+ +MY +CG + AE+ E T+ +I G Sbjct: 457 LNYGTEIHGRIIKSGMGLDWFVGSALVDMYGKCGMLMEAEKIHARLEEKTTVSWNSIISG 516 Query: 352 YVSNKRCDDALR 387 + S K+ ++A R Sbjct: 517 FSSQKQSENAQR 528 Score = 70.1 bits (170), Expect = 2e-10 Identities = 38/93 (40%), Positives = 55/93 (59%), Gaps = 2/93 (2%) Frame = +1 Query: 1 FDEMPERNVVSWNAMIAAYVHNGKEVDGLELFYTMKCVEFVVPDEFAFATVLTGCARVSE 180 F EMPERN+V W+A+IA YV N + ++GL+LF M V V + +A+V CA +S Sbjct: 196 FREMPERNLVCWSAVIAGYVQNDRFIEGLKLFKDMLKVGMGV-SQSTYASVFRSCAGLSA 254 Query: 181 LKLGMQIHAGVLVVGFEIECVL--AVCNMYLRC 273 KLG Q+H L F + ++ A +MY +C Sbjct: 255 FKLGTQLHGHALKSDFAYDSIIGTATLDMYAKC 287 Score = 58.5 bits (140), Expect = 5e-07 Identities = 27/93 (29%), Positives = 56/93 (60%), Gaps = 2/93 (2%) Frame = +1 Query: 10 MPERNVVSWNAMIAAYVHNGKEVDGLELFYTMKCVEFVVPDEFAFATVLTGCARVSELKL 189 + E+ VSWN++I+ + + K+ + + +++ ++PD + +ATVL CA ++ ++L Sbjct: 502 LEEKTTVSWNSIISGF-SSQKQSENAQRYFSQMLEMGIIPDNYTYATVLDVCANMATIEL 560 Query: 190 GMQIHAGVLVVGFEIECVLA--VCNMYLRCGEV 282 G QIHA +L + + +A + +MY +CG + Sbjct: 561 GKQIHAQILKLQLHSDVYIASTLVDMYSKCGNM 593 >ref|XP_003530462.1| PREDICTED: pentatricopeptide repeat-containing protein At3g02330-like [Glycine max] Length = 852 Score = 90.9 bits (224), Expect = 1e-16 Identities = 50/134 (37%), Positives = 81/134 (60%), Gaps = 4/134 (2%) Frame = +1 Query: 1 FDEMPERNVVSWNAMIAAYVHNGKEVDGLELFYTMKCVEFVVPDEFA-FATVLTGCARVS 177 FD MPER+VVSWN++++ Y+HNG +E+F M+ ++ +P ++A F+ VL C+ + Sbjct: 95 FDTMPERDVVSWNSLLSCYLHNGVNRKSIEIFVRMRSLK--IPHDYATFSVVLKACSGIE 152 Query: 178 ELKLGMQIHAGVLVVGFEIECVL--AVCNMYLRCGEVSFAERFLRERGE-NATLKVLMIK 348 + LG+Q+H + +GFE + V A+ +MY +C ++ A R RE E N +I Sbjct: 153 DYGLGLQVHCLAIQMGFENDVVTGSALVDMYSKCKKLDGAFRIFREMPERNLVCWSAVIA 212 Query: 349 GYVSNKRCDDALRL 390 GYV N R + L+L Sbjct: 213 GYVQNDRFIEGLKL 226 Score = 72.8 bits (177), Expect = 3e-11 Identities = 45/132 (34%), Positives = 73/132 (55%), Gaps = 3/132 (2%) Frame = +1 Query: 1 FDEMPERNVVSWNAMIAAYVHNGKEVDGLELFYTMKCVEFVVPDEFAFATVLTGCARVSE 180 FD+M R+ VSWNA+IAA+ N + V L LF +M + PD+F + +V+ CA Sbjct: 398 FDDMERRDAVSWNAIIAAHEQNEEIVKTLSLFVSM-LRSTMEPDDFTYGSVVKACAGQQA 456 Query: 181 LKLGMQIHAGVLVVGFEIECVL--AVCNMYLRCGEVSFAERFLRERGENATLK-VLMIKG 351 L GM+IH ++ G ++ + A+ +MY +CG + AE+ E T+ +I G Sbjct: 457 LNYGMEIHGRIVKSGMGLDWFVGSALVDMYGKCGMLMEAEKIHDRLEEKTTVSWNSIISG 516 Query: 352 YVSNKRCDDALR 387 + S K+ ++A R Sbjct: 517 FSSQKQSENAQR 528 Score = 72.0 bits (175), Expect = 5e-11 Identities = 40/99 (40%), Positives = 58/99 (58%), Gaps = 2/99 (2%) Frame = +1 Query: 1 FDEMPERNVVSWNAMIAAYVHNGKEVDGLELFYTMKCVEFVVPDEFAFATVLTGCARVSE 180 F EMPERN+V W+A+IA YV N + ++GL+LF M V V + +A+V CA +S Sbjct: 196 FREMPERNLVCWSAVIAGYVQNDRFIEGLKLFKDMLKVGMGV-SQSTYASVFRSCAGLSA 254 Query: 181 LKLGMQIHAGVLVVGFEIECVL--AVCNMYLRCGEVSFA 291 KLG Q+H L F + ++ A +MY +C +S A Sbjct: 255 FKLGTQLHGHALKSDFAYDSIIGTATLDMYAKCDRMSDA 293 Score = 63.2 bits (152), Expect = 2e-08 Identities = 30/95 (31%), Positives = 57/95 (60%), Gaps = 2/95 (2%) Frame = +1 Query: 4 DEMPERNVVSWNAMIAAYVHNGKEVDGLELFYTMKCVEFVVPDEFAFATVLTGCARVSEL 183 D + E+ VSWN++I+ + + K+ + + +++ V+PD F +ATVL CA ++ + Sbjct: 500 DRLEEKTTVSWNSIISGF-SSQKQSENAQRYFSQMLEMGVIPDNFTYATVLDVCANMATI 558 Query: 184 KLGMQIHAGVLVVGFEIECVLA--VCNMYLRCGEV 282 +LG QIHA +L + + +A + +MY +CG + Sbjct: 559 ELGKQIHAQILKLNLHSDVYIASTLVDMYSKCGNM 593 >ref|XP_002878586.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297324425|gb|EFH54845.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 786 Score = 89.7 bits (221), Expect = 2e-16 Identities = 45/122 (36%), Positives = 75/122 (61%), Gaps = 2/122 (1%) Frame = +1 Query: 1 FDEMPERNVVSWNAMIAAYVHNGKEVDGLELFYTMKCVEFVVPDEFAFATVLTGCARVSE 180 F++M ER++V+WN+MI+ Y G ++ L++F M + PD F A+VL+ CA + + Sbjct: 235 FEQMAERDIVTWNSMISGYNQRGYDLRALDMFSKMLRDSMLSPDRFTLASVLSACANLEK 294 Query: 181 LKLGMQIHAGVLVVGFEIECVL--AVCNMYLRCGEVSFAERFLRERGENATLKVLMIKGY 354 L +G QIH+ ++ GF+I ++ A+ +MY RCG V A R + +RG K L I+G+ Sbjct: 295 LCIGEQIHSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRG----TKDLKIEGF 350 Query: 355 VS 360 + Sbjct: 351 TA 352 >ref|NP_179798.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75206010|sp|Q9SHZ8.1|PP168_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g22070 gi|4587589|gb|AAD25817.1| hypothetical protein [Arabidopsis thaliana] gi|330252165|gb|AEC07259.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 786 Score = 88.6 bits (218), Expect = 5e-16 Identities = 44/122 (36%), Positives = 75/122 (61%), Gaps = 2/122 (1%) Frame = +1 Query: 1 FDEMPERNVVSWNAMIAAYVHNGKEVDGLELFYTMKCVEFVVPDEFAFATVLTGCARVSE 180 F++M ER++V+WN+MI+ + G ++ L++F M + PD F A+VL+ CA + + Sbjct: 235 FEQMAERDIVTWNSMISGFNQRGYDLRALDIFSKMLRDSLLSPDRFTLASVLSACANLEK 294 Query: 181 LKLGMQIHAGVLVVGFEIECVL--AVCNMYLRCGEVSFAERFLRERGENATLKVLMIKGY 354 L +G QIH+ ++ GF+I ++ A+ +MY RCG V A R + +RG K L I+G+ Sbjct: 295 LCIGKQIHSHIVTTGFDISGIVLNALISMYSRCGGVETARRLIEQRG----TKDLKIEGF 350 Query: 355 VS 360 + Sbjct: 351 TA 352 >emb|CBI22025.3| unnamed protein product [Vitis vinifera] Length = 489 Score = 87.8 bits (216), Expect = 8e-16 Identities = 51/131 (38%), Positives = 72/131 (54%), Gaps = 3/131 (2%) Frame = +1 Query: 1 FDEMPERNVVSWNAMIAAYVHNGKEVDGLELFYTMKCVEFVVPDEFAFATVLTGCARVSE 180 FDEMPERNV +WNAM+A + +GL LF M + F +PDEFA +VL GCA + Sbjct: 141 FDEMPERNVATWNAMVAGLIQFEFNEEGLGLFSRMNELGF-LPDEFALGSVLRGCAGLRA 199 Query: 181 LKLGMQIHAGVLVVGFEIECVL--AVCNMYLRCGEVSFAERFLRER-GENATLKVLMIKG 351 L G Q+H V GFE V+ ++ +MY++CG + ER +R +N +I G Sbjct: 200 LVAGRQVHGYVRKCGFEFNLVVVSSLAHMYMKCGSLGEGERLIRAMPSQNVVAWNTLIAG 259 Query: 352 YVSNKRCDDAL 384 N ++ L Sbjct: 260 RAQNGYPEEVL 270 Score = 65.5 bits (158), Expect = 4e-09 Identities = 41/130 (31%), Positives = 68/130 (52%), Gaps = 3/130 (2%) Frame = +1 Query: 10 MPERNVVSWNAMIAAYVHNGKEVDGLELFYTMKCVEFVVPDEFAFATVLTGCARVSELKL 189 MP +NVV+WN +IA NG + L+ + MK F PD+ F +V++ C+ ++ L Sbjct: 245 MPSQNVVAWNTLIAGRAQNGYPEEVLDQYNMMKMAGF-RPDKITFVSVISSCSELATLGQ 303 Query: 190 GMQIHAGVLVVGFE--IECVLAVCNMYLRCGEVSFAER-FLRERGENATLKVLMIKGYVS 360 G QIHA V+ G + + ++ +MY RCG + ++ + FL + MI Y Sbjct: 304 GQQIHAEVIKAGASLIVSVISSLISMYSRCGCLEYSLKVFLECENGDVVCWSSMIAAYGF 363 Query: 361 NKRCDDALRL 390 + R +A+ L Sbjct: 364 HGRGVEAIDL 373