BLASTX nr result
ID: Coptis21_contig00026454
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis21_contig00026454 (938 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003535689.1| PREDICTED: pentatricopeptide repeat-containi... 396 e-108 ref|XP_002283562.1| PREDICTED: pentatricopeptide repeat-containi... 392 e-107 ref|XP_003595326.1| Pentatricopeptide repeat-containing protein ... 366 4e-99 ref|NP_199192.1| pentatricopeptide repeat-containing protein [Ar... 321 1e-85 ref|XP_002863633.1| pentatricopeptide repeat-containing protein ... 315 7e-84 >ref|XP_003535689.1| PREDICTED: pentatricopeptide repeat-containing protein At5g43790-like [Glycine max] Length = 591 Score = 396 bits (1017), Expect = e-108 Identities = 186/305 (60%), Positives = 236/305 (77%), Gaps = 7/305 (2%) Frame = +1 Query: 43 IFNQISNPSIFLYNTLISSFTNNPHHTHIAFSLYSQILTHNSLKPTCFTYPSLLKASCSH 222 IFN I NP++FLYNTLISS T++ H+AFSLY+ ILTH +L+P FT+PSL KA SH Sbjct: 58 IFNHIPNPTLFLYNTLISSLTHHSDQIHLAFSLYNHILTHKTLQPNSFTFPSLFKACASH 117 Query: 223 PWLYYGKPLHAHILKFLVPSYDHFVQASLLNFYSKCGMLDVSRYLFDQITEPDLATWNTI 402 PWL +G PLHAH+LKFL P YD FVQ SLLNFY+K G L VSRYLFDQI+EPDLATWNT+ Sbjct: 118 PWLQHGPPLHAHVLKFLQPPYDPFVQNSLLNFYAKYGKLCVSRYLFDQISEPDLATWNTM 177 Query: 403 LSAYAKSSSN-------EDNSLSIETLFLFNEMQVSLTRPNEKTLVALIGACANLGVFNQ 561 L+AYA+S+S+ ED +S+E L LF +MQ+S +PNE TLVALI AC+NLG +Q Sbjct: 178 LAAYAQSASHVSYSTSFEDADMSLEALHLFCDMQLSQIKPNEVTLVALISACSNLGALSQ 237 Query: 562 GTWAHAYILRNNLELNRFVATALIEFYAKCGCLNLAKQVFEQVVRKDTLCYNAMIGGFAI 741 G WAH Y+LRNNL+LNRFV TAL++ Y+KCGCLNLA Q+F+++ +DT CYNAMIGGFA+ Sbjct: 238 GAWAHGYVLRNNLKLNRFVGTALVDMYSKCGCLNLACQLFDELSDRDTFCYNAMIGGFAV 297 Query: 742 HGQSRGALYVFDRMRCDGVRPDDITLLVVMYTCAHAGLVKDGRRIFNTMKDEYGIEPKLE 921 HG AL ++ M+ + + PD T++V M+ C+H GLV++G IF +MK +G+EPKLE Sbjct: 298 HGHGNQALELYRNMKLEDLVPDGATIVVTMFACSHGGLVEEGLEIFESMKGVHGMEPKLE 357 Query: 922 HYGCL 936 HYGCL Sbjct: 358 HYGCL 362 >ref|XP_002283562.1| PREDICTED: pentatricopeptide repeat-containing protein At5g43790-like [Vitis vinifera] Length = 590 Score = 392 bits (1006), Expect = e-107 Identities = 187/300 (62%), Positives = 234/300 (78%), Gaps = 2/300 (0%) Frame = +1 Query: 43 IFNQISNPSIFLYNTLISSFTNNPHHTHIAFSLYSQILTHNSLKPTCFTYPSLLKASCSH 222 IFN I NP+IFLYNTLISS N HTHIAFSLYS++LTH +LKP FT+PSL KA S Sbjct: 62 IFNHIPNPTIFLYNTLISSLANIKPHTHIAFSLYSRVLTHTTLKPNGFTFPSLFKACGSQ 121 Query: 223 PWLYYGKPLHAHILKFLVPSYDHFVQASLLNFYSKCGMLDVSRYLFDQITEPDLATWNTI 402 PWL +G+ LH H+LKFL P+ D FVQA+LLN+Y+KCG + RYLF+QI++PDLA+WN+I Sbjct: 122 PWLRHGRALHTHVLKFLEPTCDPFVQAALLNYYAKCGKVGACRYLFNQISKPDLASWNSI 181 Query: 403 LSAYAKSSSN--EDNSLSIETLFLFNEMQVSLTRPNEKTLVALIGACANLGVFNQGTWAH 576 LSAY +S ED SLS+E L LF EMQ SL + NE TLVALI ACA LG +QG WAH Sbjct: 182 LSAYVHNSGAICEDVSLSLEVLTLFIEMQKSLIKANEVTLVALISACAELGALSQGAWAH 241 Query: 577 AYILRNNLELNRFVATALIEFYAKCGCLNLAKQVFEQVVRKDTLCYNAMIGGFAIHGQSR 756 Y+L++NL+LN FV TALI+ Y+KCGCL+LA Q+F+Q+ +DTLCYNAMIGGFAIHG Sbjct: 242 VYVLKHNLKLNHFVGTALIDMYSKCGCLDLACQLFDQLPHRDTLCYNAMIGGFAIHGYGH 301 Query: 757 GALYVFDRMRCDGVRPDDITLLVVMYTCAHAGLVKDGRRIFNTMKDEYGIEPKLEHYGCL 936 AL +F +M +G+ PDD+TL+V M +C+H GLV++G +F +MK+ YG+EPKLEHYGCL Sbjct: 302 QALDLFKKMTLEGLAPDDVTLVVTMCSCSHVGLVEEGCDVFESMKEVYGVEPKLEHYGCL 361 >ref|XP_003595326.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355484374|gb|AES65577.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 740 Score = 366 bits (940), Expect = 4e-99 Identities = 176/310 (56%), Positives = 232/310 (74%), Gaps = 12/310 (3%) Frame = +1 Query: 43 IFNQISNPSIFLYNTLISSFTN--NPHHTHIAFSLYSQILTHNSLKPTCFTYPSLLKASC 216 IFN ISNP+IFLYNTLISS N N + H+AFSLY++ILT+ +L+P FT+PSL KA C Sbjct: 202 IFNYISNPTIFLYNTLISSLINQTNQNQIHLAFSLYNKILTNKNLQPNSFTFPSLFKACC 261 Query: 217 SHP-WLYYGKPLHAHILKFLVPSYDHFVQASLLNFYSKCGMLDVSRYLFDQITEPDLATW 393 S+ W +YG LH H+LKFL P +D+FVQASLLNFY+K G + VSRY+FD+I EPDLATW Sbjct: 262 SNQSWFHYGPLLHTHVLKFLQPPFDNFVQASLLNFYAKYGKMCVSRYIFDRINEPDLATW 321 Query: 394 NTILSAYAKSSSN-------EDNSLSIETLFLFNEMQVSLTRPNEKTLVALIGACANLGV 552 N IL+AYA+SSS +D S+E+L+LF +MQV RPNE T+VALI AC+NLG Sbjct: 322 NVILNAYARSSSYHSYSNSFDDADFSLESLYLFRDMQVIGIRPNEVTIVALISACSNLGA 381 Query: 553 FNQGTWAHAYILRNNLELNRFVATALIEFYAKCGCLNLAKQVFEQVVR--KDTLCYNAMI 726 +QG W H ++LRN +++NRFV TA ++ Y+KCGCLNLA QVF+++ +D+ CY AMI Sbjct: 382 VSQGFWVHCFVLRNKIKMNRFVGTAFVDMYSKCGCLNLACQVFDKMPENDRDSFCYTAMI 441 Query: 727 GGFAIHGQSRGALYVFDRMRCDGVRPDDITLLVVMYTCAHAGLVKDGRRIFNTMKDEYGI 906 GGFA+HG AL ++ +M+ G+ PD T +V M+ C+H GLV++G IF +MK+ +G+ Sbjct: 442 GGFAVHGYGNQALELYRKMKFKGLVPDSATFVVTMFACSHVGLVEEGLEIFKSMKEVHGV 501 Query: 907 EPKLEHYGCL 936 EPKLEHYGCL Sbjct: 502 EPKLEHYGCL 511 >ref|NP_199192.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75262420|sp|Q9FG85.1|PP415_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At5g43790 gi|10177948|dbj|BAB11307.1| unnamed protein product [Arabidopsis thaliana] gi|332007627|gb|AED95010.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 460 Score = 321 bits (823), Expect = 1e-85 Identities = 165/318 (51%), Positives = 220/318 (69%), Gaps = 6/318 (1%) Frame = +1 Query: 1 LHVXXXXXXXXXXXIFNQISNPSIFLYNTLISSFTNNPH--HTHIAFSLYSQILTHNS-- 168 LH+ I QI NPS+FLYNTLISS +N + TH+AFSLY QIL+ S Sbjct: 48 LHLSSTVCLSYALSILRQIPNPSVFLYNTLISSIVSNHNSTQTHLAFSLYDQILSSRSNF 107 Query: 169 LKPTCFTYPSLLKAS-CSHPWLYYGKPLHAHILKFLVP-SYDHFVQASLLNFYSKCGMLD 342 ++P FTYPSL KAS W +G+ LHAH+LKFL P ++D FVQA+L+ FY+ CG L Sbjct: 108 VRPNEFTYPSLFKASGFDAQWHRHGRALHAHVLKFLEPVNHDRFVQAALVGFYANCGKLR 167 Query: 343 VSRYLFDQITEPDLATWNTILSAYAKSSSNEDNSLSIETLFLFNEMQVSLTRPNEKTLVA 522 +R LF++I EPDLATWNT+L+AYA S + + E L LF MQV RPNE +LVA Sbjct: 168 EARSLFERIREPDLATWNTLLAAYANSEEIDSDE---EVLLLFMRMQV---RPNELSLVA 221 Query: 523 LIGACANLGVFNQGTWAHAYILRNNLELNRFVATALIEFYAKCGCLNLAKQVFEQVVRKD 702 LI +CANLG F +G WAH Y+L+NNL LN+FV T+LI+ Y+KCGCL+ A++VF+++ ++D Sbjct: 222 LIKSCANLGEFVRGVWAHVYVLKNNLTLNQFVGTSLIDLYSKCGCLSFARKVFDEMSQRD 281 Query: 703 TLCYNAMIGGFAIHGQSRGALYVFDRMRCDGVRPDDITLLVVMYTCAHAGLVKDGRRIFN 882 CYNAMI G A+HG + + ++ + G+ PD T +V + C+H+GLV +G +IFN Sbjct: 282 VSCYNAMIRGLAVHGFGQEGIELYKSLISQGLVPDSATFVVTISACSHSGLVDEGLQIFN 341 Query: 883 TMKDEYGIEPKLEHYGCL 936 +MK YGIEPK+EHYGCL Sbjct: 342 SMKAVYGIEPKVEHYGCL 359 Score = 73.6 bits (179), Expect = 6e-11 Identities = 55/210 (26%), Positives = 99/210 (47%), Gaps = 6/210 (2%) Frame = +1 Query: 241 KPLHAHILKFLVPSYDHFVQASLLNFYSKCGMLDVSRYLFDQITEPDLATWNTILSAYAK 420 K +HA I+ + H S L S L + + QI P + +NT++S+ Sbjct: 26 KQIHAQIIT--IGLSHHTYPLSKLLHLSSTVCLSYALSILRQIPNPSVFLYNTLISSIVS 83 Query: 421 SSSNEDNSLSIETLFLFNEMQVSLTRPNEKTLVALIGACA-NLGVFNQGTWAHAYILR-- 591 + ++ L+ + + RPNE T +L A + G HA++L+ Sbjct: 84 NHNSTQTHLAFSLYDQILSSRSNFVRPNEFTYPSLFKASGFDAQWHRHGRALHAHVLKFL 143 Query: 592 NNLELNRFVATALIEFYAKCGCLNLAKQVFEQVVRKDTLCYNAMIGGFAIHGQ---SRGA 762 + +RFV AL+ FYA CG L A+ +FE++ D +N ++ +A + Sbjct: 144 EPVNHDRFVQAALVGFYANCGKLREARSLFERIREPDLATWNTLLAAYANSEEIDSDEEV 203 Query: 763 LYVFDRMRCDGVRPDDITLLVVMYTCAHAG 852 L +F RM+ VRP++++L+ ++ +CA+ G Sbjct: 204 LLLFMRMQ---VRPNELSLVALIKSCANLG 230 >ref|XP_002863633.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297309468|gb|EFH39892.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 460 Score = 315 bits (808), Expect = 7e-84 Identities = 163/318 (51%), Positives = 219/318 (68%), Gaps = 6/318 (1%) Frame = +1 Query: 1 LHVXXXXXXXXXXXIFNQISNPSIFLYNTLISSFTNNPH--HTHIAFSLYSQILTHNS-- 168 LH+ I QI NPS+FLYNTLISS +N + TH+AFSLY QIL+ S Sbjct: 48 LHLSSTVCLSYALSILRQIPNPSVFLYNTLISSIVSNHNSTQTHLAFSLYDQILSSRSNF 107 Query: 169 LKPTCFTYPSLLKAS-CSHPWLYYGKPLHAHILKFLVP-SYDHFVQASLLNFYSKCGMLD 342 ++P FTYPSL KAS W +G+ LHAH+LKF+ P ++D FVQA+L+ FY+ CG L Sbjct: 108 VRPNEFTYPSLFKASGFETKWHRHGRALHAHVLKFIEPVNHDRFVQAALVGFYANCGELR 167 Query: 343 VSRYLFDQITEPDLATWNTILSAYAKSSSNEDNSLSIETLFLFNEMQVSLTRPNEKTLVA 522 +R L ++I EPDLATWNT+L+AYA S E + E L LF MQV RPNE +LVA Sbjct: 168 EARSLLERIREPDLATWNTLLAAYANSEETESDE---EVLKLFVRMQV---RPNELSLVA 221 Query: 523 LIGACANLGVFNQGTWAHAYILRNNLELNRFVATALIEFYAKCGCLNLAKQVFEQVVRKD 702 LI +CANLG F G WAH Y+L+ NL LN+FV T+LI+FY+KCGCL+ A+QVF+++ +D Sbjct: 222 LIKSCANLGEFWGGVWAHVYLLKKNLTLNQFVGTSLIDFYSKCGCLSFARQVFDEMSERD 281 Query: 703 TLCYNAMIGGFAIHGQSRGALYVFDRMRCDGVRPDDITLLVVMYTCAHAGLVKDGRRIFN 882 C+NAMI G A+HG + + +++ + G+ PD+ T +V + C+H+GLV +G +IF+ Sbjct: 282 ISCFNAMIRGLAVHGFGQEGIELYNSLISQGLVPDNATFVVTISACSHSGLVDEGLQIFH 341 Query: 883 TMKDEYGIEPKLEHYGCL 936 +MK YGIEPK+EHYGCL Sbjct: 342 SMKTVYGIEPKVEHYGCL 359 Score = 71.2 bits (173), Expect = 3e-10 Identities = 54/210 (25%), Positives = 98/210 (46%), Gaps = 6/210 (2%) Frame = +1 Query: 241 KPLHAHILKFLVPSYDHFVQASLLNFYSKCGMLDVSRYLFDQITEPDLATWNTILSAYAK 420 K +HA I+ + H S L S L + + QI P + +NT++S+ Sbjct: 26 KQIHAQIIT--IGLSHHTYPLSKLLHLSSTVCLSYALSILRQIPNPSVFLYNTLISSIVS 83 Query: 421 SSSNEDNSLSIETLFLFNEMQVSLTRPNEKTLVALIGACA-NLGVFNQGTWAHAYILR-- 591 + ++ L+ + + RPNE T +L A G HA++L+ Sbjct: 84 NHNSTQTHLAFSLYDQILSSRSNFVRPNEFTYPSLFKASGFETKWHRHGRALHAHVLKFI 143 Query: 592 NNLELNRFVATALIEFYAKCGCLNLAKQVFEQVVRKDTLCYNAMIGGFAIHGQSRG---A 762 + +RFV AL+ FYA CG L A+ + E++ D +N ++ +A ++ Sbjct: 144 EPVNHDRFVQAALVGFYANCGELREARSLLERIREPDLATWNTLLAAYANSEETESDEEV 203 Query: 763 LYVFDRMRCDGVRPDDITLLVVMYTCAHAG 852 L +F RM+ VRP++++L+ ++ +CA+ G Sbjct: 204 LKLFVRMQ---VRPNELSLVALIKSCANLG 230