BLASTX nr result
ID: Coptis24_contig00004220
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis24_contig00004220 (2090 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI35029.3| unnamed protein product [Vitis vinifera] 616 e-174 ref|XP_002276684.1| PREDICTED: putative pentatricopeptide repeat... 616 e-174 ref|NP_189505.2| pentatricopeptide repeat-containing protein [Ar... 530 e-148 ref|NP_189507.2| pentatricopeptide repeat-containing protein [Ar... 528 e-147 ref|XP_002877120.1| pentatricopeptide repeat-containing protein ... 464 e-128 >emb|CBI35029.3| unnamed protein product [Vitis vinifera] Length = 1596 Score = 616 bits (1588), Expect = e-174 Identities = 298/474 (62%), Positives = 365/474 (77%) Frame = +2 Query: 2 MQQLKTIHAIFITNGHHDNNYAMSKLISFTALSNNGNLIYASQLFDQIPNPNSFIYNTLI 181 M+Q K IHA+FI NG H NNYA+SKLISF ALSN+G+L YAS +F QI NPN F YNTLI Sbjct: 17 MRQFKAIHALFIVNGLHLNNYAISKLISFCALSNSGSLSYASLIFSQIQNPNLFAYNTLI 76 Query: 182 RAYSRSTQPQVSLHYFCLMLKDDCIVPDYHTLPFVLIACANACCVSLGEGIHCWVLKNGW 361 RAYSRS+ PQ++LHYF LML D+ + PD HT PF++ AC N+ + LG+ IH WVLKNG Sbjct: 77 RAYSRSSTPQLALHYFQLMLDDENVGPDQHTFPFIISACTNSLWMLLGKQIHNWVLKNGV 136 Query: 362 GFSDKHIQTALLRFYLECGSLVEFRKVFDEIHERDVFQWNVFMNGCLRYGMDVEALSAFR 541 SD+H+QTAL+RFY EC ++ + RK+FDEI DV QWNV +NG +R G+ EAL+AFR Sbjct: 137 ASSDRHVQTALVRFYAECCAMGDARKLFDEIPNLDVVQWNVLLNGYVRRGLAPEALNAFR 196 Query: 542 DMLSSGVELDEYCVATGLTACAHSGALRQGMWIHEYVKKRDEFSEDVFVGTALVDMYAKC 721 +ML SGVE DE+C+ T L CA GAL+QG WIHEYV KR DVF+GTALVDMYAKC Sbjct: 197 NMLVSGVEPDEFCLTTALKGCAQLGALQQGKWIHEYVTKRKWLEADVFIGTALVDMYAKC 256 Query: 722 GCIDKAVEVFRRMRKRNEFSWAAMIGGFALHGFAKEAIHCLDRMIMEDGLRPDGVVLLAV 901 GCID++VEVF M KRN FSW+AMIGGFALHG ++A+ CL+RM +EDGLRPDGVVLL V Sbjct: 257 GCIDRSVEVFEGMTKRNVFSWSAMIGGFALHGHVRKAMQCLERMQVEDGLRPDGVVLLGV 316 Query: 902 LTACTHAGCEEEGRLLLRNMKALYGIVPKHEHYSCTVDLLCRAGRLNEALELIQTMPMRP 1081 + AC HAG +EEG+ LL NM+A YGI+PKHEHYSC VDLLCRAG+L+EAL+LI+ MPM+P Sbjct: 317 IMACAHAGLQEEGQFLLENMEARYGILPKHEHYSCMVDLLCRAGQLDEALKLIRRMPMKP 376 Query: 1082 LASVWGSLLSACKTHGNXXXXXXXXXXXXQIEHDSGAQEDGAYVQLSNIYLNAQRCSDAG 1261 A+VWG+LLS C+TH N + + G +EDGAYVQLSNIYL AQ+C DA Sbjct: 377 RAAVWGALLSGCRTHNNVDLAELAARELLMVGNGDGTEEDGAYVQLSNIYLAAQKCEDAC 436 Query: 1262 RIRMMMDDKGIKKTPGCSVIEVNGKVNEFVAGDVVHPLRLEIHAILDLLSLRIS 1423 RIR M+ DK IK PGCS+IEV G+VN+FV+GD+ HP +IH +LDL+SL+ S Sbjct: 437 RIRRMIGDKRIKTKPGCSLIEVEGEVNQFVSGDISHPCLAQIHEMLDLVSLQHS 490 >ref|XP_002276684.1| PREDICTED: putative pentatricopeptide repeat-containing protein At3g28640-like [Vitis vinifera] Length = 511 Score = 616 bits (1588), Expect = e-174 Identities = 298/474 (62%), Positives = 365/474 (77%) Frame = +2 Query: 2 MQQLKTIHAIFITNGHHDNNYAMSKLISFTALSNNGNLIYASQLFDQIPNPNSFIYNTLI 181 M+Q K IHA+FI NG H NNYA+SKLISF ALSN+G+L YAS +F QI NPN F YNTLI Sbjct: 17 MRQFKAIHALFIVNGLHLNNYAISKLISFCALSNSGSLSYASLIFSQIQNPNLFAYNTLI 76 Query: 182 RAYSRSTQPQVSLHYFCLMLKDDCIVPDYHTLPFVLIACANACCVSLGEGIHCWVLKNGW 361 RAYSRS+ PQ++LHYF LML D+ + PD HT PF++ AC N+ + LG+ IH WVLKNG Sbjct: 77 RAYSRSSTPQLALHYFQLMLDDENVGPDQHTFPFIISACTNSLWMLLGKQIHNWVLKNGV 136 Query: 362 GFSDKHIQTALLRFYLECGSLVEFRKVFDEIHERDVFQWNVFMNGCLRYGMDVEALSAFR 541 SD+H+QTAL+RFY EC ++ + RK+FDEI DV QWNV +NG +R G+ EAL+AFR Sbjct: 137 ASSDRHVQTALVRFYAECCAMGDARKLFDEIPNLDVVQWNVLLNGYVRRGLAPEALNAFR 196 Query: 542 DMLSSGVELDEYCVATGLTACAHSGALRQGMWIHEYVKKRDEFSEDVFVGTALVDMYAKC 721 +ML SGVE DE+C+ T L CA GAL+QG WIHEYV KR DVF+GTALVDMYAKC Sbjct: 197 NMLVSGVEPDEFCLTTALKGCAQLGALQQGKWIHEYVTKRKWLEADVFIGTALVDMYAKC 256 Query: 722 GCIDKAVEVFRRMRKRNEFSWAAMIGGFALHGFAKEAIHCLDRMIMEDGLRPDGVVLLAV 901 GCID++VEVF M KRN FSW+AMIGGFALHG ++A+ CL+RM +EDGLRPDGVVLL V Sbjct: 257 GCIDRSVEVFEGMTKRNVFSWSAMIGGFALHGHVRKAMQCLERMQVEDGLRPDGVVLLGV 316 Query: 902 LTACTHAGCEEEGRLLLRNMKALYGIVPKHEHYSCTVDLLCRAGRLNEALELIQTMPMRP 1081 + AC HAG +EEG+ LL NM+A YGI+PKHEHYSC VDLLCRAG+L+EAL+LI+ MPM+P Sbjct: 317 IMACAHAGLQEEGQFLLENMEARYGILPKHEHYSCMVDLLCRAGQLDEALKLIRRMPMKP 376 Query: 1082 LASVWGSLLSACKTHGNXXXXXXXXXXXXQIEHDSGAQEDGAYVQLSNIYLNAQRCSDAG 1261 A+VWG+LLS C+TH N + + G +EDGAYVQLSNIYL AQ+C DA Sbjct: 377 RAAVWGALLSGCRTHNNVDLAELAARELLMVGNGDGTEEDGAYVQLSNIYLAAQKCEDAC 436 Query: 1262 RIRMMMDDKGIKKTPGCSVIEVNGKVNEFVAGDVVHPLRLEIHAILDLLSLRIS 1423 RIR M+ DK IK PGCS+IEV G+VN+FV+GD+ HP +IH +LDL+SL+ S Sbjct: 437 RIRRMIGDKRIKTKPGCSLIEVEGEVNQFVSGDISHPCLAQIHEMLDLVSLQHS 490 >ref|NP_189505.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75273576|sp|Q9LJJ1.1|PP259_ARATH RecName: Full=Putative pentatricopeptide repeat-containing protein At3g28640 gi|9294278|dbj|BAB02180.1| unnamed protein product [Arabidopsis thaliana] gi|332643948|gb|AEE77469.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 504 Score = 530 bits (1364), Expect = e-148 Identities = 258/475 (54%), Positives = 345/475 (72%), Gaps = 4/475 (0%) Frame = +2 Query: 2 MQQLKTIHAIFITNGHHDNNYAMSKLIS-FTALSN-NGNLIYASQLFDQIPNPNSFIYNT 175 ++Q+K+ H++FI +G H N YA+SKL++ F L N N + YAS +FD I PNSF+Y+T Sbjct: 24 VKQIKSTHSLFIIHGLHRNTYAISKLLTAFLHLPNLNKHFHYASSIFDSIEIPNSFVYDT 83 Query: 176 LIRAYSRSTQPQVSLHYFCLMLKDD--CIVPDYHTLPFVLIACANACCVSLGEGIHCWVL 349 +IR SRS+QP + L YF LM+K++ I P Y T F+++AC AC S+G+ IHCWV+ Sbjct: 84 MIRICSRSSQPHLGLRYFLLMVKEEEEDIAPSYLTFHFLIVACLKACFFSVGKQIHCWVV 143 Query: 350 KNGWGFSDKHIQTALLRFYLECGSLVEFRKVFDEIHERDVFQWNVFMNGCLRYGMDVEAL 529 KNG SD H+QT +LR Y+E L++ RKVFDEI + DV +W+V MNG +R G+ E L Sbjct: 144 KNGVFLSDSHVQTGVLRIYVEDKLLLDARKVFDEIPQPDVVKWDVLMNGYVRCGLGSEGL 203 Query: 530 SAFRDMLSSGVELDEYCVATGLTACAHSGALRQGMWIHEYVKKRDEFSEDVFVGTALVDM 709 FR+ML G+E DE+ V T LTACA GAL QG WIHE+VKK+ DVFVGTALVDM Sbjct: 204 EVFREMLVKGLEPDEFSVTTALTACAQVGALAQGKWIHEFVKKKSWIESDVFVGTALVDM 263 Query: 710 YAKCGCIDKAVEVFRRMRKRNEFSWAAMIGGFALHGFAKEAIHCLDRMIMEDGLRPDGVV 889 YAKCGCI+ AVEVF+++ +RN FSWAA+IGG+A +G+AK+A+ CL+R+ EDG++PD VV Sbjct: 264 YAKCGCIETAVEVFKKLTRRNVFSWAALIGGYAAYGYAKKAMTCLERLEREDGIKPDSVV 323 Query: 890 LLAVLTACTHAGCEEEGRLLLRNMKALYGIVPKHEHYSCTVDLLCRAGRLNEALELIQTM 1069 LL VL AC H G EEGR +L NM+A Y I PKHEHYSC VDL+CRAGRL++AL LI+ M Sbjct: 324 LLGVLAACAHGGFLEEGRSMLENMEARYEITPKHEHYSCIVDLMCRAGRLDDALNLIEKM 383 Query: 1070 PMRPLASVWGSLLSACKTHGNXXXXXXXXXXXXQIEHDSGAQEDGAYVQLSNIYLNAQRC 1249 PM+PLASVWG+LL+ C+TH N +E + +E+ A VQLSNIY + QR Sbjct: 384 PMKPLASVWGALLNGCRTHKNVELGELAVKNLLDLEKGNVEEEEAALVQLSNIYFSVQRN 443 Query: 1250 SDAGRIRMMMDDKGIKKTPGCSVIEVNGKVNEFVAGDVVHPLRLEIHAILDLLSL 1414 +A ++R M++ +G++KTPG SV+EV+G V +FV+GDV HP L+IH ++ LLS+ Sbjct: 444 PEASKVRGMIEQRGVRKTPGWSVLEVDGNVTKFVSGDVSHPNLLQIHTVIHLLSV 498 >ref|NP_189507.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75273574|sp|Q9LJI9.1|PP260_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At3g28660 gi|9294280|dbj|BAB02182.1| unnamed protein product [Arabidopsis thaliana] gi|20259531|gb|AAM13885.1| unknown protein [Arabidopsis thaliana] gi|24030460|gb|AAN41382.1| unknown protein [Arabidopsis thaliana] gi|332643950|gb|AEE77471.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 504 Score = 528 bits (1359), Expect = e-147 Identities = 261/481 (54%), Positives = 346/481 (71%), Gaps = 4/481 (0%) Frame = +2 Query: 2 MQQLKTIHAIFITNGHHDNNYAMSKLIS-FTALSN-NGNLIYASQLFDQIPNPNSFIYNT 175 ++Q+K+ H++FI +G H N YA+SKL++ F L N N + YAS +FD I PNSF+Y+T Sbjct: 24 VKQIKSTHSLFIIHGLHRNTYAISKLLTAFLHLPNLNKHFHYASSIFDSIEIPNSFVYDT 83 Query: 176 LIRAYSRSTQPQVSLHYFCLMLKDD--CIVPDYHTLPFVLIACANACCVSLGEGIHCWVL 349 +IR SRS+QP + L YF LM+K++ I P Y T F+++AC AC S+G+ IHCWV+ Sbjct: 84 MIRICSRSSQPHLGLRYFLLMVKEEEEDITPSYLTFHFLIVACLKACFFSVGKQIHCWVV 143 Query: 350 KNGWGFSDKHIQTALLRFYLECGSLVEFRKVFDEIHERDVFQWNVFMNGCLRYGMDVEAL 529 KNG SD H+QT +LR Y+E L + RKVFDEI + DV +W+V MNG +R G+ E L Sbjct: 144 KNGVFLSDGHVQTGVLRIYVEDKLLFDARKVFDEIPQPDVVKWDVLMNGYVRCGLGSEGL 203 Query: 530 SAFRDMLSSGVELDEYCVATGLTACAHSGALRQGMWIHEYVKKRDEFSEDVFVGTALVDM 709 F++ML G+E DE+ V T LTACA GAL QG WIHE+VKK+ DVFVGTALVDM Sbjct: 204 EVFKEMLVRGIEPDEFSVTTALTACAQVGALAQGKWIHEFVKKKRWIESDVFVGTALVDM 263 Query: 710 YAKCGCIDKAVEVFRRMRKRNEFSWAAMIGGFALHGFAKEAIHCLDRMIMEDGLRPDGVV 889 YAKCGCI+ AVEVF ++ +RN FSWAA+IGG+A +G+AK+A CLDR+ EDG++PD VV Sbjct: 264 YAKCGCIETAVEVFEKLTRRNVFSWAALIGGYAAYGYAKKATTCLDRIEREDGIKPDSVV 323 Query: 890 LLAVLTACTHAGCEEEGRLLLRNMKALYGIVPKHEHYSCTVDLLCRAGRLNEALELIQTM 1069 LL VL AC H G EEGR +L NM+A YGI PKHEHYSC VDL+CRAGRL++AL+LI+ M Sbjct: 324 LLGVLAACAHGGFLEEGRTMLENMEARYGITPKHEHYSCIVDLMCRAGRLDDALDLIEKM 383 Query: 1070 PMRPLASVWGSLLSACKTHGNXXXXXXXXXXXXQIEHDSGAQEDGAYVQLSNIYLNAQRC 1249 PM+PLASVWG+LL+ C+TH N +E + +E+ A VQLSNIY + QR Sbjct: 384 PMKPLASVWGALLNGCRTHKNVELGELAVQNLLDLEKGNVEEEEAALVQLSNIYFSVQRN 443 Query: 1250 SDAGRIRMMMDDKGIKKTPGCSVIEVNGKVNEFVAGDVVHPLRLEIHAILDLLSLRISDH 1429 +A ++R M++ +GI+KTPG S++EV+G V +FV+GDV HP L+IH ++ LLS+ S Sbjct: 444 PEAFKVRGMIEQRGIRKTPGWSLLEVDGIVTKFVSGDVSHPNLLQIHTLIHLLSVDASQI 503 Query: 1430 L 1432 L Sbjct: 504 L 504 >ref|XP_002877120.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297322958|gb|EFH53379.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 399 Score = 464 bits (1193), Expect = e-128 Identities = 221/393 (56%), Positives = 288/393 (73%) Frame = +2 Query: 236 MLKDDCIVPDYHTLPFVLIACANACCVSLGEGIHCWVLKNGWGFSDKHIQTALLRFYLEC 415 M+K++ I P Y T F+++AC AC S+G+ IHCWV+KNG SD H+QT +LR Y+E Sbjct: 1 MVKEEDIAPSYLTFYFLIVACFKACLFSVGKQIHCWVVKNGVFLSDGHVQTGILRIYVED 60 Query: 416 GSLVEFRKVFDEIHERDVFQWNVFMNGCLRYGMDVEALSAFRDMLSSGVELDEYCVATGL 595 L++ KVFDEI + DV +W+V MNG +R G+ E L FR+ML GVE DE+ V T L Sbjct: 61 KVLLDAHKVFDEIPKPDVVKWDVLMNGYVRCGLGSEGLEVFREMLVRGVEPDEFSVTTAL 120 Query: 596 TACAHSGALRQGMWIHEYVKKRDEFSEDVFVGTALVDMYAKCGCIDKAVEVFRRMRKRNE 775 TACA GAL QG WIHE+VKK+ DVFVGTALVDMYAKCGCI+ AVEVF ++ +RN Sbjct: 121 TACAQVGALAQGKWIHEFVKKKRWIESDVFVGTALVDMYAKCGCIEMAVEVFEKLSRRNV 180 Query: 776 FSWAAMIGGFALHGFAKEAIHCLDRMIMEDGLRPDGVVLLAVLTACTHAGCEEEGRLLLR 955 FSWAA+IGG+A +G+AK+A+ CLDRM EDG++PD VVLL VL AC H G +EGR +L Sbjct: 181 FSWAALIGGYAAYGYAKKAMTCLDRMEREDGIKPDSVVLLGVLAACAHGGFLQEGRAMLG 240 Query: 956 NMKALYGIVPKHEHYSCTVDLLCRAGRLNEALELIQTMPMRPLASVWGSLLSACKTHGNX 1135 NM+A YGI PKHEHYSC VDL+CRAGRL++AL+LI+ MPM+PLASVWG+LL+ C+TH N Sbjct: 241 NMEARYGITPKHEHYSCIVDLMCRAGRLDDALDLIEKMPMKPLASVWGALLNGCRTHKNV 300 Query: 1136 XXXXXXXXXXXQIEHDSGAQEDGAYVQLSNIYLNAQRCSDAGRIRMMMDDKGIKKTPGCS 1315 +E + +E+ A VQLSNIY + QR +A ++R M++ +GI+KTPG S Sbjct: 301 ELGELAVKNLLDLEKGNAEEEEAALVQLSNIYFSVQRNPEASKVRGMIEQRGIRKTPGWS 360 Query: 1316 VIEVNGKVNEFVAGDVVHPLRLEIHAILDLLSL 1414 V+EV+G V +FV+GDV HP L+IH ++ LLS+ Sbjct: 361 VLEVDGNVTKFVSGDVSHPNLLQIHTVIHLLSV 393 Score = 102 bits (253), Expect = 5e-19 Identities = 67/236 (28%), Positives = 116/236 (49%), Gaps = 2/236 (0%) Frame = +2 Query: 113 LIYASQLFDQIPNPNSFIYNTLIRAYSRSTQPQVSLHYFCLMLKDDCIVPDYHTLPFVLI 292 L+ A ++FD+IP P+ ++ L+ Y R L F ML + PD ++ L Sbjct: 63 LLDAHKVFDEIPKPDVVKWDVLMNGYVRCGLGSEGLEVFREMLVRG-VEPDEFSVTTALT 121 Query: 293 ACANACCVSLGEGIHCWVLKNGWGFSDKHIQTALLRFYLECGSLVEFRKVFDEIHERDVF 472 ACA ++ G+ IH +V K W SD + TAL+ Y +CG + +VF+++ R+VF Sbjct: 122 ACAQVGALAQGKWIHEFVKKKRWIESDVFVGTALVDMYAKCGCIEMAVEVFEKLSRRNVF 181 Query: 473 QWNVFMNGCLRYGMDVEALSAFRDM-LSSGVELDEYCVATGLTACAHSGALRQGMWIHEY 649 W + G YG +A++ M G++ D + L ACAH G L++G + Sbjct: 182 SWAALIGGYAAYGYAKKAMTCLDRMEREDGIKPDSVVLLGVLAACAHGGFLQEGRAMLGN 241 Query: 650 VKKRDEFSEDVFVGTALVDMYAKCGCIDKAVEVFRRMRKRNEFS-WAAMIGGFALH 814 ++ R + + +VD+ + G +D A+++ +M + S W A++ G H Sbjct: 242 MEARYGITPKHEHYSCIVDLMCRAGRLDDALDLIEKMPMKPLASVWGALLNGCRTH 297