BLASTX nr result
ID: Coptis25_contig00012911
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis25_contig00012911 (1622 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002268821.1| PREDICTED: pentatricopeptide repeat-containi... 541 e-151 ref|XP_002526948.1| pentatricopeptide repeat-containing protein,... 486 e-135 ref|XP_002324000.1| predicted protein [Populus trichocarpa] gi|2... 479 e-133 ref|XP_002873660.1| pentatricopeptide repeat-containing protein ... 474 e-131 ref|NP_190245.1| pentatricopeptide repeat-containing protein [Ar... 461 e-127 >ref|XP_002268821.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610 [Vitis vinifera] Length = 763 Score = 541 bits (1394), Expect = e-151 Identities = 290/509 (56%), Positives = 349/509 (68%), Gaps = 39/509 (7%) Frame = +2 Query: 212 MQSLSVWSVRGDFLAVPHFSFGLNTSRTKDKH-----------------------FTCTS 322 MQ+LSVW +G F AVP + L +S + + +S Sbjct: 1 MQALSVWPSKGVFWAVPQLDYNLGSSSIPSRRRGRRKLWNPEDPVCQYRSLAFLWVSSSS 60 Query: 323 RYHLVFVP----------------SNLKVFKLXXXXXXXXXXXXXALTWAVEQEETGNGV 454 R V V S LK+F L AL WA+EQ+ GN Sbjct: 61 RSDRVGVYCGSPKFDFGCGLLSGYSKLKIF-LLCERKRGSFGASFALAWALEQQAIGNEF 119 Query: 455 SVESTSLIGELSDKSDSVEVDCGNTDEGADSETVNVVTEGNGIGQNENDLGEQTSIRVDV 634 E ++ I L+ +++V++DC D D + + E + ++ E+ S VDV Sbjct: 120 VKEDSNSIHSLAGNTETVDIDCLKVDGARDGDEND--NEEEKEAEKNGEVIEEKSRNVDV 177 Query: 635 RALAGRLQFAETADDVEEVLREMEELPLPVYSSMIRGFGLDKRMESALALFEWLKMKKKT 814 RALA L+FA TADDVEEVL++ ELPL VYS+MIRGFG DKR+++A+AL EWLK KK+T Sbjct: 178 RALAHGLEFATTADDVEEVLKDKVELPLQVYSTMIRGFGTDKRLDAAMALVEWLKRKKET 237 Query: 815 TGGRIGPNLFIYNSLLGAMKQCKQFGGVEKVMEDMAEEGVGSNMVTYNTLMAIYLEQGWP 994 G + GPNLF+YNSLLGA+KQ ++F VEKVM DMA EG+ N+VTYNTLM+IYLEQG Sbjct: 238 NGSK-GPNLFVYNSLLGAVKQSEKFALVEKVMNDMAREGILPNVVTYNTLMSIYLEQGRS 296 Query: 995 NRALTFLEEIEKKGMSPSPVSYSTALVAYRRMEDGNGGLKFFTELRDRYKKGEIGKYDDE 1174 AL LEEI+K G+ PSPVSYSTAL+ YRRMEDG+G LKFF ELR+ Y KGEIGK DE Sbjct: 297 VEALNILEEIQKNGLCPSPVSYSTALLVYRRMEDGHGALKFFIELRENYLKGEIGKDADE 356 Query: 1175 DWENEFVKLEKFTIRICYQVMRRWLVKGDHSSTKVLQLLSEMDKAGLNPGRAEYERLVWA 1354 DWENEFVKL+ FTIRICYQVMRRWLVK + S +L+LL++MD AGL PGRAEYERLVWA Sbjct: 357 DWENEFVKLKNFTIRICYQVMRRWLVKEGNQSPILLKLLADMDNAGLQPGRAEYERLVWA 416 Query: 1355 CTLEGHHIVAKDLYKRIRERESEISLSVCNHAIWLMGKAKKWWAALEIYEDLLDKGPKPN 1534 CT E H++VAK+LY RIRER +EISLSVCNH IWLMGKAKKWWAALEIYEDLLDKGPKPN Sbjct: 417 CTREEHYVVAKELYTRIRERHTEISLSVCNHIIWLMGKAKKWWAALEIYEDLLDKGPKPN 476 Query: 1535 NLSYELVVSHFNVLLTAASRRGIWRWGVR 1621 NLSYELVVSHFN+LLTAA ++GIWRWGVR Sbjct: 477 NLSYELVVSHFNILLTAARKKGIWRWGVR 505 >ref|XP_002526948.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223533700|gb|EEF35435.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 671 Score = 486 bits (1252), Expect = e-135 Identities = 245/406 (60%), Positives = 304/406 (74%), Gaps = 2/406 (0%) Frame = +2 Query: 410 ALTWAVEQEETGNGVSVESTSLIGELSDKSDSVEVDCGNTDEGADSETVNVVTEGNGIG- 586 A WA+++++ + SL L KS+ +V+ N DS+ N E N Sbjct: 9 AFAWALQKQDISSEFHGVEPSLDDGLLGKSEKEDVNPHNLGRLEDSDDDNNNQEDNIELD 68 Query: 587 -QNENDLGEQTSIRVDVRALAGRLQFAETADDVEEVLREMEELPLPVYSSMIRGFGLDKR 763 +++ +GE+ +DVR+LA L A+TADDVEEVL++ ELPL VYSSMI+ FG D + Sbjct: 69 LRSKEGVGEEKCRSIDVRSLARSLHSAQTADDVEEVLKDKGELPLQVYSSMIKAFGWDNK 128 Query: 764 MESALALFEWLKMKKKTTGGRIGPNLFIYNSLLGAMKQCKQFGGVEKVMEDMAEEGVGSN 943 MESALAL EWLK ++K G IGPNLFIYNSLL A+K+ K F EK++ DM +EG+ N Sbjct: 129 MESALALVEWLK-RRKEIGSSIGPNLFIYNSLLSAVKKSKLFEEAEKILNDMTQEGIAPN 187 Query: 944 MVTYNTLMAIYLEQGWPNRALTFLEEIEKKGMSPSPVSYSTALVAYRRMEDGNGGLKFFT 1123 +VTYNTLM IY+E+G +AL LE++ +KG P+ SYSTAL+AYR MEDG+G L FF Sbjct: 188 VVTYNTLMGIYVEKGQATKALNILEQMHEKGFIPTAASYSTALLAYRGMEDGHGALAFFV 247 Query: 1124 ELRDRYKKGEIGKYDDEDWENEFVKLEKFTIRICYQVMRRWLVKGDHSSTKVLQLLSEMD 1303 +++D+Y KG+IGK DE+WENEFVKLE F IRICYQVMRRWLV+ D+ ST VL+LL++MD Sbjct: 248 DIKDKYLKGKIGKNSDENWENEFVKLETFIIRICYQVMRRWLVRHDNFSTDVLKLLTDMD 307 Query: 1304 KAGLNPGRAEYERLVWACTLEGHHIVAKDLYKRIRERESEISLSVCNHAIWLMGKAKKWW 1483 KAGL P +AEYERLVWACT E H+ V K+LY RIRER S+ISLSVCNH IWLMGKAKKWW Sbjct: 308 KAGLQPSQAEYERLVWACTREDHYAVGKELYIRIRERHSKISLSVCNHLIWLMGKAKKWW 367 Query: 1484 AALEIYEDLLDKGPKPNNLSYELVVSHFNVLLTAASRRGIWRWGVR 1621 AALEIYEDLLDKGP PNN+SYEL+VSHFN+LLTAA +RGIWRWGVR Sbjct: 368 AALEIYEDLLDKGPNPNNMSYELIVSHFNILLTAARKRGIWRWGVR 413 >ref|XP_002324000.1| predicted protein [Populus trichocarpa] gi|222867002|gb|EEF04133.1| predicted protein [Populus trichocarpa] Length = 709 Score = 479 bits (1234), Expect = e-133 Identities = 258/489 (52%), Positives = 328/489 (67%), Gaps = 19/489 (3%) Frame = +2 Query: 212 MQSLSVWSVRGDFLAVPHFSF------------GLNTSRTKDKHFT-CTSRYHLV----- 337 MQ+LSVW + G AVPH F G+ D F +S + +V Sbjct: 1 MQTLSVWPLSGGSCAVPHLEFEEDSSCFLSTRRGIKRWGLVDNVFQGASSGFPMVSGDLR 60 Query: 338 FVPSNLKV-FKLXXXXXXXXXXXXXALTWAVEQEETGNGVSVESTSLIGELSDKSDSVEV 514 F+ ++ K+ + AL A+EQ++ GN E + L D+S Sbjct: 61 FLSNHSKIKYVCFRETKEGSFGSSLALASALEQQKIGN----EFHRVESSLDDRS----- 111 Query: 515 DCGNTDEGADSETVNVVTEGNGIGQNENDLGEQTSIRVDVRALAGRLQFAETADDVEEVL 694 + GE+ ++DV ALA L FA+T DD+EEVL Sbjct: 112 --------------------------LGEAGEERDEKIDVPALAQSLYFAKTVDDIEEVL 145 Query: 695 REMEELPLPVYSSMIRGFGLDKRMESALALFEWLKMKKKTTGGRIGPNLFIYNSLLGAMK 874 ++ ELP+ VY SMI+GFG DK+ME A+AL +WLK+KK+T G I PNLFIYNSLL A+K Sbjct: 146 KDKGELPVQVYLSMIKGFGWDKKMEPAIALVDWLKIKKETDG-TIVPNLFIYNSLLSAVK 204 Query: 875 QCKQFGGVEKVMEDMAEEGVGSNMVTYNTLMAIYLEQGWPNRALTFLEEIEKKGMSPSPV 1054 Q +Q+ EK++E M +EGV N+VTYN LM IY++QG +AL LEE+ + G +PS Sbjct: 205 QSEQYEETEKILERMTQEGVAPNVVTYNILMVIYVKQGQAKKALDVLEEMRRNGFTPSAA 264 Query: 1055 SYSTALVAYRRMEDGNGGLKFFTELRDRYKKGEIGKYDDEDWENEFVKLEKFTIRICYQV 1234 SYS+AL+AYR+MEDG+G LKFF E++D+Y KGEIGK DEDWE E+VKLE FTIR+CYQV Sbjct: 265 SYSSALLAYRKMEDGDGALKFFVEIKDKYMKGEIGKDADEDWEREYVKLENFTIRVCYQV 324 Query: 1235 MRRWLVKGDHSSTKVLQLLSEMDKAGLNPGRAEYERLVWACTLEGHHIVAKDLYKRIRER 1414 MRRWLV+ ++ +T VL+LL++MDKA L PGR++YERLVWACT E H++VAK+LY RIRER Sbjct: 325 MRRWLVRLENLNTNVLKLLTDMDKAELQPGRSDYERLVWACTREEHYVVAKELYIRIRER 384 Query: 1415 ESEISLSVCNHAIWLMGKAKKWWAALEIYEDLLDKGPKPNNLSYELVVSHFNVLLTAASR 1594 S+ISLSVCNH IWLMGKAKKWWAALE+YEDLLDKGPKPNNLSYEL+VS+FNVLLTAA + Sbjct: 385 CSDISLSVCNHVIWLMGKAKKWWAALEVYEDLLDKGPKPNNLSYELIVSYFNVLLTAAKK 444 Query: 1595 RGIWRWGVR 1621 RGIWRWGVR Sbjct: 445 RGIWRWGVR 453 >ref|XP_002873660.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297319497|gb|EFH49919.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 674 Score = 474 bits (1221), Expect = e-131 Identities = 260/491 (52%), Positives = 322/491 (65%), Gaps = 21/491 (4%) Frame = +2 Query: 212 MQSLSVWSVRGDFLAVPHFSFGLNTS------RTKDKHFT----CTSRYH-LVFVPSNLK 358 MQ+LS+W ++ L F L+ S +++ +H + C R L+ V SN K Sbjct: 1 MQALSIWPLKSGLLVGSRLEFELDCSCFVVSHKSRKRHCSAQQGCFGRISSLILVSSNRK 60 Query: 359 VFKLXXXXXXXXXXXXX----------ALTWAVEQEETGNGVSVESTSLIGELSDKSDSV 508 L + WA EQ E G VS E +S Sbjct: 61 FEGLAVNPTSKVLFLCEPKRNLSGSSVGVGWATEQRELGEEVSTEDSSY----------- 109 Query: 509 EVDCGNTDEGADSETVNVVTEGNGIGQNENDLGEQTSIRVDVRALAGRLQFAETADDVEE 688 +TVN GE+T+ RVDVR LA L+ A+TADDV+ Sbjct: 110 ------------PQTVNG--------------GEKTNSRVDVRELAYSLRAAKTADDVDI 143 Query: 689 VLREMEELPLPVYSSMIRGFGLDKRMESALALFEWLKMKKKTTGGRIGPNLFIYNSLLGA 868 V++EM ELPL VY +MIRGFG DKR++ A+A+ +WL+ KK +GG IGPNLFIYNSLLGA Sbjct: 144 VIKEMGELPLQVYCAMIRGFGKDKRLKPAIAVVDWLRRKKSESGGVIGPNLFIYNSLLGA 203 Query: 869 MKQCKQFGGVEKVMEDMAEEGVGSNMVTYNTLMAIYLEQGWPNRALTFLEEIEKKGMSPS 1048 MKQ G EK++ DM EEG+ N+VTYNTLM IY+E+G ++AL L+ +++KG P+ Sbjct: 204 MKQ-SSVGEAEKILSDMEEEGIVPNIVTYNTLMVIYMEKGEFHKALGILDLVKEKGFEPN 262 Query: 1049 PVSYSTALVAYRRMEDGNGGLKFFTELRDRYKKGEIGKYDDEDWENEFVKLEKFTIRICY 1228 P++YSTAL+ YRRMEDG G L+FF ELR++Y K EIG D DWE EFVKLE F RICY Sbjct: 263 PITYSTALLVYRRMEDGMGALEFFVELREKYSKREIGNDADYDWEFEFVKLENFIGRICY 322 Query: 1229 QVMRRWLVKGDHSSTKVLQLLSEMDKAGLNPGRAEYERLVWACTLEGHHIVAKDLYKRIR 1408 QVMRRWLVK ++ +T+VL+LL+ MD AG P R E+ERL+WACT E H+IV K+LYKRIR Sbjct: 323 QVMRRWLVKDENWTTRVLKLLNAMDNAGPKPSREEHERLIWACTREEHYIVGKELYKRIR 382 Query: 1409 ERESEISLSVCNHAIWLMGKAKKWWAALEIYEDLLDKGPKPNNLSYELVVSHFNVLLTAA 1588 ER EISLSVCNH IWLMGKAKKWWAALEIYEDLLD+GP+PNNLSYELVVSHFN+LL+AA Sbjct: 383 ERFPEISLSVCNHLIWLMGKAKKWWAALEIYEDLLDEGPEPNNLSYELVVSHFNILLSAA 442 Query: 1589 SRRGIWRWGVR 1621 SRRGIWRWGVR Sbjct: 443 SRRGIWRWGVR 453 Score = 57.8 bits (138), Expect = 8e-06 Identities = 58/262 (22%), Positives = 106/262 (40%), Gaps = 21/262 (8%) Frame = +2 Query: 836 NLFIYNSLLGAMKQCKQFGGVEKVMEDMAEEGVGSNMVTY-------NTLMAIYLEQGWP 994 +L + N L+ M + K++ ++ ED+ +EG N ++Y N L++ +G Sbjct: 389 SLSVCNHLIWLMGKAKKWWAALEIYEDLLDEGPEPNNLSYELVVSHFNILLSAASRRGIW 448 Query: 995 NRALTFLEEIEKKGMSPSPVSYSTALVAYRRMEDGNGGLKFFTELRDRYKKG-------- 1150 + L ++E KG+ P ++ LVA + + ++ F + D +K Sbjct: 449 RWGVRLLNKMEDKGLKPQSRHWNAVLVACSKASETTAAIQIFKAMVDNGEKPTVISYGAL 508 Query: 1151 ----EIGKYDDEDWE--NEFVKLEKFTIRICYQVMRRWLVKGDHSSTKVLQLLSEMDKAG 1312 E GK DE + N +K+ Y M ++ G + LL EM G Sbjct: 509 LSALEKGKLYDEAFRVWNHMIKVGIEPNLYAYTTMAS-VLTGQQKFNLLDTLLKEMASKG 567 Query: 1313 LNPGRAEYERLVWACTLEGHHIVAKDLYKRIRERESEISLSVCNHAIWLMGKAKKWWAAL 1492 + P Y ++ C G VA + + R+R + E + I + K A Sbjct: 568 IEPSVVTYNAVISGCARNGLSGVAYEWFHRMRGEKVEPNEITYEMLIEALANDAKPRLAY 627 Query: 1493 EIYEDLLDKGPKPNNLSYELVV 1558 E++ + G K ++ Y+ VV Sbjct: 628 ELHLKAQNDGLKLSSKPYDAVV 649 >ref|NP_190245.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75206903|sp|Q9SNB7.1|PP264_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At3g46610 gi|6523064|emb|CAB62331.1| hypothetical protein [Arabidopsis thaliana] gi|332644660|gb|AEE78181.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 665 Score = 461 bits (1185), Expect = e-127 Identities = 225/339 (66%), Positives = 276/339 (81%) Frame = +2 Query: 605 GEQTSIRVDVRALAGRLQFAETADDVEEVLREMEELPLPVYSSMIRGFGLDKRMESALAL 784 GE+ ++RVDVR LA L+ A+TADDV+ VL++ ELPL V+ +MI+GFG DKR++ A+A+ Sbjct: 109 GEKNNLRVDVRELAFSLRAAKTADDVDAVLKDKGELPLQVFCAMIKGFGKDKRLKPAVAV 168 Query: 785 FEWLKMKKKTTGGRIGPNLFIYNSLLGAMKQCKQFGGVEKVMEDMAEEGVGSNMVTYNTL 964 +WLK KK +GG IGPNLFIYNSLLGAM+ FG EK+++DM EEG+ N+VTYNTL Sbjct: 169 VDWLKRKKSESGGVIGPNLFIYNSLLGAMRG---FGEAEKILKDMEEEGIVPNIVTYNTL 225 Query: 965 MAIYLEQGWPNRALTFLEEIEKKGMSPSPVSYSTALVAYRRMEDGNGGLKFFTELRDRYK 1144 M IY+E+G +AL L+ ++KG P+P++YSTAL+ YRRMEDG G L+FF ELR++Y Sbjct: 226 MVIYMEEGEFLKALGILDLTKEKGFEPNPITYSTALLVYRRMEDGMGALEFFVELREKYA 285 Query: 1145 KGEIGKYDDEDWENEFVKLEKFTIRICYQVMRRWLVKGDHSSTKVLQLLSEMDKAGLNPG 1324 K EIG DWE EFVKLE F RICYQVMRRWLVK D+ +T+VL+LL+ MD AG+ P Sbjct: 286 KREIGNDVGYDWEFEFVKLENFIGRICYQVMRRWLVKDDNWTTRVLKLLNAMDSAGVRPS 345 Query: 1325 RAEYERLVWACTLEGHHIVAKDLYKRIRERESEISLSVCNHAIWLMGKAKKWWAALEIYE 1504 R E+ERL+WACT E H+IV K+LYKRIRER SEISLSVCNH IWLMGKAKKWWAALEIYE Sbjct: 346 REEHERLIWACTREEHYIVGKELYKRIRERFSEISLSVCNHLIWLMGKAKKWWAALEIYE 405 Query: 1505 DLLDKGPKPNNLSYELVVSHFNVLLTAASRRGIWRWGVR 1621 DLLD+GP+PNNLSYELVVSHFN+LL+AAS+RGIWRWGVR Sbjct: 406 DLLDEGPEPNNLSYELVVSHFNILLSAASKRGIWRWGVR 444