BLASTX nr result
ID: Coptis25_contig00037763
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis25_contig00037763 (693 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002280725.1| PREDICTED: pentatricopeptide repeat-containi... 242 4e-62 ref|XP_002884507.1| pentatricopeptide repeat-containing protein ... 204 1e-50 ref|NP_187185.2| pentatricopeptide repeat-containing protein [Ar... 203 2e-50 gb|AAF27040.1|AC009177_30 hypothetical protein [Arabidopsis thal... 203 2e-50 ref|XP_003626292.1| Pentatricopeptide repeat-containing protein ... 183 2e-44 >ref|XP_002280725.1| PREDICTED: pentatricopeptide repeat-containing protein At3g05340 [Vitis vinifera] Length = 656 Score = 242 bits (618), Expect = 4e-62 Identities = 122/228 (53%), Positives = 155/228 (67%) Frame = +3 Query: 6 SSQISSFLTKKLQHSPNPHISDITLSHVGLSTLLSICGREGHFRLGLSLHASIIKNYEYF 185 +S +SS L + +P S ++ V +S LLS+CGREG+ LG SLHASIIKN+ + Sbjct: 18 TSPVSSPLKTLILQNPYSETSKFAINQVDISFLLSLCGREGYLHLGSSLHASIIKNFGFL 77 Query: 186 HPMLQSNTRNVVAVWNSLVSMYSKFGLLVSAAKVFDEMPMKDTVSWNSMISXXXXXXXXX 365 + N RNV+ VWNSL+SMYS+ G L A KVFD MPMKDT+SWNS IS Sbjct: 78 DGNNRDNLRNVIVVWNSLLSMYSRCGELRDATKVFDHMPMKDTISWNSRISGLLGNGDIE 137 Query: 366 XXXXXXKQMRGLGVFRVDQASLTCVLSACDNPDMLYVCMMMHCLVLRNGYEQEVSVGNAL 545 KQ+ G+++ DQA+LT VL+ACD P+ YV M+H LV GYE+E++VGNAL Sbjct: 138 MGFRVFKQLYESGIYQFDQATLTTVLTACDKPEFCYVSKMIHSLVFLYGYEREITVGNAL 197 Query: 546 ITNYSKCGCCESGRRVFCEMFVTNVITWTAVISGLAQSEFCEESLRLF 689 IT+Y +CGCC SGRRVF EM NV+TWTAVISGL+Q +F EESL+LF Sbjct: 198 ITSYFRCGCCSSGRRVFDEMSEKNVVTWTAVISGLSQGQFYEESLKLF 245 Score = 76.6 bits (187), Expect = 4e-12 Identities = 57/201 (28%), Positives = 96/201 (47%), Gaps = 2/201 (0%) Frame = +3 Query: 93 LSTLLSICGREGHFRLGLSLHASIIKNYEYFHPMLQSNTRNVVAVWNSLVSMYSKFGLLV 272 L+T+L+ C + + +H S++ Y Y + V N+L++ Y + G Sbjct: 159 LTTVLTACDKPEFCYVSKMIH-SLVFLYGY---------EREITVGNALITSYFRCGCCS 208 Query: 273 SAAKVFDEMPMKDTVSWNSMISXXXXXXXXXXXXXXXKQMRGLGVFRVDQASLTCV--LS 446 S +VFDEM K+ V+W ++IS +MR VD SLT + L Sbjct: 209 SGRRVFDEMSEKNVVTWTAVISGLSQGQFYEESLKLFGKMRD---GPVDPNSLTYLSSLM 265 Query: 447 ACDNPDMLYVCMMMHCLVLRNGYEQEVSVGNALITNYSKCGCCESGRRVFCEMFVTNVIT 626 AC + +H LV + G ++ + +AL+ YSKCG E ++F + ++ Sbjct: 266 ACSGLQAIREGRQIHGLVWKLGVHFDLCIESALMDMYSKCGSLEDAWKIFESAEEVDEVS 325 Query: 627 WTAVISGLAQSEFCEESLRLF 689 T ++ GLAQ+ F EES+++F Sbjct: 326 MTVILVGLAQNGFEEESIQVF 346 Score = 63.9 bits (154), Expect = 3e-08 Identities = 49/199 (24%), Positives = 87/199 (43%) Frame = +3 Query: 93 LSTLLSICGREGHFRLGLSLHASIIKNYEYFHPMLQSNTRNVVAVWNSLVSMYSKFGLLV 272 LS+L++ G + R G +H + K +F ++S +L+ MYSK G L Sbjct: 261 LSSLMACSGLQA-IREGRQIHGLVWKLGVHFDLCIES----------ALMDMYSKCGSLE 309 Query: 273 SAAKVFDEMPMKDTVSWNSMISXXXXXXXXXXXXXXXKQMRGLGVFRVDQASLTCVLSAC 452 A K+F+ D VS ++ +M GV +D ++ +L Sbjct: 310 DAWKIFESAEEVDEVSMTVILVGLAQNGFEEESIQVFVKMVKNGVV-IDPNMISAILGVF 368 Query: 453 DNPDMLYVCMMMHCLVLRNGYEQEVSVGNALITNYSKCGCCESGRRVFCEMFVTNVITWT 632 L + +H L+++ + V N LI YSKCG + ++FC M N ++W Sbjct: 369 GIDTSLALGKQIHSLIIKKSFGSNYFVNNGLINMYSKCGDLDDSIKIFCWMPQRNSVSWN 428 Query: 633 AVISGLAQSEFCEESLRLF 689 ++I+ A+ +L+L+ Sbjct: 429 SMIAAFARHGNGSRALQLY 447 >ref|XP_002884507.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297330347|gb|EFH60766.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 676 Score = 204 bits (520), Expect = 1e-50 Identities = 111/218 (50%), Positives = 141/218 (64%), Gaps = 1/218 (0%) Frame = +3 Query: 39 LQHSPNPHISDITLSHVGLSTLLSICGREGHFR-LGLSLHASIIKNYEYFHPMLQSNTRN 215 ++ SP+ +S L+HV +S LLSICGREG F LG LHASI+KN E+F P+ RN Sbjct: 29 IRQSPSYQVSTFLLNHVDMSLLLSICGREGWFPYLGPCLHASIVKNPEFFDPVDADIHRN 88 Query: 216 VVAVWNSLVSMYSKFGLLVSAAKVFDEMPMKDTVSWNSMISXXXXXXXXXXXXXXXKQMR 395 + VWNSL+S+Y K G L A K+FDEMP++D +S N + K+M Sbjct: 89 ALVVWNSLLSLYVKCGKLGDALKLFDEMPVRDVISQNIVFYGFLRNRETESGFVLLKRML 148 Query: 396 GLGVFRVDQASLTCVLSACDNPDMLYVCMMMHCLVLRNGYEQEVSVGNALITNYSKCGCC 575 G G F DQA+LT VLS CD P+ V M+H L + +GY++E+SVGN LIT+Y KCGC Sbjct: 149 GSGGF--DQATLTIVLSVCDTPEFCLVTKMIHALAILSGYDKEISVGNKLITSYFKCGCS 206 Query: 576 ESGRRVFCEMFVTNVITWTAVISGLAQSEFCEESLRLF 689 SGR VF EM NVITWTAVISGL ++E E+ LRLF Sbjct: 207 VSGRWVFSEMAHRNVITWTAVISGLIENELHEDGLRLF 244 Score = 77.4 bits (189), Expect = 2e-12 Identities = 52/200 (26%), Positives = 93/200 (46%), Gaps = 1/200 (0%) Frame = +3 Query: 93 LSTLLSICGREGHFRLGLSLHA-SIIKNYEYFHPMLQSNTRNVVAVWNSLVSMYSKFGLL 269 L+ +LS+C + +HA +I+ Y+ ++V N L++ Y K G Sbjct: 158 LTIVLSVCDTPEFCLVTKMIHALAILSGYD-----------KEISVGNKLITSYFKCGCS 206 Query: 270 VSAAKVFDEMPMKDTVSWNSMISXXXXXXXXXXXXXXXKQMRGLGVFRVDQASLTCVLSA 449 VS VF EM ++ ++W ++IS MR G+ + + L+A Sbjct: 207 VSGRWVFSEMAHRNVITWTAVISGLIENELHEDGLRLFCLMRR-GLVHPNSVTYLSALAA 265 Query: 450 CDNPDMLYVCMMMHCLVLRNGYEQEVSVGNALITNYSKCGCCESGRRVFCEMFVTNVITW 629 C M+ +H L+ + G E E+ + +AL+ YSKCG E ++F + ++ Sbjct: 266 CSGSQMIVEGQQIHALLWKFGIESELCIESALMDMYSKCGSIEDAWKIFESSQEVDEVSM 325 Query: 630 TAVISGLAQSEFCEESLRLF 689 T ++ GLAQ+ EE+++ F Sbjct: 326 TVILVGLAQNGSEEEAIQFF 345 Score = 55.8 bits (133), Expect = 8e-06 Identities = 39/157 (24%), Positives = 70/157 (44%) Frame = +3 Query: 219 VAVWNSLVSMYSKFGLLVSAAKVFDEMPMKDTVSWNSMISXXXXXXXXXXXXXXXKQMRG 398 + + ++L+ MYSK G + A K+F+ D VS ++ +M Sbjct: 291 LCIESALMDMYSKCGSIEDAWKIFESSQEVDEVSMTVILVGLAQNGSEEEAIQFFIRMLQ 350 Query: 399 LGVFRVDQASLTCVLSACDNPDMLYVCMMMHCLVLRNGYEQEVSVGNALITNYSKCGCCE 578 GV +D ++ +L + L + +H LV++ + V N LI YSKCG Sbjct: 351 AGV-EIDANVVSAILGVSFVDNSLGLGKQLHSLVIKRKFCGNTFVNNGLINMYSKCGDLT 409 Query: 579 SGRRVFCEMFVTNVITWTAVISGLAQSEFCEESLRLF 689 + VF M N ++W ++I+ A+ +L+L+ Sbjct: 410 DSQTVFRRMPKRNYVSWNSMIAAFARHGHGLAALKLY 446 >ref|NP_187185.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|218546760|sp|Q9MA85.2|PP215_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At3g05340 gi|332640702|gb|AEE74223.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 658 Score = 203 bits (517), Expect = 2e-50 Identities = 112/218 (51%), Positives = 140/218 (64%), Gaps = 1/218 (0%) Frame = +3 Query: 39 LQHSPNPHISDITLSHVGLSTLLSICGREGHF-RLGLSLHASIIKNYEYFHPMLQSNTRN 215 ++ SPN +S L+HV +S LLSICGREG F LG LHASIIKN E+F P+ RN Sbjct: 29 IRQSPNYQVSTFLLNHVDMSLLLSICGREGWFPHLGPCLHASIIKNPEFFEPVDADIHRN 88 Query: 216 VVAVWNSLVSMYSKFGLLVSAAKVFDEMPMKDTVSWNSMISXXXXXXXXXXXXXXXKQMR 395 + VWNSL+S+Y+K G LV A K+FDEMPM+D +S N + K+M Sbjct: 89 ALVVWNSLLSLYAKCGKLVDAIKLFDEMPMRDVISQNIVFYGFLRNRETESGFVLLKRML 148 Query: 396 GLGVFRVDQASLTCVLSACDNPDMLYVCMMMHCLVLRNGYEQEVSVGNALITNYSKCGCC 575 G G F D A+LT VLS CD P+ V M+H L + +GY++E+SVGN LIT+Y KCGC Sbjct: 149 GSGGF--DHATLTIVLSVCDTPEFCLVTKMIHALAILSGYDKEISVGNKLITSYFKCGCS 206 Query: 576 ESGRRVFCEMFVTNVITWTAVISGLAQSEFCEESLRLF 689 SGR VF M NVIT TAVISGL ++E E+ LRLF Sbjct: 207 VSGRGVFDGMSHRNVITLTAVISGLIENELHEDGLRLF 244 Score = 73.2 bits (178), Expect = 5e-11 Identities = 51/203 (25%), Positives = 91/203 (44%), Gaps = 1/203 (0%) Frame = +3 Query: 84 HVGLSTLLSICGREGHFRLGLSLHA-SIIKNYEYFHPMLQSNTRNVVAVWNSLVSMYSKF 260 H L+ +LS+C + +HA +I+ Y+ ++V N L++ Y K Sbjct: 155 HATLTIVLSVCDTPEFCLVTKMIHALAILSGYD-----------KEISVGNKLITSYFKC 203 Query: 261 GLLVSAAKVFDEMPMKDTVSWNSMISXXXXXXXXXXXXXXXKQMRGLGVFRVDQASLTCV 440 G VS VFD M ++ ++ ++IS MR G+ + + Sbjct: 204 GCSVSGRGVFDGMSHRNVITLTAVISGLIENELHEDGLRLFSLMRR-GLVHPNSVTYLSA 262 Query: 441 LSACDNPDMLYVCMMMHCLVLRNGYEQEVSVGNALITNYSKCGCCESGRRVFCEMFVTNV 620 L+AC + +H L+ + G E E+ + +AL+ YSKCG E +F + Sbjct: 263 LAACSGSQRIVEGQQIHALLWKYGIESELCIESALMDMYSKCGSIEDAWTIFESTTEVDE 322 Query: 621 ITWTAVISGLAQSEFCEESLRLF 689 ++ T ++ GLAQ+ EE+++ F Sbjct: 323 VSMTVILVGLAQNGSEEEAIQFF 345 >gb|AAF27040.1|AC009177_30 hypothetical protein [Arabidopsis thaliana] Length = 770 Score = 203 bits (517), Expect = 2e-50 Identities = 112/218 (51%), Positives = 140/218 (64%), Gaps = 1/218 (0%) Frame = +3 Query: 39 LQHSPNPHISDITLSHVGLSTLLSICGREGHF-RLGLSLHASIIKNYEYFHPMLQSNTRN 215 ++ SPN +S L+HV +S LLSICGREG F LG LHASIIKN E+F P+ RN Sbjct: 29 IRQSPNYQVSTFLLNHVDMSLLLSICGREGWFPHLGPCLHASIIKNPEFFEPVDADIHRN 88 Query: 216 VVAVWNSLVSMYSKFGLLVSAAKVFDEMPMKDTVSWNSMISXXXXXXXXXXXXXXXKQMR 395 + VWNSL+S+Y+K G LV A K+FDEMPM+D +S N + K+M Sbjct: 89 ALVVWNSLLSLYAKCGKLVDAIKLFDEMPMRDVISQNIVFYGFLRNRETESGFVLLKRML 148 Query: 396 GLGVFRVDQASLTCVLSACDNPDMLYVCMMMHCLVLRNGYEQEVSVGNALITNYSKCGCC 575 G G F D A+LT VLS CD P+ V M+H L + +GY++E+SVGN LIT+Y KCGC Sbjct: 149 GSGGF--DHATLTIVLSVCDTPEFCLVTKMIHALAILSGYDKEISVGNKLITSYFKCGCS 206 Query: 576 ESGRRVFCEMFVTNVITWTAVISGLAQSEFCEESLRLF 689 SGR VF M NVIT TAVISGL ++E E+ LRLF Sbjct: 207 VSGRGVFDGMSHRNVITLTAVISGLIENELHEDGLRLF 244 Score = 73.2 bits (178), Expect = 5e-11 Identities = 51/203 (25%), Positives = 91/203 (44%), Gaps = 1/203 (0%) Frame = +3 Query: 84 HVGLSTLLSICGREGHFRLGLSLHA-SIIKNYEYFHPMLQSNTRNVVAVWNSLVSMYSKF 260 H L+ +LS+C + +HA +I+ Y+ ++V N L++ Y K Sbjct: 155 HATLTIVLSVCDTPEFCLVTKMIHALAILSGYD-----------KEISVGNKLITSYFKC 203 Query: 261 GLLVSAAKVFDEMPMKDTVSWNSMISXXXXXXXXXXXXXXXKQMRGLGVFRVDQASLTCV 440 G VS VFD M ++ ++ ++IS MR G+ + + Sbjct: 204 GCSVSGRGVFDGMSHRNVITLTAVISGLIENELHEDGLRLFSLMRR-GLVHPNSVTYLSA 262 Query: 441 LSACDNPDMLYVCMMMHCLVLRNGYEQEVSVGNALITNYSKCGCCESGRRVFCEMFVTNV 620 L+AC + +H L+ + G E E+ + +AL+ YSKCG E +F + Sbjct: 263 LAACSGSQRIVEGQQIHALLWKYGIESELCIESALMDMYSKCGSIEDAWTIFESTTEVDE 322 Query: 621 ITWTAVISGLAQSEFCEESLRLF 689 ++ T ++ GLAQ+ EE+++ F Sbjct: 323 VSMTVILVGLAQNGSEEEAIQFF 345 >ref|XP_003626292.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355501307|gb|AES82510.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 650 Score = 183 bits (465), Expect = 2e-44 Identities = 103/216 (47%), Positives = 134/216 (62%), Gaps = 5/216 (2%) Frame = +3 Query: 57 PHISDITLSHVGLSTLLSICGREGHFRLGLSLHASIIKNYEYFHPMLQSNTRNVVAVWNS 236 P + L+H L++LL++CGR+ + LG S+HA IIK F + RN + +WNS Sbjct: 25 PSTTKSLLNHADLTSLLTLCGRDRNLTLGSSIHARIIKQPPSFD--FDGSQRNALFIWNS 82 Query: 237 LVSMYSKFGLLVSAAKVFDEMPMKDTVSWNSMISXXXXXXXXXXXXXXXKQMRGLG--VF 410 L+SMYSK G +A VFD MP++DTVSWN+MIS KQM Sbjct: 83 LLSMYSKCGEFRNAGNVFDYMPVRDTVSWNTMISGFLRNGDFDTSFKFFKQMTESNRVCC 142 Query: 411 RVDQASLTCVLSACDNPDM---LYVCMMMHCLVLRNGYEQEVSVGNALITNYSKCGCCES 581 R D+A+LT +LS CD + V M+H LV G+E+E++VGNALIT+Y KC C Sbjct: 143 RFDKATLTTMLSGCDGLRLGISTSVTQMIHGLVFVGGFEREITVGNALITSYFKCECFSQ 202 Query: 582 GRRVFCEMFVTNVITWTAVISGLAQSEFCEESLRLF 689 GR+VF EM NV+TWTAVISGLAQ+EF E+SLRLF Sbjct: 203 GRKVFDEMIERNVVTWTAVISGLAQNEFYEDSLRLF 238 Score = 70.1 bits (170), Expect = 4e-10 Identities = 51/200 (25%), Positives = 93/200 (46%), Gaps = 1/200 (0%) Frame = +3 Query: 93 LSTLLSICGREGHFRLGLSLHASIIKNYEYFHPML-QSNTRNVVAVWNSLVSMYSKFGLL 269 L+T+LS C RLG+S + + H ++ + V N+L++ Y K Sbjct: 149 LTTMLSGCDG---LRLGISTSVT-----QMIHGLVFVGGFEREITVGNALITSYFKCECF 200 Query: 270 VSAAKVFDEMPMKDTVSWNSMISXXXXXXXXXXXXXXXKQMRGLGVFRVDQASLTCVLSA 449 KVFDEM ++ V+W ++IS QMR G + + L A Sbjct: 201 SQGRKVFDEMIERNVVTWTAVISGLAQNEFYEDSLRLFAQMRCCGSVSPNVLTYLSSLMA 260 Query: 450 CDNPDMLYVCMMMHCLVLRNGYEQEVSVGNALITNYSKCGCCESGRRVFCEMFVTNVITW 629 C +L +H L+ + G + ++ + +AL+ YSKCG ++ ++F + ++ Sbjct: 261 CSGLQVLRDGQKIHGLLWKLGMQSDLCIESALMDLYSKCGSLDAAWQIFESAEELDGVSL 320 Query: 630 TAVISGLAQSEFCEESLRLF 689 T ++ AQ+ F EE++++F Sbjct: 321 TVILVAFAQNGFEEEAIQIF 340 Score = 59.7 bits (143), Expect = 5e-07 Identities = 40/166 (24%), Positives = 79/166 (47%), Gaps = 1/166 (0%) Frame = +3 Query: 162 IIKNYEYFHPML-QSNTRNVVAVWNSLVSMYSKFGLLVSAAKVFDEMPMKDTVSWNSMIS 338 ++++ + H +L + ++ + + ++L+ +YSK G L +A ++F+ D VS ++ Sbjct: 266 VLRDGQKIHGLLWKLGMQSDLCIESALMDLYSKCGSLDAAWQIFESAEELDGVSLTVILV 325 Query: 339 XXXXXXXXXXXXXXXKQMRGLGVFRVDQASLTCVLSACDNPDMLYVCMMMHCLVLRNGYE 518 +M LG+ VD ++ VL L + +H L+++ + Sbjct: 326 AFAQNGFEEEAIQIFTKMVALGM-EVDANMVSAVLGVFGVGTYLALGKQIHSLIIKKNFC 384 Query: 519 QEVSVGNALITNYSKCGCCESGRRVFCEMFVTNVITWTAVISGLAQ 656 + VGN L+ YSKCG VF +M N ++W +VI+ A+ Sbjct: 385 ENPFVGNGLVNMYSKCGDLSDSLLVFYQMTQKNSVSWNSVIAAFAR 430