BLASTX nr result
ID: Coptis21_contig00030799
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis21_contig00030799 (700 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002265138.1| PREDICTED: pentatricopeptide repeat-containi... 322 5e-86 emb|CAN67593.1| hypothetical protein VITISV_000699 [Vitis vinifera] 319 3e-85 ref|XP_002308772.1| predicted protein [Populus trichocarpa] gi|2... 315 4e-84 ref|XP_002883344.1| pentatricopeptide repeat-containing protein ... 311 6e-83 ref|XP_002520572.1| pentatricopeptide repeat-containing protein,... 308 5e-82 >ref|XP_002265138.1| PREDICTED: pentatricopeptide repeat-containing protein At3g22150, chloroplastic [Vitis vinifera] Length = 825 Score = 322 bits (825), Expect = 5e-86 Identities = 152/231 (65%), Positives = 186/231 (80%) Frame = +3 Query: 6 PSGGISLGKQIHGYAIRHSIDHNVFVGTALIDMYSKCASISYAEKVCDSIPEKNTVTYTT 185 P G I LGKQIHG+AIR ++ NVFVGTAL+DMYSK +I+YAE V EKN+VTYTT Sbjct: 537 PMGTIGLGKQIHGFAIRCFLNRNVFVGTALLDMYSKSGAITYAENVFAETLEKNSVTYTT 596 Query: 186 IILGYGQHGLGDRALSMFQTMSEFGMNPDAITFVAILSACSYSGLVDKGLKIFESMESSY 365 +I YGQHG+G+RALS+F M G+ PD++TFVAILSACSY+GLVD+GL+IF+SME Y Sbjct: 597 MISSYGQHGMGERALSLFHAMLGSGIKPDSVTFVAILSACSYAGLVDEGLRIFQSMEREY 656 Query: 366 GILPTTEHYCCVVDMLGRAGRVVEAYEFVKALGDRANAAGIWGSLLGACRMHGEMELGRI 545 I P+ EHYCCV DMLGR GRVVEAYEFVK LG+ N GIWGSLLGACR+HGE ELG++ Sbjct: 657 KIQPSAEHYCCVADMLGRVGRVVEAYEFVKGLGEEGNTFGIWGSLLGACRIHGEFELGKV 716 Query: 546 ISDRLFEVEEDNDVTGYHVLLSNIYADKEMWDSADRVRKNMREKGLRKETG 698 ++++L E+E+ + +TGYHVLLSNIYA + WD+ DRVRK MR+KGL KE G Sbjct: 717 VANKLLEMEKGSSLTGYHVLLSNIYAAEGNWDNVDRVRKEMRQKGLMKEAG 767 Score = 78.6 bits (192), Expect = 1e-12 Identities = 50/194 (25%), Positives = 93/194 (47%), Gaps = 11/194 (5%) Frame = +3 Query: 18 ISLGKQIHGYAIRHSIDHNVFVGTALIDMYSKCASISYAEKVCDSIPEKNTVTYTTIILG 197 + LG+Q+H Y ++ S V + A+I MYS+C SI + KV ++ E++ VT+ T++ Sbjct: 338 LELGRQLHAYILKSSTILQVVILNAIIVMYSRCGSIGTSFKVFSNMLERDVVTWNTMVSA 397 Query: 198 YGQHGLGDRALSMFQTMSEFGMNPDAITFVAILSACS-----------YSGLVDKGLKIF 344 + Q+GL D L + M + G D++T A+LS S ++ L+ G++ F Sbjct: 398 FVQNGLDDEGLMLVFAMQKQGFMVDSVTLTALLSLASNLRSQEIGKQAHAYLIRHGIQ-F 456 Query: 345 ESMESSYGILPTTEHYCCVVDMLGRAGRVVEAYEFVKALGDRANAAGIWGSLLGACRMHG 524 E M+S ++DM ++G + A + + D W +++ +G Sbjct: 457 EGMDS------------YLIDMYAKSGLITTAQQLFEKNSDYDRDEATWNAMIAGYTQNG 504 Query: 525 EMELGRIISDRLFE 566 E G + ++ E Sbjct: 505 LSEEGFAVFRKMIE 518 Score = 70.1 bits (170), Expect = 4e-10 Identities = 38/108 (35%), Positives = 64/108 (59%), Gaps = 2/108 (1%) Frame = +3 Query: 24 LGKQIHGYAIRHSIDHNVFVGTALIDMYSKCASISYAEKVCD--SIPEKNTVTYTTIILG 197 +GKQ H Y IRH I + + LIDMY+K I+ A+++ + S +++ T+ +I G Sbjct: 441 IGKQAHAYLIRHGIQFEG-MDSYLIDMYAKSGLITTAQQLFEKNSDYDRDEATWNAMIAG 499 Query: 198 YGQHGLGDRALSMFQTMSEFGMNPDAITFVAILSACSYSGLVDKGLKI 341 Y Q+GL + ++F+ M E + P+A+T +IL AC+ G + G +I Sbjct: 500 YTQNGLSEEGFAVFRKMIEQNVRPNAVTLASILPACNPMGTIGLGKQI 547 >emb|CAN67593.1| hypothetical protein VITISV_000699 [Vitis vinifera] Length = 825 Score = 319 bits (818), Expect = 3e-85 Identities = 151/231 (65%), Positives = 186/231 (80%) Frame = +3 Query: 6 PSGGISLGKQIHGYAIRHSIDHNVFVGTALIDMYSKCASISYAEKVCDSIPEKNTVTYTT 185 P G I LGKQIHG+AIR ++ NVFVGTAL+DMYSK +I+YAE V EKN+VTYTT Sbjct: 537 PMGTIGLGKQIHGFAIRCFLNQNVFVGTALLDMYSKSGAITYAENVFAETLEKNSVTYTT 596 Query: 186 IILGYGQHGLGDRALSMFQTMSEFGMNPDAITFVAILSACSYSGLVDKGLKIFESMESSY 365 +IL YGQHG+G+RALS+F M G+ PD++TFVAILSACSY+GLVD+GL+IF+SME Y Sbjct: 597 MILSYGQHGMGERALSLFHAMLGSGIKPDSVTFVAILSACSYAGLVDEGLRIFQSMEREY 656 Query: 366 GILPTTEHYCCVVDMLGRAGRVVEAYEFVKALGDRANAAGIWGSLLGACRMHGEMELGRI 545 I P++EHYCCV DMLGR GRV EAYEFVK LG+ N IWGSLLGACR+HGE ELG++ Sbjct: 657 KIQPSSEHYCCVADMLGRVGRVXEAYEFVKGLGEEGNTFRIWGSLLGACRIHGEFELGKV 716 Query: 546 ISDRLFEVEEDNDVTGYHVLLSNIYADKEMWDSADRVRKNMREKGLRKETG 698 ++++L E+E+ + +TGYHVLLSNIYA + WD+ DRVRK MR+KGL KE G Sbjct: 717 VANKLLEMEKGSXLTGYHVLLSNIYAAEGNWDNVDRVRKEMRQKGLMKEAG 767 Score = 73.9 bits (180), Expect = 3e-11 Identities = 48/194 (24%), Positives = 91/194 (46%), Gaps = 11/194 (5%) Frame = +3 Query: 18 ISLGKQIHGYAIRHSIDHNVFVGTALIDMYSKCASISYAEKVCDSIPEKNTVTYTTIILG 197 + LG+Q+H Y ++ S V + A+I MYS+C SI + KV ++ E++ VT+ T++ Sbjct: 338 LDLGRQLHAYILKSSTILQVVILNAIIVMYSRCGSIGTSFKVFSNMLERDVVTWNTMVSA 397 Query: 198 YGQHGLGDRALSMFQTMSEFGMNPDAITFVAILSACS-----------YSGLVDKGLKIF 344 + Q+GL D L + M + G D++T A+LS S ++ L+ G++ F Sbjct: 398 FVQNGLDDEGLMLVFEMQKQGFMVDSVTLTALLSLASNLRSQEIGKQAHAYLIRHGIQ-F 456 Query: 345 ESMESSYGILPTTEHYCCVVDMLGRAGRVVEAYEFVKALGDRANAAGIWGSLLGACRMHG 524 E M+ ++DM ++G + A + + W +++ +G Sbjct: 457 EGMDG------------YLIDMYAKSGLITTAQQLFEKNSXYDRDEATWNAMIAGYTQNG 504 Query: 525 EMELGRIISDRLFE 566 E G + ++ E Sbjct: 505 LSEEGFAVFRKMIE 518 Score = 71.6 bits (174), Expect = 1e-10 Identities = 39/108 (36%), Positives = 63/108 (58%), Gaps = 2/108 (1%) Frame = +3 Query: 24 LGKQIHGYAIRHSIDHNVFVGTALIDMYSKCASISYAEKVCD--SIPEKNTVTYTTIILG 197 +GKQ H Y IRH I G LIDMY+K I+ A+++ + S +++ T+ +I G Sbjct: 441 IGKQAHAYLIRHGIQFEGMDGY-LIDMYAKSGLITTAQQLFEKNSXYDRDEATWNAMIAG 499 Query: 198 YGQHGLGDRALSMFQTMSEFGMNPDAITFVAILSACSYSGLVDKGLKI 341 Y Q+GL + ++F+ M E + P+A+T +IL AC+ G + G +I Sbjct: 500 YTQNGLSEEGFAVFRKMIEQNVRPNAVTLASILPACNPMGTIGLGKQI 547 >ref|XP_002308772.1| predicted protein [Populus trichocarpa] gi|222854748|gb|EEE92295.1| predicted protein [Populus trichocarpa] Length = 320 Score = 315 bits (808), Expect = 4e-84 Identities = 145/231 (62%), Positives = 186/231 (80%) Frame = +3 Query: 6 PSGGISLGKQIHGYAIRHSIDHNVFVGTALIDMYSKCASISYAEKVCDSIPEKNTVTYTT 185 P G I LGKQ+HG +IR +D N+FV T+L+DMYSK SI+YAE V +P+KN+VTYTT Sbjct: 42 PVGNIDLGKQLHGVSIRLLLDKNIFVSTSLVDMYSKSGSINYAESVFTKLPDKNSVTYTT 101 Query: 186 IILGYGQHGLGDRALSMFQTMSEFGMNPDAITFVAILSACSYSGLVDKGLKIFESMESSY 365 +IL YGQHG+G+RALS+F +M + G+ PDAITF+A+LSACS+SGLVD+GL+IFESME + Sbjct: 102 MILAYGQHGMGERALSLFHSMKKSGIEPDAITFIAVLSACSHSGLVDEGLQIFESMEKDF 161 Query: 366 GILPTTEHYCCVVDMLGRAGRVVEAYEFVKALGDRANAAGIWGSLLGACRMHGEMELGRI 545 I P+T HYCCV DMLGR GRVVEAYEFVK LG+ N IWGSLLGACR+H +ELG + Sbjct: 162 KIQPSTPHYCCVTDMLGRVGRVVEAYEFVKQLGEAGNVLEIWGSLLGACRLHEHVELGEV 221 Query: 546 ISDRLFEVEEDNDVTGYHVLLSNIYADKEMWDSADRVRKNMREKGLRKETG 698 ++ +L E+E+ ++TGYHVLLSNIYA++ W + D+VR+ MREKGL+KE G Sbjct: 222 VAKKLLEMEKTGNITGYHVLLSNIYAEEGNWVNVDKVRREMREKGLQKEVG 272 >ref|XP_002883344.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297329184|gb|EFH59603.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 824 Score = 311 bits (798), Expect = 6e-83 Identities = 141/229 (61%), Positives = 182/229 (79%) Frame = +3 Query: 12 GGISLGKQIHGYAIRHSIDHNVFVGTALIDMYSKCASISYAEKVCDSIPEKNTVTYTTII 191 G + LGKQ+HG++IR +D NVFV +AL+DMYSK +I YAE + E+N+VTYTT+I Sbjct: 539 GSVDLGKQLHGFSIRQYLDQNVFVASALVDMYSKAGAIKYAENMFSQTKERNSVTYTTMI 598 Query: 192 LGYGQHGLGDRALSMFQTMSEFGMNPDAITFVAILSACSYSGLVDKGLKIFESMESSYGI 371 LGYGQHG+G+RA+S+F +M E G+ PDAI FVA+LSACSYSGLVD+GLKIFE M Y I Sbjct: 599 LGYGQHGMGERAISLFLSMQELGIKPDAIAFVAVLSACSYSGLVDEGLKIFEDMREVYNI 658 Query: 372 LPTTEHYCCVVDMLGRAGRVVEAYEFVKALGDRANAAGIWGSLLGACRMHGEMELGRIIS 551 P++EHYCC+ DMLGR GRV EAYEFVK LG+ N A +WGSLLG+CR+HGE+EL +S Sbjct: 659 QPSSEHYCCITDMLGRVGRVNEAYEFVKGLGEEGNIAELWGSLLGSCRLHGELELAETVS 718 Query: 552 DRLFEVEEDNDVTGYHVLLSNIYADKEMWDSADRVRKNMREKGLRKETG 698 +RL ++++ + +GY VLLSN+YA+++ W S DRVRK MREKGL+KE G Sbjct: 719 ERLAKLDKGKNFSGYEVLLSNMYAEEQNWKSVDRVRKGMREKGLKKEVG 767 Score = 68.6 bits (166), Expect = 1e-09 Identities = 44/138 (31%), Positives = 72/138 (52%), Gaps = 2/138 (1%) Frame = +3 Query: 24 LGKQIHGYAIRHSIDHNVFVGTALIDMYSKCASISYAEKVCDS--IPEKNTVTYTTIILG 197 +GKQ HG+ IR I + + LIDMY+K I ++K+ + E++ T+ ++I G Sbjct: 441 IGKQTHGFLIRQGIQFEG-MNSYLIDMYAKSGLIRISQKLFEGSGYAERDQATWNSMISG 499 Query: 198 YGQHGLGDRALSMFQTMSEFGMNPDAITFVAILSACSYSGLVDKGLKIFESMESSYGILP 377 Y Q+G + +F+ M E + P+A+T +IL ACS G VD G ++ Y + Sbjct: 500 YTQNGHTEETFLVFRKMLEQNIRPNAVTVASILPACSQVGSVDLGKQLHGFSIRQY-LDQ 558 Query: 378 TTEHYCCVVDMLGRAGRV 431 +VDM +AG + Sbjct: 559 NVFVASALVDMYSKAGAI 576 Score = 67.8 bits (164), Expect = 2e-09 Identities = 46/194 (23%), Positives = 90/194 (46%), Gaps = 11/194 (5%) Frame = +3 Query: 18 ISLGKQIHGYAIRHSIDHNVFVGTALIDMYSKCASISYAEKVCDSIPEKNTVTYTTIILG 197 + LG+Q HG+ ++ + + + +L+ MYS+C + + V S+ E++ V++ T+I Sbjct: 338 VELGRQFHGFVSKNFRELPIVIINSLMVMYSRCGFVQKSFGVFHSMRERDVVSWNTMISA 397 Query: 198 YGQHGLGDRALSMFQTMSEFGMNPDAITFVAILSACS-----------YSGLVDKGLKIF 344 + Q+GL D L + M + G D IT A+LSA S + L+ +G++ F Sbjct: 398 FVQNGLDDEGLMLVYEMQKQGFKIDYITVTALLSAASNLRNKEIGKQTHGFLIRQGIQ-F 456 Query: 345 ESMESSYGILPTTEHYCCVVDMLGRAGRVVEAYEFVKALGDRANAAGIWGSLLGACRMHG 524 E M S ++DM ++G + + + + G W S++ +G Sbjct: 457 EGMNS------------YLIDMYAKSGLIRISQKLFEGSGYAERDQATWNSMISGYTQNG 504 Query: 525 EMELGRIISDRLFE 566 E ++ ++ E Sbjct: 505 HTEETFLVFRKMLE 518 >ref|XP_002520572.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223540232|gb|EEF41805.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 695 Score = 308 bits (790), Expect = 5e-82 Identities = 142/229 (62%), Positives = 185/229 (80%) Frame = +3 Query: 12 GGISLGKQIHGYAIRHSIDHNVFVGTALIDMYSKCASISYAEKVCDSIPEKNTVTYTTII 191 G I+LGKQ+HG +IR+S+D N+FV TAL+DMYSK +I+YAE V E+N+VTYTT+I Sbjct: 421 GSINLGKQLHGVSIRYSLDQNIFVRTALVDMYSKSGAINYAESVFTQSSERNSVTYTTMI 480 Query: 192 LGYGQHGLGDRALSMFQTMSEFGMNPDAITFVAILSACSYSGLVDKGLKIFESMESSYGI 371 LGYGQHG+G+ ALS+F +M + G+ PDAITFVA+LSACSY+GLVD+GL+IFESM+ + I Sbjct: 481 LGYGQHGMGENALSLFHSMKKSGIQPDAITFVAVLSACSYAGLVDEGLRIFESMKRDFKI 540 Query: 372 LPTTEHYCCVVDMLGRAGRVVEAYEFVKALGDRANAAGIWGSLLGACRMHGEMELGRIIS 551 P+T HYCCV DMLGR GRV+EAYEFVK LG+ + IWGSLLGACR+HG +ELG +S Sbjct: 541 QPSTAHYCCVADMLGRVGRVIEAYEFVKQLGEEGHVIEIWGSLLGACRLHGHIELGEEVS 600 Query: 552 DRLFEVEEDNDVTGYHVLLSNIYADKEMWDSADRVRKNMREKGLRKETG 698 +RL E+ + + GY VLLSN+YA++ W++ D++RK+MREKGLRKE G Sbjct: 601 NRLLEMNSVDRLAGYQVLLSNMYAEEANWETVDKLRKSMREKGLRKEVG 649 Score = 75.1 bits (183), Expect = 1e-11 Identities = 45/138 (32%), Positives = 75/138 (54%), Gaps = 2/138 (1%) Frame = +3 Query: 24 LGKQIHGYAIRHSIDHNVFVGTALIDMYSKCASISYAEKVCDS--IPEKNTVTYTTIILG 197 +GKQ H Y IRH I + + + LIDMY+K I +++V ++ I ++ T+ +I G Sbjct: 323 IGKQTHAYLIRHGIKFDG-MDSYLIDMYAKSGLIRISQRVFENNNIQNRDQATWNAVIAG 381 Query: 198 YGQHGLGDRALSMFQTMSEFGMNPDAITFVAILSACSYSGLVDKGLKIFESMESSYGILP 377 Y Q+GL ++A F+ M E + P+A+T +IL ACS G ++ G K + Y + Sbjct: 382 YTQNGLVEQAFITFRLMLEQNLRPNAVTLASILPACSSLGSINLG-KQLHGVSIRYSLDQ 440 Query: 378 TTEHYCCVVDMLGRAGRV 431 +VDM ++G + Sbjct: 441 NIFVRTALVDMYSKSGAI 458 Score = 68.6 bits (166), Expect = 1e-09 Identities = 44/183 (24%), Positives = 92/183 (50%), Gaps = 11/183 (6%) Frame = +3 Query: 18 ISLGKQIHGYAIRHSIDHNVFVGTALIDMYSKCASISYAEKVCDSIPEKNTVTYTTIILG 197 + LG+Q+H + +++ +V V A++ MYS+C S+ + +V + +PEK+ V++ T+I G Sbjct: 220 LGLGQQMHAFTMKNHTVLSVTVLNAILVMYSRCNSVQTSFEVFEKMPEKDVVSWNTMISG 279 Query: 198 YGQHGLGDRALSMFQTMSEFGMNPDAITFVAILSACS-----------YSGLVDKGLKIF 344 + Q+GL + L + M + G D++T ++LSA S ++ L+ G+K F Sbjct: 280 FIQNGLDEEGLMLVYEMQKQGFIADSVTVTSLLSAASNLRNREIGKQTHAYLIRHGIK-F 338 Query: 345 ESMESSYGILPTTEHYCCVVDMLGRAGRVVEAYEFVKALGDRANAAGIWGSLLGACRMHG 524 + M+S ++DM ++G + + + + W +++ +G Sbjct: 339 DGMDS------------YLIDMYAKSGLIRISQRVFENNNIQNRDQATWNAVIAGYTQNG 386 Query: 525 EME 533 +E Sbjct: 387 LVE 389