BLASTX nr result
ID: Cimicifuga21_contig00021384
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cimicifuga21_contig00021384 (1442 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002273255.1| PREDICTED: pentatricopeptide repeat-containi... 614 e-173 ref|XP_002326162.1| predicted protein [Populus trichocarpa] gi|2... 555 e-155 ref|XP_002525196.1| pentatricopeptide repeat-containing protein,... 540 e-151 dbj|BAC42187.2| unknown protein [Arabidopsis thaliana] 529 e-148 ref|NP_195903.2| pentatricopeptide repeat-containing protein [Ar... 527 e-147 >ref|XP_002273255.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830, chloroplastic [Vitis vinifera] gi|297741486|emb|CBI32618.3| unnamed protein product [Vitis vinifera] Length = 842 Score = 614 bits (1583), Expect = e-173 Identities = 313/488 (64%), Positives = 380/488 (77%), Gaps = 16/488 (3%) Frame = -3 Query: 1440 VFADAKMWHMALKIKEDMLLAGVTPNVVTWSSLISACANAGLVEQAIQVFEEMLLAGCEP 1261 VFADAK+W MALKIKEDML AGV PN VTWS+LIS+CANAG+ EQAIQ+F+EMLLAGCEP Sbjct: 361 VFADAKLWQMALKIKEDMLSAGVIPNTVTWSALISSCANAGITEQAIQLFKEMLLAGCEP 420 Query: 1260 NSQCYNSLLYACVEACQYDRAFRIFKSWRESGFQK-SCHSRNCNASYQVHIGTEKSSEKC 1084 NSQCYN LL+ACVEACQYDRAFR+F+SW++S FQ+ S + N N +G E + C Sbjct: 421 NSQCYNILLHACVEACQYDRAFRLFQSWKDSRFQEISGGTGNGNT-----VGVELKHQNC 475 Query: 1083 NTNSQSCYSGPYYLSFFKAVPFTPTIATFNIMMKACGTDYYRAKALMDEIKILGLSPNHI 904 T+ +C S ++LSF K+ PFTPT T+NI+MKACGTDYYRAKALMDE+K GLSPNHI Sbjct: 476 ITSMPNCLSNSHHLSFSKSFPFTPTTTTYNILMKACGTDYYRAKALMDEMKTAGLSPNHI 535 Query: 903 SWSILIDICGSTGNVGGSVQALKAMRDSGINPDIIAYTTTIKACVRNKSLKLAFSLFEEL 724 SWSILIDICG TGN+ G+V+ LK MR++GI PD++AYTT IK CV +K+LK+AFSLF E+ Sbjct: 536 SWSILIDICGGTGNIVGAVRILKTMREAGIKPDVVAYTTAIKYCVESKNLKIAFSLFAEM 595 Query: 723 KRYRLKPNLVTYNTLLRARRRYGSLREVQQCLAIYQDMRKAGYSPNDYFLKELIEEWCEG 544 KRY+++PNLVTYNTLLRAR RYGSL EVQQCLAIYQ MRKAGY NDY+LKELIEEWCEG Sbjct: 596 KRYQIQPNLVTYNTLLRARSRYGSLHEVQQCLAIYQHMRKAGYKSNDYYLKELIEEWCEG 655 Query: 543 VIKDNNRDAGIVGRSDSHNKTHRAAPQSLLLEKVAAHLQRDISQSLVVDLRGLTKVEARI 364 VI+DNN + + S N+ PQSLLLEKVAAHLQ+ +++SL +DL+GLT+VEARI Sbjct: 656 VIQDNNLNQ---SKFSSVNRADWGRPQSLLLEKVAAHLQKSVAESLAIDLQGLTQVEARI 712 Query: 363 VVLAVLRMIKEKYTKGDVLEDDIVIIIGVGKEMAGAGNHKYEVQDAVVKLLRDELGLSVI 184 VVLAVLRMIKE Y G ++DDI+II+G+ K A H+ V+ A++KLL+DELGL V Sbjct: 713 VVLAVLRMIKENYILGHPIKDDILIILGIKKVDANLVEHESPVKGAIIKLLQDELGLEVA 772 Query: 183 LAGPRISLD----VGNPLDTDPVDVES-----------SSARRPAVLRRVKVTKKSLYHW 49 AGP+I+LD +G P +DP E+ SS RRPAVL+R KVT+KSL HW Sbjct: 773 FAGPKIALDKRINLGGPPGSDPDWQEALGRNRLPTELESSTRRPAVLQRFKVTRKSLDHW 832 Query: 48 LQKRVGMT 25 LQ+RVG T Sbjct: 833 LQRRVGAT 840 Score = 68.6 bits (166), Expect = 4e-09 Identities = 57/228 (25%), Positives = 96/228 (42%), Gaps = 5/228 (2%) Frame = -3 Query: 1374 VTPNVVTWSSLISACANAGLVEQAIQVFEEMLLAGCEPNSQCYNSLLYACVEACQYDRAF 1195 + PN+ + ++I C +++ ++EE+L PN +NSL+ V Y F Sbjct: 241 IGPNMYCYRTMIDVCGLCSHYQKSRYIYEELLAQKITPNIYVFNSLMNVNVHDLSY--TF 298 Query: 1194 RIFKSWRESGFQKSCHSRN-----CNASYQVHIGTEKSSEKCNTNSQSCYSGPYYLSFFK 1030 ++K+ + G S N C + +V + E E N S +G L F Sbjct: 299 NVYKNMQNLGVTADMASYNILLKACCVAGRVDLAQEIYREVQNLES----NGMLKLDVF- 353 Query: 1029 AVPFTPTIATFNIMMKACGTDYYRAKALMDEIKILGLSPNHISWSILIDICGSTGNVGGS 850 T +T I + A + A + +++ G+ PN ++WS LI C + G + Sbjct: 354 ------TYSTI-IKVFADAKLWQMALKIKEDMLSAGVIPNTVTWSALISSCANAGITEQA 406 Query: 849 VQALKAMRDSGINPDIIAYTTTIKACVRNKSLKLAFSLFEELKRYRLK 706 +Q K M +G P+ Y + ACV AF LF+ K R + Sbjct: 407 IQLFKEMLLAGCEPNSQCYNILLHACVEACQYDRAFRLFQSWKDSRFQ 454 >ref|XP_002326162.1| predicted protein [Populus trichocarpa] gi|222833355|gb|EEE71832.1| predicted protein [Populus trichocarpa] Length = 828 Score = 555 bits (1429), Expect = e-155 Identities = 286/474 (60%), Positives = 360/474 (75%) Frame = -3 Query: 1440 VFADAKMWHMALKIKEDMLLAGVTPNVVTWSSLISACANAGLVEQAIQVFEEMLLAGCEP 1261 +FADAKMW MALKIKEDML +GVTPN+ WSSLISACANAGLVEQAIQ+FEEMLL+GC+P Sbjct: 373 IFADAKMWQMALKIKEDMLSSGVTPNMHIWSSLISACANAGLVEQAIQLFEEMLLSGCKP 432 Query: 1260 NSQCYNSLLYACVEACQYDRAFRIFKSWRESGFQKSCHSRNCNASYQVHIGTEKSSEKCN 1081 NSQC N LL+ACV+ACQYDRAFR+F+ W+ S Q+ H + + ++ E + + C Sbjct: 433 NSQCCNILLHACVQACQYDRAFRLFQCWKGSEAQEVFHGDHSGNADEI----EHAQKHC- 487 Query: 1080 TNSQSCYSGPYYLSFFKAVPFTPTIATFNIMMKACGTDYYRAKALMDEIKILGLSPNHIS 901 N + ++L+F K PFTPT AT++++MKACG+DY+RAKALMDE+K +G+SPNHIS Sbjct: 488 PNMTTIVPNSHHLNFIKKFPFTPTPATYHMLMKACGSDYHRAKALMDEMKTVGISPNHIS 547 Query: 900 WSILIDICGSTGNVGGSVQALKAMRDSGINPDIIAYTTTIKACVRNKSLKLAFSLFEELK 721 WSILIDICG +GNV G+VQ LK MR +G+ PD++AYTT IK CV K+LKLAFSLF E+K Sbjct: 548 WSILIDICGVSGNVSGAVQILKNMRMAGVEPDVVAYTTAIKVCVETKNLKLAFSLFAEMK 607 Query: 720 RYRLKPNLVTYNTLLRARRRYGSLREVQQCLAIYQDMRKAGYSPNDYFLKELIEEWCEGV 541 R ++ PNLVTYNTLLRAR RYGSLREVQQCLAIYQDMRKAGY NDY+LK+LIEEWCEGV Sbjct: 608 RCQINPNLVTYNTLLRARTRYGSLREVQQCLAIYQDMRKAGYKSNDYYLKQLIEEWCEGV 667 Query: 540 IKDNNRDAGIVGRSDSHNKTHRAAPQSLLLEKVAAHLQRDISQSLVVDLRGLTKVEARIV 361 I+DNN+ I G S +T P+SLLLEKVAAHLQ +IS++L +DL+GLTKVEARIV Sbjct: 668 IQDNNQ---IQGGFASCKRTDLGRPRSLLLEKVAAHLQNNISENLAIDLQGLTKVEARIV 724 Query: 360 VLAVLRMIKEKYTKGDVLEDDIVIIIGVGKEMAGAGNHKYEVQDAVVKLLRDELGLSVIL 181 VLAVLRMIKE YT G +++D+ I + V K + A EV++A+++LLR+ELGL V++ Sbjct: 725 VLAVLRMIKENYTLGYSVKEDMWITLDVSK-VDPASKRDSEVKNAIIELLRNELGLEVLV 783 Query: 180 AGPRISLDVGNPLDTDPVDVESSSARRPAVLRRVKVTKKSLYHWLQKRVGMTTR 19 A P D + +S S+ P V +R+KV +KSL+ WLQ+R G R Sbjct: 784 AVPG---------HLDDIKTDSKSSLDPVVTQRLKVRRKSLHEWLQRRAGAIRR 828 >ref|XP_002525196.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223535493|gb|EEF37162.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 786 Score = 540 bits (1391), Expect = e-151 Identities = 283/489 (57%), Positives = 355/489 (72%), Gaps = 15/489 (3%) Frame = -3 Query: 1440 VFADAKMWHMALKIKEDMLLAGVTPNVVTWSSLISACANAGLVEQAIQVFEEMLLAGCEP 1261 +FADAK+W +ALKIKEDML +GVTPN TWSSLISA ANAGLV+QAI++FEEMLLAGC P Sbjct: 305 IFADAKLWQLALKIKEDMLSSGVTPNTFTWSSLISASANAGLVDQAIKLFEEMLLAGCVP 364 Query: 1260 NSQCYNSLLYACVEACQYDRAFRIFKSWRESGFQKSCHSRNCNASYQVHIGTEKSSEKCN 1081 NS C N LL+ACVEACQYDRAFR+F +W+ S Q + + + N + E Sbjct: 365 NSHCCNILLHACVEACQYDRAFRLFNAWKGSEIQNT-FTTDYNCPVDDISSAMHACEDYI 423 Query: 1080 TNSQSCYSGPYYLSFFKAVPFTPTIATFNIMMKACGTDYYRAKALMDEIKILGLSPNHIS 901 + S +LSF K PFTP+ AT+N +MKACG+DY RAKALMDE++ +GLSPNHIS Sbjct: 424 ITVPNLASNSLHLSFLKKFPFTPSSATYNTLMKACGSDYNRAKALMDEMQAVGLSPNHIS 483 Query: 900 WSILIDICGSTGNVGGSVQALKAMRDSGINPDIIAYTTTIKACVRNKSLKLAFSLFEELK 721 WSILIDICGS+GN+ G++Q LK MR +GI PD+IAYTT IK V +K+LK+AFSLF E+K Sbjct: 484 WSILIDICGSSGNMEGAIQILKNMRMAGIEPDVIAYTTAIKVSVESKNLKMAFSLFAEMK 543 Query: 720 RYRLKPNLVTYNTLLRARRRYGSLREVQQCLAIYQDMRKAGYSPNDYFLKELIEEWCEGV 541 RY+LKPNLVTY+TLLRAR RYGSL+EVQQCLAIYQDMRKAGY ND +LK+LIEEWCEGV Sbjct: 544 RYQLKPNLVTYDTLLRARTRYGSLKEVQQCLAIYQDMRKAGYKSNDNYLKQLIEEWCEGV 603 Query: 540 IKDNNRDAGIVGRSDSHNKTHRA---APQSLLLEKVAAHLQRDISQSLVVDLRGLTKVEA 370 I+DN++ D RA P SLLLEKVAAHL ++++SL VDL+GLTKVEA Sbjct: 604 IQDNDQ------CQDDFKPCKRAEFGRPHSLLLEKVAAHLHHNVAESLSVDLQGLTKVEA 657 Query: 369 RIVVLAVLRMIKEKYTKGDVLEDDIVIIIGVGKEMAGAGNHKYEVQDAVVKLLRDELGLS 190 RIVVLAVLRM+KE Y +G +++DD+ I +G+ K K EV+DA+ KLL +ELGL Sbjct: 658 RIVVLAVLRMVKENYIQGHLVKDDMSITLGIDKVDVLPATQKAEVKDAIFKLLHNELGLE 717 Query: 189 VILAGPRISLDVGNPLD------------TDPVDVESSSARRPAVLRRVKVTKKSLYHWL 46 V++ PR + D+ L+ + ++ SSARRP VL+R+KVT+ SL+ WL Sbjct: 718 VLIVVPRYTADLETDLEIPLNSYQNWSKSSGRENIRVSSARRPLVLQRLKVTRNSLHSWL 777 Query: 45 QKRVGMTTR 19 Q++ G R Sbjct: 778 QRKAGALRR 786 >dbj|BAC42187.2| unknown protein [Arabidopsis thaliana] Length = 852 Score = 529 bits (1362), Expect = e-148 Identities = 276/477 (57%), Positives = 339/477 (71%), Gaps = 9/477 (1%) Frame = -3 Query: 1440 VFADAKMWHMALKIKEDMLLAGVTPNVVTWSSLISACANAGLVEQAIQVFEEMLLAGCEP 1261 VFADAKMW ALK+K+DM GVTPN TWSSLISACANAGLVEQA +FEEML +GCEP Sbjct: 383 VFADAKMWKWALKVKDDMKSVGVTPNTHTWSSLISACANAGLVEQANHLFEEMLASGCEP 442 Query: 1260 NSQCYNSLLYACVEACQYDRAFRIFKSWRESGFQKSCHSRNCNASYQVHIGTEKSSEKCN 1081 NSQC+N LL+ACVEACQYDRAFR+F+SW+ S +S ++ + V G S Sbjct: 443 NSQCFNILLHACVEACQYDRAFRLFQSWKGSSVNESLYADDI-----VSKGRTSSPNILK 497 Query: 1080 TNSQSCY----SGPYYLSFFKAVPFTPTIATFNIMMKACGTDYYRAKALMDEIKILGLSP 913 N S Y+ K F PT AT+NI++KACGTDYYR K LMDE+K LGLSP Sbjct: 498 NNGPGSLVNRNSNSPYIQASKRFCFKPTTATYNILLKACGTDYYRGKELMDEMKSLGLSP 557 Query: 912 NHISWSILIDICGSTGNVGGSVQALKAMRDSGINPDIIAYTTTIKACVRNKSLKLAFSLF 733 N I+WS LID+CG +G+V G+V+ L+ M +G PD++AYTT IK C NK LKLAFSLF Sbjct: 558 NQITWSTLIDMCGGSGDVEGAVRILRTMHSAGTRPDVVAYTTAIKICAENKCLKLAFSLF 617 Query: 732 EELKRYRLKPNLVTYNTLLRARRRYGSLREVQQCLAIYQDMRKAGYSPNDYFLKELIEEW 553 EE++RY++KPN VTYNTLL+AR +YGSL EV+QCLAIYQDMR AGY PND+FLKELIEEW Sbjct: 618 EEMRRYQIKPNWVTYNTLLKARSKYGSLLEVRQCLAIYQDMRNAGYKPNDHFLKELIEEW 677 Query: 552 CEGVIKDNNRDAGIVGRSDSHNKTHRAAPQSLLLEKVAAHLQRDISQSLVVDLRGLTKVE 373 CEGVI++N R + + N P SLL+EKVA H+Q + +L +DL+GLTK+E Sbjct: 678 CEGVIQENGRSQDKISDQEGDN---AGRPVSLLIEKVATHMQERTAGNLAIDLQGLTKIE 734 Query: 372 ARIVVLAVLRMIKEKYTKGDVLEDDIVIIIGVGKEMAGAGNHKYEVQDAVVKLLRDELGL 193 AR+VVLAVLRMIKE Y +GDV+ DD++IIIG + +G + VQ+A+VKLLRDEL L Sbjct: 735 ARLVVLAVLRMIKEDYMRGDVVIDDVLIIIGTDEANTVSGKQEITVQEALVKLLRDELSL 794 Query: 192 SVILAGPR-ISLDVGNPLDTDPVDVES----SSARRPAVLRRVKVTKKSLYHWLQKR 37 V+ AG R I D D D + +S SS RRPA+L R+ VTK SLY WLQ+R Sbjct: 795 VVLPAGQRNIIQDAHCVDDADQENTKSFVSISSTRRPAILERLMVTKASLYQWLQRR 851 >ref|NP_195903.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332278227|sp|Q8GYL7.3|PP361_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At5g02830, chloroplastic; Flags: Precursor gi|332003140|gb|AED90523.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 852 Score = 527 bits (1358), Expect = e-147 Identities = 275/477 (57%), Positives = 339/477 (71%), Gaps = 9/477 (1%) Frame = -3 Query: 1440 VFADAKMWHMALKIKEDMLLAGVTPNVVTWSSLISACANAGLVEQAIQVFEEMLLAGCEP 1261 VFADAKMW ALK+K+DM GVTPN TWSSLISACANAGLVEQA +FEEML +GCEP Sbjct: 383 VFADAKMWKWALKVKDDMKSVGVTPNTHTWSSLISACANAGLVEQANHLFEEMLASGCEP 442 Query: 1260 NSQCYNSLLYACVEACQYDRAFRIFKSWRESGFQKSCHSRNCNASYQVHIGTEKSSEKCN 1081 NSQC+N LL+ACVEACQYDRAFR+F+SW+ S +S ++ + V G S Sbjct: 443 NSQCFNILLHACVEACQYDRAFRLFQSWKGSSVNESLYADDI-----VSKGRTSSPNILK 497 Query: 1080 TNSQSCY----SGPYYLSFFKAVPFTPTIATFNIMMKACGTDYYRAKALMDEIKILGLSP 913 N S Y+ K F PT AT+NI++KACGTDYYR K LMDE+K LGLSP Sbjct: 498 NNGPGSLVNRNSNSPYIQASKRFCFKPTTATYNILLKACGTDYYRGKELMDEMKSLGLSP 557 Query: 912 NHISWSILIDICGSTGNVGGSVQALKAMRDSGINPDIIAYTTTIKACVRNKSLKLAFSLF 733 N I+WS LID+CG +G+V G+V+ L+ M +G PD++AYTT IK C NK LKLAFSLF Sbjct: 558 NQITWSTLIDMCGGSGDVEGAVRILRTMHSAGTRPDVVAYTTAIKICAENKCLKLAFSLF 617 Query: 732 EELKRYRLKPNLVTYNTLLRARRRYGSLREVQQCLAIYQDMRKAGYSPNDYFLKELIEEW 553 EE++RY++KPN VTYNTLL+AR +YGSL EV+QCLAIYQDMR AGY PND+FLKELIEEW Sbjct: 618 EEMRRYQIKPNWVTYNTLLKARSKYGSLLEVRQCLAIYQDMRNAGYKPNDHFLKELIEEW 677 Query: 552 CEGVIKDNNRDAGIVGRSDSHNKTHRAAPQSLLLEKVAAHLQRDISQSLVVDLRGLTKVE 373 CEGVI++N + + + N P SLL+EKVA H+Q + +L +DL+GLTK+E Sbjct: 678 CEGVIQENGQSQDKISDQEGDN---AGRPVSLLIEKVATHMQERTAGNLAIDLQGLTKIE 734 Query: 372 ARIVVLAVLRMIKEKYTKGDVLEDDIVIIIGVGKEMAGAGNHKYEVQDAVVKLLRDELGL 193 AR+VVLAVLRMIKE Y +GDV+ DD++IIIG + +G + VQ+A+VKLLRDEL L Sbjct: 735 ARLVVLAVLRMIKEDYMRGDVVIDDVLIIIGTDEANTVSGKQEITVQEALVKLLRDELSL 794 Query: 192 SVILAGPR-ISLDVGNPLDTDPVDVES----SSARRPAVLRRVKVTKKSLYHWLQKR 37 V+ AG R I D D D + +S SS RRPA+L R+ VTK SLY WLQ+R Sbjct: 795 VVLPAGQRNIIQDAHCVDDADQENTKSFVSISSTRRPAILERLMVTKASLYQWLQRR 851