BLASTX nr result

ID: Cimicifuga21_contig00020815 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cimicifuga21_contig00020815
         (776 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002273255.1| PREDICTED: pentatricopeptide repeat-containi...   313   3e-83
ref|XP_004167767.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   302   5e-80
ref|XP_004142106.1| PREDICTED: pentatricopeptide repeat-containi...   300   2e-79
dbj|BAC42187.2| unknown protein [Arabidopsis thaliana]                294   2e-77
ref|NP_195903.2| pentatricopeptide repeat-containing protein [Ar...   294   2e-77

>ref|XP_002273255.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
           chloroplastic [Vitis vinifera]
           gi|297741486|emb|CBI32618.3| unnamed protein product
           [Vitis vinifera]
          Length = 842

 Score =  313 bits (801), Expect = 3e-83
 Identities = 157/258 (60%), Positives = 196/258 (75%)
 Frame = +1

Query: 1   VSTLICDGRLDSVVQFFTKIHKLGIIPSALFNKSVIKLLALECRRLVKARKSKEVVDLME 180
           +S L+ +GR+  VV+   K+ KLGI P  LF+ S ++LL+ ECRR++   + +EVV+L+E
Sbjct: 109 ISGLLREGRVYCVVEVLRKVDKLGICPLELFDGSTLELLSKECRRILNCGQVEEVVELIE 168

Query: 181 TLAGFLFSVKEFVKPCEILKLCVEESDPETAVRYACMLPNAPIYFSSIILEFGKKGDLMS 360
            L GF F VK+ ++P + +K+CV + +P  AVRYAC+LP+A I F +II EFGKK DL S
Sbjct: 169 ILDGFHFPVKKLLEPLDFIKICVNKRNPNLAVRYACILPHAQILFCTIIHEFGKKRDLGS 228

Query: 361 SVIAYEACMCKSDGPNMYVCRTMIDVCGLCGDFLKSRYIYEALLAQNVTPNAYVFNSLMN 540
           ++ A+EA   K  GPNMY  RTMIDVCGLC  + KSRYIYE LLAQ +TPN YVFNSLMN
Sbjct: 229 ALTAFEASKQKLIGPNMYCYRTMIDVCGLCSHYQKSRYIYEELLAQKITPNIYVFNSLMN 288

Query: 541 VNAHDLSYALQVYKHMKNLGVTPDLASYNILLKACCCAGRVDLAQDTYEEVRHIALMGRL 720
           VN HDLSY   VYK+M+NLGVT D+ASYNILLKACC AGRVDLAQ+ Y EV+++   G L
Sbjct: 289 VNVHDLSYTFNVYKNMQNLGVTADMASYNILLKACCVAGRVDLAQEIYREVQNLESNGML 348

Query: 721 KLDVITYSTIIKVFADAK 774
           KLDV TYSTIIKVFADAK
Sbjct: 349 KLDVFTYSTIIKVFADAK 366


>ref|XP_004167767.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
           protein At5g02830, chloroplastic-like [Cucumis sativus]
          Length = 855

 Score =  302 bits (774), Expect = 5e-80
 Identities = 149/258 (57%), Positives = 195/258 (75%)
 Frame = +1

Query: 1   VSTLICDGRLDSVVQFFTKIHKLGIIPSALFNKSVIKLLALECRRLVKARKSKEVVDLME 180
           +S  + +G++ SVVQ   K+ +LGI    L ++  ++ L  +CRR+ K+ + +E+V+LME
Sbjct: 125 ISRCLREGKVWSVVQVLRKVEELGISVLELCDEPAVESLRRDCRRMAKSGELEELVELME 184

Query: 181 TLAGFLFSVKEFVKPCEILKLCVEESDPETAVRYACMLPNAPIYFSSIILEFGKKGDLMS 360
            L+GF FSV+E +KP E++KLCV+  +P+ A+RYA +LP+A I F + I EFGKK DL S
Sbjct: 185 VLSGFGFSVREMMKPSEVIKLCVDYRNPKMAIRYASILPHADILFCTTINEFGKKRDLKS 244

Query: 361 SVIAYEACMCKSDGPNMYVCRTMIDVCGLCGDFLKSRYIYEALLAQNVTPNAYVFNSLMN 540
           + IAY       +G NMY+ RT+IDVCGLCGD+ KSR IY+ L+ QNVTPN +VFNSLMN
Sbjct: 245 AYIAYTESKANMNGSNMYIYRTIIDVCGLCGDYKKSRNIYQDLVNQNVTPNIFVFNSLMN 304

Query: 541 VNAHDLSYALQVYKHMKNLGVTPDLASYNILLKACCCAGRVDLAQDTYEEVRHIALMGRL 720
           VNAHDL+Y  Q+YK+M+NLGV  D+ASYNILLKACC AGRVDLAQD Y EV+H+   G L
Sbjct: 305 VNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVL 364

Query: 721 KLDVITYSTIIKVFADAK 774
           KLDV TYSTI+KVFADAK
Sbjct: 365 KLDVFTYSTIVKVFADAK 382


>ref|XP_004142106.1| PREDICTED: pentatricopeptide repeat-containing protein At5g02830,
           chloroplastic-like [Cucumis sativus]
          Length = 849

 Score =  300 bits (768), Expect = 2e-79
 Identities = 148/258 (57%), Positives = 194/258 (75%)
 Frame = +1

Query: 1   VSTLICDGRLDSVVQFFTKIHKLGIIPSALFNKSVIKLLALECRRLVKARKSKEVVDLME 180
           +S  + +G++ SVVQ   K+ +LGI    L ++  ++ L  +CRR+ K+ + +E+V+LME
Sbjct: 125 ISRCLREGKVWSVVQVLRKVEELGISVLELCDEPAVESLRRDCRRMAKSGELEELVELME 184

Query: 181 TLAGFLFSVKEFVKPCEILKLCVEESDPETAVRYACMLPNAPIYFSSIILEFGKKGDLMS 360
            L+GF FSV+E +KP E++KLCV+  +P+ A+RYA +LP+A I F + I EFGKK DL S
Sbjct: 185 VLSGFGFSVREMMKPSEVIKLCVDYRNPKMAIRYASILPHADILFCTTINEFGKKRDLKS 244

Query: 361 SVIAYEACMCKSDGPNMYVCRTMIDVCGLCGDFLKSRYIYEALLAQNVTPNAYVFNSLMN 540
           + IAY       +G NMY+ RT+IDVCGLCGD+ KSR IY+ L+ QNV PN +VFNSLMN
Sbjct: 245 AYIAYTESKANMNGSNMYIYRTIIDVCGLCGDYKKSRNIYQDLVNQNVIPNIFVFNSLMN 304

Query: 541 VNAHDLSYALQVYKHMKNLGVTPDLASYNILLKACCCAGRVDLAQDTYEEVRHIALMGRL 720
           VNAHDL+Y  Q+YK+M+NLGV  D+ASYNILLKACC AGRVDLAQD Y EV+H+   G L
Sbjct: 305 VNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVL 364

Query: 721 KLDVITYSTIIKVFADAK 774
           KLDV TYSTI+KVFADAK
Sbjct: 365 KLDVFTYSTIVKVFADAK 382


>dbj|BAC42187.2| unknown protein [Arabidopsis thaliana]
          Length = 852

 Score =  294 bits (752), Expect = 2e-77
 Identities = 143/258 (55%), Positives = 187/258 (72%)
 Frame = +1

Query: 1   VSTLICDGRLDSVVQFFTKIHKLGIIPSALFNKSVIKLLALECRRLVKARKSKEVVDLME 180
           +S+ +  G+++SVV    +I K+GI P  L + S +KL+  + R +  + + ++ +DLME
Sbjct: 131 ISSNLRQGKIESVVYTLKRIEKVGIAPLDLVDDSSVKLMRKQFRAMANSVQVEKAIDLME 190

Query: 181 TLAGFLFSVKEFVKPCEILKLCVEESDPETAVRYACMLPNAPIYFSSIILEFGKKGDLMS 360
            LAG  F +KE V P +++K CVE S+P+ A+RYAC+LP+  +    II  FGKKGD++S
Sbjct: 191 ILAGLGFKIKELVDPFDVVKSCVEISNPQLAIRYACLLPHTELLLCRIIHGFGKKGDMVS 250

Query: 361 SVIAYEACMCKSDGPNMYVCRTMIDVCGLCGDFLKSRYIYEALLAQNVTPNAYVFNSLMN 540
            + AYEAC    D PNMY+CRTMIDVCGLCGD++KSRYIYE LL +N+ PN YV NSLMN
Sbjct: 251 VMTAYEACKQILDTPNMYICRTMIDVCGLCGDYVKSRYIYEDLLKENIKPNIYVINSLMN 310

Query: 541 VNAHDLSYALQVYKHMKNLGVTPDLASYNILLKACCCAGRVDLAQDTYEEVRHIALMGRL 720
           VN+HDL Y L+VYK+M+ L VT D+ SYNILLK CC AGRVDLAQD Y+E + +   G L
Sbjct: 311 VNSHDLGYTLKVYKNMQILDVTADMTSYNILLKTCCLAGRVDLAQDIYKEAKRMESSGLL 370

Query: 721 KLDVITYSTIIKVFADAK 774
           KLD  TY TIIKVFADAK
Sbjct: 371 KLDAFTYCTIIKVFADAK 388


>ref|NP_195903.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|332278227|sp|Q8GYL7.3|PP361_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At5g02830, chloroplastic; Flags: Precursor
           gi|332003140|gb|AED90523.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 852

 Score =  294 bits (752), Expect = 2e-77
 Identities = 143/258 (55%), Positives = 187/258 (72%)
 Frame = +1

Query: 1   VSTLICDGRLDSVVQFFTKIHKLGIIPSALFNKSVIKLLALECRRLVKARKSKEVVDLME 180
           +S+ +  G+++SVV    +I K+GI P  L + S +KL+  + R +  + + ++ +DLME
Sbjct: 131 ISSNLRQGKIESVVYTLKRIEKVGIAPLDLVDDSSVKLMRKQFRAMANSVQVEKAIDLME 190

Query: 181 TLAGFLFSVKEFVKPCEILKLCVEESDPETAVRYACMLPNAPIYFSSIILEFGKKGDLMS 360
            LAG  F +KE V P +++K CVE S+P+ A+RYAC+LP+  +    II  FGKKGD++S
Sbjct: 191 ILAGLGFKIKELVDPFDVVKSCVEISNPQLAIRYACLLPHTELLLCRIIHGFGKKGDMVS 250

Query: 361 SVIAYEACMCKSDGPNMYVCRTMIDVCGLCGDFLKSRYIYEALLAQNVTPNAYVFNSLMN 540
            + AYEAC    D PNMY+CRTMIDVCGLCGD++KSRYIYE LL +N+ PN YV NSLMN
Sbjct: 251 VMTAYEACKQILDTPNMYICRTMIDVCGLCGDYVKSRYIYEDLLKENIKPNIYVINSLMN 310

Query: 541 VNAHDLSYALQVYKHMKNLGVTPDLASYNILLKACCCAGRVDLAQDTYEEVRHIALMGRL 720
           VN+HDL Y L+VYK+M+ L VT D+ SYNILLK CC AGRVDLAQD Y+E + +   G L
Sbjct: 311 VNSHDLGYTLKVYKNMQILDVTADMTSYNILLKTCCLAGRVDLAQDIYKEAKRMESSGLL 370

Query: 721 KLDVITYSTIIKVFADAK 774
           KLD  TY TIIKVFADAK
Sbjct: 371 KLDAFTYCTIIKVFADAK 388


Top