BLASTX nr result
ID: Cephaelis21_contig00013448
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00013448 (887 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002309408.1| predicted protein [Populus trichocarpa] gi|2... 211 1e-77 ref|XP_004143391.1| PREDICTED: putative pentatricopeptide repeat... 208 6e-74 ref|XP_003521411.1| PREDICTED: putative pentatricopeptide repeat... 204 8e-74 ref|XP_002870540.1| pentatricopeptide repeat-containing protein ... 192 1e-69 ref|NP_198573.1| pentatricopeptide repeat-containing protein [Ar... 193 4e-67 >ref|XP_002309408.1| predicted protein [Populus trichocarpa] gi|222855384|gb|EEE92931.1| predicted protein [Populus trichocarpa] Length = 547 Score = 211 bits (538), Expect(2) = 1e-77 Identities = 99/168 (58%), Positives = 134/168 (79%) Frame = -3 Query: 504 QRYLHSWNVMIKGFVKLGDLDSARRLFNEMPERDAFSYTAMIDGYAKAGDMASARMLFEQ 325 +R L SWN MI G K GDL AR++F+EM ER+ S+T MIDGYAK GDMASAR LF++ Sbjct: 199 ERNLPSWNAMISGLGKAGDLSGARKVFDEMVERNVVSFTVMIDGYAKVGDMASARALFDE 258 Query: 324 SEQKGVVLWSALISGYAQNGQSKEAIQMFYKMQSLSIQPDKYAMVSLMSACSQLGSLELA 145 + +K VV WSALISGY++N Q EA+++F++M S++++PD++ MVSLMSACSQLG+ +LA Sbjct: 259 APEKDVVAWSALISGYSRNEQPNEAVKIFFEMVSMNVKPDEFIMVSLMSACSQLGNSDLA 318 Query: 144 KLIDSYLHSGSVDVCQGHVAAALVDMNAKCGNMERALTLFEQLPTRDV 1 K +DSYL S+D Q HV AAL+DM+AKCGNME+A+ LF+ +P+RD+ Sbjct: 319 KWVDSYLSQTSIDTRQAHVLAALIDMHAKCGNMEKAVKLFQDMPSRDL 366 Score = 105 bits (262), Expect(2) = 1e-77 Identities = 48/94 (51%), Positives = 67/94 (71%) Frame = -1 Query: 782 FSVMLLINNCANALVLREGQVLHGFVVRYGIESDVFVGCSLINLYGKCREVECARKVFDA 603 ++ LLI C+N L L+EG+++HG +R G+ DV+VG SLI+ YGKC+E+ ARKVFD Sbjct: 106 YTYPLLIKVCSNELRLKEGEIVHGSAIRCGVSDDVYVGSSLISFYGKCKEILSARKVFDE 165 Query: 602 MPIRNEVSWTAMIFVYMIFGDALEANRLFDKMPK 501 +P RN VSWTAM+ Y GD A R+F++MP+ Sbjct: 166 IPERNVVSWTAMVAGYASVGDLENAKRVFERMPE 199 Score = 64.3 bits (155), Expect(2) = 4e-12 Identities = 43/157 (27%), Positives = 81/157 (51%), Gaps = 6/157 (3%) Frame = -3 Query: 459 KLGDLDSARRLFNEMPE-----RDAFSYTAMIDGYAKAGDMASARMLFEQSEQKGVVLWS 295 +LG+ D A+ + + + + R A A+ID +AK G+M A LF+ + ++ Sbjct: 311 QLGNSDLAKWVDSYLSQTSIDTRQAHVLAALIDMHAKCGNMEKAVKLFQDMPSRDLIPCC 370 Query: 294 ALISGYAQNGQSKEAIQMFYKMQSLSIQPDKYAMVSLMSACSQLGSLELA-KLIDSYLHS 118 +LI G + +G+ EA+++F +M + PD A +++ACS+ G +E D+ + Sbjct: 371 SLIQGLSIHGRGVEAVELFNRMLDEGLIPDTVAFTVILTACSRGGLIEDGWHFFDTMKNK 430 Query: 117 GSVDVCQGHVAAALVDMNAKCGNMERALTLFEQLPTR 7 SV H A +VD+ ++ G + A L + +P + Sbjct: 431 YSVVPSPNHY-ACMVDLLSRAGQLRAAYDLLKSMPLK 466 Score = 33.5 bits (75), Expect(2) = 4e-12 Identities = 19/62 (30%), Positives = 34/62 (54%) Frame = -1 Query: 692 IESDVFVGCSLINLYGKCREVECARKVFDAMPIRNEVSWTAMIFVYMIFGDALEANRLFD 513 +E +V +I+ Y K ++ AR +FD P ++ V+W+A+I Y EA ++F Sbjct: 229 VERNVVSFTVMIDGYAKVGDMASARALFDEAPEKDVVAWSALISGYSRNEQPNEAVKIFF 288 Query: 512 KM 507 +M Sbjct: 289 EM 290 Score = 71.6 bits (174), Expect = 2e-10 Identities = 38/110 (34%), Positives = 67/110 (60%), Gaps = 5/110 (4%) Frame = -3 Query: 477 MIKGFVKLGDLDSARRLFNEMPERDAFSYTAMIDGYAKAGDMASARMLFEQSEQKGVVLW 298 +I + K ++ SAR++F+E+PER+ S+TAM+ GYA GD+ +A+ +FE+ ++ + W Sbjct: 146 LISFYGKCKEILSARKVFDEIPERNVVSWTAMVAGYASVGDLENAKRVFERMPERNLPSW 205 Query: 297 SALISGYAQNGQSKEAIQMFYKM-----QSLSIQPDKYAMVSLMSACSQL 163 +A+ISG + G A ++F +M S ++ D YA V M++ L Sbjct: 206 NAMISGLGKAGDLSGARKVFDEMVERNVVSFTVMIDGYAKVGDMASARAL 255 Score = 62.8 bits (151), Expect = 1e-07 Identities = 38/135 (28%), Positives = 69/135 (51%) Frame = -3 Query: 405 DAFSYTAMIDGYAKAGDMASARMLFEQSEQKGVVLWSALISGYAQNGQSKEAIQMFYKMQ 226 D + +++I Y K ++ SAR +F++ ++ VV W+A+++GYA G + A ++F +M Sbjct: 139 DVYVGSSLISFYGKCKEILSARKVFDEIPERNVVSWTAMVAGYASVGDLENAKRVFERMP 198 Query: 225 SLSIQPDKYAMVSLMSACSQLGSLELAKLIDSYLHSGSVDVCQGHVAAALVDMNAKCGNM 46 ++ P AM+S + L K+ D + V ++D AK G+M Sbjct: 199 ERNL-PSWNAMISGLGKAGDLSGAR--KVFDEMVERNVVSF------TVMIDGYAKVGDM 249 Query: 45 ERALTLFEQLPTRDV 1 A LF++ P +DV Sbjct: 250 ASARALFDEAPEKDV 264 >ref|XP_004143391.1| PREDICTED: putative pentatricopeptide repeat-containing protein At5g37570-like [Cucumis sativus] gi|449519310|ref|XP_004166678.1| PREDICTED: putative pentatricopeptide repeat-containing protein At5g37570-like [Cucumis sativus] Length = 565 Score = 208 bits (529), Expect(2) = 6e-74 Identities = 96/168 (57%), Positives = 132/168 (78%) Frame = -3 Query: 504 QRYLHSWNVMIKGFVKLGDLDSARRLFNEMPERDAFSYTAMIDGYAKAGDMASARMLFEQ 325 +R + SWN +I G++K+GD+ SA + F+EMPE++ S+T MIDGYAKAGDM SAR LF++ Sbjct: 221 ERNVASWNAIIGGYMKMGDVKSAEKAFDEMPEKNVVSFTTMIDGYAKAGDMLSARNLFQK 280 Query: 324 SEQKGVVLWSALISGYAQNGQSKEAIQMFYKMQSLSIQPDKYAMVSLMSACSQLGSLELA 145 + ++ ++ WSALISGY QNGQ EA++ F +M S +++PDK+ + SLM ACSQLG+L+LA Sbjct: 281 APERDIIAWSALISGYTQNGQPNEAVKTFLEMSSRNVKPDKFVLTSLMLACSQLGNLDLA 340 Query: 144 KLIDSYLHSGSVDVCQGHVAAALVDMNAKCGNMERALTLFEQLPTRDV 1 K +DSY SVD+ HV AAL+DMNAKCGNMERA+ LFE++P RD+ Sbjct: 341 KWVDSYATRCSVDLRGAHVTAALIDMNAKCGNMERAMYLFEKMPKRDL 388 Score = 96.7 bits (239), Expect(2) = 6e-74 Identities = 46/89 (51%), Positives = 61/89 (68%) Frame = -1 Query: 767 LINNCANALVLREGQVLHGFVVRYGIESDVFVGCSLINLYGKCREVECARKVFDAMPIRN 588 L+ CA+ + EG LHG ++R G++ D++V SL+NLYGK ++CARKVFD M RN Sbjct: 133 LLKVCASEGKMMEGMALHGSILRCGVDEDIYVTTSLVNLYGKGGLIDCARKVFDGMSERN 192 Query: 587 EVSWTAMIFVYMIFGDALEANRLFDKMPK 501 VSWTAMI Y G+ +EA RLFD MP+ Sbjct: 193 VVSWTAMIVGYSSIGNLVEAKRLFDLMPE 221 Score = 62.0 bits (149), Expect(3) = 1e-12 Identities = 41/162 (25%), Positives = 82/162 (50%), Gaps = 11/162 (6%) Frame = -3 Query: 459 KLGDLDSARRLFN-----EMPERDAFSYTAMIDGYAKAGDMASARMLFEQSEQKGVVLWS 295 +LG+LD A+ + + + R A A+ID AK G+M A LFE+ ++ ++ + Sbjct: 333 QLGNLDLAKWVDSYATRCSVDLRGAHVTAALIDMNAKCGNMERAMYLFEKMPKRDLISYC 392 Query: 294 ALISGYAQNGQSKEAIQMFYKMQSLSIQPDKYAMVSLMSACSQLGSLELAKLIDSYLHSG 115 +++ G + +G +A+ +F +M + PD A +++ACS+ G L+D H Sbjct: 393 SVMQGLSIHGHGDQAVSLFERMLGEDLTPDDVAFTVILTACSRAG------LVDEGWHYF 446 Query: 114 SVDVCQGHVA------AALVDMNAKCGNMERALTLFEQLPTR 7 + C+ + A +VD+ ++ G ++ A L + +P + Sbjct: 447 EMMRCKYSMVPSVDHYACIVDLLSRSGRLKEAYELIKSVPVQ 488 Score = 33.5 bits (75), Expect(3) = 1e-12 Identities = 19/61 (31%), Positives = 33/61 (54%) Frame = -1 Query: 689 ESDVFVGCSLINLYGKCREVECARKVFDAMPIRNEVSWTAMIFVYMIFGDALEANRLFDK 510 E +V ++I+ Y K ++ AR +F P R+ ++W+A+I Y G EA + F + Sbjct: 252 EKNVVSFTTMIDGYAKAGDMLSARNLFQKAPERDIIAWSALISGYTQNGQPNEAVKTFLE 311 Query: 509 M 507 M Sbjct: 312 M 312 Score = 23.1 bits (48), Expect(3) = 1e-12 Identities = 8/28 (28%), Positives = 14/28 (50%) Frame = -3 Query: 885 NIYLWNTLIEGYCRHSSLSNCISLFNRM 802 N+ WN +I GY + + + F+ M Sbjct: 223 NVASWNAIIGGYMKMGDVKSAEKAFDEM 250 >ref|XP_003521411.1| PREDICTED: putative pentatricopeptide repeat-containing protein At5g37570-like [Glycine max] Length = 566 Score = 204 bits (519), Expect(2) = 8e-74 Identities = 98/167 (58%), Positives = 132/167 (79%) Frame = -3 Query: 501 RYLHSWNVMIKGFVKLGDLDSARRLFNEMPERDAFSYTAMIDGYAKAGDMASARMLFEQS 322 R + SWN M++GFVK+GDL AR +F+ MPE++ S+T MIDGYAKAGDMA+AR LF+ S Sbjct: 223 RNVASWNSMLQGFVKMGDLSGARGVFDAMPEKNVVSFTTMIDGYAKAGDMAAARFLFDCS 282 Query: 321 EQKGVVLWSALISGYAQNGQSKEAIQMFYKMQSLSIQPDKYAMVSLMSACSQLGSLELAK 142 +K VV WSALISGY QNG +A+++F +M+ ++++PD++ +VSLMSA +QLG LELA+ Sbjct: 283 LEKDVVAWSALISGYVQNGLPNQALRVFLEMELMNVKPDEFILVSLMSASAQLGHLELAQ 342 Query: 141 LIDSYLHSGSVDVCQGHVAAALVDMNAKCGNMERALTLFEQLPTRDV 1 +DSY+ +D+ Q HV AAL+DMNAKCGNMERAL LF++ P RDV Sbjct: 343 WVDSYVSKICIDLQQDHVIAALLDMNAKCGNMERALKLFDEKPRRDV 389 Score = 100 bits (248), Expect(2) = 8e-74 Identities = 47/93 (50%), Positives = 63/93 (67%) Frame = -1 Query: 782 FSVMLLINNCANALVLREGQVLHGFVVRYGIESDVFVGCSLINLYGKCREVECARKVFDA 603 F+ +I C+ REG+ LHG R G++ D++VG SLI++YGKC E+ ARKVFD Sbjct: 129 FTYPSVIKACSGTCKAREGKSLHGSAFRCGVDQDLYVGTSLIDMYGKCGEIADARKVFDG 188 Query: 602 MPIRNEVSWTAMIFVYMIFGDALEANRLFDKMP 504 M RN VSWTAM+ Y+ GD +EA +LFD+MP Sbjct: 189 MSDRNVVSWTAMLVGYVAVGDVVEARKLFDEMP 221 Score = 65.1 bits (157), Expect = 2e-08 Identities = 36/128 (28%), Positives = 72/128 (56%), Gaps = 3/128 (2%) Frame = -3 Query: 387 AMIDGYAKAGDMASARMLFEQSEQKGVVLWSALISGYAQNGQSKEAIQMFYKMQSLSIQP 208 A++D AK G+M A LF++ ++ VVL+ ++I G + +G+ +EA+ +F +M + P Sbjct: 363 ALLDMNAKCGNMERALKLFDEKPRRDVVLYCSMIQGLSIHGRGEEAVNLFNRMLMEGLTP 422 Query: 207 DKYAMVSLMSACSQLGSLELAKLIDSYLHSGSVDVCQGHVA---AALVDMNAKCGNMERA 37 D+ A +++ACS+ G ++ + +Y S C + A +VD+ ++ G++ A Sbjct: 423 DEVAFTVILTACSRAGLVDEGR---NYFQSMKQKYCISPLPDHYACMVDLLSRSGHIRDA 479 Query: 36 LTLFEQLP 13 L + +P Sbjct: 480 YELIKLIP 487 >ref|XP_002870540.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297316376|gb|EFH46799.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 547 Score = 192 bits (489), Expect(3) = 1e-69 Identities = 93/168 (55%), Positives = 122/168 (72%) Frame = -3 Query: 504 QRYLHSWNVMIKGFVKLGDLDSARRLFNEMPERDAFSYTAMIDGYAKAGDMASARMLFEQ 325 +R L +WN ++ G VK GDL +AR+LF+EMP+RD SYT+MIDGYAK GDM SAR LFE Sbjct: 201 ERNLGTWNALVDGLVKSGDLVNARKLFDEMPKRDIISYTSMIDGYAKGGDMVSARDLFEN 260 Query: 324 SEQKGVVLWSALISGYAQNGQSKEAIQMFYKMQSLSIQPDKYAMVSLMSACSQLGSLELA 145 + V WSALI GYAQNGQ EA ++F +M + +++PD++ MV LMSACSQ+G EL Sbjct: 261 ARGVDVRAWSALILGYAQNGQPNEAFKVFSEMCAKNVKPDEFIMVGLMSACSQMGCFELC 320 Query: 144 KLIDSYLHSGSVDVCQGHVAAALVDMNAKCGNMERALTLFEQLPTRDV 1 + +DSYLH +V AL+DMNAKCG+M+RA LFE++P RD+ Sbjct: 321 EKVDSYLHQSMNKFSSHYVIPALIDMNAKCGHMDRAAKLFEEMPQRDL 368 Score = 91.3 bits (225), Expect(3) = 1e-69 Identities = 42/94 (44%), Positives = 61/94 (64%) Frame = -1 Query: 782 FSVMLLINNCANALVLREGQVLHGFVVRYGIESDVFVGCSLINLYGKCREVECARKVFDA 603 ++ L++ C+N R G +HG V+R G + DV +G S ++ YGKC+++ ARKVF Sbjct: 108 YTFPLVMKVCSNNAEFRVGSTVHGLVLRIGFDKDVVLGTSFVDFYGKCKDLCSARKVFGE 167 Query: 602 MPIRNEVSWTAMIFVYMIFGDALEANRLFDKMPK 501 MP RN VSWTA+I Y+ G+ EA R+FD MP+ Sbjct: 168 MPERNVVSWTALIVAYVKSGELEEAKRMFDLMPE 201 Score = 27.7 bits (60), Expect(3) = 1e-69 Identities = 12/29 (41%), Positives = 16/29 (55%) Frame = -3 Query: 879 YLWNTLIEGYCRHSSLSNCISLFNRMTKS 793 YLWN LI+GY +SL RM ++ Sbjct: 72 YLWNHLIKGYSNKFLFFETVSLLMRMMRT 100 Score = 58.2 bits (139), Expect = 3e-06 Identities = 34/125 (27%), Positives = 62/125 (49%) Frame = -3 Query: 387 AMIDGYAKAGDMASARMLFEQSEQKGVVLWSALISGYAQNGQSKEAIQMFYKMQSLSIQP 208 A+ID AK G M A LFE+ Q+ +V + +++ G A +G EA+++F KM I P Sbjct: 342 ALIDMNAKCGHMDRAAKLFEEMPQRDLVSYCSMMEGMAIHGCGSEAVRLFEKMVDEGIVP 401 Query: 207 DKYAMVSLMSACSQLGSLELAKLIDSYLHSGSVDVCQGHVAAALVDMNAKCGNMERALTL 28 D+ A ++ CSQ +E + + + +V++ ++ G ++ A L Sbjct: 402 DEVAFTVILKVCSQSRLVEEGLRYFELMRKEYSILASPDHYSCIVNLLSRTGKLKEAYEL 461 Query: 27 FEQLP 13 + +P Sbjct: 462 IKSMP 466 >ref|NP_198573.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75170664|sp|Q9FHR3.1|PP403_ARATH RecName: Full=Putative pentatricopeptide repeat-containing protein At5g37570 gi|9757967|dbj|BAB08303.1| unnamed protein product [Arabidopsis thaliana] gi|332006824|gb|AED94207.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 550 Score = 193 bits (490), Expect(2) = 4e-67 Identities = 93/168 (55%), Positives = 123/168 (73%) Frame = -3 Query: 504 QRYLHSWNVMIKGFVKLGDLDSARRLFNEMPERDAFSYTAMIDGYAKAGDMASARMLFEQ 325 +R L SWN ++ G VK GDL +A++LF+EMP+RD SYT+MIDGYAK GDM SAR LFE+ Sbjct: 204 ERNLGSWNALVDGLVKSGDLVNAKKLFDEMPKRDIISYTSMIDGYAKGGDMVSARDLFEE 263 Query: 324 SEQKGVVLWSALISGYAQNGQSKEAIQMFYKMQSLSIQPDKYAMVSLMSACSQLGSLELA 145 + V WSALI GYAQNGQ EA ++F +M + +++PD++ MV LMSACSQ+G EL Sbjct: 264 ARGVDVRAWSALILGYAQNGQPNEAFKVFSEMCAKNVKPDEFIMVGLMSACSQMGCFELC 323 Query: 144 KLIDSYLHSGSVDVCQGHVAAALVDMNAKCGNMERALTLFEQLPTRDV 1 + +DSYLH +V AL+DMNAKCG+M+RA LFE++P RD+ Sbjct: 324 EKVDSYLHQRMNKFSSHYVVPALIDMNAKCGHMDRAAKLFEEMPQRDL 371 Score = 89.0 bits (219), Expect(2) = 4e-67 Identities = 41/94 (43%), Positives = 61/94 (64%) Frame = -1 Query: 782 FSVMLLINNCANALVLREGQVLHGFVVRYGIESDVFVGCSLINLYGKCREVECARKVFDA 603 ++ L++ C+N +R G +HG V+R G + DV VG S ++ YGKC+++ ARKVF Sbjct: 111 YTFPLVMKVCSNNGQVRVGSSVHGLVLRIGFDKDVVVGTSFVDFYGKCKDLFSARKVFGE 170 Query: 602 MPIRNEVSWTAMIFVYMIFGDALEANRLFDKMPK 501 MP RN VSWTA++ Y+ G+ EA +FD MP+ Sbjct: 171 MPERNAVSWTALVVAYVKSGELEEAKSMFDLMPE 204 Score = 57.0 bits (136), Expect = 6e-06 Identities = 34/125 (27%), Positives = 61/125 (48%) Frame = -3 Query: 387 AMIDGYAKAGDMASARMLFEQSEQKGVVLWSALISGYAQNGQSKEAIQMFYKMQSLSIQP 208 A+ID AK G M A LFE+ Q+ +V + +++ G A +G EAI++F KM I P Sbjct: 345 ALIDMNAKCGHMDRAAKLFEEMPQRDLVSYCSMMEGMAIHGCGSEAIRLFEKMVDEGIVP 404 Query: 207 DKYAMVSLMSACSQLGSLELAKLIDSYLHSGSVDVCQGHVAAALVDMNAKCGNMERALTL 28 D+ A ++ C Q +E + + + +V++ ++ G ++ A L Sbjct: 405 DEVAFTVILKVCGQSRLVEEGLRYFELMRKKYSILASPDHYSCIVNLLSRTGKLKEAYEL 464 Query: 27 FEQLP 13 + +P Sbjct: 465 IKSMP 469