BLASTX nr result
ID: Cephaelis21_contig00005071
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00005071 (1547 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAS79604.1| putative pentatricopeptide repeat-containing prot... 408 e-111 ref|XP_002283651.1| PREDICTED: pentatricopeptide repeat-containi... 404 e-110 ref|XP_003543566.1| PREDICTED: pentatricopeptide repeat-containi... 386 e-104 ref|XP_003603974.1| Pentatricopeptide repeat-containing protein ... 383 e-104 ref|XP_003523513.1| PREDICTED: pentatricopeptide repeat-containi... 366 1e-98 >gb|AAS79604.1| putative pentatricopeptide repeat-containing protein [Ipomoea trifida] gi|118562903|dbj|BAF37793.1| hypothetical protein [Ipomoea trifida] Length = 575 Score = 408 bits (1048), Expect = e-111 Identities = 201/341 (58%), Positives = 252/341 (73%), Gaps = 3/341 (0%) Frame = -1 Query: 1541 NSKVSTALIDMYAKCWCIDSATEVFNETVNKDVIAWTAMITGLAIRGKCKEAMELFEQMK 1362 N+ VSTALIDMYAKC CID A EVF+ET+ KDV WTA+I GLA G C +A+E FE MK Sbjct: 236 NANVSTALIDMYAKCGCIDGALEVFDETLEKDVYVWTAIIAGLASHGLCMKAIEFFENMK 295 Query: 1361 SLEIKPDERTLNAVLLACRNGRCVSEGLNCFRTMKKKHKVRPTMQHYRCIVDMLAHTGHL 1182 ++K DER + AVL A RN VSEGL FR +KK HK++PT+QHY C+VDML G L Sbjct: 296 KSDVKMDERAITAVLSAYRNAGLVSEGLLFFRRLKK-HKIKPTIQHYGCVVDMLTRAGRL 354 Query: 1181 EDAESFMKKMPVQADILLWRTLISACELLGDVERGERLMKHLELLKMHSSDSESHLVPKD 1002 +DAE F++KMP++ D +LWRTLI C++LGDVER ERL++ LELL M S D+ S+++ ++ Sbjct: 355 KDAEEFIRKMPIEPDAVLWRTLIWGCKILGDVERSERLVRELELLNMDSRDTGSYVLLEN 414 Query: 1001 LYDSDDKWQEKGNMRALTNQRGFSKAPGYSRIEINGEVHEFTTGDMRLFEVEKIDGKLDE 822 +Y + KW+EK R L QRG K P SRIEI+G VHEFT GD R E + KL++ Sbjct: 415 VYAATGKWEEKAKTRELMYQRGLMKPPACSRIEIDGVVHEFTAGDSRHDEATAVYEKLED 474 Query: 821 IALNLSHEGYDPKYS---VDIDDEDKAFELLHHSEKLAVSFGLIRTSPGSTIRIVKNLRP 651 + L EGY+P S ++IDD++KA +LLHHSEKLAVSFGL+++SPGS IRIVKNLR Sbjct: 475 VEERLRGEGYNPIVSEVLLEIDDDEKASQLLHHSEKLAVSFGLVKSSPGSVIRIVKNLRS 534 Query: 650 CVDCHSFMKLLSKVYEREIIVRDHIRFHHFRNGECCCGNFW 528 C DCHSFMKL+SKVY+R+IIVRD IRFHHF G C CG+ W Sbjct: 535 CEDCHSFMKLISKVYQRDIIVRDRIRFHHFSGGNCSCGDRW 575 Score = 58.2 bits (139), Expect = 6e-06 Identities = 37/141 (26%), Positives = 74/141 (52%) Frame = -1 Query: 1532 VSTALIDMYAKCWCIDSATEVFNETVNKDVIAWTAMITGLAIRGKCKEAMELFEQMKSLE 1353 ++ ALI +Y+ + A +VF++ ++DV++WT++I G + EA+ LF M Sbjct: 138 INNALIHLYSVSGEPNLAYKVFDKMPDRDVVSWTSIIDGFVDNDRPIEAIRLFTHMIENG 197 Query: 1352 IKPDERTLNAVLLACRNGRCVSEGLNCFRTMKKKHKVRPTMQHYRCIVDMLAHTGHLEDA 1173 I+P+E T+ +VL AC + ++ G +K+K+ ++DM A G ++ A Sbjct: 198 IEPNEVTVASVLRACADTGALNTGERIHSFVKEKN-FSSNANVSTALIDMYAKCGCIDGA 256 Query: 1172 ESFMKKMPVQADILLWRTLIS 1110 + ++ D+ +W +I+ Sbjct: 257 LEVFDE-TLEKDVYVWTAIIA 276 >ref|XP_002283651.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065 [Vitis vinifera] gi|297744424|emb|CBI37686.3| unnamed protein product [Vitis vinifera] Length = 571 Score = 404 bits (1038), Expect = e-110 Identities = 197/342 (57%), Positives = 256/342 (74%), Gaps = 3/342 (0%) Frame = -1 Query: 1544 LNSKVSTALIDMYAKCWCIDSATEVFNETVNKDVIAWTAMITGLAIRGKCKEAMELFEQM 1365 L + V TALIDMYAKC I SA +VF+ VNKDV AWTAMI+GLA G C+EA+ LF+QM Sbjct: 230 LEANVRTALIDMYAKCGSIGSARKVFDGIVNKDVFAWTAMISGLANHGLCEEAVTLFDQM 289 Query: 1364 KSLEIKPDERTLNAVLLACRNGRCVSEGLNCFRTMKKKHKVRPTMQHYRCIVDMLAHTGH 1185 +S ++PDERT+ AVL ACRN SEG F +M K+ ++PT+QHY C+VD+LA TGH Sbjct: 290 ESFGLRPDERTMTAVLSACRNAGWFSEGFAYFNSMWCKYGIKPTIQHYGCMVDLLARTGH 349 Query: 1184 LEDAESFMKKMPVQADILLWRTLISACELLGDVERGERLMKHLELLKMHSSDSESHLVPK 1005 L++AE F++KMP++ D++LWRTLI A ++ GD++R E+LMK LLKM S D S+++ Sbjct: 350 LDEAEEFIRKMPIEPDVVLWRTLIWASKVHGDIDRSEQLMKDRGLLKMDSDDCGSYVLLG 409 Query: 1004 DLYDSDDKWQEKGNMRALTNQRGFSKAPGYSRIEINGEVHEFTTGDMRLFEVEKIDGKLD 825 ++Y S KW +K MR L NQ+G SK PG SRIE++G VHEF GD E EKI KLD Sbjct: 410 NVYASAGKWHDKAKMRELMNQKGLSKPPGCSRIEVDGLVHEFAAGDSGHIEAEKIYAKLD 469 Query: 824 EIALNLSHEGYDPKYS---VDIDDEDKAFELLHHSEKLAVSFGLIRTSPGSTIRIVKNLR 654 E+ L EGY PK S ++ID+++KAF+L HHSEKLAV+FGLI+TSPG+ IRIVKNLR Sbjct: 470 EVEERLKAEGYHPKLSEVLLEIDNKEKAFQLRHHSEKLAVAFGLIKTSPGTEIRIVKNLR 529 Query: 653 PCVDCHSFMKLLSKVYEREIIVRDHIRFHHFRNGECCCGNFW 528 C DCHS +KL+SK+Y+++IIVRD IRFHHF NG+C C ++W Sbjct: 530 SCEDCHSVLKLISKIYQQDIIVRDRIRFHHFINGDCSCKDYW 571 Score = 72.4 bits (176), Expect = 3e-10 Identities = 47/160 (29%), Positives = 76/160 (47%) Frame = -1 Query: 1532 VSTALIDMYAKCWCIDSATEVFNETVNKDVIAWTAMITGLAIRGKCKEAMELFEQMKSLE 1353 VS LI MY+ C A +VF + ++DV++WT+MI G + EA+ LFE+M Sbjct: 133 VSNGLIHMYSSCGKSGRAYKVFGKMRDRDVVSWTSMIDGFVDDDRALEAIRLFEEMVEDG 192 Query: 1352 IKPDERTLNAVLLACRNGRCVSEGLNCFRTMKKKHKVRPTMQHYRCIVDMLAHTGHLEDA 1173 ++P+E T+ +VL AC + V G ++++ K+ ++DM A G + A Sbjct: 193 VEPNEATVVSVLRACADAGAVGMGRRVQGVIEER-KIGLEANVRTALIDMYAKCGSIGSA 251 Query: 1172 ESFMKKMPVQADILLWRTLISACELLGDVERGERLMKHLE 1053 + V D+ W +IS G E L +E Sbjct: 252 RKVFDGI-VNKDVFAWTAMISGLANHGLCEEAVTLFDQME 290 >ref|XP_003543566.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Glycine max] Length = 572 Score = 386 bits (991), Expect = e-104 Identities = 182/340 (53%), Positives = 250/340 (73%), Gaps = 3/340 (0%) Frame = -1 Query: 1538 SKVSTALIDMYAKCWCIDSATEVFNETVNKDVIAWTAMITGLAIRGKCKEAMELFEQMKS 1359 S VSTAL+DMYAK CI SA +VF++ V++DV WTAMI+GLA G CK+A+++F M+S Sbjct: 233 SNVSTALVDMYAKGGCIASARKVFDDVVHRDVFVWTAMISGLASHGLCKDAIDMFVDMES 292 Query: 1358 LEIKPDERTLNAVLLACRNGRCVSEGLNCFRTMKKKHKVRPTMQHYRCIVDMLAHTGHLE 1179 +KPDERT+ AVL ACRN + EG F +++++ ++P++QH+ C+VD+LA G L+ Sbjct: 293 SGVKPDERTVTAVLTACRNAGLIREGFMLFSDVQRRYGMKPSIQHFGCLVDLLARAGRLK 352 Query: 1178 DAESFMKKMPVQADILLWRTLISACELLGDVERGERLMKHLELLKMHSSDSESHLVPKDL 999 +AE F+ MP++ D +LWRTLI AC++ GD +R ERLMKHLE+ M + DS S+++ ++ Sbjct: 353 EAEDFVNAMPIEPDTVLWRTLIWACKVHGDADRAERLMKHLEIQDMRADDSGSYILASNV 412 Query: 998 YDSDDKWQEKGNMRALTNQRGFSKAPGYSRIEINGEVHEFTTGDMRLFEVEKIDGKLDEI 819 Y S KW K +R L N++G K PG SRIE++G VHEF GD E E+I +L E+ Sbjct: 413 YASTGKWCNKAEVRELMNKKGLVKPPGTSRIEVDGGVHEFVMGDYNHPEAEEIFVELAEV 472 Query: 818 ALNLSHEGYDPKYS---VDIDDEDKAFELLHHSEKLAVSFGLIRTSPGSTIRIVKNLRPC 648 + EGYDP+ S +++DDE+KA +LLHHSEKLA+++GLIR GSTIRIVKNLR C Sbjct: 473 VDKIRKEGYDPRVSEVLLEMDDEEKAVQLLHHSEKLALAYGLIRIGHGSTIRIVKNLRSC 532 Query: 647 VDCHSFMKLLSKVYEREIIVRDHIRFHHFRNGECCCGNFW 528 DCH FMKL+SK+Y+R+IIVRD IRFHHF+NGEC C ++W Sbjct: 533 EDCHEFMKLISKIYKRDIIVRDRIRFHHFKNGECSCKDYW 572 Score = 59.7 bits (143), Expect = 2e-06 Identities = 38/142 (26%), Positives = 70/142 (49%), Gaps = 1/142 (0%) Frame = -1 Query: 1532 VSTALIDMYAKCWCIDSATEVFNETVNKDVIAWTAMITGLAIRGKCKEAMELFEQMKSLE 1353 + L+ MY++ + A +F+ ++DV++WT+MI GL EA+ LFE+M Sbjct: 132 IQNVLLHMYSEFGDLLLARSLFDRMPHRDVVSWTSMIGGLVNHDLPVEAINLFERMLQCG 191 Query: 1352 IKPDERTLNAVLLACRNGRCVSEGLNCFRTMKK-KHKVRPTMQHYRCIVDMLAHTGHLED 1176 ++ +E T+ +VL AC + +S G +++ ++ +VDM A G + Sbjct: 192 VEVNEATVISVLRACADSGALSMGRKVHANLEEWGIEIHSKSNVSTALVDMYAKGGCIAS 251 Query: 1175 AESFMKKMPVQADILLWRTLIS 1110 A + V D+ +W +IS Sbjct: 252 ARKVFDDV-VHRDVFVWTAMIS 272 >ref|XP_003603974.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355493022|gb|AES74225.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 566 Score = 383 bits (984), Expect = e-104 Identities = 184/343 (53%), Positives = 250/343 (72%), Gaps = 3/343 (0%) Frame = -1 Query: 1547 NLNSKVSTALIDMYAKCWCIDSATEVFNETVNKDVIAWTAMITGLAIRGKCKEAMELFEQ 1368 + + V TALI MY+KC C++SA EVF++ +++DV WTAMI GLA G CKEA+ELF + Sbjct: 224 DFKANVCTALIHMYSKCGCLESAREVFDDVLDRDVFVWTAMIYGLACHGMCKEAIELFLE 283 Query: 1367 MKSLEIKPDERTLNAVLLACRNGRCVSEGLNCFRTMKKKHKVRPTMQHYRCIVDMLAHTG 1188 M++ +KPDERT+ VL A RN V EG F ++K++ ++P ++H+ C+VD+LA G Sbjct: 284 METCNVKPDERTIMVVLSAYRNAGLVREGYMFFNDVQKRYSMKPNIKHFGCMVDLLAKGG 343 Query: 1187 HLEDAESFMKKMPVQADILLWRTLISACELLGDVERGERLMKHLELLKMHSSDSESHLVP 1008 LE+AE F+ MP++ D ++WRTLI AC++ D ER ERLMKHLEL M + DS S+++ Sbjct: 344 CLEEAEDFINAMPMKPDAVIWRTLIWACKVHADTERAERLMKHLELQGMSAHDSGSYILA 403 Query: 1007 KDLYDSDDKWQEKGNMRALTNQRGFSKAPGYSRIEINGEVHEFTTGDMRLFEVEKIDGKL 828 ++Y S KW +K +R L N++G K PG SRIE++G VHEF GD + EKI KL Sbjct: 404 SNVYASTGKWCDKAEVRELMNKKGLVKPPGSSRIEVDGVVHEFVMGDYDHPDTEKIFIKL 463 Query: 827 DEIALNLSHEGYDPKYS---VDIDDEDKAFELLHHSEKLAVSFGLIRTSPGSTIRIVKNL 657 D++ L EGY+PK S +++DDE+KA +LLHHSEKLA+++GLIRT PGS IRIVKNL Sbjct: 464 DQMVDKLRKEGYNPKVSEVMLEMDDEEKAIQLLHHSEKLALAYGLIRTCPGSKIRIVKNL 523 Query: 656 RPCVDCHSFMKLLSKVYEREIIVRDHIRFHHFRNGECCCGNFW 528 R C DCH FMKL+SKVY+R+IIVRD IRFHHF+NG+C C ++W Sbjct: 524 RSCEDCHEFMKLISKVYQRDIIVRDRIRFHHFKNGDCSCKDYW 566 Score = 60.1 bits (144), Expect = 2e-06 Identities = 40/160 (25%), Positives = 75/160 (46%) Frame = -1 Query: 1532 VSTALIDMYAKCWCIDSATEVFNETVNKDVIAWTAMITGLAIRGKCKEAMELFEQMKSLE 1353 + ALI MY++ + A +VF+ ++DV++WT+MI G EA++LF++M + Sbjct: 128 IQNALIHMYSEIGELVIARQVFDRMSHRDVVSWTSMIAGFVNHHLTVEAIQLFQRMLEVG 187 Query: 1352 IKPDERTLNAVLLACRNGRCVSEGLNCFRTMKKKHKVRPTMQHYRCIVDMLAHTGHLEDA 1173 + +E T+ +VL C + +S G +K+K + ++ M + G LE A Sbjct: 188 VDVNEATVISVLRGCADSGALSVGRKVHGIVKEK-GIDFKANVCTALIHMYSKCGCLESA 246 Query: 1172 ESFMKKMPVQADILLWRTLISACELLGDVERGERLMKHLE 1053 + + D+ +W +I G + L +E Sbjct: 247 REVFDDV-LDRDVFVWTAMIYGLACHGMCKEAIELFLEME 285 >ref|XP_003523513.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Glycine max] Length = 542 Score = 366 bits (939), Expect = 1e-98 Identities = 176/340 (51%), Positives = 244/340 (71%), Gaps = 3/340 (0%) Frame = -1 Query: 1538 SKVSTALIDMYAKCWCIDSATEVFNETVNKDVIAWTAMITGLAIRGKCKEAMELFEQMKS 1359 S VSTAL+DMYAK CI +VF++ V++DV WTAMI+GLA G CK+A+++F M+S Sbjct: 205 SNVSTALVDMYAKSGCI--VRKVFDDVVDRDVFVWTAMISGLASHGLCKDAIDMFVDMES 262 Query: 1358 LEIKPDERTLNAVLLACRNGRCVSEGLNCFRTMKKKHKVRPTMQHYRCIVDMLAHTGHLE 1179 +KPDERT+ VL ACRN + EG F +++++ ++P++QH+ C+VD+LA G L+ Sbjct: 263 SGVKPDERTVTTVLTACRNAGLIREGFMLFSDVQRRYGMKPSIQHFGCLVDLLARAGRLK 322 Query: 1178 DAESFMKKMPVQADILLWRTLISACELLGDVERGERLMKHLELLKMHSSDSESHLVPKDL 999 +AE F+ MP++ D +LWRTLI AC++ GD +R ERLMKHLE+ M + DS S+++ ++ Sbjct: 323 EAEDFVNAMPIEPDAVLWRTLIWACKVHGDDDRAERLMKHLEIQDMRADDSGSYILTSNV 382 Query: 998 YDSDDKWQEKGNMRALTNQRGFSKAPGYSRIEINGEVHEFTTGDMRLFEVEKIDGKLDEI 819 Y S KW K +R L N++G K G SRIEI+G VHEF GD E E+I +L E+ Sbjct: 383 YASTGKWCNKAEVRELMNKKGLVKPLGSSRIEIDGGVHEFVMGDYNHPEAEEIFVELAEV 442 Query: 818 ALNLSHEGYDPKYS---VDIDDEDKAFELLHHSEKLAVSFGLIRTSPGSTIRIVKNLRPC 648 + EGYDP+ S +++DDE+KA +LLHHSEKLA+++GLIR GSTI IVKNLR C Sbjct: 443 MDKIRKEGYDPRVSEVLLEMDDEEKAVQLLHHSEKLALAYGLIRIGHGSTIWIVKNLRSC 502 Query: 647 VDCHSFMKLLSKVYEREIIVRDHIRFHHFRNGECCCGNFW 528 DCH FMKL+SK+ +R+I+VRD IRFHHF+NGEC C ++W Sbjct: 503 EDCHEFMKLISKICKRDIVVRDRIRFHHFKNGECSCKDYW 542