BLASTX nr result

ID: Cephaelis21_contig00005071 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00005071
         (1547 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAS79604.1| putative pentatricopeptide repeat-containing prot...   408   e-111
ref|XP_002283651.1| PREDICTED: pentatricopeptide repeat-containi...   404   e-110
ref|XP_003543566.1| PREDICTED: pentatricopeptide repeat-containi...   386   e-104
ref|XP_003603974.1| Pentatricopeptide repeat-containing protein ...   383   e-104
ref|XP_003523513.1| PREDICTED: pentatricopeptide repeat-containi...   366   1e-98

>gb|AAS79604.1| putative pentatricopeptide repeat-containing protein [Ipomoea
            trifida] gi|118562903|dbj|BAF37793.1| hypothetical
            protein [Ipomoea trifida]
          Length = 575

 Score =  408 bits (1048), Expect = e-111
 Identities = 201/341 (58%), Positives = 252/341 (73%), Gaps = 3/341 (0%)
 Frame = -1

Query: 1541 NSKVSTALIDMYAKCWCIDSATEVFNETVNKDVIAWTAMITGLAIRGKCKEAMELFEQMK 1362
            N+ VSTALIDMYAKC CID A EVF+ET+ KDV  WTA+I GLA  G C +A+E FE MK
Sbjct: 236  NANVSTALIDMYAKCGCIDGALEVFDETLEKDVYVWTAIIAGLASHGLCMKAIEFFENMK 295

Query: 1361 SLEIKPDERTLNAVLLACRNGRCVSEGLNCFRTMKKKHKVRPTMQHYRCIVDMLAHTGHL 1182
              ++K DER + AVL A RN   VSEGL  FR +KK HK++PT+QHY C+VDML   G L
Sbjct: 296  KSDVKMDERAITAVLSAYRNAGLVSEGLLFFRRLKK-HKIKPTIQHYGCVVDMLTRAGRL 354

Query: 1181 EDAESFMKKMPVQADILLWRTLISACELLGDVERGERLMKHLELLKMHSSDSESHLVPKD 1002
            +DAE F++KMP++ D +LWRTLI  C++LGDVER ERL++ LELL M S D+ S+++ ++
Sbjct: 355  KDAEEFIRKMPIEPDAVLWRTLIWGCKILGDVERSERLVRELELLNMDSRDTGSYVLLEN 414

Query: 1001 LYDSDDKWQEKGNMRALTNQRGFSKAPGYSRIEINGEVHEFTTGDMRLFEVEKIDGKLDE 822
            +Y +  KW+EK   R L  QRG  K P  SRIEI+G VHEFT GD R  E   +  KL++
Sbjct: 415  VYAATGKWEEKAKTRELMYQRGLMKPPACSRIEIDGVVHEFTAGDSRHDEATAVYEKLED 474

Query: 821  IALNLSHEGYDPKYS---VDIDDEDKAFELLHHSEKLAVSFGLIRTSPGSTIRIVKNLRP 651
            +   L  EGY+P  S   ++IDD++KA +LLHHSEKLAVSFGL+++SPGS IRIVKNLR 
Sbjct: 475  VEERLRGEGYNPIVSEVLLEIDDDEKASQLLHHSEKLAVSFGLVKSSPGSVIRIVKNLRS 534

Query: 650  CVDCHSFMKLLSKVYEREIIVRDHIRFHHFRNGECCCGNFW 528
            C DCHSFMKL+SKVY+R+IIVRD IRFHHF  G C CG+ W
Sbjct: 535  CEDCHSFMKLISKVYQRDIIVRDRIRFHHFSGGNCSCGDRW 575



 Score = 58.2 bits (139), Expect = 6e-06
 Identities = 37/141 (26%), Positives = 74/141 (52%)
 Frame = -1

Query: 1532 VSTALIDMYAKCWCIDSATEVFNETVNKDVIAWTAMITGLAIRGKCKEAMELFEQMKSLE 1353
            ++ ALI +Y+     + A +VF++  ++DV++WT++I G     +  EA+ LF  M    
Sbjct: 138  INNALIHLYSVSGEPNLAYKVFDKMPDRDVVSWTSIIDGFVDNDRPIEAIRLFTHMIENG 197

Query: 1352 IKPDERTLNAVLLACRNGRCVSEGLNCFRTMKKKHKVRPTMQHYRCIVDMLAHTGHLEDA 1173
            I+P+E T+ +VL AC +   ++ G      +K+K+           ++DM A  G ++ A
Sbjct: 198  IEPNEVTVASVLRACADTGALNTGERIHSFVKEKN-FSSNANVSTALIDMYAKCGCIDGA 256

Query: 1172 ESFMKKMPVQADILLWRTLIS 1110
                 +  ++ D+ +W  +I+
Sbjct: 257  LEVFDE-TLEKDVYVWTAIIA 276


>ref|XP_002283651.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065
            [Vitis vinifera] gi|297744424|emb|CBI37686.3| unnamed
            protein product [Vitis vinifera]
          Length = 571

 Score =  404 bits (1038), Expect = e-110
 Identities = 197/342 (57%), Positives = 256/342 (74%), Gaps = 3/342 (0%)
 Frame = -1

Query: 1544 LNSKVSTALIDMYAKCWCIDSATEVFNETVNKDVIAWTAMITGLAIRGKCKEAMELFEQM 1365
            L + V TALIDMYAKC  I SA +VF+  VNKDV AWTAMI+GLA  G C+EA+ LF+QM
Sbjct: 230  LEANVRTALIDMYAKCGSIGSARKVFDGIVNKDVFAWTAMISGLANHGLCEEAVTLFDQM 289

Query: 1364 KSLEIKPDERTLNAVLLACRNGRCVSEGLNCFRTMKKKHKVRPTMQHYRCIVDMLAHTGH 1185
            +S  ++PDERT+ AVL ACRN    SEG   F +M  K+ ++PT+QHY C+VD+LA TGH
Sbjct: 290  ESFGLRPDERTMTAVLSACRNAGWFSEGFAYFNSMWCKYGIKPTIQHYGCMVDLLARTGH 349

Query: 1184 LEDAESFMKKMPVQADILLWRTLISACELLGDVERGERLMKHLELLKMHSSDSESHLVPK 1005
            L++AE F++KMP++ D++LWRTLI A ++ GD++R E+LMK   LLKM S D  S+++  
Sbjct: 350  LDEAEEFIRKMPIEPDVVLWRTLIWASKVHGDIDRSEQLMKDRGLLKMDSDDCGSYVLLG 409

Query: 1004 DLYDSDDKWQEKGNMRALTNQRGFSKAPGYSRIEINGEVHEFTTGDMRLFEVEKIDGKLD 825
            ++Y S  KW +K  MR L NQ+G SK PG SRIE++G VHEF  GD    E EKI  KLD
Sbjct: 410  NVYASAGKWHDKAKMRELMNQKGLSKPPGCSRIEVDGLVHEFAAGDSGHIEAEKIYAKLD 469

Query: 824  EIALNLSHEGYDPKYS---VDIDDEDKAFELLHHSEKLAVSFGLIRTSPGSTIRIVKNLR 654
            E+   L  EGY PK S   ++ID+++KAF+L HHSEKLAV+FGLI+TSPG+ IRIVKNLR
Sbjct: 470  EVEERLKAEGYHPKLSEVLLEIDNKEKAFQLRHHSEKLAVAFGLIKTSPGTEIRIVKNLR 529

Query: 653  PCVDCHSFMKLLSKVYEREIIVRDHIRFHHFRNGECCCGNFW 528
             C DCHS +KL+SK+Y+++IIVRD IRFHHF NG+C C ++W
Sbjct: 530  SCEDCHSVLKLISKIYQQDIIVRDRIRFHHFINGDCSCKDYW 571



 Score = 72.4 bits (176), Expect = 3e-10
 Identities = 47/160 (29%), Positives = 76/160 (47%)
 Frame = -1

Query: 1532 VSTALIDMYAKCWCIDSATEVFNETVNKDVIAWTAMITGLAIRGKCKEAMELFEQMKSLE 1353
            VS  LI MY+ C     A +VF +  ++DV++WT+MI G     +  EA+ LFE+M    
Sbjct: 133  VSNGLIHMYSSCGKSGRAYKVFGKMRDRDVVSWTSMIDGFVDDDRALEAIRLFEEMVEDG 192

Query: 1352 IKPDERTLNAVLLACRNGRCVSEGLNCFRTMKKKHKVRPTMQHYRCIVDMLAHTGHLEDA 1173
            ++P+E T+ +VL AC +   V  G      ++++ K+         ++DM A  G +  A
Sbjct: 193  VEPNEATVVSVLRACADAGAVGMGRRVQGVIEER-KIGLEANVRTALIDMYAKCGSIGSA 251

Query: 1172 ESFMKKMPVQADILLWRTLISACELLGDVERGERLMKHLE 1053
                  + V  D+  W  +IS     G  E    L   +E
Sbjct: 252  RKVFDGI-VNKDVFAWTAMISGLANHGLCEEAVTLFDQME 290


>ref|XP_003543566.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like
            [Glycine max]
          Length = 572

 Score =  386 bits (991), Expect = e-104
 Identities = 182/340 (53%), Positives = 250/340 (73%), Gaps = 3/340 (0%)
 Frame = -1

Query: 1538 SKVSTALIDMYAKCWCIDSATEVFNETVNKDVIAWTAMITGLAIRGKCKEAMELFEQMKS 1359
            S VSTAL+DMYAK  CI SA +VF++ V++DV  WTAMI+GLA  G CK+A+++F  M+S
Sbjct: 233  SNVSTALVDMYAKGGCIASARKVFDDVVHRDVFVWTAMISGLASHGLCKDAIDMFVDMES 292

Query: 1358 LEIKPDERTLNAVLLACRNGRCVSEGLNCFRTMKKKHKVRPTMQHYRCIVDMLAHTGHLE 1179
              +KPDERT+ AVL ACRN   + EG   F  +++++ ++P++QH+ C+VD+LA  G L+
Sbjct: 293  SGVKPDERTVTAVLTACRNAGLIREGFMLFSDVQRRYGMKPSIQHFGCLVDLLARAGRLK 352

Query: 1178 DAESFMKKMPVQADILLWRTLISACELLGDVERGERLMKHLELLKMHSSDSESHLVPKDL 999
            +AE F+  MP++ D +LWRTLI AC++ GD +R ERLMKHLE+  M + DS S+++  ++
Sbjct: 353  EAEDFVNAMPIEPDTVLWRTLIWACKVHGDADRAERLMKHLEIQDMRADDSGSYILASNV 412

Query: 998  YDSDDKWQEKGNMRALTNQRGFSKAPGYSRIEINGEVHEFTTGDMRLFEVEKIDGKLDEI 819
            Y S  KW  K  +R L N++G  K PG SRIE++G VHEF  GD    E E+I  +L E+
Sbjct: 413  YASTGKWCNKAEVRELMNKKGLVKPPGTSRIEVDGGVHEFVMGDYNHPEAEEIFVELAEV 472

Query: 818  ALNLSHEGYDPKYS---VDIDDEDKAFELLHHSEKLAVSFGLIRTSPGSTIRIVKNLRPC 648
               +  EGYDP+ S   +++DDE+KA +LLHHSEKLA+++GLIR   GSTIRIVKNLR C
Sbjct: 473  VDKIRKEGYDPRVSEVLLEMDDEEKAVQLLHHSEKLALAYGLIRIGHGSTIRIVKNLRSC 532

Query: 647  VDCHSFMKLLSKVYEREIIVRDHIRFHHFRNGECCCGNFW 528
             DCH FMKL+SK+Y+R+IIVRD IRFHHF+NGEC C ++W
Sbjct: 533  EDCHEFMKLISKIYKRDIIVRDRIRFHHFKNGECSCKDYW 572



 Score = 59.7 bits (143), Expect = 2e-06
 Identities = 38/142 (26%), Positives = 70/142 (49%), Gaps = 1/142 (0%)
 Frame = -1

Query: 1532 VSTALIDMYAKCWCIDSATEVFNETVNKDVIAWTAMITGLAIRGKCKEAMELFEQMKSLE 1353
            +   L+ MY++   +  A  +F+   ++DV++WT+MI GL       EA+ LFE+M    
Sbjct: 132  IQNVLLHMYSEFGDLLLARSLFDRMPHRDVVSWTSMIGGLVNHDLPVEAINLFERMLQCG 191

Query: 1352 IKPDERTLNAVLLACRNGRCVSEGLNCFRTMKK-KHKVRPTMQHYRCIVDMLAHTGHLED 1176
            ++ +E T+ +VL AC +   +S G      +++   ++         +VDM A  G +  
Sbjct: 192  VEVNEATVISVLRACADSGALSMGRKVHANLEEWGIEIHSKSNVSTALVDMYAKGGCIAS 251

Query: 1175 AESFMKKMPVQADILLWRTLIS 1110
            A      + V  D+ +W  +IS
Sbjct: 252  ARKVFDDV-VHRDVFVWTAMIS 272


>ref|XP_003603974.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355493022|gb|AES74225.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 566

 Score =  383 bits (984), Expect = e-104
 Identities = 184/343 (53%), Positives = 250/343 (72%), Gaps = 3/343 (0%)
 Frame = -1

Query: 1547 NLNSKVSTALIDMYAKCWCIDSATEVFNETVNKDVIAWTAMITGLAIRGKCKEAMELFEQ 1368
            +  + V TALI MY+KC C++SA EVF++ +++DV  WTAMI GLA  G CKEA+ELF +
Sbjct: 224  DFKANVCTALIHMYSKCGCLESAREVFDDVLDRDVFVWTAMIYGLACHGMCKEAIELFLE 283

Query: 1367 MKSLEIKPDERTLNAVLLACRNGRCVSEGLNCFRTMKKKHKVRPTMQHYRCIVDMLAHTG 1188
            M++  +KPDERT+  VL A RN   V EG   F  ++K++ ++P ++H+ C+VD+LA  G
Sbjct: 284  METCNVKPDERTIMVVLSAYRNAGLVREGYMFFNDVQKRYSMKPNIKHFGCMVDLLAKGG 343

Query: 1187 HLEDAESFMKKMPVQADILLWRTLISACELLGDVERGERLMKHLELLKMHSSDSESHLVP 1008
             LE+AE F+  MP++ D ++WRTLI AC++  D ER ERLMKHLEL  M + DS S+++ 
Sbjct: 344  CLEEAEDFINAMPMKPDAVIWRTLIWACKVHADTERAERLMKHLELQGMSAHDSGSYILA 403

Query: 1007 KDLYDSDDKWQEKGNMRALTNQRGFSKAPGYSRIEINGEVHEFTTGDMRLFEVEKIDGKL 828
             ++Y S  KW +K  +R L N++G  K PG SRIE++G VHEF  GD    + EKI  KL
Sbjct: 404  SNVYASTGKWCDKAEVRELMNKKGLVKPPGSSRIEVDGVVHEFVMGDYDHPDTEKIFIKL 463

Query: 827  DEIALNLSHEGYDPKYS---VDIDDEDKAFELLHHSEKLAVSFGLIRTSPGSTIRIVKNL 657
            D++   L  EGY+PK S   +++DDE+KA +LLHHSEKLA+++GLIRT PGS IRIVKNL
Sbjct: 464  DQMVDKLRKEGYNPKVSEVMLEMDDEEKAIQLLHHSEKLALAYGLIRTCPGSKIRIVKNL 523

Query: 656  RPCVDCHSFMKLLSKVYEREIIVRDHIRFHHFRNGECCCGNFW 528
            R C DCH FMKL+SKVY+R+IIVRD IRFHHF+NG+C C ++W
Sbjct: 524  RSCEDCHEFMKLISKVYQRDIIVRDRIRFHHFKNGDCSCKDYW 566



 Score = 60.1 bits (144), Expect = 2e-06
 Identities = 40/160 (25%), Positives = 75/160 (46%)
 Frame = -1

Query: 1532 VSTALIDMYAKCWCIDSATEVFNETVNKDVIAWTAMITGLAIRGKCKEAMELFEQMKSLE 1353
            +  ALI MY++   +  A +VF+   ++DV++WT+MI G        EA++LF++M  + 
Sbjct: 128  IQNALIHMYSEIGELVIARQVFDRMSHRDVVSWTSMIAGFVNHHLTVEAIQLFQRMLEVG 187

Query: 1352 IKPDERTLNAVLLACRNGRCVSEGLNCFRTMKKKHKVRPTMQHYRCIVDMLAHTGHLEDA 1173
            +  +E T+ +VL  C +   +S G      +K+K  +         ++ M +  G LE A
Sbjct: 188  VDVNEATVISVLRGCADSGALSVGRKVHGIVKEK-GIDFKANVCTALIHMYSKCGCLESA 246

Query: 1172 ESFMKKMPVQADILLWRTLISACELLGDVERGERLMKHLE 1053
                  + +  D+ +W  +I      G  +    L   +E
Sbjct: 247  REVFDDV-LDRDVFVWTAMIYGLACHGMCKEAIELFLEME 285


>ref|XP_003523513.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like
            [Glycine max]
          Length = 542

 Score =  366 bits (939), Expect = 1e-98
 Identities = 176/340 (51%), Positives = 244/340 (71%), Gaps = 3/340 (0%)
 Frame = -1

Query: 1538 SKVSTALIDMYAKCWCIDSATEVFNETVNKDVIAWTAMITGLAIRGKCKEAMELFEQMKS 1359
            S VSTAL+DMYAK  CI    +VF++ V++DV  WTAMI+GLA  G CK+A+++F  M+S
Sbjct: 205  SNVSTALVDMYAKSGCI--VRKVFDDVVDRDVFVWTAMISGLASHGLCKDAIDMFVDMES 262

Query: 1358 LEIKPDERTLNAVLLACRNGRCVSEGLNCFRTMKKKHKVRPTMQHYRCIVDMLAHTGHLE 1179
              +KPDERT+  VL ACRN   + EG   F  +++++ ++P++QH+ C+VD+LA  G L+
Sbjct: 263  SGVKPDERTVTTVLTACRNAGLIREGFMLFSDVQRRYGMKPSIQHFGCLVDLLARAGRLK 322

Query: 1178 DAESFMKKMPVQADILLWRTLISACELLGDVERGERLMKHLELLKMHSSDSESHLVPKDL 999
            +AE F+  MP++ D +LWRTLI AC++ GD +R ERLMKHLE+  M + DS S+++  ++
Sbjct: 323  EAEDFVNAMPIEPDAVLWRTLIWACKVHGDDDRAERLMKHLEIQDMRADDSGSYILTSNV 382

Query: 998  YDSDDKWQEKGNMRALTNQRGFSKAPGYSRIEINGEVHEFTTGDMRLFEVEKIDGKLDEI 819
            Y S  KW  K  +R L N++G  K  G SRIEI+G VHEF  GD    E E+I  +L E+
Sbjct: 383  YASTGKWCNKAEVRELMNKKGLVKPLGSSRIEIDGGVHEFVMGDYNHPEAEEIFVELAEV 442

Query: 818  ALNLSHEGYDPKYS---VDIDDEDKAFELLHHSEKLAVSFGLIRTSPGSTIRIVKNLRPC 648
               +  EGYDP+ S   +++DDE+KA +LLHHSEKLA+++GLIR   GSTI IVKNLR C
Sbjct: 443  MDKIRKEGYDPRVSEVLLEMDDEEKAVQLLHHSEKLALAYGLIRIGHGSTIWIVKNLRSC 502

Query: 647  VDCHSFMKLLSKVYEREIIVRDHIRFHHFRNGECCCGNFW 528
             DCH FMKL+SK+ +R+I+VRD IRFHHF+NGEC C ++W
Sbjct: 503  EDCHEFMKLISKICKRDIVVRDRIRFHHFKNGECSCKDYW 542


Top