BLASTX nr result

ID: Cephaelis21_contig00012143 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00012143
         (2169 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002266822.1| PREDICTED: pentatricopeptide repeat-containi...   728   0.0  
emb|CAN67401.1| hypothetical protein VITISV_025967 [Vitis vinifera]   728   0.0  
ref|XP_002521980.1| pentatricopeptide repeat-containing protein,...   684   0.0  
ref|XP_002884468.1| pentatricopeptide repeat-containing protein ...   674   0.0  
ref|NP_566237.1| pentatricopeptide repeat-containing protein [Ar...   672   0.0  

>ref|XP_002266822.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760,
            chloroplastic-like [Vitis vinifera]
          Length = 582

 Score =  728 bits (1880), Expect = 0.0
 Identities = 355/580 (61%), Positives = 458/580 (78%), Gaps = 3/580 (0%)
 Frame = -1

Query: 1989 TIFSADVFSLLFPC--VTNPTSNLHKKAVVRCRSLMSNDR-KSESSQNAKVSTETRSTHL 1819
            TI+S D F    P      PTS+ H  ++V CR+   ND   S ++    VS E R  HL
Sbjct: 2    TIYSTDFFPRCPPFNPQLKPTSHSHHTSIVTCRNPNPNDGFNSRNAPKVGVSAEARPAHL 61

Query: 1818 QPDDSTEPNFVKLLYRSFKAAKYNEALYFLECMVNRSLKPDVILSTKLIQGFFSSKNAEK 1639
            Q  D  E + +KLL RS KA K+NE+LYFLEC+VN+   PDVIL TKLI+GFF+ KN EK
Sbjct: 62   QSYDFRETHLMKLLNRSCKAGKFNESLYFLECLVNKGYTPDVILCTKLIKGFFNFKNIEK 121

Query: 1638 AIRVMQILEQYGEPDVFAYNAVISGFCKLNQFDSANKILNRMRARGFSPDVVTYNIMIGS 1459
            A RVM+ILE + EPDVFAYNAVISGFCK+N+ ++A ++LNRM+ARGF PD+VTYNIMIGS
Sbjct: 122  ASRVMEILESHTEPDVFAYNAVISGFCKVNRIEAATQVLNRMKARGFLPDIVTYNIMIGS 181

Query: 1458 LCNRGKLGLALKVFDQLLEDNCQPSVVTYTILIEATAVEGGTREAMKLFDEMLSKGLQPD 1279
            LCNR KLGLALKV DQLL DNC P+V+TYTILIEAT VEGG  EAMKL +EML++GL PD
Sbjct: 182  LCNRRKLGLALKVLDQLLLDNCMPTVITYTILIEATIVEGGINEAMKLLEEMLARGLLPD 241

Query: 1278 MYTYNAIIRAMCKDGLMDEAFDFVKSLPARGCKADLVSYNVLLRALLSQGRWKDGEKLVA 1099
            MYTYNAIIR MCK+G+++ A + + SL ++GCK D++SYN+LLRA L+QG+W +GEKLVA
Sbjct: 242  MYTYNAIIRGMCKEGMVERAAELITSLTSKGCKPDVISYNILLRAFLNQGKWDEGEKLVA 301

Query: 1098 EMLSEDIEPNVVTYTILISALCHLGKLEESLEFLKLMMDEGLTPNTYTYDPLISAFCKEG 919
            EM S   EPN VTY+ILIS+LC  G+++E++  LK+M+++ LTP+TY+YDPLISA CKEG
Sbjct: 302  EMFSRGCEPNKVTYSILISSLCRFGRIDEAISVLKVMIEKELTPDTYSYDPLISALCKEG 361

Query: 918  KMDLAIAFLNHMVSSGCLPDIVNYNTLLSAMCKNKNVDQALELFGNLSEVGCPPDVSTYN 739
            ++DLAI  +++M+S+GCLPDIVNYNT+L+A+CKN N +QALE+F  L  +GCPP+VS+YN
Sbjct: 362  RLDLAIGIMDYMISNGCLPDIVNYNTILAALCKNGNANQALEIFNKLRGMGCPPNVSSYN 421

Query: 738  TMISALWNTGERAQALRIASEMISKAIDPDEITFKTLISCLCRDGMVDEARELLTGMESS 559
            TMISALW+ G+R++AL +   MISK +DPDEIT+ +LISCLCRDG+V+EA  LL  ME S
Sbjct: 422  TMISALWSCGDRSRALGMVPAMISKGVDPDEITYNSLISCLCRDGLVEEAIGLLDDMEQS 481

Query: 558  GFPLTVATYNIVLLGLCKVHRVDDAIEVLELMIQKGCLPNETTYILLLEGIGFAGWQAEA 379
            GF  TV +YNIVLLGLCKV R+DDAI +   MI+KGC PNETTYILL+EGIGFAGW+ EA
Sbjct: 482  GFRPTVISYNIVLLGLCKVRRIDDAIGMFAEMIEKGCRPNETTYILLIEGIGFAGWRTEA 541

Query: 378  IETASTLLRKHVITKESVQRLKRTFPTLNADKAVTHTKRK 259
            +E A++L  + VI+++S +RL +TFP L+  K +++++ K
Sbjct: 542  MELANSLFSRDVISQDSFKRLNKTFPMLDVYKELSNSETK 581


>emb|CAN67401.1| hypothetical protein VITISV_025967 [Vitis vinifera]
          Length = 592

 Score =  728 bits (1879), Expect = 0.0
 Identities = 356/582 (61%), Positives = 458/582 (78%), Gaps = 3/582 (0%)
 Frame = -1

Query: 1995 MTTIFSADVFSLL--FPCVTNPTSNLHKKAVVRCRSLMSNDR-KSESSQNAKVSTETRST 1825
            + TI+S D F     F     PTS+ H  ++V CR+   ND   S +S    VS E R  
Sbjct: 10   LMTIYSTDFFPHCPPFSPQLKPTSHSHHTSIVTCRNPNPNDGYNSRNSPKVGVSAEARPA 69

Query: 1824 HLQPDDSTEPNFVKLLYRSFKAAKYNEALYFLECMVNRSLKPDVILSTKLIQGFFSSKNA 1645
            HLQ  D  E + +KLL RS KA K+NE+LYFLEC+VN+   PDVIL TKLI+GFF+ KN 
Sbjct: 70   HLQSYDFRETHLMKLLNRSCKAGKFNESLYFLECLVNKGYTPDVILCTKLIKGFFNFKNI 129

Query: 1644 EKAIRVMQILEQYGEPDVFAYNAVISGFCKLNQFDSANKILNRMRARGFSPDVVTYNIMI 1465
            EKA RVM+ILE + EPDVFAYNAVISGFCK+NQ ++A ++LNRM+ARGF PD+VTYNIMI
Sbjct: 130  EKASRVMEILESHTEPDVFAYNAVISGFCKVNQIEAATQVLNRMKARGFLPDIVTYNIMI 189

Query: 1464 GSLCNRGKLGLALKVFDQLLEDNCQPSVVTYTILIEATAVEGGTREAMKLFDEMLSKGLQ 1285
            GSLCNR KLGLAL V DQLL DNC P+V+TYTILIEAT VEGG  EAMKL +EML++GL 
Sbjct: 190  GSLCNRRKLGLALTVLDQLLLDNCMPTVITYTILIEATIVEGGINEAMKLLEEMLARGLL 249

Query: 1284 PDMYTYNAIIRAMCKDGLMDEAFDFVKSLPARGCKADLVSYNVLLRALLSQGRWKDGEKL 1105
            PDMYTYNAIIR MCK+G+++ A + + SL ++GC+ D++SYN+LLRA L+QG+W +GEKL
Sbjct: 250  PDMYTYNAIIRGMCKEGMVERAAELITSLTSKGCEPDVISYNILLRAFLNQGKWDEGEKL 309

Query: 1104 VAEMLSEDIEPNVVTYTILISALCHLGKLEESLEFLKLMMDEGLTPNTYTYDPLISAFCK 925
            VAEM S   EPN VTY+ILIS+LC  G+++E++  LK+M+++ LTP+TY+YDPLISA CK
Sbjct: 310  VAEMFSRGCEPNKVTYSILISSLCRFGRIDEAISVLKVMIEKELTPDTYSYDPLISALCK 369

Query: 924  EGKMDLAIAFLNHMVSSGCLPDIVNYNTLLSAMCKNKNVDQALELFGNLSEVGCPPDVST 745
            EG++DLAI  +++M+S+GCLPDIVNYNT+L+A+CKN N +QALE+F  L  +GCPP+VS+
Sbjct: 370  EGRLDLAIGIMDYMISNGCLPDIVNYNTILAALCKNGNANQALEIFNKLRGMGCPPNVSS 429

Query: 744  YNTMISALWNTGERAQALRIASEMISKAIDPDEITFKTLISCLCRDGMVDEARELLTGME 565
            YNTMISALW+ G+R++AL +   MISK IDPDEIT+ +LISCLCRDG+V+EA  LL  ME
Sbjct: 430  YNTMISALWSCGDRSRALGMVPAMISKGIDPDEITYNSLISCLCRDGLVEEAIGLLDDME 489

Query: 564  SSGFPLTVATYNIVLLGLCKVHRVDDAIEVLELMIQKGCLPNETTYILLLEGIGFAGWQA 385
             SGF  TV +YNIVLLGLCKV R+DDAI +   MI+KGC PNETTYILL+EGIGFAGW+ 
Sbjct: 490  QSGFRPTVISYNIVLLGLCKVRRIDDAIGMFAEMIEKGCRPNETTYILLIEGIGFAGWRT 549

Query: 384  EAIETASTLLRKHVITKESVQRLKRTFPTLNADKAVTHTKRK 259
            EA+E A++L  + VI+++S +RL +TFP L+  K +++++ K
Sbjct: 550  EAMELANSLFSRDVISQDSFKRLNKTFPMLDVYKELSNSETK 591


>ref|XP_002521980.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223538784|gb|EEF40384.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 584

 Score =  684 bits (1764), Expect = 0.0
 Identities = 326/556 (58%), Positives = 440/556 (79%), Gaps = 1/556 (0%)
 Frame = -1

Query: 1938 PTSNLHKKAVVRC-RSLMSNDRKSESSQNAKVSTETRSTHLQPDDSTEPNFVKLLYRSFK 1762
            PTSN     +V C R  +++  K  + Q  +VS ETR TH+   D  E + +KLL RS +
Sbjct: 22   PTSNSLHSTIVSCIRPELNDANKVRNPQKVRVSAETRQTHVLSFDFKEVHLMKLLNRSCR 81

Query: 1761 AAKYNEALYFLECMVNRSLKPDVILSTKLIQGFFSSKNAEKAIRVMQILEQYGEPDVFAY 1582
            A KYNE+LYFLECMV++   PDVIL TKLI+GFF+S+N  KA RVM+ILE+YG+PDVFAY
Sbjct: 82   AGKYNESLYFLECMVDKGYTPDVILCTKLIKGFFNSRNIGKATRVMEILERYGKPDVFAY 141

Query: 1581 NAVISGFCKLNQFDSANKILNRMRARGFSPDVVTYNIMIGSLCNRGKLGLALKVFDQLLE 1402
            NA+ISGF K NQ ++AN++L+RM++RGF PDVVTYNIMIGS C+RGKL LAL++F++LL+
Sbjct: 142  NALISGFIKANQLENANRVLDRMKSRGFLPDVVTYNIMIGSFCSRGKLDLALEIFEELLK 201

Query: 1401 DNCQPSVVTYTILIEATAVEGGTREAMKLFDEMLSKGLQPDMYTYNAIIRAMCKDGLMDE 1222
            DNC+P+V+TYTILIEAT ++GG   AMKL DEMLSKGL+PD  TYNAIIR MCK+ ++D+
Sbjct: 202  DNCEPTVITYTILIEATILDGGIDVAMKLLDEMLSKGLEPDTLTYNAIIRGMCKEMMVDK 261

Query: 1221 AFDFVKSLPARGCKADLVSYNVLLRALLSQGRWKDGEKLVAEMLSEDIEPNVVTYTILIS 1042
            AF+ ++SL +RGCK D+++YN+LLR LLS+G+W +GEKL++EM+S   +PNVVT++ILI 
Sbjct: 262  AFELLRSLSSRGCKPDIITYNILLRTLLSRGKWSEGEKLISEMISIGCKPNVVTHSILIG 321

Query: 1041 ALCHLGKLEESLEFLKLMMDEGLTPNTYTYDPLISAFCKEGKMDLAIAFLNHMVSSGCLP 862
             LC  GK+EE++  L+ M ++GL P+ Y YDPLI+ FC+EG++DLA  FL +M+S GCLP
Sbjct: 322  TLCRDGKVEEAVNLLRSMKEKGLKPDAYCYDPLIAGFCREGRLDLATEFLEYMISDGCLP 381

Query: 861  DIVNYNTLLSAMCKNKNVDQALELFGNLSEVGCPPDVSTYNTMISALWNTGERAQALRIA 682
            DIVNYNT+++ +C+    DQALE+F  L EVGCPP+VS+YNT+ SALW++G+R +AL + 
Sbjct: 382  DIVNYNTIMAGLCRTGKADQALEVFEKLDEVGCPPNVSSYNTLFSALWSSGDRYRALEMI 441

Query: 681  SEMISKAIDPDEITFKTLISCLCRDGMVDEARELLTGMESSGFPLTVATYNIVLLGLCKV 502
             +++++ IDPDEIT+ +LISCLCRDGMVDEA ELL  M+S  +   V +YNI+LLGLCKV
Sbjct: 442  LKLLNQGIDPDEITYNSLISCLCRDGMVDEAIELLVDMQSGRYRPNVVSYNIILLGLCKV 501

Query: 501  HRVDDAIEVLELMIQKGCLPNETTYILLLEGIGFAGWQAEAIETASTLLRKHVITKESVQ 322
            +R +DAIEVL  M +KGC PNETTYILL+EGIGF+G +AEA+E A++L   + I+++S  
Sbjct: 502  NRANDAIEVLAAMTEKGCQPNETTYILLIEGIGFSGLRAEAMELANSLHGMNAISEDSFN 561

Query: 321  RLKRTFPTLNADKAVT 274
            RL +TFP L+  K +T
Sbjct: 562  RLNKTFPLLDVYKDLT 577


>ref|XP_002884468.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297330308|gb|EFH60727.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 598

 Score =  674 bits (1740), Expect = 0.0
 Identities = 314/525 (59%), Positives = 421/525 (80%)
 Frame = -1

Query: 1866 SSQNAKVSTETRSTHLQPDDSTEPNFVKLLYRSFKAAKYNEALYFLECMVNRSLKPDVIL 1687
            ++ +A + TE R  H Q     +   +K+ +RS ++  Y E+L+ LE MV +   PDVIL
Sbjct: 63   TTTDAAIPTERRQQHSQSLGFRDTQMLKIFHRSCRSGNYIESLHLLETMVRKGYNPDVIL 122

Query: 1686 STKLIQGFFSSKNAEKAIRVMQILEQYGEPDVFAYNAVISGFCKLNQFDSANKILNRMRA 1507
             TKLI+GFF+ +N  KA+RVM+ILE++G+PDVFAYNA+I+GFCK+N+ D A ++L+RMR+
Sbjct: 123  CTKLIKGFFTLRNVPKAVRVMEILEKFGQPDVFAYNALINGFCKMNRIDDATRVLDRMRS 182

Query: 1506 RGFSPDVVTYNIMIGSLCNRGKLGLALKVFDQLLEDNCQPSVVTYTILIEATAVEGGTRE 1327
            + FSPD VTYNIMIGSLC+RGKL LALKV DQLL DNCQP+V+TYTILIEAT +EGG  E
Sbjct: 183  KDFSPDTVTYNIMIGSLCSRGKLDLALKVLDQLLSDNCQPTVITYTILIEATMLEGGVDE 242

Query: 1326 AMKLFDEMLSKGLQPDMYTYNAIIRAMCKDGLMDEAFDFVKSLPARGCKADLVSYNVLLR 1147
            A+KL DEMLS+GL+PDM+TYN IIR MCK+G++D AF+ +++L  +GC+ D++SYN+LLR
Sbjct: 243  ALKLLDEMLSRGLKPDMFTYNTIIRGMCKEGMVDRAFEMIRNLELKGCEPDVISYNILLR 302

Query: 1146 ALLSQGRWKDGEKLVAEMLSEDIEPNVVTYTILISALCHLGKLEESLEFLKLMMDEGLTP 967
            ALL+QG+W++GEKL+ +M SE  +PNVVTY+ILI+ LC  GK+EE++  LKLM ++GLTP
Sbjct: 303  ALLNQGKWEEGEKLMTKMFSEKCDPNVVTYSILITTLCRDGKIEEAMNLLKLMKEKGLTP 362

Query: 966  NTYTYDPLISAFCKEGKMDLAIAFLNHMVSSGCLPDIVNYNTLLSAMCKNKNVDQALELF 787
            + Y+YDPLI+AFC+EG++D+AI FL  M+S GCLPDIVNYNT+L+ +CKN   DQALE+F
Sbjct: 363  DAYSYDPLIAAFCREGRLDVAIEFLETMISDGCLPDIVNYNTVLATLCKNGKADQALEIF 422

Query: 786  GNLSEVGCPPDVSTYNTMISALWNTGERAQALRIASEMISKAIDPDEITFKTLISCLCRD 607
            G L EVGC P+ S+YNTM SALW++G++ +AL +  EM+S  IDPDEIT+ ++ISCLCR+
Sbjct: 423  GKLGEVGCSPNSSSYNTMFSALWSSGDKIRALHMILEMVSNGIDPDEITYNSMISCLCRE 482

Query: 606  GMVDEARELLTGMESSGFPLTVATYNIVLLGLCKVHRVDDAIEVLELMIQKGCLPNETTY 427
            GMVD+A ELL  M S  F  +V TYNIVLLG CK HR++DAI+VL+ M+  GC PNETTY
Sbjct: 483  GMVDKAFELLVDMRSCEFHPSVVTYNIVLLGFCKAHRIEDAIDVLDSMVGNGCRPNETTY 542

Query: 426  ILLLEGIGFAGWQAEAIETASTLLRKHVITKESVQRLKRTFPTLN 292
             +L+EGIGFAG++AEA+E A+ L+R + I++ S +RL RTFP LN
Sbjct: 543  TVLIEGIGFAGYRAEAMELANDLVRINAISEYSFKRLHRTFPLLN 587


>ref|NP_566237.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75207286|sp|Q9SR00.1|PP213_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At3g04760, chloroplastic; Flags: Precursor
            gi|6175176|gb|AAF04902.1|AC011437_17 hypothetical protein
            [Arabidopsis thaliana] gi|15810359|gb|AAL07067.1| unknown
            protein [Arabidopsis thaliana] gi|22136960|gb|AAM91709.1|
            unknown protein [Arabidopsis thaliana]
            gi|332640611|gb|AEE74132.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 602

 Score =  672 bits (1735), Expect = 0.0
 Identities = 316/528 (59%), Positives = 421/528 (79%)
 Frame = -1

Query: 1875 KSESSQNAKVSTETRSTHLQPDDSTEPNFVKLLYRSFKAAKYNEALYFLECMVNRSLKPD 1696
            ++ ++ +A + TE R  H Q     +   +K+ +RS ++  Y E+L+ LE MV +   PD
Sbjct: 64   QTTTTTDATLPTERRQQHSQSLGFRDTQMLKIFHRSCRSGNYIESLHLLETMVRKGYNPD 123

Query: 1695 VILSTKLIQGFFSSKNAEKAIRVMQILEQYGEPDVFAYNAVISGFCKLNQFDSANKILNR 1516
            VIL TKLI+GFF+ +N  KA+RVM+ILE++G+PDVFAYNA+I+GFCK+N+ D A ++L+R
Sbjct: 124  VILCTKLIKGFFTLRNIPKAVRVMEILEKFGQPDVFAYNALINGFCKMNRIDDATRVLDR 183

Query: 1515 MRARGFSPDVVTYNIMIGSLCNRGKLGLALKVFDQLLEDNCQPSVVTYTILIEATAVEGG 1336
            MR++ FSPD VTYNIMIGSLC+RGKL LALKV +QLL DNCQP+V+TYTILIEAT +EGG
Sbjct: 184  MRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLNQLLSDNCQPTVITYTILIEATMLEGG 243

Query: 1335 TREAMKLFDEMLSKGLQPDMYTYNAIIRAMCKDGLMDEAFDFVKSLPARGCKADLVSYNV 1156
              EA+KL DEMLS+GL+PDM+TYN IIR MCK+G++D AF+ V++L  +GC+ D++SYN+
Sbjct: 244  VDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKEGMVDRAFEMVRNLELKGCEPDVISYNI 303

Query: 1155 LLRALLSQGRWKDGEKLVAEMLSEDIEPNVVTYTILISALCHLGKLEESLEFLKLMMDEG 976
            LLRALL+QG+W++GEKL+ +M SE  +PNVVTY+ILI+ LC  GK+EE++  LKLM ++G
Sbjct: 304  LLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYSILITTLCRDGKIEEAMNLLKLMKEKG 363

Query: 975  LTPNTYTYDPLISAFCKEGKMDLAIAFLNHMVSSGCLPDIVNYNTLLSAMCKNKNVDQAL 796
            LTP+ Y+YDPLI+AFC+EG++D+AI FL  M+S GCLPDIVNYNT+L+ +CKN   DQAL
Sbjct: 364  LTPDAYSYDPLIAAFCREGRLDVAIEFLETMISDGCLPDIVNYNTVLATLCKNGKADQAL 423

Query: 795  ELFGNLSEVGCPPDVSTYNTMISALWNTGERAQALRIASEMISKAIDPDEITFKTLISCL 616
            E+FG L EVGC P+ S+YNTM SALW++G++ +AL +  EM+S  IDPDEIT+ ++ISCL
Sbjct: 424  EIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRALHMILEMMSNGIDPDEITYNSMISCL 483

Query: 615  CRDGMVDEARELLTGMESSGFPLTVATYNIVLLGLCKVHRVDDAIEVLELMIQKGCLPNE 436
            CR+GMVDEA ELL  M S  F  +V TYNIVLLG CK HR++DAI VLE M+  GC PNE
Sbjct: 484  CREGMVDEAFELLVDMRSCEFHPSVVTYNIVLLGFCKAHRIEDAINVLESMVGNGCRPNE 543

Query: 435  TTYILLLEGIGFAGWQAEAIETASTLLRKHVITKESVQRLKRTFPTLN 292
            TTY +L+EGIGFAG++AEA+E A+ L+R   I++ S +RL RTFP LN
Sbjct: 544  TTYTVLIEGIGFAGYRAEAMELANDLVRIDAISEYSFKRLHRTFPLLN 591


Top