BLASTX nr result

ID: Cephaelis21_contig00012052 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00012052
         (1916 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI40732.3| unnamed protein product [Vitis vinifera]              586   e-165
emb|CAN84084.1| hypothetical protein VITISV_018999 [Vitis vinifera]   585   e-164
ref|XP_002307761.1| predicted protein [Populus trichocarpa] gi|2...   575   e-161
ref|NP_199195.4| pentatricopeptide repeat-containing protein [Ar...   543   e-152
ref|XP_003597616.1| hypothetical protein MTR_2g100200 [Medicago ...   533   e-149

>emb|CBI40732.3| unnamed protein product [Vitis vinifera]
          Length = 520

 Score =  586 bits (1510), Expect = e-165
 Identities = 290/518 (55%), Positives = 390/518 (75%), Gaps = 9/518 (1%)
 Frame = -2

Query: 1657 KQDIHVDESHILTQLSDILPISNGPSRICCD----KPLNSVTIKA-----AADGFLSPED 1505
            K+  + +E  +L QLS +LPI       CC+    KP    + K      A DGFLSP +
Sbjct: 10   KRPSNFNERDVLYQLSGLLPI-------CCNTSISKPFTENSPKEQLKTRAVDGFLSPGE 62

Query: 1504 KFRGVFLQKLRGKSAVEQALTSVGIEITVDLLDKVVSRGNLSGDSMVVFFNWARLRSKFS 1325
            K RGVF+Q+LRGK+A+E ALT+VGI++T+D++ +V++RGNL G++MV+FFNWA  +    
Sbjct: 63   KLRGVFIQRLRGKAAIELALTNVGIDLTIDIVSEVINRGNLGGEAMVIFFNWAVKQPTIP 122

Query: 1324 EDVDSYNLIIKALGRRKYFRHMIEMLGDMSKRGINPNSDTFFAVLDSFIRSRQVSKGIKM 1145
            +DVD+YN+IIKALGRRK+   ++++L DM  +GI+PN +T   V+DSFI++RQVSK I+M
Sbjct: 123  KDVDTYNVIIKALGRRKFIEFVVKVLKDMHIQGISPNYETLSIVMDSFIKARQVSKAIEM 182

Query: 1144 WDNLEDFGLKCNTGMFNVLLKCLTLRAYVGTACSLINKMRGKIHFDSTTYNLVIGGWSRF 965
            + NLE+FG KC+T   NVLL+CL  R++VG A    N M+G I F+  TYN++IGGWS++
Sbjct: 183  FRNLEEFGGKCDTESLNVLLQCLCQRSHVGAANLFFNAMKGGIPFNCMTYNIIIGGWSKY 242

Query: 964  GRIIEVERTLEAMVEDGINPDSSTYSYILEGLGRAGRIDSAVKVFKELQESGSMLDVEVY 785
            G+I E+ER L+AMV DG +P+  T+S+++EGLGRAGRID AV+VF  ++E+G + +  VY
Sbjct: 243  GKIGEMERCLKAMVADGFSPNCLTFSHLIEGLGRAGRIDDAVEVFHHMEETGCVPNACVY 302

Query: 784  NAMIANYISCGEMDEGLKYFEQLLNSNCEPNMVTYVRLISGCLKARRVADAIEMFDQMLD 605
            NA+I+N+IS  + DE LKY+  +++SNC+PNM TY +LI   LKAR+VADA+EM D+M+ 
Sbjct: 303  NALISNFISTRDFDECLKYYNFMVSSNCDPNMDTYTKLIVAFLKARKVADALEMLDEMVG 362

Query: 604  RGIIPSTGTITSFIEPLCGYGPPHAALMIYEKSRKVGCRISLNSYKLLLNRLSKFGKCQM 425
            RG+IP+TG ITSFIEPLC YGPPHAA+MIY+K+RKVGCRISL++YKLLL RLS+FGKC M
Sbjct: 363  RGMIPTTGAITSFIEPLCQYGPPHAAMMIYKKARKVGCRISLSAYKLLLMRLSRFGKCGM 422

Query: 424  LVNIWREMEKCGYSSDVQVYEYAINGFCKIGQLENAVLVMEESLSKGFCPSRLICXXXXX 245
            L+N+W EM++ GYSSD +VYEY ING C IGQL+ AVLVMEESL KGFCPSRLI      
Sbjct: 423  LLNLWDEMQESGYSSDTEVYEYVINGLCNIGQLDTAVLVMEESLHKGFCPSRLIRSKLNN 482

Query: 244  XXXXXXKAEVAYRLSLKIKVAHGNEKAKTRWRVKGWHF 131
                  K E+AY+L LKIK+A  N+ A+  WR  GWHF
Sbjct: 483  KLLASNKVEMAYKLFLKIKIARQNDNARRFWRGNGWHF 520


>emb|CAN84084.1| hypothetical protein VITISV_018999 [Vitis vinifera]
          Length = 561

 Score =  585 bits (1507), Expect = e-164
 Identities = 297/542 (54%), Positives = 394/542 (72%), Gaps = 9/542 (1%)
 Frame = -2

Query: 1729 LFPFSTHETPLTFTKDPSLQNHLHKQDIHVDESHILTQLSDILPISNGPSRICCD----K 1562
            LF FST +       D    N + K+  + +E  +L QLS +LPI       CC+    K
Sbjct: 28   LFQFSTLQVTSNPLMDEPTDNQI-KRPSNFNERDVLYQLSGLLPI-------CCNTSISK 79

Query: 1561 PLNSVTIKA-----AADGFLSPEDKFRGVFLQKLRGKSAVEQALTSVGIEITVDLLDKVV 1397
            P    + K      A DGFLSP +K RGVF+Q+LRGK+A+E ALT+VGI++T+D++ +V 
Sbjct: 80   PFTENSPKEQLKTRAVDGFLSPGEKLRGVFIQRLRGKAAIELALTNVGIDLTIDIVSEVX 139

Query: 1396 SRGNLSGDSMVVFFNWARLRSKFSEDVDSYNLIIKALGRRKYFRHMIEMLGDMSKRGINP 1217
            +RGNL G++MV FFNWA  +    +DVD+YN+IIKALGRRK+    + +L DM  +GI+P
Sbjct: 140  NRGNLGGEAMVXFFNWAVKQPTIPKDVDTYNVIIKALGRRKFIEFXVXVLKDMHIQGISP 199

Query: 1216 NSDTFFAVLDSFIRSRQVSKGIKMWDNLEDFGLKCNTGMFNVLLKCLTLRAYVGTACSLI 1037
            N +T   V+DSFI++RQVSK I+M+ NLE+FG KC+T   NVLL+CL  R++VG A    
Sbjct: 200  NYETLSIVMDSFIKARQVSKAIEMFRNLEEFGGKCDTESLNVLLQCLCQRSHVGAANLFF 259

Query: 1036 NKMRGKIHFDSTTYNLVIGGWSRFGRIIEVERTLEAMVEDGINPDSSTYSYILEGLGRAG 857
            N M+G I F+  TYN++IGGWS++G+I E+ER L+AMV DG +P+  T+S+++EGLGRAG
Sbjct: 260  NAMKGGIPFNCMTYNIIIGGWSKYGKIGEMERCLKAMVADGFSPNCLTFSHLIEGLGRAG 319

Query: 856  RIDSAVKVFKELQESGSMLDVEVYNAMIANYISCGEMDEGLKYFEQLLNSNCEPNMVTYV 677
            RID AV+VF  ++E+G + +  VYNA+I+N+IS  + DE LKY+  +++SNC+PNM TY 
Sbjct: 320  RIDDAVEVFHHMEETGCVPNACVYNALISNFISTRDFDECLKYYNFMVSSNCDPNMDTYT 379

Query: 676  RLISGCLKARRVADAIEMFDQMLDRGIIPSTGTITSFIEPLCGYGPPHAALMIYEKSRKV 497
            +LI   LKAR+VADA+EM D+M+ RG+IP+TG ITSFIEPLC YGPPHAA+MIY+K+RKV
Sbjct: 380  KLIVAFLKARKVADALEMLDEMVGRGMIPTTGAITSFIEPLCQYGPPHAAMMIYKKARKV 439

Query: 496  GCRISLNSYKLLLNRLSKFGKCQMLVNIWREMEKCGYSSDVQVYEYAINGFCKIGQLENA 317
            GCRISL++YKLLL RLS+FGKC ML+N+W EM++ GYSSD +VYEY ING C IGQL+ A
Sbjct: 440  GCRISLSAYKLLLMRLSRFGKCGMLLNLWDEMQESGYSSDTEVYEYVINGLCNIGQLDTA 499

Query: 316  VLVMEESLSKGFCPSRLICXXXXXXXXXXXKAEVAYRLSLKIKVAHGNEKAKTRWRVKGW 137
            VLVMEESL KGFCPSRLI            K E+AY+L LKIK A  N+ A+  WR  GW
Sbjct: 500  VLVMEESLXKGFCPSRLIRSKLNNKLLASNKVEMAYKLFLKIKXARQNDNARRFWRGNGW 559

Query: 136  HF 131
            HF
Sbjct: 560  HF 561


>ref|XP_002307761.1| predicted protein [Populus trichocarpa] gi|222857210|gb|EEE94757.1|
            predicted protein [Populus trichocarpa]
          Length = 563

 Score =  575 bits (1481), Expect = e-161
 Identities = 283/548 (51%), Positives = 391/548 (71%), Gaps = 9/548 (1%)
 Frame = -2

Query: 1747 PSLLAFLFPFSTHETPLTFTKDPSLQNHLHKQDIHVDESHILTQLSDILPISNGPSRICC 1568
            P LL    PFST + P   T   ++     K   ++ E ++L +LS++LPIS  P     
Sbjct: 24   PHLLRSSIPFSTLDPPQFATSQDNI-----KLQYNLGEDYVLNELSNLLPISPKPP---I 75

Query: 1567 DKPLN--------SVTIKAAADGFLSPEDKFRGVFLQKLRGKSAVEQALTSVGIEITVDL 1412
              P N         V I+   D FLSPE+K RGVF+QK++GKS +E+ALT   +++++D+
Sbjct: 76   PHPYNHDRSISNKQVEIRPVFDRFLSPEEKLRGVFVQKIKGKSGIERALTECSVDLSLDV 135

Query: 1411 LDKVVSRGNLSGDSMVVFFNWARLRSKFSEDVDSYNLIIKALGRRKYFRHMIEMLGDMSK 1232
            + +V++RGNL G++M++FFNWA  +   S+DVDSYN++I+ALGRRK+   M++ L ++  
Sbjct: 136  VAEVLNRGNLGGEAMIMFFNWAIKQPMISKDVDSYNVVIRALGRRKFIDFMVKFLHELRV 195

Query: 1231 RGINPNSDTFFAVLDSFIRSRQVSKGIKMWDNLED-FGLKCNTGMFNVLLKCLTLRAYVG 1055
             G++ NS+TF  V+DS +R+R+V K I+M+ NLE+ FG + +    NVLL+CL  R++VG
Sbjct: 196  EGVSMNSETFSIVIDSLVRARRVYKAIQMFGNLEEEFGFERDAESLNVLLQCLCRRSHVG 255

Query: 1054 TACSLINKMRGKIHFDSTTYNLVIGGWSRFGRIIEVERTLEAMVEDGINPDSSTYSYILE 875
             A S  N ++GKI F+  TYN++IGGWS+FGR+ E++R  E M EDG +PD  ++SY+LE
Sbjct: 256  AANSYFNSVKGKIPFNCMTYNVIIGGWSKFGRVSEMQRVFEEMEEDGFSPDCLSFSYLLE 315

Query: 874  GLGRAGRIDSAVKVFKELQESGSMLDVEVYNAMIANYISCGEMDEGLKYFEQLLNSNCEP 695
            GLGRAG+I+ AV +F  ++E G + D  VYNAMI+N+IS G  DE +KY+  LL+ NC+P
Sbjct: 316  GLGRAGKIEDAVMIFGSMEEKGCVPDTNVYNAMISNFISVGNFDECMKYYRCLLSKNCDP 375

Query: 694  NMVTYVRLISGCLKARRVADAIEMFDQMLDRGIIPSTGTITSFIEPLCGYGPPHAALMIY 515
            N+ TY R+ISG +KA +VADA+EMFD+MLDRG++  TGT+TSFIEPLC +GPPHAA++IY
Sbjct: 376  NIDTYTRMISGLIKASKVADALEMFDEMLDRGMVTKTGTVTSFIEPLCSFGPPHAAMVIY 435

Query: 514  EKSRKVGCRISLNSYKLLLNRLSKFGKCQMLVNIWREMEKCGYSSDVQVYEYAINGFCKI 335
             K+RKVGC+ISL++YKLLL RLS+FGKC M++ IW EM++ GYSSD++VYEY I+G C I
Sbjct: 436  TKARKVGCKISLSAYKLLLMRLSRFGKCGMMLKIWDEMQESGYSSDMEVYEYLISGLCNI 495

Query: 334  GQLENAVLVMEESLSKGFCPSRLICXXXXXXXXXXXKAEVAYRLSLKIKVAHGNEKAKTR 155
            GQ ENAVLVMEES+ KGFCPSR IC           K E AYRL LKIK A  +E A+  
Sbjct: 496  GQFENAVLVMEESMRKGFCPSRFICSKLNNKLLASNKVERAYRLFLKIKHARHSENARRC 555

Query: 154  WRVKGWHF 131
            WR  GWHF
Sbjct: 556  WRSNGWHF 563


>ref|NP_199195.4| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|223635652|sp|P0C8R0.1|PP416_ARATH RecName:
            Full=Putative pentatricopeptide repeat-containing protein
            At5g43820 gi|332007631|gb|AED95014.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 546

 Score =  543 bits (1400), Expect = e-152
 Identities = 252/504 (50%), Positives = 369/504 (73%)
 Frame = -2

Query: 1642 VDESHILTQLSDILPISNGPSRICCDKPLNSVTIKAAADGFLSPEDKFRGVFLQKLRGKS 1463
            VDES++L +LS +LPIS+  + +   K  +S   + A D FLS EDK RGVFLQKL+GKS
Sbjct: 45   VDESYVLAELSSLLPISSNKTSV--SKEDSSSKNQVAIDSFLSAEDKLRGVFLQKLKGKS 102

Query: 1462 AVEQALTSVGIEITVDLLDKVVSRGNLSGDSMVVFFNWARLRSKFSEDVDSYNLIIKALG 1283
            A++++L+S+GI +++D++  V++RGNLSG++MV FF+WA      ++DV SY++I++ALG
Sbjct: 103  AIQKSLSSLGIGLSIDIVADVLNRGNLSGEAMVTFFDWAVREPGVTKDVGSYSVILRALG 162

Query: 1282 RRKYFRHMIEMLGDMSKRGINPNSDTFFAVLDSFIRSRQVSKGIKMWDNLEDFGLKCNTG 1103
            RRK F  M+++L  M   G+NP+ +     +DSF+R   V + I++++  E FG+KC+T 
Sbjct: 163  RRKLFSFMMDVLKGMVCEGVNPDLECLTIAMDSFVRVHYVRRAIELFEESESFGVKCSTE 222

Query: 1102 MFNVLLKCLTLRAYVGTACSLINKMRGKIHFDSTTYNLVIGGWSRFGRIIEVERTLEAMV 923
             FN LL+CL  R++V  A S+ N  +G I FDS +YN++I GWS+ G + E+E+ L+ MV
Sbjct: 223  SFNALLRCLCERSHVSAAKSVFNAKKGNIPFDSCSYNIMISGWSKLGEVEEMEKVLKEMV 282

Query: 922  EDGINPDSSTYSYILEGLGRAGRIDSAVKVFKELQESGSMLDVEVYNAMIANYISCGEMD 743
            E G  PD  +YS+++EGLGR GRI+ +V++F  ++  G++ D  VYNAMI N+IS  + D
Sbjct: 283  ESGFGPDCLSYSHLIEGLGRTGRINDSVEIFDNIKHKGNVPDANVYNAMICNFISARDFD 342

Query: 742  EGLKYFEQLLNSNCEPNMVTYVRLISGCLKARRVADAIEMFDQMLDRGIIPSTGTITSFI 563
            E ++Y+ ++L+  CEPN+ TY +L+SG +K R+V+DA+E+F++ML RG++P+TG +TSF+
Sbjct: 343  ESMRYYRRMLDEECEPNLETYSKLVSGLIKGRKVSDALEIFEEMLSRGVLPTTGLVTSFL 402

Query: 562  EPLCGYGPPHAALMIYEKSRKVGCRISLNSYKLLLNRLSKFGKCQMLVNIWREMEKCGYS 383
            +PLC YGPPHAA++IY+KSRK GCRIS ++YKLLL RLS+FGKC ML+N+W EM++ GY 
Sbjct: 403  KPLCSYGPPHAAMVIYQKSRKAGCRISESAYKLLLKRLSRFGKCGMLLNVWDEMQESGYP 462

Query: 382  SDVQVYEYAINGFCKIGQLENAVLVMEESLSKGFCPSRLICXXXXXXXXXXXKAEVAYRL 203
            SDV+VYEY ++G C IG LENAVLVMEE++ KGFCP+R +            K E+AY+L
Sbjct: 463  SDVEVYEYIVDGLCIIGHLENAVLVMEEAMRKGFCPNRFVYSRLSSKLMASNKTELAYKL 522

Query: 202  SLKIKVAHGNEKAKTRWRVKGWHF 131
             LKIK A   E A++ WR  GWHF
Sbjct: 523  FLKIKKARATENARSFWRSNGWHF 546


>ref|XP_003597616.1| hypothetical protein MTR_2g100200 [Medicago truncatula]
            gi|124360397|gb|ABN08410.1| Pentatricopeptide repeat
            [Medicago truncatula] gi|355486664|gb|AES67867.1|
            hypothetical protein MTR_2g100200 [Medicago truncatula]
          Length = 527

 Score =  533 bits (1374), Expect = e-149
 Identities = 270/538 (50%), Positives = 375/538 (69%), Gaps = 2/538 (0%)
 Frame = -2

Query: 1738 LAFLFPFST--HETPLTFTKDPSLQNHLHKQDIHVDESHILTQLSDILPISNGPSRICCD 1565
            L F+  FS   H  P    +  SLQN       ++DE  IL Q+S +LPI          
Sbjct: 6    LQFVLHFSKPKHPLPKLHQRFSSLQN-----SSNLDERLILHQISQLLPIPTS------- 53

Query: 1564 KPLNSVTIKAAADGFLSPEDKFRGVFLQKLRGKSAVEQALTSVGIEITVDLLDKVVSRGN 1385
            K  +S +   + DGFLSPEDK RG+FLQKL+GK+A+EQAL++V I++ VD++ KV++ GN
Sbjct: 54   KTPDSQSDSKSIDGFLSPEDKLRGIFLQKLKGKAAIEQALSNVCIDVNVDIIGKVLNFGN 113

Query: 1384 LSGDSMVVFFNWARLRSKFSEDVDSYNLIIKALGRRKYFRHMIEMLGDMSKRGINPNSDT 1205
            L G++MV+FFNWA  +     DV SY++I+KALGRRK+F  M+++L +M   GI  +   
Sbjct: 114  LGGEAMVMFFNWALKQPMVPRDVGSYHVIVKALGRRKFFVFMMQVLDEMRLNGIKADLLM 173

Query: 1204 FFAVLDSFIRSRQVSKGIKMWDNLEDFGLKCNTGMFNVLLKCLTLRAYVGTACSLINKMR 1025
               V+DSF+ +  VSK I+++ NL+D GL  +T + NVLL CL  R +VG A S+ N M+
Sbjct: 174  LSIVIDSFVNAGHVSKAIQLFGNLDDLGLCRDTEVLNVLLSCLCRRCHVGAAASVFNSMK 233

Query: 1024 GKIHFDSTTYNLVIGGWSRFGRIIEVERTLEAMVEDGINPDSSTYSYILEGLGRAGRIDS 845
            GK+ F+  TYN+V+GGWS+ GR+ E+E+ ++ M  +G +PD +T ++ LEGLGRAGR+D 
Sbjct: 234  GKVSFNVDTYNVVVGGWSKLGRVNEIEKVMKEMEVEGFSPDFNTLAFFLEGLGRAGRMDE 293

Query: 844  AVKVFKELQESGSMLDVEVYNAMIANYISCGEMDEGLKYFEQLLNSNCEPNMVTYVRLIS 665
            AV+VF  ++E     D  +YNAMI N+IS G+ D  +KY+  +L+ NCEPN+ TY R+I+
Sbjct: 294  AVEVFGSMKEK----DTAIYNAMIFNFISIGDFDGFMKYYNGMLSDNCEPNIHTYSRMIT 349

Query: 664  GCLKARRVADAIEMFDQMLDRGIIPSTGTITSFIEPLCGYGPPHAALMIYEKSRKVGCRI 485
              L+ R+VADA+ MFD+ML +G++P TGTITSFI+ LC YGPP+AA+MIY+K+RK+ C+I
Sbjct: 350  AFLRTRKVADALLMFDEMLRQGVVPPTGTITSFIKQLCSYGPPYAAMMIYKKTRKLECKI 409

Query: 484  SLNSYKLLLNRLSKFGKCQMLVNIWREMEKCGYSSDVQVYEYAINGFCKIGQLENAVLVM 305
            S+ +YK+LL RLSKFGKC  L+++W+EM++CGYSSDV+VYEY I+G   IGQLENAVLVM
Sbjct: 410  SMEAYKILLMRLSKFGKCGSLLSVWQEMQECGYSSDVEVYEYIISGLYNIGQLENAVLVM 469

Query: 304  EESLSKGFCPSRLICXXXXXXXXXXXKAEVAYRLSLKIKVAHGNEKAKTRWRVKGWHF 131
            EE+L KGFCPSRL+              E AYRL LKIK A   + A++ WR  GWHF
Sbjct: 470  EEALRKGFCPSRLVYSKLSNKLLASNLTERAYRLFLKIKHARSLKNARSYWRDNGWHF 527


Top