BLASTX nr result

ID: Cephaelis21_contig00015197 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00015197
         (1638 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003520278.1| PREDICTED: uncharacterized protein LOC100814...   559   e-157
ref|XP_002282555.1| PREDICTED: uncharacterized protein LOC100251...   557   e-156
ref|XP_002522160.1| conserved hypothetical protein [Ricinus comm...   551   e-154
ref|XP_003534779.1| PREDICTED: uncharacterized protein LOC100796...   547   e-153
ref|XP_004144331.1| PREDICTED: uncharacterized protein LOC101219...   536   e-150

>ref|XP_003520278.1| PREDICTED: uncharacterized protein LOC100814636 [Glycine max]
          Length = 490

 Score =  559 bits (1441), Expect = e-157
 Identities = 290/428 (67%), Positives = 323/428 (75%)
 Frame = +1

Query: 67   MAIHKTQKGKPQQRPPRSSILFVLISTLACIALLYLASSFFSTNGFXXXXXXXXXXXXXX 246
            MA H+  K KP+    RS IL +L + +  IALL+L SS  ST                 
Sbjct: 101  MAFHRASKAKPKSA--RSRIL-LLAAAVTAIALLFLFSSLLSTT-----------VSKAN 146

Query: 247  XEDRHRHKGPEKYLYWGDKIDCPGKHCQSCEGLGHQESSLRCALEEALFLGRTFVMPSRM 426
               R + +  EKYLYWG +IDCPGKHC SCEGLGHQESSLRCALEEALFLGRTFVMPSRM
Sbjct: 147  LLQRRQQQPFEKYLYWGTRIDCPGKHCGSCEGLGHQESSLRCALEEALFLGRTFVMPSRM 206

Query: 427  CINPIHNKKGILHQSSDTNSEERWAASSCAMDSLYDITLISERIPVILDNSKQWHRVLST 606
            CINPIHNKKGILH S++ +SEE W ASSCAMDSLYD  L+S+ +PVILDNSK+W+RVLST
Sbjct: 207  CINPIHNKKGILHHSTNASSEELWDASSCAMDSLYDTELMSDTVPVILDNSKEWYRVLST 266

Query: 607  SMKLGSRGIANVQGISRDDLQSKXXXXXXXXXXRTASPLSWY*SW*SFRFMECKDRNNHS 786
            SMKLG+RG+A+V+G+SR +L+            RTASPLSW        FMECKDRNN S
Sbjct: 267  SMKLGARGVAHVEGVSRFELKENSRYSDLLLINRTASPLSW--------FMECKDRNNRS 318

Query: 787  AILLPYSFLPSMAAKKLRNAAEKIKALLGDYDAIHVRRGDKIKTRKDRFGDDRTLHPHLD 966
            AI+LPYSFLPSMAA KLR+AAEKIKALLGDYDAIHVRRGDKIKTRKDRFG  R+LHPHLD
Sbjct: 319  AIMLPYSFLPSMAAGKLRDAAEKIKALLGDYDAIHVRRGDKIKTRKDRFGVARSLHPHLD 378

Query: 967  RDTRPEFILCRILKWSPPGRTLFIASNERTPGFFSPLALRCTEWGLMTWN*VMYLSCFGY 1146
            RDTRPEF+LCRI KW PPGRTLFIASNERTPGFFSPL+ R                   Y
Sbjct: 379  RDTRPEFMLCRIAKWVPPGRTLFIASNERTPGFFSPLSAR-------------------Y 419

Query: 1147 KLAYSSNYSSILDSLIENNYQLFMVERLIMMGAKTFIKTFKEDENDLGLTDDPKKNTKIW 1326
            +LAYSSNYS ILD LIENNYQLFM+ERLIMMG KTFI+TFKEDE DL LTDDPKKNTK+W
Sbjct: 420  RLAYSSNYSHILDPLIENNYQLFMIERLIMMGGKTFIRTFKEDETDLSLTDDPKKNTKLW 479

Query: 1327 QKPVYTKD 1350
            Q PVY  D
Sbjct: 480  QIPVYNVD 487


>ref|XP_002282555.1| PREDICTED: uncharacterized protein LOC100251727 [Vitis vinifera]
            gi|297741651|emb|CBI32783.3| unnamed protein product
            [Vitis vinifera]
          Length = 398

 Score =  557 bits (1436), Expect = e-156
 Identities = 294/429 (68%), Positives = 323/429 (75%), Gaps = 1/429 (0%)
 Frame = +1

Query: 67   MAIHKTQKGKPQQRPPRSSILFVLISTLACIALLYLASSFFSTNGFXXXXXXXXXXXXXX 246
            MAI+KT + KP  +P RS ILF+++S LA IA L + SS  S NG               
Sbjct: 1    MAIYKTLRTKP--KPNRSPILFLVLSILA-IAFLCILSSSISINGSLFRSEKTQLNTL-- 55

Query: 247  XEDRHRHK-GPEKYLYWGDKIDCPGKHCQSCEGLGHQESSLRCALEEALFLGRTFVMPSR 423
               + RH  G EKYLYWG KIDCPGKHC SCEGLGHQESSLRCALEEALFL RT VMPSR
Sbjct: 56   ---KSRHSVGHEKYLYWGYKIDCPGKHCDSCEGLGHQESSLRCALEEALFLQRTLVMPSR 112

Query: 424  MCINPIHNKKGILHQSSDTNSEERWAASSCAMDSLYDITLISERIPVILDNSKQWHRVLS 603
            MCINPIHNKKGILH SS+  SEE WAA+SCAMDSLYD+ L+S  +PVILDNSK W+RVLS
Sbjct: 113  MCINPIHNKKGILHHSSNATSEEMWAANSCAMDSLYDLDLMSNTVPVILDNSKMWYRVLS 172

Query: 604  TSMKLGSRGIANVQGISRDDLQSKXXXXXXXXXXRTASPLSWY*SW*SFRFMECKDRNNH 783
            TSMKLG+RG+A+V G+SR  L+            RTASPLSW        FMECKDRNN 
Sbjct: 173  TSMKLGARGVAHVAGVSRIALRDNSHYSNLLLINRTASPLSW--------FMECKDRNNR 224

Query: 784  SAILLPYSFLPSMAAKKLRNAAEKIKALLGDYDAIHVRRGDKIKTRKDRFGDDRTLHPHL 963
            SAI+LPYSFLPSMAAKKLR+AA KIK LLGDYDA+HVRRGDKIKTRKDRFG DRTLHPHL
Sbjct: 225  SAIMLPYSFLPSMAAKKLRDAAGKIKVLLGDYDAMHVRRGDKIKTRKDRFGVDRTLHPHL 284

Query: 964  DRDTRPEFILCRILKWSPPGRTLFIASNERTPGFFSPLALRCTEWGLMTWN*VMYLSCFG 1143
            DRDTRPEFIL RI KW PPGRTLFIASNERTPGFFSPL+ R                   
Sbjct: 285  DRDTRPEFILRRIEKWVPPGRTLFIASNERTPGFFSPLSAR------------------- 325

Query: 1144 YKLAYSSNYSSILDSLIENNYQLFMVERLIMMGAKTFIKTFKEDENDLGLTDDPKKNTKI 1323
            Y+LAYSSNYS ILD L+ENNYQLFM+ERLI+MGAKT+I+TFKEDE  L LTDDPKKNTK 
Sbjct: 326  YRLAYSSNYSKILDPLVENNYQLFMIERLILMGAKTYIRTFKEDETYLSLTDDPKKNTKA 385

Query: 1324 WQKPVYTKD 1350
            WQ PVYT D
Sbjct: 386  WQIPVYTSD 394


>ref|XP_002522160.1| conserved hypothetical protein [Ricinus communis]
            gi|223538598|gb|EEF40201.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 405

 Score =  551 bits (1421), Expect = e-154
 Identities = 283/418 (67%), Positives = 318/418 (76%), Gaps = 3/418 (0%)
 Frame = +1

Query: 106  RPPRSSILFVLISTLACIALLYLASSFFSTNGFXXXXXXXXXXXXXXX---EDRHRHKGP 276
            +P  SS +   +  ++ IALL+L SS  STNGF                  +    H   
Sbjct: 11   KPKLSSPILTTVICISSIALLFLFSSLISTNGFSFSSPNTHNIIVDIIRPTKPTRPHSLH 70

Query: 277  EKYLYWGDKIDCPGKHCQSCEGLGHQESSLRCALEEALFLGRTFVMPSRMCINPIHNKKG 456
            +KYLYWG++IDCPGKHC SCEGLGHQESSLRCALEEA+FL RTFVMPS MCINPIHNKKG
Sbjct: 71   DKYLYWGNRIDCPGKHCDSCEGLGHQESSLRCALEEAIFLNRTFVMPSAMCINPIHNKKG 130

Query: 457  ILHQSSDTNSEERWAASSCAMDSLYDITLISERIPVILDNSKQWHRVLSTSMKLGSRGIA 636
            ILH SS++++EERWAA+SCAMDSLYDI LISE IPVILDNSK W++VLSTSMKLG RGIA
Sbjct: 131  ILHHSSNSSAEERWAANSCAMDSLYDIDLISETIPVILDNSKIWYQVLSTSMKLGDRGIA 190

Query: 637  NVQGISRDDLQSKXXXXXXXXXXRTASPLSWY*SW*SFRFMECKDRNNHSAILLPYSFLP 816
            +V+G+SR DL+            RTASPLSW        FMECKDRNN SAILLPYSFLP
Sbjct: 191  HVEGVSRVDLKENSGYSNLLLINRTASPLSW--------FMECKDRNNRSAILLPYSFLP 242

Query: 817  SMAAKKLRNAAEKIKALLGDYDAIHVRRGDKIKTRKDRFGDDRTLHPHLDRDTRPEFILC 996
            SMAA+KLR+AA+KIK LLGDYDAIHVRRGDKIKTRKDRFG  R+LHPHLDRDTRP+F+L 
Sbjct: 243  SMAAEKLRDAADKIKTLLGDYDAIHVRRGDKIKTRKDRFGVARSLHPHLDRDTRPDFMLL 302

Query: 997  RILKWSPPGRTLFIASNERTPGFFSPLALRCTEWGLMTWN*VMYLSCFGYKLAYSSNYSS 1176
            RI KW PPGRTLFIASNE+TPGFFSPL++R                   YKLAYS NYS 
Sbjct: 303  RIEKWVPPGRTLFIASNEKTPGFFSPLSVR-------------------YKLAYSLNYSW 343

Query: 1177 ILDSLIENNYQLFMVERLIMMGAKTFIKTFKEDENDLGLTDDPKKNTKIWQKPVYTKD 1350
            ILD LIENNYQLFM+ERLIMMGA+TFI+TFKED+ DL LTDDPKKNTK WQ PVYT D
Sbjct: 344  ILDPLIENNYQLFMIERLIMMGARTFIRTFKEDDTDLSLTDDPKKNTKKWQIPVYTMD 401


>ref|XP_003534779.1| PREDICTED: uncharacterized protein LOC100796131 [Glycine max]
          Length = 385

 Score =  547 bits (1409), Expect = e-153
 Identities = 289/428 (67%), Positives = 321/428 (75%)
 Frame = +1

Query: 67   MAIHKTQKGKPQQRPPRSSILFVLISTLACIALLYLASSFFSTNGFXXXXXXXXXXXXXX 246
            MA H+  K KP+    RS IL +L + +  IALL L SS  ST                 
Sbjct: 1    MAFHRASKAKPKSA--RSRIL-LLAAAVTAIALLLLFSSLLSTT---------------V 42

Query: 247  XEDRHRHKGPEKYLYWGDKIDCPGKHCQSCEGLGHQESSLRCALEEALFLGRTFVMPSRM 426
             +     +  EKYLYWG +IDCPGKHC+SCEGLGHQESSLRCALEEALFLGRTFVMPS M
Sbjct: 43   SKANLLQQPFEKYLYWGTRIDCPGKHCRSCEGLGHQESSLRCALEEALFLGRTFVMPSGM 102

Query: 427  CINPIHNKKGILHQSSDTNSEERWAASSCAMDSLYDITLISERIPVILDNSKQWHRVLST 606
            CINPIHNKKGILH S++ +SEE W ASSCAMDSLYD  L+S+ +PVILDNSK+W+RVLST
Sbjct: 103  CINPIHNKKGILH-STNASSEELWDASSCAMDSLYDTELMSDTVPVILDNSKEWYRVLST 161

Query: 607  SMKLGSRGIANVQGISRDDLQSKXXXXXXXXXXRTASPLSWY*SW*SFRFMECKDRNNHS 786
            SMKLG+RG+A+V G+SR +L+            RTASPLSW        FMECKDRNN S
Sbjct: 162  SMKLGARGVAHVGGVSRFELKENSRYSDLLLINRTASPLSW--------FMECKDRNNGS 213

Query: 787  AILLPYSFLPSMAAKKLRNAAEKIKALLGDYDAIHVRRGDKIKTRKDRFGDDRTLHPHLD 966
            AI+LPYSFLPSMAA KLR+AAEKIKALLGDYDAIHVRRGDKIKTRKDRFG  R+LHPHLD
Sbjct: 214  AIMLPYSFLPSMAAGKLRDAAEKIKALLGDYDAIHVRRGDKIKTRKDRFGVVRSLHPHLD 273

Query: 967  RDTRPEFILCRILKWSPPGRTLFIASNERTPGFFSPLALRCTEWGLMTWN*VMYLSCFGY 1146
            RDTRPEF+LCRI KW PPGRTLFIASNERTPGFFSPL+ R                   Y
Sbjct: 274  RDTRPEFMLCRIAKWVPPGRTLFIASNERTPGFFSPLSAR-------------------Y 314

Query: 1147 KLAYSSNYSSILDSLIENNYQLFMVERLIMMGAKTFIKTFKEDENDLGLTDDPKKNTKIW 1326
            +LAYSSNYS ILD LIENNYQLFM+ERLIMMGAKTFI+TFKEDE DL LTDDPKKNTK+W
Sbjct: 315  RLAYSSNYSHILDPLIENNYQLFMIERLIMMGAKTFIRTFKEDETDLSLTDDPKKNTKLW 374

Query: 1327 QKPVYTKD 1350
            Q PVY  D
Sbjct: 375  QIPVYNVD 382


>ref|XP_004144331.1| PREDICTED: uncharacterized protein LOC101219097 [Cucumis sativus]
          Length = 407

 Score =  536 bits (1382), Expect = e-150
 Identities = 276/438 (63%), Positives = 323/438 (73%), Gaps = 8/438 (1%)
 Frame = +1

Query: 67   MAIHKTQKGKPQQRPPRSSILFVLISTLACIALLYLASSFFSTNGFXXXXXXXXXXXXXX 246
            MA  +TQK KP+   PRS ++F  +S L+ IA L+L SS  STNG               
Sbjct: 1    MAFPRTQKPKPK---PRSPLIFFFVS-LSAIAFLFLFSSLISTNGSSSFPSSNSIQKIFR 56

Query: 247  XED--------RHRHKGPEKYLYWGDKIDCPGKHCQSCEGLGHQESSLRCALEEALFLGR 402
             ++        RH     +K+LYWG++IDCPGKHC+SCEGLGHQESSLRCALEEA+FL R
Sbjct: 57   LKNLTQKQRRNRHFFSVNDKFLYWGNRIDCPGKHCESCEGLGHQESSLRCALEEAMFLQR 116

Query: 403  TFVMPSRMCINPIHNKKGILHQSSDTNSEERWAASSCAMDSLYDITLISERIPVILDNSK 582
            TFVMPSRMCINPIHNKKG+LHQS+ ++SEE W A+SCAMDSLYD+ LIS+ +PVILDNSK
Sbjct: 117  TFVMPSRMCINPIHNKKGLLHQSN-SSSEESWEANSCAMDSLYDMDLISDTVPVILDNSK 175

Query: 583  QWHRVLSTSMKLGSRGIANVQGISRDDLQSKXXXXXXXXXXRTASPLSWY*SW*SFRFME 762
             W++VLST MKLG+R + +V+ +SR +L+            RTASPLSW        FME
Sbjct: 176  SWYQVLSTGMKLGARAVGHVEKVSRIELRDSSRYSNLLLINRTASPLSW--------FME 227

Query: 763  CKDRNNHSAILLPYSFLPSMAAKKLRNAAEKIKALLGDYDAIHVRRGDKIKTRKDRFGDD 942
            CKDRNNHSA++LPY FLPSMAA+ LR+AAEKIK LLGDYDAIHVRRGDKIKTRKDRFG D
Sbjct: 228  CKDRNNHSAVMLPYKFLPSMAAENLRDAAEKIKGLLGDYDAIHVRRGDKIKTRKDRFGVD 287

Query: 943  RTLHPHLDRDTRPEFILCRILKWSPPGRTLFIASNERTPGFFSPLALRCTEWGLMTWN*V 1122
            R+LHPHLDRDTRPEF+L RI KW P GRTLFIASNER PGFFSPL+ R            
Sbjct: 288  RSLHPHLDRDTRPEFMLKRIAKWVPAGRTLFIASNERIPGFFSPLSAR------------ 335

Query: 1123 MYLSCFGYKLAYSSNYSSILDSLIENNYQLFMVERLIMMGAKTFIKTFKEDENDLGLTDD 1302
                   YKLAYSSNYS ILD +++NNYQLFM+ERLIM GAKT I+TFKED+ DL LTDD
Sbjct: 336  -------YKLAYSSNYSDILDPVVQNNYQLFMIERLIMAGAKTLIRTFKEDDTDLSLTDD 388

Query: 1303 PKKNTKIWQKPVYTKDTK 1356
            PKKNTK WQ PVYT + +
Sbjct: 389  PKKNTKAWQIPVYTDEER 406