BLASTX nr result

ID: Cephaelis21_contig00015594 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00015594
         (2072 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002283370.1| PREDICTED: uncharacterized protein LOC100250...   530   e-148
ref|XP_002331717.1| predicted protein [Populus trichocarpa] gi|2...   461   e-127
ref|XP_004146089.1| PREDICTED: uncharacterized protein LOC101202...   416   e-113
ref|XP_002524524.1| conserved hypothetical protein [Ricinus comm...   411   e-112
ref|XP_003540498.1| PREDICTED: uncharacterized protein LOC100812...   392   e-106

>ref|XP_002283370.1| PREDICTED: uncharacterized protein LOC100250468 [Vitis vinifera]
          Length = 524

 Score =  530 bits (1365), Expect = e-148
 Identities = 293/529 (55%), Positives = 363/529 (68%), Gaps = 7/529 (1%)
 Frame = +2

Query: 137  MAETETMGILDEIQYLASDKLQVVSYKWLSRNFLVSSNDAKRLLQNFVEKHGTGIEVLYT 316
            MA+ ET+GIL+EI+ L SDKLQVVSYKWLSR+FLV+SN AKRLL  FVEKHG G+EV+YT
Sbjct: 1    MADIETLGILEEIEALVSDKLQVVSYKWLSRSFLVTSNVAKRLLHEFVEKHGGGLEVVYT 60

Query: 317  LAGWMKNDPSLYRVRLASKPTLAEAKQEFDENCSVEIYSVQACLPKDPAVLGNAECVQAG 496
            L+GW+KNDP +Y +RL S+P LAEAKQEFD +CSV++YSVQAC+PKDPA L NAE VQA 
Sbjct: 61   LSGWLKNDPPVYHIRLVSEPKLAEAKQEFDGHCSVQVYSVQACIPKDPAALWNAEFVQAE 120

Query: 497  GLFKQPHAADNCLRDNRFCRVSNSFVKRNGGGAQGSSMDAQVKNAGAIGSSAKNFPNQSP 676
             LFKQP A DNCLRDNRFC +SNSFVKR   GA  S++ +Q K  G +G S      Q+ 
Sbjct: 121  ELFKQPFAVDNCLRDNRFCGISNSFVKRIAEGAPVSNVASQPKTTGLLGPSKSISAPQTI 180

Query: 677  TLPQPHLKRVQESG---KLQSPDMVKDVKNKSHITGAREQSSKPAQDKEKVPVLPVNIKK 847
             + QP  ++VQ+S     LQSP +V DVK++S+ T A +Q+SKP  DK K P LP N KK
Sbjct: 181  AVQQPQERKVQQSSPKVSLQSPSVVTDVKDESNGTRAHDQASKPPADKGKAPPLPANKKK 240

Query: 848  VSDNKSSGNKGALTNMWGRASSKLKPECASAETNSTHNSSDAQICAHEEIERESSDDEYQ 1027
              ++KSSG +G+L NMWGRAS K KP CA  +      S++AQICA E +E  SSD++ Q
Sbjct: 241  GQNDKSSGTEGSLANMWGRASVKSKPSCAPVDV----VSAEAQICAREAVEGASSDEDGQ 296

Query: 1028 GVHSKRPSNGGSGRKRKVVFEDSEEEDEYKDAVNLASPDPPKRQPMVASKQTSHSLDLEK 1207
              + KR SNG  GRKR+VVF+ S+EE+E++DAVNLASPDPPK +  + SKQ+   L  +K
Sbjct: 297  DANFKRASNGDGGRKRRVVFDFSDEEEEFEDAVNLASPDPPKGKSCIVSKQSPKPLVPDK 356

Query: 1208 -NMLNFXXXXXXXXXXXXXXADRGTNQILKEESLALSKGDNTKSFSLEKGVNNV-TVGTT 1381
             N+ +               ++R +N   +E+S  LSKG N    S +K    V  +   
Sbjct: 357  INLNSDQQKQDKPKVKEEKSSNRESNLSPREDSSVLSKGKNNGISSSDKIAGGVPEIDVN 416

Query: 1382 LKDRKTDTAPRSPQRRKVLKTRIDERGREVTEVVWEGGEAET-KSDSNTMKKVDNMATSN 1558
             KD+ TD AP SP+RRKV+KTRIDERGREVTEVVWE GEAET K+DSN  KK +N   +N
Sbjct: 417  KKDKVTDAAPNSPKRRKVMKTRIDERGREVTEVVWE-GEAETKKADSNETKKSENSIVTN 475

Query: 1559 TANRPSAIKKSPAVGNNAPLNQV-XXXXXXXXXXXDPKQGNILSFFKRV 1702
             ANR    KKSPAVGN AP N              DPKQGNI+SFFKRV
Sbjct: 476  AANRAPPAKKSPAVGNTAPSNVTGKAGSKKAGNSKDPKQGNIMSFFKRV 524


>ref|XP_002331717.1| predicted protein [Populus trichocarpa] gi|222874323|gb|EEF11454.1|
            predicted protein [Populus trichocarpa]
          Length = 508

 Score =  461 bits (1186), Expect = e-127
 Identities = 280/531 (52%), Positives = 343/531 (64%), Gaps = 13/531 (2%)
 Frame = +2

Query: 149  ETMGILDEIQYLASDKLQVVSYKWLSRNFLVSSNDAKRLLQNFVEKHGTGIEVLYTLAGW 328
            ET+GILDEI+ L SDKLQVVSYKWLSRNF+VSSN AKRLLQ FV   G+G EV+YTL+GW
Sbjct: 2    ETLGILDEIEVLVSDKLQVVSYKWLSRNFMVSSNAAKRLLQEFVNTRGSGFEVVYTLSGW 61

Query: 329  MKNDPSLYRVRLASKPTLAEAKQEFDENCSVEIYSVQACLPKDPAVLGNAECVQAGGLFK 508
            +KN+PS Y +RL S P L EAKQEF+ NCSV++YSVQAC+PKDPA L NAE VQA  LFK
Sbjct: 62   LKNNPSSYHIRLVSGPKLEEAKQEFNGNCSVQVYSVQACIPKDPAALWNAEFVQAEELFK 121

Query: 509  QPHAADNCLRDNRFCRVSNSFVKRN-GGGAQGSSMDAQVKNAGAIGSSAKNFPNQSPTLP 685
            Q    DNCLRDNRFC + NSFVK N  G A   SM+  V               Q+ T P
Sbjct: 122  QSFTVDNCLRDNRFCGILNSFVKYNCDGPAATKSMEIPVIQV-----------CQTITAP 170

Query: 686  QPHLKRVQESGKL--QSPDMVKDVKNKSHITGAREQSSKPAQDKEKVPVLPVNIKK-VSD 856
                 +VQ+S K+   SP++V  VK++ + TG R+ ++K   D+EKV +LP N KK  SD
Sbjct: 171  PSKQTKVQQSPKVGPPSPNLVNSVKSERNGTGVRDLATKQTVDEEKVSLLPANKKKGQSD 230

Query: 857  NKSSGNKGALTNMWGRASSKLKPECASAETNSTH-----NSSDAQICAHEEIERESSDDE 1021
              SSGN G+L N+WGRAS+K KP  A A+ N  H      S++AQI A EEIE  SSDDE
Sbjct: 231  KTSSGNGGSLANLWGRASAKSKPSSAQAD-NDKHIPNPTVSAEAQISACEEIEIGSSDDE 289

Query: 1022 YQGVHSKRPSNGGSGRKRKVVFEDSEEEDEYKDAVNLASPDPPKRQPMVASKQTSHSLDL 1201
             QGV+ KR SNG S RKR+VV + S  +DE++DAVNLASP+ PK        Q+S +L L
Sbjct: 290  AQGVNFKRTSNGDSSRKRRVVLDYS--DDEFEDAVNLASPELPK-------GQSSTALVL 340

Query: 1202 EKNMLNFXXXXXXXXXXXXXXADRGT-NQILKEESLALSKGDNTKSFSLEKGVNNVTVGT 1378
            EK   +F              +  G  NQ+L+++S ++ +G ++K+ SLEK  ++ T   
Sbjct: 341  EKP--HFKKQAEDKPVIKVEKSTEGAPNQLLRDDS-SVGEGIDSKTSSLEKIQSDFTFCD 397

Query: 1379 TLKDRKTDTAPRSPQRRKVLKTRIDERGREVTEVVWEGGEAETK--SDSNTMKKVDNMAT 1552
              KD     AP SP+RRKVLKTRIDERGREVTEVVWEG E ETK     ++ KK +N A 
Sbjct: 398  AQKDTAAGAAPNSPKRRKVLKTRIDERGREVTEVVWEGEETETKKVESQDSKKKAENTAV 457

Query: 1553 SNTA-NRPSAIKKSPAVGNNAPLNQVXXXXXXXXXXXDPKQGNILSFFKRV 1702
            +NT  NR    KKSPA GN AP N             DPKQGNILSFFKRV
Sbjct: 458  TNTVNNRAPLTKKSPAAGNGAPSNPGSKAGNKKGGNKDPKQGNILSFFKRV 508


>ref|XP_004146089.1| PREDICTED: uncharacterized protein LOC101202933 [Cucumis sativus]
            gi|449518067|ref|XP_004166065.1| PREDICTED:
            uncharacterized protein LOC101228942 [Cucumis sativus]
          Length = 520

 Score =  416 bits (1068), Expect = e-113
 Identities = 257/534 (48%), Positives = 332/534 (62%), Gaps = 12/534 (2%)
 Frame = +2

Query: 137  MAETETMGILDEIQYLASDKLQVVSYKWLSRNFLVSSNDAKRLLQNFVEKHGTGIEVLYT 316
            MAE ET+GIL +I+ L +DKLQVVSYKWLSR++L+SS+ AKRLLQ FVEKH +G++V+Y 
Sbjct: 1    MAEIETLGILQDIESLVADKLQVVSYKWLSRSYLISSDTAKRLLQEFVEKHESGLQVVYA 60

Query: 317  LAGWMKNDPSLYRVRLASKPTLAEAKQEFDENCSVEIYSVQACLPKDPAVLGNAECVQAG 496
            L+GW+K DP  Y +RL S   L EAKQ+FD  CS+++YSVQA +PKDPA L NAE VQA 
Sbjct: 61   LSGWLKKDPPSYHIRLVSGSKLPEAKQDFDGTCSIQVYSVQASIPKDPAALWNAEFVQAE 120

Query: 497  GLFKQPHAADNCLRDNRFCRVSNSFVKRNGGGAQGSSMDAQVKNAGAIGSSAKNFPNQSP 676
             LFKQP  ADNCLRDNRFC +SNS+VKRN      S   +Q K+A  + SS K    Q+ 
Sbjct: 121  ELFKQPFTADNCLRDNRFCGISNSYVKRNVDEIPASVAASQPKSAVDLESSKKMTSYQNT 180

Query: 677  TLPQP---HLKRVQESGKLQSPDMVKDVKNKSHITGAREQSSKPAQDKEKVPVLPVNIKK 847
            T+ QP    + +V  +  LQS  +VK+VK++ + T    Q+SKP   KEKV  LP N KK
Sbjct: 181  TVLQPQKSEMPKVSPNVGLQSSTVVKEVKSEGNRTD--HQASKPIAVKEKVASLPTNKKK 238

Query: 848  VSDNKSSGNKG-ALTNMWGR--ASSKLKPECASAE----TNSTHNSSDAQICAHEEIERE 1006
               +K+  + G +L N+WGR    SKL  + A A      N T +S++AQICAHE ++ E
Sbjct: 239  GQGDKTCSSTGSSLANLWGRVPTKSKLGDDHADANRATAANPTVSSAEAQICAHEALQIE 298

Query: 1007 SSDDEYQGVHSKRPSNGGSGRKRKVVFEDSEEEDEYKDAVNLASPDPPKRQPMVASKQTS 1186
            +SDD+ Q V+ KR SN  SGRKR+VVF+ S++E E++DAV+LASP+ PK Q  +  KQ +
Sbjct: 299  NSDDDEQDVNIKRSSN-ESGRKRRVVFDFSDDE-EFEDAVSLASPENPKDQSCLDLKQHT 356

Query: 1187 HSLDLEKNMLNFXXXXXXXXXXXXXXADRGTNQILKEESLALSKGDNTKSFSLEKGVNNV 1366
              L   K  LN                +  T+++  E+SL   K  N    S EK     
Sbjct: 357  -ELPKGKAHLN----NDEQLNGKLKIKEEKTSEL--EQSLVEEKQHNC---STEKNEVCA 406

Query: 1367 TVGTTLK-DRKTDTAPRSPQRRKVLKTRIDERGREVTEVVWEGGEAETKSDSNTMKKV-D 1540
                ++K +   D  P SP+RRKVL+TRID+RGREV EVVWEG E + K D  +  K+ D
Sbjct: 407  HENDSIKVENPVDATPASPKRRKVLRTRIDDRGREVNEVVWEGEEQKQKKDDVSSAKISD 466

Query: 1541 NMATSNTANRPSAIKKSPAVGNNAPLNQVXXXXXXXXXXXDPKQGNILSFFKRV 1702
              A   T NRP A KKSPA+GN      V            PKQGNILSFFKRV
Sbjct: 467  QKAVETTTNRPPAAKKSPALGNGGANPAVKAGAKKPGNAAGPKQGNILSFFKRV 520


>ref|XP_002524524.1| conserved hypothetical protein [Ricinus communis]
            gi|223536198|gb|EEF37851.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 573

 Score =  411 bits (1056), Expect = e-112
 Identities = 247/509 (48%), Positives = 319/509 (62%), Gaps = 18/509 (3%)
 Frame = +2

Query: 149  ETMGILDEIQYLASDKLQVVSYKWLSRNFLVSSNDAKRLLQNFVEKHGTGIEVLYTLAGW 328
            ET+GIL+EI+ L SDKL+VVSYKWLSRNFLVSSNDAKRLLQ F EKH +G+EV+Y L+GW
Sbjct: 2    ETLGILEEIESLVSDKLEVVSYKWLSRNFLVSSNDAKRLLQEFAEKHKSGLEVVYALSGW 61

Query: 329  MKNDPSLYRVRLASKPTLAEAKQEFDENCSVEIYSVQACLPKDPAVLGNAECVQAGGLFK 508
            +KN+P  Y +RL S+P L EAK+EFD NCS+ +YSVQ  +PKDPA L N E VQA  LF+
Sbjct: 62   LKNNPQSYHIRLVSRPKLEEAKKEFDGNCSIHVYSVQPAIPKDPAALWNDEFVQAEELFR 121

Query: 509  QPHAADNCLRDNRFCRVSNSFVKRNGGGAQGSSMDAQVKNAGAIGSSAKNFPNQSPTLPQ 688
            QP+ ADNCLRDNRFC + N FVKRN  G   S+  +Q K+ G    S  N  +++  +P 
Sbjct: 122  QPNVADNCLRDNRFCGILNPFVKRNVAGNPVSNAVSQPKSVGIPEPSKSNSAHENIKVPL 181

Query: 689  PHLKRVQESGKL---QSPDMVKDVKNKSHITGAREQSSKPAQDKEKVPVLPVNIKKVSDN 859
              +K  ++SG +   QS  +VKD+K++SH T   +QSSKP    EK  VLP N KK   +
Sbjct: 182  QQIKD-EQSGPMVGKQSTILVKDIKSESHET--EDQSSKPHACGEK--VLPTNEKKGQGD 236

Query: 860  KSSGNKGALTNMWGRASSKLKPECASAETNSTHN---SSDAQICAHEEIERESSDDEYQG 1030
            KSS    +L N+WGRAS+K K   A    N   N   S++AQ+C+ E IE +SS DE +G
Sbjct: 237  KSS---SSLANLWGRASAKSKLTSAEDNKNLVSNPIASAEAQVCSSEAIEDQSSADEAKG 293

Query: 1031 VHSKRPSNGGSGRKRKVVFEDSEEEDEYKDAVNLASPDPPKRQ--PMVASKQTSHSLDLE 1204
            V+ KR SNG   RKR+VVF+ S  +DEY+DAV+LASP+ PK +   +  S++ + +  +E
Sbjct: 294  VNFKRTSNGEGSRKRRVVFDFS--DDEYEDAVSLASPEAPKEKMNKIFLSEKPNVNGQIE 351

Query: 1205 KNMLNFXXXXXXXXXXXXXXADRGTNQILKEESLALSKGDNTKSFSLEKGVNNVTVGTTL 1384
                                 D+  NQ+ +E+    SK  N+   S EK  + +T G   
Sbjct: 352  DK----------REVKEESSTDKAPNQVPREKISVSSKRFNSNDSSNEKKHSPITGGDGK 401

Query: 1385 KDRKTDTAPRSPQRRKVLKTRIDERGREVTEVVWEGGEAE-TKSDSNTMKKVD------- 1540
             D  T+  P SP+RRKVLKTRIDERGREV EVVWEG + E  K+DSN+ K  D       
Sbjct: 402  ADIVTNDPPHSPKRRKVLKTRIDERGREVNEVVWEGEDTEKIKADSNSPKNADINAPKKA 461

Query: 1541 --NMATSNTANRPSAIKKSPAVGNNAPLN 1621
              N  TS   NR    KKSPAVG+ A  N
Sbjct: 462  ENNAITSTVNNRAPVAKKSPAVGSGASTN 490


>ref|XP_003540498.1| PREDICTED: uncharacterized protein LOC100812372 [Glycine max]
          Length = 518

 Score =  392 bits (1008), Expect = e-106
 Identities = 250/534 (46%), Positives = 327/534 (61%), Gaps = 12/534 (2%)
 Frame = +2

Query: 137  MAETETMGILDEIQYLASDKLQVVSYKWLSRNFLVSSNDAKRLLQNFVEKHGTGIEVLYT 316
            MA+T+T+  + EI+ L SDKLQVVSYKWLSRN++VSS++AKRLLQ FV+KH  G+EV+Y 
Sbjct: 1    MAQTQTLSFIHEIESLVSDKLQVVSYKWLSRNYMVSSDEAKRLLQEFVQKHEGGLEVVYA 60

Query: 317  LAGWMKNDPSLYRVRLASKPTLAEAKQEFDENCSVEIYSVQACLPKDPAVLGNAECVQAG 496
            L+GW+K++   Y VRL + P LAEA+QEFD +CSV+IYSVQA +PKDPAVL NAE +QA 
Sbjct: 61   LSGWLKSNHPSYHVRLVTGPKLAEAQQEFDGDCSVQIYSVQASIPKDPAVLWNAEFIQAE 120

Query: 497  GLFKQPHAADNCLRDNRFCRVSNSFVKRNGGGAQGSSMDAQVKNAGAIGSSAKNFPNQSP 676
             LFKQP + DNCLRDNRFC +SNSFV+RN  G        Q K+ G  G +  +   Q P
Sbjct: 121  ELFKQPSSVDNCLRDNRFCGISNSFVQRNVDGPTVVFAAPQSKSVGE-GPTKSDIVQQPP 179

Query: 677  -TLPQPHLKRVQESGKLQSPDMVKDVKNKSH---ITGAREQSSKPAQDKEKVPVLPVNIK 844
              + +  + +V    K QS  +VK+VK++S+    TG  +  +KP  DKEK P LP   K
Sbjct: 180  KNISRDSIDKVDT--KPQS--VVKEVKSESNGIGNTGVHDNMNKPTADKEKAPPLPTGKK 235

Query: 845  KVSDNKS-SGNKGALTNMWGRASSKLKPECASAETNSTHN----SSDA-QICAHEEIERE 1006
            KV  +KS S N G+L ++WGRAS+K KP  +SAE N+  +    S++A Q  A E  E +
Sbjct: 236  KVQADKSGSVNGGSLASLWGRASAKPKPCSSSAENNNKISNPLVSTEAGQTAACEAEECD 295

Query: 1007 SSDDEYQGVHSKRPSNGGSGRKRKVVFEDSEEEDEYKDAVNLASPDPPKRQPMVASKQTS 1186
            S +D+ Q V  +R SN    RKR+VVF+ S+E+++  D V+LASPD P +Q  + S+Q  
Sbjct: 296  SGNDDNQDVSLRRSSN----RKRRVVFDFSDEDED--DVVSLASPDLPNKQSSLDSRQND 349

Query: 1187 HSLDLEKNMLNFXXXXXXXXXXXXXXA-DRGTNQILKEESLALSKGDNTKSFSLEKGVNN 1363
                 EK  LNF              A ++  +  L+E   A+S+  NT   S EK  + 
Sbjct: 350  KKTS-EKTTLNFDLQEENKSGVKEERATEQKAHLPLRENVSAISRCTNTGKSSSEKLQSG 408

Query: 1364 VTVGTTLKDRKTDTAPRSPQRRKVLKTRIDERGREVTEVVWEGGEAETKS-DSNTMKKVD 1540
                   KD   +  P SP+RRKV+KTRIDERGREVTEVVW+G E E K  D  T KK D
Sbjct: 409  APEVHLNKDSVNNAPPCSPKRRKVMKTRIDERGREVTEVVWDGEETEEKKPDKVTTKKSD 468

Query: 1541 NMATSNTANRPSAIKKSPAVGNNAPLNQVXXXXXXXXXXXDPKQGNILSFFKRV 1702
            + A++   N   A KK PA  N                  DPKQGNILSFFKRV
Sbjct: 469  SNASTKAINSAPATKKPPANSNAIS----GKGGKKAGNSKDPKQGNILSFFKRV 518


Top