BLASTX nr result

ID: Cephaelis21_contig00010012 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00010012
         (1690 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003634173.1| PREDICTED: uncharacterized protein LOC100853...   119   2e-24
ref|XP_002524424.1| conserved hypothetical protein [Ricinus comm...   115   5e-23
ref|XP_002515870.1| conserved hypothetical protein [Ricinus comm...   109   3e-21
ref|XP_002321364.1| predicted protein [Populus trichocarpa] gi|2...    97   1e-17
ref|NP_197838.2| uncharacterized protein [Arabidopsis thaliana] ...    92   3e-16

>ref|XP_003634173.1| PREDICTED: uncharacterized protein LOC100853133 [Vitis vinifera]
          Length = 985

 Score =  119 bits (299), Expect = 2e-24
 Identities = 76/201 (37%), Positives = 109/201 (54%), Gaps = 1/201 (0%)
 Frame = +1

Query: 295 QTEPPEMPNTSQGKRLMFNLMSNVLAELFIMGGDSNGLPKFHPQKCSRKQPSPRFCGPLN 474
           +++PPE      G++  F +MSNVLAELF MG DSN +PK   +K SRKQ +P+ C    
Sbjct: 223 RSQPPET-----GEKEWFGIMSNVLAELFNMG-DSNQIPKLSGKKSSRKQTNPKIC---- 272

Query: 475 SPGFSCIGNEXXXXXXXXXXXXPHAKRESASPVSDDYSGVGLVKELRGDERVKLVENASS 654
                                    ++E   P +   SG   + E++       V+  + 
Sbjct: 273 --------------------LLSSVRQEDEVPATAPSSGDNSLTEMKDSNGE--VKTVNQ 310

Query: 655 GXXXXXXXXDKGCWM-LSGFSRTEVTVIDTSFARWKFEKMLFRKKNVWKVRDKKGKTANF 831
           G        ++ C   LS +SR+EVTVIDTS A WKFEK+LFRKKNVWKVRDKKGK+ + 
Sbjct: 311 GKVDCLDAEEEKCNQDLSAYSRSEVTVIDTSCAVWKFEKLLFRKKNVWKVRDKKGKSRSI 370

Query: 832 TKKERKSTSVDDENVSRGNLK 894
            +K+RK++  D++  +R  +K
Sbjct: 371 GRKKRKASECDEQLEARKKMK 391


>ref|XP_002524424.1| conserved hypothetical protein [Ricinus communis]
           gi|223536308|gb|EEF37959.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 272

 Score =  115 bits (287), Expect = 5e-23
 Identities = 93/304 (30%), Positives = 138/304 (45%), Gaps = 8/304 (2%)
 Frame = +1

Query: 58  MLCSIPTSGKSTSNWLARLRSSKGFPSTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 237
           MLCS+    KS SNWL RLRS+KGFP+T                                
Sbjct: 1   MLCSVSAGTKSGSNWLDRLRSTKGFPATENLDLDNFLSNSSLLNPSISESTLSHNKRV-- 58

Query: 238 XXXXXXXXXXXXXXPETGLQTEPPEMPNTSQGKRLMFNLMSNVLAELFIMGGDSNGLPKF 417
                           T  QT+ P+  ++  G++  F L++NVL +LF MG   +   + 
Sbjct: 59  ----------------TSDQTQFPDT-SSENGEKEWFGLVTNVLCDLFNMGDSQDKNSRL 101

Query: 418 HPQKCSRKQPSPRFCGPLNSPGFSCIGNEXXXXXXXXXXXXPHAKRESASPVS----DDY 585
              K SRKQ +P+F    +     C+                    + A+P S    ++ 
Sbjct: 102 SGTKSSRKQTNPKFFDIESVRKEECV--------------------QVATPASFRSDNNS 141

Query: 586 SGVGLVKELRGDERVKLVENASSGXXXXXXXXDKGCWMLSGFSRTEVTVIDTSFARWKFE 765
           + VG+  +   ++    V+             DK    L G+S++EVTVIDTSF  WKF+
Sbjct: 142 NVVGMNADCFSNDDDNNVDEEKE-----KCSSDKE---LKGYSKSEVTVIDTSFEMWKFD 193

Query: 766 KMLFRKKNVWKVRDKKGKTANFTKKERKSTSVDDENVSRGNL---KKAKF-LDGQCVFSG 933
           K++FR+KN+WKVRDKKGK+ +F+ K+RK   ++   +  GN+   KKAK   D Q   S 
Sbjct: 194 KLVFRRKNIWKVRDKKGKSWSFSSKKRKGNQLESA-IGNGNVGCKKKAKMSSDSQFASSK 252

Query: 934 KGNG 945
           + NG
Sbjct: 253 ESNG 256


>ref|XP_002515870.1| conserved hypothetical protein [Ricinus communis]
           gi|223545025|gb|EEF46539.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 268

 Score =  109 bits (272), Expect = 3e-21
 Identities = 90/296 (30%), Positives = 131/296 (44%), Gaps = 3/296 (1%)
 Frame = +1

Query: 67  SIPTSGKSTSNWLARLRSSKGFPSTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 246
           S+    KS SNWL RLRS+KGFP+T                                   
Sbjct: 4   SVFAGNKSGSNWLDRLRSTKGFPATENLDLDNFLSDPSLPNSESTQSLNRRV-------- 55

Query: 247 XXXXXXXXXXXPETGLQTEPPEMPNTSQGKRLMFNLMSNVLAELFIMGGDSNGLPKFHPQ 426
                        T  QTE P+    + G+R  F +++NVL +LF MG   +   +   +
Sbjct: 56  -------------TSDQTEIPDTLREN-GEREWFGVVTNVLCDLFNMGDSQDKNSRISGK 101

Query: 427 KCSRKQPSPRFCGPLNSPGFSCIGNEXXXXXXXXXXXXPHAKRESASPVSDDYSGV-GLV 603
           K SRKQ +P+F           +  E                  +AS  SD+ S V G+ 
Sbjct: 102 KSSRKQTNPKFFDA------DSVRKEEYVQAAT-----------TASFHSDNNSNVVGMN 144

Query: 604 KELRGDERVKLVENASSGXXXXXXXXDKGCWMLSGFSRTEVTVIDTSFARWKFEKMLFRK 783
            +   D+     ++  +G              L G+S++EVTVIDTSF  WKF+K++FR+
Sbjct: 145 ADCFVDD-----DDEYNGKLDEKKEKSSSDKELKGYSKSEVTVIDTSFEVWKFDKLVFRR 199

Query: 784 KNVWKVRDKKGKTANFTKKERKSTSVDD--ENVSRGNLKKAKFLDGQCVFSGKGNG 945
           K++WKVRDKKGK+ NF  K+RK   ++    N +  + KKAK  D +   S + NG
Sbjct: 200 KSIWKVRDKKGKSWNFASKKRKGNHLESATNNGNVSSKKKAKMSDSEFASSKESNG 255


>ref|XP_002321364.1| predicted protein [Populus trichocarpa] gi|222868360|gb|EEF05491.1|
           predicted protein [Populus trichocarpa]
          Length = 288

 Score = 97.4 bits (241), Expect = 1e-17
 Identities = 91/290 (31%), Positives = 121/290 (41%), Gaps = 8/290 (2%)
 Frame = +1

Query: 58  MLCSIPTSGKSTSNWLARLRSSKGFPSTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 237
           MLCS+ TS KS SNWL RL S+KGF +                                 
Sbjct: 1   MLCSVKTS-KSGSNWLDRLWSNKGFSNNDDDDPSVPNPSSSPITDASNSVINSNSESTHS 59

Query: 238 XXXXXXXXXXXXXXPETGLQTEPPEMPNTSQGKRLMFNLMSNVLAELFIMGGDSNGLP-- 411
                          +  + T      ++S  K L F LM+NVL++LF MGG S+ +   
Sbjct: 60  ESD------------QNKVTTTTTREISSSDNKDLFF-LMNNVLSDLFNMGGCSDPIEGS 106

Query: 412 ---KFHPQKCSRKQPSPRFCGPLNSPGFSCIGNEXXXXXXXXXXXXPHAKRESASPVSDD 582
                  ++  RKQ  P+FC           GN              +    + S  SD 
Sbjct: 107 SRHSRKKERIPRKQTKPKFC--------FVSGNNSSNDSLDCVRKDENVLVATGSLNSDK 158

Query: 583 YSG---VGLVKELRGDERVKLVENASSGXXXXXXXXDKGCWMLSGFSRTEVTVIDTSFAR 753
            S     G+  +   +E   + E    G          G   L G+SR+EVTVIDTS   
Sbjct: 159 NSNNVDCGVDDDDEEEEEEDVEEEKGKGFGV------SGDKELKGYSRSEVTVIDTSCLV 212

Query: 754 WKFEKMLFRKKNVWKVRDKKGKTANFTKKERKSTSVDDENVSRGNLKKAK 903
           WKF+K++FRKKNVWKVRDKKGK+     K+RK   ++  N   G  KKAK
Sbjct: 213 WKFDKLVFRKKNVWKVRDKKGKSWVSGSKKRKVIDLESAN-GNGAKKKAK 261


>ref|NP_197838.2| uncharacterized protein [Arabidopsis thaliana]
           gi|28973694|gb|AAO64164.1| unknown protein [Arabidopsis
           thaliana] gi|29824259|gb|AAP04090.1| unknown protein
           [Arabidopsis thaliana] gi|110736861|dbj|BAF00388.1|
           hypothetical protein [Arabidopsis thaliana]
           gi|332005934|gb|AED93317.1| uncharacterized protein
           [Arabidopsis thaliana]
          Length = 334

 Score = 92.4 bits (228), Expect = 3e-16
 Identities = 79/250 (31%), Positives = 114/250 (45%), Gaps = 14/250 (5%)
 Frame = +1

Query: 280 PETGLQTEPP--EMPNTSQGKRLMFNLMSNVLAELFIMGGDSNGLPKFHPQKCSRKQPSP 453
           P   + ++P   E P+        + +MS+VL ELF   G S        +K  RKQ +P
Sbjct: 62  PSAPIPSDPELAESPSEEPVPGEWYGVMSDVLFELFNFSGSSKSSTIPGKKKLPRKQSNP 121

Query: 454 RFCGPLNSPGFSCIGNEXXXXXXXXXXXXPHAKRESASPVSDDYSGVGLVKELRGDERVK 633
           R C  L +P    +               P  +  + S     Y+      E+R + R  
Sbjct: 122 RHCS-LETPEDVVV--PLVNQKSDDANCLPSVREFATSSSRSSYNKKPPAPEIR-ERRRS 177

Query: 634 LVENASSGXXXXXXXXDKGCWMLSGFSRTEVTVIDTSFARWKFEKMLFRKKNVWKVRDKK 813
           +VE             +KG   L GFSR+EVTVIDTSF  WK EK++FR++NVWKVR+KK
Sbjct: 178 VVEGDG-----VDEEEEKGEKDLVGFSRSEVTVIDTSFKIWKSEKLVFRRRNVWKVREKK 232

Query: 814 GKTANFT----------KKERKSTSVDDEN--VSRGNLKKAKFLDGQCVFSGKGNGEALK 957
           GK+   +          KK+RK   VDD++  ++R   KK K        + + N E + 
Sbjct: 233 GKSRVVSKLKKLMKKKKKKKRKCDDVDDDDGGIARKKSKKMKISTSVSDNNPRYNVEEIH 292

Query: 958 QEYHHSNQSQ 987
            E   SN S+
Sbjct: 293 DEPESSNVSR 302


Top