BLASTX nr result

ID: Cephaelis21_contig00025520 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00025520
         (1479 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002328811.1| predicted protein [Populus trichocarpa] gi|2...   173   9e-41
gb|AFN58239.1| trichome branching-like protein [Gossypium arboreum]   167   7e-39
ref|XP_002519367.1| replication factor C / DNA polymerase III ga...   150   1e-33
ref|XP_003528725.1| PREDICTED: uncharacterized protein LOC100814...   139   1e-30
ref|XP_002303217.1| predicted protein [Populus trichocarpa] gi|2...   139   2e-30

>ref|XP_002328811.1| predicted protein [Populus trichocarpa] gi|222839109|gb|EEE77460.1|
            predicted protein [Populus trichocarpa]
          Length = 1197

 Score =  173 bits (439), Expect = 9e-41
 Identities = 124/322 (38%), Positives = 158/322 (49%), Gaps = 6/322 (1%)
 Frame = +3

Query: 531  DPSNLHLKKELTQIRKAARVLRDPGTSSSWRSPLNSARSXXXXXXXXXXXDDFGKQQINY 710
            DPS LHLKKELTQIRKAARVLRDPGTSSSW+SPLNSARS                   + 
Sbjct: 8    DPSRLHLKKELTQIRKAARVLRDPGTSSSWKSPLNSARSAATMAAAA------ASTSASA 61

Query: 711  REYSNGENHSQLPLGFVENNGSSSKFNDFDANGNAKEKEKKVYLCNWKMQKSESERSKQC 890
             ++   EN  Q   G   +N +S+  +    +GN   K+K+V+L NWK QKS SE+S   
Sbjct: 62   WKHFETENAIQNGGGGGSHNNNSAHLDSHFKSGNNHGKDKRVFLYNWKSQKSSSEKSALA 121

Query: 891  GXXXXXXXXXXXXXXXXXXXXXXXXVGDSLSDARIGEIDSKSDTFVSE-KYASLLFKCKD 1067
                                     + DSLSDAR    DSKSDT++ E + A+++F+C+D
Sbjct: 122  ---------RNDADDDYESCSIQGSLDDSLSDARNAG-DSKSDTYLGETRSAAMIFRCRD 171

Query: 1068 TNFKPTXXXXXXXXXXXXXXXXXXXXXXGEKLKEQMMLDR----GPKRAF-QGLGRNDLS 1232
             N                              +++M L R     P      GLGR+D  
Sbjct: 172  ANLVSPSMRRAMGIKKKSKKTNARFDVLSRYQQKEMNLRRLLKGHPSMGLGLGLGRDD-- 229

Query: 1233 NLVDQSDDTEDYCNSEDLRRDSAVSPLLAXXXXXXXXXXXXXXXXXXXXXXXXYSYSTPA 1412
             +V+QSDDTE+Y NSE LR+ S  SPLL                         YS+STPA
Sbjct: 230  -VVEQSDDTEEYSNSEYLRKISGASPLLLKLKHKNRSHSPSKLLRTTRKEDSSYSHSTPA 288

Query: 1413 MSTSSYNRYVARYPSTVGSWDA 1478
            +S SSY++Y  R PS VGSWDA
Sbjct: 289  LSASSYDKYRKRNPSNVGSWDA 310


>gb|AFN58239.1| trichome branching-like protein [Gossypium arboreum]
          Length = 1223

 Score =  167 bits (423), Expect = 7e-39
 Identities = 126/320 (39%), Positives = 153/320 (47%), Gaps = 4/320 (1%)
 Frame = +3

Query: 531  DPSNLHLKKELTQIRKAARVLRDPGTSSSWRSPLNSARSXXXXXXXXXXXDDFGKQQINY 710
            DPS LHLKKELTQIRKAARVLRDPGT+SSW+SP+NS+RS              G + ++ 
Sbjct: 8    DPSRLHLKKELTQIRKAARVLRDPGTTSSWKSPINSSRSVAA----------LGSESLS- 56

Query: 711  REYSNGENHSQLPL--GFVENNGSSSKFNDFDANGNAKEKEKKVYLCNWKMQKSESERSK 884
               SNG  H  L L    VE+NG          N N  EK+K+V+L NW+ QKS S    
Sbjct: 57   --RSNGNAHLDLSLLPFRVESNGHGR-----ITNSNGNEKDKRVFLYNWRSQKSSSVNVD 109

Query: 885  QCGXXXXXXXXXXXXXXXXXXXXXXXXVGDSLSDAR-IGEIDSKSDTFVSE-KYASLLFK 1058
              G                          +SLSDAR  G  DSKSDT + E + AS+LF+
Sbjct: 110  DDGEDDDDFDDGDDGDQSSSWIQGSVD-ENSLSDARKCG--DSKSDTCLGESRSASMLFR 166

Query: 1059 CKDTNFKPTXXXXXXXXXXXXXXXXXXXXXXGEKLKEQMMLDRGPKRAFQGLGRNDLSNL 1238
            C+D N                           +K      +    ++   G+ RN   + 
Sbjct: 167  CRDANL--------VSLVTPSAKRMLGANKNSKKNGSNFDVFSRYEQKKNGVNRN---SS 215

Query: 1239 VDQSDDTEDYCNSEDLRRDSAVSPLLAXXXXXXXXXXXXXXXXXXXXXXXXYSYSTPAMS 1418
            VDQSDDTEDY NSED R+ S  SPLL                         YSYSTPA+S
Sbjct: 216  VDQSDDTEDYSNSEDFRKISGASPLLLKLKPKNWPHPSSRLLKADRKEDSSYSYSTPALS 275

Query: 1419 TSSYNRYVARYPSTVGSWDA 1478
            TSSYN+Y    PS VGSWDA
Sbjct: 276  TSSYNKYFNHNPSVVGSWDA 295


>ref|XP_002519367.1| replication factor C / DNA polymerase III gamma-tau subunit, putative
            [Ricinus communis] gi|223541434|gb|EEF42984.1|
            replication factor C / DNA polymerase III gamma-tau
            subunit, putative [Ricinus communis]
          Length = 1270

 Score =  150 bits (378), Expect = 1e-33
 Identities = 114/309 (36%), Positives = 142/309 (45%), Gaps = 7/309 (2%)
 Frame = +3

Query: 570  IRKAARVLRDPGTSSSWRSPLNSARSXXXXXXXXXXXDDFG--KQQINYREYSNGENHSQ 743
            + KAARVLRDPGT+SSW+SP++S+RS                 KQ  N     NG N + 
Sbjct: 11   VGKAARVLRDPGTTSSWKSPISSSRSAAAATLAAAAAASTSAWKQFDNENVIPNGHNSNS 70

Query: 744  LPLGFVENNGSSSKFNDFDANGNAKEKEKKVYLCNWKMQKSESERSKQCGXXXXXXXXXX 923
                +  NNG                KEK+V+L NWK QKS SE+S              
Sbjct: 71   HMDSYFRNNG----------------KEKRVFLYNWKTQKSSSEKSA---------IARN 105

Query: 924  XXXXXXXXXXXXXXVGDSLSDARIGEIDSKSDTFVSE-KYASLLFKCKDTNF-KPTXXXX 1097
                          V DSLSDAR    DSKSDT++ + + +S++F+C+D N   P+    
Sbjct: 106  DLDEDYESRSVQDSVDDSLSDAR-NAADSKSDTYLGDSRSSSMIFRCRDANLVSPSMRRA 164

Query: 1098 XXXXXXXXXXXXXXXXXXGEKLKE---QMMLDRGPKRAFQGLGRNDLSNLVDQSDDTEDY 1268
                                + KE   + +L   P  A  GLGR D    V+QSDDTEDY
Sbjct: 165  MGIKKKSKKTDTHLDILSRYQQKEINLRRLLKSHPSIAL-GLGREDS---VEQSDDTEDY 220

Query: 1269 CNSEDLRRDSAVSPLLAXXXXXXXXXXXXXXXXXXXXXXXXYSYSTPAMSTSSYNRYVAR 1448
             NSEDLR+ S  SPLL                         Y+YSTPA+STSSYNRY   
Sbjct: 221  SNSEDLRKISGASPLLIKLKHKRWSHSPSKLLRISRKEDSSYTYSTPALSTSSYNRYCNH 280

Query: 1449 YPSTVGSWD 1475
             PSTVGSWD
Sbjct: 281  NPSTVGSWD 289


>ref|XP_003528725.1| PREDICTED: uncharacterized protein LOC100814391 [Glycine max]
          Length = 1237

 Score =  139 bits (351), Expect = 1e-30
 Identities = 113/330 (34%), Positives = 140/330 (42%), Gaps = 17/330 (5%)
 Frame = +3

Query: 537  SNLHLKKELTQIRKAARVLRDPGTSSSWRSPLNSARSXXXXXXXXXXXDDFGKQQINYRE 716
            S LHLKKELTQIRKAARVLRDPGT+SSW+SPL+S+RS                       
Sbjct: 4    SELHLKKELTQIRKAARVLRDPGTTSSWKSPLSSSRSVAAW------------------- 44

Query: 717  YSNGENHSQLPLGFVENNGSSSKFNDFDANG--NAKEKEKKVYLCNWKMQKSESERSKQC 890
                            N+ +S +       G  N  +K+K+V+L NWK  KS SE+    
Sbjct: 45   ---------------NNDTASRRLTTISQLGPNNTNDKDKRVFLYNWKNYKSSSEKYND- 88

Query: 891  GXXXXXXXXXXXXXXXXXXXXXXXXVGDSLSDARIGEIDSKSDTFVSEKYA--------S 1046
                                       DSLSDAR G  DSKSDT+++            S
Sbjct: 89   -------EEEEEEDDDGSSSLLGDRDRDSLSDARNG-CDSKSDTYLAAAVGGGGGGGTRS 140

Query: 1047 LLFKCKDTNFK-----PTXXXXXXXXXXXXXXXXXXXXXXGEKL--KEQMMLDRGPKRAF 1205
             +F+C D N       P                       G+K     + +L+  P   F
Sbjct: 141  SIFRCGDANLVSRRTVPVKKKSKKNNPHFDFLAKYQHHRPGKKFVSSSKALLEGHPSPFF 200

Query: 1206 QGLGRNDLSNLVDQSDDTEDYCNSEDLRRDSAVSPLLAXXXXXXXXXXXXXXXXXXXXXX 1385
                R+D  ++    DDTEDY NSE +R  S  SPLL                       
Sbjct: 201  N---RDD--SVEHSDDDTEDYTNSEGVRPISGTSPLLLKLRQKNWSRSSSKFLRRSRKED 255

Query: 1386 XXYSYSTPAMSTSSYNRYVARYPSTVGSWD 1475
              YSYSTPA+STSSYNRY  RYPST+GSWD
Sbjct: 256  SSYSYSTPALSTSSYNRYGHRYPSTLGSWD 285


>ref|XP_002303217.1| predicted protein [Populus trichocarpa] gi|222840649|gb|EEE78196.1|
            predicted protein [Populus trichocarpa]
          Length = 1241

 Score =  139 bits (349), Expect = 2e-30
 Identities = 113/325 (34%), Positives = 142/325 (43%), Gaps = 9/325 (2%)
 Frame = +3

Query: 531  DPSNLHLKKELTQIRKAARVLRDPGTSSSWRSPLNSARSXXXXXXXXXXXDDFGKQQINY 710
            DPS LHLKKELTQIRKAARVLRDPGT+SSW+S  ++A +               K   N 
Sbjct: 8    DPSRLHLKKELTQIRKAARVLRDPGTTSSWKSARSAAAASTSASAW--------KHFENE 59

Query: 711  REYSNG---ENHSQLPLGFVENNGSSSKFNDFDANGNAKEKEKKVYLCNWKMQKSESERS 881
                NG    +HS        NN S+   + F +  N    +KKV+L NWK QK  SE+S
Sbjct: 60   NAIQNGGTTASHS--------NNSSTHLGSHFKSVLNNNGSDKKVFLYNWKSQKYSSEKS 111

Query: 882  K-QCGXXXXXXXXXXXXXXXXXXXXXXXXVGDSLSDARIGEIDSKSDTFVSEKYASLLFK 1058
                                         VGDS SD  +GE  S           +++F+
Sbjct: 112  ALPRNDADDNCESCSVQESLDDSLSDARNVGDSKSDTYLGETRS----------PAMIFR 161

Query: 1059 CKDTNFKPTXXXXXXXXXXXXXXXXXXXXXXGEKLKEQMMLDR----GPKRAFQ-GLGRN 1223
             +D N                              +++M L R     P      GLGR+
Sbjct: 162  RRDANLVSPSMRRAMGVKKKGKKTNTRLDVLSRYQEKEMNLRRLLKGHPSMGLSLGLGRD 221

Query: 1224 DLSNLVDQSDDTEDYCNSEDLRRDSAVSPLLAXXXXXXXXXXXXXXXXXXXXXXXXYSYS 1403
                +V+QSDDTE+Y NSEDLR+ S  SPLL                         Y +S
Sbjct: 222  ---AIVEQSDDTEEYSNSEDLRKISGASPLLLKLKHKNWSHSPSKFLRTSRKEDSSYCHS 278

Query: 1404 TPAMSTSSYNRYVARYPSTVGSWDA 1478
            TPA+STSS N+Y  R PSTVGSWDA
Sbjct: 279  TPALSTSSCNKYRNRNPSTVGSWDA 303