BLASTX nr result

ID: Cornus23_contig00035339 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cornus23_contig00035339
         (808 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010649804.1| PREDICTED: uncharacterized protein LOC100266...   144   6e-32
ref|XP_002314392.2| transcription activation domain-interacting ...   127   7e-27
ref|XP_011005086.1| PREDICTED: uncharacterized protein LOC105111...   124   6e-26
ref|XP_010244658.1| PREDICTED: uncharacterized protein LOC104588...   100   9e-19
ref|XP_010244657.1| PREDICTED: uncharacterized protein LOC104588...   100   9e-19
ref|XP_012069425.1| PREDICTED: uncharacterized protein LOC105631...   100   2e-18
gb|KDP40036.1| hypothetical protein JCGZ_02034 [Jatropha curcas]      100   2e-18
ref|XP_002516852.1| pax transcription activation domain interact...   100   2e-18
ref|XP_012455778.1| PREDICTED: uncharacterized protein LOC105777...    96   3e-17
ref|XP_010040881.1| PREDICTED: uncharacterized protein LOC104429...    95   7e-17
ref|XP_012482073.1| PREDICTED: uncharacterized protein LOC105796...    93   2e-16
gb|KJB28584.1| hypothetical protein B456_005G056800 [Gossypium r...    93   2e-16
ref|XP_012482072.1| PREDICTED: uncharacterized protein LOC105796...    93   2e-16
gb|KHG03195.1| PAX-interacting 1 [Gossypium arboreum]                  92   4e-16
ref|XP_007035445.1| BRCT domain-containing DNA repair protein, p...    92   6e-16
ref|XP_007035444.1| BRCT domain-containing DNA repair protein, p...    92   6e-16
ref|XP_007035443.1| BRCT domain-containing DNA repair protein, p...    92   6e-16
ref|XP_007035442.1| BRCT domain-containing DNA repair protein, p...    92   6e-16
ref|XP_007035440.1| BRCT domain-containing DNA repair protein, p...    92   6e-16
gb|KHG13951.1| PAX-interacting 1 [Gossypium arboreum]                  91   1e-15

>ref|XP_010649804.1| PREDICTED: uncharacterized protein LOC100266667 [Vitis vinifera]
          Length = 1239

 Score =  144 bits (364), Expect = 6e-32
 Identities = 71/124 (57%), Positives = 90/124 (72%)
 Frame = -1

Query: 808 DELPFLQETVPFDDTVRVEDTFETQVVNLGGETQVLDDPDCVENIDTQLLLECNDEGIVD 629
           D + FLQ TVPFDDTV +ED FETQ+VNLGGETQVLDDPDC ENI TQLL   +DE +++
Sbjct: 48  DAVQFLQNTVPFDDTVPLEDAFETQLVNLGGETQVLDDPDCTENIRTQLLDGFDDEVVIE 107

Query: 628 SDGEGTDRTEVLDDTDEVSDGDSVKRFGNHPVDLATMLHATLCKQGDQGFKAESNALSNE 449
           SDGEGTDRTEVL D + +SD +SV+  G  PVD   + + + C+Q ++G   E + L  E
Sbjct: 108 SDGEGTDRTEVLSDNEGLSDDNSVRSIGVFPVDKENVHNVSACEQDEKGSLLEPHPLIGE 167

Query: 448 QCGS 437
           QC +
Sbjct: 168 QCNA 171



 Score =  138 bits (347), Expect = 5e-30
 Identities = 77/155 (49%), Positives = 105/155 (67%), Gaps = 2/155 (1%)
 Frame = -1

Query: 466 NALSNEQCGSGEKFKGEDVDELQFLQSTVPVDDTVPLEDASETQMMNLGDETQAWDDPPC 287
           +ALS+    SGEK K  + D +QFLQ+TVP DDTVPLEDA ETQ++NLG ETQ  DDP C
Sbjct: 29  DALSSPSSLSGEKIKDWNADAVQFLQNTVPFDDTVPLEDAFETQLVNLGGETQVLDDPDC 88

Query: 286 VENMGTQLLVEWDNEVIDDSDGEATTRTEVLG-NQEMSGNDFVKNVHNDLVDPENMLYTT 110
            EN+ TQLL  +D+EV+ +SDGE T RTEVL  N+ +S ++ V+++    VD EN+   +
Sbjct: 89  TENIRTQLLDGFDDEVVIESDGEGTDRTEVLSDNEGLSDDNSVRSIGVFPVDKENVHNVS 148

Query: 109 LCKQNNRGFKAE-SDAHNEQCSSELNVPTATRLDK 8
            C+Q+ +G   E      EQC++E NV T T L++
Sbjct: 149 ACEQDEKGSLLEPHPLIGEQCNAEHNVSTVTPLEQ 183


>ref|XP_002314392.2| transcription activation domain-interacting family protein [Populus
           trichocarpa] gi|550328889|gb|EEF00563.2| transcription
           activation domain-interacting family protein [Populus
           trichocarpa]
          Length = 1102

 Score =  127 bits (320), Expect = 7e-27
 Identities = 70/129 (54%), Positives = 91/129 (70%)
 Frame = -1

Query: 808 DELPFLQETVPFDDTVRVEDTFETQVVNLGGETQVLDDPDCVENIDTQLLLECNDEGIVD 629
           +EL FLQ T+ F+DTVRVED FETQVV+LGGETQ LDD D  +N+DTQL+ E     I+D
Sbjct: 49  NELQFLQSTMLFEDTVRVEDAFETQVVDLGGETQALDDLDWFQNVDTQLIDE-----IID 103

Query: 628 SDGEGTDRTEVLDDTDEVSDGDSVKRFGNHPVDLATMLHATLCKQGDQGFKAESNALSNE 449
           SDGEGTDRTEVLDD +E+SD +S +R     +D   +   +L K G++G   +S+AL++E
Sbjct: 104 SDGEGTDRTEVLDDGNELSDDESGRRGKCESLDGEKIQDTSLSKHGEKGLVEQSDALTDE 163

Query: 448 QCGSGEKFK 422
           Q  SG   K
Sbjct: 164 QHLSGSALK 172



 Score = 94.7 bits (234), Expect = 7e-17
 Identities = 57/125 (45%), Positives = 80/125 (64%), Gaps = 1/125 (0%)
 Frame = -1

Query: 424 KGEDVDELQFLQSTVPVDDTVPLEDASETQMMNLGDETQAWDDPPCVENMGTQLLVEWDN 245
           KGED +ELQFLQST+  +DTV +EDA ETQ+++LG ETQA DD    +N+ TQL+ E   
Sbjct: 44  KGEDANELQFLQSTMLFEDTVRVEDAFETQVVDLGGETQALDDLDWFQNVDTQLIDE--- 100

Query: 244 EVIDDSDGEATTRTEVLGN-QEMSGNDFVKNVHNDLVDPENMLYTTLCKQNNRGFKAESD 68
             I DSDGE T RTEVL +  E+S ++  +    + +D E +  T+L K   +G   +SD
Sbjct: 101 --IIDSDGEGTDRTEVLDDGNELSDDESGRRGKCESLDGEKIQDTSLSKHGEKGLVEQSD 158

Query: 67  AHNEQ 53
           A  ++
Sbjct: 159 ALTDE 163


>ref|XP_011005086.1| PREDICTED: uncharacterized protein LOC105111439 [Populus
           euphratica]
          Length = 1147

 Score =  124 bits (312), Expect = 6e-26
 Identities = 69/129 (53%), Positives = 90/129 (69%)
 Frame = -1

Query: 808 DELPFLQETVPFDDTVRVEDTFETQVVNLGGETQVLDDPDCVENIDTQLLLECNDEGIVD 629
           +EL FLQ T+ F+DTVRVED  ETQVV+LGGETQ LDD D  +N+DTQL+ E     I+D
Sbjct: 49  NELQFLQSTMLFEDTVRVEDASETQVVDLGGETQALDDLDWFQNVDTQLIDE-----IID 103

Query: 628 SDGEGTDRTEVLDDTDEVSDGDSVKRFGNHPVDLATMLHATLCKQGDQGFKAESNALSNE 449
           SDGEGTDRTEVLDD +E+SD +S +R     +D   +   +L K G++G   +S+AL++E
Sbjct: 104 SDGEGTDRTEVLDDGNELSDDESGRRGKCESLDGEKIQDTSLSKHGEKGLVEQSDALTDE 163

Query: 448 QCGSGEKFK 422
           Q  SG   K
Sbjct: 164 QHLSGSALK 172



 Score = 97.1 bits (240), Expect = 1e-17
 Identities = 58/125 (46%), Positives = 81/125 (64%), Gaps = 1/125 (0%)
 Frame = -1

Query: 424 KGEDVDELQFLQSTVPVDDTVPLEDASETQMMNLGDETQAWDDPPCVENMGTQLLVEWDN 245
           KGED +ELQFLQST+  +DTV +EDASETQ+++LG ETQA DD    +N+ TQL+ E   
Sbjct: 44  KGEDANELQFLQSTMLFEDTVRVEDASETQVVDLGGETQALDDLDWFQNVDTQLIDE--- 100

Query: 244 EVIDDSDGEATTRTEVLGN-QEMSGNDFVKNVHNDLVDPENMLYTTLCKQNNRGFKAESD 68
             I DSDGE T RTEVL +  E+S ++  +    + +D E +  T+L K   +G   +SD
Sbjct: 101 --IIDSDGEGTDRTEVLDDGNELSDDESGRRGKCESLDGEKIQDTSLSKHGEKGLVEQSD 158

Query: 67  AHNEQ 53
           A  ++
Sbjct: 159 ALTDE 163


>ref|XP_010244658.1| PREDICTED: uncharacterized protein LOC104588427 isoform X2 [Nelumbo
           nucifera]
          Length = 1149

 Score =  100 bits (250), Expect = 9e-19
 Identities = 56/121 (46%), Positives = 77/121 (63%)
 Frame = -1

Query: 808 DELPFLQETVPFDDTVRVEDTFETQVVNLGGETQVLDDPDCVENIDTQLLLECNDEGIVD 629
           D+L   Q TVP DDT+ +E   ETQ+V+L GETQ ++D D +E++ TQLL + + E   D
Sbjct: 48  DDLHAFQNTVPLDDTIPLEIFAETQLVDLEGETQEIEDLDLIEDVKTQLLDDYDKELSFD 107

Query: 628 SDGEGTDRTEVLDDTDEVSDGDSVKRFGNHPVDLATMLHATLCKQGDQGFKAESNALSNE 449
           SDGEGTDRTE+L D D VSD DS +  G+  V      H+  CKQG +    +S+A  +E
Sbjct: 108 SDGEGTDRTEILTDVDAVSDDDSKRGSGDDSVGSEKRQHSP-CKQGAKDIILDSDASHDE 166

Query: 448 Q 446
           +
Sbjct: 167 E 167



 Score = 95.5 bits (236), Expect = 4e-17
 Identities = 56/144 (38%), Positives = 89/144 (61%), Gaps = 1/144 (0%)
 Frame = -1

Query: 457 SNEQCGSGEKFKGEDVDELQFLQSTVPVDDTVPLEDASETQMMNLGDETQAWDDPPCVEN 278
           S     SGEK + ++ D+L   Q+TVP+DDT+PLE  +ETQ+++L  ETQ  +D   +E+
Sbjct: 32  SPSSVSSGEKLEEDNADDLHAFQNTVPLDDTIPLEIFAETQLVDLEGETQEIEDLDLIED 91

Query: 277 MGTQLLVEWDNEVIDDSDGEATTRTEVLGNQE-MSGNDFVKNVHNDLVDPENMLYTTLCK 101
           + TQLL ++D E+  DSDGE T RTE+L + + +S +D  +   +D V  E   ++  CK
Sbjct: 92  VKTQLLDDYDKELSFDSDGEGTDRTEILTDVDAVSDDDSKRGSGDDSVGSEKRQHSP-CK 150

Query: 100 QNNRGFKAESDAHNEQCSSELNVP 29
           Q  +    +SDA +++   E +VP
Sbjct: 151 QGAKDIILDSDASHDE--EERSVP 172


>ref|XP_010244657.1| PREDICTED: uncharacterized protein LOC104588427 isoform X1 [Nelumbo
           nucifera]
          Length = 1228

 Score =  100 bits (250), Expect = 9e-19
 Identities = 56/121 (46%), Positives = 77/121 (63%)
 Frame = -1

Query: 808 DELPFLQETVPFDDTVRVEDTFETQVVNLGGETQVLDDPDCVENIDTQLLLECNDEGIVD 629
           D+L   Q TVP DDT+ +E   ETQ+V+L GETQ ++D D +E++ TQLL + + E   D
Sbjct: 48  DDLHAFQNTVPLDDTIPLEIFAETQLVDLEGETQEIEDLDLIEDVKTQLLDDYDKELSFD 107

Query: 628 SDGEGTDRTEVLDDTDEVSDGDSVKRFGNHPVDLATMLHATLCKQGDQGFKAESNALSNE 449
           SDGEGTDRTE+L D D VSD DS +  G+  V      H+  CKQG +    +S+A  +E
Sbjct: 108 SDGEGTDRTEILTDVDAVSDDDSKRGSGDDSVGSEKRQHSP-CKQGAKDIILDSDASHDE 166

Query: 448 Q 446
           +
Sbjct: 167 E 167



 Score = 95.5 bits (236), Expect = 4e-17
 Identities = 56/144 (38%), Positives = 89/144 (61%), Gaps = 1/144 (0%)
 Frame = -1

Query: 457 SNEQCGSGEKFKGEDVDELQFLQSTVPVDDTVPLEDASETQMMNLGDETQAWDDPPCVEN 278
           S     SGEK + ++ D+L   Q+TVP+DDT+PLE  +ETQ+++L  ETQ  +D   +E+
Sbjct: 32  SPSSVSSGEKLEEDNADDLHAFQNTVPLDDTIPLEIFAETQLVDLEGETQEIEDLDLIED 91

Query: 277 MGTQLLVEWDNEVIDDSDGEATTRTEVLGNQE-MSGNDFVKNVHNDLVDPENMLYTTLCK 101
           + TQLL ++D E+  DSDGE T RTE+L + + +S +D  +   +D V  E   ++  CK
Sbjct: 92  VKTQLLDDYDKELSFDSDGEGTDRTEILTDVDAVSDDDSKRGSGDDSVGSEKRQHSP-CK 150

Query: 100 QNNRGFKAESDAHNEQCSSELNVP 29
           Q  +    +SDA +++   E +VP
Sbjct: 151 QGAKDIILDSDASHDE--EERSVP 172


>ref|XP_012069425.1| PREDICTED: uncharacterized protein LOC105631840 [Jatropha curcas]
          Length = 1193

 Score = 99.8 bits (247), Expect = 2e-18
 Identities = 61/125 (48%), Positives = 80/125 (64%), Gaps = 1/125 (0%)
 Frame = -1

Query: 808 DELPFLQETVPFDDTVRVEDTFETQVVNLGGETQVLDDPD-CVENIDTQLLLECNDEGIV 632
           +E+ ++Q TVPFDDTV VED FETQ V+ GGETQVLDDPD  ++++DTQL+ E      +
Sbjct: 48  NEVNWVQNTVPFDDTVPVEDAFETQ-VDFGGETQVLDDPDYLIDHMDTQLMDE------I 100

Query: 631 DSDGEGTDRTEVLDDTDEVSDGDSVKRFGNHPVDLATMLHATLCKQGDQGFKAESNALSN 452
            SDGEGTD+TEVL D+DE+SD +S KR     VD         C+   +  ++ S    N
Sbjct: 101 YSDGEGTDKTEVLSDSDELSDDESQKRGKCESVD-----GGITCRASLEHKESVSVEQPN 155

Query: 451 EQCGS 437
           E C S
Sbjct: 156 ENCSS 160



 Score = 75.5 bits (184), Expect = 4e-11
 Identities = 54/133 (40%), Positives = 77/133 (57%), Gaps = 2/133 (1%)
 Frame = -1

Query: 424 KGEDVDELQFLQSTVPVDDTVPLEDASETQMMNLGDETQAWDDPP-CVENMGTQLLVEWD 248
           KG D +E+ ++Q+TVP DDTVP+EDA ETQ ++ G ETQ  DDP   +++M TQL+    
Sbjct: 43  KGGDANEVNWVQNTVPFDDTVPVEDAFETQ-VDFGGETQVLDDPDYLIDHMDTQLM---- 97

Query: 247 NEVIDDSDGEATTRTEVLG-NQEMSGNDFVKNVHNDLVDPENMLYTTLCKQNNRGFKAES 71
           +E+   SDGE T +TEVL  + E+S ++  K    + VD       +L  ++      E 
Sbjct: 98  DEIY--SDGEGTDKTEVLSDSDELSDDESQKRGKCESVDGGITCRASL--EHKESVSVEQ 153

Query: 70  DAHNEQCSSELNV 32
              NE CSS  NV
Sbjct: 154 P--NENCSSSFNV 164


>gb|KDP40036.1| hypothetical protein JCGZ_02034 [Jatropha curcas]
          Length = 1160

 Score = 99.8 bits (247), Expect = 2e-18
 Identities = 61/125 (48%), Positives = 80/125 (64%), Gaps = 1/125 (0%)
 Frame = -1

Query: 808 DELPFLQETVPFDDTVRVEDTFETQVVNLGGETQVLDDPD-CVENIDTQLLLECNDEGIV 632
           +E+ ++Q TVPFDDTV VED FETQ V+ GGETQVLDDPD  ++++DTQL+ E      +
Sbjct: 15  NEVNWVQNTVPFDDTVPVEDAFETQ-VDFGGETQVLDDPDYLIDHMDTQLMDE------I 67

Query: 631 DSDGEGTDRTEVLDDTDEVSDGDSVKRFGNHPVDLATMLHATLCKQGDQGFKAESNALSN 452
            SDGEGTD+TEVL D+DE+SD +S KR     VD         C+   +  ++ S    N
Sbjct: 68  YSDGEGTDKTEVLSDSDELSDDESQKRGKCESVD-----GGITCRASLEHKESVSVEQPN 122

Query: 451 EQCGS 437
           E C S
Sbjct: 123 ENCSS 127



 Score = 75.5 bits (184), Expect = 4e-11
 Identities = 54/133 (40%), Positives = 77/133 (57%), Gaps = 2/133 (1%)
 Frame = -1

Query: 424 KGEDVDELQFLQSTVPVDDTVPLEDASETQMMNLGDETQAWDDPP-CVENMGTQLLVEWD 248
           KG D +E+ ++Q+TVP DDTVP+EDA ETQ ++ G ETQ  DDP   +++M TQL+    
Sbjct: 10  KGGDANEVNWVQNTVPFDDTVPVEDAFETQ-VDFGGETQVLDDPDYLIDHMDTQLM---- 64

Query: 247 NEVIDDSDGEATTRTEVLG-NQEMSGNDFVKNVHNDLVDPENMLYTTLCKQNNRGFKAES 71
           +E+   SDGE T +TEVL  + E+S ++  K    + VD       +L  ++      E 
Sbjct: 65  DEIY--SDGEGTDKTEVLSDSDELSDDESQKRGKCESVDGGITCRASL--EHKESVSVEQ 120

Query: 70  DAHNEQCSSELNV 32
              NE CSS  NV
Sbjct: 121 P--NENCSSSFNV 131


>ref|XP_002516852.1| pax transcription activation domain interacting protein, putative
           [Ricinus communis] gi|223543940|gb|EEF45466.1| pax
           transcription activation domain interacting protein,
           putative [Ricinus communis]
          Length = 1178

 Score = 99.8 bits (247), Expect = 2e-18
 Identities = 49/81 (60%), Positives = 67/81 (82%)
 Frame = -1

Query: 793 LQETVPFDDTVRVEDTFETQVVNLGGETQVLDDPDCVENIDTQLLLECNDEGIVDSDGEG 614
           +Q +VPF DTV VED FETQV++L  ETQVLDDPDC E+++TQ++     +G+ +SDGE 
Sbjct: 49  VQNSVPFSDTVAVEDAFETQVIDLCDETQVLDDPDCFEHMETQVI-----DGL-NSDGEE 102

Query: 613 TDRTEVLDDTDEVSDGDSVKR 551
           TD+TEVLDDT+E+SDG+S++R
Sbjct: 103 TDKTEVLDDTNELSDGESLRR 123



 Score = 80.1 bits (196), Expect = 2e-12
 Identities = 57/137 (41%), Positives = 77/137 (56%), Gaps = 3/137 (2%)
 Frame = -1

Query: 490 DQGFKAESNALSNEQCGSGEKFKGEDVDELQFLQSTVPVDDTVPLEDASETQMMNLGDET 311
           +  F   +  L + Q   GEK  G D    Q +Q++VP  DTV +EDA ETQ+++L DET
Sbjct: 19  NSSFTQSNTQLFDSQIFPGEK--GVDAHAGQLVQNSVPFSDTVAVEDAFETQVIDLCDET 76

Query: 310 QAWDDPPCVENMGTQLLVEWDNEVID--DSDGEATTRTEVLGN-QEMSGNDFVKNVHNDL 140
           Q  DDP C E+M TQ        VID  +SDGE T +TEVL +  E+S  + ++    D 
Sbjct: 77  QVLDDPDCFEHMETQ--------VIDGLNSDGEETDKTEVLDDTNELSDGESLRRGKCDS 128

Query: 139 VDPENMLYTTLCKQNNR 89
           +D EN   T+L   NNR
Sbjct: 129 LDVEN---TSLELTNNR 142


>ref|XP_012455778.1| PREDICTED: uncharacterized protein LOC105777205 isoform X1
           [Gossypium raimondii] gi|763805618|gb|KJB72556.1|
           hypothetical protein B456_011G184800 [Gossypium
           raimondii]
          Length = 1215

 Score = 95.9 bits (237), Expect = 3e-17
 Identities = 54/94 (57%), Positives = 63/94 (67%), Gaps = 10/94 (10%)
 Frame = -1

Query: 808 DELPFLQETVPFDD-TVRVEDTFETQVVNLGGETQVL---------DDPDCVENIDTQLL 659
           DEL +LQ TVPFDD  V+VED  ETQ +NLGGETQVL         DD DC EN++TQLL
Sbjct: 49  DELDYLQSTVPFDDYNVQVEDGLETQALNLGGETQVLNFDGETQVLDDLDCFENMETQLL 108

Query: 658 LECNDEGIVDSDGEGTDRTEVLDDTDEVSDGDSV 557
            E ND    DSD EG + TE+LD  DEVS+ + V
Sbjct: 109 DEFNDAIAADSDSEGMEGTEILDQGDEVSNDEIV 142



 Score = 80.1 bits (196), Expect = 2e-12
 Identities = 56/149 (37%), Positives = 74/149 (49%), Gaps = 10/149 (6%)
 Frame = -1

Query: 436 GEKFKGEDVDELQFLQSTVPVDD-TVPLEDASETQMMNLGDETQAW---------DDPPC 287
           G+K   ED DEL +LQSTVP DD  V +ED  ETQ +NLG ETQ           DD  C
Sbjct: 40  GDKADNEDSDELDYLQSTVPFDDYNVQVEDGLETQALNLGGETQVLNFDGETQVLDDLDC 99

Query: 286 VENMGTQLLVEWDNEVIDDSDGEATTRTEVLGNQEMSGNDFVKNVHNDLVDPENMLYTTL 107
            ENM TQLL E+++ +  DSD E    TE+L   +   ND +        D    L+   
Sbjct: 100 FENMETQLLDEFNDAIAADSDSEGMEGTEILDQGDEVSNDEIVT-----GDCGQFLF--- 151

Query: 106 CKQNNRGFKAESDAHNEQCSSELNVPTAT 20
             Q     +  + + NEQ +S ++  T T
Sbjct: 152 --QKKESLEQHNASTNEQMNSGIHGSTTT 178


>ref|XP_010040881.1| PREDICTED: uncharacterized protein LOC104429753 [Eucalyptus
           grandis]
          Length = 914

 Score = 94.7 bits (234), Expect = 7e-17
 Identities = 77/231 (33%), Positives = 119/231 (51%), Gaps = 13/231 (5%)
 Frame = -1

Query: 808 DELPFLQETVPFDDTVRVEDTFETQVV---------NLGGETQVLDDPDCVENIDTQLLL 656
           DEL +++ T+ FDDTV ++  FETQ++         + GGETQV+D  D + N++TQLLL
Sbjct: 14  DEL-YVESTMRFDDTVPLDGAFETQILDTGGETQLVDFGGETQVIDCEDGIGNVETQLLL 72

Query: 655 E-CNDEGIVDSDGEGTDRTEVLDDTDEVSDGDSVKRFGNHPVDLATMLHATLCKQGDQGF 479
           + C+ +   DSDGE T  TEVL   DEVSDG  + + G    D   M  +   K  D   
Sbjct: 73  DGCDTQVAFDSDGEDTGGTEVLGSDDEVSDG-GLHQEGGCSRDEKKMSCSPFSK--DSET 129

Query: 478 KAESNALSNEQCGSGEKFKGEDVDELQFLQSTVPVDDTVPLEDASETQMMNLGDETQAWD 299
           K +S AL++E C SG   +G        L+++     ++ L  AS   M    +E     
Sbjct: 130 KEQSAALTDEPCSSGSVRRGFTSIRAASLRASGLAARSMFLNGASSASMRQSSEEQDG-- 187

Query: 298 DPPCVENMGTQLLVEW--DNEVIDDS-DGEATTRTEVLGNQEMSGNDFVKN 155
                E+ GT L   +    +V+D   D +A ++   LGN +++G  +V++
Sbjct: 188 -----EDNGTSLGGAYITSEDVVDTMLDEKADSKEVPLGNDKLAGLSYVES 233


>ref|XP_012482073.1| PREDICTED: uncharacterized protein LOC105796804 isoform X2
           [Gossypium raimondii] gi|763761331|gb|KJB28585.1|
           hypothetical protein B456_005G056800 [Gossypium
           raimondii]
          Length = 1057

 Score = 93.2 bits (230), Expect = 2e-16
 Identities = 53/94 (56%), Positives = 63/94 (67%), Gaps = 10/94 (10%)
 Frame = -1

Query: 808 DELPFLQETVPFDDT-VRVEDTFETQVVNLG---------GETQVLDDPDCVENIDTQLL 659
           DEL + Q+TVP DD  V VED  E Q++NLG         GETQVLDD DC ENI+TQLL
Sbjct: 52  DELNYPQDTVPLDDNNVAVEDGLEIQILNLGEETQVLDFGGETQVLDDLDCCENIETQLL 111

Query: 658 LECNDEGIVDSDGEGTDRTEVLDDTDEVSDGDSV 557
              +   ++DS+GEGTD TEV DD DEVSD + V
Sbjct: 112 DAFDVSVVLDSEGEGTDGTEVFDDGDEVSDDEVV 145



 Score = 74.7 bits (182), Expect = 7e-11
 Identities = 60/168 (35%), Positives = 84/168 (50%), Gaps = 12/168 (7%)
 Frame = -1

Query: 487 QGFKAESNALSNEQCGSGEKFKGEDVDELQFLQSTVPVDDT-VPLEDASETQMMNLGDET 311
           Q F+ +S    +  CG  +    ED DEL + Q TVP+DD  V +ED  E Q++NLG+ET
Sbjct: 28  QPFEFDSQFPVSPFCGYRDD--NEDDDELNYPQDTVPLDDNNVAVEDGLEIQILNLGEET 85

Query: 310 QAWD---------DPPCVENMGTQLLVEWDNEVIDDSDGEATTRTEVLGN-QEMSGNDFV 161
           Q  D         D  C EN+ TQLL  +D  V+ DS+GE T  TEV  +  E+S ++ V
Sbjct: 86  QVLDFGGETQVLDDLDCCENIETQLLDAFDVSVVLDSEGEGTDGTEVFDDGDEVSDDEVV 145

Query: 160 KNVHNDLVDPENMLYTTLCKQNNRGFKAESDAHNEQC-SSELNVPTAT 20
                  +  E       C+           A  ++C SS ++VPTAT
Sbjct: 146 IGDCGRSIGHEEKESLEQCR-----------ASTDECRSSGIHVPTAT 182


>gb|KJB28584.1| hypothetical protein B456_005G056800 [Gossypium raimondii]
          Length = 1034

 Score = 93.2 bits (230), Expect = 2e-16
 Identities = 53/94 (56%), Positives = 63/94 (67%), Gaps = 10/94 (10%)
 Frame = -1

Query: 808 DELPFLQETVPFDDT-VRVEDTFETQVVNLG---------GETQVLDDPDCVENIDTQLL 659
           DEL + Q+TVP DD  V VED  E Q++NLG         GETQVLDD DC ENI+TQLL
Sbjct: 52  DELNYPQDTVPLDDNNVAVEDGLEIQILNLGEETQVLDFGGETQVLDDLDCCENIETQLL 111

Query: 658 LECNDEGIVDSDGEGTDRTEVLDDTDEVSDGDSV 557
              +   ++DS+GEGTD TEV DD DEVSD + V
Sbjct: 112 DAFDVSVVLDSEGEGTDGTEVFDDGDEVSDDEVV 145



 Score = 74.7 bits (182), Expect = 7e-11
 Identities = 60/168 (35%), Positives = 84/168 (50%), Gaps = 12/168 (7%)
 Frame = -1

Query: 487 QGFKAESNALSNEQCGSGEKFKGEDVDELQFLQSTVPVDDT-VPLEDASETQMMNLGDET 311
           Q F+ +S    +  CG  +    ED DEL + Q TVP+DD  V +ED  E Q++NLG+ET
Sbjct: 28  QPFEFDSQFPVSPFCGYRDD--NEDDDELNYPQDTVPLDDNNVAVEDGLEIQILNLGEET 85

Query: 310 QAWD---------DPPCVENMGTQLLVEWDNEVIDDSDGEATTRTEVLGN-QEMSGNDFV 161
           Q  D         D  C EN+ TQLL  +D  V+ DS+GE T  TEV  +  E+S ++ V
Sbjct: 86  QVLDFGGETQVLDDLDCCENIETQLLDAFDVSVVLDSEGEGTDGTEVFDDGDEVSDDEVV 145

Query: 160 KNVHNDLVDPENMLYTTLCKQNNRGFKAESDAHNEQC-SSELNVPTAT 20
                  +  E       C+           A  ++C SS ++VPTAT
Sbjct: 146 IGDCGRSIGHEEKESLEQCR-----------ASTDECRSSGIHVPTAT 182


>ref|XP_012482072.1| PREDICTED: uncharacterized protein LOC105796804 isoform X1
           [Gossypium raimondii] gi|763761329|gb|KJB28583.1|
           hypothetical protein B456_005G056800 [Gossypium
           raimondii]
          Length = 1136

 Score = 93.2 bits (230), Expect = 2e-16
 Identities = 53/94 (56%), Positives = 63/94 (67%), Gaps = 10/94 (10%)
 Frame = -1

Query: 808 DELPFLQETVPFDDT-VRVEDTFETQVVNLG---------GETQVLDDPDCVENIDTQLL 659
           DEL + Q+TVP DD  V VED  E Q++NLG         GETQVLDD DC ENI+TQLL
Sbjct: 52  DELNYPQDTVPLDDNNVAVEDGLEIQILNLGEETQVLDFGGETQVLDDLDCCENIETQLL 111

Query: 658 LECNDEGIVDSDGEGTDRTEVLDDTDEVSDGDSV 557
              +   ++DS+GEGTD TEV DD DEVSD + V
Sbjct: 112 DAFDVSVVLDSEGEGTDGTEVFDDGDEVSDDEVV 145



 Score = 74.7 bits (182), Expect = 7e-11
 Identities = 60/168 (35%), Positives = 84/168 (50%), Gaps = 12/168 (7%)
 Frame = -1

Query: 487 QGFKAESNALSNEQCGSGEKFKGEDVDELQFLQSTVPVDDT-VPLEDASETQMMNLGDET 311
           Q F+ +S    +  CG  +    ED DEL + Q TVP+DD  V +ED  E Q++NLG+ET
Sbjct: 28  QPFEFDSQFPVSPFCGYRDD--NEDDDELNYPQDTVPLDDNNVAVEDGLEIQILNLGEET 85

Query: 310 QAWD---------DPPCVENMGTQLLVEWDNEVIDDSDGEATTRTEVLGN-QEMSGNDFV 161
           Q  D         D  C EN+ TQLL  +D  V+ DS+GE T  TEV  +  E+S ++ V
Sbjct: 86  QVLDFGGETQVLDDLDCCENIETQLLDAFDVSVVLDSEGEGTDGTEVFDDGDEVSDDEVV 145

Query: 160 KNVHNDLVDPENMLYTTLCKQNNRGFKAESDAHNEQC-SSELNVPTAT 20
                  +  E       C+           A  ++C SS ++VPTAT
Sbjct: 146 IGDCGRSIGHEEKESLEQCR-----------ASTDECRSSGIHVPTAT 182


>gb|KHG03195.1| PAX-interacting 1 [Gossypium arboreum]
          Length = 1215

 Score = 92.0 bits (227), Expect = 4e-16
 Identities = 52/94 (55%), Positives = 62/94 (65%), Gaps = 10/94 (10%)
 Frame = -1

Query: 808 DELPFLQETVPFDD-TVRVEDTFETQVVNL---------GGETQVLDDPDCVENIDTQLL 659
           D+L +LQ TVPFDD  V+VED  ETQ +NL          GETQVLDD DC EN++TQLL
Sbjct: 49  DQLDYLQSTVPFDDYNVQVEDGLETQALNLVGETQVLNFDGETQVLDDLDCFENMETQLL 108

Query: 658 LECNDEGIVDSDGEGTDRTEVLDDTDEVSDGDSV 557
            E ND    DSD EG + TE+LD  DEVS+ + V
Sbjct: 109 DEFNDAIAADSDSEGMEGTEILDQGDEVSNDEIV 142



 Score = 75.9 bits (185), Expect = 3e-11
 Identities = 45/100 (45%), Positives = 56/100 (56%), Gaps = 10/100 (10%)
 Frame = -1

Query: 436 GEKFKGEDVDELQFLQSTVPVDD-TVPLEDASETQMMNL---------GDETQAWDDPPC 287
           G+K   ED D+L +LQSTVP DD  V +ED  ETQ +NL           ETQ  DD  C
Sbjct: 40  GDKADNEDSDQLDYLQSTVPFDDYNVQVEDGLETQALNLVGETQVLNFDGETQVLDDLDC 99

Query: 286 VENMGTQLLVEWDNEVIDDSDGEATTRTEVLGNQEMSGND 167
            ENM TQLL E+++ +  DSD E    TE+L   +   ND
Sbjct: 100 FENMETQLLDEFNDAIAADSDSEGMEGTEILDQGDEVSND 139


>ref|XP_007035445.1| BRCT domain-containing DNA repair protein, putative isoform 6
           [Theobroma cacao] gi|508714474|gb|EOY06371.1| BRCT
           domain-containing DNA repair protein, putative isoform 6
           [Theobroma cacao]
          Length = 1254

 Score = 91.7 bits (226), Expect = 6e-16
 Identities = 53/92 (57%), Positives = 61/92 (66%), Gaps = 10/92 (10%)
 Frame = -1

Query: 808 DELPFLQETVPFDD-TVRVEDTFETQVVNL---------GGETQVLDDPDCVENIDTQLL 659
           D L +L  + PFDD  V  ED FETQVVN          GGETQVLDD DC EN++TQLL
Sbjct: 52  DGLQYLWSSAPFDDDNVPGEDAFETQVVNFCGETQVLNFGGETQVLDDVDCFENMETQLL 111

Query: 658 LECNDEGIVDSDGEGTDRTEVLDDTDEVSDGD 563
            E +DE  +D+DGEGTD TEVL D DE S+ D
Sbjct: 112 DEFDDEVALDNDGEGTDVTEVLADGDEDSNDD 143



 Score = 86.7 bits (213), Expect = 2e-14
 Identities = 51/101 (50%), Positives = 61/101 (60%), Gaps = 10/101 (9%)
 Frame = -1

Query: 439 SGEKFKGEDVDELQFLQSTVPVDD-TVPLEDA---------SETQMMNLGDETQAWDDPP 290
           SG+K   ED D LQ+L S+ P DD  VP EDA          ETQ++N G ETQ  DD  
Sbjct: 42  SGDKVDNEDDDGLQYLWSSAPFDDDNVPGEDAFETQVVNFCGETQVLNFGGETQVLDDVD 101

Query: 289 CVENMGTQLLVEWDNEVIDDSDGEATTRTEVLGNQEMSGND 167
           C ENM TQLL E+D+EV  D+DGE T  TEVL + +   ND
Sbjct: 102 CFENMETQLLDEFDDEVALDNDGEGTDVTEVLADGDEDSND 142


>ref|XP_007035444.1| BRCT domain-containing DNA repair protein, putative isoform 5
           [Theobroma cacao] gi|508714473|gb|EOY06370.1| BRCT
           domain-containing DNA repair protein, putative isoform 5
           [Theobroma cacao]
          Length = 1035

 Score = 91.7 bits (226), Expect = 6e-16
 Identities = 53/92 (57%), Positives = 61/92 (66%), Gaps = 10/92 (10%)
 Frame = -1

Query: 808 DELPFLQETVPFDD-TVRVEDTFETQVVNL---------GGETQVLDDPDCVENIDTQLL 659
           D L +L  + PFDD  V  ED FETQVVN          GGETQVLDD DC EN++TQLL
Sbjct: 52  DGLQYLWSSAPFDDDNVPGEDAFETQVVNFCGETQVLNFGGETQVLDDVDCFENMETQLL 111

Query: 658 LECNDEGIVDSDGEGTDRTEVLDDTDEVSDGD 563
            E +DE  +D+DGEGTD TEVL D DE S+ D
Sbjct: 112 DEFDDEVALDNDGEGTDVTEVLADGDEDSNDD 143



 Score = 86.7 bits (213), Expect = 2e-14
 Identities = 51/101 (50%), Positives = 61/101 (60%), Gaps = 10/101 (9%)
 Frame = -1

Query: 439 SGEKFKGEDVDELQFLQSTVPVDD-TVPLEDA---------SETQMMNLGDETQAWDDPP 290
           SG+K   ED D LQ+L S+ P DD  VP EDA          ETQ++N G ETQ  DD  
Sbjct: 42  SGDKVDNEDDDGLQYLWSSAPFDDDNVPGEDAFETQVVNFCGETQVLNFGGETQVLDDVD 101

Query: 289 CVENMGTQLLVEWDNEVIDDSDGEATTRTEVLGNQEMSGND 167
           C ENM TQLL E+D+EV  D+DGE T  TEVL + +   ND
Sbjct: 102 CFENMETQLLDEFDDEVALDNDGEGTDVTEVLADGDEDSND 142


>ref|XP_007035443.1| BRCT domain-containing DNA repair protein, putative isoform 4
           [Theobroma cacao] gi|508714472|gb|EOY06369.1| BRCT
           domain-containing DNA repair protein, putative isoform 4
           [Theobroma cacao]
          Length = 1140

 Score = 91.7 bits (226), Expect = 6e-16
 Identities = 53/92 (57%), Positives = 61/92 (66%), Gaps = 10/92 (10%)
 Frame = -1

Query: 808 DELPFLQETVPFDD-TVRVEDTFETQVVNL---------GGETQVLDDPDCVENIDTQLL 659
           D L +L  + PFDD  V  ED FETQVVN          GGETQVLDD DC EN++TQLL
Sbjct: 52  DGLQYLWSSAPFDDDNVPGEDAFETQVVNFCGETQVLNFGGETQVLDDVDCFENMETQLL 111

Query: 658 LECNDEGIVDSDGEGTDRTEVLDDTDEVSDGD 563
            E +DE  +D+DGEGTD TEVL D DE S+ D
Sbjct: 112 DEFDDEVALDNDGEGTDVTEVLADGDEDSNDD 143



 Score = 86.7 bits (213), Expect = 2e-14
 Identities = 51/101 (50%), Positives = 61/101 (60%), Gaps = 10/101 (9%)
 Frame = -1

Query: 439 SGEKFKGEDVDELQFLQSTVPVDD-TVPLEDA---------SETQMMNLGDETQAWDDPP 290
           SG+K   ED D LQ+L S+ P DD  VP EDA          ETQ++N G ETQ  DD  
Sbjct: 42  SGDKVDNEDDDGLQYLWSSAPFDDDNVPGEDAFETQVVNFCGETQVLNFGGETQVLDDVD 101

Query: 289 CVENMGTQLLVEWDNEVIDDSDGEATTRTEVLGNQEMSGND 167
           C ENM TQLL E+D+EV  D+DGE T  TEVL + +   ND
Sbjct: 102 CFENMETQLLDEFDDEVALDNDGEGTDVTEVLADGDEDSND 142


>ref|XP_007035442.1| BRCT domain-containing DNA repair protein, putative isoform 3
           [Theobroma cacao] gi|508714471|gb|EOY06368.1| BRCT
           domain-containing DNA repair protein, putative isoform 3
           [Theobroma cacao]
          Length = 1200

 Score = 91.7 bits (226), Expect = 6e-16
 Identities = 53/92 (57%), Positives = 61/92 (66%), Gaps = 10/92 (10%)
 Frame = -1

Query: 808 DELPFLQETVPFDD-TVRVEDTFETQVVNL---------GGETQVLDDPDCVENIDTQLL 659
           D L +L  + PFDD  V  ED FETQVVN          GGETQVLDD DC EN++TQLL
Sbjct: 52  DGLQYLWSSAPFDDDNVPGEDAFETQVVNFCGETQVLNFGGETQVLDDVDCFENMETQLL 111

Query: 658 LECNDEGIVDSDGEGTDRTEVLDDTDEVSDGD 563
            E +DE  +D+DGEGTD TEVL D DE S+ D
Sbjct: 112 DEFDDEVALDNDGEGTDVTEVLADGDEDSNDD 143



 Score = 86.7 bits (213), Expect = 2e-14
 Identities = 51/101 (50%), Positives = 61/101 (60%), Gaps = 10/101 (9%)
 Frame = -1

Query: 439 SGEKFKGEDVDELQFLQSTVPVDD-TVPLEDA---------SETQMMNLGDETQAWDDPP 290
           SG+K   ED D LQ+L S+ P DD  VP EDA          ETQ++N G ETQ  DD  
Sbjct: 42  SGDKVDNEDDDGLQYLWSSAPFDDDNVPGEDAFETQVVNFCGETQVLNFGGETQVLDDVD 101

Query: 289 CVENMGTQLLVEWDNEVIDDSDGEATTRTEVLGNQEMSGND 167
           C ENM TQLL E+D+EV  D+DGE T  TEVL + +   ND
Sbjct: 102 CFENMETQLLDEFDDEVALDNDGEGTDVTEVLADGDEDSND 142


>ref|XP_007035440.1| BRCT domain-containing DNA repair protein, putative isoform 1
           [Theobroma cacao] gi|590660596|ref|XP_007035441.1| BRCT
           domain-containing DNA repair protein, putative isoform 1
           [Theobroma cacao] gi|508714469|gb|EOY06366.1| BRCT
           domain-containing DNA repair protein, putative isoform 1
           [Theobroma cacao] gi|508714470|gb|EOY06367.1| BRCT
           domain-containing DNA repair protein, putative isoform 1
           [Theobroma cacao]
          Length = 1225

 Score = 91.7 bits (226), Expect = 6e-16
 Identities = 53/92 (57%), Positives = 61/92 (66%), Gaps = 10/92 (10%)
 Frame = -1

Query: 808 DELPFLQETVPFDD-TVRVEDTFETQVVNL---------GGETQVLDDPDCVENIDTQLL 659
           D L +L  + PFDD  V  ED FETQVVN          GGETQVLDD DC EN++TQLL
Sbjct: 52  DGLQYLWSSAPFDDDNVPGEDAFETQVVNFCGETQVLNFGGETQVLDDVDCFENMETQLL 111

Query: 658 LECNDEGIVDSDGEGTDRTEVLDDTDEVSDGD 563
            E +DE  +D+DGEGTD TEVL D DE S+ D
Sbjct: 112 DEFDDEVALDNDGEGTDVTEVLADGDEDSNDD 143



 Score = 86.7 bits (213), Expect = 2e-14
 Identities = 51/101 (50%), Positives = 61/101 (60%), Gaps = 10/101 (9%)
 Frame = -1

Query: 439 SGEKFKGEDVDELQFLQSTVPVDD-TVPLEDA---------SETQMMNLGDETQAWDDPP 290
           SG+K   ED D LQ+L S+ P DD  VP EDA          ETQ++N G ETQ  DD  
Sbjct: 42  SGDKVDNEDDDGLQYLWSSAPFDDDNVPGEDAFETQVVNFCGETQVLNFGGETQVLDDVD 101

Query: 289 CVENMGTQLLVEWDNEVIDDSDGEATTRTEVLGNQEMSGND 167
           C ENM TQLL E+D+EV  D+DGE T  TEVL + +   ND
Sbjct: 102 CFENMETQLLDEFDDEVALDNDGEGTDVTEVLADGDEDSND 142


>gb|KHG13951.1| PAX-interacting 1 [Gossypium arboreum]
          Length = 1134

 Score = 90.5 bits (223), Expect = 1e-15
 Identities = 52/94 (55%), Positives = 63/94 (67%), Gaps = 10/94 (10%)
 Frame = -1

Query: 808 DELPFLQETVPFDDT-VRVEDTFETQVVNLG---------GETQVLDDPDCVENIDTQLL 659
           DEL + Q+TVP DD  V VED  ETQ++NLG         GETQVLDD D  ENI+TQLL
Sbjct: 52  DELNYPQDTVPLDDNNVAVEDGLETQILNLGEETQVLDFGGETQVLDDLDYCENIETQLL 111

Query: 658 LECNDEGIVDSDGEGTDRTEVLDDTDEVSDGDSV 557
              +   ++DS+GEGTD TEV DD D+VSD + V
Sbjct: 112 DAVDVSVVLDSEGEGTDGTEVFDDGDQVSDDEVV 145



 Score = 71.2 bits (173), Expect = 8e-10
 Identities = 56/151 (37%), Positives = 76/151 (50%), Gaps = 12/151 (7%)
 Frame = -1

Query: 436 GEKFKGEDVDELQFLQSTVPVDDT-VPLEDASETQMMNLGDETQAWD---------DPPC 287
           G++   ED DEL + Q TVP+DD  V +ED  ETQ++NLG+ETQ  D         D   
Sbjct: 43  GDRDDNEDDDELNYPQDTVPLDDNNVAVEDGLETQILNLGEETQVLDFGGETQVLDDLDY 102

Query: 286 VENMGTQLLVEWDNEVIDDSDGEATTRTEVLGN-QEMSGNDFVKNVHNDLVDPENMLYTT 110
            EN+ TQLL   D  V+ DS+GE T  TEV  +  ++S ++ V       +  E      
Sbjct: 103 CENIETQLLDAVDVSVVLDSEGEGTDGTEVFDDGDQVSDDEVVIGDCGRSIGHEEKESLE 162

Query: 109 LCKQNNRGFKAESDAHNEQC-SSELNVPTAT 20
            C+           A  E+C SS + VPTAT
Sbjct: 163 QCR-----------ASTEECRSSGIYVPTAT 182


Top