BLASTX nr result

ID: Phellodendron21_contig00026529 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Phellodendron21_contig00026529
         (1307 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_006429954.1 hypothetical protein CICLE_v10011020mg [Citrus cl...   659   0.0  
KDO70786.1 hypothetical protein CISIN_1g044070mg, partial [Citru...   655   0.0  
XP_006481687.1 PREDICTED: uncharacterized protein LOC102624787 i...   654   0.0  
XP_006481685.1 PREDICTED: uncharacterized protein LOC102624787 i...   654   0.0  
XP_018848848.1 PREDICTED: pre-mRNA-splicing factor RSE1 [Juglans...   569   0.0  
EOY09620.1 Cleavage and polyadenylation specificity factor (CPSF...   565   0.0  
OAY43074.1 hypothetical protein MANES_08G040000 [Manihot esculenta]   568   0.0  
EOY09618.1 Cleavage and polyadenylation specificity factor (CPSF...   565   0.0  
XP_007029116.2 PREDICTED: pre-mRNA-splicing factor RSE1 isoform ...   564   0.0  
GAV86134.1 CPSF_A domain-containing protein/MMS1_N domain-contai...   562   0.0  
EEF30789.1 spliceosomal protein sap, putative [Ricinus communis]      554   0.0  
XP_007029117.2 PREDICTED: pre-mRNA-splicing factor RSE1 isoform ...   557   0.0  
XP_015582355.1 PREDICTED: LOW QUALITY PROTEIN: pre-mRNA-splicing...   554   0.0  
XP_012090856.1 PREDICTED: pre-mRNA-splicing factor RSE1 [Jatroph...   550   e-180
XP_015898900.1 PREDICTED: uncharacterized protein LOC107432303 i...   549   e-180
OMO68745.1 hypothetical protein COLO4_29436 [Corchorus olitorius]     540   e-177
OMO51866.1 hypothetical protein CCACVL1_29540 [Corchorus capsula...   537   e-176
XP_017610862.1 PREDICTED: pre-mRNA-splicing factor RSE1 isoform ...   537   e-175
KJB36184.1 hypothetical protein B456_006G145300 [Gossypium raimo...   526   e-174
XP_016669400.1 PREDICTED: splicing factor 3B subunit 3-like [Gos...   531   e-173

>XP_006429954.1 hypothetical protein CICLE_v10011020mg [Citrus clementina] ESR43194.1
            hypothetical protein CICLE_v10011020mg [Citrus
            clementina]
          Length = 926

 Score =  659 bits (1700), Expect = 0.0
 Identities = 328/396 (82%), Positives = 345/396 (87%)
 Frame = +2

Query: 119  AVSEEECSMAKXXXXXXXXXXXXXXXRNVHYLAKCVLKGSVVLQVAYGHLRSPTSNDVVF 298
            AVSEE CS AK               RN+HYLAKCVLKGSVVLQVA+GHLRSPTSNDVVF
Sbjct: 2    AVSEEVCSTAKSRSSPSSSSAPAPPPRNIHYLAKCVLKGSVVLQVAHGHLRSPTSNDVVF 61

Query: 299  GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWNEKFNTRNSQVMGKDLLVVVSDS 478
            GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWN+KFN +NSQVMGKDLLVV+SDS
Sbjct: 62   GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWNDKFNAQNSQVMGKDLLVVISDS 121

Query: 479  GKLSFLTFCNEMHRFFPVAHVQLSDPGNSRHQLGRLLAVDSSGCFIAVSAYEDRXXXXXX 658
            GKLSFL FCNEMHRFFPVA V LS+PGNSRHQLGR+LAVDSSGCFIAVSAYEDR      
Sbjct: 122  GKLSFLAFCNEMHRFFPVAQVHLSNPGNSRHQLGRMLAVDSSGCFIAVSAYEDRLGLFSL 181

Query: 659  XXXXXXDIIDKKICYPPESEGDTCASRSAQKTNISGTVWSMCFISKDPCQPSKEHNPILA 838
                  DIIDKKICYP ESE DT ASR AQK  ISGT+WSMCFIS DP QPSKEHNPILA
Sbjct: 182  SMSSGSDIIDKKICYPSESEVDTSASRIAQKNRISGTIWSMCFISTDPRQPSKEHNPILA 241

Query: 839  IILNRRGELVNELLLVGWNIRDHAISVILPSLEAGPLAHSIVEVPRSYGYAFVFRIGDAL 1018
            IILNRRG L+NELLLVGWNIR+HAISV+    EAGPLAHS+VEVPRSYG+AFVFRIGDAL
Sbjct: 242  IILNRRGALLNELLLVGWNIREHAISVLSCFFEAGPLAHSVVEVPRSYGFAFVFRIGDAL 301

Query: 1019 LMDLRDPLYPSCVYRTSLNFLPSAMEEQNFVEESCRVHDVDDEGLFNVAACALLELGDYD 1198
            LMDLRDP  PSCVYRTSLNFLP A+EEQNFV+ESCRVHDVDDEGLFNVAACALLEL DYD
Sbjct: 302  LMDLRDPHNPSCVYRTSLNFLPPALEEQNFVDESCRVHDVDDEGLFNVAACALLELRDYD 361

Query: 1199 PMFIDSDSGNAKQPSKYVCSWSWEPETDKIPKMVFC 1306
            PM IDSDSGNAK+PSK+VCSWSWEPETDKIPKMVFC
Sbjct: 362  PMCIDSDSGNAKEPSKHVCSWSWEPETDKIPKMVFC 397


>KDO70786.1 hypothetical protein CISIN_1g044070mg, partial [Citrus sinensis]
          Length = 903

 Score =  655 bits (1689), Expect = 0.0
 Identities = 326/396 (82%), Positives = 344/396 (86%)
 Frame = +2

Query: 119  AVSEEECSMAKXXXXXXXXXXXXXXXRNVHYLAKCVLKGSVVLQVAYGHLRSPTSNDVVF 298
            AVSEE CS AK               RN+HYLAKCVLKGSVVLQVA+GHLRSPTSNDVVF
Sbjct: 2    AVSEEVCSTAKSRSSPSSSSAPASPPRNIHYLAKCVLKGSVVLQVAHGHLRSPTSNDVVF 61

Query: 299  GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWNEKFNTRNSQVMGKDLLVVVSDS 478
            GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWN+KFN +NSQVMGKDLLVV+SDS
Sbjct: 62   GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWNKKFNAQNSQVMGKDLLVVISDS 121

Query: 479  GKLSFLTFCNEMHRFFPVAHVQLSDPGNSRHQLGRLLAVDSSGCFIAVSAYEDRXXXXXX 658
            GKLSFL FCNEMHRFFPVA V LS+PGNSRHQLGR+LAVDSSGC IAVSAYEDR      
Sbjct: 122  GKLSFLAFCNEMHRFFPVAQVHLSNPGNSRHQLGRMLAVDSSGCLIAVSAYEDRLGLFSL 181

Query: 659  XXXXXXDIIDKKICYPPESEGDTCASRSAQKTNISGTVWSMCFISKDPCQPSKEHNPILA 838
                  DIIDKKICYP ESE DT ASR AQK +ISGT+WSMCFIS DP QPSKEHNPILA
Sbjct: 182  SMSSGSDIIDKKICYPSESEVDTSASRIAQKNSISGTIWSMCFISTDPRQPSKEHNPILA 241

Query: 839  IILNRRGELVNELLLVGWNIRDHAISVILPSLEAGPLAHSIVEVPRSYGYAFVFRIGDAL 1018
            IILNRRG L+NELLLVGWNIR+HAISV+    EAGPLAH +VEVPRSYG+AFVFRIGDAL
Sbjct: 242  IILNRRGALLNELLLVGWNIREHAISVLSCFFEAGPLAHCVVEVPRSYGFAFVFRIGDAL 301

Query: 1019 LMDLRDPLYPSCVYRTSLNFLPSAMEEQNFVEESCRVHDVDDEGLFNVAACALLELGDYD 1198
            LMDLRDP  PSCVYRTSLNFLP A+EEQNFV+ESCRVHDVDDEGLFNVAACALLEL DYD
Sbjct: 302  LMDLRDPHNPSCVYRTSLNFLPPALEEQNFVDESCRVHDVDDEGLFNVAACALLELRDYD 361

Query: 1199 PMFIDSDSGNAKQPSKYVCSWSWEPETDKIPKMVFC 1306
            PM IDSDSGNAK+PSK+VCSWSWEPETDKIPKMVFC
Sbjct: 362  PMCIDSDSGNAKEPSKHVCSWSWEPETDKIPKMVFC 397


>XP_006481687.1 PREDICTED: uncharacterized protein LOC102624787 isoform X3 [Citrus
            sinensis]
          Length = 1182

 Score =  654 bits (1686), Expect = 0.0
 Identities = 325/396 (82%), Positives = 344/396 (86%)
 Frame = +2

Query: 119  AVSEEECSMAKXXXXXXXXXXXXXXXRNVHYLAKCVLKGSVVLQVAYGHLRSPTSNDVVF 298
            AVSEE CS AK               RN+HYLAKCVLKGSVVLQVA+GHLRSPTSNDVVF
Sbjct: 2    AVSEEVCSTAKSRSSPSSSSAPASPPRNIHYLAKCVLKGSVVLQVAHGHLRSPTSNDVVF 61

Query: 299  GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWNEKFNTRNSQVMGKDLLVVVSDS 478
            GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWN+KFN +NSQ+MGKDLLVV+SDS
Sbjct: 62   GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWNKKFNAQNSQLMGKDLLVVISDS 121

Query: 479  GKLSFLTFCNEMHRFFPVAHVQLSDPGNSRHQLGRLLAVDSSGCFIAVSAYEDRXXXXXX 658
            GKLSFL FCNEMHRFFPVA V LS+PGNSRHQLGR+LAVDSSGC IAVSAYEDR      
Sbjct: 122  GKLSFLAFCNEMHRFFPVAQVHLSNPGNSRHQLGRMLAVDSSGCLIAVSAYEDRLGLFSL 181

Query: 659  XXXXXXDIIDKKICYPPESEGDTCASRSAQKTNISGTVWSMCFISKDPCQPSKEHNPILA 838
                  DIIDKKICYP ESE DT ASR AQK +ISGT+WSMCFIS DP QPSKEHNPILA
Sbjct: 182  SMSSGSDIIDKKICYPSESEVDTSASRIAQKNSISGTIWSMCFISTDPRQPSKEHNPILA 241

Query: 839  IILNRRGELVNELLLVGWNIRDHAISVILPSLEAGPLAHSIVEVPRSYGYAFVFRIGDAL 1018
            IILNRRG L+NELLLVGWNIR+HAISV+    EAGPLAH +VEVPRSYG+AFVFRIGDAL
Sbjct: 242  IILNRRGALLNELLLVGWNIREHAISVLSCFFEAGPLAHCVVEVPRSYGFAFVFRIGDAL 301

Query: 1019 LMDLRDPLYPSCVYRTSLNFLPSAMEEQNFVEESCRVHDVDDEGLFNVAACALLELGDYD 1198
            LMDLRDP  PSCVYRTSLNFLP A+EEQNFV+ESCRVHDVDDEGLFNVAACALLEL DYD
Sbjct: 302  LMDLRDPHNPSCVYRTSLNFLPPALEEQNFVDESCRVHDVDDEGLFNVAACALLELRDYD 361

Query: 1199 PMFIDSDSGNAKQPSKYVCSWSWEPETDKIPKMVFC 1306
            PM IDSDSGNAK+PSK+VCSWSWEPETDKIPKMVFC
Sbjct: 362  PMCIDSDSGNAKEPSKHVCSWSWEPETDKIPKMVFC 397


>XP_006481685.1 PREDICTED: uncharacterized protein LOC102624787 isoform X1 [Citrus
            sinensis]
          Length = 1394

 Score =  654 bits (1686), Expect = 0.0
 Identities = 325/396 (82%), Positives = 344/396 (86%)
 Frame = +2

Query: 119  AVSEEECSMAKXXXXXXXXXXXXXXXRNVHYLAKCVLKGSVVLQVAYGHLRSPTSNDVVF 298
            AVSEE CS AK               RN+HYLAKCVLKGSVVLQVA+GHLRSPTSNDVVF
Sbjct: 2    AVSEEVCSTAKSRSSPSSSSAPASPPRNIHYLAKCVLKGSVVLQVAHGHLRSPTSNDVVF 61

Query: 299  GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWNEKFNTRNSQVMGKDLLVVVSDS 478
            GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWN+KFN +NSQ+MGKDLLVV+SDS
Sbjct: 62   GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWNKKFNAQNSQLMGKDLLVVISDS 121

Query: 479  GKLSFLTFCNEMHRFFPVAHVQLSDPGNSRHQLGRLLAVDSSGCFIAVSAYEDRXXXXXX 658
            GKLSFL FCNEMHRFFPVA V LS+PGNSRHQLGR+LAVDSSGC IAVSAYEDR      
Sbjct: 122  GKLSFLAFCNEMHRFFPVAQVHLSNPGNSRHQLGRMLAVDSSGCLIAVSAYEDRLGLFSL 181

Query: 659  XXXXXXDIIDKKICYPPESEGDTCASRSAQKTNISGTVWSMCFISKDPCQPSKEHNPILA 838
                  DIIDKKICYP ESE DT ASR AQK +ISGT+WSMCFIS DP QPSKEHNPILA
Sbjct: 182  SMSSGSDIIDKKICYPSESEVDTSASRIAQKNSISGTIWSMCFISTDPRQPSKEHNPILA 241

Query: 839  IILNRRGELVNELLLVGWNIRDHAISVILPSLEAGPLAHSIVEVPRSYGYAFVFRIGDAL 1018
            IILNRRG L+NELLLVGWNIR+HAISV+    EAGPLAH +VEVPRSYG+AFVFRIGDAL
Sbjct: 242  IILNRRGALLNELLLVGWNIREHAISVLSCFFEAGPLAHCVVEVPRSYGFAFVFRIGDAL 301

Query: 1019 LMDLRDPLYPSCVYRTSLNFLPSAMEEQNFVEESCRVHDVDDEGLFNVAACALLELGDYD 1198
            LMDLRDP  PSCVYRTSLNFLP A+EEQNFV+ESCRVHDVDDEGLFNVAACALLEL DYD
Sbjct: 302  LMDLRDPHNPSCVYRTSLNFLPPALEEQNFVDESCRVHDVDDEGLFNVAACALLELRDYD 361

Query: 1199 PMFIDSDSGNAKQPSKYVCSWSWEPETDKIPKMVFC 1306
            PM IDSDSGNAK+PSK+VCSWSWEPETDKIPKMVFC
Sbjct: 362  PMCIDSDSGNAKEPSKHVCSWSWEPETDKIPKMVFC 397


>XP_018848848.1 PREDICTED: pre-mRNA-splicing factor RSE1 [Juglans regia]
          Length = 1381

 Score =  569 bits (1467), Expect = 0.0
 Identities = 280/396 (70%), Positives = 322/396 (81%)
 Frame = +2

Query: 119  AVSEEECSMAKXXXXXXXXXXXXXXXRNVHYLAKCVLKGSVVLQVAYGHLRSPTSNDVVF 298
            AVSEEECS AK                + HYLAKCVLKGSVVLQV YGH+RSPT  DVVF
Sbjct: 2    AVSEEECSSAKSRSSSPASSS------STHYLAKCVLKGSVVLQVLYGHIRSPTYLDVVF 55

Query: 299  GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWNEKFNTRNSQVMGKDLLVVVSDS 478
            GKETSIELVIIGEDGIVQSVCEQ VFGTIKD+A++PWNEKF+ RN Q++GKDLLVV+SDS
Sbjct: 56   GKETSIELVIIGEDGIVQSVCEQPVFGTIKDIAILPWNEKFHVRNPQMIGKDLLVVISDS 115

Query: 479  GKLSFLTFCNEMHRFFPVAHVQLSDPGNSRHQLGRLLAVDSSGCFIAVSAYEDRXXXXXX 658
            GKLSFLTFCNEMHRFFP+ HVQLS+PGNSRHQLGR+LAV++SGCFIA SAYEDR      
Sbjct: 116  GKLSFLTFCNEMHRFFPLTHVQLSNPGNSRHQLGRMLAVNTSGCFIAASAYEDRLALFSI 175

Query: 659  XXXXXXDIIDKKICYPPESEGDTCASRSAQKTNISGTVWSMCFISKDPCQPSKEHNPILA 838
                  DIID++I YPPE EGD    RS QK +I GT+WSMCFIS+DP QPSKEHNP+LA
Sbjct: 176  SMSNGSDIIDERIIYPPEHEGDGSIGRSIQKNSIRGTIWSMCFISQDPNQPSKEHNPVLA 235

Query: 839  IILNRRGELVNELLLVGWNIRDHAISVILPSLEAGPLAHSIVEVPRSYGYAFVFRIGDAL 1018
            I+LNRRGE++NELLL+GWN+R+ ++ VI    EAGPLAH+IVEVP SYG+AF+FR+GDAL
Sbjct: 236  ILLNRRGEVMNELLLLGWNMRECSVFVISHCREAGPLAHNIVEVPYSYGFAFLFRVGDAL 295

Query: 1019 LMDLRDPLYPSCVYRTSLNFLPSAMEEQNFVEESCRVHDVDDEGLFNVAACALLELGDYD 1198
            LMDLRD   P CVYRTSLNFLP+++ E N  EESCRVHDVDDEGLFNVAA +LLEL D D
Sbjct: 296  LMDLRDAQNPCCVYRTSLNFLPNSVYEPNLAEESCRVHDVDDEGLFNVAA-SLLELKDCD 354

Query: 1199 PMFIDSDSGNAKQPSKYVCSWSWEPETDKIPKMVFC 1306
            PM ID D+GN    +K+VCSWSWEPE  KIP+++FC
Sbjct: 355  PMCIDGDNGNVSSANKHVCSWSWEPEIHKIPRLIFC 390


>EOY09620.1 Cleavage and polyadenylation specificity factor (CPSF) A subunit
            protein isoform 3 [Theobroma cacao]
          Length = 1254

 Score =  565 bits (1456), Expect = 0.0
 Identities = 272/396 (68%), Positives = 321/396 (81%)
 Frame = +2

Query: 119  AVSEEECSMAKXXXXXXXXXXXXXXXRNVHYLAKCVLKGSVVLQVAYGHLRSPTSNDVVF 298
            A+SEEECS AK               + V+YLAKCVL+GSVVLQVAYGHLRSP+S DVVF
Sbjct: 2    ALSEEECSTAKASSSSPSSSSATASSQGVNYLAKCVLRGSVVLQVAYGHLRSPSSFDVVF 61

Query: 299  GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWNEKFNTRNSQVMGKDLLVVVSDS 478
            GKETSIELVI+GEDGIV S+CEQ VFGTIKDLA++PWNEK   RN Q+ GKDLL+V+SDS
Sbjct: 62   GKETSIELVIMGEDGIVTSICEQTVFGTIKDLAILPWNEKVCARNPQMRGKDLLIVISDS 121

Query: 479  GKLSFLTFCNEMHRFFPVAHVQLSDPGNSRHQLGRLLAVDSSGCFIAVSAYEDRXXXXXX 658
            GKLSFLTFC EMHRFFPVAHVQLSDPGNSRHQLGR+LAVDS+GCFIA SAYEDR      
Sbjct: 122  GKLSFLTFCIEMHRFFPVAHVQLSDPGNSRHQLGRMLAVDSTGCFIATSAYEDRLALFSL 181

Query: 659  XXXXXXDIIDKKICYPPESEGDTCASRSAQKTNISGTVWSMCFISKDPCQPSKEHNPILA 838
                  DIID++I YPPE+EG   ++RSAQ+T+I GT+WSMCF+SKD  QP+KEHNP+LA
Sbjct: 182  SMSAGDDIIDERIFYPPENEGSVSSTRSAQRTSIRGTIWSMCFVSKDSFQPNKEHNPVLA 241

Query: 839  IILNRRGELVNELLLVGWNIRDHAISVILPSLEAGPLAHSIVEVPRSYGYAFVFRIGDAL 1018
            I+LNR+G  +NEL+L+GWNI++ A+ V+   LEAGPLAHSIVEVP S G+AF+ R+GDAL
Sbjct: 242  IVLNRKGNALNELVLLGWNIKERAVYVVSQYLEAGPLAHSIVEVPHSCGFAFLLRVGDAL 301

Query: 1019 LMDLRDPLYPSCVYRTSLNFLPSAMEEQNFVEESCRVHDVDDEGLFNVAACALLELGDYD 1198
            LMDL D   P CVYRT+LNF    +EEQNF+E+S R HDVDDEGLFNVAACALL+L DYD
Sbjct: 302  LMDLSDAHNPHCVYRTTLNFSGHTLEEQNFIEDSFRAHDVDDEGLFNVAACALLQLSDYD 361

Query: 1199 PMFIDSDSGNAKQPSKYVCSWSWEPETDKIPKMVFC 1306
            PM ID DSGN K   K+VCS+SWEP++D+ P+M+FC
Sbjct: 362  PMCIDGDSGNGKFTCKHVCSFSWEPKSDRSPRMIFC 397


>OAY43074.1 hypothetical protein MANES_08G040000 [Manihot esculenta]
          Length = 1386

 Score =  568 bits (1464), Expect = 0.0
 Identities = 277/396 (69%), Positives = 320/396 (80%)
 Frame = +2

Query: 119  AVSEEECSMAKXXXXXXXXXXXXXXXRNVHYLAKCVLKGSVVLQVAYGHLRSPTSNDVVF 298
            AVSEEECS AK                  +YLAKCVL+GSVVLQV YGH RSP+S+D+VF
Sbjct: 2    AVSEEECSNAKSRSSSPSA-------NGAYYLAKCVLRGSVVLQVVYGHFRSPSSSDIVF 54

Query: 299  GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWNEKFNTRNSQVMGKDLLVVVSDS 478
            GKETSIELVII  DGIV S+CEQ VFGTIKDLAV+PWN+KF+ R+ Q+ GKDLL V+SDS
Sbjct: 55   GKETSIELVIIDADGIVHSICEQPVFGTIKDLAVIPWNDKFHARSPQMQGKDLLAVLSDS 114

Query: 479  GKLSFLTFCNEMHRFFPVAHVQLSDPGNSRHQLGRLLAVDSSGCFIAVSAYEDRXXXXXX 658
            GKLSFLTFC+EMHRFFP+ HVQLS+PGNSR QLGR+LAVDSSGCFIA SAY DR      
Sbjct: 115  GKLSFLTFCSEMHRFFPLTHVQLSNPGNSRQQLGRMLAVDSSGCFIATSAYVDRLALFSL 174

Query: 659  XXXXXXDIIDKKICYPPESEGDTCASRSAQKTNISGTVWSMCFISKDPCQPSKEHNPILA 838
                  DIIDK+I YPPE+EG T ++R  Q+ +ISGT+WSMCFIS+D  Q SKEHNP+LA
Sbjct: 175  SLSGASDIIDKQIFYPPENEGHTSSTRIIQRPSISGTIWSMCFISRDSSQSSKEHNPVLA 234

Query: 839  IILNRRGELVNELLLVGWNIRDHAISVILPSLEAGPLAHSIVEVPRSYGYAFVFRIGDAL 1018
            IILNRRG L+NELLL+GWNIR+  I+VI   +EAGP+AH I+EVP S G+AF+FR+GDAL
Sbjct: 235  IILNRRGALLNELLLLGWNIREQTINVISLYVEAGPIAHDIIEVPHSNGFAFLFRVGDAL 294

Query: 1019 LMDLRDPLYPSCVYRTSLNFLPSAMEEQNFVEESCRVHDVDDEGLFNVAACALLELGDYD 1198
            LMDLRD   PSCVYRTSLNFLP+++EEQ FVEE CRVHDVDD+GLFNVAACALLEL DYD
Sbjct: 295  LMDLRDAHNPSCVYRTSLNFLPASVEEQTFVEEPCRVHDVDDDGLFNVAACALLELRDYD 354

Query: 1199 PMFIDSDSGNAKQPSKYVCSWSWEPETDKIPKMVFC 1306
            PM IDS+ GN K  SKYVCSWSWEPE +K P+M+FC
Sbjct: 355  PMCIDSEGGNVKSASKYVCSWSWEPEVNKNPRMIFC 390


>EOY09618.1 Cleavage and polyadenylation specificity factor (CPSF) A subunit
            protein isoform 1 [Theobroma cacao]
          Length = 1391

 Score =  565 bits (1456), Expect = 0.0
 Identities = 272/396 (68%), Positives = 321/396 (81%)
 Frame = +2

Query: 119  AVSEEECSMAKXXXXXXXXXXXXXXXRNVHYLAKCVLKGSVVLQVAYGHLRSPTSNDVVF 298
            A+SEEECS AK               + V+YLAKCVL+GSVVLQVAYGHLRSP+S DVVF
Sbjct: 2    ALSEEECSTAKASSSSPSSSSATASSQGVNYLAKCVLRGSVVLQVAYGHLRSPSSFDVVF 61

Query: 299  GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWNEKFNTRNSQVMGKDLLVVVSDS 478
            GKETSIELVI+GEDGIV S+CEQ VFGTIKDLA++PWNEK   RN Q+ GKDLL+V+SDS
Sbjct: 62   GKETSIELVIMGEDGIVTSICEQTVFGTIKDLAILPWNEKVCARNPQMRGKDLLIVISDS 121

Query: 479  GKLSFLTFCNEMHRFFPVAHVQLSDPGNSRHQLGRLLAVDSSGCFIAVSAYEDRXXXXXX 658
            GKLSFLTFC EMHRFFPVAHVQLSDPGNSRHQLGR+LAVDS+GCFIA SAYEDR      
Sbjct: 122  GKLSFLTFCIEMHRFFPVAHVQLSDPGNSRHQLGRMLAVDSTGCFIATSAYEDRLALFSL 181

Query: 659  XXXXXXDIIDKKICYPPESEGDTCASRSAQKTNISGTVWSMCFISKDPCQPSKEHNPILA 838
                  DIID++I YPPE+EG   ++RSAQ+T+I GT+WSMCF+SKD  QP+KEHNP+LA
Sbjct: 182  SMSAGDDIIDERIFYPPENEGSVSSTRSAQRTSIRGTIWSMCFVSKDSFQPNKEHNPVLA 241

Query: 839  IILNRRGELVNELLLVGWNIRDHAISVILPSLEAGPLAHSIVEVPRSYGYAFVFRIGDAL 1018
            I+LNR+G  +NEL+L+GWNI++ A+ V+   LEAGPLAHSIVEVP S G+AF+ R+GDAL
Sbjct: 242  IVLNRKGNALNELVLLGWNIKERAVYVVSQYLEAGPLAHSIVEVPHSCGFAFLLRVGDAL 301

Query: 1019 LMDLRDPLYPSCVYRTSLNFLPSAMEEQNFVEESCRVHDVDDEGLFNVAACALLELGDYD 1198
            LMDL D   P CVYRT+LNF    +EEQNF+E+S R HDVDDEGLFNVAACALL+L DYD
Sbjct: 302  LMDLSDAHNPHCVYRTTLNFSGHTLEEQNFIEDSFRAHDVDDEGLFNVAACALLQLSDYD 361

Query: 1199 PMFIDSDSGNAKQPSKYVCSWSWEPETDKIPKMVFC 1306
            PM ID DSGN K   K+VCS+SWEP++D+ P+M+FC
Sbjct: 362  PMCIDGDSGNGKFTCKHVCSFSWEPKSDRSPRMIFC 397


>XP_007029116.2 PREDICTED: pre-mRNA-splicing factor RSE1 isoform X2 [Theobroma cacao]
          Length = 1391

 Score =  564 bits (1454), Expect = 0.0
 Identities = 272/396 (68%), Positives = 321/396 (81%)
 Frame = +2

Query: 119  AVSEEECSMAKXXXXXXXXXXXXXXXRNVHYLAKCVLKGSVVLQVAYGHLRSPTSNDVVF 298
            A+SEEECS AK               + V+YLAKCVL+GSVVLQVAYGHLRSP+S DVVF
Sbjct: 2    ALSEEECSTAKASSSSPSSSSATASSQGVNYLAKCVLRGSVVLQVAYGHLRSPSSFDVVF 61

Query: 299  GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWNEKFNTRNSQVMGKDLLVVVSDS 478
            GKETSIELVI+GEDGIV S+CEQ VFGTIKDLA++PWNEK   RN Q+ GKDLL+V+SDS
Sbjct: 62   GKETSIELVIMGEDGIVTSICEQTVFGTIKDLAILPWNEKVCARNPQMRGKDLLIVISDS 121

Query: 479  GKLSFLTFCNEMHRFFPVAHVQLSDPGNSRHQLGRLLAVDSSGCFIAVSAYEDRXXXXXX 658
            GKLSFLTFC EMHRFFPVAHVQLSDPGNSRHQLGR+LAVDS+GCFIA SAYEDR      
Sbjct: 122  GKLSFLTFCIEMHRFFPVAHVQLSDPGNSRHQLGRMLAVDSTGCFIATSAYEDRLALFSL 181

Query: 659  XXXXXXDIIDKKICYPPESEGDTCASRSAQKTNISGTVWSMCFISKDPCQPSKEHNPILA 838
                  DIID++I YPPE+EG   ++RSAQ+T+I GT+WSMCF+SKD  QP+KEHNP+LA
Sbjct: 182  SMSAGDDIIDERIFYPPENEGSVSSTRSAQRTSIRGTIWSMCFVSKDSFQPNKEHNPVLA 241

Query: 839  IILNRRGELVNELLLVGWNIRDHAISVILPSLEAGPLAHSIVEVPRSYGYAFVFRIGDAL 1018
            I+LNR+G  +NEL+L+GWNI++ A+ V+   LEAGPLAHSIVEVP S G+AF+ R+GDAL
Sbjct: 242  IVLNRKGNALNELVLLGWNIKERAVYVVSQYLEAGPLAHSIVEVPDSCGFAFLLRVGDAL 301

Query: 1019 LMDLRDPLYPSCVYRTSLNFLPSAMEEQNFVEESCRVHDVDDEGLFNVAACALLELGDYD 1198
            LMDL D   P CVYRT+LNF    +EEQNF+E+S R HDVDDEGLFNVAACALL+L DYD
Sbjct: 302  LMDLSDAHNPHCVYRTTLNFSGHTLEEQNFIEDSFRAHDVDDEGLFNVAACALLQLSDYD 361

Query: 1199 PMFIDSDSGNAKQPSKYVCSWSWEPETDKIPKMVFC 1306
            PM ID DSGN K   K+VCS+SWEP++D+ P+M+FC
Sbjct: 362  PMCIDGDSGNGKFTCKHVCSFSWEPKSDRSPRMIFC 397


>GAV86134.1 CPSF_A domain-containing protein/MMS1_N domain-containing protein,
            partial [Cephalotus follicularis]
          Length = 1391

 Score =  562 bits (1448), Expect = 0.0
 Identities = 280/414 (67%), Positives = 326/414 (78%), Gaps = 18/414 (4%)
 Frame = +2

Query: 119  AVSEEECSMAKXXXXXXXXXXXXXXXR-----NVHYLAKCVLKGSVVLQVAYGHLRSPTS 283
            AVSEEECS AK                     +VHYLAKCVLKGSVVLQVAYGHLRS +S
Sbjct: 2    AVSEEECSNAKARSSPPSSSSSAAPQPPSPNGSVHYLAKCVLKGSVVLQVAYGHLRSSSS 61

Query: 284  NDVVFGKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWNEKFNTRNSQVMGKDLLV 463
            +DVVFGKETSIELVIIGEDG+VQSVCEQ VFGTI+DLA++PWNEKF  RN Q++GKDLLV
Sbjct: 62   SDVVFGKETSIELVIIGEDGVVQSVCEQVVFGTIRDLAIIPWNEKFRARNPQMLGKDLLV 121

Query: 464  VVSDSGKLSFLTFCNEMHRFFPVAHVQLSDPGNSRHQLGRLLAVDS-------------S 604
            V+SDSGKLSFL+FCNEMHRFFPV H+QLS+PGNSRHQLGR+LAVDS             S
Sbjct: 122  VLSDSGKLSFLSFCNEMHRFFPVTHIQLSNPGNSRHQLGRMLAVDSRQAAFVRRRIEIKS 181

Query: 605  GCFIAVSAYEDRXXXXXXXXXXXXDIIDKKICYPPESEGDTCASRSAQKTNISGTVWSMC 784
            GCFIA SAYEDR            DIIDKKI +PPE+EG+   +RS Q+ ++SGT+WS+C
Sbjct: 182  GCFIAASAYEDRLAVFSLSMSVDSDIIDKKIFHPPENEGEASTARSLQRISMSGTIWSLC 241

Query: 785  FISKDPCQPSKEHNPILAIILNRRGELVNELLLVGWNIRDHAISVILPSLEAGPLAHSIV 964
            FISKD  QPSKE NP+LA++LNR G  +NELLL+GWNI+++A+ VI   +EAGPLAHSIV
Sbjct: 242  FISKDSSQPSKEDNPVLAMLLNRSGAHLNELLLLGWNIKENALHVISHYVEAGPLAHSIV 301

Query: 965  EVPRSYGYAFVFRIGDALLMDLRDPLYPSCVYRTSLNFLPSAMEEQNFVEESCRVHDVDD 1144
            EVP S G+AF+FR+GD LLMDLRD   P CVYRTSLNFLPSA+EEQ+FVEESC+VHDVDD
Sbjct: 302  EVPHSSGFAFLFRVGDVLLMDLRDAENPCCVYRTSLNFLPSAVEEQDFVEESCKVHDVDD 361

Query: 1145 EGLFNVAACALLELGDYDPMFIDSDSGNAKQPSKYVCSWSWEPETDKIPKMVFC 1306
            EGLFNVAACALLEL DYDPM ID +  +A   SK+VCSW WEP+ D+ P+M+FC
Sbjct: 362  EGLFNVAACALLELRDYDPMCIDIEDVSASSISKHVCSWCWEPKCDEYPRMIFC 415


>EEF30789.1 spliceosomal protein sap, putative [Ricinus communis]
          Length = 1220

 Score =  554 bits (1427), Expect = 0.0
 Identities = 277/396 (69%), Positives = 317/396 (80%)
 Frame = +2

Query: 119  AVSEEECSMAKXXXXXXXXXXXXXXXRNVHYLAKCVLKGSVVLQVAYGHLRSPTSNDVVF 298
            AVSEEECS AK                + HYLAKCVL+GSVVLQV YGH RSP+SND+VF
Sbjct: 2    AVSEEECSNAKSRSSSPSASS-----NSAHYLAKCVLRGSVVLQVVYGHFRSPSSNDIVF 56

Query: 299  GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWNEKFNTRNSQVMGKDLLVVVSDS 478
            GKETSIELVIIGEDGI+QS+CEQ VFGTIKDLAV+PWNEKF TR+ Q+ GKDLL V SDS
Sbjct: 57   GKETSIELVIIGEDGILQSICEQPVFGTIKDLAVIPWNEKFCTRSPQMHGKDLLAVTSDS 116

Query: 479  GKLSFLTFCNEMHRFFPVAHVQLSDPGNSRHQLGRLLAVDSSGCFIAVSAYEDRXXXXXX 658
            GKLSFL FCNEMHRFFP+ H+QLS+ GNS  QLGRLLAVD+SGCFIA SAY DR      
Sbjct: 117  GKLSFLIFCNEMHRFFPLTHIQLSNSGNSIRQLGRLLAVDTSGCFIATSAYVDRLALFSL 176

Query: 659  XXXXXXDIIDKKICYPPESEGDTCASRSAQKTNISGTVWSMCFISKDPCQPSKEHNPILA 838
                  DIID++I YPPESEG T  +RS Q+ NISGT+WS+CFIS+D  Q SKEHNP+LA
Sbjct: 177  SITGSSDIIDEQIFYPPESEGHTSFTRSIQRPNISGTIWSICFISRDLSQSSKEHNPVLA 236

Query: 839  IILNRRGELVNELLLVGWNIRDHAISVILPSLEAGPLAHSIVEVPRSYGYAFVFRIGDAL 1018
            IILNR  EL+NELLL+ WNIR H I+VI P++EAGP+ H IVEVP S G+AF+FR+GDAL
Sbjct: 237  IILNRSSELLNELLLLEWNIRGHTINVI-PNVEAGPI-HDIVEVPHSNGFAFLFRVGDAL 294

Query: 1019 LMDLRDPLYPSCVYRTSLNFLPSAMEEQNFVEESCRVHDVDDEGLFNVAACALLELGDYD 1198
            LMDLRD  +P  V +TS +FLP+AMEEQNFVE+SCRVHDVDD+ LFNVAACALL+L DYD
Sbjct: 295  LMDLRDAHHPCRVCKTSFSFLPAAMEEQNFVEDSCRVHDVDDDSLFNVAACALLQLQDYD 354

Query: 1199 PMFIDSDSGNAKQPSKYVCSWSWEPETDKIPKMVFC 1306
            PM IDS+ G+ K  SKYVCSWSWEPE DK PKM+FC
Sbjct: 355  PMCIDSEGGSVKSTSKYVCSWSWEPEPDKNPKMIFC 390


>XP_007029117.2 PREDICTED: pre-mRNA-splicing factor RSE1 isoform X1 [Theobroma cacao]
          Length = 1401

 Score =  557 bits (1436), Expect = 0.0
 Identities = 273/406 (67%), Positives = 321/406 (79%), Gaps = 10/406 (2%)
 Frame = +2

Query: 119  AVSEEECSMAKXXXXXXXXXXXXXXXRNVHYLAKCVLKGSVVLQVAYGHLRSPTSNDVVF 298
            A+SEEECS AK               + V+YLAKCVL+GSVVLQVAYGHLRSP+S DVVF
Sbjct: 2    ALSEEECSTAKASSSSPSSSSATASSQGVNYLAKCVLRGSVVLQVAYGHLRSPSSFDVVF 61

Query: 299  GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWNEKFNTRNSQVM----------G 448
            GKETSIELVI+GEDGIV S+CEQ VFGTIKDLA++PWNEK   RN QV           G
Sbjct: 62   GKETSIELVIMGEDGIVTSICEQTVFGTIKDLAILPWNEKVCARNPQVCTETYNGSIMRG 121

Query: 449  KDLLVVVSDSGKLSFLTFCNEMHRFFPVAHVQLSDPGNSRHQLGRLLAVDSSGCFIAVSA 628
            KDLL+V+SDSGKLSFLTFC EMHRFFPVAHVQLSDPGNSRHQLGR+LAVDS+GCFIA SA
Sbjct: 122  KDLLIVISDSGKLSFLTFCIEMHRFFPVAHVQLSDPGNSRHQLGRMLAVDSTGCFIATSA 181

Query: 629  YEDRXXXXXXXXXXXXDIIDKKICYPPESEGDTCASRSAQKTNISGTVWSMCFISKDPCQ 808
            YEDR            DIID++I YPPE+EG   ++RSAQ+T+I GT+WSMCF+SKD  Q
Sbjct: 182  YEDRLALFSLSMSAGDDIIDERIFYPPENEGSVSSTRSAQRTSIRGTIWSMCFVSKDSFQ 241

Query: 809  PSKEHNPILAIILNRRGELVNELLLVGWNIRDHAISVILPSLEAGPLAHSIVEVPRSYGY 988
            P+KEHNP+LAI+LNR+G  +NEL+L+GWNI++ A+ V+   LEAGPLAHSIVEVP S G+
Sbjct: 242  PNKEHNPVLAIVLNRKGNALNELVLLGWNIKERAVYVVSQYLEAGPLAHSIVEVPDSCGF 301

Query: 989  AFVFRIGDALLMDLRDPLYPSCVYRTSLNFLPSAMEEQNFVEESCRVHDVDDEGLFNVAA 1168
            AF+ R+GDALLMDL D   P CVYRT+LNF    +EEQNF+E+S R HDVDDEGLFNVAA
Sbjct: 302  AFLLRVGDALLMDLSDAHNPHCVYRTTLNFSGHTLEEQNFIEDSFRAHDVDDEGLFNVAA 361

Query: 1169 CALLELGDYDPMFIDSDSGNAKQPSKYVCSWSWEPETDKIPKMVFC 1306
            CALL+L DYDPM ID DSGN K   K+VCS+SWEP++D+ P+M+FC
Sbjct: 362  CALLQLSDYDPMCIDGDSGNGKFTCKHVCSFSWEPKSDRSPRMIFC 407


>XP_015582355.1 PREDICTED: LOW QUALITY PROTEIN: pre-mRNA-splicing factor RSE1
            [Ricinus communis]
          Length = 1317

 Score =  554 bits (1427), Expect = 0.0
 Identities = 277/396 (69%), Positives = 317/396 (80%)
 Frame = +2

Query: 119  AVSEEECSMAKXXXXXXXXXXXXXXXRNVHYLAKCVLKGSVVLQVAYGHLRSPTSNDVVF 298
            AVSEEECS AK                + HYLAKCVL+GSVVLQV YGH RSP+SND+VF
Sbjct: 2    AVSEEECSNAKSRSSSPSASS-----NSAHYLAKCVLRGSVVLQVVYGHFRSPSSNDIVF 56

Query: 299  GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWNEKFNTRNSQVMGKDLLVVVSDS 478
            GKETSIELVIIGEDGI+QS+CEQ VFGTIKDLAV+PWNEKF TR+ Q+ GKDLL V SDS
Sbjct: 57   GKETSIELVIIGEDGILQSICEQPVFGTIKDLAVIPWNEKFCTRSPQMHGKDLLAVTSDS 116

Query: 479  GKLSFLTFCNEMHRFFPVAHVQLSDPGNSRHQLGRLLAVDSSGCFIAVSAYEDRXXXXXX 658
            GKLSFL FCNEMHRFFP+ H+QLS+ GNS  QLGRLLAVD+SGCFIA SAY DR      
Sbjct: 117  GKLSFLIFCNEMHRFFPLTHIQLSNSGNSIRQLGRLLAVDTSGCFIATSAYVDRLALFSL 176

Query: 659  XXXXXXDIIDKKICYPPESEGDTCASRSAQKTNISGTVWSMCFISKDPCQPSKEHNPILA 838
                  DIID++I YPPESEG T  +RS Q+ NISGT+WS+CFIS+D  Q SKEHNP+LA
Sbjct: 177  SITGSSDIIDEQIFYPPESEGHTSFTRSIQRPNISGTIWSICFISRDLSQSSKEHNPVLA 236

Query: 839  IILNRRGELVNELLLVGWNIRDHAISVILPSLEAGPLAHSIVEVPRSYGYAFVFRIGDAL 1018
            IILNR  EL+NELLL+ WNIR H I+VI P++EAGP+ H IVEVP S G+AF+FR+GDAL
Sbjct: 237  IILNRSSELLNELLLLEWNIRGHTINVI-PNVEAGPI-HDIVEVPHSNGFAFLFRVGDAL 294

Query: 1019 LMDLRDPLYPSCVYRTSLNFLPSAMEEQNFVEESCRVHDVDDEGLFNVAACALLELGDYD 1198
            LMDLRD  +P  V +TS +FLP+AMEEQNFVE+SCRVHDVDD+ LFNVAACALL+L DYD
Sbjct: 295  LMDLRDAHHPCRVCKTSFSFLPAAMEEQNFVEDSCRVHDVDDDSLFNVAACALLQLQDYD 354

Query: 1199 PMFIDSDSGNAKQPSKYVCSWSWEPETDKIPKMVFC 1306
            PM IDS+ G+ K  SKYVCSWSWEPE DK PKM+FC
Sbjct: 355  PMCIDSEGGSVKSTSKYVCSWSWEPEPDKNPKMIFC 390


>XP_012090856.1 PREDICTED: pre-mRNA-splicing factor RSE1 [Jatropha curcas]
          Length = 1386

 Score =  550 bits (1416), Expect = e-180
 Identities = 272/396 (68%), Positives = 313/396 (79%)
 Frame = +2

Query: 119  AVSEEECSMAKXXXXXXXXXXXXXXXRNVHYLAKCVLKGSVVLQVAYGHLRSPTSNDVVF 298
            AVSEEECS AK                  HYLAKCVL+GS VLQV YGH RS +SND++F
Sbjct: 2    AVSEEECSNAKSRSSSPSATL-----NGTHYLAKCVLRGSAVLQVVYGHFRSSSSNDIIF 56

Query: 299  GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWNEKFNTRNSQVMGKDLLVVVSDS 478
            GKETS+ELVIIGE+GIV+SVCEQ +FGTIKDLAV+P N K + R+ Q   KDLL VVSDS
Sbjct: 57   GKETSVELVIIGEEGIVESVCEQPIFGTIKDLAVIPSNGKLHARSPQE--KDLLAVVSDS 114

Query: 479  GKLSFLTFCNEMHRFFPVAHVQLSDPGNSRHQLGRLLAVDSSGCFIAVSAYEDRXXXXXX 658
            GKLSFLTFCNEM RFFP+  VQLS PGNSRHQLGR+LAVDSSGCFIA SAY D+      
Sbjct: 115  GKLSFLTFCNEMLRFFPLTQVQLSSPGNSRHQLGRMLAVDSSGCFIASSAYVDQLALFSL 174

Query: 659  XXXXXXDIIDKKICYPPESEGDTCASRSAQKTNISGTVWSMCFISKDPCQPSKEHNPILA 838
                  D+IDK+I YPPE+EG T  +RS  K +ISGT+WSMCFIS+D CQ SKEHNP+LA
Sbjct: 175  SVSGGSDLIDKRIFYPPENEGQTSFTRSIHKPSISGTIWSMCFISRDSCQSSKEHNPVLA 234

Query: 839  IILNRRGELVNELLLVGWNIRDHAISVILPSLEAGPLAHSIVEVPRSYGYAFVFRIGDAL 1018
            IILNRRG L+NELLL+ WNI +HAI+VI   +EAGP+AH I+EVP S G+AF+FR+GDAL
Sbjct: 235  IILNRRGALLNELLLLEWNIGEHAINVISLYVEAGPIAHDIIEVPHSNGFAFLFRVGDAL 294

Query: 1019 LMDLRDPLYPSCVYRTSLNFLPSAMEEQNFVEESCRVHDVDDEGLFNVAACALLELGDYD 1198
            LMDLRD   P C+YRTSLNFLP+A+EEQNFVEESCRVHDVDD+GLFNVAACALLEL DYD
Sbjct: 295  LMDLRDAHNPCCIYRTSLNFLPTAVEEQNFVEESCRVHDVDDDGLFNVAACALLELRDYD 354

Query: 1199 PMFIDSDSGNAKQPSKYVCSWSWEPETDKIPKMVFC 1306
            PM IDS+  N K  S Y+CSWSW PE+DK P+M+FC
Sbjct: 355  PMCIDSEGSNIKSTSNYMCSWSWGPESDKNPRMIFC 390


>XP_015898900.1 PREDICTED: uncharacterized protein LOC107432303 isoform X1 [Ziziphus
            jujuba]
          Length = 1387

 Score =  549 bits (1415), Expect = e-180
 Identities = 268/396 (67%), Positives = 312/396 (78%)
 Frame = +2

Query: 119  AVSEEECSMAKXXXXXXXXXXXXXXXRNVHYLAKCVLKGSVVLQVAYGHLRSPTSNDVVF 298
            AVSEEECS AK                  HYLAKCVL+GSVVLQV YGH+RSP+S DVVF
Sbjct: 2    AVSEEECSSAKSRSSSSASSSN-------HYLAKCVLRGSVVLQVVYGHIRSPSSLDVVF 54

Query: 299  GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWNEKFNTRNSQVMGKDLLVVVSDS 478
            GKE SIELVIIGEDGIVQSV EQ VFGTIKDLA++PWN+KF +RN Q++GKDLL+V+SDS
Sbjct: 55   GKENSIELVIIGEDGIVQSVSEQPVFGTIKDLAILPWNDKFRSRNPQMLGKDLLIVISDS 114

Query: 479  GKLSFLTFCNEMHRFFPVAHVQLSDPGNSRHQLGRLLAVDSSGCFIAVSAYEDRXXXXXX 658
            GKLSFL+F NEMHRFFPV  VQLS+PGNSR+QLGR+LAVDSSGCFIA SAYE+R      
Sbjct: 115  GKLSFLSFSNEMHRFFPVTQVQLSNPGNSRNQLGRMLAVDSSGCFIAASAYENRLAMFSV 174

Query: 659  XXXXXXDIIDKKICYPPESEGDTCASRSAQKTNISGTVWSMCFISKDPCQPSKEHNPILA 838
                  DIIDKKI YP E+E D   +RS  K +ISGT+WSMCFISKDP QPSK H+P+LA
Sbjct: 175  SVSAGSDIIDKKIMYPSENEADVITARSVHKNSISGTIWSMCFISKDPNQPSKGHDPVLA 234

Query: 839  IILNRRGELVNELLLVGWNIRDHAISVILPSLEAGPLAHSIVEVPRSYGYAFVFRIGDAL 1018
            I+LNRRG L+ ELLL+GWNIRDH+I ++   +EAGP A+ + EVP  YG+A +FR+GDAL
Sbjct: 235  ILLNRRGALLTELLLLGWNIRDHSICILSQYVEAGPFAYDVAEVPHCYGFAIIFRVGDAL 294

Query: 1019 LMDLRDPLYPSCVYRTSLNFLPSAMEEQNFVEESCRVHDVDDEGLFNVAACALLELGDYD 1198
            +M+LRD   P CVYRT+LNF P+A+EEQNFV+ESCRVHDVDDEGLFNVAACALLEL DYD
Sbjct: 295  IMNLRDAHAPCCVYRTNLNFSPNAVEEQNFVDESCRVHDVDDEGLFNVAACALLELRDYD 354

Query: 1199 PMFIDSDSGNAKQPSKYVCSWSWEPETDKIPKMVFC 1306
            PM ID+DS N     K  C+WSWEP   K P+M+FC
Sbjct: 355  PMCIDADSDNLNSTYKRACAWSWEPGNAKNPRMIFC 390


>OMO68745.1 hypothetical protein COLO4_29436 [Corchorus olitorius]
          Length = 1394

 Score =  540 bits (1392), Expect = e-177
 Identities = 265/398 (66%), Positives = 315/398 (79%), Gaps = 2/398 (0%)
 Frame = +2

Query: 119  AVSEEECSMAKXXXXXXXXXXXXXXX--RNVHYLAKCVLKGSVVLQVAYGHLRSPTSNDV 292
            A+SEEECS AK                 + V+YLAKCVL+GSVVLQVAYGHLRSP+S+DV
Sbjct: 2    ALSEEECSTAKASSSSSSSSSSAAGSSSQGVNYLAKCVLRGSVVLQVAYGHLRSPSSSDV 61

Query: 293  VFGKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWNEKFNTRNSQVMGKDLLVVVS 472
            VFGKETSIELVIIGEDGI  SVCEQ VFGTIKDLA++PWNEK   +NSQ+ GKDLL+VVS
Sbjct: 62   VFGKETSIELVIIGEDGIATSVCEQTVFGTIKDLAILPWNEKLGAQNSQMHGKDLLIVVS 121

Query: 473  DSGKLSFLTFCNEMHRFFPVAHVQLSDPGNSRHQLGRLLAVDSSGCFIAVSAYEDRXXXX 652
            DSGKL+FL+FCNEMHRFFPVA+VQLSDPGNSRHQLG++LAVDS+G FIA SA+EDR    
Sbjct: 122  DSGKLAFLSFCNEMHRFFPVANVQLSDPGNSRHQLGKMLAVDSTGSFIATSAHEDRLALF 181

Query: 653  XXXXXXXXDIIDKKICYPPESEGDTCASRSAQKTNISGTVWSMCFISKDPCQPSKEHNPI 832
                    DIID++I YPPE+EG   ++RS Q+T+I GT+WSMCF+SKD  QP +E+NP+
Sbjct: 182  SLSMSAEGDIIDERIFYPPENEGSGTSTRSVQRTSIRGTIWSMCFVSKDSVQPHEENNPV 241

Query: 833  LAIILNRRGELVNELLLVGWNIRDHAISVILPSLEAGPLAHSIVEVPRSYGYAFVFRIGD 1012
            LA++L R+G  +NEL+L+ WNIR+ A+ V+   LEAGPLAHSIVEVP S G+AF+ R GD
Sbjct: 242  LAVVLTRKGNTLNELVLLRWNIRERAVYVLSQYLEAGPLAHSIVEVPHSCGFAFLLRAGD 301

Query: 1013 ALLMDLRDPLYPSCVYRTSLNFLPSAMEEQNFVEESCRVHDVDDEGLFNVAACALLELGD 1192
            ALLMDLRD   P CVYRT+LNF    +EEQNF EES R HDVDDEGLFNVAACALL+L D
Sbjct: 302  ALLMDLRDAHNPHCVYRTNLNFSAHTLEEQNFAEESSRAHDVDDEGLFNVAACALLQLSD 361

Query: 1193 YDPMFIDSDSGNAKQPSKYVCSWSWEPETDKIPKMVFC 1306
            YDPM ID DSGN K   KYVCS+SWE ++D+  +M+FC
Sbjct: 362  YDPMCIDGDSGNCKLDCKYVCSFSWETKSDRSARMIFC 399


>OMO51866.1 hypothetical protein CCACVL1_29540 [Corchorus capsularis]
          Length = 1381

 Score =  537 bits (1383), Expect = e-176
 Identities = 263/397 (66%), Positives = 313/397 (78%), Gaps = 2/397 (0%)
 Frame = +2

Query: 122  VSEEECSMAKXXXXXXXXXXXXXXX--RNVHYLAKCVLKGSVVLQVAYGHLRSPTSNDVV 295
            +SEEECS AK                 + V+YLAKCVL+GSVVLQVAYGHLRSP+S+DVV
Sbjct: 3    LSEEECSTAKASSSSSSSSSSAAGSSSQGVNYLAKCVLRGSVVLQVAYGHLRSPSSSDVV 62

Query: 296  FGKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWNEKFNTRNSQVMGKDLLVVVSD 475
            FGKETSIELVIIGEDGI  SVCEQ VFGTIKDLA++PWNEK   +NSQ+ GKDLL+V SD
Sbjct: 63   FGKETSIELVIIGEDGIATSVCEQTVFGTIKDLAILPWNEKLGAQNSQMHGKDLLIVASD 122

Query: 476  SGKLSFLTFCNEMHRFFPVAHVQLSDPGNSRHQLGRLLAVDSSGCFIAVSAYEDRXXXXX 655
            SGKL+FL+FCNEMHRFFPVA+VQLSDPGNSRHQLG++LAVDS+G FIA SA+EDR     
Sbjct: 123  SGKLAFLSFCNEMHRFFPVANVQLSDPGNSRHQLGKMLAVDSTGSFIATSAHEDRLALFS 182

Query: 656  XXXXXXXDIIDKKICYPPESEGDTCASRSAQKTNISGTVWSMCFISKDPCQPSKEHNPIL 835
                   DIID+KI YPPE+EG   ++RS Q+T+I GT+WSMCF+SKD  QP +E+NP++
Sbjct: 183  LSMSAEGDIIDEKIFYPPENEGSGSSTRSVQRTSIRGTIWSMCFVSKDSVQPHEENNPVV 242

Query: 836  AIILNRRGELVNELLLVGWNIRDHAISVILPSLEAGPLAHSIVEVPRSYGYAFVFRIGDA 1015
            A++L R+G  +NEL+L+ WNIR+ A+ V+   LEAGPLAHSIVEVP S G+AF+ R GDA
Sbjct: 243  AVVLTRKGNTLNELVLLRWNIRERAVYVLSQYLEAGPLAHSIVEVPHSCGFAFLLRAGDA 302

Query: 1016 LLMDLRDPLYPSCVYRTSLNFLPSAMEEQNFVEESCRVHDVDDEGLFNVAACALLELGDY 1195
            LLMDLRD   P CVYRT+LNF    +EEQNF EES R HDVDDEGLFNVAACALL+L DY
Sbjct: 303  LLMDLRDAHNPHCVYRTNLNFSAHTLEEQNFAEESSRAHDVDDEGLFNVAACALLQLSDY 362

Query: 1196 DPMFIDSDSGNAKQPSKYVCSWSWEPETDKIPKMVFC 1306
            DPM ID DSGN K   KYVCS+SWE ++D+  +M+FC
Sbjct: 363  DPMCIDGDSGNCKLDCKYVCSFSWETKSDRSMRMIFC 399


>XP_017610862.1 PREDICTED: pre-mRNA-splicing factor RSE1 isoform X1 [Gossypium
            arboreum]
          Length = 1387

 Score =  537 bits (1383), Expect = e-175
 Identities = 259/396 (65%), Positives = 313/396 (79%)
 Frame = +2

Query: 119  AVSEEECSMAKXXXXXXXXXXXXXXXRNVHYLAKCVLKGSVVLQVAYGHLRSPTSNDVVF 298
            A+SEEECS AK               + V+YLAKCVL+GS +LQVAYGHLRSP+S DVVF
Sbjct: 2    ALSEEECSTAKASSSSPASSSATVSSQGVNYLAKCVLRGSAILQVAYGHLRSPSSLDVVF 61

Query: 299  GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWNEKFNTRNSQVMGKDLLVVVSDS 478
            GKETSIELVIIGEDGI  SVCEQ VFGTIKDLA++PWNEK   +N+Q+ GKDLLV++SDS
Sbjct: 62   GKETSIELVIIGEDGIATSVCEQTVFGTIKDLAILPWNEKVYGQNTQMPGKDLLVIISDS 121

Query: 479  GKLSFLTFCNEMHRFFPVAHVQLSDPGNSRHQLGRLLAVDSSGCFIAVSAYEDRXXXXXX 658
            GKLSFLTFCNEMHRFFPV H+QLSDPGN+R Q+GRLLAVDS+G FIA SAYEDR      
Sbjct: 122  GKLSFLTFCNEMHRFFPVDHIQLSDPGNARDQIGRLLAVDSAGRFIATSAYEDRLAFFSL 181

Query: 659  XXXXXXDIIDKKICYPPESEGDTCASRSAQKTNISGTVWSMCFISKDPCQPSKEHNPILA 838
                  DI+DKKI YPPE+EG   ++R+AQ+T+I GT+WSMCF+SKDP Q +KEHNP+LA
Sbjct: 182  SMSGD-DIVDKKIFYPPENEGSGSSTRNAQRTSIRGTIWSMCFVSKDPIQTNKEHNPVLA 240

Query: 839  IILNRRGELVNELLLVGWNIRDHAISVILPSLEAGPLAHSIVEVPRSYGYAFVFRIGDAL 1018
            I+LNR+G  +NEL+L+GWN+ +HA+ ++   LEAGPLAHSIVEVP S GYA +FR+GDAL
Sbjct: 241  IVLNRKGNTLNELVLLGWNLSEHAVDILSQYLEAGPLAHSIVEVPHSCGYALLFRVGDAL 300

Query: 1019 LMDLRDPLYPSCVYRTSLNFLPSAMEEQNFVEESCRVHDVDDEGLFNVAACALLELGDYD 1198
            LMDLRD   P CVYRT+L+F     EE   VEE C  H+ DD+GLFNVAACALL+L DYD
Sbjct: 301  LMDLRDARNPHCVYRTTLDFSVHTPEEHICVEELCTAHEFDDDGLFNVAACALLQLSDYD 360

Query: 1199 PMFIDSDSGNAKQPSKYVCSWSWEPETDKIPKMVFC 1306
            PM ID +SG+ K   K+VCS+SWEP++D+ P+M+FC
Sbjct: 361  PMCIDGESGSGKTTCKHVCSFSWEPKSDRSPRMIFC 396


>KJB36184.1 hypothetical protein B456_006G145300 [Gossypium raimondii]
          Length = 1171

 Score =  526 bits (1356), Expect = e-174
 Identities = 256/396 (64%), Positives = 312/396 (78%)
 Frame = +2

Query: 119  AVSEEECSMAKXXXXXXXXXXXXXXXRNVHYLAKCVLKGSVVLQVAYGHLRSPTSNDVVF 298
            A+SEEECS AK               + V+YLAKCVL+GS +LQVAYGHLRSP+S DVVF
Sbjct: 2    ALSEEECSTAKASSSSPASSSATVSSQGVNYLAKCVLRGSAILQVAYGHLRSPSSLDVVF 61

Query: 299  GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWNEKFNTRNSQVMGKDLLVVVSDS 478
            GKETSIELVIIGEDGIV SVCEQ VFGTIKDLA++PWNEK   +N+Q+ GKDLLVV+SDS
Sbjct: 62   GKETSIELVIIGEDGIVTSVCEQTVFGTIKDLAILPWNEKVYGQNTQMPGKDLLVVISDS 121

Query: 479  GKLSFLTFCNEMHRFFPVAHVQLSDPGNSRHQLGRLLAVDSSGCFIAVSAYEDRXXXXXX 658
            GKLSFLTFCNEMHRFFPV H+QLSDPGN+R Q+GR+LAVDS+G FIA SAYEDR      
Sbjct: 122  GKLSFLTFCNEMHRFFPVDHIQLSDPGNARDQIGRMLAVDSTGRFIATSAYEDRLAFFSL 181

Query: 659  XXXXXXDIIDKKICYPPESEGDTCASRSAQKTNISGTVWSMCFISKDPCQPSKEHNPILA 838
                  DIIDKKI YPPE+EG   ++R+AQ+ ++ GT+WSMCF+SKDP Q +KEHNP+LA
Sbjct: 182  SMSGG-DIIDKKIFYPPENEGSGSSTRNAQRISVRGTIWSMCFVSKDPNQTNKEHNPVLA 240

Query: 839  IILNRRGELVNELLLVGWNIRDHAISVILPSLEAGPLAHSIVEVPRSYGYAFVFRIGDAL 1018
            I+LNR+G  +NEL+L+GWN+ +HA+ ++   LEAGPLAHSIVEVP S GYA +FR+GDAL
Sbjct: 241  IVLNRKGNTLNELVLLGWNLSEHAVYILSQYLEAGPLAHSIVEVPHSCGYALLFRVGDAL 300

Query: 1019 LMDLRDPLYPSCVYRTSLNFLPSAMEEQNFVEESCRVHDVDDEGLFNVAACALLELGDYD 1198
            LMDLRD   P CVYRT+L+F     EE   VEE C  H+ DD+GLFNVAACALL+L DYD
Sbjct: 301  LMDLRDARNPHCVYRTTLDFSVHTPEEHICVEELCPAHEFDDDGLFNVAACALLQLSDYD 360

Query: 1199 PMFIDSDSGNAKQPSKYVCSWSWEPETDKIPKMVFC 1306
            PM ID +SG+ K   K+VCS+SWE ++++ P+++FC
Sbjct: 361  PMCIDGESGSGKTTCKHVCSFSWELKSNRSPRIIFC 396


>XP_016669400.1 PREDICTED: splicing factor 3B subunit 3-like [Gossypium hirsutum]
          Length = 1387

 Score =  531 bits (1368), Expect = e-173
 Identities = 257/396 (64%), Positives = 312/396 (78%)
 Frame = +2

Query: 119  AVSEEECSMAKXXXXXXXXXXXXXXXRNVHYLAKCVLKGSVVLQVAYGHLRSPTSNDVVF 298
            A+SEEECS AK               + V+YLAKCVL+GS +LQVAYGHLRSP+S DVVF
Sbjct: 2    ALSEEECSTAKASSSSPASSSATVSSQGVNYLAKCVLRGSAILQVAYGHLRSPSSLDVVF 61

Query: 299  GKETSIELVIIGEDGIVQSVCEQAVFGTIKDLAVVPWNEKFNTRNSQVMGKDLLVVVSDS 478
            GKETSIELVIIGEDGI  SVCEQ VFGTIKDLA++P NEK   +N+Q+ GKDLLV++SDS
Sbjct: 62   GKETSIELVIIGEDGIATSVCEQTVFGTIKDLAILPCNEKVYGQNTQMPGKDLLVIISDS 121

Query: 479  GKLSFLTFCNEMHRFFPVAHVQLSDPGNSRHQLGRLLAVDSSGCFIAVSAYEDRXXXXXX 658
            GKLSFLTFCNEMHRFFPV H+QLSDPGN+R Q+GR+LAVDS+G FIA SAYEDR      
Sbjct: 122  GKLSFLTFCNEMHRFFPVDHIQLSDPGNARDQIGRMLAVDSTGRFIATSAYEDRLAFFSL 181

Query: 659  XXXXXXDIIDKKICYPPESEGDTCASRSAQKTNISGTVWSMCFISKDPCQPSKEHNPILA 838
                  DI+DKKI YPPE+EG   ++R+AQ+T+I GT+WSMCF+SKDP Q +KEHNP+LA
Sbjct: 182  SMSGD-DIVDKKIFYPPENEGSGSSTRNAQRTSIRGTIWSMCFVSKDPIQTNKEHNPVLA 240

Query: 839  IILNRRGELVNELLLVGWNIRDHAISVILPSLEAGPLAHSIVEVPRSYGYAFVFRIGDAL 1018
            I+LNR+G  +NEL+L+GWN+ +HA+ ++   LEAGPLAHSIVEVP S GYA +FR+GDAL
Sbjct: 241  IVLNRKGNTLNELVLLGWNLSEHAVDILSQYLEAGPLAHSIVEVPHSCGYALLFRVGDAL 300

Query: 1019 LMDLRDPLYPSCVYRTSLNFLPSAMEEQNFVEESCRVHDVDDEGLFNVAACALLELGDYD 1198
            LMDLRD   P CVYRT+L+F     EE   VEE C  H+ DD+GLFNVAACALL+L DYD
Sbjct: 301  LMDLRDARNPHCVYRTTLDFSVHTPEEHICVEELCTAHEFDDDGLFNVAACALLQLSDYD 360

Query: 1199 PMFIDSDSGNAKQPSKYVCSWSWEPETDKIPKMVFC 1306
            PM ID +SG+ K   K+VCS+SWEP++D+ P+M+FC
Sbjct: 361  PMCIDGESGSGKTTCKHVCSFSWEPKSDRSPRMIFC 396


Top