BLASTX nr result

ID: Rehmannia31_contig00013005 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia31_contig00013005
         (1245 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011075586.1| uncharacterized protein LOC105160027 [Sesamu...   329   e-106
gb|PIN05142.1| hypothetical protein CDL12_22318 [Handroanthus im...   322   e-103
gb|PIN11462.1| hypothetical protein CDL12_15930 [Handroanthus im...   312   2e-99
gb|PIN21942.1| hypothetical protein CDL12_05355 [Handroanthus im...   286   4e-89
ref|XP_011070952.1| uncharacterized protein LOC105156500 [Sesamu...   268   2e-82
ref|XP_012855239.1| PREDICTED: uncharacterized protein LOC105974...   250   1e-75
ref|XP_012847724.1| PREDICTED: uncharacterized protein LOC105967...   214   4e-62
emb|CDP00415.1| unnamed protein product [Coffea canephora]            213   8e-61
ref|XP_022878128.1| uncharacterized protein LOC111396080 isoform...   203   1e-57
ref|XP_022878129.1| uncharacterized protein LOC111396080 isoform...   202   2e-57
ref|XP_022726069.1| uncharacterized protein LOC111282306 [Durio ...   192   3e-52
ref|XP_022776770.1| uncharacterized protein LOC111318274 [Durio ...   190   2e-51
gb|OMO71971.1| hypothetical protein COLO4_27911 [Corchorus olito...   188   8e-51
gb|EOY01744.1| Uncharacterized protein TCM_011575 isoform 1 [The...   188   1e-50
ref|XP_017971049.1| PREDICTED: uncharacterized protein LOC186102...   187   2e-50
ref|XP_021299624.1| uncharacterized protein LOC110428202 [Herran...   187   3e-50
ref|XP_012478745.1| PREDICTED: uncharacterized protein LOC105794...   186   1e-49
gb|EOY01745.1| Uncharacterized protein TCM_011575 isoform 2 [The...   185   1e-49
ref|XP_010258118.1| PREDICTED: uncharacterized protein LOC104597...   184   2e-49
ref|XP_007045913.2| PREDICTED: uncharacterized protein LOC186102...   184   2e-49

>ref|XP_011075586.1| uncharacterized protein LOC105160027 [Sesamum indicum]
 ref|XP_011075587.1| uncharacterized protein LOC105160027 [Sesamum indicum]
          Length = 423

 Score =  329 bits (843), Expect = e-106
 Identities = 174/314 (55%), Positives = 210/314 (66%), Gaps = 1/314 (0%)
 Frame = +3

Query: 306  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCK-P 482
            MGVKRPL +E++PELSFKQPKQ D+N K L F  +D  +      +DSPGE KSN CK  
Sbjct: 1    MGVKRPLEQEDLPELSFKQPKQLDNNRK-LTFTAEDFPTHRTTLEVDSPGEVKSNFCKIH 59

Query: 483  CDGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEATLPLSFVTTS 662
             DGMLENG+T+GAS+A                             K LEA+ PLS VT+S
Sbjct: 60   SDGMLENGDTNGASLA----------------------------DKELEASAPLSLVTSS 91

Query: 663  SHEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEV 842
            S EE+AGNED S L+ FPGY D  I PW+P +Q E+P+I +LN  PRKEVPVGPD+Q EV
Sbjct: 92   SSEEDAGNEDTSILYNFPGYIDFSI-PWRPPQQYEDPYISLLNSSPRKEVPVGPDYQAEV 150

Query: 843  PTWDPSAVGKYFSVSNNFSGSDWKENLRGTCIIPRPGVYHSSIDQSMVGRGRTDCSCPDV 1022
            P WDPS+  K    SNNF  ++ ++ L G C+IP PG+  SS+D   VGRGRTDCSC D+
Sbjct: 151  PEWDPSSSAKDSLGSNNFVDNE-EQRLMGACVIPMPGLNGSSVDGVTVGRGRTDCSCLDM 209

Query: 1023 GSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANPAY 1202
            GS+RCVQQHVKEARE LR  IGE  F  LGFY+MGEEVA  W+ EDE +F+ V+F+NPA 
Sbjct: 210  GSMRCVQQHVKEAREKLRETIGEEAFANLGFYDMGEEVAWRWSAEDEHIFHNVIFSNPAS 269

Query: 1203 SGRNFWKFLGFAFP 1244
             GRNFWK L   FP
Sbjct: 270  HGRNFWKHLSVMFP 283


>gb|PIN05142.1| hypothetical protein CDL12_22318 [Handroanthus impetiginosus]
          Length = 434

 Score =  322 bits (826), Expect = e-103
 Identities = 173/314 (55%), Positives = 208/314 (66%), Gaps = 1/314 (0%)
 Frame = +3

Query: 306  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCK-P 482
            MG+KRPL EE+ PELSFKQPKQ DDN K L FMT+D+ S+   PR+DSPGEAK N CK  
Sbjct: 1    MGIKRPLVEEDFPELSFKQPKQFDDNIK-LTFMTEDIPSRGTTPRVDSPGEAKGNFCKLH 59

Query: 483  CDGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEATLPLSFVTTS 662
             DGMLENG+  G+S+A  E EG+                             PLS VT S
Sbjct: 60   FDGMLENGDADGSSMADKEFEGSA----------------------------PLSLVTNS 91

Query: 663  SHEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEV 842
            S  EE GNED SFLH FP Y D    PW+     E+P++ +LN  PRKEVPVGPD+Q ++
Sbjct: 92   S-SEEVGNEDTSFLHHFPEYIDFS-RPWRFPGHFEDPYVSLLNGSPRKEVPVGPDYQADL 149

Query: 843  PTWDPSAVGKYFSVSNNFSGSDWKENLRGTCIIPRPGVYHSSIDQSMVGRGRTDCSCPDV 1022
            P WDP + GK    SN F G+D ++ L GTCIIP P +  S+I ++ VGRGRTDCSC D+
Sbjct: 150  PQWDPHSNGKDLLGSNYFVGNDTEQRLMGTCIIPMPRLNDSNIHEATVGRGRTDCSCLDM 209

Query: 1023 GSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANPAY 1202
            GS+RCVQQHVKEARE LR  IGE  F +LGFY+MGEEVA  WT EDE +F+EVVF+NPA 
Sbjct: 210  GSMRCVQQHVKEAREKLRETIGEENFTKLGFYDMGEEVASKWTPEDEHIFHEVVFSNPAS 269

Query: 1203 SGRNFWKFLGFAFP 1244
             G +FW+ L  AFP
Sbjct: 270  RGTDFWEHLSAAFP 283


>gb|PIN11462.1| hypothetical protein CDL12_15930 [Handroanthus impetiginosus]
          Length = 434

 Score =  312 bits (800), Expect = 2e-99
 Identities = 171/314 (54%), Positives = 203/314 (64%), Gaps = 1/314 (0%)
 Frame = +3

Query: 306  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCK-P 482
            MG+KRPL EE+ PELSFKQPKQ DDN K L FMT+D+ S+   PR+DSPGEAK N CK  
Sbjct: 1    MGIKRPLVEEDFPELSFKQPKQFDDNIK-LTFMTEDIPSRGTYPRVDSPGEAKGNFCKLH 59

Query: 483  CDGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEATLPLSFVTTS 662
             DGMLENG+  G+S+A  E EG+                             PLS VT S
Sbjct: 60   FDGMLENGDADGSSMADKEFEGSA----------------------------PLSLVTNS 91

Query: 663  SHEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEV 842
            S  EE GNED SFLH FP Y D    PW+     E+P++  LN   RKEVPVGPD+Q  +
Sbjct: 92   S-SEEVGNEDTSFLHHFPEYIDFS-RPWRFPGHFEDPYVSSLNGSLRKEVPVGPDYQANL 149

Query: 843  PTWDPSAVGKYFSVSNNFSGSDWKENLRGTCIIPRPGVYHSSIDQSMVGRGRTDCSCPDV 1022
            P WD  + GK    SN F G+D  + L GTCIIP P +  S+I ++ VGRGRTDCSC D+
Sbjct: 150  PQWDAHSNGKDLLGSNYFLGNDTDQRLMGTCIIPMPRLNDSNIHEATVGRGRTDCSCLDM 209

Query: 1023 GSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANPAY 1202
            GS+RCVQQHVKEARE LR  IGE  F +LGFY+MGEEVA  WT EDE +F+EVVF+NPA 
Sbjct: 210  GSMRCVQQHVKEAREKLRETIGEENFTKLGFYDMGEEVASKWTPEDEHIFHEVVFSNPAS 269

Query: 1203 SGRNFWKFLGFAFP 1244
             G +FW+ L  AFP
Sbjct: 270  RGTDFWEHLSAAFP 283


>gb|PIN21942.1| hypothetical protein CDL12_05355 [Handroanthus impetiginosus]
          Length = 434

 Score =  286 bits (731), Expect = 4e-89
 Identities = 164/314 (52%), Positives = 196/314 (62%), Gaps = 1/314 (0%)
 Frame = +3

Query: 306  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCK-P 482
            MG+KRPL EE+ PE SF QPKQPD N K L   T++    TA   +DSPG AK   CK  
Sbjct: 1    MGIKRPLEEED-PEPSFHQPKQPDCNKK-LAICTEESYITTA--GVDSPGGAKGTSCKLQ 56

Query: 483  CDGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEATLPLSFVTTS 662
             DG LE+G   GASI        GD+E                    LEA+ PLS VT+S
Sbjct: 57   FDGRLEDGERDGASI--------GDKE--------------------LEASAPLSLVTSS 88

Query: 663  SHEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEV 842
            S EE+AGN D S    FP Y D  +P   P EQ E+P+IY+LN  PRKEVP+GPDHQ EV
Sbjct: 89   SSEEDAGNRDTSLRSYFPEYIDYSLPMPLP-EQFEDPYIYLLNSSPRKEVPIGPDHQAEV 147

Query: 843  PTWDPSAVGKYFSVSNNFSGSDWKENLRGTCIIPRPGVYHSSIDQSMVGRGRTDCSCPDV 1022
            P W+P+   + FS SN  + ++  + L GTCIIP P    S+ID    GRGRTDCSC DV
Sbjct: 148  PLWNPNVGWEDFSNSNFLADNNRDQKLLGTCIIPMPDFNDSNIDGVRAGRGRTDCSCLDV 207

Query: 1023 GSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANPAY 1202
            GS+RCVQQHVKE RE LR  +G+  FKELGF +MG+EVA  WT +DE++F EVV +NPA 
Sbjct: 208  GSMRCVQQHVKEVRERLRETVGDEKFKELGFCDMGDEVACKWTPDDEQIFLEVVLSNPAS 267

Query: 1203 SGRNFWKFLGFAFP 1244
             GR FWK L  AFP
Sbjct: 268  HGRKFWKHLREAFP 281


>ref|XP_011070952.1| uncharacterized protein LOC105156500 [Sesamum indicum]
          Length = 428

 Score =  268 bits (686), Expect = 2e-82
 Identities = 157/314 (50%), Positives = 188/314 (59%), Gaps = 1/314 (0%)
 Frame = +3

Query: 306  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCKP- 482
            MG+KRPL EE+ PE SFKQPKQ D N K L   T++  S      +DSPG  KS  CK  
Sbjct: 2    MGMKRPLEEEDFPEPSFKQPKQLDYNKK-LTLNTEE--SHLTTLTVDSPGRTKSIFCKSQ 58

Query: 483  CDGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEATLPLSFVTTS 662
             DG LENG+ + AS+AG E E +                             PLS VT+S
Sbjct: 59   FDGRLENGDLYSASLAGKEFEPSA----------------------------PLSLVTSS 90

Query: 663  SHEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEV 842
            S EE+  N D S    FP +TD G+P   P +Q E+P+I +LN  PRKEVP+GPDHQ  V
Sbjct: 91   SREEDVVNGDTSVWSNFPAFTDFGLPRRLP-QQFEDPYISLLNSSPRKEVPIGPDHQAGV 149

Query: 843  PTWDPSAVGKYFSVSNNFSGSDWKENLRGTCIIPRPGVYHSSIDQSMVGRGRTDCSCPDV 1022
            P WDP+A     S  NN      +E L GTCII  P    S+I +  VGRGRTDC C DV
Sbjct: 150  PLWDPNA-----SRDNNR-----EEELMGTCIISMPDANDSTIGEFRVGRGRTDCDCLDV 199

Query: 1023 GSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANPAY 1202
            GS+RCVQQHV EARE LR  IG+  F+ELGF NMG+EVA  WT E+E++F+EVVF+NP  
Sbjct: 200  GSMRCVQQHVTEAREKLRETIGDENFEELGFSNMGDEVACKWTPEEEQVFHEVVFSNPVS 259

Query: 1203 SGRNFWKFLGFAFP 1244
             GR FWK L  AFP
Sbjct: 260  HGRKFWKHLRVAFP 273


>ref|XP_012855239.1| PREDICTED: uncharacterized protein LOC105974668 [Erythranthe guttata]
 ref|XP_012855240.1| PREDICTED: uncharacterized protein LOC105974668 [Erythranthe guttata]
          Length = 399

 Score =  250 bits (638), Expect = 1e-75
 Identities = 147/315 (46%), Positives = 185/315 (58%), Gaps = 2/315 (0%)
 Frame = +3

Query: 306  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCK-P 482
            MG+KRPL E + PE SF +P    D +K L+  T+D    T  PR DS G  KSN C+  
Sbjct: 1    MGIKRPLEEADFPEASFGKPI---DYNKKLISCTEDFHITT--PRFDSLGGPKSNICELQ 55

Query: 483  CDGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEATLPLSFVTTS 662
             DG LE+  T+ AS A  E E                            A+ PLS VT+S
Sbjct: 56   FDGRLEDSETYSASAADKEFE----------------------------ASAPLSLVTSS 87

Query: 663  SHEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEV 842
            S EE+AGN D SF   FPGY D+ IPP +P EQ ++P+I +LN  P+KEVP+GPD+Q EV
Sbjct: 88   S-EEDAGNGDTSFWSYFPGYIDISIPPRRPPEQFDDPYISLLNSSPKKEVPLGPDYQAEV 146

Query: 843  PTWDPSAVGKYFSVSNNFSGSDWKENLRGTCIIPRPGVYHSSIDQSMVGRGRTD-CSCPD 1019
            P W+ +          N+   + ++ L GTC+IP P +  S+ D   VG GRT  CSC D
Sbjct: 147  PLWEGA----------NYFTDEREQQLMGTCVIPMPDLNDSTSDGVRVGHGRTVVCSCLD 196

Query: 1020 VGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANPA 1199
            VGS+RCVQQHVKEARE L   IGE  F +LGF +MG+EVA  WT  DE +F+E+V +NP 
Sbjct: 197  VGSMRCVQQHVKEAREKLLETIGENNFIDLGFCHMGDEVACKWTPADEHVFHEIVLSNPV 256

Query: 1200 YSGRNFWKFLGFAFP 1244
              GRNFWK L  AFP
Sbjct: 257  SHGRNFWKLLRSAFP 271


>ref|XP_012847724.1| PREDICTED: uncharacterized protein LOC105967657 [Erythranthe guttata]
 ref|XP_012847725.1| PREDICTED: uncharacterized protein LOC105967657 [Erythranthe guttata]
          Length = 381

 Score =  214 bits (545), Expect = 4e-62
 Identities = 138/315 (43%), Positives = 174/315 (55%), Gaps = 2/315 (0%)
 Frame = +3

Query: 306  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCKPC 485
            MGVKRP  EEN+PELSF+Q +  D N+K L F  +D  S TA PR   PGE         
Sbjct: 1    MGVKRPFEEENLPELSFEQ-RIEDHNNKKLSFTPEDSPSTTA-PRFHYPGE--------- 49

Query: 486  DGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEATLPLSFVTTSS 665
                EN  T GA I         D+E   S                     PLS   ++ 
Sbjct: 50   ---FENCVTDGACIV--------DKESTPSA--------------------PLSLAASNG 78

Query: 666  HEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEVP 845
             +EE   E+  +    P   D   P   P  Q E+P+IY+LN  PRKE+P+GPDHQ +VP
Sbjct: 79   KQEEE-EEEEEYAGNIPDI-DFRTPSMPPPLQFEDPYIYLLNTPPRKEIPIGPDHQADVP 136

Query: 846  TWDPSAVGKYFSVSNNFSGSDWKENLRGTCIIPRPGVYHS-SIDQSMVGRGRTDCSCPDV 1022
             WDP A  K FS        + ++ L G+CI+  PG+  S S D    GRGRTDCSC DV
Sbjct: 137  EWDPFARRKDFS-------DEREQELMGSCIVRPPGLNRSGSTDPFAAGRGRTDCSCMDV 189

Query: 1023 GSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVA-LNWTFEDERLFYEVVFANPA 1199
            GS+RCVQQHV EARE L+  +G+ +F +LGFY+MGEE A   WT ++E+LF+EVVF+NP 
Sbjct: 190  GSMRCVQQHVHEAREKLQETMGDEVFVKLGFYDMGEEAASRKWTPDEEQLFHEVVFSNP- 248

Query: 1200 YSGRNFWKFLGFAFP 1244
              G +FWK LG  FP
Sbjct: 249  --GGDFWKVLGSVFP 261


>emb|CDP00415.1| unnamed protein product [Coffea canephora]
          Length = 467

 Score =  213 bits (543), Expect = 8e-61
 Identities = 130/316 (41%), Positives = 172/316 (54%), Gaps = 3/316 (0%)
 Frame = +3

Query: 306  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLF---MTQDVTSQTAVPRIDSPGEAKSNDC 476
            MGVKRP  EE+    S KQ KQ + ++K   F    + D  SQ +  R          D 
Sbjct: 1    MGVKRPFDEEDFQVSSVKQAKQLEFDNKQTSFSKAFSSDDVSQNSGSR---------GDF 51

Query: 477  KPCDGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEATLPLSFVT 656
              C    E GN    S + +                           K LE + PLS+VT
Sbjct: 52   DKCQLFKELGNEDSRSASSSA-------------------------EKELETSAPLSWVT 86

Query: 657  TSSHEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQC 836
            +SS EE+AG+ +  ++ LFP Y +   P  +    LE+ +  ++N  PRK++P+GP+HQ 
Sbjct: 87   SSSGEEDAGSGEPFYVSLFPEYFEFNFPR-RTVVHLEDSYSSLINSSPRKQIPIGPNHQA 145

Query: 837  EVPTWDPSAVGKYFSVSNNFSGSDWKENLRGTCIIPRPGVYHSSIDQSMVGRGRTDCSCP 1016
            E+P WDP AV       NN    D +E + GTCII      +SS D+  +G+GR DC C 
Sbjct: 146  EIPPWDPQAVETDPLTPNNCVRDDNEEAV-GTCIISASLSSYSSRDEVKIGQGRKDCVCL 204

Query: 1017 DVGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANP 1196
            D GSVRCV+QH+KEARE LR  IG+  F ELGFY+MGEEVA  WT E+ER+F+EVV+ NP
Sbjct: 205  DRGSVRCVRQHIKEAREKLREVIGDEKFLELGFYDMGEEVAAKWTEEEERVFHEVVYYNP 264

Query: 1197 AYSGRNFWKFLGFAFP 1244
               G+NFWK L  AFP
Sbjct: 265  VSLGKNFWKQLAVAFP 280


>ref|XP_022878128.1| uncharacterized protein LOC111396080 isoform X1 [Olea europaea var.
            sylvestris]
          Length = 396

 Score =  203 bits (516), Expect = 1e-57
 Identities = 126/314 (40%), Positives = 168/314 (53%)
 Frame = +3

Query: 303  RMGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCKP 482
            +MG KRP  E++ PELSFK  KQ   N+ +L    +D +S     + D+ G A SN C  
Sbjct: 7    KMGTKRPFQEDDFPELSFKHAKQLGFNN-NLNSYFEDFSSCRTSAKSDTVGVAASNFCN- 64

Query: 483  CDGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEATLPLSFVTTS 662
                 +    HG S   +          NI E             K LE    LS     
Sbjct: 65   ----YKFDERHGNSDISSY--------SNIDE-------------KELEMGTALS---NG 96

Query: 663  SHEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEV 842
            S +E++  ED      F GY D  +P   P  Q  +P+ ++LN  PRKE+PVG DHQ EV
Sbjct: 97   SIDEDSAFEDRFCWSHFQGYFDFSVPRRLP-VQFRDPYSFLLNCSPRKEIPVGRDHQAEV 155

Query: 843  PTWDPSAVGKYFSVSNNFSGSDWKENLRGTCIIPRPGVYHSSIDQSMVGRGRTDCSCPDV 1022
            P WDP+A  K  S  + F   + +E L GTCIIP P +    I +     G+ DC+C D 
Sbjct: 156  PPWDPNATWKDTSSPSYFVDDESEEKLMGTCIIPMPNLKFDGIKEV---HGQADCNCLDR 212

Query: 1023 GSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANPAY 1202
            GSV+CVQQH KEARE LR  IG+  F ELGF +MGEEVA  WT E++++F++V+++NP  
Sbjct: 213  GSVKCVQQHAKEAREKLRETIGDEKFAELGFLDMGEEVASKWTEEEQQIFHQVIYSNPLL 272

Query: 1203 SGRNFWKFLGFAFP 1244
             G+NFW+ L   FP
Sbjct: 273  LGKNFWRHLSAVFP 286


>ref|XP_022878129.1| uncharacterized protein LOC111396080 isoform X2 [Olea europaea var.
            sylvestris]
          Length = 389

 Score =  202 bits (514), Expect = 2e-57
 Identities = 126/313 (40%), Positives = 167/313 (53%)
 Frame = +3

Query: 306  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCKPC 485
            MG KRP  E++ PELSFK  KQ   N+ +L    +D +S     + D+ G A SN C   
Sbjct: 1    MGTKRPFQEDDFPELSFKHAKQLGFNN-NLNSYFEDFSSCRTSAKSDTVGVAASNFCN-- 57

Query: 486  DGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEATLPLSFVTTSS 665
                +    HG S   +          NI E             K LE    LS     S
Sbjct: 58   ---YKFDERHGNSDISSY--------SNIDE-------------KELEMGTALS---NGS 90

Query: 666  HEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEVP 845
             +E++  ED      F GY D  +P   P  Q  +P+ ++LN  PRKE+PVG DHQ EVP
Sbjct: 91   IDEDSAFEDRFCWSHFQGYFDFSVPRRLP-VQFRDPYSFLLNCSPRKEIPVGRDHQAEVP 149

Query: 846  TWDPSAVGKYFSVSNNFSGSDWKENLRGTCIIPRPGVYHSSIDQSMVGRGRTDCSCPDVG 1025
             WDP+A  K  S  + F   + +E L GTCIIP P +    I +     G+ DC+C D G
Sbjct: 150  PWDPNATWKDTSSPSYFVDDESEEKLMGTCIIPMPNLKFDGIKEV---HGQADCNCLDRG 206

Query: 1026 SVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANPAYS 1205
            SV+CVQQH KEARE LR  IG+  F ELGF +MGEEVA  WT E++++F++V+++NP   
Sbjct: 207  SVKCVQQHAKEAREKLRETIGDEKFAELGFLDMGEEVASKWTEEEQQIFHQVIYSNPLLL 266

Query: 1206 GRNFWKFLGFAFP 1244
            G+NFW+ L   FP
Sbjct: 267  GKNFWRHLSAVFP 279


>ref|XP_022726069.1| uncharacterized protein LOC111282306 [Durio zibethinus]
          Length = 537

 Score =  192 bits (489), Expect = 3e-52
 Identities = 122/316 (38%), Positives = 168/316 (53%), Gaps = 3/316 (0%)
 Frame = +3

Query: 306  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCKPC 485
            MG KRP  +E + EL FK P+Q D ++K  LF+     S T                KP 
Sbjct: 1    MGFKRPFDDEELQELPFKNPRQFDYSNKLTLFVDTIHCSNTPQ--------------KPH 46

Query: 486  DGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEATLPLSFVTTSS 665
               +ENG     S    E  G  + + N+ +             K  E + PLS VT++S
Sbjct: 47   ISEVENGFGKYQSNEAFE-RGALNNDANLVD-------------KDFETSAPLSLVTSTS 92

Query: 666  HEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEVP 845
             EE+ GN  A+  H+   Y D   P  +    +E+ +  +L+  PR++VP+GP+HQ  VP
Sbjct: 93   SEEDTGNGAAAISHVSSEYFDFFFPR-RTFASVEDAYSLLLDRSPRRQVPLGPNHQANVP 151

Query: 846  TWDPSAVGKYFSVSNNFSGS---DWKENLRGTCIIPRPGVYHSSIDQSMVGRGRTDCSCP 1016
            +W  S + K     N+ S +   D +E L GTC+IP P    S+ +   VG GR DCSC 
Sbjct: 152  SWG-SHIKKDKLAQNDASDTTDNDNEEILMGTCVIPMPDSDLSANNSGKVGAGRADCSCL 210

Query: 1017 DVGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANP 1196
            D GS RCV+QHV EARE LR  +G   F +LGFY+MG++VA  W+ EDE +F EVV++NP
Sbjct: 211  DRGSFRCVRQHVTEAREKLRKSLGHEKFMKLGFYDMGDDVAYKWSEEDEDIFNEVVYSNP 270

Query: 1197 AYSGRNFWKFLGFAFP 1244
            A  G+ FWK L   FP
Sbjct: 271  ASLGKKFWKHLSVVFP 286


>ref|XP_022776770.1| uncharacterized protein LOC111318274 [Durio zibethinus]
 ref|XP_022776771.1| uncharacterized protein LOC111318274 [Durio zibethinus]
          Length = 536

 Score =  190 bits (483), Expect = 2e-51
 Identities = 118/315 (37%), Positives = 161/315 (51%), Gaps = 2/315 (0%)
 Frame = +3

Query: 306  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCKPC 485
            MG KRP   E + EL FK P+Q D ++K   F                       D  PC
Sbjct: 1    MGYKRPFDNEELQELPFKNPRQFDYSNKLTQFA----------------------DTIPC 38

Query: 486  DGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEATLPLSFVTTSS 665
                +  N     + G   +   D       L+           K  E+  PLS VT +S
Sbjct: 39   SNTPQKPNI-SVVVEGGFCKYQWDEAFESYALN----DVTHLVDKDFESGAPLSLVTRTS 93

Query: 666  HEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEVP 845
             EE+ G E A+ L + P Y D   P  +    +E+ +  +++  PRK+VP+GP+HQ  VP
Sbjct: 94   SEEDRGTEAAAILPVSPEYFDFDFPR-RMFTPVEDAYYLLVDRSPRKQVPLGPNHQAYVP 152

Query: 846  TWDPSAVGKYFSVSNNF--SGSDWKENLRGTCIIPRPGVYHSSIDQSMVGRGRTDCSCPD 1019
            +WD       F+ ++    + +D +E + GTC+IP P    S+ +   VG GRTDCSC D
Sbjct: 153  SWDRHIKKDKFAQNDTLDTTDNDNEEIVMGTCVIPMPDSDLSTNNSGKVGAGRTDCSCLD 212

Query: 1020 VGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANPA 1199
             GS+RCVQQHV EARE LR  +G   F +LGFY+MGE+VA  W+ EDE +F EVV+ANP 
Sbjct: 213  RGSLRCVQQHVMEAREKLRKSLGHEKFVKLGFYDMGEDVAYKWSEEDEEIFREVVYANPL 272

Query: 1200 YSGRNFWKFLGFAFP 1244
              G+ FWK L   FP
Sbjct: 273  SLGKKFWKHLTVVFP 287


>gb|OMO71971.1| hypothetical protein COLO4_27911 [Corchorus olitorius]
          Length = 505

 Score =  188 bits (477), Expect = 8e-51
 Identities = 120/319 (37%), Positives = 173/319 (54%), Gaps = 6/319 (1%)
 Frame = +3

Query: 306  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCKPC 485
            MG KRP  +E + ELSFK P+  D ++K    +TQ V +   +PR ++P +   +     
Sbjct: 1    MGFKRPFDDEELQELSFKNPRHFDYSNK----LTQFVDT---LPRSNTPQKPHLSV---- 49

Query: 486  DGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEATLPLSFVTTSS 665
               L++G        G E +   D    +               K  E + PLS VT++S
Sbjct: 50   --ELDDGFRKYQWDKGFETDDLNDVTHPVE--------------KDFETSAPLSLVTSAS 93

Query: 666  HEEEAGNEDASFLH-LFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEV 842
             EE+ G   A+ +  + P Y D   P  +P   +E+ +  +L+  PR+++P+GP+HQ  +
Sbjct: 94   SEEDTGTGAAAAVSPVSPEYADFDYPR-RPFTPVEDSYSLLLDRSPRRQIPLGPNHQAYI 152

Query: 843  PTWDPSAVGKYFSVSNNFSGS---DWKENLRGTCIIPRPGVYH--SSIDQSMVGRGRTDC 1007
            P+         F+  NN S +   D++  L GTC+IP P      S+I+   VG GRTDC
Sbjct: 153  PSLGRHLKKDMFAHGNNVSDTTDNDYEVKLMGTCVIPMPDSDSDLSAINSGKVGAGRTDC 212

Query: 1008 SCPDVGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVF 1187
            SC D GS+RCVQQHV EARE LR  +G   F +LGFY+MGE+VA  W+ EDE +F EVV+
Sbjct: 213  SCLDRGSLRCVQQHVMEAREKLRKSLGHEKFVKLGFYDMGEDVAYKWSEEDEEIFREVVY 272

Query: 1188 ANPAYSGRNFWKFLGFAFP 1244
            +NPA  G+ FWK L   FP
Sbjct: 273  SNPASLGKKFWKNLSVVFP 291


>gb|EOY01744.1| Uncharacterized protein TCM_011575 isoform 1 [Theobroma cacao]
          Length = 526

 Score =  188 bits (477), Expect = 1e-50
 Identities = 122/316 (38%), Positives = 168/316 (53%), Gaps = 3/316 (0%)
 Frame = +3

Query: 306  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCKPC 485
            MG KRP  +E + EL FK  +Q D ++K    MTQ   +    PR ++P        KP 
Sbjct: 1    MGFKRPFDDEELQELPFKNLRQFDYSNK----MTQFADT---FPRSNTPQ-------KPH 46

Query: 486  DGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEATLPLSFVTTSS 665
               +E+G          E +   D    +               K  E + PLS VT+ S
Sbjct: 47   ISEVEDGFRKYQWDEVFETDALNDVTHFVD--------------KDFETSAPLSLVTSPS 92

Query: 666  HEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEVP 845
             EE+ G   A+ L + P Y D  +P  +    +E+ +   L+  PR++V +GP+HQ  VP
Sbjct: 93   SEEDTGTGAAAILPVSPEYFDFDLPR-RTFAPVEDAYSLFLDRSPRRQVLLGPNHQANVP 151

Query: 846  TWDPSAVGKYFSVSNNFSGS---DWKENLRGTCIIPRPGVYHSSIDQSMVGRGRTDCSCP 1016
            +W    V KY    ++ S S   D +E + GTC+IP P  Y S+ +   VG GRTDCSC 
Sbjct: 152  SWGRH-VKKYEFAQSDASDSTDNDKEEMMMGTCVIPMPESYLSANNSGKVGAGRTDCSCL 210

Query: 1017 DVGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANP 1196
            D GS+RCVQQHV EARE LR  +G   F +LGFY+MGE+VA  W+ EDE +F EVV++NP
Sbjct: 211  DRGSLRCVQQHVMEARERLRKSLGHEKFVKLGFYDMGEDVAYKWSEEDEEIFREVVYSNP 270

Query: 1197 AYSGRNFWKFLGFAFP 1244
            +  G+ FWK L   FP
Sbjct: 271  SSLGKKFWKDLSVVFP 286


>ref|XP_017971049.1| PREDICTED: uncharacterized protein LOC18610283 isoform X2 [Theobroma
            cacao]
          Length = 523

 Score =  187 bits (475), Expect = 2e-50
 Identities = 122/316 (38%), Positives = 168/316 (53%), Gaps = 3/316 (0%)
 Frame = +3

Query: 306  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCKPC 485
            MG KRP  +E + EL FK  +Q D ++K    MTQ   +    PR ++P        KP 
Sbjct: 1    MGFKRPFDDEELQELPFKNLRQFDYSNK----MTQFADT---FPRSNTPQ-------KPH 46

Query: 486  DGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEATLPLSFVTTSS 665
               +E+G          E +   D    I               K  E + PLS VT+ S
Sbjct: 47   ISEVEDGFRKYQWDEVFETDALNDVTHFID--------------KDFETSAPLSLVTSPS 92

Query: 666  HEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEVP 845
             EE+ G   A+ L + P Y D  +P  +    +E+ +   L+  PR++V +GP+HQ  VP
Sbjct: 93   SEEDTGTGAAAILPVSPEYFDFDLPR-RTFAPVEDAYSLFLDRSPRRQVLLGPNHQANVP 151

Query: 846  TWDPSAVGKYFSVSNNFSGS---DWKENLRGTCIIPRPGVYHSSIDQSMVGRGRTDCSCP 1016
            +W    V KY    ++ S S   D ++ + GTC+IP P  Y S+ +   VG GRTDCSC 
Sbjct: 152  SWGRH-VKKYEFAQSDASDSTDNDKEDMMMGTCVIPMPESYLSANNSGKVGAGRTDCSCL 210

Query: 1017 DVGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANP 1196
            D GS+RCVQQHV EARE LR  +G   F +LGFY+MGE+VA  W+ EDE +F EVV++NP
Sbjct: 211  DRGSLRCVQQHVMEARERLRKSLGHEKFVKLGFYDMGEDVAYKWSEEDEEIFREVVYSNP 270

Query: 1197 AYSGRNFWKFLGFAFP 1244
            +  G+ FWK L   FP
Sbjct: 271  SSLGKKFWKDLSVVFP 286


>ref|XP_021299624.1| uncharacterized protein LOC110428202 [Herrania umbratica]
          Length = 521

 Score =  187 bits (474), Expect = 3e-50
 Identities = 118/317 (37%), Positives = 166/317 (52%), Gaps = 4/317 (1%)
 Frame = +3

Query: 306  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCKPC 485
            MG KRP  +E +  L FK P+Q D ++K   F           PR  +P        KP 
Sbjct: 1    MGFKRPFDDEELQGLPFKNPRQFDYSNKLTQFAD-------TFPRSSTPQ-------KPH 46

Query: 486  DGM-LENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEATLPLSFVTTS 662
              + +E+G          E +   D  + +               K  E + PLS VT+ 
Sbjct: 47   ISVEVEDGFRKYQRDEAFETDALNDVTDLVD--------------KDFETSAPLSLVTSP 92

Query: 663  SHEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEV 842
            S EE+ G   A+ L + P Y D  +P  +    +E+ +  +L+  PR++VP+GP+HQ  V
Sbjct: 93   SSEEDTGTGAAAILPVSPEYFDFDLPR-RTFALVEDAYSLLLDRSPRRQVPLGPNHQANV 151

Query: 843  PTWDPSAVGKYFSVSNNFSGS---DWKENLRGTCIIPRPGVYHSSIDQSMVGRGRTDCSC 1013
            P+W    + KY    ++ S S   D +E + G C+IP P  Y S+ +   VG GRTDCSC
Sbjct: 152  PSWGRH-IKKYEFAQSDASDSIDNDKEEMMMGACVIPMPESYLSANNGGKVGAGRTDCSC 210

Query: 1014 PDVGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFAN 1193
             D GS+RCVQQHV EARE LR  +G   F  LGFY+MGE+VA  W+ EDE +F EVV++N
Sbjct: 211  LDRGSLRCVQQHVMEARERLRKSLGHEKFVRLGFYDMGEDVAYKWSEEDEEIFREVVYSN 270

Query: 1194 PAYSGRNFWKFLGFAFP 1244
            P+  G+ FW+ L   FP
Sbjct: 271  PSSLGKKFWRDLSVVFP 287


>ref|XP_012478745.1| PREDICTED: uncharacterized protein LOC105794229 [Gossypium raimondii]
 ref|XP_012478746.1| PREDICTED: uncharacterized protein LOC105794229 [Gossypium raimondii]
 ref|XP_012478747.1| PREDICTED: uncharacterized protein LOC105794229 [Gossypium raimondii]
 gb|KJB30449.1| hypothetical protein B456_005G144600 [Gossypium raimondii]
 gb|KJB30450.1| hypothetical protein B456_005G144600 [Gossypium raimondii]
          Length = 542

 Score =  186 bits (471), Expect = 1e-49
 Identities = 116/316 (36%), Positives = 161/316 (50%), Gaps = 3/316 (0%)
 Frame = +3

Query: 306  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAV-PRIDSPGEAKSNDCKP 482
            MG KRP   E + EL FK P+Q D+N+K   F      S T   P I    E     C+ 
Sbjct: 1    MGFKRPFDSEELQELPFKHPRQFDNNNKLTQFANTISHSYTHQNPHISVDVEGGFCKCQ- 59

Query: 483  CDGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEATLPLSFVTTS 662
             D   E G             G  D   ++               K  E + PLS +T+ 
Sbjct: 60   WDEAFETG-------------GLNDERPSVD--------------KDFETSAPLSLITSI 92

Query: 663  SHEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEV 842
            S EE+     A+   + P Y D   P  +    +E+ +  +L+  PRK+VP+GP+HQ  V
Sbjct: 93   SSEEDVDTGPAAISPISPEYFDFDFPR-RTLGPVEDAYSLLLDRSPRKQVPLGPNHQANV 151

Query: 843  PTWDPSAVGKYF--SVSNNFSGSDWKENLRGTCIIPRPGVYHSSIDQSMVGRGRTDCSCP 1016
            P+         F  + +++ +   ++E + GTC+IP P    S+ D   VG GRTDCSC 
Sbjct: 152  PSLGRHIKKDKFVQNCASDTNDIGYEEIMMGTCVIPMPDSDLSANDSGKVGAGRTDCSCL 211

Query: 1017 DVGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANP 1196
            D GS+RCV+QHV EARE LR  +G   F +LGFY+MGE+VA  W+ E+E +F EVV++NP
Sbjct: 212  DGGSLRCVRQHVMEAREKLRKSLGHEKFVKLGFYDMGEDVAYKWSEEEEEIFREVVYSNP 271

Query: 1197 AYSGRNFWKFLGFAFP 1244
            A  G+NFWK     FP
Sbjct: 272  ASLGKNFWKHFSMVFP 287


>gb|EOY01745.1| Uncharacterized protein TCM_011575 isoform 2 [Theobroma cacao]
          Length = 527

 Score =  185 bits (470), Expect = 1e-49
 Identities = 120/316 (37%), Positives = 168/316 (53%), Gaps = 3/316 (0%)
 Frame = +3

Query: 306  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCKPC 485
            MG KRP  +E + EL FK  +Q D ++K    MTQ   +    PR ++P +   +     
Sbjct: 1    MGFKRPFDDEELQELPFKNLRQFDYSNK----MTQFADT---FPRSNTPQKPHIS----- 48

Query: 486  DGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEATLPLSFVTTSS 665
               +E+G          E +   D    +               K  E + PLS VT+ S
Sbjct: 49   -AEVEDGFRKYQWDEVFETDALNDVTHFVD--------------KDFETSAPLSLVTSPS 93

Query: 666  HEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEVP 845
             EE+ G   A+ L + P Y D  +P  +    +E+ +   L+  PR++V +GP+HQ  VP
Sbjct: 94   SEEDTGTGAAAILPVSPEYFDFDLPR-RTFAPVEDAYSLFLDRSPRRQVLLGPNHQANVP 152

Query: 846  TWDPSAVGKYFSVSNNFSGS---DWKENLRGTCIIPRPGVYHSSIDQSMVGRGRTDCSCP 1016
            +W    V KY    ++ S S   D +E + GTC+IP P  Y S+ +   VG GRTDCSC 
Sbjct: 153  SWGRH-VKKYEFAQSDASDSTDNDKEEMMMGTCVIPMPESYLSANNSGKVGAGRTDCSCL 211

Query: 1017 DVGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANP 1196
            D GS+RCVQQHV EARE LR  +G   F +LGFY+MGE+VA  W+ EDE +F EVV++NP
Sbjct: 212  DRGSLRCVQQHVMEARERLRKSLGHEKFVKLGFYDMGEDVAYKWSEEDEEIFREVVYSNP 271

Query: 1197 AYSGRNFWKFLGFAFP 1244
            +  G+ FWK L   FP
Sbjct: 272  SSLGKKFWKDLSVVFP 287


>ref|XP_010258118.1| PREDICTED: uncharacterized protein LOC104597984 [Nelumbo nucifera]
          Length = 516

 Score =  184 bits (468), Expect = 2e-49
 Identities = 126/321 (39%), Positives = 164/321 (51%), Gaps = 8/321 (2%)
 Frame = +3

Query: 306  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPR--IDSPGEAKSNDCK 479
            M  KRP G+E   EL+ K P+Q + +++   F   D+TS   +P+  +   GE+   DC 
Sbjct: 1    MVYKRPFGDEESCELACKHPRQLEYSNQLASFA--DITSYNDMPQNPLSLVGES---DCS 55

Query: 480  P--CDGMLENGNTHGASI-AGNELEGTGDREENISELSCKXXXXXXXXXKRLEATLPLSF 650
               CD  LE+G     SI AG +LE T      IS LS                     +
Sbjct: 56   KGQCDERLESGTITELSIGAGKDLEITAP--VGISSLS---------------------W 92

Query: 651  VTTSSHEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDH 830
             T+S+ EE++ +E    +  FPGY +   P      Q E  +   L++ PRK V VGPDH
Sbjct: 93   ATSSTSEEDSRSEATDRVPFFPGYYEPDYPA-TVLAQSEEIYSSPLDYHPRKLVAVGPDH 151

Query: 831  QCEVPTW---DPSAVGKYFSVSNNFSGSDWKENLRGTCIIPRPGVYHSSIDQSMVGRGRT 1001
            Q  VP W   D    G    V          + L GTCIIP P +  S       G GRT
Sbjct: 152  QANVPAWGFQDTHCFGAEVMVPETTD-----DKLMGTCIIPMPDLEQSVYSSDNFGCGRT 206

Query: 1002 DCSCPDVGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEV 1181
             CSCPD GS+RCV+QH+ E RE LR  +G+  F ELGF +MGEEVA NW  E+E+ F+EV
Sbjct: 207  ICSCPDGGSIRCVKQHIMETREKLRETLGQEKFAELGFCDMGEEVARNWNEEEEQSFHEV 266

Query: 1182 VFANPAYSGRNFWKFLGFAFP 1244
            VF+NPA  G+NFW  L   FP
Sbjct: 267  VFSNPASLGKNFWDHLSVVFP 287


>ref|XP_007045913.2| PREDICTED: uncharacterized protein LOC18610283 isoform X1 [Theobroma
            cacao]
          Length = 524

 Score =  184 bits (468), Expect = 2e-49
 Identities = 120/316 (37%), Positives = 168/316 (53%), Gaps = 3/316 (0%)
 Frame = +3

Query: 306  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCKPC 485
            MG KRP  +E + EL FK  +Q D ++K    MTQ   +    PR ++P +   +     
Sbjct: 1    MGFKRPFDDEELQELPFKNLRQFDYSNK----MTQFADT---FPRSNTPQKPHIS----- 48

Query: 486  DGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEATLPLSFVTTSS 665
               +E+G          E +   D    I               K  E + PLS VT+ S
Sbjct: 49   -AEVEDGFRKYQWDEVFETDALNDVTHFID--------------KDFETSAPLSLVTSPS 93

Query: 666  HEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEVP 845
             EE+ G   A+ L + P Y D  +P  +    +E+ +   L+  PR++V +GP+HQ  VP
Sbjct: 94   SEEDTGTGAAAILPVSPEYFDFDLPR-RTFAPVEDAYSLFLDRSPRRQVLLGPNHQANVP 152

Query: 846  TWDPSAVGKYFSVSNNFSGS---DWKENLRGTCIIPRPGVYHSSIDQSMVGRGRTDCSCP 1016
            +W    V KY    ++ S S   D ++ + GTC+IP P  Y S+ +   VG GRTDCSC 
Sbjct: 153  SWGRH-VKKYEFAQSDASDSTDNDKEDMMMGTCVIPMPESYLSANNSGKVGAGRTDCSCL 211

Query: 1017 DVGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANP 1196
            D GS+RCVQQHV EARE LR  +G   F +LGFY+MGE+VA  W+ EDE +F EVV++NP
Sbjct: 212  DRGSLRCVQQHVMEARERLRKSLGHEKFVKLGFYDMGEDVAYKWSEEDEEIFREVVYSNP 271

Query: 1197 AYSGRNFWKFLGFAFP 1244
            +  G+ FWK L   FP
Sbjct: 272  SSLGKKFWKDLSVVFP 287


Top