BLASTX nr result

ID: Phellodendron21_contig00028192 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Phellodendron21_contig00028192
         (811 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

KDO79290.1 hypothetical protein CISIN_1g000724mg [Citrus sinensis]    325   4e-99
XP_006425854.1 hypothetical protein CICLE_v10024721mg [Citrus cl...   324   9e-99
KDO79291.1 hypothetical protein CISIN_1g000724mg [Citrus sinensis]    300   3e-90
XP_017621696.1 PREDICTED: uncharacterized protein LOC108465829 [...   253   2e-76
XP_012079205.1 PREDICTED: uncharacterized protein LOC105639683 [...   259   2e-75
XP_016695497.1 PREDICTED: uncharacterized protein LOC107911985 [...   256   1e-74
XP_012489170.1 PREDICTED: uncharacterized protein LOC105802214 [...   256   1e-74
XP_016733314.1 PREDICTED: uncharacterized protein LOC107944009 [...   252   6e-73
XP_010094386.1 hypothetical protein L484_008274 [Morus notabilis...   243   1e-69
XP_007204681.1 hypothetical protein PRUPE_ppa000297mg [Prunus pe...   239   1e-68
XP_018810406.1 PREDICTED: uncharacterized protein LOC108983280 [...   239   2e-68
XP_010647355.1 PREDICTED: uncharacterized protein LOC100853492 [...   232   5e-66
EOX91359.1 Uncharacterized protein TCM_000577 isoform 1 [Theobro...   231   9e-66
EOX91360.1 O-Glycosyl hydrolases family 17 protein, putative iso...   231   9e-66
OAY35420.1 hypothetical protein MANES_12G100500 [Manihot esculenta]   231   2e-65
OAY35419.1 hypothetical protein MANES_12G100500 [Manihot esculenta]   231   2e-65
OMO84185.1 hypothetical protein COLO4_22183 [Corchorus olitorius]     228   2e-64
XP_017983519.1 PREDICTED: uncharacterized protein LOC18611094 is...   227   4e-64
XP_017983515.1 PREDICTED: uncharacterized protein LOC18611094 is...   227   4e-64
XP_007047203.2 PREDICTED: uncharacterized protein LOC18611094 is...   227   4e-64

>KDO79290.1 hypothetical protein CISIN_1g000724mg [Citrus sinensis]
          Length = 1329

 Score =  325 bits (833), Expect = 4e-99
 Identities = 189/319 (59%), Positives = 203/319 (63%), Gaps = 62/319 (19%)
 Frame = -1

Query: 772 YNHRCGLFRGLFHPAKAFNFILVLSCTFFYLATCEPCSIN-------------------- 653
           + +RCGLF+G F        I+VLSCTFFYLATCEPCSIN                    
Sbjct: 17  FYYRCGLFKGFF--------IVVLSCTFFYLATCEPCSINGMQKSVEYKGCGSYGDNQQV 68

Query: 652 ------GMQKSAEY------------NVCG------------------------ALEVSK 599
                 G   S+ Y            NVC                         +LE S 
Sbjct: 69  GFQDIIGDDTSSGYIERSSMTHPKSGNVCSDLNVFCFPSTLPGFLLKEHKLKTDSLETSN 128

Query: 598 LQSGSPLSIGTNQANSGASNRTWLSHSGMFKLLNGRTIXXXXXXXXXXXXXXSIGSDVGN 419
           LQSGSPLSIGTNQ NSG SNRTWLS S  FKLLNGRTI              SIGSD+  
Sbjct: 129 LQSGSPLSIGTNQPNSGPSNRTWLSQSCRFKLLNGRTISCYLSSKETSGELSSIGSDIDK 188

Query: 418 QNGFSSCSPTLLNQKSKNVSLKKNSERSKLGYFDISSSPKVEITPPELDWGQKYLFSASV 239
           QNGFSS   TLLNQKSKNVSLK +S   K G FD+SS PKVEI+PP LDWGQKYLF  S+
Sbjct: 189 QNGFSSFRRTLLNQKSKNVSLKNSSNLIKPGTFDVSS-PKVEISPPVLDWGQKYLFFPSL 247

Query: 238 AFLTVANSFSDSILHIYEPYSTSSQFYPCNSSEILLGPGEVASICFVFLPRWLGLSTAHL 59
           AFLTVANSFSDSIL IYEP++TSSQFYPCNSSEILLGPGEVASICFVFLP WLGLSTA L
Sbjct: 248 AFLTVANSFSDSILRIYEPFTTSSQFYPCNSSEILLGPGEVASICFVFLPTWLGLSTARL 307

Query: 58  ILQTSSGGFLVPTKGFGVE 2
           ILQTSSGGFLVPT+GFGVE
Sbjct: 308 ILQTSSGGFLVPTRGFGVE 326


>XP_006425854.1 hypothetical protein CICLE_v10024721mg [Citrus clementina]
           XP_006466635.1 PREDICTED: uncharacterized protein
           LOC102630085 [Citrus sinensis] ESR39094.1 hypothetical
           protein CICLE_v10024721mg [Citrus clementina]
          Length = 1329

 Score =  324 bits (830), Expect = 9e-99
 Identities = 189/316 (59%), Positives = 201/316 (63%), Gaps = 62/316 (19%)
 Frame = -1

Query: 763 RCGLFRGLFHPAKAFNFILVLSCTFFYLATCEPCSIN----------------------- 653
           RCGLF+G F        I+VLSCTFFYLATCEPCSIN                       
Sbjct: 20  RCGLFKGFF--------IVVLSCTFFYLATCEPCSINGMQKSVEYKGCGSYGDNQQVGFQ 71

Query: 652 ---GMQKSAEY------------NVCG------------------------ALEVSKLQS 590
              G   S+ Y            NVC                         +LE S LQS
Sbjct: 72  DIIGDDTSSGYIERSSMTHPKSGNVCSDLNVFCFPSTLPGFLLKEHKLKTDSLETSNLQS 131

Query: 589 GSPLSIGTNQANSGASNRTWLSHSGMFKLLNGRTIXXXXXXXXXXXXXXSIGSDVGNQNG 410
           GSPLSIGTNQ NSG SNRTWLS S  FKLLNGRTI              SIGSD+  QNG
Sbjct: 132 GSPLSIGTNQPNSGPSNRTWLSQSCRFKLLNGRTISCYLSSKETSGELSSIGSDIDKQNG 191

Query: 409 FSSCSPTLLNQKSKNVSLKKNSERSKLGYFDISSSPKVEITPPELDWGQKYLFSASVAFL 230
           FSS   TLLNQKSKNVSLK +S   K G FD+SS PKVEI+PP LDWGQKYLF  S+AFL
Sbjct: 192 FSSFRRTLLNQKSKNVSLKNSSNLIKPGTFDVSS-PKVEISPPVLDWGQKYLFFPSLAFL 250

Query: 229 TVANSFSDSILHIYEPYSTSSQFYPCNSSEILLGPGEVASICFVFLPRWLGLSTAHLILQ 50
           TVANSFSDSIL IYEP++TSSQFYPCNSSEILLGPGEVASICFVFLP WLGLSTA LILQ
Sbjct: 251 TVANSFSDSILRIYEPFTTSSQFYPCNSSEILLGPGEVASICFVFLPTWLGLSTARLILQ 310

Query: 49  TSSGGFLVPTKGFGVE 2
           TSSGGFLVPT+GFGVE
Sbjct: 311 TSSGGFLVPTRGFGVE 326


>KDO79291.1 hypothetical protein CISIN_1g000724mg [Citrus sinensis]
          Length = 1242

 Score =  300 bits (767), Expect = 3e-90
 Identities = 160/224 (71%), Positives = 171/224 (76%)
 Frame = -1

Query: 673 CEPCSINGMQKSAEYNVCGALEVSKLQSGSPLSIGTNQANSGASNRTWLSHSGMFKLLNG 494
           C P ++ G           +LE S LQSGSPLSIGTNQ NSG SNRTWLS S  FKLLNG
Sbjct: 17  CFPSTLPGFLLKEHKLKTDSLETSNLQSGSPLSIGTNQPNSGPSNRTWLSQSCRFKLLNG 76

Query: 493 RTIXXXXXXXXXXXXXXSIGSDVGNQNGFSSCSPTLLNQKSKNVSLKKNSERSKLGYFDI 314
           RTI              SIGSD+  QNGFSS   TLLNQKSKNVSLK +S   K G FD+
Sbjct: 77  RTISCYLSSKETSGELSSIGSDIDKQNGFSSFRRTLLNQKSKNVSLKNSSNLIKPGTFDV 136

Query: 313 SSSPKVEITPPELDWGQKYLFSASVAFLTVANSFSDSILHIYEPYSTSSQFYPCNSSEIL 134
           SS PKVEI+PP LDWGQKYLF  S+AFLTVANSFSDSIL IYEP++TSSQFYPCNSSEIL
Sbjct: 137 SS-PKVEISPPVLDWGQKYLFFPSLAFLTVANSFSDSILRIYEPFTTSSQFYPCNSSEIL 195

Query: 133 LGPGEVASICFVFLPRWLGLSTAHLILQTSSGGFLVPTKGFGVE 2
           LGPGEVASICFVFLP WLGLSTA LILQTSSGGFLVPT+GFGVE
Sbjct: 196 LGPGEVASICFVFLPTWLGLSTARLILQTSSGGFLVPTRGFGVE 239


>XP_017621696.1 PREDICTED: uncharacterized protein LOC108465829 [Gossypium
           arboreum]
          Length = 649

 Score =  253 bits (647), Expect = 2e-76
 Identities = 146/312 (46%), Positives = 178/312 (57%), Gaps = 63/312 (20%)
 Frame = -1

Query: 748 RGLFHPAKAFNFILVLSCTFFYLATCEPCSINGMQKSAEYNVC----------------- 620
           RG+  P KAF F LVLSCT F L TCEPC+++GM K+ EY  C                 
Sbjct: 23  RGMLQPVKAFQFFLVLSCTLFCLITCEPCAVSGMPKTDEYEGCEYYGDAHHVGFQETIID 82

Query: 619 ------------GALEVSKLQSGS----------------------PLSIGTNQANSGAS 542
                         L V ++ S S                       L +  +Q++S +S
Sbjct: 83  STHSQSDMGTFTTRLSVERVCSDSHSFCFPSTLPGFLTEESTLEVGGLEVSRSQSDSASS 142

Query: 541 ------------NRTWLSHSGMFKLLNGRTIXXXXXXXXXXXXXXSIGSDVGNQNGFSSC 398
                       N +WLS   MFKLLNGRT+              SI +D  NQN   SC
Sbjct: 143 FAEQSNLRVQASNSSWLSDHSMFKLLNGRTVSCSVYSKAGIHEFSSINTDGANQNDI-SC 201

Query: 397 SPTLLNQKSKNVSLKKNSERSKLGYFDISSSPKVEITPPELDWGQKYLFSASVAFLTVAN 218
              LL+QKS +V ++KN E +KL  FD  SSP VEI PP +DWG KYLF  SVA+LTVAN
Sbjct: 202 KGPLLSQKSTSVRMEKNKEVTKLSSFDGLSSPNVEINPPIMDWGHKYLFLPSVAYLTVAN 261

Query: 217 SFSDSILHIYEPYSTSSQFYPCNSSEILLGPGEVASICFVFLPRWLGLSTAHLILQTSSG 38
           + +DSILHI+EP+ST+ QFYPCN SE+LLGPGEVASICFVFLPRW+GLS+AHL+LQTSSG
Sbjct: 262 TCNDSILHIHEPFSTNIQFYPCNFSEVLLGPGEVASICFVFLPRWVGLSSAHLVLQTSSG 321

Query: 37  GFLVPTKGFGVE 2
           G LV  +GF VE
Sbjct: 322 GLLVQARGFAVE 333


>XP_012079205.1 PREDICTED: uncharacterized protein LOC105639683 [Jatropha curcas]
           KDP31904.1 hypothetical protein JCGZ_12365 [Jatropha
           curcas]
          Length = 1322

 Score =  259 bits (661), Expect = 2e-75
 Identities = 143/310 (46%), Positives = 176/310 (56%), Gaps = 61/310 (19%)
 Frame = -1

Query: 748 RGLFHPAKAFNFILVLSCTFFYLATCEPCSINGMQKSAEYNVCGAL-------------- 611
           RGLFH  KAF+F LVLSCT F LATC PC I+GMQK  EY+ CG+               
Sbjct: 29  RGLFHQVKAFHFFLVLSCTLFCLATCGPCLIHGMQKPKEYDGCGSYGDNPAVGFQDINVP 88

Query: 610 EVSKLQSGSPLS-----------------------------------------------I 572
           + S   SGS ++                                               +
Sbjct: 89  DASSYDSGSTVTRISVNSICTDSHSFCFPSTLPGLSSKEYKQKSDALEVSRSQSDSLSSV 148

Query: 571 GTNQANSGASNRTWLSHSGMFKLLNGRTIXXXXXXXXXXXXXXSIGSDVGNQNGFSSCSP 392
           G  Q + GASN++WLS SG+F+LLNG+ I               +     NQN  S+C  
Sbjct: 149 GLTQGSKGASNKSWLSDSGIFELLNGQAITCSLNSMEGVDRLSFMQMGSANQNDLSACGG 208

Query: 391 TLLNQKSKNVSLKKNSERSKLGYFDISSSPKVEITPPELDWGQKYLFSASVAFLTVANSF 212
           +LL +KS +  L  NSE +K   FD  SSP V+I+PP LDWG K+L+  SVAFLTVAN+ 
Sbjct: 209 SLLIKKSTSCRLNMNSEMTKSSPFDACSSPHVQISPPVLDWGHKHLYVPSVAFLTVANTC 268

Query: 211 SDSILHIYEPYSTSSQFYPCNSSEILLGPGEVASICFVFLPRWLGLSTAHLILQTSSGGF 32
           +DSILH+YEP+ST+ QFYPCN SE  LGPGE+AS+CFVFLPR+LG S AHLILQTSSGGF
Sbjct: 269 NDSILHVYEPFSTNIQFYPCNFSEFFLGPGEIASLCFVFLPRFLGFSAAHLILQTSSGGF 328

Query: 31  LVPTKGFGVE 2
           LV  KG+ VE
Sbjct: 329 LVQVKGYAVE 338


>XP_016695497.1 PREDICTED: uncharacterized protein LOC107911985 [Gossypium
           hirsutum]
          Length = 1337

 Score =  256 bits (655), Expect = 1e-74
 Identities = 149/312 (47%), Positives = 177/312 (56%), Gaps = 63/312 (20%)
 Frame = -1

Query: 748 RGLFHPAKAFNFILVLSCTFFYLATCEPCSINGMQKSAEYNVC----------------- 620
           RG+  P KAF F LVLSCT F L TCEPC++NGM K  EY  C                 
Sbjct: 23  RGMIQPVKAFQFFLVLSCTLFCLITCEPCAVNGMPKRDEYEGCEYYGDAHHVGFQETIID 82

Query: 619 ------------GALEVSKLQSGS----------------------PLSIGTNQANSGAS 542
                         L V ++ S S                       L +  +Q++S +S
Sbjct: 83  STHSQSDMGTSTTRLSVERVCSDSHSFCFPSTLPGFLTEESTLEVGGLEVSRSQSDSASS 142

Query: 541 ------------NRTWLSHSGMFKLLNGRTIXXXXXXXXXXXXXXSIGSDVGNQNGFSSC 398
                       NR+WLS   MFKLLNGRT+              SI +   NQN   SC
Sbjct: 143 FAEQSNLRVQASNRSWLSDHSMFKLLNGRTVSCSVYSRAGIHEFSSINTGGANQNDI-SC 201

Query: 397 SPTLLNQKSKNVSLKKNSERSKLGYFDISSSPKVEITPPELDWGQKYLFSASVAFLTVAN 218
              LL+QKS +V +K N E +KL  FD  SSP VEI PP +DWG KYLF  SVA+LTVAN
Sbjct: 202 KGPLLSQKSTSVRMKNNKEVTKLNSFDGLSSPNVEINPPIMDWGHKYLFLPSVAYLTVAN 261

Query: 217 SFSDSILHIYEPYSTSSQFYPCNSSEILLGPGEVASICFVFLPRWLGLSTAHLILQTSSG 38
           + +DSILHI+EP+ST+ QFYPCN SE+LLGPGEVASICFVFLPRW+GLS+AHLILQTSSG
Sbjct: 262 TCNDSILHIHEPFSTNIQFYPCNFSEVLLGPGEVASICFVFLPRWVGLSSAHLILQTSSG 321

Query: 37  GFLVPTKGFGVE 2
           GFLV  +GF VE
Sbjct: 322 GFLVQARGFAVE 333


>XP_012489170.1 PREDICTED: uncharacterized protein LOC105802214 [Gossypium
           raimondii] KJB40249.1 hypothetical protein
           B456_007G053500 [Gossypium raimondii]
          Length = 1337

 Score =  256 bits (655), Expect = 1e-74
 Identities = 149/312 (47%), Positives = 177/312 (56%), Gaps = 63/312 (20%)
 Frame = -1

Query: 748 RGLFHPAKAFNFILVLSCTFFYLATCEPCSINGMQKSAEYNVC----------------- 620
           RG+  P KAF F LVLSCT F L TCEPC++NGM K  EY  C                 
Sbjct: 23  RGMIQPVKAFQFFLVLSCTLFCLITCEPCAVNGMPKRDEYEGCEYYGDAHHVGFQETIID 82

Query: 619 ------------GALEVSKLQSGS----------------------PLSIGTNQANSGAS 542
                         L V ++ S S                       L +  +Q++S +S
Sbjct: 83  STHSQTDMGTSTTRLSVERVCSDSHSFCFPSTLPGFLTEESTLEVGGLEVSRSQSDSASS 142

Query: 541 ------------NRTWLSHSGMFKLLNGRTIXXXXXXXXXXXXXXSIGSDVGNQNGFSSC 398
                       NR+WLS   MFKLLNGRT+              SI +   NQN   SC
Sbjct: 143 FAEQSNLRVQASNRSWLSDHSMFKLLNGRTVSCSVYSRAGIHEFSSINTGGANQNDI-SC 201

Query: 397 SPTLLNQKSKNVSLKKNSERSKLGYFDISSSPKVEITPPELDWGQKYLFSASVAFLTVAN 218
              LL+QKS +V +K N E +KL  FD  SSP VEI PP +DWG KYLF  SVA+LTVAN
Sbjct: 202 KGPLLSQKSTSVRMKNNKEVTKLSSFDGLSSPNVEINPPIMDWGHKYLFLPSVAYLTVAN 261

Query: 217 SFSDSILHIYEPYSTSSQFYPCNSSEILLGPGEVASICFVFLPRWLGLSTAHLILQTSSG 38
           + +DSILHI+EP+ST+ QFYPCN SE+LLGPGEVASICFVFLPRW+GLS+AHLILQTSSG
Sbjct: 262 TCNDSILHIHEPFSTNIQFYPCNFSEVLLGPGEVASICFVFLPRWVGLSSAHLILQTSSG 321

Query: 37  GFLVPTKGFGVE 2
           GFLV  +GF VE
Sbjct: 322 GFLVQARGFAVE 333


>XP_016733314.1 PREDICTED: uncharacterized protein LOC107944009 [Gossypium
           hirsutum]
          Length = 1313

 Score =  252 bits (643), Expect = 6e-73
 Identities = 145/310 (46%), Positives = 178/310 (57%), Gaps = 63/310 (20%)
 Frame = -1

Query: 742 LFHPAKAFNFILVLSCTFFYLATCEPCSINGMQKSAEYNVC------------------- 620
           +  P KAF F LVLSCT F L TCEPC+++GM K+ EY  C                   
Sbjct: 1   MLQPVKAFQFFLVLSCTLFCLITCEPCAVSGMPKTDEYEGCEYYGDAHHVGFQETIIDST 60

Query: 619 ----------GALEVSKLQSGS----------------------PLSIGTNQANSGAS-- 542
                       L V ++ S S                       L +  +Q++S +S  
Sbjct: 61  HSQSDMGTFTTRLSVERVCSDSHSFCFPSTLPGFLTEESTLEVGGLEVSRSQSDSASSFA 120

Query: 541 ----------NRTWLSHSGMFKLLNGRTIXXXXXXXXXXXXXXSIGSDVGNQNGFSSCSP 392
                     N +WLS   MFKLLNGRT+              SI +D  NQN   SC  
Sbjct: 121 EQSNLRVQASNSSWLSDHSMFKLLNGRTVSCSVYSKAGIHEFPSINTDGANQNDI-SCKG 179

Query: 391 TLLNQKSKNVSLKKNSERSKLGYFDISSSPKVEITPPELDWGQKYLFSASVAFLTVANSF 212
            LL+QKS +V ++KN+E +KL  FD  SSP VEI PP +DWG KYLF  SVA+LTVAN+ 
Sbjct: 180 PLLSQKSTSVRMEKNNEVTKLSSFDGLSSPNVEINPPIMDWGHKYLFLPSVAYLTVANTC 239

Query: 211 SDSILHIYEPYSTSSQFYPCNSSEILLGPGEVASICFVFLPRWLGLSTAHLILQTSSGGF 32
           +DSILHI+EP+ST+ QFYPCN SE+LLGPGEVASICFVFLPRW+GLS+AHL+LQTSSGGF
Sbjct: 240 NDSILHIHEPFSTNIQFYPCNFSEVLLGPGEVASICFVFLPRWVGLSSAHLVLQTSSGGF 299

Query: 31  LVPTKGFGVE 2
           LV  +GF VE
Sbjct: 300 LVQARGFAVE 309


>XP_010094386.1 hypothetical protein L484_008274 [Morus notabilis] EXB55923.1
           hypothetical protein L484_008274 [Morus notabilis]
          Length = 1329

 Score =  243 bits (619), Expect = 1e-69
 Identities = 134/304 (44%), Positives = 172/304 (56%), Gaps = 55/304 (18%)
 Frame = -1

Query: 748 RGLFHPAKAFNFILVLSCTFFYLATCEPCSINGMQKSAEY-------------------- 629
           RGLF+ AK F+F +VLSC  F LATC PCS++G Q+SAE+                    
Sbjct: 21  RGLFYGAKIFHFAVVLSCAIFCLATCHPCSMDGKQESAEFDACRSYGDKSNAVFLDINAE 80

Query: 628 -----------NVC------------------------GALEVSKLQSGSPLSIGTNQAN 554
                      ++C                         ALE +     +P+++G+    
Sbjct: 81  YGHPRSYLKIESICTNSHAFCFPSTLPGFSSRDDKLEAAALEAAGSPFDTPINVGSADDT 140

Query: 553 SGASNRTWLSHSGMFKLLNGRTIXXXXXXXXXXXXXXSIGSDVGNQNGFSSCSPTLLNQK 374
               N++W    G FKLLNG  +              SI +D   QN  SSC   LLN+K
Sbjct: 141 KSTMNKSWSMDYGRFKLLNGGVLSCSLNSREGSNKLSSIQTDGAIQNDASSCRRPLLNKK 200

Query: 373 SKNVSLKKNSERSKLGYFDISSSPKVEITPPELDWGQKYLFSASVAFLTVANSFSDSILH 194
             N   ++N E +K G FD+SSS  VEI+P  LDWG K+++  SVAFLTVAN+ ++S+LH
Sbjct: 201 RTNFKAEENLEIAKSGSFDVSSSRHVEISPAILDWGHKHIYFPSVAFLTVANTCNESVLH 260

Query: 193 IYEPYSTSSQFYPCNSSEILLGPGEVASICFVFLPRWLGLSTAHLILQTSSGGFLVPTKG 14
           +YEP+ST SQFYPCN SE L+GPGE ASICFVFLPRWLGLS+AHLILQTSSGGFL+  KG
Sbjct: 261 VYEPFSTDSQFYPCNFSEALVGPGETASICFVFLPRWLGLSSAHLILQTSSGGFLIKAKG 320

Query: 13  FGVE 2
           F +E
Sbjct: 321 FAIE 324


>XP_007204681.1 hypothetical protein PRUPE_ppa000297mg [Prunus persica] ONH96547.1
           hypothetical protein PRUPE_7G136200 [Prunus persica]
          Length = 1328

 Score =  239 bits (611), Expect = 1e-68
 Identities = 140/307 (45%), Positives = 171/307 (55%), Gaps = 58/307 (18%)
 Frame = -1

Query: 748 RGLFHPAKAFNFILVLSCTFFYLATCEPCSINGMQKSAEYNVCGA-------------LE 608
           RGL HP KA + ++VL+CT FYLATC  CS NGMQ  +EY+ CG+             L 
Sbjct: 25  RGLSHPIKALHVLMVLACTLFYLATCGQCSGNGMQILSEYDACGSYGDNFDVAFADNFLG 84

Query: 607 VSKLQSGSP----------------------------------LSIGTNQANSGAS---- 542
            S L  G P                                  L +  +Q++  +S    
Sbjct: 85  DSTLGCGIPRNPFNIDKICTSSRLFCFPSTLPGFLEHKLKVADLEVSGSQSDDLSSIGST 144

Query: 541 -------NRTWLSHSGMFKLLNGRTIXXXXXXXXXXXXXXSIGSDVGNQNGFSSCSPTLL 383
                  N++W S +GMFKL NG  +              SI +D  N N  SSC   LL
Sbjct: 145 ENIKLANNKSWSSDNGMFKLFNGGIVSCSLNSKAATNEFSSIQTDSANPNDLSSCRGPLL 204

Query: 382 NQKSKNVSLKKNSERSKLGYFDISSSPKVEITPPELDWGQKYLFSASVAFLTVANSFSDS 203
            QKS +    KN+E +K   F  SSSP VEI+P  LDW QK ++  S+AFLTVAN+ +DS
Sbjct: 205 YQKSTSFRPNKNTEMTKSNSFSSSSSPHVEISPAVLDWEQKNMYFPSLAFLTVANTCNDS 264

Query: 202 ILHIYEPYSTSSQFYPCNSSEILLGPGEVASICFVFLPRWLGLSTAHLILQTSSGGFLVP 23
           ILH+YEP+ST  QFYPCN SE+LLGPGE ASICFVFLPRWLGLS+AHLILQTSSGGFL+ 
Sbjct: 265 ILHVYEPFSTDIQFYPCNFSEVLLGPGETASICFVFLPRWLGLSSAHLILQTSSGGFLIQ 324

Query: 22  TKGFGVE 2
            KG  VE
Sbjct: 325 AKGVAVE 331


>XP_018810406.1 PREDICTED: uncharacterized protein LOC108983280 [Juglans regia]
          Length = 1337

 Score =  239 bits (610), Expect = 2e-68
 Identities = 142/309 (45%), Positives = 177/309 (57%), Gaps = 60/309 (19%)
 Frame = -1

Query: 748 RGLFHPAKAFNFILVLSCTFFYLATCEPCSINGMQKSAEYNVCGA---------LEV--- 605
           RGLFH  +AF FI+VLSC  F  ATC P S+NGM K  E++ CG+         L++   
Sbjct: 20  RGLFHLVRAFQFIVVLSCILFCQATCGPSSMNGMLKPVEHDACGSYRDRFDVEFLDIGVG 79

Query: 604 -SKLQSGSPL---SIGT------------------------------------------- 566
            S  Q G P+   +IGT                                           
Sbjct: 80  DSSTQYGKPMTHVNIGTVCTDSRSFCFPSTLPGFSSKEYEHRDAALEASGSQSDCQLPDK 139

Query: 565 NQANSG-ASNRTWLSHSGMFKLLNGRTIXXXXXXXXXXXXXXSIGSDVGNQNGFSSCSPT 389
           +  +SG  SN++W S  GMF+LL G  +              +I +D  NQN FS    +
Sbjct: 140 STRDSGWMSNQSWSSDHGMFELLKGGIVSCSLNSKEDINEVSTIQADSANQNDFSFSRGS 199

Query: 388 LLNQKSKNVSLKKNSERSKLGYFDISSSPKVEITPPELDWGQKYLFSASVAFLTVANSFS 209
           L+NQK K+   +++SE +K   FD SSS  VEI P  LDWGQKYL+  S+AFLTVAN+ +
Sbjct: 200 LINQKCKSFRPERSSEVTKTCSFDGSSSFSVEIKPNVLDWGQKYLYLPSLAFLTVANTCN 259

Query: 208 DSILHIYEPYSTSSQFYPCNSSEILLGPGEVASICFVFLPRWLGLSTAHLILQTSSGGFL 29
           DSILH+YEP+ST  QFYPCNSSE LLGPGEVASICF++ PRWLGLS+AHLILQTSSGGFL
Sbjct: 260 DSILHVYEPFSTDVQFYPCNSSEALLGPGEVASICFIYFPRWLGLSSAHLILQTSSGGFL 319

Query: 28  VPTKGFGVE 2
           V  KGF +E
Sbjct: 320 VHAKGFAIE 328


>XP_010647355.1 PREDICTED: uncharacterized protein LOC100853492 [Vitis vinifera]
          Length = 1348

 Score =  232 bits (592), Expect = 5e-66
 Identities = 124/224 (55%), Positives = 146/224 (65%)
 Frame = -1

Query: 673 CEPCSINGMQKSAEYNVCGALEVSKLQSGSPLSIGTNQANSGASNRTWLSHSGMFKLLNG 494
           C P ++ G            LEVS+    + L +G+   +  ASN +W S  GMFKLLNG
Sbjct: 122 CFPSTLPGFLTEEHRLTEAVLEVSR-SPDAKLPVGSAVPSKQASNLSWSSDYGMFKLLNG 180

Query: 493 RTIXXXXXXXXXXXXXXSIGSDVGNQNGFSSCSPTLLNQKSKNVSLKKNSERSKLGYFDI 314
           RT+              S+ +   NQN  SSC   LLNQKS +  L KNSE      FD 
Sbjct: 181 RTVSCSLNYREGVHVMPSLQTRSANQNDLSSCRGPLLNQKSTSSMLNKNSEMKSSSSFDG 240

Query: 313 SSSPKVEITPPELDWGQKYLFSASVAFLTVANSFSDSILHIYEPYSTSSQFYPCNSSEIL 134
           SS P+VEI+PP LDWGQKYL+  SVAF+TV N+  DSILH+YEP+ST  QFYPCN SE+ 
Sbjct: 241 SSLPQVEISPPLLDWGQKYLYLPSVAFITVENTCDDSILHVYEPFSTDIQFYPCNFSEVF 300

Query: 133 LGPGEVASICFVFLPRWLGLSTAHLILQTSSGGFLVPTKGFGVE 2
           LGPGEVASICFVFLPRWLG+S+AHLILQTSSGGFLV  KGF VE
Sbjct: 301 LGPGEVASICFVFLPRWLGVSSAHLILQTSSGGFLVQAKGFAVE 344


>EOX91359.1 Uncharacterized protein TCM_000577 isoform 1 [Theobroma cacao]
          Length = 1323

 Score =  231 bits (590), Expect = 9e-66
 Identities = 122/225 (54%), Positives = 153/225 (68%), Gaps = 1/225 (0%)
 Frame = -1

Query: 673 CEPCSINGMQKSAEYNVCGALEVSKLQSGSPLS-IGTNQANSGASNRTWLSHSGMFKLLN 497
           C P ++ G          G+LEVS+ QS S  S I  +     A+N++W S+ GMFKLLN
Sbjct: 86  CFPSTLPGFSTEETKLEVGSLEVSRSQSDSASSYIEPSNLRGQANNKSWFSNHGMFKLLN 145

Query: 496 GRTIXXXXXXXXXXXXXXSIGSDVGNQNGFSSCSPTLLNQKSKNVSLKKNSERSKLGYFD 317
           GR +              S  +D  NQN  S C  +L  Q+S NV +K N E +K G FD
Sbjct: 146 GRMVSCSLSSRDGIHEFSSTFTDDANQNDIS-CRGSLQYQESANVRMKNNREVTKSGSFD 204

Query: 316 ISSSPKVEITPPELDWGQKYLFSASVAFLTVANSFSDSILHIYEPYSTSSQFYPCNSSEI 137
           +SS P V+++PP LDWGQKYLF  SVA+LTVAN+ ++S LH+YEP+ST+ QFYPCN SE+
Sbjct: 205 VSSFPNVDVSPPVLDWGQKYLFLPSVAYLTVANTCNESDLHVYEPFSTNMQFYPCNFSEL 264

Query: 136 LLGPGEVASICFVFLPRWLGLSTAHLILQTSSGGFLVPTKGFGVE 2
           LLGPGEVA+ICFVFLPRW+GLS+AHLILQTSSGGFLV  +GF VE
Sbjct: 265 LLGPGEVATICFVFLPRWVGLSSAHLILQTSSGGFLVQARGFAVE 309


>EOX91360.1 O-Glycosyl hydrolases family 17 protein, putative isoform 2,
           partial [Theobroma cacao]
          Length = 1327

 Score =  231 bits (590), Expect = 9e-66
 Identities = 122/225 (54%), Positives = 153/225 (68%), Gaps = 1/225 (0%)
 Frame = -1

Query: 673 CEPCSINGMQKSAEYNVCGALEVSKLQSGSPLS-IGTNQANSGASNRTWLSHSGMFKLLN 497
           C P ++ G          G+LEVS+ QS S  S I  +     A+N++W S+ GMFKLLN
Sbjct: 98  CFPSTLPGFSTEETKLEVGSLEVSRSQSDSASSYIEPSNLRGQANNKSWFSNHGMFKLLN 157

Query: 496 GRTIXXXXXXXXXXXXXXSIGSDVGNQNGFSSCSPTLLNQKSKNVSLKKNSERSKLGYFD 317
           GR +              S  +D  NQN  S C  +L  Q+S NV +K N E +K G FD
Sbjct: 158 GRMVSCSLSSRDGIHEFSSTFTDDANQNDIS-CRGSLQYQESANVRMKNNREVTKSGSFD 216

Query: 316 ISSSPKVEITPPELDWGQKYLFSASVAFLTVANSFSDSILHIYEPYSTSSQFYPCNSSEI 137
           +SS P V+++PP LDWGQKYLF  SVA+LTVAN+ ++S LH+YEP+ST+ QFYPCN SE+
Sbjct: 217 VSSFPNVDVSPPVLDWGQKYLFLPSVAYLTVANTCNESDLHVYEPFSTNMQFYPCNFSEL 276

Query: 136 LLGPGEVASICFVFLPRWLGLSTAHLILQTSSGGFLVPTKGFGVE 2
           LLGPGEVA+ICFVFLPRW+GLS+AHLILQTSSGGFLV  +GF VE
Sbjct: 277 LLGPGEVATICFVFLPRWVGLSSAHLILQTSSGGFLVQARGFAVE 321



 Score = 61.2 bits (147), Expect = 6e-07
 Identities = 26/43 (60%), Positives = 32/43 (74%)
 Frame = -1

Query: 748 RGLFHPAKAFNFILVLSCTFFYLATCEPCSINGMQKSAEYNVC 620
           RG++  AK+F F LVLSCT F L TCEPCS+NG+ K  EY+ C
Sbjct: 11  RGMYQRAKSFLFFLVLSCTLFCLTTCEPCSVNGVPKMEEYDGC 53


>OAY35420.1 hypothetical protein MANES_12G100500 [Manihot esculenta]
          Length = 1302

 Score =  231 bits (588), Expect = 2e-65
 Identities = 115/224 (51%), Positives = 152/224 (67%)
 Frame = -1

Query: 673 CEPCSINGMQKSAEYNVCGALEVSKLQSGSPLSIGTNQANSGASNRTWLSHSGMFKLLNG 494
           C P +++G+    +     ALE S+    S  S+G  Q + GASNR+W S SGMF+L NG
Sbjct: 58  CFPSTLHGLPSYEQEYKADALEFSRSHPDSLSSVGPTQDSKGASNRSWFSDSGMFELSNG 117

Query: 493 RTIXXXXXXXXXXXXXXSIGSDVGNQNGFSSCSPTLLNQKSKNVSLKKNSERSKLGYFDI 314
           +T+               + +   NQN FSSC   L+ +KS ++ L  NSE +K     +
Sbjct: 118 QTVSCSLNSIEDINQLLCVQNSSANQNDFSSCGGPLIIKKSASLRLTSNSEVTKSSPLHV 177

Query: 313 SSSPKVEITPPELDWGQKYLFSASVAFLTVANSFSDSILHIYEPYSTSSQFYPCNSSEIL 134
           SSSP V+I+PP LDWG+K+L   SVAFLTVAN+ ++S+L++YEP+ST+ QFYPCN S+  
Sbjct: 178 SSSPHVKISPPVLDWGRKHLHFPSVAFLTVANTCNNSLLYVYEPFSTNIQFYPCNHSKFF 237

Query: 133 LGPGEVASICFVFLPRWLGLSTAHLILQTSSGGFLVPTKGFGVE 2
           LGPGEVAS+CFVFLPRWLGLS+AHLILQTSSGGFLV  KG+ +E
Sbjct: 238 LGPGEVASVCFVFLPRWLGLSSAHLILQTSSGGFLVQVKGYALE 281


>OAY35419.1 hypothetical protein MANES_12G100500 [Manihot esculenta]
          Length = 1361

 Score =  231 bits (588), Expect = 2e-65
 Identities = 115/224 (51%), Positives = 152/224 (67%)
 Frame = -1

Query: 673 CEPCSINGMQKSAEYNVCGALEVSKLQSGSPLSIGTNQANSGASNRTWLSHSGMFKLLNG 494
           C P +++G+    +     ALE S+    S  S+G  Q + GASNR+W S SGMF+L NG
Sbjct: 117 CFPSTLHGLPSYEQEYKADALEFSRSHPDSLSSVGPTQDSKGASNRSWFSDSGMFELSNG 176

Query: 493 RTIXXXXXXXXXXXXXXSIGSDVGNQNGFSSCSPTLLNQKSKNVSLKKNSERSKLGYFDI 314
           +T+               + +   NQN FSSC   L+ +KS ++ L  NSE +K     +
Sbjct: 177 QTVSCSLNSIEDINQLLCVQNSSANQNDFSSCGGPLIIKKSASLRLTSNSEVTKSSPLHV 236

Query: 313 SSSPKVEITPPELDWGQKYLFSASVAFLTVANSFSDSILHIYEPYSTSSQFYPCNSSEIL 134
           SSSP V+I+PP LDWG+K+L   SVAFLTVAN+ ++S+L++YEP+ST+ QFYPCN S+  
Sbjct: 237 SSSPHVKISPPVLDWGRKHLHFPSVAFLTVANTCNNSLLYVYEPFSTNIQFYPCNHSKFF 296

Query: 133 LGPGEVASICFVFLPRWLGLSTAHLILQTSSGGFLVPTKGFGVE 2
           LGPGEVAS+CFVFLPRWLGLS+AHLILQTSSGGFLV  KG+ +E
Sbjct: 297 LGPGEVASVCFVFLPRWLGLSSAHLILQTSSGGFLVQVKGYALE 340



 Score = 67.4 bits (163), Expect = 5e-09
 Identities = 30/45 (66%), Positives = 35/45 (77%)
 Frame = -1

Query: 748 RGLFHPAKAFNFILVLSCTFFYLATCEPCSINGMQKSAEYNVCGA 614
           RGLFH  KAF F LVLSCT F LATC PC ++GMQKS +++ CGA
Sbjct: 30  RGLFHQVKAFQFFLVLSCTIFCLATCGPCLMDGMQKSKKHDGCGA 74


>OMO84185.1 hypothetical protein COLO4_22183 [Corchorus olitorius]
          Length = 1311

 Score =  228 bits (581), Expect = 2e-64
 Identities = 120/233 (51%), Positives = 153/233 (65%), Gaps = 1/233 (0%)
 Frame = -1

Query: 697 CTFFYLATCEPCSINGMQKSAEYNVCGALEVSKLQSGSPLS-IGTNQANSGASNRTWLSH 521
           CT  +L  C P ++ G          G LEVS+ QS S  S +  +      +NR+WLS+
Sbjct: 79  CTNSHLF-CFPSTLPGFSTEESKIEVGGLEVSRSQSHSDTSYVEPSNLRGQGNNRSWLSN 137

Query: 520 SGMFKLLNGRTIXXXXXXXXXXXXXXSIGSDVGNQNGFSSCSPTLLNQKSKNVSLKKNSE 341
            G+F+LLNGR +              S  +D   QN  SSC     NQKS +V ++ N E
Sbjct: 138 HGVFRLLNGRMVSCSLYSRGGVHEFSSFLTDGATQNDISSCRGPTQNQKSTSVRMENNIE 197

Query: 340 RSKLGYFDISSSPKVEITPPELDWGQKYLFSASVAFLTVANSFSDSILHIYEPYSTSSQF 161
            +K G F++SS P V+I+P  +DWGQKYLF  SVA+LTVAN+ +DSILH+YEP+STS QF
Sbjct: 198 VTKSGSFEVSSLPNVDISPAVMDWGQKYLFLPSVAYLTVANTCNDSILHVYEPFSTSIQF 257

Query: 160 YPCNSSEILLGPGEVASICFVFLPRWLGLSTAHLILQTSSGGFLVPTKGFGVE 2
           YPCN SE+LLGPGEV SICFVFLPRW+G S+AHL+LQTSSGGFLV  +G+ VE
Sbjct: 258 YPCNFSEVLLGPGEVVSICFVFLPRWVGSSSAHLVLQTSSGGFLVQARGYAVE 310


>XP_017983519.1 PREDICTED: uncharacterized protein LOC18611094 isoform X3
           [Theobroma cacao]
          Length = 1319

 Score =  227 bits (578), Expect = 4e-64
 Identities = 121/225 (53%), Positives = 151/225 (67%), Gaps = 1/225 (0%)
 Frame = -1

Query: 673 CEPCSINGMQKSAEYNVCGALEVSKLQSGSPLS-IGTNQANSGASNRTWLSHSGMFKLLN 497
           C P ++ G          G+LEVS+ QS S  S I  +     A+N++W S+ GMFKLLN
Sbjct: 86  CFPSTLPGFSTEETKLEVGSLEVSRSQSDSASSYIEPSNLRGQANNKSWFSNHGMFKLLN 145

Query: 496 GRTIXXXXXXXXXXXXXXSIGSDVGNQNGFSSCSPTLLNQKSKNVSLKKNSERSKLGYFD 317
           GR +                 S   NQN  S C  +L  Q+S NV +K N E +K G FD
Sbjct: 146 GRMVSCSLSSRDGIHEF----SSNANQNDIS-CRGSLQYQESANVRMKNNREVTKSGSFD 200

Query: 316 ISSSPKVEITPPELDWGQKYLFSASVAFLTVANSFSDSILHIYEPYSTSSQFYPCNSSEI 137
           +SS P V+++PP LDWGQKYLF  SVA+LTVAN+ ++S LH+YEP+ST+ QFYPCN SE+
Sbjct: 201 VSSFPNVDVSPPVLDWGQKYLFLPSVAYLTVANTCNESDLHVYEPFSTNMQFYPCNFSEL 260

Query: 136 LLGPGEVASICFVFLPRWLGLSTAHLILQTSSGGFLVPTKGFGVE 2
           LLGPGEVA+ICFVFLPRW+GLS+AHLILQTSSGGFLV  +GF VE
Sbjct: 261 LLGPGEVATICFVFLPRWVGLSSAHLILQTSSGGFLVQARGFAVE 305


>XP_017983515.1 PREDICTED: uncharacterized protein LOC18611094 isoform X2
           [Theobroma cacao]
          Length = 1331

 Score =  227 bits (578), Expect = 4e-64
 Identities = 121/225 (53%), Positives = 151/225 (67%), Gaps = 1/225 (0%)
 Frame = -1

Query: 673 CEPCSINGMQKSAEYNVCGALEVSKLQSGSPLS-IGTNQANSGASNRTWLSHSGMFKLLN 497
           C P ++ G          G+LEVS+ QS S  S I  +     A+N++W S+ GMFKLLN
Sbjct: 86  CFPSTLPGFSTEETKLEVGSLEVSRSQSDSASSYIEPSNLRGQANNKSWFSNHGMFKLLN 145

Query: 496 GRTIXXXXXXXXXXXXXXSIGSDVGNQNGFSSCSPTLLNQKSKNVSLKKNSERSKLGYFD 317
           GR +                 S   NQN  S C  +L  Q+S NV +K N E +K G FD
Sbjct: 146 GRMVSCSLSSRDGIHEF----SSNANQNDIS-CRGSLQYQESANVRMKNNREVTKSGSFD 200

Query: 316 ISSSPKVEITPPELDWGQKYLFSASVAFLTVANSFSDSILHIYEPYSTSSQFYPCNSSEI 137
           +SS P V+++PP LDWGQKYLF  SVA+LTVAN+ ++S LH+YEP+ST+ QFYPCN SE+
Sbjct: 201 VSSFPNVDVSPPVLDWGQKYLFLPSVAYLTVANTCNESDLHVYEPFSTNMQFYPCNFSEL 260

Query: 136 LLGPGEVASICFVFLPRWLGLSTAHLILQTSSGGFLVPTKGFGVE 2
           LLGPGEVA+ICFVFLPRW+GLS+AHLILQTSSGGFLV  +GF VE
Sbjct: 261 LLGPGEVATICFVFLPRWVGLSSAHLILQTSSGGFLVQARGFAVE 305


>XP_007047203.2 PREDICTED: uncharacterized protein LOC18611094 isoform X1
           [Theobroma cacao]
          Length = 1336

 Score =  227 bits (578), Expect = 4e-64
 Identities = 121/225 (53%), Positives = 151/225 (67%), Gaps = 1/225 (0%)
 Frame = -1

Query: 673 CEPCSINGMQKSAEYNVCGALEVSKLQSGSPLS-IGTNQANSGASNRTWLSHSGMFKLLN 497
           C P ++ G          G+LEVS+ QS S  S I  +     A+N++W S+ GMFKLLN
Sbjct: 86  CFPSTLPGFSTEETKLEVGSLEVSRSQSDSASSYIEPSNLRGQANNKSWFSNHGMFKLLN 145

Query: 496 GRTIXXXXXXXXXXXXXXSIGSDVGNQNGFSSCSPTLLNQKSKNVSLKKNSERSKLGYFD 317
           GR +                 S   NQN  S C  +L  Q+S NV +K N E +K G FD
Sbjct: 146 GRMVSCSLSSRDGIHEF----SSNANQNDIS-CRGSLQYQESANVRMKNNREVTKSGSFD 200

Query: 316 ISSSPKVEITPPELDWGQKYLFSASVAFLTVANSFSDSILHIYEPYSTSSQFYPCNSSEI 137
           +SS P V+++PP LDWGQKYLF  SVA+LTVAN+ ++S LH+YEP+ST+ QFYPCN SE+
Sbjct: 201 VSSFPNVDVSPPVLDWGQKYLFLPSVAYLTVANTCNESDLHVYEPFSTNMQFYPCNFSEL 260

Query: 136 LLGPGEVASICFVFLPRWLGLSTAHLILQTSSGGFLVPTKGFGVE 2
           LLGPGEVA+ICFVFLPRW+GLS+AHLILQTSSGGFLV  +GF VE
Sbjct: 261 LLGPGEVATICFVFLPRWVGLSSAHLILQTSSGGFLVQARGFAVE 305


Top