BLASTX nr result

ID: Akebia22_contig00023752 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia22_contig00023752
         (1389 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006374271.1| hypothetical protein POPTR_0015s05570g [Popu...   150   1e-33
ref|XP_002528226.1| hypothetical protein RCOM_0460020 [Ricinus c...   141   6e-31
ref|XP_003620296.1| Fiber expressed protein [Medicago truncatula...   135   4e-29
ref|XP_007152573.1| hypothetical protein PHAVU_004G1414000g [Pha...   132   4e-28
ref|XP_003620292.1| Cotton fiber expressed protein [Medicago tru...   130   1e-27
ref|XP_004163126.1| PREDICTED: uncharacterized LOC101210213 [Cuc...   130   1e-27
ref|XP_004149308.1| PREDICTED: uncharacterized protein LOC101210...   130   1e-27
ref|XP_003534104.2| PREDICTED: rRNA biogenesis protein RRP36-lik...   127   9e-27
ref|XP_004301222.1| PREDICTED: uncharacterized protein LOC101308...   126   3e-26
ref|XP_006599709.1| PREDICTED: uncharacterized protein LOC102664...   125   6e-26
ref|XP_007025771.1| Uncharacterized protein TCM_029972 [Theobrom...   124   8e-26
ref|XP_004294520.1| PREDICTED: uncharacterized protein LOC101300...   124   1e-25
ref|XP_006467883.1| PREDICTED: uncharacterized protein LOC102624...   124   1e-25
gb|AAY85179.1| fiber expressed protein [Gossypium hirsutum]           123   2e-25
gb|AAC33277.1| cotton fiber expressed protein 2 [Gossypium hirsu...   122   4e-25
ref|XP_006841790.1| hypothetical protein AMTR_s00003p00267520 [A...   121   6e-25
ref|XP_002273372.2| PREDICTED: uncharacterized protein LOC100244...   120   1e-24
emb|CAN65070.1| hypothetical protein VITISV_003953 [Vitis vinifera]   118   5e-24
ref|XP_007021270.1| Uncharacterized protein TCM_031316 [Theobrom...   117   9e-24
emb|CBI23474.3| unnamed protein product [Vitis vinifera]              117   9e-24

>ref|XP_006374271.1| hypothetical protein POPTR_0015s05570g [Populus trichocarpa]
            gi|550322028|gb|ERP52068.1| hypothetical protein
            POPTR_0015s05570g [Populus trichocarpa]
          Length = 326

 Score =  150 bits (380), Expect = 1e-33
 Identities = 107/306 (34%), Positives = 144/306 (47%), Gaps = 25/306 (8%)
 Frame = +3

Query: 516  FFQNSSLSRKRFDTAIWTIKLSFLSIGIISTVLLFKLAI-PYSLNLFISTLPRLWISVRS 692
            FF+ SS +  RF+TAIW +KL  LS+GIIST +LFK+AI P + NL +STLP +WIS+R 
Sbjct: 4    FFEISSFNTSRFETAIWAVKLVLLSVGIISTFILFKVAIIPCTFNLILSTLPSVWISLRG 63

Query: 693  WFSPPYLYILVNFIIIGIVASSTFQH-------------KLXXXXXXXXXXXEDEMEIEN 833
            W SPPY+YI+VNFIII IVASSTFQH             KL            D  +  +
Sbjct: 64   WLSPPYIYIIVNFIIITIVASSTFQHPNPDTKLPYSSSKKLKSQNQSSTNHANDLWQEHD 123

Query: 834  RTQPQKQ----SDFDTVIDSSSIKTSSEIWAENEHIVSDYGEKSLVLAKKYVESETSCLT 1001
              + +KQ      F+  IDSS    S + +  N     +  EK+     K      SCLT
Sbjct: 124  MQEVEKQLGTILSFEIPIDSSQDYYSPDTFLTNSG--KELQEKTNTDPSKDPCPPDSCLT 181

Query: 1002 DSNDITEQKPSVSARFAVRKSSTETQLRLGRSMARSAKPSNPLGIGIAKPKKHNDTLDAT 1181
            DS    ++K  +                                  + +     DTL+  
Sbjct: 182  DSAKKQQKKMDMEP--------------------------------LTQEADQQDTLEDA 209

Query: 1182 WKAITEGKGKPLTRHLRKSDTWNVPPRVIEPEPDLKP-------HXXXXXXXXXRRELRK 1340
            W +I E +GK  TR LRK  TW+ PP+V++    +                   RREL+K
Sbjct: 210  WTSIMEKQGKTPTRQLRKIGTWDTPPKVLQKVNGIVTADGGGGCGDDDDPVSWARRELKK 269

Query: 1341 SETFND 1358
            S+TFND
Sbjct: 270  SDTFND 275


>ref|XP_002528226.1| hypothetical protein RCOM_0460020 [Ricinus communis]
            gi|223532362|gb|EEF34159.1| hypothetical protein
            RCOM_0460020 [Ricinus communis]
          Length = 328

 Score =  141 bits (356), Expect = 6e-31
 Identities = 101/309 (32%), Positives = 142/309 (45%), Gaps = 24/309 (7%)
 Frame = +3

Query: 504  MADFFFQNSSLSRKRFDTAIWTIKLSFLSIGIISTVLLFKLA-IPYSLNLFISTLPRLWI 680
            MAD F Q+S   R R +  IW  KL  LS GI++ ++ FK+A +P++ +L +STLP LWI
Sbjct: 1    MADLF-QSSIFGRNRLEGIIWGAKLVLLSAGIMAAIIFFKVAMVPFAFDLILSTLPSLWI 59

Query: 681  SVRSWFSPPYLYILVNFIIIGIVASSTFQHKLXXXXXXXXXXXEDEMEIENRTQPQKQSD 860
            S+ SW SPPY+YI++NFIII I ASS+ QH+                 + +R     ++ 
Sbjct: 60   SLHSWLSPPYIYIVLNFIIIIIAASSSLQHQ--------------NSNLNSRKASSTKTQ 105

Query: 861  FDTVIDSSSIKTSSEIWAENEHIVSDYGEKSLVLAKKYVE-------------SETSCLT 1001
              T   S S     ++W ++ H + D  EK    +    E             S   C  
Sbjct: 106  SITTDKSQSKYHFHDLWQDDNHDLQD-DEKQAKTSNPSTEPSSPDCGSSQNSHSHDPCQD 164

Query: 1002 DSNDITEQKPSVSARFAVRKSSTETQLRLGRSMARSAKPSNPLGIGIAKP-KKHNDTLDA 1178
            D  ++ EQ+P V                             PL   I  P ++  DTL+ 
Sbjct: 165  DVQEVEEQQPLV----------------------------KPLSPDIPNPAEEEEDTLEG 196

Query: 1179 TWKAITEGKGKPLTRHLRKSDTWNVPPRVI---------EPEPDLKPHXXXXXXXXXRRE 1331
            TWK I EGKGK   R L+KS+TW  PPR+          + + D             RRE
Sbjct: 197  TWKLIMEGKGK-AARELKKSETWGTPPRLAVVVQGDGDKDADDDDDGGDPNDPVAWARRE 255

Query: 1332 LRKSETFND 1358
            LRKS+TF+D
Sbjct: 256  LRKSDTFSD 264


>ref|XP_003620296.1| Fiber expressed protein [Medicago truncatula]
            gi|357500037|ref|XP_003620307.1| Fiber expressed protein
            [Medicago truncatula] gi|355495311|gb|AES76514.1| Fiber
            expressed protein [Medicago truncatula]
            gi|355495322|gb|AES76525.1| Fiber expressed protein
            [Medicago truncatula]
          Length = 331

 Score =  135 bits (340), Expect = 4e-29
 Identities = 100/298 (33%), Positives = 142/298 (47%), Gaps = 15/298 (5%)
 Frame = +3

Query: 510  DFFFQNSSLSRKRFDTAIWTIKLSFLSIGIISTVLLFKLAI-PYSLNLFISTLPRLWISV 686
            D F   SSL   R D ++   KL  +SIGIIST++LFK+AI PY+ +L +STLP+LW S+
Sbjct: 2    DLFRNPSSLKSNRMDQSLGVAKLVLMSIGIISTLILFKVAIIPYTFDLVLSTLPQLWFSI 61

Query: 687  RSWFSPPYLYILVNFIIIGIVASSTFQHKLXXXXXXXXXXXEDEMEIENRTQP-QKQSDF 863
            R+WF+ P+LYI+VNFIII IV SS F HK            +    +E  T P ++++  
Sbjct: 62   RTWFTIPFLYIIVNFIIITIVFSSNFSHK-SNSSITFSDLKQTTTILETTTNPIEQENQT 120

Query: 864  DTVIDSSSIKTSSEIWAENEHIVSDYGEKSLVL-------AKKYVESETSCLTDSNDITE 1022
            +       +    E   ++E  V D  +  L         ++K    E   LTDS+D  +
Sbjct: 121  NEPHQEEKVVEEIEEQEQDEKRVVDVKDSELFCDEFITHPSQKKCSKEDYSLTDSDDKVK 180

Query: 1023 QKPSVSARFAVRKSSTETQLRLGRSMARSAKPSNPLGIGIAKPKKHNDTLDATWKAITEG 1202
                   +F                     K  N      +  K  +D+L+ATWKAI EG
Sbjct: 181  DFELFFNKFITDPI--------------QEKRCNDYNSPDSGDKGDDDSLEATWKAIMEG 226

Query: 1203 KGKPLTRHLRKSDTWNVPPRVIEPEPDLK------PHXXXXXXXXXRRELRKSETFND 1358
            + K    +L+KSDTW    R+++ EP                     REL+KSETFND
Sbjct: 227  QEKTKKPYLKKSDTWTA--RIVKAEPFRNNGGCGFGSGDDDPVAWAERELKKSETFND 282


>ref|XP_007152573.1| hypothetical protein PHAVU_004G1414000g [Phaseolus vulgaris]
            gi|561025882|gb|ESW24567.1| hypothetical protein
            PHAVU_004G1414000g [Phaseolus vulgaris]
          Length = 315

 Score =  132 bits (332), Expect = 4e-28
 Identities = 94/293 (32%), Positives = 145/293 (49%), Gaps = 10/293 (3%)
 Frame = +3

Query: 510  DFFFQNSSLSRKRFDTAIWTIKLSFLSIGIISTVLLFKLAI-PYSLNLFISTLPRLWISV 686
            D F + SS    RF+TA+W  KL  +S+G +S+++LFK+A+ PY+ +L +STLP+ W+SV
Sbjct: 5    DLFQKPSSRKSNRFETAMWAAKLVLMSMGFVSSIVLFKVAVVPYTFHLLLSTLPQFWVSV 64

Query: 687  RSWFSPPYLYILVNFIIIGIVASSTFQHKLXXXXXXXXXXXEDEMEIENRTQPQKQSD-- 860
            RSW + P+LYI+VNFIII I ASS F  K                     + P+  SD  
Sbjct: 65   RSWLTLPFLYIIVNFIIIIIAASSNFHPK-------------------GNSHPKFFSDSP 105

Query: 861  --FDTVIDSSSIKTSSEIWAENEHIVSDYGEKSLVLAKKYVESETSCLTDSNDITEQKPS 1034
              FD    +++I  ++    + E   ++  E+     +K VE E           +Q+  
Sbjct: 106  APFDPKHATTTISDTANHPTDPESQTNEPKEE-----EKEVEEE----------QKQQEQ 150

Query: 1035 VSARFAVRKSSTETQLRLGRSMARSAKPSNPLGIGIAKPKKHNDTLDATWKAITEGKGKP 1214
               +   +K + +T L           P N     +      +DTL+ATWKAI EG+GK 
Sbjct: 151  EKKQEQEQKVAEQTDLLSRDKFMLHPLPENCTNDYLLPDSDGDDTLEATWKAIMEGQGKT 210

Query: 1215 LTRHLRKSDTWNVPPRVIEPEP-----DLKPHXXXXXXXXXRRELRKSETFND 1358
            +   L+KS+TW    R+ + EP     +             ++EL+KS+TFND
Sbjct: 211  IRPQLKKSETWGA--RIAKAEPFGRNGEGDDDDDNDPVAWAKKELKKSDTFND 261


>ref|XP_003620292.1| Cotton fiber expressed protein [Medicago truncatula]
            gi|357500045|ref|XP_003620311.1| Cotton fiber expressed
            protein [Medicago truncatula] gi|355495307|gb|AES76510.1|
            Cotton fiber expressed protein [Medicago truncatula]
            gi|355495326|gb|AES76529.1| Cotton fiber expressed
            protein [Medicago truncatula]
          Length = 316

 Score =  130 bits (328), Expect = 1e-27
 Identities = 101/295 (34%), Positives = 145/295 (49%), Gaps = 12/295 (4%)
 Frame = +3

Query: 510  DFFFQNSSLSRKRFDTAIWTIKLSFLSIGIISTVLLFKLAI-PYSLNLFISTLPRLWISV 686
            D F   SS    R +T I   KL  +SIGIIST++LFK+AI PY+ +L +STLP+LW S+
Sbjct: 2    DLFQNPSSQKSSRMETPICVAKLVLISIGIISTLILFKVAIIPYTFDLVLSTLPQLWFSI 61

Query: 687  RSWFSPPYLYILVNFIIIGIVASSTFQHKLXXXXXXXXXXXEDEMEIENRTQPQKQSDFD 866
            R+WF+ P+LYI+VNFIII IVASS+F                + +E+EN+T    Q    
Sbjct: 62   RTWFTLPFLYIIVNFIIIIIVASSSFSDP-KHTTTSILETTTNPIELENQTNEPHQ---- 116

Query: 867  TVIDSSSIKTSSEIWAENEHIVSDYGEKSLVLAKKYVE-------SETSCLTDSNDITEQ 1025
               +   ++   E   E + +V D    S +   K++        S+   L DS+D  + 
Sbjct: 117  ---EEKKVEEVEEQEQEEKRVVKD----SELFHNKFITDPIPEKCSKDFYLPDSDDKVKD 169

Query: 1026 KPSVSARFAVRKSSTETQLRLGRSMARSAKPSNPLGIGIAKPKKHNDTLDATWKAITEGK 1205
                  +F +   S E             K  N   +  +  K  +D+L+ATWKAI E +
Sbjct: 170  FRLFCNKF-IDDPSPE-------------KCCNDYNLPDSGDKGDDDSLEATWKAIMEAQ 215

Query: 1206 GKPLTRHLRKSDTWNVPPRVIEPEPDLKP----HXXXXXXXXXRRELRKSETFND 1358
             K    HL+KS TW    R+++ EP                  +REL+KSETFND
Sbjct: 216  EKTKKPHLKKSGTWTA--RIVKAEPFRNNGGFCGGDDDPVAWAQRELKKSETFND 268


>ref|XP_004163126.1| PREDICTED: uncharacterized LOC101210213 [Cucumis sativus]
          Length = 346

 Score =  130 bits (327), Expect = 1e-27
 Identities = 92/289 (31%), Positives = 139/289 (48%), Gaps = 22/289 (7%)
 Frame = +3

Query: 561  IWTIKLSFLSIGIISTVLLFKLAIPYSLNLFISTLPRLWISVRSWFSPPYLYILVNFIII 740
            I ++K+  +S G++S  L  KL++P   +   S +P +W S  SW +PPYLY+L+N III
Sbjct: 11   ILSLKILLISTGLLSMALFLKLSVPLLADFVFSEIPSIWTSFLSWLTPPYLYLLINCIII 70

Query: 741  GIVASSTFQHKLXXXXXXXXXXXEDEMEIENRTQPQKQSDF-------------DTVIDS 881
             IVASS  Q  L               +I +       SD               T +  
Sbjct: 71   SIVASSKLQSNLENDPIPETPAPPPVTKISSDYAVCGSSDILNGYSSYNANQNVVTKVSD 130

Query: 882  SSIKTSSEIWAE-NEHIVSDY---GEKSLVLAKKYVESETSCLTDSNDITEQKPSVSARF 1049
              I  S+E++    E  VS+    GE   ++A K  + E+S L+   +   +K S+   F
Sbjct: 131  LEIDDSNEVYGRIEESRVSEMEKKGENDSMIAMKGGD-ESSVLSSITNTLPRKDSIGVLF 189

Query: 1050 AVRKSSTETQLRLG-RSMARSAKPSNPLGIGIAKPKKHNDTLDATWKAITEGKGKPLTRH 1226
            + ++       R G R   +S+    PL  G++KPK+  DTL+ TW+ ITEG+  PLTRH
Sbjct: 190  SNKEEKPPVSSRFGQRKFVKSSPEGKPL--GVSKPKR-QDTLENTWRKITEGRSMPLTRH 246

Query: 1227 LRKSDTW----NVPPRVIEPEPDLKPHXXXXXXXXXRRELRKSETFNDG 1361
            LRKSDTW       P ++  +P   P           + ++KSETF +G
Sbjct: 247  LRKSDTWESHGRKAPTMVVEDPATPP----------SKVMKKSETFKEG 285


>ref|XP_004149308.1| PREDICTED: uncharacterized protein LOC101210213 [Cucumis sativus]
          Length = 306

 Score =  130 bits (327), Expect = 1e-27
 Identities = 92/289 (31%), Positives = 139/289 (48%), Gaps = 22/289 (7%)
 Frame = +3

Query: 561  IWTIKLSFLSIGIISTVLLFKLAIPYSLNLFISTLPRLWISVRSWFSPPYLYILVNFIII 740
            I ++K+  +S G++S  L  KL++P   +   S +P +W S  SW +PPYLY+L+N III
Sbjct: 11   ILSLKILLISTGLLSMALFLKLSVPLLADFVFSEIPSIWTSFLSWLTPPYLYLLINCIII 70

Query: 741  GIVASSTFQHKLXXXXXXXXXXXEDEMEIENRTQPQKQSDF-------------DTVIDS 881
             IVASS  Q  L               +I +       SD               T +  
Sbjct: 71   SIVASSKLQSNLENDPIPETPAPPPVTKISSDYAVCGSSDILNGYSSYNANQNVVTKVSD 130

Query: 882  SSIKTSSEIWAE-NEHIVSDY---GEKSLVLAKKYVESETSCLTDSNDITEQKPSVSARF 1049
              I  S+E++    E  VS+    GE   ++A K  + E+S L+   +   +K S+   F
Sbjct: 131  LEIDDSNEVYGRIEESRVSEMEKKGENDSMIAMKGGD-ESSVLSSITNTLPRKDSIGVLF 189

Query: 1050 AVRKSSTETQLRLG-RSMARSAKPSNPLGIGIAKPKKHNDTLDATWKAITEGKGKPLTRH 1226
            + ++       R G R   +S+    PL  G++KPK+  DTL+ TW+ ITEG+  PLTRH
Sbjct: 190  SNKEEKPPVSSRFGQRKFVKSSPEGKPL--GVSKPKR-QDTLENTWRKITEGRSMPLTRH 246

Query: 1227 LRKSDTW----NVPPRVIEPEPDLKPHXXXXXXXXXRRELRKSETFNDG 1361
            LRKSDTW       P ++  +P   P           + ++KSETF +G
Sbjct: 247  LRKSDTWESHGRKAPTMVVEDPATPP----------SKVMKKSETFKEG 285


>ref|XP_003534104.2| PREDICTED: rRNA biogenesis protein RRP36-like [Glycine max]
          Length = 298

 Score =  127 bits (320), Expect = 9e-27
 Identities = 91/287 (31%), Positives = 133/287 (46%), Gaps = 1/287 (0%)
 Frame = +3

Query: 501  AMADFFFQNSSLSRKRFDTAIWTIKLSFLSIGIISTVLLFKLAI-PYSLNLFISTLPRLW 677
            A+ D F   SSL   +F+T++W  KL  +S+G+IST++L K+AI PY+ +L +STLP+  
Sbjct: 2    AVMDLFQNPSSLKSNKFETSVWIAKLVLMSMGVISTLVLLKVAIVPYTFHLLLSTLPQFC 61

Query: 678  ISVRSWFSPPYLYILVNFIIIGIVASSTFQHKLXXXXXXXXXXXEDEMEIENRTQPQKQS 857
            +SVRSW + P+LYI+VNFIII I ASS F  K                   +   P+  S
Sbjct: 62   VSVRSWLTLPFLYIIVNFIIITIAASSNFPPK-------------------SNNHPKTFS 102

Query: 858  DFDTVIDSSSIKTSSEIWAENEHIVSDYGEKSLVLAKKYVESETSCLTDSNDITEQKPSV 1037
            D      + S   S    +EN+       EK +   ++ V  E        ++ E+    
Sbjct: 103  DPKHTTTTISDTVSHPTESENQTNEPKEEEKEVQQQQQEVVEEEE---PEEEVVEESGLT 159

Query: 1038 SARFAVRKSSTETQLRLGRSMARSAKPSNPLGIGIAKPKKHNDTLDATWKAITEGKGKPL 1217
              +F +R                     N            +DTL+ATW+AI EG+GK +
Sbjct: 160  YDKFMIRPLL-----------------ENCTNDYFLPDSDGDDTLEATWRAIMEGQGKTM 202

Query: 1218 TRHLRKSDTWNVPPRVIEPEPDLKPHXXXXXXXXXRRELRKSETFND 1358
               L+KSDTW       EP                ++EL+KS+TFND
Sbjct: 203  KPQLKKSDTWGARIAKAEPFHRNGEGGDDDPVAWAQKELKKSDTFND 249


>ref|XP_004301222.1| PREDICTED: uncharacterized protein LOC101308473 [Fragaria vesca
            subsp. vesca]
          Length = 353

 Score =  126 bits (316), Expect = 3e-26
 Identities = 100/310 (32%), Positives = 145/310 (46%), Gaps = 35/310 (11%)
 Frame = +3

Query: 534  LSRKRFDTAIWTIKLSFLSIGIISTVLLFKLAI-PYSLNLFISTLPRLWISVRSW-FSPP 707
            L R R +TA+W  KL  L  GI+ST++ FK+ I PY LNL +STLP LW S+RS+  SP 
Sbjct: 5    LKRARLETAVWAGKLLLLCAGIVSTMVFFKVVIIPYLLNLTLSTLPDLWTSLRSYCLSPL 64

Query: 708  YLYILVNFIIIGIVASSTFQHKLXXXXXXXXXXXE--DEMEIENRTQPQKQSDFDTVIDS 881
            Y+YI+VNFIII I ASS FQ++               + +     T    + D+ TVID 
Sbjct: 65   YIYIIVNFIIIIIAASSIFQNQKQKHPSSVPSSYNSVNNLNHAKTTSIDHEDDYHTVIDG 124

Query: 882  SSIKTSSEIWAENEHIVSDYGEKSLVLAKKYVESETSCLTDSNDITEQKPSVSARFAVRK 1061
            ++   S++I   +    +       +L  ++          SN++         + A  +
Sbjct: 125  TAEYNSNKIQTSSHSENATISPLPSLLTHQF----------SNEMISWHDIHIVQQASTE 174

Query: 1062 SSTETQLRLGRSMA-----RSAKPSNPLGIGIAKPKKHND-TLDATWKAITEGKGKPLTR 1223
               E+    G + A      S+K  + L    A  ++  D TLD TW AI E +GKP+  
Sbjct: 175  EGDESITVSGNTTAPSPEEHSSKEDSCLTEKPATEEEAEDNTLDTTWNAIMERQGKPICN 234

Query: 1224 HLRKSDTWNVPPRV-------IEPEPDL------------------KPHXXXXXXXXXRR 1328
            HL+KS+TW+ PPR        I P  +L                    +         R+
Sbjct: 235  HLKKSETWDTPPRTTGLVRSSIIPARELVQEDHVLDDDHHQDINNNNDNVVDDRVAWARK 294

Query: 1329 ELRKSETFND 1358
            EL+KSETFND
Sbjct: 295  ELKKSETFND 304


>ref|XP_006599709.1| PREDICTED: uncharacterized protein LOC102664248 [Glycine max]
          Length = 305

 Score =  125 bits (313), Expect = 6e-26
 Identities = 96/288 (33%), Positives = 142/288 (49%), Gaps = 5/288 (1%)
 Frame = +3

Query: 510  DFFFQNSSLSRKRFDTAIWTIKLSFLSIGIISTVLLFKLAI-PYSLNLFISTLPRLWISV 686
            D F   SSL   R +TA+W  KL  +S+G+IST++L K+AI PY+ +L +STLP+  +SV
Sbjct: 5    DLFQNPSSLKSNRSETAMWIAKLVLMSMGVISTLVLLKVAIVPYTFHLLLSTLPQFCVSV 64

Query: 687  RSWFSPPYLYILVNFIIIGIVASSTFQHKLXXXXXXXXXXXEDEMEIENRTQPQKQSDFD 866
            RSW + P+LYI+VNFIII I ASS F  K                   +   P       
Sbjct: 65   RSWLTLPFLYIIVNFIIITIAASSNFPPK----------------TFSDSPAPSDPKHTT 108

Query: 867  TVIDSSSIKTSSEIWAENEHIVSDYGEKSLVLAKKYVESETSCLTDSNDITEQKPSVSAR 1046
            TVI  ++   +     E+E+  ++  E+     +K VE E   L       EQ+  V   
Sbjct: 109  TVISDTANHPT-----ESENQTNEPKEE-----EKEVEEEEEEL-------EQEEVVEEE 151

Query: 1047 FAVRKSSTETQLRLGRSMARSAKPSNPLGIGIAKPKKHNDTLDATWKAITEGKGKPLTRH 1226
                +   E+ L   + M + +  S      +      +DTL+ATW+AI EG+GK +   
Sbjct: 152  LEQEEVIEESGLTYDKFMIQPSLESCTNDY-VLPDSDGDDTLEATWRAIMEGQGKTMKPQ 210

Query: 1227 LRKSDTWNVPPRVIEPEPDLK----PHXXXXXXXXXRRELRKSETFND 1358
            L+KSDTW    R+ + EP  +               ++EL+KS+TFND
Sbjct: 211  LKKSDTWGA--RIAKAEPFHRNGEGGGGDDDPVAWAQKELKKSDTFND 256


>ref|XP_007025771.1| Uncharacterized protein TCM_029972 [Theobroma cacao]
            gi|508781137|gb|EOY28393.1| Uncharacterized protein
            TCM_029972 [Theobroma cacao]
          Length = 332

 Score =  124 bits (312), Expect = 8e-26
 Identities = 81/289 (28%), Positives = 136/289 (47%), Gaps = 21/289 (7%)
 Frame = +3

Query: 555  TAIWTIKLSFLSIGIISTVLLFKLAIPYSLNLFISTLPRLWISVRSWFSPPYLYILVNFI 734
            T I ++K+ F+SIG+++     K+++P  L   +S  P LW + RSW  PPYLY+++N I
Sbjct: 6    TLILSLKVLFISIGMLAIAFGLKVSVPLVLEFSVSQAPLLWSTFRSWLKPPYLYVIINGI 65

Query: 735  IIGIVASSTFQHKLXXXXXXXXXXXEDEMEIENRTQPQKQSDFDTVIDSSSIKTSSEIWA 914
            II I ASS F HK              ++ ++    P  + +  + +D   +++S+ ++ 
Sbjct: 66   IITIAASSRFNHKTGEKDQKEQMQQRSKISVDQ--GPAFEDEMKSGLDFGVVESSALVYE 123

Query: 915  ENE------------------HIVSDYGEKSLVLAKKYV---ESETSCLTDSNDITEQKP 1031
            + +                    V D G++  +   +++     ++S +   + +  +KP
Sbjct: 124  QEQRGEEVETRGFEEESNAAVEDVGDGGDEFAISKSEWIPPRRMDSSEIPSDSLLPTEKP 183

Query: 1032 SVSARFAVRKSSTETQLRLGRSMARSAKPSNPLGIGIAKPKKHNDTLDATWKAITEGKGK 1211
              ++RF  RK                A P     + +AKPK+H +TL+ TWK ITEG+  
Sbjct: 184  PAASRFGHRKPV-------------RANPEGGRALRVAKPKRH-ETLENTWKMITEGRAM 229

Query: 1212 PLTRHLRKSDTWNVPPRVIEPEPDLKPHXXXXXXXXXRRELRKSETFND 1358
            PLTRHL+KSDTW    R +  E                  ++KSETF D
Sbjct: 230  PLTRHLKKSDTWENHGRDVNVE-----------ALADSPLMKKSETFRD 267


>ref|XP_004294520.1| PREDICTED: uncharacterized protein LOC101300989 [Fragaria vesca
            subsp. vesca]
          Length = 352

 Score =  124 bits (311), Expect = 1e-25
 Identities = 94/307 (30%), Positives = 135/307 (43%), Gaps = 43/307 (14%)
 Frame = +3

Query: 567  TIKLSFLSIGIISTVLLFKLAIPYSLNLFISTLPRLWISVRSWFSPPYLYILVNFIIIGI 746
            ++K+  +S G++S  +  KL++P + ++  S LP LW S+ SW  PPYLYIL+N III I
Sbjct: 9    SLKIVLISTGVVSMAVALKLSVPVAADIVASQLPSLWSSLLSWLRPPYLYILINCIIISI 68

Query: 747  VASSTFQHKLXXXXXXXXXXXEDEMEIENRTQPQKQSDFDTVIDSSSIKTSSEIWAENEH 926
            VASS                 ED  + E    P  +      +  S+ +   +  A N  
Sbjct: 69   VASSKLH---------SAPKPEDHHDSEIIIPPPPEMAVVPPVKISAAEARPDYAAYNGV 119

Query: 927  IVSDYGEKSLVLAK--------------KYVESETSCLT-------DSNDIT-------- 1019
             +SDYG  S VL K              + +E E   +          N +T        
Sbjct: 120  ALSDYGYDSHVLPKDSDSYGGAVAAETVEVLEPENEMIKVLAEVNGGDNAVTAASRPARS 179

Query: 1020 --------------EQKPSVSARFAVRKSSTETQLRLGRSMARSAKPSNPLGIGIAKPKK 1157
                           QKP VS+RFA RKS               A P     +G+++PK+
Sbjct: 180  GLQRKDSMEFWLNENQKPPVSSRFAHRKSI-------------RASPEGGKSLGVSRPKR 226

Query: 1158 HNDTLDATWKAITEGKGKPLTRHLRKSDTWNVPPRVIEPEPDLKPHXXXXXXXXXRRELR 1337
            H DTL++TWK IT+G+  PLTRHL+KSDTW        P P +             + + 
Sbjct: 227  H-DTLESTWKTITDGRSVPLTRHLKKSDTWERRMDQNTPPPPMAD-----------KMMH 274

Query: 1338 KSETFND 1358
            KSETF++
Sbjct: 275  KSETFHE 281


>ref|XP_006467883.1| PREDICTED: uncharacterized protein LOC102624543 [Citrus sinensis]
          Length = 308

 Score =  124 bits (310), Expect = 1e-25
 Identities = 87/270 (32%), Positives = 136/270 (50%), Gaps = 4/270 (1%)
 Frame = +3

Query: 561  IWTIKLSFLSIGIISTVLLFKLAIPYSLNLFISTLPRLWISVRSWFSPPYLYILVNFIII 740
            I  +K+ F+S G++S  L  K ++P+ ++  +S  P +W S  SW  PPYLYI++N III
Sbjct: 8    ILPLKVFFISTGVLSIALFCKSSVPFVMDFSVSRAPVIWSSFVSWLKPPYLYIIINAIII 67

Query: 741  GIVASS-TFQHKLXXXXXXXXXXXEDEMEIENRTQ--PQKQSDFDTVIDSSSIKTSSEIW 911
             I ASS  +Q+             E EM+ E +      ++ +  TV +  S+  S +  
Sbjct: 68   IIAASSHLYQNDHVPSTDSTPSDVEYEMKYEQQQMIVVAEEENKATVFEEKSVVVSGD-- 125

Query: 912  AENEHIVSDYGEKSLVLAKKYVESETSCLTDSNDIT-EQKPSVSARFAVRKSSTETQLRL 1088
             + +  V +YG+ +     +  +S    LTD + +  E+KP VSARF  RK         
Sbjct: 126  -DAQVEVGNYGDAAPWTPPQRTDS-LEILTDFHLLAEEEKPLVSARFGHRKPI------- 176

Query: 1089 GRSMARSAKPSNPLGIGIAKPKKHNDTLDATWKAITEGKGKPLTRHLRKSDTWNVPPRVI 1268
                   + P +   + + KP++H +TL+ TWK ITEG+  PLTRH++KSDTW    R +
Sbjct: 177  ------KSSPEDGKKLRVTKPRRH-ETLENTWKTITEGRAMPLTRHMKKSDTWENHGRQV 229

Query: 1269 EPEPDLKPHXXXXXXXXXRRELRKSETFND 1358
              +P L               ++KSETF D
Sbjct: 230  NVDPLL---------------VKKSETFKD 244


>gb|AAY85179.1| fiber expressed protein [Gossypium hirsutum]
          Length = 331

 Score =  123 bits (308), Expect = 2e-25
 Identities = 95/295 (32%), Positives = 140/295 (47%), Gaps = 27/295 (9%)
 Frame = +3

Query: 555  TAIWTIKLSFLSIGIISTVLLFKLAIPYSLNLFISTLPRLWISVRSWFSPPYLYILVNFI 734
            T I ++K+  +S GI+  VL  K+++P  L   +S  P  W   RSW  PPYLY+++N I
Sbjct: 6    TWILSLKVFLISTGILGIVLGLKISVPLVLEFSVSQAPLWWSGFRSWLKPPYLYVVINGI 65

Query: 735  IIGIVASSTFQHKLXXXXXXXXXXXEDEME-------IENRTQPQKQSDFDTVIDSSSIK 893
            II I ASS F               +D+ME       I    QP  + +  +  DS +++
Sbjct: 66   IITIAASSRFNQN---------NGEKDQMEQMQPRPKISADQQPMVEYETKSGWDSDAVE 116

Query: 894  TSSEIWAEN---EHIVSDYGEKSLVLAKK-------YVESETSCL----TDSNDI----- 1016
            +S  ++ EN   E + +   E+   +A K       +V S++  +    TDS++I     
Sbjct: 117  SSDFVYEENQRGEEVETRVSEEESNVAVKDDRDGNEFVISKSEWIPPSRTDSSEIPLDAL 176

Query: 1017 -TEQKPSVSARFAVRKSSTETQLRLGRSMARSAKPSNPLGIGIAKPKKHNDTLDATWKAI 1193
              ++KP+ S+RF  RK                A P     +  AKPK+H +TL+ TWK I
Sbjct: 177  LIQEKPAPSSRFGHRKPV-------------KANPEGGRALKAAKPKRH-ETLENTWKMI 222

Query: 1194 TEGKGKPLTRHLRKSDTWNVPPRVIEPEPDLKPHXXXXXXXXXRRELRKSETFND 1358
            TEGK  PL+RHL+KSDTW    R I  E                  ++KSETF D
Sbjct: 223  TEGKSMPLSRHLKKSDTWENHGRDINME-----------ALTSSPLMKKSETFRD 266


>gb|AAC33277.1| cotton fiber expressed protein 2 [Gossypium hirsutum]
          Length = 275

 Score =  122 bits (306), Expect = 4e-25
 Identities = 93/295 (31%), Positives = 140/295 (47%), Gaps = 27/295 (9%)
 Frame = +3

Query: 555  TAIWTIKLSFLSIGIISTVLLFKLAIPYSLNLFISTLPRLWISVRSWFSPPYLYILVNFI 734
            T I ++K+  +S GI+  VL  K+++P  L   +S  P  W   RSW  PPYLY+++N I
Sbjct: 6    TWILSLKVFLISTGILGIVLGLKISVPLVLEFSVSQAPLWWSGFRSWLKPPYLYVVINGI 65

Query: 735  IIGIVASSTFQHKLXXXXXXXXXXXEDEME-------IENRTQPQKQSDFDTVIDSSSIK 893
            II I ASS F               +D+ME       I    QP  + +  +  DS +++
Sbjct: 66   IITIAASSRFNQN---------NGEKDQMEQMQPRPKISADQQPMVEYETKSGWDSDAVE 116

Query: 894  TSSEIWAEN---EHIVSDYGEKSLVLA-------KKYVESETSCL----TDSNDI----- 1016
            +S  ++ EN   E + +   E+   +A        ++V S++  +    TDS++I     
Sbjct: 117  SSDFVYEENQRGEEVETRVSEEESNVAVEDDRDGNEFVISKSEWIPPSRTDSSEIPLDAL 176

Query: 1017 -TEQKPSVSARFAVRKSSTETQLRLGRSMARSAKPSNPLGIGIAKPKKHNDTLDATWKAI 1193
              ++KP+ S+RF  RK                  P     + +AKPK+H +TL+ TWK I
Sbjct: 177  LIQEKPAPSSRFGHRKPV-------------KVNPEGGRALKVAKPKRH-ETLENTWKMI 222

Query: 1194 TEGKGKPLTRHLRKSDTWNVPPRVIEPEPDLKPHXXXXXXXXXRRELRKSETFND 1358
            TEGK  PL+RHL+KSDTW    R I  E                  ++KSETF D
Sbjct: 223  TEGKSMPLSRHLKKSDTWENHGRDINVE-----------ALTSSPLMKKSETFRD 266


>ref|XP_006841790.1| hypothetical protein AMTR_s00003p00267520 [Amborella trichopoda]
            gi|548843811|gb|ERN03465.1| hypothetical protein
            AMTR_s00003p00267520 [Amborella trichopoda]
          Length = 381

 Score =  121 bits (304), Expect = 6e-25
 Identities = 80/257 (31%), Positives = 129/257 (50%), Gaps = 3/257 (1%)
 Frame = +3

Query: 501  AMADFFFQNSSLSRKRFDTAIWTIKLSFLSIGIISTVLLFKLAIPYSLNLFISTLPRLWI 680
            +++D  F       K   + I T+K+ FLS+ ++ST +  K ++P  +      +P L  
Sbjct: 56   SLSDSGFFGDIFRTKMAKSVILTLKVLFLSMAVVSTAIFLKFSLPVLMEFMAYEVPILCD 115

Query: 681  SVRSWFSPPYLYILVNFIIIGIVASSTFQHKLXXXXXXXXXXXEDEMEIENRTQPQKQSD 860
             ++ W +PPYLYI++N III I ASS+FQ K            +   EI N         
Sbjct: 116  QIKGWLTPPYLYIVINCIIITIAASSSFQQK-PEENNQEPLLDQKRAEIRNDYGIPAIHS 174

Query: 861  FDTVIDSSSIKTSSEIWAENEHIVSDYGEKSLVLAKKYVESETSCLTDSND---ITEQKP 1031
             +  I +   + S EI A  +   ++  E  L   + +    +  + DS +   +TE+KP
Sbjct: 175  PEFEIPTEIKRPSFEITATKKSS-NEEEEDFLSRGEWWPTRRSFPMVDSLENSCVTEEKP 233

Query: 1032 SVSARFAVRKSSTETQLRLGRSMARSAKPSNPLGIGIAKPKKHNDTLDATWKAITEGKGK 1211
             VS RF+ RKS             +++      G+G+++PK+H +TL++TWK IT+G+  
Sbjct: 234  PVSVRFSHRKS------------VKASPEGGRAGLGVSRPKRH-ETLESTWKTITDGRAI 280

Query: 1212 PLTRHLRKSDTWNVPPR 1262
            PL RHL+KSDTW    R
Sbjct: 281  PLARHLKKSDTWETHGR 297


>ref|XP_002273372.2| PREDICTED: uncharacterized protein LOC100244739 [Vitis vinifera]
          Length = 330

 Score =  120 bits (302), Expect = 1e-24
 Identities = 85/281 (30%), Positives = 136/281 (48%), Gaps = 13/281 (4%)
 Frame = +3

Query: 555  TAIWTIKLSFLSIGIISTVLLFKLAIPYSLNLFISTLPRLWISVRSWFSPPYLYILVNFI 734
            T + ++K++ +SIG++S  ++ + +IP      +S +P +W S RSW  PPYLY+++N I
Sbjct: 10   TWLLSLKVALISIGVVSMAVILRQSIPVISEFAVSGVPVMWSSFRSWLKPPYLYVIINGI 69

Query: 735  IIGIVASSTFQHKLXXXXXXXXXXXEDEMEIENRTQPQKQSDFDTVIDSSSIKTSSEIWA 914
            II I ASS F H++           + E E    T   K       +++  +    E+  
Sbjct: 70   IITIAASSKF-HRMNHQDRPETAAKDPEDERTEFTYEMKNPPEFVGLETPIVYEQREVTI 128

Query: 915  ENEHIVSD----YGEKSLVLAKK---------YVESETSCLTDSNDITEQKPSVSARFAV 1055
                 V+D      E+ LV+++           +  E+S +     ++ +KP VS+RF  
Sbjct: 129  SEVKNVADSPVVEDEEELVISRSTTPPLPPSPLLRRESSEIPLECLLSTEKPLVSSRFGH 188

Query: 1056 RKSSTETQLRLGRSMARSAKPSNPLGIGIAKPKKHNDTLDATWKAITEGKGKPLTRHLRK 1235
            RK            +  S + S    + ++KPK+H +TL+ TWK IT+G+  PLTRHL+K
Sbjct: 189  RK-----------PIKASPEASGGKVLRVSKPKRH-ETLENTWKMITDGRHMPLTRHLKK 236

Query: 1236 SDTWNVPPRVIEPEPDLKPHXXXXXXXXXRRELRKSETFND 1358
            SDTW    R I    D  P           + + KSETF D
Sbjct: 237  SDTWENHGRHIVVREDSSP-----------QRVNKSETFKD 266


>emb|CAN65070.1| hypothetical protein VITISV_003953 [Vitis vinifera]
          Length = 407

 Score =  118 bits (296), Expect = 5e-24
 Identities = 84/281 (29%), Positives = 134/281 (47%), Gaps = 13/281 (4%)
 Frame = +3

Query: 555  TAIWTIKLSFLSIGIISTVLLFKLAIPYSLNLFISTLPRLWISVRSWFSPPYLYILVNFI 734
            T + ++K++ +SIG++S  ++ + +IP      +S +P  W S RSW  PPYLY+++N I
Sbjct: 87   TWLLSLKVALISIGVVSMAVILRQSIPVISEFAVSGVPVXWSSFRSWLKPPYLYVIINGI 146

Query: 735  IIGIVASSTFQHKLXXXXXXXXXXXEDEMEIENRTQPQKQSDFDTVIDSSSIKTSSEIWA 914
            II I ASS F H++           + E E    T   K       +++  +    E+  
Sbjct: 147  IITIAASSKF-HRMNHQDRPETAAKDPEDERTEFTYEMKNPPEFVGLETPIVYEQREVTI 205

Query: 915  ENEHIVSD----YGEKSLVLAKK---------YVESETSCLTDSNDITEQKPSVSARFAV 1055
                 V+D      E+ L +++           +  E+S +     ++ +KP VS+RF  
Sbjct: 206  SEVKXVADSPVVEDEEELXISRSTTPPLPPSPLLRRESSEIPLECLLSTEKPLVSSRFGH 265

Query: 1056 RKSSTETQLRLGRSMARSAKPSNPLGIGIAKPKKHNDTLDATWKAITEGKGKPLTRHLRK 1235
            RK            +  S + S    + ++KPK+H +TL+ TWK IT+G+  PLTRHL+K
Sbjct: 266  RK-----------PIKASPEASGGKVLRVSKPKRH-ETLENTWKMITDGRHMPLTRHLKK 313

Query: 1236 SDTWNVPPRVIEPEPDLKPHXXXXXXXXXRRELRKSETFND 1358
            SDTW    R I    D  P           + + KSETF D
Sbjct: 314  SDTWENHGRHIVVREDSSP-----------QRVNKSETFKD 343


>ref|XP_007021270.1| Uncharacterized protein TCM_031316 [Theobroma cacao]
            gi|508720898|gb|EOY12795.1| Uncharacterized protein
            TCM_031316 [Theobroma cacao]
          Length = 318

 Score =  117 bits (294), Expect = 9e-24
 Identities = 85/275 (30%), Positives = 130/275 (47%), Gaps = 11/275 (4%)
 Frame = +3

Query: 567  TIKLSFLSIGIISTVLLFKLAIPYSLNLFISTLPRLWISVRSWFSPPYLYILVNFIIIGI 746
            ++K + +S GIIS  + FK+++P    L  S +P  +  V S+  PPYLY+L+N III I
Sbjct: 7    SLKTALISTGIISIAIFFKVSLPLVSELLTSGIPSTYSLVLSFLRPPYLYLLINCIIISI 66

Query: 747  VASSTFQHKLXXXXXXXXXXXEDEMEIENRTQPQKQS-DFDTVIDSSSIKTSSEIWAENE 923
            VASS  QHK               +++ +     + S   DT       +  S +    E
Sbjct: 67   VASSKLQHKAETQQSPSPEIVLPAVKVSSEVYSSEYSYGSDTSARVVVAEDLSTVEESKE 126

Query: 924  HIVSDYGEKSLVLAKKYV----------ESETSCLTDSNDITEQKPSVSARFAVRKSSTE 1073
             +V D G++     K  +          ES    ++  N+   +KP VS RF  RK+   
Sbjct: 127  AVVVDGGDEEEEQVKVVMSLPPPPPARSESMELVMSLLNEKAGEKPPVSKRFGQRKAVKA 186

Query: 1074 TQLRLGRSMARSAKPSNPLGIGIAKPKKHNDTLDATWKAITEGKGKPLTRHLRKSDTWNV 1253
                           S    + ++KPK+H DTL++TWK ITEG+  PLTRHL+KSDTW  
Sbjct: 187  A--------------SEGKALRVSKPKRH-DTLESTWKTITEGRPMPLTRHLKKSDTWE- 230

Query: 1254 PPRVIEPEPDLKPHXXXXXXXXXRRELRKSETFND 1358
              +  + +P+  P             ++KS+TFN+
Sbjct: 231  --QRAQKDPNAPP-------PPLPHTVKKSDTFNE 256


>emb|CBI23474.3| unnamed protein product [Vitis vinifera]
          Length = 304

 Score =  117 bits (294), Expect = 9e-24
 Identities = 84/280 (30%), Positives = 135/280 (48%), Gaps = 14/280 (5%)
 Frame = +3

Query: 555  TAIWTIKLSFLSIGIISTVLLFKLAIPYSLNLFISTLPRLWISVRSWFSPPYLYILVNFI 734
            T + ++K++ +SIG++S  ++ + +IP      +S +P +W S RSW  PPYLY+++N I
Sbjct: 10   TWLLSLKVALISIGVVSMAVILRQSIPVISEFAVSGVPVMWSSFRSWLKPPYLYVIINGI 69

Query: 735  IIGIVASSTFQHKLXXXXXXXXXXXEDEMEIENRTQPQKQSDFDTVIDSSSIKTSSEIWA 914
            II I ASS F H++           + E E    T   K       +++  +    E+  
Sbjct: 70   IITIAASSKF-HRMNHQDRPETAAKDPEDERTEFTYEMKNPPEFVGLETPIVYEQREVTI 128

Query: 915  ENEHIVSD----YGEKSLVLAKK---------YVESETSCLTDSNDITEQKPSVSARFAV 1055
                 V+D      E+ LV+++           +  E+S +     ++ +KP VS+RF  
Sbjct: 129  SEVKNVADSPVVEDEEELVISRSTTPPLPPSPLLRRESSEIPLECLLSTEKPLVSSRFGH 188

Query: 1056 RKSSTETQLRLGRSMARSAKPSNPLGIGIAKPKKHNDTLDATWKAITEGKGKPLTRHLRK 1235
            RK            +  S + S    + ++KPK+H +TL+ TWK IT+G+  PLTRHL+K
Sbjct: 189  RK-----------PIKASPEASGGKVLRVSKPKRH-ETLENTWKMITDGRHMPLTRHLKK 236

Query: 1236 SDTWNVPPRVIEPEPDLKPHXXXXXXXXXRREL-RKSETF 1352
            SDTW    R I    D  P          + EL R+ E F
Sbjct: 237  SDTWENHGRHIVVREDSSPQRLRKDPSLSQDELNRRVEAF 276


Top