BLASTX nr result

ID: Catharanthus22_contig00013312 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00013312
         (820 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004245868.1| PREDICTED: uncharacterized protein LOC101261...   308   1e-81
ref|XP_006358582.1| PREDICTED: uncharacterized protein LOC102600...   305   1e-80
ref|XP_002282079.1| PREDICTED: uncharacterized protein LOC100243...   301   2e-79
gb|EOY24083.1| Uncharacterized protein isoform 1 [Theobroma cacao]    294   2e-77
ref|XP_006485398.1| PREDICTED: uncharacterized protein LOC102629...   291   1e-76
ref|XP_006436791.1| hypothetical protein CICLE_v10031769mg [Citr...   291   1e-76
ref|XP_003611437.1| hypothetical protein MTR_5g014010 [Medicago ...   273   7e-71
ref|XP_004511767.1| PREDICTED: uncharacterized protein LOC101498...   272   9e-71
ref|XP_002528762.1| conserved hypothetical protein [Ricinus comm...   272   9e-71
gb|EOY24085.1| Uncharacterized protein isoform 3 [Theobroma cacao]    271   2e-70
ref|XP_003538768.1| PREDICTED: uncharacterized protein LOC100784...   271   2e-70
ref|XP_002326500.1| predicted protein [Populus trichocarpa]           270   5e-70
ref|XP_006436792.1| hypothetical protein CICLE_v10031769mg [Citr...   268   1e-69
ref|XP_003516643.1| PREDICTED: uncharacterized protein LOC100779...   268   2e-69
ref|XP_006368453.1| hypothetical protein POPTR_0001s02940g [Popu...   266   7e-69
gb|ESW28629.1| hypothetical protein PHAVU_002G004700g [Phaseolus...   265   1e-68
gb|EOY24084.1| Uncharacterized protein isoform 2 [Theobroma cacao]    263   6e-68
ref|XP_004148518.1| PREDICTED: uncharacterized protein LOC101208...   263   7e-68
gb|EMJ12217.1| hypothetical protein PRUPE_ppa020378mg [Prunus pe...   259   8e-67
ref|XP_004301705.1| PREDICTED: uncharacterized protein LOC101313...   242   1e-61

>ref|XP_004245868.1| PREDICTED: uncharacterized protein LOC101261143 [Solanum
           lycopersicum]
          Length = 398

 Score =  308 bits (790), Expect = 1e-81
 Identities = 143/270 (52%), Positives = 195/270 (72%)
 Frame = +2

Query: 8   GRLCIEDRDINAAEKTLSKLIEGRVCQAYAQYLCSGTGASWVRKDELWTYLDKHKVMENH 187
           G LC+ED +IN   K LSKL+EG +C+ +AQY C+GTG  WV+ ++LW  +++ K+M+ +
Sbjct: 125 GNLCVEDSNINETAKKLSKLVEGLLCEEHAQYSCTGTGTIWVQGNQLWEKVNESKIMDEY 184

Query: 188 GLDEITYMHAKLRAIEASRKLLKTRINAGRFEEFNCPELLVKQYKPISCCIRQWILKHXX 367
           GL+E  Y HA  RA+EA RK+L+TR+N    EE  CP LLV  Y P+SC I++WIL+H  
Sbjct: 185 GLNEAVYAHAMKRAMEALRKVLETRLNDHGIEELKCPPLLVLHYTPVSCRIQRWILEHAL 244

Query: 368 XXXXXXXXXXGCIVITLKVRRRHNLSVRAEELYNQICDILEENALRSRGVNGDGEVWVVA 547
                     GC+   LK+RRR++LSV+AE +YN+ CD+LEE A+ +R + G+ E WVVA
Sbjct: 245 LLVPACALLLGCVFTLLKLRRRYHLSVKAEHIYNEACDVLEEKAMSARSMTGEHEPWVVA 304

Query: 548 SWLRDYVLTPKERKDPFLWEKVEEFVEEDSRLDRYPKMIKGESKVVWEWQVEGSISSSGK 727
           S LRD++L+PKERKDP LW+KVE+ V+EDSRL+RYPKM+KGE KVVWEWQVEGS+SSSGK
Sbjct: 305 SLLRDHLLSPKERKDPMLWKKVEQLVQEDSRLERYPKMVKGECKVVWEWQVEGSLSSSGK 364

Query: 728 IKNAEKTKWNSGGTMNSTANQQHWEPKDAE 817
            K A++ +  SG   N +  Q++W  K  E
Sbjct: 365 RKKAKEIRLASGQHTNLSTQQRNWPWKAKE 394


>ref|XP_006358582.1| PREDICTED: uncharacterized protein LOC102600075 [Solanum tuberosum]
          Length = 397

 Score =  305 bits (781), Expect = 1e-80
 Identities = 143/270 (52%), Positives = 191/270 (70%)
 Frame = +2

Query: 8   GRLCIEDRDINAAEKTLSKLIEGRVCQAYAQYLCSGTGASWVRKDELWTYLDKHKVMENH 187
           G LC+ED  IN A K LSKL+EG +C+ + QY C+GTG  WV+ ++LW  +++ K+M+ +
Sbjct: 124 GNLCVEDSSINEAAKKLSKLVEGLLCEGHTQYSCTGTGTVWVQGNQLWEKVNESKIMDEY 183

Query: 188 GLDEITYMHAKLRAIEASRKLLKTRINAGRFEEFNCPELLVKQYKPISCCIRQWILKHXX 367
           GL E  Y HA  RA+EA RK+L+TR+N    EE  CP LLV  Y P+SC I+QWIL H  
Sbjct: 184 GLSEAVYAHAMKRAMEALRKVLETRLNDHGIEELKCPPLLVLHYTPVSCRIQQWILDHAL 243

Query: 368 XXXXXXXXXXGCIVITLKVRRRHNLSVRAEELYNQICDILEENALRSRGVNGDGEVWVVA 547
                     GC+   LK RRR+ LSV+AE++YN+ CD+LEE A+ +R + G+ E WVVA
Sbjct: 244 LLVPACALLLGCVFTLLKFRRRYYLSVKAEQIYNEACDVLEEKAVSARSMTGEHEPWVVA 303

Query: 548 SWLRDYVLTPKERKDPFLWEKVEEFVEEDSRLDRYPKMIKGESKVVWEWQVEGSISSSGK 727
           S LRD++L+PKERKDP LW+KVE+ V+EDSRL+RYPKM+KGE KVVWEWQVEGS+SSSGK
Sbjct: 304 SLLRDHLLSPKERKDPMLWKKVEQLVQEDSRLERYPKMVKGECKVVWEWQVEGSLSSSGK 363

Query: 728 IKNAEKTKWNSGGTMNSTANQQHWEPKDAE 817
            K A++ +  SG   + +  Q++W  K  E
Sbjct: 364 RKKAKEIRLASGQHTDLSPQQRNWPWKAKE 393


>ref|XP_002282079.1| PREDICTED: uncharacterized protein LOC100243743 [Vitis vinifera]
           gi|297742158|emb|CBI33945.3| unnamed protein product
           [Vitis vinifera]
          Length = 383

 Score =  301 bits (770), Expect = 2e-79
 Identities = 147/260 (56%), Positives = 185/260 (71%), Gaps = 1/260 (0%)
 Frame = +2

Query: 2   KHGRLCIEDRDINAAEKTLSKLIEGRVCQAYAQYLCSGTGASWVRKDELWTYLDKHKVME 181
           KHG+LCIED DIN   K L+  IE  VC+ YAQ+LC GTG+ WV++DE+W  +D+ K+ME
Sbjct: 110 KHGKLCIEDGDINETAKKLANRIETHVCEGYAQFLC-GTGSVWVQEDEVWNDVDELKMME 168

Query: 182 NHGLDEITYMHAKLRAIEASRKLLKTRINAGRFEEFNCPELLVKQYKPISCCIRQWILKH 361
           N GL+    MH K RA+E    LL+T+IN    +E  CP LL + YKP SC ++QWI  H
Sbjct: 169 NLGLENAIDMHTKQRAMEMIDGLLETKINHRGIKELKCPNLLAEHYKPFSCRVQQWISNH 228

Query: 362 XXXXXXXXXXXXGCIVITLKVRRRHNLSVRAEELYNQICDILEENALRSRGVNGDGEVWV 541
                       G I++  ++R+R NLS RAEELYNQICDILEENA+ ++G +G+GE WV
Sbjct: 229 ALVLMPICGLLVGSILLLRRIRQRRNLSARAEELYNQICDILEENAMMTKGGDGEGEPWV 288

Query: 542 VASWLRDYVLTPKERKDPFLWEKVEEFVEEDSRLDRYPKMIKGESKVVWEWQVEGSISSS 721
           V SWLRD++L PKERKDP LW KVEE V+EDSRLDRYPK++KGESKVVWEWQVEGS+SS 
Sbjct: 289 VVSWLRDHLLLPKERKDPLLWRKVEELVQEDSRLDRYPKLVKGESKVVWEWQVEGSLSSR 348

Query: 722 -GKIKNAEKTKWNSGGTMNS 778
             K + A K K + G  +NS
Sbjct: 349 LRKKREASKLKPSGGTNINS 368


>gb|EOY24083.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 388

 Score =  294 bits (753), Expect = 2e-77
 Identities = 141/269 (52%), Positives = 184/269 (68%)
 Frame = +2

Query: 2   KHGRLCIEDRDINAAEKTLSKLIEGRVCQAYAQYLCSGTGASWVRKDELWTYLDKHKVME 181
           +HG+LC+ED+DIN   K  SK +E R+C+AYAQ LC GT   W R+ ++W  LD H++M+
Sbjct: 112 RHGKLCVEDKDINETAKKFSKWLEVRLCEAYAQSLCYGTVTVWAREHDIWNDLDGHELMQ 171

Query: 182 NHGLDEITYMHAKLRAIEASRKLLKTRINAGRFEEFNCPELLVKQYKPISCCIRQWILKH 361
           N G D  TY++AK R +E   KLL+TRIN+   +E  CP+ L + YKP +C IRQ I  H
Sbjct: 172 NFGPDNATYLYAKRRVMETIVKLLETRINSHGIQEVKCPDSLAEYYKPFTCRIRQLISNH 231

Query: 362 XXXXXXXXXXXXGCIVITLKVRRRHNLSVRAEELYNQICDILEENALRSRGVNGDGEVWV 541
                       G  ++   V ++  LS R EELY+Q+CD+LEE ALRS+ VNG GE WV
Sbjct: 232 ALIIVPVCAGLVGFAMLFWNVHQKRCLSARVEELYHQVCDMLEEKALRSKSVNGGGESWV 291

Query: 542 VASWLRDYVLTPKERKDPFLWEKVEEFVEEDSRLDRYPKMIKGESKVVWEWQVEGSISSS 721
           VASWLRD++L P+ERKDP LW+KVEE V+EDSR+DRYPK++KGESKVVWEWQVEGS+SSS
Sbjct: 292 VASWLRDHLLFPRERKDPHLWKKVEELVQEDSRVDRYPKLVKGESKVVWEWQVEGSLSSS 351

Query: 722 GKIKNAEKTKWNSGGTMNSTANQQHWEPK 808
              K  E+    S G +N+  NQ   + K
Sbjct: 352 RMRKKGEEVTLKSVGGINTNLNQSDHKVK 380


>ref|XP_006485398.1| PREDICTED: uncharacterized protein LOC102629601 isoform X1 [Citrus
           sinensis]
          Length = 396

 Score =  291 bits (746), Expect = 1e-76
 Identities = 137/272 (50%), Positives = 186/272 (68%)
 Frame = +2

Query: 2   KHGRLCIEDRDINAAEKTLSKLIEGRVCQAYAQYLCSGTGASWVRKDELWTYLDKHKVME 181
           KHG+LC+ED DIN     LS+ +E R+C+AYAQ+LC GTG+ WV ++++W  L+ H++M+
Sbjct: 115 KHGKLCVEDGDINETAGRLSRWVENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGHELMK 174

Query: 182 NHGLDEITYMHAKLRAIEASRKLLKTRINAGRFEEFNCPELLVKQYKPISCCIRQWILKH 361
              LD   Y++ K R +E   + L++R N+   +E  CPELL + YKP+SC I QW+  H
Sbjct: 175 IFELDNPVYLYTKKRTMETVGRYLESRTNSYGMKELKCPELLAEHYKPLSCRIHQWVSTH 234

Query: 362 XXXXXXXXXXXXGCIVITLKVRRRHNLSVRAEELYNQICDILEENALRSRGVNGDGEVWV 541
                       GC+++  KV RR   ++R EELY+Q+C+ILEENAL S+ VNG+ E WV
Sbjct: 235 ALIIVPVCSLLVGCLLLLWKVHRRRYFAIRVEELYHQVCEILEENALMSKSVNGECEPWV 294

Query: 542 VASWLRDYVLTPKERKDPFLWEKVEEFVEEDSRLDRYPKMIKGESKVVWEWQVEGSISSS 721
           VAS LRD++L PKERKDP +W+KVEE V+EDSR+D+YPK++KGESKVVWEWQVEGS+SSS
Sbjct: 295 VASRLRDHLLLPKERKDPVIWKKVEELVQEDSRVDQYPKLLKGESKVVWEWQVEGSLSSS 354

Query: 722 GKIKNAEKTKWNSGGTMNSTANQQHWEPKDAE 817
              K  E +KW S    +    QQ   P  AE
Sbjct: 355 KMRKKGEASKWRSAEGRDMKFGQQQ-SPLKAE 385


>ref|XP_006436791.1| hypothetical protein CICLE_v10031769mg [Citrus clementina]
           gi|568863995|ref|XP_006485399.1| PREDICTED:
           uncharacterized protein LOC102629601 isoform X2 [Citrus
           sinensis] gi|557538987|gb|ESR50031.1| hypothetical
           protein CICLE_v10031769mg [Citrus clementina]
          Length = 391

 Score =  291 bits (746), Expect = 1e-76
 Identities = 137/272 (50%), Positives = 186/272 (68%)
 Frame = +2

Query: 2   KHGRLCIEDRDINAAEKTLSKLIEGRVCQAYAQYLCSGTGASWVRKDELWTYLDKHKVME 181
           KHG+LC+ED DIN     LS+ +E R+C+AYAQ+LC GTG+ WV ++++W  L+ H++M+
Sbjct: 115 KHGKLCVEDGDINETAGRLSRWVENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGHELMK 174

Query: 182 NHGLDEITYMHAKLRAIEASRKLLKTRINAGRFEEFNCPELLVKQYKPISCCIRQWILKH 361
              LD   Y++ K R +E   + L++R N+   +E  CPELL + YKP+SC I QW+  H
Sbjct: 175 IFELDNPVYLYTKKRTMETVGRYLESRTNSYGMKELKCPELLAEHYKPLSCRIHQWVSTH 234

Query: 362 XXXXXXXXXXXXGCIVITLKVRRRHNLSVRAEELYNQICDILEENALRSRGVNGDGEVWV 541
                       GC+++  KV RR   ++R EELY+Q+C+ILEENAL S+ VNG+ E WV
Sbjct: 235 ALIIVPVCSLLVGCLLLLWKVHRRRYFAIRVEELYHQVCEILEENALMSKSVNGECEPWV 294

Query: 542 VASWLRDYVLTPKERKDPFLWEKVEEFVEEDSRLDRYPKMIKGESKVVWEWQVEGSISSS 721
           VAS LRD++L PKERKDP +W+KVEE V+EDSR+D+YPK++KGESKVVWEWQVEGS+SSS
Sbjct: 295 VASRLRDHLLLPKERKDPVIWKKVEELVQEDSRVDQYPKLLKGESKVVWEWQVEGSLSSS 354

Query: 722 GKIKNAEKTKWNSGGTMNSTANQQHWEPKDAE 817
              K  E +KW S    +    QQ   P  AE
Sbjct: 355 KMRKKGEASKWRSAEGRDMKFGQQQ-SPLKAE 385


>ref|XP_003611437.1| hypothetical protein MTR_5g014010 [Medicago truncatula]
           gi|355512772|gb|AES94395.1| hypothetical protein
           MTR_5g014010 [Medicago truncatula]
          Length = 374

 Score =  273 bits (697), Expect = 7e-71
 Identities = 130/261 (49%), Positives = 179/261 (68%), Gaps = 2/261 (0%)
 Frame = +2

Query: 2   KHGRLCIEDRDINAAEKTLSKLIEGRVCQAYAQYLCSGTGASWVRKDELWTYLDKHKVME 181
           KHG LC+ED DIN + + ++  +E  +C  YAQ+LCSGTG+ WV  D+LW Y++    +E
Sbjct: 102 KHGNLCVEDGDINDSARKIADTVERHLCGEYAQFLCSGTGSIWVHDDDLWNYIEP---VE 158

Query: 182 NHGLDEITYMHAKLRAIEASRKLLKTRINAGRFEEFNCPELLVKQYKPISCCIRQWILKH 361
           N       Y + K +A +   KLL+ R+     +EF CP+ LV+QYKP +C +RQWI +H
Sbjct: 159 NVKEGNALYNYTKQKAFDMMDKLLEMRLTTHGMKEFKCPDSLVEQYKPYACRLRQWITQH 218

Query: 362 XXXXXXXXXXXXGCIVITLKVRRRHNLSVRAEELYNQICDILEENALRSRGVNGDGEVWV 541
                       GC+++   VRR+  +S R EELYN++C+ILEENAL S+ VNG+ E WV
Sbjct: 219 ILVVLPICAMLVGCMILFWNVRRKLRVSRRVEELYNKVCEILEENALTSKSVNGECEPWV 278

Query: 542 VASWLRDYVLTPKERKDPFLWEKVEEFVEEDSRLDRYPKMIKGESKVVWEWQVEGSISSS 721
           VAS LRD++L P+ERKDP LW+KVEE V+EDSR+DRYPK++KGESKVVWEWQVEGS+S++
Sbjct: 279 VASRLRDHLLLPRERKDPLLWKKVEELVQEDSRVDRYPKLVKGESKVVWEWQVEGSLSAT 338

Query: 722 GKI--KNAEKTKWNSGGTMNS 778
             +  ++A KT  N    +NS
Sbjct: 339 KMLTKRDASKTMVNRNTELNS 359


>ref|XP_004511767.1| PREDICTED: uncharacterized protein LOC101498686 [Cicer arietinum]
          Length = 391

 Score =  272 bits (696), Expect = 9e-71
 Identities = 130/260 (50%), Positives = 176/260 (67%), Gaps = 2/260 (0%)
 Frame = +2

Query: 2   KHGRLCIEDRDINAAEKTLSKLIEGRVCQAYAQYLCSGTGASWVRKDELWTYLDKHKVME 181
           KHG LC+ED DIN + + + + +E  +C  YAQYLCSGTG+ WV  D+LW Y +    + 
Sbjct: 119 KHGNLCVEDGDINESARKIVEKVEHHLCGEYAQYLCSGTGSIWVHDDDLWNYFEP---VG 175

Query: 182 NHGLDEITYMHAKLRAIEASRKLLKTRINAGRFEEFNCPELLVKQYKPISCCIRQWILKH 361
           N   D   Y + K +A +   KLL+ R+N+   +EF CP+LLV+ YK  +C  RQWI +H
Sbjct: 176 NVKEDNALYKYTKQKAFDTMDKLLEMRLNSHGMKEFKCPDLLVEHYKSYACRFRQWITQH 235

Query: 362 XXXXXXXXXXXXGCIVITLKVRRRHNLSVRAEELYNQICDILEENALRSRGVNGDGEVWV 541
                       GC ++    RR+  +S R EELYN++C+ILEENAL S+ VNG+ E WV
Sbjct: 236 IIVVLPICAMLVGCTILFTNARRKLRMSRRVEELYNKVCEILEENALTSKSVNGECEPWV 295

Query: 542 VASWLRDYVLTPKERKDPFLWEKVEEFVEEDSRLDRYPKMIKGESKVVWEWQVEGSISSS 721
           VAS LRD++L P+ERKDP LW+KVEE V+EDSR+DRYPK++KGESKVVWEWQVEGS+S+S
Sbjct: 296 VASRLRDHLLLPRERKDPLLWKKVEELVQEDSRIDRYPKLVKGESKVVWEWQVEGSLSAS 355

Query: 722 GKI--KNAEKTKWNSGGTMN 775
             +  ++A KT+ N    +N
Sbjct: 356 KMMTKRDASKTRINGNVDLN 375


>ref|XP_002528762.1| conserved hypothetical protein [Ricinus communis]
           gi|223531765|gb|EEF33584.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 373

 Score =  272 bits (696), Expect = 9e-71
 Identities = 129/253 (50%), Positives = 176/253 (69%)
 Frame = +2

Query: 2   KHGRLCIEDRDINAAEKTLSKLIEGRVCQAYAQYLCSGTGASWVRKDELWTYLDKHKVME 181
           KH  +CIED DIN   K LS+ +E  +C+AYAQYLC G G  W + +++W  LD H++ME
Sbjct: 102 KHRNICIEDGDINERAKKLSEWVENHLCEAYAQYLCDGIGTIWFQDNDIWYDLDGHQLME 161

Query: 182 NHGLDEITYMHAKLRAIEASRKLLKTRINAGRFEEFNCPELLVKQYKPISCCIRQWILKH 361
           N   D  TY++AK +A+E   +LL+ R N+   +E  CP+L+ + YKP +C  RQWI  H
Sbjct: 162 NFQPDNATYIYAKRKAMEMIVRLLEIRTNSHGNKELKCPDLVAEHYKPFTCRFRQWISNH 221

Query: 362 XXXXXXXXXXXXGCIVITLKVRRRHNLSVRAEELYNQICDILEENALRSRGVNGDGEVWV 541
                       G +++  K++RR  LS R EELY+Q+C++LEENAL S+  NG+ + WV
Sbjct: 222 AFVIASLCSLVVGAVLLLRKLQRRWYLSARGEELYHQVCEVLEENALMSKQSNGECDSWV 281

Query: 542 VASWLRDYVLTPKERKDPFLWEKVEEFVEEDSRLDRYPKMIKGESKVVWEWQVEGSISSS 721
           VAS LRD++L PKERKDP LW++VE+ V+EDSR+DRYPK++KGESKVVWEWQVEGS  SS
Sbjct: 282 VASQLRDHLLLPKERKDPVLWKRVEQLVQEDSRVDRYPKLVKGESKVVWEWQVEGS-WSS 340

Query: 722 GKIKNAEKTKWNS 760
           G+I+  E +K  S
Sbjct: 341 GRIRKKEASKLKS 353


>gb|EOY24085.1| Uncharacterized protein isoform 3 [Theobroma cacao]
          Length = 350

 Score =  271 bits (694), Expect = 2e-70
 Identities = 126/232 (54%), Positives = 164/232 (70%)
 Frame = +2

Query: 2   KHGRLCIEDRDINAAEKTLSKLIEGRVCQAYAQYLCSGTGASWVRKDELWTYLDKHKVME 181
           +HG+LC+ED+DIN   K  SK +E R+C+AYAQ LC GT   W R+ ++W  LD H++M+
Sbjct: 112 RHGKLCVEDKDINETAKKFSKWLEVRLCEAYAQSLCYGTVTVWAREHDIWNDLDGHELMQ 171

Query: 182 NHGLDEITYMHAKLRAIEASRKLLKTRINAGRFEEFNCPELLVKQYKPISCCIRQWILKH 361
           N G D  TY++AK R +E   KLL+TRIN+   +E  CP+ L + YKP +C IRQ I  H
Sbjct: 172 NFGPDNATYLYAKRRVMETIVKLLETRINSHGIQEVKCPDSLAEYYKPFTCRIRQLISNH 231

Query: 362 XXXXXXXXXXXXGCIVITLKVRRRHNLSVRAEELYNQICDILEENALRSRGVNGDGEVWV 541
                       G  ++   V ++  LS R EELY+Q+CD+LEE ALRS+ VNG GE WV
Sbjct: 232 ALIIVPVCAGLVGFAMLFWNVHQKRCLSARVEELYHQVCDMLEEKALRSKSVNGGGESWV 291

Query: 542 VASWLRDYVLTPKERKDPFLWEKVEEFVEEDSRLDRYPKMIKGESKVVWEWQ 697
           VASWLRD++L P+ERKDP LW+KVEE V+EDSR+DRYPK++KGESKVVWEWQ
Sbjct: 292 VASWLRDHLLFPRERKDPHLWKKVEELVQEDSRVDRYPKLVKGESKVVWEWQ 343


>ref|XP_003538768.1| PREDICTED: uncharacterized protein LOC100784375 isoform X1 [Glycine
           max]
          Length = 381

 Score =  271 bits (694), Expect = 2e-70
 Identities = 131/268 (48%), Positives = 183/268 (68%), Gaps = 3/268 (1%)
 Frame = +2

Query: 2   KHGRLCIEDRDINAAEKTLSKLIEGRVCQAYAQYLCSGTGASWVRKDELWTYLDKHKVME 181
           +HG LC ED DIN + + L + +E  +C+ YAQ+LC+GTG  WV +D+LW Y +    + 
Sbjct: 108 RHGNLCAEDGDINESARKLLERVEHHLCEKYAQFLCTGTGIIWVHEDDLWNYFEP---VG 164

Query: 182 NHGLDEITYMHAKLRAIEASRKLLKTRINAGR-FEEFNCPELLVKQYKPISCCIRQWILK 358
           N  +D   Y + K RA+E   KLL+TR+N+    +EF CP+ L + YKP +CCIRQWI +
Sbjct: 165 NVKVDNALYNYTKQRAVETMGKLLETRLNSSHGMKEFKCPDQLAEHYKPYTCCIRQWISQ 224

Query: 359 HXXXXXXXXXXXXGCIVITLKVRRRHNLSVRAEELYNQICDILEENALRSRGVNGDGEVW 538
           H            GC  +   VR++ ++S R EELY+++C+ILE+NAL S+  NG+ E W
Sbjct: 225 HILVVLPICAMLVGCTALCWNVRQKLSMSRRVEELYDKVCEILEDNALTSKSANGECEPW 284

Query: 539 VVASWLRDYVLTPKERKDPFLWEKVEEFVEEDSRLDRYPKMIKGESKVVWEWQVEGSISS 718
           VVAS LRD++L P+ERK+P LW+K+EE V+EDSR+DRYPK++KGESKVVWEWQVEGS+S+
Sbjct: 285 VVASRLRDHLLLPRERKNPLLWKKLEELVQEDSRIDRYPKLVKGESKVVWEWQVEGSLSA 344

Query: 719 S--GKIKNAEKTKWNSGGTMNSTANQQH 796
           S   K ++A KT  N    +N   +QQH
Sbjct: 345 SKMKKRRDASKTMVNESTDLN---HQQH 369


>ref|XP_002326500.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  270 bits (690), Expect = 5e-70
 Identities = 131/272 (48%), Positives = 178/272 (65%)
 Frame = +2

Query: 2   KHGRLCIEDRDINAAEKTLSKLIEGRVCQAYAQYLCSGTGASWVRKDELWTYLDKHKVME 181
           KH   CIED D+    K L + +E  +C+AYA +LC GTG  WV++D++   LD H+++E
Sbjct: 108 KHRNTCIEDGDVYERAKKLLEGVENHLCEAYADFLCYGTGIMWVQEDDILNDLDGHQLLE 167

Query: 182 NHGLDEITYMHAKLRAIEASRKLLKTRINAGRFEEFNCPELLVKQYKPISCCIRQWILKH 361
           N+  D   Y++ K++A+E   + L+TR N    +EF CP+LLV+ YKP +C +RQWI +H
Sbjct: 168 NYSSDNPVYVYTKMKAMETISEELQTRTNPNGKKEFKCPDLLVEHYKPFTCHLRQWISEH 227

Query: 362 XXXXXXXXXXXXGCIVITLKVRRRHNLSVRAEELYNQICDILEENALRSRGVNGDGEVWV 541
                       G   +  K+RRR  LS R EELY+Q+CDILEE AL S+ VN + E WV
Sbjct: 228 ALVIVPVCALVVGFAFLVWKIRRRWYLSTRGEELYHQVCDILEERALMSKRVNAECEPWV 287

Query: 542 VASWLRDYVLTPKERKDPFLWEKVEEFVEEDSRLDRYPKMIKGESKVVWEWQVEGSISSS 721
           VAS LRD++L+PKERKD  LW+KVE+ V EDSR+DRYPK++KGESKVVWEWQVEGS+SS 
Sbjct: 288 VASRLRDHLLSPKERKDFVLWKKVEDLVREDSRVDRYPKLVKGESKVVWEWQVEGSLSSG 347

Query: 722 GKIKNAEKTKWNSGGTMNSTANQQHWEPKDAE 817
              K  E +K  S   +    +++  E K  E
Sbjct: 348 RMRKKVESSKLKSNDGVKENFDKERHELKPGE 379


>ref|XP_006436792.1| hypothetical protein CICLE_v10031769mg [Citrus clementina]
           gi|568863997|ref|XP_006485400.1| PREDICTED:
           uncharacterized protein LOC102629601 isoform X3 [Citrus
           sinensis] gi|557538988|gb|ESR50032.1| hypothetical
           protein CICLE_v10031769mg [Citrus clementina]
          Length = 359

 Score =  268 bits (686), Expect = 1e-69
 Identities = 120/232 (51%), Positives = 166/232 (71%)
 Frame = +2

Query: 2   KHGRLCIEDRDINAAEKTLSKLIEGRVCQAYAQYLCSGTGASWVRKDELWTYLDKHKVME 181
           KHG+LC+ED DIN     LS+ +E R+C+AYAQ+LC GTG+ WV ++++W  L+ H++M+
Sbjct: 115 KHGKLCVEDGDINETAGRLSRWVENRLCRAYAQFLCDGTGSIWVEENDIWNDLEGHELMK 174

Query: 182 NHGLDEITYMHAKLRAIEASRKLLKTRINAGRFEEFNCPELLVKQYKPISCCIRQWILKH 361
              LD   Y++ K R +E   + L++R N+   +E  CPELL + YKP+SC I QW+  H
Sbjct: 175 IFELDNPVYLYTKKRTMETVGRYLESRTNSYGMKELKCPELLAEHYKPLSCRIHQWVSTH 234

Query: 362 XXXXXXXXXXXXGCIVITLKVRRRHNLSVRAEELYNQICDILEENALRSRGVNGDGEVWV 541
                       GC+++  KV RR   ++R EELY+Q+C+ILEENAL S+ VNG+ E WV
Sbjct: 235 ALIIVPVCSLLVGCLLLLWKVHRRRYFAIRVEELYHQVCEILEENALMSKSVNGECEPWV 294

Query: 542 VASWLRDYVLTPKERKDPFLWEKVEEFVEEDSRLDRYPKMIKGESKVVWEWQ 697
           VAS LRD++L PKERKDP +W+KVEE V+EDSR+D+YPK++KGESKVVWEWQ
Sbjct: 295 VASRLRDHLLLPKERKDPVIWKKVEELVQEDSRVDQYPKLLKGESKVVWEWQ 346


>ref|XP_003516643.1| PREDICTED: uncharacterized protein LOC100779650 [Glycine max]
          Length = 377

 Score =  268 bits (684), Expect = 2e-69
 Identities = 128/260 (49%), Positives = 177/260 (68%), Gaps = 2/260 (0%)
 Frame = +2

Query: 2   KHGRLCIEDRDINAAEKTLSKLIEGRVCQAYAQYLCSGTGASWVRKDELWTYLDKHKVME 181
           +HG LC+ED DIN + + L + +E  +C+ YAQ+LC+GTG  WVR+D+LW Y +    + 
Sbjct: 103 RHGNLCVEDGDINESARKLLERVEHHLCEEYAQFLCTGTGTIWVREDDLWNYFEP---VG 159

Query: 182 NHGLDEITYMHAKLRAIEASRKLLKTRINAGR-FEEFNCPELLVKQYKPISCCIRQWILK 358
           N  +D   Y + K +A E   KLL TR+N+    +EF CP+ L + YK  +CCIRQWI +
Sbjct: 160 NVKVDNALYKYTKQKAFETMGKLLDTRLNSSHGMKEFKCPDQLAEHYKSYACCIRQWISQ 219

Query: 359 HXXXXXXXXXXXXGCIVITLKVRRRHNLSVRAEELYNQICDILEENALRSRGVNGDGEVW 538
           H            GC  +   VR++  +S R EELYN++C+ILEENAL S+  NG+ E W
Sbjct: 220 HILVVLPICAMLVGCTALFWSVRQKLCMSRRIEELYNKVCEILEENALTSKSANGECEPW 279

Query: 539 VVASWLRDYVLTPKERKDPFLWEKVEEFVEEDSRLDRYPKMIKGESKVVWEWQVEGSIS- 715
           VV+S LRD++L P+ERK+P LW+KVE+ V+EDSR+DRYPK++KGESKVVWEWQVEGS+S 
Sbjct: 280 VVSSRLRDHLLLPRERKNPLLWKKVEKMVQEDSRIDRYPKLVKGESKVVWEWQVEGSLSF 339

Query: 716 SSGKIKNAEKTKWNSGGTMN 775
           S  K ++A KT+ N    +N
Sbjct: 340 SKMKRRDASKTRVNESTDLN 359


>ref|XP_006368453.1| hypothetical protein POPTR_0001s02940g [Populus trichocarpa]
           gi|550346367|gb|ERP65022.1| hypothetical protein
           POPTR_0001s02940g [Populus trichocarpa]
          Length = 384

 Score =  266 bits (680), Expect = 7e-69
 Identities = 129/269 (47%), Positives = 176/269 (65%)
 Frame = +2

Query: 2   KHGRLCIEDRDINAAEKTLSKLIEGRVCQAYAQYLCSGTGASWVRKDELWTYLDKHKVME 181
           KH   CIED D+    K L + +E  +C+AYA +LC GTG  WV++D++   LD H++++
Sbjct: 108 KHRNTCIEDGDVYERAKKLLEGVENHLCEAYADFLCYGTGIMWVQEDDILNDLDGHQLLK 167

Query: 182 NHGLDEITYMHAKLRAIEASRKLLKTRINAGRFEEFNCPELLVKQYKPISCCIRQWILKH 361
           N+  D   Y + K++A+E   + L+TR N    +EF CP+LLV+ YKP +C +RQWI +H
Sbjct: 168 NYSSDNPVYAYTKMKAMETISEELQTRTNPNGKKEFKCPDLLVEHYKPFTCHLRQWISEH 227

Query: 362 XXXXXXXXXXXXGCIVITLKVRRRHNLSVRAEELYNQICDILEENALRSRGVNGDGEVWV 541
                       G   +  K+RRR  LS R EELY+Q+CDILEE AL S+ VN + E WV
Sbjct: 228 ALVIVPVCALVVGFAFLVWKIRRRWYLSTRGEELYHQVCDILEERALMSKRVNAECEPWV 287

Query: 542 VASWLRDYVLTPKERKDPFLWEKVEEFVEEDSRLDRYPKMIKGESKVVWEWQVEGSISSS 721
           VAS LRD++L+PKERKD  LW+KVE+ V EDSR+DRYPK++KGESKVVWEWQVEGS+SS 
Sbjct: 288 VASRLRDHLLSPKERKDFVLWKKVEDLVREDSRVDRYPKLVKGESKVVWEWQVEGSLSSG 347

Query: 722 GKIKNAEKTKWNSGGTMNSTANQQHWEPK 808
              K  E +K  S   +    +++  E K
Sbjct: 348 RMRKKVESSKLKSNDGVKENFDKERHELK 376


>gb|ESW28629.1| hypothetical protein PHAVU_002G004700g [Phaseolus vulgaris]
          Length = 383

 Score =  265 bits (678), Expect = 1e-68
 Identities = 130/272 (47%), Positives = 181/272 (66%), Gaps = 3/272 (1%)
 Frame = +2

Query: 2   KHGRLCIEDRDINAAEKTLSKLIEGRVCQAYAQYLCSGTGASWVRKDELWTYLDKHKVME 181
           +HG LC+ED DI+ + + + + +E  +C+ YAQ+LCSGTG  WV +D LW +      +E
Sbjct: 110 RHGNLCVEDGDISQSARKIVERVERHLCEGYAQFLCSGTGPMWVPEDVLWNHFQP---VE 166

Query: 182 NHGLDEITYMHAKLRAIEASRKLLKTRINAGR-FEEFNCPELLVKQYKPISCCIRQWILK 358
           N  +D   + + K RA+E   KLL+TR+N     +EF CP+LL   YKP +CCIRQW+ +
Sbjct: 167 NVKVDNALHNYTKQRAVETMGKLLETRLNNSHGMKEFKCPDLLAVHYKPYTCCIRQWVSQ 226

Query: 359 HXXXXXXXXXXXXGCIVITLKVRRRHNLSVRAEELYNQICDILEENALRSRGVNGDGEVW 538
           H            GCI +   +RR+ ++S R EELY+++C+ILE+NAL S+  NG+ E W
Sbjct: 227 HILVVLPICAMLVGCITLFWSIRRKLSMSRRVEELYDKVCEILEDNALTSKSANGECEPW 286

Query: 539 VVASWLRDYVLTPKERKDPFLWEKVEEFVEEDSRLDRYPKMIKGESKVVWEWQVEGSISS 718
            VAS LRD++L P+ERK+P LW KVEE V+EDSR+D YPK++KGESKVVWEWQVEGS+S 
Sbjct: 287 FVASRLRDHLLLPRERKNPLLWRKVEELVQEDSRIDCYPKLVKGESKVVWEWQVEGSLSV 346

Query: 719 S--GKIKNAEKTKWNSGGTMNSTANQQHWEPK 808
           S   K ++A KT+ N    +N    +QH E K
Sbjct: 347 SKMKKRRDASKTRINESMDLN---QRQHPEVK 375


>gb|EOY24084.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 378

 Score =  263 bits (672), Expect = 6e-68
 Identities = 131/269 (48%), Positives = 174/269 (64%)
 Frame = +2

Query: 2   KHGRLCIEDRDINAAEKTLSKLIEGRVCQAYAQYLCSGTGASWVRKDELWTYLDKHKVME 181
           +HG+LC+ED+DIN   K  SK +E R+C+AYAQ LC GT   W R+ ++W  LD H++M+
Sbjct: 112 RHGKLCVEDKDINETAKKFSKWLEVRLCEAYAQSLCYGTVTVWAREHDIWNDLDGHELMQ 171

Query: 182 NHGLDEITYMHAKLRAIEASRKLLKTRINAGRFEEFNCPELLVKQYKPISCCIRQWILKH 361
           N G D  TY++AK R +E   KLL+TRIN+   +E  CP+ L + YKP +C IRQ I  H
Sbjct: 172 NFGPDNATYLYAKRRVMETIVKLLETRINSHGIQEVKCPDSLAEYYKPFTCRIRQLISNH 231

Query: 362 XXXXXXXXXXXXGCIVITLKVRRRHNLSVRAEELYNQICDILEENALRSRGVNGDGEVWV 541
                       G  ++   V ++  LS R EELY+Q+CD+LEE ALRS+ VNG GE WV
Sbjct: 232 ALIIVPVCAGLVGFAMLFWNVHQKRCLSARVEELYHQVCDMLEEKALRSKSVNGGGESWV 291

Query: 542 VASWLRDYVLTPKERKDPFLWEKVEEFVEEDSRLDRYPKMIKGESKVVWEWQVEGSISSS 721
           VASWLRD++L P+ERKDP LW+KVEE V+EDSR+DRYPK++K          VEGS+SSS
Sbjct: 292 VASWLRDHLLFPRERKDPHLWKKVEELVQEDSRVDRYPKLVK----------VEGSLSSS 341

Query: 722 GKIKNAEKTKWNSGGTMNSTANQQHWEPK 808
              K  E+    S G +N+  NQ   + K
Sbjct: 342 RMRKKGEEVTLKSVGGINTNLNQSDHKVK 370


>ref|XP_004148518.1| PREDICTED: uncharacterized protein LOC101208017 [Cucumis sativus]
          Length = 404

 Score =  263 bits (671), Expect = 7e-68
 Identities = 129/267 (48%), Positives = 178/267 (66%)
 Frame = +2

Query: 2   KHGRLCIEDRDINAAEKTLSKLIEGRVCQAYAQYLCSGTGASWVRKDELWTYLDKHKVME 181
           KHGRLCIED  IN A   LS+ +E  +C+A A++LC G G  WV+++++W  LD  +++E
Sbjct: 124 KHGRLCIEDGVINEAVNKLSEWLESHLCEANAKFLCDGIGIVWVKENDIWDDLDGKELVE 183

Query: 182 NHGLDEITYMHAKLRAIEASRKLLKTRINAGRFEEFNCPELLVKQYKPISCCIRQWILKH 361
           + G D  T M+AK +A+E    LL+TR N+   +E  CP+LL + YKP +C IR W+L+H
Sbjct: 184 SIGSDNTTLMYAKSKALETIGGLLQTRQNSLGIKELKCPDLLAESYKPFTCRIRHWVLQH 243

Query: 362 XXXXXXXXXXXXGCIVITLKVRRRHNLSVRAEELYNQICDILEENALRSRGVNGDGEVWV 541
                       GC  +  K+ RR  L+ RAE+LYNQ+C+ILEENAL S   +G  E WV
Sbjct: 244 AFVVLPVFLLLVGCTWLLWKLYRRQYLTNRAEDLYNQVCEILEENALTSTRNSGQCESWV 303

Query: 542 VASWLRDYVLTPKERKDPFLWEKVEEFVEEDSRLDRYPKMIKGESKVVWEWQVEGSISSS 721
           VAS LRD++L P+ER++P LW+KVEE V+EDSR+DRYP+++KG+ K VWEWQVEGS+SSS
Sbjct: 304 VASRLRDHLLLPRERRNPLLWKKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSS 363

Query: 722 GKIKNAEKTKWNSGGTMNSTANQQHWE 802
            K K A K+        NS +    W+
Sbjct: 364 MKKKLASKS--------NSASKSNFWK 382


>gb|EMJ12217.1| hypothetical protein PRUPE_ppa020378mg [Prunus persica]
          Length = 380

 Score =  259 bits (662), Expect = 8e-67
 Identities = 126/253 (49%), Positives = 168/253 (66%)
 Frame = +2

Query: 2   KHGRLCIEDRDINAAEKTLSKLIEGRVCQAYAQYLCSGTGASWVRKDELWTYLDKHKVME 181
           K G+LCIED DIN   K L++ +E R+C A AQ+LC GT   WV ++++W  LDK +++E
Sbjct: 110 KRGKLCIEDGDINETAKKLAERVEIRLCGALAQFLCYGTETIWVEENDIWNDLDKRELLE 169

Query: 182 NHGLDEITYMHAKLRAIEASRKLLKTRINAGRFEEFNCPELLVKQYKPISCCIRQWILKH 361
            H  D   YM+ K R +E   ++L TR ++   +E  CP++L + YKP SC IRQWI +H
Sbjct: 170 -HVPDNAIYMYTKERTMETVNRMLDTRTSSRGVKELKCPDMLAEHYKPFSCRIRQWISEH 228

Query: 362 XXXXXXXXXXXXGCIVITLKVRRRHNLSVRAEELYNQICDILEENALRSRGVNGDGEVWV 541
                       G   I  K+ RR  LS R +ELY Q+C++LEE A  S+ VN + E WV
Sbjct: 229 ALLILRVCALLVGSTFILWKLHRRRCLSTRVDELYQQVCEVLEEKAFMSKSVNSECEPWV 288

Query: 542 VASWLRDYVLTPKERKDPFLWEKVEEFVEEDSRLDRYPKMIKGESKVVWEWQVEGSISSS 721
           VAS LRD +L PKERKDP LW+KVEE V+EDS +D YPK++KGESKVVWEWQVEGS+SSS
Sbjct: 289 VASRLRDRLLLPKERKDPVLWKKVEELVQEDSHVDCYPKLVKGESKVVWEWQVEGSLSSS 348

Query: 722 GKIKNAEKTKWNS 760
            +++  E +K  S
Sbjct: 349 RRMRRGEDSKLKS 361


>ref|XP_004301705.1| PREDICTED: uncharacterized protein LOC101313902 [Fragaria vesca
           subsp. vesca]
          Length = 384

 Score =  242 bits (618), Expect = 1e-61
 Identities = 124/266 (46%), Positives = 168/266 (63%), Gaps = 1/266 (0%)
 Frame = +2

Query: 2   KHGRLCIEDRDINAAEKTLSKLIEGRVCQAYAQYLCSGTGASWVRKDELWTYLDKHKVME 181
           K G+LC+ED  I      L++++E R+C+AYA+YLC G G  WV ++++W   DK+++ E
Sbjct: 102 KRGKLCVEDGVIQDTATKLAEMVEIRLCEAYAEYLCHGIGTIWVEQNDIWNDFDKNELAE 161

Query: 182 NHGLDEITYMHAKLRAIEASRKLLKTRINAGRFEEFNCPELLVKQYKPISCCIRQWILKH 361
           + G D   +M+AK RA+E    +L  R N+   +EF CP++L + YKP SC I QWI +H
Sbjct: 162 HVGSDNAIFMYAKQRAMEMISGMLDRRTNSLGVQEFKCPDMLAEHYKPSSCRISQWISEH 221

Query: 362 XXXXXXXXXXXXGCIVITLKVRRRHNLSVRAEELYNQICDILEENALRSRGVNGDGEVWV 541
                       G  ++  K  +R  LS R +E+Y QIC+ LEE AL +R  N   E WV
Sbjct: 222 ALLIFPVCAALLGITLLLRKFHQRQYLSTRVDEVYQQICEELEEKALMNR-ANEKCEPWV 280

Query: 542 VASWLRDYVLTPKERKDPFLWEKVEEFVEEDSRLDRYPKMIKGESKVVWEWQVEGSISSS 721
           VAS LRD +L+ KERK+P LW+KVEE V EDSR+D YP ++KGESKVVWEWQVEGS+SSS
Sbjct: 281 VASTLRDSLLSLKERKNPELWKKVEELVREDSRMDCYPTLVKGESKVVWEWQVEGSLSSS 340

Query: 722 GKIKNAEKTKWNSG-GTMNSTANQQH 796
            K +  E +K  S  GT  S     H
Sbjct: 341 RKSRKGEASKLKSSKGTERSPDQHLH 366


Top