BLASTX nr result

ID: Sinomenium22_contig00008927 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00008927
         (2041 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007209905.1| hypothetical protein PRUPE_ppa004205mg [Prun...   336   2e-89
ref|XP_004299321.1| PREDICTED: uncharacterized protein LOC101293...   320   1e-84
ref|XP_002281450.1| PREDICTED: uncharacterized protein LOC100263...   301   1e-78
ref|XP_002274465.2| PREDICTED: uncharacterized protein LOC100250...   300   2e-78
emb|CAN60165.1| hypothetical protein VITISV_040087 [Vitis vinifera]   299   4e-78
ref|XP_003522999.1| PREDICTED: uncharacterized protein LOC100793...   298   8e-78
ref|XP_006483051.1| PREDICTED: uncharacterized protein LOC102614...   290   1e-75
gb|AHB59599.1| putative MYB-related protein 12 [Arachis hypogaea]     289   4e-75
ref|XP_006438810.1| hypothetical protein CICLE_v10031172mg [Citr...   286   2e-74
ref|XP_007138262.1| hypothetical protein PHAVU_009G193800g [Phas...   285   6e-74
ref|XP_004499488.1| PREDICTED: uncharacterized protein LOC101494...   285   6e-74
ref|XP_007037501.1| Uncharacterized protein isoform 1 [Theobroma...   283   2e-73
ref|XP_007037503.1| Uncharacterized protein isoform 3 [Theobroma...   281   8e-73
ref|XP_004241596.1| PREDICTED: uncharacterized protein LOC101258...   281   1e-72
ref|XP_006354761.1| PREDICTED: uncharacterized protein LOC102579...   280   2e-72
ref|XP_007032692.1| Uncharacterized protein TCM_018715 [Theobrom...   278   5e-72
ref|XP_007045913.1| Uncharacterized protein isoform 2 [Theobroma...   278   5e-72
ref|XP_007045912.1| Uncharacterized protein isoform 1 [Theobroma...   278   5e-72
ref|XP_004138186.1| PREDICTED: uncharacterized protein LOC101205...   276   3e-71
ref|XP_006431271.1| hypothetical protein CICLE_v10011783mg [Citr...   275   8e-71

>ref|XP_007209905.1| hypothetical protein PRUPE_ppa004205mg [Prunus persica]
            gi|462405640|gb|EMJ11104.1| hypothetical protein
            PRUPE_ppa004205mg [Prunus persica]
          Length = 523

 Score =  336 bits (862), Expect = 2e-89
 Identities = 185/438 (42%), Positives = 250/438 (57%), Gaps = 7/438 (1%)
 Frame = +3

Query: 201  PFYFKPEHQTRGLVEHDDVYSFPFDFPPRKLVSIGSDHQADVPVWGLEGIKNNSDCLDVY 380
            P  F PE   R L + +D+YSF  D PPRK VSIG +HQA+VP+WG +G  NNS+ LD  
Sbjct: 111  PDCFNPERPIRTLAQSEDIYSFLLDHPPRKSVSIGPEHQAEVPLWGAQGNNNNSNNLDTS 170

Query: 381  DPDNAHLHTSGSNYKLDDDSGEEMIGTCVLPMPSSEALTSDGEKVGTARTDCCCLDMGSI 560
            +        + SN  L+D+  + ++GTCV+PMP S+     G   G  RTDC C D  S+
Sbjct: 171  E--------AVSNSDLEDE--KRLMGTCVIPMPDSDLSADTGCIAGIGRTDCSCEDEDSV 220

Query: 561  RCVRQHITEAREKLRESLGLERFAVLGFHDMGEVVARKWNEEDERVFHEVVLSHPASLGK 740
            RCVRQHI EAREKL +++G +RF  LGF DMGE VA++W+EE+E++FH+VV S+PASLGK
Sbjct: 221  RCVRQHILEAREKLIKTIGPKRFEELGFSDMGEQVAQRWSEEEEQLFHQVVFSNPASLGK 280

Query: 741  NFWDHLYMVFPSRTKMELVSYYFNAFVLRRRAEQNRLDPMNIDSDNDEWQ-XXXXXXXXX 917
            NFWD+L  VFPSRTK E+VSYYFN F+L +RA QNR DP+N+DSDNDEWQ          
Sbjct: 281  NFWDNLSTVFPSRTKKEIVSYYFNVFMLVKRAGQNRYDPINVDSDNDEWQGSNDYGDNQL 340

Query: 918  XXXXXXXXXIQSPADLDESAYVEHFNEVXXXXXXXXXXXXXXXXXXXXXXFAEDERLENK 1097
                     ++SP   +   Y + + +                       F    +    
Sbjct: 341  AVTEDEDSVVESPICQNVPGYYQSWKD-NLQEYDEEVVDDTCDDNVNVDMFGGGTKQILD 399

Query: 1098 QTRKLPGNCNSMP------PYEHNFEEDNDFQDDSCTSYECQPSRAESCGPVVVAAGSQG 1259
            +   L  NC++ P          + + D + QDDSCTS++              AA  + 
Sbjct: 400  RCYGLVDNCSTCPIAQLQDKISWDEKGDQEVQDDSCTSFD------------AAAASQEN 447

Query: 1260 RRGAESENHHRKQPLHSGFEGFNNAGSHEYGIEHCDGVIWDLGYVSGSKGDVDLVPTCHM 1439
            +  +E  NH        GF G +N G HEY +E CD  IWD GY++  +  VD +PTC+M
Sbjct: 448  QLKSEEGNH-----WSGGFNGSSNRGDHEYVLEPCDTKIWDAGYMTCPENKVDFLPTCNM 502

Query: 1440 IKEVFGEEAWNSKERDGR 1493
            I+EVFG+E+WN K RDG+
Sbjct: 503  IEEVFGKESWNYKARDGK 520


>ref|XP_004299321.1| PREDICTED: uncharacterized protein LOC101293785 [Fragaria vesca
            subsp. vesca]
          Length = 533

 Score =  320 bits (821), Expect = 1e-84
 Identities = 182/441 (41%), Positives = 244/441 (55%), Gaps = 8/441 (1%)
 Frame = +3

Query: 195  FCPFYFKPEHQTRGLVEHDDVYSFPFDFPPRKLVSIGSDHQADVPVWGLEGIKNNSDCLD 374
            + P YF PE   R L   +D+YSF  D  PRK  SIG +HQA +P WG  G+ N S    
Sbjct: 116  YFPEYFNPERPIRTLAS-EDIYSFLLDHSPRKSASIGPEHQAVIPPWGAHGVNNTSS--- 171

Query: 375  VYDPDNAHLHTSGSNYKLDDDSGEEMIGTCVLPMPSSEALTSDGEKVGTARTDCCCLDMG 554
                 ++HL TS S    D ++ + M+GTCV+PMP+SE  T     VG  RTDC C D  
Sbjct: 172  -----SSHLDTSQSVVDSDLENEKRMMGTCVIPMPNSELSTDCESIVGRGRTDCSCEDRA 226

Query: 555  SIRCVRQHITEAREKLRESLGLERFAVLGFHDMGEVVARKWNEEDERVFHEVVLSHPASL 734
            SIRCVRQHI EAREKL +++G ERFA LGF DMGE VA KW++ +E++FH+VV S+PASL
Sbjct: 227  SIRCVRQHILEAREKLIKNIGPERFAELGFCDMGEQVAEKWSDYEEKLFHQVVFSNPASL 286

Query: 735  GKNFWDHLYMVFPSRTKMELVSYYFNAFVLRRRAEQNRLDPMNIDSDNDEWQ-XXXXXXX 911
             KNFWD L  VFP RTKME+VSYYFN F+LR+RA QNR DP+N+DSDNDEW+        
Sbjct: 287  DKNFWDSLSAVFPLRTKMEIVSYYFNVFMLRKRARQNRYDPVNVDSDNDEWEGSTVHGDN 346

Query: 912  XXXXXXXXXXXIQSPADLDESAYVEHFNEVXXXXXXXXXXXXXXXXXXXXXXFAEDERLE 1091
                       + SP   ++  +++ +                             +++ 
Sbjct: 347  EPGVTDDDDSVVDSPGYQNDPGFIKSWGG-DMQEYDEDVVDDACDNVNVDIYGGSGKQIS 405

Query: 1092 NKQTRKLPGNCNSMP--PYEHNF----EEDNDFQDDSCTSYECQPSRAESCGPVVVAAGS 1253
            ++    L  N  S P   ++ N     + D + QDDSCTS+E               A  
Sbjct: 406  DRCPGNLVSNGGSSPIVQFQKNIAWDEKGDQEVQDDSCTSFEAG------------VASQ 453

Query: 1254 QGRRGAESENHHRKQPLHSGFEGFNNAGSHEYGIEHCDGVIWDLGYVSGSKGDVDLVPTC 1433
              +  +E+ +H         F G +  G HEY +E CD  +WD GY +  K  VD +PTC
Sbjct: 454  DNQLRSENGDHWEV----GCFNGTSKLGDHEYVLEPCDAKVWDAGYSTCRKNKVDFLPTC 509

Query: 1434 HMIKEVFGEEAWNS-KERDGR 1493
            +MI+EVFG+++WNS K RDG+
Sbjct: 510  NMIEEVFGKDSWNSYKARDGK 530


>ref|XP_002281450.1| PREDICTED: uncharacterized protein LOC100263964 [Vitis vinifera]
          Length = 521

 Score =  301 bits (770), Expect = 1e-78
 Identities = 180/442 (40%), Positives = 226/442 (51%), Gaps = 7/442 (1%)
 Frame = +3

Query: 189  LSFCPFYFKPEHQTRGLVEHDDVYSFPFDFPPRKLVSIGSDHQADVPVWGLEGIKNNSDC 368
            +S  P YF  +   R   + DD Y    D+PPRK V IGSDHQ DVP W  +GI ++ D 
Sbjct: 110  VSLFPEYFSSDSPVRASNDSDDYYLSLLDYPPRKSVPIGSDHQVDVPAWS-QGIMDSLDY 168

Query: 369  LDVYDPDNAHLHTSGSNYKLDDDSGEEMIGTCVLPMPSSEALTSDGEKVGTARTDCCCLD 548
            L+  +        SG    + +   + +IGTCV+PMP SE   +D   VG  RTDC C D
Sbjct: 169  LETSEQVIFSPQASGLELSVGNIDEKRLIGTCVMPMPKSEPFCNDAV-VGNGRTDCSCHD 227

Query: 549  MGSIRCVRQHITEAREKLRESLGLERFAVLGFHDMGEVVARKWNEEDERVFHEVVLSHPA 728
             GS RCVRQHI EAREKLR +LG ERF  LGFHDMGE VA KWNEE+E++FHEVV S+P 
Sbjct: 228  RGSYRCVRQHIAEAREKLRGTLGEERFVKLGFHDMGEEVAEKWNEEEEQLFHEVVFSNPV 287

Query: 729  SLGKNFWDHLYMVFPSRTKMELVSYYFNAFVLRRRAEQNRLDPMNIDSDNDEW--QXXXX 902
            SLGKNFWD+L +VFPSRT  E+VSYYFN F+LR+RAEQNR DP NIDSDNDEW       
Sbjct: 288  SLGKNFWDNLSLVFPSRTTREIVSYYFNVFMLRKRAEQNRYDPENIDSDNDEWPETDDYC 347

Query: 903  XXXXXXXXXXXXXXIQSPADLDESAYVEHFNEVXXXXXXXXXXXXXXXXXXXXXXFAEDE 1082
                          ++SP   ++ +Y     +                         +  
Sbjct: 348  NDEHEMTEEDEDSVVESPIYQEDPSYNPCHADDKRKYEDIGDGTHGDNENVNYGSGMDIL 407

Query: 1083 RLENKQTRKLPGNCNS-----MPPYEHNFEEDNDFQDDSCTSYECQPSRAESCGPVVVAA 1247
             +    T KL  N  S     +     + + D+  +D SCTS                  
Sbjct: 408  DISESCTDKLLNNSGSDSICQLSDVPWDGKGDHGIKDGSCTS------------------ 449

Query: 1248 GSQGRRGAESENHHRKQPLHSGFEGFNNAGSHEYGIEHCDGVIWDLGYVSGSKGDVDLVP 1427
                  GA+S+    K             G H Y +E CD  +WD GYV+ SK  VDL+ 
Sbjct: 450  ---SNTGADSQRTQAKA----------GNGDHWYALEPCDAKVWDAGYVTCSKTKVDLLS 496

Query: 1428 TCHMIKEVFGEEAWNSKERDGR 1493
            TC MI+EVFG      K  DG+
Sbjct: 497  TCSMIEEVFGAGTGTYKGADGQ 518


>ref|XP_002274465.2| PREDICTED: uncharacterized protein LOC100250913 [Vitis vinifera]
          Length = 550

 Score =  300 bits (768), Expect = 2e-78
 Identities = 177/453 (39%), Positives = 241/453 (53%), Gaps = 18/453 (3%)
 Frame = +3

Query: 192  SFCPFYFKPEHQTRGLVEHDDVYSFPFDFPPRKLVSIGSDHQADVPVWGLEGIKNNSDCL 371
            S  P YF+     R + + +D+YS   D  PR+ V +G DHQA+VPVW L+ +KN  D L
Sbjct: 100  SLSPEYFESYLPRRTVAQFEDIYSSLLDCSPRRQVPVGPDHQANVPVWSLQKVKNRLDKL 159

Query: 372  DVYDPDNAHLHTSGSNYKLDDDSGEEMIGTCVLPMPSSEALTSDGEKVGTARTDCCCLDM 551
            +  +   +   +  S+  +D ++ E  +GTCV+PMP       +G K G  RTDC CLD 
Sbjct: 160  ETSNRYISSSQSMVSDQTVDGENEERWMGTCVIPMPEENLSAENGVKTGDGRTDCGCLDN 219

Query: 552  GSIRCVRQHITEAREKLRESLGLERFAVLGFHDMGEVVARKWNEEDERVFHEVVLSHPAS 731
             SIRCVRQH+ EAREKLR++LG E+F  LGF DMGE VA KW+EE+E+ FHEVV SHPAS
Sbjct: 220  DSIRCVRQHVMEAREKLRKTLGQEKFMELGFCDMGEEVALKWHEEEEQAFHEVVFSHPAS 279

Query: 732  LGKNFWDHLYMVFPSRTKMELVSYYFNAFVLRRRAEQNRLDPMNIDSDNDEWQ-XXXXXX 908
            LG+NFW+HL   F  R K ELVSYYFN F+LR+RA QNR + + IDSD+DEW        
Sbjct: 280  LGQNFWEHLSATFSYRAKQELVSYYFNVFMLRQRAAQNRSNFLYIDSDDDEWHGNNRSLN 339

Query: 909  XXXXXXXXXXXXIQSPADLDESAY------VEHFNEVXXXXXXXXXXXXXXXXXXXXXXF 1070
                        I+S +D    AY       E  ++                       F
Sbjct: 340  EVGTAEEEDDSGIESLSDQHNHAYHEEEPHEEDDDDDDDDDDDEEDDDKDDSDFDGDGGF 399

Query: 1071 AEDERLENKQTRKLPG----NCNSMPPYEHNFE-------EDNDFQDDSCTSYECQPSRA 1217
             +D++   K+   +      + N   P   N +       ED   QDDSC S+ECQP+ A
Sbjct: 400  GDDKQGATKEDGMVHNGKLLDYNMFDPVARNMDKVPDSNGEDFSVQDDSCMSFECQPNVA 459

Query: 1218 ESCGPVVVAAGSQGRRGAESENHHRKQPLHSGFEGFNNAGSHEYGIEHCDGVIWDLGYVS 1397
              C P    A  Q   GA      +++  H   +G +      Y +E  +  +WD  Y +
Sbjct: 460  NPCAPSDPEASVQ-ESGARIT---QQKSFHGDDDGSSTRVDPGYLLEPSETKVWDGRYWT 515

Query: 1398 GSKGDVDLVPTCHMIKEVFGEEAWNSKERDGRA 1496
            GS   VDL+PTC+MI+E+FG    NSK +D ++
Sbjct: 516  GSINGVDLLPTCNMIEEIFGLGTPNSKTKDDKS 548


>emb|CAN60165.1| hypothetical protein VITISV_040087 [Vitis vinifera]
          Length = 605

 Score =  299 bits (765), Expect = 4e-78
 Identities = 177/453 (39%), Positives = 240/453 (52%), Gaps = 18/453 (3%)
 Frame = +3

Query: 192  SFCPFYFKPEHQTRGLVEHDDVYSFPFDFPPRKLVSIGSDHQADVPVWGLEGIKNNSDCL 371
            S  P YF+     R + + +D+YS   D  PR+ V +G DHQA+VPVW L+ +KN  D L
Sbjct: 155  SLSPEYFESYLPRRTVAQFEDIYSSLLDCSPRRQVPVGPDHQANVPVWSLQKVKNRLDKL 214

Query: 372  DVYDPDNAHLHTSGSNYKLDDDSGEEMIGTCVLPMPSSEALTSDGEKVGTARTDCCCLDM 551
            +  +   +   +  S+  +D ++ E  +GTCV+PMP       +G K G  RTDC CLD 
Sbjct: 215  ETSNRYISSSQSMVSDQTVDGENEERWMGTCVIPMPEENLSAENGVKTGDGRTDCGCLDN 274

Query: 552  GSIRCVRQHITEAREKLRESLGLERFAVLGFHDMGEVVARKWNEEDERVFHEVVLSHPAS 731
             SIRCVRQH+ EAREKLR++LG E+F  LGF DMGE VA KW+EE+E+ FHEVV SHPAS
Sbjct: 275  DSIRCVRQHVMEAREKLRKTLGQEKFMELGFCDMGEEVALKWHEEEEQAFHEVVFSHPAS 334

Query: 732  LGKNFWDHLYMVFPSRTKMELVSYYFNAFVLRRRAEQNRLDPMNIDSDNDEWQ-XXXXXX 908
            LG+NFW+HL   F  R K ELVSYYFN F+LR+RA QNR + + IDSD+DEW        
Sbjct: 335  LGQNFWEHLSATFSYRAKQELVSYYFNVFMLRQRAAQNRSNFLYIDSDDDEWHGNNRSLN 394

Query: 909  XXXXXXXXXXXXIQSPADLDESAY------VEHFNEVXXXXXXXXXXXXXXXXXXXXXXF 1070
                        I+S +D    AY       E  ++                       F
Sbjct: 395  EVGTAEEEDDSGIESLSDQHNHAYHEEEPHEEDDDDDDDDDDDEEDDDKDDSDFDGDGGF 454

Query: 1071 AEDERLENKQTRKLPG----NCNSMPPYEHNFE-------EDNDFQDDSCTSYECQPSRA 1217
             +D+    K+   +      + N   P   N +       ED   QDDSC S+ECQP+ A
Sbjct: 455  GDDKLGATKEDGMVHNGKLLDYNMFDPVARNMDKVPDSNGEDFSVQDDSCMSFECQPNVA 514

Query: 1218 ESCGPVVVAAGSQGRRGAESENHHRKQPLHSGFEGFNNAGSHEYGIEHCDGVIWDLGYVS 1397
              C P    A  Q   GA      +++  H   +G +      Y +E  +  +WD  Y +
Sbjct: 515  NPCAPSDPEASVQ-ESGARIT---QQKSFHGDDDGSSTRVDPGYLLEPSETKVWDGRYWT 570

Query: 1398 GSKGDVDLVPTCHMIKEVFGEEAWNSKERDGRA 1496
            GS   VDL+PTC+MI+E+FG    NSK +D ++
Sbjct: 571  GSINGVDLLPTCNMIEEIFGLGTPNSKTKDDKS 603


>ref|XP_003522999.1| PREDICTED: uncharacterized protein LOC100793553 [Glycine max]
          Length = 522

 Score =  298 bits (762), Expect = 8e-78
 Identities = 171/432 (39%), Positives = 236/432 (54%), Gaps = 8/432 (1%)
 Frame = +3

Query: 189  LSFCPFYFKPEHQTRGLVEHDDVYSFPFDFPPRKLVSIGSDHQADVPVWGLEGIKNN--- 359
            LS  P YF PE   R L  ++D+YS   +  PRK VS+GSDHQADVP W + G  N    
Sbjct: 111  LSLFPEYFSPERPIRTLTRYEDIYSILLEHSPRKPVSVGSDHQADVPAWDILGATNRPNA 170

Query: 360  SDCLDVYDPDNAHLHTSGSNYKLDDDSGEEMIGTCVLPMPSSEALTSDGEKVGTARTDCC 539
            SD + V D    H+          D++ + ++GTCV+PMP  E L+S+ ++VG A TDC 
Sbjct: 171  SDAVSVSDFTVGHI----------DETEKRLMGTCVIPMPQME-LSSNDDEVGKASTDCS 219

Query: 540  CLDMGSIRCVRQHITEAREKLRESLGLERFAVLGFHDMGEVVARKWNEEDERVFHEVVLS 719
            C D GS+RCVRQHI E REK  ++ G+E+F  LGF +MGE VA  W+ EDE++FHEVV +
Sbjct: 220  CEDQGSMRCVRQHIAEEREKHIKTFGVEKFTELGFTNMGEQVAENWSAEDEQLFHEVVFN 279

Query: 720  HPASLGKNFWDHLYMVFPSRTKMELVSYYFNAFVLRRRAEQNRLDPMNIDSDNDEWQXXX 899
            +P SL KNFW++L + FPSRTK E+VSYYFN F+L+RRAEQNR D ++IDSDNDEWQ   
Sbjct: 280  NPVSLDKNFWNYLSIAFPSRTKKEIVSYYFNVFMLQRRAEQNRNDLLSIDSDNDEWQ-GS 338

Query: 900  XXXXXXXXXXXXXXXIQSPADLDESAYVEHFN---EVXXXXXXXXXXXXXXXXXXXXXXF 1070
                            +SP   DE+   +  N   +                        
Sbjct: 339  EGNDIATREEDEDSVAESPVCHDETCMADCHNNDLQAYNEYAADETCAANETVDFTNKNI 398

Query: 1071 AEDERLENKQTRKLPGNCNSMPPYEHNFEE--DNDFQDDSCTSYECQPSRAESCGPVVVA 1244
             +D + +  +     G+    P  +  +++  D   ++DSCTS +            V  
Sbjct: 399  DDDSQYDPIEMHHSSGSPLIQPQDQPIWQDSCDGKVKEDSCTSSD------------VGV 446

Query: 1245 AGSQGRRGAESENHHRKQPLHSGFEGFNNAGSHEYGIEHCDGVIWDLGYVSGSKGDVDLV 1424
            A  + +   E+ +H         + G NN  S  Y +EHCD  +WD G+VS SK  +D V
Sbjct: 447  ASQETKVNTENGDH-----WCGNYNGVNNGYSQGYVLEHCDAKVWDSGFVSCSKNKIDFV 501

Query: 1425 PTCHMIKEVFGE 1460
            PTC+MI+EVFG+
Sbjct: 502  PTCNMIEEVFGD 513


>ref|XP_006483051.1| PREDICTED: uncharacterized protein LOC102614272 [Citrus sinensis]
          Length = 541

 Score =  290 bits (743), Expect = 1e-75
 Identities = 169/430 (39%), Positives = 242/430 (56%), Gaps = 7/430 (1%)
 Frame = +3

Query: 219  EHQTRGLVEHDDVYSFPFDFPPRKLVSIGSDHQADVPVWGLEGIKNNSDCLDVYDPDNAH 398
            ++  R  V  +D YS   D  PRK V +G +HQA +P W     KN  D       +N+ 
Sbjct: 117  DYPRRTFVPFEDSYSSLLDRSPRKQVPLGPNHQAILPSWDRSMGKNILDGKATLRGNNSL 176

Query: 399  LHTSGSNYKLDDDSGEEMIGTCVLPMPSSEALTSDGEKVGTARTDCCCLDMGSIRCVRQH 578
            +H  GS+  +D+D+ E+ +GTC++PMP S +   + ++VG    DC CLD GSIRCV+QH
Sbjct: 177  VHL-GSHNVVDNDNEEKWMGTCIIPMPDSNSFAHNIDQVGRGIMDCDCLDEGSIRCVQQH 235

Query: 579  ITEAREKLRESLGLERFAVLGFHDMGEVVARKWNEEDERVFHEVVLSHPASLGKNFWDHL 758
            + EAREKL +SLG E+F  LG  DMGE V+ KW+EE+E+VFHEVV S+P SLG+NFW  L
Sbjct: 236  VMEAREKLLKSLGHEKFVKLGLCDMGEEVSCKWSEEEEQVFHEVVYSNPFSLGRNFWKQL 295

Query: 759  YMVFPSRTKMELVSYYFNAFVLRRRAEQNRLDPMNIDSDNDEWQ-XXXXXXXXXXXXXXX 935
              VFPSRTK E+VSYYFN FVLRRRA QNR D + IDSD+DEW                 
Sbjct: 296  SAVFPSRTKKEIVSYYFNVFVLRRRAVQNRSDLLEIDSDDDEWHGGYGGSDEIRISEEDE 355

Query: 936  XXXIQSPADLDESAYVEHFNEVXXXXXXXXXXXXXXXXXXXXXXFAEDERLENKQTRKL- 1112
               I+SP D + +   E  ++                           + + +    K  
Sbjct: 356  DSAIESPVDQENADCGEDSSDEDDDDGGDSDGDVGDGGGEVTGETCGTDHVSDTNIAKSF 415

Query: 1113 -PGNCNSMPPYEHNFEED--NDF--QDDSCTSYECQPSRAESCGPVVVAAGSQGRRGAES 1277
              G  +++ P+      D  +DF  +D+SCTS+E QP  ++SCG +  A   Q   G  +
Sbjct: 416  DEGGFDAVVPHMDKIPGDAGDDFNVEDESCTSFEFQPDMSDSCGAIDAAHALQ-LSGVRT 474

Query: 1278 ENHHRKQPLHSGFEGFNNAGSHEYGIEHCDGVIWDLGYVSGSKGDVDLVPTCHMIKEVFG 1457
            E+    + LH   +G+N+   H   ++ CD  +WD  Y+S  KG V+L+PTC++I+E+FG
Sbjct: 475  EH---GKALHGRLDGYNDLVGHMNLLDSCDAKVWDARYLSPIKG-VELLPTCNIIEEIFG 530

Query: 1458 EEAWNSKERD 1487
            +  W++K R+
Sbjct: 531  QGTWDTKTRN 540


>gb|AHB59599.1| putative MYB-related protein 12 [Arachis hypogaea]
          Length = 538

 Score =  289 bits (739), Expect = 4e-75
 Identities = 165/430 (38%), Positives = 238/430 (55%), Gaps = 10/430 (2%)
 Frame = +3

Query: 201  PFYFKPEHQTRGLVEHDDVYSFPFDFPPRKLVSIGSDHQADVPVWGLEGIKNNSDCLDVY 380
            P YF PE   R L  ++D+YS   + PPRKLVS+G++HQAD+PVW         D     
Sbjct: 125  PEYFSPEKPFRTLARYEDIYSILIENPPRKLVSMGANHQADIPVW---------DSSVAI 175

Query: 381  DPDNAHLHTSGSNYKLDDDSGEEMIGTCVLPMPSSEALTSDGEKVGTARTDCCCLDMGSI 560
            D  NA    S   + + D+  + ++GTC++PMP  E L+SD + VG  RT+C C D GSI
Sbjct: 176  DRPNASEDVSNLGFPIGDEDEKRLMGTCIIPMPQME-LSSDNDDVGKGRTNCWCEDRGSI 234

Query: 561  RCVRQHITEAREKLRESLGLERFAVLGFHDMGEVVARKWNEEDERVFHEVVLSHPASLGK 740
            RCVRQHI E RE+L +  G E+F  LGF+DMGE VA KW+ E+ER+FHEVV ++P SLGK
Sbjct: 235  RCVRQHIAEERERLLKEFGHEKFDELGFNDMGERVAEKWSAEEERLFHEVVFNNPVSLGK 294

Query: 741  NFWDHLYMVFPSRTKMELVSYYFNAFVLRRRAEQNRLDPMNIDSDNDEWQXXXXXXXXXX 920
            NFW +L +  PSR+K E+VSYYFN F+LR+RAEQNR D ++IDSDNDEWQ          
Sbjct: 295  NFWHYLSIALPSRSKKEIVSYYFNVFMLRKRAEQNRNDALSIDSDNDEWQGSDGIDIATR 354

Query: 921  XXXXXXXXIQSPADLDESAYVE-HFN-EVXXXXXXXXXXXXXXXXXXXXXXFAEDERLEN 1094
                    + SP D ++  +   H N +V                         D+  ++
Sbjct: 355  EEDEDDSVVDSPVDQNDIGFTSCHENDQVDYDDEFAADEICAVNGTVDLTKRNIDDEDDS 414

Query: 1095 KQ-----TRKLPGN--CNSMPPYEHNF-EEDNDFQDDSCTSYECQPSRAESCGPVVVAAG 1250
            K       R    N  C  + P++    ++D + +D++CT  +           VV +  
Sbjct: 415  KYDAVSVARSTVPNRFCPPIQPHDQTIHKDDENVKDETCTFSDA----------VVSSQE 464

Query: 1251 SQGRRGAESENHHRKQPLHSGFEGFNNAGSHEYGIEHCDGVIWDLGYVSGSKGDVDLVPT 1430
            ++ + GAE +     Q   +  E  +N  S+ + +E CD  +WD  ++S SK  +D +PT
Sbjct: 465  TRAKSGAEGD-----QWCGNYNEVASNGYSNGHVLEPCDAKVWDPAFLSCSKSKIDFLPT 519

Query: 1431 CHMIKEVFGE 1460
            C+MI+E+FG+
Sbjct: 520  CNMIEEIFGD 529


>ref|XP_006438810.1| hypothetical protein CICLE_v10031172mg [Citrus clementina]
            gi|557541006|gb|ESR52050.1| hypothetical protein
            CICLE_v10031172mg [Citrus clementina]
          Length = 541

 Score =  286 bits (733), Expect = 2e-74
 Identities = 168/430 (39%), Positives = 240/430 (55%), Gaps = 7/430 (1%)
 Frame = +3

Query: 219  EHQTRGLVEHDDVYSFPFDFPPRKLVSIGSDHQADVPVWGLEGIKNNSDCLDVYDPDNAH 398
            ++  R  V  +D YS   D  PRK V +G +HQA +P W     KN  D       +N+ 
Sbjct: 117  DYPRRTFVPFEDSYSSLLDRSPRKQVPLGPNHQAILPSWDRSMGKNILDGKATLRGNNSL 176

Query: 399  LHTSGSNYKLDDDSGEEMIGTCVLPMPSSEALTSDGEKVGTARTDCCCLDMGSIRCVRQH 578
             H  GS+  +D+D+ E+ +GTC++PMP S +   + ++VG    DC CLD GSIRCV+QH
Sbjct: 177  DHL-GSHNVVDNDNEEKWMGTCIIPMPDSNSFAHNIDQVGRGIMDCDCLDEGSIRCVQQH 235

Query: 579  ITEAREKLRESLGLERFAVLGFHDMGEVVARKWNEEDERVFHEVVLSHPASLGKNFWDHL 758
            + EAREKL +SLG E+F  LG  DMGE V+ KW+EE+E+VFHEVV S+P SLG+NFW  L
Sbjct: 236  VMEAREKLLKSLGHEKFVKLGLCDMGEEVSCKWSEEEEQVFHEVVYSNPFSLGRNFWKQL 295

Query: 759  YMVFPSRTKMELVSYYFNAFVLRRRAEQNRLDPMNIDSDNDEWQ-XXXXXXXXXXXXXXX 935
              VFPSRTK E+VSYYFN FVLRRRA QNR D + IDSD+DEW                 
Sbjct: 296  SAVFPSRTKKEIVSYYFNVFVLRRRAVQNRSDLLEIDSDDDEWHGGYGGSDEIRISEEDE 355

Query: 936  XXXIQSPADLDESAYVEHFNEVXXXXXXXXXXXXXXXXXXXXXXFAEDERLENKQTRKL- 1112
               I+SP D + +   E  ++                           + + +    K  
Sbjct: 356  DSAIESPVDQENADCGEDSSDEDDDDGGDSDGDVGDGGGEVTGETCGTDHVSDTNIAKSF 415

Query: 1113 -PGNCNSMPPYEHNFEED--NDF--QDDSCTSYECQPSRAESCGPVVVAAGSQGRRGAES 1277
              G  +++ P+      D  +DF  +D+SCTS+E QP  ++SCG +      Q   G  +
Sbjct: 416  DEGGFDAVVPHMDKIPGDAGDDFNVEDESCTSFEFQPDMSDSCGAIDAEHALQ-LSGVRT 474

Query: 1278 ENHHRKQPLHSGFEGFNNAGSHEYGIEHCDGVIWDLGYVSGSKGDVDLVPTCHMIKEVFG 1457
            E+    + LH   +G+N+   H   ++ CD  +WD  Y+S  KG V+L+PTC++I+E+FG
Sbjct: 475  EH---GKALHGRLDGYNDLVGHMNLLDSCDAKVWDARYLSPIKG-VELLPTCNIIEEIFG 530

Query: 1458 EEAWNSKERD 1487
            +  W++K R+
Sbjct: 531  QGTWDTKTRN 540


>ref|XP_007138262.1| hypothetical protein PHAVU_009G193800g [Phaseolus vulgaris]
            gi|561011349|gb|ESW10256.1| hypothetical protein
            PHAVU_009G193800g [Phaseolus vulgaris]
          Length = 522

 Score =  285 bits (729), Expect = 6e-74
 Identities = 169/431 (39%), Positives = 237/431 (54%), Gaps = 7/431 (1%)
 Frame = +3

Query: 189  LSFCPFYFKPEHQTRGLVEHDDVYSFPFDFPPRKLVSIGSDHQADVPVWGLEGIKNNSDC 368
            LS  P YF PE   R L  ++D+YS   +  PRK VS+G++HQADVP           DC
Sbjct: 111  LSLFPEYFSPERPIRTLTRYEDIYSILLEHSPRKPVSVGANHQADVPAL---------DC 161

Query: 369  LDVYDPDNAHLHTSGSNYKLDD--DSGEEMIGTCVLPMPSSEALTSDGEKVGTARTDCCC 542
            L   +  N     S +++ + D  ++ ++++GTCV+P+P  E L+S  ++VG  RT+C C
Sbjct: 162  LGATNKSNVSASDSDTDFTVGDRDETEKKLLGTCVIPLPQME-LSSCDDEVGKGRTECNC 220

Query: 543  LDMGSIRCVRQHITEAREKLRESLGLERFAVLGFHDMGEVVARKWNEEDERVFHEVVLSH 722
             D GS+RCVRQHI E R+KL ++ G E+F  LGF +MGE VA KW+ EDE++FHEVV ++
Sbjct: 221  EDQGSMRCVRQHIAEERDKLLKTFGPEKFTELGFTNMGEQVAEKWSVEDEQLFHEVVFNN 280

Query: 723  PASLGKNFWDHLYMVFPSRTKMELVSYYFNAFVLRRRAEQNRLDPMNIDSDNDEWQXXXX 902
            PASL KNFW++L + FPSRTK E+VSYYFN F+LRRRAEQNR D +NIDSDNDEWQ    
Sbjct: 281  PASLDKNFWNYLSIAFPSRTKKEIVSYYFNVFMLRRRAEQNRNDLLNIDSDNDEWQ-GSD 339

Query: 903  XXXXXXXXXXXXXXIQSPADLDESAYVE-HFNEVXXXXXXXXXXXXXXXXXXXXXXFAED 1079
                           +SP   DES   + H N++                         D
Sbjct: 340  SNDIATREEDEDSVAESPVCQDESCMADCHDNDLQTYDEYAADETCAANETVDFTSRNID 399

Query: 1080 ERLENKQTR-KLPGNCNSM-PPYEHNFEE--DNDFQDDSCTSYECQPSRAESCGPVVVAA 1247
            +  +         G C  + PP +  +++  D   +DDSCTS +               A
Sbjct: 400  DGSKYDPVELHHSGRCPLIQPPDQPVWQDSCDEKVKDDSCTSSD------------TGVA 447

Query: 1248 GSQGRRGAESENHHRKQPLHSGFEGFNNAGSHEYGIEHCDGVIWDLGYVSGSKGDVDLVP 1427
              Q +   E+ +H         + G +N  +  Y +E CD  +WD G+VS SK  +D +P
Sbjct: 448  SQQTKVNTENGDH-----WCGNYNGVSNGYNQGYVLEPCDAKVWDSGFVSCSKNKMDFLP 502

Query: 1428 TCHMIKEVFGE 1460
            TC+MI+EVFG+
Sbjct: 503  TCNMIEEVFGD 513


>ref|XP_004499488.1| PREDICTED: uncharacterized protein LOC101494171 isoform X1 [Cicer
            arietinum] gi|502126914|ref|XP_004499489.1| PREDICTED:
            uncharacterized protein LOC101494171 isoform X2 [Cicer
            arietinum]
          Length = 533

 Score =  285 bits (729), Expect = 6e-74
 Identities = 166/427 (38%), Positives = 230/427 (53%), Gaps = 7/427 (1%)
 Frame = +3

Query: 201  PFYFKPEHQTRGLVEHDDVYSFPFDFPPRKLVSIGSDHQADVPVWGLEGIKNNSDCLDVY 380
            P YF PE   R L  ++D+YS   +  PRK VS+G++HQADVP WG             Y
Sbjct: 122  PIYFSPERPIRTLTRYEDIYSILLEHSPRKPVSVGANHQADVPPWGFSRAS--------Y 173

Query: 381  DPDNAHLHTSGSNYKL--DDDSGEEMIGTCVLPMPSSEALTSDGEKVGTARTDCCCLDMG 554
             P +A    S SN+     D++ + ++GTC++PMP  E LTS  +KVG  RTDC C+D  
Sbjct: 174  VP-HASGTVSDSNFTAWNRDEAEKRLMGTCIIPMPEME-LTSIDQKVGKGRTDCSCVDRE 231

Query: 555  SIRCVRQHITEAREKLRESLGLERFAVLGFHDMGEVVARKWNEEDERVFHEVVLSHPASL 734
            S+RCVRQHI E REKL +S+G E+F  LGF DMGE VA KW+ EDE +FH+VV ++PASL
Sbjct: 232  SMRCVRQHIMEEREKLLKSIGFEKFTELGFADMGEQVAEKWSAEDEHLFHKVVFNNPASL 291

Query: 735  GKNFWDHLYMVFPSRTKMELVSYYFNAFVLRRRAEQNRLDPMNIDSDNDEWQXXXXXXXX 914
             +NFW++L +VFPSRTK E+VSYYFN F+LR+RAEQNR   +N DSDNDEWQ        
Sbjct: 292  NRNFWNYLSIVFPSRTKKEIVSYYFNVFMLRKRAEQNRNHLLNADSDNDEWQ-GNDENEI 350

Query: 915  XXXXXXXXXXIQSPA---DLDESAYVEHFNEVXXXXXXXXXXXXXXXXXXXXXXFAEDER 1085
                       + P    D + +    H  E                         +D +
Sbjct: 351  STHDEDDDSVTEYPICQDDCNNNCNDNHLEEYDDEFAADETFTVKGTMDCTKRNIGDDSK 410

Query: 1086 LENKQTRKLPGNCNSMPPYEHNFEE--DNDFQDDSCTSYECQPSRAESCGPVVVAAGSQG 1259
             ++       G+    P  +H +++  D   + DS TS++   +  E    + V +GS  
Sbjct: 411  YDHVGMHNSNGSPLIQPQDQHVWQDSCDEKVKGDSYTSHDIGVASRE----IKVKSGS-- 464

Query: 1260 RRGAESENHHRKQPLHSGFEGFNNAGSHEYGIEHCDGVIWDLGYVSGSKGDVDLVPTCHM 1439
                     H     +    G+++  S  Y +E CD  +WD G+VS SK  +D +PTC M
Sbjct: 465  -------GDHWSSNYNGVSNGYSHGYSQGYVLEPCDAPVWDSGFVSCSKNKIDFLPTCSM 517

Query: 1440 IKEVFGE 1460
            I+EVFG+
Sbjct: 518  IEEVFGD 524


>ref|XP_007037501.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590668470|ref|XP_007037502.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508774746|gb|EOY22002.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508774747|gb|EOY22003.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 515

 Score =  283 bits (724), Expect = 2e-73
 Identities = 168/487 (34%), Positives = 240/487 (49%), Gaps = 11/487 (2%)
 Frame = +3

Query: 33   SECTERVVSDIATDSVGADKEFNPNXXXXXXXXXXXXXXXXXXXXXXXXXXQLSFCPFYF 212
            +EC E++ + I T   G  ++F  N                           +      F
Sbjct: 58   TECDEKLANAIDTKHPGNAEDFEANVPSCIAISSLGTCCTGEEDSWPEEPLHIPSFAECF 117

Query: 213  KPEHQTRGLVEHDDVYSFPFDFPPRKLVSIGSDHQADVPVWGLEGIKNNSDCLDVYDPDN 392
             PE Q R     DD+YS   + PPRK V  G ++QAD+P W  +  +N S+  D  +   
Sbjct: 118  HPERQVRTSARWDDIYSILLECPPRKQVLAGPNYQADIPEWDSQVARNTSNDTDASE--- 174

Query: 393  AHLHTSGSNYKLDDDSGEEMIGTCVLPMPSSEALTSDGEKVGTARTDCCCLDMGSIRCVR 572
                T+   Y+       +++GTC++PMP+ E    D +KVG+ R+DC C D  S+RCVR
Sbjct: 175  ----TAADRYE------NKLMGTCIIPMPAFECSAYD-DKVGSGRSDCSCEDKDSVRCVR 223

Query: 573  QHITEAREKLRESLGLERFAVLGFHDMGEVVARKWNEEDERVFHEVVLSHPASLGKNFWD 752
            QHI EARE+LR+SLG E+F  LGF DMGE+V  KW+EE+E++FH+VV S+PASLG+NFWD
Sbjct: 224  QHIMEAREELRKSLGHEKFVELGFCDMGELVTMKWSEEEEQLFHKVVFSNPASLGRNFWD 283

Query: 753  HLYMVFPSRTKMELVSYYFNAFVLRRRAEQNRLDPMNIDSDNDEWQXXXXXXXXXXXXXX 932
             L  V+P RTK ++VSYYFN F+LR+R+EQNR + M+IDSDNDEWQ              
Sbjct: 284  SLVSVYPYRTKEDIVSYYFNVFMLRKRSEQNRCESMSIDSDNDEWQGTDDSGNNEVGFSD 343

Query: 933  XXXXIQSPADLDESAYVEHFNEVXXXXXXXXXXXXXXXXXXXXXXFAEDERLENKQT--R 1106
                    + + +  +  H ++                        +  +  +  +T   
Sbjct: 344  EDEDSVIESPICQEDFDNHRSQEAGLCVFDEDIADETCDNHSIDFGSRGDATKVSETYSE 403

Query: 1107 KLPGNCNSMPPYE---------HNFEEDNDFQDDSCTSYECQPSRAESCGPVVVAAGSQG 1259
            KL  +C S P  +            +E+ + QD SCTS +   +  E+  PV        
Sbjct: 404  KLFSSCGSDPTAQLHGKTLKDTQGEQEEREVQDYSCTSSDTGAASHET--PV-------- 453

Query: 1260 RRGAESENHHRKQPLHSGFEGFNNAGSHEYGIEHCDGVIWDLGYVSGSKGDVDLVPTCHM 1439
                   N            G NN GSH Y +E CD  +WD GY +  K  +D +PTC M
Sbjct: 454  -------NADNADQWQGNLNGLNNGGSHGYVLEPCDTKVWDAGYPTCQKNKIDFLPTCSM 506

Query: 1440 IKEVFGE 1460
            I+EVFG+
Sbjct: 507  IEEVFGD 513


>ref|XP_007037503.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508774748|gb|EOY22004.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 490

 Score =  281 bits (719), Expect = 8e-73
 Identities = 160/428 (37%), Positives = 225/428 (52%), Gaps = 11/428 (2%)
 Frame = +3

Query: 210  FKPEHQTRGLVEHDDVYSFPFDFPPRKLVSIGSDHQADVPVWGLEGIKNNSDCLDVYDPD 389
            F PE Q R     DD+YS   + PPRK V  G ++QAD+P W  +  +N S+  D  +  
Sbjct: 92   FHPERQVRTSARWDDIYSILLECPPRKQVLAGPNYQADIPEWDSQVARNTSNDTDASE-- 149

Query: 390  NAHLHTSGSNYKLDDDSGEEMIGTCVLPMPSSEALTSDGEKVGTARTDCCCLDMGSIRCV 569
                 T+   Y+       +++GTC++PMP+ E    D +KVG+ R+DC C D  S+RCV
Sbjct: 150  -----TAADRYE------NKLMGTCIIPMPAFECSAYD-DKVGSGRSDCSCEDKDSVRCV 197

Query: 570  RQHITEAREKLRESLGLERFAVLGFHDMGEVVARKWNEEDERVFHEVVLSHPASLGKNFW 749
            RQHI EARE+LR+SLG E+F  LGF DMGE+V  KW+EE+E++FH+VV S+PASLG+NFW
Sbjct: 198  RQHIMEAREELRKSLGHEKFVELGFCDMGELVTMKWSEEEEQLFHKVVFSNPASLGRNFW 257

Query: 750  DHLYMVFPSRTKMELVSYYFNAFVLRRRAEQNRLDPMNIDSDNDEWQXXXXXXXXXXXXX 929
            D L  V+P RTK ++VSYYFN F+LR+R+EQNR + M+IDSDNDEWQ             
Sbjct: 258  DSLVSVYPYRTKEDIVSYYFNVFMLRKRSEQNRCESMSIDSDNDEWQGTDDSGNNEVGFS 317

Query: 930  XXXXXIQSPADLDESAYVEHFNEVXXXXXXXXXXXXXXXXXXXXXXFAEDERLENKQT-- 1103
                     + + +  +  H ++                        +  +  +  +T  
Sbjct: 318  DEDEDSVIESPICQEDFDNHRSQEAGLCVFDEDIADETCDNHSIDFGSRGDATKVSETYS 377

Query: 1104 RKLPGNCNSMPPYE---------HNFEEDNDFQDDSCTSYECQPSRAESCGPVVVAAGSQ 1256
             KL  +C S P  +            +E+ + QD SCTS +   +  E+  PV       
Sbjct: 378  EKLFSSCGSDPTAQLHGKTLKDTQGEQEEREVQDYSCTSSDTGAASHET--PV------- 428

Query: 1257 GRRGAESENHHRKQPLHSGFEGFNNAGSHEYGIEHCDGVIWDLGYVSGSKGDVDLVPTCH 1436
                    N            G NN GSH Y +E CD  +WD GY +  K  +D +PTC 
Sbjct: 429  --------NADNADQWQGNLNGLNNGGSHGYVLEPCDTKVWDAGYPTCQKNKIDFLPTCS 480

Query: 1437 MIKEVFGE 1460
            MI+EVFG+
Sbjct: 481  MIEEVFGD 488


>ref|XP_004241596.1| PREDICTED: uncharacterized protein LOC101258762 isoform 1 [Solanum
            lycopersicum] gi|460391983|ref|XP_004241597.1| PREDICTED:
            uncharacterized protein LOC101258762 isoform 2 [Solanum
            lycopersicum]
          Length = 546

 Score =  281 bits (718), Expect = 1e-72
 Identities = 169/445 (37%), Positives = 229/445 (51%), Gaps = 15/445 (3%)
 Frame = +3

Query: 207  YFKPEHQTRGLVEHDDVYSFPFDFPPRKLVSIGSDHQADVPVWGLEGIKNNSDCLDVYDP 386
            Y+  +   R ++   +VYS  F+ PPRK V IG D QA++P WG    KN S      + 
Sbjct: 118  YYSSDPPFRVVIHPMEVYSPLFNNPPRKSVPIGPDFQAELPEWGAYDSKNISVKESTQES 177

Query: 387  DNAHLHTSGSNYKLDDDSGEEMIGTCVLPMPSSEALTSDGEKVGTARTDCCCLDMGSIRC 566
             N       S++    D   ++ GTC++PMP  E+     E VG  R  C C D GS  C
Sbjct: 178  SNLPSQALESDFVDHHDEENKLAGTCIIPMPKLESPADHEENVGAGRIGCSCGDAGSFGC 237

Query: 567  VRQHITEAREKLRESLGLERFAVLGFHDMGEVVARKWNEEDERVFHEVVLSHPASLGKNF 746
            VR HI EAREKL+ +LG E F  LG +DMGE+VA KW+EE+E +FHEVV S+PA+LGKNF
Sbjct: 238  VRLHIMEAREKLKAALGEETFVRLGVYDMGEIVAEKWSEEEEELFHEVVFSNPAALGKNF 297

Query: 747  WDHLYMVFPSRTKMELVSYYFNAFVLRRRAEQNRLDPMNIDSDNDEWQ---XXXXXXXXX 917
            WDHL + FPSR+K +LVSYYFN F+LR+RA+QNR DP NIDSDNDEWQ            
Sbjct: 298  WDHLAVEFPSRSKRDLVSYYFNVFILRKRAKQNRFDPSNIDSDNDEWQEIDDDVVATGAQ 357

Query: 918  XXXXXXXXXIQSPA-----------DLDESAYVEHFNEVXXXXXXXXXXXXXXXXXXXXX 1064
                     ++SP              ++ AY E    V                     
Sbjct: 358  MTDDDEDSVVESPIYQNYPGHNEIYVTEKQAYDEEAG-VATLEDYQTINFCRRKVLSDVS 416

Query: 1065 XFAEDERLENKQTRKLPGNCNSMPPYEHNFEEDNDFQDDSCTSYECQPSRAESCGPVVVA 1244
                DE ++N  +     N   +  +  N   ++D +D+SCT+                A
Sbjct: 417  KACPDELIDNNSS--CGHNIQPLDRHHSNEVGNHDVEDNSCTT---------------DA 459

Query: 1245 AGSQGRRGAESENHHRKQPLHSGFEGFNNAGSHEYGIEHCDGVIWDL-GYVSGSKGDVDL 1421
            AG+         +  +    H  F G      H++ +E  +G  WD+ GY+S  K +VDL
Sbjct: 460  AGASSDTPQVKTDDCKHWASH--FAGVGIDSGHDFVMEPSNGKEWDMGGYLSCPKNEVDL 517

Query: 1422 VPTCHMIKEVFGEEAWNSKERDGRA 1496
            +PTC MI+EVFG+EAW+SK RDG +
Sbjct: 518  LPTCSMIEEVFGDEAWSSKHRDGHS 542


>ref|XP_006354761.1| PREDICTED: uncharacterized protein LOC102579656 isoform X1 [Solanum
            tuberosum] gi|565376542|ref|XP_006354762.1| PREDICTED:
            uncharacterized protein LOC102579656 isoform X2 [Solanum
            tuberosum]
          Length = 545

 Score =  280 bits (715), Expect = 2e-72
 Identities = 170/451 (37%), Positives = 227/451 (50%), Gaps = 21/451 (4%)
 Frame = +3

Query: 207  YFKPEHQTRGLVEHDDVYSFPFDFPPRKLVSIGSDHQADVPVWGLEGIKNNSDCLDVYDP 386
            Y+  +   R ++   +VYS   + PPRK V IG D QA++P WG    KN S      + 
Sbjct: 118  YYNTDPSFRVVIHPMEVYSPLLNNPPRKSVPIGPDFQAELPEWGAYDCKNISMKESTQES 177

Query: 387  DNAHLHTSGSNYKLDDDSGEEMIGTCVLPMPSSEALTSDGEKVGTARTDCCCLDMGSIRC 566
             N       S +    D   ++ GTC++PMP  E      E VG  +  C C D GS  C
Sbjct: 178  PNLPSQALESGFVDHHDEENKLAGTCIIPMPKLELPADHEENVGAGKIGCSCEDAGSFGC 237

Query: 567  VRQHITEAREKLRESLGLERFAVLGFHDMGEVVARKWNEEDERVFHEVVLSHPASLGKNF 746
            VR HI EAREKL+ +LG E F  LG +DMGE+VA KW++E+E +FHEVV S+PA+LGKNF
Sbjct: 238  VRLHIMEAREKLKAALGEETFVRLGVYDMGEIVAAKWSDEEEELFHEVVFSNPAALGKNF 297

Query: 747  WDHLYMVFPSRTKMELVSYYFNAFVLRRRAEQNRLDPMNIDSDNDEWQXXXXXXXXXXXX 926
            W+HL + FPSR+K +LVSYYFN F+LR+RA+QNR DP NIDSDNDEWQ            
Sbjct: 298  WEHLAVEFPSRSKRDLVSYYFNVFILRKRAKQNRFDPSNIDSDNDEWQEIDDDVVATG-- 355

Query: 927  XXXXXXIQSPADLDESAYVEH--FNEVXXXXXXXXXXXXXXXXXXXXXXFAEDERLENKQ 1100
                       D DE + VE   +                         F ED R  N  
Sbjct: 356  -------AQMTDEDEDSMVESPIYQNYPGHNEIYVTEKQAYDEEAGVATF-EDYRTINFC 407

Query: 1101 TRKLPGNCNSMPPYE---------HNFEE----------DNDFQDDSCTSYECQPSRAES 1223
             RK+  + +   P E         HN +           + D +D+SCT+          
Sbjct: 408  RRKVLSDASKACPDELIDNNSSCGHNIQPLDRHHSNEVGNPDVEDNSCTT---------- 457

Query: 1224 CGPVVVAAGSQGRRGAESENHHRKQPLHSGFEGFNNAGSHEYGIEHCDGVIWDLGYVSGS 1403
                  AAG+         +  +    H  F G      H++ +E  +G  WD GY+S +
Sbjct: 458  -----DAAGASSETPQVKTDDCKHWASH--FAGVGIGSVHDFVMEPSNGKEWDTGYLSCA 510

Query: 1404 KGDVDLVPTCHMIKEVFGEEAWNSKERDGRA 1496
            K +VDL+PTC MI+EVFG+EAW+SK RDG +
Sbjct: 511  KNEVDLLPTCSMIEEVFGDEAWSSKNRDGHS 541


>ref|XP_007032692.1| Uncharacterized protein TCM_018715 [Theobroma cacao]
            gi|508711721|gb|EOY03618.1| Uncharacterized protein
            TCM_018715 [Theobroma cacao]
          Length = 481

 Score =  278 bits (712), Expect = 5e-72
 Identities = 168/422 (39%), Positives = 221/422 (52%), Gaps = 17/422 (4%)
 Frame = +3

Query: 15   DGGFNKSECTERVVSDIATDSVGADKEFNPNXXXXXXXXXXXXXXXXXXXXXXXXXXQLS 194
            +G F++  C  +V+S       GA+KE+  +                           L 
Sbjct: 61   EGRFDEDPCN-KVLS-------GANKEYETSASCSVPHFWWVNSNGIDADTESEVAVHLP 112

Query: 195  FCPFYFKPEHQTRGLVEHDDVYSFPFDFPPRKLVSIGSDHQADVPVWGLEGIKNNSDCLD 374
              P YF   HQ R  +  D++YS      PRKLVSIG +HQA++P W  +G+K++SDC D
Sbjct: 113  LFPEYFASGHQIRAFLHADEIYSSILS--PRKLVSIGPEHQANIPEWRQQGLKSSSDCPD 170

Query: 375  VYDPDNAHLHTSGSNYKLDDDSGEEMIGTCVLPMPSSEALTSDG-EKVGTARTDCCCLDM 551
              DP      +  S    DDD  ++M+GTCV+PMP SE       E VG  R DC CLD 
Sbjct: 171  TSDPQVPLKSSCASLMVDDDDDQKKMMGTCVIPMPDSETTAKFCCEDVGH-RIDCECLDQ 229

Query: 552  GSIRCVRQHITEAREKLRESLGLERFAVLGFHDMGEVVARKWNEEDERVFHEVVLSHPAS 731
            GSIRC+RQH+TEARE LR++LG E F  LGF D GE +A++W EE+E  F  VVL++P S
Sbjct: 230  GSIRCIRQHVTEARENLRKNLGPELFGELGFCDTGEELAKRWPEEEELAFQNVVLTNPVS 289

Query: 732  LGKNFWDHLYMVFPSRTKMELVSYYFNAFVLRRRAEQNRLDPMNIDSDNDEWQXXXXXXX 911
            LGKNFWDHL  VFPS +K +LVSYYFN F+LR+RAEQNR+DP+NIDSD+DEWQ       
Sbjct: 290  LGKNFWDHLPAVFPSHSKRDLVSYYFNVFMLRKRAEQNRVDPVNIDSDDDEWQ----TAE 345

Query: 912  XXXXXXXXXXXIQSPADLDESAYVEH-----FNEVXXXXXXXXXXXXXXXXXXXXXXFAE 1076
                       ++SP+D   SA+ EH      +E                         +
Sbjct: 346  CGIPAEDDDSVVESPSDQGTSAHFEHNHVEDCHEYIEDDDEDGVDSSGNVVADICRAATD 405

Query: 1077 DE------RLENKQTRKLPGNCNS-----MPPYEHNFEEDNDFQDDSCTSYECQPSRAES 1223
            +E       +         GN +S         + N E+D D QDDSCTSYE Q  + + 
Sbjct: 406  EEDEGDIDEISGPHVENFIGNYDSCDFQLSSKVQGNNEDDYDIQDDSCTSYEYQREKVDC 465

Query: 1224 CG 1229
            CG
Sbjct: 466  CG 467


>ref|XP_007045913.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508709848|gb|EOY01745.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 527

 Score =  278 bits (712), Expect = 5e-72
 Identities = 166/439 (37%), Positives = 226/439 (51%), Gaps = 7/439 (1%)
 Frame = +3

Query: 189  LSFCPFYFKPEHQTRGLVEHDDVYSFPFDFPPRKLVSIGSDHQADVPVWGLEGIKNNSDC 368
            L   P YF  +   R     +D YS   D  PR+ V +G +HQA+VP WG    K     
Sbjct: 106  LPVSPEYFDFDLPRRTFAPVEDAYSLFLDRSPRRQVLLGPNHQANVPSWGRHVKKYEFAQ 165

Query: 369  LDVYDPDNAHLHTSGSNYKLDDDSGEEMIGTCVLPMPSSEALTSDGEKVGTARTDCCCLD 548
             D  D               D+D  E M+GTCV+PMP S    ++  KVG  RTDC CLD
Sbjct: 166  SDASD-------------STDNDKEEMMMGTCVIPMPESYLSANNSGKVGAGRTDCSCLD 212

Query: 549  MGSIRCVRQHITEAREKLRESLGLERFAVLGFHDMGEVVARKWNEEDERVFHEVVLSHPA 728
             GS+RCV+QH+ EARE+LR+SLG E+F  LGF+DMGE VA KW+EEDE +F EVV S+P+
Sbjct: 213  RGSLRCVQQHVMEARERLRKSLGHEKFVKLGFYDMGEDVAYKWSEEDEEIFREVVYSNPS 272

Query: 729  SLGKNFWDHLYMVFPSRTKMELVSYYFNAFVLRRRAEQNRLDPMNIDSDNDEWQXXXXXX 908
            SLGK FW  L +VFPSR+K ELVSYYFN F+L+RRA QNR   ++IDSD+DEW       
Sbjct: 273  SLGKKFWKDLSVVFPSRSKRELVSYYFNVFILQRRAVQNRSSMLDIDSDDDEWHGSQQAY 332

Query: 909  XXXXXXXXXXXXIQSPADLDESAYVEH---FNEVXXXXXXXXXXXXXXXXXXXXXXFAED 1079
                        I+S AD ++ A  E     ++                       +  +
Sbjct: 333  EVQDSDEDEDSAIESLADQEDLANREGECLQDDDDDDDDDDESDVGDGSCALTREDYGVN 392

Query: 1080 ERLENKQTRKLPGN----CNSMPPYEHNFEEDNDFQDDSCTSYECQPSRAESCGPVVVAA 1247
              LE    +    +    C           ED + QDDSC S+E QP+  +S   +   A
Sbjct: 393  HLLEGHVAKSFDESRFDPCFQQTNKVSGIGEDFNVQDDSCMSFEFQPNMVDSLSVIDTKA 452

Query: 1248 GSQGRRGAESENHHRKQPLHSGFEGFNNAGSHEYGIEHCDGVIWDLGYVSGSKGDVDLVP 1427
             S    G +++N      L    +G ++   H Y  + CD  IWD  Y +     +DL P
Sbjct: 453  NSH-VNGVKTDN-----CLRGRLDGSSDLAHHVYLFDSCDTKIWDTRYPTAPTKGIDLQP 506

Query: 1428 TCHMIKEVFGEEAWNSKER 1484
            TC++I+E+FG++  ++K R
Sbjct: 507  TCNIIEEIFGQDTRDNKTR 525


>ref|XP_007045912.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508709847|gb|EOY01744.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 526

 Score =  278 bits (712), Expect = 5e-72
 Identities = 166/439 (37%), Positives = 226/439 (51%), Gaps = 7/439 (1%)
 Frame = +3

Query: 189  LSFCPFYFKPEHQTRGLVEHDDVYSFPFDFPPRKLVSIGSDHQADVPVWGLEGIKNNSDC 368
            L   P YF  +   R     +D YS   D  PR+ V +G +HQA+VP WG    K     
Sbjct: 105  LPVSPEYFDFDLPRRTFAPVEDAYSLFLDRSPRRQVLLGPNHQANVPSWGRHVKKYEFAQ 164

Query: 369  LDVYDPDNAHLHTSGSNYKLDDDSGEEMIGTCVLPMPSSEALTSDGEKVGTARTDCCCLD 548
             D  D               D+D  E M+GTCV+PMP S    ++  KVG  RTDC CLD
Sbjct: 165  SDASD-------------STDNDKEEMMMGTCVIPMPESYLSANNSGKVGAGRTDCSCLD 211

Query: 549  MGSIRCVRQHITEAREKLRESLGLERFAVLGFHDMGEVVARKWNEEDERVFHEVVLSHPA 728
             GS+RCV+QH+ EARE+LR+SLG E+F  LGF+DMGE VA KW+EEDE +F EVV S+P+
Sbjct: 212  RGSLRCVQQHVMEARERLRKSLGHEKFVKLGFYDMGEDVAYKWSEEDEEIFREVVYSNPS 271

Query: 729  SLGKNFWDHLYMVFPSRTKMELVSYYFNAFVLRRRAEQNRLDPMNIDSDNDEWQXXXXXX 908
            SLGK FW  L +VFPSR+K ELVSYYFN F+L+RRA QNR   ++IDSD+DEW       
Sbjct: 272  SLGKKFWKDLSVVFPSRSKRELVSYYFNVFILQRRAVQNRSSMLDIDSDDDEWHGSQQAY 331

Query: 909  XXXXXXXXXXXXIQSPADLDESAYVEH---FNEVXXXXXXXXXXXXXXXXXXXXXXFAED 1079
                        I+S AD ++ A  E     ++                       +  +
Sbjct: 332  EVQDSDEDEDSAIESLADQEDLANREGECLQDDDDDDDDDDESDVGDGSCALTREDYGVN 391

Query: 1080 ERLENKQTRKLPGN----CNSMPPYEHNFEEDNDFQDDSCTSYECQPSRAESCGPVVVAA 1247
              LE    +    +    C           ED + QDDSC S+E QP+  +S   +   A
Sbjct: 392  HLLEGHVAKSFDESRFDPCFQQTNKVSGIGEDFNVQDDSCMSFEFQPNMVDSLSVIDTKA 451

Query: 1248 GSQGRRGAESENHHRKQPLHSGFEGFNNAGSHEYGIEHCDGVIWDLGYVSGSKGDVDLVP 1427
             S    G +++N      L    +G ++   H Y  + CD  IWD  Y +     +DL P
Sbjct: 452  NSH-VNGVKTDN-----CLRGRLDGSSDLAHHVYLFDSCDTKIWDTRYPTAPTKGIDLQP 505

Query: 1428 TCHMIKEVFGEEAWNSKER 1484
            TC++I+E+FG++  ++K R
Sbjct: 506  TCNIIEEIFGQDTRDNKTR 524


>ref|XP_004138186.1| PREDICTED: uncharacterized protein LOC101205795 [Cucumis sativus]
            gi|449477160|ref|XP_004154947.1| PREDICTED:
            uncharacterized LOC101205795 [Cucumis sativus]
          Length = 520

 Score =  276 bits (705), Expect = 3e-71
 Identities = 170/449 (37%), Positives = 237/449 (52%), Gaps = 19/449 (4%)
 Frame = +3

Query: 207  YFKP-EHQTRGLVEHDDVYSFPFDFPPRKLVSIGSDHQADVPVWGLEGIKNNSDCLDVYD 383
            +F P  HQ R L   +++YS   D  P+K VSIG +HQA VP W    +           
Sbjct: 117  FFNPVNHQRRILTYCEEIYSLLLDHAPQKSVSIGPEHQAIVPPWRPREV----------- 165

Query: 384  PDNAHLHTSGSNYKLD---DDSGEEMIGTCVLPMPSSEALTSDGEKVGTARTDCCCLDMG 554
              +  LH  GS+ K +   D+  + + GTCV+PMP  ++  S G++VG+ R  C C D G
Sbjct: 166  --DVILHAPGSDSKSNFTGDEYEKRLTGTCVIPMPDVDSSISSGQEVGSGRAACSCEDCG 223

Query: 555  SIRCVRQHITEAREKLRESLGLERFAVLGFHDMGEVVARKWNEEDERVFHEVVLSHPASL 734
            S+ CV  HI EARE+L+ S+G +RFA LGF +MGE +A+KW+EE+ER+F+EVV S+P S+
Sbjct: 224  SVGCVSTHIAEAREQLKSSIGPDRFADLGFSEMGEQLAQKWSEEEERLFYEVVFSNPVSM 283

Query: 735  GKNFWDHLYMVFPSRTKMELVSYYFNAFVLRRRAEQNRLDPMNIDSDNDEWQXXXXXXXX 914
            GKNFW  L +VF S++K E+VSYYFN F+LRRRAEQNR D +NIDSDNDEW         
Sbjct: 284  GKNFWSDLSVVFASKSKREIVSYYFNVFMLRRRAEQNRCDSLNIDSDNDEW--------- 334

Query: 915  XXXXXXXXXXIQSPADLDESAYVEHFNEVXXXXXXXXXXXXXXXXXXXXXXFAE---DER 1085
                           D +     E  + V                      + E   DER
Sbjct: 335  --------PGTDDYGDNEPGMTEEDDDSVVESPLHDIGSCFDRSREDELQEYDEDIADER 386

Query: 1086 LENKQTRKLP---GNCNSMPPYEHNFEE-----DNDFQDDSCTSYECQPSRAESCGPVVV 1241
             ++ ++  +     NC S P  +          D++ QDDSCTS +  P+          
Sbjct: 387  FDDDESGGIGNCFNNCGSSPTLQEKIPHDERGGDHEVQDDSCTSSDTCPA---------- 436

Query: 1242 AAGSQGRRGAESENHHRKQPLHSGFEGFNNAG--SHEYGI--EHCDGVIWDLGYVSGSKG 1409
                   +   ++  H  Q L S F G NN     HE     EHCD  +WD+GY++ SK 
Sbjct: 437  ------TQVLPAKTEHCDQWL-SSFTGPNNGVGLGHEPSSVQEHCDAKVWDVGYLTCSKS 489

Query: 1410 DVDLVPTCHMIKEVFGEEAWNSKERDGRA 1496
            +VD +PT  MI+EVFG+++ N K RDG++
Sbjct: 490  EVDFLPTSSMIEEVFGDDSSNYKARDGKS 518


>ref|XP_006431271.1| hypothetical protein CICLE_v10011783mg [Citrus clementina]
           gi|557533328|gb|ESR44511.1| hypothetical protein
           CICLE_v10011783mg [Citrus clementina]
          Length = 430

 Score =  275 bits (702), Expect = 8e-71
 Identities = 146/298 (48%), Positives = 183/298 (61%), Gaps = 3/298 (1%)
 Frame = +3

Query: 6   SDGDGGFNKSECTER---VVSDIATDSVGADKEFNPNXXXXXXXXXXXXXXXXXXXXXXX 176
           SDGDG  N + C ++   +   +A  S G +KEF                          
Sbjct: 48  SDGDGEQNINRCRDQGRFLFGPVAEVSNGTEKEFE--IGSDCISPFLWANGLFAEGDANS 105

Query: 177 XXXQLSFCPFYFKPEHQTRGLVEHDDVYSFPFDFPPRKLVSIGSDHQADVPVWGLEGIKN 356
               LS  P YF  EHQ R  ++ D++YS   D PP K VSIG ++QADVP W L+G KN
Sbjct: 106 EEVYLSLFPEYFATEHQIRTFLQSDEIYSSHLDHPPVKSVSIGPEYQADVPEWCLQGSKN 165

Query: 357 NSDCLDVYDPDNAHLHTSGSNYKLDDDSGEEMIGTCVLPMPSSEALTSDGEKVGTARTDC 536
           +   LD  D        SGS   +DDD GE+++GTCV+ MP S    +   +    R DC
Sbjct: 166 SLAHLDGSDRQVRLERLSGSCLVVDDDQGEKLLGTCVISMPDSAPSANYYSQSLVTRNDC 225

Query: 537 CCLDMGSIRCVRQHITEAREKLRESLGLERFAVLGFHDMGEVVARKWNEEDERVFHEVVL 716
            CLD GSIRCVRQH+ EAREKLR +LG + F  LGFH+MGE V++ W +E+E  FHEVV 
Sbjct: 226 ECLDKGSIRCVRQHVMEAREKLRVNLGHKIFEELGFHEMGEEVSKNWTKEEENKFHEVVS 285

Query: 717 SHPASLGKNFWDHLYMVFPSRTKMELVSYYFNAFVLRRRAEQNRLDPMNIDSDNDEWQ 890
           S+P S+GKNFWD L +VFPSRTK ELVSYYFN F+L++RAEQNR DP+NIDSD+DEWQ
Sbjct: 286 SYPVSMGKNFWDRLSLVFPSRTKNELVSYYFNVFILQKRAEQNRFDPLNIDSDDDEWQ 343


Top