BLASTX nr result

ID: Paeonia25_contig00020351 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia25_contig00020351
         (2256 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002265036.1| PREDICTED: uncharacterized protein LOC100266...   468   e-129
ref|XP_007221419.1| hypothetical protein PRUPE_ppa001422mg [Prun...   414   e-112
ref|XP_007221418.1| hypothetical protein PRUPE_ppa001422mg [Prun...   414   e-112
emb|CBI40243.3| unnamed protein product [Vitis vinifera]              408   e-111
ref|XP_006436656.1| hypothetical protein CICLE_v10030776mg [Citr...   401   e-109
ref|XP_002315450.2| hypothetical protein POPTR_0010s24240g [Popu...   392   e-106
gb|EXC25392.1| hypothetical protein L484_016774 [Morus notabilis]     389   e-105
ref|XP_002311034.2| hypothetical protein POPTR_0008s02470g [Popu...   385   e-104
ref|XP_004166800.1| PREDICTED: uncharacterized LOC101207239 [Cuc...   374   e-100
ref|XP_004140897.1| PREDICTED: uncharacterized protein LOC101207...   374   e-100
ref|XP_007010268.1| Enhancer of polycomb-like transcription fact...   372   e-100
ref|XP_007010267.1| Enhancer of polycomb-like transcription fact...   372   e-100
ref|XP_002532013.1| conserved hypothetical protein [Ricinus comm...   369   3e-99
ref|XP_006398922.1| hypothetical protein EUTSA_v10012741mg [Eutr...   326   2e-86
ref|XP_007145542.1| hypothetical protein PHAVU_007G247300g [Phas...   320   2e-84
ref|XP_006360531.1| PREDICTED: uncharacterized protein LOC102597...   318   5e-84
ref|XP_006360530.1| PREDICTED: uncharacterized protein LOC102597...   316   3e-83
ref|XP_004243418.1| PREDICTED: uncharacterized protein LOC101263...   309   4e-81
ref|NP_196087.1| Enhancer of polycomb-like transcription factor ...   302   4e-79
ref|XP_002873159.1| hypothetical protein ARALYDRAFT_908352 [Arab...   296   4e-77

>ref|XP_002265036.1| PREDICTED: uncharacterized protein LOC100266152 [Vitis vinifera]
          Length = 791

 Score =  468 bits (1205), Expect = e-129
 Identities = 281/622 (45%), Positives = 372/622 (59%), Gaps = 25/622 (4%)
 Frame = +1

Query: 466  MPSVGMRRTRRVF---GVIKGV-DGARVLRSGRRLWTESGEGKLKKGNDGDEWFKLIDNS 633
            MPSVGMRRT RVF      KG   GARVLRSGRRLW +SGEGKL +  D   WF+L+ NS
Sbjct: 1    MPSVGMRRTTRVFVPKTAAKGAAGGARVLRSGRRLWPDSGEGKLTRDAD---WFRLLHNS 57

Query: 634  GGGVRNYKG------NGWHEFGSKQQVAEMDTDGVKSVPKLSKTVPRIKIDPVAGDRKFG 795
            GGG     G      NGWHE  SKQ+V ++D +   +V +      +   D  +   ++G
Sbjct: 58   GGGGGGAGGGGGLKENGWHEVNSKQEVDDVDAE--VAVSESRNVAGKCGDDQGSDYSRWG 115

Query: 796  NVYTRKRKRSDAKNPNFLGGKEGRRGLEDKMFGIHFVRRKKSKKTSIVRAGSHDMGAVFV 975
             VY+R+ KRSD+K+   L   E +RG EDK FGI F R+++ K+       S + G V V
Sbjct: 116  IVYSRRTKRSDSKS---LLSPEKKRGFEDKRFGIRFSRKQRRKRME----ESEEGGYVCV 168

Query: 976  RKYVSHSVMLAVAVQSSCATTLRFTTFLNSILGYMKRATVKLKELSAFLLSKSITDVFSS 1155
                    M+ V + SS +   RFT+FLNSILGYM+R+ V+L  L  FL  + + D FSS
Sbjct: 169  E-------MVTVVIDSSRSGRCRFTSFLNSILGYMRRSRVRLWGLYEFLTWEPMMDAFSS 221

Query: 1156 HGIHFSRGPTHLNSSGICKIFGARQFSPSFAVDFSALPLCFMYMHSSMLLRLERQLFLHM 1335
            HG+ F R P    S GICKIFGAR+F P F+VDFSA+P CFMY+HSSMLLR     F+ +
Sbjct: 222  HGVRFLRDPPCARSFGICKIFGARRFIPLFSVDFSAVPSCFMYLHSSMLLRFGCLPFVLV 281

Query: 1336 TFLMGLDTDSK----TMXXXXXXXXXXXXXXRQLVAWGDEDPCKRSMLNG---------- 1473
               M + ++ +    +                + +   +++  KR ML            
Sbjct: 282  NNSMSVCSNGEEPIDSEENLLCIPSKKDHFGSKSITLENDNSGKRRMLQPTIGTSRFSGR 341

Query: 1474 NLQYRNGLNXXXXXXXXXXXXXXNVGNLSCLYVNRSNNGALVSNLFGIMNDAIGFSSPVS 1653
            N Q+RNG+N               V N S + +++SN GALVS+     N  I FSS V 
Sbjct: 342  NAQWRNGVNSRSIQKRRSSQRSRRVRNPSLVGIHKSN-GALVSDFITNRNKGIPFSSVVY 400

Query: 1654 NHKCRRSVSSCSVGNLKEVKSTWTISGEDIESTCCSGNILVVESDKCFHREEGASIMLE- 1830
            N + RRS    S  N++E+KST  +  E+I+S CCS NIL+VESD+CF RE GA++MLE 
Sbjct: 401  NQELRRSARHASATNIRELKSTSVVVKEEIDSVCCSANILIVESDRCF-RENGANVMLEV 459

Query: 1831 SCSNQWFIVVKKNGWTRYSHKAQTVMRPCPHGTNGYYRDMATQSMIWTEDNDWKLEFPNK 2010
            S S +WFI VKK+G  +YSHKA+  MR        Y  +  T +MIW  ++ WKLEFPN+
Sbjct: 460  SASKEWFIAVKKDGSMKYSHKAEKDMR--------YASNRHTHAMIWNGEDGWKLEFPNR 511

Query: 2011 RDWLIFKELHKECCERNVTATTVKNIPVPEIREVSDYGDSNSVPFMRPDSYIHLKGDEVS 2190
            +DW+IFKEL+KECC+RNV A +VK IPVP + EV+DYGD    PF RPD+YI  K DEVS
Sbjct: 512  QDWMIFKELYKECCDRNVEAPSVKIIPVPGVHEVTDYGDYKGDPFSRPDTYIAFKNDEVS 571

Query: 2191 RALARRTSNYDMDGEDQEWLNR 2256
            RA+A+ T++YDMD ED+EWL +
Sbjct: 572  RAMAKTTASYDMDSEDEEWLKK 593


>ref|XP_007221419.1| hypothetical protein PRUPE_ppa001422mg [Prunus persica]
            gi|462418131|gb|EMJ22618.1| hypothetical protein
            PRUPE_ppa001422mg [Prunus persica]
          Length = 832

 Score =  414 bits (1063), Expect = e-112
 Identities = 266/628 (42%), Positives = 356/628 (56%), Gaps = 31/628 (4%)
 Frame = +1

Query: 466  MPSVGMRRTRRVFG---VIKGVDGARVLRSGRRLWTESGEGKLKKGNDGDE-WFKLIDNS 633
            MPSV MRRT RVFG   V  GVDGARVLRSGRRLW ES E KL++  +GDE W KL+ + 
Sbjct: 54   MPSVEMRRTTRVFGMGMVKGGVDGARVLRSGRRLWPESSESKLERARNGDEDWLKLMKSH 113

Query: 634  GG----GVRNYKGNGWHEFGSKQQVAEMDTDGVKSVPKLSKTVPRIKIDPVAGDRKFGNV 801
             G    G+ + K  G ++ GS ++   +    +   P+ ++ +     D +   +++G V
Sbjct: 114  AGESVVGLNHKKWAGANQVGSPRRNTPVLKTSLVKKPQSNELLA----DLLKEHKRYGIV 169

Query: 802  YTRKRKRSDAKNPNFLGGKEGRRGLEDKMFGIHFVRRKKSKKTSIVRAGSHDMGAVFVRK 981
            YTRKRKR+ A   +FLG  E   G +D+M+G  F RR++ KK+  +     D    FV  
Sbjct: 170  YTRKRKRASA---SFLGNVEKENGSDDRMYGRRFARRQRMKKSKEL-----DSHPGFVCP 221

Query: 982  YVSHSVMLAVAVQSSCATTLRFTTFLNSILGYMKRATVKLKELSAFLLSKSITDVFSSHG 1161
             V     L  +V+SS A       FL S+L YM RA++ L E S FL  + I  +F+S+G
Sbjct: 222  EV-----LCFSVESSWAQGYWAGRFLYSVLVYMTRASLGLTEFSEFLALEPIGSIFASYG 276

Query: 1162 IHFSRGPTHLNSSGICKIFGARQFSPSFAVDFSALPLCFMYMHSSMLLRLERQLFLHMTF 1341
            I FSR  +    SG+CK+FGA QF P F+VDFSA+P CFM+M +SM LR       H+T 
Sbjct: 277  IQFSRDRSCTRRSGVCKLFGAEQFIPLFSVDFSAVPGCFMFMQTSMHLRFR----CHLTV 332

Query: 1342 LMGLDTDSKTMXXXXXXXXXXXXXXRQLVAWGDEDPC--------KRSMLNGNL------ 1479
               +D                     + +  GD+D           R  L+ ++      
Sbjct: 333  NNLIDGHENG----------------EFIDQGDDDDDGEKVDFIENRHALHSSVRVPKLA 376

Query: 1480 ----QYRNGLNXXXXXXXXXXXXXXNVGNLSCLYVNRSNNGALVSNLFGIMNDAIGFSSP 1647
                QYRNGL                  N S + + R  NGALVS L  I  + + FSS 
Sbjct: 377  CRSTQYRNGLTSRGIQKRRSSLRRRRSRNPSLVSL-RKPNGALVSELISIRKNGLPFSSV 435

Query: 1648 VSNHKCRRSVSSCSVGNLKEVKSTWTISGEDIESTCCSGNILVVESDKCFHREEGASIML 1827
             S H  R+SVS    GNLK    T   S  D++ST CS NIL  E DKC+ RE+GA++ML
Sbjct: 436  ESKHMLRKSVSLSLAGNLKAESLTIEGSKRDLDSTSCSANILFTELDKCY-REDGATVML 494

Query: 1828 E-SCSNQWFIVVKKNGWTRYSHKAQTVMRPCPHGTNGYYRDMATQSMIWTED----NDWK 1992
            E S S +W +VVKKNG TRY+HKA+ VMRPC        ++  TQ++IW+ D    N+WK
Sbjct: 495  EMSSSGEWLLVVKKNGLTRYTHKAEKVMRPCS-------KNRITQAIIWSADSNGDNNWK 547

Query: 1993 LEFPNKRDWLIFKELHKECCERNVTATTVKNIPVPEIREVSDYGDSNSVPFMRPDSYIHL 2172
            LEFPN+ DW IFK+L+KEC +R V A  +K IPVP +REV  Y DS+S  F RP+SYI+L
Sbjct: 548  LEFPNRCDWAIFKDLYKECSDRVVPAPAIKFIPVPGVREVPGYADSHSTLFDRPESYIYL 607

Query: 2173 KGDEVSRALARRTSNYDMDGEDQEWLNR 2256
              DEVSRA+A+RT+NYDMD +D+EWL +
Sbjct: 608  NDDEVSRAMAKRTANYDMDSDDEEWLKK 635


>ref|XP_007221418.1| hypothetical protein PRUPE_ppa001422mg [Prunus persica]
            gi|462418130|gb|EMJ22617.1| hypothetical protein
            PRUPE_ppa001422mg [Prunus persica]
          Length = 768

 Score =  414 bits (1063), Expect = e-112
 Identities = 266/628 (42%), Positives = 356/628 (56%), Gaps = 31/628 (4%)
 Frame = +1

Query: 466  MPSVGMRRTRRVFG---VIKGVDGARVLRSGRRLWTESGEGKLKKGNDGDE-WFKLIDNS 633
            MPSV MRRT RVFG   V  GVDGARVLRSGRRLW ES E KL++  +GDE W KL+ + 
Sbjct: 54   MPSVEMRRTTRVFGMGMVKGGVDGARVLRSGRRLWPESSESKLERARNGDEDWLKLMKSH 113

Query: 634  GG----GVRNYKGNGWHEFGSKQQVAEMDTDGVKSVPKLSKTVPRIKIDPVAGDRKFGNV 801
             G    G+ + K  G ++ GS ++   +    +   P+ ++ +     D +   +++G V
Sbjct: 114  AGESVVGLNHKKWAGANQVGSPRRNTPVLKTSLVKKPQSNELLA----DLLKEHKRYGIV 169

Query: 802  YTRKRKRSDAKNPNFLGGKEGRRGLEDKMFGIHFVRRKKSKKTSIVRAGSHDMGAVFVRK 981
            YTRKRKR+ A   +FLG  E   G +D+M+G  F RR++ KK+  +     D    FV  
Sbjct: 170  YTRKRKRASA---SFLGNVEKENGSDDRMYGRRFARRQRMKKSKEL-----DSHPGFVCP 221

Query: 982  YVSHSVMLAVAVQSSCATTLRFTTFLNSILGYMKRATVKLKELSAFLLSKSITDVFSSHG 1161
             V     L  +V+SS A       FL S+L YM RA++ L E S FL  + I  +F+S+G
Sbjct: 222  EV-----LCFSVESSWAQGYWAGRFLYSVLVYMTRASLGLTEFSEFLALEPIGSIFASYG 276

Query: 1162 IHFSRGPTHLNSSGICKIFGARQFSPSFAVDFSALPLCFMYMHSSMLLRLERQLFLHMTF 1341
            I FSR  +    SG+CK+FGA QF P F+VDFSA+P CFM+M +SM LR       H+T 
Sbjct: 277  IQFSRDRSCTRRSGVCKLFGAEQFIPLFSVDFSAVPGCFMFMQTSMHLRFR----CHLTV 332

Query: 1342 LMGLDTDSKTMXXXXXXXXXXXXXXRQLVAWGDEDPC--------KRSMLNGNL------ 1479
               +D                     + +  GD+D           R  L+ ++      
Sbjct: 333  NNLIDGHENG----------------EFIDQGDDDDDGEKVDFIENRHALHSSVRVPKLA 376

Query: 1480 ----QYRNGLNXXXXXXXXXXXXXXNVGNLSCLYVNRSNNGALVSNLFGIMNDAIGFSSP 1647
                QYRNGL                  N S + + R  NGALVS L  I  + + FSS 
Sbjct: 377  CRSTQYRNGLTSRGIQKRRSSLRRRRSRNPSLVSL-RKPNGALVSELISIRKNGLPFSSV 435

Query: 1648 VSNHKCRRSVSSCSVGNLKEVKSTWTISGEDIESTCCSGNILVVESDKCFHREEGASIML 1827
             S H  R+SVS    GNLK    T   S  D++ST CS NIL  E DKC+ RE+GA++ML
Sbjct: 436  ESKHMLRKSVSLSLAGNLKAESLTIEGSKRDLDSTSCSANILFTELDKCY-REDGATVML 494

Query: 1828 E-SCSNQWFIVVKKNGWTRYSHKAQTVMRPCPHGTNGYYRDMATQSMIWTED----NDWK 1992
            E S S +W +VVKKNG TRY+HKA+ VMRPC        ++  TQ++IW+ D    N+WK
Sbjct: 495  EMSSSGEWLLVVKKNGLTRYTHKAEKVMRPCS-------KNRITQAIIWSADSNGDNNWK 547

Query: 1993 LEFPNKRDWLIFKELHKECCERNVTATTVKNIPVPEIREVSDYGDSNSVPFMRPDSYIHL 2172
            LEFPN+ DW IFK+L+KEC +R V A  +K IPVP +REV  Y DS+S  F RP+SYI+L
Sbjct: 548  LEFPNRCDWAIFKDLYKECSDRVVPAPAIKFIPVPGVREVPGYADSHSTLFDRPESYIYL 607

Query: 2173 KGDEVSRALARRTSNYDMDGEDQEWLNR 2256
              DEVSRA+A+RT+NYDMD +D+EWL +
Sbjct: 608  NDDEVSRAMAKRTANYDMDSDDEEWLKK 635


>emb|CBI40243.3| unnamed protein product [Vitis vinifera]
          Length = 734

 Score =  408 bits (1049), Expect = e-111
 Identities = 240/552 (43%), Positives = 328/552 (59%), Gaps = 15/552 (2%)
 Frame = +1

Query: 646  RNYKGNGWHEFGSKQQVAEMDTDGVKSVPKLSKTVPRIKIDPVAGDRKFGNVYTRKRKRS 825
            R  + NGWHE  SKQ+V ++D +   +V +      +   D  +   ++G VY+R+ KRS
Sbjct: 11   RRCRLNGWHEVNSKQEVDDVDAE--VAVSESRNVAGKCGDDQGSDYSRWGIVYSRRTKRS 68

Query: 826  DAKNPNFLGGKEGRRGLEDKMFGIHFVRRKKSKKTSIVRAGSHDMGAVFVRKYVSHSVML 1005
            D+K+   L   E +RG EDK FGI F R+++ K+       S + G V V        M+
Sbjct: 69   DSKS---LLSPEKKRGFEDKRFGIRFSRKQRRKRME----ESEEGGYVCVE-------MV 114

Query: 1006 AVAVQSSCATTLRFTTFLNSILGYMKRATVKLKELSAFLLSKSITDVFSSHGIHFSRGPT 1185
             V + SS +   RFT+FLNSILGYM+R+ V+L  L  FL  + + D FSSHG+ F R P 
Sbjct: 115  TVVIDSSRSGRCRFTSFLNSILGYMRRSRVRLWGLYEFLTWEPMMDAFSSHGVRFLRDPP 174

Query: 1186 HLNSSGICKIFGARQFSPSFAVDFSALPLCFMYMHSSMLLRLERQLFLHMTFLMGLDTDS 1365
               S GICKIFGAR+F P F+VDFSA+P CFMY+HSSMLLR     F+ +   M + ++ 
Sbjct: 175  CARSFGICKIFGARRFIPLFSVDFSAVPSCFMYLHSSMLLRFGCLPFVLVNNSMSVCSNG 234

Query: 1366 K----TMXXXXXXXXXXXXXXRQLVAWGDEDPCKRSMLNG----------NLQYRNGLNX 1503
            +    +                + +   +++  KR ML            N Q+RNG+N 
Sbjct: 235  EEPIDSEENLLCIPSKKDHFGSKSITLENDNSGKRRMLQPTIGTSRFSGRNAQWRNGVNS 294

Query: 1504 XXXXXXXXXXXXXNVGNLSCLYVNRSNNGALVSNLFGIMNDAIGFSSPVSNHKCRRSVSS 1683
                          V N S + +++SN GALVS+     N  I FSS V N + RRS   
Sbjct: 295  RSIQKRRSSQRSRRVRNPSLVGIHKSN-GALVSDFITNRNKGIPFSSVVYNQELRRSARH 353

Query: 1684 CSVGNLKEVKSTWTISGEDIESTCCSGNILVVESDKCFHREEGASIMLE-SCSNQWFIVV 1860
             S  N++E+KST  +  E+I+S CCS NIL+VESD+CF RE GA++MLE S S +WFI V
Sbjct: 354  ASATNIRELKSTSVVVKEEIDSVCCSANILIVESDRCF-RENGANVMLEVSASKEWFIAV 412

Query: 1861 KKNGWTRYSHKAQTVMRPCPHGTNGYYRDMATQSMIWTEDNDWKLEFPNKRDWLIFKELH 2040
            KK+G  +YSHKA+  MR        Y  +  T +MIW  ++ WKLEFPN++DW+IFKEL+
Sbjct: 413  KKDGSMKYSHKAEKDMR--------YASNRHTHAMIWNGEDGWKLEFPNRQDWMIFKELY 464

Query: 2041 KECCERNVTATTVKNIPVPEIREVSDYGDSNSVPFMRPDSYIHLKGDEVSRALARRTSNY 2220
            KECC+RNV A +VK IPVP + EV+DYGD    PF RPD+YI  K DEVSRA+A+ T++Y
Sbjct: 465  KECCDRNVEAPSVKIIPVPGVHEVTDYGDYKGDPFSRPDTYIAFKNDEVSRAMAKTTASY 524

Query: 2221 DMDGEDQEWLNR 2256
            DMD ED+EWL +
Sbjct: 525  DMDSEDEEWLKK 536


>ref|XP_006436656.1| hypothetical protein CICLE_v10030776mg [Citrus clementina]
            gi|568878428|ref|XP_006492195.1| PREDICTED:
            uncharacterized protein LOC102612244 [Citrus sinensis]
            gi|557538852|gb|ESR49896.1| hypothetical protein
            CICLE_v10030776mg [Citrus clementina]
          Length = 758

 Score =  401 bits (1031), Expect = e-109
 Identities = 253/603 (41%), Positives = 347/603 (57%), Gaps = 6/603 (0%)
 Frame = +1

Query: 466  MPSVGMRRTRRVFGVIKGVDGARVLRSGRRLWTESGEGKLKKGNDGDEWFK--LID--NS 633
            MPSVGMRRT RVFGV+KGVDGARVLRSGRRLW +SG+GKL++ N GD+W+   +I+  N 
Sbjct: 1    MPSVGMRRTTRVFGVVKGVDGARVLRSGRRLWPDSGDGKLRRTNYGDDWYHHPVINKKNG 60

Query: 634  GGGVRNYKGNGWHEFGSKQQVAEMDTDGVKSVPKLSKTVPRIKIDPVAGDRKFGNVYTRK 813
            G G    K NGW       +V   + D  K V    K    +K      D  +G VY+RK
Sbjct: 61   GPGGPKCKPNGWAAHLDDLKVYA-NNDEKKEVKMCKKVKEELK----GADLMYGIVYSRK 115

Query: 814  RKRSDAKNPNFLGGKEGRRGLEDKMFGIHFVRRKKSKKTSIVRAGSHDMGAVFVRKYVSH 993
            RKR+D +    L         E K +GI F RR++ KK+                K V  
Sbjct: 116  RKRNDGEKSKIL---------EKKKYGIQFSRRQRRKKSE---------------KIVPF 151

Query: 994  SVMLAVAVQSSCATTLRFTTFLNSILGYMKRATVKLKELSAFLLSKSITDVFSSHGIHFS 1173
            SV   V ++SS +  L   +FL+S+LG M+RATV+L  L++FLLS++I+ VFS  GI FS
Sbjct: 152  SVF-GVGLESSSSGFL--VSFLSSVLGCMRRATVELPRLASFLLSETISGVFSLRGIRFS 208

Query: 1174 RGPTHLNSSGICKIFGARQFSPSFAVDFSALPLCFMYMHSSMLLRLERQLFLHMTFLMGL 1353
              P  +  +G+C+IFG  Q  P F++DFSA+P CFMY+H  ML+R  R   ++ +     
Sbjct: 209  WDPP-IARTGMCRIFGTMQLIPMFSLDFSAVPSCFMYIHHCMLVRFMRPPSVNSSASEDD 267

Query: 1354 DTDSKTMXXXXXXXXXXXXXXRQLVAWGDEDPCKRSMLNG-NLQYRNGLNXXXXXXXXXX 1530
             ++ + +                +         + S L   N+QYR+ LN          
Sbjct: 268  SSEEEDVDYVCESKTVTPVVDNSVNKVALHPSVRSSKLAARNVQYRSSLNSRAIQKRRSS 327

Query: 1531 XXXXNVGNLSCLYVNRSNNGALVSNLFGIMNDAIGFSSPVSNHKCRRSVSSCSVGNLKEV 1710
                   N S L  ++  +GALVS+L      +I  SS VS  K R S+   SV ++KEV
Sbjct: 328  LRRRRARNPS-LIGSQKASGALVSDLTSCRKSSIPSSSAVSKSKLRSSLQHSSVLSIKEV 386

Query: 1711 KSTWTISGEDIESTCCSGNILVVESDKCFHREEGASIMLE-SCSNQWFIVVKKNGWTRYS 1887
             ST      D++ +CC  +ILV+ESD+C  R EGA+++LE S S +W +VVKK+G TRYS
Sbjct: 387  SSTVDSLMLDLDRSCCCVSILVMESDRCC-RVEGANVILEMSHSKEWHLVVKKDGETRYS 445

Query: 1888 HKAQTVMRPCPHGTNGYYRDMATQSMIWTEDNDWKLEFPNKRDWLIFKELHKECCERNVT 2067
             KAQ +MRP          +  T +++W  D++WKLEF N++DWL FK+L+KEC +RN  
Sbjct: 446  FKAQRIMRPSSF-------NRFTHAILWAGDDNWKLEFSNRQDWLNFKDLYKECSDRNAQ 498

Query: 2068 ATTVKNIPVPEIREVSDYGDSNSVPFMRPDSYIHLKGDEVSRALARRTSNYDMDGEDQEW 2247
             +  K IP+P + EV  Y DSN+VPF RPDSYI +  DEVSRALA+RT+NYDMD ED+EW
Sbjct: 499  VSVSKVIPIPGVYEVLGYEDSNTVPFCRPDSYISVNVDEVSRALAKRTANYDMDSEDEEW 558

Query: 2248 LNR 2256
            L +
Sbjct: 559  LKK 561


>ref|XP_002315450.2| hypothetical protein POPTR_0010s24240g [Populus trichocarpa]
            gi|550330500|gb|EEF01621.2| hypothetical protein
            POPTR_0010s24240g [Populus trichocarpa]
          Length = 777

 Score =  392 bits (1006), Expect = e-106
 Identities = 249/625 (39%), Positives = 352/625 (56%), Gaps = 28/625 (4%)
 Frame = +1

Query: 466  MPSVGMRRTRRVFGVIKGVDGARVLRSGRRLWTESGEGKLKKGNDGDEWFKLI------- 624
            MPSVG+RRT RVFGVIKGVDGARVLRSGRRLW ESG+GKL++ NDGDEW+  I       
Sbjct: 1    MPSVGLRRTTRVFGVIKGVDGARVLRSGRRLWQESGDGKLRRSNDGDEWYHTIIKNDNYQ 60

Query: 625  ---DNSGGGVRNYKGNGW-HEFGSKQQVAEMDTDGVKSVPKLSKTVPRIKIDPVAGDRKF 792
                N    ++  + +GW H+   K+ +      GV       K + R+K      ++KF
Sbjct: 61   TKNQNKNSDLKYKENSGWAHDDKLKKDL------GVVIAIAAPKRIKRVK-----SEKKF 109

Query: 793  GNVYTRKRKRSDAKNPNFLGGKEGRRGLEDKMFGIHFVRRKKSKKTSIVRAGSHDMGAVF 972
            G VY RKRKR        LGG++     EDK FGI F RR++    S+    S  +    
Sbjct: 110  GIVYRRKRKR--------LGGEKSEDS-EDKKFGIQFSRRQRR---SLDDESSESL---- 153

Query: 973  VRKYVSHSVMLAVAVQSSCATTLRFTTFLNSILGYMKRATVKLKELSAFLLSKSITDVFS 1152
                V    ++ +    S +++   + FL+S+L Y+KR  + L EL+ FLLS+ I+ VF+
Sbjct: 154  ----VCTPELVVLVEDFSSSSSNGLSCFLSSVLRYIKRVNLSLSELADFLLSEPISSVFA 209

Query: 1153 SHGIHFSRGPTHLNSSGICKIFGARQFSPSFAVDFSALPLCFMYMHSSMLLRLERQLFLH 1332
            S+G+HF+R  +  +  GICK FG RQ  P F+VDFS++P CF++MH S+ +R +    + 
Sbjct: 210  SNGLHFARDLS-ADRIGICKFFGTRQLLPMFSVDFSSIPSCFVHMHLSLFVRFKFLSPIP 268

Query: 1333 MTFLMGLDTDSKTMXXXXXXXXXXXXXXR-----QLVAWGDED----------PCKRSML 1467
            +   +  D +   +              +     ++ A  + D            + S L
Sbjct: 269  VNNSLDEDDEDDDVMMSGSKVDQSCTTMKTDFALKITAVPEIDNSGSKAVVHPSVRASKL 328

Query: 1468 NG-NLQYRNGLNXXXXXXXXXXXXXXNVGNLSCLYVNRSNNGALVSNLFGIMNDAIGFSS 1644
             G + QYRNGLN                 N +   +++++ GALVS+L       I FSS
Sbjct: 329  AGRSTQYRNGLNSRGIQKRRSSLRRGRPRNSAIAGLHKAS-GALVSDLISSRRKGIPFSS 387

Query: 1645 PVSNHKCRRSVSSCSVGNLKEVKSTWTISGEDIESTCCSGNILVVESDKCFHREEGASIM 1824
             VS +K RRSV S    N+KE+ S      +D+  + CS NILV ESD+C+ R EGA++M
Sbjct: 388  VVSKNKLRRSVRSSPAANIKEMNSAAVGVKKDMNMSSCSANILVSESDRCY-RIEGATVM 446

Query: 1825 LE-SCSNQWFIVVKKNGWTRYSHKAQTVMRPCPHGTNGYYRDMATQSMIWTEDNDWKLEF 2001
             E + S +W +VVKK+G TRY+H AQ  MR C   +N +     T  +IWT D++WKLEF
Sbjct: 447  FEFTGSREWVLVVKKDGLTRYTHLAQKSMRTC--ASNRF-----THDIIWTGDDNWKLEF 499

Query: 2002 PNKRDWLIFKELHKECCERNVTATTVKNIPVPEIREVSDYGDSNSVPFMRPDSYIHLKGD 2181
            PN++DW IFKEL+KEC + NV A+  K I VP +REV  Y +    PF+RP +YI  + D
Sbjct: 500  PNRQDWFIFKELYKECSDCNVPASVSKVISVPGVREVLGYENGGGAPFLRPYAYISSEND 559

Query: 2182 EVSRALARRTSNYDMDGEDQEWLNR 2256
            EV+RALAR T++YDMD ED+EWL +
Sbjct: 560  EVARALARSTASYDMDSEDEEWLKK 584


>gb|EXC25392.1| hypothetical protein L484_016774 [Morus notabilis]
          Length = 795

 Score =  389 bits (1000), Expect = e-105
 Identities = 258/627 (41%), Positives = 340/627 (54%), Gaps = 30/627 (4%)
 Frame = +1

Query: 466  MPSVGMRRTRRVFGVIKGVDGARVLRSGRRLWTESGEGKLKKGNDGDEWFKLIDNSGGGV 645
            MPSVGMRRT RVFGV+KGVDGARVLRSGRRLW +SGE KL++ +D  +WFK+    G G 
Sbjct: 1    MPSVGMRRTTRVFGVVKGVDGARVLRSGRRLWPDSGEVKLRRHSDVYDWFKI--GKGDGG 58

Query: 646  RNYKGNGW-HEFGSKQQ----VAEMDTDGVKSVPKLSKTVPRIKIDPVAG----DRKFGN 798
              Y  NGW H   SK +    VAE+        PK       + +D   G    DR FG 
Sbjct: 59   LGYDSNGWAHNTNSKPKKTPPVAEI------KAPKPEDNNRGVGVDLAHGGRRPDRMFGL 112

Query: 799  VYTRKRKRSDAK-------NPNFLGGKEGRRGLEDKMFGIHFVRRKKSKKTS---IVRAG 948
            VY+RKRK    +       N   LGG  G+R      +G  FVRR++ K  S      A 
Sbjct: 113  VYSRKRKNLAVRSSGNASVNSETLGGSVGKR------YGRRFVRRQRRKLNSGESFAVAD 166

Query: 949  SHDMGAVFVRKYVSHSVMLAVAVQSSCATTLRFTTFLNSILGYMKRATVKLKELSAFLLS 1128
              D    F     + S +++V   SS          L SIL Y+ RA ++L +L AFL+S
Sbjct: 167  DSDSRLEF-----TPSEVVSVVFGSSMDRNFYAVGVLCSILVYLTRARLRLTDLFAFLVS 221

Query: 1129 KSITDVFSSHGIHFSRGPTHLNSSGICKIFGARQFSPSFAVDFSALPLCFMYMHSSMLLR 1308
            + I+ V SS GI+       +     CK+FGA +F P F VDFSA+PLCFM+MHS M  R
Sbjct: 222  EPISRVNSSCGINIFLDHPSIKRFASCKLFGAPEFVPLFCVDFSAIPLCFMHMHSCMFFR 281

Query: 1309 LERQLFLHMTFLMG----------LDTDSKTMXXXXXXXXXXXXXXRQLVAWGDEDPCKR 1458
             +RQ  L     +           L +  K                   +A        +
Sbjct: 282  YKRQPSLAGNNEIDEMISDDEEDQLSSPGKDALESKPLLSAEANHSENRLASNPSFKASK 341

Query: 1459 SMLNGNLQYRNGLNXXXXXXXXXXXXXXNVGNLSCLYVNRSNNGALVSNLFGIMNDAIGF 1638
                 N QYRNGL                  N S   V + NN AL+S+L     +++  
Sbjct: 342  FACRSN-QYRNGLISRGIQKRRSSLRRRKARNPSLCGVQKPNN-ALLSDLVSFRKNSVSL 399

Query: 1639 SSPVSNHKCRRSVSSCSVGNLKEVKSTWTISGEDIESTCCSGNILVVESDKCFHREEGAS 1818
            S   SN+K RRS+ S S   LKEV ST   S +D++ST C  N+L++E +KC+ RE G S
Sbjct: 400  SL-TSNNKLRRSLRSNSARKLKEVSSTVADSTQDMDSTSCCANVLIIEPEKCY-REGGFS 457

Query: 1819 IMLESCS-NQWFIVVKKNGWTRYSHKAQTVMRPCPHGTNGYYRDMATQSMIWTEDNDWKL 1995
            I+LES     W I VKK+G T+++HKA+ VMRPC   +N +     T  ++WT D+ WKL
Sbjct: 458  IVLESSPLGGWLIAVKKDGSTKFTHKAEKVMRPC--SSNRF-----THDIMWTADDGWKL 510

Query: 1996 EFPNKRDWLIFKELHKECCERNVTATTVKNIPVPEIREVSDYGDSNSVPFMRPDSYIHLK 2175
            EFPN++DWLIFK+L++EC +RN+ A  VK +P+P + EVS  GDS+   F RPDSYI +K
Sbjct: 511  EFPNRKDWLIFKDLYQECSDRNMLAPGVKVVPIPGVNEVSQKGDSHCTLFRRPDSYISVK 570

Query: 2176 GDEVSRALARRTSNYDMDGEDQEWLNR 2256
             DE+ RAL R+TSNYDMD ED+EWLN+
Sbjct: 571  DDELCRALKRKTSNYDMDLEDEEWLNK 597


>ref|XP_002311034.2| hypothetical protein POPTR_0008s02470g [Populus trichocarpa]
            gi|550332250|gb|EEE88401.2| hypothetical protein
            POPTR_0008s02470g [Populus trichocarpa]
          Length = 774

 Score =  385 bits (990), Expect = e-104
 Identities = 246/624 (39%), Positives = 338/624 (54%), Gaps = 27/624 (4%)
 Frame = +1

Query: 466  MPSVGMRRTRRVFGVIKGVDGARVLRSGRRLWTESGEGKLKKGNDGDEWFKLI------- 624
            MPSVG+RRT RVF V+KGVDGARVLRSGRRLW ESG+GKL++ +DGDE ++ I       
Sbjct: 1    MPSVGLRRTTRVFSVVKGVDGARVLRSGRRLWPESGDGKLRRSSDGDELYQTIIKNTNNH 60

Query: 625  ---DNSGGGVRNYKGNGW-HEFGSKQQVAEMDTDGVKSVPKLSKTVPRIKIDPVAGDRKF 792
                NS   ++  + NGW H+   K+        G+       K + R+K +      KF
Sbjct: 61   IKNQNSNSNLKYKENNGWTHDVKLKKD------RGIVIAIAAPKKIKRVKSEK----EKF 110

Query: 793  GNVYTRKRKRSDAKNPNFLGGKEGRRGLEDKMFGIHFVRRKKSKKTSIVRAGSHDMGAVF 972
            G VY+RKRKR        LGG++     EDK FGI F RR++       R GS    ++ 
Sbjct: 111  GIVYSRKRKR--------LGGEKSENP-EDKKFGIQFSRRQRR------REGSESQESLV 155

Query: 973  VRKYVSHSVMLAVAVQSSCATTLRFTTFLNSILGYMKRATVKLKELSAFLLSKSITDVFS 1152
                   +  L   V+   ++    + FL+S+LG+  R ++ L EL+ FLLS  I+ VF+
Sbjct: 156  C------TPQLVALVEGCSSSNGWLSCFLSSVLGHAMRVSLSLSELADFLLSDPISSVFA 209

Query: 1153 SHGIHFSRGPTHLNSSGICKIFGARQFSPSFAVDFSALPLCFMYMHSSMLLRLERQLFLH 1332
            S+G+HF R     +  GICK F  RQ  P F+VDFSA+P CF +MH S+ ++      + 
Sbjct: 210  SNGLHFVRDLPS-DRIGICKFFETRQLLPMFSVDFSAIPSCFAFMHLSLFVKFRCLSLIP 268

Query: 1333 MTFLMGLDTDSKTMXXXXXXXXXXXXXXRQ------LVAWGDEDPCK--------RSMLN 1470
            +   +  D D   +                      +V   D   C+         S L 
Sbjct: 269  VNNSVDGDDDDDEIMSESKGDQSCTSTKTDFTQKITVVPKTDSYGCRVVLHPSVRASKLT 328

Query: 1471 G-NLQYRNGLNXXXXXXXXXXXXXXNVGNLSCLYVNRSNNGALVSNLFGIMNDAIGFSSP 1647
            G N Q+RNGLN                 N S   ++++N GALVS+L       I FSS 
Sbjct: 329  GRNTQHRNGLNSRGIQKRRSSLRRGRPRNSSIGGLHKAN-GALVSDLISSRKIGIPFSSV 387

Query: 1648 VSNHKCRRSVSSCSVGNLKEVKSTWTISGEDIESTCCSGNILVVESDKCFHREEGASIML 1827
            VS  K RRS+ S    ++KE+        + +  + CS NIL+ E+D+C+ R EGA++ML
Sbjct: 388  VSKEKLRRSIQSSPAASIKELNCAAVGVKKGMNLSSCSANILITETDRCY-RIEGATVML 446

Query: 1828 E-SCSNQWFIVVKKNGWTRYSHKAQTVMRPCPHGTNGYYRDMATQSMIWTEDNDWKLEFP 2004
            E + S +W +VVKKNG TRYSH AQ +MR C         +  T  +IW  D++WKLEFP
Sbjct: 447  EFTDSKEWVLVVKKNGLTRYSHLAQKIMRTC-------VSNRFTHDIIWNGDDNWKLEFP 499

Query: 2005 NKRDWLIFKELHKECCERNVTATTVKNIPVPEIREVSDYGDSNSVPFMRPDSYIHLKGDE 2184
            N++DW IFKEL+KEC + NV A+  K IPVP +R V D GD  S PF RP +YI    DE
Sbjct: 500  NRQDWFIFKELYKECSDHNVPASVSKAIPVPGVRGVLDNGDCGSAPFSRPYAYISSNNDE 559

Query: 2185 VSRALARRTSNYDMDGEDQEWLNR 2256
            V+RAL+R T++YDMD ED+EWL +
Sbjct: 560  VARALSRSTASYDMDSEDEEWLKK 583


>ref|XP_004166800.1| PREDICTED: uncharacterized LOC101207239 [Cucumis sativus]
          Length = 819

 Score =  374 bits (960), Expect = e-100
 Identities = 243/637 (38%), Positives = 345/637 (54%), Gaps = 42/637 (6%)
 Frame = +1

Query: 466  MPSVGMRRTRRVFGVIKGVDGARVLRSGRRLWTESGEGKLKKGNDGDEWFKLIDNSGGGV 645
            MPS GMRRTR VFG++KG DGARVLRSGRRLW ESGE KLKK  D  +W+ +ID  G G 
Sbjct: 1    MPS-GMRRTR-VFGLVKGSDGARVLRSGRRLWPESGEVKLKKSKDASDWYPIIDGRGNGG 58

Query: 646  RNYKGN---GWHEFGSKQ-------QVAEMDTDGVKSVPKLSKTVPRIKIDPVAG--DRK 789
             +  G     W +  + +        + E D   V  VP+  K  PRI  D  +   DR 
Sbjct: 59   GSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPVKVFPRIGNDDKSSGVDRM 118

Query: 790  FGNVYTRKRKRSDAKNPNFLGGKEGRRGLE-DKMFGIHFVRRKKSKKTSIVR----AGSH 954
            FG VY+RKRKR   ++       E    L  D+MFG+ F+RR++S+KT +      AG  
Sbjct: 119  FGKVYSRKRKRGRLEDGEVFDEMESDNVLSGDRMFGLRFIRRQRSRKTDVEHWESTAGGR 178

Query: 955  DMGAVFVRKYVSH------SVMLAVAVQSSCATTLRFTTFLNSILGYMKRATVKLKELSA 1116
                 F R+ + H      ++    +V   C     F+ F+ ++L + K   + + + SA
Sbjct: 179  TSNLHFHRQRILHPRDCALTIFAGSSVDGGC-----FSDFILTVLRHFKSPGLSVAKFSA 233

Query: 1117 FLLSKSITDVFSSHGIHFSRGPTHLNSSGICKIFGARQFSPSFAVDFSALPLCFMYMHSS 1296
            FLLS  I +VF+  G+ F +G       G+  IFG+RQ  P F +DFSA+PL FM+++S 
Sbjct: 234  FLLSNPINEVFALKGMRFLQGYPPTGCCGMFAIFGSRQSIPMFHLDFSAIPLPFMFLYSE 293

Query: 1297 MLLRLER----QLFLHMTFLMGLDTDSKT-MXXXXXXXXXXXXXXRQLVAWGDEDPCKRS 1461
            M LR+ R     ++ +    + + +DS+                 R+ +A+  + P  RS
Sbjct: 294  MFLRVTRIQARLVYNNNQLDVDISSDSEEDSVEELHVPSPVSSLERKPMAFLFDRPKTRS 353

Query: 1462 MLNGN----------LQYRNGLNXXXXXXXXXXXXXXNVGNLSCLYVNRSNNGALVSNLF 1611
            + + +          +QYRNG +                 + S   + +S     V ++ 
Sbjct: 354  VSHPSVRATRLGTRTMQYRNGFSSRGIRKRRSSLRIRRPRSHSLSAMQKSIGPLAVDDV- 412

Query: 1612 GIMNDAIGFSSPVSNHKCRRSVSSC---SVGNLKEVKSTWTISGEDIESTCCSGNILVVE 1782
                  +G S P S   C R  SS    S G ++E  ST   S  D++S+CC  NIL+VE
Sbjct: 413  -----KLGVSFP-SGASCNRHKSSAVRDSAGRIRETNSTALRSAMDVDSSCCKANILIVE 466

Query: 1783 SDKCFHREEGASIMLE-SCSNQWFIVVKKNGWTRYSHKAQTVMRPCPHGTNGYYRDMATQ 1959
            +DKC  REEGA+I+LE S S +W +VVKK+G TRY+HKA+ VM+P     N +     T 
Sbjct: 467  ADKCL-REEGANIVLEFSASCEWLLVVKKDGSTRYTHKAERVMKPS--SCNRF-----TH 518

Query: 1960 SMIWTEDNDWKLEFPNKRDWLIFKELHKECCERNVTATTVKNIPVPEIREVSDYGDSNSV 2139
            +++W+ DN WKLEFPN+RDW IFK+L+KEC +RN+     K IPVP + EV DY DS+  
Sbjct: 519  AILWSIDNGWKLEFPNRRDWFIFKDLYKECSDRNIPCLIAKAIPVPRVSEVPDYVDSSGA 578

Query: 2140 PFMRPDSYIHLKGDEVSRALARRTSNYDMDGEDQEWL 2250
             F RPD+YI +  DEV RA+ + T+NYDMD ED+EWL
Sbjct: 579  SFQRPDTYISVNDDEVCRAMTKSTANYDMDSEDEEWL 615


>ref|XP_004140897.1| PREDICTED: uncharacterized protein LOC101207239 [Cucumis sativus]
          Length = 819

 Score =  374 bits (960), Expect = e-100
 Identities = 243/637 (38%), Positives = 345/637 (54%), Gaps = 42/637 (6%)
 Frame = +1

Query: 466  MPSVGMRRTRRVFGVIKGVDGARVLRSGRRLWTESGEGKLKKGNDGDEWFKLIDNSGGGV 645
            MPS GMRRTR VFG++KG DGARVLRSGRRLW ESGE KLKK  D  +W+ +ID  G G 
Sbjct: 1    MPS-GMRRTR-VFGLVKGSDGARVLRSGRRLWPESGEVKLKKSKDASDWYPIIDGRGNGG 58

Query: 646  RNYKGN---GWHEFGSKQ-------QVAEMDTDGVKSVPKLSKTVPRIKIDPVAG--DRK 789
             +  G     W +  + +        + E D   V  VP+  K  PRI  D  +   DR 
Sbjct: 59   GSGHGRLHGKWTQVRNVKPKRVVVVNIREDDDACVVKVPEPVKVFPRIGNDDKSSGVDRM 118

Query: 790  FGNVYTRKRKRSDAKNPNFLGGKEGRRGLE-DKMFGIHFVRRKKSKKTSIVR----AGSH 954
            FG VY+RKRKR   ++       E    L  D+MFG+ F+RR++S+KT +      AG  
Sbjct: 119  FGKVYSRKRKRGRLEDGEVFDEMESDNVLSGDRMFGLRFIRRQRSRKTDVEHWESTAGGR 178

Query: 955  DMGAVFVRKYVSH------SVMLAVAVQSSCATTLRFTTFLNSILGYMKRATVKLKELSA 1116
                 F R+ + H      ++    +V   C     F+ F+ ++L + K   + + + SA
Sbjct: 179  TSNLHFHRQRILHPRDCALTIFAGSSVDGGC-----FSDFILTVLRHFKSPGLSVAKFSA 233

Query: 1117 FLLSKSITDVFSSHGIHFSRGPTHLNSSGICKIFGARQFSPSFAVDFSALPLCFMYMHSS 1296
            FLLS  I +VF+  G+ F +G       G+  IFG+RQ  P F +DFSA+PL FM+++S 
Sbjct: 234  FLLSNPINEVFALKGMRFLQGYPPTGCCGMFAIFGSRQSIPMFHLDFSAIPLPFMFLYSE 293

Query: 1297 MLLRLER----QLFLHMTFLMGLDTDSKT-MXXXXXXXXXXXXXXRQLVAWGDEDPCKRS 1461
            M LR+ R     ++ +    + + +DS+                 R+ +A+  + P  RS
Sbjct: 294  MFLRVTRIQARLVYNNNQLDVDISSDSEEDSVEELHVPSPVSSLERKPMAFLFDRPKTRS 353

Query: 1462 MLNGN----------LQYRNGLNXXXXXXXXXXXXXXNVGNLSCLYVNRSNNGALVSNLF 1611
            + + +          +QYRNG +                 + S   + +S     V ++ 
Sbjct: 354  VSHPSVRATRLGTRTMQYRNGFSSRGIRKRRSSLRIRRPRSHSLAAMQKSIGPLAVDDV- 412

Query: 1612 GIMNDAIGFSSPVSNHKCRRSVSSC---SVGNLKEVKSTWTISGEDIESTCCSGNILVVE 1782
                  +G S P S   C R  SS    S G ++E  ST   S  D++S+CC  NIL+VE
Sbjct: 413  -----KLGVSFP-SGASCNRHKSSAVRDSAGRIRETNSTALGSAMDVDSSCCKANILIVE 466

Query: 1783 SDKCFHREEGASIMLE-SCSNQWFIVVKKNGWTRYSHKAQTVMRPCPHGTNGYYRDMATQ 1959
            +DKC  REEGA+I+LE S S +W +VVKK+G TRY+HKA+ VM+P     N +     T 
Sbjct: 467  ADKCL-REEGANIVLEFSASCEWLLVVKKDGSTRYTHKAERVMKPS--SCNRF-----TH 518

Query: 1960 SMIWTEDNDWKLEFPNKRDWLIFKELHKECCERNVTATTVKNIPVPEIREVSDYGDSNSV 2139
            +++W+ DN WKLEFPN+RDW IFK+L+KEC +RN+     K IPVP + EV DY DS+  
Sbjct: 519  AILWSIDNGWKLEFPNRRDWFIFKDLYKECSDRNIPCLIAKAIPVPRVSEVPDYVDSSGA 578

Query: 2140 PFMRPDSYIHLKGDEVSRALARRTSNYDMDGEDQEWL 2250
             F RPD+YI +  DEV RA+ + T+NYDMD ED+EWL
Sbjct: 579  SFQRPDTYISVNDDEVCRAMTKSTANYDMDSEDEEWL 615


>ref|XP_007010268.1| Enhancer of polycomb-like transcription factor protein, putative
            isoform 2 [Theobroma cacao] gi|508727181|gb|EOY19078.1|
            Enhancer of polycomb-like transcription factor protein,
            putative isoform 2 [Theobroma cacao]
          Length = 784

 Score =  372 bits (955), Expect = e-100
 Identities = 244/611 (39%), Positives = 348/611 (56%), Gaps = 14/611 (2%)
 Frame = +1

Query: 466  MPSVGMRRTRRVFGVIKGVDGARVLRSGRRLWTESGEGKLKK-GNDGDEWFKLIDNSGGG 642
            MPSVGMRRT RVF ++K  + ARVLRSGRRLW +SGE K K+  N+GDE + L+  +   
Sbjct: 1    MPSVGMRRTTRVFRMVKSSEVARVLRSGRRLWPDSGEAKPKRLANEGDENYNLMKKAPKS 60

Query: 643  VRNYKGNGWHEFGSKQQVAEMDTDGVKSVPKLSKTVPRIKIDPVAG--DRKFGNVYTRKR 816
              N  G      G  +++      G +  P+      +      +G  D+ FG VYTRKR
Sbjct: 61   EVN--GVAAEVSGRPKRL------GNEENPRKQSRKMKAGAFNTSGSVDKMFGIVYTRKR 112

Query: 817  KRSDAKNPNFLGGK-EGRRGLEDKMFGIHFVRRKKSKKTSIVRAGSHDMGAVFVRKYVSH 993
            KR+  +N +  G   +G  G            +K S++ +I    +++         V  
Sbjct: 113  KRNGVQNGHLSGNSGQGNYG------------KKISRRQAIENRNTNED--------VEE 152

Query: 994  SVMLAVAVQSSCATTLRFTTFLNSILGYMKRATVKLKELSAFLLSKSITDVFSSHGIHFS 1173
              M +  V++       F+ FL  +LGY+KRA V+L EL+AFL+S+ I+ V+SS+G++F 
Sbjct: 153  PKMFSFVVENGDCNGC-FSNFLILVLGYVKRAEVRLSELAAFLMSQPISSVYSSNGVNFF 211

Query: 1174 RGPTHLNSSGICKIFGARQFSPSFAVDFSALPLCFMYMHSSMLLRLER-QLFLHMTFLMG 1350
             GP   N +GICK FGA+   P F++DFSA+P  F+YMH S +LRL+R Q+    +  + 
Sbjct: 212  WGPR--NRTGICKFFGAKDSIPLFSLDFSAVPRYFLYMHYSKVLRLKRIQIVPVNSDEIV 269

Query: 1351 LDTDSKTMXXXXXXXXXXXXXXRQLVAWGD-------EDPCKRSMLNG-NLQYRNGLNXX 1506
             D++                     V   +           + S L G N Q RNGL+  
Sbjct: 270  SDSEEDEPCVTSVVDVCKSTSGNAAVEIDNLGSKVVLHPSVRASKLTGRNAQCRNGLSSR 329

Query: 1507 XXXXXXXXXXXXNVGNLSCLYVNRSNNGALVSNLFGIMNDAIGFSSPVSNHKCRRSVSSC 1686
                           N S + ++++N GAL+S+L     + I FSS VS +K R SV + 
Sbjct: 330  SIQKRRSSLRRRRARNPSIVGIHKAN-GALMSDLISSRRNGIPFSSVVSKNKLRSSVRNS 388

Query: 1687 SVGNLKEVKSTWTISGEDIESTCCSGNILVVESDKCFHREEGASIMLE-SCSNQWFIVVK 1863
            SV N+ +V S+ +   ++++S+ CS NILV+E+D+C+ REEGA + LE S S +W +VVK
Sbjct: 389  SVANVSDVGSSISDLMQNVDSSQCSANILVIEADRCY-REEGAIVTLELSASREWLLVVK 447

Query: 1864 KNGWTRYSHKAQTVMRPCPHGTNGYYRDMATQSMIWTEDNDWKLEFPNKRDWLIFKELHK 2043
            K   T+++ KA   MRP     N +     T ++IWT D++WKLEFPN++DW+IFK+L+K
Sbjct: 448  KGSSTKFACKADKFMRPS--SCNRF-----THAIIWTGDDNWKLEFPNRQDWIIFKDLYK 500

Query: 2044 ECCERNVTATTVKNIPVPEIREVSDYGDSNSVPFMRPDSYIHLKGDEVSRALARRTSNYD 2223
            EC ERNV A+TVK IPVP + EV  Y D  SVPF RPD YI L GDEVSRALA+RT+NYD
Sbjct: 501  ECSERNVPASTVKAIPVPGVHEVPGYEDRRSVPFCRPDFYISLDGDEVSRALAKRTANYD 560

Query: 2224 MDGEDQEWLNR 2256
            MD ED+EWL +
Sbjct: 561  MDSEDEEWLKK 571


>ref|XP_007010267.1| Enhancer of polycomb-like transcription factor protein, putative
            isoform 1 [Theobroma cacao] gi|508727180|gb|EOY19077.1|
            Enhancer of polycomb-like transcription factor protein,
            putative isoform 1 [Theobroma cacao]
          Length = 767

 Score =  372 bits (955), Expect = e-100
 Identities = 244/611 (39%), Positives = 348/611 (56%), Gaps = 14/611 (2%)
 Frame = +1

Query: 466  MPSVGMRRTRRVFGVIKGVDGARVLRSGRRLWTESGEGKLKK-GNDGDEWFKLIDNSGGG 642
            MPSVGMRRT RVF ++K  + ARVLRSGRRLW +SGE K K+  N+GDE + L+  +   
Sbjct: 1    MPSVGMRRTTRVFRMVKSSEVARVLRSGRRLWPDSGEAKPKRLANEGDENYNLMKKAPKS 60

Query: 643  VRNYKGNGWHEFGSKQQVAEMDTDGVKSVPKLSKTVPRIKIDPVAG--DRKFGNVYTRKR 816
              N  G      G  +++      G +  P+      +      +G  D+ FG VYTRKR
Sbjct: 61   EVN--GVAAEVSGRPKRL------GNEENPRKQSRKMKAGAFNTSGSVDKMFGIVYTRKR 112

Query: 817  KRSDAKNPNFLGGK-EGRRGLEDKMFGIHFVRRKKSKKTSIVRAGSHDMGAVFVRKYVSH 993
            KR+  +N +  G   +G  G            +K S++ +I    +++         V  
Sbjct: 113  KRNGVQNGHLSGNSGQGNYG------------KKISRRQAIENRNTNED--------VEE 152

Query: 994  SVMLAVAVQSSCATTLRFTTFLNSILGYMKRATVKLKELSAFLLSKSITDVFSSHGIHFS 1173
              M +  V++       F+ FL  +LGY+KRA V+L EL+AFL+S+ I+ V+SS+G++F 
Sbjct: 153  PKMFSFVVENGDCNGC-FSNFLILVLGYVKRAEVRLSELAAFLMSQPISSVYSSNGVNFF 211

Query: 1174 RGPTHLNSSGICKIFGARQFSPSFAVDFSALPLCFMYMHSSMLLRLER-QLFLHMTFLMG 1350
             GP   N +GICK FGA+   P F++DFSA+P  F+YMH S +LRL+R Q+    +  + 
Sbjct: 212  WGPR--NRTGICKFFGAKDSIPLFSLDFSAVPRYFLYMHYSKVLRLKRIQIVPVNSDEIV 269

Query: 1351 LDTDSKTMXXXXXXXXXXXXXXRQLVAWGD-------EDPCKRSMLNG-NLQYRNGLNXX 1506
             D++                     V   +           + S L G N Q RNGL+  
Sbjct: 270  SDSEEDEPCVTSVVDVCKSTSGNAAVEIDNLGSKVVLHPSVRASKLTGRNAQCRNGLSSR 329

Query: 1507 XXXXXXXXXXXXNVGNLSCLYVNRSNNGALVSNLFGIMNDAIGFSSPVSNHKCRRSVSSC 1686
                           N S + ++++N GAL+S+L     + I FSS VS +K R SV + 
Sbjct: 330  SIQKRRSSLRRRRARNPSIVGIHKAN-GALMSDLISSRRNGIPFSSVVSKNKLRSSVRNS 388

Query: 1687 SVGNLKEVKSTWTISGEDIESTCCSGNILVVESDKCFHREEGASIMLE-SCSNQWFIVVK 1863
            SV N+ +V S+ +   ++++S+ CS NILV+E+D+C+ REEGA + LE S S +W +VVK
Sbjct: 389  SVANVSDVGSSISDLMQNVDSSQCSANILVIEADRCY-REEGAIVTLELSASREWLLVVK 447

Query: 1864 KNGWTRYSHKAQTVMRPCPHGTNGYYRDMATQSMIWTEDNDWKLEFPNKRDWLIFKELHK 2043
            K   T+++ KA   MRP     N +     T ++IWT D++WKLEFPN++DW+IFK+L+K
Sbjct: 448  KGSSTKFACKADKFMRPS--SCNRF-----THAIIWTGDDNWKLEFPNRQDWIIFKDLYK 500

Query: 2044 ECCERNVTATTVKNIPVPEIREVSDYGDSNSVPFMRPDSYIHLKGDEVSRALARRTSNYD 2223
            EC ERNV A+TVK IPVP + EV  Y D  SVPF RPD YI L GDEVSRALA+RT+NYD
Sbjct: 501  ECSERNVPASTVKAIPVPGVHEVPGYEDRRSVPFCRPDFYISLDGDEVSRALAKRTANYD 560

Query: 2224 MDGEDQEWLNR 2256
            MD ED+EWL +
Sbjct: 561  MDSEDEEWLKK 571


>ref|XP_002532013.1| conserved hypothetical protein [Ricinus communis]
            gi|223528325|gb|EEF30368.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 781

 Score =  369 bits (948), Expect = 3e-99
 Identities = 234/622 (37%), Positives = 340/622 (54%), Gaps = 25/622 (4%)
 Frame = +1

Query: 466  MPSVGMRRTRRVFGVIKGVDGARVLRSGRRLWTESGEGKLKKGNDGDEWFKLI------- 624
            MPSVGMRR+ RVFGV+KGVDGARVLRSGRRL   +GE K K+ NDGDEW   +       
Sbjct: 1    MPSVGMRRSTRVFGVVKGVDGARVLRSGRRLLIGAGENKFKRANDGDEWLHTMIKNHHHN 60

Query: 625  DNSGGGVRNYKGNGWHEFGSKQQVAEMDTD---------GVKSVPKLSKTVPRIKIDPVA 777
             N+   ++  K NGW +  ++  V+++  +         G  +  +++K V        +
Sbjct: 61   HNNSPIMKCNKENGWTQ--TQTHVSKLKKERPSPVALGVGAGAGNEVAKKVND------S 112

Query: 778  GDRKFGNVYTRKRKRSDAKNPNFLGGKEGRRGLEDKMFGIHFVRRKKSKKTSIVRAGSHD 957
            G++ +G VY+RKR+R    +   + G+       +K FGI F RR++ +        S +
Sbjct: 113  GNKMWGIVYSRKRRRMSGIDKLEILGR-------NKKFGIQFSRRQRRRVLKDNEVESFE 165

Query: 958  MGAVFVRKYVSHSVMLAVAVQSSCATTLRFTTFLNSILGYMKRATVKLKELSAFLLSKSI 1137
                          +L + V  SC+++    +FL+ +LGY++R  + + EL  FLLS+S+
Sbjct: 166  ------------PALLGIIVDGSCSSSGLAASFLHLVLGYIRRTNLSIAELVPFLLSESV 213

Query: 1138 TDVFSSHGIHFSRGPTHLNSSGICKIFGARQFSPSFAVDFSALPLCFMYMHSSMLLRLER 1317
               F+S G+ F +  T  N +GICKIFG     P F++DFSA+P CF+ MH  +  R++ 
Sbjct: 214  KCAFASDGLRFLQDTT-ANRNGICKIFGGMSTVPIFSLDFSAVPFCFLCMHLRLAFRVKC 272

Query: 1318 QLFLHMTFLMGLDTDSKTMXXXXXXXXXXXXXXRQLVAW---GDEDPCKRSMLNGNL--- 1479
              F  +   +  D+  + +                 +     G +     S++   L   
Sbjct: 273  LSFEPVNNSLDEDSSQEVISESEEDHSCGLVRTDTFLLTDNSGGKVSLHPSLIASKLAGR 332

Query: 1480 --QYRNGLNXXXXXXXXXXXXXXNVGNLSCLYVNRSNNGALVSNLFGIMNDAIGFSSPVS 1653
              QYRN LN                 N S + ++++N GALVS+L     + I FS+ VS
Sbjct: 333  HSQYRNVLNSRGIQKRRSAFRRRRARNPSGVGIHKAN-GALVSDLISSRKNGIPFSTVVS 391

Query: 1654 NHKCRRSVSSCSVGNLKEVKSTWTISGEDIESTCCSGNILVVESDKCFHREEGASIMLE- 1830
              K RRS+      NLKEV  T   +   ++S+ CS N+LV+ESD+C+ R  GA++ LE 
Sbjct: 392  KDKLRRSLRLTPAANLKEVNPTAVQTSRVMDSSSCSANLLVIESDRCY-RMVGATVALEI 450

Query: 1831 SCSNQWFIVVKKNGWTRYSHKAQTVMRPCPHGTNGYYRDMATQSMIWTEDNDWKLEFPNK 2010
            S   +W +VVKK+G TR +H AQ  MRPC         +  T  +IWT D+ WKLEFPN+
Sbjct: 451  SDLKEWVLVVKKDGLTRCTHLAQKSMRPCSS-------NRITHDVIWTGDDSWKLEFPNR 503

Query: 2011 RDWLIFKELHKECCERNVTATTVKNIPVPEIREVSDYGDSNSVPFMRPDSYIHLKGDEVS 2190
            +DWLIFK+L+KEC +RNV A   K IPVP +REV  Y DS+S+PF R D+YI    DEV 
Sbjct: 504  QDWLIFKDLYKECYDRNVPAPISKAIPVPGVREVLGYEDSSSLPFSRQDAYISFNNDEVV 563

Query: 2191 RALARRTSNYDMDGEDQEWLNR 2256
            RAL +RT+NYDMD ED+EWL +
Sbjct: 564  RALTKRTANYDMDCEDEEWLKK 585


>ref|XP_006398922.1| hypothetical protein EUTSA_v10012741mg [Eutrema salsugineum]
            gi|557100012|gb|ESQ40375.1| hypothetical protein
            EUTSA_v10012741mg [Eutrema salsugineum]
          Length = 777

 Score =  326 bits (836), Expect = 2e-86
 Identities = 224/613 (36%), Positives = 316/613 (51%), Gaps = 16/613 (2%)
 Frame = +1

Query: 466  MPSVGMRRTRRVFGVIKGVDGARVLRSGRRLWTESGEGKLKKGND--GDEWFKLIDNSGG 639
            MPSVGMRRT RVFGV+K  DGARVLRSGRR+W    E K+K+ +D    +W  L  + G 
Sbjct: 1    MPSVGMRRTTRVFGVVKAADGARVLRSGRRIWPNVDEPKVKRAHDVVDRDWNCLNPSKGK 60

Query: 640  GVR----NYKGNGWHEFGSKQQVAEMDTDGVKSVPKLSKTVPRIKI--DPVAGDRKFGNV 801
            G +       G G      ++  +E D   +    +  + V   +   D    D+ FG V
Sbjct: 61   GNKVSGGRSNGAGSRPCSPREISSEKDDKEIDFPVRKRRKVATAEAVGDEKTVDKLFGVV 120

Query: 802  YTRKRKRSDAKNPNFLGGKEGRRGLEDKMFGIHFVRRKKSKKTSIVRAGSHDMGAVFVRK 981
            Y+RKRKR        L G+      E+ +  + F  R+K     +V              
Sbjct: 121  YSRKRKR--------LSGQSSDNRSEEPLRSLKFYCRRKRLSDRVVSPRR---------- 162

Query: 982  YVSHSVMLAVAVQSSCATTLRFTTFLNSILGYMKRATVKLKELSAFLLSKSITDVFSSHG 1161
               +  ++ + V +SC  +  F+T    ++ Y++R  + L  L++F LS+ I DVF+ HG
Sbjct: 163  --LYGPVITLTVDASCEESW-FSTVFVLVMRYVRRGQLGLSSLASFFLSQPINDVFADHG 219

Query: 1162 IHFSRGPTHLNSSGICKIFGARQFSPSFAVDFSALPLCFMYMHSSMLLRLERQLFLHMT- 1338
            + F   P  L+S G+CK FGA    P F+ DF+A+P CFM MH ++ LR+  + F  +  
Sbjct: 220  VRFLAEPP-LSSRGVCKFFGALNCLPLFSADFNAIPRCFMDMHFTLFLRVVPRSFAFVKK 278

Query: 1339 FLMGLDTDSKTMXXXXXXXXXXXXXXRQLVAWGDEDPCKRSMLNG-NLQYRNGLNXXXXX 1515
             L  L+   +                R  V  G       S L G N QYR  L      
Sbjct: 279  SLYLLNNPVEESDSESEIVLSEPCNPRNGVVVGLHPSVTASKLTGGNAQYRGSLGFHSIQ 338

Query: 1516 XXXXXXXXXNVGNLSCLYVNRSNNGALVSNLFGIMNDAIGFSSPVSNHKCRRSV---SSC 1686
                        NLS   V++ +NG  VS L G   +    ++ VS+ K R SV   SS 
Sbjct: 339  KRRSSLRRRRARNLSH-GVHKPHNGTPVSELSGNWKNR---TTSVSSRKLRSSVLNNSSP 394

Query: 1687 SVGNLKEVKSTWTISGEDIESTCCSGNILVVESDKCFHREEGASIMLE-SCSNQWFIVVK 1863
            S   +  +    T   E+++S CCS NILV+ SD+C  REEG  +MLE S S +WF+V+K
Sbjct: 395  SSNGISTISKPRT--KEELDSLCCSANILVIGSDRCT-REEGCGVMLEFSSSKEWFVVIK 451

Query: 1864 KNGWTRYSHKAQTVMRPCPHGTNGYYRDMATQSMIWTEDNDWKLEFPNKRDWLIFKELHK 2043
            K+G  RY H+A+  MRPC    N +     TQS++W  DNDWKLEF +K+DWL FKE++ 
Sbjct: 452  KDGAIRYRHRARKTMRPC--SCNRF-----TQSIVWLGDNDWKLEFCDKQDWLGFKEIYN 504

Query: 2044 ECCERNVTATTVKNIPVPEIREVSDYGD--SNSVPFMRPDSYIHLKGDEVSRALARRTSN 2217
            EC ERN+     K IP+P +REVS Y +  ++   F+ P  YI +K DEV+RA+AR  + 
Sbjct: 505  ECYERNILEQNAKVIPIPGVREVSGYSEDIADFPSFVMPVPYISVKEDEVTRAMARNIAI 564

Query: 2218 YDMDGEDQEWLNR 2256
            YDMD ED+EWL R
Sbjct: 565  YDMDSEDEEWLER 577


>ref|XP_007145542.1| hypothetical protein PHAVU_007G247300g [Phaseolus vulgaris]
            gi|561018732|gb|ESW17536.1| hypothetical protein
            PHAVU_007G247300g [Phaseolus vulgaris]
          Length = 734

 Score =  320 bits (820), Expect = 2e-84
 Identities = 223/621 (35%), Positives = 314/621 (50%), Gaps = 24/621 (3%)
 Frame = +1

Query: 466  MPSVGMRRTRRVFGVIKGVDGARVLRSGRRLWTESGEGKLKKGNDGDEWFKLIDNSGGGV 645
            MP+ GMRRT RVFG+ KG D ARVLRSGRRLW +SGE K K+ +DGDEW           
Sbjct: 1    MPAAGMRRTTRVFGM-KGADTARVLRSGRRLWPDSGEVKTKRSSDGDEWAV--------- 50

Query: 646  RNYKGNGWHEFGSKQQVAEMDTDGVKSVPKLSKTVPRIKIDPVAGDRKFGNVYTRKRKRS 825
                        +  + A+MD          +   PR                T K KR 
Sbjct: 51   ------------TPAKAAKMD----------AVMTPR---------------GTAKGKRQ 73

Query: 826  DAKNPNFLGGKEGRRGLEDKMFGIHFVRRKKSKKTSIVRAGSHDMGAVFVRKYVSHSVML 1005
            +A         + R    D+ FGI +VRR+K  K    + GS        R       +L
Sbjct: 74   EAV-------VDARDSTVDRRFGIVYVRRRKGLK----KEGSR-------RSVEVSRCVL 115

Query: 1006 AVAVQSSCATTLRFTTFLNSILGYMKRATVKLKELSAFLLSKSITDVFSSHGIHFSRGPT 1185
            +V V      +  F   L S++ Y KR  V  ++LS F +S ++  VF+S G+ F +GP 
Sbjct: 116  SVVVSRCAGKSALFLRLLASVVRYAKRVRVSPRKLSGFFMSGAVNGVFASQGMQFVKGPP 175

Query: 1186 HLNSSGICKIFGARQFSPSFAVDFSALPLCFMYMHSSMLLR-LERQLFLHM------TFL 1344
             +NS GIC+ FG  +F P F+VDFSA+PLCF Y+HS+M  + + R LFL        + +
Sbjct: 176  AVNS-GICQFFGVTEFVPLFSVDFSAVPLCFEYLHSAMFFKSMLRSLFLVCNPINVRSDV 234

Query: 1345 MGLDTDSKTMXXXXXXXXXXXXXXRQL----------VAWGDEDPCKRSMLNG------N 1476
              +++D   +               +L          +   D    + S+ +       N
Sbjct: 235  EDMESDDDLLEYQNEKQISSNTFKGELSETVTVTSDVIEINDVLSLQSSVKSTTRAAGRN 294

Query: 1477 LQYRNGLNXXXXXXXXXXXXXXNVGNLSCLYVNRSNNGALVSNLFGIMNDAIGFSSPVSN 1656
             QYRN LN                 N S   + R  NGA+   L G       FS   S+
Sbjct: 295  GQYRNMLNSRGIQKRRSSLRKRKARNPSMGGLRR--NGAVAFELTGGRKGNNQFSGVTSS 352

Query: 1657 HKCRRSVSSCSVGNLKEVKSTWTISGEDIESTCCSGNILVVESDKCFHREEGASIMLE-S 1833
             + R   +  + G+LKE  S    S E +  + CS N+LV E  +C HR EGA + LE S
Sbjct: 353  KRLRSLANGSTTGSLKEASSAIVDSKERLGLSSCSANLLVSEIHQC-HRVEGAIVTLEMS 411

Query: 1834 CSNQWFIVVKKNGWTRYSHKAQTVMRPCPHGTNGYYRDMATQSMIWTEDNDWKLEFPNKR 2013
             S +W + VKK+  TR + KA+ VMRPC   +N +     T +++++ DN WKLEF N++
Sbjct: 412  ASKEWLLTVKKDELTRSTFKAEKVMRPC--SSNRF-----THAIMYSLDNGWKLEFTNRQ 464

Query: 2014 DWLIFKELHKECCERNVTATTVKNIPVPEIREVSDYGDSNSVPFMRPDSYIHLKGDEVSR 2193
            DW +FK+L+K+C +RN+ +T  K IPVP +REVS Y +SNS PF RPD+YI + GDE++R
Sbjct: 465  DWNVFKDLYKKCSDRNIPSTAAKFIPVPGVREVSSYAESNSFPFHRPDTYISVFGDELTR 524

Query: 2194 ALARRTSNYDMDGEDQEWLNR 2256
            A+AR T+NYDMD ED+EWL +
Sbjct: 525  AMARTTANYDMDSEDEEWLKK 545


>ref|XP_006360531.1| PREDICTED: uncharacterized protein LOC102597035 isoform X2 [Solanum
            tuberosum]
          Length = 779

 Score =  318 bits (816), Expect = 5e-84
 Identities = 234/627 (37%), Positives = 321/627 (51%), Gaps = 32/627 (5%)
 Frame = +1

Query: 466  MPSVG-MRRTRRVFGVIKGVDGARVLRSGRRLWTESGEGKLKKGNDGDEWFKLIDNSGGG 642
            MPSVG MRRT R+FG        RVLRSGRRL T    G+ K+   GDEW  L+DN GGG
Sbjct: 1    MPSVGGMRRTTRIFGT-------RVLRSGRRLSTP---GEAKRAKHGDEWIGLLDNVGGG 50

Query: 643  ----VRNYKGNGW--HEFGSKQQVAEMDTD-GVKSVPKL-SKTVPRIK-IDPVAG-DRKF 792
                    K NGW   E     +  EMD D   KS+ +L S   P ++ I P +  DR +
Sbjct: 51   GAADATRCKKNGWLKKEVALNLEADEMDIDVDSKSMDELESPEAPVVETISPNSNIDRMW 110

Query: 793  GNVYTRKRKRSDAKNPNFLGGKEGRRGLEDKMFGIHFVRRKKSKKTSIVRAGSHDMGAVF 972
            G VYTRKRKR            +G+   + + +G  FVR+KK +       G  + G V 
Sbjct: 111  GLVYTRKRKR-------VADSVKGKVLTDVRRYGKQFVRKKKVRSAYAKDLGKSEDGQV- 162

Query: 973  VRKYVSHSVMLAVAVQSSCATTLRFTTFLNSILGYMKRATVKLKELSAFLLSKSITDVFS 1152
                 S  +++   V +S  +    +  LN IL Y++R+TV L+++  F+ SK + DV S
Sbjct: 163  -----SSGIVI---VNTSYGSGYWVSCLLNCILMYLRRSTVSLQQIFGFINSKPLRDVNS 214

Query: 1153 SHGIHFSRGPTHLNSSGICKIFGARQFSPSFAVDFSALPLCFMYMHSSMLLRLERQLF-L 1329
              GI   + P  + + G C I G R   P F +DFS +P  F+Y+HSS+LLR     + L
Sbjct: 215  LQGILLFKTPRKIKT-GACVISGVRCSVPVFTLDFSTVPCFFLYLHSSLLLRFVPMSYAL 273

Query: 1330 HMTFLMGLD-----TDSKTMXXXXXXXXXXXXXXRQ----LVAWGDEDPCKRSMLNG--- 1473
             M   + +D      D + +               Q    +VA G  D  K  ++N    
Sbjct: 274  VMQPTVAIDEVTVTNDKEIVSCLSPVTQSELDVNTQSGLDVVAPGAYDSKKIEVVNPTVG 333

Query: 1474 -------NLQYRNGLNXXXXXXXXXXXXXXNVGNLSCLYVNRSNNGALVSNLFGIMNDAI 1632
                   +LQ RN  N              +       +  ++  G L S+      D +
Sbjct: 334  LPKLAARHLQPRNSRNIQKRRSSLRSMRGRHSS-----FGTQNATGVLTSDRLRFRRDGL 388

Query: 1633 GFSSPVSNHKCRRSVSSCSVGNLKEVKSTWTISGEDIESTCCSGNILVVESDKCFHREEG 1812
             FSS   +++ R S    S  ++KE+KS      ++IEST CS N+LV+E DKC+ REEG
Sbjct: 389  RFSSRTPHYELRSSRQKTSTPSVKELKSALVGLTQNIESTSCSANVLVIEPDKCY-REEG 447

Query: 1813 ASIMLE-SCSNQWFIVVKKNGWTRYSHKAQTVMRPCPHGTNGYYRDMATQSMIWTEDNDW 1989
            A I +E S + QW + VK  G  R++   + VMRPC         +  T  +IW  DN W
Sbjct: 448  AVIGMELSAAKQWILAVKIGGVRRFNLTTEKVMRPCSS-------NRVTHDIIWVGDNGW 500

Query: 1990 KLEFPNKRDWLIFKELHKECCERNVTATTVKNIPVPEIREVSDYGDSNSVPFMRPDSYIH 2169
            KLEFP ++DWLIFKEL+K C +RNV    V  IPVP +REVS Y +SN   F RP SYI 
Sbjct: 501  KLEFPIRQDWLIFKELYKGCSDRNVQPPAVSIIPVPGVREVSGYAESNPPEFARPVSYIT 560

Query: 2170 LKGDEVSRALARRTSNYDMDGEDQEWL 2250
            +K DE++RALAR T+NYDMDG+D+EWL
Sbjct: 561  VKDDELARALARSTANYDMDGDDEEWL 587


>ref|XP_006360530.1| PREDICTED: uncharacterized protein LOC102597035 isoform X1 [Solanum
            tuberosum]
          Length = 781

 Score =  316 bits (810), Expect = 3e-83
 Identities = 234/628 (37%), Positives = 320/628 (50%), Gaps = 33/628 (5%)
 Frame = +1

Query: 466  MPSVG-MRRTRRVFGVIKGVDGARVLRSGRRLWTESGEGKLKKGNDGDEWFKLIDNSGGG 642
            MPSVG MRRT R+FG        RVLRSGRRL T    G+ K+   GDEW  L+DN GGG
Sbjct: 1    MPSVGGMRRTTRIFGT-------RVLRSGRRLSTP---GEAKRAKHGDEWIGLLDNVGGG 50

Query: 643  ----VRNYKGNGW--HEFGSKQQVAEMDTD-GVKSVPKL-SKTVPRIK-IDPVAG-DRKF 792
                    K NGW   E     +  EMD D   KS+ +L S   P ++ I P +  DR +
Sbjct: 51   GAADATRCKKNGWLKKEVALNLEADEMDIDVDSKSMDELESPEAPVVETISPNSNIDRMW 110

Query: 793  GNVYTRKRKRSDAKNPNFLGGKEGRRGLEDKMFGIHFVRRKKSKKTSIVRAGSHDMGAVF 972
            G VYTRKRKR            +G+   + + +G  FVR+KK +       G  + G V 
Sbjct: 111  GLVYTRKRKR-------VADSVKGKVLTDVRRYGKQFVRKKKVRSAYAKDLGKSEDGQV- 162

Query: 973  VRKYVSHSVMLAVAVQSSCATTLRFTTFLNSILGYMKRATVKLKELSAFLLSKSITDVFS 1152
                 S  +++   V +S  +    +  LN IL Y++R+TV L+++  F+ SK + DV S
Sbjct: 163  -----SSGIVI---VNTSYGSGYWVSCLLNCILMYLRRSTVSLQQIFGFINSKPLRDVNS 214

Query: 1153 SHGIHFSRGPTHLN-SSGICKIFGARQFSPSFAVDFSALPLCFMYMHSSMLLRLERQLF- 1326
              GI   +  T     +G C I G R   P F +DFS +P  F+Y+HSS+LLR     + 
Sbjct: 215  LQGILLFKDQTPRKIKTGACVISGVRCSVPVFTLDFSTVPCFFLYLHSSLLLRFVPMSYA 274

Query: 1327 LHMTFLMGLD-----TDSKTMXXXXXXXXXXXXXXRQ----LVAWGDEDPCKRSMLNG-- 1473
            L M   + +D      D + +               Q    +VA G  D  K  ++N   
Sbjct: 275  LVMQPTVAIDEVTVTNDKEIVSCLSPVTQSELDVNTQSGLDVVAPGAYDSKKIEVVNPTV 334

Query: 1474 --------NLQYRNGLNXXXXXXXXXXXXXXNVGNLSCLYVNRSNNGALVSNLFGIMNDA 1629
                    +LQ RN  N              +       +  ++  G L S+      D 
Sbjct: 335  GLPKLAARHLQPRNSRNIQKRRSSLRSMRGRHSS-----FGTQNATGVLTSDRLRFRRDG 389

Query: 1630 IGFSSPVSNHKCRRSVSSCSVGNLKEVKSTWTISGEDIESTCCSGNILVVESDKCFHREE 1809
            + FSS   +++ R S    S  ++KE+KS      ++IEST CS N+LV+E DKC+ REE
Sbjct: 390  LRFSSRTPHYELRSSRQKTSTPSVKELKSALVGLTQNIESTSCSANVLVIEPDKCY-REE 448

Query: 1810 GASIMLE-SCSNQWFIVVKKNGWTRYSHKAQTVMRPCPHGTNGYYRDMATQSMIWTEDND 1986
            GA I +E S + QW + VK  G  R++   + VMRPC         +  T  +IW  DN 
Sbjct: 449  GAVIGMELSAAKQWILAVKIGGVRRFNLTTEKVMRPCSS-------NRVTHDIIWVGDNG 501

Query: 1987 WKLEFPNKRDWLIFKELHKECCERNVTATTVKNIPVPEIREVSDYGDSNSVPFMRPDSYI 2166
            WKLEFP ++DWLIFKEL+K C +RNV    V  IPVP +REVS Y +SN   F RP SYI
Sbjct: 502  WKLEFPIRQDWLIFKELYKGCSDRNVQPPAVSIIPVPGVREVSGYAESNPPEFARPVSYI 561

Query: 2167 HLKGDEVSRALARRTSNYDMDGEDQEWL 2250
             +K DE++RALAR T+NYDMDG+D+EWL
Sbjct: 562  TVKDDELARALARSTANYDMDGDDEEWL 589


>ref|XP_004243418.1| PREDICTED: uncharacterized protein LOC101263728 [Solanum
            lycopersicum]
          Length = 790

 Score =  309 bits (791), Expect = 4e-81
 Identities = 225/631 (35%), Positives = 314/631 (49%), Gaps = 36/631 (5%)
 Frame = +1

Query: 466  MPSVG-MRRTRRVFGVIKGVDGARVLRSGRRLWTESGEGKLKKGNDGDEWFKLIDNSGGG 642
            MPSVG MRRT R+FG        RVLRSGRRL T     + K+   GDEW  L+DN GGG
Sbjct: 1    MPSVGGMRRTTRIFGT-------RVLRSGRRLSTSF---EAKRAKHGDEWIGLLDNVGGG 50

Query: 643  ------VRNYKGNGW--HEFGSKQQVAEMDTDGVKSVPKLSKTVPRIKIDPVAG----DR 786
                      K  GW   E     +  EM+ D         +TV    +D V+     DR
Sbjct: 51   GGAAADATRCKKKGWLKKEVALNLEADEMNIDVDSKSMDEQETVEAPVVDTVSPKSYIDR 110

Query: 787  KFGNVYTRKRKRSDAKNPNFLGGKEGRRGLEDKM-FGIHFVRRKKSKKTSIVRAGSHDMG 963
             +G VYTRKRKR D K  + + GK     L D M +G  F+R+KK +      +   + G
Sbjct: 111  MWGLVYTRKRKRVDLKRHDSVRGKV----LTDVMRYGKQFIRKKKHRSAYAKDSDKSEDG 166

Query: 964  AVFVRKYVSHSVMLAVAVQSSCATTLRFTTFLNSILGYMKRATVKLKELSAFLLSKSITD 1143
                 ++ S  V+    V +S  +    +  LN +L Y++R+TV L+++  F+ SK + D
Sbjct: 167  -----QFSSDIVI----VNTSYGSGYWVSCLLNCMLMYLRRSTVSLQQIFGFINSKPLRD 217

Query: 1144 VFSSHGIHFSRGPTHLN-SSGICKIFGARQFSPSFAVDFSALPLCFMYMHSSMLLRLERQ 1320
            V+S  GI   +  T     +G C I G R   P F +DFS +P  F+Y+HSS+LLR    
Sbjct: 218  VWSLQGILLLKDQTSRKIKTGACVISGVRCSVPVFTLDFSTVPCFFLYLHSSLLLRFVPM 277

Query: 1321 LFL----------HMTFLMGLDTDSKTMXXXXXXXXXXXXXXRQLVAWGDEDPCKRSMLN 1470
             +            +T    ++  S                   +VA G  D  K  ++N
Sbjct: 278  SYALVMQPTVAIDEVTVTNDMELVSCLTPVTLSELDVNTQSGHDVVAPGAYDSKKIEVVN 337

Query: 1471 G----------NLQYRNGLNXXXXXXXXXXXXXXNVGNLSCLYVNRSNNGALVSNLFGIM 1620
                       +LQ RN  N              +       +  ++ +G L S+     
Sbjct: 338  TTVGLPKSTARHLQPRNSRNIQKRRSSLRSMRGRHSS-----FGTQNASGVLTSDRLRFR 392

Query: 1621 NDAIGFSSPVSNHKCRRSVSSCSVGNLKEVKSTWTISGEDIESTCCSGNILVVESDKCFH 1800
             D + FSS   +++ R S    S+ ++KE+KS      ++IE+  CS NILV E DKC+ 
Sbjct: 393  RDGLRFSSRTPHYELRSSRQKTSMPSVKELKSALVRLTQNIETASCSANILVTEPDKCY- 451

Query: 1801 REEGASIMLE-SCSNQWFIVVKKNGWTRYSHKAQTVMRPCPHGTNGYYRDMATQSMIWTE 1977
            REEGA I +E S + QW + VK  G  R++   + VMRPC         +  T  +IW  
Sbjct: 452  REEGAVIGMELSAAKQWILAVKIGGVRRFNLTTEKVMRPCSS-------NRVTHDLIWVG 504

Query: 1978 DNDWKLEFPNKRDWLIFKELHKECCERNVTATTVKNIPVPEIREVSDYGDSNSVPFMRPD 2157
            D+ WKLEFP+++DWLIFKEL+K C +RNV    V  IPVP + EVS Y +SN   F RP 
Sbjct: 505  DSGWKLEFPDRQDWLIFKELYKGCSDRNVQPPAVSIIPVPGVSEVSGYAESNPPFFARPV 564

Query: 2158 SYIHLKGDEVSRALARRTSNYDMDGEDQEWL 2250
            SYI +K DE++RALAR T+NYDMDG+D+EWL
Sbjct: 565  SYITVKDDELARALARSTANYDMDGDDEEWL 595


>ref|NP_196087.1| Enhancer of polycomb-like transcription factor protein [Arabidopsis
            thaliana] gi|7413529|emb|CAB86009.1| putative protein
            [Arabidopsis thaliana] gi|332003387|gb|AED90770.1|
            Enhancer of polycomb-like transcription factor protein
            [Arabidopsis thaliana]
          Length = 766

 Score =  302 bits (774), Expect = 4e-79
 Identities = 221/613 (36%), Positives = 307/613 (50%), Gaps = 16/613 (2%)
 Frame = +1

Query: 466  MPSVGMRRTRRVFGVIKGVDGARVLRSGRRLWTESGEGKLKKGN-----DGDEWFKLIDN 630
            MPSVGMRRT RVFGV+K  DGARVLRSGRR+W   GE K+++ +     D D   K   N
Sbjct: 1    MPSVGMRRTTRVFGVVKAADGARVLRSGRRIWPNVGEPKVRRAHDVVDRDCDSVLK-NQN 59

Query: 631  SGGGVRNYKGNGWHEFGSKQQVAEMDTDGVKSVPKLSKTVPRIK--IDPVAGDRKFGNVY 804
               G +   G    +  S +QV+    D V   P   +   R +   D    D+ FG VY
Sbjct: 60   KSKGNKVSSGKSNSQPCSPKQVSSEKEDKVDDFPVTKRRKVRNEGVGDEKTVDKMFGIVY 119

Query: 805  TRKRKRSDAKNPNFLGGKEGRRGLEDKMFGIHFVRRKKSKKTSIVRAGSHDMGAVFVRKY 984
            +RKRKR        L         E+ +  + F RR++     +                
Sbjct: 120  SRKRKR--------LCEPSSSDRSEEPLRSLKFYRRRRKLSQRV---------------- 155

Query: 985  VSHSVMLAVAVQSSCATTLRFTTFLNSILGYMKRATVKLKELSAFLLSKSITDVFSSHGI 1164
               S +L + V  SC     F T     + Y++R  ++L  L++F LS+ I  VF+ HG+
Sbjct: 156  ---SSVLTLTVDWSCEDCW-FLTVFGLAMRYIRREELRLSSLASFFLSQPINQVFADHGV 211

Query: 1165 HF-SRGPTHLNSSGICKIFGARQFSPSFAVDFSALPLCFMYMHSSMLLR-LERQLFLHMT 1338
             F  R P  L+S G+CK FGA    P F+ DF+ +P  FM MH ++ +R L R  F    
Sbjct: 212  RFLVRSP--LSSRGVCKFFGAMSCLPLFSADFAVIPRWFMDMHFTLFVRVLPRSFFFVEK 269

Query: 1339 FLMGLDTDSKTMXXXXXXXXXXXXXXRQLVAWGDEDPCKRSML-NGNLQYRNGLNXXXXX 1515
             L  L+   +                R  V  G     + S L  GN QYR  L      
Sbjct: 270  SLYLLNNPIEESDSESELALPEPCTPRNGVVVGLHPSVRASKLTGGNAQYRGNLGSHSFQ 329

Query: 1516 XXXXXXXXXNVGNLSCLYVNRSNNGALVSNLFGIMNDAIGFSSPVSNHKCRRSV--SSCS 1689
                        NLS    ++ NNG  V ++ G   +    ++ VS+ K R SV  +S  
Sbjct: 330  KRRSSLRRRRARNLS-HNAHKLNNGTPVFDISGSRKNR---TAAVSSKKLRSSVLSNSSP 385

Query: 1690 VGNLKEVKSTWTISGEDIESTCCSGNILVVESDKCFHREEGASIMLE-SCSNQWFIVVKK 1866
            V N   +    T + E+++S CCS NIL++ SD+C  REEG S+MLE S S +WF+V+KK
Sbjct: 386  VSNGISI-IPMTKTKEELDSICCSANILMIHSDRC-TREEGFSVMLEASSSKEWFLVIKK 443

Query: 1867 NGWTRYSHKAQTVMRPCPHGTNGYYRDMATQSMIWTEDNDWKLEFPNKRDWLIFKELHKE 2046
            +G  RYSH AQ  MRP       +  +  T + +W   ++WKLEF +++DWL FK+++KE
Sbjct: 444  DGAIRYSHMAQRTMRP-------FSSNRITHATVWMGGDNWKLEFCDRQDWLGFKDIYKE 496

Query: 2047 CCERNVTATTVKNIPVPEIREVSDYGD--SNSVPFMRPD-SYIHLKGDEVSRALARRTSN 2217
            C ERN+   +VK IP+P +REV  Y +   N   F RP  SYI +  DEVSRA+AR  + 
Sbjct: 497  CYERNLLEQSVKVIPIPGVREVCGYAEYIDNFPSFSRPPVSYISVNEDEVSRAMARSIAL 556

Query: 2218 YDMDGEDQEWLNR 2256
            YDMD ED+EWL R
Sbjct: 557  YDMDSEDEEWLER 569


>ref|XP_002873159.1| hypothetical protein ARALYDRAFT_908352 [Arabidopsis lyrata subsp.
            lyrata] gi|297318996|gb|EFH49418.1| hypothetical protein
            ARALYDRAFT_908352 [Arabidopsis lyrata subsp. lyrata]
          Length = 766

 Score =  296 bits (757), Expect = 4e-77
 Identities = 214/608 (35%), Positives = 300/608 (49%), Gaps = 11/608 (1%)
 Frame = +1

Query: 466  MPSVGMRRTRRVFGVIKGVDGARVLRSGRRLWTESGEGKLKKGND--GDEWFKLIDNSGG 639
            MPSVGMRRT RVFGV+K  DGARVLRSGRR+W   GE K+++ +D    +   ++ N   
Sbjct: 1    MPSVGMRRTTRVFGVVKAADGARVLRSGRRIWPNVGEPKVRRAHDVVDRDCDSVLKNQNK 60

Query: 640  GVRNYKGNGWHEFGSKQQVAEMDTDGVKSVP--KLSKTVPRIKIDPVAGDRKFGNVYTRK 813
               N       +  S +QV+    D V   P  K  K       D    D+ FG VY+RK
Sbjct: 61   TKGNKVSGSNSQPCSPRQVSSEKEDKVDDFPVRKRRKVRNEGVGDEKTVDKMFGIVYSRK 120

Query: 814  RKRSDAKNPNFLGGKEGRRGLEDKMFGIHFVRRKKSKKTSIVRAGSHDMGAVFVRKYVSH 993
            RKR    + +           E  +  + F RR++     +                   
Sbjct: 121  RKRLSEPSSD---------RSEVPLRSLKFYRRRRRLSQRV------------------- 152

Query: 994  SVMLAVAVQSSCATTLRFTTFLNSILGYMKRATVKLKELSAFLLSKSITDVFSSHGIHFS 1173
            S +L + V  SC      + F    + Y +R  ++L  L+ F LS+ I  VF+ HG+ F 
Sbjct: 153  SSVLTLTVDWSCEDCWLLSVF-GLAMRYTRREELRLSSLADFFLSQPINQVFADHGVRFL 211

Query: 1174 RGPTHLNSSGICKIFGARQFSPSFAVDFSALPLCFMYMHSSMLLRL-ERQLFLHMTFLMG 1350
              P  L+S G+CK FGA    P F+ DF+ +P  FM M  ++  R+  R  F     L  
Sbjct: 212  LKPP-LSSRGVCKFFGAMNCLPLFSADFAVIPQWFMDMQFTLFRRVAPRSFFFVEKSLYL 270

Query: 1351 LDTDSKTMXXXXXXXXXXXXXXRQLVAWGDEDPCKRSML-NGNLQYRNGLNXXXXXXXXX 1527
            L+   +                R     G     + S L  GN QYR  L          
Sbjct: 271  LNNPIEESDSEPELALPEPCTPRNGGVVGLHPSVRASKLTGGNAQYRGNLGSHSFQKRRS 330

Query: 1528 XXXXXNVGNLSCLYVNRSNNGALVSNLFGIMNDAIGFSSPVSNHKCRRSV--SSCSVGNL 1701
                    NLS    ++ NNG  V ++ G   +    ++ VS+ K R SV  +S  V N 
Sbjct: 331  SLRRRRARNLS-HNAHKLNNGTPVFDISGSRKNR---TAAVSSRKLRSSVLSNSSPVSNG 386

Query: 1702 KEVKSTWTISGEDIESTCCSGNILVVESDKCFHREEGASIMLE-SCSNQWFIVVKKNGWT 1878
              +    T + E+++S CCS NIL++ SD+C  REEG ++MLE S S +WF+V+KK+G  
Sbjct: 387  ISI-IPLTKTKEELDSLCCSANILMIHSDRC-TREEGFAVMLEASSSKEWFLVIKKDGAI 444

Query: 1879 RYSHKAQTVMRPCPHGTNGYYRDMATQSMIWTEDNDWKLEFPNKRDWLIFKELHKECCER 2058
            RYSH+AQ  MRPC         +  T + +W   ++WKLEF +++DWL FK+++KEC ER
Sbjct: 445  RYSHRAQRTMRPCS-------CNRITHATVWMGGDNWKLEFCDRQDWLGFKDIYKECYER 497

Query: 2059 NVTATTVKNIPVPEIREVSDYGD--SNSVPFMRPDSYIHLKGDEVSRALARRTSNYDMDG 2232
            NV   +VK IP+P +REV  Y +   N   F RP SYI +  DEVSRA+AR  + YDMD 
Sbjct: 498  NVLEQSVKVIPIPGVREVCGYAEYIDNFPSFSRPVSYISVNEDEVSRAMARGIALYDMDS 557

Query: 2233 EDQEWLNR 2256
            ED+EWL R
Sbjct: 558  EDEEWLER 565


Top