BLASTX nr result

ID: Mentha29_contig00009141 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00009141
         (2069 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU39705.1| hypothetical protein MIMGU_mgv1a004483mg [Mimulus...   438   e-120
ref|XP_006472760.1| PREDICTED: transcriptional regulator ATRX ho...   308   8e-81
ref|XP_006434168.1| hypothetical protein CICLE_v10000938mg [Citr...   306   3e-80
ref|XP_002284460.1| PREDICTED: uncharacterized protein LOC100259...   306   3e-80
ref|XP_002300995.2| hypothetical protein POPTR_0002s08550g [Popu...   302   3e-79
ref|XP_006351897.1| PREDICTED: lisH domain-containing protein C1...   300   1e-78
ref|XP_003522580.1| PREDICTED: transcriptional regulator ATRX ho...   298   8e-78
ref|XP_002516334.1| conserved hypothetical protein [Ricinus comm...   296   2e-77
ref|XP_007019032.1| Uncharacterized protein isoform 2 [Theobroma...   293   2e-76
ref|XP_006581582.1| PREDICTED: DNA ligase 1-like isoform X2 [Gly...   290   1e-75
ref|XP_003527934.1| PREDICTED: DNA ligase 1-like isoform X1 [Gly...   290   1e-75
ref|XP_007019033.1| Uncharacterized protein isoform 3 [Theobroma...   288   5e-75
ref|XP_007222349.1| hypothetical protein PRUPE_ppa004840mg [Prun...   283   2e-73
ref|XP_004500560.1| PREDICTED: transcriptional regulator ATRX ho...   283   3e-73
ref|XP_004169339.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   281   8e-73
ref|XP_004145363.1| PREDICTED: uncharacterized protein LOC101217...   281   8e-73
ref|XP_004290855.1| PREDICTED: uncharacterized protein LOC101302...   275   6e-71
ref|XP_006434169.1| hypothetical protein CICLE_v10000938mg [Citr...   274   1e-70
ref|XP_007137404.1| hypothetical protein PHAVU_009G124200g [Phas...   273   3e-70
ref|XP_006578974.1| PREDICTED: transcriptional regulator ATRX ho...   270   2e-69

>gb|EYU39705.1| hypothetical protein MIMGU_mgv1a004483mg [Mimulus guttatus]
          Length = 525

 Score =  438 bits (1127), Expect = e-120
 Identities = 252/441 (57%), Positives = 294/441 (66%), Gaps = 19/441 (4%)
 Frame = +2

Query: 38   MAEGDGEKGAFELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFI 217
            MAE +GEK   E QLE AV +RLQHFKDQADSLTLESVRRLLEKDLGLEK ALDAHKRFI
Sbjct: 1    MAE-EGEKQGIEQQLEHAVCSRLQHFKDQADSLTLESVRRLLEKDLGLEKFALDAHKRFI 59

Query: 218  RHYLEKIMDGADESNSSPATVNM-EGGVLLSXXXXXXXXXXXXANSESKKASTGNKETME 394
            RHYLEK M+ AD+        N  E  V LS            +N++ KK+STG++E ME
Sbjct: 60   RHYLEKKMEDADDCKPETEKENENEKDVHLSKEDATILPKQNESNNDLKKSSTGDEEMME 119

Query: 395  DSPIMGVLTPKSEVGTQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDK 574
            DSPIMGVLTPKSE+G Q  LSES I+KAILERADH  ANS+ ++L GVRRLLEEDLGLDK
Sbjct: 120  DSPIMGVLTPKSEIGAQGPLSESRIEKAILERADHFLANSENLTLAGVRRLLEEDLGLDK 179

Query: 575  NTLDAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVKSRKSKKV----NXXXXXXXXXXXX 742
            N LD +K  IS+Q+D VL            + +  +S KSKKV    +            
Sbjct: 180  NDLDPFKKFISQQIDQVLNPPKATKSVKNVKKKTSESLKSKKVKTVSSEEGSESLPSESD 239

Query: 743  XXXXKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKK-----------QIEED 889
                K K +KE+  RKN KK EQP+KR++    D+D+S KKP K             EED
Sbjct: 240  EMEDKVKSKKESASRKNSKKLEQPKKRKS----DLDVSAKKPSKLQKRQKEEDNDSKEED 295

Query: 890  NNSDEGGSISEDGQSQLS---LEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYK 1060
            NNS E GS+SEDGQSQ S   LEKPA RKEK  P YGK+VENLKSIIKACGMS+PP IYK
Sbjct: 296  NNSGEDGSLSEDGQSQSSVEKLEKPAQRKEKPVPAYGKKVENLKSIIKACGMSIPPVIYK 355

Query: 1061 KVKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGIDXXXXXXX 1240
            K K VPD+KRE ++++ELEGIL REGLSKNP+EKEIKDC+K+KE A+ELEGID       
Sbjct: 356  KAKQVPDNKREAVIIQELEGILLREGLSKNPSEKEIKDCKKRKETARELEGIDMSNIISS 415

Query: 1241 XXXXXXXXXVAPERPVVRAKK 1303
                      AP +P  RAKK
Sbjct: 416  SRRRSTFSFGAPAKPEARAKK 436


>ref|XP_006472760.1| PREDICTED: transcriptional regulator ATRX homolog [Citrus sinensis]
          Length = 497

 Score =  308 bits (788), Expect = 8e-81
 Identities = 178/431 (41%), Positives = 252/431 (58%), Gaps = 15/431 (3%)
 Frame = +2

Query: 65   AFELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMD 244
            + E Q++ A+ +R+ HFK+QADSLT E VRRL+EKDLGLE  ALD HK+FI+  L + MD
Sbjct: 19   SIEPQIKAAMISRVSHFKEQADSLTFEGVRRLIEKDLGLETHALDVHKKFIKQCLLECMD 78

Query: 245  GADESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVLTP 424
            GA   ++S  +       + S            +  + K+    N E MEDSP++G++T 
Sbjct: 79   GAGGVSASKDSAESAKENVSSTKEEEKSPEGYQSAKDVKEPCPENYEKMEDSPVLGLMTG 138

Query: 425  KSEV----------GTQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDK 574
              +           G +   SES IKKAI +RA +++ N +K+++ G+RR+LEEDL LDK
Sbjct: 139  NKKTKFETEEAQGDGNKEDPSESAIKKAIRKRAAYIKTNIEKVTMAGLRRILEEDLKLDK 198

Query: 575  NTLDAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVKSR---KSKKVNXXXXXXXXXXXXX 745
             TLD++K +IS+++D VL            + + +K     K+K+V+             
Sbjct: 199  FTLDSFKKMISQELDEVLKSSEVLEPSTVEKKKSLKKNYQSKAKEVSSEGSSDSSDGEVD 258

Query: 746  XXXKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPK--KQIEEDNNSDEGGSIS 919
               + K RK+   +  ++  E  +KR+  E       +KK K  K   EDNN  E GS+S
Sbjct: 259  EEDEMKPRKKIVSKGKVQNNEGLKKRKRPEKETKASIKKKTKAVKIASEDNNDAESGSVS 318

Query: 920  EDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRETI 1099
            +DG SQ S EKP  +K  S P YGKRVE+LK++IK+CGMS+PP++YKKVK  P++KRE  
Sbjct: 319  DDGHSQSSSEKPIKKKVVSTPAYGKRVEHLKTVIKSCGMSIPPSVYKKVKQAPENKREAQ 378

Query: 1100 LVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGIDXXXXXXXXXXXXXXXXVAPE 1279
            L+KELEGILSREGLS NP+EKEIK+ +KKKERA+ELEGID                V P 
Sbjct: 379  LIKELEGILSREGLSSNPSEKEIKEVKKKKERARELEGIDMSNIVSSSRRRSATSFVPPP 438

Query: 1280 RPVVRAKKYKG 1312
            +P +  +   G
Sbjct: 439  KPKIPDESESG 449


>ref|XP_006434168.1| hypothetical protein CICLE_v10000938mg [Citrus clementina]
            gi|557536290|gb|ESR47408.1| hypothetical protein
            CICLE_v10000938mg [Citrus clementina]
          Length = 497

 Score =  306 bits (783), Expect = 3e-80
 Identities = 177/431 (41%), Positives = 252/431 (58%), Gaps = 15/431 (3%)
 Frame = +2

Query: 65   AFELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMD 244
            + E Q++ A+ +R+ HFK+QADSLT E VRRL+EKDLGLE  ALD HK+FI+  L + MD
Sbjct: 19   SIEPQIKAAMISRVSHFKEQADSLTFEGVRRLIEKDLGLETHALDVHKKFIKQCLLECMD 78

Query: 245  GADESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVLTP 424
            GA   ++S  +       + S            +  + K+    N E MEDSP++G++T 
Sbjct: 79   GAGGVSASKDSAESAKENVSSTKEEEKSPEGYQSAKDVKEPCPENYEKMEDSPVLGLMTG 138

Query: 425  KSEV----------GTQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDK 574
              +           G +   SES IKKAI +RA +++ N +K+++ G+RR+LEEDL LDK
Sbjct: 139  NKKTKFETEEAQGDGNKEDPSESAIKKAIRKRAAYIKTNIEKVTMAGLRRILEEDLKLDK 198

Query: 575  NTLDAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVKSR---KSKKVNXXXXXXXXXXXXX 745
             TLD++K +IS+++D VL            + + +K     K+K+V+             
Sbjct: 199  FTLDSFKKMISQELDEVLKSSEVLEPSTVEKKKSLKKNYQSKAKEVSSEGSSDSSDGEVD 258

Query: 746  XXXKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPK--KQIEEDNNSDEGGSIS 919
               + K RK+   +  ++  E  +KR+  E       +KK K  K   EDNN  E GS+S
Sbjct: 259  EEDEMKPRKKIVSKGKVQNNEGLKKRKRPEKETKASIKKKTKAVKIASEDNNDAESGSVS 318

Query: 920  EDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRETI 1099
            +DG+SQ S EKP  +K  S P YGKRVE+LK++IK+C MS+PP++YKKVK  P++KRE  
Sbjct: 319  DDGRSQSSSEKPIKKKVVSTPAYGKRVEHLKTVIKSCAMSIPPSVYKKVKQAPENKREAQ 378

Query: 1100 LVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGIDXXXXXXXXXXXXXXXXVAPE 1279
            L+KELEGILSREGLS NP+EKEIK+ +KKKERA+ELEGID                V P 
Sbjct: 379  LIKELEGILSREGLSSNPSEKEIKEVKKKKERARELEGIDMSNIVSSSRRRSATSFVPPP 438

Query: 1280 RPVVRAKKYKG 1312
            +P +  +   G
Sbjct: 439  KPKIPDESESG 449


>ref|XP_002284460.1| PREDICTED: uncharacterized protein LOC100259114 [Vitis vinifera]
            gi|302141832|emb|CBI19035.3| unnamed protein product
            [Vitis vinifera]
          Length = 502

 Score =  306 bits (783), Expect = 3e-80
 Identities = 189/444 (42%), Positives = 260/444 (58%), Gaps = 19/444 (4%)
 Frame = +2

Query: 17   QLPPAELMAEGDGEKGA-FELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLA 193
            ++  +E + +G  E+    E Q++ A+S+R+ HFK+QADSLT E VRRLLEKDLGLE  A
Sbjct: 4    EMQDSEPITKGTEEEAQEIESQIKAAMSSRVGHFKEQADSLTFEGVRRLLEKDLGLETYA 63

Query: 194  LDAHKRFIRHYLEKIMDGADESNSSPATVNMEG-GVLLSXXXXXXXXXXXXANSESKKAS 370
            LD HKRF++ +L + ++ A + N S  +    G  V  +            +  + K+ S
Sbjct: 64   LDVHKRFVKQFLLECINAAADDNPSKKSGETRGKNVCSTKGEAAEPPETVKSKKDVKEPS 123

Query: 371  TGNKETMEDSPIMGVLT----PKSEVGTQSSL------SESTIKKAILERADHLQANSDK 520
            +G++E +E SP++G++T     KSE             SESTI+KAI +RA + +A S+ 
Sbjct: 124  SGDEEKIEGSPVLGLMTGQKIAKSETEETQGKENKEVPSESTIRKAIRKRASYFKAKSEN 183

Query: 521  ISLGGVRRLLEEDLGLDKNTLDAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVK----SR 688
            I++ GVRR+LEEDL LDK TLD YK  IS Q+D VL            +    K    SR
Sbjct: 184  ITMAGVRRVLEEDLKLDKKTLDPYKKFISEQLDEVLKSPQVSKPTTGVKKGSPKKNSHSR 243

Query: 689  KSKKVNXXXXXXXXXXXXXXXXKEKLRKEA--GLRKNIKKFEQPRKRRNSENADMDISRK 862
             S+K +                 +   K A  G  +N +   + RKR  +E       R 
Sbjct: 244  ASRKTSSEGSSESLESESDEEEVKPKTKMAPKGKTQNSEDLRK-RKRPVTETKMPSKKRS 302

Query: 863  KPKKQIEEDNN-SDEGGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMS 1039
            K  + + EDN+ +++ G++S+DG SQ S EKP  RKE SAP YGKRVENLKSIIK+C MS
Sbjct: 303  KTAETVSEDNSDAEDSGNVSDDGHSQSSSEKPVKRKEVSAPAYGKRVENLKSIIKSCAMS 362

Query: 1040 VPPNIYKKVKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGID 1219
            VPP++YK+VK  P++KRE  L+KELE ILS+EGLSKNP+EK+IK+ RKKKERAKELEGID
Sbjct: 363  VPPSVYKRVKQAPENKREAHLIKELEEILSKEGLSKNPSEKDIKEVRKKKERAKELEGID 422

Query: 1220 XXXXXXXXXXXXXXXXVAPERPVV 1291
                            VAP +P +
Sbjct: 423  TSNIVLSSRRRSTRSFVAPPKPKI 446


>ref|XP_002300995.2| hypothetical protein POPTR_0002s08550g [Populus trichocarpa]
            gi|550344567|gb|EEE80268.2| hypothetical protein
            POPTR_0002s08550g [Populus trichocarpa]
          Length = 476

 Score =  302 bits (774), Expect = 3e-79
 Identities = 185/420 (44%), Positives = 248/420 (59%), Gaps = 13/420 (3%)
 Frame = +2

Query: 71   ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMDGA 250
            E Q++ A+ +R+ HFK QADSLT E VRRLLEKDLGL+KLALD HKRF++  L + +DGA
Sbjct: 25   ESQVKEAMLSRVSHFKKQADSLTFEGVRRLLEKDLGLDKLALDVHKRFVKQCLFECLDGA 84

Query: 251  DESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVL---- 418
               N+S  + +     + S              +  K+  + ++E MEDSP+MG+L    
Sbjct: 85   VTDNASKDSGDTVEKHVDSPKEVTESPERRDLKNNIKEPCSEDEEKMEDSPVMGLLSGQK 144

Query: 419  TPKSEV-GTQSSL-----SESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDKNT 580
            T KS+   TQ++      SE +IKKA++ RA +++ANS++I++ G+RRLLEEDL LDK +
Sbjct: 145  TTKSKAKDTQANEVKEVPSEGSIKKAMMRRASYIKANSEEITMAGLRRLLEEDLKLDKFS 204

Query: 581  LDAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVKSRKSKKVNXXXXXXXXXXXXXXXXKE 760
            LD YK  IS+Q+D V               ED + +  KK                    
Sbjct: 205  LDPYKKFISKQLDEVSSRESADSSDKESEEEDEEVKPKKK-------------------- 244

Query: 761  KLRKEAGLRKNIKKFEQPRKRRNSENADMDISRK--KPKKQIEEDNNSDE-GGSISEDGQ 931
                + G+ + ++  E  +KRR +E      + K  KP +   EDN+  E  G+ SED  
Sbjct: 245  ----KIGVERKMQNSEGSKKRRRTEKETKVSANKRIKPLETAAEDNSDSEVSGNASEDNN 300

Query: 932  SQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRETILVKE 1111
            S  S EKP  +KE S P YGKRVE+LKS+IK+CGMSVPP+IYKKVK  P++KRE  L+KE
Sbjct: 301  SPSSAEKPVKKKEASTPAYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPENKREARLIKE 360

Query: 1112 LEGILSREGLSKNPTEKEIKDCRKKKERAKELEGIDXXXXXXXXXXXXXXXXVAPERPVV 1291
            LE ILSREGLS NP+EKEIK+ RK+KERAKELEGID                VAP +P V
Sbjct: 361  LEEILSREGLSSNPSEKEIKEVRKRKERAKELEGIDLSNIVTTSRRRSATSFVAPPKPKV 420


>ref|XP_006351897.1| PREDICTED: lisH domain-containing protein C1711.05-like [Solanum
            tuberosum]
          Length = 476

 Score =  300 bits (769), Expect = 1e-78
 Identities = 192/426 (45%), Positives = 254/426 (59%), Gaps = 12/426 (2%)
 Frame = +2

Query: 44   EGDGEKGAFELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRH 223
            E + EK   E+++E A+ +R+QHFK+ ADS TLE VRRL+E+DL LEK ALD HKR I+ 
Sbjct: 4    EVNEEKQGIEVKIEEALRSRIQHFKENADSFTLERVRRLIEEDLELEKYALDVHKRSIKL 63

Query: 224  YLEKIMDGA-DESNSSPATVNMEGGVLLSXXXXXXXXXXXX-ANSESKKASTGNKETMED 397
             LEK+M+ A D+ +   +  N+E    L+                +  K    ++  M+D
Sbjct: 64   ILEKLMENAADDGDPKDSQENLEKDASLTKQEKEVLESPKKQVIKKDIKEPAFDEAEMDD 123

Query: 398  SPIMGVLTPKSE-VGTQS-SLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLD 571
            SPIMGV++ KSE V  QS   SES+IKKAI ERA H + NS+ I+L GVRRLLEEDLGL+
Sbjct: 124  SPIMGVMSSKSESVDAQSVKASESSIKKAIWERAAHFRDNSESITLAGVRRLLEEDLGLE 183

Query: 572  KNTLDAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVKSRKSKKVNXXXXXXXXXXXXXXX 751
            KNTLDA+K  I  Q+D VL            +S + KS+ +KK +               
Sbjct: 184  KNTLDAFKKFIQIQIDEVLTPSEAPKSSSVKKSPEKKSKTAKK-SGENSNSFSSKRKHIA 242

Query: 752  XKEKLRKEAGLRKNIKKFE--QPRKRRNSENADMDISRKKPKKQIEEDNNS-DEGGSISE 922
             K K RK +  ++ ++K E  + RK+ NSE+      +K+  K + ++N+  D   S SE
Sbjct: 243  EKVKSRKSSAAKETVEKSEGLKKRKKPNSEDNVPAKKQKEVSKNLSDENSDGDTDKSDSE 302

Query: 923  DGQSQLSLEKPAPRKE-----KSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDK 1087
            DGQS  S E  + +K+      +  GYGKRVE+LKSI KACGMSV P+IYK+ K V DDK
Sbjct: 303  DGQSGSSAEIISAKKKVVKGASANTGYGKRVEHLKSIFKACGMSVAPSIYKRAKQVSDDK 362

Query: 1088 RETILVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGIDXXXXXXXXXXXXXXXX 1267
            RE  L+KELE ILS EGLS NPTEKEIK+ +K+K+ AKELEGID                
Sbjct: 363  REGFLIKELEKILSAEGLSTNPTEKEIKEVKKRKQTAKELEGIDLSNIVSNTRRRSTTSF 422

Query: 1268 VAPERP 1285
            VAP RP
Sbjct: 423  VAPPRP 428


>ref|XP_003522580.1| PREDICTED: transcriptional regulator ATRX homolog isoform X1 [Glycine
            max]
          Length = 490

 Score =  298 bits (762), Expect = 8e-78
 Identities = 177/401 (44%), Positives = 242/401 (60%), Gaps = 18/401 (4%)
 Frame = +2

Query: 71   ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMDGA 250
            E Q+E A+ +R+ HFK+Q+DSLT E VRRLLEKDLGLE+ ALD HKRFI+  L K ++G 
Sbjct: 16   ESQIETAMRSRVSHFKEQSDSLTFEGVRRLLEKDLGLEEYALDVHKRFIKQCLLKCLEGV 75

Query: 251  DESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVLTPKS 430
             + +     ++ + G   S             + ++K     ++E MEDSP++G+L  + 
Sbjct: 76   GDDDGPK--ISGKEGEKGSSIQESEEPKEECESKDAKDLCPEDEEKMEDSPVLGLLKEQK 133

Query: 431  EV----------GTQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDKNT 580
                        GT+   SE+ IKKA+ +R+ +++AN++KI++ G+RRLLEEDL LDK T
Sbjct: 134  RAKLETKDDKGNGTKVVPSEALIKKAVRKRSSYIKANAEKITMAGLRRLLEEDLKLDKFT 193

Query: 581  LDAYKNLISRQVDLVLXXXXXXXXXXXXRS-------EDVKSRKSKKVNXXXXXXXXXXX 739
            LD YK  +S+Q+D VL            +          V  + S + N           
Sbjct: 194  LDPYKKFVSQQLDEVLTSSEVPEPAKNAKKIVKKKPDTKVTKKVSSEENSDTSDKETDEE 253

Query: 740  XXXXXKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKKQIEEDNN-SDEGGSI 916
                 + K RK+   +  +K   QP+KR+  E+      R KP K   EDN+ +++ G  
Sbjct: 254  ESEEDEVKPRKKILPKGKVKTSVQPKKRKGEESDLSSKKRVKPAKAASEDNSDAEDNGKN 313

Query: 917  SEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRET 1096
            SED QS  S EKP+ +KE S P YGKRVE+LKS+IKACGMSVPP IYKKVK VP++KRE 
Sbjct: 314  SEDDQSHSSPEKPSKKKEVSNPVYGKRVEHLKSVIKACGMSVPPVIYKKVKQVPENKREG 373

Query: 1097 ILVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGID 1219
             L+KELE ILSREGLS NP+EKEIK+ ++KK RAKELEGID
Sbjct: 374  QLIKELEEILSREGLSSNPSEKEIKEVKRKKARAKELEGID 414


>ref|XP_002516334.1| conserved hypothetical protein [Ricinus communis]
            gi|223544564|gb|EEF46081.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 517

 Score =  296 bits (759), Expect = 2e-77
 Identities = 175/419 (41%), Positives = 249/419 (59%), Gaps = 12/419 (2%)
 Frame = +2

Query: 71   ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMDGA 250
            E Q++ A+ +R+ +F +Q++SLT E VRRLLEKDLGL++ ALD HKRF++  L + +DG 
Sbjct: 26   ESQIKDAMRSRVNYFNEQSNSLTFEGVRRLLEKDLGLQEYALDVHKRFVKQCLLQCLDGD 85

Query: 251  DESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVLT--- 421
            + S  S  T   E G                +    K+  + ++E  E+SP+MG+LT   
Sbjct: 86   NASKDSGETD--EKGSRSIKGEATESPEGHESKDHIKEPCSEDEEKTEESPVMGLLTGKK 143

Query: 422  -PKSEVG---TQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDKNTLDA 589
             PKSE      + + +ES IKKA+ +RA +++ANSDK+++ G+RRLLEEDL LDK+ LD 
Sbjct: 144  TPKSETDKTLVKEAPTESIIKKALSKRASYIKANSDKVTMAGLRRLLEEDLRLDKHALDP 203

Query: 590  YKNLISRQVDLVLXXXXXXXXXXXXRSEDVKSRKSKKVNXXXXXXXXXXXXXXXXKEKLR 769
            YK  IS Q+D VL               + + + SKK+                 +++++
Sbjct: 204  YKKFISAQLDEVLQSSEVSEPKKKSVKTNSQGKASKKMRTEESSDSSGKEMDTEDEDEVK 263

Query: 770  KEAGLRKNIKKFE----QPRKRRNSENADMDISRKKPKKQIEEDNN-SDEGGSISEDGQS 934
             +  +  N K       + RKR   E       R KP +++ ED++ +++ G+ SEDG+S
Sbjct: 264  PKKKIAPNKKMINSEGSKKRKRFEKETKVTSKKRVKPTEKVAEDSSDAEDSGNASEDGRS 323

Query: 935  QLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRETILVKEL 1114
            Q S EKP  +KE   P YGKRVE+LKS+IK+CGMSVPP +YKKVK VP++KRE  L+KEL
Sbjct: 324  QSSAEKPVKKKEAPTPVYGKRVEHLKSVIKSCGMSVPPVVYKKVKQVPENKREAQLIKEL 383

Query: 1115 EGILSREGLSKNPTEKEIKDCRKKKERAKELEGIDXXXXXXXXXXXXXXXXVAPERPVV 1291
            E ILS+EGLS NP+EKEIK+ RK+KERAKELEGID                V P +P +
Sbjct: 384  EEILSKEGLSSNPSEKEIKEVRKRKERAKELEGIDMSNIVSSSRRRSATSYVPPPKPKI 442


>ref|XP_007019032.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508724360|gb|EOY16257.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 523

 Score =  293 bits (750), Expect = 2e-76
 Identities = 178/445 (40%), Positives = 252/445 (56%), Gaps = 38/445 (8%)
 Frame = +2

Query: 71   ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMDGA 250
            E ++  A+ +R+ HFK+QADSLT E VRRLLEKDLGLE  ALD HKRF++  L K +DG 
Sbjct: 29   ESRITTAMRSRVGHFKEQADSLTFEGVRRLLEKDLGLETFALDVHKRFVKQCLLKCLDGG 88

Query: 251  DESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVL---- 418
            D+ ++  ++       L +            +  + K+A + ++E +EDSP++G+L    
Sbjct: 89   DDDDAPKSSGETGEKNLSTTTEVTESPKGRQSKKDVKEAFSEDEEKLEDSPVLGLLTGHK 148

Query: 419  -----TPKSEVGTQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDKNTL 583
                 T ++E      + ESTIKKAI +RA +++ANS+K+++ G+RRLLEEDL LDK+TL
Sbjct: 149  TTKTETMETETKENKDVFESTIKKAIKKRASYVEANSEKVTMAGLRRLLEEDLKLDKDTL 208

Query: 584  DAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVK----SRKSKKVNXXXXXXXXXXXXXXX 751
            D YK  I+ Q+D VL            +  ++K    S+ SKK +               
Sbjct: 209  DPYKKFITEQLDEVLKSREVSAPASVVKKNNLKKNSQSKASKKASKKLSSASSGSESDEE 268

Query: 752  XKE----------------------KLRKEAGLRKNIKKFEQPRKRR-NSENADMDISRK 862
              E                      K +K+   +  IK  E  +KR+   + A+M   ++
Sbjct: 269  EGEEEEDEDEDEDVDEEEEEEEEEVKPKKKISAKGKIKNSEGLKKRKIPKKEAEMPSKKR 328

Query: 863  KPKKQIEEDNNSD--EGGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGM 1036
                +   D+NSD  + GS+S+D +S+ S  K   RKE S P YGK VE+LKS+IK+CGM
Sbjct: 329  SKHAESISDDNSDAEDSGSVSDDNRSRSSAAKAVKRKETSTPVYGKHVEHLKSVIKSCGM 388

Query: 1037 SVPPNIYKKVKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGI 1216
            SVPP IYK+VK VP++ RE  L+KELE ILS+EGLS NP+EKEIK+ RK+KERAKELEGI
Sbjct: 389  SVPPAIYKRVKQVPENNREAQLIKELEEILSKEGLSSNPSEKEIKEVRKRKERAKELEGI 448

Query: 1217 DXXXXXXXXXXXXXXXXVAPERPVV 1291
            D                VAP +P +
Sbjct: 449  DTSNIVLSSRRRSTTSFVAPPKPKI 473


>ref|XP_006581582.1| PREDICTED: DNA ligase 1-like isoform X2 [Glycine max]
          Length = 486

 Score =  290 bits (743), Expect = 1e-75
 Identities = 177/412 (42%), Positives = 243/412 (58%), Gaps = 19/412 (4%)
 Frame = +2

Query: 41   AEGDGEKGAF-ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFI 217
            +EG  +K    E Q+E A+ +R+  FK+Q+DSLT E VRRLLEKDLGLE+ ALD HKRFI
Sbjct: 5    SEGTAKKEEILESQIETAMRSRVSLFKEQSDSLTFEGVRRLLEKDLGLEEYALDVHKRFI 64

Query: 218  RHYLEKIMDGADESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMED 397
            +  L K ++G  + +   A ++ + G   +               ++K     ++E MED
Sbjct: 65   KQCLLKCLEGVGDDDG--AKISGKEGEKGTSTQESEEPKEECEAKDAKDLCPEDEEKMED 122

Query: 398  SPIMGVLTPKSEV----------GTQSSLSESTIKKAILERADHLQANSDKISLGGVRRL 547
            SP++G+L  +             GT+    E+ IKKA+ +R+ +++AN++KI++ G+RRL
Sbjct: 123  SPVLGLLKEQKRAKLETKDDKGNGTKVVPIEALIKKAVRKRSSYIKANAEKITMAGLRRL 182

Query: 548  LEEDLGLDKNTLDAYKNLISRQVDLVLXXXXXXXXXXXXRS-------EDVKSRKSKKVN 706
            LEEDL LDK TLD YK  +S+Q+D VL            +          V  + S + N
Sbjct: 183  LEEDLKLDKFTLDPYKKFVSQQLDEVLASSEVPKPSNNAKKIVKKKPDTKVTKKVSSEEN 242

Query: 707  XXXXXXXXXXXXXXXXKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKKQIEE 886
                            + K RK+   +  +K   QP+KR+  E       R KP K   E
Sbjct: 243  SDTSDKETDEEESEEDEVKPRKKIVPKGKVKTSVQPKKRKGEETDLSSKKRVKPAKATSE 302

Query: 887  DNN-SDEGGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKK 1063
            DN+ +++ G  SED QS  S EKP+ +KE S P YGK VE+LKS+IKACGMSVPP IYKK
Sbjct: 303  DNSDAEDDGKNSEDDQSSSSPEKPSKKKEVSTPVYGKHVEHLKSVIKACGMSVPPVIYKK 362

Query: 1064 VKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGID 1219
            VK VP++KRE  L+KELE ILSREGLS NP+EKEIK+ ++KK RAKELEGID
Sbjct: 363  VKQVPENKREEQLIKELEEILSREGLSSNPSEKEIKEVKRKKARAKELEGID 414


>ref|XP_003527934.1| PREDICTED: DNA ligase 1-like isoform X1 [Glycine max]
          Length = 488

 Score =  290 bits (743), Expect = 1e-75
 Identities = 177/412 (42%), Positives = 243/412 (58%), Gaps = 19/412 (4%)
 Frame = +2

Query: 41   AEGDGEKGAF-ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFI 217
            +EG  +K    E Q+E A+ +R+  FK+Q+DSLT E VRRLLEKDLGLE+ ALD HKRFI
Sbjct: 5    SEGTAKKEEILESQIETAMRSRVSLFKEQSDSLTFEGVRRLLEKDLGLEEYALDVHKRFI 64

Query: 218  RHYLEKIMDGADESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMED 397
            +  L K ++G  + +   A ++ + G   +               ++K     ++E MED
Sbjct: 65   KQCLLKCLEGVGDDDG--AKISGKEGEKGTSTQESEEPKEECEAKDAKDLCPEDEEKMED 122

Query: 398  SPIMGVLTPKSEV----------GTQSSLSESTIKKAILERADHLQANSDKISLGGVRRL 547
            SP++G+L  +             GT+    E+ IKKA+ +R+ +++AN++KI++ G+RRL
Sbjct: 123  SPVLGLLKEQKRAKLETKDDKGNGTKVVPIEALIKKAVRKRSSYIKANAEKITMAGLRRL 182

Query: 548  LEEDLGLDKNTLDAYKNLISRQVDLVLXXXXXXXXXXXXRS-------EDVKSRKSKKVN 706
            LEEDL LDK TLD YK  +S+Q+D VL            +          V  + S + N
Sbjct: 183  LEEDLKLDKFTLDPYKKFVSQQLDEVLASSEVPKPSNNAKKIVKKKPDTKVTKKVSSEEN 242

Query: 707  XXXXXXXXXXXXXXXXKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKKQIEE 886
                            + K RK+   +  +K   QP+KR+  E       R KP K   E
Sbjct: 243  SDTSDKETDEEESEEDEVKPRKKIVPKGKVKTSVQPKKRKGEETDLSSKKRVKPAKATSE 302

Query: 887  DNN-SDEGGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKK 1063
            DN+ +++ G  SED QS  S EKP+ +KE S P YGK VE+LKS+IKACGMSVPP IYKK
Sbjct: 303  DNSDAEDDGKNSEDDQSSSSPEKPSKKKEVSTPVYGKHVEHLKSVIKACGMSVPPVIYKK 362

Query: 1064 VKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGID 1219
            VK VP++KRE  L+KELE ILSREGLS NP+EKEIK+ ++KK RAKELEGID
Sbjct: 363  VKQVPENKREEQLIKELEEILSREGLSSNPSEKEIKEVKRKKARAKELEGID 414


>ref|XP_007019033.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508724361|gb|EOY16258.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 521

 Score =  288 bits (738), Expect = 5e-75
 Identities = 178/445 (40%), Positives = 252/445 (56%), Gaps = 38/445 (8%)
 Frame = +2

Query: 71   ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMDGA 250
            E ++  A+ +R+ HFK+QADSLT E VRRLLEKDLGLE  ALD HKRF++  L K +DG 
Sbjct: 29   ESRITTAMRSRVGHFKEQADSLTFEGVRRLLEKDLGLETFALDVHKRFVKQCLLKCLDGG 88

Query: 251  DESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVL---- 418
            D+ ++  ++       L +            +  + K+A + ++E +EDSP++G+L    
Sbjct: 89   DDDDAPKSSGETGEKNLSTTTEVTESPKGRQSKKDVKEAFSEDEEKLEDSPVLGLLTGHK 148

Query: 419  -----TPKSEVGTQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDKNTL 583
                 T ++E      + ESTIKKAI +RA +++ANS+K+++ G+RRLLEEDL LDK+TL
Sbjct: 149  TTKTETMETETKENKDVFESTIKKAIKKRASYVEANSEKVTMAGLRRLLEEDLKLDKDTL 208

Query: 584  DAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVK----SRKSKKVNXXXXXXXXXXXXXXX 751
            D YK  I+ Q+D VL            +  ++K    S+ SKK +               
Sbjct: 209  DPYKKFITEQLDEVLKSREVSAPASVVKKNNLKKNSQSKASKKASKKLSSASSGSESDEE 268

Query: 752  XKE----------------------KLRKEAGLRKNIKKFEQPRKRR-NSENADMDISRK 862
              E                      K +K+   +  IK  E  +KR+   + A+M   ++
Sbjct: 269  EGEEEEDEDEDEDVDEEEEEEEEEVKPKKKISAKGKIKNSEGLKKRKIPKKEAEMPSKKR 328

Query: 863  KPKKQIEEDNNSD--EGGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGM 1036
                +   D+NSD  + GS+S+D +S+ S  K   RKE S P YGK VE+LKS+IK+CGM
Sbjct: 329  SKHAESISDDNSDAEDSGSVSDDNRSRSSAAK--ARKETSTPVYGKHVEHLKSVIKSCGM 386

Query: 1037 SVPPNIYKKVKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGI 1216
            SVPP IYK+VK VP++ RE  L+KELE ILS+EGLS NP+EKEIK+ RK+KERAKELEGI
Sbjct: 387  SVPPAIYKRVKQVPENNREAQLIKELEEILSKEGLSSNPSEKEIKEVRKRKERAKELEGI 446

Query: 1217 DXXXXXXXXXXXXXXXXVAPERPVV 1291
            D                VAP +P +
Sbjct: 447  DTSNIVLSSRRRSTTSFVAPPKPKI 471


>ref|XP_007222349.1| hypothetical protein PRUPE_ppa004840mg [Prunus persica]
            gi|462419285|gb|EMJ23548.1| hypothetical protein
            PRUPE_ppa004840mg [Prunus persica]
          Length = 489

 Score =  283 bits (725), Expect = 2e-73
 Identities = 169/421 (40%), Positives = 248/421 (58%), Gaps = 16/421 (3%)
 Frame = +2

Query: 77   QLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMDGADE 256
            Q++ A+ +R+ +FK+Q+DSLT E VRRLLEKDLGLE  ALD HKRF++ +L + ++GA +
Sbjct: 25   QIKDAMRSRVPYFKEQSDSLTFEGVRRLLEKDLGLETFALDVHKRFVKEHLVECLEGAGD 84

Query: 257  SNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVL----TP 424
             N+S ++   +   ++             +N + K+  + ++E MEDSP+MG+L    T 
Sbjct: 85   DNTSKSSGETDEKSIIKGEAAESPEGYK-SNKDVKETYSEDEEKMEDSPVMGLLAGNKTA 143

Query: 425  KS------EVGTQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDKNTLD 586
            KS         ++ + SE+ IK A+ +R  +++ANS+KI++ G+RRLLEEDL L+K TLD
Sbjct: 144  KSGTEETKSTKSKKAPSETVIKSALRKRVSYIKANSEKITMAGLRRLLEEDLKLEKYTLD 203

Query: 587  AYKNLISRQVDLVLXXXXXXXXXXXXRS--EDVKSRKSKKVNXXXXXXXXXXXXXXXXKE 760
              K  I+  +D VL            ++  + V+ + S KV                  E
Sbjct: 204  PCKKFINEHLDKVLESCEISEPAPVKKNVKKSVQRKASTKVRSDESSGSSDNESDEEEDE 263

Query: 761  KLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKK----QIEEDNNSDEGGSISEDG 928
               +   + K   +     K+R     + +IS KK  K    + E+ ++++  G++SED 
Sbjct: 264  VKPRNKSVPKGKMQNSNDLKKRKRMANETNISGKKRIKPSETEPEDKSDAEVSGNVSEDD 323

Query: 929  QSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRETILVK 1108
            +SQ S EKP  +KE S P YGKRVE+L+S+IKACGMSV P++YKKVK VP+ KRE  L+K
Sbjct: 324  RSQSSAEKPVKKKEVSTPAYGKRVEHLRSVIKACGMSVAPSVYKKVKQVPESKREAHLIK 383

Query: 1109 ELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGIDXXXXXXXXXXXXXXXXVAPERPV 1288
            ELE ILS+EGLS +PTEKEIK+ +KKKERAKELEGID                V P +P 
Sbjct: 384  ELEEILSKEGLSAHPTEKEIKEVKKKKERAKELEGIDMSNIVTSSRRRSTTSFVPPPKPK 443

Query: 1289 V 1291
            +
Sbjct: 444  I 444


>ref|XP_004500560.1| PREDICTED: transcriptional regulator ATRX homolog isoform X1 [Cicer
            arietinum] gi|502130188|ref|XP_004500561.1| PREDICTED:
            transcriptional regulator ATRX homolog isoform X2 [Cicer
            arietinum]
          Length = 497

 Score =  283 bits (723), Expect = 3e-73
 Identities = 175/428 (40%), Positives = 244/428 (57%), Gaps = 18/428 (4%)
 Frame = +2

Query: 71   ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMDGA 250
            E Q++ A+ +R+ HFK QADSLT E VRRLLEKDLG E+ +LD+HKRFI+  LEK ++  
Sbjct: 16   ESQIQTAMLSRVPHFKQQADSLTFEGVRRLLEKDLGFEEYSLDSHKRFIKQCLEKCLEEV 75

Query: 251  DESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVLTPKS 430
             + ++S  +   E                    S+ +K  T ++E MEDSP++G+L  + 
Sbjct: 76   GDDDASKMSGEEEEK---GESTQEVEGKKEEHQSKDEKDLTEDEEKMEDSPVLGLLKEQK 132

Query: 431  EVGTQSSLSEST----------IKKAILERADHLQANSDKISLGGVRRLLEEDLGLDKNT 580
             V  ++  +E            IKKAI++R+ +L+AN+D++++ G+RRLLEEDL LDK +
Sbjct: 133  RVKNETKKAEGNGKKVVPNEALIKKAIIKRSSYLKANADEVTVAGLRRLLEEDLKLDKFS 192

Query: 581  LDAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVKSRKSKKVNXXXXXXXXXXXXXXXXKE 760
            LD +K  I +Q+D VL            + + VK +   KV                 +E
Sbjct: 193  LDPFKKFIRQQLDEVLMSSEVLEPAKSAK-KIVKKKPDSKVTKKVSTEENSDTSDKVSEE 251

Query: 761  KLRKEAGLRKNIKKFEQ------PRKRRNSENADMDISRKKPKKQIEEDNN-SDEGGSIS 919
            +  +E  ++   K   +      P+KR+  E       R KP K+  EDN+ +++GG  S
Sbjct: 252  EESQEDEVKPKKKSVPKGKASVGPKKRKGEEIKSPSKKRAKPDKEASEDNSDAEDGGKNS 311

Query: 920  EDGQSQLSLEKPAPRKEKSAPG-YGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRET 1096
            ED QS  S E    +K+ S P  Y KRVE+LKS+IKACGMSVPP IYKKVK VP++KRE 
Sbjct: 312  EDDQSHSSAENTTQKKQVSTPVVYSKRVEHLKSVIKACGMSVPPVIYKKVKQVPENKREG 371

Query: 1097 ILVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGIDXXXXXXXXXXXXXXXXVAP 1276
             L+KELE ILSREGLS NP+EKEIK+ ++KKERAKELEGID                 AP
Sbjct: 372  QLIKELEEILSREGLSSNPSEKEIKEVKRKKERAKELEGIDMSNIVSSTRRRATTSFAAP 431

Query: 1277 ERPVVRAK 1300
              P  + K
Sbjct: 432  PPPKPKPK 439


>ref|XP_004169339.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101229552
            [Cucumis sativus]
          Length = 488

 Score =  281 bits (719), Expect = 8e-73
 Identities = 171/400 (42%), Positives = 235/400 (58%), Gaps = 17/400 (4%)
 Frame = +2

Query: 71   ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMDGA 250
            E ++  A+ +R+ HFK+QADSLT E VRRLLEKDL +E   LD HKR+++  L K ++  
Sbjct: 23   ETKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCMETYTLDVHKRYVKQCLVKCLEAD 82

Query: 251  DESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVLTPKS 430
             E N S  +  + G   ++            +   +K+    ++E MEDSP+MG+LT +S
Sbjct: 83   LEDNVSKDS-ELTGRKSVNKEEAPESPEGHQSKKGAKEPCLEDEEKMEDSPVMGLLTGRS 141

Query: 431  EVGTQSS-------------LSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLD 571
                +S               SESTI KAI +R  +L+ANS+K+++ GVRRLLE+DL L 
Sbjct: 142  TKNVESDGIKGIKGKDDKDVPSESTIMKAIRKRTSYLKANSEKVTMAGVRRLLEDDLKLT 201

Query: 572  KNTLDAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVKSRKSKKVNXXXXXXXXXXXXXXX 751
            KN LD+ K  IS+QV+ +L             +E V + KS K                 
Sbjct: 202  KNVLDSCKKFISQQVEEILTSCEA--------AEQVSNLKSPKKISKESSYSTEGSSSEE 253

Query: 752  XKEKLR--KEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKKQIEEDNNSDEGG-SISE 922
              +++   K    +  I    + +KR+ S    +   ++    Q   D +SDEGG ++SE
Sbjct: 254  ENDEVNPGKTNATKGRIPDANETKKRKRSTKKTVSAQKQSKHVQDTSDEDSDEGGGNVSE 313

Query: 923  DGQSQLSLEKPAPRK-EKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRETI 1099
            DG+S  S EKP  ++   S P YGKRVE+LKS+IK+CGMSVPP+IYKKVK  P+ KRE+ 
Sbjct: 314  DGRSGSSNEKPVKKEVSSSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQ 373

Query: 1100 LVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGID 1219
            L+KELEGILSREGLS N TEKEIK+ +KKKERAKELEGID
Sbjct: 374  LIKELEGILSREGLSANSTEKEIKEVKKKKERAKELEGID 413


>ref|XP_004145363.1| PREDICTED: uncharacterized protein LOC101217045 [Cucumis sativus]
          Length = 488

 Score =  281 bits (719), Expect = 8e-73
 Identities = 171/400 (42%), Positives = 235/400 (58%), Gaps = 17/400 (4%)
 Frame = +2

Query: 71   ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMDGA 250
            E ++  A+ +R+ HFK+QADSLT E VRRLLEKDL +E   LD HKR+++  L K ++  
Sbjct: 23   ETKIHNAMRSRISHFKEQADSLTFEGVRRLLEKDLCMETYTLDVHKRYVKQCLVKCLEAD 82

Query: 251  DESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVLTPKS 430
             E N S  +  + G   ++            +   +K+    ++E MEDSP+MG+LT +S
Sbjct: 83   LEDNVSKDS-ELTGRKSVNKEEAPESPEGHQSKKGAKEPCLEDEEKMEDSPVMGLLTGRS 141

Query: 431  EVGTQSS-------------LSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLD 571
                +S               SESTI KAI +R  +L+ANS+K+++ GVRRLLE+DL L 
Sbjct: 142  TKNVESDGIKGIKGKDDKDVPSESTIMKAIRKRTSYLKANSEKVTMAGVRRLLEDDLKLT 201

Query: 572  KNTLDAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVKSRKSKKVNXXXXXXXXXXXXXXX 751
            KN LD+ K  IS+QV+ +L             +E V + KS K                 
Sbjct: 202  KNVLDSCKKFISQQVEEILTSCEA--------AEQVSNLKSPKKISKESSYSTEGSSSEE 253

Query: 752  XKEKLR--KEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKKQIEEDNNSDEGG-SISE 922
              +++   K    +  I    + +KR+ S    +   ++    Q   D +SDEGG ++SE
Sbjct: 254  ENDEVNPGKTNATKGRIPDSNETKKRKRSTKKTVSAQKQSKHVQDTSDEDSDEGGGNVSE 313

Query: 923  DGQSQLSLEKPAPRK-EKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRETI 1099
            DG+S  S EKP  ++   S P YGKRVE+LKS+IK+CGMSVPP+IYKKVK  P+ KRE+ 
Sbjct: 314  DGRSGSSNEKPVKKEVSSSTPVYGKRVEHLKSVIKSCGMSVPPSIYKKVKQAPESKRESQ 373

Query: 1100 LVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGID 1219
            L+KELEGILSREGLS N TEKEIK+ +KKKERAKELEGID
Sbjct: 374  LIKELEGILSREGLSANSTEKEIKEVKKKKERAKELEGID 413


>ref|XP_004290855.1| PREDICTED: uncharacterized protein LOC101302129 [Fragaria vesca
            subsp. vesca]
          Length = 490

 Score =  275 bits (703), Expect = 6e-71
 Identities = 167/438 (38%), Positives = 258/438 (58%), Gaps = 17/438 (3%)
 Frame = +2

Query: 29   AELMAEGDGEKGAFELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHK 208
            +E   + + E G  E ++  A+ AR+ HFK+Q+DSLT  +VRR+LEKDLGLE  ALDAHK
Sbjct: 8    SEAPMKKEEETGDMESKILEAMKARVPHFKEQSDSLTFVNVRRVLEKDLGLEPSALDAHK 67

Query: 209  RFIRHYLEKIMDGADESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKET 388
             F++ +L K ++GA E N+S ++   +   L+             +N + K+ S+ ++E 
Sbjct: 68   GFVKEHLLKCLEGAGEDNNSKSSGQTDEKSLIKGEATGSTEGHQ-SNKDMKETSSADEEK 126

Query: 389  MEDSPIMGVLTPKSEV-----GTQSSLS-----ESTIKKAILERADHLQANSDKISLGGV 538
            +EDSP   +LT          G++SS +     E+ IK A+ +R  +++AN +K+++G +
Sbjct: 127  VEDSPASELLTEHKTAKVKAEGSKSSNNKKAPTEAMIKSALGKRGSYIKANIEKLTMGEL 186

Query: 539  RRLLEEDLGLDKNTLDAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVKSRKS---KKVNX 709
            RR+LE+DL LD  +LD +K  I++Q+D VL            +    K ++    ++++ 
Sbjct: 187  RRVLEKDLKLDTYSLDPFKKFINQQLDEVLESCVDPEPVKNVKKNVKKPQRKPTPEEISE 246

Query: 710  XXXXXXXXXXXXXXXKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKKQI--- 880
                           + K RK++  +  ++  +  +KR++    + +IS KK  K +   
Sbjct: 247  ESSGPANSGTDEEEDEVKPRKKSVTKGKMQNSDGLKKRKSLAK-ETNISGKKRIKSLKAD 305

Query: 881  -EEDNNSDEGGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIY 1057
             EE +++ +  ++SED  S+ S EKP  +KE S P YGKRVE+L+S+IKACGMSVPP+IY
Sbjct: 306  SEEKSDAKDSENVSEDEDSKSSAEKPVKKKEVSTPAYGKRVEHLRSVIKACGMSVPPSIY 365

Query: 1058 KKVKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGIDXXXXXX 1237
            KKVK VP++KRE  L+KELE IL REGLS +PTEKEIK+ +KKKE+AKELEGID      
Sbjct: 366  KKVKQVPENKREAQLIKELEDILGREGLSSSPTEKEIKEVKKKKEKAKELEGIDMSNIVT 425

Query: 1238 XXXXXXXXXXVAPERPVV 1291
                      V P +P +
Sbjct: 426  SSRRRSTTSFVPPPKPKI 443


>ref|XP_006434169.1| hypothetical protein CICLE_v10000938mg [Citrus clementina]
            gi|557536291|gb|ESR47409.1| hypothetical protein
            CICLE_v10000938mg [Citrus clementina]
          Length = 451

 Score =  274 bits (701), Expect = 1e-70
 Identities = 158/381 (41%), Positives = 227/381 (59%), Gaps = 15/381 (3%)
 Frame = +2

Query: 65   AFELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMD 244
            + E Q++ A+ +R+ HFK+QADSLT E VRRL+EKDLGLE  ALD HK+FI+  L + MD
Sbjct: 19   SIEPQIKAAMISRVSHFKEQADSLTFEGVRRLIEKDLGLETHALDVHKKFIKQCLLECMD 78

Query: 245  GADESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVLTP 424
            GA   ++S  +       + S            +  + K+    N E MEDSP++G++T 
Sbjct: 79   GAGGVSASKDSAESAKENVSSTKEEEKSPEGYQSAKDVKEPCPENYEKMEDSPVLGLMTG 138

Query: 425  KSEV----------GTQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDK 574
              +           G +   SES IKKAI +RA +++ N +K+++ G+RR+LEEDL LDK
Sbjct: 139  NKKTKFETEEAQGDGNKEDPSESAIKKAIRKRAAYIKTNIEKVTMAGLRRILEEDLKLDK 198

Query: 575  NTLDAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVKSR---KSKKVNXXXXXXXXXXXXX 745
             TLD++K +IS+++D VL            + + +K     K+K+V+             
Sbjct: 199  FTLDSFKKMISQELDEVLKSSEVLEPSTVEKKKSLKKNYQSKAKEVSSEGSSDSSDGEVD 258

Query: 746  XXXKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPK--KQIEEDNNSDEGGSIS 919
               + K RK+   +  ++  E  +KR+  E       +KK K  K   EDNN  E GS+S
Sbjct: 259  EEDEMKPRKKIVSKGKVQNNEGLKKRKRPEKETKASIKKKTKAVKIASEDNNDAESGSVS 318

Query: 920  EDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRETI 1099
            +DG+SQ S EKP  +K  S P YGKRVE+LK++IK+C MS+PP++YKKVK  P++KRE  
Sbjct: 319  DDGRSQSSSEKPIKKKVVSTPAYGKRVEHLKTVIKSCAMSIPPSVYKKVKQAPENKREAQ 378

Query: 1100 LVKELEGILSREGLSKNPTEK 1162
            L+KELEGILSREGLS NP+EK
Sbjct: 379  LIKELEGILSREGLSSNPSEK 399


>ref|XP_007137404.1| hypothetical protein PHAVU_009G124200g [Phaseolus vulgaris]
            gi|561010491|gb|ESW09398.1| hypothetical protein
            PHAVU_009G124200g [Phaseolus vulgaris]
          Length = 493

 Score =  273 bits (697), Expect = 3e-70
 Identities = 174/438 (39%), Positives = 245/438 (55%), Gaps = 22/438 (5%)
 Frame = +2

Query: 38   MAEGDGE--KGA-FELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHK 208
            MAE   E  KG   E Q+E A+ +R+ HFK+Q+DSLT E VRRLLEKDLGLE+ ALD HK
Sbjct: 1    MAEDSEEMKKGENIESQIETAMLSRVSHFKEQSDSLTFEGVRRLLEKDLGLEECALDVHK 60

Query: 209  RFIRHYLEKIMDGADESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKET 388
            RFI+  L + ++G  +   +   ++ + G   +               + K     ++E 
Sbjct: 61   RFIKQCLLECLEGVGDD--AGPRISEKAGEEGAGTLEPDEPKEKCELKDEKDLCPEDEEK 118

Query: 389  MEDSPIMGVLTPKSEV----------GTQSSLSESTIKKAILERADHLQANSDKISLGGV 538
            MEDSP++G+L  +             G +   SE+ + KA+ +R+ +++AN++ I++ G+
Sbjct: 119  MEDSPVLGLLKEQKRAKLETKDDKGNGNKVVPSEALVMKAVKKRSSYIKANAETITMAGL 178

Query: 539  RRLLEEDLGLDKNTLDAYKNLISRQVDLVLXXXXXXXXXXXXRSEDVKSRKSKKVNXXXX 718
            RRLLE+DL LDK TLD YK  IS+Q+D VL            + + VK +   KV     
Sbjct: 179  RRLLEDDLKLDKFTLDLYKKFISQQLDEVLASSVVSEPAKNAK-KIVKKKPDTKVTKKVS 237

Query: 719  XXXXXXXXXXXXKEKLRKEAGLRKNIK-----KFEQPRKRRNSENADMDISRKKPKKQI- 880
                         E   +E  ++   K     K + P + +  +  + D+S KK  K   
Sbjct: 238  SEENSDTSDKEIDEDESQEDEVKPMKKVVPKGKAQTPVQSKKRKGEETDLSSKKRMKPAK 297

Query: 881  ---EEDNNSDEGGSISEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPN 1051
               EE +++++ G  SED QS  S EKP+ +KE S P YGKRVE LKS+IKACGM VPP+
Sbjct: 298  AASEEISDAEDSGKNSEDDQSHSSSEKPSKKKEVSTPVYGKRVETLKSVIKACGMGVPPS 357

Query: 1052 IYKKVKGVPDDKRETILVKELEGILSREGLSKNPTEKEIKDCRKKKERAKELEGIDXXXX 1231
            IYKK+K V ++KRE  L+KELE ILSREGLS NP+EKEIK+ ++KK RAKELEGID    
Sbjct: 358  IYKKIKQVSENKREGQLIKELEEILSREGLSSNPSEKEIKEVKRKKARAKELEGIDVSNI 417

Query: 1232 XXXXXXXXXXXXVAPERP 1285
                        +AP  P
Sbjct: 418  VSSSRRRSTSSYIAPPPP 435


>ref|XP_006578974.1| PREDICTED: transcriptional regulator ATRX homolog isoform X2 [Glycine
            max]
          Length = 408

 Score =  270 bits (690), Expect = 2e-69
 Identities = 163/382 (42%), Positives = 225/382 (58%), Gaps = 18/382 (4%)
 Frame = +2

Query: 71   ELQLERAVSARLQHFKDQADSLTLESVRRLLEKDLGLEKLALDAHKRFIRHYLEKIMDGA 250
            E Q+E A+ +R+ HFK+Q+DSLT E VRRLLEKDLGLE+ ALD HKRFI+  L K ++G 
Sbjct: 16   ESQIETAMRSRVSHFKEQSDSLTFEGVRRLLEKDLGLEEYALDVHKRFIKQCLLKCLEGV 75

Query: 251  DESNSSPATVNMEGGVLLSXXXXXXXXXXXXANSESKKASTGNKETMEDSPIMGVLTPKS 430
             + +     ++ + G   S             + ++K     ++E MEDSP++G+L  + 
Sbjct: 76   GDDDGPK--ISGKEGEKGSSIQESEEPKEECESKDAKDLCPEDEEKMEDSPVLGLLKEQK 133

Query: 431  EV----------GTQSSLSESTIKKAILERADHLQANSDKISLGGVRRLLEEDLGLDKNT 580
                        GT+   SE+ IKKA+ +R+ +++AN++KI++ G+RRLLEEDL LDK T
Sbjct: 134  RAKLETKDDKGNGTKVVPSEALIKKAVRKRSSYIKANAEKITMAGLRRLLEEDLKLDKFT 193

Query: 581  LDAYKNLISRQVDLVLXXXXXXXXXXXXRS-------EDVKSRKSKKVNXXXXXXXXXXX 739
            LD YK  +S+Q+D VL            +          V  + S + N           
Sbjct: 194  LDPYKKFVSQQLDEVLTSSEVPEPAKNAKKIVKKKPDTKVTKKVSSEENSDTSDKETDEE 253

Query: 740  XXXXXKEKLRKEAGLRKNIKKFEQPRKRRNSENADMDISRKKPKKQIEEDNN-SDEGGSI 916
                 + K RK+   +  +K   QP+KR+  E+      R KP K   EDN+ +++ G  
Sbjct: 254  ESEEDEVKPRKKILPKGKVKTSVQPKKRKGEESDLSSKKRVKPAKAASEDNSDAEDNGKN 313

Query: 917  SEDGQSQLSLEKPAPRKEKSAPGYGKRVENLKSIIKACGMSVPPNIYKKVKGVPDDKRET 1096
            SED QS  S EKP+ +KE S P YGKRVE+LKS+IKACGMSVPP IYKKVK VP++KRE 
Sbjct: 314  SEDDQSHSSPEKPSKKKEVSNPVYGKRVEHLKSVIKACGMSVPPVIYKKVKQVPENKREG 373

Query: 1097 ILVKELEGILSREGLSKNPTEK 1162
             L+KELE ILSREGLS NP+EK
Sbjct: 374  QLIKELEEILSREGLSSNPSEK 395


Top