BLASTX nr result

ID: Paeonia24_contig00012836 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia24_contig00012836
         (2323 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002273558.1| PREDICTED: uncharacterized protein LOC100268...   594   e-167
emb|CBI27315.3| unnamed protein product [Vitis vinifera]              579   e-162
ref|XP_002312290.1| hypothetical protein POPTR_0008s09730g [Popu...   550   e-153
ref|XP_007227250.1| hypothetical protein PRUPE_ppa017973mg [Prun...   528   e-147
ref|XP_002512056.1| conserved hypothetical protein [Ricinus comm...   524   e-146
ref|XP_007045807.1| Histone-lysine N-methyltransferase ATX1, put...   498   e-138
ref|XP_007045808.1| Histone-lysine N-methyltransferase ATX1, put...   494   e-137
ref|XP_006484353.1| PREDICTED: uncharacterized protein LOC102628...   488   e-135
ref|XP_006437884.1| hypothetical protein CICLE_v10033741mg, part...   456   e-125
ref|XP_004298357.1| PREDICTED: uncharacterized protein LOC101305...   441   e-121
ref|XP_006599179.1| PREDICTED: uncharacterized protein LOC100802...   416   e-113
gb|EXB44873.1| hypothetical protein L484_026455 [Morus notabilis]     412   e-112
ref|XP_006415912.1| hypothetical protein EUTSA_v10006590mg [Eutr...   410   e-111
gb|AAG50686.1|AC079829_19 hypothetical protein [Arabidopsis thal...   406   e-110
gb|AAF98582.1|AC013427_25 This gene may be cut off [Arabidopsis ...   406   e-110
ref|NP_001185099.1| DNA binding protein [Arabidopsis thaliana] g...   406   e-110
ref|NP_173957.2| DNA binding protein [Arabidopsis thaliana] gi|3...   406   e-110
ref|XP_007156394.1| hypothetical protein PHAVU_003G282800g [Phas...   403   e-109
ref|XP_004239457.1| PREDICTED: uncharacterized protein LOC101261...   402   e-109
ref|XP_002893407.1| predicted protein [Arabidopsis lyrata subsp....   401   e-109

>ref|XP_002273558.1| PREDICTED: uncharacterized protein LOC100268093 [Vitis vinifera]
          Length = 1242

 Score =  594 bits (1531), Expect = e-167
 Identities = 320/615 (52%), Positives = 410/615 (66%), Gaps = 24/615 (3%)
 Frame = -2

Query: 1929 ELSVCHDASDVNLRDKVGYAGILQKEINTKIRESIICGNSGVGCVPE-----------EI 1783
            E S+          +++     L ++ N  + ESII  N G  C+ +           EI
Sbjct: 628  EQSISSKMDGAEAGNQISDVAPLTRKYNGLLSESIIYRNFGDDCILDAYPTVGPLLAAEI 687

Query: 1782 LQMSSSENIPNKKVAVDAEARFPGQLHGLCTENTTPNPKAVFYN-DPVISHNKSFSCASE 1606
             Q+SSS + P+KKV    E +  GQ + L TE    NP+ VF N  PV S N+ F C S+
Sbjct: 688  HQVSSSASSPDKKVLFSPEVKLEGQHYNLNTEKIALNPEGVFCNMAPVSSQNQEFICTSK 747

Query: 1605 NKDTADFLSPPVSRVEKSNNGVGHELAKQQNLVKFKSSDPQFQKLGTNLFDKATSDANEV 1426
              D   F  P V RVE     +  +L +QQNLVK   S    QK GT+  +   S+A EV
Sbjct: 748  YDDPYIFFYPSVLRVESCQAYIDKKLVEQQNLVKLNRS---VQKGGTSFGENNMSNAEEV 804

Query: 1425 HTDSDLKPHMNKVLNNDLKDFVKLVGCYVHPMPVLSVFLNTKDHEILICVLCGLVVDSNR 1246
               ++LK H+   + +DL    +LVGCYVHPMPVLSV LNT++ EI ICVLCGL+VD + 
Sbjct: 805  QAGTNLKAHIKMEVKHDLVGNTELVGCYVHPMPVLSVLLNTREDEIHICVLCGLLVDKDT 864

Query: 1245 ALFIYKLSIEGPGAGCPSFAGHASLTFPISKDAFGRKVVLDRSGLQFTPDGQCLVLLNSV 1066
             LFIYK++I+ P    P+F G+  +  P  KD  G +V LDR GLQFTPDGQ LVLLNS+
Sbjct: 865  ILFIYKVTIKEPRLQSPTFVGYTPIILPTLKDRSGGEVALDRFGLQFTPDGQSLVLLNSI 924

Query: 1065 KAPYCREQKIHCLCSACTSDCFEDNAVKIVQVKLGYITVMAKLKTDNSVHCVLVCEPNHL 886
            K PYCREQKI CLCSAC  +CFE+NA+KIVQ+KLG+++V+ KLKT +SV C+LVCEPNHL
Sbjct: 925  KTPYCREQKIPCLCSACKLECFEENAIKIVQIKLGFLSVVEKLKTVDSVQCILVCEPNHL 984

Query: 885  IAVEDGGRLRLWVMNSTWSAPTEEFVIPISDCMSSHIMELKKIPKSSHLVIGHNGFGDFG 706
            +AVE+ GRL +WVMNSTWS  TE+F+IP  DC+S  I+ELK+IPK + LV+GH+GFG+F 
Sbjct: 985  VAVEESGRLHVWVMNSTWSVQTEDFIIPTYDCVSPCIVELKRIPKCAPLVVGHHGFGEFS 1044

Query: 705  LWDISNRILISKFSIPSLSVVQFLPISLIDWKSKGLVSNYSSAGEHIN------KLWLSE 544
            LWDIS RILIS+F++PS+S+ +F+PISL  ++S+  +S+      HIN      K+W S+
Sbjct: 1045 LWDISQRILISRFAMPSISIFEFIPISLFSFQSEVPLSSNPDVDLHINKIMAATKMWFSK 1104

Query: 543  H-----CMKLSLKDMAIWLLVST-GPNCKALEKYESDCQLNESGCWRLALLVKNRVILGS 382
            H      + L  + +A+WLLVST   +    +   +DCQ N  G WRLALLVKN VILGS
Sbjct: 1105 HNENYTFLPLGGESIAVWLLVSTLSDSDTQHDNQMNDCQTNPVGWWRLALLVKNMVILGS 1164

Query: 381  PLDPRAVAIGASDGHGIITTSDGLVYMWELYTGVELGTLHYFKGGGVSCIVTDEESKSGV 202
             LDPRA AIGAS GHGII T DGLVYMWEL TG +LG+LHYFK GGVSCI TD +S+S V
Sbjct: 1165 ALDPRAAAIGASAGHGIIGTHDGLVYMWELSTGTKLGSLHYFK-GGVSCIATD-DSRSDV 1222

Query: 201  LAVAGDGGKLLIYLH 157
             AVAGDGG+LL+YLH
Sbjct: 1223 FAVAGDGGQLLVYLH 1237



 Score = 76.6 bits (187), Expect = 5e-11
 Identities = 45/110 (40%), Positives = 62/110 (56%)
 Frame = -2

Query: 2241 DDINRSVHIRDVRSPVRMLSENSRTEREEKMHISIKDPESFVPSFEHITSVVPDSYEDDQ 2062
            D+ + +  I +V SP  +L+ENS  E EEK+    +D +  +PSFEH+ SV+PDS+EDDQ
Sbjct: 451  DENSGACPIVNVASPALVLAENSPVEMEEKVQTFRRDFDPVIPSFEHVKSVIPDSFEDDQ 510

Query: 2061 CEQHVISQVPLSFXXXXXXXXXXXXXETFAPDNLRLFGSIKARKELSVCH 1912
            C  H  +  PL F             +T A D L  F ++ A KE SVCH
Sbjct: 511  C-GHDSANGPLLFSDIAGADQASFDKDTCACDTLGQFINVDAWKESSVCH 559


>emb|CBI27315.3| unnamed protein product [Vitis vinifera]
          Length = 1177

 Score =  579 bits (1493), Expect = e-162
 Identities = 342/772 (44%), Positives = 444/772 (57%), Gaps = 87/772 (11%)
 Frame = -2

Query: 2211 DVRSPVRMLSENSRTEREEKMHISIKDPESFVPSFEHITSVVPDSYEDDQCEQHVISQVP 2032
            D  S    +  NS  E EEK+    +D +  +PSFEH+ SV+PDS+EDDQC  H  +  P
Sbjct: 442  DENSGACPIVNNSPVEMEEKVQTFRRDFDPVIPSFEHVKSVIPDSFEDDQCG-HDSANGP 500

Query: 2031 LSFXXXXXXXXXXXXXETFAPDNLRLF--------------------------------- 1951
            L F             +T A D L  F                                 
Sbjct: 501  LLFSDIAGADQASFDKDTCACDTLGQFINVDAWKESSVCHVETGERKDGFSCSKANVASK 560

Query: 1950 --------GSIKARKELSVCHDASDVNLRDKVGYAGILQKEINTK--------------- 1840
                    G +   +E ++    S  N +  V    I ++ I++K               
Sbjct: 561  LDENSIHHGILSVEREKTLLDYTSGANTKCMVSSVQISEQSISSKMDGAEAGNQISDVAP 620

Query: 1839 --------IRESIICGNSGVGCVPE-----------EILQMSSSENIPNKKVAVDAEARF 1717
                    + ESII  N G  C+ +           EI Q+SSS + P+KKV    E + 
Sbjct: 621  LTRKYNGLLSESIIYRNFGDDCILDAYPTVGPLLAAEIHQVSSSASSPDKKVLFSPEVKL 680

Query: 1716 PGQLHGLCTENTTPNPKAVFYNDPVISHNKSFSCASENKDTADFLSPPVSRVEKSNNGVG 1537
             GQ + L TE    NP+               SC +                      + 
Sbjct: 681  EGQHYNLNTEKIALNPEE--------------SCQAY---------------------ID 705

Query: 1536 HELAKQQNLVKFKSSDPQFQKLGTNLFDKATSDANEVHTDSDLKPHMNKVLNNDLKDFVK 1357
             +L +QQNLVK   S    QK GT+  +   S+A EV   ++LK H+   + +DL    +
Sbjct: 706  KKLVEQQNLVKLNRS---VQKGGTSFGENNMSNAEEVQAGTNLKAHIKMEVKHDLVGNTE 762

Query: 1356 LVGCYVHPMPVLSVFLNTKDHEILICVLCGLVVDSNRALFIYKLSIEGPGAGCPSFAGHA 1177
            LVGCYVHPMPVLSV LNT++ EI ICVLCGL+VD +  LFIYK++I+ P    P+F G+ 
Sbjct: 763  LVGCYVHPMPVLSVLLNTREDEIHICVLCGLLVDKDTILFIYKVTIKEPRLQSPTFVGYT 822

Query: 1176 SLTFPISKDAFGRKVVLDRSGLQFTPDGQCLVLLNSVKAPYCREQKIHCLCSACTSDCFE 997
             +  P  KD  G +V LDR GLQFTPDGQ LVLLNS+K PYCREQKI CLCSAC  +CFE
Sbjct: 823  PIILPTLKDRSGGEVALDRFGLQFTPDGQSLVLLNSIKTPYCREQKIPCLCSACKLECFE 882

Query: 996  DNAVKIVQVKLGYITVMAKLKTDNSVHCVLVCEPNHLIAVEDGGRLRLWVMNSTWSAPTE 817
            +NA+KIVQ+KLG+++V+ KLKT +SV C+LVCEPNHL+AVE+ GRL +WVMNSTWS  TE
Sbjct: 883  ENAIKIVQIKLGFLSVVEKLKTVDSVQCILVCEPNHLVAVEESGRLHVWVMNSTWSVQTE 942

Query: 816  EFVIPISDCMSSHIMELKKIPKSSHLVIGHNGFGDFGLWDISNRILISKFSIPSLSVVQF 637
            +F+IP  DC+S  I+ELK+IPK + LV+GH+GFG+F LWDIS RILIS+F++PS+S+ +F
Sbjct: 943  DFIIPTYDCVSPCIVELKRIPKCAPLVVGHHGFGEFSLWDISQRILISRFAMPSISIFEF 1002

Query: 636  LPISLIDWKSKGLVSNYSSAGEHIN------KLWLSEH-----CMKLSLKDMAIWLLVST 490
            +PISL  ++S+  +S+      HIN      K+W S+H      + L  + +A+WLLVST
Sbjct: 1003 IPISLFSFQSEVPLSSNPDVDLHINKIMAATKMWFSKHNENYTFLPLGGESIAVWLLVST 1062

Query: 489  -GPNCKALEKYESDCQLNESGCWRLALLVKNRVILGSPLDPRAVAIGASDGHGIITTSDG 313
               +    +   +DCQ N  G WRLALLVKN VILGS LDPRA AIGAS GHGII T DG
Sbjct: 1063 LSDSDTQHDNQMNDCQTNPVGWWRLALLVKNMVILGSALDPRAAAIGASAGHGIIGTHDG 1122

Query: 312  LVYMWELYTGVELGTLHYFKGGGVSCIVTDEESKSGVLAVAGDGGKLLIYLH 157
            LVYMWEL TG +LG+LHYFK GGVSCI TD +S+S V AVAGDGG+LL+YLH
Sbjct: 1123 LVYMWELSTGTKLGSLHYFK-GGVSCIATD-DSRSDVFAVAGDGGQLLVYLH 1172


>ref|XP_002312290.1| hypothetical protein POPTR_0008s09730g [Populus trichocarpa]
            gi|222852110|gb|EEE89657.1| hypothetical protein
            POPTR_0008s09730g [Populus trichocarpa]
          Length = 1312

 Score =  550 bits (1417), Expect = e-153
 Identities = 299/612 (48%), Positives = 383/612 (62%), Gaps = 21/612 (3%)
 Frame = -2

Query: 1926 LSVCHDASDVNLRDKVGYAGILQKEINTKIRESIICGNSGVGCVPE--------EILQMS 1771
            LS      +V  R KV  A    ++ N    ESIIC N     +PE        E+ QMS
Sbjct: 700  LSTAQVTKNVYTRKKVSKAASSTRKCNASFSESIICRNLRDDSIPETTRTLLNSEMFQMS 759

Query: 1770 SSENIPNKKVAVDAEARFPGQLHGLCTENTTPNPKAVFYND-PVISHNKSFSCASENKDT 1594
            SS + P+K     +E     QL+G+  + TT NP  +  +  P +S  ++FS AS  KD 
Sbjct: 760  SSVDKPHKNAIFGSEPMVGDQLNGMQIDETTSNPNPLSESKLPFVSQTQTFSGASMGKDA 819

Query: 1593 ADFLSPPVSRVEKSNNGVGHELAKQQNLVKFKSSDPQFQKLGTNLFDKATSDANEVHTDS 1414
            ++  +  VS++E+ +      L   QN            +LGT      TS   EV T+S
Sbjct: 820  SNLFAATVSKIEEPHAYSEGRLVVSQNTSDTNGPPVLSAELGTAFSCYNTSSVKEVQTNS 879

Query: 1413 DLKPHMNKVLNNDLKDFVKLVGCYVHPMPVLSVFLNTKDHEILICVLCGLVVDSNRALFI 1234
            DLK H N   NN+L+   +LVGCY+HPMPVLS+ + TK  EI +C LCG +VD NR LF+
Sbjct: 880  DLKLHRNLKHNNELEGNFELVGCYLHPMPVLSLLVVTKGDEINVCALCGHLVDKNRTLFL 939

Query: 1233 YKLSIEGPGAGCPSFAGHASLTFPISKDAFGRKVVLDRSGLQFTPDGQCLVLLNSVKAPY 1054
            YKL+IE    G PSF GH S+TFP S D FGR+  L+RSGLQ TPDGQ LVLL S+K PY
Sbjct: 940  YKLAIEETRTGNPSFVGHTSVTFPFSTDIFGRETALERSGLQLTPDGQNLVLLGSMKTPY 999

Query: 1053 CREQKIHCLCSACTSDCFEDNAVKIVQVKLGYITVMAKLKTDNSVHCVLVCEPNHLIAVE 874
            CRE +  CLCS C+ +C E + VKIVQVK GY++V+ KL T +S+ C+LVCEPNHLIA  
Sbjct: 1000 CREGRTDCLCSTCSLNCSEQSTVKIVQVKTGYVSVLVKLSTFDSMQCILVCEPNHLIAAG 1059

Query: 873  DGGRLRLWVMNSTWSAPTEEFVIPISDCMSSHIMELKKIPKSSHLVIGHNGFGDFGLWDI 694
            + GRL LW MNS WSAPTEEF+I  +DC+S  I+ELK++P  + +V+G+NGFG+F +WD+
Sbjct: 1060 ESGRLHLWTMNSAWSAPTEEFIISANDCISPCIVELKRVPNCASVVVGNNGFGEFTVWDV 1119

Query: 693  SNRILISKFSIPSLSVVQFLPISLIDWKSKGLVSNYSSAGEHIN------KLWLSEHCMK 532
            S R+ +++ S PS S  QF PIS   W+      +YS+  E I+      KLW SE+   
Sbjct: 1120 SRRMFMARVSSPSASACQFFPISSFTWQRVVHGFHYSTVEEQIDGIVDATKLWFSENSEY 1179

Query: 531  LSL-----KDMAIWLLVSTGPNCKALEKY-ESDCQLNESGCWRLALLVKNRVILGSPLDP 370
             SL     +D+AIWLLVST P     E Y  SDC +N  G WRLALLVKN +ILG  LDP
Sbjct: 1180 YSLPPLDGEDIAIWLLVSTIPELDTQEDYISSDCGINPVGWWRLALLVKNMLILGKALDP 1239

Query: 369  RAVAIGASDGHGIITTSDGLVYMWELYTGVELGTLHYFKGGGVSCIVTDEESKSGVLAVA 190
            RA AIG+S G+GII T DGLVYMWE  TG  LGTLH+F+G  VSCI TD  SK GV++VA
Sbjct: 1240 RAAAIGSSSGNGIIGTFDGLVYMWEFTTGTRLGTLHHFEGESVSCIATD-NSKPGVISVA 1298

Query: 189  GDGGKLLIYLHS 154
            GD G+LL+Y  S
Sbjct: 1299 GDKGQLLVYRRS 1310


>ref|XP_007227250.1| hypothetical protein PRUPE_ppa017973mg [Prunus persica]
            gi|462424186|gb|EMJ28449.1| hypothetical protein
            PRUPE_ppa017973mg [Prunus persica]
          Length = 1170

 Score =  528 bits (1361), Expect = e-147
 Identities = 295/578 (51%), Positives = 378/578 (65%), Gaps = 16/578 (2%)
 Frame = -2

Query: 1839 IRESIICGNSGVGCVPE-----------EILQMSSSENIPNKKVAVDAEARFPGQLHGLC 1693
            + ESIIC NSG  C+PE           E LQM SS++   K  +  AEA+       L 
Sbjct: 599  LSESIICRNSGDICLPESYPSAETLLALETLQMGSSDDNLYKD-SFCAEAKTVEHSSCLN 657

Query: 1692 TENTTPNPKAVFYND-PVISHNKSFSCASENKDTADFLSPPVSRVEKSNNGVGHELAKQQ 1516
             +  + N K +     P +   ++   AS+ KDT   L   VSR+E   N V  ++   +
Sbjct: 658  ADKPSVNSKGLLNGHCPAVLQEQALVGASKEKDTLCSLDLSVSRLE---NHVDKDVVGHE 714

Query: 1515 NLVKFKSSDPQFQKLGTNLFDKATSDANEVHTDSDLKPHMNKVLNNDLKDFVKLVGCYVH 1336
            NL++   ++   QK GT L      D N V   SD KPH  + LNN+L   ++ VG Y H
Sbjct: 715  NLLEPNDTETS-QKQGTGLMH----DPNSVPHSSDSKPHSME-LNNELTGSLEFVGRYSH 768

Query: 1335 PMPVLSVFLNTKDHEILICVLCGLVVDSNRALFIYKLSIEGPGAGCPSFAGHASLTFPIS 1156
              PVLSV L+ K  EI +CVLCG +VD + +LFIYK++IE P  GCPSF GH S+T PI 
Sbjct: 769  QNPVLSVLLSAKGTEIYVCVLCGPLVDKDGSLFIYKVAIEEPRVGCPSFVGHTSVTLPIR 828

Query: 1155 KDAFGRKVVLDRSGLQFTPDGQCLVLLNSVKAPYCREQKIHCLCSACTSDCFEDNAVKIV 976
            KD FGR + L+RS LQFTPDGQ LVLL+S+K PYCR+  IHCLCS CTS+C E+N VKIV
Sbjct: 829  KDYFGR-IALERSSLQFTPDGQYLVLLDSIKTPYCRQGSIHCLCSTCTSNCSEENTVKIV 887

Query: 975  QVKLGYITVMAKLKTDNSVHCVLVCEPNHLIAVEDGGRLRLWVMNSTWSAPTEEFVIPIS 796
            QV+LGY++ +A LK  +S+ C+LVCEPN+L+AV + GRL LWVMNSTWSA  E FV+P  
Sbjct: 888  QVRLGYVSKVASLKAVDSLECILVCEPNNLVAVGESGRLHLWVMNSTWSAQIENFVLPAE 947

Query: 795  DCMSSHIMELKKIPKSSHLVIGHNGFGDFGLWDISNRILISKFSIPSLSVVQFLPISLID 616
            DC+S  I+ELK+IP  +H+V+GHNGFG+F LWDIS  IL+S+FS  S S+ QF+P+SL  
Sbjct: 948  DCISPGIVELKRIPNCTHIVVGHNGFGEFSLWDISKCILVSRFSAASSSICQFVPVSLFT 1007

Query: 615  WKSKGLVSNYSSAGEHINKLWLSEHCMKLSL--KDMAIWLLVSTGPNCKALEKYES-DCQ 445
            W+ K  VS+YS   EHIN+L  +    + SL  +D+A+WLLVS+  +  A + Y S DC 
Sbjct: 1008 WRIKCPVSSYSDIEEHINELVAATSNNQFSLEGEDIAVWLLVSSSSDSDAQQDYVSDDCD 1067

Query: 444  LNESGCWRLALLVKNRVILGSPLDPRAVAIGASDGHGIITTSDGLVYMWELYTGVELGTL 265
             N  G WRLAL+VKN VI GS LDPRA  IGAS G GI  T DGLVYMWEL TG + G +
Sbjct: 1068 SNPMGRWRLALMVKNMVIFGSALDPRAAVIGASAGQGICGTCDGLVYMWELSTGNKFGAM 1127

Query: 264  HYFKGGGVSCIVTDEESKS-GVLAVAGDGGKLLIYLHS 154
            H+FKGG VSCI TD+   S G +AVAGD  +LL++LHS
Sbjct: 1128 HHFKGGSVSCIATDDSRPSPGAVAVAGD-NQLLVFLHS 1164


>ref|XP_002512056.1| conserved hypothetical protein [Ricinus communis]
            gi|223549236|gb|EEF50725.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 1246

 Score =  524 bits (1350), Expect = e-146
 Identities = 304/660 (46%), Positives = 390/660 (59%), Gaps = 9/660 (1%)
 Frame = -2

Query: 2124 SFVPSFEHITSVVPDSYEDDQCEQHVISQVPLSFXXXXXXXXXXXXXETFAPDNLRLFGS 1945
            S V S  H  S++ D ++ DQC     + V                    +  N  +   
Sbjct: 523  SVVRSCGHTNSIILDKFDGDQCLGASAASVEA-----------LGSSLQLSRTNTLVKDG 571

Query: 1944 IKARKELSVCHDASDVNLRDKVGYAGILQKEINTKIRESIICGNSGVGCVPEEILQMSSS 1765
                  +S       V  R KV  A    ++ N  + +S+ C      C+ E    +  S
Sbjct: 572  ASEISNISSSQVPEKVYTRRKVLNAEPTARKHNPPLLKSLGCRRLSDACILETTGTLLDS 631

Query: 1764 ENIPNKKVAVDAEARFPGQLHGLCTENTTPNPKAVFYNDP-VISHNKSFSCASENKDTAD 1588
            E   +K      +AR    LH L T+ T  N      +   + S  ++  CA E  DT++
Sbjct: 632  EPFNDKNEVFYEDARVGRNLHVLPTDKTAVNSNPALESPVHITSVTQANICALEGHDTSN 691

Query: 1587 FLSPPVSRVEKSNNGVGHELAKQQNLVKFKSSDPQFQKLGTNLFDKATSDANEVHTDSDL 1408
             + P +S VEK  +     L   +N +       Q +  G   FDK TS A E   +S++
Sbjct: 692  IVVPSMSDVEKPLH-FEERLVGLKNTLDINGLGSQEEGKG---FDK-TSSAQE--GNSEI 744

Query: 1407 KPHMNKVLNNDLKDFVKLVGCYVHPMPVLSVFLNTKDHEILICVLCGLVVDSNRALFIYK 1228
                N  L N+L   V+ +GCY HPMPVLS+ +  K +EI ICVLCGL+V+ +R LF+YK
Sbjct: 745  MRQWNSELTNELDGIVEFLGCYFHPMPVLSLLVRRKGNEIYICVLCGLLVEKDRTLFLYK 804

Query: 1227 LSIEGPGAGCPSFAGHASLTFPISKDAFGRKVVLDRSGLQFTPDGQCLVLLNSVKAPYCR 1048
            L+IEGP  GCP F GH S+T+P S   FGR++  +RSGLQ TPDGQCLVLL S +AP CR
Sbjct: 805  LAIEGPRIGCPCFIGHTSVTWPSSTGIFGREISFERSGLQLTPDGQCLVLLGSTRAPCCR 864

Query: 1047 EQKIHCLCSACTSDCFEDNAVKIVQVKLGYITVMAKLKTDNSVHCVLVCEPNHLIAVEDG 868
            E ++ CLCSAC SDCF  N VKIVQVK GY++V+ KLKT++S+ C+LVCEP+HL+A  + 
Sbjct: 865  EGRLECLCSACASDCFGSNGVKIVQVKAGYVSVLVKLKTNDSLQCILVCEPDHLVAAGEN 924

Query: 867  GRLRLWVMNSTWSAPTEEFVIPISDCMSSHIMELKKIPKSSHLVIGHNGFGDFGLWDISN 688
             RL LW MNS WSAPTEEF I  +D  S  IMELK+IPK + LVIGH+GFG+F LWDIS 
Sbjct: 925  SRLHLWTMNSVWSAPTEEFTIQSNDYTSPCIMELKRIPKCTSLVIGHDGFGEFTLWDISK 984

Query: 687  RILISKFSIPSLSVVQFLPISLIDWKSKGLVSNYSSAGEHINKL-----WLSEHCMKLSL 523
            RI +SKFS PS SV QF PISL  W+ +    +YS+   H+N+L       S H +  SL
Sbjct: 985  RIFVSKFSSPSNSVHQFSPISLFHWQREVHGLSYSNVEAHVNRLMDATKMFSGHSINHSL 1044

Query: 522  --KDMAIWLLVSTGPNCKALEKY-ESDCQLNESGCWRLALLVKNRVILGSPLDPRAVAIG 352
              +D+AIW LVST P+  AL  Y  S  Q+N  G WRLALL+KN +ILGS LDPRA AIG
Sbjct: 1045 PHEDIAIWFLVSTAPDSDALHDYGSSHSQINPVGYWRLALLMKNSLILGSALDPRAAAIG 1104

Query: 351  ASDGHGIITTSDGLVYMWELYTGVELGTLHYFKGGGVSCIVTDEESKSGVLAVAGDGGKL 172
             S GHGII T DGLVYMWEL TG +LGTLH FKGG  SCI TD +S SGVLA+A D G++
Sbjct: 1105 TSAGHGIIGTLDGLVYMWELLTGKKLGTLHKFKGGSASCIATD-DSGSGVLAIADDKGEI 1163


>ref|XP_007045807.1| Histone-lysine N-methyltransferase ATX1, putative isoform 1
            [Theobroma cacao] gi|508709742|gb|EOY01639.1|
            Histone-lysine N-methyltransferase ATX1, putative isoform
            1 [Theobroma cacao]
          Length = 1329

 Score =  498 bits (1282), Expect = e-138
 Identities = 286/615 (46%), Positives = 373/615 (60%), Gaps = 32/615 (5%)
 Frame = -2

Query: 1899 VNLRDKVGYAGILQKEINTKIRESIICGNSGVGCVPE-------EILQMS--SSENIPNK 1747
            V  R KV       ++    + ESII  N+G    P         ++  S  SS+  P  
Sbjct: 715  VYTRKKVSKQAYSTRKYTGPLSESIIYRNTGDDYAPNVSATTGISLVSKSCHSSDEKPCN 774

Query: 1746 KVAVDAEARFPGQLHGLCTENTTPNPKAVFYNDPVI--SHNKSFSCASENKDTADFLSPP 1573
            +   DA     GQ +GL  E TT N K    N P +  + N+   CAS+ KD +  L P 
Sbjct: 775  RDICDATDMLEGQSYGLPVEKTTTNCKPEMSNMPPVLSNRNQKLVCASKAKDASYLLVPS 834

Query: 1572 VSRVEKSNNGVGHELAKQQNLVKFKSSDPQFQKLGTNLFDKATSDANEVHTDSDLKPHMN 1393
            VS           E  + ++ V+        Q   T++FD   S A EV   SD+    +
Sbjct: 835  VSLERGFQENCHKERLEHRSTVE-NGCPASCQNQVTSVFDTNRSKAREVQGSSDVNHCRD 893

Query: 1392 KVLNNDLKDFVKLVGCYVHPMPVLSVFLNTKDHEILICVLCGLVVDSNRALFIYKLSIEG 1213
              LN DL+  V LVG Y HP+P+ SV+L TK +EI ICVLCGL+VD +R LF+Y++SIE 
Sbjct: 894  VELNCDLRGIVNLVGSYFHPLPISSVWLCTKGNEIHICVLCGLLVDKDRTLFLYRVSIEE 953

Query: 1212 PGAGCPSFAGHASLTFPISKDAFGRKVVL----------DRSGLQFTPDGQCLVLLNSVK 1063
            P  GCPSF G+ S+T   S+ +FG ++            +R GLQFTPDGQCLVLL+ +K
Sbjct: 954  PSIGCPSFVGYTSVTLTFSEVSFGGRICCNSSAIFIIDSERCGLQFTPDGQCLVLLDGIK 1013

Query: 1062 APYCREQKIHCLCSACTSDCFEDNAVKIVQVKLGYITVMAKLKTDNSVHCVLVCEPNHLI 883
             PYCRE  I C+CS C+S C  +N VKIVQV  GY++++AKL+T  SV C+LVCE N+L+
Sbjct: 1014 TPYCREGIIDCICSICSSGCSNENGVKIVQVNHGYVSLVAKLETVESVQCILVCENNYLV 1073

Query: 882  AVEDGGRLRLWVMNSTWSAPTEEFVIPISDCMSSHIMELKKIPKSSHLVIGHNGFGDFGL 703
            A    GRL LWVMNSTWSA TEEF++P  DC+S  ++ELK+IPK + LVIGHNG G+F +
Sbjct: 1074 AAGTSGRLHLWVMNSTWSAWTEEFILPAGDCLSPCVVELKRIPKCARLVIGHNGIGEFVV 1133

Query: 702  WDISNRILISKFSIPSLSVVQFLPISLIDWKSKGLVSNYSSAGEHIN------KLWLSEH 541
            WDI  R+++S+FS     + QFLPISL  W+    V +Y+     I+      K+  SEH
Sbjct: 1134 WDILKRLILSRFSASGNPIKQFLPISLFSWQP---VFSYADMNGRIDEIFTTTKILFSEH 1190

Query: 540  --CM--KLSLKDMAIWLLVSTGPNCK-ALEKYESDCQLNESGCWRLALLVKNRVILGSPL 376
              C    L  +D+A+WLL+ST  + +   E+  S+CQ N +  WRLALLVK+RVILGS L
Sbjct: 1191 KDCFFPPLEGEDIALWLLLSTVSDFEDQYERLPSNCQANPARSWRLALLVKDRVILGSTL 1250

Query: 375  DPRAVAIGASDGHGIITTSDGLVYMWELYTGVELGTLHYFKGGGVSCIVTDEESKSGVLA 196
            DPRA AIGAS  HGII   DGLVYMWEL TG  LG LH+FKGG VSCI TD + +  V+A
Sbjct: 1251 DPRAAAIGASFDHGIIGRDDGLVYMWELSTGTRLGVLHHFKGGSVSCIATD-DLRPDVVA 1309

Query: 195  VAGDGGKLLIYLHSR 151
            VA D G+LLIYLHS+
Sbjct: 1310 VAADDGQLLIYLHSQ 1324



 Score = 64.7 bits (156), Expect = 2e-07
 Identities = 67/274 (24%), Positives = 107/274 (39%), Gaps = 12/274 (4%)
 Frame = -2

Query: 2322 PLLXXXXXXXXXXXKPSETSPRVVKYRDDINRSVHIRDVRSPVRMLSENSRTEREEKMHI 2143
            PLL            P +  P VV  R +   + H+ ++ S   +L+E +  E++ +MH 
Sbjct: 440  PLLKESSKKKREIINPYKVLPHVVNSRVNNIETNHLLNLPSSAIILTEEAHAEQDRRMHT 499

Query: 2142 SIKDPESFVPSFEHITSVVPDSYEDDQCEQHVISQVPLSFXXXXXXXXXXXXXETFAPDN 1963
               D  S VP+ EH+ SV+ DS+EDDQ   HV  Q  +SF             +T+  + 
Sbjct: 500  QSIDHGSVVPNLEHVNSVILDSFEDDQGGDHVAKQA-VSFSKSVEVDQTSFNKDTYHSNI 558

Query: 1962 LRLFGSIKARKELSVCHDASDVNLRDKVGYAGILQKEINTKIRE-----SIICGNSGVGC 1798
                 SI  ++E S C D    N         I  KE+N  + +      I    S  G 
Sbjct: 559  QEQLVSINVKQETSDCCDEISEN------QDTICHKEVNMALNKKPHGSDITMSESASGH 612

Query: 1797 VPEEILQMSSSENIPNKKVAVDAEARFPGQLHGLCTENTTPNPKAVFYNDPVISHNKSFS 1618
            V   ++  + SE+I                  G+C  N   N   +  +        + +
Sbjct: 613  V--SLIMKAFSEDI-----------------QGVCV-NLDENSADIENHSMEKKPKNALN 652

Query: 1617 CASENKDTADF-------LSPPVSRVEKSNNGVG 1537
            CA  N+D  DF       +S  V   + + +G+G
Sbjct: 653  CAKVNRDFYDFQLDANNHVSAAVDTNDNNPSGIG 686


>ref|XP_007045808.1| Histone-lysine N-methyltransferase ATX1, putative isoform 2
            [Theobroma cacao] gi|590698910|ref|XP_007045809.1|
            Histone-lysine N-methyltransferase ATX1, putative isoform
            2 [Theobroma cacao] gi|508709743|gb|EOY01640.1|
            Histone-lysine N-methyltransferase ATX1, putative isoform
            2 [Theobroma cacao] gi|508709744|gb|EOY01641.1|
            Histone-lysine N-methyltransferase ATX1, putative isoform
            2 [Theobroma cacao]
          Length = 1128

 Score =  494 bits (1273), Expect = e-137
 Identities = 284/605 (46%), Positives = 369/605 (60%), Gaps = 22/605 (3%)
 Frame = -2

Query: 1899 VNLRDKVGYAGILQKEINTKIRESIICGNSGVGCVPE-------EILQMS--SSENIPNK 1747
            V  R KV       ++    + ESII  N+G    P         ++  S  SS+  P  
Sbjct: 530  VYTRKKVSKQAYSTRKYTGPLSESIIYRNTGDDYAPNVSATTGISLVSKSCHSSDEKPCN 589

Query: 1746 KVAVDAEARFPGQLHGLCTENTTPNPKAVFYNDPVI--SHNKSFSCASENKDTADFLSPP 1573
            +   DA     GQ +GL  E TT N K    N P +  + N+   CAS+ KD +  L P 
Sbjct: 590  RDICDATDMLEGQSYGLPVEKTTTNCKPEMSNMPPVLSNRNQKLVCASKAKDASYLLVPS 649

Query: 1572 VSRVEKSNNGVGHELAKQQNLVKFKSSDPQFQKLGTNLFDKATSDANEVHTDSDLKPHMN 1393
            VS           E  + ++ V+        Q   T++FD   S A EV   SD+    +
Sbjct: 650  VSLERGFQENCHKERLEHRSTVE-NGCPASCQNQVTSVFDTNRSKAREVQGSSDVNHCRD 708

Query: 1392 KVLNNDLKDFVKLVGCYVHPMPVLSVFLNTKDHEILICVLCGLVVDSNRALFIYKLSIEG 1213
              LN DL+  V LVG Y HP+P+ SV+L TK +EI ICVLCGL+VD +R LF+Y++SIE 
Sbjct: 709  VELNCDLRGIVNLVGSYFHPLPISSVWLCTKGNEIHICVLCGLLVDKDRTLFLYRVSIEE 768

Query: 1212 PGAGCPSFAGHASLTFPISKDAFGRKVVLDRSGLQFTPDGQCLVLLNSVKAPYCREQKIH 1033
            P  GCPSF G+ S+T   S+      +  +R GLQFTPDGQCLVLL+ +K PYCRE  I 
Sbjct: 769  PSIGCPSFVGYTSVTLTFSE------IDSERCGLQFTPDGQCLVLLDGIKTPYCREGIID 822

Query: 1032 CLCSACTSDCFEDNAVKIVQVKLGYITVMAKLKTDNSVHCVLVCEPNHLIAVEDGGRLRL 853
            C+CS C+S C  +N VKIVQV  GY++++AKL+T  SV C+LVCE N+L+A    GRL L
Sbjct: 823  CICSICSSGCSNENGVKIVQVNHGYVSLVAKLETVESVQCILVCENNYLVAAGTSGRLHL 882

Query: 852  WVMNSTWSAPTEEFVIPISDCMSSHIMELKKIPKSSHLVIGHNGFGDFGLWDISNRILIS 673
            WVMNSTWSA TEEF++P  DC+S  ++ELK+IPK + LVIGHNG G+F +WDI  R+++S
Sbjct: 883  WVMNSTWSAWTEEFILPAGDCLSPCVVELKRIPKCARLVIGHNGIGEFVVWDILKRLILS 942

Query: 672  KFSIPSLSVVQFLPISLIDWKSKGLVSNYSSAGEHIN------KLWLSEH--CM--KLSL 523
            +FS     + QFLPISL  W+    V +Y+     I+      K+  SEH  C    L  
Sbjct: 943  RFSASGNPIKQFLPISLFSWQP---VFSYADMNGRIDEIFTTTKILFSEHKDCFFPPLEG 999

Query: 522  KDMAIWLLVSTGPNCK-ALEKYESDCQLNESGCWRLALLVKNRVILGSPLDPRAVAIGAS 346
            +D+A+WLL+ST  + +   E+  S+CQ N +  WRLALLVK+RVILGS LDPRA AIGAS
Sbjct: 1000 EDIALWLLLSTVSDFEDQYERLPSNCQANPARSWRLALLVKDRVILGSTLDPRAAAIGAS 1059

Query: 345  DGHGIITTSDGLVYMWELYTGVELGTLHYFKGGGVSCIVTDEESKSGVLAVAGDGGKLLI 166
              HGII   DGLVYMWEL TG  LG LH+FKGG VSCI TD + +  V+AVA D G+LLI
Sbjct: 1060 FDHGIIGRDDGLVYMWELSTGTRLGVLHHFKGGSVSCIATD-DLRPDVVAVAADDGQLLI 1118

Query: 165  YLHSR 151
            YLHS+
Sbjct: 1119 YLHSQ 1123



 Score = 62.8 bits (151), Expect = 7e-07
 Identities = 55/202 (27%), Positives = 85/202 (42%), Gaps = 5/202 (2%)
 Frame = -2

Query: 2322 PLLXXXXXXXXXXXKPSETSPRVVKYRDDINRSVHIRDVRSPVRMLSENSRTEREEKMHI 2143
            PLL            P +  P VV  R +   + H+ ++ S   +L+E +  E++ +MH 
Sbjct: 273  PLLKESSKKKREIINPYKVLPHVVNSRVNNIETNHLLNLPSSAIILTEEAHAEQDRRMHT 332

Query: 2142 SIKDPESFVPSFEHITSVVPDSYEDDQCEQHVISQVPLSFXXXXXXXXXXXXXETFAPDN 1963
               D  S VP+ EH+ SV+ DS+EDDQ   HV  Q  +SF             +T+  + 
Sbjct: 333  QSIDHGSVVPNLEHVNSVILDSFEDDQGGDHVAKQA-VSFSKSVEVDQTSFNKDTYHSNI 391

Query: 1962 LRLFGSIKARKELSVCHDASDVNLRDKVGYAGILQKEINTKIRE-----SIICGNSGVGC 1798
                 SI  ++E S C D    N         I  KE+N  + +      I    S  G 
Sbjct: 392  QEQLVSINVKQETSDCCDEISEN------QDTICHKEVNMALNKKPHGSDITMSESASGH 445

Query: 1797 VPEEILQMSSSENIPNKKVAVD 1732
            V   ++  + SE+I    V +D
Sbjct: 446  V--SLIMKAFSEDIQGVCVNLD 465


>ref|XP_006484353.1| PREDICTED: uncharacterized protein LOC102628159 [Citrus sinensis]
          Length = 1252

 Score =  488 bits (1257), Expect = e-135
 Identities = 285/600 (47%), Positives = 364/600 (60%), Gaps = 22/600 (3%)
 Frame = -2

Query: 1884 KVGYAGILQKEINTKIRESIICGN-----------SGVGCVPEEILQMSSSENIPNKKVA 1738
            KV     L K+ +    ESIIC N           +    +  EI QM SS+  P ++  
Sbjct: 665  KVSKRAPLMKKFDGPFSESIICRNFIDDHVAKQQHTAETLLASEISQMRSSDYKPRRE-N 723

Query: 1737 VDAEARFPGQLHGLCTENTTPNPKAVFYNDPVISHNKSFSCASENKDTADFLSPPVSRVE 1558
             DA AR   ++   C               PVIS N +  CA+++KD  +   P    ++
Sbjct: 724  FDA-ARDLLEVKSCCL--------------PVISKNSTVFCATKDKDFHNSFDPSTLHMK 768

Query: 1557 KSNNGVGHELAKQQNLVKFKSSDPQFQKLGTNLFDKATSDANEVHTDSDLKPHMNKVLNN 1378
                  G EL +Q N  +F SS    QK   +  +  +S+A E    SDLK   N    N
Sbjct: 769  NLKANSGKELDEQLNFAEFNSSVVS-QKQEISGCEYTSSNAKESQVSSDLKLQKNVECIN 827

Query: 1377 DLKDFVKLVGCYVHPMPVLSVFLNTKDHEILICVLCGLVVDSNRALFIYKLSIEGPGAGC 1198
            +L     L+GCY  P+P+LSV L+T   +I +CV CG +VD  R LFIY + I+ P  G 
Sbjct: 828  ELAGTFDLMGCYFFPLPILSVLLSTTGDKIYVCVSCGFLVDKKRTLFIYTVDIQEPRVGN 887

Query: 1197 PSFAGHASLTFPISKDAFGRKVVLDRSGLQFTPDGQCLVLLNSVKAPYCREQKIHCLCSA 1018
            PS  GH S+  P  KD FGR++ L+RS   FTPDGQ LVLL+S+K PYCRE +  CLCS 
Sbjct: 888  PSCVGHTSVMLPFLKDNFGREIALERSCALFTPDGQYLVLLDSMKTPYCREGRSDCLCST 947

Query: 1017 CTSDCFEDNAVKIVQVKLGYITVMAKLKTDNSVHCVLVCEPNHLIAVEDGGRLRLWVMNS 838
            CTS   ++NAVKIV+VK GY++V+AKLKTD+ V C+LVCEP HLIAV + G+L LW MNS
Sbjct: 948  CTSHRLDENAVKIVKVKPGYVSVVAKLKTDDCVQCILVCEPKHLIAVGESGKLHLWEMNS 1007

Query: 837  TWSAPTEEFVIPISDCMSSHIMELKKIPKSSHLVIGHNGFGDFGLWDISNRILISKFSIP 658
            +WSA  EE +IPI+DC+   I+E+K+IPK + LV+GHNGFG+FG+WDIS R+L+S+FS  
Sbjct: 1008 SWSAQVEECIIPINDCIYPCIVEMKRIPKCAPLVVGHNGFGEFGIWDISKRVLVSRFSAA 1067

Query: 657  SLSVVQFLPISLIDWKSKGLVSNYSS--AGEHINKLWLSEHCMKLSL-----KDMAIWLL 499
              S+ QF PI+L  W+  G VS  +S            S+H  K S      +D AIWLL
Sbjct: 1068 RASIYQFFPINLFSWQRNGSVSMDASLELTNTATTSLFSKHSEKSSFCPSVGEDSAIWLL 1127

Query: 498  VSTGPNCKALEKYES-DCQLNESGCWRLALLVKNRVILGSPLDPRAVAIGASDGHGIITT 322
            VST  +  A     S DCQ N    WRLALLVKNRVILGSPLDPRA AIGAS G GII T
Sbjct: 1128 VSTISDSDAQHNCMSRDCQKNPVRFWRLALLVKNRVILGSPLDPRASAIGASSGLGIIGT 1187

Query: 321  SDGLVYMWELYTGVELGTLHYFKGGGVSCIVTDEESKSGVLAVAG---DGGKLLIYLHSR 151
            +DGLVY WEL +G +LG LH+FKGG VSCI TD +S    LAVAG   DGG+LL+YLH++
Sbjct: 1188 NDGLVYAWELSSGNKLGILHHFKGGTVSCIATD-DSGLQALAVAGDGPDGGQLLVYLHAQ 1246


>ref|XP_006437884.1| hypothetical protein CICLE_v10033741mg, partial [Citrus clementina]
            gi|557540080|gb|ESR51124.1| hypothetical protein
            CICLE_v10033741mg, partial [Citrus clementina]
          Length = 1177

 Score =  456 bits (1174), Expect = e-125
 Identities = 264/563 (46%), Positives = 338/563 (60%), Gaps = 19/563 (3%)
 Frame = -2

Query: 1884 KVGYAGILQKEINTKIRESIICGN-----------SGVGCVPEEILQMSSSENIPNKKVA 1738
            KV     L K+ +    ESIIC N           +    +  EI QM SS+  P ++  
Sbjct: 632  KVSKRAPLMKKFDGPFSESIICRNFIDDHVAKQQHTAETLLASEISQMRSSDYKPQRE-N 690

Query: 1737 VDAEARFPGQLHGLCTENTTPNPKAVFYNDPVISHNKSFSCASENKDTADFLSPPVSRVE 1558
             DA AR   ++   C               PVIS N +  CA+++KD  +   P    ++
Sbjct: 691  FDA-ARDLLEVKSCCL--------------PVISKNSTVFCATKDKDFHNSFDPSTLHMK 735

Query: 1557 KSNNGVGHELAKQQNLVKFKSSDPQFQKLGTNLFDKATSDANEVHTDSDLKPHMNKVLNN 1378
            KS    G EL +Q N  +F SS    QK   +  +  +S+A E    SDLK   N    N
Sbjct: 736  KSKANSGKELDEQLNFAEFNSSVVS-QKQEISGCEYTSSNAKESQVSSDLKLQKNVECIN 794

Query: 1377 DLKDFVKLVGCYVHPMPVLSVFLNTKDHEILICVLCGLVVDSNRALFIYKLSIEGPGAGC 1198
            +L     L+GCY  P+P+LSV L+T   +I +CV CG +VD  R LFIY + I+ P  G 
Sbjct: 795  ELAGTFDLMGCYFFPLPILSVLLSTTGDKIYVCVSCGFLVDKKRTLFIYTVDIQEPRVGN 854

Query: 1197 PSFAGHASLTFPISKDAFGRKVVLDRSGLQFTPDGQCLVLLNSVKAPYCREQKIHCLCSA 1018
            PS  GH S+  P  KD FGR++ L+RS   FTPDGQ LVLL+S+K PYCRE +  CLCS 
Sbjct: 855  PSCVGHTSVMLPFLKDNFGREIALERSCALFTPDGQYLVLLDSMKTPYCREGRSDCLCST 914

Query: 1017 CTSDCFEDNAVKIVQVKLGYITVMAKLKTDNSVHCVLVCEPNHLIAVEDGGRLRLWVMNS 838
            CTS   ++NAVKIV+V  GY++V+AKLKTD+ V C+LVCEP HLIAV + G+L LW MNS
Sbjct: 915  CTSHRLDENAVKIVKVNPGYVSVVAKLKTDDCVQCILVCEPKHLIAVGESGKLHLWEMNS 974

Query: 837  TWSAPTEEFVIPISDCMSSHIMELKKIPKSSHLVIGHNGFGDFGLWDISNRILISKFSIP 658
            +WSA  EE +IPI+DC+   I+E+K+IPK + LV+GHNGFG+FG+WDIS R+L+S+FS  
Sbjct: 975  SWSAQVEECIIPINDCIYPCIVEMKRIPKCAPLVVGHNGFGEFGIWDISKRVLVSRFSAA 1034

Query: 657  SLSVVQFLPISLIDWKSKGLVSNYSS--AGEHINKLWLSEHCMKLSL-----KDMAIWLL 499
              S+ QF PI+L  W+  G VS  +S            S+H  K S      +D AIWLL
Sbjct: 1035 RASIYQFFPINLFSWQRNGSVSMDASLELTNTATTSLFSKHSEKSSFCPSVGEDSAIWLL 1094

Query: 498  VSTGPNCKALEKYES-DCQLNESGCWRLALLVKNRVILGSPLDPRAVAIGASDGHGIITT 322
            VST  +  A     S DCQ N    WRLALLVKNRVILGSPLDPRA AIGAS G GII T
Sbjct: 1095 VSTISDSDAQHNCMSRDCQKNPVRFWRLALLVKNRVILGSPLDPRASAIGASSGLGIIGT 1154

Query: 321  SDGLVYMWELYTGVELGTLHYFK 253
            +DGLVY WEL +G +LG LH+FK
Sbjct: 1155 NDGLVYAWELSSGNKLGILHHFK 1177


>ref|XP_004298357.1| PREDICTED: uncharacterized protein LOC101305752 [Fragaria vesca
            subsp. vesca]
          Length = 1259

 Score =  441 bits (1133), Expect = e-121
 Identities = 251/564 (44%), Positives = 354/564 (62%), Gaps = 5/564 (0%)
 Frame = -2

Query: 1839 IRESIICGNSGVGCVPE-EILQMSSSENIPNKKVAVDAEARFPGQLHGLCTENTTPNPKA 1663
            + E+IIC N+     P  E LQ+ S ++  NK+ ++ AEAR  G    L  +  + N K+
Sbjct: 705  VSETIICKNNVPETYPSTETLQVGSDDS-SNKRDSICAEARIVGH-SSLNAKEPSMNSKS 762

Query: 1662 VFYND-PVISHNKSFSCASENKDTADFLSPPVSRVEKSNNGVGHELAKQQNLVKFKSSDP 1486
            V     P +   ++       KDT+      VS +E   N V  ++   +NL++F  S+ 
Sbjct: 763  VINGICPAVLQGQALLVGE--KDTSYSSDLSVSHLE---NQVDKKVVGNENLLQFIDSET 817

Query: 1485 QFQKLGTNLFDKATSDANEVHTDSDLKPHMNKVLNNDLKDFVKLVGCYVHPMPVLSVFLN 1306
               K G +     + D N +   S+ KPH  K  NN L   ++ VGCY  P+PVLSV L+
Sbjct: 818  S-HKQGPSF----SYDPNSIPFSSNTKPH-KKEHNNGLAGILEFVGCYTQPVPVLSVLLS 871

Query: 1305 TKDHEILICVLCGLVVDSNRALFIYKLSIEGPGAGCPSFAGHASLTFPISKDAFGRKVVL 1126
            TK   I + VLCGL+V  + +LFIYK++IE P  G  S  GH SLT P   D +   + L
Sbjct: 872  TKGRYIYVSVLCGLLVGKDVSLFIYKVAIEEPMVGHSSLVGHTSLTLPDLTD-YSNGMAL 930

Query: 1125 DRSGLQFTPDGQCLVLLNSVKAPYCREQKIHCLCSACTSDCFEDNAVKIVQVKLGYITVM 946
            +R  LQF PDGQCLVLL+ ++ P+CR+ K HCLC+ C S C E++AVKIVQVKLGY++++
Sbjct: 931  ERFCLQFIPDGQCLVLLDKIRTPFCRQGKTHCLCTTCASSCSEEDAVKIVQVKLGYVSLV 990

Query: 945  AKLKTDNSVHCVLVCEPNHLIAVEDGGRLRLWVMNSTWSAPTEEFVIPISDCMSSHIMEL 766
             +LK   S  C+LVCEPN+L++V   GRL LWVM+STWSA  E  V+P  DC+S  +++L
Sbjct: 991  TRLKAAQSQRCILVCEPNNLVSVGKSGRLHLWVMDSTWSAQMEYIVMPSEDCISPGVVDL 1050

Query: 765  KKIPKSSHLVIGHNGFGDFGLWDISNRILISKFSIPSLSVVQFLPISLIDWKSKGLVSNY 586
            K+IP  +HL++GHNG+G+F LWDI+  I +S+FS PS S+ QF+PISL  W+     S++
Sbjct: 1051 KRIPNCTHLIVGHNGYGEFSLWDITKCIFVSRFSAPSGSICQFVPISLFAWQMNFHASSH 1110

Query: 585  SSAGEHINKLW--LSEHCMKLSLKDMAIWLLVSTGPNCKALEKYE-SDCQLNESGCWRLA 415
                EH+N++   +S+       +D+AI LLV +  +  A   YE  +C  N  G WRLA
Sbjct: 1111 FEMEEHVNQMMASISKTLSSYEGEDVAICLLVLSS-DSDAQHDYELGNCHPNPVGRWRLA 1169

Query: 414  LLVKNRVILGSPLDPRAVAIGASDGHGIITTSDGLVYMWELYTGVELGTLHYFKGGGVSC 235
            L+VKN VILG+ LD RA  IGAS G GI  T DGLVY WEL +G +LGT+H+FKGG VSC
Sbjct: 1170 LMVKNIVILGTALDSRASVIGASAGQGICGTCDGLVYTWELSSGTKLGTMHHFKGGSVSC 1229

Query: 234  IVTDEESKSGVLAVAGDGGKLLIY 163
            I ++++S+SG +A+AGD  ++L+Y
Sbjct: 1230 I-SNDDSRSGAVAIAGD-NQVLVY 1251


>ref|XP_006599179.1| PREDICTED: uncharacterized protein LOC100802319 isoform X2 [Glycine
            max]
          Length = 1115

 Score =  416 bits (1070), Expect = e-113
 Identities = 272/741 (36%), Positives = 391/741 (52%), Gaps = 19/741 (2%)
 Frame = -2

Query: 2322 PLLXXXXXXXXXXXKPSETSPRVVKYRDDINRSVHIRDVRSPVRMLSENSRTEREEKMHI 2143
            PLL           +PS+  P  V  +D+  +  +  DV     +++E +  E+ +K+H 
Sbjct: 429  PLLRTVSTDKEFTVRPSDMLPCQVNSKDE--QKGYSVDVLPSDVIMTEAAHGEQGQKIH- 485

Query: 2142 SIKDPESFVPSFEHITSVVPDSYEDDQCEQHVISQVPLSFXXXXXXXXXXXXXETFAPDN 1963
               D  S  P+FEH+ S+VPDS+E  QC+ +  +Q  LS                    +
Sbjct: 486  GCTDSHSNTPNFEHMRSIVPDSFEYSQCDDYKTNQEILSSDIVEAGRSSFNKEMC----S 541

Query: 1962 LRLFGSIKARKELSVCHDASDVNLRDKVGYAGILQKEINTKIRESIICGNSGVGCVPEEI 1783
             +L G       ++ CH AS ++ +D      +                     C+PE +
Sbjct: 542  QQLLGHDLTNGTIT-CH-ASGLDFKDMPQNCDV---------------------CIPESV 578

Query: 1782 LQMSSSENIPNKKVAVDAEARFPGQLHGLCTENTTPNPKAVFYNDPVISHNKSFSCASE- 1606
            L   S +++   + + DA           C  +   NP  VF +    S  K    A + 
Sbjct: 579  LDDMSPKDLIIYERSDDA-----------CL-HVKENPAHVFLS----SVQKDLPTAQDF 622

Query: 1605 -NKDTADF-LSPPVSRVEK---SNNGVGHELAKQQNLVKFKSSDPQFQKLGTNLFDKATS 1441
               DTA   +  P  R +     +N V       QNL  F   +  F   GT        
Sbjct: 623  TGDDTAGLCVQTPQIRSDVLGGHSNLVDPNPTSSQNLTLFADENKCF---GTK------- 672

Query: 1440 DANEVHTDSDLKPHMNKVLNNDLKDFVKLVGCYVHPMPVLSVFLNTKDHEILICVLCGLV 1261
               EV   S+  P  N+ L N+L   VK VG Y+HPMPV S+FL+T++ EI +CVLCG +
Sbjct: 673  ---EVQLISEPMPLQNQELKNNLGSSVKFVGRYLHPMPVSSLFLSTREDEIHVCVLCGYL 729

Query: 1260 VDSNRALFIYKLSIEGPGAGCPSFAGHASLTFPISKDAFGRKVVLDRSGLQFTPDGQCLV 1081
                R LF YK++I  P  GCPS   H+S+  P  K  F ++ +++RSG+Q TP GQ +V
Sbjct: 730  TGQYRTLFTYKVAIAEPTLGCPSVMAHSSILLPDPKHNFIKETMVERSGVQLTPGGQYVV 789

Query: 1080 LLNSVKAPYCREQKIHCLCSACTSDCFEDNAVKIVQVKLGYITVMAKLKTDNSVHCVLVC 901
            L+ S+K P CRE KI C CS C S C E NA+KIVQV+ GY++V+  L+T ++VHC+LVC
Sbjct: 790  LIGSIKTPNCREGKIDCHCSTCKSVCSEKNALKIVQVEHGYVSVVTTLETVDNVHCILVC 849

Query: 900  EPNHLIAVEDGGRLRLWVMNSTWSAPTEEFVIPISDCMSSHIMELKKIPKSSHLVIGHNG 721
            EPN L++V + G+L++WVMNS WS   E F+IP    +S  IMELK++PK +HLV+GHN 
Sbjct: 850  EPNRLVSVGESGKLQVWVMNSKWSEKIEYFIIPADGSVSPGIMELKRVPKCTHLVVGHNS 909

Query: 720  FGDFGLWDISNRILISKFSIPSLSVVQFLPISLIDWKSKGLVSNYSSAGEHINK------ 559
             G+F LWDI+    ++ FS     V +F PISL  W++KG   +  +  E  +K      
Sbjct: 910  RGEFSLWDIAKCNCVTSFSALKSPVNEFFPISLFQWQTKGSGFSNVNIEEQADKLLEATN 969

Query: 558  LWLSEH---CMKLSL-KDMAIWLLVSTGPNCKALEKY---ESDCQLNESGCWRLALLVKN 400
            LW SE    C    + +D+A+WL VST  +  +   +    S   ++ +  WRLALL+KN
Sbjct: 970  LWYSEQRDICWFSPIEEDVAMWLFVSTTSDLDSCHNHVSTSSSYDIHTARSWRLALLMKN 1029

Query: 399  RVILGSPLDPRAVAIGASDGHGIITTSDGLVYMWELYTGVELGTLHYFKGGGVSCIVTDE 220
             +I GSPLD R    G S G+GII+TSDG+VYMWEL  G +L TLH+F+ G V+C+ TD+
Sbjct: 1030 SIIFGSPLDLRTSGNGVSCGYGIISTSDGVVYMWELSKGSKLDTLHHFQDGNVTCVATDD 1089

Query: 219  ESKSGVLAVAGDGGKLLIYLH 157
                G L VAG  G+LL+YLH
Sbjct: 1090 --SRGALGVAGGRGELLLYLH 1108


>gb|EXB44873.1| hypothetical protein L484_026455 [Morus notabilis]
          Length = 1147

 Score =  412 bits (1058), Expect = e-112
 Identities = 209/408 (51%), Positives = 269/408 (65%), Gaps = 11/408 (2%)
 Frame = -2

Query: 1359 KLVGCYVHPMPVLSVFLNTKDHEILICVLCGLVVDSNRALFIYKLSIEGPGAGCPSFAGH 1180
            +L+GCY+HP+PVLS+ + T   +I ICVLCGL V+ +R LFIYK++ + P  G PSF GH
Sbjct: 729  ELIGCYLHPLPVLSLLVCTTGEDIHICVLCGLRVNKDRTLFIYKIATQEPRVGYPSFVGH 788

Query: 1179 ASLTFPISKDAFGRKVVLDRSGLQFTPDGQCLVLLNSVKAPYCREQKIHCLCSACTSDCF 1000
             S+T P  KD FG+++ L+RSGLQ+TP GQ LVLL+ ++ PYCR+  I CLC AC S  F
Sbjct: 789  TSVTLPSLKDYFGKEIALERSGLQYTPGGQYLVLLDCIRTPYCRQGTIPCLCPACASGSF 848

Query: 999  EDNAVKIVQVKLGYITVMAKLKTDNSVHCVLVCEPNHLIAVEDGGRLRLWVMNSTWSAPT 820
            E++AVKIV+VKLGY++V+ KLKT  S+ CVLVCEPNHL+AV + GRL LWVMN  WSA T
Sbjct: 849  EEDAVKIVEVKLGYVSVVVKLKTLESLQCVLVCEPNHLVAVGESGRLHLWVMNPAWSAQT 908

Query: 819  EEFVIPISDCMSSHIMELKKIPKSSHLVIGHNGFGDFGLWDISNRILISKFSIPSLSVVQ 640
            E+F++P +D +S  I+ELK+IPK   LV+GHNGFG+F                   S+ +
Sbjct: 909  EQFILPANDLVSPGIVELKRIPKCVRLVVGHNGFGEF-------------------SLCE 949

Query: 639  FLPISLIDWKSKGLVSNYSSAGEHINK------LWLSEHCMKLSL----KDMAIWLLVST 490
            F P++L  WK KG      +   H+N+      +W SE     SL    +++A+WLLVS 
Sbjct: 950  FFPVALFGWKKKGHSFGDCNVHGHVNRMMAATNMWFSEQTNDDSLPLLEEEIAVWLLVSV 1009

Query: 489  GPNCKALEKYES-DCQLNESGCWRLALLVKNRVILGSPLDPRAVAIGASDGHGIITTSDG 313
              +      Y S D      G WRLALLVKN VILG  LDP A AIGAS GHGII T DG
Sbjct: 1010 PSDSDDHHDYTSGDYHTKSVGWWRLALLVKNMVILGGALDPSAEAIGASAGHGIIGTCDG 1069

Query: 312  LVYMWELYTGVELGTLHYFKGGGVSCIVTDEESKSGVLAVAGDGGKLL 169
            LVY+WE+ TG +LGTLH+F+G  VSCI TD+  K  V    G+G  LL
Sbjct: 1070 LVYIWEMSTGTKLGTLHHFRGSSVSCIATDDSKKGAVAISGGEGWSLL 1117


>ref|XP_006415912.1| hypothetical protein EUTSA_v10006590mg [Eutrema salsugineum]
            gi|557093683|gb|ESQ34265.1| hypothetical protein
            EUTSA_v10006590mg [Eutrema salsugineum]
          Length = 1207

 Score =  410 bits (1053), Expect = e-111
 Identities = 226/487 (46%), Positives = 304/487 (62%), Gaps = 12/487 (2%)
 Frame = -2

Query: 1581 SPPVSRVEKSNNGVGHELAKQQNLVKFKSSDPQFQKLGTNLFDKATSDANEVHTDSDLKP 1402
            S P S+VE     +G  L  Q   V   SS     K  T+  + + S+  E   +S+LK 
Sbjct: 727  SLPASKVENVQAHIGEALGIQ---VSEPSSTKSPNKENTS--ENSISNVPEFPVNSNLKL 781

Query: 1401 HMNKVLNNDLKDFVKLVGCYVHPMPVLSVFLNTKDHEILICVLCGLVVDSNRALFIYKLS 1222
            + +  +NN+++  V+L+G Y HPMPV +V L    +EI ICVL     D    LF+YK+S
Sbjct: 782  NRDVKINNEMEKTVELLGYYFHPMPVSTVSLQYVGNEIYICVLSFATEDRVSTLFMYKIS 841

Query: 1221 IEGPGAGCPSFAGHASLTFPISKDAFGRKVVLDRSGLQFTPDGQCLVLLNSVKAPYCREQ 1042
             + P  G PS  GH     PI  D  GR   L+RS L FTPDGQ L+   ++K PYCR++
Sbjct: 842  AKSPTRGFPSVVGHTPAILPIVDDKSGRNRTLERSYLHFTPDGQHLIFTGNIKTPYCRQR 901

Query: 1041 KIHCLCSACTSDCFEDNAVKIVQVKLGYITVMAKLKTDNSVHCVLVCEPNHLIAVEDGGR 862
            +I CLC  CTS  FE+NAV+IV+VK GY++++ KL+  +SV CV+VC+PN+LIAV   G 
Sbjct: 902  EIDCLCLTCTSASFEENAVRIVEVKAGYVSLVTKLQAVDSVQCVVVCDPNYLIAVVKSGN 961

Query: 861  LRLWVMNSTWSAPTEEFVIPISDCMSSHIMELKKIPKSSHLVIGHNGFGDFGLWDISNRI 682
            L  W MNS W   TEEFVI  + C+SS I+ELKKIPK  HL+IGHNG G+F +WDIS R 
Sbjct: 962  LIAWAMNSDWRGSTEEFVILANPCISSCIVELKKIPKCPHLIIGHNGIGEFTIWDISKRS 1021

Query: 681  LISKFSIPSLSVVQFLPISLIDWKSKGLVSNYSSAGEHIN------KLWLSEHCMKLSL- 523
            L+S+F  PS  + +F+P SL  W +   V N+S+  +H++      KLW S+     +L 
Sbjct: 1022 LVSRFVSPSNLIFEFIPTSLFAWHT---VHNHSTIEDHVDVILAATKLWFSKGVNNKTLV 1078

Query: 522  ----KDMAIWLLVSTGPNCKAL-EKYESDCQLNESGCWRLALLVKNRVILGSPLDPRAVA 358
                +D AIWLLVST P+  A+ ++ ES  +     CWRLALLV+N+VILGS LDPRA  
Sbjct: 1079 PAEVEDTAIWLLVSTDPDPDAICDRVESPAR-----CWRLALLVRNQVILGSQLDPRADV 1133

Query: 357  IGASDGHGIITTSDGLVYMWELYTGVELGTLHYFKGGGVSCIVTDEESKSGVLAVAGDGG 178
             G   GHG+  T DG VYMW+L TG +LG+LH FKG GVSCI +D+   SG + +A + G
Sbjct: 1134 AGTVSGHGVAGTLDGHVYMWDLSTGTKLGSLHDFKGQGVSCISSDD---SGNICIASEDG 1190

Query: 177  KLLIYLH 157
            +LL+Y H
Sbjct: 1191 QLLVYCH 1197


>gb|AAG50686.1|AC079829_19 hypothetical protein [Arabidopsis thaliana]
          Length = 1196

 Score =  406 bits (1044), Expect = e-110
 Identities = 225/485 (46%), Positives = 303/485 (62%), Gaps = 12/485 (2%)
 Frame = -2

Query: 1575 PVSRVEKSNNGVGHELAKQQNLVKFKSSDPQFQKLGTNLFDKATSDANEVHTDSDLKPHM 1396
            P S+ E     +G EL  Q +  +  S++ Q+++   N  +K TS   E    S+LK + 
Sbjct: 726  PASKFEDCQANIGEELGIQVS--EPPSTESQYKE---NTSEKCTS-VQEFPASSNLKLNR 779

Query: 1395 NKVLNNDLKDFVKLVGCYVHPMPVLSVFLNTKDHEILICVLCGLVVDSNRALFIYKLSIE 1216
            +  +NN+++  V+L+GCY HPMPV SV L T  +EI I VL     D  R LF+YK+S E
Sbjct: 780  DVKINNEMEKTVELLGCYFHPMPVSSVLLRTVGNEIYILVLSFATEDRVRTLFMYKMSAE 839

Query: 1215 GPGAGCPSFAGHASLTFPISKDAFGRKVVLDRSGLQFTPDGQCLVLLNSVKAPYCREQKI 1036
             P  G PS  GH     PI  D       L+ S L FTPDG  L+L  ++K PYCR+++ 
Sbjct: 840  APSKGFPSIIGHTPAILPIVDDKSSGNGTLEISNLHFTPDGLHLILTGNIKTPYCRKRET 899

Query: 1035 HCLCSACTSDCFEDNAVKIVQVKLGYITVMAKLKTDNSVHCVLVCEPNHLIAVEDGGRLR 856
             C C  CTS CFE+NAV+IVQVK G+++++ KL+ D+SV CV+VC+PN+LIA    G L 
Sbjct: 900  DCSCLICTSACFEENAVRIVQVKTGHVSLVTKLQADDSVQCVVVCDPNNLIAAVKSGNLI 959

Query: 855  LWVMNSTWSAPTEEFVIPISDCMSSHIMELKKIPKSSHLVIGHNGFGDFGLWDISNRILI 676
            +W MNS WS PTEE+VI  + C+SS IMELKKIPK  HLVIGHNG G+F +WDIS R L+
Sbjct: 960  VWAMNSHWSGPTEEYVILANPCISSCIMELKKIPKCPHLVIGHNGIGEFTIWDISKRSLV 1019

Query: 675  SKFSIPSLSVVQFLPISLIDWKSKGLVSNYSSAGEHIN------KLWLSEHCMKLSL--- 523
            S+F  PS  + +F+P SL  W     V ++S+  ++++      KLW S+     +L   
Sbjct: 1020 SRFVSPSNLIFEFIPTSLFAWHP---VHSHSTIEDNVDMILAATKLWFSKGVNNKTLVPA 1076

Query: 522  --KDMAIWLLVSTGPNCKA-LEKYESDCQLNESGCWRLALLVKNRVILGSPLDPRAVAIG 352
              KD AIWLLVST  +  A  ++ ES  +     CWRLALLVK+++ILGS LDPRA   G
Sbjct: 1077 EVKDTAIWLLVSTDLDSDAKCDRVESPVR-----CWRLALLVKDQLILGSQLDPRADVAG 1131

Query: 351  ASDGHGIITTSDGLVYMWELYTGVELGTLHYFKGGGVSCIVTDEESKSGVLAVAGDGGKL 172
               GHG+  T DGLVYMW+L TG +LG+LH FKG  VSCI TD+   S  + +A + G+L
Sbjct: 1132 TISGHGVAGTLDGLVYMWDLSTGTKLGSLHDFKGQRVSCISTDD---SRNICIASEDGQL 1188

Query: 171  LIYLH 157
            L+Y H
Sbjct: 1189 LVYCH 1193


>gb|AAF98582.1|AC013427_25 This gene may be cut off [Arabidopsis thaliana]
          Length = 554

 Score =  406 bits (1044), Expect = e-110
 Identities = 225/485 (46%), Positives = 303/485 (62%), Gaps = 12/485 (2%)
 Frame = -2

Query: 1575 PVSRVEKSNNGVGHELAKQQNLVKFKSSDPQFQKLGTNLFDKATSDANEVHTDSDLKPHM 1396
            P S+ E     +G EL  Q +  +  S++ Q+++   N  +K TS   E    S+LK + 
Sbjct: 84   PASKFEDCQANIGEELGIQVS--EPPSTESQYKE---NTSEKCTS-VQEFPASSNLKLNR 137

Query: 1395 NKVLNNDLKDFVKLVGCYVHPMPVLSVFLNTKDHEILICVLCGLVVDSNRALFIYKLSIE 1216
            +  +NN+++  V+L+GCY HPMPV SV L T  +EI I VL     D  R LF+YK+S E
Sbjct: 138  DVKINNEMEKTVELLGCYFHPMPVSSVLLRTVGNEIYILVLSFATEDRVRTLFMYKMSAE 197

Query: 1215 GPGAGCPSFAGHASLTFPISKDAFGRKVVLDRSGLQFTPDGQCLVLLNSVKAPYCREQKI 1036
             P  G PS  GH     PI  D       L+ S L FTPDG  L+L  ++K PYCR+++ 
Sbjct: 198  APSKGFPSIIGHTPAILPIVDDKSSGNGTLEISNLHFTPDGLHLILTGNIKTPYCRKRET 257

Query: 1035 HCLCSACTSDCFEDNAVKIVQVKLGYITVMAKLKTDNSVHCVLVCEPNHLIAVEDGGRLR 856
             C C  CTS CFE+NAV+IVQVK G+++++ KL+ D+SV CV+VC+PN+LIA    G L 
Sbjct: 258  DCSCLICTSACFEENAVRIVQVKTGHVSLVTKLQADDSVQCVVVCDPNNLIAAVKSGNLI 317

Query: 855  LWVMNSTWSAPTEEFVIPISDCMSSHIMELKKIPKSSHLVIGHNGFGDFGLWDISNRILI 676
            +W MNS WS PTEE+VI  + C+SS IMELKKIPK  HLVIGHNG G+F +WDIS R L+
Sbjct: 318  VWAMNSHWSGPTEEYVILANPCISSCIMELKKIPKCPHLVIGHNGIGEFTIWDISKRSLV 377

Query: 675  SKFSIPSLSVVQFLPISLIDWKSKGLVSNYSSAGEHIN------KLWLSEHCMKLSL--- 523
            S+F  PS  + +F+P SL  W     V ++S+  ++++      KLW S+     +L   
Sbjct: 378  SRFVSPSNLIFEFIPTSLFAWHP---VHSHSTIEDNVDMILAATKLWFSKGVNNKTLVPA 434

Query: 522  --KDMAIWLLVSTGPNCKA-LEKYESDCQLNESGCWRLALLVKNRVILGSPLDPRAVAIG 352
              KD AIWLLVST  +  A  ++ ES  +     CWRLALLVK+++ILGS LDPRA   G
Sbjct: 435  EVKDTAIWLLVSTDLDSDAKCDRVESPVR-----CWRLALLVKDQLILGSQLDPRADVAG 489

Query: 351  ASDGHGIITTSDGLVYMWELYTGVELGTLHYFKGGGVSCIVTDEESKSGVLAVAGDGGKL 172
               GHG+  T DGLVYMW+L TG +LG+LH FKG  VSCI TD+   S  + +A + G+L
Sbjct: 490  TISGHGVAGTLDGLVYMWDLSTGTKLGSLHDFKGQRVSCISTDD---SRNICIASEDGQL 546

Query: 171  LIYLH 157
            L+Y H
Sbjct: 547  LVYCH 551


>ref|NP_001185099.1| DNA binding protein [Arabidopsis thaliana]
            gi|332192557|gb|AEE30678.1| DNA binding protein
            [Arabidopsis thaliana]
          Length = 1194

 Score =  406 bits (1044), Expect = e-110
 Identities = 225/485 (46%), Positives = 303/485 (62%), Gaps = 12/485 (2%)
 Frame = -2

Query: 1575 PVSRVEKSNNGVGHELAKQQNLVKFKSSDPQFQKLGTNLFDKATSDANEVHTDSDLKPHM 1396
            P S+ E     +G EL  Q +  +  S++ Q+++   N  +K TS   E    S+LK + 
Sbjct: 724  PASKFEDCQANIGEELGIQVS--EPPSTESQYKE---NTSEKCTS-VQEFPASSNLKLNR 777

Query: 1395 NKVLNNDLKDFVKLVGCYVHPMPVLSVFLNTKDHEILICVLCGLVVDSNRALFIYKLSIE 1216
            +  +NN+++  V+L+GCY HPMPV SV L T  +EI I VL     D  R LF+YK+S E
Sbjct: 778  DVKINNEMEKTVELLGCYFHPMPVSSVLLRTVGNEIYILVLSFATEDRVRTLFMYKMSAE 837

Query: 1215 GPGAGCPSFAGHASLTFPISKDAFGRKVVLDRSGLQFTPDGQCLVLLNSVKAPYCREQKI 1036
             P  G PS  GH     PI  D       L+ S L FTPDG  L+L  ++K PYCR+++ 
Sbjct: 838  APSKGFPSIIGHTPAILPIVDDKSSGNGTLEISNLHFTPDGLHLILTGNIKTPYCRKRET 897

Query: 1035 HCLCSACTSDCFEDNAVKIVQVKLGYITVMAKLKTDNSVHCVLVCEPNHLIAVEDGGRLR 856
             C C  CTS CFE+NAV+IVQVK G+++++ KL+ D+SV CV+VC+PN+LIA    G L 
Sbjct: 898  DCSCLICTSACFEENAVRIVQVKTGHVSLVTKLQADDSVQCVVVCDPNNLIAAVKSGNLI 957

Query: 855  LWVMNSTWSAPTEEFVIPISDCMSSHIMELKKIPKSSHLVIGHNGFGDFGLWDISNRILI 676
            +W MNS WS PTEE+VI  + C+SS IMELKKIPK  HLVIGHNG G+F +WDIS R L+
Sbjct: 958  VWAMNSHWSGPTEEYVILANPCISSCIMELKKIPKCPHLVIGHNGIGEFTIWDISKRSLV 1017

Query: 675  SKFSIPSLSVVQFLPISLIDWKSKGLVSNYSSAGEHIN------KLWLSEHCMKLSL--- 523
            S+F  PS  + +F+P SL  W     V ++S+  ++++      KLW S+     +L   
Sbjct: 1018 SRFVSPSNLIFEFIPTSLFAWHP---VHSHSTIEDNVDMILAATKLWFSKGVNNKTLVPA 1074

Query: 522  --KDMAIWLLVSTGPNCKA-LEKYESDCQLNESGCWRLALLVKNRVILGSPLDPRAVAIG 352
              KD AIWLLVST  +  A  ++ ES  +     CWRLALLVK+++ILGS LDPRA   G
Sbjct: 1075 EVKDTAIWLLVSTDLDSDAKCDRVESPVR-----CWRLALLVKDQLILGSQLDPRADVAG 1129

Query: 351  ASDGHGIITTSDGLVYMWELYTGVELGTLHYFKGGGVSCIVTDEESKSGVLAVAGDGGKL 172
               GHG+  T DGLVYMW+L TG +LG+LH FKG  VSCI TD+   S  + +A + G+L
Sbjct: 1130 TISGHGVAGTLDGLVYMWDLSTGTKLGSLHDFKGQRVSCISTDD---SRNICIASEDGQL 1186

Query: 171  LIYLH 157
            L+Y H
Sbjct: 1187 LVYCH 1191


>ref|NP_173957.2| DNA binding protein [Arabidopsis thaliana]
            gi|332192556|gb|AEE30677.1| DNA binding protein
            [Arabidopsis thaliana]
          Length = 1189

 Score =  406 bits (1044), Expect = e-110
 Identities = 225/485 (46%), Positives = 303/485 (62%), Gaps = 12/485 (2%)
 Frame = -2

Query: 1575 PVSRVEKSNNGVGHELAKQQNLVKFKSSDPQFQKLGTNLFDKATSDANEVHTDSDLKPHM 1396
            P S+ E     +G EL  Q +  +  S++ Q+++   N  +K TS   E    S+LK + 
Sbjct: 719  PASKFEDCQANIGEELGIQVS--EPPSTESQYKE---NTSEKCTS-VQEFPASSNLKLNR 772

Query: 1395 NKVLNNDLKDFVKLVGCYVHPMPVLSVFLNTKDHEILICVLCGLVVDSNRALFIYKLSIE 1216
            +  +NN+++  V+L+GCY HPMPV SV L T  +EI I VL     D  R LF+YK+S E
Sbjct: 773  DVKINNEMEKTVELLGCYFHPMPVSSVLLRTVGNEIYILVLSFATEDRVRTLFMYKMSAE 832

Query: 1215 GPGAGCPSFAGHASLTFPISKDAFGRKVVLDRSGLQFTPDGQCLVLLNSVKAPYCREQKI 1036
             P  G PS  GH     PI  D       L+ S L FTPDG  L+L  ++K PYCR+++ 
Sbjct: 833  APSKGFPSIIGHTPAILPIVDDKSSGNGTLEISNLHFTPDGLHLILTGNIKTPYCRKRET 892

Query: 1035 HCLCSACTSDCFEDNAVKIVQVKLGYITVMAKLKTDNSVHCVLVCEPNHLIAVEDGGRLR 856
             C C  CTS CFE+NAV+IVQVK G+++++ KL+ D+SV CV+VC+PN+LIA    G L 
Sbjct: 893  DCSCLICTSACFEENAVRIVQVKTGHVSLVTKLQADDSVQCVVVCDPNNLIAAVKSGNLI 952

Query: 855  LWVMNSTWSAPTEEFVIPISDCMSSHIMELKKIPKSSHLVIGHNGFGDFGLWDISNRILI 676
            +W MNS WS PTEE+VI  + C+SS IMELKKIPK  HLVIGHNG G+F +WDIS R L+
Sbjct: 953  VWAMNSHWSGPTEEYVILANPCISSCIMELKKIPKCPHLVIGHNGIGEFTIWDISKRSLV 1012

Query: 675  SKFSIPSLSVVQFLPISLIDWKSKGLVSNYSSAGEHIN------KLWLSEHCMKLSL--- 523
            S+F  PS  + +F+P SL  W     V ++S+  ++++      KLW S+     +L   
Sbjct: 1013 SRFVSPSNLIFEFIPTSLFAWHP---VHSHSTIEDNVDMILAATKLWFSKGVNNKTLVPA 1069

Query: 522  --KDMAIWLLVSTGPNCKA-LEKYESDCQLNESGCWRLALLVKNRVILGSPLDPRAVAIG 352
              KD AIWLLVST  +  A  ++ ES  +     CWRLALLVK+++ILGS LDPRA   G
Sbjct: 1070 EVKDTAIWLLVSTDLDSDAKCDRVESPVR-----CWRLALLVKDQLILGSQLDPRADVAG 1124

Query: 351  ASDGHGIITTSDGLVYMWELYTGVELGTLHYFKGGGVSCIVTDEESKSGVLAVAGDGGKL 172
               GHG+  T DGLVYMW+L TG +LG+LH FKG  VSCI TD+   S  + +A + G+L
Sbjct: 1125 TISGHGVAGTLDGLVYMWDLSTGTKLGSLHDFKGQRVSCISTDD---SRNICIASEDGQL 1181

Query: 171  LIYLH 157
            L+Y H
Sbjct: 1182 LVYCH 1186


>ref|XP_007156394.1| hypothetical protein PHAVU_003G282800g [Phaseolus vulgaris]
            gi|561029748|gb|ESW28388.1| hypothetical protein
            PHAVU_003G282800g [Phaseolus vulgaris]
          Length = 1211

 Score =  403 bits (1036), Expect = e-109
 Identities = 258/697 (37%), Positives = 373/697 (53%), Gaps = 38/697 (5%)
 Frame = -2

Query: 2133 DPESFVPSFEHITSVVPDSYEDDQCEQHVISQVPLSFXXXXXXXXXXXXXETFAPDNLRL 1954
            DP S   + EH+  +VPDS+E  +C+ +  +Q  LS                      R 
Sbjct: 542  DPHSNTLNSEHMKCIVPDSFEYSECDDYKTNQEKLSSDLAEAG---------------RS 586

Query: 1953 FGSIKARKELSVCHDASDVNLRDKVGYAGI----LQKEINTKIRESIICGNS-------- 1810
              +I+   +  + HD    N+  K   +GI      +  +  I ES++   S        
Sbjct: 587  SFNIEMGSQQLLGHDMP--NITSKTHASGIDFEDSPRNFDVCIPESVLDDMSPKDQVNSE 644

Query: 1809 -------GVGCVPEEILQMSSSENIPNKKVAVDAEAR-FPGQLHGLCTENTTPNPKAVFY 1654
                   GV   P  +    + ++ P  +      +  F G    L T         +  
Sbjct: 645  RRDDDYSGVKENPAHVSLSPAQKDFPTAQDFTGGVSNAFSGDKFKLVTTQMYTTKDTLHS 704

Query: 1653 NDPV-ISHNKSFSCASENKDTADFLSPPVSR---VEKSNNGVGHELAKQQNLVKFKSSDP 1486
            ++ + IS++    C  ++       +P   R   +E SN  V   LA  QN  +F   + 
Sbjct: 705  SEIILISNSNDKPCEPDDAAGLCVQTPQTCRDVLIEHSNI-VEQSLAPSQNPTQFAEENK 763

Query: 1485 QFQKLGTNLFDKATSDANEVHTDSDLKPHMNKVLNNDLKDFVKLVGCYVHPMPVLSVFLN 1306
             F   GT           E    S+  P  N+ L ++L   VK VGCY+HPMPV S+FL+
Sbjct: 764  CF---GTK----------EAQLISEPMPLQNEELKSNLGSSVKFVGCYLHPMPVSSLFLS 810

Query: 1305 TKDHEILICVLCGLVVDSNRALFIYKLSIEGPGAGCPSFAGHASLTFPISKDAFGRKVVL 1126
            TK+ E+ ICVLCG + D  R LF YK++I  P  G PS   H+S+  P  K  F ++ ++
Sbjct: 811  TKEDEVHICVLCGHLTDQYRTLFTYKVAITEPTLGYPSVMAHSSILLPDPKHNFIKETMV 870

Query: 1125 DRSGLQFTPDGQCLVLLNSVKAPYCREQKIHCLCSACTSDCFEDNAVKIVQVKLGYITVM 946
            +RSG+Q TP GQ +VL+ S+KAP CRE KI C CS CTS  +E NA+KIVQV+ GY++V+
Sbjct: 871  ERSGVQLTPGGQYIVLIGSIKAPNCREGKIDCSCSTCTSVFYEKNALKIVQVEHGYVSVV 930

Query: 945  AKLKTDNSVHCVLVCEPNHLIAVEDGGRLRLWVMNSTWSAPTEEFVIPISD-CMSSHIME 769
              L+T ++VHC+LVCEPN L++V + G+L +WVMNS WS  TE F+IP  D   S  I+E
Sbjct: 931  TTLETADNVHCILVCEPNRLVSVGESGKLEVWVMNSKWSEKTEHFIIPTDDGSASPGIVE 990

Query: 768  LKKIPKSSHLVIGHNGFGDFGLWDISNRILISKFSIPSLSVVQFLPISLIDWKSKGLVSN 589
            LKK+PKS+HLV+GHN +G+F LWDI+    +++FS     + +F PISL  W++KG   +
Sbjct: 991  LKKVPKSTHLVVGHNSYGEFSLWDIAKCNCVARFSAIKSPINEFFPISLFQWQTKGSGFS 1050

Query: 588  YSSAGEHINKL------WLS---EHCMKLSLKD-MAIWLLVSTGPN---CKALEKYESDC 448
            Y+S  E  +KL      W S   E      L++ +A+WL VST  +   C       S  
Sbjct: 1051 YASMEEQADKLLKATNSWYSQQRETSWPSPLEENVAMWLFVSTYSDQDCCHNPTSTSSSF 1110

Query: 447  QLNESGCWRLALLVKNRVILGSPLDPRAVAIGASDGHGIITTSDGLVYMWELYTGVELGT 268
             ++ +  WRLAL++KN +  GSPL+ R   IG S G+GII T++G+VYMWEL  G +L T
Sbjct: 1111 DIHTARSWRLALMMKNSINFGSPLNLRTCGIGVSSGYGIIGTTEGVVYMWELSKGSKLYT 1170

Query: 267  LHYFKGGGVSCIVTDEESKSGVLAVAGDGGKLLIYLH 157
            LH F+ G V+C+ TD  +  G L VAG GG+LL+YLH
Sbjct: 1171 LHQFQDGNVACVATD--NSRGALGVAG-GGQLLLYLH 1204


>ref|XP_004239457.1| PREDICTED: uncharacterized protein LOC101261411 [Solanum
            lycopersicum]
          Length = 1523

 Score =  402 bits (1033), Expect = e-109
 Identities = 237/587 (40%), Positives = 332/587 (56%), Gaps = 24/587 (4%)
 Frame = -2

Query: 1848 NTKIRESIICGNSGVGCVPEE-----------ILQMSSSENIPNKKVAVDAEARFPGQ-L 1705
            +  + ESIIC +     VPE             LQ SSS+    ++     E    G+ L
Sbjct: 943  HVSLSESIICRDFRDDSVPESNADIKAMHTSHFLQGSSSKQCQIEQSISTDEPHIEGRSL 1002

Query: 1704 HGLCTENTTPNPKAVFYNDPVISHNKSFSCASENKDTADFLSPPVSRVEKSNNGVGHELA 1525
            +    E +T    A FY     S ++      ++  T+ FL    S    S   +   L+
Sbjct: 1003 NFYTKERSTSTNGAPFYLASR-SQDEEMDQMLDHIQTSKFLD---STATNSEGNLTKMLS 1058

Query: 1524 KQQNLVKFKSS--DPQFQKLGTNLFDKATSDANEVHTDSDLKPHMNKVLNNDLKDFVKLV 1351
            + Q  V+F     D Q QK+   +F   T++  E + +++++   +    ++    +K++
Sbjct: 1059 RDQQSVRFTGHLLDKQNQKI---IFSADTTEKKENNENANMEAQQDLKSESERSGVLKVI 1115

Query: 1350 GCYVHPMPVLSVFLNTKDHEILICVLCGLVVDSNRALFIYKLSIEGPGAGCPSFAGHASL 1171
              Y HPMP+ SV L  +++++ ICVLCG  +  +R +F+YK  +EG   GCPSF G  S+
Sbjct: 1116 AGYAHPMPISSVLLRRQENDLYICVLCGQPLHEDRTIFMYKAPLEGEEKGCPSFIGQVSI 1175

Query: 1170 TFPISKDAFGRKVVLDRSGLQFTPDGQCLVLLNSVKAPYCREQKIHCLCSACTSDCFEDN 991
             F  S  AF   + LD + +Q TP GQ LVL NSV AP CRE  I C CS C  + FE+N
Sbjct: 1176 RFQFSDGAFRGDIELDSAAVQLTPFGQSLVLFNSVIAPSCREGDIKCQCSLCALNIFEEN 1235

Query: 990  AVKIVQVKLGYITVMAKLKTDNSVHCVLVCEPNHLIAVEDGGRLRLWVMNSTWSAPTEEF 811
            AVKI+Q++ GY++++ KLKT   V C+LVC P+HL+AVE+ G+L +WVMN+ WSA TE+ 
Sbjct: 1236 AVKIMQIRNGYLSLITKLKTTLRVCCILVCPPDHLVAVEESGKLYVWVMNTNWSAETEKR 1295

Query: 810  VIPISDCMSSHIMELKKIPKSSHLVIGHNGFGDFGLWDISNRILISKFSIPSLSVVQFLP 631
             +   DC     M+LK+IP S+ LV+G+NGFG+F LWDI   +L+S FS  S SV Q LP
Sbjct: 1296 CLLPPDCPPFSTMKLKRIPNSASLVLGYNGFGEFRLWDIKKCMLVSNFSAASTSVFQCLP 1355

Query: 630  ISLIDWKSK-----GLVSNYSSAGEHINKLWLSEHC-----MKLSLKDMAIWLLVSTGPN 481
            +SL  W+ K     G+     +    + K+   E C       L  KD+AIW+L+ST P+
Sbjct: 1356 VSLFSWQRKFTAPAGVTEEIINEITDVTKMSFLEKCDNRPFCLLEDKDVAIWVLISTAPD 1415

Query: 480  CKALEKYESDCQLNESGCWRLALLVKNRVILGSPLDPRAVAIGASDGHGIITTSDGLVYM 301
              +     SD Q +    WRLALLV N +I+G+ LDPRA AIG S GHGII  SDGLVY 
Sbjct: 1416 SNSSAYQSSDQQTDPDHWWRLALLVNNTMIMGNSLDPRATAIGYSAGHGIIGRSDGLVYT 1475

Query: 300  WELYTGVELGTLHYFKGGGVSCIVTDEESKSGVLAVAGDGGKLLIYL 160
            WEL TG  L TLH+FK   VS IV+D  S   V A+A DGG+LL+YL
Sbjct: 1476 WELTTGKRLQTLHHFKDAAVSSIVSDNSSHRAV-AIASDGGQLLVYL 1521


>ref|XP_002893407.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
            gi|297339249|gb|EFH69666.1| predicted protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 1194

 Score =  401 bits (1031), Expect = e-109
 Identities = 224/487 (45%), Positives = 296/487 (60%), Gaps = 14/487 (2%)
 Frame = -2

Query: 1575 PVSRVEKSNNGVGHELAKQQNLVKFKSSDPQFQK--LGTNLFDKATSDANEVHTDSDLKP 1402
            P S+ E     +G  L  Q        S+P   K     N  +K TS   E    S+L+ 
Sbjct: 724  PASKFEDCQANIGEALGIQV-------SEPPSTKSQCKENTSEKRTS-VQEFPASSNLEI 775

Query: 1401 HMNKVLNNDLKDFVKLVGCYVHPMPVLSVFLNTKDHEILICVLCGLVVDSNRALFIYKLS 1222
            + +  +NN++   V+L+GCY HPMPV SV L +  +EI ICVL     D  R LF+YK+S
Sbjct: 776  NRDVKINNEMGKTVELLGCYFHPMPVSSVLLKSAGNEIYICVLSFATEDRVRTLFMYKMS 835

Query: 1221 IEGPGAGCPSFAGHASLTFPISKDAFGRKVVLDRSGLQFTPDGQCLVLLNSVKAPYCREQ 1042
             + P  G PS  GH     PI  D  G    L+ S L FTPDG  L+L+ ++K PYCR++
Sbjct: 836  AKAPSKGFPSIIGHTPAILPIVDDKSGGNRTLEISNLHFTPDGLHLILIGNIKTPYCRKR 895

Query: 1041 KIHCLCSACTSDCFEDNAVKIVQVKLGYITVMAKLKTDNSVHCVLVCEPNHLIAVEDGGR 862
            +  C C  CTS CFE+NAV+IVQVK G+++++ KL+ D+SV CV+VC+PN+LIA    G 
Sbjct: 896  ETDCSCLICTSACFEENAVRIVQVKTGHVSLVTKLQADDSVQCVVVCDPNNLIAAVKSGN 955

Query: 861  LRLWVMNSTWSAPTEEFVIPISDCMSSHIMELKKIPKSSHLVIGHNGFGDFGLWDISNRI 682
            L +W MNS WS  TEE VI  + C+SS IMELKKIPK  HLVIGHNG G+F +WDIS R 
Sbjct: 956  LIVWAMNSHWSGSTEESVILANPCISSCIMELKKIPKCPHLVIGHNGIGEFTIWDISKRS 1015

Query: 681  LISKFSIPSLSVVQFLPISLIDWKSKGLVSNYSSAGEHIN------KLWLSEHCMKLSL- 523
            L+S+F  PS  + +F+P SL  W     V ++S+  +H++      KLW S+     +L 
Sbjct: 1016 LVSRFVSPSNLIFEFIPTSLFAWHP---VHSHSTIEDHVDMILAATKLWFSKGINNKTLV 1072

Query: 522  ----KDMAIWLLVSTGPNCKA-LEKYESDCQLNESGCWRLALLVKNRVILGSPLDPRAVA 358
                KD AIWLLVST     A  ++ ES  +     CWRLALLVKN++ILG+ LDPRA  
Sbjct: 1073 PAEVKDTAIWLLVSTDLESDAKCDRVESPAR-----CWRLALLVKNQLILGNQLDPRADV 1127

Query: 357  IGASDGHGIITTSDGLVYMWELYTGVELGTLHYFKGGGVSCIVTDEESKSGVLAVAGDGG 178
             G   GHG+  T DGLVYMW+L TG +LG+LH FKG  VSCI TD+   S  + +A + G
Sbjct: 1128 AGTISGHGVAGTLDGLVYMWDLSTGAKLGSLHDFKGQRVSCISTDD---SRNICIASEDG 1184

Query: 177  KLLIYLH 157
            +LL+Y H
Sbjct: 1185 QLLVYCH 1191


Top