BLASTX nr result

ID: Mentha29_contig00011160 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00011160
         (3786 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU32492.1| hypothetical protein MIMGU_mgv1a0001072mg, partia...   545   e-152
emb|CBI18961.3| unnamed protein product [Vitis vinifera]              238   2e-59
ref|XP_006357327.1| PREDICTED: uncharacterized protein LOC102595...   220   3e-54
ref|XP_006357328.1| PREDICTED: uncharacterized protein LOC102595...   218   2e-53
ref|XP_007221926.1| hypothetical protein PRUPE_ppa000052mg [Prun...   216   8e-53
gb|EXB28444.1| Zinc finger CCCH domain-containing protein 7 [Mor...   215   1e-52
ref|XP_007019228.1| Zinc finger C-x8-C-x5-C-x3-H type family pro...   215   1e-52
ref|XP_007019227.1| Zinc finger C-x8-C-x5-C-x3-H type family pro...   215   1e-52
ref|XP_007019226.1| Zinc finger C-x8-C-x5-C-x3-H type family pro...   215   1e-52
ref|XP_006434296.1| hypothetical protein CICLE_v10000009mg [Citr...   214   2e-52
ref|XP_004237575.1| PREDICTED: uncharacterized protein LOC101244...   213   4e-52
ref|XP_006472862.1| PREDICTED: uncharacterized protein At1g21580...   211   3e-51
ref|XP_002520303.1| protein with unknown function [Ricinus commu...   207   2e-50
ref|XP_004292729.1| PREDICTED: uncharacterized protein LOC101310...   198   2e-47
ref|XP_006593806.1| PREDICTED: uncharacterized protein LOC100788...   186   5e-44
ref|XP_007161425.1| hypothetical protein PHAVU_001G067600g [Phas...   184   2e-43
ref|XP_007161424.1| hypothetical protein PHAVU_001G067600g [Phas...   184   2e-43
gb|EPS73988.1| hypothetical protein M569_00768, partial [Genlise...   184   4e-43
ref|XP_006596227.1| PREDICTED: uncharacterized protein At1g21580...   178   2e-41
ref|XP_002302217.2| zinc finger family protein [Populus trichoca...   177   3e-41

>gb|EYU32492.1| hypothetical protein MIMGU_mgv1a0001072mg, partial [Mimulus guttatus]
          Length = 1562

 Score =  545 bits (1405), Expect = e-152
 Identities = 397/1024 (38%), Positives = 526/1024 (51%), Gaps = 42/1024 (4%)
 Frame = +1

Query: 841  ADENLSSNKSASYGDRLGFNQHVNIASNYIHDKTSVYDAEEGEILPTDQIIELDSDRDHR 1020
            A E L  N+S  +GD L FNQH    + ++ +  SV   E+  I   DQ   L +D    
Sbjct: 371  AHETLLVNESTGHGDSLYFNQHEKNVNKFVENGASVLRTEKRAIWSIDQYTRLGTDE--- 427

Query: 1021 VHDYSEHPDVSVPSEDIVSIKNHDPAGSRETRISEVHE-DNFFSENHKLVPSPKSACEVM 1197
                                            +SE HE D+    + +L+  PK      
Sbjct: 428  -------------------------------YLSEFHESDSSLESDSRLLQRPKC----- 451

Query: 1198 KMTEDAVVSDVGFKDANSGGFLPNQCNSLQIILAAKDSKSLEG----------------- 1326
                  + SDVG   ANS    PNQ N+ QI + A + +S +G                 
Sbjct: 452  -----VISSDVGSAYANSEELHPNQVNTPQIPVHAVEVRSSDGVVRDCSNVNIGITTRSD 506

Query: 1327 ----------QAVKVSSPDNMGTLLSHNIDSVEGGEICFYNDNLFPKGITDHETCLSVDN 1476
                        V+VS PD +G   S ++DSVEG           P  I++ ETC  +D 
Sbjct: 507  VPSADCKRISAQVEVSFPDALGEKSSTDVDSVEGS----------PTAISNVETCSHMDY 556

Query: 1477 SSKVIRKRKHAGDQLGLLSGTKATVNVRTVRLRSI---VTRCMAEDIVPAMDGNLVGEKD 1647
            SSK+IRKRK      G+ S    + NV     RSI   V   ++ + +P ++ +L+ E D
Sbjct: 557  SSKIIRKRKARSAPEGVYS---LSTNVLVGTGRSIGGEVASLLSNNHIPDVEVDLLVEND 613

Query: 1648 SCKEDTNLQQGSSGVKDSLLEVHLSANGSFPGNQKKQTLCSPRXXXXXXXEDDTTAGGMS 1827
            +C ED    +  S V+D+ LEV   ANG     +KK+    PR       +DD  AGG++
Sbjct: 614  TCNEDDFFNKELSEVEDTTLEVDSGANGLCFNYRKKKKGSCPRSNLISPLKDDPVAGGVT 673

Query: 1828 NP---LILEQHFFRPSEWEAEQSGNTPPASITISQCETTGVEGEVGINDSSVVSLDENML 1998
            +    L+L     + +EW AE+  ++P  S + ++C  + +E +V   +  V  LD+N+ 
Sbjct: 674  SDCSGLVLRS--IKLAEWGAERREDSPSGSTSTTECAISDMEDKVVCENFYVADLDKNLS 731

Query: 1999 DVDRSLGKDDLALNASGL-LCSERNGPSGSDSGDESLASSFDMRSCMTSPEELHVNSDSS 2175
            DV++     D A+ A+ L  C +R G + S+S ++ LAS FDM SCM+SPEEL   SD S
Sbjct: 732  DVNKLYTAGDQAVIANSLSSCGDRTGVAASNSHEDLLASGFDMGSCMSSPEELLSYSDLS 791

Query: 2176 FLENTKASACLLDNEMICQSDNIFNEKHVFADPNTISHGKCSGAAVSKSQSNIGIPQSLL 2355
            F  N    AC   NE   +                        AA   SQ+N  +  SLL
Sbjct: 792  FSRNV---ACQTKNEEFMKK-----------------------AAADNSQTNGKLSPSLL 825

Query: 2356 KDTSQVVKKLYPIHSKLTWSKNQVTSAIAKVLPVQHPQNLSNSRKLYSSHATKSRTWCRT 2535
            + +S++VKK   +H KLT SKNQ T  ++K  P                           
Sbjct: 826  EGSSKMVKKSNFVHGKLTMSKNQPT--VSKASP--------------------------- 856

Query: 2536 VNSTAAVAEPKIEPSPIPQSNGTSAARSVPSSYIRKGNSLVRKDSPSDDASHGVPGSS-S 2712
                  V EPK +PS +P S+ T  AR++ SSYIRKGNSLVR  S +     G  GS  S
Sbjct: 857  ------VTEPKSQPSLLPPSHETKLARNMQSSYIRKGNSLVRNPSSTGATPTGYHGSGCS 910

Query: 2713 VYRLSHCTNTRKNDLATDYMTGDADAPTVKRRGQISTSVMTKAVTLNHSGNSLKCTPCHL 2892
            VYRL+ CT+  KN  A+D    D +A T+ R  ++ TS   K   LNH+        C+ 
Sbjct: 911  VYRLTTCTDNLKNSQASDSEIDDVNASTLLRIKEVHTSAFPKEPPLNHT--------CNS 962

Query: 2893 EEPLSLSNPHINDCPPRT-LDVTKERIRSSVVSECQTDSVINSDSQSTARDGNSEKKIIY 3069
             + LS     + D P  + LD   E I+SS V EC+TD V N D QS    GN EKKI+Y
Sbjct: 963  GDSLS-----VGDTPRNSGLD---ETIKSSAVPECRTDPVSNPDGQSKLA-GNLEKKILY 1013

Query: 3070 VKRRSNQLVAACNSGDMSMLGVDNTRSQLSDGYYKSRGNQLLRASSKNHVAKGNAHATVS 3249
            VKRRSNQL+AA +S D S+ G D T++ LSDGYYKS+ NQL+RASS+NHV K +A+  + 
Sbjct: 1014 VKRRSNQLIAASSSIDTSIPGADKTQASLSDGYYKSKKNQLVRASSENHVKKEDANVNLL 1073

Query: 3250 GLVPQSVVPKTSTRRQSGFAKSCRYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFSSK 3429
             L P + +P+TS R  SGFAKSCR+SKFSSVWKL D QSSEK+KNS+ PRKVWPH F  K
Sbjct: 1074 RLAPHTNLPRTSKRPVSGFAKSCRHSKFSSVWKLHDKQSSEKHKNSVVPRKVWPHLFPWK 1133

Query: 3430 RAAYWRSLM--LGIKP---SLSNISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSL 3594
            RA Y R+ M  LG KP   SLS  SQKLL+SRKRGAIYTRS+HGYSL+MSKVLSVG SSL
Sbjct: 1134 RATYLRNFMHALGAKPNSSSLSTTSQKLLLSRKRGAIYTRSTHGYSLRMSKVLSVGASSL 1193

Query: 3595 KWSKSIEKSSRXXXXXXXXXXXXXXXXXXXXXGCVSIASKSRNHVSRKWVLSVKLRPGER 3774
            KWSKSIE++S+                     G V IA++SRNHVSR           ER
Sbjct: 1194 KWSKSIERNSKMANEEATRAVAAAEKKKKEETGAVPIATRSRNHVSR-----------ER 1242

Query: 3775 IFRI 3786
            IFRI
Sbjct: 1243 IFRI 1246



 Score =  137 bits (346), Expect = 3e-29
 Identities = 90/216 (41%), Positives = 131/216 (60%), Gaps = 16/216 (7%)
 Frame = +1

Query: 148 NRQPEFAEDDLSLGRFSGRLG--VSKGEFIR---KQRLQKKNTLLKTPLGKVRSKHNGGS 312
           N++PEF +D++ LGRF+GRLG  V   EF+R   K++LQKKN + + P+GK  SKH+G  
Sbjct: 1   NKEPEFMQDNMRLGRFAGRLGRRVHNEEFVRSNKKRKLQKKNAIHRIPVGKDCSKHSGPP 60

Query: 313 KARHFTKDSNGGSFKVKEK-GYGKMQTRMEYEREREQSPMELAISFKSNALVAKAIQVPS 489
           K +HF K+ + G    KEK G+ ++QTR+  ++EREQSPMELAISFKSNALVAKAI  PS
Sbjct: 61  KNQHFKKNLSSGISGCKEKEGFQQLQTRIADKKEREQSPMELAISFKSNALVAKAILAPS 120

Query: 490 NPSTELCEKSNSVKERLGCHVSALPAVKS---VVVS-------DIQSDSWGTSKEHMDKX 639
            P+      S++V  +   ++S  P+ KS   VV +       D++S+S  TS E +D+ 
Sbjct: 121 GPAVRSVIDSSNVNNKTVYNMSDSPSAKSSNGVVKTHCLTHGLDLRSESHRTSIEVLDEA 180

Query: 640 XXXXXXXXXXNCAGDLVEMALENACLKVKTLKGSTM 747
                     + A    E A++N   +   L+ ST+
Sbjct: 181 SVSGSGFVGVDGAISFGENAIKNEPPRCMNLQSSTV 216


>emb|CBI18961.3| unnamed protein product [Vitis vinifera]
          Length = 2149

 Score =  238 bits (606), Expect = 2e-59
 Identities = 222/634 (35%), Positives = 299/634 (47%), Gaps = 42/634 (6%)
 Frame = +1

Query: 2011 SLGKDDLALNASGLL--CSERNGPSGSDSGDESLASSFDMRSCMTSPEELHVNSDSSFLE 2184
            +L KDD     S  L   ++ NG S ++S DE + S  D  S M SPE L +      L+
Sbjct: 1252 TLMKDDKQPTVSNYLSIAADGNGVSPTNSNDELMQSLPDTLSNMASPETLPLIPGLHTLD 1311

Query: 2185 NTKASACLLDNEMICQSDNIFNEKHVFADPNTI-SHGKCSGAAVS--KSQSNIGIPQSL- 2352
             T+ S   + ++  C  D   +EK +    + + +H  CS ++ S  K    IG   S+ 
Sbjct: 1312 -TELSVEQISDQKGCGDDRKSDEKPMVDCGSVLFAHNSCSQSSESNFKLDDAIGSDNSIN 1370

Query: 2353 -------LKDTSQVVKKLYPIHSKLTWSKNQVTSAIAKVLPVQHPQNLSNSRKLYSS-HA 2508
                    +DT +    +  I  +L  SKN + + + +V P      L+NS+K  SS H 
Sbjct: 1371 GKTVQPSSQDTKRTTHSVNLISGELNGSKNHLNNLVPRVFPAPSSFFLANSKKTASSTHI 1430

Query: 2509 TKSRTWCRTVNSTAAVAEPKIEPSPIPQSNGTSAARSVPSSYIRKGNSLVRKDSP---SD 2679
             K RTW RT  S++++ +P     P PQ       +   +SYIRKGNSLVRK +P     
Sbjct: 1431 AKPRTWYRTGASSSSLKKPLSIAFP-PQRQLKKIGKVQGTSYIRKGNSLVRKPAPVAVIP 1489

Query: 2680 DASHGVPGSSSVYRL--SHCTNTRK---NDLATDYM-------TGDADAPTVKRRGQIST 2823
              SHG+  SSSVYRL  S     RK   ++  TD +       TG  DAP          
Sbjct: 1490 QGSHGL--SSSVYRLNPSGVDEMRKRTGSESRTDVIDPSNRSSTGATDAP---------- 1537

Query: 2824 SVMTKAVTLNHSGNSLKCTPCHLEEPLSLSNPHINDCPPRTLDVTKERIRSSVVSECQTD 3003
            S   +   L +S    KCT              I+  P  + D  K    SS  +E QT 
Sbjct: 1538 SERPQTPPLPYSTKLPKCTT-------------ISSVPMSSEDGAK----SSGSTENQTG 1580

Query: 3004 SVINSDSQSTARDGNSE----KKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQLSDGYY 3171
             + N +SQS   DGNSE    K++ YVKR+SNQLVAA N  DMS+   D T +  SD   
Sbjct: 1581 LINNLESQSVLNDGNSESSKLKRVTYVKRKSNQLVAASNPHDMSVQNADKTPALSSDDDG 1640

Query: 3172 KSRGNQLLRASSKNHVAKGNAHATVSGLVPQSVVPKTSTRRQSG--FAKSCRYSKFSSVW 3345
             +   Q                       P+ V  K+S++R S    +K+   SKFS VW
Sbjct: 1641 SNSEGQ---------------------RPPKLVSSKSSSKRPSDKVLSKTREPSKFSLVW 1679

Query: 3346 KLQDSQSSEKYKNSLGPRKVWPHFFSSKRAAYWRSLM-----LGIKPSLSNISQKLLVSR 3510
             L+ +QSSEK  NS+  + V P  F  KRA YWRS M     +    SLS IS+KLL+ R
Sbjct: 1680 TLRGAQSSEKDGNSVHSQGVLPSLFPWKRATYWRSFMHNPASIPNSTSLSMISRKLLLLR 1739

Query: 3511 KRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSRXXXXXXXXXXXXXXXXXXXXX 3690
            KR  +YTRS+ G+SL+ SKVL VGGSSLKWSKSIE+ S+                     
Sbjct: 1740 KRDTVYTRSTGGFSLRKSKVLGVGGSSLKWSKSIERQSKKANEEATLAVAAVERKKREQN 1799

Query: 3691 GCVSIAS--KSRNHVSRKWVLSVKLRPGERIFRI 3786
            G  S+ S  +SRNH SR           ERIFR+
Sbjct: 1800 GAASVISETESRNHSSR-----------ERIFRV 1822


>ref|XP_006357327.1| PREDICTED: uncharacterized protein LOC102595922 isoform X1 [Solanum
            tuberosum]
          Length = 1952

 Score =  220 bits (561), Expect = 3e-54
 Identities = 208/616 (33%), Positives = 288/616 (46%), Gaps = 25/616 (4%)
 Frame = +1

Query: 2014 LGKDDLALNASGL--LCSERNGPSGSDSGDESLASSF-DMRSCMTSPEELHVNSDSSFLE 2184
            L +DD+ L A  L    ++ +  S     D S   SF D+ +   S E +  +S SS + 
Sbjct: 1044 LDRDDMPLLADNLSLFANKVSVKSMESVPDMSPLVSFPDLTNSSVSEEPIDKSSMSSEIV 1103

Query: 2185 NTKASACLLDNEMICQSDNIFNEKHVFADPNTISHGKCSGAAVSKSQS-NIGIPQSLLKD 2361
              KA    +D   I   DNI + +   +D      G+ S   V      N+       ++
Sbjct: 1104 IEKALR--VDENSITAYDNISSSEKTSSD--AFEFGRSSDHKVGGDPLVNVSTVALSSQN 1159

Query: 2362 TSQVVKKLYPIHSKLTWSKNQVTSAIAKVLPVQHPQNLSNSRKLYSSHATKSRTWCRTVN 2541
            T +  K +     K     NQ + A  +VL V+ P +    R +      K  TW RT N
Sbjct: 1160 TVKSSKNVSSQGWKPNLGANQQSPAGPRVLSVR-PSSFITPRNV--PVPKKPLTWHRTGN 1216

Query: 2542 STAAVAEPKIEPSPIPQSNGTSAARSVPSSYIRKGNSLVRKDSPSDDASHGVPG-SSSVY 2718
            S+++V     + S +P  +  S   +   SYIRKGNSLVR  SP      G    SSS Y
Sbjct: 1217 SSSSVVGRGSQMSALPPQSHLSKDTAKVGSYIRKGNSLVRNPSPVGSVPKGYHAPSSSTY 1276

Query: 2719 RL--SHCTNTRKNDLATDYMTGDADA---------------PTVKRRGQISTSVMTKAVT 2847
            RL  S   + R+       +TG                   PT        T V T +  
Sbjct: 1277 RLNSSGVNDLRRKCENRAEITGSPSCRGTPEVNAPSERPKTPTQSESFSCITLVSTSSPV 1336

Query: 2848 LNHSGNSLKCTPCHLEEPLSLSNPHINDCPPRTLDVTKERIRSSVVSECQTDSVINSDSQ 3027
             +H GN    T     +P+ +++  +       L  ++    SS V ECQ     +S SQ
Sbjct: 1337 EDHPGNGSIATN---SDPMEVTDNIL------ALKPSEHPSTSSAVPECQIGLGGDSGSQ 1387

Query: 3028 STARDGNSEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQLSDGYYKSRGNQLLRASS 3207
            +T  +G+S+K I+YVK+RSNQL+AA +    S           SDGYYK R NQL+RAS 
Sbjct: 1388 NTLDEGSSKKNIVYVKQRSNQLLAASDKTQTS-----------SDGYYKRRKNQLIRASG 1436

Query: 3208 KNHVAKGNAHATVSGLVPQSVVP-KTSTRRQSGFAKSCRYSKFSSVWKLQDSQSSEKYKN 3384
             NH+ +         +  +++VP +  T+R +G AK+ + SKFS VWKL D+QSS KY  
Sbjct: 1437 NNHMKQRI-------VTTKTIVPFQRGTKRLNGLAKTSKLSKFSLVWKLGDTQSSRKYGG 1489

Query: 3385 SLGPRKVWPHFFSSKRAAYWRSLMLGIKPS--LSNISQKLLVSRKRGAIYTRSSHGYSLK 3558
            ++   K+WP+ F  KRA+Y RS  L   PS   S I +KLL+S+KR  IYTRS HG SL+
Sbjct: 1490 TVEYEKLWPYLFPWKRASYRRSF-LSSSPSDNSSIIRRKLLLSKKRETIYTRSIHGLSLR 1548

Query: 3559 MSKVLSVGGSSLKWSKSIEKSSRXXXXXXXXXXXXXXXXXXXXXGCVSIASKSRNHVSRK 3738
             SKVLSV GSSLKWSKSIE+ S+                     G  +  S S N+VSR 
Sbjct: 1549 RSKVLSVSGSSLKWSKSIEQRSKKATEEAALAVAAVDKRKRGQYG-FNADSMSGNNVSR- 1606

Query: 3739 WVLSVKLRPGERIFRI 3786
                      ERIFRI
Sbjct: 1607 ----------ERIFRI 1612



 Score = 67.0 bits (162), Expect = 6e-08
 Identities = 45/109 (41%), Positives = 67/109 (61%), Gaps = 4/109 (3%)
 Frame = +1

Query: 190 RFSGRLGVSKGEFIR---KQRLQKKNTLLKTPLGKVRSKHNGGSKARHFTKDSNGGSFKV 360
           RFS RL V K E  R   K+++QKK+ LL+   GK  ++      +R+   D + G+ + 
Sbjct: 251 RFSNRLRVDKEEIHRSPQKKQVQKKSALLRIQCGKANNR------SRNQDHDLSSGAVRG 304

Query: 361 KEKG-YGKMQTRMEYEREREQSPMELAISFKSNALVAKAIQVPSNPSTE 504
           K+K  + +++ R+E   ERE S MEL +SFKSNALVAKAI  PS+ + +
Sbjct: 305 KQKDVFERLERRVE---EREGSQMELDVSFKSNALVAKAIMTPSSSAID 350


>ref|XP_006357328.1| PREDICTED: uncharacterized protein LOC102595922 isoform X2 [Solanum
            tuberosum]
          Length = 1946

 Score =  218 bits (555), Expect = 2e-53
 Identities = 208/615 (33%), Positives = 282/615 (45%), Gaps = 24/615 (3%)
 Frame = +1

Query: 2014 LGKDDLALNASGL--LCSERNGPSGSDSGDESLASSF-DMRSCMTSPEELHVNSDSSFLE 2184
            L +DD+ L A  L    ++ +  S     D S   SF D+ +   S E +  +S SS + 
Sbjct: 1044 LDRDDMPLLADNLSLFANKVSVKSMESVPDMSPLVSFPDLTNSSVSEEPIDKSSMSSEIV 1103

Query: 2185 NTKASACLLDNEMICQSDNIFNEKHVFADPNTISHGKCSGAAVSKSQS-NIGIPQSLLKD 2361
              KA    +D   I   DNI + +   +D      G+ S   V      N+       ++
Sbjct: 1104 IEKALR--VDENSITAYDNISSSEKTSSD--AFEFGRSSDHKVGGDPLVNVSTVALSSQN 1159

Query: 2362 TSQVVKKLYPIHSKLTWSKNQVTSAIAKVLPVQHPQNLSNSRKLYSSHATKSRTWCRTVN 2541
            T +  K +     K     NQ + A  +VL V+ P +    R +      K  TW RT N
Sbjct: 1160 TVKSSKNVSSQGWKPNLGANQQSPAGPRVLSVR-PSSFITPRNV--PVPKKPLTWHRTGN 1216

Query: 2542 STAAVAEPKIEPSPIPQSNGTSAARSVPSSYIRKGNSLVRKDSPSDDASHGVPG-SSSVY 2718
            S+++V     + S +P  +  S   +   SYIRKGNSLVR  SP      G    SSS Y
Sbjct: 1217 SSSSVVGRGSQMSALPPQSHLSKDTAKVGSYIRKGNSLVRNPSPVGSVPKGYHAPSSSTY 1276

Query: 2719 RL--SHCTNTRKNDLATDYMTGDADA---------------PTVKRRGQISTSVMTKAVT 2847
            RL  S   + R+       +TG                   PT        T V T +  
Sbjct: 1277 RLNSSGVNDLRRKCENRAEITGSPSCRGTPEVNAPSERPKTPTQSESFSCITLVSTSSPV 1336

Query: 2848 LNHSGNSLKCTPCHLEEPLSLSNPHINDCPPRTLDVTKERIRSSVVSECQTDSVINSDSQ 3027
             +H GN    T     +P+ +++  +       L  ++    SS V ECQ     +S SQ
Sbjct: 1337 EDHPGNGSIATN---SDPMEVTDNIL------ALKPSEHPSTSSAVPECQIGLGGDSGSQ 1387

Query: 3028 STARDGNSEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQLSDGYYKSRGNQLLRASS 3207
            +T  +G+S+K I+YVK+RSNQL+AA +    S           SDGYYK R NQL+RAS 
Sbjct: 1388 NTLDEGSSKKNIVYVKQRSNQLLAASDKTQTS-----------SDGYYKRRKNQLIRASG 1436

Query: 3208 KNHVAKGNAHATVSGLVPQSVVPKTSTRRQSGFAKSCRYSKFSSVWKLQDSQSSEKYKNS 3387
             NH+ +            + V  KT    Q G AK+ + SKFS VWKL D+QSS KY  +
Sbjct: 1437 NNHMKQ------------RIVTTKTIVPFQRGLAKTSKLSKFSLVWKLGDTQSSRKYGGT 1484

Query: 3388 LGPRKVWPHFFSSKRAAYWRSLMLGIKPS--LSNISQKLLVSRKRGAIYTRSSHGYSLKM 3561
            +   K+WP+ F  KRA+Y RS  L   PS   S I +KLL+S+KR  IYTRS HG SL+ 
Sbjct: 1485 VEYEKLWPYLFPWKRASYRRSF-LSSSPSDNSSIIRRKLLLSKKRETIYTRSIHGLSLRR 1543

Query: 3562 SKVLSVGGSSLKWSKSIEKSSRXXXXXXXXXXXXXXXXXXXXXGCVSIASKSRNHVSRKW 3741
            SKVLSV GSSLKWSKSIE+ S+                     G  +  S S N+VSR  
Sbjct: 1544 SKVLSVSGSSLKWSKSIEQRSKKATEEAALAVAAVDKRKRGQYG-FNADSMSGNNVSR-- 1600

Query: 3742 VLSVKLRPGERIFRI 3786
                     ERIFRI
Sbjct: 1601 ---------ERIFRI 1606



 Score = 67.0 bits (162), Expect = 6e-08
 Identities = 45/109 (41%), Positives = 67/109 (61%), Gaps = 4/109 (3%)
 Frame = +1

Query: 190 RFSGRLGVSKGEFIR---KQRLQKKNTLLKTPLGKVRSKHNGGSKARHFTKDSNGGSFKV 360
           RFS RL V K E  R   K+++QKK+ LL+   GK  ++      +R+   D + G+ + 
Sbjct: 251 RFSNRLRVDKEEIHRSPQKKQVQKKSALLRIQCGKANNR------SRNQDHDLSSGAVRG 304

Query: 361 KEKG-YGKMQTRMEYEREREQSPMELAISFKSNALVAKAIQVPSNPSTE 504
           K+K  + +++ R+E   ERE S MEL +SFKSNALVAKAI  PS+ + +
Sbjct: 305 KQKDVFERLERRVE---EREGSQMELDVSFKSNALVAKAIMTPSSSAID 350


>ref|XP_007221926.1| hypothetical protein PRUPE_ppa000052mg [Prunus persica]
            gi|462418862|gb|EMJ23125.1| hypothetical protein
            PRUPE_ppa000052mg [Prunus persica]
          Length = 2092

 Score =  216 bits (549), Expect = 8e-53
 Identities = 206/659 (31%), Positives = 296/659 (44%), Gaps = 62/659 (9%)
 Frame = +1

Query: 1996 LDVDRSLGKDDLALNASGLLCSERNGPSGSDSGDESLASSFDMRSCMTSPEELHVNSDSS 2175
            ++ D    KD L   ++ LL    +    + + +E + S  D  S   SPE    +    
Sbjct: 1133 MESDHVSVKDSLPFASNRLLLCANDNEVSTTNSNEGVESVPDTLSDTGSPET-STDVPGV 1191

Query: 2176 FLENTKASACLLDNEMICQSDNIFNEKHVF---------------ADPNTISHGKCSGAA 2310
             +     S   + +   C  D     K V                   N  SH    G  
Sbjct: 1192 QMRTCSPSVIKISDGKDCGDDQKLGLKSVVEVGCSASARNSLSECTKSNLTSHPVTEGGQ 1251

Query: 2311 VSKSQSNIGIPQSLLKDTSQVVKKLYPIHSKLTWSKNQVTSAIAKVLPVQHPQNLSNSRK 2490
             S     + +P   +K T+  +  L    S++   KNQ+  A  +++P       S S+K
Sbjct: 1252 -SVMGKTVALPLQDIKKTAHGLN-LVTAESRV---KNQLGQATRRIVPGHSYSVFSTSKK 1306

Query: 2491 LYSS-HATKSRTWCRTVNSTAAVAEPKIEPSPIPQSNGTSAARSVP--------SSYIRK 2643
              SS H  K RTW R  N++A+       P+ +P S+     R++P        +SY+RK
Sbjct: 1307 TGSSTHMAKPRTWHRNGNASASSL-----PASMPFSSTVPPQRNLPQKDGKLQSNSYVRK 1361

Query: 2644 GNSLVRKDSPS---DDASHGVPGSSSVYRLSHC-TNTRKNDLATDYMTGDADAPTVKRRG 2811
            GNSLVRK  P      +SHG   SS+VYRL+    +  K +  ++      + P++ R G
Sbjct: 1362 GNSLVRKPVPVAALPQSSHGF--SSAVYRLNSLGIDGLKKNAGSESRVDVKNPPSLMRTG 1419

Query: 2812 QISTSVMTKAVTLNHSGNSLKC--------TPCHLEEPLSLSNPHINDCPPRTLDVTKER 2967
            +++         L +      C        T   L EPL LS  +++D P   L+    +
Sbjct: 1420 EMNAPFDRPRPPLPNGAKLSTCDAISLGVCTSSQLAEPL-LSGENMSD-PMNCLETKDAK 1477

Query: 2968 I---RSSVVSECQTD--SVINS-DSQSTARDGNSE----KKIIYVKRRSNQLVAACNSGD 3117
            I    S V SE Q +     NS ++Q+   DGNS     K I+YVK + NQLVA+ +  D
Sbjct: 1478 IVVNDSLVTSETQENHSGPFNSLENQTELHDGNSAPSNTKNIVYVKHKLNQLVASSSPCD 1537

Query: 3118 MSMLGVDNTRSQLSDGYYKSRGNQLLRASSKNHVAKGNAHAT---------VSGLVPQSV 3270
            + +   D  +    DGYYK R NQL+R SS+ H  +    +          VS +VP  +
Sbjct: 1538 LPVHNTDKIQHSSFDGYYKRRKNQLIRTSSEGHAKQAVITSNDNLNSQVQKVSKIVPSRI 1597

Query: 3271 VPKTSTRRQSGFAKSCRYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFSSKRAAYWRS 3450
              K   R Q   AK+ +  K S VW  + +QSS    +S   +KV PH F  KRA +WR+
Sbjct: 1598 YGKK--RSQKVIAKTSKTGKHSLVWTPRGTQSSNNDGDSFDHQKVLPHLFPWKRARHWRT 1655

Query: 3451 LMLGIKP-----SLSNISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIE 3615
             M          S S IS+KLL+SR+R  +YTRS+HG+SL+M KVLSVGGSSLKWSKSIE
Sbjct: 1656 SMQSQASNFKYSSASTISKKLLLSRRRDTVYTRSTHGFSLRMYKVLSVGGSSLKWSKSIE 1715

Query: 3616 KSSRXXXXXXXXXXXXXXXXXXXXXG--CVSIASKSRNHVSRKWVLSVKLRPGERIFRI 3786
              S+                     G  CVS  SK RN++S           G+RIFRI
Sbjct: 1716 NRSKKANEEATRAVAAVEKKKREHSGAACVSSGSKFRNNIS-----------GKRIFRI 1763


>gb|EXB28444.1| Zinc finger CCCH domain-containing protein 7 [Morus notabilis]
          Length = 2046

 Score =  215 bits (547), Expect = 1e-52
 Identities = 202/642 (31%), Positives = 286/642 (44%), Gaps = 48/642 (7%)
 Frame = +1

Query: 2005 DRSLGKDDLALNASGLLCSERNGPSGSDSGDESLASSFDMRSCMTSPEELHVNSDSSFLE 2184
            D  L KDD   N    L S  N  SG+ S DE++    D  +  +SP+      D +  +
Sbjct: 1164 DNLLVKDDFP-NLPNYLSSP-NDCSGATSTDEAMDFVPDSPTMTSSPQTSLDVPDVNMSD 1221

Query: 2185 NTKASACLLDNEMICQSDNIFNEKHVFADPNTISHGKC----------SGAAVSKSQSNI 2334
             T  S     +  IC+ D    +K +    + +S  K           S +A    Q+  
Sbjct: 1222 VTSVSQI---SNQICREDEKLVQKSLDDKGSEVSAQKSFSQCTKSNLTSDSATECDQAIG 1278

Query: 2335 GIPQSL-LKDTSQVVKKLYPIHSKLTWSKNQVTSAIAKVLPVQHPQNLSNSRKLYSSHAT 2511
            G    L L+D     + +     +    KNQ+  A+++  P +    L+  +K  +S   
Sbjct: 1279 GKTAPLSLQDCRSTSRGVNIESVESNEQKNQLDQAVSRTFPGRSSFRLTTFKKRANSTHA 1338

Query: 2512 KSRTWCRTVNSTAAVAEPKIEPSPIPQSNGTSAARSVP--------SSYIRKGNSLVRKD 2667
              RTW R VNS+A        P     S    + + +P        +SY+RKGNSLVRK 
Sbjct: 1339 NPRTWHRNVNSSACAL-----PGSKTFSKNVPSQKQLPERDEKVQSTSYVRKGNSLVRKP 1393

Query: 2668 SPSDDASHGVPGSSSVYRLSHC-TNTRKNDLATDYMTGDADAPTVKRRGQISTSVMTKAV 2844
            SP+   S G P  S VYRL+   ++  K  + +D      +   + R G+   S      
Sbjct: 1394 SPTAALSQGPPSFSPVYRLNSAGSDELKRSIESDNRVSLGNTHDLSRVGETKASCNNPGP 1453

Query: 2845 TLNHSGNSLK---------CTPCHLEEPLSLSNPHINDCPPRTLD--VTKERIRSSVVSE 2991
                SG+ L          CT       LS      N  P  + +   T   +  S+ SE
Sbjct: 1454 LPIQSGSKLPNSVAISPGDCTASPSAGLLSNDRCETNSDPISSTENNETPNLVEDSLTSE 1513

Query: 2992 C------QTDSVINSDSQSTARDGNSE-KKIIYVKRRSNQLVAACNSGDMSMLGVDNTRS 3150
                   Q +S+ N    S A   +S  K+I+YVKR+SNQLVA  NS        D  ++
Sbjct: 1514 AFENQNGQLNSLDNQTELSNANLASSNMKQIVYVKRKSNQLVATSNS-----TSADKIQT 1568

Query: 3151 QLSDGYYKSRGNQLLRASSKNHVAKG---NAHATVSGLVPQSVVPKTSTRR-QSGFAKSC 3318
              SDGYYK + NQL+R S ++H  +    + +  +   +   V+P  S RR      K+ 
Sbjct: 1569 SSSDGYYKRKKNQLIRTSLESHTKQPVMPDDNFNLGVQMTLGVIPNRSKRRGHKVVPKTF 1628

Query: 3319 RYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFSSKRAAYWRSLMLG----IKPSLSNI 3486
            + S  S VW L  ++S++    SL  +KV+PH F  KR  YWRS ML      K S   I
Sbjct: 1629 KRSTNSLVWTLCSTESTKVNSGSLYHQKVFPHLFPWKRTTYWRSFMLNSNLIYKSSSLAI 1688

Query: 3487 SQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSR--XXXXXXXXXXX 3660
            S+KLL+SRKR  +YTRS +G+SL+ SKVLSVGG+SLKWSKS+E  S+             
Sbjct: 1689 SKKLLLSRKRDTLYTRSLNGFSLRKSKVLSVGGASLKWSKSLENRSKKVNEEATLAVVAV 1748

Query: 3661 XXXXXXXXXXGCVSIASKSRNHVSRKWVLSVKLRPGERIFRI 3786
                       C+S  SKSRNH SR           ERIFRI
Sbjct: 1749 DKKKREQKEATCISSGSKSRNHSSR-----------ERIFRI 1779


>ref|XP_007019228.1| Zinc finger C-x8-C-x5-C-x3-H type family protein, putative isoform 3
            [Theobroma cacao] gi|508724556|gb|EOY16453.1| Zinc finger
            C-x8-C-x5-C-x3-H type family protein, putative isoform 3
            [Theobroma cacao]
          Length = 1935

 Score =  215 bits (547), Expect = 1e-52
 Identities = 220/660 (33%), Positives = 302/660 (45%), Gaps = 71/660 (10%)
 Frame = +1

Query: 2020 KDDLALNASGLLCS-ERNGPSGSDSGDESLASSFDMRSCMTSPEELHVNSDSSFLENTKA 2196
            KDDL      L+   + N  S ++S DE + +  D+ S + SP     N D+  +    A
Sbjct: 1162 KDDLPSALISLVFGVDANEVSATNSNDEVMPAP-DIVSDVGSP----YNHDNFVIS---A 1213

Query: 2197 SACLLDNEMICQSDNIFNEKHVFAD-------PNTISHGKCSGAAVSKSQSNIGIPQS-- 2349
            S C      +CQ     +EK  F D       P     G  S A VS SQ +  I +S  
Sbjct: 1214 STC---KAPLCQQ----SEKQAFGDEKFSDDKPMAEGAGNVS-ALVSYSQHSRTILKSND 1265

Query: 2350 LLKDTSQVVKK--LYPIH-SKLTWSKNQVTSA-------IAKVLPVQHPQNLS-----NS 2484
             ++    V  K  L P H SK T S N ++ A       ++ V+P  +P   S     + 
Sbjct: 1266 AIQTNQSVAGKEVLLPSHDSKNTNSPNSISGATRRRKNPLSHVVPKSYPTRSSFVFSASK 1325

Query: 2485 RKLYSSHATKSRTWCRTVNSTAAVA---EPKIEPSPIPQSNGTSAARSVPSSYIRKGNSL 2655
                S++ TK RTW RT NS+A+     +P    +P+ +     AA     SYIRKGNSL
Sbjct: 1326 NTTPSTNITKPRTWHRTNNSSASPLSGNKPSSSANPLQRQMPKKAAFFQSPSYIRKGNSL 1385

Query: 2656 VRKD---SPSDDASHGVPGSSSVYRLS--------HCTNTRKNDLATDYMTGDADA---- 2790
            VRK          SH +  SSSVYR++          T       A D  TG A+A    
Sbjct: 1386 VRKPVAVPALPQGSHSL--SSSVYRMNPGVVDEVKKGTGPNSRVGAVDLRTGGANASFER 1443

Query: 2791 PTVKRRGQISTSVMTKAVTLNHSGNSLKCTPCHLEEPLSLSNPHINDCPPRTLDVTKERI 2970
            PT      +S        T N  G   +CT   L EP       I+DC    ++      
Sbjct: 1444 PTTPPLSSVSK---VPNCTSNSPG---ECTSSPLAEP------SISDCCETAINHASSME 1491

Query: 2971 RSSVVSEC-----------QTDSVINSDSQSTARDGN----SEKKIIYVKRRSNQLVAAC 3105
             + V++             Q  SV N +  +   + N    + K++ YVK +SNQLVA  
Sbjct: 1492 INDVLNSPEDGLKTFETLNQNGSVNNLEECTEQSESNLVPSNAKRLTYVKPKSNQLVATS 1551

Query: 3106 NSGDMSMLGVDNTR--SQLSDGYYKSRGNQLLRASSKNHVAKG-----NAHATVSGLVPQ 3264
              G  S+L  D  +  S  SDGYYK   NQL+R + ++H+ +      N   +V  +  +
Sbjct: 1552 ECGRTSILNADKNQNFSAPSDGYYKKSKNQLIRTALESHIKQAVTMSDNKTNSVGQVAAK 1611

Query: 3265 SVVPKTSTRRQSG--FAKSCRYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFSSKRAA 3438
             +  +T  +RQS     K+ + SKFS VW L  ++ S+   NSL   KV P  F  KR  
Sbjct: 1612 VMPSRTVGKRQSNKVVGKTHKPSKFSLVWTLHSARLSKNDGNSLRRPKVLPQLFPWKRMT 1671

Query: 3439 YWRSLMLG----IKPSLSNISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSK 3606
            YWRS  L        SLS IS+K+L+SRKR  +YTRS +G+S++ SKV SVGGSSLKWSK
Sbjct: 1672 YWRSFKLNSVSSCNSSLSTISRKMLLSRKRNTVYTRSINGFSIRKSKVFSVGGSSLKWSK 1731

Query: 3607 SIEKSSRXXXXXXXXXXXXXXXXXXXXXGCVSIASKSRNHVSRKWVLSVKLRPGERIFRI 3786
            SIE++SR                     G VS   K R++   K V   +LRPGERIFRI
Sbjct: 1732 SIERNSRKANEEATLAVAEAERKKREQKGTVSRTGK-RSYSCHKVVHGTELRPGERIFRI 1790


>ref|XP_007019227.1| Zinc finger C-x8-C-x5-C-x3-H type family protein, putative isoform 2
            [Theobroma cacao] gi|508724555|gb|EOY16452.1| Zinc finger
            C-x8-C-x5-C-x3-H type family protein, putative isoform 2
            [Theobroma cacao]
          Length = 1962

 Score =  215 bits (547), Expect = 1e-52
 Identities = 220/660 (33%), Positives = 302/660 (45%), Gaps = 71/660 (10%)
 Frame = +1

Query: 2020 KDDLALNASGLLCS-ERNGPSGSDSGDESLASSFDMRSCMTSPEELHVNSDSSFLENTKA 2196
            KDDL      L+   + N  S ++S DE + +  D+ S + SP     N D+  +    A
Sbjct: 1162 KDDLPSALISLVFGVDANEVSATNSNDEVMPAP-DIVSDVGSP----YNHDNFVIS---A 1213

Query: 2197 SACLLDNEMICQSDNIFNEKHVFAD-------PNTISHGKCSGAAVSKSQSNIGIPQS-- 2349
            S C      +CQ     +EK  F D       P     G  S A VS SQ +  I +S  
Sbjct: 1214 STC---KAPLCQQ----SEKQAFGDEKFSDDKPMAEGAGNVS-ALVSYSQHSRTILKSND 1265

Query: 2350 LLKDTSQVVKK--LYPIH-SKLTWSKNQVTSA-------IAKVLPVQHPQNLS-----NS 2484
             ++    V  K  L P H SK T S N ++ A       ++ V+P  +P   S     + 
Sbjct: 1266 AIQTNQSVAGKEVLLPSHDSKNTNSPNSISGATRRRKNPLSHVVPKSYPTRSSFVFSASK 1325

Query: 2485 RKLYSSHATKSRTWCRTVNSTAAVA---EPKIEPSPIPQSNGTSAARSVPSSYIRKGNSL 2655
                S++ TK RTW RT NS+A+     +P    +P+ +     AA     SYIRKGNSL
Sbjct: 1326 NTTPSTNITKPRTWHRTNNSSASPLSGNKPSSSANPLQRQMPKKAAFFQSPSYIRKGNSL 1385

Query: 2656 VRKD---SPSDDASHGVPGSSSVYRLS--------HCTNTRKNDLATDYMTGDADA---- 2790
            VRK          SH +  SSSVYR++          T       A D  TG A+A    
Sbjct: 1386 VRKPVAVPALPQGSHSL--SSSVYRMNPGVVDEVKKGTGPNSRVGAVDLRTGGANASFER 1443

Query: 2791 PTVKRRGQISTSVMTKAVTLNHSGNSLKCTPCHLEEPLSLSNPHINDCPPRTLDVTKERI 2970
            PT      +S        T N  G   +CT   L EP       I+DC    ++      
Sbjct: 1444 PTTPPLSSVSK---VPNCTSNSPG---ECTSSPLAEP------SISDCCETAINHASSME 1491

Query: 2971 RSSVVSEC-----------QTDSVINSDSQSTARDGN----SEKKIIYVKRRSNQLVAAC 3105
             + V++             Q  SV N +  +   + N    + K++ YVK +SNQLVA  
Sbjct: 1492 INDVLNSPEDGLKTFETLNQNGSVNNLEECTEQSESNLVPSNAKRLTYVKPKSNQLVATS 1551

Query: 3106 NSGDMSMLGVDNTR--SQLSDGYYKSRGNQLLRASSKNHVAKG-----NAHATVSGLVPQ 3264
              G  S+L  D  +  S  SDGYYK   NQL+R + ++H+ +      N   +V  +  +
Sbjct: 1552 ECGRTSILNADKNQNFSAPSDGYYKKSKNQLIRTALESHIKQAVTMSDNKTNSVGQVAAK 1611

Query: 3265 SVVPKTSTRRQSG--FAKSCRYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFSSKRAA 3438
             +  +T  +RQS     K+ + SKFS VW L  ++ S+   NSL   KV P  F  KR  
Sbjct: 1612 VMPSRTVGKRQSNKVVGKTHKPSKFSLVWTLHSARLSKNDGNSLRRPKVLPQLFPWKRMT 1671

Query: 3439 YWRSLMLG----IKPSLSNISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSK 3606
            YWRS  L        SLS IS+K+L+SRKR  +YTRS +G+S++ SKV SVGGSSLKWSK
Sbjct: 1672 YWRSFKLNSVSSCNSSLSTISRKMLLSRKRNTVYTRSINGFSIRKSKVFSVGGSSLKWSK 1731

Query: 3607 SIEKSSRXXXXXXXXXXXXXXXXXXXXXGCVSIASKSRNHVSRKWVLSVKLRPGERIFRI 3786
            SIE++SR                     G VS   K R++   K V   +LRPGERIFRI
Sbjct: 1732 SIERNSRKANEEATLAVAEAERKKREQKGTVSRTGK-RSYSCHKVVHGTELRPGERIFRI 1790


>ref|XP_007019226.1| Zinc finger C-x8-C-x5-C-x3-H type family protein, putative isoform 1
            [Theobroma cacao] gi|508724554|gb|EOY16451.1| Zinc finger
            C-x8-C-x5-C-x3-H type family protein, putative isoform 1
            [Theobroma cacao]
          Length = 2110

 Score =  215 bits (547), Expect = 1e-52
 Identities = 220/660 (33%), Positives = 302/660 (45%), Gaps = 71/660 (10%)
 Frame = +1

Query: 2020 KDDLALNASGLLCS-ERNGPSGSDSGDESLASSFDMRSCMTSPEELHVNSDSSFLENTKA 2196
            KDDL      L+   + N  S ++S DE + +  D+ S + SP     N D+  +    A
Sbjct: 1162 KDDLPSALISLVFGVDANEVSATNSNDEVMPAP-DIVSDVGSP----YNHDNFVIS---A 1213

Query: 2197 SACLLDNEMICQSDNIFNEKHVFAD-------PNTISHGKCSGAAVSKSQSNIGIPQS-- 2349
            S C      +CQ     +EK  F D       P     G  S A VS SQ +  I +S  
Sbjct: 1214 STC---KAPLCQQ----SEKQAFGDEKFSDDKPMAEGAGNVS-ALVSYSQHSRTILKSND 1265

Query: 2350 LLKDTSQVVKK--LYPIH-SKLTWSKNQVTSA-------IAKVLPVQHPQNLS-----NS 2484
             ++    V  K  L P H SK T S N ++ A       ++ V+P  +P   S     + 
Sbjct: 1266 AIQTNQSVAGKEVLLPSHDSKNTNSPNSISGATRRRKNPLSHVVPKSYPTRSSFVFSASK 1325

Query: 2485 RKLYSSHATKSRTWCRTVNSTAAVA---EPKIEPSPIPQSNGTSAARSVPSSYIRKGNSL 2655
                S++ TK RTW RT NS+A+     +P    +P+ +     AA     SYIRKGNSL
Sbjct: 1326 NTTPSTNITKPRTWHRTNNSSASPLSGNKPSSSANPLQRQMPKKAAFFQSPSYIRKGNSL 1385

Query: 2656 VRKD---SPSDDASHGVPGSSSVYRLS--------HCTNTRKNDLATDYMTGDADA---- 2790
            VRK          SH +  SSSVYR++          T       A D  TG A+A    
Sbjct: 1386 VRKPVAVPALPQGSHSL--SSSVYRMNPGVVDEVKKGTGPNSRVGAVDLRTGGANASFER 1443

Query: 2791 PTVKRRGQISTSVMTKAVTLNHSGNSLKCTPCHLEEPLSLSNPHINDCPPRTLDVTKERI 2970
            PT      +S        T N  G   +CT   L EP       I+DC    ++      
Sbjct: 1444 PTTPPLSSVSK---VPNCTSNSPG---ECTSSPLAEP------SISDCCETAINHASSME 1491

Query: 2971 RSSVVSEC-----------QTDSVINSDSQSTARDGN----SEKKIIYVKRRSNQLVAAC 3105
             + V++             Q  SV N +  +   + N    + K++ YVK +SNQLVA  
Sbjct: 1492 INDVLNSPEDGLKTFETLNQNGSVNNLEECTEQSESNLVPSNAKRLTYVKPKSNQLVATS 1551

Query: 3106 NSGDMSMLGVDNTR--SQLSDGYYKSRGNQLLRASSKNHVAKG-----NAHATVSGLVPQ 3264
              G  S+L  D  +  S  SDGYYK   NQL+R + ++H+ +      N   +V  +  +
Sbjct: 1552 ECGRTSILNADKNQNFSAPSDGYYKKSKNQLIRTALESHIKQAVTMSDNKTNSVGQVAAK 1611

Query: 3265 SVVPKTSTRRQSG--FAKSCRYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFSSKRAA 3438
             +  +T  +RQS     K+ + SKFS VW L  ++ S+   NSL   KV P  F  KR  
Sbjct: 1612 VMPSRTVGKRQSNKVVGKTHKPSKFSLVWTLHSARLSKNDGNSLRRPKVLPQLFPWKRMT 1671

Query: 3439 YWRSLMLG----IKPSLSNISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSK 3606
            YWRS  L        SLS IS+K+L+SRKR  +YTRS +G+S++ SKV SVGGSSLKWSK
Sbjct: 1672 YWRSFKLNSVSSCNSSLSTISRKMLLSRKRNTVYTRSINGFSIRKSKVFSVGGSSLKWSK 1731

Query: 3607 SIEKSSRXXXXXXXXXXXXXXXXXXXXXGCVSIASKSRNHVSRKWVLSVKLRPGERIFRI 3786
            SIE++SR                     G VS   K R++   K V   +LRPGERIFRI
Sbjct: 1732 SIERNSRKANEEATLAVAEAERKKREQKGTVSRTGK-RSYSCHKVVHGTELRPGERIFRI 1790


>ref|XP_006434296.1| hypothetical protein CICLE_v10000009mg [Citrus clementina]
            gi|557536418|gb|ESR47536.1| hypothetical protein
            CICLE_v10000009mg [Citrus clementina]
          Length = 2165

 Score =  214 bits (545), Expect = 2e-52
 Identities = 195/591 (32%), Positives = 284/591 (48%), Gaps = 53/591 (8%)
 Frame = +1

Query: 2014 LGKDDLALNASGLLCSERNGPSGSDSGDESLASSFDMRSCMTSPEELHVNSDSSFLENTK 2193
            L + DL+  A   L ++ +G S ++S DE +   FD  S + SPE L      + L N +
Sbjct: 1213 LERGDLS-RAYRALVADGDGVSTTNSYDEMM--EFDSISELGSPEILSTVPVMNAL-NHE 1268

Query: 2194 ASACLLDNEMICQSDNIFNEKHV-------FADPNTISHGKCS-------GAAVSKSQSN 2331
            ASA  + NE +C+ + I +E+ V        A  +   H K +        +A   +Q  
Sbjct: 1269 ASASQISNEKVCRIEKIPSEEPVDEGFFNLSAHTSPSEHAKINLKLDDMLESAHLVAQRT 1328

Query: 2332 IGIPQSLLKDTSQVVKKLYPIHSKLTWSKNQVTSAIAKVLPVQHPQNLSNSRKLYSSH-- 2505
            + +P   +KDT      L P+  +    K+Q +  ++++ P +     + SR L SS   
Sbjct: 1329 VSLPAQDVKDTGLT---LNPMSGETNGKKHQASHCVSRIHPRRSSSVFTASRDLASSTRT 1385

Query: 2506 --ATKSRTWCRTVNSTAAVAEPKIEPSPIPQSNGTSAARSVPSSYIRKGNSLVRKDSPS- 2676
               T+ RTW RT +S+A+ A       P         A+    SYIRKGNSLVRK +P  
Sbjct: 1386 TCTTRPRTWHRTESSSASPAPGNKSLLPPQNQLPKKVAKYQSMSYIRKGNSLVRKPAPVA 1445

Query: 2677 --DDASHGVPGSSSVYRLS-----HCTNTRKNDLATDYMTGDA-----DAPTVKRRGQIS 2820
                 SHG+  +SSVY L+         TR ++   D +   +     +AP  + R   +
Sbjct: 1446 AVSQISHGL--TSSVYWLNSSGIGESKKTRGSEGGADVVDPPSFLRGVNAPLERPR---T 1500

Query: 2821 TSVMTKAVTLNHSGNSL-KCTPCHLEEPLSLSNPHINDCPPRTLDVTKE------RIRSS 2979
              +   A   NH+ +S    T   + EPL            + +++  E       +  S
Sbjct: 1501 PPLPVVAKVPNHATSSTGDYTSSPVAEPLPNGCSETKSDTQKLMEINDELNFSNAALNIS 1560

Query: 2980 VVSECQTDSVINSDSQSTARDG----NSEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTR 3147
                 QT SV   +SQ    DG    ++ K+I Y+KR+SNQL+AA N   +S+   D T+
Sbjct: 1561 KTPVNQTGSVNGLESQGELNDGTLCTSNVKRITYLKRKSNQLIAASNGCSLSVQNPDKTQ 1620

Query: 3148 SQLSDGYYKSRGNQLLRASSKNH----VAKGNAHATVSGLVPQSVVPKTSTRRQSGFA-- 3309
            S  SDGYYK R NQL+R   ++H    V+  +   T  G      + + S   QS  A  
Sbjct: 1621 STASDGYYKRRKNQLIRTPLESHINQTVSLADGSFTSEGEKCAKDIFRRSDMSQSYKAVK 1680

Query: 3310 KSCRYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFSSKRAAYWRSLM-----LGIKPS 3474
            K C+  +FS VW L   QSS+   + L   KV P  F  KR  YWR  +     +    S
Sbjct: 1681 KICKPIRFSLVWTLNSMQSSKSDDHFLYRGKVLPSLFPWKRTLYWRRFVQDPVSISNNSS 1740

Query: 3475 LSNISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSR 3627
            LS IS+KLL+ RKR  +YTRS+HG+SL+  KVLSVGGSSLKWSKSIE  S+
Sbjct: 1741 LSAISRKLLLLRKRDTVYTRSNHGFSLRKYKVLSVGGSSLKWSKSIENRSK 1791


>ref|XP_004237575.1| PREDICTED: uncharacterized protein LOC101244480 [Solanum
            lycopersicum]
          Length = 1167

 Score =  213 bits (543), Expect = 4e-52
 Identities = 203/603 (33%), Positives = 280/603 (46%), Gaps = 31/603 (5%)
 Frame = +1

Query: 1912 TISQCETTGVEGEVGINDSSVVSLDENMLDVDRS-----LGKDDLALNASGL--LCSERN 2070
            ++   ET  +   V  +  S V +D+ +           L +DD+ L A  L    ++ +
Sbjct: 226  SVVSIETLKMADRVSDDQGSSVGIDQKLAPESHESCHYVLDRDDMPLLADNLSLFANKVS 285

Query: 2071 GPSGSDSGDESLASSF-DMRSCMTSPEELHVNSDSSFLENTKASACLLDNEMICQSDNIF 2247
              S     D S   SF D+ +C  S E +  +S SS +   KA    +D       DNI 
Sbjct: 286  VKSMESVPDMSPLLSFPDLTNCSVSEEPIDKSSVSSEIVIEKALR--VDENSRTAYDNIS 343

Query: 2248 NEKHVFADP---NTISHGKCSGAAVSKSQSNIGIPQSLLKDTSQVVKKLYPIHSKLTWSK 2418
            +     +D    +  S  K  G  V    +     Q+ +K +  V  + +    K     
Sbjct: 344  SSVKTSSDAFEFDRSSDHKVGGNPVVNINTVALSSQNTVKSSKNVSSQGW----KPNLGA 399

Query: 2419 NQVTSAIAKVLPVQHPQNLSNSRKLYSSHATKSRTWCRTVNSTAAVAEPKIEPSPIPQSN 2598
            NQ   A ++VL V+ P +    R +      K  TW RT NS ++V     + + +P  +
Sbjct: 400  NQQIPAGSRVLSVR-PSSFITPRNV--PVPKKPLTWHRTGNSFSSVVGRGSQMNSLPPQS 456

Query: 2599 GTSAARSVPSSYIRKGNSLVRKDSPSDDASHGV-PGSSSVYRL--SHCTNTRKNDLATDY 2769
              S   +   SYIRKGNSLVR  SP      G    SSS YRL  S   + R+       
Sbjct: 457  HLSKDTAKVGSYIRKGNSLVRNPSPVGSLPKGYHASSSSTYRLNSSGVNDLRRKCENRAE 516

Query: 2770 MTGDADA---------------PTVKRRGQISTSVMTKAVTLNHSGNSLKCTPCHLEEPL 2904
            +TG                   PT        T + T +  ++H GN    T     +P+
Sbjct: 517  ITGSPSCRGTPEVNAPSERPKTPTQSESFSCVTLMSTSSPVVDHPGNGDIATN---SDPM 573

Query: 2905 SLSNPHINDCPPRTLDVTKERIRSSVVSECQTDSVINSDSQSTARDGNSEKKIIYVKRRS 3084
             +++ +I    P  L  T     SS V ECQ     +S SQ+T  +G+S K I+YVK+RS
Sbjct: 574  EVTD-NILALKPSELPST-----SSAVLECQIGLGGDSGSQNTLDEGSSRKVIVYVKQRS 627

Query: 3085 NQLVAACNSGDMSMLGVDNTRSQLSDGYYKSRGNQLLRASSKNHVAKGNAHATVSGLVPQ 3264
            NQLVAA +    S           SDGYYK R NQL+RAS  N + +    AT   +VP 
Sbjct: 628  NQLVAASDKTQTS-----------SDGYYKRRKNQLIRASGNNQMKQ--RVATTKNIVPF 674

Query: 3265 SVVPKTSTRRQSGFAKSCRYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFSSKRAAYW 3444
                      Q G AK+ + SKFS VWKL D+QSS KY  ++   K+WP  F  KRA+Y 
Sbjct: 675  ----------QRGLAKTSKLSKFSLVWKLGDTQSSRKYGGTVEYEKLWPFLFPWKRASYR 724

Query: 3445 RSLMLGIKPS--LSNISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEK 3618
            R+  L   PS   S I +KLL+S+KR  IYTRS HG SL+ SKVLSV GSSLKWSKSIE+
Sbjct: 725  RNF-LSSSPSDNSSIIRRKLLLSKKRETIYTRSIHGLSLRRSKVLSVSGSSLKWSKSIEQ 783

Query: 3619 SSR 3627
             S+
Sbjct: 784  RSK 786


>ref|XP_006472862.1| PREDICTED: uncharacterized protein At1g21580-like [Citrus sinensis]
          Length = 2164

 Score =  211 bits (536), Expect = 3e-51
 Identities = 189/591 (31%), Positives = 279/591 (47%), Gaps = 53/591 (8%)
 Frame = +1

Query: 2014 LGKDDLALNASGLLCSERNGPSGSDSGDESLASSFDMRSCMTSPEELHVNSDSSFLENTK 2193
            L + DL+  A   L ++ +G S ++S DE +   FD  S + SPE L      + L N +
Sbjct: 1212 LERGDLS-RAYRALVADGDGVSTTNSYDEMM--EFDSISELGSPEILSTVPVMNAL-NHE 1267

Query: 2194 ASACLLDNEMICQSDNIFNEKHV-------FADPNTISHGKCS-------GAAVSKSQSN 2331
            ASA  + NE +C+ + I +E+ V        A  +   H K +        +A   +Q  
Sbjct: 1268 ASASQISNEKVCRIEKIPSEEPVDEGFFNLSAHTSPSEHAKINLKLDDMLESAHLVAQRT 1327

Query: 2332 IGIPQSLLKDTSQVVKKLYPIHSKLTWSKNQVTSAIAKVLPVQHPQNLSNSRKLYSSH-- 2505
            + +P   +KDT      L P+  +    K+Q +  ++++ P +     + SR L SS   
Sbjct: 1328 VSLPAQDVKDTGLT---LNPMSGETNGKKHQASHCVSRIHPRRSSSVFTASRDLASSTRT 1384

Query: 2506 --ATKSRTWCRTVNSTAAVAEPKIEPSPIPQSNGTSAARSVPSSYIRKGNSLVRKDSPS- 2676
               T+ RTW RT +S+A+ A       P         A+    SYIRKGNSLVRK +P  
Sbjct: 1385 TCTTRPRTWHRTESSSASPAPGNKSLLPPQNQLPKKVAKFQSMSYIRKGNSLVRKPAPVA 1444

Query: 2677 --DDASHGVPGSSSVYRLS-----HCTNTRKNDLATDYMTGDA-----DAPTVKRRGQIS 2820
                 SHG+  +SSVY L+         TR ++   D +   +     +AP  + R   +
Sbjct: 1445 AVSQVSHGL--TSSVYWLNSSGIGESKKTRGSEGGADVVDPTSFLRGVNAPLERPR---T 1499

Query: 2821 TSVMTKAVTLNHSGNSL-KCTPCHLEEPLSLSNPHINDCPPRTLDVTKE------RIRSS 2979
              +   A   NH+ +S    T   + EPL            + +++  E       +  S
Sbjct: 1500 PPLPVVAKVPNHATSSTGDYTSSPVAEPLPNGCSETKSDTQKLMEINDELNFSNAALNIS 1559

Query: 2980 VVSECQTDSVINSDSQSTARDG----NSEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTR 3147
                 QT SV   +SQ    DG    ++ K+I Y+KR+SNQL+AA N   +S+   D T+
Sbjct: 1560 KTPVNQTGSVNGLESQGELNDGTLCTSNVKRITYLKRKSNQLIAASNGCSLSVQNPDKTQ 1619

Query: 3148 SQLSDGYYKSRGNQLLRASSKNHV------AKGNAHATVSGLVPQSVVPKTSTRRQSGFA 3309
            S  SDGYYK R NQL+R   ++ +      A G+  +               ++      
Sbjct: 1620 STASDGYYKRRKNQLIRTPLESQINQTVSLADGSFTSEGEKCAKDIFTRSDMSQSYKAVK 1679

Query: 3310 KSCRYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFSSKRAAYWRSLM-----LGIKPS 3474
            K C+  +FS VW L   QSS+   + L   KV P  F  KR  YWR  +     +    S
Sbjct: 1680 KICKPIRFSLVWTLNSMQSSKSDDHFLYRGKVLPSLFPWKRTLYWRRFVQDPVSISNNSS 1739

Query: 3475 LSNISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSR 3627
            LS IS+KLL+ RKR  +YTRS+HG+SL+  KVLSVGGSSLKWSKSIE  S+
Sbjct: 1740 LSAISRKLLLLRKRDTVYTRSNHGFSLRKYKVLSVGGSSLKWSKSIENRSK 1790


>ref|XP_002520303.1| protein with unknown function [Ricinus communis]
            gi|223540522|gb|EEF42089.1| protein with unknown function
            [Ricinus communis]
          Length = 2030

 Score =  207 bits (528), Expect = 2e-50
 Identities = 215/711 (30%), Positives = 305/711 (42%), Gaps = 69/711 (9%)
 Frame = +1

Query: 1861 PSEWEAEQSGNTPPASITISQCETTGVEGEVGINDSSVVSLDENMLDVDRSLGKDDLALN 2040
            P+ +E E+   T P    IS  +   +  E G  +   V   E  L VD    +      
Sbjct: 1018 PTGFEGEKIAGTTPVMAGISH-QNNSIHAESGEGEKMDVDAVEEQLIVDSGTSQCQCPSE 1076

Query: 2041 ASGLLCSER------------NGPSGSDSGDESLASSFDMRSCM------TSPEELHVNS 2166
               L   ER            +  +G  S   +L   F +R C       TS E + +  
Sbjct: 1077 VQSLNSDERMPVVNVEDENCLDAKNGLPSASNNL---FSLRDCNGTSTTDTSGEAMVLVP 1133

Query: 2167 DSSFLENTKASACLLDNEMICQSDNIFNEKHVFADPNTISHGKCSGAAVSKSQSNIGIPQ 2346
            D+  L N      L D   I QS     +     +          G+ +S   S   I +
Sbjct: 1134 DT--LPNMDYQETLPDAPSILQSSLSIKQAGGNDEILLGMSATQGGSGISAVTSGSLITE 1191

Query: 2347 SLLKDTSQVVKKLYPIHSKLTWSKNQVTSAIAKVLPVQHPQN-------------LSNSR 2487
                + +        + S+ T S  Q  +A++K +  +   +             L+++ 
Sbjct: 1192 DHAVENANSFGGKATLPSQDTKSSTQTLNAMSKEISGRKSHHNIAAYPGRSSFVFLASTS 1251

Query: 2488 KLYSSHATKSRTWCRTVNSTA-AVAEPKIEPSPIPQSNGT--SAARSVPSSYIRKGNSLV 2658
               S+H +K RTW RT +S A A+   K+  S +P          +   +SYIRKGNSLV
Sbjct: 1252 TAPSNHISKPRTWHRTDSSFAPALPGNKVFSSTVPTKCQLPKKVTKFHNTSYIRKGNSLV 1311

Query: 2659 RKDS---PSDDASHGVPGSSSVYRLSHCTNTRKNDLATDYMTGDADAPTVKRRGQISTSV 2829
            RK +        SHG+  S+     S     +KN   TD  TG AD P   + G  ++  
Sbjct: 1312 RKPTLVAAQPLGSHGLSSSAYWLNSSGKYEVKKN---TDTRTGVADPPNFVKSGVGASFE 1368

Query: 2830 MTKAVTL-------NHSGNSL-KCTPCHLEEPLSL------SNPHINDCPPRTLDVTKER 2967
              +   L       NH  NS+  C    L E L +      S+P  +      L  +++ 
Sbjct: 1369 RPRTPPLPSSTKISNHPTNSMGDCLSSPLVERLHICAAEAASDPVTSTESNDVLKSSEDT 1428

Query: 2968 IRSSVVSECQTDSVINSDSQSTARDGNS----EKKIIYVKRRSNQLVAACNSGDMSMLGV 3135
            ++ S     QT  + N D ++   DGN+     K I YVKR+SNQL+A  N   +SM   
Sbjct: 1429 VKVSEKHMFQTGQINNLDCETEQNDGNAVSSNAKSIKYVKRKSNQLIATSNPCSLSMKNS 1488

Query: 3136 DNTRSQLSDGYYKSRGNQLLRASSKNH----VAKGNAHATVSGLVPQSVVPKTS-TRRQS 3300
             +T +  SDGYYK R NQL+R S +NH     +  +      G    ++    S T+R+S
Sbjct: 1489 HSTAALPSDGYYKRRKNQLIRTSVENHEKPTASMPDESVNTEGQALHNITSGRSLTKRRS 1548

Query: 3301 G--FAKSCRYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFSSKRAAYWRSLM-----L 3459
                AK+ + SKFSSVW L  +QS +   +SL  +KV P     KRA  WRS +     +
Sbjct: 1549 RKVVAKTRKPSKFSSVWTLHSAQSLKDDSHSLHSQKVLPQLLPWKRATSWRSFIPSSAAI 1608

Query: 3460 GIKPSLSNISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSRXXXX 3639
             I  S S IS+KLL+ RKR  +YTRS HGYSL+ SKVLSVGGSSLKWSKSIE+ S+    
Sbjct: 1609 SINGSSSLISRKLLLLRKRDTVYTRSKHGYSLRKSKVLSVGGSSLKWSKSIERQSKKANE 1668

Query: 3640 XXXXXXXXXXXXXXXXXGC--VSIASKSRNHVSRKWVLSVKLRPGERIFRI 3786
                             G   V   +K+RN  SR           ERIFRI
Sbjct: 1669 EATLAVAEAERKKRERFGASHVDTGTKNRNSSSR-----------ERIFRI 1708


>ref|XP_004292729.1| PREDICTED: uncharacterized protein LOC101310670 [Fragaria vesca
            subsp. vesca]
          Length = 1908

 Score =  198 bits (503), Expect = 2e-47
 Identities = 196/635 (30%), Positives = 295/635 (46%), Gaps = 38/635 (5%)
 Frame = +1

Query: 1996 LDVDRSLGKDDLALNASGLLC-SERNGPSGSDSGDESLASSFDMRSCMTSPEELHVNSDS 2172
            +++D    KD L    S LL  ++ N  + ++S DE + S  D  S   +PE     +D+
Sbjct: 975  MELDYLCVKDKLPFVPSCLLSIAKGNEVTATNSIDEGMKSVPDTLSDTGTPETSTSITDA 1034

Query: 2173 SFLENTKASACLLDNEMICQSDNIFNEKHVFADP-NTISHGKCSGAAVSKSQSNIGIPQS 2349
              L    +   + D E +C  D  F  K   A   N  S  K +    + ++ +  +   
Sbjct: 1035 HLLICNPSVVKMFD-EKVCGDDQKFELKSEVASAGNFFSETKTNLTLDNVTEGHQSVTGK 1093

Query: 2350 LLKDTSQVVKKL-YPIH--SKLTWSKNQVTSAIAKVLPVQHPQNLSNSRKLYSS-HATKS 2517
             +    Q  KK  + +H  S  +  K+Q+  A  K++P       + S+K  SS H +K 
Sbjct: 1094 TVPLKLQESKKTSHGLHLLSAESALKSQLGQATHKIVPGHPYPTFTTSQKTTSSTHISKP 1153

Query: 2518 RTWCRTVNSTAAVAEPKIEPSP--IPQSNGTSAARSVPSSYIRKGNSLVRKDSPSDDASH 2691
            RTW R  NS+A+       P    +PQ NG   +    +SY+RKGN+LVR+ +       
Sbjct: 1154 RTWHRNANSSASPLHASTLPPQRQLPQRNGKFES----NSYVRKGNTLVRRPASVAAVPQ 1209

Query: 2692 GVPG-SSSVYRL--SHCTNTRKNDLATDYMTGDADAPTVKRRGQISTSVMTKAVTLNHSG 2862
               G +SSVY+L  S    ++KN   +D      +  ++ R G+I          L    
Sbjct: 1210 SSQGLNSSVYQLNISGIDGSKKN-AGSDGRVDIKNPSSLMRTGKIIAPSDRPTAPLPSEV 1268

Query: 2863 NSLKCTPCHLEEPLSLSNPHINDC------PPRTLDV------TKERIRSSVVSECQTDS 3006
                     L  P  ++ P ++D       P    D+       K+ + +S   E  +  
Sbjct: 1269 KMYTSAAISLGTPSQVAEPPLSDFFGTKSDPMNCSDMKDAEGSVKDLLATSDPPEHHSGP 1328

Query: 3007 VINSDSQSTARDGNSEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQLSDGYYKSRGN 3186
            V NS   S A   ++ KK+IYVKR+ NQLVA+ N  D+S+   DN  +Q SDGYYK R +
Sbjct: 1329 VTNSHDGSLA--SSNVKKVIYVKRKLNQLVASSNPSDLSVHNADN--NQPSDGYYKRRKH 1384

Query: 3187 QLLRASSKNH------VAKGNAHATVSGLVPQSVVP-KTSTRRQSGFAKSCRYSKFSSVW 3345
            QL+R+S +++      +   N ++ V   +   V+P +T  +++S  A +    K S VW
Sbjct: 1385 QLIRSSLESNGKDTVLLPTDNLNSRVQKAL--KVIPSRTFNKKRSLKAVARTGKKNSLVW 1442

Query: 3346 KLQDSQSSEKYKNSLGPRKVWPHFFSSKRAAYWRSLMLGIKP-----SLSNISQKLLVSR 3510
                +QSS    +S   +KV PH F  KRA  WR++M          S S IS+KLL+SR
Sbjct: 1443 TPSGTQSSNNNGSSFDHQKVLPHLFPWKRARSWRTVMQTQASNFNYSSSSTISKKLLLSR 1502

Query: 3511 KRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSRXXXXXXXXXXXXXXXXXXXXX 3690
             R  +YTRS+HG+SL+  KVLSVGGSSLKWSKSIE  S+                     
Sbjct: 1503 MRDTVYTRSTHGFSLRKYKVLSVGGSSLKWSKSIESRSK-------------KVNEEATR 1549

Query: 3691 GCVSIASKSRNHVSRKWV---LSVKLRPGERIFRI 3786
                +A K R H         L ++  PG+RIFRI
Sbjct: 1550 AVAEVAKKKREHNGATCASSGLKIRNSPGKRIFRI 1584


>ref|XP_006593806.1| PREDICTED: uncharacterized protein LOC100788859 [Glycine max]
          Length = 2025

 Score =  186 bits (473), Expect = 5e-44
 Identities = 200/712 (28%), Positives = 311/712 (43%), Gaps = 73/712 (10%)
 Frame = +1

Query: 1870 WEAEQSGNTPPASITISQCETTGVEGEVGINDSSVV---SLDENMLDVDRSLG-KDDLAL 2037
            W ++    +P   +     + + +EGE   N + +V   ++  ++L V    G K DL  
Sbjct: 1004 WHSDIVSFSPCEDLAFPNVQFSSLEGECKENTTPIVPTSNIQTDILAVGNIAGEKTDLQA 1063

Query: 2038 NASGLLCSE---RNGPSGSDSGDES----LASSFDMRSCMTSPEELHVN-SDSSFLENTK 2193
                    E   R+  +  +  D +    L + +++ SC  S +E+  N S+   +E+  
Sbjct: 1064 VEENYQYREHVQRSPRADMEPNDHNMKNDLLAQWNLMSCPASGDEVTTNNSNDEVIEDAP 1123

Query: 2194 ASACLLDNEMICQSDN-------IFNEKHVFA---DPNTIS----HGKCSGAAVSKSQSN 2331
              + +    M+ +  +         N++++F    +P+ IS        + +++ +++ N
Sbjct: 1124 GLSDMFSQGMVSEVPDRRVLEFTAINDENIFGVQENPDNISMVGHDSNLNTSSIQQTKKN 1183

Query: 2332 IG----------IPQSLLKDTSQVVKKL--YPIHSK---LTWSKNQVTSAIAKVLPVQHP 2466
            +           I +  + + SQV  K+    ++S    L+ +KNQ  S I K  P  H 
Sbjct: 1184 MKSDHAIEHSNLITKKTMSEQSQVSSKVTTQALNSYCFGLSGTKNQSGSIIPKTFP-GHS 1242

Query: 2467 QNLSNSRKLYSSHATKSRTWCRTVNSTAAVAEPKIEPS--------PIPQSNGTSAARSV 2622
               S +    S H +K RTW RT N+  A + P+I+PS        PI +  G       
Sbjct: 1243 FTFSKT-SASSPHVSKPRTWHRTGNNPPA-SLPRIKPSLGTVPPKKPILEMKGNFQN--- 1297

Query: 2623 PSSYIRKGNSLVRKDSPSDDASHGVPGSSSVYRLSHCTNTRKNDLATDYMTGDADAPTVK 2802
             +SY+RKGNSLVRK +P     H     SSV + S   +     + +       D     
Sbjct: 1298 -TSYVRKGNSLVRKPTPVSTLPH----ISSVNQTSLGIDEIPKSIKSGGRADVTDKQMYL 1352

Query: 2803 RRGQISTSVMTKAVTLNHSGNSLKCTPCHLEEPLSLSNPHINDCPPRTLDVTK------- 2961
            R G  + +   +   L     S + T   L EP S        C     D+ K       
Sbjct: 1353 RTGA-TNAPQQRTPPLPIDTKSEENTSSSLVEPPS------GGCCENASDLRKFIETDNI 1405

Query: 2962 ------ERIRSSVVSECQTDSVINSDSQSTARDGN----SEKKIIYVKRRSNQLVAACNS 3111
                  + ++     E Q     N DSQ  A DGN    + K+I+Y+K ++NQLVA  NS
Sbjct: 1406 APNSSEDALKHYETLENQPGPSDNGDSQGEAIDGNVFPLNTKRIVYIKPKTNQLVATSNS 1465

Query: 3112 GDMSMLGVDNTRSQLSDGYYKSRGNQLLRASSKNHV------AKGNAHATVSGLVPQSVV 3273
             D+S+   DN ++  SDGYYK R NQL+R + ++H+      +   A++   G       
Sbjct: 1466 CDVSVSTDDNLQTAFSDGYYKRRKNQLIRTTFESHINQTVAMSNNTAYSGGQGTSNALCN 1525

Query: 3274 PKTSTRRQSGFAKS-CRYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFSSKRAAYWRS 3450
             + S RR     +S C+ S+ S VW L    SSE  ++S   ++  P  F  KR  +  S
Sbjct: 1526 RRFSKRRTHKVGRSSCKRSRASLVWTLCSKNSSENDRDSQHYQRALPQLFPWKRPTFASS 1585

Query: 3451 LMLGIKPSLSNISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSRX 3630
            L      SLS IS+KLL  RKR  +YTRS HG+SL+ S+VL VGG SLKWSKSIEK S+ 
Sbjct: 1586 LN---NSSLSAISKKLLQLRKRDTVYTRSIHGFSLQKSRVLGVGGCSLKWSKSIEKKSKL 1642

Query: 3631 XXXXXXXXXXXXXXXXXXXXGCVSIASKSRNHVSRKWVLSVKLRPGERIFRI 3786
                                  V I+S+S+               GERIFRI
Sbjct: 1643 ANEEATLAVAAVERKRREQKNAVCISSQSKTADC----------AGERIFRI 1684


>ref|XP_007161425.1| hypothetical protein PHAVU_001G067600g [Phaseolus vulgaris]
            gi|561034889|gb|ESW33419.1| hypothetical protein
            PHAVU_001G067600g [Phaseolus vulgaris]
          Length = 1984

 Score =  184 bits (468), Expect = 2e-43
 Identities = 188/622 (30%), Positives = 286/622 (45%), Gaps = 42/622 (6%)
 Frame = +1

Query: 2035 LNASGLLCSERNGPSGSDSGDESLASSFDMRSCMTSPEELHVNSDSSFLEN-TKASACLL 2211
            LN    L +++N  S   SGDE   S+        S +EL V++  +  +  ++  A  +
Sbjct: 1061 LNVKNDLLAQQNLMSCPASGDEVTTSN--------SNDELIVDAPGALSDIFSQGMASEV 1112

Query: 2212 DNEMICQSDNIFNEKHVFADPNT--ISHGKCSGAAVSKSQSNIGIPQSLLKDTSQVVKK- 2382
             +  + +   I +E     + NT  +   K +G +      N+ I +++  ++SQV  K 
Sbjct: 1113 PDRRVLELTAINDENICGVEENTSSVQEMKQNGRSDHAFGHNMMIKKTI-SESSQVSSKV 1171

Query: 2383 ----LYPIHSKLTWSKNQVTSAIAKVLPVQHPQNLSNSR---KLYSSHATKSRTWCRTVN 2541
                L      L+ +KNQ  S I K  P  H    S S       S+H +K RTW RT N
Sbjct: 1172 TTQALNSYRFGLSGTKNQSGSVIPKTFP-GHSLTFSRSETKSSASSTHVSKPRTWHRTGN 1230

Query: 2542 STAAVAEPKIE-----PS--PIPQSNGTSAARSVPSSYIRKGNSLVRKDSPSDDASHGVP 2700
               ++  P+I      PS  PI +  G        +SY+RKGNSLVRK +P       +P
Sbjct: 1231 PPISL--PRINSVGTIPSKRPILERKGNFQN----TSYVRKGNSLVRKPTPVS----ALP 1280

Query: 2701 GSSSVYRLSHC-----TNTRKNDLATDYMTGDADAPTVKRRGQISTSVMTKAVTLNHSGN 2865
              SSV + S       +   K++   D      + P   R G   +    +   L  +  
Sbjct: 1281 QISSVNQSSSLGFDDVSKGTKSESRVDL----TNQPMYLRAGATYSQQRQRTPPLPINTK 1336

Query: 2866 SLKCTPCHLEEPLSLSNPHINDCPPRTLDV-------TKERIRSSVVSECQTDSVINSDS 3024
            S + T   L EP S  +      P   +++       +++ ++   + E Q   + N +S
Sbjct: 1337 SEENTSSSLVEPPSGGSCENVSDPTSFIEINNNVRNSSEDTLKHYEIPENQPVPLDNGES 1396

Query: 3025 QSTARDGN----SEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQLSDGYYKSRGNQL 3192
            Q  A +GN    + K+I+Y+K ++NQLVA  NS D+S+   DN ++  SD YYK R NQL
Sbjct: 1397 QVEANNGNPLSLNTKRIVYIKPKTNQLVATSNSCDVSVPADDNGQTAFSDAYYKRRKNQL 1456

Query: 3193 LRASSKNH------VAKGNAHATVSGLVPQSVVPKTSTRRQSGFAKS-CRYSKFSSVWKL 3351
            +R + ++H      V  G A++   G        + S +R +   +S C+ S+ S VW L
Sbjct: 1457 VRTTFESHNNQTAIVPNGKANSDGQGTSNALCNRRFSKKRLNKVGRSSCKRSRASLVWTL 1516

Query: 3352 QDSQSSEKYKNSLGPRKVWPHFFSSKRAAYWRSLMLGIKPSLSNISQKLLVSRKRGAIYT 3531
                SSE  +NS   +KV P  F  KRA +  S       S+S IS+KLL  RKR  +YT
Sbjct: 1517 CSKSSSENDRNSRHYQKVLPQLFPWKRATFASSFN---SSSVSAISKKLLQLRKRDTVYT 1573

Query: 3532 RSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSRXXXXXXXXXXXXXXXXXXXXXGCVSIAS 3711
            RS HG+SL  S+VL VGG SLKWSKSIEK+S+                       V I+S
Sbjct: 1574 RSKHGFSLWKSRVLGVGGCSLKWSKSIEKNSKQANEEATLAVAAVEKKKREQKNAVCISS 1633

Query: 3712 KS-RNHVSRKWVLSVKLRPGER 3774
            +S R  + R   +  ++ P  R
Sbjct: 1634 QSKRERIFRFGSVRYRMDPSRR 1655


>ref|XP_007161424.1| hypothetical protein PHAVU_001G067600g [Phaseolus vulgaris]
            gi|561034888|gb|ESW33418.1| hypothetical protein
            PHAVU_001G067600g [Phaseolus vulgaris]
          Length = 1979

 Score =  184 bits (468), Expect = 2e-43
 Identities = 188/622 (30%), Positives = 286/622 (45%), Gaps = 42/622 (6%)
 Frame = +1

Query: 2035 LNASGLLCSERNGPSGSDSGDESLASSFDMRSCMTSPEELHVNSDSSFLEN-TKASACLL 2211
            LN    L +++N  S   SGDE   S+        S +EL V++  +  +  ++  A  +
Sbjct: 1061 LNVKNDLLAQQNLMSCPASGDEVTTSN--------SNDELIVDAPGALSDIFSQGMASEV 1112

Query: 2212 DNEMICQSDNIFNEKHVFADPNT--ISHGKCSGAAVSKSQSNIGIPQSLLKDTSQVVKK- 2382
             +  + +   I +E     + NT  +   K +G +      N+ I +++  ++SQV  K 
Sbjct: 1113 PDRRVLELTAINDENICGVEENTSSVQEMKQNGRSDHAFGHNMMIKKTI-SESSQVSSKV 1171

Query: 2383 ----LYPIHSKLTWSKNQVTSAIAKVLPVQHPQNLSNSR---KLYSSHATKSRTWCRTVN 2541
                L      L+ +KNQ  S I K  P  H    S S       S+H +K RTW RT N
Sbjct: 1172 TTQALNSYRFGLSGTKNQSGSVIPKTFP-GHSLTFSRSETKSSASSTHVSKPRTWHRTGN 1230

Query: 2542 STAAVAEPKIE-----PS--PIPQSNGTSAARSVPSSYIRKGNSLVRKDSPSDDASHGVP 2700
               ++  P+I      PS  PI +  G        +SY+RKGNSLVRK +P       +P
Sbjct: 1231 PPISL--PRINSVGTIPSKRPILERKGNFQN----TSYVRKGNSLVRKPTPVS----ALP 1280

Query: 2701 GSSSVYRLSHC-----TNTRKNDLATDYMTGDADAPTVKRRGQISTSVMTKAVTLNHSGN 2865
              SSV + S       +   K++   D      + P   R G   +    +   L  +  
Sbjct: 1281 QISSVNQSSSLGFDDVSKGTKSESRVDL----TNQPMYLRAGATYSQQRQRTPPLPINTK 1336

Query: 2866 SLKCTPCHLEEPLSLSNPHINDCPPRTLDV-------TKERIRSSVVSECQTDSVINSDS 3024
            S + T   L EP S  +      P   +++       +++ ++   + E Q   + N +S
Sbjct: 1337 SEENTSSSLVEPPSGGSCENVSDPTSFIEINNNVRNSSEDTLKHYEIPENQPVPLDNGES 1396

Query: 3025 QSTARDGN----SEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQLSDGYYKSRGNQL 3192
            Q  A +GN    + K+I+Y+K ++NQLVA  NS D+S+   DN ++  SD YYK R NQL
Sbjct: 1397 QVEANNGNPLSLNTKRIVYIKPKTNQLVATSNSCDVSVPADDNGQTAFSDAYYKRRKNQL 1456

Query: 3193 LRASSKNH------VAKGNAHATVSGLVPQSVVPKTSTRRQSGFAKS-CRYSKFSSVWKL 3351
            +R + ++H      V  G A++   G        + S +R +   +S C+ S+ S VW L
Sbjct: 1457 VRTTFESHNNQTAIVPNGKANSDGQGTSNALCNRRFSKKRLNKVGRSSCKRSRASLVWTL 1516

Query: 3352 QDSQSSEKYKNSLGPRKVWPHFFSSKRAAYWRSLMLGIKPSLSNISQKLLVSRKRGAIYT 3531
                SSE  +NS   +KV P  F  KRA +  S       S+S IS+KLL  RKR  +YT
Sbjct: 1517 CSKSSSENDRNSRHYQKVLPQLFPWKRATFASSFN---SSSVSAISKKLLQLRKRDTVYT 1573

Query: 3532 RSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSRXXXXXXXXXXXXXXXXXXXXXGCVSIAS 3711
            RS HG+SL  S+VL VGG SLKWSKSIEK+S+                       V I+S
Sbjct: 1574 RSKHGFSLWKSRVLGVGGCSLKWSKSIEKNSKQANEEATLAVAAVEKKKREQKNAVCISS 1633

Query: 3712 KS-RNHVSRKWVLSVKLRPGER 3774
            +S R  + R   +  ++ P  R
Sbjct: 1634 QSKRERIFRFGSVRYRMDPSRR 1655


>gb|EPS73988.1| hypothetical protein M569_00768, partial [Genlisea aurea]
          Length = 694

 Score =  184 bits (466), Expect = 4e-43
 Identities = 143/432 (33%), Positives = 212/432 (49%), Gaps = 2/432 (0%)
 Frame = +1

Query: 2497 SSHATKSRTWCRTVNSTAAVAEPKIEPSPIPQSNGTSAARSVPSS-YIRKGNSLVRKDSP 2673
            S+   KSRTW R+ N +  VAEP+ + S +P  + +    ++PS+ YIR+GNSLVR  SP
Sbjct: 27   SACIAKSRTWHRSGNVSVVVAEPRSQSSTLPVVHKSKMTENMPSTAYIRQGNSLVRNPSP 86

Query: 2674 SDDASHGVPGS-SSVYRLSHCTNTRKNDLATDYMTGDADAPTVKRRGQISTSVMTKAVTL 2850
            S      +  S  SVY+L+      KN         + DA ++     ++ S   K++ L
Sbjct: 87   SGVFPPAIRSSVKSVYKLASLE--AKNTQQFKGKVHEVDASSLLAVKPVTVS---KSLAL 141

Query: 2851 NHSGNSLKCTPCHLEEPLSLSNPHINDCPPRTLDVTKERIRSSVVSECQTDSVINSDSQS 3030
            N +  ++ C         SL   +++ C  RT D  K+   + +V +C +DS  N++   
Sbjct: 142  NRNVKAVNCP-----SDKSLLTRNVSPC--RTSDALKDTKETILVPKCPSDSRDNAECLH 194

Query: 3031 TARDGNSEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQLSDGYYKSRGNQLLRASSK 3210
            T  +   +K+I+Y+KR+ NQLVAA  S   S++ +D ++  L+DGYYKS+ NQL+R SS 
Sbjct: 195  TPEE-EQKKEIVYIKRKRNQLVAASTSTSRSVVQLDKSKVSLTDGYYKSKKNQLVRKSSS 253

Query: 3211 NHVAKGNAHATVSGLVPQSVVPKTSTRRQSGFAKSCRYSKFSSVWKLQDSQSSEKYKNSL 3390
            NH  K  A    S ++P   + K STR+ S  +K       + VW L  ++S +      
Sbjct: 254  NH-TKRRASLNFSKVLPLKTITKPSTRQMSTLSK-------AFVWNLHATESPK------ 299

Query: 3391 GPRKVWPHFFSSKRAAYWRSLMLGIKPSLSNISQKLLVSRKRGAIYTRSSHGYSLKMSKV 3570
              RKV P     KRA +WRS    +  ++    + L  SRKRG +Y RSSHGYSLKMS V
Sbjct: 300  -TRKVLPLIVPWKRATHWRSCKYAL--NIRQNVRALPTSRKRGTVYLRSSHGYSLKMSGV 356

Query: 3571 LSVGGSSLKWSKSIEKSSRXXXXXXXXXXXXXXXXXXXXXGCVSIASKSRNHVSRKWVLS 3750
             SV   SLK     +  S                       C S A+  +N+     V  
Sbjct: 357  RSVAECSLKGQNPPDMKSE---------------NTNEGDACTSTATMEQNNDEGPAVHM 401

Query: 3751 VKLRPGERIFRI 3786
                  ER+FRI
Sbjct: 402  PSTSRRERVFRI 413


>ref|XP_006596227.1| PREDICTED: uncharacterized protein At1g21580-like [Glycine max]
          Length = 1672

 Score =  178 bits (451), Expect = 2e-41
 Identities = 158/477 (33%), Positives = 224/477 (46%), Gaps = 34/477 (7%)
 Frame = +1

Query: 2299 SGAAVSKSQSNIGIPQSLLKDTSQVVKK-----LYPIHSKLTWSKNQVTSAIAKVLPVQH 2463
            SG A+  S     I +  + + SQV  +     L      L+ +KNQ  S I K  P  H
Sbjct: 1156 SGHAIEHSNL---ITKKTMSEPSQVSSRVTTQALNSYRFGLSGTKNQSGSVIPKTFP-GH 1211

Query: 2464 PQNLSNSRKLYSSHATKSRTWCRTVN---STAAVAEPKIEPSPIPQSNGTSAARSVPSSY 2634
                S +    S H +K RTW RT N   ++    +P +E  P  +    +      +SY
Sbjct: 1212 SFTFSKA-SASSPHVSKPRTWLRTGNIPPTSVLRIKPSVETVPPKRPILETKGNFQNTSY 1270

Query: 2635 IRKGNSLVRKDSPSDDASHGVPGSSSVYRLSHC-TNTRKNDLATDYMTGDADAPTVKRRG 2811
            +RKGNSLVRK +P       +P  SSV + S    +     + +       D P   + G
Sbjct: 1271 VRKGNSLVRKPTPVST----LPQISSVNQTSSLGIDEIPKSIKSGRRADGTDKPMYLKTG 1326

Query: 2812 QISTSVM-TKAVTLNHSGNSLKCTPCHLEEPLSLSNPHINDCPPRTLDVTK--------- 2961
             I+     T  + ++        T        SL  P    C     DV K         
Sbjct: 1327 AINAPQQRTPPLPID--------TKLEENRSSSLVEPPSGGCCENASDVRKFIETDNIAP 1378

Query: 2962 ----ERIRSSVVSECQTDSVINSDSQSTARDGN----SEKKIIYVKRRSNQLVAACNSGD 3117
                + ++     E Q+    N +SQ  A DGN    + K+I+Y+K ++NQLVA  NS D
Sbjct: 1379 NSSEDALKHCETPENQSGPSDNGESQGEANDGNVFPLNTKRIVYIKPKTNQLVATSNSYD 1438

Query: 3118 MSMLGVDNTRSQLSDGYYKSRGNQLLRASSKNH----VAKGNAHATVSGLVPQSVV--PK 3279
            +S+   DN ++  SDGYYK R NQL+R + ++H    VA  N  A   G    + +   +
Sbjct: 1439 VSVSTDDNLQTAFSDGYYKRRKNQLVRTTIESHINQTVAMPNNTANSDGQGTSNALCNRR 1498

Query: 3280 TSTRRQSGFAKSC-RYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFSSKRAAYWRSLM 3456
             S +R     +S  + S+ S VW L    SSE  ++S   ++  P  F  KRAA+  SL 
Sbjct: 1499 FSKKRTHKVGRSSFKRSRASLVWTLCSKNSSENDRDSRHYQRALPLLFPWKRAAFASSLN 1558

Query: 3457 LGIKPSLSNISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSR 3627
                 SLS IS+KLL  RKR  +YTRS HG+SL+ S+VL VGG SLKWSKSIEK+S+
Sbjct: 1559 ---NSSLSAISKKLLQLRKRDTVYTRSIHGFSLRKSRVLGVGGCSLKWSKSIEKNSK 1612


>ref|XP_002302217.2| zinc finger family protein [Populus trichocarpa]
            gi|550344506|gb|EEE81490.2| zinc finger family protein
            [Populus trichocarpa]
          Length = 2120

 Score =  177 bits (449), Expect = 3e-41
 Identities = 211/672 (31%), Positives = 294/672 (43%), Gaps = 70/672 (10%)
 Frame = +1

Query: 1981 LDENMLDVDRSLGKDDLALNASGLLCSERN------GPSGSDSGDESLASSFDMRSCMTS 2142
            LDEN+  +D   G    A N S  + +  +      G S ++SGDE +    +  S   S
Sbjct: 1161 LDENIPSIDVDDGGFHGAKNDSPCMSNNPSSFGDGFGVSFTNSGDELVEIVPETLSDRGS 1220

Query: 2143 PEELHVNSDSSFLENTKASACLLDNEMICQSDNIFNEKHVFADPNTISHGKCSGAAVSKS 2322
            PE L     +S  +N+         E I ++D+      + A+   I+ G  S  ++S S
Sbjct: 1221 PETLPDVMGTSLSKNSV--------EKIHENDD-----KIPAERPVINVGSDSSMSISSS 1267

Query: 2323 QSN---IGIPQSLLKDTSQVVKK--LYPIHSKLTWS------------KNQVTSAIAKVL 2451
            Q+    + +  ++ +D     K   L    SK+T              KN  +  I+K+ 
Sbjct: 1268 QNAKVVLNLDHAVERDQLLTGKTGHLPSQDSKITTQMPNAKSGDLYGKKNHSSHPISKIY 1327

Query: 2452 PVQHPQNLSNSRK-LYSSHATKSRTWCRTVN-STAAVAEPKIEPSPIPQSN--GTSAARS 2619
              +     S S+    SS  +K+RTW R  N S +A    K   S +P          +S
Sbjct: 1328 SGRSSFVFSASKSSASSSRISKTRTWHRNDNCSDSAPPSNKAFSSTVPAQRLFPRKGDKS 1387

Query: 2620 VPSSYIRKGNSLVRKDSPSDDASHGVPGSSSVYRL-SHCTNTRKNDLATDYMTGDADAPT 2796
              +SYIRKGNSLVRK +    +      SSSVY+L S  T+  K    +D     AD   
Sbjct: 1388 QRTSYIRKGNSLVRKPTSVAQSPGPHALSSSVYQLNSSGTDEPKKSAGSDSRIDLADPLN 1447

Query: 2797 VKRRGQISTS--------VMTKAVTLNHSGNSL--KCTPCHLEEPLSLSNPHI------- 2925
            V R G +  S        + + +   N + NSL  + +    E   SL    +       
Sbjct: 1448 VLRTGGMDASFEKPRTPSLSSVSKISNRASNSLGGRASSPLAEHLHSLCTETVTVPAKLL 1507

Query: 2926 --NDCPPRTLDVTKERIRSSVVSECQTDSVINSDSQSTARDGNSE-----KKIIYVKRRS 3084
              ND P  + DV K  I  S ++  Q   + N +  S   DGN+      K + YVKR+S
Sbjct: 1508 ESNDVPKSSDDVLK--ISGSPIT--QNSQISNLECHSDTNDGNTVALANGKSLTYVKRKS 1563

Query: 3085 NQLVAACNSGDMSMLGVDNTRSQLSDGYYKSRGNQLLRASSKNHVAKGNAHATVSGLVPQ 3264
            NQLVA+ N    S+    NT S   D YYK R NQL+R S ++ + K  A      L  +
Sbjct: 1564 NQLVASSNPCASSVQNAHNTSS---DSYYKRRKNQLIRTSLESQI-KQTASIPDESLNSE 1619

Query: 3265 SVVPKTS-------TRRQSGFAKSCRYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFS 3423
                  S        R++    K+C+ SK S VW L  +Q S+   +S    KV PH F 
Sbjct: 1620 GQTALNSFSRNFSKRRQRKVVTKTCKPSKLSLVWTLHGAQLSKNDGDSSHCGKVLPHLFP 1679

Query: 3424 SKRAAYWRSLM-----LGIKPSLSNISQ----KLLVSRKRGAIYTRSSHGYSLKMSKVLS 3576
             KRA Y RS +     +    SLS I      KLL+ RKR   YTRS HG+SL+ SKVLS
Sbjct: 1680 WKRATYRRSSLPNSSSISDHSSLSTIGYNNWWKLLLLRKRNTEYTRSKHGFSLRKSKVLS 1739

Query: 3577 VGGSSLKWSKSIEKSSRXXXXXXXXXXXXXXXXXXXXXGCVSIA--SKSRNHVSRKWVLS 3750
            VGGSSLKWSKSIEK S+                     G   +A  +KSRN +SR     
Sbjct: 1740 VGGSSLKWSKSIEKHSKKANEEATLAVAAAERKKREQRGAAHVACPTKSRN-ISR----- 1793

Query: 3751 VKLRPGERIFRI 3786
                  ERIFR+
Sbjct: 1794 ------ERIFRV 1799


Top