BLASTX nr result

ID: Mentha28_contig00007371 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha28_contig00007371
         (2343 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU22288.1| hypothetical protein MIMGU_mgv1a000316mg [Mimulus...   649   0.0  
ref|XP_006347374.1| PREDICTED: uncharacterized protein LOC102596...   518   e-144
gb|EPS74726.1| hypothetical protein M569_00028, partial [Genlise...   490   e-135
ref|XP_002268966.1| PREDICTED: uncharacterized protein LOC100247...   490   e-135
ref|XP_004242147.1| PREDICTED: uncharacterized protein LOC101249...   486   e-134
ref|XP_007026078.1| Homeodomain-like superfamily protein, putati...   468   e-129
ref|XP_002518479.1| conserved hypothetical protein [Ricinus comm...   457   e-126
ref|XP_006389624.1| hypothetical protein POPTR_0021s00740g [Popu...   457   e-126
ref|XP_006428336.1| hypothetical protein CICLE_v10010907mg [Citr...   455   e-125
ref|XP_006480350.1| PREDICTED: uncharacterized protein LOC102624...   454   e-125
ref|XP_007026080.1| Homeodomain-like superfamily protein, putati...   446   e-122
ref|XP_004295271.1| PREDICTED: uncharacterized protein LOC101297...   442   e-121
ref|XP_007213734.1| hypothetical protein PRUPE_ppa000251mg [Prun...   437   e-119
ref|XP_007026079.1| Homeodomain-like superfamily protein, putati...   436   e-119
gb|EXC05724.1| hypothetical protein L484_011305 [Morus notabilis]     423   e-115
ref|XP_006594422.1| PREDICTED: uncharacterized protein LOC102661...   407   e-111
ref|XP_006597583.1| PREDICTED: uncharacterized protein LOC100794...   407   e-110
ref|XP_004486161.1| PREDICTED: uncharacterized protein LOC101502...   397   e-108
emb|CBI23241.3| unnamed protein product [Vitis vinifera]              394   e-107
ref|XP_002887874.1| DNA binding protein [Arabidopsis lyrata subs...   382   e-103

>gb|EYU22288.1| hypothetical protein MIMGU_mgv1a000316mg [Mimulus guttatus]
          Length = 1264

 Score =  649 bits (1675), Expect = 0.0
 Identities = 405/787 (51%), Positives = 471/787 (59%), Gaps = 14/787 (1%)
 Frame = +1

Query: 1    EPLFPLRNSLCSAESDGQGETENTPPDXXXXXXXXXXNRMPKKTVATTLLEKAKNQPVTS 180
            EPLFPL +S CSAESDGQGE ENTP D             PKKT+A  LLEK KN+PV  
Sbjct: 517  EPLFPLHSSPCSAESDGQGEIENTPQDSNRIISCS-----PKKTMAAALLEKTKNEPVAL 571

Query: 181  VPKEIAELAQRFWPLFNPALYPRKPPPATIANRVLFTDAEDELLALGLMEYNTDWKAIQQ 360
            VPKEIA+LAQRFWPLFNPALYP KPPPA++  RVLFTDAEDELLALGLMEYN DWKAIQ+
Sbjct: 572  VPKEIAKLAQRFWPLFNPALYPHKPPPASLTIRVLFTDAEDELLALGLMEYNNDWKAIQK 631

Query: 361  RFLPCKSRHQIFVRQKNRASSKAPENPIKAVRRIKNSPLTLEEIARIELGLKKFKLDFMS 540
            RFLPCKSRHQIFVRQKNR+SSKAP NPIKAVR IKNSPL+ EEIARIE+GLK+FKLD++S
Sbjct: 632  RFLPCKSRHQIFVRQKNRSSSKAPGNPIKAVRTIKNSPLSSEEIARIEMGLKRFKLDWIS 691

Query: 541  VWRFFLPYRDPSLLPRQWRIASGTQKSYKLDATKKAKRRLYELRRKGXXXXXXXXXXXXD 720
            +WRFF+PYRDPSLLPRQWRIA GTQKSYK DATK AKRRLY L+RK             +
Sbjct: 692  IWRFFVPYRDPSLLPRQWRIACGTQKSYKSDATKNAKRRLYALKRKTSKPSTSNRHSSTE 751

Query: 721  KEGDSSDNAIEETNSGDNHIDKEDEAYVHEAFLADWMPENNASSSFPTLLPSQKDNFGYK 900
            KE DS+DNA+EET  GDNH+ KEDEAYVHEAFLADW P NN SSS PT LPS  +N   K
Sbjct: 752  KEDDSTDNAVEET-KGDNHLRKEDEAYVHEAFLADWRPNNNVSSSLPTSLPSH-ENSQAK 809

Query: 901  DTQPPIFFKSAAASRPSDSLVNLRPYRVRKQNNARLVKLAPGLPPVNLPPSVRVMSQSSF 1080
            D QP I   S AASRP++S V LRPYR R+ NNARLVKLAPGLPPVNLP SVR+MSQS F
Sbjct: 810  DIQPQIISNSPAASRPANSQVILRPYRTRRPNNARLVKLAPGLPPVNLPASVRIMSQSDF 869

Query: 1081 INSQA---AKDSGNIPSNAGLMAENQSLHAGSNMHLGVGSSAKFGPMRKDHVHVTTSSQQ 1251
             +SQA   AK S N    AG + EN+           V SSAK  P   + V +T S+++
Sbjct: 870  KSSQAVASAKISVNTSRMAGAVVENR-----------VASSAKSVPSTSNSVCITASNKR 918

Query: 1252 RNQSDVATNRCTVERGDSDLQMHPLLFQAPQDG---------HLXXXXXXXXXXXXXGKQ 1404
                +          GDS LQMHPLLFQ+PQ+          +               +Q
Sbjct: 919  VEVPE--------RGGDSVLQMHPLLFQSPQNASSIMPYYPVNSTTSTSSSFTFFSGKQQ 970

Query: 1405 PQLSLSLFHNPRRIRDAVNFLSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPN 1584
            P+LSL LFHNPR I+DAVNFLS SSK P +  A++ GVDFHPLLQR+D+   D+ +A   
Sbjct: 971  PKLSLGLFHNPRHIKDAVNFLSMSSKTPPQENASSLGVDFHPLLQRSDD--IDTASA--- 1025

Query: 1585 GKLPSIAASRQGCAPIQKHPSSTTKPSVDGISSASMGTKASSLSRQGNELDLNIQLSFTS 1764
               PSIA S +                       S GTK +SL  + NELDLN   SFTS
Sbjct: 1026 ---PSIAESSR--------------------LERSSGTKVASLKGKVNELDLNFHPSFTS 1062

Query: 1765 KNQEGAESRNAAQRNTRRSLGAPIPGVIESKNTKDSSKKRDSAPDAICNELNSSDIPLVA 1944
             N + +ES N                        DSSK             NS +  +V 
Sbjct: 1063 -NSKHSESPN------------------------DSSK-------------NSGETRMVK 1084

Query: 1945 SRNRGSRKVSDNM-HDESLPEIVMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQ 2121
            SR +GSRK SD    +ES+ EIVM                                   Q
Sbjct: 1085 SRTKGSRKCSDIAGSNESIQEIVMEQEELSDSEEEFGENVEFECEEMADSEGDSLSDSEQ 1144

Query: 2122 VVNVPNE-EVDLDIEEGRVLNSQNEYGSNACSTSEACSNGLDMVEKGKPKALPLNLNSCP 2298
            +V++ +E E+D+DI+                +TSE   N        KPK L LNLNS P
Sbjct: 1145 IVDLQDEDEMDVDID----------------NTSEKVIN-------VKPKILSLNLNSFP 1181

Query: 2299 PVSPYSN 2319
            P+SP  N
Sbjct: 1182 PLSPNPN 1188


>ref|XP_006347374.1| PREDICTED: uncharacterized protein LOC102596887 [Solanum tuberosum]
          Length = 1436

 Score =  518 bits (1333), Expect = e-144
 Identities = 339/818 (41%), Positives = 445/818 (54%), Gaps = 42/818 (5%)
 Frame = +1

Query: 1    EPLFPLRNSLCSAESDGQGETENT--PPDXXXXXXXXXXNRMPKKTVATTLLEKAKNQPV 174
            +PLFP++N   +AE DG+    +   PP           +R  KKT+A  L+EKAK Q V
Sbjct: 567  KPLFPVQNIHFTAEPDGRASLYSNVVPPSSSI-------SRKSKKTLAAVLVEKAKQQAV 619

Query: 175  TSVPKEIAELAQRFWPLFNPALYPRKPPPATIANRVLFTDAEDELLALGLMEYNTDWKAI 354
             SVP EIA+LAQRF+PLFNPALYP KPPPA +ANR+LFTDAEDELLALGLMEYNTDWKAI
Sbjct: 620  ASVPNEIAKLAQRFYPLFNPALYPHKPPPAMVANRLLFTDAEDELLALGLMEYNTDWKAI 679

Query: 355  QQRFLPCKSRHQIFVRQKNRASSKAPENPIKAVRRIKNSPLTLEEIARIELGLKKFKLDF 534
            QQR+LPCKS+HQIFVRQKNR+SSKAP+NPIKAVRR+KNSPLT EE+ARIE GLK FKLD+
Sbjct: 680  QQRYLPCKSKHQIFVRQKNRSSSKAPDNPIKAVRRMKNSPLTAEEVARIEEGLKVFKLDW 739

Query: 535  MSVWRFFLPYRDPSLLPRQWRIASGTQKSYKLDATKKAKRRLYELRRKGXXXXXXXXXXX 714
            MSVW+F +PYRDPSLLPRQWR A GTQKSY  DA+KKAKRRLYE  RK            
Sbjct: 740  MSVWKFIVPYRDPSLLPRQWRTAIGTQKSYISDASKKAKRRLYESERKKLKSGALETWHI 799

Query: 715  XD-KEGDSSDNAIEETNSGDNHIDKEDEAYVHEAFLADWMPE----------NNASSSFP 861
               K+ D +D+AIEE     N  D+ +EAYVHEAFLADW P           +N +   P
Sbjct: 800  SSRKKDDVADSAIEE-----NCTDRNEEAYVHEAFLADWRPAISSIQVNHSMSNPAEKIP 854

Query: 862  TL---------LPSQKDNFGYKDTQPPIFFKSAAASRPSDSLVNLRPYRVRKQNNARLVK 1014
             L         +  + +N G ++ Q  I  +   + R S++    R    RK NN +LVK
Sbjct: 855  PLQLLGVESSQVAEKMNNNGSRNWQSQISNEFPVSLRSSETESFSRGNGARKFNNGQLVK 914

Query: 1015 LAPGLPPVNLPPSVRVMSQSSF----INSQAAKDSGNIPSNAGL--MAENQSLHAG---S 1167
            LAPGLPPVNLPPSVRVMSQS+F    + +      G+  +  G+   A  ++ +A    +
Sbjct: 915  LAPGLPPVNLPPSVRVMSQSAFKSYHVGTYPRAFGGDASTGDGVRDSAAPKTANAAKPYT 974

Query: 1168 NMHLGVGSSAKFGPMRKDHVHVTTSSQQRNQSDVATNRCTVERGDSDLQMHPLLFQAPQD 1347
            N  +  GS +          +++  + Q  +        T E+ +S L+MHPLLF+AP+D
Sbjct: 975  NYFVKDGSFSSSAGRN----NISNQNLQETRLSKDNKNVTDEKDESGLRMHPLLFRAPED 1030

Query: 1348 GHL------XXXXXXXXXXXXXGKQPQLSLSLFHNPRRIRDAVNFLSKSSKPPEKNAAAT 1509
            G L                   G QP  +LSLFH+PR+    VNFL KSS P +K  + +
Sbjct: 1031 GPLPYNQSNSSFSTSSSFNFFSGCQP--NLSLFHHPRQSAHTVNFLDKSSNPGDK-TSIS 1087

Query: 1510 SGVDFHPLLQRTDNEGAD-SLAAHPNGKLPSIAASRQGCAPIQKHPSSTTKPSVDGISSA 1686
            SG DFHPLLQRTD+   D  +A+       +   SR  C  +Q         +VD  S+ 
Sbjct: 1088 SGFDFHPLLQRTDDANCDLEVASAVTRPSCTSETSRGWCTQVQN--------AVDSSSNV 1139

Query: 1687 SMGTKASSLSRQGNELDLNIQLSFTSKNQEGAESRNAAQRNTRRSLGAPIPGVIESKNTK 1866
            +    +S + +  NE+DL + LSFTS  Q+   SR  A R   RS          +  ++
Sbjct: 1140 ACSIPSSPMGK-SNEVDLEMHLSFTSSKQKAIGSRGVADRFMGRS---------PTSASR 1189

Query: 1867 DSSKKRDSAPDAICNELNSSDIPLVASRNRGSRKVSDNMHDESLPEIVMXXXXXXXXXXX 2046
            D +   +  P+      +S     + S +  +    D++ D+SL EIVM           
Sbjct: 1190 DQNPLNNGTPNRTTQHSDSGATARILSSDEETGNGVDDLEDQSLVEIVMEQEELSDSEEE 1249

Query: 2047 XXXXXXXXXXXXXXXXXXXXXXXXQVVNVPNEEVDL----DIEEGRVLNSQNEYGSNACS 2214
                                    ++ N  NEE+D     D  +  V N+      N+CS
Sbjct: 1250 IGESVEFECEEMEDSEGEEIFESEEITNDENEEMDKVALDDSYDQHVPNTHGNSKGNSCS 1309

Query: 2215 TSEACSNGLDMVEKGKPKALPLNLNSCPPVSPYSNPKN 2328
             +E  +   D     +P +L LN N   PVSP   PK+
Sbjct: 1310 ITEDHATRFDKATNDQPSSLCLNSNPPRPVSPQVKPKS 1347


>gb|EPS74726.1| hypothetical protein M569_00028, partial [Genlisea aurea]
          Length = 1049

 Score =  490 bits (1261), Expect = e-135
 Identities = 305/659 (46%), Positives = 379/659 (57%), Gaps = 18/659 (2%)
 Frame = +1

Query: 7    LFPLRNSLCSAESDGQGETENTPPDXXXXXXXXXXNRMPKKTVATTLLEKAKNQPVTSVP 186
            LFP  +S  SAES+ +GE +N  PD          + +PKK++A TLLEKAK QP+  VP
Sbjct: 469  LFPFHSSSGSAESENRGEIDNNSPD----------SDLPKKSMAATLLEKAKTQPIYLVP 518

Query: 187  KEIAELAQRFWPLFNPALYPRKPPPATIANRVLFTDAEDELLALGLMEYNTDWKAIQQRF 366
            K+IA+LAQRF P FNP+LYP KPPPA +ANRVLFT+ EDELLA+GLMEYNTDWKAIQQRF
Sbjct: 519  KDIAKLAQRFLPFFNPSLYPHKPPPAPLANRVLFTEVEDELLAMGLMEYNTDWKAIQQRF 578

Query: 367  LPCKSRHQIFVRQKNRASSKAPENPIKAVRRIKNSPLTLEEIARIELGLKKFKLDFMSVW 546
            LPCKSRHQIFVRQKNRASSKAPENPIKAVRR+K SPLT EEIARIE GLK FKLD++S+W
Sbjct: 579  LPCKSRHQIFVRQKNRASSKAPENPIKAVRRMKTSPLTPEEIARIEAGLKMFKLDWISIW 638

Query: 547  RFFLPYRDPSLLPRQWRIASGTQKSYKLDATKKAKRRLYELRRKGXXXXXXXXXXXXDKE 726
             F LP+RDP+LLPRQWRIA GTQKSYK DA  KAKRRL ELRRK             DKE
Sbjct: 639  SFLLPHRDPALLPRQWRIALGTQKSYKSDAKTKAKRRLNELRRKASKPSHSSLYSPSDKE 698

Query: 727  GDSSDNAIEETNSGDNHIDKEDEAYVHEAFLADWMPENNASSSFPTLLPSQKDNFGYKDT 906
            G SSDNA EE N    H D +DEAYVHEAFL+DW P NN  S F   +    +       
Sbjct: 699  GYSSDNASEEANRLRKHSDNDDEAYVHEAFLSDWRPNNNVPSIFYASMQPGMNTASGSGQ 758

Query: 907  QPPIFFKSAAASRPSDSLVNLRPYRVRKQNNARLVKLAPGLPPVNLPPSVRVMSQSSFIN 1086
               + + +++A R +   +   P+R R++N+AR+VKLAP LPPVNLPPSVR++SQS F  
Sbjct: 759  NRLLNYPASSALRYTQ--IYPWPHRGRRKNSARVVKLAPDLPPVNLPPSVRIISQSVFQR 816

Query: 1087 SQA---AKDSGNIP-SNAGLMAENQSLHAGSNMHLGVGSSAKFGPMRKDHVHVTTSSQQR 1254
             QA   AK S NI  SN G +A      +GS+                     T  +   
Sbjct: 817  DQAAASAKASVNIQGSNYGTVANGARDDSGSS---------------------TKCAANC 855

Query: 1255 NQSDVATNRCTVERGDSDLQMHPLLFQAPQDGHLXXXXXXXXXXXXXGKQPQLSLSLFHN 1434
              S   +     E GD DL+MHPL F++PQD H               +   LSLSLFH+
Sbjct: 856  QPSSNGSGVVIPETGDRDLEMHPLFFRSPQDAH----------WPYYPQNSGLSLSLFHH 905

Query: 1435 PRRIRD-AVNFLSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLPSIAAS 1611
            PR ++D A++FL+    PP      +SGV FHPLLQ   N+  ++  A     +P+ A  
Sbjct: 906  PRHLQDPAMSFLNHGKCPP------SSGVVFHPLLQ--SNKAVETGTAR---AVPTTA-- 952

Query: 1612 RQGCAPIQKHPSSTTKPSVDGISSASMGTKASSLSRQGNELDLNIQLSFTSKNQEGA--- 1782
                                         K +S S +GNELDL+I LS   +N+E     
Sbjct: 953  -----------------------------KTASRSSKGNELDLDIHLSVLPENRESTLQK 983

Query: 1783 ----------ESRNAAQRNTRRSLGAPIPGVIESKNTKDSSKKRDSAPDAICNELNSSD 1929
                      ++  AA R    +   P   V+E +   DS  +     +  C E+  S+
Sbjct: 984  PVAAAVAGRDDNNEAASREMNDATSFP-DIVMEQEELSDSEDEYGENVEFECEEMADSE 1041


>ref|XP_002268966.1| PREDICTED: uncharacterized protein LOC100247051 [Vitis vinifera]
          Length = 1514

 Score =  490 bits (1261), Expect = e-135
 Identities = 339/867 (39%), Positives = 443/867 (51%), Gaps = 97/867 (11%)
 Frame = +1

Query: 1    EPLFPLRNSLCSAESDGQGETENTPPDXXXXXXXXXXNRMPKKTVATTLLEKAKNQPVTS 180
            EPLFP  +    AE+ G+      PP           ++ PKKT+A  L+E  K Q V  
Sbjct: 561  EPLFPFPSFQSLAEASGEVSRGTMPPATNMELVSSSSHQPPKKTLAAALVESTKKQSVAL 620

Query: 181  VPKEIAELAQRFWPLFNPALYPRKPPPATIANRVLFTDAEDELLALGLMEYNTDWKAIQQ 360
            V KEI +LAQ+F+PLFN AL+P KPPP  +ANRVLFTD+EDELLA+GLMEYN+DWKAIQQ
Sbjct: 621  VHKEIVKLAQKFFPLFNSALFPHKPPPTPVANRVLFTDSEDELLAMGLMEYNSDWKAIQQ 680

Query: 361  RFLPCKSRHQIFVRQKNRASSKAPENPIKAVRRIKNSPLTLEEIARIELGLKKFKLDFMS 540
            RFLPCK++HQIFVRQKNR SSKAP+NPIKAVRR+K SPLT EE  RI+ GL+ FKLD+MS
Sbjct: 681  RFLPCKTKHQIFVRQKNRCSSKAPDNPIKAVRRMKTSPLTAEEKERIQEGLRVFKLDWMS 740

Query: 541  VWRFFLPYRDPSLLPRQWRIASGTQKSYKLDATKKAKRRLYEL-RRKGXXXXXXXXXXXX 717
            +W+F +P+RDPSLLPRQWRIA G QKSYK D  KK KRRLYEL RRK             
Sbjct: 741  IWKFIVPHRDPSLLPRQWRIAHGIQKSYKKDTAKKEKRRLYELNRRKSKAAAGPIWETVS 800

Query: 718  DKEGDSSDNAIEETNSGDNHIDKEDEAYVHEAFLADWMPENNA--SSSFP---------- 861
            +KE   ++NA+EE  SGD+ +D +DEAYVHEAFLADW P N +  SS  P          
Sbjct: 801  EKEEYQTENAVEEGKSGDDDMDNDDEAYVHEAFLADWRPGNTSLISSELPFSNVTEKYLH 860

Query: 862  TLLPSQKDNFGYKDTQ---------------------------------PPIFFKSAAAS 942
            +  PSQ+     + T                                  P +   +++  
Sbjct: 861  SDSPSQEGTHVREWTSIHGSGEFRPQNVHALEFPAASNYFQNPHMFSHFPHVRNSTSSTM 920

Query: 943  RPSDSLVNL-----------RPYRVRKQNNARLVKLAPGLPPVNLPPSVRVMSQSSFINS 1089
             PS  + +L           RPYRVR+ ++A  VKLAP LPPVNLPPSVR++SQS+ + S
Sbjct: 921  EPSQPVSDLTLKSSKSQFCLRPYRVRRNSSAHQVKLAPDLPPVNLPPSVRIISQSA-LKS 979

Query: 1090 QAAKDSGNIPSNAGLMAENQSLHAGSNMHLGVGSSAKFGPMRKDHVHVTTSSQ-QRNQSD 1266
              +  S  I +  G+           NM   + + AK G          TSS  + N +D
Sbjct: 980  YQSGVSSKISATGGIGGTGT-----ENMVPRLSNIAKSGTSHSAKARQNTSSPLKHNITD 1034

Query: 1267 VATNRCTV--------ERG-DSDLQMHPLLFQAPQDGHL------XXXXXXXXXXXXXGK 1401
                R           ERG +SDL MHPLLFQA +DG L                   G 
Sbjct: 1035 PHAQRSRALKDKFAMEERGIESDLHMHPLLFQASEDGRLPYYPFNCSHGPSNSFSFFSGN 1094

Query: 1402 QPQLSLSLFHNPRRIRDAVNFLSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHP 1581
            Q Q++LSLFHNP +    VN   KS K   K +  + G+DFHPLLQR+D+   D + + P
Sbjct: 1095 QSQVNLSLFHNPHQANPKVNSFYKSLK--SKESTPSCGIDFHPLLQRSDDIDNDLVTSRP 1152

Query: 1582 NGKLP-SIAASRQGCAPIQ-KHPSSTTKPSVDGISSASMGTKASSLSRQGNELDLNIQLS 1755
             G+L   + + R   A +Q    +  T+P V+     S GTK S L    NELDL I LS
Sbjct: 1153 TGQLSFDLESFRGKRAQLQNSFDAVLTEPRVNSAPPRS-GTKPSCLDGIENELDLEIHLS 1211

Query: 1756 FTSKNQEGAESRNAAQRNTRRSLGAPIPG-VIESKNTKDSSKKRD------SAPDAICNE 1914
             TSK ++   S N  + N R+S      G  +E++N+     ++       S+P  +  +
Sbjct: 1212 STSKTEKVVGSTNVTENNQRKSASTLNSGTAVEAQNSSSQYHQQSDHRPSVSSPLEVRGK 1271

Query: 1915 LNSSDIPLVASRNRGSRKVSDNMHDESLPEIVMXXXXXXXXXXXXXXXXXXXXXXXXXXX 2094
            L S    LV   N     + DN+ D+SLPEIVM                           
Sbjct: 1272 LISGACALVLPSN----DILDNIGDQSLPEIVMEQEELSDSDEEIGEHVEFECEEMADSE 1327

Query: 2095 XXXXXXXXQVVNVPNE------------EVDLDIEE---GRVLNSQNEYGSNACSTSEAC 2229
                    Q+V++ ++            +VD D E+    R+ N Q+       STS   
Sbjct: 1328 GEESSDSEQIVDLQDKVVPIVEMEKLVPDVDFDNEQCEPRRIDNPQSNDCITKDSTSPVR 1387

Query: 2230 SNGLDMVEKGKPKALPLNLNSCPPVSP 2310
                      +  +  L+LNSCPP  P
Sbjct: 1388 LGSTGQERDTRCSSSWLSLNSCPPGCP 1414


>ref|XP_004242147.1| PREDICTED: uncharacterized protein LOC101249932 [Solanum
            lycopersicum]
          Length = 1418

 Score =  486 bits (1252), Expect = e-134
 Identities = 327/823 (39%), Positives = 437/823 (53%), Gaps = 47/823 (5%)
 Frame = +1

Query: 1    EPLFPLRNSLCSAESDGQGE--TENTPPDXXXXXXXXXXNRMPKKTVATTLLEKAKNQPV 174
            +PLFP++N   +AE DG+    + + PP           ++  KKT+A  L+EKAK Q V
Sbjct: 544  KPLFPVQNIHFTAEPDGRASLYSNSVPPSSSI-------SQKSKKTLAAVLVEKAKQQAV 596

Query: 175  TSVPKEIAELAQRFWPLFNPALYPRKPPPATIANRVLFTDAEDELLALGLMEYNTDWKAI 354
             SVP EIA+LAQRF+PLFNPALYP KPPPA +ANRVLFTDAEDELLALGLMEYNTDWKAI
Sbjct: 597  ASVPNEIAKLAQRFYPLFNPALYPHKPPPAMVANRVLFTDAEDELLALGLMEYNTDWKAI 656

Query: 355  QQRFLPCKSRHQIFVRQKNRASSKAPENPIKAVRRIKNSPLTLEEIARIELGLKKFKLDF 534
            QQR+LPCKS+HQIFVRQKNR+SSKAP+NPIKAVRR+KNSPLT EE+ARIE GLK FKLD+
Sbjct: 657  QQRYLPCKSKHQIFVRQKNRSSSKAPDNPIKAVRRMKNSPLTAEEVARIEEGLKVFKLDW 716

Query: 535  MSVWRFFLPYRDPSLLPRQWRIASGTQKSYKLDATKKAKRRLYELRRKGXXXXXXXXXXX 714
            MSVW+F +PYRDPSLLPRQWR A GTQKSY  DA+KKAKRRLYE  RK            
Sbjct: 717  MSVWKFIVPYRDPSLLPRQWRTAIGTQKSYISDASKKAKRRLYESERKKLKSGASETWHI 776

Query: 715  XDKEGDSSDNAIEETNSGDNHIDKEDEAYVHEAFLADWMPE----------NNASSSFPT 864
              ++ + +  A       DN  D+ +EAYVHEAFLADW P           +N +   P 
Sbjct: 777  SSRKNEGNCGA-------DNCTDRNEEAYVHEAFLADWRPSVSSIQVNHSMSNLAEKIPP 829

Query: 865  L---------LPSQKDNFGYKDTQPPIFFKSAAASRPSDSLVNLRPY----------RVR 987
            L         +  + +N G ++ Q  I  +   + R   SL +  P+          R++
Sbjct: 830  LQLLGVESSQVAEKMNNSGSRNWQSHISNEFPVSRR--YSLHHCTPFFSLRSSCVFLRLQ 887

Query: 988  KQNNARLVKLAPGLPPVNLPPSVRVMSQSSFIN---SQAAKDSGNIPSNAGLMAENQSLH 1158
                + LVKLAPGLPPVNLPPSVRVMSQS+F +       +  G   S    + +N    
Sbjct: 888  TFCISILVKLAPGLPPVNLPPSVRVMSQSAFKSYHVGTCPRAFGGDASTGDGVRDNAVPK 947

Query: 1159 AGSNMHLGVGSSAKFGPMRKDHVHVTTSSQQRNQSDVA--TNRCTVERGDSDLQMHPLLF 1332
              +          K GP+         S+Q   ++ ++      T E+ +S L+MHPLLF
Sbjct: 948  TANAAKPCTNYFVKDGPLSSSAGRNNISNQNLQETRLSKDNKNVTEEKDESGLRMHPLLF 1007

Query: 1333 QAPQDGHL------XXXXXXXXXXXXXGKQPQLSLSLFHNPRRIRDAVNFLSKSSKPPEK 1494
            +AP+DG                     G QP  +LSLFH+P +    VNFL KSS P +K
Sbjct: 1008 RAPEDGPFPHYQSNSSFSTSSSFNFFSGCQP--NLSLFHHPHQSAHTVNFLDKSSNPGDK 1065

Query: 1495 NAAATSGVDFHPLLQRTDNEGAD-SLAAHPNGKLPSIAASRQGCAPIQKHPSSTTKPSVD 1671
              + +SG DFHPLLQR D+   D  +A+       +   SR  C  +Q         +VD
Sbjct: 1066 -TSMSSGFDFHPLLQRIDDANCDLEVASTVTRPSCTSETSRGWCTQVQN--------AVD 1116

Query: 1672 GISSASMGTKASSLSRQGNELDLNIQLSFTSKNQEGAESRNAAQRNTRRSLGAPIPGVIE 1851
              S+ +    +S + +  NELDL + LSFT   Q+   SR  A R   RS          
Sbjct: 1117 SSSNVACAIPSSPMGK-SNELDLEMHLSFTCSKQKAIGSRGVADRFMERS---------P 1166

Query: 1852 SKNTKDSSKKRDSAPDAICNELNSSDIPLVASRNRGSRKVSDNMHDESLPEIVMXXXXXX 2031
            +  ++D +   +  P+      +S     + S +  +    D++ D+SL EIVM      
Sbjct: 1167 TSASRDQNPLNNGTPNRTTQHSDSGATARILSSDEETGNGVDDLEDQSLIEIVMEQEELS 1226

Query: 2032 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXQVVNVPNEEVD-LDIEEGRVLNSQNEYGS-- 2202
                                         ++ N  NEE+D + +E+  V +    +G+  
Sbjct: 1227 DSEEEIGESVEFECEEMEDSEGEEIFESEEITNDENEEMDKVALEDSYVQHVPYTHGNSK 1286

Query: 2203 -NACSTSEACSNGLDMVEKGKPKALPLNLNSCPPVSPYSNPKN 2328
             N+CS +E+ +   D     +P +L LN N  PP +  S  K+
Sbjct: 1287 GNSCSITESHATRFDKATDDQPSSLYLNSN--PPRTVSSQVKS 1327


>ref|XP_007026078.1| Homeodomain-like superfamily protein, putative isoform 1 [Theobroma
            cacao] gi|508781444|gb|EOY28700.1| Homeodomain-like
            superfamily protein, putative isoform 1 [Theobroma cacao]
          Length = 1463

 Score =  468 bits (1204), Expect = e-129
 Identities = 301/730 (41%), Positives = 402/730 (55%), Gaps = 59/730 (8%)
 Frame = +1

Query: 1    EPLFPLRNSLCSAESDGQGETENTPPDXXXXXXXXXXNRMPKKTVATTLLEKAKNQPVTS 180
            EPLFPL       E++ +    +  P              PKKT+A TL+EK K Q V  
Sbjct: 547  EPLFPLPCFPSEVEANNEALRGSALP-AGSTVPSSVCQPPPKKTLAATLVEKTKKQSVAV 605

Query: 181  VPKEIAELAQRFWPLFNPALYPRKPPPATIANRVLFTDAEDELLALGLMEYNTDWKAIQQ 360
            VPK+I +LAQRF+PLFNP L+P KPPP  +ANRVLFTDAEDELLALG+MEYN+DWKAIQQ
Sbjct: 606  VPKDITKLAQRFFPLFNPVLFPHKPPPVAVANRVLFTDAEDELLALGIMEYNSDWKAIQQ 665

Query: 361  RFLPCKSRHQIFVRQKNRASSKAPENPIKAVRRIKNSPLTLEEIARIELGLKKFKLDFMS 540
            R+LPCKS+HQIFVRQKNR SSKAPENPIKAVRR+K SPLT EE+  I+ GLK +KLD+MS
Sbjct: 666  RYLPCKSKHQIFVRQKNRCSSKAPENPIKAVRRMKTSPLTAEELQGIQEGLKVYKLDWMS 725

Query: 541  VWRFFLPYRDPSLLPRQWRIASGTQKSYKLDATKKAKRRLYELRRKGXXXXXXXXXXXXD 720
            VW+F +P+RDPSLLPRQWRIA GTQKSYK DATKK KRRLYE  R+             D
Sbjct: 726  VWKFIVPHRDPSLLPRQWRIALGTQKSYKQDATKKEKRRLYESERRKRKAALTNWQHVSD 785

Query: 721  KEGDSSDNAIEETNSGDNHIDKEDEAYVHEAFLADWMPENN--ASSSFPTL--------- 867
            KE   ++    E  SGD+ ID  DE+YVHE FLADW P  +   SS  P L         
Sbjct: 786  KEDCQAEYTGGENCSGDDDIDNVDESYVHEGFLADWRPGTSKLISSERPCLNIRNKNLPG 845

Query: 868  ---------LPSQKDNF---------GYKDTQPPIFFKS----------AAASRP----- 948
                     +  Q +N+         G+    P    +S          + A +P     
Sbjct: 846  DMSTEEGTHVTEQSNNYVSAVIRPLTGHMQGSPHALNQSQHPYATSHHASNALQPTHPVP 905

Query: 949  ------SDSLVNLRPYRVRKQNNARLVKLAPGLPPVNLPPSVRVMSQSSFINSQAAKDSG 1110
                  S S + LRPYR RK NN RLVKLAP LPPVNLPPSVRV+S+S+   +Q    + 
Sbjct: 906  NMIWNASKSQIYLRPYRSRKSNNLRLVKLAPDLPPVNLPPSVRVISESALKTNQCGAYTK 965

Query: 1111 NIPSNAGLMAENQSLHAGSNMHLGVGSSAKFGPMRKDHVHVTTSSQQRNQSDVATNRCTV 1290
               +  G++            H     + K         ++T+S  +  +S V  N+   
Sbjct: 966  VSATGDGVVDAGIGNTVSPFSHSAKALANKRHKSNPTRANITSSLSE--ESGVVKNKSVA 1023

Query: 1291 ERGD--SDLQMHPLLFQAPQDGHL------XXXXXXXXXXXXXGKQPQLSLSLFHNPRRI 1446
            E     +DLQMHPLLFQAP+DG +                   G QPQL+LSLF+NP++ 
Sbjct: 1024 EERSTHTDLQMHPLLFQAPEDGQVPYYPLNCGTGASSSFSFFSGNQPQLNLSLFYNPQQT 1083

Query: 1447 RDAVNFLSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLPSIAASRQGCA 1626
              +V  L++S K  + + + + G+DFHPLLQRTD+  ++ +       L S+    +  A
Sbjct: 1084 NHSVESLTRSLKMKD-SVSISCGIDFHPLLQRTDDTNSELVTECSTASL-SVNLDGKSVA 1141

Query: 1627 PIQKHPSSTTK-PSVDGISSASMGTKASSLSRQGNELDLNIQLSFTSKNQEGAESRNAAQ 1803
            P   +PS+  +  SV   S  +  ++ SS + + NELDL I LS  S  +  A S +AA 
Sbjct: 1142 PC--NPSNAVQMKSVAQCSPFATRSRPSSPNEKANELDLEIHLSSLSTKENAALSGDAAT 1199

Query: 1804 RNTRRSLGAPIPGVIESKNTKDSSKKRDSAPDAICNELNSSDIPLVASRNRGSRKVSDNM 1983
             +   ++      ++ S+N  ++     S+ +   +   +S IP     ++ + +  D+ 
Sbjct: 1200 HHKNSAV-----SLLNSQNAAETRDTTHSSGNKFVSGARASTIP-----SKTTGRYMDDT 1249

Query: 1984 HDESLPEIVM 2013
             D+S  EIVM
Sbjct: 1250 SDQSHLEIVM 1259


>ref|XP_002518479.1| conserved hypothetical protein [Ricinus communis]
            gi|223542324|gb|EEF43866.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 1399

 Score =  457 bits (1177), Expect = e-126
 Identities = 309/733 (42%), Positives = 397/733 (54%), Gaps = 62/733 (8%)
 Frame = +1

Query: 1    EPLFPLRNSLCSAESDGQGETENTPPDXXXXXXXXXXNRMPKKTVATTLLEKAKNQPVTS 180
            EPLF L      AE++G+    NTPP            + PKKT+A +++E  K Q V  
Sbjct: 500  EPLFQLPRFPSVAEANGEVSKGNTPP-AVSSVPSTPGQQPPKKTLAASIVENVKKQSVAL 558

Query: 181  VPKEIAELAQRFWPLFNPALYPRKPPPATIANRVLFTDAEDELLALGLMEYNTDWKAIQQ 360
            VPK+I++LAQRF  LFNPAL+P KPPPA ++NR+LFTD+EDELLALG+MEYNTDWKAIQQ
Sbjct: 559  VPKDISKLAQRFLQLFNPALFPHKPPPAAVSNRILFTDSEDELLALGMMEYNTDWKAIQQ 618

Query: 361  RFLPCKSRHQIFVRQKNRASSKAPENPIKAVRRIKNSPLTLEEIARIELGLKKFKLDFMS 540
            RFLPCKS+HQIFVRQKNR SSKAPENPIKAVRR+K SPLT EEI  I+ GL+  K D+MS
Sbjct: 619  RFLPCKSKHQIFVRQKNRCSSKAPENPIKAVRRMKTSPLTAEEIESIQEGLRVLKHDWMS 678

Query: 541  VWRFFLPYRDPSLLPRQWRIASGTQKSYKLDATKKAKRRLYEL-RRKGXXXXXXXXXXXX 717
            V RF +P+RDPSLLPRQWRIA GTQ+SYKLDA KK KRR+YE  RR+             
Sbjct: 679  VCRFIVPHRDPSLLPRQWRIALGTQRSYKLDAAKKEKRRIYESNRRRCKTADLANWQQVS 738

Query: 718  DKEGDSSDNAIEETNSGDNHIDKEDEAYVHEAFLADWMPE--NNASSSFPTL-------- 867
            DKE +  D+   E NSGD+++D  +EAYVH+AFLADW P+  N  SS  P L        
Sbjct: 739  DKEDNQVDSTGGENNSGDDYVDNPNEAYVHQAFLADWRPDASNLISSEHPCLNLRDKNFL 798

Query: 868  ---LP---------SQKDNF-GYKDTQPPIFFK---SAAASRPSDSLVNLRPYRVRKQNN 999
               LP         S  DN  G+   +  +      S  +   + S   L PY  R+ + 
Sbjct: 799  TGALPREGTRIKNQSHIDNMHGFPYARYSVHLNHQVSDTSQGAAKSQFYLWPYWTRRTDG 858

Query: 1000 ARLVKLAPGLPPVNLPPSVRVMSQSSFINSQAAKDSGNIPSNAGLMA----ENQSLHAGS 1167
            A LVKLAP LPPVNLPP+VRV+SQ++F ++Q A     +P+  G       EN       
Sbjct: 859  AHLVKLAPDLPPVNLPPTVRVISQTAFKSNQCAVPI-KVPALGGTSGDARKENIVPQPAV 917

Query: 1168 NMHLGVGSSAKFGPMRKDHV--HVTTS------SQQRNQSDVATNRCTV-ERG-DSDLQM 1317
              +L   S A     +++ V   +TTS      S    +S +  + C   ERG +SDLQM
Sbjct: 918  VANLRSTSLAMTKRDKRNQVGDKITTSCPEEFTSSHPEESAILHDTCAAEERGTESDLQM 977

Query: 1318 HPLLFQAPQDGHL------XXXXXXXXXXXXXGKQPQLSLSLFHNPRRIRDAVNFLSKSS 1479
            HPLLFQ+P+DG L                     QPQL+LSLFH+ R     V+  +KSS
Sbjct: 978  HPLLFQSPEDGRLSYYPLSCSTGASSSFTFFSANQPQLNLSLFHSSRPANHTVDCFNKSS 1037

Query: 1480 KPPEKNAAATSGVDFHPLLQRTDNEGAD---------------SLAAHPNGKLPSIAASR 1614
            K  E + +A+ G+DFHPLLQR + E  D                 +A P   L ++    
Sbjct: 1038 KTGE-STSASCGIDFHPLLQRAEEENIDFATSCSIAHQYVCLGGKSAQPQNPLGAV---- 1092

Query: 1615 QGCAPIQKHPSSTTKPSVDGISSASMGTKASSLSRQGNELDLNIQLSFTSKNQEGAESRN 1794
            Q  +P+   PS+T             G+K  S   + NELDL I LS  S  ++   SR+
Sbjct: 1093 QTKSPVNSGPSTT-------------GSKPPSSIEKANELDLEIHLSSMSAVEKTRGSRD 1139

Query: 1795 AAQRNTRRSLGAPIPGVIESKNTKDSSKKRDSAPDAICNELNSSDIPLVASRNRGSRKVS 1974
                N       P      S NT D  K  D+               +    N  +R   
Sbjct: 1140 VGASNQLE----PSTSAPNSGNTIDKDKSADA---------------IAVQSNNDARCDM 1180

Query: 1975 DNMHDESLPEIVM 2013
            ++  D++ PEIVM
Sbjct: 1181 EDKGDQAPPEIVM 1193


>ref|XP_006389624.1| hypothetical protein POPTR_0021s00740g [Populus trichocarpa]
            gi|550312453|gb|ERP48538.1| hypothetical protein
            POPTR_0021s00740g [Populus trichocarpa]
          Length = 1441

 Score =  457 bits (1177), Expect = e-126
 Identities = 299/698 (42%), Positives = 383/698 (54%), Gaps = 67/698 (9%)
 Frame = +1

Query: 121  PKKTVATTLLEKAKNQPVTSVPKEIAELAQRFWPLFNPALYPRKPPPATIANRVLFTDAE 300
            PKKT+A +++E  K Q V  VPK+I++LAQRF+PLFNP L+P KPPPA +ANRVLFTD+E
Sbjct: 530  PKKTLAASIVESTKKQSVALVPKDISKLAQRFFPLFNPVLFPHKPPPAAVANRVLFTDSE 589

Query: 301  DELLALGLMEYNTDWKAIQQRFLPCKSRHQIFVRQKNRASSKAPENPIKAVRRIKNSPLT 480
            DELLALG+MEYNTDWKAIQQRFLPCKS+HQIFVRQKNR SSKAPENPIKAVRR+K SPLT
Sbjct: 590  DELLALGIMEYNTDWKAIQQRFLPCKSKHQIFVRQKNRCSSKAPENPIKAVRRMKTSPLT 649

Query: 481  LEEIARIELGLKKFKLDFMSVWRFFLPYRDPSLLPRQWRIASGTQKSYKLDATKKAKRRL 660
             EE  RI+ GL+ +KLD++SVW+F +P+RDPSLLPRQ RIA GTQKSYK DA KK KRR+
Sbjct: 650  TEETERIQEGLRVYKLDWLSVWKFVVPHRDPSLLPRQLRIALGTQKSYKQDAAKKEKRRI 709

Query: 661  YELRRKGXXXXXXXXXXXXDKE---------------GDSSDNAIEETNSGDNHIDKEDE 795
             E R++             DKE                + +D   +  +SGD+ +D  +E
Sbjct: 710  SEARKRSRTTELSNWKPASDKEFNVLPNVIKCFDWVQDNQADRTGKGNSSGDDCVDNVNE 769

Query: 796  AYVHEAFLADWMP---------------------ENNASSSFPTL-------LPSQKDNF 891
            AYVH+AFL+DW P                      NN     P L       LP    + 
Sbjct: 770  AYVHQAFLSDWRPGSSGLISSDTISREDQNTREHPNNCRPGEPQLWIDNMNGLPYGSSSH 829

Query: 892  GY--------KDTQPPIFFKSAAASRPSDSLVNLRPYRVRKQNNARLVKLAPGLPPVNLP 1047
             Y         +T  P +  S  +   S   ++LRPYR RK +   LV+LAP LPPVNLP
Sbjct: 830  HYPLAHAKPSPNTMLPNYQISNMSVSISKPQIHLRPYRSRKTDGVHLVRLAPDLPPVNLP 889

Query: 1048 PSVRVMSQSSFINSQAAKDSGNIPSNAGLMAENQSLHAGSNMHLGVGSSAKFGPMRKDHV 1227
             SVRV+SQS+F  +Q         S        ++  A    H+G   +      R+D  
Sbjct: 890  RSVRVISQSAFERNQCGSSIKVSTSGIRTGDAGKNNIAAQLPHIGNLRTPSSVDSRRDKT 949

Query: 1228 -----HVTTSSQQRNQSDVATNRCTV-ERG-DSDLQMHPLLFQAPQDGHL------XXXX 1368
                 HVT S  +  QS +  N CT  ERG DSDLQMHPLLFQAP+ G L          
Sbjct: 950  NQAADHVTDSHPE--QSAIVHNVCTAEERGTDSDLQMHPLLFQAPEGGCLPYLPLSCSSG 1007

Query: 1369 XXXXXXXXXGKQPQLSLSLFHNPRRIRDAVNFLSKSSKPPEKNAAATSGVDFHPLLQRTD 1548
                     G QPQL+LSLFHNP +    V+  +KSSK  +  +A+ S +DFHPLLQRTD
Sbjct: 1008 TSSSFSFFSGNQPQLNLSLFHNPLQANHVVDGFNKSSKSKDSTSASCS-IDFHPLLQRTD 1066

Query: 1549 NEGADSLAAHPNGKLPSIAASRQG-CAPIQKHPSSTTKPSVDGISSASMGTKASSLSRQG 1725
             E  + + A  N   P+      G  A  Q H  +    S       ++  K SS + + 
Sbjct: 1067 EENNNLVMACSN---PNQFVCLSGESAQFQNHFGAVQNKSFVNNIPIAVDPKHSSSNEKA 1123

Query: 1726 NELDLNIQLSFTSKNQEGAESRNAAQRNTRRS-LGAPIPG-VIESKNTKDSSKKRDSAPD 1899
            N+LDL+I LS  S  +    SR+    N  RS    P  G  +E+        + +  P 
Sbjct: 1124 NDLDLDIHLSSNSAKEVSERSRDVGANNQPRSTTSEPKSGRRMETCKINSPRDQHNEHPT 1183

Query: 1900 AICNELNSSDIPLVASRNRGSRKVSDNMHDESLPEIVM 2013
               N ++ +D   V S N  +  + D + D+S PEIVM
Sbjct: 1184 VHSNLVSGADASPVQSNNVSTCNM-DVVGDQSHPEIVM 1220


>ref|XP_006428336.1| hypothetical protein CICLE_v10010907mg [Citrus clementina]
            gi|557530393|gb|ESR41576.1| hypothetical protein
            CICLE_v10010907mg [Citrus clementina]
          Length = 1424

 Score =  455 bits (1171), Expect = e-125
 Identities = 301/735 (40%), Positives = 400/735 (54%), Gaps = 64/735 (8%)
 Frame = +1

Query: 1    EPLFPLRNSLCSAESDGQGETENTPPDXXXXXXXXXXNRMPKKTVATTLLEKAKNQPVTS 180
            EPLFP  +     E++ +     T P            + PK+++A  L+E  K Q V  
Sbjct: 529  EPLFPFPSFASLIEANSEVYKGRTLPSANTITSSPS-RQPPKRSLAAALVESTKKQSVAL 587

Query: 181  VPKEIAELAQRFWPLFNPALYPRKPPPATIANRVLFTDAEDELLALGLMEYNTDWKAIQQ 360
            V KEI++LA+RF+PLFNP+L+P KPPP ++ANRVLFTDAEDELLALG+MEYNTDWKAIQQ
Sbjct: 588  VTKEISKLARRFFPLFNPSLFPHKPPPPSVANRVLFTDAEDELLALGMMEYNTDWKAIQQ 647

Query: 361  RFLPCKSRHQIFVRQKNRASSKAPENPIKAVRRIKNSPLTLEEIARIELGLKKFKLDFMS 540
            RFLPCKS+HQIFVRQKNR SSKAPENPIKAVRR+K SPLT +EI  I+ GLK FKLD+MS
Sbjct: 648  RFLPCKSKHQIFVRQKNRCSSKAPENPIKAVRRMKTSPLTAKEIECIQEGLKVFKLDWMS 707

Query: 541  VWRFFLPYRDPSLLPRQWRIASGTQKSYKLDATKKAKRRLYELRRKGXXXXXXXXXXXXD 720
            VW+F +P+RDPSLL RQWRIA GTQK YK DA KK KRRLYEL+R+             D
Sbjct: 708  VWKFVVPHRDPSLLRRQWRIALGTQKCYKQDANKKEKRRLYELKRRCKTADLANWHLDSD 767

Query: 721  KEGDSSDNAIEETNSGDNHIDKEDEAYVHEAFLADWMP--ENNASSSFP----------- 861
            KE +++   I   N  D +I+   E YVHE FLADW P   N  SS  P           
Sbjct: 768  KEVENAGGVI---NGADGYIENTQEGYVHEGFLADWRPGVYNQGSSGNPCINLGDKHPSC 824

Query: 862  -------TLLPSQKDNFGYKDTQPPI---------------FFKS--------------- 930
                   T +  + +NF      PP                 + S               
Sbjct: 825  GILLREGTHIGEEPNNFVSDGAHPPTNNMHEHPYALNRSQDLYPSHLTHVRHDVLNSMQP 884

Query: 931  -----AAASRPSDSLVNLRPYRVRKQNNARLVKLAPGLPPVNLPPSVRVMSQSSFINSQA 1095
                   AS+ S S V L PYR R+ NNA LVKLAP LPPVNLPPSVRV+ QS+F   ++
Sbjct: 885  NHPVPNMASKTSKSQVCLPPYRARRSNNAHLVKLAPDLPPVNLPPSVRVIPQSAF---KS 941

Query: 1096 AKDSGNIPSNAGLMAENQSLHAGSNMHLGVGSSAKFGPMRKDHVHVTTSSQQRNQSDVAT 1275
             +   ++  +A   AE+ + H+GS  HL        G  +++ V    ++    +S V  
Sbjct: 942  VQRGSSVKVSA---AESNAGHSGS-QHL-----VTAGRDKRNTVTENVANSHLEESHVQE 992

Query: 1276 NRCTVERGDSDLQMHPLLFQAPQDGHL------XXXXXXXXXXXXXGKQPQLSLSLFHNP 1437
             R T    + DLQMHPLLFQAP+DGHL                   G QPQL+LSLFHNP
Sbjct: 993  ERGT----EPDLQMHPLLFQAPEDGHLPYYPLNCSASTSSSFSFFSGNQPQLNLSLFHNP 1048

Query: 1438 RRIRDAVNFLSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLPSIAASRQ 1617
            R++  A++  +KS K  E + + +  +DFHPLL+RT+    ++L   P+    S+ + R+
Sbjct: 1049 RQLSHALSCFNKSLKTKE-STSGSCVIDFHPLLKRTE-VANNNLVTTPSNARISVGSERK 1106

Query: 1618 GCAPIQKHPSSTTKPSVDGISSASMGTKASSLSRQGNELDLNIQLSFTSKNQEGAESRNA 1797
                     +  +K SV     A+  +  SS++ + NELDL I LS +S  +    +R  
Sbjct: 1107 SDQHKNPFDALQSKTSVSNGPFAA-NSVPSSINEKSNELDLEIHLSSSSAKERALGNREM 1165

Query: 1798 AQRNTRRSLGAPIPGVIESKNTKDSSKKRDSAPDAICNELNSSDIPLVASRNRGSRKVS- 1974
            A  N  +S+       + +   K  ++  D+      +     +   VAS    S + + 
Sbjct: 1166 APHNLMQSM------TVANSGDKTVTQNNDN-----LHYQYGENYSQVASNGHFSVQTTG 1214

Query: 1975 --DNMHDESLPEIVM 2013
              D++ D S PEIVM
Sbjct: 1215 NIDDIGDHSHPEIVM 1229


>ref|XP_006480350.1| PREDICTED: uncharacterized protein LOC102624036 isoform X1 [Citrus
            sinensis] gi|568853408|ref|XP_006480351.1| PREDICTED:
            uncharacterized protein LOC102624036 isoform X2 [Citrus
            sinensis]
          Length = 1424

 Score =  454 bits (1169), Expect = e-125
 Identities = 301/735 (40%), Positives = 399/735 (54%), Gaps = 64/735 (8%)
 Frame = +1

Query: 1    EPLFPLRNSLCSAESDGQGETENTPPDXXXXXXXXXXNRMPKKTVATTLLEKAKNQPVTS 180
            EPLFP  +     E++ +     T P            + PK+++A  L+E  K Q V  
Sbjct: 529  EPLFPFPSFASLIEANSEVYKGRTLPSANTITSSPS-RQPPKRSLAAALVESTKKQSVAL 587

Query: 181  VPKEIAELAQRFWPLFNPALYPRKPPPATIANRVLFTDAEDELLALGLMEYNTDWKAIQQ 360
            V KEI++LA+RF+PLFNP+L+P KPPP ++ANRVLFTDAEDELLALG+MEYNTDWKAIQQ
Sbjct: 588  VTKEISKLARRFFPLFNPSLFPHKPPPPSVANRVLFTDAEDELLALGMMEYNTDWKAIQQ 647

Query: 361  RFLPCKSRHQIFVRQKNRASSKAPENPIKAVRRIKNSPLTLEEIARIELGLKKFKLDFMS 540
            RFLPCKS+HQIFVRQKNR SSKAPENPIKAVRR+K SPLT +EI  I+ GLK FKLD+MS
Sbjct: 648  RFLPCKSKHQIFVRQKNRCSSKAPENPIKAVRRMKTSPLTAKEIECIQEGLKVFKLDWMS 707

Query: 541  VWRFFLPYRDPSLLPRQWRIASGTQKSYKLDATKKAKRRLYELRRKGXXXXXXXXXXXXD 720
            VW+F +P+RDPSLL RQWRIA GTQK YK DA KK KRRLYEL+R+             D
Sbjct: 708  VWKFVVPHRDPSLLRRQWRIALGTQKCYKQDANKKEKRRLYELKRRCKTADLANWHLDSD 767

Query: 721  KEGDSSDNAIEETNSGDNHIDKEDEAYVHEAFLADWMP--ENNASSSFP----------- 861
            KE +++   I   N  D +I+   E YVHE FLADW P   N  SS  P           
Sbjct: 768  KEVENAGGVI---NGADGYIENTQEGYVHEGFLADWRPGVYNQGSSGNPCINLGDKHPSC 824

Query: 862  -------TLLPSQKDNFGYKDTQPPI---------------FFKS--------------- 930
                   T +  + +NF      PP                 + S               
Sbjct: 825  GILLREGTHIGEEPNNFVSDGAHPPTNNMHEHPYALNRSQDLYPSHLTHVRHDVLNSMQP 884

Query: 931  -----AAASRPSDSLVNLRPYRVRKQNNARLVKLAPGLPPVNLPPSVRVMSQSSFINSQA 1095
                   AS+ S S V L PYR R+ NNA LVKLAP LPPVNLPPSVRV+ QS+F   ++
Sbjct: 885  NHPVPNMASKTSKSQVCLPPYRARRSNNAHLVKLAPDLPPVNLPPSVRVIPQSAF---KS 941

Query: 1096 AKDSGNIPSNAGLMAENQSLHAGSNMHLGVGSSAKFGPMRKDHVHVTTSSQQRNQSDVAT 1275
             +   ++  +A   AE+ + H+GS  HL        G  +++ V    ++    +S V  
Sbjct: 942  VQRGSSVKVSA---AESNAGHSGS-QHL-----VTAGRDKRNTVTENVANSHLEESHVQE 992

Query: 1276 NRCTVERGDSDLQMHPLLFQAPQDGHL------XXXXXXXXXXXXXGKQPQLSLSLFHNP 1437
             R T      DLQMHPLLFQAP+DGHL                   G QPQL+LSLFHNP
Sbjct: 993  ERGT----QPDLQMHPLLFQAPEDGHLPYYPLNCSASTSSSFSFFSGNQPQLNLSLFHNP 1048

Query: 1438 RRIRDAVNFLSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLPSIAASRQ 1617
            R++  A++  +KS K  E + + +  +DFHPLL+RT+    ++L   P+    S+ + R+
Sbjct: 1049 RQLSHALSCFNKSLKTKE-STSGSCVIDFHPLLKRTE-VANNNLVTTPSNARISVGSERK 1106

Query: 1618 GCAPIQKHPSSTTKPSVDGISSASMGTKASSLSRQGNELDLNIQLSFTSKNQEGAESRNA 1797
                     +  +K SV     A+  +  SS++ + NELDL I LS +S  +    +R  
Sbjct: 1107 SDQHKNPFDALQSKTSVSNGPFAA-NSVPSSINEKSNELDLEIHLSSSSAKERALGNREM 1165

Query: 1798 AQRNTRRSLGAPIPGVIESKNTKDSSKKRDSAPDAICNELNSSDIPLVASRNRGSRKVS- 1974
            A  N  +S+       + +   K  ++  D+      +     +   VAS    S + + 
Sbjct: 1166 APHNLMQSM------TVANSGDKTVTQNNDN-----LHYQYGENYSQVASNGHFSVQTTG 1214

Query: 1975 --DNMHDESLPEIVM 2013
              D++ D S PEIVM
Sbjct: 1215 NIDDIGDHSHPEIVM 1229


>ref|XP_007026080.1| Homeodomain-like superfamily protein, putative isoform 3 [Theobroma
            cacao] gi|508781446|gb|EOY28702.1| Homeodomain-like
            superfamily protein, putative isoform 3 [Theobroma cacao]
          Length = 1402

 Score =  446 bits (1147), Expect = e-122
 Identities = 282/681 (41%), Positives = 384/681 (56%), Gaps = 10/681 (1%)
 Frame = +1

Query: 1    EPLFPLRNSLCSAESDGQGETENTPPDXXXXXXXXXXNRMPKKTVATTLLEKAKNQPVTS 180
            EPLFPL       E++ +    +  P              PKKT+A TL+EK K Q V  
Sbjct: 547  EPLFPLPCFPSEVEANNEALRGSALP-AGSTVPSSVCQPPPKKTLAATLVEKTKKQSVAV 605

Query: 181  VPKEIAELAQRFWPLFNPALYPRKPPPATIANRVLFTDAEDELLALGLMEYNTDWKAIQQ 360
            VPK+I +LAQRF+PLFNP L+P KPPP  +ANRVLFTDAEDELLALG+MEYN+DWKAIQQ
Sbjct: 606  VPKDITKLAQRFFPLFNPVLFPHKPPPVAVANRVLFTDAEDELLALGIMEYNSDWKAIQQ 665

Query: 361  RFLPCKSRHQIFVRQKNRASSKAPENPIKAVRRIKNSPLTLEEIARIELGLKKFKLDFMS 540
            R+LPCKS+HQIFVRQKNR SSKAPENPIKAVRR+K SPLT EE+  I+ GLK +KLD+MS
Sbjct: 666  RYLPCKSKHQIFVRQKNRCSSKAPENPIKAVRRMKTSPLTAEELQGIQEGLKVYKLDWMS 725

Query: 541  VWRFFLPYRDPSLLPRQWRIASGTQKSYKLDATKKAKRRLYELRRKGXXXXXXXXXXXXD 720
            VW+F +P+RDPSLLPRQWRIA GTQKSYK DATKK KRRLYE  R+             D
Sbjct: 726  VWKFIVPHRDPSLLPRQWRIALGTQKSYKQDATKKEKRRLYESERRKRKAALTNWQHVSD 785

Query: 721  KEGDSSDNAIEETNSGDNHIDKEDEAYVHEAFLADWMPENNASSSFPTLLPSQKDNFGYK 900
            KE +   +  E++N+  + + +    ++  +      P     S  P        N   +
Sbjct: 786  KEAEEGTHVTEQSNNYVSAVIRPLTGHMQGS------PHALNQSQHPYATSHHASN-ALQ 838

Query: 901  DTQP-PIFFKSAAASRPSDSLVNLRPYRVRKQNNARLVKLAPGLPPVNLPPSVRVMSQSS 1077
             T P P    +A     S S + LRPYR RK NN RLVKLAP LPPVNLPPSVRV+S+S+
Sbjct: 839  PTHPVPNMIWNA-----SKSQIYLRPYRSRKSNNLRLVKLAPDLPPVNLPPSVRVISESA 893

Query: 1078 FINSQAAKDSGNIPSNAGLMAENQSLHAGSNMHLGVGSSAKFGPMRKDHVHVTTSSQQRN 1257
               +Q    +    +  G++            H     + K         ++T+S  +  
Sbjct: 894  LKTNQCGAYTKVSATGDGVVDAGIGNTVSPFSHSAKALANKRHKSNPTRANITSSLSE-- 951

Query: 1258 QSDVATNRCTVERGD--SDLQMHPLLFQAPQDGHL------XXXXXXXXXXXXXGKQPQL 1413
            +S V  N+   E     +DLQMHPLLFQAP+DG +                   G QPQL
Sbjct: 952  ESGVVKNKSVAEERSTHTDLQMHPLLFQAPEDGQVPYYPLNCGTGASSSFSFFSGNQPQL 1011

Query: 1414 SLSLFHNPRRIRDAVNFLSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKL 1593
            +LSLF+NP++   +V  L++S K  + + + + G+DFHPLLQRTD+  ++ +       L
Sbjct: 1012 NLSLFYNPQQTNHSVESLTRSLKMKD-SVSISCGIDFHPLLQRTDDTNSELVTECSTASL 1070

Query: 1594 PSIAASRQGCAPIQKHPSSTTK-PSVDGISSASMGTKASSLSRQGNELDLNIQLSFTSKN 1770
             S+    +  AP   +PS+  +  SV   S  +  ++ SS + + NELDL I LS  S  
Sbjct: 1071 -SVNLDGKSVAPC--NPSNAVQMKSVAQCSPFATRSRPSSPNEKANELDLEIHLSSLSTK 1127

Query: 1771 QEGAESRNAAQRNTRRSLGAPIPGVIESKNTKDSSKKRDSAPDAICNELNSSDIPLVASR 1950
            +  A S +AA  +   ++      ++ S+N  ++     S+ +   +   +S IP     
Sbjct: 1128 ENAALSGDAATHHKNSAV-----SLLNSQNAAETRDTTHSSGNKFVSGARASTIP----- 1177

Query: 1951 NRGSRKVSDNMHDESLPEIVM 2013
            ++ + +  D+  D+S  EIVM
Sbjct: 1178 SKTTGRYMDDTSDQSHLEIVM 1198


>ref|XP_004295271.1| PREDICTED: uncharacterized protein LOC101297625 [Fragaria vesca
            subsp. vesca]
          Length = 1378

 Score =  442 bits (1136), Expect = e-121
 Identities = 296/722 (40%), Positives = 377/722 (52%), Gaps = 51/722 (7%)
 Frame = +1

Query: 1    EPLFPLRN------SLCSAESDGQGETENTPPDXXXXXXXXXXNRMPKKTVATTLLEKAK 162
            EPLFPL N      + C   S       N  P           ++ PKK++A  ++E  K
Sbjct: 512  EPLFPLLNFPLRDQANCEVVSGVGSSAVNGSP--------CSPSQPPKKSLAAAIVESTK 563

Query: 163  NQPVTSVPKEIAELAQRFWPLFNPALYPRKPPPATIANRVLFTDAEDELLALGLMEYNTD 342
             Q V  VP+EIA LAQRF+PLFNPALYP KPPPA + NRVLFTDAEDELLALGLMEYNTD
Sbjct: 564  KQSVALVPREIANLAQRFYPLFNPALYPHKPPPAAVTNRVLFTDAEDELLALGLMEYNTD 623

Query: 343  WKAIQQRFLPCKSRHQIFVRQKNRASSKAPENPIKAVRRIKNSPLTLEEIARIELGLKKF 522
            WKAIQQRFLPCK++HQI+VRQKNR SS+APEN IKAVRR+K SPLT EEI+ IE GLK +
Sbjct: 624  WKAIQQRFLPCKTKHQIYVRQKNRCSSRAPENSIKAVRRMKTSPLTAEEISCIEEGLKAY 683

Query: 523  KLDFMSVWRFFLPYRDPSLLPRQWRIASGTQKSYKLDATKKAKRRLYEL-RRKGXXXXXX 699
            K D M+VW+F +P+RDPSLLPRQWR A GTQKSYKLD  KK KRRLY+L RR+       
Sbjct: 684  KYDLMAVWKFVVPHRDPSLLPRQWRTALGTQKSYKLDEAKKEKRRLYDLKRRENKKADMS 743

Query: 700  XXXXXXDKEGDSSDNAIEETNSGDNHIDKEDEAYVHEAFLADWMP-----ENN------- 843
                  +KE   ++ +  E NS D  +D   E YVHEAFLADW P     E N       
Sbjct: 744  SWQSSYEKEDCQAEKSCGENNSADGPMDNAGETYVHEAFLADWRPGTSSGERNPHPGIDG 803

Query: 844  ----------------ASSSFPTLLPSQKDNFGYKDTQPPIFFKSAAASRPSDSLVNLRP 975
                            ++S +P    S     G   +         + S  S S      
Sbjct: 804  HKEAPHSQTGNMHQFPSASKYPQNPSSHMTGVGQYASSATKLSHPVSTSSTSGSQFCYPT 863

Query: 976  YRVRKQNNARLVKLAPGLPPVNLPPSVRVMSQSSFINSQAAKDSGNIPSNAGLMAENQSL 1155
            ++ R+   A LVKLAP LPPVNLPPSVRV+SQS+F  +     S    +  GL A  +  
Sbjct: 864  HQARRTTGAHLVKLAPDLPPVNLPPSVRVVSQSAFKGNVRGTTSHVAGAGGGLGATKE-- 921

Query: 1156 HAGSNMHLGVGSSAKFGPM----RKDHVHVTTSSQQRNQSDVATNRCTVERG---DSDLQ 1314
                N    VG S  F  +     K      + ++ R +   +     VE+G    SDLQ
Sbjct: 922  ----NAVSQVGRSGTFNSVAARQNKSQYAKESVTKLRPEETNSFKEKRVEKGGDTGSDLQ 977

Query: 1315 MHPLLFQAPQDGHL------XXXXXXXXXXXXXGKQPQLSLSLFHNPRRIRDAVNFLSKS 1476
            MHPLLFQ P+DG L                   G QPQL L+L H+P +     N +   
Sbjct: 978  MHPLLFQPPEDGRLPYYPLNCSTSNSGSYSFLSGNQPQLHLTLLHDPHQ----ENQVDGP 1033

Query: 1477 SKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLPSIAASRQGCAPIQKHPSSTT 1656
             +  +++   + G+DFHPL+QRT+N   +S+A       P    SR       +HPS + 
Sbjct: 1034 VRTLKESNVISRGIDFHPLMQRTEN--VNSVAVTKCSTAPLAVGSR------VQHPSKSF 1085

Query: 1657 KPSVDGISSASMGTKASSLSRQGNELDLNIQLSFTSKNQEGAESRNAAQRNTRRSLGAPI 1836
            +  V   + A       S    G ELDL I LS TS+ ++  +SR  +  N  +S  AP 
Sbjct: 1086 QTEVPEATGAK-----PSPDEGGIELDLEIHLSSTSRKEKTLKSREVSHHNLVKSRTAPG 1140

Query: 1837 PG---VIESKNTKDSSKKRDSAPDAICNELNSSDIPLVASRNRGSRKVSDNMHDESLPEI 2007
             G   + +S N+       +S+  A  ++  S    LV   N  SR   D M D S P+I
Sbjct: 1141 TGTTMIAQSVNSPIYIHAENSS--ASSSKFVSGSNTLVIPSNNMSRYNPDEMGDPSQPDI 1198

Query: 2008 VM 2013
             M
Sbjct: 1199 EM 1200


>ref|XP_007213734.1| hypothetical protein PRUPE_ppa000251mg [Prunus persica]
            gi|462409599|gb|EMJ14933.1| hypothetical protein
            PRUPE_ppa000251mg [Prunus persica]
          Length = 1395

 Score =  437 bits (1123), Expect = e-119
 Identities = 296/724 (40%), Positives = 377/724 (52%), Gaps = 53/724 (7%)
 Frame = +1

Query: 1    EPLFPLRN-SLCS-----AESDGQGETENTPPDXXXXXXXXXXNRMPKKTVATTLLEKAK 162
            EPLFPL N  LC+     A S       N  P            + PKK++A T++E  K
Sbjct: 535  EPLFPLPNFPLCAQANFEAVSGSGSSVSNVAPSSSS-------QQPPKKSLAATIVESTK 587

Query: 163  NQPVTSVPKEIAELAQRFWPLFNPALYPRKPPPATIANRVLFTDAEDELLALGLMEYNTD 342
             Q V  VP+EI++LAQ F+PLFNPAL+P KPPP  +ANRVLFTDAEDELLALGLMEYN D
Sbjct: 588  KQSVAIVPREISKLAQIFFPLFNPALFPHKPPPGNMANRVLFTDAEDELLALGLMEYNMD 647

Query: 343  WKAIQQRFLPCKSRHQIFVRQKNRASSKAPENPIKAVRRIKNSPLTLEEIARIELGLKKF 522
            WKAIQQRFLPCKS  QIFVRQKNR SSKAPENPIKAVRR+KNSPLT EE+A I+ GLK +
Sbjct: 648  WKAIQQRFLPCKSERQIFVRQKNRCSSKAPENPIKAVRRMKNSPLTAEELACIQEGLKAY 707

Query: 523  KLDFMSVWRFFLPYRDPSLLPRQWRIASGTQKSYKLDATKKAKRRLYELRRKGXXXXXXX 702
            K D+MS+W+F +P+RDP+LLPRQWRIA GTQKSYKLD  KK KRRLYE +R+        
Sbjct: 708  KYDWMSIWQFIVPHRDPNLLPRQWRIALGTQKSYKLDEAKKEKRRLYESKRRKHKSSDLS 767

Query: 703  XXXXXDKEGDSSDNAIEETNSGDNHIDKEDEAYVHEAFLADWMP-----ENNASSS--FP 861
                  ++ D         NS D   D   E YVHEAFLADW P     E N  S     
Sbjct: 768  SWQNSSEKEDCQAEKSGGENSADGFTDNAGETYVHEAFLADWRPGTSSGERNLHSGTLSQ 827

Query: 862  TLLPSQKDNFGYKDT----------QPPIFFK----------------SAAASRPSDSLV 963
              +    + FG+K+           Q P                    S   S    S  
Sbjct: 828  EAIREWANVFGHKEAPRTQTVSKYQQSPSLITGFRHFASGTTQTNHSVSHMTSNAFKSQF 887

Query: 964  NLRPYRVRKQNNARLVKLAPGLPPVNLPPSVRVMSQSSFINSQAAKDSGNIPSNAG---L 1134
            N R YR R+ N A+LVKLAP LPPVNLPPSVR++SQS+F  S     S    S  G    
Sbjct: 888  NYRRYRARRTNGAQLVKLAPELPPVNLPPSVRIVSQSAFRGSLCGISSTVSASGVGSGSS 947

Query: 1135 MAENQSLHAGSNMHLGVGSSAKFGPMRKDHVHVTTSSQQRNQSDVATNRCTVERG---DS 1305
              +N          LG+  +      +      + ++ +   S +  ++C VE G   DS
Sbjct: 948  ATDNLFSKFSQVGRLGISDAITSRQNKTHSPKDSVATLRPEDSRIVKDKC-VEEGRDTDS 1006

Query: 1306 DLQMHPLLFQAPQDGHL------XXXXXXXXXXXXXGKQPQLSLSLFHNPRRIRDAVNFL 1467
            DL MHPLLFQAP+DG L                     QPQL+LSLFHNP +    V+  
Sbjct: 1007 DLHMHPLLFQAPEDGRLPYYPLNCSNRNSSTFSFLSANQPQLNLSLFHNPHQ-GSHVDCF 1065

Query: 1468 SKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLPSIAASRQGCAPIQKHPS 1647
             KS K     + A   +DFHPL+QRTD              + S+  +    AP+    +
Sbjct: 1066 DKSLKTSNSTSRA---IDFHPLMQRTD-------------YVSSVPVTTCSTAPLS---N 1106

Query: 1648 STTKPSVDGISSASMGTKASSLSRQGNELDLNIQLSFTSKNQEGAESRNAAQRNTRRS-L 1824
            ++  P +      ++GT     + + NELDL I LS TS+ +   + R+    N+ +S  
Sbjct: 1107 TSQTPLLGNTDPQALGT-----NEKANELDLEIHLSSTSEKENFLKRRDVGVHNSVKSRT 1161

Query: 1825 GAPIPGVIESKNTKDSSKKRDSA-PDAICNELNSSDIPLVASRNRGSRKVSDNMHDESLP 2001
             AP  G I      + S  + +       +E  S  + LV   N  SR  +D+  ++S P
Sbjct: 1162 TAPDSGTIMITQCANGSLYQHAENSSGSGSEPVSGGLTLVIPSNILSRYNADDTGEQSQP 1221

Query: 2002 EIVM 2013
            +I M
Sbjct: 1222 DIEM 1225


>ref|XP_007026079.1| Homeodomain-like superfamily protein, putative isoform 2 [Theobroma
            cacao] gi|508781445|gb|EOY28701.1| Homeodomain-like
            superfamily protein, putative isoform 2 [Theobroma cacao]
          Length = 1374

 Score =  436 bits (1120), Expect = e-119
 Identities = 276/680 (40%), Positives = 374/680 (55%), Gaps = 9/680 (1%)
 Frame = +1

Query: 1    EPLFPLRNSLCSAESDGQGETENTPPDXXXXXXXXXXNRMPKKTVATTLLEKAKNQPVTS 180
            EPLFPL       E++ +    +  P              PKKT+A TL+EK K Q V  
Sbjct: 547  EPLFPLPCFPSEVEANNEALRGSALP-AGSTVPSSVCQPPPKKTLAATLVEKTKKQSVAV 605

Query: 181  VPKEIAELAQRFWPLFNPALYPRKPPPATIANRVLFTDAEDELLALGLMEYNTDWKAIQQ 360
            VPK+I +LAQRF+PLFNP L+P KPPP  +ANRVLFTDAEDELLALG+MEYN+DWKAIQQ
Sbjct: 606  VPKDITKLAQRFFPLFNPVLFPHKPPPVAVANRVLFTDAEDELLALGIMEYNSDWKAIQQ 665

Query: 361  RFLPCKSRHQIFVRQKNRASSKAPENPIKAVRRIKNSPLTLEEIARIELGLKKFKLDFMS 540
            R+LPCKS+HQIFVRQKNR SSKAPENPIKAVRR+K SPLT EE+  I+ GLK +KLD+MS
Sbjct: 666  RYLPCKSKHQIFVRQKNRCSSKAPENPIKAVRRMKTSPLTAEELQGIQEGLKVYKLDWMS 725

Query: 541  VWRFFLPYRDPSLLPRQWRIASGTQKSYKLDATKKAKRRLYELRRKGXXXXXXXXXXXXD 720
            VW+F +P+RDPSLLPRQWRIA GTQKSYK DATKK KRRLYE  R+             D
Sbjct: 726  VWKFIVPHRDPSLLPRQWRIALGTQKSYKQDATKKEKRRLYESERRKRKAALTNWQHVSD 785

Query: 721  KEGDSSDNAIEETNSGDNHIDKEDEAYVHEAFLADWMPENNASSSFPTLLPSQKDNFGYK 900
            KE +   +  E++N+  + + +    ++  +      P     S  P        N   +
Sbjct: 786  KEAEEGTHVTEQSNNYVSAVIRPLTGHMQGS------PHALNQSQHPYATSHHASN-ALQ 838

Query: 901  DTQP-PIFFKSAAASRPSDSLVNLRPYRVRKQNNARLVKLAPGLPPVNLPPSVRVMSQSS 1077
             T P P    +A     S S + LRPYR RK NN RLVKLAP LPPVNLPPSVRV+S+S+
Sbjct: 839  PTHPVPNMIWNA-----SKSQIYLRPYRSRKSNNLRLVKLAPDLPPVNLPPSVRVISESA 893

Query: 1078 FINSQAAKDSGNIPSNAGLMAENQSLHAGSNMHLGVGSSAKFGPMRKDHVHVTTSSQQRN 1257
               +Q    +    +  G++            H     + K         ++T+S  +  
Sbjct: 894  LKTNQCGAYTKVSATGDGVVDAGIGNTVSPFSHSAKALANKRHKSNPTRANITSSLSE-- 951

Query: 1258 QSDVATNRCTVERGD--SDLQMHPLLFQAPQDGHL------XXXXXXXXXXXXXGKQPQL 1413
            +S V  N+   E     +DLQMHPLLFQAP+DG +                   G QPQL
Sbjct: 952  ESGVVKNKSVAEERSTHTDLQMHPLLFQAPEDGQVPYYPLNCGTGASSSFSFFSGNQPQL 1011

Query: 1414 SLSLFHNPRRIRDAVNFLSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKL 1593
            +LSLF+NP++   +V  L++S K  + + + + G+DFHPLLQRTD+  ++          
Sbjct: 1012 NLSLFYNPQQTNHSVESLTRSLKMKD-SVSISCGIDFHPLLQRTDDTNSE---------- 1060

Query: 1594 PSIAASRQGCAPIQKHPSSTTKPSVDGISSASMGTKASSLSRQGNELDLNIQLSFTSKNQ 1773
              +  S   C+P                   +  ++ SS + + NELDL I LS  S  +
Sbjct: 1061 --LMKSVAQCSPF------------------ATRSRPSSPNEKANELDLEIHLSSLSTKE 1100

Query: 1774 EGAESRNAAQRNTRRSLGAPIPGVIESKNTKDSSKKRDSAPDAICNELNSSDIPLVASRN 1953
              A S +AA  +   ++      ++ S+N  ++     S+ +   +   +S IP     +
Sbjct: 1101 NAALSGDAATHHKNSAV-----SLLNSQNAAETRDTTHSSGNKFVSGARASTIP-----S 1150

Query: 1954 RGSRKVSDNMHDESLPEIVM 2013
            + + +  D+  D+S  EIVM
Sbjct: 1151 KTTGRYMDDTSDQSHLEIVM 1170


>gb|EXC05724.1| hypothetical protein L484_011305 [Morus notabilis]
          Length = 1423

 Score =  423 bits (1088), Expect = e-115
 Identities = 286/686 (41%), Positives = 369/686 (53%), Gaps = 56/686 (8%)
 Frame = +1

Query: 124  KKTVATTLLEKAKNQPVTSVPKEIAELAQRFWPLFNPALYPRKPPPATIANRVLFTDAED 303
            KKT+A TL+E  K Q +  VP+ I++L++RF+PLFNPAL+P K PP  +  RVLFTD+ED
Sbjct: 572  KKTLAATLVESTKKQSIALVPRNISKLSERFFPLFNPALFPHKAPPPGVLKRVLFTDSED 631

Query: 304  ELLALGLMEYNTDWKAIQQRFLPCKSRHQIFVRQKNRASSKAPENPIKAVRRIKNSPLTL 483
            ELLALG+MEYNTDWKAIQ+RFLPCKS+HQIFVRQKNR SSKAPENPIKAVRR+K SPLT 
Sbjct: 632  ELLALGMMEYNTDWKAIQERFLPCKSKHQIFVRQKNRCSSKAPENPIKAVRRMKTSPLTA 691

Query: 484  EEIARIELGLKKFKLDFMSVWRFFLPYRDPSLLPRQWRIASGTQKSYKLDATKKAKRRLY 663
            EE+A I+ GLK +K D+MSVW F +P+RDPSLLPRQWRIA GTQKSYKLD  KK KRRLY
Sbjct: 692  EEMACIQEGLKVYKYDWMSVWLFTVPHRDPSLLPRQWRIALGTQKSYKLDGEKKEKRRLY 751

Query: 664  EL-RRKGXXXXXXXXXXXXDKEGDSSDNAIEETNSGDNHIDKEDEAYVHEAFLADWMPEN 840
            EL RRK             D + ++S       N+ D  ID   +AYVHEAFLADW P +
Sbjct: 752  ELSRRKCKSSATASWQNKADLQVENSGGG---NNNADGSIDNSGKAYVHEAFLADWRPSD 808

Query: 841  NASSS---------FPTLLPSQKDNFGY------------------KDTQPPIFF----K 927
             +  S           TL P Q  N+ Y                  K   P   F     
Sbjct: 809  PSGHSSLDIARNPHSGTLSPEQLHNYVYGKAPQTIGGYMQQFSSTSKYQHPSFHFAGVRH 868

Query: 928  SAAASRPSDSLV------------NLRPYRVRKQNNARLVKLAPGLPPVNLPPSVRVMSQ 1071
            S A +   +SLV              RPYR RK N   LV+LAP LPPVNLPPSVRV+S 
Sbjct: 869  SGANTFEPNSLVPNTMQSTLKSQFYFRPYRARKSNGMHLVRLAPDLPPVNLPPSVRVVSL 928

Query: 1072 SSFINSQAAKDSGNIPSNAGLMAENQSLHAGSNMHLGVGSSAKFGPMRKDHVHVTTSSQQ 1251
                 S     +G +  +A    EN           G+    K    + +  +    S  
Sbjct: 929  RG--ASTPVSAAGGVTGDA--EKENLMSRIPLAGRSGITHVTKSRENKSNASNDCPISSI 984

Query: 1252 RNQSDVATNRCTVERG--DSDLQMHPLLFQAPQDGHL------XXXXXXXXXXXXXGKQP 1407
              +S +  + C  + G  DSDLQMHPLLFQAP+DG L                   G QP
Sbjct: 985  AEESRIIKDTCAEDDGNIDSDLQMHPLLFQAPEDGRLPYYPLNCSPSNSSSFSFFSGNQP 1044

Query: 1408 QLSLSLFHNPRRIRDAVNFLSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNG 1587
            QL LSL HNPR+  + V   +KS +  + + +++ G+DFHPLLQRTD         + +G
Sbjct: 1045 QLHLSLLHNPRQ-ENLVGSFTKSLQLKD-STSSSYGIDFHPLLQRTD---------YVHG 1093

Query: 1588 KLPSIAASRQGCAPIQKHPSSTTKPSVDGISSASMGTKASSLSRQGNELDLNIQLSFTSK 1767
             L  +    Q  + +   P +T+K                    + NELDL I +S  S+
Sbjct: 1094 DLIDV----QTESLVNADPHTTSK-----------------FVEKANELDLEIHISSASR 1132

Query: 1768 NQEGAESRNAAQRNTRRSLGAPIPGVIESKNTKDSSKK----RDSAPDAICNELNSSDIP 1935
             +EG+ +RN    N  RS     P    +  T++S++      +S+P  I   ++     
Sbjct: 1133 -KEGSWNRNETAHNPVRS-ATNAPNSEFTSKTQNSNRSLYLHNESSPSNISRPVSGGHSS 1190

Query: 1936 LVASRNRGSRKVSDNMHDESLPEIVM 2013
            ++   N G  +  D+M D+S PEIVM
Sbjct: 1191 VLPGDNIG--RYVDDMGDQSHPEIVM 1214


>ref|XP_006594422.1| PREDICTED: uncharacterized protein LOC102661544 isoform X1 [Glycine
            max] gi|571499167|ref|XP_006594423.1| PREDICTED:
            uncharacterized protein LOC102661544 isoform X2 [Glycine
            max] gi|571499169|ref|XP_006594424.1| PREDICTED:
            uncharacterized protein LOC102661544 isoform X3 [Glycine
            max] gi|571499171|ref|XP_006594425.1| PREDICTED:
            uncharacterized protein LOC102661544 isoform X4 [Glycine
            max]
          Length = 1406

 Score =  407 bits (1047), Expect = e-111
 Identities = 295/745 (39%), Positives = 380/745 (51%), Gaps = 74/745 (9%)
 Frame = +1

Query: 1    EPLFPLRNSLCSAESDGQGETENTPPDXXXXXXXXXXNRMPKKTVATTLLEKAKNQPVTS 180
            EPLFP+ + +  AE++G+  +  T              + PKKT+A  L+E  K Q +  
Sbjct: 514  EPLFPVSSPV--AEANGE-ISRGTISRAVNAVSPSTGKQRPKKTLAAMLVESTKKQSIAL 570

Query: 181  VPKEIAELAQRFWPLFNPALYPRKPPPATIANRVLFTDAEDELLALGLMEYNTDWKAIQQ 360
            V KE+A+LAQRF  LFNPAL+P KPPPA + NR+LFTD+EDELLALG+MEYNTDWKAIQQ
Sbjct: 571  VQKEVAKLAQRFLALFNPALFPHKPPPAAVVNRILFTDSEDELLALGIMEYNTDWKAIQQ 630

Query: 361  RFLPCKSRHQIFVRQKNRASSKAPENPIKAVRRIKNSPLTLEEIARIELGLKKFKLDFMS 540
            RFLPCK++HQIFVRQKNR SSKA ENPIKAVRR+K SPLT EEIA I+ GLK +K D+  
Sbjct: 631  RFLPCKTKHQIFVRQKNRCSSKASENPIKAVRRMKTSPLTAEEIACIQEGLKLYKCDWTL 690

Query: 541  VWRFFLPYRDPSLLPRQWRIASGTQKSYKLDATKKAKRRLYELRRKGXXXXXXXXXXXXD 720
            VW++ +P+RDPSLLPRQWRIA GTQKSYK+DA+K+ KRRLYE  R+             D
Sbjct: 691  VWQYIVPHRDPSLLPRQWRIALGTQKSYKIDASKREKRRLYESNRR-KSKALESWRAISD 749

Query: 721  KEGDSSDNAIEETNSGDNHIDKEDEAYVHEAFLADWMPE-----------------NNAS 849
            KE   ++ A      G   +  E   YVH+AFLADW P+                 N A 
Sbjct: 750  KEDCDAEIA------GSECMYSEVVPYVHQAFLADWRPDTSTLTYPERISTTSGEGNVAH 803

Query: 850  SSFPT----------------LLPSQKDN---------------------FGYKDTQPPI 918
            ++F                   +P Q  N                      G K     I
Sbjct: 804  NAFSQEDIQFYRGTHDYGLSGKVPHQNGNQSALPSVSKLPQPFHTMSDLRNGMKGVPSTI 863

Query: 919  FFKSAAASRPSDSLVNLRPYRVRKQNNARLVKLAPGLPPVNLPPSVRVMSQSSFINSQAA 1098
              K       S S    RPYR R+ +NA LVKLAP LPPVNLPPSVRV+SQ++F   Q  
Sbjct: 864  NPKKPVFDVTSSSKYYCRPYRSRRAHNAHLVKLAPDLPPVNLPPSVRVVSQTAFKGFQCG 923

Query: 1099 KDSGNIPSNAGLMAENQSLHAGSNMH----LGVGSSAKFGPMRKDHVHVTTSSQQRNQSD 1266
                + P  AG+ A  +   A    H      V       P  +D V   T SQ      
Sbjct: 924  TSKVH-PPGAGVAACRKDYSASQTPHGEKSENVHPVKGARPTLEDSV---TGSQLERSET 979

Query: 1267 VATNRCTVERGD-SDLQMHPLLFQAPQDGHL------XXXXXXXXXXXXXGKQPQLSLSL 1425
            V       E+G  +DLQMHPLLFQ  +DG+                    G QPQL+LSL
Sbjct: 980  VEGESLVAEKGTRTDLQMHPLLFQVTEDGNAPYCPLKFSSGTSSSFSFFSGSQPQLNLSL 1039

Query: 1426 FHNPRRIRDAVNFLSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLPSIA 1605
            FH+ ++ +  ++  +KS K  + +   + G+DFHPLLQ++D+                  
Sbjct: 1040 FHSSQQ-QSHIDCANKSLKSKD-STLRSGGIDFHPLLQKSDD-----------------T 1080

Query: 1606 ASRQGCAPIQKHPSSTTKPSVDGISSASMGTKASSLSRQGNELDLNIQLSFTSKNQEGAE 1785
             S      IQ  P S     V  I++ S G     L+ + NELDL I LS  S  ++  +
Sbjct: 1081 QSPTSFDAIQ--PESLVNSGVQAIANRSSG-----LNDKSNELDLEIHLSSVSGREKSVK 1133

Query: 1786 SRNAAQRNTRRSLGAPIPGVIESKNTKDSSKKRDSAP---------DAICNELNSSDIPL 1938
            SR   Q      +G+     I   + K    + D+AP          A   EL SS  PL
Sbjct: 1134 SR---QLKAHDPVGSKKTVAISGTSMK---PQEDTAPYCQHGVENLSAGSCELASS-APL 1186

Query: 1939 VASRNRGSRKVSDNMHDESLPEIVM 2013
            V S +  +R   D++ D+S PEIVM
Sbjct: 1187 VVSSDNITRYDVDDIGDQSHPEIVM 1211


>ref|XP_006597583.1| PREDICTED: uncharacterized protein LOC100794351 isoform X1 [Glycine
            max] gi|571517713|ref|XP_006597584.1| PREDICTED:
            uncharacterized protein LOC100794351 isoform X2 [Glycine
            max]
          Length = 1403

 Score =  407 bits (1045), Expect = e-110
 Identities = 292/745 (39%), Positives = 389/745 (52%), Gaps = 74/745 (9%)
 Frame = +1

Query: 1    EPLFPLRNSLCSAESDGQGETENTPPDXXXXXXXXXXNRMPKKTVATTLLEKAKNQPVTS 180
            EPLF   + +  AE++G+  +  T              + PKKT+A  L+E  K Q +  
Sbjct: 510  EPLFTFSSPV--AEANGE-ISRGTISRAVNAVSTSTRQQRPKKTLAAMLVESTKKQSIAL 566

Query: 181  VPKEIAELAQRFWPLFNPALYPRKPPPATIANRVLFTDAEDELLALGLMEYNTDWKAIQQ 360
            V KE+A+LAQRF  LFNPAL+P KPPPA + NR+LFTD+EDELLALG+MEYNTDWKAIQQ
Sbjct: 567  VQKEVAKLAQRFLALFNPALFPHKPPPAAVVNRILFTDSEDELLALGIMEYNTDWKAIQQ 626

Query: 361  RFLPCKSRHQIFVRQKNRASSKAPENPIKAVRRIKNSPLTLEEIARIELGLKKFKLDFMS 540
            RFLPCKS+HQIFVRQKN  SSKA ENPIKAVRR+K SPLT EEIA I+ GLK +K D+  
Sbjct: 627  RFLPCKSKHQIFVRQKNHCSSKALENPIKAVRRMKTSPLTAEEIACIQEGLKIYKCDWTL 686

Query: 541  VWRFFLPYRDPSLLPRQWRIASGTQKSYKLDATKKAKRRLYELRRKGXXXXXXXXXXXXD 720
            VW++ +P+RDPSLLPRQWRIA GTQKSYK+DA+K+ KRRLYE  R+             D
Sbjct: 687  VWQYIVPHRDPSLLPRQWRIALGTQKSYKIDASKREKRRLYESNRR-KLKALESWRAISD 745

Query: 721  KEGDSSDNAIEETNSGDNHID-KEDEAYVHEAFLADWMPENNASSSFPTLLP-------- 873
            KE   ++ A      G   +D  E   YVH+AFLADW P + ++ ++P  +         
Sbjct: 746  KEDCDAEIA------GSECMDYSEVVPYVHQAFLADWRP-HTSTLTYPECISTTSREGNV 798

Query: 874  -----SQKDNFGYKDTQ------------------------PPIFF-------------- 924
                 SQKD   Y+ T                         P +F               
Sbjct: 799  AHNAFSQKDIQFYRGTHDYGLSGKVPLENGNQSALPSVSKLPQLFHTTSDLRNGMKGAPS 858

Query: 925  ----KSAAASRPSDSLVNLRPYRVRKQNNARLVKLAPGLPPVNLPPSVRVMSQSSFINSQ 1092
                K       S S    RPYR R+ +NA LVKLAPGLPPVNLPPSVR++SQ++F   Q
Sbjct: 859  TINPKKPVFDVTSSSKYYCRPYRSRRAHNAHLVKLAPGLPPVNLPPSVRIVSQTAFKGFQ 918

Query: 1093 AAKDSGNIPSNAGLMA------ENQSLHA--GSNMHLGVGSSAKFGPMRKDHVHVTTSSQ 1248
                  ++P  AG+ A       +Q+ H     N+H   G+     P  +D V   T SQ
Sbjct: 919  CGTSKVHLP-GAGVAACRKDNSSSQTPHGEKSENVHPVKGAR----PTLEDSV---TGSQ 970

Query: 1249 QRNQSDVATNRCTVERG-DSDLQMHPLLFQAPQDGHL------XXXXXXXXXXXXXGKQP 1407
                  V       E+G  SDLQMHPLLFQ  +DG++                   G QP
Sbjct: 971  LGRSDTVEDGSLVAEKGTSSDLQMHPLLFQVTEDGNVPYYPLKFSSGTSSSFSFFSGSQP 1030

Query: 1408 QLSLSLFHNPRRIRDAVNFLSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNG 1587
            QL+LSLFH+ ++ +  ++  +KS K  + +   + G+DFHPLLQ++D+            
Sbjct: 1031 QLNLSLFHSSQQ-QSHIDCANKSLKLKD-STLRSGGIDFHPLLQKSDD------------ 1076

Query: 1588 KLPSIAASRQGCAPIQKHPSSTTKPSVDGISSASMGTKASSLSRQGNELDLNIQLSFTSK 1767
                   S      IQ  P S     V  I+S S G     L+ + NELDL I LS  S 
Sbjct: 1077 -----TQSPTSFDAIQ--PESLVNSGVQAIASRSSG-----LNDKSNELDLEIHLSSVSG 1124

Query: 1768 NQEGAESRNAAQRN---TRRSLGAPIPGVIESKNTKDSSKKRDSAPDAICNELNSSDIPL 1938
             ++  +SR     +   +++++      +   ++T    ++      A   EL SS  PL
Sbjct: 1125 REKSVKSRQLKAHDPVGSKKTVAISGTAMKPQEDTAPYCQQGVENLSAGSCELASS-APL 1183

Query: 1939 VASRNRGSRKVSDNMHDESLPEIVM 2013
            V   +  +R   D++ D+S PEIVM
Sbjct: 1184 VVPNDNITRYDVDDIGDQSHPEIVM 1208


>ref|XP_004486161.1| PREDICTED: uncharacterized protein LOC101502269 isoform X1 [Cicer
            arietinum] gi|502079123|ref|XP_004486162.1| PREDICTED:
            uncharacterized protein LOC101502269 isoform X2 [Cicer
            arietinum]
          Length = 1417

 Score =  397 bits (1021), Expect = e-108
 Identities = 280/738 (37%), Positives = 370/738 (50%), Gaps = 67/738 (9%)
 Frame = +1

Query: 1    EPLFPLRNSLCSAESDGQGETENTPPDXXXXXXXXXXNRMPKKTVATTLLEKAKNQPVTS 180
            EPLFP  +S+  A ++    +  T              + P+KT+A  L++  K Q V  
Sbjct: 500  EPLFPFSSSVAGANNE---VSSGTISGVNSTVSSSPGKKKPRKTLAAMLVDSTKKQSVAL 556

Query: 181  VPKEIAELAQRFWPLFNPALYPRKPPPATIANRVLFTDAEDELLALGLMEYNTDWKAIQQ 360
            VPK++A L QRF   FNPAL+P KPPPA + NR+LFTD+EDELLALG+MEYNTDWKAIQQ
Sbjct: 557  VPKKVANLTQRFLAFFNPALFPHKPPPAAVVNRILFTDSEDELLALGIMEYNTDWKAIQQ 616

Query: 361  RFLPCKSRHQIFVRQKNRASSKAPENPIKAVRRIKNSPLTLEEIARIELGLKKFKLDFMS 540
            RFLP KS+HQIFVRQKNR SSK+ +NPIKAVRR+K SPLT EEIA I  GLK +K D+MS
Sbjct: 617  RFLPSKSKHQIFVRQKNRCSSKSSDNPIKAVRRMKTSPLTAEEIACIHEGLKHYKSDWMS 676

Query: 541  VWRFFLPYRDPSLLPRQWRIASGTQKSYKLDATKKAKRRLYELRR---KGXXXXXXXXXX 711
            VW++ +P+RDP LLPRQWR+A GTQKSYKLD  KK KRRLYE ++   K           
Sbjct: 677  VWQYIVPHRDPFLLPRQWRVALGTQKSYKLDEGKKEKRRLYESQKRKLKATATAIECWQP 736

Query: 712  XXDKEGDSSDNAIEETNSGDNHIDKEDEAYVHEAFLADWMP------------------- 834
              DKE   ++ A        + +D  D  YVH+AFLADW P                   
Sbjct: 737  IPDKEDCEAEIA--------DGMDYSDVPYVHQAFLADWRPDTSTLNYSERISSTSLEVN 788

Query: 835  ---------------------------ENNASSSFPT-----LLPSQKDNF--GYKDTQP 912
                                       +N    +FP+     LL      F  G K T  
Sbjct: 789  LGHDAISQDIQLYRGINNYGLSGNVQHQNGNQPAFPSAYKLPLLFHSTSGFRSGMKGTPS 848

Query: 913  PIFFKSAAASRPSDSLVNLRPYRVRKQNNARLVKLAPGLPPVNLPPSVRVMSQSSFINSQ 1092
                K+      S S    RPYR R+ N ARLVKLAP LPPVNLPPSVRV+S+++F    
Sbjct: 849  ATIPKNPVFGATSSSKYYCRPYRARRANTARLVKLAPDLPPVNLPPSVRVVSETAF-KGF 907

Query: 1093 AAKDSGNIPSNAGLMAENQSLHAGSNMH---LGVGSSAKFGPMRKDHVHVTTSSQQRNQS 1263
                S N P   G+    +   A    H   +G+   A    M KD V       Q  +S
Sbjct: 908  PCGTSKNFPPGGGVTDVRKDNSASQIPHGEKIGIDHRAGARSMPKDSV----VGSQVERS 963

Query: 1264 DVATNRCTV--ERGDSDLQMHPLLFQAPQDGHL------XXXXXXXXXXXXXGKQPQLSL 1419
            + A  R  V  +   +DLQMHPLLFQ  ++G                     G+QPQL+L
Sbjct: 964  ETAEGRSVVAEKAAHADLQMHPLLFQVTEEGQTPYYPFKFSSGPSSSFSFFSGRQPQLNL 1023

Query: 1420 SLFHNPRRIRDAVNFLSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLPS 1599
            SLF +  + +  ++  +KS K  + ++    G+DFHPLLQ++++  A S           
Sbjct: 1024 SLFSSSLQ-QGHIDRANKSLK-SKNSSLRLGGIDFHPLLQKSNDTQAQS----------- 1070

Query: 1600 IAASRQGCAPIQKHPSSTTKPSVDGISSASMGTKASSLSRQGNELDLNIQLSFTSKNQEG 1779
                  G   IQ       +  V+         ++S L+ + NELDL+I L   S+  + 
Sbjct: 1071 ------GSDDIQ------AESLVNNSGVPDTTDRSSGLNDKSNELDLDIHLCSVSEGDKS 1118

Query: 1780 AESRNAAQRNTRRSLGAPIPGVIESKNTKDSSKKRDSAPDAICNELNSSDIPLVASRNRG 1959
             +SR   + +   S    I         ++ S  R       C EL S+D PLVA  +  
Sbjct: 1119 MKSRQLKEHDPIASCETAINAPYCQHGGRNPSPSR-------C-ELASND-PLVAPEDNI 1169

Query: 1960 SRKVSDNMHDESLPEIVM 2013
            +R   D++ D+S P IVM
Sbjct: 1170 TRYDVDDVGDQSHPGIVM 1187


>emb|CBI23241.3| unnamed protein product [Vitis vinifera]
          Length = 1445

 Score =  394 bits (1013), Expect = e-107
 Identities = 205/360 (56%), Positives = 250/360 (69%), Gaps = 1/360 (0%)
 Frame = +1

Query: 1    EPLFPLRNSLCSAESDGQGETENTPPDXXXXXXXXXXNRMPKKTVATTLLEKAKNQPVTS 180
            EPLFP  +    AE+ G+      PP           ++ PKKT+A  L+E  K Q V  
Sbjct: 488  EPLFPFPSFQSLAEASGEVSRGTMPPATNMELVSSSSHQPPKKTLAAALVESTKKQSVAL 547

Query: 181  VPKEIAELAQRFWPLFNPALYPRKPPPATIANRVLFTDAEDELLALGLMEYNTDWKAIQQ 360
            V KEI +LAQ+F+PLFN AL+P KPPP  +ANRVLFTD+EDELLA+GLMEYN+DWKAIQQ
Sbjct: 548  VHKEIVKLAQKFFPLFNSALFPHKPPPTPVANRVLFTDSEDELLAMGLMEYNSDWKAIQQ 607

Query: 361  RFLPCKSRHQIFVRQKNRASSKAPENPIKAVRRIKNSPLTLEEIARIELGLKKFKLDFMS 540
            RFLPCK++HQIFVRQKNR SSKAP+NPIKAVRR+K SPLT EE  RI+ GL+ FKLD+MS
Sbjct: 608  RFLPCKTKHQIFVRQKNRCSSKAPDNPIKAVRRMKTSPLTAEEKERIQEGLRVFKLDWMS 667

Query: 541  VWRFFLPYRDPSLLPRQWRIASGTQKSYKLDATKKAKRRLYEL-RRKGXXXXXXXXXXXX 717
            +W+F +P+RDPSLLPRQWRIA G QKSYK D  KK KRRLYEL RRK             
Sbjct: 668  IWKFIVPHRDPSLLPRQWRIAHGIQKSYKKDTAKKEKRRLYELNRRKSKAAAGPIWETVS 727

Query: 718  DKEGDSSDNAIEETNSGDNHIDKEDEAYVHEAFLADWMPENNASSSFPTLLPSQKDNFGY 897
            +KE   ++NA+EE  SGD+ +D +DEAYVHEAFLADW PE   +    +  P  +++   
Sbjct: 728  EKEEYQTENAVEEGKSGDDDMDNDDEAYVHEAFLADWRPEGTHNPHMFSHFPHVRNS--T 785

Query: 898  KDTQPPIFFKSAAASRPSDSLVNLRPYRVRKQNNARLVKLAPGLPPVNLPPSVRVMSQSS 1077
              T  P    S    + S S   LRPYRVR+ ++A  VKLAP LPPVNLPPSVR++SQS+
Sbjct: 786  SSTMEPSQPVSDLTLKSSKSQFCLRPYRVRRNSSAHQVKLAPDLPPVNLPPSVRIISQSA 845



 Score = 60.8 bits (146), Expect = 3e-06
 Identities = 79/312 (25%), Positives = 107/312 (34%), Gaps = 15/312 (4%)
 Frame = +1

Query: 1420 SLFHNPRRIRDAVNFLSKSSKPPEKNAAATSGVDFHPLLQRTDNEGADSLAAHPNGKLPS 1599
            +LFHNP +    VN   KS K   K +  + G+DFHPLLQR+D+   D            
Sbjct: 850  NLFHNPHQANPKVNSFYKSLK--SKESTPSCGIDFHPLLQRSDDIDND------------ 895

Query: 1600 IAASRQGCAPIQKHPSSTTKPSVDGISSASMGTKASSLSRQGNELDLNIQLSFTSKNQEG 1779
                      +    +  T+P V+     S GTK S L    NELDL I LS TSK ++ 
Sbjct: 896  ----------LNSFDAVLTEPRVNSAPPRS-GTKPSCLDGIENELDLEIHLSSTSKTEKV 944

Query: 1780 AESRNAAQRNTRRSLGAPIPGVIESKNTKDSSKKRDSAPDAICNELNSSDIPLVASRNRG 1959
              S N                                        L S    LV   N  
Sbjct: 945  VGSTN----------------------------------------LISGACALVLPSN-- 962

Query: 1960 SRKVSDNMHDESLPEIVMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQVVNVPN 2139
               + DN+ D+SLPEIVM                                   Q+V++ +
Sbjct: 963  --DILDNIGDQSLPEIVMEQEELSDSDEEIGEHVEFECEEMADSEGEESSDSEQIVDLQD 1020

Query: 2140 E------------EVDLDIEE---GRVLNSQNEYGSNACSTSEACSNGLDMVEKGKPKAL 2274
            +            +VD D E+    R+ N Q+       STS             +  + 
Sbjct: 1021 KVVPIVEMEKLVPDVDFDNEQCEPRRIDNPQSNDCITKDSTSPVRLGSTGQERDTRCSSS 1080

Query: 2275 PLNLNSCPPVSP 2310
             L+LNSCPP  P
Sbjct: 1081 WLSLNSCPPGCP 1092


>ref|XP_002887874.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata]
            gi|297333715|gb|EFH64133.1| DNA binding protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 1257

 Score =  382 bits (980), Expect = e-103
 Identities = 262/691 (37%), Positives = 347/691 (50%), Gaps = 45/691 (6%)
 Frame = +1

Query: 55   GETENTPPDXXXXXXXXXXNRMPKKTVATTLLEKAKNQPVTSVPKEIAELAQRFWPLFNP 234
            GE  N P             +  KKT+A  L+E A+ Q V  V K+IA+LA+RF PLF  
Sbjct: 447  GEIVNNPLSSPSSSKSPSGQQQSKKTLAAILVESAQKQSVALVHKDIAKLAKRFLPLFKV 506

Query: 235  ALYPRKPPPATIANRVLFTDAEDELLALGLMEYNTDWKAIQQRFLPCKSRHQIFVRQKNR 414
            +LYP KPP A +ANRVLFTDAEDELLALG+MEYN+DWKAI+QRFLPCK  HQI+VRQKNR
Sbjct: 507  SLYPHKPPHAAVANRVLFTDAEDELLALGIMEYNSDWKAIKQRFLPCKGEHQIYVRQKNR 566

Query: 415  ASSKAPENPIKAVRRIKNSPLTLEEIARIELGLKKFKLDFMSVWRFFLPYRDPSLLPRQW 594
             SSKAPENPIKAV R+K+SPLT EEI RI+ GLK FK D+ SVW+F +PYRDPS LPRQW
Sbjct: 567  RSSKAPENPIKAVLRMKSSPLTPEEIVRIQEGLKYFKYDWTSVWKFVVPYRDPSSLPRQW 626

Query: 595  RIASGTQKSYKLDATKKAKRRLYELRRKGXXXXXXXXXXXXDKEGDSSDNAIEETNSGDN 774
            R A G QKSYKLDA KK KRRLY+ +RK             D+ G S  N   E + GD 
Sbjct: 627  RTALGIQKSYKLDAVKKEKRRLYDTKRK---FREQQASAKEDRHGASKAN---EYHVGDE 680

Query: 775  HIDKEDEAYVHEAFLADWMP------ENNASSSFPTLLPSQKDNF---------GYKDTQ 909
             ++   EAY+HE FLADW P       + +  SF        D           G K+++
Sbjct: 681  LVESSGEAYLHEGFLADWRPGMPTLFYSTSMHSFDKAKDVPGDRHESVQTCIVEGSKNSE 740

Query: 910  ------------------PPIFFKSAAASRPSDSLVNLRPYRVRKQNNARLVKLAPGLPP 1035
                              P     S  A   S + +  RPYR RK  N  +V+LAP LPP
Sbjct: 741  LGGAQILTCTQRLAPSFIPLYHHTSGTAPGASKASIITRPYRSRKLFNRSVVRLAPDLPP 800

Query: 1036 VNLPPSVRVMSQSSFINSQAAKDSGNIPSNAGLMAENQSLHAGSNMHLGVGSSAKFGPMR 1215
            +NLP SVRV+SQS F  +Q+   S       G+   ++    G              P  
Sbjct: 801  LNLPSSVRVISQSVFAKNQSETSSKTCIIKGGMSDVSRRGILGIETPCFSADGDNNVPPN 860

Query: 1216 KDHVHVTTSSQQRNQSDVATNRCTVERGDSDLQMHPLLFQAPQDGHL-----XXXXXXXX 1380
            +  V +       + S +          DSDLQMHPLLF+ P+ G +             
Sbjct: 861  EKVVDLQEDVPAESSSGMGE-----RSNDSDLQMHPLLFRTPEHGQITCYPASRDPGGSS 915

Query: 1381 XXXXXGKQPQLSLSLFHNPRRIRDAVNFLSKSSKPPEKNAAATSGVDFHPLLQRTDNEGA 1560
                   +PQL LSLF++P++I  + + L K+S P E   A      FHPLLQRT++E  
Sbjct: 916  FSFFPDNRPQL-LSLFNSPKQINHSADQLHKNSSPNEHETAQGDSC-FHPLLQRTEHE-- 971

Query: 1561 DSLAAHPNGKLPSIAASRQGCAPIQKHPSSTTKPSVDGISSASMGTKASSLSRQGNELDL 1740
             S      G L      +     +Q    +  K  + G +  S+  K  S S+    ++L
Sbjct: 972  TSYLISRRGNLDPGIGKKDKLCQLQDSSCAVEKTLIPGRNDVSL--KPFSSSKHSKNVNL 1029

Query: 1741 NIQLSFTSKNQEGAESRNAAQRN-------TRRSLGAPIPGVIESKNTKDSSKKRDSAPD 1899
            +I LS +S         +AA  +       T+ + G+ +PG         S+   D+   
Sbjct: 1030 DIYLSSSSSKVNNCGRVSAANISEAPDICMTQCNDGSEVPG---------STAPSDTISR 1080

Query: 1900 AICNELNSSDIPLVASRNRGSRKVSDNMHDE 1992
             I    + S++ +V  +   S    + M +E
Sbjct: 1081 CIDEMADQSNLGIVMEQEELSDSDEEMMEEE 1111


Top