BLASTX nr result

ID: Rehmannia26_contig00004618 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia26_contig00004618
         (2611 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI15707.3| unnamed protein product [Vitis vinifera]              938   0.0  
ref|XP_004235273.1| PREDICTED: uncharacterized protein LOC101245...   934   0.0  
ref|XP_003532495.1| PREDICTED: uncharacterized protein LOC100816...   909   0.0  
ref|XP_006580618.1| PREDICTED: uncharacterized protein LOC100786...   902   0.0  
ref|XP_002511132.1| zinc finger protein, putative [Ricinus commu...   901   0.0  
ref|XP_004503774.1| PREDICTED: MORC family CW-type zinc finger p...   894   0.0  
gb|EXB54890.1| hypothetical protein L484_008818 [Morus notabilis]     885   0.0  
gb|EPS74601.1| hypothetical protein M569_00153, partial [Genlise...   884   0.0  
ref|XP_002280533.2| PREDICTED: uncharacterized protein LOC100266...   881   0.0  
ref|XP_004161374.1| PREDICTED: uncharacterized LOC101222073 [Cuc...   881   0.0  
gb|EMJ10266.1| hypothetical protein PRUPE_ppa025644mg [Prunus pe...   880   0.0  
ref|XP_004138252.1| PREDICTED: uncharacterized protein LOC101222...   877   0.0  
ref|XP_006436890.1| hypothetical protein CICLE_v10030712mg [Citr...   875   0.0  
gb|EOY22565.1| MORC family CW-type zinc finger protein 4, putati...   874   0.0  
gb|EOY22563.1| MORC family CW-type zinc finger protein 4, putati...   874   0.0  
ref|XP_002321726.2| hypothetical protein POPTR_0015s11300g [Popu...   861   0.0  
ref|XP_002318152.1| hypothetical protein POPTR_0012s10460g [Popu...   850   0.0  
ref|XP_006584700.1| PREDICTED: uncharacterized protein LOC100816...   830   0.0  
ref|XP_002864076.1| hypothetical protein ARALYDRAFT_495140 [Arab...   826   0.0  
ref|XP_004298543.1| PREDICTED: uncharacterized protein LOC101292...   820   0.0  

>emb|CBI15707.3| unnamed protein product [Vitis vinifera]
          Length = 830

 Score =  938 bits (2424), Expect = 0.0
 Identities = 483/778 (62%), Positives = 580/778 (74%), Gaps = 44/778 (5%)
 Frame = -1

Query: 2527 EGLNAVLPVGFLDPLPTKQTPSPRNERLCLEFPSARVNAVA-----EASAKQFWKAGDYE 2363
            +GL  VLP+GFLDPLP ++ P+   + +      A+ ++ A     E S K FWKAG+YE
Sbjct: 78   DGLGIVLPLGFLDPLPPEEPPALVPKAVTSPTAVAQRSSTANRNLVEQSCKLFWKAGEYE 137

Query: 2362 GAPGGVWDSSNGGMDHVRVHPKFLHSNATSHKWVLGAFAELLDNSLDEVCNGATYVNVDM 2183
            GAPGG +DSS GG+DHVRVHPKFLHSNATSHKW LGAFAELLDNSLDE+CNGATYVNVDM
Sbjct: 138  GAPGGDFDSSAGGLDHVRVHPKFLHSNATSHKWALGAFAELLDNSLDEICNGATYVNVDM 197

Query: 2182 VKSQKDGSKMLLIEDNGGGMDPDKMRHCMSLGYSAKSKLVDTIGQYGNGFKTSTMRLGAD 2003
            ++++KDG++MLLIEDNGGGMDP+KMR CMSLGYSAKSK+ +TIGQYGNGFKTSTMRLGAD
Sbjct: 198  LENKKDGNRMLLIEDNGGGMDPEKMRQCMSLGYSAKSKIANTIGQYGNGFKTSTMRLGAD 257

Query: 2002 VIVFSRCSGKDGRRSTQSIGLLSYTFLKSTGKEDIVVPMLDYERRGQDWNKIVRSSATDW 1823
            VIVFSRC GKDG+  TQSIGLLSYTFL+STGKEDIVVPM+DYE+ G++WNK++RSSA+DW
Sbjct: 258  VIVFSRCCGKDGKSPTQSIGLLSYTFLRSTGKEDIVVPMIDYEKGGREWNKMIRSSASDW 317

Query: 1822 NRNVETIAQWSPFSSEADLLRQFNQMQDQGTRIIIYNLWEDDQGLLELDFDTDPHDIQIR 1643
            N+NVETI QWSPFSSE DLLRQFN +++ GTRIIIYNLWEDD G LELDFDTDP DIQIR
Sbjct: 318  NKNVETIMQWSPFSSELDLLRQFNFIKEHGTRIIIYNLWEDDPGQLELDFDTDPKDIQIR 377

Query: 1642 GVNRDEKSIDMAKKFPNSKHFLTYRHSLRSYAAILYLRIPPNFRIILRGEDVEHHNVVND 1463
            GVNRDEK+I MAK+FPNS+HFLTYRHSLRSYA+ILYLR+PP FRIILRG+DVEHHNVVND
Sbjct: 378  GVNRDEKNIQMAKQFPNSRHFLTYRHSLRSYASILYLRLPPGFRIILRGKDVEHHNVVND 437

Query: 1462 MMMSQEITYRPQPGVDGIPKDTNMVAIVTVGFVKDAKAHIDVQGFNVYHKNRLIKPFWRV 1283
            MMM+QE+TYRPQP  DG+PKD NMVA+VT+GFVKDAK HIDVQGFNVYHKNRLIKPFWR+
Sbjct: 438  MMMTQEVTYRPQPSADGVPKDLNMVAVVTIGFVKDAKHHIDVQGFNVYHKNRLIKPFWRL 497

Query: 1282 WHPPGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLETRLIQVQKTYWSSNCHKIGYA 1103
            W+  GSDGRGVIGVLEANFVEPAHDKQGFERT VL+RLETRL+Q+QKTYW++ CHKIGYA
Sbjct: 498  WNAAGSDGRGVIGVLEANFVEPAHDKQGFERTIVLSRLETRLLQMQKTYWTTYCHKIGYA 557

Query: 1102 PRRSK---NVHEREISPDSFPRSPPLKKKSIASSDKTQTRFASXXXXXXXXXXXGDVRTL 932
            PRR+K   N   RE SPD  P++P   KK + +S   +T  ++              R L
Sbjct: 558  PRRNKKLINESARETSPDYLPQTPSQPKKKVGAS-SGKTPLSNLDKHASHSNHKQGGREL 616

Query: 931  RKRPIYVDQ------------------------------SSSSEEDVRDNGRQNHTPRKQ 842
             + P  V Q                              SS S EDV D+      P ++
Sbjct: 617  ERTPETVYQSHGNGHASSKQEKRTHMPTRPRKEQSSLVPSSPSAEDVDDDDVPAVLPERE 676

Query: 841  TNG-----SSSRKMFGKDGSQNPSGLRSREAAKNNSPAEMPARTTRXXXXXXXXXXXXXX 677
             NG     S +   FG+DG Q  +  +S+    N +   +                    
Sbjct: 677  ANGRVHKASHANNSFGEDGHQISTRSQSKGDDVNGNSNSL-------------------- 716

Query: 676  XXXXXXSYVLAQLKEENLQLRDRLKRKEEEILGDLLHDLQKEKERSKSLEAQLQEATRKY 497
                     L QL+EEN +L++RLKRKE + +  L  DL+KE+E+ K LE +LQEA +K 
Sbjct: 717  --------ALEQLREENCELKERLKRKEGDTVVALRGDLEKEREKCKLLETELQEARQKI 768

Query: 496  EELSKEQESLIDIFSEERQRRDVEEENLRKKLKDASNTIQQLLDKLRSLEKISS-NGK 326
            E+++KEQ+SLIDIFSEER RRD+EEENLRKKL++ASNTIQ+LL+++R LEK+ + NGK
Sbjct: 769  EDMNKEQDSLIDIFSEERDRRDIEEENLRKKLREASNTIQELLERVRVLEKMKTPNGK 826


>ref|XP_004235273.1| PREDICTED: uncharacterized protein LOC101245442 [Solanum
            lycopersicum]
          Length = 834

 Score =  934 bits (2413), Expect = 0.0
 Identities = 495/784 (63%), Positives = 571/784 (72%), Gaps = 53/784 (6%)
 Frame = -1

Query: 2527 EGLNAVLPVGFLDPLPTKQTPSPRNERLCLEFPSARVNA---VAEASAKQFWKAGDYEGA 2357
            + +N  LP+GFLDPLP      P+   L L  P    N    V+ +S KQFWKAGDYEG+
Sbjct: 68   QDVNFALPLGFLDPLPP-----PKEPPLPLPAPPNGSNTDLGVSGSSCKQFWKAGDYEGS 122

Query: 2356 PGGVWDSSNGGMDHVRVHPKFLHSNATSHKWVLGAFAELLDNSLDEVCNGATYVNVDMVK 2177
                    +GG+DHVRVHPKFLHSNATSHKWVLGA AELLDNSLDEV NGATYVN+DMVK
Sbjct: 123  SSASSVLKSGGIDHVRVHPKFLHSNATSHKWVLGALAELLDNSLDEVSNGATYVNIDMVK 182

Query: 2176 SQKDGSKMLLIEDNGGGMDPDKMRHCMSLGYSAKSKLVDTIGQYGNGFKTSTMRLGADVI 1997
            ++KDGS+MLLIEDNGGGMDP++MRHCMSLGYS KSK+ DTIGQYGNGFKTSTMRLGADVI
Sbjct: 183  NKKDGSRMLLIEDNGGGMDPERMRHCMSLGYSVKSKMADTIGQYGNGFKTSTMRLGADVI 242

Query: 1996 VFSRCSGKDGRRSTQSIGLLSYTFLKSTGKEDIVVPMLDYERRGQDWNKIVRSSATDWNR 1817
            VFSR  GK G+  TQSIGLLSYTFL++ G EDIVVPMLDYE+R + W++I+RSS+ DW++
Sbjct: 243  VFSRSDGKPGKSPTQSIGLLSYTFLRNKGMEDIVVPMLDYEKR-EGWDRIIRSSSDDWDK 301

Query: 1816 NVETIAQWSPFSSEADLLRQFNQMQDQGTRIIIYNLWEDDQGLLELDFDTDPHDIQIRGV 1637
            N+ETI +WSPFSSEADLLRQFN M+ QGTRI++YNLWEDDQGLLELDFD DPHDIQIRGV
Sbjct: 302  NLETIIEWSPFSSEADLLRQFNPMKGQGTRIVVYNLWEDDQGLLELDFDADPHDIQIRGV 361

Query: 1636 NRDEKSIDMAKKFPNSKHFLTYRHSLRSYAAILYLRIPPNFRIILRGEDVEHHNVVNDMM 1457
            NRDE+SI MAK++PNS+HFLTYRHSLRSYA+ILYLR+ P FRIILRG+DVEHHN+VNDMM
Sbjct: 362  NRDERSIQMAKQYPNSRHFLTYRHSLRSYASILYLRVAPGFRIILRGKDVEHHNIVNDMM 421

Query: 1456 MSQEITYRPQPGVDGIPKDTNMVAIVTVGFVKDAKAHIDVQGFNVYHKNRLIKPFWRVWH 1277
            M+QE+TYRP PG DG+PKD+NMVA V +GFVKDAK+HIDVQGFNVYHKNRLIKPFWR+WH
Sbjct: 422  MTQEVTYRPMPGADGVPKDSNMVATVKIGFVKDAKSHIDVQGFNVYHKNRLIKPFWRLWH 481

Query: 1276 PPGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLETRLIQVQKTYWSSNCHKIGYAPR 1097
             PGSDGRGVIGVLEANFVEPAHDKQGFERTTVL+RLE RL+Q+QKTYWS+ CHKIGYAPR
Sbjct: 482  APGSDGRGVIGVLEANFVEPAHDKQGFERTTVLSRLEARLVQMQKTYWSTLCHKIGYAPR 541

Query: 1096 RSKNVHEREISPDSFP--RSPPLKKKSIASSDKTQTRFA--------------------- 986
            R+K    RE SPD +P   S P    S  SS+K     A                     
Sbjct: 542  RNKKAIAREDSPD-YPSSASQPKHNSSAKSSEKIYPSSASQSKHNSSAKSSEKIYPSSAS 600

Query: 985  -------------SXXXXXXXXXXXGDVRTLRKRPIYVDQSSSSEEDVRDN--------- 872
                         S           G +R  R  P  ++ SSS+E+D  D+         
Sbjct: 601  QSKHNSSAKSSEKSNVDGHLNGKHDGKIRRSRNIPSSLEPSSSAEDDSDDDVQVVLPKNK 660

Query: 871  --GRQNHTPRKQTNGSSSRKMFGKDGSQ---NPSGLRSREAAKNNSPAEMPARTTRXXXX 707
              G QNHT              GKDG +   +P G   R A +  SP     R TR    
Sbjct: 661  PVGHQNHTN-------------GKDGPRVMHSPPGFGQRVAEQVCSPGGNLKRVTRSSRS 707

Query: 706  XXXXXXXXXXXXXXXXSYVLAQLKEENLQLRDRLKRKEEEILGDLLHDLQKEKERSKSLE 527
                               L QLK EN +L++RL+RKEEEILGDLL DLQ E+ERSKSLE
Sbjct: 708  KGDADENEGMLPDNLTE-SLEQLKAENHELKERLRRKEEEILGDLLRDLQHERERSKSLE 766

Query: 526  AQLQEATRKYEELSKEQESLIDIFSEERQRRDVEEENLRKKLKDASNTIQQLLDKLRSLE 347
            AQLQE+TRK EEL+KEQESLIDIF+EERQRRD+EEENLRKKLKDASNT+Q+LLDK++ LE
Sbjct: 767  AQLQESTRKLEELNKEQESLIDIFTEERQRRDMEEENLRKKLKDASNTVQELLDKVQVLE 826

Query: 346  KISS 335
            K  S
Sbjct: 827  KTRS 830


>ref|XP_003532495.1| PREDICTED: uncharacterized protein LOC100816702 isoform X1 [Glycine
            max]
          Length = 820

 Score =  909 bits (2350), Expect = 0.0
 Identities = 474/764 (62%), Positives = 564/764 (73%), Gaps = 29/764 (3%)
 Frame = -1

Query: 2530 AEGLNAVLPVGFLDPLPTKQTPSPRNER-LCLEFP------SARVNAVAEAS---AKQFW 2381
            +E    VLPVGFL PLP    P P     L L  P      ++RVNA    S   +KQFW
Sbjct: 69   SEAGGVVLPVGFLTPLPPAPVPVPPPAAVLSLPAPEWASNSASRVNASKSFSLNSSKQFW 128

Query: 2380 KAGDYEGAPGGVWDSSNGGMDHVRVHPKFLHSNATSHKWVLGAFAELLDNSLDEVCNGAT 2201
            KAGDY+GAP G   SS  GMDHVRVHPKFLHSNATSHKW LGAFAELLDNSLDEVCNGAT
Sbjct: 129  KAGDYDGAPLGGSGSSTVGMDHVRVHPKFLHSNATSHKWALGAFAELLDNSLDEVCNGAT 188

Query: 2200 YVNVDMVKSQKDGSKMLLIEDNGGGMDPDKMRHCMSLGYSAKSKLVDTIGQYGNGFKTST 2021
            YVNVDM+ ++KDG++MLL+EDNGGGMDP+KMR CMSLGYS KSK+ +TIGQYGNGFKTST
Sbjct: 189  YVNVDMLINKKDGTRMLLVEDNGGGMDPEKMRQCMSLGYSMKSKMANTIGQYGNGFKTST 248

Query: 2020 MRLGADVIVFSRCSGKDGRRSTQSIGLLSYTFLKSTGKEDIVVPMLDYERRGQDWNKIVR 1841
            MRLGADVIVFSR  GKDG+ STQSIGLLSYTFL+STGKEDIVVPMLDYERRGQ+WNKI+R
Sbjct: 249  MRLGADVIVFSRYPGKDGKSSTQSIGLLSYTFLRSTGKEDIVVPMLDYERRGQEWNKIIR 308

Query: 1840 SSATDWNRNVETIAQWSPFSSEADLLRQFNQMQDQGTRIIIYNLWEDDQGLLELDFDTDP 1661
            +S  DWN+NVETI QWSPFS+EADLL QFN ++D GTR+IIYNLWEDDQG LELDFD DP
Sbjct: 309  TSLDDWNKNVETIVQWSPFSNEADLLLQFNLVKDHGTRVIIYNLWEDDQGQLELDFDEDP 368

Query: 1660 HDIQIRGVNRDEKSIDMAKKFPNSKHFLTYRHSLRSYAAILYLRIPPNFRIILRGEDVEH 1481
            HDIQIRGVNRDEK+I M+K+FPNS+HFLTYRHSLRSY +ILYLR+P  FRIILRG+D+ H
Sbjct: 369  HDIQIRGVNRDEKNIQMSKEFPNSRHFLTYRHSLRSYTSILYLRLPSGFRIILRGKDILH 428

Query: 1480 HNVVNDMMMSQEITYRPQPGVDG-IPKDTNMVAIVTVGFVKDAKAHIDVQGFNVYHKNRL 1304
            HN+VNDMMMSQE+TYRPQ GVDG +PKD+NMVA+VT+GFVKDA  H+DV GFNVYHKNRL
Sbjct: 429  HNIVNDMMMSQEVTYRPQAGVDGLLPKDSNMVAVVTIGFVKDAVHHVDVSGFNVYHKNRL 488

Query: 1303 IKPFWRVWHPPGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLETRLIQVQKTYWSSN 1124
            IKPFWR+W+P GS GRGVIGVLEANFVEPAHDKQGFERT VL+RLE++LIQ+QK YWS+N
Sbjct: 489  IKPFWRIWNPAGSGGRGVIGVLEANFVEPAHDKQGFERTLVLSRLESKLIQMQKKYWSTN 548

Query: 1123 CHKIGYAPRRSK----NVHEREISPDSFPRSPPLKKKSIASSDKT-------------QT 995
            CHKIGYA  RSK    +  ++E SPD FP S   K+K     DK              Q 
Sbjct: 549  CHKIGYASNRSKIQIRDYADKEASPDYFPESSQSKRKYSTMDDKATPLTSDKLRSHSDQK 608

Query: 994  RFASXXXXXXXXXXXGDVRTLRKRPIYVDQSSSSEEDVRDNGRQNHTPRKQTNG-SSSRK 818
            R                  + R+R   + + SSS+++V +       P+K+T   S++ K
Sbjct: 609  RIQKQTDKYIAYKNGQSSVSPRRRMQSLSEQSSSDDEVSE-----VLPKKKTQKISTAEK 663

Query: 817  MFGKDGSQNPSGLRSREAAKNNSPAEMPARTTRXXXXXXXXXXXXXXXXXXXXSYVLAQL 638
             F K+             +++ +     ++ TR                       L QL
Sbjct: 664  SFEKENG----------CSQDTTSRGKSSQYTRGSKLEGKSVNDGEQPPSDNDLLTLEQL 713

Query: 637  KEENLQLRDRLKRKEEEILGDLLHDLQKEKERSKSLEAQLQEATRKYEELSKEQESLIDI 458
            K+EN +L++RL+RKEE+ILG +L DLQ EK+R KSLE QL +A +K EEL+ EQE+LID+
Sbjct: 714  KKENRELKERLQRKEEDILGQVLQDLQHEKDRCKSLETQLIDAEKKLEELNNEQETLIDV 773

Query: 457  FSEERQRRDVEEENLRKKLKDASNTIQQLLDKLRSLEKISSNGK 326
            F+EER RRD EE+ LR KL++ASNTI++LLDK R LE+ SS+GK
Sbjct: 774  FAEERDRRDAEEKKLRNKLEEASNTIRELLDKTRKLERKSSSGK 817


>ref|XP_006580618.1| PREDICTED: uncharacterized protein LOC100786679 [Glycine max]
          Length = 826

 Score =  902 bits (2330), Expect = 0.0
 Identities = 471/761 (61%), Positives = 564/761 (74%), Gaps = 26/761 (3%)
 Frame = -1

Query: 2530 AEGLNAVLPVGFLDPLPTKQTPSPRNER-LCLEFPSARVNAVAEA----------SAKQF 2384
            +E    VLPVGFL PLP    P+P     L L  P    N+ A            S+KQF
Sbjct: 74   SEAGGVVLPVGFLTPLPPAPAPTPPPTAVLSLPAPEWASNSTASRANASKSLSLNSSKQF 133

Query: 2383 WKAGDYEGAPGGVWDSSNGGMDHVRVHPKFLHSNATSHKWVLGAFAELLDNSLDEVCNGA 2204
            WKAGDY+GAP G   SS  GMDHVRVHPKFLHSNATSHKW LGA AELLDNSLDEVC+GA
Sbjct: 134  WKAGDYDGAPLGGSGSSTVGMDHVRVHPKFLHSNATSHKWALGALAELLDNSLDEVCSGA 193

Query: 2203 TYVNVDMVKSQKDGSKMLLIEDNGGGMDPDKMRHCMSLGYSAKSKLVDTIGQYGNGFKTS 2024
            TYVNVDM+ ++KDG++MLLIEDNGGGMDP+KMR CMSLGYS KSK+ +TIGQYGNGFKTS
Sbjct: 194  TYVNVDMLTNKKDGTRMLLIEDNGGGMDPEKMRQCMSLGYSVKSKMANTIGQYGNGFKTS 253

Query: 2023 TMRLGADVIVFSRCSGKDGRRSTQSIGLLSYTFLKSTGKEDIVVPMLDYERRGQDWNKIV 1844
            TMRLGADVIVFSR  GKD + S+QSIGLLSYTFL+STGKEDIVVPMLDYERRGQ+WNKI+
Sbjct: 254  TMRLGADVIVFSRYPGKDMKSSSQSIGLLSYTFLRSTGKEDIVVPMLDYERRGQEWNKII 313

Query: 1843 RSSATDWNRNVETIAQWSPFSSEADLLRQFNQMQDQGTRIIIYNLWEDDQGLLELDFDTD 1664
            R+S  DW++NVETI QWSPFS+EADLLRQFN ++D GTR+IIYNLWEDDQG LELDFD D
Sbjct: 314  RTSLDDWDKNVETIVQWSPFSNEADLLRQFNLVKDHGTRVIIYNLWEDDQGQLELDFDED 373

Query: 1663 PHDIQIRGVNRDEKSIDMAKKFPNSKHFLTYRHSLRSYAAILYLRIPPNFRIILRGEDVE 1484
            PHDIQIRGVNRDEK+I MAK+FPNS+HFLTYRHSLRSYA+ILYLR+PP FRIILRG+D+ 
Sbjct: 374  PHDIQIRGVNRDEKNIQMAKEFPNSRHFLTYRHSLRSYASILYLRLPPGFRIILRGKDIL 433

Query: 1483 HHNVVNDMMMSQEITYRPQPGVDG-IPKDTNMVAIVTVGFVKDAKAHIDVQGFNVYHKNR 1307
            HHN+VNDMMMSQE+TYRPQ GVDG +PKD+NMVA+VT+GFVKDA  HIDV GFNVYHKNR
Sbjct: 434  HHNIVNDMMMSQEVTYRPQAGVDGLLPKDSNMVAVVTIGFVKDAVHHIDVSGFNVYHKNR 493

Query: 1306 LIKPFWRVWHPPGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLETRLIQVQKTYWSS 1127
            LIKPFWR+W+P GS GRGVIGVLEANFVEPAHDKQGFERT VL+RLE++LIQ+QK YWS+
Sbjct: 494  LIKPFWRIWNPAGSGGRGVIGVLEANFVEPAHDKQGFERTLVLSRLESKLIQMQKKYWST 553

Query: 1126 NCHKIGYAPRRSK----NVHEREISPDSFPRSPPLKKKSIASSDKTQTRFASXXXXXXXX 959
            NC+KIGYA  RSK    +  ++E S D FP S   K+K  +++D       S        
Sbjct: 554  NCYKIGYASNRSKIQIRDSADKEASADYFPESSQSKRK-YSTTDGKAPPLTSDKLHSYSN 612

Query: 958  XXXGDVRTLRKRPIYVDQS---------SSSEEDVRDNGRQNHTPRKQTNG-SSSRKMFG 809
                  +T + R     QS         SSSE+   D+      P+K+T   S++ K F 
Sbjct: 613  QKRIQKQTEKYRVYINGQSSVSPKRKVQSSSEQSSSDDEVSEVLPKKKTQKLSTAEKSF- 671

Query: 808  KDGSQNPSGLRSREAAKNNSPAEMPARTTRXXXXXXXXXXXXXXXXXXXXSYVLAQLKEE 629
                +N +G      ++  +     ++ TR                       L Q K+E
Sbjct: 672  ----ENENGCFQHTTSRGKT-----SQYTRGSKLEGKDVNGGEQPLSDKDLLTLEQFKKE 722

Query: 628  NLQLRDRLKRKEEEILGDLLHDLQKEKERSKSLEAQLQEATRKYEELSKEQESLIDIFSE 449
            N +L++RL+RKEE+ILG++LH LQ EK+R KSLE QL +A +K EEL+ EQE+LID+F+E
Sbjct: 723  NRELKERLQRKEEDILGEVLHALQHEKDRCKSLETQLIDAEKKLEELNNEQETLIDVFAE 782

Query: 448  ERQRRDVEEENLRKKLKDASNTIQQLLDKLRSLEKISSNGK 326
            ER RRD EE+ LR KL++ASNTI++LL+K+R LE+ SS+GK
Sbjct: 783  ERDRRDAEEKKLRNKLEEASNTIKELLEKIRKLERKSSSGK 823


>ref|XP_002511132.1| zinc finger protein, putative [Ricinus communis]
            gi|223550247|gb|EEF51734.1| zinc finger protein, putative
            [Ricinus communis]
          Length = 816

 Score =  901 bits (2328), Expect = 0.0
 Identities = 485/758 (63%), Positives = 550/758 (72%), Gaps = 27/758 (3%)
 Frame = -1

Query: 2527 EGLNAVLPVGFLDPLPTKQTPSPR-------NERLCLEFPSARVNAVAEASAKQFWKAGD 2369
            E L  VLPVGFL PL   Q P+         N+ +CL           + S KQFWKAGD
Sbjct: 74   EELGVVLPVGFLAPL--NQVPAEAMLTTVQGNDNVCL----------IDQSCKQFWKAGD 121

Query: 2368 YEGAPGGVWDSSNGGMDHVRVHPKFLHSNATSHKWVLGAFAELLDNSLDEVCNGATYVNV 2189
            YEGAP G WD S GGMDHVRVHPKFLHSNATSHKW LGAFAELLDN+LDEVC GATYVN+
Sbjct: 122  YEGAPCGDWDLSTGGMDHVRVHPKFLHSNATSHKWALGAFAELLDNALDEVCYGATYVNI 181

Query: 2188 DMVKSQKDGSKMLLIEDNGGGMDPDKMRHCMSLGYSAKSKLVDTIGQYGNGFKTSTMRLG 2009
            DM+ + KDGS+MLLIEDNGGGMDPDKMR CMSLGYSAKSK+ +TIGQYGNGFKTSTMRLG
Sbjct: 182  DMLANWKDGSRMLLIEDNGGGMDPDKMRQCMSLGYSAKSKVANTIGQYGNGFKTSTMRLG 241

Query: 2008 ADVIVFSRCSGKDGRRSTQSIGLLSYTFLKSTGKEDIVVPMLDYERRGQDWNKIVRSSAT 1829
            ADVIVFSRC GKDG+  TQSIGLLSYTFL+STGKEDIVVPMLDYER+GQ+WNK++RSS+ 
Sbjct: 242  ADVIVFSRCPGKDGKSPTQSIGLLSYTFLRSTGKEDIVVPMLDYERKGQEWNKMIRSSSG 301

Query: 1828 DWNRNVETIAQWSPFSSEADLLRQFNQMQDQGTRIIIYNLWEDDQGLLELDFDTDPHDIQ 1649
            DWNRNVETI QWSPFSSEADLLRQFN M D GTRI+IYNLWEDD+G LELDFDTDPHDIQ
Sbjct: 302  DWNRNVETIVQWSPFSSEADLLRQFNLMSDHGTRIVIYNLWEDDEGSLELDFDTDPHDIQ 361

Query: 1648 IRGVNRDEKSIDMAKKFPNSKHFLTYRHSLRSYAAILYLRIPPNFRIILRGEDVEHHNVV 1469
            +RGVNRDEK+I MAK+FPNS+HFLTYRHSLRSYA+ILYLR+PP FRIILRG+DVEHHN+V
Sbjct: 362  LRGVNRDEKNIQMAKEFPNSRHFLTYRHSLRSYASILYLRLPPCFRIILRGKDVEHHNIV 421

Query: 1468 NDMMMSQEITYRPQPGVDGIPKDTN---MVAIVTVGFVKDAKAHIDVQGFNVYHKNRLIK 1298
            NDMM+SQEITYRPQ   DG+ KD N   M AIVT+GFVKDAK HIDVQGFNVYHKNRLIK
Sbjct: 422  NDMMLSQEITYRPQ-SADGVAKDFNLNHMAAIVTIGFVKDAKHHIDVQGFNVYHKNRLIK 480

Query: 1297 PFWRVWHPPGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLETRLIQVQKTYWSSNCH 1118
            PFWR+W+  GSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLE RL+Q+QKTYWS+NCH
Sbjct: 481  PFWRLWNAAGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLVQMQKTYWSTNCH 540

Query: 1117 KIGYAPRRSKNVHEREI----SPDSFPRSPPLKKKSIASSDKTQTRFASXXXXXXXXXXX 950
            KIGYAPRR+K           SPD    S   KK S A   K  +  +            
Sbjct: 541  KIGYAPRRNKRFINESTDGGSSPDYSQVSSQSKKYS-ALRGKGLSSLSDKFYSHANQNGG 599

Query: 949  GDVRTLRK--RPIYVD-----QSSSSEEDVRDNGRQNHTPRKQTNGSSSRKMFGKDGSQN 791
                T  K   P Y +       S   +    +GR+ H     +   SS  +   D +  
Sbjct: 600  KRSDTFAKNGNPAYANGHVSSNGSDGTKTSTGSGRKTH-----SKAPSSPSLHDVDDNDA 654

Query: 790  PSGLRSRE----AAKNNSPAEMPAR--TTRXXXXXXXXXXXXXXXXXXXXSYVLAQLKEE 629
               L +R+      + +SP E   +   TR                      +  +LK+E
Sbjct: 655  HIALPTRQDGLHMVRLSSPLEDTTQQAVTRSQSKAGKVDNSQHVLPESDLCNI-NELKQE 713

Query: 628  NLQLRDRLKRKEEEILGDLLHDLQKEKERSKSLEAQLQEATRKYEELSKEQESLIDIFSE 449
            N +LR+RLK++E E  G+++H     K   KSLE QLQEA +K EEL+KEQESLIDIFSE
Sbjct: 714  NQELRERLKKREAEFQGEMMHGSMCNK--CKSLEIQLQEAQQKIEELNKEQESLIDIFSE 771

Query: 448  ERQRRDVEEENLRKKLKDASNTIQQLLDKLRSLEKISS 335
            ER RRD EEENLRKK KDASNTIQQLLDK+R LEK+ S
Sbjct: 772  ERDRRDKEEENLRKKYKDASNTIQQLLDKVRLLEKMKS 809


>ref|XP_004503774.1| PREDICTED: MORC family CW-type zinc finger protein 4-like [Cicer
            arietinum]
          Length = 823

 Score =  894 bits (2311), Expect = 0.0
 Identities = 471/767 (61%), Positives = 559/767 (72%), Gaps = 39/767 (5%)
 Frame = -1

Query: 2509 LPVGFLDPLPTKQTPSPRNERLCLEFPSARVNAVAEA----------SAKQFWKAGDYEG 2360
            LP+GFL PLP +Q PS  +  L L  P    N                 KQFWKAGDY+G
Sbjct: 82   LPIGFLSPLPPQQ-PSSSDAVLYLPAPEWASNTTPNRFNKSVNFTLQGCKQFWKAGDYDG 140

Query: 2359 APGGVWDSSNGGMDHVRVHPKFLHSNATSHKWVLGAFAELLDNSLDEVCNGATYVNVDMV 2180
             P   +++S  GMDHVRVHPKFLHSNATSHKW LGAFAELLDN+LDEVCNGATYVNVDM+
Sbjct: 141  PPARAFETSTVGMDHVRVHPKFLHSNATSHKWALGAFAELLDNALDEVCNGATYVNVDML 200

Query: 2179 KSQKDGSKMLLIEDNGGGMDPDKMRHCMSLGYSAKSKLVDTIGQYGNGFKTSTMRLGADV 2000
             S+KDGS+MLL+EDNGGGMDPDK+R CMSLGYS KSK+ +TIGQYGNGFKTSTMRLGADV
Sbjct: 201  VSKKDGSRMLLVEDNGGGMDPDKIRQCMSLGYSVKSKIANTIGQYGNGFKTSTMRLGADV 260

Query: 1999 IVFSRCSGKDGRRSTQSIGLLSYTFLKSTGKEDIVVPMLDYERRGQDWNKIVRSSATDWN 1820
            IVFSRC GKDG+RSTQSIGLLSYTFL++T KEDIVVPMLDYER GQ WNK++R+S  DWN
Sbjct: 261  IVFSRCQGKDGKRSTQSIGLLSYTFLRNTRKEDIVVPMLDYERDGQGWNKMLRTSLDDWN 320

Query: 1819 RNVETIAQWSPFSSEADLLRQFNQMQDQGTRIIIYNLWEDDQGLLELDFDTDPHDIQIRG 1640
             NVETI QWSPFS EADL RQFN ++DQGTR+IIYNLWEDDQG LELDFD DP+DIQIRG
Sbjct: 321  NNVETIVQWSPFSDEADLRRQFNLLKDQGTRVIIYNLWEDDQGQLELDFDEDPNDIQIRG 380

Query: 1639 VNRDEKSIDMAKKFPNSKHFLTYRHSLRSYAAILYLRIPPNFRIILRGEDVEHHNVVNDM 1460
            VNRDEK I MAK FPNS HFLTYRHSLRSYA+ILYLR P  FRIILRG+DV HHN+VNDM
Sbjct: 381  VNRDEKHIKMAKDFPNSTHFLTYRHSLRSYASILYLRFPRGFRIILRGKDVLHHNIVNDM 440

Query: 1459 MMSQEITYRPQPGV-DGIPKDTNMVAIVTVGFVKDAKAHIDVQGFNVYHKNRLIKPFWRV 1283
            MMSQE+TYRPQ GV DGI KD+NMVAIVT+GFVKDA  HIDV GFNVYHKNRLIKPFWR+
Sbjct: 441  MMSQEVTYRPQSGVADGILKDSNMVAIVTIGFVKDAVHHIDVSGFNVYHKNRLIKPFWRI 500

Query: 1282 WHPPGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLETRLIQVQKTYWSSNCHKIGYA 1103
            W+P GS GRGVIGVLEANFVEPAHDKQGFERT VL+RLE RLIQ+QK+YW SNCHKIGYA
Sbjct: 501  WNPAGSGGRGVIGVLEANFVEPAHDKQGFERTLVLSRLEQRLIQMQKSYWGSNCHKIGYA 560

Query: 1102 PRRSK----NVHEREISPDSFPRSPPLKKKSIASSDKTQTRFASXXXXXXXXXXXGDVRT 935
              R K    +   ++ SPD  P S   K+K  A++ K     A+              R 
Sbjct: 561  SNRIKKHISSSAGKDASPDPVPESSQSKRKYSATNGK-----ATPLASDELNSHSKQKRI 615

Query: 934  LRKRPIYV------DQSSSSEEDVRDNGRQNH---TPRKQTNGSSSR-----KMFG---- 809
             ++   YV      DQSSS +    D+    +    P+ QT G S +     K FG    
Sbjct: 616  RKETERYVEYTNGRDQSSSEQSSYADDDSDQNDDVLPKNQTKGDSRKTRIFEKSFGNKNV 675

Query: 808  --KDGSQNPSGLRSREA----AKNNSPAEMPARTTRXXXXXXXXXXXXXXXXXXXXSYVL 647
              +D +     LRS++     AK+ +  E P   +                        L
Sbjct: 676  SFQDSTSREKALRSKQGSMQEAKDVNDCEQPLSDSSS----------------------L 713

Query: 646  AQLKEENLQLRDRLKRKEEEILGDLLHDLQKEKERSKSLEAQLQEATRKYEELSKEQESL 467
             QL++EN +L++RL+++EEEILG++L  L+ EK++ KS+E +L++A +K EE++KEQE+L
Sbjct: 714  EQLRKENRELKERLEKREEEILGEVLQALRDEKDKCKSVETRLRDAEQKIEEMNKEQETL 773

Query: 466  IDIFSEERQRRDVEEENLRKKLKDASNTIQQLLDKLRSLEKISSNGK 326
            ID+FSEER RR+ EE+NLRKKL++ASNTIQ+LL+K+R LE+ SS+GK
Sbjct: 774  IDVFSEERDRRNAEEKNLRKKLQEASNTIQELLEKVRLLERKSSSGK 820


>gb|EXB54890.1| hypothetical protein L484_008818 [Morus notabilis]
          Length = 788

 Score =  885 bits (2288), Expect = 0.0
 Identities = 463/757 (61%), Positives = 538/757 (71%), Gaps = 29/757 (3%)
 Frame = -1

Query: 2527 EGLNAVLPVGFLDPLPTKQTPSPRNERLCLEFPSARVNAVAEASAKQFWKAGDYEGAPGG 2348
            EGL   LP+ F DPL       PR   L       +V + +    KQFWKAGDY GAP G
Sbjct: 70   EGLEVGLPIEFADPLRPLAVAEPRESDL-------KVASSSLQGCKQFWKAGDYLGAPCG 122

Query: 2347 VWDSSNGGMDHVRVHPKFLHSNATSHKWVLGAFAELLDNSLDEVCNGATYVNVDMVKSQK 2168
             WDSS+GGMDHVRVHPKFLHSNATSHKW LGAFAELLDN+LDEVC GAT+VN+DM+ ++K
Sbjct: 123  DWDSSSGGMDHVRVHPKFLHSNATSHKWSLGAFAELLDNALDEVCYGATFVNIDMIVNKK 182

Query: 2167 DGSKMLLIEDNGGGMDPDKMRHCMSLGYSAKSKLVDTIGQYGNGFKTSTMRLGADVIVFS 1988
            DGS MLLIEDNGGGMDPDKMRHCMSLGYS KSK+ +TIGQYGNGFKTSTMRLGADVIVFS
Sbjct: 183  DGSNMLLIEDNGGGMDPDKMRHCMSLGYSVKSKIANTIGQYGNGFKTSTMRLGADVIVFS 242

Query: 1987 RCSGKDGRRSTQSIGLLSYTFLKSTGKEDIVVPMLDYERRGQDWNKIVRSSATDWNRNVE 1808
            RC GK+G+R TQSIGLLSYTFL+STGKEDIVVPMLDYE  G  W K++RSS  DWN+NVE
Sbjct: 243  RCRGKEGKRPTQSIGLLSYTFLRSTGKEDIVVPMLDYESEGGRWRKMIRSSLGDWNKNVE 302

Query: 1807 TIAQWSPFSSEADLLRQFNQMQDQGTRIIIYNLWEDDQGLLELDFDTDPHDIQIRGVNRD 1628
            TI QWSPFS+EADLL QF+ M D GTRIIIYNLWEDDQ   ELDF  D HDIQIRGVNRD
Sbjct: 303  TILQWSPFSTEADLLHQFSVMNDHGTRIIIYNLWEDDQRYSELDFGADQHDIQIRGVNRD 362

Query: 1627 EKSIDMAKKFPNSKHFLTYRHSLRSYAAILYLRIPPNFRIILRGEDVEHHNVVNDMMMSQ 1448
            EK+I MAKKFPNS+HFLTYRHSLRSYA+ILYLR+PP FRIILRG+DVEHHN+VNDMM S+
Sbjct: 363  EKNIQMAKKFPNSRHFLTYRHSLRSYASILYLRLPPGFRIILRGKDVEHHNIVNDMMYSE 422

Query: 1447 EITYRPQPGVDGIPKDTNMVAIVTVGFVKDAKAHIDVQGFNVYHKNRLIKPFWRVWHPPG 1268
            ++TYRPQ G DGIPKDTNM+A+VT+GFVKDAK HIDVQGFNVYHKNRLIKPFWR+W+  G
Sbjct: 423  KVTYRPQHGADGIPKDTNMMAVVTIGFVKDAKHHIDVQGFNVYHKNRLIKPFWRLWNAAG 482

Query: 1267 SDGRGVIGVLEANFVEPAHDKQGFERTTVLARLETRLIQVQKTYWSSNCHKIGYAPRRSK 1088
            SDGRGVIGVLEANFVEPAHDKQGFERTT LARLE RLIQ+QKTYW   CH IGYAPRR++
Sbjct: 483  SDGRGVIGVLEANFVEPAHDKQGFERTTALARLEARLIQMQKTYWRDKCHVIGYAPRRNE 542

Query: 1087 NVH--EREISPDSFPRSPPLKKKSIASSDKTQT------------------RFASXXXXX 968
             V    +E SPD    +P  K K   +S++ +T                  + AS     
Sbjct: 543  KVSSGNKETSPDYLSETPASKGKGARTSNRKETPISDKTHSHNQTHQRQASKGASTVNGY 602

Query: 967  XXXXXXGDVRTLRKRPI-----YVDQSSSSE----EDVRDNGRQNHTPRKQTNGSSSRKM 815
                  GD   +   PI      + +SS S      D  D+   N  P+KQ N S+S+K 
Sbjct: 603  GKHVSSGDDDDVGNTPISGKRKRISKSSFSPADYISDESDDEMHNSVPKKQANCSNSKKT 662

Query: 814  FGKDGSQNPSGLRSREAAKNNSPAEMPARTTRXXXXXXXXXXXXXXXXXXXXSYVLAQLK 635
             G     +   LR+                                         L +L+
Sbjct: 663  PGTGHVSSSLELRA-----------------------------------------LERLR 681

Query: 634  EENLQLRDRLKRKEEEILGDLLHDLQKEKERSKSLEAQLQEATRKYEELSKEQESLIDIF 455
            EEN +L++RLK+ E  IL DL  +LQ EK +  SLE +L++A  K EEL+KEQ++LID+F
Sbjct: 682  EENYELKERLKKSEGTILADLRRELQDEKGKCNSLETELKQAHHKIEELNKEQDNLIDLF 741

Query: 454  SEERQRRDVEEENLRKKLKDASNTIQQLLDKLRSLEK 344
            +EER RRD EE NLRKKL+DAS+TI++L DK+R LEK
Sbjct: 742  AEERDRRDDEERNLRKKLQDASHTIEELRDKVRLLEK 778


>gb|EPS74601.1| hypothetical protein M569_00153, partial [Genlisea aurea]
          Length = 764

 Score =  884 bits (2285), Expect = 0.0
 Identities = 471/752 (62%), Positives = 557/752 (74%), Gaps = 26/752 (3%)
 Frame = -1

Query: 2512 VLPVGFLDPLPTKQTPSPRNERLCLEFPSARVNAVAEASAKQFWKAGDYEGAPGGVWDSS 2333
            VLP GFLDPLP+KQ      +++     S    A+A  S +QFWKAGD++ + GG W S 
Sbjct: 46   VLPPGFLDPLPSKQ------KQVMAVSCSESSFALAAVSDRQFWKAGDFDDSSGGDW-SY 98

Query: 2332 NGGMDHVRVHPKFLHSNATSHKWVLGAFAELLDNSLDEVCNGATYVNVDMVKSQKDGSKM 2153
            +GGMDHVRVHP+FLHSNATSHKWVLGAFAELLDN+LDE+ NGATYV VDMV++ KD SKM
Sbjct: 99   SGGMDHVRVHPRFLHSNATSHKWVLGAFAELLDNALDEIRNGATYVAVDMVQNMKDDSKM 158

Query: 2152 LLIEDNGGGMDPDKMRHCMSLGYSAKSKLVDTIGQYGNGFKTSTMRLGADVIVFSRCSGK 1973
            LLIEDNGGGM PDKMRHCMSLGYSAKSK+ +TIGQYGNGFKTSTMRLGADVIVFSRC G 
Sbjct: 159  LLIEDNGGGMSPDKMRHCMSLGYSAKSKMANTIGQYGNGFKTSTMRLGADVIVFSRCRGV 218

Query: 1972 DGRRSTQSIGLLSYTFLKSTGKEDIVVPMLDYERRGQDWNKIVRSSATDWNRNVETIAQW 1793
            DG R TQ+IG+LSYTFL+ST KEDIVVPMLDYE+RGQ W+KI++S+A DW  NV+ I  W
Sbjct: 219  DGERPTQTIGMLSYTFLRSTRKEDIVVPMLDYEKRGQAWSKIMKSTAGDWENNVDAIVNW 278

Query: 1792 SPFSSEADLLRQFNQMQDQGTRIIIYNLWEDDQGLLELDFDTDPHDIQIRGVNRDEKSID 1613
            SPF+SE +LL+QF+ +++QGTRIIIYNLWED++G LELDFD + HDIQIRGVNRD+KSI+
Sbjct: 279  SPFTSEENLLQQFDHVKNQGTRIIIYNLWEDEEGQLELDFDANEHDIQIRGVNRDDKSIE 338

Query: 1612 MAKKFPNSKHFLTYRHSLRSYAAILYLRIPPNFRIILRGEDVEHHNVVNDMMMSQEITYR 1433
            MA++FPNS+HFLTYRHSLRSYAAILYLRIPP FRI LRG+DV HHN+V+DMMMSQEITYR
Sbjct: 339  MAQRFPNSRHFLTYRHSLRSYAAILYLRIPPQFRITLRGKDVAHHNIVSDMMMSQEITYR 398

Query: 1432 PQPGVDGIPKDTNMVAIVTVGFVKDAKAHIDVQGFNVYHKNRLIKPFWRVWHPPGSDGRG 1253
            PQPG +GIPK +NMVA+VTVGFVKDAKAHIDVQGFNVYHKNRLIKPFWRVWHPPGSDGRG
Sbjct: 399  PQPGSEGIPKSSNMVAVVTVGFVKDAKAHIDVQGFNVYHKNRLIKPFWRVWHPPGSDGRG 458

Query: 1252 VIGVLEANFVEPAHDKQGFERTTVLARLETRLIQVQKTYWSSNCHKIGYAPRRSK----- 1088
            VIGVLEANFVEPAHDKQGFERTT+LARLE+RLIQ+QK+YW++NCHKIGYAPR SK     
Sbjct: 459  VIGVLEANFVEPAHDKQGFERTTILARLESRLIQMQKSYWTTNCHKIGYAPRCSKKDLTN 518

Query: 1087 -NVHERE-------------ISPDS-FPRSPPLKKKSIASSDKTQTRFASXXXXXXXXXX 953
             NV                 +S DS F  S    + +     KT    AS          
Sbjct: 519  GNVSTVSSFSDFFADNTSFFLSVDSPFETSTTRARAANKGRAKTADLPASVRDKVSKKSG 578

Query: 952  XGDVRTLRKRP-----IYVDQSSSSEEDVRDNGRQNHTPRKQTNGSSSRKMFGKDGSQNP 788
              D  T RKRP     + VD S SS+ED            +++N   SR       +  P
Sbjct: 579  KDDTTTPRKRPSSAARLAVDGSGSSDED------------RESNSGGSRP---SPSAAAP 623

Query: 787  SGLRSREAAKNN-SPAEMPARTTRXXXXXXXXXXXXXXXXXXXXSYVLAQLKEENLQLRD 611
               +S  + K   SPA  P                         S  LA+++EEN  L+ 
Sbjct: 624  RVKKSEASEKRRVSPASTP-----------PAFNDDEPSPLNPHSNALARIEEENQLLKL 672

Query: 610  RLKRKEEEILGDLLHDLQKEKERSKSLEAQLQEATRKYEELSKEQESLIDIFSEERQRRD 431
            RL++KEEEILGD+L DLQ EKE+ +SLE QL+ ATRKYEELSKEQE LI++FSEER+RRD
Sbjct: 673  RLQKKEEEILGDVLVDLQSEKEKCESLETQLELATRKYEELSKEQECLINLFSEERKRRD 732

Query: 430  VEEENLRKKLKDASNTIQQLLDKLRSLEKISS 335
             +E  LR K+K+AS+TI++LL K+R LE+ +S
Sbjct: 733  ADEALLRDKIKEASSTIEELLKKIRMLERKNS 764


>ref|XP_002280533.2| PREDICTED: uncharacterized protein LOC100266246 [Vitis vinifera]
          Length = 2234

 Score =  881 bits (2277), Expect = 0.0
 Identities = 451/709 (63%), Positives = 536/709 (75%), Gaps = 39/709 (5%)
 Frame = -1

Query: 2335 SNGGMDHVRVHPKFLHSNATSHKWVLGAFAELLDNSLDEVCNGATYVNVDMVKSQKDGSK 2156
            S GG+DHVRVHPKFLHSNATSHKW LGAFAELLDNSLDE+CNGATYVNVDM++++KDG++
Sbjct: 1551 SVGGLDHVRVHPKFLHSNATSHKWALGAFAELLDNSLDEICNGATYVNVDMLENKKDGNR 1610

Query: 2155 MLLIEDNGGGMDPDKMRHCMSLGYSAKSKLVDTIGQYGNGFKTSTMRLGADVIVFSRCSG 1976
            MLLIEDNGGGMDP+KMR CMSLGYSAKSK+ +TIGQYGNGFKTSTMRLGADVIVFSRC G
Sbjct: 1611 MLLIEDNGGGMDPEKMRQCMSLGYSAKSKIANTIGQYGNGFKTSTMRLGADVIVFSRCCG 1670

Query: 1975 KDGRRSTQSIGLLSYTFLKSTGKEDIVVPMLDYERRGQDWNKIVRSSATDWNRNVETIAQ 1796
            KDG+  TQSIGLLSYTFL+STGKEDIVVPM+DYE+ G++WNK++RSSA+DWN+NVETI Q
Sbjct: 1671 KDGKSPTQSIGLLSYTFLRSTGKEDIVVPMIDYEKGGREWNKMIRSSASDWNKNVETIMQ 1730

Query: 1795 WSPFSSEADLLRQFNQMQDQGTRIIIYNLWEDDQGLLELDFDTDPHDIQIRGVNRDEKSI 1616
            WSPFSSE DLLRQFN +++ GTRIIIYNLWEDD G LELDFDTDP DIQIRGVNRDEK+I
Sbjct: 1731 WSPFSSELDLLRQFNFIKEHGTRIIIYNLWEDDPGQLELDFDTDPKDIQIRGVNRDEKNI 1790

Query: 1615 DMAKKFPNSKHFLTYRHSLRSYAAILYLRIPPNFRIILRGEDVEHHNVVNDMMMSQEITY 1436
             MAK+FPNS+HFLTYRHSLRSYA+ILYLR+PP FRIILRG+DVEHHNVVNDMMM+QE+TY
Sbjct: 1791 QMAKQFPNSRHFLTYRHSLRSYASILYLRLPPGFRIILRGKDVEHHNVVNDMMMTQEVTY 1850

Query: 1435 RPQPGVDGIPKDTNMVAIVTVGFVKDAKAHIDVQGFNVYHKNRLIKPFWRVWHPPGSDGR 1256
            RPQP  DG+PKD NMVA+VT+GFVKDAK HIDVQGFNVYHKNRLIKPFWR+W+  GSDGR
Sbjct: 1851 RPQPSADGVPKDLNMVAVVTIGFVKDAKHHIDVQGFNVYHKNRLIKPFWRLWNAAGSDGR 1910

Query: 1255 GVIGVLEANFVEPAHDKQGFERTTVLARLETRLIQVQKTYWSSNCHKIGYAPRRSK---N 1085
            GVIGVLEANFVEPAHDKQGFERT VL+RLETRL+Q+QKTYW++ CHKIGYAPRR+K   N
Sbjct: 1911 GVIGVLEANFVEPAHDKQGFERTIVLSRLETRLLQMQKTYWTTYCHKIGYAPRRNKKLIN 1970

Query: 1084 VHEREISPDSFPRSPPLKKKSIASSDKTQTRFASXXXXXXXXXXXGDVRTLRKRPIYVDQ 905
               RE SPD  P++P   KK + +S   +T  ++              R L + P  V Q
Sbjct: 1971 ESARETSPDYLPQTPSQPKKKVGAS-SGKTPLSNLDKHASHSNHKQGGRELERTPETVYQ 2029

Query: 904  ------------------------------SSSSEEDVRDNGRQNHTPRKQTNG-----S 830
                                          SS S EDV D+      P ++ NG     S
Sbjct: 2030 SHGNGHASSKQEKRTHMPTRPRKEQSSLVPSSPSAEDVDDDDVPAVLPEREANGRVHKAS 2089

Query: 829  SSRKMFGKDGSQNPSGLRSREAAKNNSPAEMPARTTRXXXXXXXXXXXXXXXXXXXXSYV 650
             +   FG+DG Q  +  +S+    N +   +                             
Sbjct: 2090 HANNSFGEDGHQISTRSQSKGDDVNGNSNSL----------------------------A 2121

Query: 649  LAQLKEENLQLRDRLKRKEEEILGDLLHDLQKEKERSKSLEAQLQEATRKYEELSKEQES 470
            L QL+EEN +L++RLKRKE + +  L  DL+KE+E+ K LE +LQEA +K E+++KEQ+S
Sbjct: 2122 LEQLREENCELKERLKRKEGDTVVALRGDLEKEREKCKLLETELQEARQKIEDMNKEQDS 2181

Query: 469  LIDIFSEERQRRDVEEENLRKKLKDASNTIQQLLDKLRSLEKISS-NGK 326
            LIDIFSEER RRD+EEENLRKKL++ASNTIQ+LL+++R LEK+ + NGK
Sbjct: 2182 LIDIFSEERDRRDIEEENLRKKLREASNTIQELLERVRVLEKMKTPNGK 2230


>ref|XP_004161374.1| PREDICTED: uncharacterized LOC101222073 [Cucumis sativus]
          Length = 794

 Score =  881 bits (2276), Expect = 0.0
 Identities = 470/745 (63%), Positives = 540/745 (72%), Gaps = 19/745 (2%)
 Frame = -1

Query: 2521 LNAVLPVGFLDPLPTKQTPSPRNERLCLEFPSARVNAVAE--------ASAKQFWKAGDY 2366
            L  V P+GFL P    +    ++    +  PSA    V E        ++ KQFWKAGDY
Sbjct: 80   LEVVKPLGFLAPASLDE----KHSMAVILPPSAEAGTVQETGTSKANGSACKQFWKAGDY 135

Query: 2365 EGAPGGVWDSSNGGMDHVRVHPKFLHSNATSHKWVLGAFAELLDNSLDEVCNGATYVNVD 2186
            EGAP   W+S++GGMDHVRVHPKFLHSNATSHKW LGAFAELLDNSLDEV +GAT+VN+D
Sbjct: 136  EGAPCSNWESTSGGMDHVRVHPKFLHSNATSHKWALGAFAELLDNSLDEVSSGATHVNID 195

Query: 2185 MVKSQKDGSKMLLIEDNGGGMDPDKMRHCMSLGYSAKSKLVDTIGQYGNGFKTSTMRLGA 2006
            M+ ++KD +KMLLIEDNGGGM P+KMRHCMSLGYS K+KL DTIGQYGNGFKTSTMRLGA
Sbjct: 196  MLVNKKDRTKMLLIEDNGGGMSPEKMRHCMSLGYSEKTKLADTIGQYGNGFKTSTMRLGA 255

Query: 2005 DVIVFSRCSGKDGRRSTQSIGLLSYTFLKSTGKEDIVVPMLDYERRGQDWNKIVRSSATD 1826
            DVIVFSRC G+ G+  TQSIGLLSYTFL+STGKEDIVVPMLDYER+G +W KIVRSS  D
Sbjct: 256  DVIVFSRCCGQYGKSGTQSIGLLSYTFLRSTGKEDIVVPMLDYERKGGEWVKIVRSSLND 315

Query: 1825 WNRNVETIAQWSPFSSEADLLRQFNQMQDQGTRIIIYNLWEDDQGLLELDFDTDPHDIQI 1646
            WN+NV+T+ QWSPF++EA+LLRQF  M+D GTRIIIYNLWEDDQG LELDFDTDPHDIQI
Sbjct: 316  WNKNVDTVVQWSPFANEAELLRQFYMMKDHGTRIIIYNLWEDDQGQLELDFDTDPHDIQI 375

Query: 1645 RGVNRDEKSIDMAKKFPNSKHFLTYRHSLRSYAAILYLRIPPNFRIILRGEDVEHHNVVN 1466
            RGVNRDEKSI MAKKFPNS+HFLTYRHSLRSYA+ILYLR+PP FRIILRG DVEHHN+VN
Sbjct: 376  RGVNRDEKSIQMAKKFPNSRHFLTYRHSLRSYASILYLRLPPCFRIILRGRDVEHHNIVN 435

Query: 1465 DMMMSQEITYRPQPGVDG---IPKDTN---MVAIVTVGFVKDAKAHIDVQGFNVYHKNRL 1304
            DMM+SQE+TYRPQPG DG   + KDTN   MVA+VT+GFVKDAK HIDVQGFNVYHKNRL
Sbjct: 436  DMMISQEVTYRPQPGADGAGTVGKDTNVILMVAVVTIGFVKDAKHHIDVQGFNVYHKNRL 495

Query: 1303 IKPFWRVWHPPGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLETRLIQVQKTYWSSN 1124
            IKPFWR+W+  GSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLE RLIQ+QKTYW S 
Sbjct: 496  IKPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWCSY 555

Query: 1123 CHKIGYAPRR----SKNVHEREISPDSFPRSPPLKKKSIASSDKTQTRFASXXXXXXXXX 956
            CHKIGYAPRR    +    +RE SPD +   PP + K          R            
Sbjct: 556  CHKIGYAPRRIDKPNSRTPDRESSPDDYSSQPPPQSK----------RRVLPLVGRNPIN 605

Query: 955  XXGDVRTLRKRPIYVDQSSSSEEDVRDNGRQNHTPRKQTNGSSSRKMFGKDGSQNPSGLR 776
               D    R RP   +  S S  +VR +         Q NG+ +    G D S       
Sbjct: 606  MTPDSEKSRTRPSSSEPPSPSGLEVRVDNHHGG----QANGTGNETFHGNDVSMR----- 656

Query: 775  SREAAKNNSPAEMPARTTRXXXXXXXXXXXXXXXXXXXXSYVLAQLKEENLQLRDRLKRK 596
              +A+ N   ++  A+                        ++L QLKEEN +L++RLKRK
Sbjct: 657  -MKASSNGGVSQ--AQQGGLAKPKGGDTNDSERSPSSSDLHMLQQLKEENEELKERLKRK 713

Query: 595  EEEILGDLLHDLQKEKERS-KSLEAQLQEATRKYEELSKEQESLIDIFSEERQRRDVEEE 419
            E +        LQ E+ER  KSLE+QL  A  K EELSKEQESLIDIFSEER RR+ EE 
Sbjct: 714  EADH-----GKLQDERERRCKSLESQLTAAELKIEELSKEQESLIDIFSEERDRRETEEH 768

Query: 418  NLRKKLKDASNTIQQLLDKLRSLEK 344
            NLRKKLK+ASNTIQ+LLDK++ LEK
Sbjct: 769  NLRKKLKEASNTIQELLDKIQILEK 793


>gb|EMJ10266.1| hypothetical protein PRUPE_ppa025644mg [Prunus persica]
          Length = 790

 Score =  880 bits (2275), Expect = 0.0
 Identities = 460/747 (61%), Positives = 548/747 (73%), Gaps = 16/747 (2%)
 Frame = -1

Query: 2527 EGLNAVLPVGFLDPLPTKQTPSPRNERLCLEFPSARVN------AVAEASAKQFWKAGDY 2366
            +G+  VLPVGFL PLP      P      L  P A V        V+    KQFWKAGDY
Sbjct: 74   DGMGVVLPVGFLSPLP------PEGVAPMLPAPDAAVTRVENTGVVSRPRCKQFWKAGDY 127

Query: 2365 EGAPGGVWDSSNGGMDHVRVHPKFLHSNATSHKWVLGAFAELLDNSLDEVCNGATYVNVD 2186
            EGAP G W+S+ GGMDHVRVHPKFLHSNATSHKW LGAFAELLDN+LDEVC+GATYVN+D
Sbjct: 128  EGAPCGNWESTAGGMDHVRVHPKFLHSNATSHKWALGAFAELLDNALDEVCSGATYVNLD 187

Query: 2185 MVKSQKDGSKMLLIEDNGGGMDPDKMRHCMSLGYSAKSKLVDTIGQYGNGFKTSTMRLGA 2006
            MV+++KD S+MLLIEDNGGGMDPDKMRHCMSLGYSAKSK+ +TIGQYGNGFKTSTMRLGA
Sbjct: 188  MVENKKDQSRMLLIEDNGGGMDPDKMRHCMSLGYSAKSKVANTIGQYGNGFKTSTMRLGA 247

Query: 2005 DVIVFSRCSGKDGRRSTQSIGLLSYTFLKSTGKEDIVVPMLDYERRGQDWNKIVRSSATD 1826
            DVIVFSRC+GKDG+ +TQSIGLLSYTFLKSTGKEDIVVPMLDYERR   W+K++RSS +D
Sbjct: 248  DVIVFSRCNGKDGKSATQSIGLLSYTFLKSTGKEDIVVPMLDYERRQGAWSKMLRSSLSD 307

Query: 1825 WNRNVETIAQWSPFSSEADLLRQFNQMQDQGTRIIIYNLWEDDQGLLELDFDTDPHDIQI 1646
            WN+NVETI QWSPFS+E DL  QF  M++ GTRIIIYNLWEDD+G LELDFD DPHDIQI
Sbjct: 308  WNKNVETIVQWSPFSNEEDLHHQFYMMKNHGTRIIIYNLWEDDEGQLELDFDADPHDIQI 367

Query: 1645 RGVNRDEKSIDMAKKFPNSKHFLTYRHSLRSYAAILYLRIPPNFRIILRGEDVEHHNVVN 1466
            RGVNR EK+I MAK+FPNS+ FLTYRHSLRSYA+ILYLR+P NFRIILRG+DVEHHN+VN
Sbjct: 368  RGVNRAEKNIQMAKEFPNSRFFLTYRHSLRSYASILYLRLPHNFRIILRGKDVEHHNIVN 427

Query: 1465 DMMMSQEITYRPQPGVDGIPKDTNMVAIVTVGFVKDAKAHIDVQGFNVYHKNRLIKPFWR 1286
            DMMMSQ++TYRPQPG DGIPK++NMVA+VT+GFVKDAK HIDVQGFNVYHKNRLIKPFWR
Sbjct: 428  DMMMSQQVTYRPQPGADGIPKESNMVAVVTIGFVKDAKHHIDVQGFNVYHKNRLIKPFWR 487

Query: 1285 VWHPPGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLETRLIQVQKTYWSSNCHKIGY 1106
            +W+  GSDGRGVIG+LEANFVEPAHDKQGFERTTVL+RLE +LI +QK YW++ CHKIGY
Sbjct: 488  LWNAAGSDGRGVIGLLEANFVEPAHDKQGFERTTVLSRLEGKLITMQKNYWNNYCHKIGY 547

Query: 1105 APRRSKNVHEREISPDSFPRSPPLKKKSIASSDKTQTRFASXXXXXXXXXXXGDVRTLRK 926
            APRR+K +     S D           S   + + QT   S                   
Sbjct: 548  APRRNKKLING--SADKAYHGHATVHISDEGTHRVQTPTKSGE----------------- 588

Query: 925  RPIYVDQSSSSE----EDVRDNGRQNHTPRKQTNGSSSRKMFGKDG------SQNPSGLR 776
                   SSSSE         + ++N +  K   G +++K  GKDG      S     ++
Sbjct: 589  -----GSSSSSEPSPPSTAMSDKQENDSSHK---GFATKKDSGKDGQGVTRLSSCTEDIQ 640

Query: 775  SREAAKNNSPAEMPARTTRXXXXXXXXXXXXXXXXXXXXSYVLAQLKEENLQLRDRLKRK 596
            S++         +P   ++                      V+ QL++EN +L++RL++K
Sbjct: 641  SQQDCWPCGGTNLPTSRSKSKGNNVNGDCSVLEGDLG----VVEQLRKENCELKERLEKK 696

Query: 595  EEEILGDLLHDLQKEKERSKSLEAQLQEATRKYEELSKEQESLIDIFSEERQRRDVEEEN 416
            +     DLL DLQ E++R KSLE QLQ A +K  E++KEQ++LI IFSEER+RRD EE N
Sbjct: 697  DGAASADLLQDLQCERDRCKSLETQLQAAQQKIVEMNKEQDTLIGIFSEERERRDNEEAN 756

Query: 415  LRKKLKDASNTIQQLLDKLRSLEKISS 335
            LRKKL+DASNTI++LLDK+R+LE + S
Sbjct: 757  LRKKLQDASNTIEELLDKVRALEVMKS 783


>ref|XP_004138252.1| PREDICTED: uncharacterized protein LOC101222073 [Cucumis sativus]
          Length = 824

 Score =  877 bits (2266), Expect = 0.0
 Identities = 472/765 (61%), Positives = 545/765 (71%), Gaps = 39/765 (5%)
 Frame = -1

Query: 2521 LNAVLPVGFLDPLPTKQTPSPRNERLCLEFPSARVNAVAE--------ASAKQFWKAGDY 2366
            L  V P+GFL P    +    ++    +  PSA    V E        ++ KQFWKAGDY
Sbjct: 80   LEVVKPLGFLAPASLDE----KHSMAVILPPSAEAGTVQETGTSKANGSACKQFWKAGDY 135

Query: 2365 EGAPGGVWDSSNGGMDHVRVHPKFLHSNATSHKWVLGAFAELLDNSLDEVCNGATYVNVD 2186
            EGAP   W+S++GGMDHVRVHPKFLHSNATSHKW LGAFAELLDNSLDEV +GAT+VN+D
Sbjct: 136  EGAPCSNWESTSGGMDHVRVHPKFLHSNATSHKWALGAFAELLDNSLDEVSSGATHVNID 195

Query: 2185 MVKSQKDGSKMLLIEDNGGGMDPDKMRHCMSLGYSAKSKLVDTIGQYGNGFKTSTMRLGA 2006
            M+ ++KD +KMLLIEDNGGGM P+KMRHCMSLGYS K+KL DTIGQYGNGFKTSTMRLGA
Sbjct: 196  MLVNKKDRTKMLLIEDNGGGMSPEKMRHCMSLGYSEKTKLADTIGQYGNGFKTSTMRLGA 255

Query: 2005 DVIVFSRCSGKDGRRSTQSIGLLSYTFLKSTGKEDIVVPMLDYERRGQDWNKIVRSSATD 1826
            DVIVFSRC G+ G+  TQSIGLLSYTFL+STGKEDIVVPMLDYER+G +W KIVRSS  D
Sbjct: 256  DVIVFSRCCGQYGKSGTQSIGLLSYTFLRSTGKEDIVVPMLDYERKGGEWVKIVRSSLND 315

Query: 1825 WNRNVETIAQWSPFSSEADLLRQFNQMQDQGTRIIIYNLWEDDQGLLELDFDTDPHDIQI 1646
            WN+NV+T+ QWSPF++EA+LLRQF  M+D GTRIIIYNLWEDDQG LELDFDTDPHDIQI
Sbjct: 316  WNKNVDTVVQWSPFANEAELLRQFYMMKDHGTRIIIYNLWEDDQGQLELDFDTDPHDIQI 375

Query: 1645 RGVNRDEKSIDMAKKFPNSKHFLTYRHSLRSYAAILYLRIPPNFRIILRGEDVEHHNVVN 1466
            RGVNRDEKSI MAKKFPNS+HFLTYRHSLRSYA+ILYLR+PP FRIILRG DVEHHN+VN
Sbjct: 376  RGVNRDEKSIQMAKKFPNSRHFLTYRHSLRSYASILYLRLPPCFRIILRGRDVEHHNIVN 435

Query: 1465 DMMMSQEITYRPQPGVDG---IPKDTN---MVAIVTVGFVKDAKAHIDVQGFNVYHKNRL 1304
            DMM+SQE+TYRPQPG DG   + KDTN   MVA+VT+GFVKDAK HIDVQGFNVYHKNRL
Sbjct: 436  DMMISQEVTYRPQPGADGAGTVGKDTNVILMVAVVTIGFVKDAKHHIDVQGFNVYHKNRL 495

Query: 1303 IKPFWRVWHPPGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLETRLIQVQKTYWSSN 1124
            IKPFWR+W+  GSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLE RLIQ+QKTYW S 
Sbjct: 496  IKPFWRLWNASGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKTYWCSY 555

Query: 1123 CHKIGYAPRR----SKNVHEREISPDSFPRSPPLKKKSIASS-----------DKTQTRF 989
            CHKIGYAPRR    +    +RE SPD +   PP + K  ++S            K   +F
Sbjct: 556  CHKIGYAPRRIDKPNSRTPDRESSPDDYSSQPPPQSKKKSTSFGGTKPDKIYLGKETEKF 615

Query: 988  AS---------XXXXXXXXXXXGDVRTLRKRPIYVDQSSSSEEDVRDNGRQNHTPRKQTN 836
                                   D    R RP   +  S S  +VR +         Q N
Sbjct: 616  QKTKDFRYGNMHSSKEKNGSMTPDSEKSRTRPSSSEPPSPSGLEVRVDNHHG----GQAN 671

Query: 835  GSSSRKMFGKDGSQNPSGLRSREAAKNNSPAEMPARTTRXXXXXXXXXXXXXXXXXXXXS 656
            G+ +    G D S         +A+ N   ++  A+                        
Sbjct: 672  GTGNETFHGNDVSMR------MKASSNGGVSQ--AQQGGLAKPKGGDTNDSERSPSSSDL 723

Query: 655  YVLAQLKEENLQLRDRLKRKEEEILGDLLHDLQKEKE-RSKSLEAQLQEATRKYEELSKE 479
            ++L QLKEEN +L++RLKRKE +        LQ E+E R KSLE+QL  A  K EELSKE
Sbjct: 724  HMLQQLKEENEELKERLKRKEAD-----HGKLQDERERRCKSLESQLTAAELKIEELSKE 778

Query: 478  QESLIDIFSEERQRRDVEEENLRKKLKDASNTIQQLLDKLRSLEK 344
            QESLIDIFSEER RR+ EE NLRKKLK+ASNTIQ+LLDK++ LEK
Sbjct: 779  QESLIDIFSEERDRRETEEHNLRKKLKEASNTIQELLDKIQILEK 823


>ref|XP_006436890.1| hypothetical protein CICLE_v10030712mg [Citrus clementina]
            gi|557539086|gb|ESR50130.1| hypothetical protein
            CICLE_v10030712mg [Citrus clementina]
          Length = 826

 Score =  875 bits (2262), Expect = 0.0
 Identities = 469/776 (60%), Positives = 550/776 (70%), Gaps = 45/776 (5%)
 Frame = -1

Query: 2527 EGLNAVLPVGFLDPLPTKQTPSPRNERLCLEFPSARVNAVAEASAKQFWKAGDYEGAPGG 2348
            + L  VLPVGFL+PLP         ERL     + +  +V   S KQFWKAGDYEGAP G
Sbjct: 72   QDLEVVLPVGFLEPLPAP-------ERLPGAAGNDKAVSVGLQSCKQFWKAGDYEGAPSG 124

Query: 2347 VWDSSNGGMDHVRVHPKFLHSNATSHKWVLGAFAELLDNSLDEVCNGATYVNVDMVKSQK 2168
             W+ S GGMDHVRVHPKFLHSNATSHKW LGAFAELLDNSLDEVCNGATY N+DM+ ++K
Sbjct: 125  GWEFSTGGMDHVRVHPKFLHSNATSHKWALGAFAELLDNSLDEVCNGATYSNIDMLINRK 184

Query: 2167 DGSKMLLIEDNGGGMDPDKMRHCMSLGYSAKSKLVDTIGQYGNGFKTSTMRLGADVIVFS 1988
            DGS+MLLIEDNGGGM+PDKMRHCMSLGYSAKSK  +TIGQYGNGFKTSTMRLGADVIVFS
Sbjct: 185  DGSRMLLIEDNGGGMNPDKMRHCMSLGYSAKSKAANTIGQYGNGFKTSTMRLGADVIVFS 244

Query: 1987 RCSGKDGRRSTQSIGLLSYTFLKSTGKEDIVVPMLDYERRGQDWNKIVRSSATDWNRNVE 1808
             C GKDG+  T+SIGLLSYTFL+S GKEDIVVPMLDYE   Q+W KI+RSS  DWNRNVE
Sbjct: 245  CCCGKDGKSPTRSIGLLSYTFLRSAGKEDIVVPMLDYEGSQQEWKKIIRSSLDDWNRNVE 304

Query: 1807 TIAQWSPFSSEADLLRQFNQMQDQGTRIIIYNLWEDDQGLLELDFDTDPHDIQIRGVNRD 1628
            TI QWSPFSSEADLL QFN M+D GTRIIIYNLWEDDQGLLELDFD+D HDIQ+RGVNRD
Sbjct: 305  TIVQWSPFSSEADLLHQFNLMKDHGTRIIIYNLWEDDQGLLELDFDSDKHDIQLRGVNRD 364

Query: 1627 EKSIDMAKKFPNSKHFLTYRHSLRSYAAILYLRIPPNFRIILRGEDVEHHNVVNDMMMSQ 1448
            E++I MA+ +PNS+HFLTYRHSLRSYA+ILYLR+PP FRII+RG+DVEHHN+VNDMM+S+
Sbjct: 365  EQNIKMAQHYPNSRHFLTYRHSLRSYASILYLRLPPGFRIIIRGKDVEHHNIVNDMMLSK 424

Query: 1447 EITYRPQPGVDGIPKDTNMVAIVTVGFVKDAKAHIDVQGFNVYHKNRLIKPFWRVWHPPG 1268
            ++TYRPQPG  GIP D +M   VT+GFVKDAK HIDVQGFNVYHKNRLIKPFWR+W+  G
Sbjct: 425  KVTYRPQPGASGIPTDLHMAVDVTIGFVKDAKHHIDVQGFNVYHKNRLIKPFWRLWNASG 484

Query: 1267 SDGRGVIGVLEANFVEPAHDKQGFERTTVLARLETRLIQVQKTYWSSNCHKIGYAPRR-- 1094
            SDGRGVIGVLEANFVEPAHDKQGFERTTVLARLE RLIQ+QK YW++NCH+IGYAPRR  
Sbjct: 485  SDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLIQMQKDYWNNNCHEIGYAPRRYK 544

Query: 1093 --SKNVHEREISPDSFPRSPPLKKKSIASSDKTQTRFASXXXXXXXXXXXGDVRTLRKRP 920
               K+ ++REIS     +S P + K   SS   + +  S            D + L +  
Sbjct: 545  KYIKDSYDREISS---KKSYPSRHKITDSSHSDKHQLHS-----NQRWEGKDSKRLPEAS 596

Query: 919  IYVDQS---------------------------SSSEEDVRDNGRQNHTPRKQTNGSSSR 821
             Y D+                            S S ED  D+        +  NGSS +
Sbjct: 597  NYGDRKGHESSKGKYKMKTPVKYREGASVSEPLSPSAEDASDDDMHVMVTARGANGSSQK 656

Query: 820  -----KMFGKDG--SQNPSGL-----RSREAAKNNSPAE--MPARTTRXXXXXXXXXXXX 683
                 K  GKDG    +PS         ++ A   S     MP+++              
Sbjct: 657  ILAAEKSLGKDGLHRTHPSACLVDSESQQDGASGGSSVRPFMPSQSKGSEVNYPEHFLSD 716

Query: 682  XXXXXXXXSYVLAQLKEENLQLRDRLKRKEEEILGDLLHDLQKEKERSKSLEAQLQEATR 503
                       L QLK+EN +L+ RL++KE E        LQ+E+ER +SLEAQL+   +
Sbjct: 717  CSLGAN-----LGQLKQENHELKKRLEKKEGE--------LQEERERCRSLEAQLKVMQQ 763

Query: 502  KYEELSKEQESLIDIFSEERQRRDVEEENLRKKLKDASNTIQQLLDKLRSLEKISS 335
              EEL+KEQESLIDIF+EER RR+ EEENLRKK+KDAS+TIQ LLDK++ LEK+ +
Sbjct: 764  TIEELNKEQESLIDIFAEERDRREREEENLRKKIKDASDTIQDLLDKIKLLEKMKT 819


>gb|EOY22565.1| MORC family CW-type zinc finger protein 4, putative isoform 3
            [Theobroma cacao]
          Length = 828

 Score =  874 bits (2257), Expect = 0.0
 Identities = 462/760 (60%), Positives = 547/760 (71%), Gaps = 33/760 (4%)
 Frame = -1

Query: 2512 VLPVGFLDPLPTKQTPSP---RNERLCLEFP----------SARVNAVAEASAKQFWKAG 2372
            VLP+GFL PLP    P+P    ++   +E P          S  +++ +    KQFWKAG
Sbjct: 82   VLPLGFLAPLPPDD-PAPVPLASDMAVVEVPETEGPPEPAASKSLSSSSSVLCKQFWKAG 140

Query: 2371 DYEGAPGGVWDSSNGGMDHVRVHPKFLHSNATSHKWVLGAFAELLDNSLDEVCNGATYVN 2192
            DY+G P   WD S+GGMDHVRVHPKFLHSNATSHKW LGAFAELLDNSLDEVC+GATYVN
Sbjct: 141  DYDGTPPADWDLSSGGMDHVRVHPKFLHSNATSHKWALGAFAELLDNSLDEVCSGATYVN 200

Query: 2191 VDMVKSQKDGSKMLLIEDNGGGMDPDKMRHCMSLGYSAKSKLVDTIGQYGNGFKTSTMRL 2012
            +DM+KS+KDG+ MLLIEDNGGGMDPDKMR CMSLGYSAKSK+ +TIGQYGNGFKTSTMRL
Sbjct: 201  IDMLKSKKDGNNMLLIEDNGGGMDPDKMRQCMSLGYSAKSKVANTIGQYGNGFKTSTMRL 260

Query: 2011 GADVIVFSRCSGKDGRRSTQSIGLLSYTFLKSTGKEDIVVPMLDYERRGQDWNKIVRSSA 1832
            GADVIVFSRC GKDG+  TQSIGLLSYTFL STGKEDIVVPMLDYE + ++W KI+RS+ 
Sbjct: 261  GADVIVFSRCCGKDGKHPTQSIGLLSYTFLTSTGKEDIVVPMLDYEWQQREWKKIIRSTV 320

Query: 1831 TDWNRNVETIAQWSPFSSEADLLRQFNQMQDQGTRIIIYNLWEDDQGLLELDFDTDPHDI 1652
            +DW+RNVET+ QWSPFSS  DLLRQFN M+D GTRIIIYNLWEDDQGL ELDF  DPHDI
Sbjct: 321  SDWDRNVETVVQWSPFSSATDLLRQFNLMKDHGTRIIIYNLWEDDQGLSELDFHADPHDI 380

Query: 1651 QIRGVNRDEKSIDMAKKFPNSKHFLTYRHSLRSYAAILYLRIPPNFRIILRGEDVEHHNV 1472
            Q+RGVNRDEK+I MAK+ PNS+HFLTYRHSLRSYA+ILYLR+ PNFRIILRG+DVEHHN+
Sbjct: 381  QLRGVNRDEKNIQMAKECPNSRHFLTYRHSLRSYASILYLRLHPNFRIILRGKDVEHHNI 440

Query: 1471 VNDMMMSQEITYRPQPGVDGIPKDTNMVAIVTVGFVKDAKAHIDVQGFNVYHKNRLIKPF 1292
            VNDMM+++ +TYRP P  +G PKD N+ A+VT+GFVKDAK H+DVQGFNVYHKNRLIKPF
Sbjct: 441  VNDMMLTEMVTYRPNPSAEGAPKDLNLAAVVTIGFVKDAKHHVDVQGFNVYHKNRLIKPF 500

Query: 1291 WRVWHPPGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLETRLIQVQKTYWSSNCHKI 1112
            WRVW+  GSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLE RL+Q+QKTYWS+NCHKI
Sbjct: 501  WRVWNAAGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLVQMQKTYWSTNCHKI 560

Query: 1111 GYAPRRSKNVHEREISPDSFP-----RSPPLKKKSIASSDKTQTRFASXXXXXXXXXXXG 947
            GYAPRR+K   ++ +  DS P     R     KKS  SS K   R +S            
Sbjct: 561  GYAPRRNKKNIDQSLGRDSSPDHDSQRPTRSNKKSTTSSSK---RLSS------DSDKLC 611

Query: 946  DVRTLRKRPIYVDQSSSSEEDVRDNGR--QNHTPRKQTNGSSSRKMFGKDG----SQNPS 785
                  KR     +      +  D G   +    RK+T   +S K   K G    S  PS
Sbjct: 612  SPSNWNKR----GKECQKFPETEDGGHVLRKGDKRKKTPIDNSTKDLTKSGKSLRSIEPS 667

Query: 784  GLRSREAAKNNSPAEMPAR---------TTRXXXXXXXXXXXXXXXXXXXXSYVLAQLKE 632
               S E   ++    +P R          TR                     + L  LK+
Sbjct: 668  S-PSTENVSDDVCEVLPERLANGSSQKFVTRTKSKQECGLNDTELPHSETNLHALELLKQ 726

Query: 631  ENLQLRDRLKRKEEEILGDLLHDLQKEKERSKSLEAQLQEATRKYEELSKEQESLIDIFS 452
            EN +L+ RL++ E +   +LL+DLQ+E+   +SLE +L+ A  K E+L+ EQESLI+IFS
Sbjct: 727  ENCELKKRLEKYEGKRQCELLNDLQQERNCRESLEIELKGAQEKIEQLNFEQESLINIFS 786

Query: 451  EERQRRDVEEENLRKKLKDASNTIQQLLDKLRSLEKISSN 332
            EER RRD EEENLRKKLKDASNTIQ+L+DK++ LE + S+
Sbjct: 787  EERDRRDKEEENLRKKLKDASNTIQELVDKVKLLEMMKSS 826


>gb|EOY22563.1| MORC family CW-type zinc finger protein 4, putative isoform 1
            [Theobroma cacao]
          Length = 827

 Score =  874 bits (2257), Expect = 0.0
 Identities = 460/777 (59%), Positives = 552/777 (71%), Gaps = 50/777 (6%)
 Frame = -1

Query: 2512 VLPVGFLDPLPTKQTPSP---RNERLCLEFP----------SARVNAVAEASAKQFWKAG 2372
            VLP+GFL PLP    P+P    ++   +E P          S  +++ +    KQFWKAG
Sbjct: 82   VLPLGFLAPLPPDD-PAPVPLASDMAVVEVPETEGPPEPAASKSLSSSSSVLCKQFWKAG 140

Query: 2371 DYEGAPGGVWDSSNGGMDHVRVHPKFLHSNATSHKWVLGAFAELLDNSLDEVCNGATYVN 2192
            DY+G P   WD S+GGMDHVRVHPKFLHSNATSHKW LGAFAELLDNSLDEVC+GATYVN
Sbjct: 141  DYDGTPPADWDLSSGGMDHVRVHPKFLHSNATSHKWALGAFAELLDNSLDEVCSGATYVN 200

Query: 2191 VDMVKSQKDGSKMLLIEDNGGGMDPDKMRHCMSLGYSAKSKLVDTIGQYGNGFKTSTMRL 2012
            +DM+KS+KDG+ MLLIEDNGGGMDPDKMR CMSLGYSAKSK+ +TIGQYGNGFKTSTMRL
Sbjct: 201  IDMLKSKKDGNNMLLIEDNGGGMDPDKMRQCMSLGYSAKSKVANTIGQYGNGFKTSTMRL 260

Query: 2011 GADVIVFSRCSGKDGRRSTQSIGLLSYTFLKSTGKEDIVVPMLDYERRGQDWNKIVRSSA 1832
            GADVIVFSRC GKDG+  TQSIGLLSYTFL STGKEDIVVPMLDYE + ++W KI+RS+ 
Sbjct: 261  GADVIVFSRCCGKDGKHPTQSIGLLSYTFLTSTGKEDIVVPMLDYEWQQREWKKIIRSTV 320

Query: 1831 TDWNRNVETIAQWSPFSSEADLLRQFNQMQDQGTRIIIYNLWEDDQGLLELDFDTDPHDI 1652
            +DW+RNVET+ QWSPFSS  DLLRQFN M+D GTRIIIYNLWEDDQGL ELDF  DPHDI
Sbjct: 321  SDWDRNVETVVQWSPFSSATDLLRQFNLMKDHGTRIIIYNLWEDDQGLSELDFHADPHDI 380

Query: 1651 QIRGVNRDEKSIDMAKKFPNSKHFLTYRHSLRSYAAILYLRIPPNFRIILRGEDVEHHNV 1472
            Q+RGVNRDEK+I MAK+ PNS+HFLTYRHSLRSYA+ILYLR+ PNFRIILRG+DVEHHN+
Sbjct: 381  QLRGVNRDEKNIQMAKECPNSRHFLTYRHSLRSYASILYLRLHPNFRIILRGKDVEHHNI 440

Query: 1471 VNDMMMSQEITYRPQPGVDGIPKDTNMVAIVTVGFVKDAKAHIDVQGFNVYHKNRLIKPF 1292
            VNDMM+++ +TYRP P  +G PKD N+ A+VT+GFVKDAK H+DVQGFNVYHKNRLIKPF
Sbjct: 441  VNDMMLTEMVTYRPNPSAEGAPKDLNLAAVVTIGFVKDAKHHVDVQGFNVYHKNRLIKPF 500

Query: 1291 WRVWHPPGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLETRLIQVQKTYWSSNCHKI 1112
            WRVW+  GSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLE RL+Q+QKTYWS+NCHKI
Sbjct: 501  WRVWNAAGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLVQMQKTYWSTNCHKI 560

Query: 1111 GYAPRRSKNVHEREISPDSFP-----RSPPLKKKSIASSDKTQT---------------- 995
            GYAPRR+K   ++ +  DS P     R     KKS  SS K  +                
Sbjct: 561  GYAPRRNKKNIDQSLGRDSSPDHDSQRPTRSNKKSTTSSSKRLSSDSDKLCSPSNWNKRG 620

Query: 994  ----------------RFASXXXXXXXXXXXGDVRTLRKRPIYVDQSSSSEEDVRDNGRQ 863
                            R               D+    K    ++ SS S E+V D+  +
Sbjct: 621  KECQKFPETEDGGHVLRKGDKRKKTPIDNSTKDLTKSGKSLRSIEPSSPSTENVSDDVCE 680

Query: 862  NHTPRKQTNGSSSRKMFGKDGSQNPSGLRSREAAKNNSPAEMPARTTRXXXXXXXXXXXX 683
               P +  NGSS + +         +  +S+E   N++  E+P   T             
Sbjct: 681  -VLPERLANGSSQKFV---------TRTKSKECGLNDT--ELPHSETN------------ 716

Query: 682  XXXXXXXXSYVLAQLKEENLQLRDRLKRKEEEILGDLLHDLQKEKERSKSLEAQLQEATR 503
                     + L  LK+EN +L+ RL++ E +   +LL+DLQ+E+   +SLE +L+ A  
Sbjct: 717  --------LHALELLKQENCELKKRLEKYEGKRQCELLNDLQQERNCRESLEIELKGAQE 768

Query: 502  KYEELSKEQESLIDIFSEERQRRDVEEENLRKKLKDASNTIQQLLDKLRSLEKISSN 332
            K E+L+ EQESLI+IFSEER RRD EEENLRKKLKDASNTIQ+L+DK++ LE + S+
Sbjct: 769  KIEQLNFEQESLINIFSEERDRRDKEEENLRKKLKDASNTIQELVDKVKLLEMMKSS 825


>ref|XP_002321726.2| hypothetical protein POPTR_0015s11300g [Populus trichocarpa]
            gi|550322482|gb|EEF05853.2| hypothetical protein
            POPTR_0015s11300g [Populus trichocarpa]
          Length = 826

 Score =  861 bits (2225), Expect = 0.0
 Identities = 461/751 (61%), Positives = 542/751 (72%), Gaps = 20/751 (2%)
 Frame = -1

Query: 2527 EGLNAVLPVGFLDPLPTKQTPSPRNER---LCLEFPSARVNAVAEASAKQFWKAGDYEGA 2357
            + L  VLP+GFL PLP   +  P  E    +     S  VN + ++S KQFWKAGDYEGA
Sbjct: 84   DDLGVVLPLGFLAPLPQPPSSEPPTEAEMAVVESTESTMVNLIGQSS-KQFWKAGDYEGA 142

Query: 2356 PGGVWDSSNGGMDHVRVHPKFLHSNATSHKWVLGAFAELLDNSLDEVCNGATYVNVDMVK 2177
            P   WD S+GGMD VRVHPKFLHSNATSHKW LGAFAEL+DN+LDE  NGAT+VN+DMV+
Sbjct: 143  PHANWDLSSGGMDRVRVHPKFLHSNATSHKWALGAFAELMDNALDEFGNGATFVNIDMVE 202

Query: 2176 SQKDGSKMLLIEDNGGGMDPDKMRHCMSLGYSAKSKLVDTIGQYGNGFKTSTMRLGADVI 1997
            S+KD S+MLLIEDNGGGMDPDKMR CMSLGYSAKSK+ +TIGQYGNGFKTSTMRLGADVI
Sbjct: 203  SKKDRSRMLLIEDNGGGMDPDKMRQCMSLGYSAKSKVANTIGQYGNGFKTSTMRLGADVI 262

Query: 1996 VFSRCSGKDGRRSTQSIGLLSYTFLKSTGKEDIVVPMLDYERRGQDWNKIVRSSATDWNR 1817
            VFSRC GKDG+  TQSIGLLSYTFL+STGKEDIVVPMLD++R+G++W++++R SA+DWNR
Sbjct: 263  VFSRCPGKDGKSPTQSIGLLSYTFLRSTGKEDIVVPMLDFQRKGREWSRMIRYSASDWNR 322

Query: 1816 NVETIAQWSPFSSEADLLRQFNQMQDQGTRIIIYNLWEDDQGLLELDFDTDPHDIQIRGV 1637
            N    A+ + F         FN M D GTRIIIYNLWEDDQGLLELDFD+DPHDIQ+RGV
Sbjct: 323  NFP--ARQTFF---------FNLMSDHGTRIIIYNLWEDDQGLLELDFDSDPHDIQLRGV 371

Query: 1636 NRDEKSIDMAKKFPNSKHFLTYRHSLRSYAAILYLRIPPNFRIILRGEDVEHHNVVNDMM 1457
            NRDEK I MAK+FPNS+HFLTYRHSLR+YA+ILYLR+P +FRIILRG+DVEHHN+VNDMM
Sbjct: 372  NRDEKHIKMAKEFPNSRHFLTYRHSLRNYASILYLRLPSSFRIILRGKDVEHHNIVNDMM 431

Query: 1456 MSQEITYRPQPGVDGIPKDTNMVAIVTVGFVKDAKAHIDVQGFNVYHKNRLIKPFWRVWH 1277
            +SQE+TYRPQPG DG+PKDTNM A+VT+GFVKDAK HIDVQGFNVYHKNRLIKPFWR+W+
Sbjct: 432  LSQEVTYRPQPGADGVPKDTNMTAVVTIGFVKDAKHHIDVQGFNVYHKNRLIKPFWRLWN 491

Query: 1276 PPGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLETRLIQVQKTYWSSNCHKIGYAPR 1097
              GSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLE RL+Q+QK YWS+ CHKIGYAPR
Sbjct: 492  AAGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLEARLVQMQKQYWSTYCHKIGYAPR 551

Query: 1096 RSKNV----HEREISPD-SFPRSPPLKKKSIASSDKTQTRFASXXXXXXXXXXXGDVRT- 935
            R+K +     + E SP    P S   +KK  +   K     +             +VRT 
Sbjct: 552  RNKKLINEFSDIETSPGYQPPTSSHSRKKYTSLGSKISPSHSDHGYGNGHASNKVNVRTN 611

Query: 934  ----LRKRPIYVDQSSSSEEDVRDNGRQNHTPRKQTNGSSSR-----KMFGKDGSQNPSG 782
                  K  I    S  +++   ++      P ++ NGS+ R     K F KDG      
Sbjct: 612  TPTKFGKSTISPGPSPPAQDVSSEDDDCVAIPVRKANGSTQRTTPTNKSFEKDGLHATQS 671

Query: 781  LRSREAAKNNSPAEMPARTTRXXXXXXXXXXXXXXXXXXXXSYVLA--QLKEENLQLRDR 608
                E + +         T                      S VLA   LK+EN +L++R
Sbjct: 672  SSCMEDSGSQHDCMSGGGTVHVTRSQTKVGDVDKMDCSFSESDVLALVHLKQENRELKER 731

Query: 607  LKRKEEEILGDLLHDLQKEKERSKSLEAQLQEATRKYEELSKEQESLIDIFSEERQRRDV 428
            L++ E E  G+  +  Q EK   KSLE QL+EA RK EEL+KEQESLIDIFSEER RRD 
Sbjct: 732  LEKLEGETRGEYRNGSQCEK--CKSLEIQLKEAQRKLEELNKEQESLIDIFSEERVRRDQ 789

Query: 427  EEENLRKKLKDASNTIQQLLDKLRSLEKISS 335
            EEENLRKKLKDASNTIQ+LLDK+R LEK+ S
Sbjct: 790  EEENLRKKLKDASNTIQELLDKVRLLEKMKS 820


>ref|XP_002318152.1| hypothetical protein POPTR_0012s10460g [Populus trichocarpa]
            gi|222858825|gb|EEE96372.1| hypothetical protein
            POPTR_0012s10460g [Populus trichocarpa]
          Length = 862

 Score =  850 bits (2197), Expect = 0.0
 Identities = 451/739 (61%), Positives = 534/739 (72%), Gaps = 29/739 (3%)
 Frame = -1

Query: 2527 EGLNAVLPVGFLDPL---PTKQTPSPRNERLCLEFPSARVNAVAEASAKQFWKAGDYEGA 2357
            E L  VLP+GFL P+   P  +TPS   E + +E   +R  ++   S+KQFWKAGDYEGA
Sbjct: 82   EDLGVVLPLGFLAPITPPPDSETPSEA-EMMAVESTESRRVSLTGQSSKQFWKAGDYEGA 140

Query: 2356 PGGVWDSSNGGMDHVRVHPKFLHSNATSHKWVLGAFAELLDNSLDEVCNGATYVNVDMVK 2177
            P   WDSS GGMDHVRVHPKFLHSNATSHKW LGAFAELLDN+LDE  NGA +VN+DMV+
Sbjct: 141  PRANWDSSFGGMDHVRVHPKFLHSNATSHKWALGAFAELLDNALDEFGNGARFVNIDMVE 200

Query: 2176 SQKDGSKMLLIEDNGGGMDPDKMRHCMSLGYSAKSKLVDTIGQYGNGFKTSTMRLGADVI 1997
            S+KD S+MLLIEDNGGGMDPDK+R CMSLGYSAKSK+ +TIGQYGNGFKTSTMRLGADVI
Sbjct: 201  SKKDQSRMLLIEDNGGGMDPDKLRQCMSLGYSAKSKVANTIGQYGNGFKTSTMRLGADVI 260

Query: 1996 VFSRCSGKDGRRSTQSIGLLSYTFLKSTGKEDIVVPMLDYERRGQDWNKIVRSSATDWNR 1817
            VFSRC GKDG+  TQSIGLLSYTFL+STGKEDIVVPMLDYER+G++W+++ RSS  DWNR
Sbjct: 261  VFSRCQGKDGKFPTQSIGLLSYTFLRSTGKEDIVVPMLDYERKGREWSRMGRSSTGDWNR 320

Query: 1816 NVETIAQWSPFSSEADLLRQFNQMQDQGTRIIIYNLWEDDQGLLELDFDTDPHDIQIRGV 1637
            NVETI  WSPFSSEADLLRQF  M D GTRIIIYNLWEDDQG+LELDFD+DPHDIQ+RGV
Sbjct: 321  NVETIVHWSPFSSEADLLRQFKLMSDHGTRIIIYNLWEDDQGMLELDFDSDPHDIQLRGV 380

Query: 1636 NRDEKSIDMAKKFPNSKHFLTYRHSLRSYAAILYLRIPPNFRIILRGEDVEHHNVVNDMM 1457
            NRDEK I MAK+FPNS+HFLTYRHSLR+Y +ILYLR+PP+FRIILRG+DVEHHN+VNDMM
Sbjct: 381  NRDEKHIQMAKEFPNSRHFLTYRHSLRNYTSILYLRLPPSFRIILRGKDVEHHNIVNDMM 440

Query: 1456 MSQEITYRPQPGVDGIPKDTN-MVAIVTVGFVKDAKAHIDVQGFNVYHKNRLIKPFWRVW 1280
            +SQEITYRPQPG D +PKDTN M A+VT+GFVKDAK HIDVQGFNVYHKNRLIKPFWR+W
Sbjct: 441  LSQEITYRPQPGADSVPKDTNQMTAVVTIGFVKDAKHHIDVQGFNVYHKNRLIKPFWRLW 500

Query: 1279 HPPGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLETRLIQVQKTYWSSNCHKIGYAP 1100
            +  GSDGRGVIGVLEANF+EPAHDKQGFERTTVLARLE RL+Q+QK YW  +C  + Y  
Sbjct: 501  NAAGSDGRGVIGVLEANFIEPAHDKQGFERTTVLARLEARLVQMQKHYW--HCLLLLYTS 558

Query: 1099 RRSKNV-----------HEREISPDSF-PRSPPLKKKSIASSDKTQTRFASXXXXXXXXX 956
              S  +              E SPD   P S   KKK  + S K     ++         
Sbjct: 559  FESFLITPLLISHKVVFPNEENSPDDLPPTSSQSKKKYTSLSSKISPSHSNRGYVSGNAF 618

Query: 955  XXGDVRT-----LRKRPIYVDQSSSSEEDVRDNGRQNHTPRKQTNGSS-----SRKMFGK 806
              G++RT     L K  +    S  ++++  ++      P ++ NGS+     + K F K
Sbjct: 619  NKGNIRTKTPTNLGKNTVSSGPSPPAQDESSEDDEHVALPMREANGSAQETTPTNKSFDK 678

Query: 805  DG---SQNPSGLRSREAAKNNSPAEMPARTTRXXXXXXXXXXXXXXXXXXXXSYVLAQLK 635
            +G   + + S L    + ++        +                        +VLA LK
Sbjct: 679  NGLPKTWSSSYLEDSGSQQDCMSGGATVQIGTRSQPKVGDVDKRDHALPESDMHVLAHLK 738

Query: 634  EENLQLRDRLKRKEEEILGDLLHDLQKEKERSKSLEAQLQEATRKYEELSKEQESLIDIF 455
            +EN +L++RL++ E E  G  +   Q EK   KSLE QLQEA +K EEL+KEQESLIDIF
Sbjct: 739  QENRELKERLQKLEGETQGKYMSGFQCEK--CKSLEIQLQEAQQKLEELNKEQESLIDIF 796

Query: 454  SEERQRRDVEEENLRKKLK 398
            SEER RRD EEE+LRKKLK
Sbjct: 797  SEERDRRDQEEESLRKKLK 815


>ref|XP_006584700.1| PREDICTED: uncharacterized protein LOC100816702 isoform X2 [Glycine
            max]
          Length = 758

 Score =  830 bits (2143), Expect = 0.0
 Identities = 433/699 (61%), Positives = 510/699 (72%), Gaps = 29/699 (4%)
 Frame = -1

Query: 2530 AEGLNAVLPVGFLDPLPTKQTPSPRNER-LCLEFP------SARVNAVAEAS---AKQFW 2381
            +E    VLPVGFL PLP    P P     L L  P      ++RVNA    S   +KQFW
Sbjct: 69   SEAGGVVLPVGFLTPLPPAPVPVPPPAAVLSLPAPEWASNSASRVNASKSFSLNSSKQFW 128

Query: 2380 KAGDYEGAPGGVWDSSNGGMDHVRVHPKFLHSNATSHKWVLGAFAELLDNSLDEVCNGAT 2201
            KAGDY+GAP G   SS  GMDHVRVHPKFLHSNATSHKW LGAFAELLDNSLDEVCNGAT
Sbjct: 129  KAGDYDGAPLGGSGSSTVGMDHVRVHPKFLHSNATSHKWALGAFAELLDNSLDEVCNGAT 188

Query: 2200 YVNVDMVKSQKDGSKMLLIEDNGGGMDPDKMRHCMSLGYSAKSKLVDTIGQYGNGFKTST 2021
            YVNVDM+ ++KDG++MLL+EDNGGGMDP+KMR CMSLGYS KSK+ +TIGQYGNGFKTST
Sbjct: 189  YVNVDMLINKKDGTRMLLVEDNGGGMDPEKMRQCMSLGYSMKSKMANTIGQYGNGFKTST 248

Query: 2020 MRLGADVIVFSRCSGKDGRRSTQSIGLLSYTFLKSTGKEDIVVPMLDYERRGQDWNKIVR 1841
            MRLGADVIVFSR  GKDG+ STQSIGLLSYTFL+STGKEDIVVPMLDYERRGQ+WNKI+R
Sbjct: 249  MRLGADVIVFSRYPGKDGKSSTQSIGLLSYTFLRSTGKEDIVVPMLDYERRGQEWNKIIR 308

Query: 1840 SSATDWNRNVETIAQWSPFSSEADLLRQFNQMQDQGTRIIIYNLWEDDQGLLELDFDTDP 1661
            +S  DWN+NVETI QWSPFS+EADLL QFN ++D GTR+IIYNLWEDDQG LELDFD DP
Sbjct: 309  TSLDDWNKNVETIVQWSPFSNEADLLLQFNLVKDHGTRVIIYNLWEDDQGQLELDFDEDP 368

Query: 1660 HDIQIRGVNRDEKSIDMAKKFPNSKHFLTYRHSLRSYAAILYLRIPPNFRIILRGEDVEH 1481
            HDIQIRGVNRDEK+I M+K+FPNS+HFLTYRHSLRSY +ILYLR+P  FRIILRG+D+ H
Sbjct: 369  HDIQIRGVNRDEKNIQMSKEFPNSRHFLTYRHSLRSYTSILYLRLPSGFRIILRGKDILH 428

Query: 1480 HNVVNDMMMSQEITYRPQPGVDG-IPKDTNMVAIVTVGFVKDAKAHIDVQGFNVYHKNRL 1304
            HN+VNDMMMSQE+TYRPQ GVDG +PKD+NMVA+VT+GFVKDA  H+DV GFNVYHKNRL
Sbjct: 429  HNIVNDMMMSQEVTYRPQAGVDGLLPKDSNMVAVVTIGFVKDAVHHVDVSGFNVYHKNRL 488

Query: 1303 IKPFWRVWHPPGSDGRGVIGVLEANFVEPAHDKQGFERTTVLARLETRLIQVQKTYWSSN 1124
            IKPFWR+W+P GS GRGVIGVLEANFVEPAHDKQGFERT VL+RLE++LIQ+QK YWS+N
Sbjct: 489  IKPFWRIWNPAGSGGRGVIGVLEANFVEPAHDKQGFERTLVLSRLESKLIQMQKKYWSTN 548

Query: 1123 CHKIGYAPRRSK----NVHEREISPDSFPRSPPLKKKSIASSDKT-------------QT 995
            CHKIGYA  RSK    +  ++E SPD FP S   K+K     DK              Q 
Sbjct: 549  CHKIGYASNRSKIQIRDYADKEASPDYFPESSQSKRKYSTMDDKATPLTSDKLRSHSDQK 608

Query: 994  RFASXXXXXXXXXXXGDVRTLRKRPIYVDQSSSSEEDVRDNGRQNHTPRKQTNG-SSSRK 818
            R                  + R+R   + + SSS+++V +       P+K+T   S++ K
Sbjct: 609  RIQKQTDKYIAYKNGQSSVSPRRRMQSLSEQSSSDDEVSE-----VLPKKKTQKISTAEK 663

Query: 817  MFGKDGSQNPSGLRSREAAKNNSPAEMPARTTRXXXXXXXXXXXXXXXXXXXXSYVLAQL 638
             F K+             +++ +     ++ TR                       L QL
Sbjct: 664  SFEKENG----------CSQDTTSRGKSSQYTRGSKLEGKSVNDGEQPPSDNDLLTLEQL 713

Query: 637  KEENLQLRDRLKRKEEEILGDLLHDLQKEKERSKSLEAQ 521
            K+EN +L++RL+RKEE+ILG +L DLQ EK+R KSLE Q
Sbjct: 714  KKENRELKERLQRKEEDILGQVLQDLQHEKDRCKSLETQ 752


>ref|XP_002864076.1| hypothetical protein ARALYDRAFT_495140 [Arabidopsis lyrata subsp.
            lyrata] gi|297309911|gb|EFH40335.1| hypothetical protein
            ARALYDRAFT_495140 [Arabidopsis lyrata subsp. lyrata]
          Length = 804

 Score =  826 bits (2134), Expect = 0.0
 Identities = 432/697 (61%), Positives = 511/697 (73%), Gaps = 16/697 (2%)
 Frame = -1

Query: 2440 LEFPSARVNAVAEASA-----KQFWKAGDYEGAPGGVWDSSNGGMDHVRVHPKFLHSNAT 2276
            L  P+   N VA  S+     KQFWKAGDYEG  GG W+ S GG DHVRVHPKFLHSNAT
Sbjct: 107  LALPATPCNVVAAPSSPWGSCKQFWKAGDYEGTSGGDWEVSAGGFDHVRVHPKFLHSNAT 166

Query: 2275 SHKWVLGAFAELLDNSLDEVCNGATYVNVDMVKSQKDGSKMLLIEDNGGGMDPDKMRHCM 2096
            SHKW LGAFAELLDN+LDEV  GAT+VNVDM++++KDGSKM++IED+GGGM+P+KMRHCM
Sbjct: 167  SHKWSLGAFAELLDNALDEVHTGATFVNVDMIENKKDGSKMVVIEDDGGGMNPEKMRHCM 226

Query: 2095 SLGYSAKSKLVDTIGQYGNGFKTSTMRLGADVIVFSRCSGKDGRRSTQSIGLLSYTFLKS 1916
            SLGYSAKSKL DTIGQYGNGFKTSTMRLGADVIVFSRC GKDG+ STQSIGLLSYTFLKS
Sbjct: 227  SLGYSAKSKLADTIGQYGNGFKTSTMRLGADVIVFSRCLGKDGKSSTQSIGLLSYTFLKS 286

Query: 1915 TGKEDIVVPMLDYERRGQDWNKIVRSSATDWNRNVETIAQWSPFSSEADLLRQFNQMQDQ 1736
            TGKEDIVVPMLDYERR  +W  I RSS +DW +NVETI QWSPF +E DLLRQFN ++  
Sbjct: 287  TGKEDIVVPMLDYERRDSEWCPITRSSVSDWEKNVETIVQWSPFPTEEDLLRQFNLVKKH 346

Query: 1735 GTRIIIYNLWEDDQGLLELDFDTDPHDIQIRGVNRDEKSIDMAKKFPNSKHFLTYRHSLR 1556
            GTRIIIYNLWEDDQG+LELDFDTDPHDIQ+RGVNRDEK+IDMA +FPNS+H+LTY+HSLR
Sbjct: 347  GTRIIIYNLWEDDQGMLELDFDTDPHDIQLRGVNRDEKNIDMASQFPNSRHYLTYKHSLR 406

Query: 1555 SYAAILYLRIPPNFRIILRGEDVEHHNVVNDMMMSQEITYRPQPGVDGIPKDTNMVAIVT 1376
            SYA+ILYL+IP  FRIILRG+DVEHHN+VNDMM +++ITYRP+ G DG  K +N+ A+VT
Sbjct: 407  SYASILYLKIPREFRIILRGKDVEHHNIVNDMMQTEKITYRPKEGADGCAKYSNLSAVVT 466

Query: 1375 VGFVKDAKAHIDVQGFNVYHKNRLIKPFWRVWHPPGSDGRGVIGVLEANFVEPAHDKQGF 1196
            +GFVKDAK H+DVQGFNVYHKNRLIKPFWR+W+  GSDGRGVIGVLEANFVEPAHDKQGF
Sbjct: 467  IGFVKDAKHHVDVQGFNVYHKNRLIKPFWRIWNAAGSDGRGVIGVLEANFVEPAHDKQGF 526

Query: 1195 ERTTVLARLETRLIQVQKTYWSSNCHKIGYAPRRS----KNVHEREISPDSFP-RSPPLK 1031
            ERTTVL+RLE RL+Q+QK YW SNCHKIGYA R+     K+  +RE SP+  P RS   +
Sbjct: 527  ERTTVLSRLEARLLQMQKNYWRSNCHKIGYASRQGKKSVKDTEDRESSPEYDPKRSDSSR 586

Query: 1030 KKSIASSDKTQTRFASXXXXXXXXXXXGDVRTLRKRPIYVDQSSSSEEDVRDNGRQNHTP 851
            K++  SS KT T   +                    P    +  ++  +V   G+ +   
Sbjct: 587  KRNAPSSFKTPTAAPNY-----------------NTPTAASEKFNTRSNVIRGGKGSLKD 629

Query: 850  RKQTNGSSSRKMFGKDG-SQNPSGLR-----SREAAKNNSPAEMPARTTRXXXXXXXXXX 689
             K     SS K  GK G S + S  R     +R     NS  +  +   R          
Sbjct: 630  SKDIGYKSSGKGGGKLGNSFSKSDKRVKPQLARAVEVTNSDDDYDSSPERNVTELPEKIS 689

Query: 688  XXXXXXXXXXSYVLAQLKEENLQLRDRLKRKEEEILGDLLHDLQKEKERSKSLEAQLQEA 509
                         L+QL++EN +LRDRL +KEE  L  L  DL++EKE  K+LEA++Q  
Sbjct: 690  ELPKPQSGSR--TLSQLEQENNELRDRLNKKEEVFL-LLQKDLRREKELRKTLEAEVQTL 746

Query: 508  TRKYEELSKEQESLIDIFSEERQRRDVEEENLRKKLK 398
              K EE+ KEQ SLID+F+E+R RRD EEENLR KL+
Sbjct: 747  KYKLEEMDKEQASLIDVFAEDRDRRDKEEENLRIKLE 783


>ref|XP_004298543.1| PREDICTED: uncharacterized protein LOC101292116 [Fragaria vesca
            subsp. vesca]
          Length = 822

 Score =  820 bits (2118), Expect = 0.0
 Identities = 432/760 (56%), Positives = 541/760 (71%), Gaps = 34/760 (4%)
 Frame = -1

Query: 2521 LNAVLPVGFLDPLPTKQTPSPRNERLCLEFPSARVNAVAEASAKQFWKAGDYEGAPGGVW 2342
            +  ++PV  LDP P++Q       R     P+A    V+    KQFWKAG++E  P G  
Sbjct: 64   MGVIMPV--LDPQPSEQDLP----RTLTAAPAAGKGMVSVNYCKQFWKAGEFEAVPCGDT 117

Query: 2341 DSSNGGMDHVRVHPKFLHSNATSHKWVLGAFAELLDNSLDEVCNGATYVNVDMVKSQKDG 2162
            +SS GGMDHVR+HPKFLHSNATSHKW LGAFAELLDN+LDEV  GATYV VDM+++QKDG
Sbjct: 118  ESSAGGMDHVRIHPKFLHSNATSHKWALGAFAELLDNALDEVSTGATYVIVDMLENQKDG 177

Query: 2161 SKMLLIEDNGGGMDPDKMRHCMSLGYSAKSKLVDTIGQYGNGFKTSTMRLGADVIVFSRC 1982
            S+MLLIEDNGGGM+PDK+R CMSLG+S KSK+ +TIGQYGNGFKTSTMRLGADVIVFSRC
Sbjct: 178  SRMLLIEDNGGGMNPDKIRDCMSLGFSTKSKIANTIGQYGNGFKTSTMRLGADVIVFSRC 237

Query: 1981 SGKDGRRSTQSIGLLSYTFLKSTGKEDIVVPMLDYERRGQDWNKIVRSSATDWNRNVETI 1802
             GKDG+ +TQSIGLLSYTFL+STGKED VVPMLDYE++G DW++I RSS +DWN+NVETI
Sbjct: 238  CGKDGKSATQSIGLLSYTFLRSTGKEDTVVPMLDYEKQGTDWHRIRRSSLSDWNKNVETI 297

Query: 1801 AQWSPFSSEADLLRQFNQMQDQGTRIIIYNLWEDDQGLLELDFDTDPHDIQIRGVNRDEK 1622
             QWSPFS+E DL  QF  +++QGTR+IIYNLWEDD+G LELDFD D +DIQIRGVNRDEK
Sbjct: 298  LQWSPFSTEEDLDHQFCMIKNQGTRVIIYNLWEDDEGQLELDFDVDRYDIQIRGVNRDEK 357

Query: 1621 SIDMAKKFPNSKHFLTYRHSLRSYAAILYLRIPPNFRIILRGEDVEHHNVVNDMMMSQEI 1442
            +I MAK++PNS+HFLTYRHSLRSYAAILYLR+P +FRIILRG+DVEHHN+VNDMMMSQ++
Sbjct: 358  NIQMAKQYPNSRHFLTYRHSLRSYAAILYLRLPIDFRIILRGKDVEHHNIVNDMMMSQKV 417

Query: 1441 TYRPQ-PGVDGIPKDTNMVAIVTVGFVKDAKAHIDVQGFNVYHKNRLIKPFWRVWHPPGS 1265
            TYRPQ    DGI KD NM A VT+GFVKDAK HIDVQGFNVYHKNRLIKPFWR+W+  GS
Sbjct: 418  TYRPQSTTTDGILKDANMAATVTIGFVKDAKYHIDVQGFNVYHKNRLIKPFWRLWNAAGS 477

Query: 1264 DGRGVIGVLEANFVEPAHDKQGFERTTVLARLETRLIQVQKTYWSSNCHKIGYAPRRSK- 1088
            DGRGVIG+LEANFVEPAHDKQGFERTT+L+RLE +L+Q+QKTYWS+NCHKIGYA RR+K 
Sbjct: 478  DGRGVIGLLEANFVEPAHDKQGFERTTILSRLEAKLVQMQKTYWSTNCHKIGYAARRNKK 537

Query: 1087 -NVHEREISPDSFPR-----SPPLKKKSIASSDKTQTRFASXXXXXXXXXXXGDVRTLRK 926
             N  ++EISPD         S   +  S++ + +++T                 V+T  K
Sbjct: 538  VNAEDKEISPDFLSERDGITSGGKRPHSLSDTFRSKTADNGHGNMHVSGKGKSTVQTQSK 597

Query: 925  RPIYVDQSSSSEEDVR---------------DNGRQNHTP-------RKQTNGS----SS 824
                 ++ SSS E+                 + GR++           KQ NGS    ++
Sbjct: 598  LTTRSEEESSSSEETLPLPPNVRGDNVHKQVNGGRESRDKDIHVAVCDKQANGSIHKPNT 657

Query: 823  RKMFGKDGSQNPSGLRSREAAKNNSPAEMPARTTRXXXXXXXXXXXXXXXXXXXXSYVLA 644
               +   G Q  + L S            P R T                       ++ 
Sbjct: 658  AMHYSGKGGQAVTELSSDMEETETRQECTPCRGTSVSATKSKSKENDFNLEDLS---IVQ 714

Query: 643  QLKEENLQLRDRLKRKEEEILGDLLHDLQKEKERSKSLEAQLQEATRKYEELSKEQESLI 464
            +L++ENL+L+++LK++       LL D   E+E++ +LE +LQEA ++ E L++EQ+SLI
Sbjct: 715  RLRKENLELKEKLKKQGTGAAAGLLKDFLLEREKNTTLETKLQEAEKQIEFLNREQDSLI 774

Query: 463  DIFSEERQRRDVEEENLRKKLKDASNTIQQLLDKLRSLEK 344
            +IF+EERQRRD  E  LR +L+DAS+TIQ+L++K+R LE+
Sbjct: 775  NIFAEERQRRDDVENKLRGELQDASDTIQELIEKIRELER 814