BLASTX nr result

ID: Akebia24_contig00006093 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00006093
         (2681 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera]   718   0.0  
ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258...   711   0.0  
ref|XP_007016926.1| F2P16.20 protein, putative isoform 1 [Theobr...   642   0.0  
ref|XP_002521936.1| conserved hypothetical protein [Ricinus comm...   634   e-179
ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Popu...   602   e-169
ref|XP_007016928.1| F2P16.20-like protein isoform 3, partial [Th...   601   e-169
ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subuni...   599   e-168
ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subuni...   597   e-168
ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subuni...   592   e-166
gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis]     591   e-166
ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subuni...   578   e-162
gb|EYU19406.1| hypothetical protein MIMGU_mgv1a003240mg [Mimulus...   578   e-162
ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subuni...   575   e-161
ref|XP_007208433.1| hypothetical protein PRUPE_ppa002134mg [Prun...   572   e-160
ref|XP_007016930.1| F2P16.20-like protein isoform 5 [Theobroma c...   555   e-155
ref|XP_007016927.1| F2P16.20-like protein isoform 2 [Theobroma c...   553   e-154
ref|XP_007016929.1| F2P16.20 protein, putative isoform 4 [Theobr...   523   e-145
ref|XP_003537129.1| PREDICTED: putative RNA polymerase II subuni...   523   e-145
ref|XP_004302308.1| PREDICTED: putative RNA polymerase II subuni...   513   e-142
ref|XP_006841578.1| hypothetical protein AMTR_s00003p00194360 [A...   505   e-140

>emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera]
          Length = 659

 Score =  718 bits (1854), Expect = 0.0
 Identities = 379/649 (58%), Positives = 452/649 (69%), Gaps = 30/649 (4%)
 Frame = +2

Query: 419  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 598
            M   Q I+VKDAVHKLQ  LLEGI+NE+QLFAAGSLMSR DYED+VTER+I+NLCGYPLC
Sbjct: 1    MAGDQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLC 60

Query: 599  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCLISNSEK 778
            +N LP E  RKG YR+SLKEHKVYDL ETYMYCSS CVV SR+FAGSLQ ERC + NSE+
Sbjct: 61   SNSLPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSER 120

Query: 779  INEVLKLFGELSLDEKEDLGKKGDLGFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 958
            IN +L+LFGE SL+  + LGK GDLG SELKI E V+ KAGEV +EDW+GPSNAIEGYVP
Sbjct: 121  INGILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVP 180

Query: 959  QRDRSSKSSPLKHRKEGSKSKNARPKKEAEKVVDEMDFTSSIIVGGQFSVPKVSSGPKQN 1138
            QRDR+ K   +K+ KEGSKS N++       V+DEMDF S+II   ++S+ K S G K  
Sbjct: 181  QRDRNLKPKNIKNHKEGSKSSNSKMDSGKNFVIDEMDFVSTIITKDEYSISKSSKGLKDT 240

Query: 1139 GSETKLKE-----SEGNQFSILETCSASTQNGFETKSKEPKGKGS--------------- 1258
             S  K KE     S G+Q S+LE  +   QN  E+K +E KG+ S               
Sbjct: 241  TSHAKSKEPKEKASIGDQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDEFSTAEVPS 300

Query: 1259 --------VN--ESKRVVSVGXXXXXXXXXXXXXXXXXXXXXXXRSVTWADERKIDSTSN 1408
                    +N  + K                             RSVTWADE K+DS  +
Sbjct: 301  VPSQSGSELNGVKGKEEYHTENAAQLGPTKPKSSLKPSGGKKVIRSVTWADE-KMDSADS 359

Query: 1409 GNLCEFQEMCEVQKSGESSSHSNVEDFDGSLRLTLXXXXXXXXXXXXXXXXSGESDATDA 1588
             + C+ +E+   ++        +V D D +LR                   SGE+D TDA
Sbjct: 360  RDFCKVRELEVKKEDPNGLGDIDVGDDDNALRFASAEACAVALSQAAEAVASGETDMTDA 419

Query: 1589 VSEAGIIILPRPHDADQGDFQNDEDMIEPEPVPLKWPKKSGVLNSKLFDAEDTWYDTPPE 1768
            VSEAGIIILP P D D+G+   D D++EPEPVPLKWP K G+ +S +FD++D+WYDTPPE
Sbjct: 420  VSEAGIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSDDSWYDTPPE 479

Query: 1769 GFNLSLSSFATMWMALFGWITSSSLAYIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSE 1948
            GF+L+LS FATMWMALF WITSSS+AYIYGRDES HEE+  VNGREYP+KIVL+DGRSSE
Sbjct: 480  GFSLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKIVLTDGRSSE 539

Query: 1949 IKQTLAGCLARALPGLVTDLRLPTPISSLEQGMRSLLETMSFMDALPPFRMKQWHVIVLL 2128
            IKQTLAGCL+RALPGLV DLRLP P+S+LEQG+  LL+TMSF+DALP FRMKQW VIVLL
Sbjct: 540  IKQTLAGCLSRALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRMKQWQVIVLL 599

Query: 2129 LIDALSVCRIPGLIPHMTNRRMLLHKVLDSAQVSVEEYEVMKNLVIPLG 2275
             IDALSVCRIP L PHMT+RRML  KV D+AQVS EEYEVMK+L+IPLG
Sbjct: 600  FIDALSVCRIPALTPHMTSRRMLFPKVFDAAQVSAEEYEVMKDLIIPLG 648


>ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258021 [Vitis vinifera]
            gi|296089830|emb|CBI39649.3| unnamed protein product
            [Vitis vinifera]
          Length = 659

 Score =  711 bits (1836), Expect = 0.0
 Identities = 376/649 (57%), Positives = 450/649 (69%), Gaps = 30/649 (4%)
 Frame = +2

Query: 419  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 598
            M   Q I+VKDAVHKLQ  LLEGI+NE+QLFAAGSLMSR DYED+VTER+I+NLCGYPLC
Sbjct: 1    MAGDQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLC 60

Query: 599  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCLISNSEK 778
            +N LP E  RKG YR+SLKEHKVYDL ETYMYCSS CVV SR+FAGSLQ ERC + NSE+
Sbjct: 61   SNSLPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSER 120

Query: 779  INEVLKLFGELSLDEKEDLGKKGDLGFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 958
            IN +L+LFGE SL+  + LGK GDLG SELKI E V+ KAGEV +EDW+GPSNAIEGYVP
Sbjct: 121  INGILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVP 180

Query: 959  QRDRSSKSSPLKHRKEGSKSKNARPKKEAEKVVDEMDFTSSIIVGGQFSVPKVSSGPKQN 1138
            QRDR+ K   +K+RKEGSKS N++       V+DEMDF  +II   ++S+ K S G K  
Sbjct: 181  QRDRNLKPKNIKNRKEGSKSSNSKMDSGKNFVIDEMDFVRTIITEDEYSISKSSKGLKDT 240

Query: 1139 GSETKLKE-----SEGNQFSILETCSASTQNGFETKSKEPKGKGS--------------- 1258
             S  K KE     S G+Q S+LE  +   QN  E+K +E KG+ S               
Sbjct: 241  TSHAKSKEPKEKASIGDQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDEFSTAEVPS 300

Query: 1259 --------VN--ESKRVVSVGXXXXXXXXXXXXXXXXXXXXXXXRSVTWADERKIDSTSN 1408
                    +N  + K                             RSVTWADE K+DS  +
Sbjct: 301  VPSQSGSELNGVKGKEEYHTENAAQLGPTKLKSCLKPSGGKKVTRSVTWADE-KMDSADS 359

Query: 1409 GNLCEFQEMCEVQKSGESSSHSNVEDFDGSLRLTLXXXXXXXXXXXXXXXXSGESDATDA 1588
             + C+ +E+   ++        +V D D +LR                   SGE+D TDA
Sbjct: 360  RDFCKVRELEVKKEDPNGLGDIDVGDDDNALRFASAEACAIALSQAAEAVASGETDMTDA 419

Query: 1589 VSEAGIIILPRPHDADQGDFQNDEDMIEPEPVPLKWPKKSGVLNSKLFDAEDTWYDTPPE 1768
            VSEA IIILP P D D+G+   D D++EPEPVPLKWP K G+ +S +FD++D+WYDTPPE
Sbjct: 420  VSEARIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSDDSWYDTPPE 479

Query: 1769 GFNLSLSSFATMWMALFGWITSSSLAYIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSE 1948
            GF+L+LS FATMWMALF WITSSS+AYIYGRDES HEE+  VNGREYP+KIVL+DGRSSE
Sbjct: 480  GFSLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKIVLTDGRSSE 539

Query: 1949 IKQTLAGCLARALPGLVTDLRLPTPISSLEQGMRSLLETMSFMDALPPFRMKQWHVIVLL 2128
            IKQTLAGCLARALPGLV DLRLP P+S+LEQG+  LL+TMSF+DALP FRMKQW VIVLL
Sbjct: 540  IKQTLAGCLARALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRMKQWQVIVLL 599

Query: 2129 LIDALSVCRIPGLIPHMTNRRMLLHKVLDSAQVSVEEYEVMKNLVIPLG 2275
             IDALSVC+IP L PHM ++RML  KV D+AQVS EEYEVMK+L+IPLG
Sbjct: 600  FIDALSVCQIPALTPHMISKRMLFPKVFDAAQVSAEEYEVMKDLIIPLG 648


>ref|XP_007016926.1| F2P16.20 protein, putative isoform 1 [Theobroma cacao]
            gi|508787289|gb|EOY34545.1| F2P16.20 protein, putative
            isoform 1 [Theobroma cacao]
          Length = 739

 Score =  642 bits (1656), Expect = 0.0
 Identities = 355/691 (51%), Positives = 446/691 (64%), Gaps = 72/691 (10%)
 Frame = +2

Query: 419  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 598
            M K+QSISV +AVHK+Q  LL+GI++E QL A+GSL+SR DYED+VTER+ISN CGYPLC
Sbjct: 55   MAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPLC 114

Query: 599  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCLISNSEK 778
             NPLP E  RKGRYR+SLKEHKVYDLQETYM+CS++C++ SR FAGSLQ ERC + N  K
Sbjct: 115  ANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHAK 174

Query: 779  INEVLKLFGELSLDEKEDLGKKGDLGFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 958
            +N++L LFG+L LD+  DLGK GDLGFS L+I+E  +VKA +V L    GPSNAIEGYVP
Sbjct: 175  LNDILSLFGDLDLDDN-DLGKNGDLGFSNLRIKENEEVKAEDVSL---AGPSNAIEGYVP 230

Query: 959  QRDRSSKSSPLKH-----------------------------------------RKEGSK 1015
            QR+  SK +P K+                                         +K GS 
Sbjct: 231  QRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKKPGSF 290

Query: 1016 SKNARPKKEAEK---VVDEMDFTSSIIVGGQFSVPKVSSGPKQNGSETKLKESE------ 1168
             +  R K  ++K   V++EMDFTS II+  ++++ K+ SG KQ+  ++ LKE E      
Sbjct: 291  KQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICK 350

Query: 1169 ---------GNQFSILETCSA-----STQN----GFETKS----KEPKGKGSVNESKRVV 1282
                     G+  ++ E  S+     ST+N    G +T S    KE     +V  S+ V+
Sbjct: 351  DSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETVL 410

Query: 1283 SVGXXXXXXXXXXXXXXXXXXXXXXXRSVTWADERKIDSTSNGNLCEFQEMCEVQKSGES 1462
                                      R VTWAD++K D+  NGNLCE +EM  ++   E 
Sbjct: 411  KSSLKSAGAKKLN-------------RFVTWADKKKADNAGNGNLCEVKEMETMKGDSEI 457

Query: 1463 SSHSNVEDFDGSLRLTLXXXXXXXXXXXXXXXXSGESDATDAVSEAGIIILPRPHDADQG 1642
            S  +     D  LR                   SG+SD TDAV E G+IILP   + D+ 
Sbjct: 458  SGSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKE 517

Query: 1643 DFQNDEDMIEPEPVPLKWPKKSGVLNSKLFDAEDTWYDTPPEGFNLSLSSFATMWMALFG 1822
            +   D DM+EPE  P+KWPKK G+ +S +F+ ED+W+D PPEGF+L+LS+FATMW ALF 
Sbjct: 518  EPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFE 577

Query: 1823 WITSSSLAYIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCLARALPGLVT 2002
            WITSSSLAYIYGRDES HEE+  +NGREYPRKI L DGRSSEIK+TLA C++RALP +VT
Sbjct: 578  WITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVT 637

Query: 2003 DLRLPTPISSLEQGMRSLLETMSFMDALPPFRMKQWHVIVLLLIDALSVCRIPGLIPHMT 2182
            DLRLP PIS+LEQGM  L++T+SFM+ALP FRMKQW VIVLL IDALSVCRIP L PHMT
Sbjct: 638  DLRLPIPISTLEQGMGHLIDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPALTPHMT 697

Query: 2183 NRRMLLHKVLDSAQVSVEEYEVMKNLVIPLG 2275
            N RMLLHKVLD AQ+S+EEYEVMK+L+IPLG
Sbjct: 698  NGRMLLHKVLDGAQISMEEYEVMKDLIIPLG 728


>ref|XP_002521936.1| conserved hypothetical protein [Ricinus communis]
            gi|223538861|gb|EEF40460.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 645

 Score =  634 bits (1634), Expect = e-179
 Identities = 339/639 (53%), Positives = 416/639 (65%), Gaps = 20/639 (3%)
 Frame = +2

Query: 419  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 598
            M K++S+SVKD V+KLQ SLLEGI+NEDQL AAGSLMSR DYED+V ERSISNLCGYPLC
Sbjct: 1    MAKEESVSVKDTVYKLQLSLLEGIENEDQLLAAGSLMSRSDYEDVVVERSISNLCGYPLC 60

Query: 599  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCLISNSEK 778
            NN LP + P KGRYR+SLKEH+VYDLQETYMYCSS C+V SR F+ SLQ +RC + N  K
Sbjct: 61   NNSLPSDRPYKGRYRISLKEHRVYDLQETYMYCSSSCLVNSRAFSESLQEKRCSVLNPIK 120

Query: 779  INEVLKLFGELSLDEKEDLGKKGDLGFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 958
            +NE+L+ F +L+LD  E LG+ GDLG S LKI+EK +   G+V LE+W+GPSNAIEGYVP
Sbjct: 121  LNEILRKFNDLTLDS-EGLGRSGDLGLSNLKIQEKSETNVGKVSLEEWIGPSNAIEGYVP 179

Query: 959  QRDRSSKSSPLKHRKEGSKSKNARPKKEAEKVVDEMDFTSSIIVGGQFSVPKVSSGPKQN 1138
            Q DR    S LK+ KEG K+   +P  + +    + DFTS+II   ++S+ K  SG    
Sbjct: 180  QGDRDPNPS-LKNHKEGLKAICKKPVSKQDCFFSDTDFTSTIITNDEYSISKGPSGLTST 238

Query: 1139 GSETKLKESEGN-------QFSIL---ETCSASTQNGFETKSKE----------PKGKGS 1258
             S+ KL+   G        Q S L   ++  AS ++    K K           P     
Sbjct: 239  ASDIKLQAQTGKGHEGLNAQLSSLRKQDSIKASRKSKGRRKEKVIKEQLNFQDLPSSSYY 298

Query: 1259 VNESKRVVSVGXXXXXXXXXXXXXXXXXXXXXXXRSVTWADERKIDSTSNGNLCEFQEMC 1438
              E++ +                           RSVTWADER +D+  + NLCE QEM 
Sbjct: 299  TAEAEDISQATGAANLNESVLKPSLKSSGAKRSNRSVTWADER-VDNAGSRNLCEVQEME 357

Query: 1439 EVQKSGESSSHSNVEDFDGSLRLTLXXXXXXXXXXXXXXXXSGESDATDAVSEAGIIILP 1618
            +  +S E S  +N  D    LR                   SG++D   A+SEAGII+LP
Sbjct: 358  QTNESHEISESANKGDDGHMLRFESAEACAVALSQAAEAVASGDADVNKAMSEAGIIVLP 417

Query: 1619 RPHDADQGDFQNDEDMIEPEPVPLKWPKKSGVLNSKLFDAEDTWYDTPPEGFNLSLSSFA 1798
               D  QG      DMIE E   LKWP K G+  S LFD ED+WYD PPEGF+L+LS FA
Sbjct: 418  PSQDLGQGGNVEKNDMIEQESASLKWPTKPGIPQSDLFDPEDSWYDAPPEGFSLTLSPFA 477

Query: 1799 TMWMALFGWITSSSLAYIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCLA 1978
            TMWMALF W+TSSSLAYIYGRDES+HE++  VNGREYPRKIVL DGRSSEI+ T   CLA
Sbjct: 478  TMWMALFAWVTSSSLAYIYGRDESAHEDYLSVNGREYPRKIVLRDGRSSEIRLTAESCLA 537

Query: 1979 RALPGLVTDLRLPTPISSLEQGMRSLLETMSFMDALPPFRMKQWHVIVLLLIDALSVCRI 2158
            R  PGLV +LRLP P+S+LEQG   LLETMSF+DALP FR KQW VI LL I+ALSVCRI
Sbjct: 538  RTFPGLVANLRLPIPVSTLEQGAGRLLETMSFVDALPAFRTKQWQVIALLFIEALSVCRI 597

Query: 2159 PGLIPHMTNRRMLLHKVLDSAQVSVEEYEVMKNLVIPLG 2275
            P L  +MT+RRM+LH+VLD A +S EEY++MK+ ++PLG
Sbjct: 598  PALTSYMTSRRMVLHQVLDGAHISAEEYDIMKDFMVPLG 636


>ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Populus trichocarpa]
            gi|550321730|gb|EEF05523.2| hypothetical protein
            POPTR_0015s01330g [Populus trichocarpa]
          Length = 696

 Score =  602 bits (1551), Expect = e-169
 Identities = 332/690 (48%), Positives = 423/690 (61%), Gaps = 71/690 (10%)
 Frame = +2

Query: 419  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 598
            M K QS  VKD ++KLQ SLL+GI+NEDQL AAGS+MS  DYED+VTER+I+NLCGYPLC
Sbjct: 1    MAKDQSTVVKDTIYKLQLSLLDGIQNEDQLLAAGSIMSHSDYEDVVTERTIANLCGYPLC 60

Query: 599  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCLISNSEK 778
             N LP + P+KGRYR+SLKEHKVYDL ETYMYCSS CV+ SRTF+GSLQ ERCL+ N  K
Sbjct: 61   GNSLPSDRPQKGRYRISLKEHKVYDLHETYMYCSSSCVINSRTFSGSLQEERCLVLNPAK 120

Query: 779  INEVLKLFGELSLDEKEDLGKKGDLGFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 958
            +NEVL LF   SL  +  LGK GDLGFS LKIEEK +   GEV  E W+GPSNAIEGYVP
Sbjct: 121  LNEVLMLFDNFSLGSEGSLGKNGDLGFSNLKIEEKTEKVEGEVSFEQWIGPSNAIEGYVP 180

Query: 959  QRDR-----------------------------------------SSKSSPLKHRKEGSK 1015
            QRDR                                           K+       +GSK
Sbjct: 181  QRDRLEEDFIIDDMDFTSSIITQDEYSISKTPSGLTDTNTDKKTQKPKAKGSHKGSKGSK 240

Query: 1016 SKNARPKKEAEKVVDEMDFTSSIIVG-GQFSVPKVSSGPKQNGSETKL-KESEGNQFSIL 1189
            +K  +   + E  +++M+FTS+II+   ++S+ K  SG     S+TK+ K+ E       
Sbjct: 241  AKGTKQSSKQESFINDMNFTSTIIITQDEYSISKSPSGLAGTTSKTKIQKQKEKVSQKSS 300

Query: 1190 ETCSASTQNGFETKS----KEPKGKGSVN------------------------ESKRVVS 1285
            E  S++T+    +K+    KE + K ++                         E+K    
Sbjct: 301  ENQSSATRKVGSSKTSRKVKEDRSKVAIKDELSSQDLSSPFDSCQTSSITITAEAKEKSV 360

Query: 1286 VGXXXXXXXXXXXXXXXXXXXXXXXRSVTWADERKIDSTSNGNLCEFQEMCEVQKSGESS 1465
                                     RSVTWADE K+ S+ + +LCE + M + +   E  
Sbjct: 361  SEKAAKPVESSLKPSLKTSGAKQLTRSVTWADE-KVGSSGSRDLCEVRGMEDTKAGPEIV 419

Query: 1466 SHSNVEDFDGSLRLTLXXXXXXXXXXXXXXXXSGESDATDAVSEAGIIILPRPHDADQGD 1645
             + +  D     +                   SG++DA++A+SEAG++ILP+PHD DQGD
Sbjct: 420  DNIDKRDDGYVSKFESAEACAKALSQAAEAVASGDADASNALSEAGLVILPQPHDLDQGD 479

Query: 1646 FQNDEDMIEPEPVPLKWPKKSGVLNSKLFDAEDTWYDTPPEGFNLSLSSFATMWMALFGW 1825
               D D+++ E   +KWP K G+  S+ FD E++WYD PPEGF+L LSSFAT+WMALF W
Sbjct: 480  PMEDVDVLDEESSTIKWPGKPGIPQSECFDPENSWYDAPPEGFSLELSSFATIWMALFAW 539

Query: 1826 ITSSSLAYIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCLARALPGLVTD 2005
            +TSSSLAY+YG+DESSHEE+ +VNGREYPRKIVL DGRS EI+QT+ GCL RA P +V D
Sbjct: 540  VTSSSLAYVYGKDESSHEEYLMVNGREYPRKIVLGDGRSFEIQQTIEGCLGRAFPVVVAD 599

Query: 2006 LRLPTPISSLEQGMRSLLETMSFMDALPPFRMKQWHVIVLLLIDALSVCRIPGLIPHMTN 2185
            LRLP PIS+LEQG  +LL TMSF+DA+P FRMKQW VI LL I+ALSVCRIP LI +M N
Sbjct: 600  LRLPIPISTLEQGAANLLGTMSFVDAVPAFRMKQWQVIALLFIEALSVCRIPALISYMDN 659

Query: 2186 RRMLLHKVLDSAQVSVEEYEVMKNLVIPLG 2275
            RRM    V+D  ++S EEYEVMK+L+IPLG
Sbjct: 660  RRM----VVDGVRMSAEEYEVMKDLMIPLG 685


>ref|XP_007016928.1| F2P16.20-like protein isoform 3, partial [Theobroma cacao]
            gi|508787291|gb|EOY34547.1| F2P16.20-like protein isoform
            3, partial [Theobroma cacao]
          Length = 703

 Score =  601 bits (1550), Expect = e-169
 Identities = 338/677 (49%), Positives = 425/677 (62%), Gaps = 72/677 (10%)
 Frame = +2

Query: 419  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 598
            M K+QSISV +AVHK+Q  LL+GI++E QL A+GSL+SR DYED+VTER+ISN CGYPLC
Sbjct: 55   MAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPLC 114

Query: 599  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCLISNSEK 778
             NPLP E  RKGRYR+SLKEHKVYDLQETYM+CS++C++ SR FAGSLQ ERC + N  K
Sbjct: 115  ANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHAK 174

Query: 779  INEVLKLFGELSLDEKEDLGKKGDLGFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 958
            +N++L LFG+L LD+  DLGK GDLGFS L+I+E  +VKA +V L    GPSNAIEGYVP
Sbjct: 175  LNDILSLFGDLDLDDN-DLGKNGDLGFSNLRIKENEEVKAEDVSL---AGPSNAIEGYVP 230

Query: 959  QRDRSSKSSPLKH-----------------------------------------RKEGSK 1015
            QR+  SK +P K+                                         +K GS 
Sbjct: 231  QRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKKPGSF 290

Query: 1016 SKNARPKKEAEK---VVDEMDFTSSIIVGGQFSVPKVSSGPKQNGSETKLKESE------ 1168
             +  R K  ++K   V++EMDFTS II+  ++++ K+ SG KQ+  ++ LKE E      
Sbjct: 291  KQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICK 350

Query: 1169 ---------GNQFSILETCSA-----STQN----GFETKS----KEPKGKGSVNESKRVV 1282
                     G+  ++ E  S+     ST+N    G +T S    KE     +V  S+ V+
Sbjct: 351  DSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETVL 410

Query: 1283 SVGXXXXXXXXXXXXXXXXXXXXXXXRSVTWADERKIDSTSNGNLCEFQEMCEVQKSGES 1462
                                      R VTWAD++K D+  NGNLCE +EM  ++   E 
Sbjct: 411  KSSLKSAGAKKLN-------------RFVTWADKKKADNAGNGNLCEVKEMETMKGDSEI 457

Query: 1463 SSHSNVEDFDGSLRLTLXXXXXXXXXXXXXXXXSGESDATDAVSEAGIIILPRPHDADQG 1642
            S  +     D  LR                   SG+SD TDAV E            D+ 
Sbjct: 458  SGSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVCEV-----------DKE 506

Query: 1643 DFQNDEDMIEPEPVPLKWPKKSGVLNSKLFDAEDTWYDTPPEGFNLSLSSFATMWMALFG 1822
            +   D DM+EPE  P+KWPKK G+ +S +F+ ED+W+D PPEGF+L+LS+FATMW ALF 
Sbjct: 507  EPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFE 566

Query: 1823 WITSSSLAYIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCLARALPGLVT 2002
            WITSSSLAYIYGRDES HEE+  +NGREYPRKI L DGRSSEIK+TLA C++RALP +VT
Sbjct: 567  WITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVT 626

Query: 2003 DLRLPTPISSLEQGMRSLLETMSFMDALPPFRMKQWHVIVLLLIDALSVCRIPGLIPHMT 2182
            DLRLP PIS+LEQGM  L++T+SFM+ALP FRMKQW VIVLL IDALSVCRIP L PHMT
Sbjct: 627  DLRLPIPISTLEQGMGHLIDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPALTPHMT 686

Query: 2183 NRRMLLHKVLDSAQVSV 2233
            N RMLLHKVLD AQ+S+
Sbjct: 687  NGRMLLHKVLDGAQISM 703


>ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Solanum lycopersicum]
          Length = 660

 Score =  599 bits (1544), Expect = e-168
 Identities = 333/656 (50%), Positives = 431/656 (65%), Gaps = 37/656 (5%)
 Frame = +2

Query: 419  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 598
            M K ++++VKDAVHKLQ  LLEGIK+E QL AAGSL+SR DY+D+VTERSI+N+CGYPLC
Sbjct: 1    MAKGEAVAVKDAVHKLQLCLLEGIKDESQLIAAGSLLSRSDYQDVVTERSIANMCGYPLC 60

Query: 599  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCLISNSEK 778
            +N LP E  RKG YR+SLKEHKVYDL ETYMYCS++CVV S  FAGSLQ ER    N  K
Sbjct: 61   SNSLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAK 120

Query: 779  INEVLKLFGELSLDEKEDLGKKGDLGFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 958
            +N+VL LF  L L   +D+ + GD G S+LKI+EKVD+K GEV LE+W+GPSNAIEGYVP
Sbjct: 121  LNQVLNLFKGLHLHSLDDVKENGDRGSSKLKIQEKVDLKGGEVSLEEWMGPSNAIEGYVP 180

Query: 959  QRDRSSKSSPLKHRKEGSKSKNARPKKEAEKVVDEMDFTSSIIVGGQFSVPKVSSGP--- 1129
            QRDRS   + LK+  +GSK+K+AR + E   +++E DF+S+II   ++SV K  +     
Sbjct: 181  QRDRSVNPALLKNINKGSKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSKFPAPVNAD 240

Query: 1130 -----KQNGSETKLKESEGNQFSILETCSA-STQNGFETKSKE----------------- 1240
                 K+  ++T+ K  + + + + +   A   ++G ET+  +                 
Sbjct: 241  SNVKFKETQAKTRYKVRDDDVYILGKQVDALQLRSGEETEKSDKNTRFLKVDKFNSGEVS 300

Query: 1241 --PKGKGSVNESKRVVSVGXXXXXXXXXXXXXXXXXXXXXXX---RSVTWADERKID--- 1396
              P      N+S  ++S                            RSVTWADE  ID   
Sbjct: 301  SGPSQHDVKNKSVLIMSDDGRKYASHGEHDKLKSSLKSSNSKKMSRSVTWADE-SIDGGI 359

Query: 1397 ---STSNGNLCEFQEMCEVQKSGESSSHSNVEDFDGSLRLTLXXXXXXXXXXXXXXXXSG 1567
               + S+  + E+    E Q  G S+S +++E+ D S R                   SG
Sbjct: 360  GKKTESSSKISEY----ESQAYGGSAS-TDMEENDDSYRFESAEACAAALSQAAEAVASG 414

Query: 1568 ESDATDAVSEAGIIILPRPHDADQGDFQNDEDMIEPEPVPLKWPKKSGVLNSKLFDAEDT 1747
             SD  DAVS+AGI+ILP   + D+   Q  ++M++ E  PLKWP+K G+ N  +F++ED+
Sbjct: 415  -SDVPDAVSKAGIVILPPSQEVDEAILQETDEMLDLETAPLKWPRKPGMPNYDVFESEDS 473

Query: 1748 WYDTPPEGFNLSLSSFATMWMALFGWITSSSLAYIYGRDESSHEEFSLVNGREYPRKIVL 1927
            WYD+PPEGFN++LS F TM+ +LF WI+SSSLA+IYG DES++EE+  +NGREYPRKIVL
Sbjct: 474  WYDSPPEGFNMTLSPFGTMFNSLFTWISSSSLAFIYGHDESNNEEYLSINGREYPRKIVL 533

Query: 1928 SDGRSSEIKQTLAGCLARALPGLVTDLRLPTPISSLEQGMRSLLETMSFMDALPPFRMKQ 2107
            SDGRS+EIKQTLAGCLARALPGLV DLRLP PIS+LEQGM  LL TMSF+D LP FRMKQ
Sbjct: 534  SDGRSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPLPAFRMKQ 593

Query: 2108 WHVIVLLLIDALSVCRIPGLIPHMTNRRMLLHKVLDSAQVSVEEYEVMKNLVIPLG 2275
            W +IVLL +DALSVCRIP L P+MT RR    KVLD AQ+S  EYE+MK+L+IPLG
Sbjct: 594  WQLIVLLFLDALSVCRIPTLTPYMTGRRTSFPKVLDGAQISAAEYEIMKDLIIPLG 649


>ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cicer arietinum]
          Length = 666

 Score =  597 bits (1540), Expect = e-168
 Identities = 328/658 (49%), Positives = 424/658 (64%), Gaps = 39/658 (5%)
 Frame = +2

Query: 419  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 598
            MEK Q ISVKDAV KLQ +LLEGI++EDQLFAAGSL+SR DYED+VTERSI+ +C YPLC
Sbjct: 1    MEKDQPISVKDAVFKLQLALLEGIQSEDQLFAAGSLISRSDYEDVVTERSITEVCSYPLC 60

Query: 599  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCLISNSEK 778
             N LP E PRKGRYR+SLKEHKVYDL ETYM+CSS CVV S+ FAGSL+ +RCL  + +K
Sbjct: 61   CNALPSERPRKGRYRISLKEHKVYDLHETYMFCSSSCVVNSKAFAGSLKDKRCLALDPQK 120

Query: 779  INEVLKLFGELSLDEKEDLGKKGDLGFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 958
            +N +L+LFG  +L+  E+ GK G+LG S L+I++K +    EV LE WVGPSNAIEGYVP
Sbjct: 121  LNNILRLFGNSNLEPMENSGKDGELGLSSLRIQDKTETVT-EVSLEQWVGPSNAIEGYVP 179

Query: 959  Q-RDRSSKSSPLKHRKEGSKSKNARPKKEAEKVVDEMDFTSSIIVGGQFSVPKVSSG--- 1126
            + RD  SK S  K+ K+GSK+ + +       +  E DF S+II+  ++SV KVSSG   
Sbjct: 180  KKRDNGSKGSQ-KNTKKGSKASHGKSNGVKNLINSEFDFMSTIIMQDEYSVSKVSSGQTD 238

Query: 1127 ---------------PKQNGSETKLKESE----GNQFSILETCSASTQNGFETKSKEPKG 1249
                           PK+   E   K+ +     + F+     SAS ++    KS +   
Sbjct: 239  ATVDHQIKPTAILEQPKRVDHELVRKDDDIQDLSSSFASSLNLSASKKDKEIAKSCKNVL 298

Query: 1250 KGSVN----------------ESKRVVSVGXXXXXXXXXXXXXXXXXXXXXXXRSVTWAD 1381
            KG  N                + +  + +                        RSVTWAD
Sbjct: 299  KGKTNRVAANDDSSTSNFDPSDVEEKIQIEKEIGSCHTKPKSSLKSNGKKKLGRSVTWAD 358

Query: 1382 ERKIDSTSNGNLCEFQEMCEVQKSGESSSHSNVEDFDGSLRLTLXXXXXXXXXXXXXXXX 1561
             +KID   + +LC F+E   ++K  + + + +V D +  LR                   
Sbjct: 359  -KKIDGCGSTDLCAFKEFGNIKKESDVADNVDVVDDEDILRSVSAEACAIALSQAAEAVA 417

Query: 1562 SGESDATDAVSEAGIIILPRPHDADQGDFQNDEDMIEPEPVPLKWPKKSGVLNSKLFDAE 1741
            SG+SDA DAVSEAGIIILP   +A +    +D D++E + V LKWP+K G+ +  LF ++
Sbjct: 418  SGDSDAIDAVSEAGIIILPHTENAVEESTVDDVDILETDSVTLKWPRKPGISDFDLFASD 477

Query: 1742 DTWYDTPPEGFNLSLSSFATMWMALFGWITSSSLAYIYGRDESSHEEFSLVNGREYPRKI 1921
            D+W+D PPEGF+L+LS FAT+W A F WITSSSLAYIYGRD S +EEF  V+GREYP KI
Sbjct: 478  DSWFDAPPEGFSLTLSPFATLWNAFFSWITSSSLAYIYGRDVSFYEEFLSVDGREYPCKI 537

Query: 1922 VLSDGRSSEIKQTLAGCLARALPGLVTDLRLPTPISSLEQGMRSLLETMSFMDALPPFRM 2101
            VLSDGRSSEIKQTLA CLARALP +V +L+LP P+S+LEQGM  LL+TMSF+D LP FR 
Sbjct: 538  VLSDGRSSEIKQTLASCLARALPAVVAELKLPMPVSTLEQGMVCLLDTMSFVDPLPGFRF 597

Query: 2102 KQWHVIVLLLIDALSVCRIPGLIPHMTNRRMLLHKVLDSAQVSVEEYEVMKNLVIPLG 2275
            KQW V+ LL +DALSVCRIP LI +MT+RR L HKVL  +Q+ +EEY V+K+L++PLG
Sbjct: 598  KQWQVVALLFVDALSVCRIPALISYMTDRRDLFHKVLSGSQIGMEEYNVLKDLIVPLG 655


>ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X1 [Solanum tuberosum]
          Length = 662

 Score =  592 bits (1526), Expect = e-166
 Identities = 338/661 (51%), Positives = 430/661 (65%), Gaps = 42/661 (6%)
 Frame = +2

Query: 419  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 598
            M K ++++VKDAVHKLQ  LLEGIK+E+QL AAGSL+SR DY+D+VTERSI+N+CGYPLC
Sbjct: 1    MAKGEAVAVKDAVHKLQLCLLEGIKDENQLIAAGSLLSRSDYQDVVTERSIANMCGYPLC 60

Query: 599  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCLISNSEK 778
            +N LP E  RKG YR+SLKEHKVYDL ETYMYCS++CVV S  FAGSLQ ER    N  K
Sbjct: 61   SNSLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAK 120

Query: 779  INEVLKLFGELSLDEKEDLGKKGDLGFSELKIEEKVDVKAG-EVLLEDWVGPSNAIEGYV 955
            +N+VL LF  L L   ED+ + GDLG S+LKI+EKVDVK G EV LE+W+GPSNAIEGYV
Sbjct: 121  LNQVLNLFKGLHLHSPEDVKENGDLGSSKLKIQEKVDVKGGGEVSLEEWMGPSNAIEGYV 180

Query: 956  PQRDRSSKSSPLKHRKEGSKSKNARPKKEAEKVVDEMDFTSSIIVGGQFSVPKVSSGPKQ 1135
            PQRDRS   + LK+  +G K+K+AR + E   +++E DF+S+II   ++SV K  + P  
Sbjct: 181  PQRDRSVNPALLKNINKGFKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSKFPA-PVN 239

Query: 1136 NGSETKLKESEG--------NQFSIL--ETCSASTQNGFETKSKE--------------- 1240
              S  K KE++         +  SIL     +   ++G ET+  +               
Sbjct: 240  AVSSEKFKEAQAKTRYKVRDDDVSILGKRVDALQLRSGEETEKSDKNTRFLKVDKFNSGE 299

Query: 1241 ----PKGKGSVNESKRVVS-----VGXXXXXXXXXXXXXXXXXXXXXXXRSVTWADE--- 1384
                P      N+S  ++S                              +SVTWADE   
Sbjct: 300  VSSGPSQHDVKNKSVLIMSDDGRKYASHGEHDKQLLKSSLKSSNSKKMSQSVTWADEIID 359

Query: 1385 ----RKIDSTSNGNLCEFQEMCEVQKSGESSSHSNVEDFDGSLRLTLXXXXXXXXXXXXX 1552
                +K +S+S   + E++     Q  G S+S +++E+ D S R                
Sbjct: 360  GGIGKKTESSSK--ISEYEN----QAYGGSAS-TDMEEDDDSYRFESAEACAAALSQAAE 412

Query: 1553 XXXSGESDATDAVSEAGIIILPRPHDADQGDFQNDEDMIEPEPVPLKWPKKSGVLNSKLF 1732
               SG SD  DAVS+AGI+ILP   + D+   Q  E M++ EP PLKWP+K G+ N  +F
Sbjct: 413  AVASG-SDVPDAVSKAGIVILPTSQEVDEAILQETE-MLDIEPAPLKWPRKPGMPNYDVF 470

Query: 1733 DAEDTWYDTPPEGFNLSLSSFATMWMALFGWITSSSLAYIYGRDESSHEEFSLVNGREYP 1912
            ++ED WYD PPEGFN++LS FATM+ +LF WI+SSSLA+IYG DE+++EE+  +NGREYP
Sbjct: 471  ESEDCWYDGPPEGFNMTLSPFATMFNSLFTWISSSSLAFIYGHDENNNEEYLSINGREYP 530

Query: 1913 RKIVLSDGRSSEIKQTLAGCLARALPGLVTDLRLPTPISSLEQGMRSLLETMSFMDALPP 2092
             KIVLSDG S+EIKQTLAGCLARALPGLV DLRLP PIS+LEQGM  LL TMSF+D LP 
Sbjct: 531  HKIVLSDGLSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPLPA 590

Query: 2093 FRMKQWHVIVLLLIDALSVCRIPGLIPHMTNRRMLLHKVLDSAQVSVEEYEVMKNLVIPL 2272
            FRMKQW +IVLL +DALSVCRIP L P+MT RR  L KVLD AQ+S  EYE+MK+L+IPL
Sbjct: 591  FRMKQWQLIVLLFLDALSVCRIPTLTPYMTGRRTSLPKVLDGAQISTAEYEIMKDLIIPL 650

Query: 2273 G 2275
            G
Sbjct: 651  G 651


>gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis]
          Length = 695

 Score =  591 bits (1524), Expect = e-166
 Identities = 331/677 (48%), Positives = 418/677 (61%), Gaps = 64/677 (9%)
 Frame = +2

Query: 437  ISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLCNNPLPL 616
            ISVKD V++LQ SLL+G+  EDQLFAAGS+MSR DY D+VTERSI+NLCGYPLC NPLP 
Sbjct: 9    ISVKDTVYRLQLSLLQGLHGEDQLFAAGSIMSRSDYNDVVTERSIANLCGYPLCPNPLPS 68

Query: 617  EHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCLISNSEKINEVLK 796
            + PRKGRYR+SLKEHKVYDL ETYMYCSSDCV+ SRTFA SL+ ERC + +S +I+ VL+
Sbjct: 69   DRPRKGRYRISLKEHKVYDLHETYMYCSSDCVINSRTFAASLKDERCAVLDSARIDAVLR 128

Query: 797  LFGELSLDEKE-DLGKKGDLGFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVPQRDRS 973
            +F + S  E+E   GK  DLGFS+LKIEEK +   G+V LE W GPSNAIEGYV QR+R 
Sbjct: 129  MFEDYSGLERELGFGKDRDLGFSKLKIEEKTENCVGDVSLEQWAGPSNAIEGYVLQRERK 188

Query: 974  SKSSPLKHRKEGSKSKN-------------------------ARPKKEA--EKVVDEMDF 1072
             K    K  K GSK+ N                         +  KK     KV ++ + 
Sbjct: 189  PKELGSKSPKRGSKANNTVLINDMDFVSTIITEDEYTVSKTPSSLKKTGLDSKVREQEEI 248

Query: 1073 TSSIIVGGQFSVPKVSSGPKQNGSETKL-------------------KESEGNQFSILET 1195
             +   +G +F+V + S  P  N S   L                    E E +     + 
Sbjct: 249  LAKKAMGNEFAVLETSYAPASNVSRVGLVFEDVTSSLRAGSCLSSARAEEESHDDKAEKC 308

Query: 1196 CSASTQNGFETKSK--------------EPKGKGSVNESKRVVSVGXXXXXXXXXXXXXX 1333
              AS ++  +   K              +  G   + E + +  +               
Sbjct: 309  TEASIKSSLKPSRKKKLSRTVTWADEKTDSSGGRKLCEIREIEDMKEDPSVVENKNGVSF 368

Query: 1334 XXXXXXXXXRSVTWADERKIDSTSNGNLCEFQEMCEVQKSGESSSHSNVEDFDGSLRLTL 1513
                     +SV WADE K DS+ + ++CE +E+ + +++ +   +++  + D + R   
Sbjct: 369  TSSGKMKAGQSVIWADE-KGDSSKSIDVCEVREIEDAKEAADMLCNADTGENDDTFRFAS 427

Query: 1514 XXXXXXXXXXXXXXXXSGESDATDAVSEAGIIILPRPHDADQGD---FQNDEDMIEPEPV 1684
                            S E +  DA+SEAGIIILPRP + D+G+     +D++  EPE  
Sbjct: 428  AEACARALDEASEAVASEELEVNDAMSEAGIIILPRPENGDEGEPMEEDDDDETSEPEQA 487

Query: 1685 PLKWPKKSGVLNSKLFDAEDTWYDTPPEGFNLSLSSFATMWMALFGWITSSSLAYIYGRD 1864
            P+KWPKK G  +S LFD ED+W+D PPE F+L+LS FA MW ALF W TSS+LAYIYGRD
Sbjct: 488  PIKWPKKPGSQHSDLFDPEDSWFDAPPEDFSLTLSPFAKMWNALFTWTTSSTLAYIYGRD 547

Query: 1865 ESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCLARALPGLVTDLRLPTPISSLEQG 2044
            ES HEE+++VNGREYP KIV  DGRSSEIKQTLAG LARALPGLV DLRL TPISSLEQG
Sbjct: 548  ESLHEEYAVVNGREYPEKIVFGDGRSSEIKQTLAGSLARALPGLVADLRLSTPISSLEQG 607

Query: 2045 MRSLLETMSFMDALPPFRMKQWHVIVLLLIDALSVCRIPGLIPHMTNRRMLLHKVLDSAQ 2224
            M  LL+TMSF+DALPPFRMKQW VI+LL ++ALSV R+P L PHM  RR+L HKVLDSAQ
Sbjct: 608  MGRLLDTMSFVDALPPFRMKQWQVIILLFLEALSVYRLPALTPHMMYRRVLFHKVLDSAQ 667

Query: 2225 VSVEEYEVMKNLVIPLG 2275
            +S EEYEVMK+LVIPLG
Sbjct: 668  ISAEEYEVMKDLVIPLG 684


>ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cucumis sativus]
          Length = 662

 Score =  578 bits (1490), Expect = e-162
 Identities = 324/664 (48%), Positives = 418/664 (62%), Gaps = 40/664 (6%)
 Frame = +2

Query: 419  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 598
            M K QS+ +KD V+KLQ +L EGIKNE+QLFAAGSLMSR DYED+VTERSI++LCGYPLC
Sbjct: 1    MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60

Query: 599  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCLISNSEK 778
            ++ LP ++ R+GRYR+SLKEHKVYDL+ETY YCSS C++ SR F+G LQ ERC + N +K
Sbjct: 61   HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120

Query: 779  INEVLKLFGELSLDEKEDLGKKGDLGFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 958
            + E+LKLF  +SLD KE++G   D G   L+I+EK++   GEV +E+W+GPSNAIEGYVP
Sbjct: 121  LKEILKLFENMSLDSKENMGNNCDSG---LEIQEKIESNIGEVPIEEWMGPSNAIEGYVP 177

Query: 959  QRDR---SSKSSPLKHRKEGSKSKNARPKKEAEKVVDEMDFTSSIIVGGQFSVPKVSSGP 1129
             RD    +  S   K  K+GSK+K  +P    +    +   TS+II   ++SV K+SSG 
Sbjct: 178  HRDHKVMTLHSKDGKESKDGSKAK-IKPLGGGKDFFSDFSITSTIITDEEYSVSKISSGL 236

Query: 1130 KQNGSETKLKESEG--------NQFSILET--CSASTQNGFETKSKEPKGKGSVNESKRV 1279
            K+   +T  K   G        +QF+ILET    A  +N    K++  K +  V+ +K  
Sbjct: 237  KEMALDTNSKNQTGEFCGKESNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSATKES 296

Query: 1280 VSV-----------------------GXXXXXXXXXXXXXXXXXXXXXXXRSVTWADERK 1390
                                      G                       RSVTWADE K
Sbjct: 297  TDNLSDAPSTSKNRSTNFNLMTEEPRGGFNDLSGTELKSSLKKPGKKNLCRSVTWADE-K 355

Query: 1391 IDSTSNGNLCEFQEMCEVQKSGESSSHSNVEDFDGS----LRLTLXXXXXXXXXXXXXXX 1558
             D  S  NL E  EM + ++   ++S  N+ +FD      LR+                 
Sbjct: 356  TDDASIMNLPEVGEMGKTKECSRTTS--NLVNFDNDNEDILRVESAEACAMALSQAAEAI 413

Query: 1559 XSGESDATDAVSEAGIIILPRPHDADQGDFQNDEDMIEPEPVPLKWPKKSGVLNSKLFDA 1738
             SG+S+ +DAVSEAGIIILP P DA++    +  +  EP     K   K GVL S LFD 
Sbjct: 414  TSGQSEVSDAVSEAGIIILPHPSDANEEASTDPVNASEPHSFSEK-SNKLGVLRSDLFDP 472

Query: 1739 EDTWYDTPPEGFNLSLSSFATMWMALFGWITSSSLAYIYGRDESSHEEFSLVNGREYPRK 1918
             D+WYD PPEGF+L+LSSFATMWMA+F W+TSSSLAYIYG+D+  HEEF  ++G+EYP K
Sbjct: 473  SDSWYDAPPEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGKEYPSK 532

Query: 1919 IVLSDGRSSEIKQTLAGCLARALPGLVTDLRLPTPISSLEQGMRSLLETMSFMDALPPFR 2098
            IV +DGRSSEIKQTLAGCL RA+PGL ++L L TPIS LE GM  LL+TM+F+DALP FR
Sbjct: 533  IVSADGRSSEIKQTLAGCLTRAIPGLASELNLSTPISRLENGMAHLLDTMTFLDALPAFR 592

Query: 2099 MKQWHVIVLLLIDALSVCRIPGLIPHMTNRRMLLHKVLDSAQVSVEEYEVMKNLVIPLGY 2278
            MKQW VIVLL I+ALSV RIP L  HM++ R L HKVLD AQ+  +EYE+M++ ++PLG 
Sbjct: 593  MKQWQVIVLLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDHILPLGR 652

Query: 2279 LTSL 2290
               L
Sbjct: 653  TAQL 656


>gb|EYU19406.1| hypothetical protein MIMGU_mgv1a003240mg [Mimulus guttatus]
          Length = 597

 Score =  578 bits (1489), Expect = e-162
 Identities = 310/626 (49%), Positives = 420/626 (67%), Gaps = 7/626 (1%)
 Frame = +2

Query: 419  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 598
            M+  + + VKDAVHKLQ SLLEGIK+E QL AAGSL+S+ DY+D+VTER+I+++CGYPLC
Sbjct: 1    MKDGKILGVKDAVHKLQLSLLEGIKHESQLIAAGSLISQSDYQDVVTERTIAHVCGYPLC 60

Query: 599  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCLISNSEK 778
             N LP E PRKG YR+SLKEHKVYDL ET+MYCS++C+++SR F  SL+ ER    +  K
Sbjct: 61   VNSLPSEPPRKGHYRISLKEHKVYDLHETHMYCSTECLIRSRAFGASLEEERSSSLDPAK 120

Query: 779  INEVLKLFGELSLDEKEDLGKKGDLGFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 958
            IN VLK+F  LSLD    L K GDLG S LKI EK+   +GE+ LE+WVGPSNAI+GYVP
Sbjct: 121  INSVLKMFDGLSLDSVMGLDKSGDLGLSGLKIREKMVTGSGEMSLEEWVGPSNAIDGYVP 180

Query: 959  QRDRSSKSSPLKHRKEGSKSKNARPKKEAEKVVDEMDFTSSIIVGGQFSVPKVSSGPKQN 1138
            +RD++S+      +K  ++S +A+P   A+ +  +++FTS+II+  ++SV K +  P++ 
Sbjct: 181  RRDQNSERKQPSRKK--TESNHAKPNL-ADTLPFDVNFTSTIIMQDEYSVSKTAV-PREA 236

Query: 1139 GSETK----LKESEGNQFSILETCSASTQNGFETKSKEPKGKGSVNESKRVVSVGXXXXX 1306
              + K     K  +  + S+L+  +  +QN         K   S  E+            
Sbjct: 237  KGKVKGKMIRKSVKAEKISVLDDTAGPSQNDTTLLKSSLKTLDSKKET------------ 284

Query: 1307 XXXXXXXXXXXXXXXXXXRSVTWADERKIDSTSNGNLCEFQEMCEV--QKSGESSSHSNV 1480
                              RSVTWADE+     S+G+     E  E+   K      H   
Sbjct: 285  ------------------RSVTWADEK-----SDGDGKSISECREIGDNKGAVVMPHLTD 321

Query: 1481 EDF-DGSLRLTLXXXXXXXXXXXXXXXXSGESDATDAVSEAGIIILPRPHDADQGDFQND 1657
            ED  D S R T                 SG++DA+DAVSEAG+IILP PH+ D+  ++  
Sbjct: 322  EDVGDESYRFTSAEACARALSQASEAVASGKTDASDAVSEAGVIILPPPHEVDEAKYEQI 381

Query: 1658 EDMIEPEPVPLKWPKKSGVLNSKLFDAEDTWYDTPPEGFNLSLSSFATMWMALFGWITSS 1837
             ++++ +P+ LKWP K G  +  LFD+ED+WYD+PPEGFNL+LS F+TM+M+LF WI+SS
Sbjct: 382  GEVVDVDPIELKWPPKPGFSSEDLFDSEDSWYDSPPEGFNLTLSPFSTMFMSLFAWISSS 441

Query: 1838 SLAYIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCLARALPGLVTDLRLP 2017
            SLAYIYG++E  HE++  +NGREYP KI++ DGRS+E+K TLAGCLARALPGLV+++R+P
Sbjct: 442  SLAYIYGKEERFHEDYLSINGREYPPKIII-DGRSAEVKHTLAGCLARALPGLVSEIRIP 500

Query: 2018 TPISSLEQGMRSLLETMSFMDALPPFRMKQWHVIVLLLIDALSVCRIPGLIPHMTNRRML 2197
            TP+S++EQGM  LL+TMSF DALP FRMKQW VI LL +DALSV RIP L P+MT RR+L
Sbjct: 501  TPVSTIEQGMGRLLDTMSFTDALPGFRMKQWQVIALLFLDALSVSRIPALSPYMTGRRIL 560

Query: 2198 LHKVLDSAQVSVEEYEVMKNLVIPLG 2275
            L KVL+ AQ++VEE+E+MK+L+IPLG
Sbjct: 561  LPKVLEGAQINVEEFEIMKDLIIPLG 586


>ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cucumis sativus]
          Length = 632

 Score =  575 bits (1482), Expect = e-161
 Identities = 317/645 (49%), Positives = 417/645 (64%), Gaps = 21/645 (3%)
 Frame = +2

Query: 419  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 598
            M K QS+ +KD V+KLQ +L EGIKNE+QLFAAGSLMSR DYED+VTERSI++LCGYPLC
Sbjct: 1    MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60

Query: 599  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCLISNSEK 778
            ++ LP ++ R+GRYR+SLKEHKVYDL+ETY YCSS C++ SR F+G LQ ERC + N +K
Sbjct: 61   HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120

Query: 779  INEVLKLFGELSLDEKEDLGKKGDLGFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 958
            + E+LKLF  +SLD KE++G   D G   L+I+EK++   GEV +E+W+GPSNAIEGYVP
Sbjct: 121  LKEILKLFENMSLDSKENMGNNCDSG---LEIQEKIESNIGEVPIEEWMGPSNAIEGYVP 177

Query: 959  QRDR---SSKSSPLKHRKEGSKSKNARPKKEAEKVVDEMDFTSSIIVGGQFSVPKVSSGP 1129
             RD    +  S   K  K+GSK+K  +P    +    +  FTS+II   ++SV K+SSG 
Sbjct: 178  HRDHKVMTLHSKDGKESKDGSKAK-IKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSGL 236

Query: 1130 KQNGSETKLKESEG--------NQFSILET--CSASTQNGFETKSKEPKGKGSVNESKRV 1279
            K+   +T  K   G        +QF+ILET    A  +N    K++  K +  V+ +K  
Sbjct: 237  KEMALDTNSKNQTGEFCGKKSNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSATKES 296

Query: 1280 VSVGXXXXXXXXXXXXXXXXXXXXXXXRSVTW---ADERKIDSTSNGNLCEFQEMCEVQK 1450
                                       RS  +    +E + + T + ++    E+ E+ K
Sbjct: 297  TD--------------NLSDAPSTSNNRSTNFNLMTEEPRDEKTDDASIMNLPEVGEMGK 342

Query: 1451 SGESS-SHSNVEDFDGS----LRLTLXXXXXXXXXXXXXXXXSGESDATDAVSEAGIIIL 1615
            + E S + SN+ +FD      LR+                  SG+S+ +DAVSEAGIIIL
Sbjct: 343  TKECSRTTSNLVNFDNDNEDLLRVESAEACAMALSQAAKAITSGQSEVSDAVSEAGIIIL 402

Query: 1616 PRPHDADQGDFQNDEDMIEPEPVPLKWPKKSGVLNSKLFDAEDTWYDTPPEGFNLSLSSF 1795
            P P DA++    +  +  EP     K   K GVL S LFD  D+WYD PPEGF+L+LSSF
Sbjct: 403  PHPSDANEEASTDPVNASEPHSFSEK-SNKLGVLRSDLFDPSDSWYDAPPEGFSLTLSSF 461

Query: 1796 ATMWMALFGWITSSSLAYIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCL 1975
            ATMWMA+F W+TSSSLAYIYG+D+  HEEF  ++G+EYP KIV +DGRSSEIKQTLAGCL
Sbjct: 462  ATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGKEYPSKIVSADGRSSEIKQTLAGCL 521

Query: 1976 ARALPGLVTDLRLPTPISSLEQGMRSLLETMSFMDALPPFRMKQWHVIVLLLIDALSVCR 2155
             RA+PGL ++L L TPIS LE GM  LL+TM+F+DALP FRMKQW VIVLL I+ALSV R
Sbjct: 522  TRAIPGLASELNLSTPISRLENGMAHLLDTMTFLDALPAFRMKQWQVIVLLFIEALSVSR 581

Query: 2156 IPGLIPHMTNRRMLLHKVLDSAQVSVEEYEVMKNLVIPLGYLTSL 2290
            IP L  HM++ R L HKVLD AQ+  +EYE+M++ ++PLG    L
Sbjct: 582  IPSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDHILPLGRTAQL 626


>ref|XP_007208433.1| hypothetical protein PRUPE_ppa002134mg [Prunus persica]
            gi|462404075|gb|EMJ09632.1| hypothetical protein
            PRUPE_ppa002134mg [Prunus persica]
          Length = 711

 Score =  572 bits (1474), Expect = e-160
 Identities = 334/703 (47%), Positives = 420/703 (59%), Gaps = 85/703 (12%)
 Frame = +2

Query: 422  EKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLCN 601
            ++Q  ISVKD V+KLQ +LLEGIK +D L+ AGS++SR DY D+VTER+I+NLCGYPLC+
Sbjct: 8    QQQPRISVKDTVYKLQLALLEGIKTQDHLYLAGSIISRSDYNDVVTERTIANLCGYPLCS 67

Query: 602  NPLPLE--HPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCLISNSE 775
            N LP +   P KG YR+SLKEHKVYDL ETYMYCSS CV++S+ FA SL  ERC + +  
Sbjct: 68   NALPSDSSRPHKGHYRISLKEHKVYDLHETYMYCSSRCVIESKAFAQSLGEERCDVLDFG 127

Query: 776  KINEVLKLFGELSLDEKE-DLGKKGDLGFSELKIEEKVDVKAGEVLLEDW---------- 922
            K+  +L+ FG++  D+ E   G+ GDLG S+LKIEEKV+   G++ +             
Sbjct: 128  KVERILRAFGDVGFDKGEVGFGEIGDLGISKLKIEEKVETGIGDLGISRLKIEEKSETHI 187

Query: 923  -----VGPSNAIEGYVPQRDRSSKSSPLKHRKEGSKSKNARPKKEAEKVVDEMDFTSSII 1087
                 VGPSNAIEGYVPQ++R SK    K  KEGSK K+A+     + + +EMDF S+II
Sbjct: 188  GDLGAVGPSNAIEGYVPQKERISKPLGSKKNKEGSKGKDAKMSSGMDIIFNEMDFMSTII 247

Query: 1088 VGGQFSVPKVSSGPKQNGSETKLKESEGNQFSILETCSASTQNGFETKSKEPKG------ 1249
               ++SV K+     +   ETK K+S+G             +N    KS++ KG      
Sbjct: 248  TSDEYSVSKIPPSVGEPDFETKFKKSKGKV--------GLNKNDSVKKSRQSKGGKNKNV 299

Query: 1250 ---------------------KGSVNESKRVVSVGXXXXXXXXXXXXXXXXXXXXXXXRS 1366
                                  GS  E K    V                        RS
Sbjct: 300  KKDDVCIREVPSTSDASQTVLNGSTKEEKEEFIVEKAEQSGEALLRSSLKPSGTKKLNRS 359

Query: 1367 VTWADERKIDSTSNGNLCEFQEMCEVQKSGE--SSSHS---------------------- 1474
            VTWADE  IDST + NL E +EM ++ +  +  SS H                       
Sbjct: 360  VTWADEM-IDSTGSRNLYEVREMEQIMEYSDAFSSMHKPSVENKVGCSNTWFDEKIDSTK 418

Query: 1475 --------NVEDFD--GSLRLT------LXXXXXXXXXXXXXXXXSGESDATDAVSEAGI 1606
                     V+D D  GSL L                        SGESD + AVS AGI
Sbjct: 419  SKNICEVREVQDADVLGSLDLQENEILESAEACAMALNQAAEAVASGESDVSGAVSGAGI 478

Query: 1607 IILPRPHDADQGDFQNDEDMIEPEPVPLKWPKKSGVLNSKLFDAEDTWYDTPPEGFNLSL 1786
            IILPRP   D+ +   D DM+E E  PL WP+K G+  S LFD ED+W+D PPEGF+++L
Sbjct: 479  IILPRPDGLDEEEPTEDVDMLESEQAPL-WPRKPGIPCSDLFDPEDSWFDAPPEGFSVTL 537

Query: 1787 SSFATMWMALFGWITSSSLAYIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLA 1966
            S FATMW +LF WITSS+LAYIYGRDES HEEF  VNGREYP KIVL+ GRSSEIK+TL 
Sbjct: 538  SPFATMWNSLFTWITSSTLAYIYGRDESFHEEFLSVNGREYPPKIVLAGGRSSEIKKTLD 597

Query: 1967 GCLARALPGLVTDLRLPTPISSLEQGMRSLLETMSFMDALPPFRMKQWHVIVLLLIDALS 2146
               ARALPG+V++LRLPTPISSLEQGM  +L TMSF+DA+P FRMKQW VIVLL ++ LS
Sbjct: 598  ESFARALPGVVSELRLPTPISSLEQGMGRMLNTMSFIDAIPAFRMKQWQVIVLLFLEGLS 657

Query: 2147 VCRIPGLIPHMTNRRMLLHKVLDSAQVSVEEYEVMKNLVIPLG 2275
            VCRIP L PHMTNRRML +KVL++ Q+S E+YE+MK+L+IPLG
Sbjct: 658  VCRIPALTPHMTNRRMLFYKVLENTQISAEQYELMKDLIIPLG 700


>ref|XP_007016930.1| F2P16.20-like protein isoform 5 [Theobroma cacao]
            gi|508787293|gb|EOY34549.1| F2P16.20-like protein isoform
            5 [Theobroma cacao]
          Length = 708

 Score =  555 bits (1429), Expect = e-155
 Identities = 310/638 (48%), Positives = 398/638 (62%), Gaps = 72/638 (11%)
 Frame = +2

Query: 419  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 598
            M K+QSISV +AVHK+Q  LL+GI++E QL A+GSL+SR DYED+VTER+ISN CGYPLC
Sbjct: 55   MAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPLC 114

Query: 599  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCLISNSEK 778
             NPLP E  RKGRYR+SLKEHKVYDLQETYM+CS++C++ SR FAGSLQ ERC + N  K
Sbjct: 115  ANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHAK 174

Query: 779  INEVLKLFGELSLDEKEDLGKKGDLGFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 958
            +N++L LFG+L LD+  DLGK GDLGFS L+I+E  +VKA +V L    GPSNAIEGYVP
Sbjct: 175  LNDILSLFGDLDLDDN-DLGKNGDLGFSNLRIKENEEVKAEDVSL---AGPSNAIEGYVP 230

Query: 959  QRDRSSKSSPLKH-----------------------------------------RKEGSK 1015
            QR+  SK +P K+                                         +K GS 
Sbjct: 231  QRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKKPGSF 290

Query: 1016 SKNARPKKEAEK---VVDEMDFTSSIIVGGQFSVPKVSSGPKQNGSETKLKESE------ 1168
             +  R K  ++K   V++EMDFTS II+  ++++ K+ SG KQ+  ++ LKE E      
Sbjct: 291  KQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICK 350

Query: 1169 ---------GNQFSILETCSA-----STQN----GFETKS----KEPKGKGSVNESKRVV 1282
                     G+  ++ E  S+     ST+N    G +T S    KE     +V  S+ V+
Sbjct: 351  DSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETVL 410

Query: 1283 SVGXXXXXXXXXXXXXXXXXXXXXXXRSVTWADERKIDSTSNGNLCEFQEMCEVQKSGES 1462
                                      R VTWAD++K D+  NGNLCE +EM  ++   E 
Sbjct: 411  KSSLKSAGAKKLN-------------RFVTWADKKKADNAGNGNLCEVKEMETMKGDSEI 457

Query: 1463 SSHSNVEDFDGSLRLTLXXXXXXXXXXXXXXXXSGESDATDAVSEAGIIILPRPHDADQG 1642
            S  +     D  LR                   SG+SD TDAV E G+IILP   + D+ 
Sbjct: 458  SGSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKE 517

Query: 1643 DFQNDEDMIEPEPVPLKWPKKSGVLNSKLFDAEDTWYDTPPEGFNLSLSSFATMWMALFG 1822
            +   D DM+EPE  P+KWPKK G+ +S +F+ ED+W+D PPEGF+L+LS+FATMW ALF 
Sbjct: 518  EPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFE 577

Query: 1823 WITSSSLAYIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCLARALPGLVT 2002
            WITSSSLAYIYGRDES HEE+  +NGREYPRKI L DGRSSEIK+TLA C++RALP +VT
Sbjct: 578  WITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVT 637

Query: 2003 DLRLPTPISSLEQGMRSLLETMSFMDALPPFRMKQWHV 2116
            DLRLP PIS+LEQGM  L++T+SFM+ALP FRMKQW +
Sbjct: 638  DLRLPIPISTLEQGMGHLIDTISFMEALPAFRMKQWEI 675


>ref|XP_007016927.1| F2P16.20-like protein isoform 2 [Theobroma cacao]
            gi|508787290|gb|EOY34546.1| F2P16.20-like protein isoform
            2 [Theobroma cacao]
          Length = 679

 Score =  553 bits (1426), Expect = e-154
 Identities = 310/636 (48%), Positives = 397/636 (62%), Gaps = 72/636 (11%)
 Frame = +2

Query: 419  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 598
            M K+QSISV +AVHK+Q  LL+GI++E QL A+GSL+SR DYED+VTER+ISN CGYPLC
Sbjct: 55   MAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPLC 114

Query: 599  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCLISNSEK 778
             NPLP E  RKGRYR+SLKEHKVYDLQETYM+CS++C++ SR FAGSLQ ERC + N  K
Sbjct: 115  ANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHAK 174

Query: 779  INEVLKLFGELSLDEKEDLGKKGDLGFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 958
            +N++L LFG+L LD+  DLGK GDLGFS L+I+E  +VKA +V L    GPSNAIEGYVP
Sbjct: 175  LNDILSLFGDLDLDDN-DLGKNGDLGFSNLRIKENEEVKAEDVSL---AGPSNAIEGYVP 230

Query: 959  QRDRSSKSSPLKH-----------------------------------------RKEGSK 1015
            QR+  SK +P K+                                         +K GS 
Sbjct: 231  QRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKKPGSF 290

Query: 1016 SKNARPKKEAEK---VVDEMDFTSSIIVGGQFSVPKVSSGPKQNGSETKLKESE------ 1168
             +  R K  ++K   V++EMDFTS II+  ++++ K+ SG KQ+  ++ LKE E      
Sbjct: 291  KQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICK 350

Query: 1169 ---------GNQFSILETCSA-----STQN----GFETKS----KEPKGKGSVNESKRVV 1282
                     G+  ++ E  S+     ST+N    G +T S    KE     +V  S+ V+
Sbjct: 351  DSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETVL 410

Query: 1283 SVGXXXXXXXXXXXXXXXXXXXXXXXRSVTWADERKIDSTSNGNLCEFQEMCEVQKSGES 1462
                                      R VTWAD++K D+  NGNLCE +EM  ++   E 
Sbjct: 411  KSSLKSAGAKKLN-------------RFVTWADKKKADNAGNGNLCEVKEMETMKGDSEI 457

Query: 1463 SSHSNVEDFDGSLRLTLXXXXXXXXXXXXXXXXSGESDATDAVSEAGIIILPRPHDADQG 1642
            S  +     D  LR                   SG+SD TDAV E G+IILP   + D+ 
Sbjct: 458  SGSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKE 517

Query: 1643 DFQNDEDMIEPEPVPLKWPKKSGVLNSKLFDAEDTWYDTPPEGFNLSLSSFATMWMALFG 1822
            +   D DM+EPE  P+KWPKK G+ +S +F+ ED+W+D PPEGF+L+LS+FATMW ALF 
Sbjct: 518  EPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFE 577

Query: 1823 WITSSSLAYIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCLARALPGLVT 2002
            WITSSSLAYIYGRDES HEE+  +NGREYPRKI L DGRSSEIK+TLA C++RALP +VT
Sbjct: 578  WITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVT 637

Query: 2003 DLRLPTPISSLEQGMRSLLETMSFMDALPPFRMKQW 2110
            DLRLP PIS+LEQGM  L++T+SFM+ALP FRMKQW
Sbjct: 638  DLRLPIPISTLEQGMGHLIDTISFMEALPAFRMKQW 673


>ref|XP_007016929.1| F2P16.20 protein, putative isoform 4 [Theobroma cacao]
            gi|508787292|gb|EOY34548.1| F2P16.20 protein, putative
            isoform 4 [Theobroma cacao]
          Length = 607

 Score =  523 bits (1348), Expect = e-145
 Identities = 296/617 (47%), Positives = 380/617 (61%), Gaps = 72/617 (11%)
 Frame = +2

Query: 419  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 598
            M K+QSISV +AVHK+Q  LL+GI++E QL A+GSL+SR DYED+VTER+ISN CGYPLC
Sbjct: 1    MAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPLC 60

Query: 599  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCLISNSEK 778
             NPLP E  RKGRYR+SLKEHKVYDLQETYM+CS++C++ SR FAGSLQ ERC + N  K
Sbjct: 61   ANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHAK 120

Query: 779  INEVLKLFGELSLDEKEDLGKKGDLGFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 958
            +N++L LFG+L LD+  DLGK GDLGFS L+I+E  +VKA +V L    GPSNAIEGYVP
Sbjct: 121  LNDILSLFGDLDLDDN-DLGKNGDLGFSNLRIKENEEVKAEDVSL---AGPSNAIEGYVP 176

Query: 959  QRDRSSKSSPLKH-----------------------------------------RKEGSK 1015
            QR+  SK +P K+                                         +K GS 
Sbjct: 177  QRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKKPGSF 236

Query: 1016 SKNARPKKEAEK---VVDEMDFTSSIIVGGQFSVPKVSSGPKQNGSETKLKESE------ 1168
             +  R K  ++K   V++EMDFTS II+  ++++ K+ SG KQ+  ++ LKE E      
Sbjct: 237  KQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICK 296

Query: 1169 ---------GNQFSILETCSA-----STQN----GFETKS----KEPKGKGSVNESKRVV 1282
                     G+  ++ E  S+     ST+N    G +T S    KE     +V  S+ V+
Sbjct: 297  DSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAVTSSETVL 356

Query: 1283 SVGXXXXXXXXXXXXXXXXXXXXXXXRSVTWADERKIDSTSNGNLCEFQEMCEVQKSGES 1462
                                      R VTWAD++K D+  NGNLCE +EM  ++   E 
Sbjct: 357  KSSLKSAGAKKLN-------------RFVTWADKKKADNAGNGNLCEVKEMETMKGDSEI 403

Query: 1463 SSHSNVEDFDGSLRLTLXXXXXXXXXXXXXXXXSGESDATDAVSEAGIIILPRPHDADQG 1642
            S  +     D  LR                   SG+SD TDAV E G+IILP   + D+ 
Sbjct: 404  SGSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKE 463

Query: 1643 DFQNDEDMIEPEPVPLKWPKKSGVLNSKLFDAEDTWYDTPPEGFNLSLSSFATMWMALFG 1822
            +   D DM+EPE  P+KWPKK G+ +S +F+ ED+W+D PPEGF+L+LS+FATMW ALF 
Sbjct: 464  EPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFE 523

Query: 1823 WITSSSLAYIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCLARALPGLVT 2002
            WITSSSLAYIYGRDES HEE+  +NGREYPRKI L DGRSSEIK+TLA C++RALP +VT
Sbjct: 524  WITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVT 583

Query: 2003 DLRLPTPISSLEQGMRS 2053
            DLRLP PIS+LEQGM +
Sbjct: 584  DLRLPIPISTLEQGMNT 600


>ref|XP_003537129.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Glycine max]
          Length = 706

 Score =  523 bits (1347), Expect = e-145
 Identities = 311/702 (44%), Positives = 411/702 (58%), Gaps = 83/702 (11%)
 Frame = +2

Query: 419  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 598
            MEK + +SVKDAV KLQ SLLEGI+NEDQLFAAGSLMSR DYEDIVTERSI+N+CGYPLC
Sbjct: 1    MEKDKPVSVKDAVFKLQMSLLEGIQNEDQLFAAGSLMSRSDYEDIVTERSITNVCGYPLC 60

Query: 599  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCLISNSEK 778
            +N LP + PRKGRYR+SLKEHKVYDL ETYM+C S+CVV S+ FAGSLQAERC   + EK
Sbjct: 61   SNALPSDRPRKGRYRISLKEHKVYDLHETYMFCCSNCVVSSKAFAGSLQAERCSGLDLEK 120

Query: 779  INEVLKLFGELSLD------EKEDLG-------KKGDLGFSELKIEE------------- 880
            +N +L LF  L+L+      + ED G       +K +    E+ +E+             
Sbjct: 121  LNNILSLFENLNLEPAENLQKNEDFGLSDLKIQEKTETSSGEVSLEQWAGPSNAIEGYVP 180

Query: 881  --------------KVDVKAGE-------------------VLLEDWVGPSNAIEGYVPQ 961
                          K   KAG                    ++++D    S  + G   Q
Sbjct: 181  KPRDHDSKGLRKNVKKGSKAGHGKPISDINLISSEMGFVSTIIMQDGYSVSKVLPG---Q 237

Query: 962  RDRSS--KSSPLKHRKEGSKSKNARPKKEAEKVVD-EMDFTSSIIVGGQFSVPKVSSGPK 1132
            RD ++  +  P    K+  K      +K+   + D    F SS+I+G      +++   +
Sbjct: 238  RDATAHHQIKPTAIVKQLGKVDAKVVRKDDGSIQDLSSSFKSSLILGTSEKEEELAQSCE 297

Query: 1133 ---QNGSETKLKESEGNQFSILETCSASTQNGFETKSKEPKGKGS--------------- 1258
               ++  +  +K+ +    SI E      QN    KS + KGK S               
Sbjct: 298  AALKSSPDCAIKKKDVYSVSISERQCDVEQNDSAKKSVQVKGKMSRVTANDDASTSNLDP 357

Query: 1259 --VNESKRVVSVGXXXXXXXXXXXXXXXXXXXXXXXRSVTWADERKIDSTSNGNLCEFQE 1432
              V E  +V   G                       R+VTWAD +KI+ST + +LC F+ 
Sbjct: 358  ANVEEKFQVEKAGGSLNTKPKSSLKSAGEKKLS---RTVTWAD-KKINSTGSKDLCGFKN 413

Query: 1433 MCEVQKSGESSSHS-NVEDFDGSLRLTLXXXXXXXXXXXXXXXXSGESDATDAVSEAGII 1609
              +++   +S+ +S +V + + +LR                   SG+SD +DAVSEAGII
Sbjct: 414  FGDIRNESDSAGNSIDVANDEDTLRRASAEACVIALSSASEAVASGDSDVSDAVSEAGII 473

Query: 1610 ILPRPHDADQGDFQNDEDMIEPEPVPLKWPKKSGVLNSKLFDAEDTWYDTPPEGFNLSLS 1789
            ILP PHDA +     D D+++ + V +KWP+K G+  +  F+++D+W+D  PEGF+L+LS
Sbjct: 474  ILPPPHDAGEEGTLEDVDILQNDSVTVKWPRKPGISEADFFESDDSWFDAAPEGFSLTLS 533

Query: 1790 SFATMWMALFGWITSSSLAYIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAG 1969
             FATMW  LF WITSSSLAYIYGRDES  EE+  VNGREYP K+VL+DGRSSEIKQTLA 
Sbjct: 534  PFATMWNTLFSWITSSSLAYIYGRDESFQEEYLSVNGREYPCKVVLADGRSSEIKQTLAS 593

Query: 1970 CLARALPGLVTDLRLPTPISSLEQGMRSLLETMSFMDALPPFRMKQWHVIVLLLIDALSV 2149
            CLARALP LV  LRLP P+S++EQGM  LLETMSF+DALP FR KQW V+ LL IDALSV
Sbjct: 594  CLARALPTLVAVLRLPIPVSTMEQGMACLLETMSFVDALPAFRTKQWQVVALLFIDALSV 653

Query: 2150 CRIPGLIPHMTNRRMLLHKVLDSAQVSVEEYEVMKNLVIPLG 2275
            CR+P LI +MT+RR   H+VL  +Q+ +EEYEV+K+L +PLG
Sbjct: 654  CRLPALISYMTDRRASFHRVLSGSQIGMEEYEVLKDLAVPLG 695


>ref|XP_004302308.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Fragaria vesca subsp. vesca]
          Length = 692

 Score =  513 bits (1321), Expect = e-142
 Identities = 308/693 (44%), Positives = 404/693 (58%), Gaps = 79/693 (11%)
 Frame = +2

Query: 434  SISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLCNNPLP 613
            S SV DAV+KLQ +LL+ +K  D+L+ AGS++SR DY D+VTERSI++LCGYPLC+N LP
Sbjct: 10   SKSVNDAVYKLQLALLDSVKTLDRLYLAGSIISRSDYTDVVTERSIADLCGYPLCSNALP 69

Query: 614  LE--HPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCLISNSEKINE 787
             E    RKG YR+SLKEHKVYDL+ET +YCSS CV+ S+ FA  L  ERC + +  K+  
Sbjct: 70   PEASRTRKGHYRISLKEHKVYDLRETKLYCSSKCVIDSKAFAQGLSEERCDVLDLGKVER 129

Query: 788  VLKLFGELSLDEKEDLGKKGDLGFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVPQRD 967
            VL+ FGE    EK+++G   DLG S LKIEEK    +G+V   +  GPSNAIEGYVP+RD
Sbjct: 130  VLREFGE----EKKEIG---DLGLSSLKIEEKSGTYSGKV---EEFGPSNAIEGYVPRRD 179

Query: 968  RSSKSSPLKHRKEGSKSKNARPKKEAEKVV-DEMDFTSSIIVGGQFSVPKVSSGPKQNGS 1144
            R SK+S  K  K+GSK K+A+P    ++++ ++MDF S+++   ++SV K+      N  
Sbjct: 180  RVSKASGAKKNKQGSKGKDAKPSGGGKQLILNDMDFMSTLLACDEYSVSKMPPNVADNNV 239

Query: 1145 ETKLKESEGNQ----FSILETCSASTQNGFETKSKEPKGKGSVNESKRVVS------VGX 1294
            +T+LK+S+G      FS+LET  ++T N    KS+     G +  S+  +       VG 
Sbjct: 240  DTELKKSKGKDLESGFSVLET--SATPN----KSEGVMDVGDLGMSRLKIEAEEESQVGK 293

Query: 1295 XXXXXXXXXXXXXXXXXXXXXXRSVTWADERK---------------------------- 1390
                                  RSVTWADE+                             
Sbjct: 294  GEKSSEGTLRSSLKHSGTKKLSRSVTWADEKSDSTGRRNLCEVRDMEDGLENPGAFDSLY 353

Query: 1391 ------------------IDSTSNGNLCEFQEMCEVQKSGESSSHSNVEDFDGSLRLTLX 1516
                              IDST   N+CE     + ++  E    S V+   G+      
Sbjct: 354  KPSSSSEAGSSFSWVDKTIDSTKCENICEVSGTHDAKEVPEVVGSSVVQ---GNEWFESA 410

Query: 1517 XXXXXXXXXXXXXXXSGESDATDAVSEAGIIILPRPHDADQGDF---------------- 1648
                           +GE D +DAVS+AGIIILPR    D+ +F                
Sbjct: 411  EACAVALSEAAGAVETGEFDTSDAVSKAGIIILPRTDGVDEEEFIVDGADEEDSIEDSVD 470

Query: 1649 ----QNDEDMIEPEPVPLKWPKKSGVLNSKLFDAEDTWYDTPPEGFNLSLSSFATMWMAL 1816
                  D DM+EPE    KWPKK       LF+ ED+W+D PP+GFNL+LS FATMW AL
Sbjct: 471  EEESTEDIDMLEPEQALSKWPKKPESSQFDLFNPEDSWFDAPPDGFNLTLSPFATMWNAL 530

Query: 1817 FGWITSSSLAYIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCLARALPGL 1996
            F W TSS+LAYIYG+D+S HEEF  VNGR YP KIVL+DGRSSEIK T+   L+RALP +
Sbjct: 531  FTWTTSSTLAYIYGKDDSFHEEFLNVNGRSYPHKIVLADGRSSEIKLTVGASLSRALPEI 590

Query: 1997 VTDLRLPTPISSLEQGMRSLLETMSFMDALPPFRMKQWHVIVLLLIDALSVCRIPGLIPH 2176
            V +L L  P  +LE+GM  +L TMSF++ALP FRMKQW VI LL I+ LSVCR+P L PH
Sbjct: 591  VAELGLAVP--NLEKGMGFMLNTMSFIEALPAFRMKQWQVIALLFIEGLSVCRMPALTPH 648

Query: 2177 MTNRRMLLHKVLDSAQVSVEEYEVMKNLVIPLG 2275
            MTNRR+L+ +VLD A++SVEEYE+MK+ +IPLG
Sbjct: 649  MTNRRVLIQRVLDGARISVEEYEIMKDFLIPLG 681


>ref|XP_006841578.1| hypothetical protein AMTR_s00003p00194360 [Amborella trichopoda]
            gi|548843599|gb|ERN03253.1| hypothetical protein
            AMTR_s00003p00194360 [Amborella trichopoda]
          Length = 591

 Score =  505 bits (1301), Expect = e-140
 Identities = 286/617 (46%), Positives = 386/617 (62%), Gaps = 7/617 (1%)
 Frame = +2

Query: 440  SVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLCNNPLPLE 619
            S+KDA++K+Q  LL+GI  E+QL AA +L+SR DY+D+VTER+I+NLCGYPLCN  LP +
Sbjct: 8    SLKDAIYKIQTYLLDGISKENQLLAAANLISRSDYDDVVTERTITNLCGYPLCNKYLPCD 67

Query: 620  HPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCLISNSEKINEVLKL 799
             P+KGRYR+SLKEH VYDL+ET++YCS +CV+ S+ F+  L+ ERC  S+  KI E+L L
Sbjct: 68   RPKKGRYRISLKEHSVYDLKETWLYCSPECVINSQAFSKLLKPERCEFSDPGKIAEILNL 127

Query: 800  FGELSLDEKEDLG----KKGDLGFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVPQRD 967
            F   S++E    G    +K  L FS L I EK DV  G++   D+VGP NAIEGYVP++D
Sbjct: 128  FSSPSIEESNAGGAEKNEKISLAFSSLTIHEKEDVSVGDIQSMDFVGPYNAIEGYVPRQD 187

Query: 968  RSSKSSPLKHRKEGSKSKNARPKKEAEKVVDEMDFTSSIIVGGQFSVPKVSSGPKQNGSE 1147
            +     P   RK GSKS  +  KK+   +  E +F S+II+G      + SSG  Q  S 
Sbjct: 188  QV----PPVQRK-GSKSGKSTTKKDP--IYPETNFASTIIIG------EPSSGNLQKNSS 234

Query: 1148 TKLKESEGNQFSILETCSASTQNGFETKSKEPKGKGSVNESKRVVSVGXXXXXXXXXXXX 1327
            +K      +            Q   ++  KE K + ++       S              
Sbjct: 235  SKFVNDHVHVNVEGSKREQHAQEKSQSHPKETKLRSALKNLGAKAST------------- 281

Query: 1328 XXXXXXXXXXXRSVTWADERK--IDSTSNGNLCEFQEMCEVQKSGESSSHSNVEDFDGSL 1501
                       R+V+WADE++  ++   N  L   Q +    K  ESS   +VED   S 
Sbjct: 282  -----------RTVSWADEQQTIVEGIQNMTLNNCQGIESGSKCKESSDSLSVEDTMISS 330

Query: 1502 RLTLXXXXXXXXXXXXXXXXSGESDATDAVSEAGIIILPRPHDADQGDFQNDEDMIEPEP 1681
            R                   SG+S+  DA SEAGI+I P P+  ++ + Q   D ++PE 
Sbjct: 331  RRASAEACASALTEAAAAVASGQSNTLDAASEAGILIFPCPNSVEEENIQKVADELKPEE 390

Query: 1682 VPLKWPKKSGVLNSKLFDAE-DTWYDTPPEGFNLSLSSFATMWMALFGWITSSSLAYIYG 1858
               KW K+  +L++  FD E D+WYD PPEGF+L+LSSFATMWMALFGW+T+SS+AYIYG
Sbjct: 391  GE-KWVKRPSLLHTGAFDTEEDSWYDAPPEGFSLTLSSFATMWMALFGWVTASSMAYIYG 449

Query: 1859 RDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCLARALPGLVTDLRLPTPISSLE 2038
            R ES+ EEF +V+GREYP K VL DG SSEIK+TL+GCLARALPG+V +++LPTPIS+LE
Sbjct: 450  RAESAEEEFVVVDGREYPHKFVLGDGLSSEIKETLSGCLARALPGVVANIKLPTPISTLE 509

Query: 2039 QGMRSLLETMSFMDALPPFRMKQWHVIVLLLIDALSVCRIPGLIPHMTNRRMLLHKVLDS 2218
              +  LL+TM+F +ALPPFRMKQWHVIVLL +DALSV  +P L  H+ +RR L+HK+L+ 
Sbjct: 510  VALGRLLDTMTFTEALPPFRMKQWHVIVLLFLDALSVHIVPALEQHIASRRTLVHKMLED 569

Query: 2219 AQVSVEEYEVMKNLVIP 2269
            AQVS EEY +M++L +P
Sbjct: 570  AQVSNEEYNIMRDLFLP 586


Top