BLASTX nr result

ID: Akebia27_contig00004278 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00004278
         (2604 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera]   693   0.0  
ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258...   686   0.0  
ref|XP_007016926.1| F2P16.20 protein, putative isoform 1 [Theobr...   612   e-172
ref|XP_002521936.1| conserved hypothetical protein [Ricinus comm...   602   e-169
gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis]     587   e-165
ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subuni...   585   e-164
ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Popu...   579   e-162
ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subuni...   574   e-161
ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subuni...   572   e-160
ref|XP_007016928.1| F2P16.20-like protein isoform 3, partial [Th...   571   e-160
ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subuni...   556   e-155
ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subuni...   555   e-155
gb|EYU19406.1| hypothetical protein MIMGU_mgv1a003240mg [Mimulus...   552   e-154
ref|XP_007208433.1| hypothetical protein PRUPE_ppa002134mg [Prun...   543   e-151
ref|XP_007016930.1| F2P16.20-like protein isoform 5 [Theobroma c...   525   e-146
ref|XP_007016927.1| F2P16.20-like protein isoform 2 [Theobroma c...   523   e-145
ref|XP_006841578.1| hypothetical protein AMTR_s00003p00194360 [A...   494   e-137
ref|XP_007016929.1| F2P16.20 protein, putative isoform 4 [Theobr...   493   e-136
ref|XP_004302308.1| PREDICTED: putative RNA polymerase II subuni...   493   e-136
gb|ABR17753.1| unknown [Picea sitchensis]                             406   e-110

>emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera]
          Length = 659

 Score =  693 bits (1789), Expect = 0.0
 Identities = 371/649 (57%), Positives = 439/649 (67%), Gaps = 30/649 (4%)
 Frame = +2

Query: 287  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 466
            M   Q I+VKDAVHKLQ  LLEGI+NE+QLFAAGSLMSR DYED+VTER+I+NLCGYPLC
Sbjct: 1    MAGDQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLC 60

Query: 467  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCSISNSEK 646
            +N LP E  RKG YR+SLKEHKVYDL ETYMYCSS CVV SR+FAGSLQ ERCS+ NSE+
Sbjct: 61   SNSLPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSER 120

Query: 647  INEVLKLFXXXXXXXXXXXXXXXXXXFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 826
            IN +L+LF                   SELKI E V+ KAGEV +EDW+GPSNAIEGYVP
Sbjct: 121  INGILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVP 180

Query: 827  QRDRSSKSSPLKHRKEGSKSKNARAKKEAEKVVDEMDFTSTIIVGGQFSVPKLSSGPKQN 1006
            QRDR+ K   +K+ KEGSKS N++       V+DEMDF STII   ++S+ K S G K  
Sbjct: 181  QRDRNLKPKNIKNHKEGSKSSNSKMDSGKNFVIDEMDFVSTIITKDEYSISKSSKGLKDT 240

Query: 1007 GSETKLKE-----SEGNQFSILETCSASTQNGFETKSKEPKGKGSV-----NESTREVS- 1153
             S  K KE     S G+Q S+LE  +   QN  E+K +E KG+ S        ST EV  
Sbjct: 241  TSHAKSKEPKEKASIGDQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDEFSTAEVPS 300

Query: 1154 -------------------VGXXXXXXXXXXXXXXXXXXXXXXXXXVTWADERKIDSTSN 1276
                                                          VTWADE K+DS  +
Sbjct: 301  VPSQSGSELNGVKGKEEYHTENAAQLGPTKPKSSLKPSGGKKVIRSVTWADE-KMDSADS 359

Query: 1277 GNLCEFQEMCDVQKSGESSSHSNVEDFDGSLRLTLXXXXXXXXXXXXXXXXSGESDATDA 1456
             + C+ +E+   ++        +V D D +LR                   SGE+D TDA
Sbjct: 360  RDFCKVRELEVKKEDPNGLGDIDVGDDDNALRFASAEACAVALSQAAEAVASGETDMTDA 419

Query: 1457 VSEAGIIILPRPHDADQGDFQNDEDMIEPEPVPLKWPKKSGVLNSELFDAEDTWYDTPPE 1636
            VSEAGIIILP P D D+G+   D D++EPEPVPLKWP K G+ +S++FD++D+WYDTPPE
Sbjct: 420  VSEAGIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSDDSWYDTPPE 479

Query: 1637 GFSLSLSSFATMWMALFGWITSSSLAYIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSE 1816
            GFSL+LS FATMWMALF WITSSS+AYIYGRDES HEE+  VNGREYP+KIVL+DGRSSE
Sbjct: 480  GFSLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKIVLTDGRSSE 539

Query: 1817 IKQTLAGCLARALPGLVTDLRLPTPISSLEQGMRSLLETMSFMDALPPFRMKQWHVIVLL 1996
            IKQTLAGCL+RALPGLV DLRLP P+S+LEQG+  LL+TMSF+DALP FRMKQW VIVLL
Sbjct: 540  IKQTLAGCLSRALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRMKQWQVIVLL 599

Query: 1997 LIDALSVCRIPGLIPHMTNRRMLLHKVLDSAQVSVEEYEVMKNLVIPLG 2143
             IDALSVCRIP L PHMT+RRML  KV D+AQVS EEYEVMK+L+IPLG
Sbjct: 600  FIDALSVCRIPALTPHMTSRRMLFPKVFDAAQVSAEEYEVMKDLIIPLG 648


>ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258021 [Vitis vinifera]
            gi|296089830|emb|CBI39649.3| unnamed protein product
            [Vitis vinifera]
          Length = 659

 Score =  686 bits (1771), Expect = 0.0
 Identities = 368/649 (56%), Positives = 437/649 (67%), Gaps = 30/649 (4%)
 Frame = +2

Query: 287  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 466
            M   Q I+VKDAVHKLQ  LLEGI+NE+QLFAAGSLMSR DYED+VTER+I+NLCGYPLC
Sbjct: 1    MAGDQPIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLC 60

Query: 467  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCSISNSEK 646
            +N LP E  RKG YR+SLKEHKVYDL ETYMYCSS CVV SR+FAGSLQ ERCS+ NSE+
Sbjct: 61   SNSLPSERLRKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSER 120

Query: 647  INEVLKLFXXXXXXXXXXXXXXXXXXFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 826
            IN +L+LF                   SELKI E V+ KAGEV +EDW+GPSNAIEGYVP
Sbjct: 121  INGILRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVP 180

Query: 827  QRDRSSKSSPLKHRKEGSKSKNARAKKEAEKVVDEMDFTSTIIVGGQFSVPKLSSGPKQN 1006
            QRDR+ K   +K+RKEGSKS N++       V+DEMDF  TII   ++S+ K S G K  
Sbjct: 181  QRDRNLKPKNIKNRKEGSKSSNSKMDSGKNFVIDEMDFVRTIITEDEYSISKSSKGLKDT 240

Query: 1007 GSETKLKE-----SEGNQFSILETCSASTQNGFETKSKEPKGKGSV-----NESTREVS- 1153
             S  K KE     S G+Q S+LE  +   QN  E+K +E KG+ S        ST EV  
Sbjct: 241  TSHAKSKEPKEKASIGDQLSMLEKSAPPIQNDSESKLRESKGRRSRVIFKDEFSTAEVPS 300

Query: 1154 -------------------VGXXXXXXXXXXXXXXXXXXXXXXXXXVTWADERKIDSTSN 1276
                                                          VTWADE K+DS  +
Sbjct: 301  VPSQSGSELNGVKGKEEYHTENAAQLGPTKLKSCLKPSGGKKVTRSVTWADE-KMDSADS 359

Query: 1277 GNLCEFQEMCDVQKSGESSSHSNVEDFDGSLRLTLXXXXXXXXXXXXXXXXSGESDATDA 1456
             + C+ +E+   ++        +V D D +LR                   SGE+D TDA
Sbjct: 360  RDFCKVRELEVKKEDPNGLGDIDVGDDDNALRFASAEACAIALSQAAEAVASGETDMTDA 419

Query: 1457 VSEAGIIILPRPHDADQGDFQNDEDMIEPEPVPLKWPKKSGVLNSELFDAEDTWYDTPPE 1636
            VSEA IIILP P D D+G+   D D++EPEPVPLKWP K G+ +S++FD++D+WYDTPPE
Sbjct: 420  VSEARIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSDDSWYDTPPE 479

Query: 1637 GFSLSLSSFATMWMALFGWITSSSLAYIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSE 1816
            GFSL+LS FATMWMALF WITSSS+AYIYGRDES HEE+  VNGREYP+KIVL+DGRSSE
Sbjct: 480  GFSLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKIVLTDGRSSE 539

Query: 1817 IKQTLAGCLARALPGLVTDLRLPTPISSLEQGMRSLLETMSFMDALPPFRMKQWHVIVLL 1996
            IKQTLAGCLARALPGLV DLRLP P+S+LEQG+  LL+TMSF+DALP FRMKQW VIVLL
Sbjct: 540  IKQTLAGCLARALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRMKQWQVIVLL 599

Query: 1997 LIDALSVCRIPGLIPHMTNRRMLLHKVLDSAQVSVEEYEVMKNLVIPLG 2143
             IDALSVC+IP L PHM ++RML  KV D+AQVS EEYEVMK+L+IPLG
Sbjct: 600  FIDALSVCQIPALTPHMISKRMLFPKVFDAAQVSAEEYEVMKDLIIPLG 648


>ref|XP_007016926.1| F2P16.20 protein, putative isoform 1 [Theobroma cacao]
            gi|508787289|gb|EOY34545.1| F2P16.20 protein, putative
            isoform 1 [Theobroma cacao]
          Length = 739

 Score =  612 bits (1578), Expect = e-172
 Identities = 339/683 (49%), Positives = 429/683 (62%), Gaps = 64/683 (9%)
 Frame = +2

Query: 287  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 466
            M K+QSISV +AVHK+Q  LL+GI++E QL A+GSL+SR DYED+VTER+ISN CGYPLC
Sbjct: 55   MAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPLC 114

Query: 467  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCSISNSEK 646
             NPLP E  RKGRYR+SLKEHKVYDLQETYM+CS++C++ SR FAGSLQ ERCS+ N  K
Sbjct: 115  ANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHAK 174

Query: 647  INEVLKLFXXXXXXXXXXXXXXXXXXFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 826
            +N++L LF                  FS L+I+E  +VKA +V L    GPSNAIEGYVP
Sbjct: 175  LNDILSLFGDLDLDDNDLGKNGDLG-FSNLRIKENEEVKAEDVSL---AGPSNAIEGYVP 230

Query: 827  QRDRSSKSSPLKH-----------------------------------------RKEGSK 883
            QR+  SK +P K+                                         +K GS 
Sbjct: 231  QRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKKPGSF 290

Query: 884  SKNARAKKEAEK---VVDEMDFTSTIIVGGQFSVPKLSSGPKQNGSETKLKESE------ 1036
             +  R K  ++K   V++EMDFTS II+  ++++ K+ SG KQ+  ++ LKE E      
Sbjct: 291  KQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICK 350

Query: 1037 ---------GNQFSILETCSA-----STQNGFETKSKEPKGKGSVNESTREVSVGXXXXX 1174
                     G+  ++ E  S+     ST+N +++         S  E+ +E         
Sbjct: 351  DSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDT-----SSAEAEKETHADKAVTS 405

Query: 1175 XXXXXXXXXXXXXXXXXXXXVTWADERKIDSTSNGNLCEFQEMCDVQKSGESSSHSNVED 1354
                                VTWAD++K D+  NGNLCE +EM  ++   E S  +    
Sbjct: 406  SETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGG 465

Query: 1355 FDGSLRLTLXXXXXXXXXXXXXXXXSGESDATDAVSEAGIIILPRPHDADQGDFQNDEDM 1534
             D  LR                   SG+SD TDAV E G+IILP   + D+ +   D DM
Sbjct: 466  DDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMEDGDM 525

Query: 1535 IEPEPVPLKWPKKSGVLNSELFDAEDTWYDTPPEGFSLSLSSFATMWMALFGWITSSSLA 1714
            +EPE  P+KWPKK G+ +S++F+ ED+W+D PPEGFSL+LS+FATMW ALF WITSSSLA
Sbjct: 526  LEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLA 585

Query: 1715 YIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCLARALPGLVTDLRLPTPI 1894
            YIYGRDES HEE+  +NGREYPRKI L DGRSSEIK+TLA C++RALP +VTDLRLP PI
Sbjct: 586  YIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPIPI 645

Query: 1895 SSLEQGMRSLLETMSFMDALPPFRMKQWHVIVLLLIDALSVCRIPGLIPHMTNRRMLLHK 2074
            S+LEQGM  L++T+SFM+ALP FRMKQW VIVLL IDALSVCRIP L PHMTN RMLLHK
Sbjct: 646  STLEQGMGHLIDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPALTPHMTNGRMLLHK 705

Query: 2075 VLDSAQVSVEEYEVMKNLVIPLG 2143
            VLD AQ+S+EEYEVMK+L+IPLG
Sbjct: 706  VLDGAQISMEEYEVMKDLIIPLG 728


>ref|XP_002521936.1| conserved hypothetical protein [Ricinus communis]
            gi|223538861|gb|EEF40460.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 645

 Score =  602 bits (1551), Expect = e-169
 Identities = 333/654 (50%), Positives = 406/654 (62%), Gaps = 35/654 (5%)
 Frame = +2

Query: 287  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 466
            M K++S+SVKD V+KLQ SLLEGI+NEDQL AAGSLMSR DYED+V ERSISNLCGYPLC
Sbjct: 1    MAKEESVSVKDTVYKLQLSLLEGIENEDQLLAAGSLMSRSDYEDVVVERSISNLCGYPLC 60

Query: 467  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCSISNSEK 646
            NN LP + P KGRYR+SLKEH+VYDLQETYMYCSS C+V SR F+ SLQ +RCS+ N  K
Sbjct: 61   NNSLPSDRPYKGRYRISLKEHRVYDLQETYMYCSSSCLVNSRAFSESLQEKRCSVLNPIK 120

Query: 647  INEVLKLFXXXXXXXXXXXXXXXXXXFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 826
            +NE+L+ F                   S LKI+EK +   G+V LE+W+GPSNAIEGYVP
Sbjct: 121  LNEILRKFNDLTLDSEGLGRSGDLG-LSNLKIQEKSETNVGKVSLEEWIGPSNAIEGYVP 179

Query: 827  QRDRSSKSSPLKHRKEGSKSKNARAKKEAEKVVDEMDFTSTIIVGGQFSVPKLSSGPKQN 1006
            Q DR    S LK+ KEG K+   +   + +    + DFTSTII   ++S+ K  SG    
Sbjct: 180  QGDRDPNPS-LKNHKEGLKAICKKPVSKQDCFFSDTDFTSTIITNDEYSISKGPSGLTST 238

Query: 1007 GSETKLKESEGN-------QFSIL---ETCSASTQNGFETKSKEPK-------------- 1114
             S+ KL+   G        Q S L   ++  AS ++    K K  K              
Sbjct: 239  ASDIKLQAQTGKGHEGLNAQLSSLRKQDSIKASRKSKGRRKEKVIKEQLNFQDLPSSSYY 298

Query: 1115 -----------GKGSVNESTREVSVGXXXXXXXXXXXXXXXXXXXXXXXXXVTWADERKI 1261
                       G  ++NES  + S+                          VTWADER +
Sbjct: 299  TAEAEDISQATGAANLNESVLKPSL---------------KSSGAKRSNRSVTWADER-V 342

Query: 1262 DSTSNGNLCEFQEMCDVQKSGESSSHSNVEDFDGSLRLTLXXXXXXXXXXXXXXXXSGES 1441
            D+  + NLCE QEM    +S E S  +N  D    LR                   SG++
Sbjct: 343  DNAGSRNLCEVQEMEQTNESHEISESANKGDDGHMLRFESAEACAVALSQAAEAVASGDA 402

Query: 1442 DATDAVSEAGIIILPRPHDADQGDFQNDEDMIEPEPVPLKWPKKSGVLNSELFDAEDTWY 1621
            D   A+SEAGII+LP   D  QG      DMIE E   LKWP K G+  S+LFD ED+WY
Sbjct: 403  DVNKAMSEAGIIVLPPSQDLGQGGNVEKNDMIEQESASLKWPTKPGIPQSDLFDPEDSWY 462

Query: 1622 DTPPEGFSLSLSSFATMWMALFGWITSSSLAYIYGRDESSHEEFSLVNGREYPRKIVLSD 1801
            D PPEGFSL+LS FATMWMALF W+TSSSLAYIYGRDES+HE++  VNGREYPRKIVL D
Sbjct: 463  DAPPEGFSLTLSPFATMWMALFAWVTSSSLAYIYGRDESAHEDYLSVNGREYPRKIVLRD 522

Query: 1802 GRSSEIKQTLAGCLARALPGLVTDLRLPTPISSLEQGMRSLLETMSFMDALPPFRMKQWH 1981
            GRSSEI+ T   CLAR  PGLV +LRLP P+S+LEQG   LLETMSF+DALP FR KQW 
Sbjct: 523  GRSSEIRLTAESCLARTFPGLVANLRLPIPVSTLEQGAGRLLETMSFVDALPAFRTKQWQ 582

Query: 1982 VIVLLLIDALSVCRIPGLIPHMTNRRMLLHKVLDSAQVSVEEYEVMKNLVIPLG 2143
            VI LL I+ALSVCRIP L  +MT+RRM+LH+VLD A +S EEY++MK+ ++PLG
Sbjct: 583  VIALLFIEALSVCRIPALTSYMTSRRMVLHQVLDGAHISAEEYDIMKDFMVPLG 636


>gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis]
          Length = 695

 Score =  587 bits (1513), Expect = e-165
 Identities = 339/685 (49%), Positives = 415/685 (60%), Gaps = 72/685 (10%)
 Frame = +2

Query: 305  ISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLCNNPLPL 484
            ISVKD V++LQ SLL+G+  EDQLFAAGS+MSR DY D+VTERSI+NLCGYPLC NPLP 
Sbjct: 9    ISVKDTVYRLQLSLLQGLHGEDQLFAAGSIMSRSDYNDVVTERSIANLCGYPLCPNPLPS 68

Query: 485  EHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCSISNSEKINEVLK 664
            + PRKGRYR+SLKEHKVYDL ETYMYCSSDCV+ SRTFA SL+ ERC++ +S +I+ VL+
Sbjct: 69   DRPRKGRYRISLKEHKVYDLHETYMYCSSDCVINSRTFAASLKDERCAVLDSARIDAVLR 128

Query: 665  LF-XXXXXXXXXXXXXXXXXXFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVPQRDRS 841
            +F                   FS+LKIEEK +   G+V LE W GPSNAIEGYV QR+R 
Sbjct: 129  MFEDYSGLERELGFGKDRDLGFSKLKIEEKTENCVGDVSLEQWAGPSNAIEGYVLQRERK 188

Query: 842  SKSSPLKHRKEGSKSKNARAKKEAEKVVDEMDFTSTIIVGGQFSVPKLSSGPKQNGSETK 1021
             K    K  K GSK+ N         ++++MDF STII   +++V K  S  K+ G ++K
Sbjct: 189  PKELGSKSPKRGSKANNT-------VLINDMDFVSTIITEDEYTVSKTPSSLKKTGLDSK 241

Query: 1022 LKESE--------GNQFSILETCSASTQN------GFETKSKEPKGKGSVNESTR---EV 1150
            ++E E        GN+F++LET  A   N       FE  +   +  GS   S R   E 
Sbjct: 242  VREQEEILAKKAMGNEFAVLETSYAPASNVSRVGLVFEDVTSSLRA-GSCLSSARAEEES 300

Query: 1151 SVGXXXXXXXXXXXXXXXXXXXXXXXXXVTWADERKIDSTSNGNLCEFQEMCDVQKSGES 1330
                                        VTWADE K DS+    LCE +E+ D+++    
Sbjct: 301  HDDKAEKCTEASIKSSLKPSRKKKLSRTVTWADE-KTDSSGGRKLCEIREIEDMKEDPSV 359

Query: 1331 SSHSNVEDFDGSLRL-----------------------TLXXXXXXXXXXXXXXXXSGES 1441
              + N   F  S ++                                         +GE+
Sbjct: 360  VENKNGVSFTSSGKMKAGQSVIWADEKGDSSKSIDVCEVREIEDAKEAADMLCNADTGEN 419

Query: 1442 D-------------ATDAVSEA---------------GIIILPRPHDADQG---DFQNDE 1528
            D             A D  SEA               GIIILPRP + D+G   +  +D+
Sbjct: 420  DDTFRFASAEACARALDEASEAVASEELEVNDAMSEAGIIILPRPENGDEGEPMEEDDDD 479

Query: 1529 DMIEPEPVPLKWPKKSGVLNSELFDAEDTWYDTPPEGFSLSLSSFATMWMALFGWITSSS 1708
            +  EPE  P+KWPKK G  +S+LFD ED+W+D PPE FSL+LS FA MW ALF W TSS+
Sbjct: 480  ETSEPEQAPIKWPKKPGSQHSDLFDPEDSWFDAPPEDFSLTLSPFAKMWNALFTWTTSST 539

Query: 1709 LAYIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCLARALPGLVTDLRLPT 1888
            LAYIYGRDES HEE+++VNGREYP KIV  DGRSSEIKQTLAG LARALPGLV DLRL T
Sbjct: 540  LAYIYGRDESLHEEYAVVNGREYPEKIVFGDGRSSEIKQTLAGSLARALPGLVADLRLST 599

Query: 1889 PISSLEQGMRSLLETMSFMDALPPFRMKQWHVIVLLLIDALSVCRIPGLIPHMTNRRMLL 2068
            PISSLEQGM  LL+TMSF+DALPPFRMKQW VI+LL ++ALSV R+P L PHM  RR+L 
Sbjct: 600  PISSLEQGMGRLLDTMSFVDALPPFRMKQWQVIILLFLEALSVYRLPALTPHMMYRRVLF 659

Query: 2069 HKVLDSAQVSVEEYEVMKNLVIPLG 2143
            HKVLDSAQ+S EEYEVMK+LVIPLG
Sbjct: 660  HKVLDSAQISAEEYEVMKDLVIPLG 684


>ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Solanum lycopersicum]
          Length = 660

 Score =  585 bits (1509), Expect = e-164
 Identities = 330/657 (50%), Positives = 416/657 (63%), Gaps = 38/657 (5%)
 Frame = +2

Query: 287  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 466
            M K ++++VKDAVHKLQ  LLEGIK+E QL AAGSL+SR DY+D+VTERSI+N+CGYPLC
Sbjct: 1    MAKGEAVAVKDAVHKLQLCLLEGIKDESQLIAAGSLLSRSDYQDVVTERSIANMCGYPLC 60

Query: 467  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCSISNSEK 646
            +N LP E  RKG YR+SLKEHKVYDL ETYMYCS++CVV S  FAGSLQ ER S  N  K
Sbjct: 61   SNSLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAK 120

Query: 647  INEVLKLFXXXXXXXXXXXXXXXXXXFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 826
            +N+VL LF                   S+LKI+EKVD+K GEV LE+W+GPSNAIEGYVP
Sbjct: 121  LNQVLNLFKGLHLHSLDDVKENGDRGSSKLKIQEKVDLKGGEVSLEEWMGPSNAIEGYVP 180

Query: 827  QRDRSSKSSPLKHRKEGSKSKNARAKKEAEKVVDEMDFTSTIIVGGQFSVPKLSSGPKQN 1006
            QRDRS   + LK+  +GSK+K+AR + E   +++E DF+STII   ++SV K  + P   
Sbjct: 181  QRDRSVNPALLKNINKGSKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSKFPA-PVNA 239

Query: 1007 GSETKLKESE---------------GNQFSIL------ETCSASTQNGFETKSKEPKGKG 1123
             S  K KE++               G Q   L      ET  +     F    K   G+ 
Sbjct: 240  DSNVKFKETQAKTRYKVRDDDVYILGKQVDALQLRSGEETEKSDKNTRFLKVDKFNSGEV 299

Query: 1124 SVNESTREVSVGXXXXXXXXXXXXXXXXXXXXXXXXX-----------VTWADERKID-- 1264
            S   S  +V                                       VTWADE  ID  
Sbjct: 300  SSGPSQHDVKNKSVLIMSDDGRKYASHGEHDKLKSSLKSSNSKKMSRSVTWADE-SIDGG 358

Query: 1265 ----STSNGNLCEFQEMCDVQKSGESSSHSNVEDFDGSLRLTLXXXXXXXXXXXXXXXXS 1432
                + S+  + E++     Q  G S+S +++E+ D S R                   S
Sbjct: 359  IGKKTESSSKISEYES----QAYGGSAS-TDMEENDDSYRFESAEACAAALSQAAEAVAS 413

Query: 1433 GESDATDAVSEAGIIILPRPHDADQGDFQNDEDMIEPEPVPLKWPKKSGVLNSELFDAED 1612
            G SD  DAVS+AGI+ILP   + D+   Q  ++M++ E  PLKWP+K G+ N ++F++ED
Sbjct: 414  G-SDVPDAVSKAGIVILPPSQEVDEAILQETDEMLDLETAPLKWPRKPGMPNYDVFESED 472

Query: 1613 TWYDTPPEGFSLSLSSFATMWMALFGWITSSSLAYIYGRDESSHEEFSLVNGREYPRKIV 1792
            +WYD+PPEGF+++LS F TM+ +LF WI+SSSLA+IYG DES++EE+  +NGREYPRKIV
Sbjct: 473  SWYDSPPEGFNMTLSPFGTMFNSLFTWISSSSLAFIYGHDESNNEEYLSINGREYPRKIV 532

Query: 1793 LSDGRSSEIKQTLAGCLARALPGLVTDLRLPTPISSLEQGMRSLLETMSFMDALPPFRMK 1972
            LSDGRS+EIKQTLAGCLARALPGLV DLRLP PIS+LEQGM  LL TMSF+D LP FRMK
Sbjct: 533  LSDGRSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPLPAFRMK 592

Query: 1973 QWHVIVLLLIDALSVCRIPGLIPHMTNRRMLLHKVLDSAQVSVEEYEVMKNLVIPLG 2143
            QW +IVLL +DALSVCRIP L P+MT RR    KVLD AQ+S  EYE+MK+L+IPLG
Sbjct: 593  QWQLIVLLFLDALSVCRIPTLTPYMTGRRTSFPKVLDGAQISAAEYEIMKDLIIPLG 649


>ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Populus trichocarpa]
            gi|550321730|gb|EEF05523.2| hypothetical protein
            POPTR_0015s01330g [Populus trichocarpa]
          Length = 696

 Score =  579 bits (1492), Expect = e-162
 Identities = 325/690 (47%), Positives = 412/690 (59%), Gaps = 71/690 (10%)
 Frame = +2

Query: 287  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 466
            M K QS  VKD ++KLQ SLL+GI+NEDQL AAGS+MS  DYED+VTER+I+NLCGYPLC
Sbjct: 1    MAKDQSTVVKDTIYKLQLSLLDGIQNEDQLLAAGSIMSHSDYEDVVTERTIANLCGYPLC 60

Query: 467  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCSISNSEK 646
             N LP + P+KGRYR+SLKEHKVYDL ETYMYCSS CV+ SRTF+GSLQ ERC + N  K
Sbjct: 61   GNSLPSDRPQKGRYRISLKEHKVYDLHETYMYCSSSCVINSRTFSGSLQEERCLVLNPAK 120

Query: 647  INEVLKLFXXXXXXXXXXXXXXXXXXFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 826
            +NEVL LF                  FS LKIEEK +   GEV  E W+GPSNAIEGYVP
Sbjct: 121  LNEVLMLFDNFSLGSEGSLGKNGDLGFSNLKIEEKTEKVEGEVSFEQWIGPSNAIEGYVP 180

Query: 827  QRDR-----------------------------------------SSKSSPLKHRKEGSK 883
            QRDR                                           K+       +GSK
Sbjct: 181  QRDRLEEDFIIDDMDFTSSIITQDEYSISKTPSGLTDTNTDKKTQKPKAKGSHKGSKGSK 240

Query: 884  SKNARAKKEAEKVVDEMDFTSTIIVG-GQFSVPKLSSGPKQNGSETKL-KESEGNQFSIL 1057
            +K  +   + E  +++M+FTSTII+   ++S+ K  SG     S+TK+ K+ E       
Sbjct: 241  AKGTKQSSKQESFINDMNFTSTIIITQDEYSISKSPSGLAGTTSKTKIQKQKEKVSQKSS 300

Query: 1058 ETCSASTQNGFETKS----KEPKGKGSVNES-----------------------TREVSV 1156
            E  S++T+    +K+    KE + K ++ +                         +E SV
Sbjct: 301  ENQSSATRKVGSSKTSRKVKEDRSKVAIKDELSSQDLSSPFDSCQTSSITITAEAKEKSV 360

Query: 1157 GXXXXXXXXXXXXXXXXXXXXXXXXX-VTWADERKIDSTSNGNLCEFQEMCDVQKSGESS 1333
                                       VTWADE K+ S+ + +LCE + M D +   E  
Sbjct: 361  SEKAAKPVESSLKPSLKTSGAKQLTRSVTWADE-KVGSSGSRDLCEVRGMEDTKAGPEIV 419

Query: 1334 SHSNVEDFDGSLRLTLXXXXXXXXXXXXXXXXSGESDATDAVSEAGIIILPRPHDADQGD 1513
             + +  D     +                   SG++DA++A+SEAG++ILP+PHD DQGD
Sbjct: 420  DNIDKRDDGYVSKFESAEACAKALSQAAEAVASGDADASNALSEAGLVILPQPHDLDQGD 479

Query: 1514 FQNDEDMIEPEPVPLKWPKKSGVLNSELFDAEDTWYDTPPEGFSLSLSSFATMWMALFGW 1693
               D D+++ E   +KWP K G+  SE FD E++WYD PPEGFSL LSSFAT+WMALF W
Sbjct: 480  PMEDVDVLDEESSTIKWPGKPGIPQSECFDPENSWYDAPPEGFSLELSSFATIWMALFAW 539

Query: 1694 ITSSSLAYIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCLARALPGLVTD 1873
            +TSSSLAY+YG+DESSHEE+ +VNGREYPRKIVL DGRS EI+QT+ GCL RA P +V D
Sbjct: 540  VTSSSLAYVYGKDESSHEEYLMVNGREYPRKIVLGDGRSFEIQQTIEGCLGRAFPVVVAD 599

Query: 1874 LRLPTPISSLEQGMRSLLETMSFMDALPPFRMKQWHVIVLLLIDALSVCRIPGLIPHMTN 2053
            LRLP PIS+LEQG  +LL TMSF+DA+P FRMKQW VI LL I+ALSVCRIP LI +M N
Sbjct: 600  LRLPIPISTLEQGAANLLGTMSFVDAVPAFRMKQWQVIALLFIEALSVCRIPALISYMDN 659

Query: 2054 RRMLLHKVLDSAQVSVEEYEVMKNLVIPLG 2143
            RRM    V+D  ++S EEYEVMK+L+IPLG
Sbjct: 660  RRM----VVDGVRMSAEEYEVMKDLMIPLG 685


>ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X1 [Solanum tuberosum]
          Length = 662

 Score =  574 bits (1479), Expect = e-161
 Identities = 331/661 (50%), Positives = 416/661 (62%), Gaps = 42/661 (6%)
 Frame = +2

Query: 287  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 466
            M K ++++VKDAVHKLQ  LLEGIK+E+QL AAGSL+SR DY+D+VTERSI+N+CGYPLC
Sbjct: 1    MAKGEAVAVKDAVHKLQLCLLEGIKDENQLIAAGSLLSRSDYQDVVTERSIANMCGYPLC 60

Query: 467  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCSISNSEK 646
            +N LP E  RKG YR+SLKEHKVYDL ETYMYCS++CVV S  FAGSLQ ER S  N  K
Sbjct: 61   SNSLPSERSRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAK 120

Query: 647  INEVLKLFXXXXXXXXXXXXXXXXXXFSELKIEEKVDVKAG-EVLLEDWVGPSNAIEGYV 823
            +N+VL LF                   S+LKI+EKVDVK G EV LE+W+GPSNAIEGYV
Sbjct: 121  LNQVLNLFKGLHLHSPEDVKENGDLGSSKLKIQEKVDVKGGGEVSLEEWMGPSNAIEGYV 180

Query: 824  PQRDRSSKSSPLKHRKEGSKSKNARAKKEAEKVVDEMDFTSTIIVGGQFSVPKLSSGPKQ 1003
            PQRDRS   + LK+  +G K+K+AR + E   +++E DF+STII   ++SV K  + P  
Sbjct: 181  PQRDRSVNPALLKNINKGFKNKHARLQDEKNMILNEFDFSSTIITQDEYSVSKFPA-PVN 239

Query: 1004 NGSETKLKESEG--------NQFSIL-------------ETCSASTQNGFETKSKEPKGK 1120
              S  K KE++         +  SIL             ET  +     F    K   G+
Sbjct: 240  AVSSEKFKEAQAKTRYKVRDDDVSILGKRVDALQLRSGEETEKSDKNTRFLKVDKFNSGE 299

Query: 1121 GSVNESTREVS-------------VGXXXXXXXXXXXXXXXXXXXXXXXXXVTWADE--- 1252
             S   S  +V                                         VTWADE   
Sbjct: 300  VSSGPSQHDVKNKSVLIMSDDGRKYASHGEHDKQLLKSSLKSSNSKKMSQSVTWADEIID 359

Query: 1253 ----RKIDSTSNGNLCEFQEMCDVQKSGESSSHSNVEDFDGSLRLTLXXXXXXXXXXXXX 1420
                +K +S+S   + E++     Q  G S+S +++E+ D S R                
Sbjct: 360  GGIGKKTESSSK--ISEYEN----QAYGGSAS-TDMEEDDDSYRFESAEACAAALSQAAE 412

Query: 1421 XXXSGESDATDAVSEAGIIILPRPHDADQGDFQNDEDMIEPEPVPLKWPKKSGVLNSELF 1600
               SG SD  DAVS+AGI+ILP   + D+   Q  E M++ EP PLKWP+K G+ N ++F
Sbjct: 413  AVASG-SDVPDAVSKAGIVILPTSQEVDEAILQETE-MLDIEPAPLKWPRKPGMPNYDVF 470

Query: 1601 DAEDTWYDTPPEGFSLSLSSFATMWMALFGWITSSSLAYIYGRDESSHEEFSLVNGREYP 1780
            ++ED WYD PPEGF+++LS FATM+ +LF WI+SSSLA+IYG DE+++EE+  +NGREYP
Sbjct: 471  ESEDCWYDGPPEGFNMTLSPFATMFNSLFTWISSSSLAFIYGHDENNNEEYLSINGREYP 530

Query: 1781 RKIVLSDGRSSEIKQTLAGCLARALPGLVTDLRLPTPISSLEQGMRSLLETMSFMDALPP 1960
             KIVLSDG S+EIKQTLAGCLARALPGLV DLRLP PIS+LEQGM  LL TMSF+D LP 
Sbjct: 531  HKIVLSDGLSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPLPA 590

Query: 1961 FRMKQWHVIVLLLIDALSVCRIPGLIPHMTNRRMLLHKVLDSAQVSVEEYEVMKNLVIPL 2140
            FRMKQW +IVLL +DALSVCRIP L P+MT RR  L KVLD AQ+S  EYE+MK+L+IPL
Sbjct: 591  FRMKQWQLIVLLFLDALSVCRIPTLTPYMTGRRTSLPKVLDGAQISTAEYEIMKDLIIPL 650

Query: 2141 G 2143
            G
Sbjct: 651  G 651


>ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cicer arietinum]
          Length = 666

 Score =  572 bits (1475), Expect = e-160
 Identities = 318/658 (48%), Positives = 412/658 (62%), Gaps = 39/658 (5%)
 Frame = +2

Query: 287  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 466
            MEK Q ISVKDAV KLQ +LLEGI++EDQLFAAGSL+SR DYED+VTERSI+ +C YPLC
Sbjct: 1    MEKDQPISVKDAVFKLQLALLEGIQSEDQLFAAGSLISRSDYEDVVTERSITEVCSYPLC 60

Query: 467  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCSISNSEK 646
             N LP E PRKGRYR+SLKEHKVYDL ETYM+CSS CVV S+ FAGSL+ +RC   + +K
Sbjct: 61   CNALPSERPRKGRYRISLKEHKVYDLHETYMFCSSSCVVNSKAFAGSLKDKRCLALDPQK 120

Query: 647  INEVLKLFXXXXXXXXXXXXXXXXXXFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 826
            +N +L+LF                   S L+I++K +    EV LE WVGPSNAIEGYVP
Sbjct: 121  LNNILRLFGNSNLEPMENSGKDGELGLSSLRIQDKTETVT-EVSLEQWVGPSNAIEGYVP 179

Query: 827  Q-RDRSSKSSPLKHRKEGSKSKNARAKKEAEKVVDEMDFTSTIIVGGQFSVPKLSSG--- 994
            + RD  SK S  K+ K+GSK+ + ++      +  E DF STII+  ++SV K+SSG   
Sbjct: 180  KKRDNGSKGSQ-KNTKKGSKASHGKSNGVKNLINSEFDFMSTIIMQDEYSVSKVSSGQTD 238

Query: 995  ---------------PKQNGSETKLKESE----GNQFSILETCSASTQNGFETKSKEPKG 1117
                           PK+   E   K+ +     + F+     SAS ++    KS +   
Sbjct: 239  ATVDHQIKPTAILEQPKRVDHELVRKDDDIQDLSSSFASSLNLSASKKDKEIAKSCKNVL 298

Query: 1118 KGSVN----------------ESTREVSVGXXXXXXXXXXXXXXXXXXXXXXXXXVTWAD 1249
            KG  N                +   ++ +                          VTWAD
Sbjct: 299  KGKTNRVAANDDSSTSNFDPSDVEEKIQIEKEIGSCHTKPKSSLKSNGKKKLGRSVTWAD 358

Query: 1250 ERKIDSTSNGNLCEFQEMCDVQKSGESSSHSNVEDFDGSLRLTLXXXXXXXXXXXXXXXX 1429
             +KID   + +LC F+E  +++K  + + + +V D +  LR                   
Sbjct: 359  -KKIDGCGSTDLCAFKEFGNIKKESDVADNVDVVDDEDILRSVSAEACAIALSQAAEAVA 417

Query: 1430 SGESDATDAVSEAGIIILPRPHDADQGDFQNDEDMIEPEPVPLKWPKKSGVLNSELFDAE 1609
            SG+SDA DAVSEAGIIILP   +A +    +D D++E + V LKWP+K G+ + +LF ++
Sbjct: 418  SGDSDAIDAVSEAGIIILPHTENAVEESTVDDVDILETDSVTLKWPRKPGISDFDLFASD 477

Query: 1610 DTWYDTPPEGFSLSLSSFATMWMALFGWITSSSLAYIYGRDESSHEEFSLVNGREYPRKI 1789
            D+W+D PPEGFSL+LS FAT+W A F WITSSSLAYIYGRD S +EEF  V+GREYP KI
Sbjct: 478  DSWFDAPPEGFSLTLSPFATLWNAFFSWITSSSLAYIYGRDVSFYEEFLSVDGREYPCKI 537

Query: 1790 VLSDGRSSEIKQTLAGCLARALPGLVTDLRLPTPISSLEQGMRSLLETMSFMDALPPFRM 1969
            VLSDGRSSEIKQTLA CLARALP +V +L+LP P+S+LEQGM  LL+TMSF+D LP FR 
Sbjct: 538  VLSDGRSSEIKQTLASCLARALPAVVAELKLPMPVSTLEQGMVCLLDTMSFVDPLPGFRF 597

Query: 1970 KQWHVIVLLLIDALSVCRIPGLIPHMTNRRMLLHKVLDSAQVSVEEYEVMKNLVIPLG 2143
            KQW V+ LL +DALSVCRIP LI +MT+RR L HKVL  +Q+ +EEY V+K+L++PLG
Sbjct: 598  KQWQVVALLFVDALSVCRIPALISYMTDRRDLFHKVLSGSQIGMEEYNVLKDLIVPLG 655


>ref|XP_007016928.1| F2P16.20-like protein isoform 3, partial [Theobroma cacao]
            gi|508787291|gb|EOY34547.1| F2P16.20-like protein isoform
            3, partial [Theobroma cacao]
          Length = 703

 Score =  571 bits (1472), Expect = e-160
 Identities = 322/669 (48%), Positives = 408/669 (60%), Gaps = 64/669 (9%)
 Frame = +2

Query: 287  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 466
            M K+QSISV +AVHK+Q  LL+GI++E QL A+GSL+SR DYED+VTER+ISN CGYPLC
Sbjct: 55   MAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPLC 114

Query: 467  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCSISNSEK 646
             NPLP E  RKGRYR+SLKEHKVYDLQETYM+CS++C++ SR FAGSLQ ERCS+ N  K
Sbjct: 115  ANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHAK 174

Query: 647  INEVLKLFXXXXXXXXXXXXXXXXXXFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 826
            +N++L LF                  FS L+I+E  +VKA +V L    GPSNAIEGYVP
Sbjct: 175  LNDILSLFGDLDLDDNDLGKNGDLG-FSNLRIKENEEVKAEDVSL---AGPSNAIEGYVP 230

Query: 827  QRDRSSKSSPLKH-----------------------------------------RKEGSK 883
            QR+  SK +P K+                                         +K GS 
Sbjct: 231  QRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKKPGSF 290

Query: 884  SKNARAKKEAEK---VVDEMDFTSTIIVGGQFSVPKLSSGPKQNGSETKLKESE------ 1036
             +  R K  ++K   V++EMDFTS II+  ++++ K+ SG KQ+  ++ LKE E      
Sbjct: 291  KQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICK 350

Query: 1037 ---------GNQFSILETCSA-----STQNGFETKSKEPKGKGSVNESTREVSVGXXXXX 1174
                     G+  ++ E  S+     ST+N +++         S  E+ +E         
Sbjct: 351  DSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDT-----SSAEAEKETHADKAVTS 405

Query: 1175 XXXXXXXXXXXXXXXXXXXXVTWADERKIDSTSNGNLCEFQEMCDVQKSGESSSHSNVED 1354
                                VTWAD++K D+  NGNLCE +EM  ++   E S  +    
Sbjct: 406  SETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGG 465

Query: 1355 FDGSLRLTLXXXXXXXXXXXXXXXXSGESDATDAVSEAGIIILPRPHDADQGDFQNDEDM 1534
             D  LR                   SG+SD TDAV E            D+ +   D DM
Sbjct: 466  DDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVCEV-----------DKEEPMEDGDM 514

Query: 1535 IEPEPVPLKWPKKSGVLNSELFDAEDTWYDTPPEGFSLSLSSFATMWMALFGWITSSSLA 1714
            +EPE  P+KWPKK G+ +S++F+ ED+W+D PPEGFSL+LS+FATMW ALF WITSSSLA
Sbjct: 515  LEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLA 574

Query: 1715 YIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCLARALPGLVTDLRLPTPI 1894
            YIYGRDES HEE+  +NGREYPRKI L DGRSSEIK+TLA C++RALP +VTDLRLP PI
Sbjct: 575  YIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPIPI 634

Query: 1895 SSLEQGMRSLLETMSFMDALPPFRMKQWHVIVLLLIDALSVCRIPGLIPHMTNRRMLLHK 2074
            S+LEQGM  L++T+SFM+ALP FRMKQW VIVLL IDALSVCRIP L PHMTN RMLLHK
Sbjct: 635  STLEQGMGHLIDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPALTPHMTNGRMLLHK 694

Query: 2075 VLDSAQVSV 2101
            VLD AQ+S+
Sbjct: 695  VLDGAQISM 703


>ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cucumis sativus]
          Length = 662

 Score =  556 bits (1433), Expect = e-155
 Identities = 318/663 (47%), Positives = 405/663 (61%), Gaps = 39/663 (5%)
 Frame = +2

Query: 287  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 466
            M K QS+ +KD V+KLQ +L EGIKNE+QLFAAGSLMSR DYED+VTERSI++LCGYPLC
Sbjct: 1    MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60

Query: 467  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCSISNSEK 646
            ++ LP ++ R+GRYR+SLKEHKVYDL+ETY YCSS C++ SR F+G LQ ERCS+ N +K
Sbjct: 61   HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120

Query: 647  INEVLKLFXXXXXXXXXXXXXXXXXXFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 826
            + E+LKLF                   S L+I+EK++   GEV +E+W+GPSNAIEGYVP
Sbjct: 121  LKEILKLFENMSLDSKENMGNNCD---SGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVP 177

Query: 827  QRDRSSKSSPLKHRKEGSKSKNARAKK--EAEKVVDEMDFTSTIIVGGQFSVPKLSSGPK 1000
             RD    +   K  KE      A+ K     +    +   TSTII   ++SV K+SSG K
Sbjct: 178  HRDHKVMTLHSKDGKESKDGSKAKIKPLGGGKDFFSDFSITSTIITDEEYSVSKISSGLK 237

Query: 1001 QNGSETKLKESEG--------NQFSILET--CSASTQNGFETKSKEPKGKGSVN---EST 1141
            +   +T  K   G        +QF+ILET    A  +N    K++  K +  V+   EST
Sbjct: 238  EMALDTNSKNQTGEFCGKESNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSATKEST 297

Query: 1142 REVSV--------------------GXXXXXXXXXXXXXXXXXXXXXXXXXVTWADERKI 1261
              +S                     G                         VTWADE K 
Sbjct: 298  DNLSDAPSTSKNRSTNFNLMTEEPRGGFNDLSGTELKSSLKKPGKKNLCRSVTWADE-KT 356

Query: 1262 DSTSNGNLCEFQEMCDVQKSGESSSHSNVEDFDGS----LRLTLXXXXXXXXXXXXXXXX 1429
            D  S  NL E  EM   ++   ++S  N+ +FD      LR+                  
Sbjct: 357  DDASIMNLPEVGEMGKTKECSRTTS--NLVNFDNDNEDILRVESAEACAMALSQAAEAIT 414

Query: 1430 SGESDATDAVSEAGIIILPRPHDADQGDFQNDEDMIEPEPVPLKWPKKSGVLNSELFDAE 1609
            SG+S+ +DAVSEAGIIILP P DA++    +  +  EP     K   K GVL S+LFD  
Sbjct: 415  SGQSEVSDAVSEAGIIILPHPSDANEEASTDPVNASEPHSFSEK-SNKLGVLRSDLFDPS 473

Query: 1610 DTWYDTPPEGFSLSLSSFATMWMALFGWITSSSLAYIYGRDESSHEEFSLVNGREYPRKI 1789
            D+WYD PPEGFSL+LSSFATMWMA+F W+TSSSLAYIYG+D+  HEEF  ++G+EYP KI
Sbjct: 474  DSWYDAPPEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGKEYPSKI 533

Query: 1790 VLSDGRSSEIKQTLAGCLARALPGLVTDLRLPTPISSLEQGMRSLLETMSFMDALPPFRM 1969
            V +DGRSSEIKQTLAGCL RA+PGL ++L L TPIS LE GM  LL+TM+F+DALP FRM
Sbjct: 534  VSADGRSSEIKQTLAGCLTRAIPGLASELNLSTPISRLENGMAHLLDTMTFLDALPAFRM 593

Query: 1970 KQWHVIVLLLIDALSVCRIPGLIPHMTNRRMLLHKVLDSAQVSVEEYEVMKNLVIPLGYL 2149
            KQW VIVLL I+ALSV RIP L  HM++ R L HKVLD AQ+  +EYE+M++ ++PLG  
Sbjct: 594  KQWQVIVLLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDHILPLGRT 653

Query: 2150 TSL 2158
              L
Sbjct: 654  AQL 656


>ref|XP_004157008.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cucumis sativus]
          Length = 632

 Score =  555 bits (1430), Expect = e-155
 Identities = 310/644 (48%), Positives = 404/644 (62%), Gaps = 20/644 (3%)
 Frame = +2

Query: 287  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 466
            M K QS+ +KD V+KLQ +L EGIKNE+QLFAAGSLMSR DYED+VTERSI++LCGYPLC
Sbjct: 1    MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60

Query: 467  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCSISNSEK 646
            ++ LP ++ R+GRYR+SLKEHKVYDL+ETY YCSS C++ SR F+G LQ ERCS+ N +K
Sbjct: 61   HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120

Query: 647  INEVLKLFXXXXXXXXXXXXXXXXXXFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 826
            + E+LKLF                   S L+I+EK++   GEV +E+W+GPSNAIEGYVP
Sbjct: 121  LKEILKLFENMSLDSKENMGNNCD---SGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVP 177

Query: 827  QRDRSSKSSPLKHRKEGSKSKNARAKK--EAEKVVDEMDFTSTIIVGGQFSVPKLSSGPK 1000
             RD    +   K  KE      A+ K     +    +  FTSTII   ++SV K+SSG K
Sbjct: 178  HRDHKVMTLHSKDGKESKDGSKAKIKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSGLK 237

Query: 1001 QNGSETKLKESEG--------NQFSILET--CSASTQNGFETKSKEPKGKGSVN---EST 1141
            +   +T  K   G        +QF+ILET    A  +N    K++  K +  V+   EST
Sbjct: 238  EMALDTNSKNQTGEFCGKKSNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSATKEST 297

Query: 1142 REVSVGXXXXXXXXXXXXXXXXXXXXXXXXXVTWADERKIDSTSNGNLCEFQEMCDVQKS 1321
              +S                               +E + + T + ++    E+ ++ K+
Sbjct: 298  DNLSDAPSTSNNRSTNFNLM--------------TEEPRDEKTDDASIMNLPEVGEMGKT 343

Query: 1322 GESS-SHSNVEDFDGS----LRLTLXXXXXXXXXXXXXXXXSGESDATDAVSEAGIIILP 1486
             E S + SN+ +FD      LR+                  SG+S+ +DAVSEAGIIILP
Sbjct: 344  KECSRTTSNLVNFDNDNEDLLRVESAEACAMALSQAAKAITSGQSEVSDAVSEAGIIILP 403

Query: 1487 RPHDADQGDFQNDEDMIEPEPVPLKWPKKSGVLNSELFDAEDTWYDTPPEGFSLSLSSFA 1666
             P DA++    +  +  EP     K   K GVL S+LFD  D+WYD PPEGFSL+LSSFA
Sbjct: 404  HPSDANEEASTDPVNASEPHSFSEK-SNKLGVLRSDLFDPSDSWYDAPPEGFSLTLSSFA 462

Query: 1667 TMWMALFGWITSSSLAYIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCLA 1846
            TMWMA+F W+TSSSLAYIYG+D+  HEEF  ++G+EYP KIV +DGRSSEIKQTLAGCL 
Sbjct: 463  TMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGKEYPSKIVSADGRSSEIKQTLAGCLT 522

Query: 1847 RALPGLVTDLRLPTPISSLEQGMRSLLETMSFMDALPPFRMKQWHVIVLLLIDALSVCRI 2026
            RA+PGL ++L L TPIS LE GM  LL+TM+F+DALP FRMKQW VIVLL I+ALSV RI
Sbjct: 523  RAIPGLASELNLSTPISRLENGMAHLLDTMTFLDALPAFRMKQWQVIVLLFIEALSVSRI 582

Query: 2027 PGLIPHMTNRRMLLHKVLDSAQVSVEEYEVMKNLVIPLGYLTSL 2158
            P L  HM++ R L HKVLD AQ+  +EYE+M++ ++PLG    L
Sbjct: 583  PSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDHILPLGRTAQL 626


>gb|EYU19406.1| hypothetical protein MIMGU_mgv1a003240mg [Mimulus guttatus]
          Length = 597

 Score =  552 bits (1423), Expect = e-154
 Identities = 301/628 (47%), Positives = 412/628 (65%), Gaps = 9/628 (1%)
 Frame = +2

Query: 287  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 466
            M+  + + VKDAVHKLQ SLLEGIK+E QL AAGSL+S+ DY+D+VTER+I+++CGYPLC
Sbjct: 1    MKDGKILGVKDAVHKLQLSLLEGIKHESQLIAAGSLISQSDYQDVVTERTIAHVCGYPLC 60

Query: 467  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCSISNSEK 646
             N LP E PRKG YR+SLKEHKVYDL ET+MYCS++C+++SR F  SL+ ER S  +  K
Sbjct: 61   VNSLPSEPPRKGHYRISLKEHKVYDLHETHMYCSTECLIRSRAFGASLEEERSSSLDPAK 120

Query: 647  INEVLKLFXXXXXXXXXXXXXXXXXXFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 826
            IN VLK+F                   S LKI EK+   +GE+ LE+WVGPSNAI+GYVP
Sbjct: 121  INSVLKMFDGLSLDSVMGLDKSGDLGLSGLKIREKMVTGSGEMSLEEWVGPSNAIDGYVP 180

Query: 827  QRDRSSKSSPLKHRKEGSKSK---NARAKKEAEKVVDEMDFTSTIIVGGQFSVPKLSSGP 997
            +RD++S+      RK+ S+ K   N      A+ +  +++FTSTII+  ++SV K ++ P
Sbjct: 181  RRDQNSE------RKQPSRKKTESNHAKPNLADTLPFDVNFTSTIIMQDEYSVSK-TAVP 233

Query: 998  KQNGSETK----LKESEGNQFSILETCSASTQNGFETKSKEPKGKGSVNESTREVSVGXX 1165
            ++   + K     K  +  + S+L+  +  +QN         K   S  E+         
Sbjct: 234  REAKGKVKGKMIRKSVKAEKISVLDDTAGPSQNDTTLLKSSLKTLDSKKETRS------- 286

Query: 1166 XXXXXXXXXXXXXXXXXXXXXXXVTWADERKIDSTSNG-NLCEFQEMCDVQKSGESSSHS 1342
                                   VTWADE+   S  +G ++ E +E+ D  K      H 
Sbjct: 287  -----------------------VTWADEK---SDGDGKSISECREIGD-NKGAVVMPHL 319

Query: 1343 NVEDF-DGSLRLTLXXXXXXXXXXXXXXXXSGESDATDAVSEAGIIILPRPHDADQGDFQ 1519
              ED  D S R T                 SG++DA+DAVSEAG+IILP PH+ D+  ++
Sbjct: 320  TDEDVGDESYRFTSAEACARALSQASEAVASGKTDASDAVSEAGVIILPPPHEVDEAKYE 379

Query: 1520 NDEDMIEPEPVPLKWPKKSGVLNSELFDAEDTWYDTPPEGFSLSLSSFATMWMALFGWIT 1699
               ++++ +P+ LKWP K G  + +LFD+ED+WYD+PPEGF+L+LS F+TM+M+LF WI+
Sbjct: 380  QIGEVVDVDPIELKWPPKPGFSSEDLFDSEDSWYDSPPEGFNLTLSPFSTMFMSLFAWIS 439

Query: 1700 SSSLAYIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCLARALPGLVTDLR 1879
            SSSLAYIYG++E  HE++  +NGREYP KI++ DGRS+E+K TLAGCLARALPGLV+++R
Sbjct: 440  SSSLAYIYGKEERFHEDYLSINGREYPPKIII-DGRSAEVKHTLAGCLARALPGLVSEIR 498

Query: 1880 LPTPISSLEQGMRSLLETMSFMDALPPFRMKQWHVIVLLLIDALSVCRIPGLIPHMTNRR 2059
            +PTP+S++EQGM  LL+TMSF DALP FRMKQW VI LL +DALSV RIP L P+MT RR
Sbjct: 499  IPTPVSTIEQGMGRLLDTMSFTDALPGFRMKQWQVIALLFLDALSVSRIPALSPYMTGRR 558

Query: 2060 MLLHKVLDSAQVSVEEYEVMKNLVIPLG 2143
            +LL KVL+ AQ++VEE+E+MK+L+IPLG
Sbjct: 559  ILLPKVLEGAQINVEEFEIMKDLIIPLG 586


>ref|XP_007208433.1| hypothetical protein PRUPE_ppa002134mg [Prunus persica]
            gi|462404075|gb|EMJ09632.1| hypothetical protein
            PRUPE_ppa002134mg [Prunus persica]
          Length = 711

 Score =  543 bits (1399), Expect = e-151
 Identities = 322/695 (46%), Positives = 406/695 (58%), Gaps = 77/695 (11%)
 Frame = +2

Query: 290  EKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLCN 469
            ++Q  ISVKD V+KLQ +LLEGIK +D L+ AGS++SR DY D+VTER+I+NLCGYPLC+
Sbjct: 8    QQQPRISVKDTVYKLQLALLEGIKTQDHLYLAGSIISRSDYNDVVTERTIANLCGYPLCS 67

Query: 470  NPLPLE--HPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCSISNSE 643
            N LP +   P KG YR+SLKEHKVYDL ETYMYCSS CV++S+ FA SL  ERC + +  
Sbjct: 68   NALPSDSSRPHKGHYRISLKEHKVYDLHETYMYCSSRCVIESKAFAQSLGEERCDVLDFG 127

Query: 644  KINEVLKLFXXXXXXXXXXXXXXXXXX-FSELKIEEKVDVKAGEVLLEDW---------- 790
            K+  +L+ F                    S+LKIEEKV+   G++ +             
Sbjct: 128  KVERILRAFGDVGFDKGEVGFGEIGDLGISKLKIEEKVETGIGDLGISRLKIEEKSETHI 187

Query: 791  -----VGPSNAIEGYVPQRDRSSKSSPLKHRKEGSKSKNARAKKEAEKVVDEMDFTSTII 955
                 VGPSNAIEGYVPQ++R SK    K  KEGSK K+A+     + + +EMDF STII
Sbjct: 188  GDLGAVGPSNAIEGYVPQKERISKPLGSKKNKEGSKGKDAKMSSGMDIIFNEMDFMSTII 247

Query: 956  VGGQFSVPKLSS------------------GPKQNGSETKLKESEGNQFSILETCSASTQ 1081
               ++SV K+                    G  +N S  K ++S+G +   ++      +
Sbjct: 248  TSDEYSVSKIPPSVGEPDFETKFKKSKGKVGLNKNDSVKKSRQSKGGKNKNVKKDDVCIR 307

Query: 1082 NGFETK-SKEPKGKGSVNESTREVSVGXXXXXXXXXXXXXXXXXXXXXXXXXVTWADERK 1258
                T  + +    GS  E   E  V                          VTWADE  
Sbjct: 308  EVPSTSDASQTVLNGSTKEEKEEFIVEKAEQSGEALLRSSLKPSGTKKLNRSVTWADEM- 366

Query: 1259 IDSTSNGNLCEFQEMCDVQKSGE--SSSHS------------------------------ 1342
            IDST + NL E +EM  + +  +  SS H                               
Sbjct: 367  IDSTGSRNLYEVREMEQIMEYSDAFSSMHKPSVENKVGCSNTWFDEKIDSTKSKNICEVR 426

Query: 1343 NVEDFD--GSLRLT------LXXXXXXXXXXXXXXXXSGESDATDAVSEAGIIILPRPHD 1498
             V+D D  GSL L                        SGESD + AVS AGIIILPRP  
Sbjct: 427  EVQDADVLGSLDLQENEILESAEACAMALNQAAEAVASGESDVSGAVSGAGIIILPRPDG 486

Query: 1499 ADQGDFQNDEDMIEPEPVPLKWPKKSGVLNSELFDAEDTWYDTPPEGFSLSLSSFATMWM 1678
             D+ +   D DM+E E  PL WP+K G+  S+LFD ED+W+D PPEGFS++LS FATMW 
Sbjct: 487  LDEEEPTEDVDMLESEQAPL-WPRKPGIPCSDLFDPEDSWFDAPPEGFSVTLSPFATMWN 545

Query: 1679 ALFGWITSSSLAYIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCLARALP 1858
            +LF WITSS+LAYIYGRDES HEEF  VNGREYP KIVL+ GRSSEIK+TL    ARALP
Sbjct: 546  SLFTWITSSTLAYIYGRDESFHEEFLSVNGREYPPKIVLAGGRSSEIKKTLDESFARALP 605

Query: 1859 GLVTDLRLPTPISSLEQGMRSLLETMSFMDALPPFRMKQWHVIVLLLIDALSVCRIPGLI 2038
            G+V++LRLPTPISSLEQGM  +L TMSF+DA+P FRMKQW VIVLL ++ LSVCRIP L 
Sbjct: 606  GVVSELRLPTPISSLEQGMGRMLNTMSFIDAIPAFRMKQWQVIVLLFLEGLSVCRIPALT 665

Query: 2039 PHMTNRRMLLHKVLDSAQVSVEEYEVMKNLVIPLG 2143
            PHMTNRRML +KVL++ Q+S E+YE+MK+L+IPLG
Sbjct: 666  PHMTNRRMLFYKVLENTQISAEQYELMKDLIIPLG 700


>ref|XP_007016930.1| F2P16.20-like protein isoform 5 [Theobroma cacao]
            gi|508787293|gb|EOY34549.1| F2P16.20-like protein isoform
            5 [Theobroma cacao]
          Length = 708

 Score =  525 bits (1351), Expect = e-146
 Identities = 294/630 (46%), Positives = 381/630 (60%), Gaps = 64/630 (10%)
 Frame = +2

Query: 287  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 466
            M K+QSISV +AVHK+Q  LL+GI++E QL A+GSL+SR DYED+VTER+ISN CGYPLC
Sbjct: 55   MAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPLC 114

Query: 467  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCSISNSEK 646
             NPLP E  RKGRYR+SLKEHKVYDLQETYM+CS++C++ SR FAGSLQ ERCS+ N  K
Sbjct: 115  ANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHAK 174

Query: 647  INEVLKLFXXXXXXXXXXXXXXXXXXFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 826
            +N++L LF                  FS L+I+E  +VKA +V L    GPSNAIEGYVP
Sbjct: 175  LNDILSLFGDLDLDDNDLGKNGDLG-FSNLRIKENEEVKAEDVSL---AGPSNAIEGYVP 230

Query: 827  QRDRSSKSSPLKH-----------------------------------------RKEGSK 883
            QR+  SK +P K+                                         +K GS 
Sbjct: 231  QRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKKPGSF 290

Query: 884  SKNARAKKEAEK---VVDEMDFTSTIIVGGQFSVPKLSSGPKQNGSETKLKESE------ 1036
             +  R K  ++K   V++EMDFTS II+  ++++ K+ SG KQ+  ++ LKE E      
Sbjct: 291  KQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICK 350

Query: 1037 ---------GNQFSILETCSA-----STQNGFETKSKEPKGKGSVNESTREVSVGXXXXX 1174
                     G+  ++ E  S+     ST+N +++         S  E+ +E         
Sbjct: 351  DSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDT-----SSAEAEKETHADKAVTS 405

Query: 1175 XXXXXXXXXXXXXXXXXXXXVTWADERKIDSTSNGNLCEFQEMCDVQKSGESSSHSNVED 1354
                                VTWAD++K D+  NGNLCE +EM  ++   E S  +    
Sbjct: 406  SETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGG 465

Query: 1355 FDGSLRLTLXXXXXXXXXXXXXXXXSGESDATDAVSEAGIIILPRPHDADQGDFQNDEDM 1534
             D  LR                   SG+SD TDAV E G+IILP   + D+ +   D DM
Sbjct: 466  DDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMEDGDM 525

Query: 1535 IEPEPVPLKWPKKSGVLNSELFDAEDTWYDTPPEGFSLSLSSFATMWMALFGWITSSSLA 1714
            +EPE  P+KWPKK G+ +S++F+ ED+W+D PPEGFSL+LS+FATMW ALF WITSSSLA
Sbjct: 526  LEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLA 585

Query: 1715 YIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCLARALPGLVTDLRLPTPI 1894
            YIYGRDES HEE+  +NGREYPRKI L DGRSSEIK+TLA C++RALP +VTDLRLP PI
Sbjct: 586  YIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPIPI 645

Query: 1895 SSLEQGMRSLLETMSFMDALPPFRMKQWHV 1984
            S+LEQGM  L++T+SFM+ALP FRMKQW +
Sbjct: 646  STLEQGMGHLIDTISFMEALPAFRMKQWEI 675


>ref|XP_007016927.1| F2P16.20-like protein isoform 2 [Theobroma cacao]
            gi|508787290|gb|EOY34546.1| F2P16.20-like protein isoform
            2 [Theobroma cacao]
          Length = 679

 Score =  523 bits (1348), Expect = e-145
 Identities = 294/628 (46%), Positives = 380/628 (60%), Gaps = 64/628 (10%)
 Frame = +2

Query: 287  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 466
            M K+QSISV +AVHK+Q  LL+GI++E QL A+GSL+SR DYED+VTER+ISN CGYPLC
Sbjct: 55   MAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPLC 114

Query: 467  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCSISNSEK 646
             NPLP E  RKGRYR+SLKEHKVYDLQETYM+CS++C++ SR FAGSLQ ERCS+ N  K
Sbjct: 115  ANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHAK 174

Query: 647  INEVLKLFXXXXXXXXXXXXXXXXXXFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 826
            +N++L LF                  FS L+I+E  +VKA +V L    GPSNAIEGYVP
Sbjct: 175  LNDILSLFGDLDLDDNDLGKNGDLG-FSNLRIKENEEVKAEDVSL---AGPSNAIEGYVP 230

Query: 827  QRDRSSKSSPLKH-----------------------------------------RKEGSK 883
            QR+  SK +P K+                                         +K GS 
Sbjct: 231  QRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKKPGSF 290

Query: 884  SKNARAKKEAEK---VVDEMDFTSTIIVGGQFSVPKLSSGPKQNGSETKLKESE------ 1036
             +  R K  ++K   V++EMDFTS II+  ++++ K+ SG KQ+  ++ LKE E      
Sbjct: 291  KQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICK 350

Query: 1037 ---------GNQFSILETCSA-----STQNGFETKSKEPKGKGSVNESTREVSVGXXXXX 1174
                     G+  ++ E  S+     ST+N +++         S  E+ +E         
Sbjct: 351  DSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDT-----SSAEAEKETHADKAVTS 405

Query: 1175 XXXXXXXXXXXXXXXXXXXXVTWADERKIDSTSNGNLCEFQEMCDVQKSGESSSHSNVED 1354
                                VTWAD++K D+  NGNLCE +EM  ++   E S  +    
Sbjct: 406  SETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGG 465

Query: 1355 FDGSLRLTLXXXXXXXXXXXXXXXXSGESDATDAVSEAGIIILPRPHDADQGDFQNDEDM 1534
             D  LR                   SG+SD TDAV E G+IILP   + D+ +   D DM
Sbjct: 466  DDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMEDGDM 525

Query: 1535 IEPEPVPLKWPKKSGVLNSELFDAEDTWYDTPPEGFSLSLSSFATMWMALFGWITSSSLA 1714
            +EPE  P+KWPKK G+ +S++F+ ED+W+D PPEGFSL+LS+FATMW ALF WITSSSLA
Sbjct: 526  LEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLA 585

Query: 1715 YIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCLARALPGLVTDLRLPTPI 1894
            YIYGRDES HEE+  +NGREYPRKI L DGRSSEIK+TLA C++RALP +VTDLRLP PI
Sbjct: 586  YIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPIPI 645

Query: 1895 SSLEQGMRSLLETMSFMDALPPFRMKQW 1978
            S+LEQGM  L++T+SFM+ALP FRMKQW
Sbjct: 646  STLEQGMGHLIDTISFMEALPAFRMKQW 673


>ref|XP_006841578.1| hypothetical protein AMTR_s00003p00194360 [Amborella trichopoda]
            gi|548843599|gb|ERN03253.1| hypothetical protein
            AMTR_s00003p00194360 [Amborella trichopoda]
          Length = 591

 Score =  494 bits (1272), Expect = e-137
 Identities = 282/617 (45%), Positives = 377/617 (61%), Gaps = 7/617 (1%)
 Frame = +2

Query: 308  SVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLCNNPLPLE 487
            S+KDA++K+Q  LL+GI  E+QL AA +L+SR DY+D+VTER+I+NLCGYPLCN  LP +
Sbjct: 8    SLKDAIYKIQTYLLDGISKENQLLAAANLISRSDYDDVVTERTITNLCGYPLCNKYLPCD 67

Query: 488  HPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCSISNSEKINEVLKL 667
             P+KGRYR+SLKEH VYDL+ET++YCS +CV+ S+ F+  L+ ERC  S+  KI E+L L
Sbjct: 68   RPKKGRYRISLKEHSVYDLKETWLYCSPECVINSQAFSKLLKPERCEFSDPGKIAEILNL 127

Query: 668  FXXXXXXXXXXXXXXXXXX----FSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVPQRD 835
            F                      FS L I EK DV  G++   D+VGP NAIEGYVP++D
Sbjct: 128  FSSPSIEESNAGGAEKNEKISLAFSSLTIHEKEDVSVGDIQSMDFVGPYNAIEGYVPRQD 187

Query: 836  RSSKSSPLKHRKEGSKSKNARAKKEAEKVVDEMDFTSTIIVGGQFSVPKLSSGPKQNGSE 1015
            +     P   RK GSKS  +  KK+   +  E +F STII+G      + SSG  Q  S 
Sbjct: 188  QV----PPVQRK-GSKSGKSTTKKDP--IYPETNFASTIIIG------EPSSGNLQKNSS 234

Query: 1016 TKLKESEGNQFSILETCSASTQNGFETKSKEPKGKGSVNESTREVSVGXXXXXXXXXXXX 1195
            +K      +            Q   ++  KE K + ++     + S              
Sbjct: 235  SKFVNDHVHVNVEGSKREQHAQEKSQSHPKETKLRSALKNLGAKAST------------- 281

Query: 1196 XXXXXXXXXXXXXVTWADERK--IDSTSNGNLCEFQEMCDVQKSGESSSHSNVEDFDGSL 1369
                         V+WADE++  ++   N  L   Q +    K  ESS   +VED   S 
Sbjct: 282  -----------RTVSWADEQQTIVEGIQNMTLNNCQGIESGSKCKESSDSLSVEDTMISS 330

Query: 1370 RLTLXXXXXXXXXXXXXXXXSGESDATDAVSEAGIIILPRPHDADQGDFQNDEDMIEPEP 1549
            R                   SG+S+  DA SEAGI+I P P+  ++ + Q   D ++PE 
Sbjct: 331  RRASAEACASALTEAAAAVASGQSNTLDAASEAGILIFPCPNSVEEENIQKVADELKPEE 390

Query: 1550 VPLKWPKKSGVLNSELFDAE-DTWYDTPPEGFSLSLSSFATMWMALFGWITSSSLAYIYG 1726
               KW K+  +L++  FD E D+WYD PPEGFSL+LSSFATMWMALFGW+T+SS+AYIYG
Sbjct: 391  GE-KWVKRPSLLHTGAFDTEEDSWYDAPPEGFSLTLSSFATMWMALFGWVTASSMAYIYG 449

Query: 1727 RDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCLARALPGLVTDLRLPTPISSLE 1906
            R ES+ EEF +V+GREYP K VL DG SSEIK+TL+GCLARALPG+V +++LPTPIS+LE
Sbjct: 450  RAESAEEEFVVVDGREYPHKFVLGDGLSSEIKETLSGCLARALPGVVANIKLPTPISTLE 509

Query: 1907 QGMRSLLETMSFMDALPPFRMKQWHVIVLLLIDALSVCRIPGLIPHMTNRRMLLHKVLDS 2086
              +  LL+TM+F +ALPPFRMKQWHVIVLL +DALSV  +P L  H+ +RR L+HK+L+ 
Sbjct: 510  VALGRLLDTMTFTEALPPFRMKQWHVIVLLFLDALSVHIVPALEQHIASRRTLVHKMLED 569

Query: 2087 AQVSVEEYEVMKNLVIP 2137
            AQVS EEY +M++L +P
Sbjct: 570  AQVSNEEYNIMRDLFLP 586


>ref|XP_007016929.1| F2P16.20 protein, putative isoform 4 [Theobroma cacao]
            gi|508787292|gb|EOY34548.1| F2P16.20 protein, putative
            isoform 4 [Theobroma cacao]
          Length = 607

 Score =  493 bits (1270), Expect = e-136
 Identities = 280/609 (45%), Positives = 363/609 (59%), Gaps = 64/609 (10%)
 Frame = +2

Query: 287  MEKQQSISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLC 466
            M K+QSISV +AVHK+Q  LL+GI++E QL A+GSL+SR DYED+VTER+ISN CGYPLC
Sbjct: 1    MAKEQSISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTCGYPLC 60

Query: 467  NNPLPLEHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCSISNSEK 646
             NPLP E  RKGRYR+SLKEHKVYDLQETYM+CS++C++ SR FAGSLQ ERCS+ N  K
Sbjct: 61   ANPLPSEPRRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCSVLNHAK 120

Query: 647  INEVLKLFXXXXXXXXXXXXXXXXXXFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVP 826
            +N++L LF                  FS L+I+E  +VKA +V L    GPSNAIEGYVP
Sbjct: 121  LNDILSLFGDLDLDDNDLGKNGDLG-FSNLRIKENEEVKAEDVSL---AGPSNAIEGYVP 176

Query: 827  QRDRSSKSSPLKH-----------------------------------------RKEGSK 883
            QR+  SK +P K+                                         +K GS 
Sbjct: 177  QRELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYIISKKPGSF 236

Query: 884  SKNARAKKEAEK---VVDEMDFTSTIIVGGQFSVPKLSSGPKQNGSETKLKESE------ 1036
             +  R K  ++K   V++EMDFTS II+  ++++ K+ SG KQ+  ++ LKE E      
Sbjct: 237  KQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEVEEKGICK 296

Query: 1037 ---------GNQFSILETCSA-----STQNGFETKSKEPKGKGSVNESTREVSVGXXXXX 1174
                     G+  ++ E  S+     ST+N +++         S  E+ +E         
Sbjct: 297  DSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDT-----SSAEAEKETHADKAVTS 351

Query: 1175 XXXXXXXXXXXXXXXXXXXXVTWADERKIDSTSNGNLCEFQEMCDVQKSGESSSHSNVED 1354
                                VTWAD++K D+  NGNLCE +EM  ++   E S  +    
Sbjct: 352  SETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAEDGG 411

Query: 1355 FDGSLRLTLXXXXXXXXXXXXXXXXSGESDATDAVSEAGIIILPRPHDADQGDFQNDEDM 1534
             D  LR                   SG+SD TDAV E G+IILP   + D+ +   D DM
Sbjct: 412  DDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMEDGDM 471

Query: 1535 IEPEPVPLKWPKKSGVLNSELFDAEDTWYDTPPEGFSLSLSSFATMWMALFGWITSSSLA 1714
            +EPE  P+KWPKK G+ +S++F+ ED+W+D PPEGFSL+LS+FATMW ALF WITSSSLA
Sbjct: 472  LEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSSLA 531

Query: 1715 YIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCLARALPGLVTDLRLPTPI 1894
            YIYGRDES HEE+  +NGREYPRKI L DGRSSEIK+TLA C++RALP +VTDLRLP PI
Sbjct: 532  YIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPIPI 591

Query: 1895 SSLEQGMRS 1921
            S+LEQGM +
Sbjct: 592  STLEQGMNT 600


>ref|XP_004302308.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Fragaria vesca subsp. vesca]
          Length = 692

 Score =  493 bits (1270), Expect = e-136
 Identities = 299/693 (43%), Positives = 391/693 (56%), Gaps = 79/693 (11%)
 Frame = +2

Query: 302  SISVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLCNNPLP 481
            S SV DAV+KLQ +LL+ +K  D+L+ AGS++SR DY D+VTERSI++LCGYPLC+N LP
Sbjct: 10   SKSVNDAVYKLQLALLDSVKTLDRLYLAGSIISRSDYTDVVTERSIADLCGYPLCSNALP 69

Query: 482  LE--HPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSLQAERCSISNSEKINE 655
             E    RKG YR+SLKEHKVYDL+ET +YCSS CV+ S+ FA  L  ERC + +  K+  
Sbjct: 70   PEASRTRKGHYRISLKEHKVYDLRETKLYCSSKCVIDSKAFAQGLSEERCDVLDLGKVER 129

Query: 656  VLKLFXXXXXXXXXXXXXXXXXXFSELKIEEKVDVKAGEVLLEDWVGPSNAIEGYVPQRD 835
            VL+ F                   S LKIEEK    +G+V   +  GPSNAIEGYVP+RD
Sbjct: 130  VLREFGEEKKEIGDLG-------LSSLKIEEKSGTYSGKV---EEFGPSNAIEGYVPRRD 179

Query: 836  RSSKSSPLKHRKEGSKSKNARAKKEAEKVV-DEMDFTSTIIVGGQFSVPKLSSGPKQNGS 1012
            R SK+S  K  K+GSK K+A+     ++++ ++MDF ST++   ++SV K+      N  
Sbjct: 180  RVSKASGAKKNKQGSKGKDAKPSGGGKQLILNDMDFMSTLLACDEYSVSKMPPNVADNNV 239

Query: 1013 ETKLKESEGNQ----FSILETCSASTQNGFETKSKEPKGKGSVN------ESTREVSVGX 1162
            +T+LK+S+G      FS+LET  ++T N    KS+     G +       E+  E  VG 
Sbjct: 240  DTELKKSKGKDLESGFSVLET--SATPN----KSEGVMDVGDLGMSRLKIEAEEESQVGK 293

Query: 1163 XXXXXXXXXXXXXXXXXXXXXXXXVTWADERK---------------------------- 1258
                                    VTWADE+                             
Sbjct: 294  GEKSSEGTLRSSLKHSGTKKLSRSVTWADEKSDSTGRRNLCEVRDMEDGLENPGAFDSLY 353

Query: 1259 ------------------IDSTSNGNLCEFQEMCDVQKSGESSSHSNVEDFDGSLRLTLX 1384
                              IDST   N+CE     D ++  E    S V+   G+      
Sbjct: 354  KPSSSSEAGSSFSWVDKTIDSTKCENICEVSGTHDAKEVPEVVGSSVVQ---GNEWFESA 410

Query: 1385 XXXXXXXXXXXXXXXSGESDATDAVSEAGIIILPRPHDADQGDF---------------- 1516
                           +GE D +DAVS+AGIIILPR    D+ +F                
Sbjct: 411  EACAVALSEAAGAVETGEFDTSDAVSKAGIIILPRTDGVDEEEFIVDGADEEDSIEDSVD 470

Query: 1517 ----QNDEDMIEPEPVPLKWPKKSGVLNSELFDAEDTWYDTPPEGFSLSLSSFATMWMAL 1684
                  D DM+EPE    KWPKK      +LF+ ED+W+D PP+GF+L+LS FATMW AL
Sbjct: 471  EEESTEDIDMLEPEQALSKWPKKPESSQFDLFNPEDSWFDAPPDGFNLTLSPFATMWNAL 530

Query: 1685 FGWITSSSLAYIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCLARALPGL 1864
            F W TSS+LAYIYG+D+S HEEF  VNGR YP KIVL+DGRSSEIK T+   L+RALP +
Sbjct: 531  FTWTTSSTLAYIYGKDDSFHEEFLNVNGRSYPHKIVLADGRSSEIKLTVGASLSRALPEI 590

Query: 1865 VTDLRLPTPISSLEQGMRSLLETMSFMDALPPFRMKQWHVIVLLLIDALSVCRIPGLIPH 2044
            V +L L  P  +LE+GM  +L TMSF++ALP FRMKQW VI LL I+ LSVCR+P L PH
Sbjct: 591  VAELGLAVP--NLEKGMGFMLNTMSFIEALPAFRMKQWQVIALLFIEGLSVCRMPALTPH 648

Query: 2045 MTNRRMLLHKVLDSAQVSVEEYEVMKNLVIPLG 2143
            MTNRR+L+ +VLD A++SVEEYE+MK+ +IPLG
Sbjct: 649  MTNRRVLIQRVLDGARISVEEYEIMKDFLIPLG 681


>gb|ABR17753.1| unknown [Picea sitchensis]
          Length = 668

 Score =  406 bits (1044), Expect = e-110
 Identities = 266/693 (38%), Positives = 367/693 (52%), Gaps = 81/693 (11%)
 Frame = +2

Query: 308  SVKDAVHKLQFSLLEGIKNEDQLFAAGSLMSRGDYEDIVTERSISNLCGYPLCNNPLPL- 484
            SVKDAV+K+Q +LL+G+K E QL AA +L+S+ DYED+VTER+I NLCGYPLC+N LP  
Sbjct: 4    SVKDAVYKIQTTLLDGVKTEAQLHAAANLLSKSDYEDVVTERTIVNLCGYPLCSNKLPAS 63

Query: 485  -----EHPRKGRYRVSLKEHKVYDLQETYMYCSSDCVVKSRTFAGSL-----QAERCSIS 634
                 +  RKGRYR+SLK+HKVYDLQET++YCS+ C++ SRTF+  L      A+     
Sbjct: 64   EEQQQQRKRKGRYRISLKDHKVYDLQETWLYCSTPCLINSRTFSDCLLPPDRNADAALEW 123

Query: 635  NSEKINEVLK-------------------------------LFXXXXXXXXXXXXXXXXX 721
            NS++I  +L+                               L                  
Sbjct: 124  NSDRILHILEAVGSLSLDDAETENVSETPKNVPEPAPKKNVLEEFKEGRKNENNNNSEEK 183

Query: 722  XFSELKIEEKVDVKAGEVLL---EDWVGPSNAIEGYVPQ-----------RDRSSKSSPL 859
              SEL I E+ +    ++L+       GPS+AIEGYVPQ            D+S   SP 
Sbjct: 184  FSSELLIHEQENGSGEKILVAFDSSSAGPSDAIEGYVPQGEQRRLHLQPPADKSVSKSPK 243

Query: 860  KHRKEGSKSKNARAKKEAEKVVDEMDFTSTIIVGGQFSVPKLSSGPKQNGSETKLKESEG 1039
            K  K   KSKN+  +    K   E DF+STII+G   +   L+       S   + E   
Sbjct: 244  K--KGPKKSKNSLKRGAPRK---ESDFSSTIIIGQPCADVALNGAT----SSIVISEETL 294

Query: 1040 NQFSILETCSASTQNGFETKSKEPKGKGSVN-ESTREVSVGXXXXXXXXXXXXXXXXXXX 1216
            NQ           QN  E KS+  K + ++  +  ++++                     
Sbjct: 295  NQKDQKSERKLDLQN--ENKSEVMKLRSALKTQGVKQLN--------------------- 331

Query: 1217 XXXXXXVTWADERKIDSTSNGNLCEFQEMCDVQKSG-------ESSSHSNVEDFDG---- 1363
                  VTWADE+K + + +  + E + + +   S        ES+S S     D     
Sbjct: 332  ----RSVTWADEKKFEQSDHIEVLEKRTLDNSNTSSIVALHSLESTSQSATFGKDAESLE 387

Query: 1364 ------------SLRLTLXXXXXXXXXXXXXXXXSGESDATDAVSEAGIIILPRPHDADQ 1507
                        + RL                  SGE DA++A S+ GI I+P   D D 
Sbjct: 388  SIRAEFNEANVKASRLEAAEVFAKALTEAANAVASGEVDASEAASKVGICIIPGTDDEDP 447

Query: 1508 GDFQNDEDMIEP-EPVPLKWPKKSGVLNSELFDAEDTWYDTPPEGFSLSLSSFATMWMAL 1684
               QND + ++  +P    W      ++ E +DA + W+D PP+GFSL LS FATMWMAL
Sbjct: 448  QKTQNDVEKLDSTQPT---WTSLPSTIDEEAYDARECWFDDPPDGFSLELSPFATMWMAL 504

Query: 1685 FGWITSSSLAYIYGRDESSHEEFSLVNGREYPRKIVLSDGRSSEIKQTLAGCLARALPGL 1864
              WIT SS+A++YGRD+S  ++FS VNGREYPRKIV   G S+EI++T+A C++RALP +
Sbjct: 505  DRWITCSSVAHLYGRDDSDADDFSTVNGREYPRKIVSGGGLSTEIERTVASCISRALPAV 564

Query: 1865 VTDLRLPTPISSLEQGMRSLLETMSFMDALPPFRMKQWHVIVLLLIDALSVCRIPGLIPH 2044
            V  LRLPTPISSLEQ +   L TM+F+DA+PPFRM QW VIV+L +DALSV  IP L P 
Sbjct: 565  VQSLRLPTPISSLEQALGRFLNTMTFIDAIPPFRMNQWRVIVVLFLDALSVHHIPSLGPQ 624

Query: 2045 MTNRRMLLHKVLDSAQVSVEEYEVMKNLVIPLG 2143
            + N+R L+HKVL++A+++ EEY+ MK L+IPLG
Sbjct: 625  IMNKRPLIHKVLEAAEMTYEEYKTMKELLIPLG 657


Top