BLASTX nr result

ID: Cinnamomum24_contig00013487 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cinnamomum24_contig00013487
         (2463 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera]   635   e-179
ref|XP_002280625.1| PREDICTED: putative RNA polymerase II subuni...   633   e-178
ref|XP_012072543.1| PREDICTED: putative RNA polymerase II subuni...   604   e-170
ref|XP_002521936.1| conserved hypothetical protein [Ricinus comm...   588   e-165
ref|XP_011087531.1| PREDICTED: putative RNA polymerase II subuni...   586   e-164
ref|XP_011087530.1| PREDICTED: putative RNA polymerase II subuni...   586   e-164
ref|XP_011087529.1| PREDICTED: putative RNA polymerase II subuni...   586   e-164
ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subuni...   574   e-160
ref|XP_007016926.1| F2P16.20 protein, putative isoform 1 [Theobr...   570   e-159
ref|XP_012479689.1| PREDICTED: putative RNA polymerase II subuni...   560   e-156
ref|XP_010271590.1| PREDICTED: putative RNA polymerase II subuni...   558   e-156
ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subuni...   557   e-155
ref|XP_012479683.1| PREDICTED: putative RNA polymerase II subuni...   555   e-155
ref|XP_010042212.1| PREDICTED: putative RNA polymerase II subuni...   552   e-154
gb|KHG00854.1| hypothetical protein F383_23706 [Gossypium arboreum]   551   e-154
ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subuni...   551   e-153
gb|KHG00855.1| hypothetical protein F383_23706 [Gossypium arboreum]   546   e-152
ref|XP_012859052.1| PREDICTED: putative RNA polymerase II subuni...   546   e-152
ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Popu...   545   e-152
ref|XP_009771014.1| PREDICTED: putative RNA polymerase II subuni...   543   e-151

>emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera]
          Length = 659

 Score =  635 bits (1638), Expect = e-179
 Identities = 363/660 (55%), Positives = 450/660 (68%), Gaps = 43/660 (6%)
 Frame = -3

Query: 2332 PIAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSLP 2153
            PIAVKDAVHKLQL LL+GI +E QLF+AGSL+SRSDYEDVVTER+I+NLCGYPLC NSLP
Sbjct: 6    PIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLCSNSLP 65

Query: 2152 SDRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWEV 1973
            S+RL RKG ++IS +E  + DL ET  YCSS C   S++F  S +EER +V +  +I  +
Sbjct: 66   SERL-RKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSERINGI 124

Query: 1972 LGLFGELSLEDKG--KKNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAIEGYVPRDRS 1799
            L LFGE SLE      K+G +G+ ELKI +  + KAGEVS+EDWIGP NAIEGYVP+   
Sbjct: 125  LRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVPQRDR 184

Query: 1798 DLSSLPTREKHREKGSKPKEAGTVEREEVVVGNETAFVSTVI------------------ 1673
            +L   P   K+ ++GSK   +     +  V+ +E  FVST+I                  
Sbjct: 185  NLK--PKNIKNHKEGSKSSNSKMDSGKNFVI-DEMDFVSTIITKDEYSISKSSKGLKDTT 241

Query: 1672 -------------IGDQLSASEASMGPQKNDSKPKANRKSKG-------KDIVEKAGKQS 1553
                         IGDQLS  E S  P +NDS+ K  R+SKG       KD    A   S
Sbjct: 242  SHAKSKEPKEKASIGDQLSMLEKSAPPIQNDSESKL-RESKGRRSRVIFKDEFSTAEVPS 300

Query: 1552 ETQSRSALSKGPQGEDSVAAAVVKQNG-SQLKSALKSSGVKPLSRSVTWADEKKAENIDA 1376
                  +   G +G++        Q G ++ KS+LK SG K + RSVTWADEK  ++ D+
Sbjct: 301  VPSQSGSELNGVKGKEEYHTENAAQLGPTKPKSSLKPSGGKKVIRSVTWADEKM-DSADS 359

Query: 1375 GNLFNGQKTEEKSESIKNXXXXXXXXXXS-LRFALAEACAIALSQAAEAVASGECDAEDA 1199
             +    ++ E K E              + LRFA AEACA+ALSQAAEAVASGE D  DA
Sbjct: 360  RDFCKVRELEVKKEDPNGLGDIDVGDDDNALRFASAEACAVALSQAAEAVASGETDMTDA 419

Query: 1198 ATEAGIVILP-PVEVDEGNSSELMDVSEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPE 1022
             +EAGI+ILP P ++DEG S +  D+ EP+   +KWP KP +  +D+FD +DSW+DTPPE
Sbjct: 420  VSEAGIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSDDSWYDTPPE 479

Query: 1021 GFSLTLSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSE 842
            GFSLTLSPFATMWMALF W+++SS+AYIYGRDES  + +L VNGREYP+K+ L+DGRSSE
Sbjct: 480  GFSLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKIVLTDGRSSE 539

Query: 841  IKQTFAGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLL 662
            IKQT AGC+SR+LPG+VADLRLP P+S LEQ +GRLLDTMS+VD LP FR  QW+VIVLL
Sbjct: 540  IKQTLAGCLSRALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRMKQWQVIVLL 599

Query: 661  FIDALSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQSGG 482
            FIDALSVCRIPALT +M+S+RML  KV DAAQV  EEYEVMKDL+IPLGR P+FSAQSGG
Sbjct: 600  FIDALSVCRIPALTPHMTSRRMLFPKVFDAAQVSAEEYEVMKDLIIPLGRVPQFSAQSGG 659


>ref|XP_002280625.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Vitis vinifera]
            gi|731415977|ref|XP_010659731.1| PREDICTED: putative RNA
            polymerase II subunit B1 CTD phosphatase RPAP2 homolog
            [Vitis vinifera] gi|731415979|ref|XP_010659732.1|
            PREDICTED: putative RNA polymerase II subunit B1 CTD
            phosphatase RPAP2 homolog [Vitis vinifera]
            gi|296089830|emb|CBI39649.3| unnamed protein product
            [Vitis vinifera]
          Length = 659

 Score =  633 bits (1633), Expect = e-178
 Identities = 363/660 (55%), Positives = 449/660 (68%), Gaps = 43/660 (6%)
 Frame = -3

Query: 2332 PIAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSLP 2153
            PIAVKDAVHKLQL LL+GI +E QLF+AGSL+SRSDYEDVVTER+I+NLCGYPLC NSLP
Sbjct: 6    PIAVKDAVHKLQLFLLEGIQNENQLFAAGSLMSRSDYEDVVTERTIANLCGYPLCSNSLP 65

Query: 2152 SDRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWEV 1973
            S+RL RKG ++IS +E  + DL ET  YCSS C   S++F  S +EER +V +  +I  +
Sbjct: 66   SERL-RKGHYRISLKEHKVYDLHETYMYCSSGCVVNSRSFAGSLQEERCSVLNSERINGI 124

Query: 1972 LGLFGELSLEDKG--KKNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAIEGYVPRDRS 1799
            L LFGE SLE      K+G +G+ ELKI +  + KAGEVS+EDWIGP NAIEGYVP  + 
Sbjct: 125  LRLFGESSLESNKILGKHGDLGLSELKIRENVEKKAGEVSMEDWIGPSNAIEGYVP--QR 182

Query: 1798 DLSSLPTREKHREKGSKPKEAGTVEREEVVVGNETAFVSTVI------------------ 1673
            D +  P   K+R++GSK   +     +  V+ +E  FV T+I                  
Sbjct: 183  DRNLKPKNIKNRKEGSKSSNSKMDSGKNFVI-DEMDFVRTIITEDEYSISKSSKGLKDTT 241

Query: 1672 -------------IGDQLSASEASMGPQKNDSKPKANRKSKG-------KDIVEKAGKQS 1553
                         IGDQLS  E S  P +NDS+ K  R+SKG       KD    A   S
Sbjct: 242  SHAKSKEPKEKASIGDQLSMLEKSAPPIQNDSESKL-RESKGRRSRVIFKDEFSTAEVPS 300

Query: 1552 ETQSRSALSKGPQGEDSVAAAVVKQNG-SQLKSALKSSGVKPLSRSVTWADEKKAENIDA 1376
                  +   G +G++        Q G ++LKS LK SG K ++RSVTWADE K ++ D+
Sbjct: 301  VPSQSGSELNGVKGKEEYHTENAAQLGPTKLKSCLKPSGGKKVTRSVTWADE-KMDSADS 359

Query: 1375 GNLFNGQKTEEKSESIKN-XXXXXXXXXXSLRFALAEACAIALSQAAEAVASGECDAEDA 1199
             +    ++ E K E               +LRFA AEACAIALSQAAEAVASGE D  DA
Sbjct: 360  RDFCKVRELEVKKEDPNGLGDIDVGDDDNALRFASAEACAIALSQAAEAVASGETDMTDA 419

Query: 1198 ATEAGIVILP-PVEVDEGNSSELMDVSEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPE 1022
             +EA I+ILP P ++DEG S +  D+ EP+   +KWP KP +  +D+FD +DSW+DTPPE
Sbjct: 420  VSEARIIILPHPRDMDEGESLKDADLLEPEPVPLKWPIKPGISHSDIFDSDDSWYDTPPE 479

Query: 1021 GFSLTLSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSE 842
            GFSLTLSPFATMWMALF W+++SS+AYIYGRDES  + +L VNGREYP+K+ L+DGRSSE
Sbjct: 480  GFSLTLSPFATMWMALFAWITSSSIAYIYGRDESFHEEYLSVNGREYPKKIVLTDGRSSE 539

Query: 841  IKQTFAGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLL 662
            IKQT AGC++R+LPG+VADLRLP P+S LEQ +GRLLDTMS+VD LP FR  QW+VIVLL
Sbjct: 540  IKQTLAGCLARALPGLVADLRLPIPVSNLEQGVGRLLDTMSFVDALPSFRMKQWQVIVLL 599

Query: 661  FIDALSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQSGG 482
            FIDALSVC+IPALT +M SKRML  KV DAAQV  EEYEVMKDL+IPLGR P+FSAQSGG
Sbjct: 600  FIDALSVCQIPALTPHMISKRMLFPKVFDAAQVSAEEYEVMKDLIIPLGRVPQFSAQSGG 659


>ref|XP_012072543.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Jatropha curcas]
            gi|802599693|ref|XP_012072544.1| PREDICTED: putative RNA
            polymerase II subunit B1 CTD phosphatase RPAP2 homolog
            [Jatropha curcas] gi|802599695|ref|XP_012072546.1|
            PREDICTED: putative RNA polymerase II subunit B1 CTD
            phosphatase RPAP2 homolog [Jatropha curcas]
            gi|643730423|gb|KDP37902.1| hypothetical protein
            JCGZ_05341 [Jatropha curcas]
          Length = 654

 Score =  604 bits (1558), Expect = e-170
 Identities = 341/663 (51%), Positives = 438/663 (66%), Gaps = 39/663 (5%)
 Frame = -3

Query: 2356 MAKDTPPPPIAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGY 2177
            MAKD     I+VKD VHKLQL+LL+GI +E QLF+AGSL+SRSDYEDVVTERSI+NLCGY
Sbjct: 1    MAKDQS---ISVKDTVHKLQLSLLEGIKNEDQLFTAGSLMSRSDYEDVVTERSIANLCGY 57

Query: 2176 PLCPNSLPSDRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVS 1997
            PLC NSLP DR   KGR++IS +E  + DL ET  YCSS+C   S+ F  S +EER +V 
Sbjct: 58   PLCNNSLPLDR-PYKGRYRISLKEHKVYDLHETYMYCSSSCIVNSRAFAGSLQEERCSVL 116

Query: 1996 DPGKIWEVLGLFGELSLEDKGK-KNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAIEG 1820
            +P K+ E+L +F  LSL+ K   +NG +G+  LKI +K ++  GEVSLE+WIGP NAIEG
Sbjct: 117  NPMKLDEILRMFNNLSLDSKNLVENGDLGLSNLKIQEKIESNVGEVSLEEWIGPSNAIEG 176

Query: 1819 YVPRDRSDLSSLPTREKHREKGSKPKEAGTVEREEVVVGNETAFVSTVIIGDQLSASEAS 1640
            YVP+   D     +  K+ ++ SK      V ++E    N+  F+ST+I  D+ S S+A 
Sbjct: 177  YVPQRDRDFKG--SSFKNPKEASKAISTKPVNKQECFF-NDMDFMSTIITKDEYSISKAP 233

Query: 1639 MGP---------------------QKNDSKP------KANRKSKG---KDIVEKAGKQSE 1550
             G                      +   S P      K +RKSKG   K I+++     +
Sbjct: 234  SGSISTGSDMKLQEQRGKETHKGSEAQSSSPGKHAFVKTSRKSKGGRSKQIIKEELSDKD 293

Query: 1549 -------TQSRSALSKGPQGEDSVAAAVVKQNGSQLKSALKSSGVKPLSRSVTWADEKKA 1391
                   +Q+ S+++     E S A      + S LK +LK SG K    SVTWADEK  
Sbjct: 294  LLSASNYSQTGSSMNNAEPEEKSGAKQAANLSESMLKPSLKPSGAKKSVHSVTWADEK-F 352

Query: 1390 ENIDAGNLFNGQKTEEKSESIKNXXXXXXXXXXSLRFALAEACAIALSQAAEAVASGECD 1211
            +N  + NL   ++ E+    ++            LRF  AEACAIALSQAAEAVASG+ D
Sbjct: 353  DNAKSRNLCEVREMEDTKSGLEILDSLENNNDNMLRFESAEACAIALSQAAEAVASGDAD 412

Query: 1210 AEDAATEAGIVILP-PVEVDEGNSSELMDVSEPDRQSVKWPRKPVLLDADLFDCEDSWHD 1034
              DA +EAG+++LP P  +  G+S+++ D+ E +  S+KWP KP +  +DLFD EDSW+D
Sbjct: 413  VNDAMSEAGVIVLPQPHHLAPGDSTDIADMLERESASLKWPAKPAVEQSDLFDSEDSWYD 472

Query: 1033 TPPEGFSLTLSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDG 854
             PPEGFSL LSPFATMWMALF WV++SSLA+IYGRDE++ + +L VNGREYP+K+ L DG
Sbjct: 473  APPEGFSLMLSPFATMWMALFAWVTSSSLAFIYGRDETAHEDYLSVNGREYPQKIVLRDG 532

Query: 853  RSSEIKQTFAGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKV 674
            RSSEIK T  GC+SR+ PGVVADLRLP PIS LEQ  GRLLDTMS+VD LPPFR  QW+V
Sbjct: 533  RSSEIKLTVEGCLSRAFPGVVADLRLPIPISTLEQGAGRLLDTMSFVDALPPFRMKQWQV 592

Query: 673  IVLLFIDALSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSA 494
               LFI+ALSVCRIPALT YM+++RM+LH+VLD AQ+  EEYEVMKDL+IPLGR P   A
Sbjct: 593  TAFLFIEALSVCRIPALTSYMTNRRMVLHQVLDGAQISAEEYEVMKDLMIPLGRDPR--A 650

Query: 493  QSG 485
            +SG
Sbjct: 651  RSG 653


>ref|XP_002521936.1| conserved hypothetical protein [Ricinus communis]
            gi|223538861|gb|EEF40460.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 645

 Score =  588 bits (1517), Expect = e-165
 Identities = 328/646 (50%), Positives = 436/646 (67%), Gaps = 31/646 (4%)
 Frame = -3

Query: 2329 IAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSLPS 2150
            ++VKD V+KLQL+LL+GI +E QL +AGSL+SRSDYEDVV ERSISNLCGYPLC NSLPS
Sbjct: 7    VSVKDTVYKLQLSLLEGIENEDQLLAAGSLMSRSDYEDVVVERSISNLCGYPLCNNSLPS 66

Query: 2149 DRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWEVL 1970
            DR   KGR++IS +E  + DL+ET  YCSS+C   S+ F+ S +E+R +V +P K+ E+L
Sbjct: 67   DR-PYKGRYRISLKEHRVYDLQETYMYCSSSCLVNSRAFSESLQEKRCSVLNPIKLNEIL 125

Query: 1969 GLFGELSLEDKGK-KNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAIEGYVPRDRSDL 1793
              F +L+L+ +G  ++G +G+  LKI +K++   G+VSLE+WIGP NAIEGYVP+   D 
Sbjct: 126  RKFNDLTLDSEGLGRSGDLGLSNLKIQEKSETNVGKVSLEEWIGPSNAIEGYVPQGDRDP 185

Query: 1792 SSLPTREKHREKGSKPKEAGTVEREEVVVGNETAFVSTVIIGDQLSASE----------- 1646
            +  P+ + H+E G K      V +++    ++T F ST+I  D+ S S+           
Sbjct: 186  N--PSLKNHKE-GLKAICKKPVSKQDCFF-SDTDFTSTIITNDEYSISKGPSGLTSTASD 241

Query: 1645 ---------------ASMGPQKNDSKPKANRKSKG--KDIVEKAGKQSETQSRSALSKGP 1517
                           A +   +     KA+RKSKG  K+ V K     +    S+     
Sbjct: 242  IKLQAQTGKGHEGLNAQLSSLRKQDSIKASRKSKGRRKEKVIKEQLNFQDLPSSSYYTAE 301

Query: 1516 QGEDSVAAAVVKQNGSQLKSALKSSGVKPLSRSVTWADEKKAENIDAGNLFNGQKTEEKS 1337
              + S A      N S LK +LKSSG K  +RSVTWADE+  +N  + NL   Q+ E+ +
Sbjct: 302  AEDISQATGAANLNESVLKPSLKSSGAKRSNRSVTWADER-VDNAGSRNLCEVQEMEQTN 360

Query: 1336 ESIK-NXXXXXXXXXXSLRFALAEACAIALSQAAEAVASGECDAEDAATEAGIVILPPVE 1160
            ES + +           LRF  AEACA+ALSQAAEAVASG+ D   A +EAGI++LPP +
Sbjct: 361  ESHEISESANKGDDGHMLRFESAEACAVALSQAAEAVASGDADVNKAMSEAGIIVLPPSQ 420

Query: 1159 -VDEGNSSELMDVSEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPEGFSLTLSPFATMW 983
             + +G + E  D+ E +  S+KWP KP +  +DLFD EDSW+D PPEGFSLTLSPFATMW
Sbjct: 421  DLGQGGNVEKNDMIEQESASLKWPTKPGIPQSDLFDPEDSWYDAPPEGFSLTLSPFATMW 480

Query: 982  MALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSEIKQTFAGCISRSL 803
            MALF WV++SSLAYIYGRDES+ + +L VNGREYPRK+ L DGRSSEI+ T   C++R+ 
Sbjct: 481  MALFAWVTSSSLAYIYGRDESAHEDYLSVNGREYPRKIVLRDGRSSEIRLTAESCLARTF 540

Query: 802  PGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLLFIDALSVCRIPAL 623
            PG+VA+LRLP P+S LEQ  GRLL+TMS+VD LP FRT QW+VI LLFI+ALSVCRIPAL
Sbjct: 541  PGLVANLRLPIPVSTLEQGAGRLLETMSFVDALPAFRTKQWQVIALLFIEALSVCRIPAL 600

Query: 622  TQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQSG 485
            T YM+S+RM+LH+VLD A +  EEY++MKD ++PLGR P+  A+SG
Sbjct: 601  TSYMTSRRMVLHQVLDGAHISAEEYDIMKDFMVPLGRDPQ--ARSG 644


>ref|XP_011087531.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X3 [Sesamum indicum]
            gi|747080559|ref|XP_011087533.1| PREDICTED: putative RNA
            polymerase II subunit B1 CTD phosphatase RPAP2 homolog
            isoform X3 [Sesamum indicum]
          Length = 655

 Score =  586 bits (1510), Expect = e-164
 Identities = 328/656 (50%), Positives = 439/656 (66%), Gaps = 40/656 (6%)
 Frame = -3

Query: 2329 IAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSLPS 2150
            + VKDAVHKLQL+LL+GI++E QL +AGSL+ RSDY+DVVTER+I N+CGYPLC NSLPS
Sbjct: 7    LTVKDAVHKLQLSLLEGINNENQLSAAGSLICRSDYQDVVTERTIINMCGYPLCSNSLPS 66

Query: 2149 DRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWEVL 1970
            +R  RKGR++IS +E  + DL+ET  YCSS+C   S+ F AS +EER +  +P  + EVL
Sbjct: 67   ER-PRKGRYRISLKEHKVYDLQETYLYCSSSCLINSRAFAASLQEERSSSLNPATLNEVL 125

Query: 1969 GLFGELSLE---DKGKKNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAIEGYVPRDRS 1799
             LF  LSL+   D G  NG +G+ ELKI +K D +AGEVSLE+WIGP NAI+GYVPR+  
Sbjct: 126  KLFDGLSLDSAVDMG--NGDLGLSELKIEEKTDTEAGEVSLEEWIGPSNAIDGYVPRNER 183

Query: 1798 DLSSLPTREKHREKGSKPKEAGTVEREEV-----VVGNETAFVSTVIIGDQLSASEASMG 1634
            +L   P +  + +KG++ ++  +  + +      ++ ++  F ST+I  D+ S S++   
Sbjct: 184  NLK--PKQSSNLKKGARQEQVESEYKHDPPDVADILSSDLNFTSTIITQDEYSISKSVPL 241

Query: 1633 PQKNDSKPKAN-----------------------RKSKGKDIVEKAGKQSETQSRSALSK 1523
             +  +SK K +                        KSK  D  +   K  +  S    + 
Sbjct: 242  VKDKESKGKVSINDVNSQGNQMEKPDAPLPNVQETKSKKSDKHKHVTKTDDKLSILEAAA 301

Query: 1522 GPQGEDSVAAAVVKQNGSQ-------LKSALKSSGVKPLSRSVTWADEKKAENIDAGNLF 1364
            GP   D        + G +       LKS+LK+S  K  +RSVTWAD K   + D  NL 
Sbjct: 302  GPSQNDLTKEENGHRLGKECASGATILKSSLKTSDSKKATRSVTWADAKT--DGDGQNLC 359

Query: 1363 NGQKTEE-KSESIKNXXXXXXXXXXSLRFALAEACAIALSQAAEAVASGECDAEDAATEA 1187
              ++ ++ K   + +          S R A AEACA ALSQAAEAVA+G+ D  DA +EA
Sbjct: 360  EFREVKDGKGALVTSHSADQEVGDESYRIASAEACARALSQAAEAVATGQHDVSDAVSEA 419

Query: 1186 GIVILPPV-EVDEGNSSELMDVSEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPEGFSL 1010
            G++ILPP  EVDE    E+ DV++ D   +KWP KP   +ADLFD EDSW+D+PPEGFSL
Sbjct: 420  GVIILPPPHEVDEAKHEEIGDVTDTDPVLLKWPPKPGFSNADLFDSEDSWYDSPPEGFSL 479

Query: 1009 TLSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSEIKQT 830
            TLSPF+TM+MALF W+++SSLAYIYG++ES  + ++ VNGREYP KV + DGRSSEIKQT
Sbjct: 480  TLSPFSTMFMALFAWITSSSLAYIYGKEESFHEEYISVNGREYPHKVVMPDGRSSEIKQT 539

Query: 829  FAGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLLFIDA 650
             AGC++R+LPG+VA+LRLP P+S +EQ +GRLLDTMS++DPLP FR  QW+VIVLLF+DA
Sbjct: 540  LAGCLARALPGLVAELRLPIPMSTIEQGMGRLLDTMSFIDPLPAFRMKQWQVIVLLFLDA 599

Query: 649  LSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQSGG 482
            LSV RIPALT Y+  +R+LL KVL+ AQ+  EE+E+MKDL+IPLGR P+FS QSGG
Sbjct: 600  LSVSRIPALTPYLMGRRILLPKVLEGAQISAEEFEIMKDLIIPLGRVPQFSTQSGG 655


>ref|XP_011087530.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X2 [Sesamum indicum]
          Length = 687

 Score =  586 bits (1510), Expect = e-164
 Identities = 328/651 (50%), Positives = 436/651 (66%), Gaps = 35/651 (5%)
 Frame = -3

Query: 2329 IAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSLPS 2150
            + VKDAVHKLQL+LL+GI++E QL +AGSL+ RSDY+DVVTER+I N+CGYPLC NSLPS
Sbjct: 51   LTVKDAVHKLQLSLLEGINNENQLSAAGSLICRSDYQDVVTERTIINMCGYPLCSNSLPS 110

Query: 2149 DRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWEVL 1970
            +R  RKGR++IS +E  + DL+ET  YCSS+C   S+ F AS +EER +  +P  + EVL
Sbjct: 111  ER-PRKGRYRISLKEHKVYDLQETYLYCSSSCLINSRAFAASLQEERSSSLNPATLNEVL 169

Query: 1969 GLFGELSLE---DKGKKNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAIEGYVPRDRS 1799
             LF  LSL+   D G  NG +G+ ELKI +K D +AGEVSLE+WIGP NAI+GYVPR+  
Sbjct: 170  KLFDGLSLDSAVDMG--NGDLGLSELKIEEKTDTEAGEVSLEEWIGPSNAIDGYVPRNER 227

Query: 1798 DLSSLPTREKHREKGSKPKEAGTVEREEVVVGNETAFVSTVIIGDQLSASEASMGPQKND 1619
            +L   P +  + +KG++ ++         ++ ++  F ST+I  D+ S S++    +  +
Sbjct: 228  NLK--PKQSSNLKKGARQEQVD-------ILSSDLNFTSTIITQDEYSISKSVPLVKDKE 278

Query: 1618 SKPKAN-----------------------RKSKGKDIVEKAGKQSETQSRSALSKGPQGE 1508
            SK K +                        KSK  D  +   K  +  S    + GP   
Sbjct: 279  SKGKVSINDVNSQGNQMEKPDAPLPNVQETKSKKSDKHKHVTKTDDKLSILEAAAGPSQN 338

Query: 1507 DSVAAAVVKQNGSQ-------LKSALKSSGVKPLSRSVTWADEKKAENIDAGNLFNGQKT 1349
            D        + G +       LKS+LK+S  K  +RSVTWAD K   + D  NL   ++ 
Sbjct: 339  DLTKEENGHRLGKECASGATILKSSLKTSDSKKATRSVTWADAKT--DGDGQNLCEFREV 396

Query: 1348 EE-KSESIKNXXXXXXXXXXSLRFALAEACAIALSQAAEAVASGECDAEDAATEAGIVIL 1172
            ++ K   + +          S R A AEACA ALSQAAEAVA+G+ D  DA +EAG++IL
Sbjct: 397  KDGKGALVTSHSADQEVGDESYRIASAEACARALSQAAEAVATGQHDVSDAVSEAGVIIL 456

Query: 1171 PPV-EVDEGNSSELMDVSEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPEGFSLTLSPF 995
            PP  EVDE    E+ DV++ D   +KWP KP   +ADLFD EDSW+D+PPEGFSLTLSPF
Sbjct: 457  PPPHEVDEAKHEEIGDVTDTDPVLLKWPPKPGFSNADLFDSEDSWYDSPPEGFSLTLSPF 516

Query: 994  ATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSEIKQTFAGCI 815
            +TM+MALF W+++SSLAYIYG++ES  + ++ VNGREYP KV + DGRSSEIKQT AGC+
Sbjct: 517  STMFMALFAWITSSSLAYIYGKEESFHEEYISVNGREYPHKVVMPDGRSSEIKQTLAGCL 576

Query: 814  SRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLLFIDALSVCR 635
            +R+LPG+VA+LRLP P+S +EQ +GRLLDTMS++DPLP FR  QW+VIVLLF+DALSV R
Sbjct: 577  ARALPGLVAELRLPIPMSTIEQGMGRLLDTMSFIDPLPAFRMKQWQVIVLLFLDALSVSR 636

Query: 634  IPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQSGG 482
            IPALT Y+  +R+LL KVL+ AQ+  EE+E+MKDL+IPLGR P+FS QSGG
Sbjct: 637  IPALTPYLMGRRILLPKVLEGAQISAEEFEIMKDLIIPLGRVPQFSTQSGG 687


>ref|XP_011087529.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X1 [Sesamum indicum]
          Length = 699

 Score =  586 bits (1510), Expect = e-164
 Identities = 328/656 (50%), Positives = 439/656 (66%), Gaps = 40/656 (6%)
 Frame = -3

Query: 2329 IAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSLPS 2150
            + VKDAVHKLQL+LL+GI++E QL +AGSL+ RSDY+DVVTER+I N+CGYPLC NSLPS
Sbjct: 51   LTVKDAVHKLQLSLLEGINNENQLSAAGSLICRSDYQDVVTERTIINMCGYPLCSNSLPS 110

Query: 2149 DRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWEVL 1970
            +R  RKGR++IS +E  + DL+ET  YCSS+C   S+ F AS +EER +  +P  + EVL
Sbjct: 111  ER-PRKGRYRISLKEHKVYDLQETYLYCSSSCLINSRAFAASLQEERSSSLNPATLNEVL 169

Query: 1969 GLFGELSLE---DKGKKNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAIEGYVPRDRS 1799
             LF  LSL+   D G  NG +G+ ELKI +K D +AGEVSLE+WIGP NAI+GYVPR+  
Sbjct: 170  KLFDGLSLDSAVDMG--NGDLGLSELKIEEKTDTEAGEVSLEEWIGPSNAIDGYVPRNER 227

Query: 1798 DLSSLPTREKHREKGSKPKEAGTVEREEV-----VVGNETAFVSTVIIGDQLSASEASMG 1634
            +L   P +  + +KG++ ++  +  + +      ++ ++  F ST+I  D+ S S++   
Sbjct: 228  NLK--PKQSSNLKKGARQEQVESEYKHDPPDVADILSSDLNFTSTIITQDEYSISKSVPL 285

Query: 1633 PQKNDSKPKAN-----------------------RKSKGKDIVEKAGKQSETQSRSALSK 1523
             +  +SK K +                        KSK  D  +   K  +  S    + 
Sbjct: 286  VKDKESKGKVSINDVNSQGNQMEKPDAPLPNVQETKSKKSDKHKHVTKTDDKLSILEAAA 345

Query: 1522 GPQGEDSVAAAVVKQNGSQ-------LKSALKSSGVKPLSRSVTWADEKKAENIDAGNLF 1364
            GP   D        + G +       LKS+LK+S  K  +RSVTWAD K   + D  NL 
Sbjct: 346  GPSQNDLTKEENGHRLGKECASGATILKSSLKTSDSKKATRSVTWADAKT--DGDGQNLC 403

Query: 1363 NGQKTEE-KSESIKNXXXXXXXXXXSLRFALAEACAIALSQAAEAVASGECDAEDAATEA 1187
              ++ ++ K   + +          S R A AEACA ALSQAAEAVA+G+ D  DA +EA
Sbjct: 404  EFREVKDGKGALVTSHSADQEVGDESYRIASAEACARALSQAAEAVATGQHDVSDAVSEA 463

Query: 1186 GIVILPPV-EVDEGNSSELMDVSEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPEGFSL 1010
            G++ILPP  EVDE    E+ DV++ D   +KWP KP   +ADLFD EDSW+D+PPEGFSL
Sbjct: 464  GVIILPPPHEVDEAKHEEIGDVTDTDPVLLKWPPKPGFSNADLFDSEDSWYDSPPEGFSL 523

Query: 1009 TLSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSEIKQT 830
            TLSPF+TM+MALF W+++SSLAYIYG++ES  + ++ VNGREYP KV + DGRSSEIKQT
Sbjct: 524  TLSPFSTMFMALFAWITSSSLAYIYGKEESFHEEYISVNGREYPHKVVMPDGRSSEIKQT 583

Query: 829  FAGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLLFIDA 650
             AGC++R+LPG+VA+LRLP P+S +EQ +GRLLDTMS++DPLP FR  QW+VIVLLF+DA
Sbjct: 584  LAGCLARALPGLVAELRLPIPMSTIEQGMGRLLDTMSFIDPLPAFRMKQWQVIVLLFLDA 643

Query: 649  LSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQSGG 482
            LSV RIPALT Y+  +R+LL KVL+ AQ+  EE+E+MKDL+IPLGR P+FS QSGG
Sbjct: 644  LSVSRIPALTPYLMGRRILLPKVLEGAQISAEEFEIMKDLIIPLGRVPQFSTQSGG 699


>ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Solanum lycopersicum]
          Length = 660

 Score =  574 bits (1479), Expect = e-160
 Identities = 332/666 (49%), Positives = 435/666 (65%), Gaps = 50/666 (7%)
 Frame = -3

Query: 2329 IAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSLPS 2150
            +AVKDAVHKLQL LL+GI  E QL +AGSLLSRSDY+DVVTERSI+N+CGYPLC NSLPS
Sbjct: 7    VAVKDAVHKLQLCLLEGIKDESQLIAAGSLLSRSDYQDVVTERSIANMCGYPLCSNSLPS 66

Query: 2149 DRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWEVL 1970
            +R +RKG ++IS +E  + DL ET  YCS+ C   S  F  S ++ER +  +P K+ +VL
Sbjct: 67   ER-SRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAKLNQVL 125

Query: 1969 GLFGELSLE--DKGKKNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAIEGYVPRDRSD 1796
             LF  L L   D  K+NG  G  +LKI +K D K GEVSLE+W+GP NAIEGYVP+   D
Sbjct: 126  NLFKGLHLHSLDDVKENGDRGSSKLKIQEKVDLKGGEVSLEEWMGPSNAIEGYVPQ--RD 183

Query: 1795 LSSLPTREKHREKGSKPKEAGTVEREEVVVGNETAFVSTVIIGDQLSASE------ASMG 1634
             S  P   K+  KGSK K A   + + +++ NE  F ST+I  D+ S S+      A   
Sbjct: 184  RSVNPALLKNINKGSKNKHARLQDEKNMIL-NEFDFSSTIITQDEYSVSKFPAPVNADSN 242

Query: 1633 PQKNDSKPKANRKSKGKDIVE----------KAGKQSETQSRSA------------LSKG 1520
             +  +++ K   K +  D+            ++G+++E   ++             +S G
Sbjct: 243  VKFKETQAKTRYKVRDDDVYILGKQVDALQLRSGEETEKSDKNTRFLKVDKFNSGEVSSG 302

Query: 1519 PQGED--SVAAAVVKQNG---------SQLKSALKSSGVKPLSRSVTWADEKKAENIDAG 1373
            P   D  + +  ++  +G          +LKS+LKSS  K +SRSVTWADE    +ID G
Sbjct: 303  PSQHDVKNKSVLIMSDDGRKYASHGEHDKLKSSLKSSNSKKMSRSVTWADE----SIDGG 358

Query: 1372 NLFNGQKTEEKSESIK--------NXXXXXXXXXXSLRFALAEACAIALSQAAEAVASGE 1217
                G+KTE  S+  +        +          S RF  AEACA ALSQAAEAVASG 
Sbjct: 359  I---GKKTESSSKISEYESQAYGGSASTDMEENDDSYRFESAEACAAALSQAAEAVASGS 415

Query: 1216 CDAEDAATEAGIVILPPV-EVDEGNSSELMDVSEPDRQSVKWPRKPVLLDADLFDCEDSW 1040
             D  DA ++AGIVILPP  EVDE    E  ++ + +   +KWPRKP + + D+F+ EDSW
Sbjct: 416  -DVPDAVSKAGIVILPPSQEVDEAILQETDEMLDLETAPLKWPRKPGMPNYDVFESEDSW 474

Query: 1039 HDTPPEGFSLTLSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALS 860
            +D+PPEGF++TLSPF TM+ +LF W+S+SSLA+IYG DES+ + +L +NGREYPRK+ LS
Sbjct: 475  YDSPPEGFNMTLSPFGTMFNSLFTWISSSSLAFIYGHDESNNEEYLSINGREYPRKIVLS 534

Query: 859  DGRSSEIKQTFAGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQW 680
            DGRS+EIKQT AGC++R+LPG+VADLRLP PIS LEQ +  LL+TMS+VDPLP FR  QW
Sbjct: 535  DGRSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPLPAFRMKQW 594

Query: 679  KVIVLLFIDALSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEF 500
            ++IVLLF+DALSVCRIP LT YM+ +R    KVLD AQ+   EYE+MKDL+IPLGR P+F
Sbjct: 595  QLIVLLFLDALSVCRIPTLTPYMTGRRTSFPKVLDGAQISAAEYEIMKDLIIPLGRVPQF 654

Query: 499  SAQSGG 482
            S QSGG
Sbjct: 655  SMQSGG 660


>ref|XP_007016926.1| F2P16.20 protein, putative isoform 1 [Theobroma cacao]
            gi|508787289|gb|EOY34545.1| F2P16.20 protein, putative
            isoform 1 [Theobroma cacao]
          Length = 739

 Score =  570 bits (1468), Expect = e-159
 Identities = 337/695 (48%), Positives = 438/695 (63%), Gaps = 69/695 (9%)
 Frame = -3

Query: 2362 SPMAKDTPPPPIAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLC 2183
            S MAK+     I+V +AVHK+QL LLDGI  E QL ++GSL+SRSDYEDVVTER+ISN C
Sbjct: 53   SSMAKEQS---ISVSEAVHKIQLHLLDGIRDEKQLLASGSLISRSDYEDVVTERTISNTC 109

Query: 2182 GYPLCPNSLPSDRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRA 2003
            GYPLC N LPS+   RKGR++IS +E  + DL+ET  +CS+ C   S+ F  S +EER +
Sbjct: 110  GYPLCANPLPSEP-RRKGRYRISLKEHKVYDLQETYMFCSTNCLINSRAFAGSLQEERCS 168

Query: 2002 VSDPGKIWEVLGLFGELSLEDKGK-KNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAI 1826
            V +  K+ ++L LFG+L L+D    KNG +G   L+I +  + KA +VSL    GP NAI
Sbjct: 169  VLNHAKLNDILSLFGDLDLDDNDLGKNGDLGFSNLRIKENEEVKAEDVSLA---GPSNAI 225

Query: 1825 EGYVPRDRSDLSSLPTREKHREK----------GSKPKE---------AGTV-------- 1727
            EGYVP+   +L S PT  K+ +           GSK +E         AGT+        
Sbjct: 226  EGYVPQ--RELISKPTPPKNNKNKVFDSSSSKLGSKKEEYFVNNELDFAGTIIMNDEYII 283

Query: 1726 ----------------EREEVVVGNETAFVSTVIIGDQLSASEASMGPQKN--------- 1622
                             ++E  V NE  F S +I+ D+ + S+   G +++         
Sbjct: 284  SKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMNDEYTISKMPSGSKQSCFDSNLKEV 343

Query: 1621 -------DSKPK-------ANRKSKGKDIVEKAGKQSETQSRSALSKGPQGEDSVAAAVV 1484
                   DS+ K       +  + K   IVE    ++  QS    S     +++ A   V
Sbjct: 344  EEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKNVYQSGLDTSSAEAEKETHADKAV 403

Query: 1483 KQNGSQLKSALKSSGVKPLSRSVTWADEKKAENIDAGNLFNGQKTEE-KSESIKNXXXXX 1307
              + + LKS+LKS+G K L+R VTWAD+KKA+N   GNL   ++ E  K +S  +     
Sbjct: 404  TSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNGNLCEVKEMETMKGDSEISGSAED 463

Query: 1306 XXXXXSLRFALAEACAIALSQAAEAVASGECDAEDAATEAGIVILPPV-EVDEGNSSELM 1130
                  LRF  AEACA+ALS+AAEAVASG+ D  DA  E G++ILP + EVD+    E  
Sbjct: 464  GGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDAVYENGLIILPSLCEVDKEEPMEDG 523

Query: 1129 DVSEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPEGFSLTLSPFATMWMALFGWVSASS 950
            D+ EP+   VKWP+KP +  +D+F+ EDSW D PPEGFSLTLS FATMW ALF W+++SS
Sbjct: 524  DMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPEGFSLTLSTFATMWNALFEWITSSS 583

Query: 949  LAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSEIKQTFAGCISRSLPGVVADLRLPT 770
            LAYIYGRDES  + +L +NGREYPRK+AL DGRSSEIK+T A CISR+LP +V DLRLP 
Sbjct: 584  LAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSEIKETLASCISRALPAIVTDLRLPI 643

Query: 769  PISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLLFIDALSVCRIPALTQYMSSKRMLL 590
            PIS LEQ +G L+DT+S+++ LP FR  QW+VIVLLFIDALSVCRIPALT +M++ RMLL
Sbjct: 644  PISTLEQGMGHLIDTISFMEALPAFRMKQWQVIVLLFIDALSVCRIPALTPHMTNGRMLL 703

Query: 589  HKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQSG 485
            HKVLD AQ+  EEYEVMKDL+IPLGR P FSAQSG
Sbjct: 704  HKVLDGAQISMEEYEVMKDLIIPLGRAPHFSAQSG 738


>ref|XP_012479689.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X2 [Gossypium raimondii]
          Length = 695

 Score =  560 bits (1442), Expect = e-156
 Identities = 336/702 (47%), Positives = 434/702 (61%), Gaps = 76/702 (10%)
 Frame = -3

Query: 2362 SPMAKDTPPPPIAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLC 2183
            S MAKD     I+V +AVHK+QL LLDGI  E QL S+GSL+SRSDYEDVVTERSISN C
Sbjct: 6    SSMAKDQS---ISVSEAVHKIQLHLLDGIRDEKQLISSGSLISRSDYEDVVTERSISNTC 62

Query: 2182 GYPLCPNSLPSDRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRA 2003
            GYPLC N LPS+   R+GR++IS +E  + DL+ET ++CS+ C   S+ F  S +EER +
Sbjct: 63   GYPLCQNPLPSEP-RRRGRYRISLKEHRVYDLQETSRFCSADCLINSRAFAGSLQEERCS 121

Query: 2002 VSDPGKIWEVLGLFGELSLEDKGK-KNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAI 1826
            V +  K+  +L LF ++ L D+   KNG +G   LKI +  + KAGEVS    +GP NAI
Sbjct: 122  VLNHAKLNAILSLFDDVDLNDEDLGKNGDLGFSNLKIKENEEIKAGEVSS---VGPSNAI 178

Query: 1825 EGYVPRDRSDLSSLPTREKHREKG---SKPKEAGTVEREEVV------------------ 1709
            EGYVP+   +L S P+  K+ + G   S   + G ++ +  V                  
Sbjct: 179  EGYVPQ--RELVSKPSSSKNSKNGVFDSSSSKLGDIKGDYFVNNEIDFTSAVIMNNEYTT 236

Query: 1708 --------------------VGNETAFVSTVIIGDQLSASEASMGPQKNDSKPKANRKSK 1589
                                V NE  F S +I+ D+ + S+   G ++  S  K  +K++
Sbjct: 237  SKNPGSLRQSQRTKPSSMKDVINEMDFTSEIIMNDEYTVSKTPPGSRQGSSGSKL-KKTE 295

Query: 1588 G----KDIVEKAGKQSETQSRSALSK--------------GPQGEDSVAAAVVKQ----- 1478
            G    KD  EK  +   ++S SAL+K                 G D++ A   K+     
Sbjct: 296  GQGVCKDFEEKCMR---SESSSALTKEDSGIVEMPSTKCVDQSGLDTINAEAEKETHSDK 352

Query: 1477 ----NGSQLKSALKSSGVKPLSRSVTWADEKKAENIDAGNLFNGQKTEEKSESIKNXXXX 1310
                +G  LKS+LKS+G K L+RSVTWAD+K  +    G+L   ++ + +    +N    
Sbjct: 353  AVASSGVVLKSSLKSAGAKKLNRSVTWADKKNVDGARKGSLCEVKEMDAQKGDSENLGRA 412

Query: 1309 XXXXXXS--LRFALAEACAIALSQAAEAVASGECDAEDAATEAGIVILP-PVEVDEGNSS 1139
                     LRFA AEACA+ALS+AA AVASG+ D  DA +EAG++IL  P+E D+    
Sbjct: 413  EDGDDDDNMLRFASAEACAMALSEAAAAVASGDSDVNDAVSEAGLIILAHPLEADKEEKV 472

Query: 1138 ELMDV----SEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPEGFSLTLSPFATMWMALF 971
            E +D      EP+   VKWP KP +  +D FD EDSW D PPEGFSLTLS FATMW ALF
Sbjct: 473  ENIDTLEAEPEPEEGPVKWPTKPGIPRSDFFDPEDSWFDAPPEGFSLTLSTFATMWNALF 532

Query: 970  GWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSEIKQTFAGCISRSLPGVV 791
             W+++SSLAYIYGRDE+  + +L VNGREYP+K+ L DGRSSEIK+T AGCISR+ P +V
Sbjct: 533  EWITSSSLAYIYGRDETFHEEYLSVNGREYPQKIVLRDGRSSEIKETLAGCISRAFPAIV 592

Query: 790  ADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLLFIDALSVCRIPALTQYM 611
              LRLP PIS LEQ +GRLLDTMS+V+ LP FR  QW+VIVLL IDALSVCRIPALT +M
Sbjct: 593  TALRLPIPISTLEQGMGRLLDTMSFVEALPAFRMKQWQVIVLLLIDALSVCRIPALTPHM 652

Query: 610  SSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQSG 485
            ++ RMLLHKVLD AQ+  EEYEVMKDL+IPLGR P FSAQSG
Sbjct: 653  TNGRMLLHKVLDGAQISMEEYEVMKDLIIPLGRAPHFSAQSG 694


>ref|XP_010271590.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Nelumbo nucifera]
            gi|720049898|ref|XP_010271591.1| PREDICTED: putative RNA
            polymerase II subunit B1 CTD phosphatase RPAP2 homolog
            [Nelumbo nucifera]
          Length = 650

 Score =  558 bits (1439), Expect = e-156
 Identities = 328/658 (49%), Positives = 430/658 (65%), Gaps = 41/658 (6%)
 Frame = -3

Query: 2332 PIAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSLP 2153
            P++VKDAVHKLQL+LL+GI +E QLF+AGSL+SRSDYEDVVTER I+ +CGYPLC N L 
Sbjct: 6    PLSVKDAVHKLQLSLLEGICNEDQLFAAGSLMSRSDYEDVVTERHITKVCGYPLCKNPLS 65

Query: 2152 SDRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWEV 1973
             +R  RKGR++IS +E  + DL+ET  YCSS C   S+ F  S   ER +VSD  KI EV
Sbjct: 66   LER-PRKGRYRISVKEHKVYDLQETYMYCSSGCLVNSRAFAGSLATERCSVSDSSKINEV 124

Query: 1972 LGLFGELSLEDKG--KKNGAMGIPELKILDKADAKA-GEVSLEDWIGPPNAIEGYVPRDR 1802
            L LF +LS +DK    + G +G  +LKI +K D    G VSLEDWIGP NAIEGYVP++ 
Sbjct: 125  LRLFEDLSSKDKEILGEEGNLGFSKLKIQEKEDVNVTGNVSLEDWIGPSNAIEGYVPKNC 184

Query: 1801 SDLSSLPTREKHREKGSKPKEAGTVEREEVVVGNETAFVSTVIIGDQLS---ASEASMGP 1631
                      KH E+GSK K A + + ++ V   E  F ST+IIGDQ     A  AS G 
Sbjct: 185  GS--------KHLEEGSKQKIAKSKKGKDKVA-KEMDFKSTIIIGDQFKIPKAPAASNGY 235

Query: 1630 QKNDSKPKAN----------------------------RKSKGK---DIVEKAGKQSETQ 1544
            ++N  K K+                             ++S+G+   ++++  G   +T 
Sbjct: 236  EQNLGKSKSGESSCVPEEWLSILNPSPAPEKSGSGITVKESEGEISGNVLKDHGIPGKTL 295

Query: 1543 SRSALSKGPQGEDSVAAAVVK--QNG-SQLKSALKSSGVKPLSRSVTWADEKKAENIDAG 1373
            S   +S     E  +   V K  Q+G + LKS++K  G K L+R+VTWADE+++  +   
Sbjct: 296  SGQNVSDTSGQETKIKLDVGKTIQSGETALKSSIKPPGAKKLTRNVTWADERESGKVGND 355

Query: 1372 NLFNGQKTEEKSESIKNXXXXXXXXXXSLRFALAEACAIALSQAAEAVASGECDAEDAAT 1193
            NL    +T+E +  +++          +L FA AEACAIALSQAAEAVASGE D  DA +
Sbjct: 356  NLVKIAETQETA--VRSDGSNVEDEDCTLCFASAEACAIALSQAAEAVASGESDVFDAVS 413

Query: 1192 EAGIVILP-PVEVDEGNSSELMDVSEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPEGF 1016
            +AGIVI+P P + DEG++   +DV E +R   +WPR+ V LD   F  ED   + PP+GF
Sbjct: 414  DAGIVIMPHPPDADEGDTQGEVDVLESERIPFRWPRRRVDLDPQFFYFEDILSE-PPDGF 472

Query: 1015 SLTLSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSEIK 836
            S++LSPF T+WMALFGW+++S+LAYIYGRDE+S   F  VNG+EYP KV   DGRS EIK
Sbjct: 473  SMSLSPFGTIWMALFGWITSSTLAYIYGRDENSHLEFQLVNGKEYPCKVVFRDGRSYEIK 532

Query: 835  QTFAGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLLFI 656
            +T A C+SR+LPG+VAD+ LPTPIS LEQ +G LLDTM++V+ LP  R  QW VIV LF+
Sbjct: 533  ETLASCLSRALPGLVADVNLPTPISTLEQGMGCLLDTMTFVEALPSLRMKQWHVIVFLFV 592

Query: 655  DALSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQSGG 482
            DALSVCR+PAL   ++S+RMLL KVLD AQ+ GEEYE+MKD ++PLGR P+FS QSGG
Sbjct: 593  DALSVCRMPALNPLVTSRRMLLQKVLDGAQISGEEYELMKDHILPLGRLPQFSTQSGG 650


>ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cicer arietinum]
          Length = 666

 Score =  557 bits (1435), Expect = e-155
 Identities = 330/667 (49%), Positives = 427/667 (64%), Gaps = 51/667 (7%)
 Frame = -3

Query: 2332 PIAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSLP 2153
            PI+VKDAV KLQLALL+GI SE QLF+AGSL+SRSDYEDVVTERSI+ +C YPLC N+LP
Sbjct: 6    PISVKDAVFKLQLALLEGIQSEDQLFAAGSLISRSDYEDVVTERSITEVCSYPLCCNALP 65

Query: 2152 SDRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWEV 1973
            S+R  RKGR++IS +E  + DL ET  +CSS+C   SK F  S K++R    DP K+  +
Sbjct: 66   SER-PRKGRYRISLKEHKVYDLHETYMFCSSSCVVNSKAFAGSLKDKRCLALDPQKLNNI 124

Query: 1972 LGLFGELSLE--DKGKKNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAIEGYVPRDRS 1799
            L LFG  +LE  +   K+G +G+  L+I DK +    EVSLE W+GP NAIEGYVP+ R 
Sbjct: 125  LRLFGNSNLEPMENSGKDGELGLSSLRIQDKTETVT-EVSLEQWVGPSNAIEGYVPKKRD 183

Query: 1798 DLSSLPTREKHREKGSKPKEAGTVEREEVVVGNETAFVSTVIIGDQLSASEASMGPQK-- 1625
            + S     +K+ +KGSK    G     + ++ +E  F+ST+I+ D+ S S+ S G     
Sbjct: 184  NGSK--GSQKNTKKGSKASH-GKSNGVKNLINSEFDFMSTIIMQDEYSVSKVSSGQTDAT 240

Query: 1624 --NDSKPKANRKS----------KGKDIVEKAGKQSETQSRSALSKGPQ----------G 1511
              +  KP A  +           K  DI + +   + + + SA  K  +          G
Sbjct: 241  VDHQIKPTAILEQPKRVDHELVRKDDDIQDLSSSFASSLNLSASKKDKEIAKSCKNVLKG 300

Query: 1510 EDSVAAA-------------------VVKQNGS---QLKSALKSSGVKPLSRSVTWADEK 1397
            + +  AA                   + K+ GS   + KS+LKS+G K L RSVTWAD K
Sbjct: 301  KTNRVAANDDSSTSNFDPSDVEEKIQIEKEIGSCHTKPKSSLKSNGKKKLGRSVTWAD-K 359

Query: 1396 KAENIDAGNLFNGQKTEE-KSESIKNXXXXXXXXXXSLRFALAEACAIALSQAAEAVASG 1220
            K +   + +L   ++    K ES              LR   AEACAIALSQAAEAVASG
Sbjct: 360  KIDGCGSTDLCAFKEFGNIKKESDVADNVDVVDDEDILRSVSAEACAIALSQAAEAVASG 419

Query: 1219 ECDAEDAATEAGIVILPPVE--VDEGNSSELMDVSEPDRQSVKWPRKPVLLDADLFDCED 1046
            + DA DA +EAGI+ILP  E  V+E    ++ D+ E D  ++KWPRKP + D DLF  +D
Sbjct: 420  DSDAIDAVSEAGIIILPHTENAVEESTVDDV-DILETDSVTLKWPRKPGISDFDLFASDD 478

Query: 1045 SWHDTPPEGFSLTLSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVA 866
            SW D PPEGFSLTLSPFAT+W A F W+++SSLAYIYGRD S  + FL V+GREYP K+ 
Sbjct: 479  SWFDAPPEGFSLTLSPFATLWNAFFSWITSSSLAYIYGRDVSFYEEFLSVDGREYPCKIV 538

Query: 865  LSDGRSSEIKQTFAGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTN 686
            LSDGRSSEIKQT A C++R+LP VVA+L+LP P+S LEQ +  LLDTMS+VDPLP FR  
Sbjct: 539  LSDGRSSEIKQTLASCLARALPAVVAELKLPMPVSTLEQGMVCLLDTMSFVDPLPGFRFK 598

Query: 685  QWKVIVLLFIDALSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQP 506
            QW+V+ LLF+DALSVCRIPAL  YM+ +R L HKVL  +Q+G EEY V+KDL++PLGR P
Sbjct: 599  QWQVVALLFVDALSVCRIPALISYMTDRRDLFHKVLSGSQIGMEEYNVLKDLIVPLGRAP 658

Query: 505  EFSAQSG 485
             FS+QSG
Sbjct: 659  HFSSQSG 665


>ref|XP_012479683.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X1 [Gossypium raimondii]
            gi|823159708|ref|XP_012479685.1| PREDICTED: putative RNA
            polymerase II subunit B1 CTD phosphatase RPAP2 homolog
            isoform X1 [Gossypium raimondii]
            gi|823159710|ref|XP_012479686.1| PREDICTED: putative RNA
            polymerase II subunit B1 CTD phosphatase RPAP2 homolog
            isoform X1 [Gossypium raimondii]
            gi|823159712|ref|XP_012479687.1| PREDICTED: putative RNA
            polymerase II subunit B1 CTD phosphatase RPAP2 homolog
            isoform X1 [Gossypium raimondii]
            gi|823159714|ref|XP_012479688.1| PREDICTED: putative RNA
            polymerase II subunit B1 CTD phosphatase RPAP2 homolog
            isoform X1 [Gossypium raimondii]
            gi|763764410|gb|KJB31664.1| hypothetical protein
            B456_005G200700 [Gossypium raimondii]
            gi|763764411|gb|KJB31665.1| hypothetical protein
            B456_005G200700 [Gossypium raimondii]
            gi|763764412|gb|KJB31666.1| hypothetical protein
            B456_005G200700 [Gossypium raimondii]
            gi|763764413|gb|KJB31667.1| hypothetical protein
            B456_005G200700 [Gossypium raimondii]
            gi|763764414|gb|KJB31668.1| hypothetical protein
            B456_005G200700 [Gossypium raimondii]
          Length = 708

 Score =  555 bits (1429), Expect = e-155
 Identities = 336/715 (46%), Positives = 434/715 (60%), Gaps = 89/715 (12%)
 Frame = -3

Query: 2362 SPMAKDTPPPPIAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLC 2183
            S MAKD     I+V +AVHK+QL LLDGI  E QL S+GSL+SRSDYEDVVTERSISN C
Sbjct: 6    SSMAKDQS---ISVSEAVHKIQLHLLDGIRDEKQLISSGSLISRSDYEDVVTERSISNTC 62

Query: 2182 GYPLCPNSLPSDRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRA 2003
            GYPLC N LPS+   R+GR++IS +E  + DL+ET ++CS+ C   S+ F  S +EER +
Sbjct: 63   GYPLCQNPLPSEP-RRRGRYRISLKEHRVYDLQETSRFCSADCLINSRAFAGSLQEERCS 121

Query: 2002 VSDPGKIWEVLGLFGELSLEDKGK-KNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAI 1826
            V +  K+  +L LF ++ L D+   KNG +G   LKI +  + KAGEVS    +GP NAI
Sbjct: 122  VLNHAKLNAILSLFDDVDLNDEDLGKNGDLGFSNLKIKENEEIKAGEVSS---VGPSNAI 178

Query: 1825 EGYVPRDRSDLSSLPTREKHREKG---SKPKEAGTVEREEVV------------------ 1709
            EGYVP+   +L S P+  K+ + G   S   + G ++ +  V                  
Sbjct: 179  EGYVPQ--RELVSKPSSSKNSKNGVFDSSSSKLGDIKGDYFVNNEIDFTSAVIMNNEYLD 236

Query: 1708 ---------------------------------VGNETAFVSTVIIGDQLSASEASMGPQ 1628
                                             V NE  F S +I+ D+ + S+   G +
Sbjct: 237  FTSAVIMNNEYTTSKNPGSLRQSQRTKPSSMKDVINEMDFTSEIIMNDEYTVSKTPPGSR 296

Query: 1627 KNDSKPKANRKSKG----KDIVEKAGKQSETQSRSALSK--------------GPQGEDS 1502
            +  S  K  +K++G    KD  EK  +   ++S SAL+K                 G D+
Sbjct: 297  QGSSGSKL-KKTEGQGVCKDFEEKCMR---SESSSALTKEDSGIVEMPSTKCVDQSGLDT 352

Query: 1501 VAAAVVKQ---------NGSQLKSALKSSGVKPLSRSVTWADEKKAENIDAGNLFNGQKT 1349
            + A   K+         +G  LKS+LKS+G K L+RSVTWAD+K  +    G+L   ++ 
Sbjct: 353  INAEAEKETHSDKAVASSGVVLKSSLKSAGAKKLNRSVTWADKKNVDGARKGSLCEVKEM 412

Query: 1348 EEKSESIKNXXXXXXXXXXS--LRFALAEACAIALSQAAEAVASGECDAEDAATEAGIVI 1175
            + +    +N             LRFA AEACA+ALS+AA AVASG+ D  DA +EAG++I
Sbjct: 413  DAQKGDSENLGRAEDGDDDDNMLRFASAEACAMALSEAAAAVASGDSDVNDAVSEAGLII 472

Query: 1174 LP-PVEVDEGNSSELMDV----SEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPEGFSL 1010
            L  P+E D+    E +D      EP+   VKWP KP +  +D FD EDSW D PPEGFSL
Sbjct: 473  LAHPLEADKEEKVENIDTLEAEPEPEEGPVKWPTKPGIPRSDFFDPEDSWFDAPPEGFSL 532

Query: 1009 TLSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSEIKQT 830
            TLS FATMW ALF W+++SSLAYIYGRDE+  + +L VNGREYP+K+ L DGRSSEIK+T
Sbjct: 533  TLSTFATMWNALFEWITSSSLAYIYGRDETFHEEYLSVNGREYPQKIVLRDGRSSEIKET 592

Query: 829  FAGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLLFIDA 650
             AGCISR+ P +V  LRLP PIS LEQ +GRLLDTMS+V+ LP FR  QW+VIVLL IDA
Sbjct: 593  LAGCISRAFPAIVTALRLPIPISTLEQGMGRLLDTMSFVEALPAFRMKQWQVIVLLLIDA 652

Query: 649  LSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQSG 485
            LSVCRIPALT +M++ RMLLHKVLD AQ+  EEYEVMKDL+IPLGR P FSAQSG
Sbjct: 653  LSVCRIPALTPHMTNGRMLLHKVLDGAQISMEEYEVMKDLIIPLGRAPHFSAQSG 707


>ref|XP_010042212.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Eucalyptus grandis]
            gi|629120488|gb|KCW84978.1| hypothetical protein
            EUGRSUZ_B01798 [Eucalyptus grandis]
          Length = 672

 Score =  552 bits (1422), Expect = e-154
 Identities = 320/672 (47%), Positives = 423/672 (62%), Gaps = 56/672 (8%)
 Frame = -3

Query: 2332 PIAVKDAVHKLQLALLDGIHS-EYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSL 2156
            P++VKDAV++LQ  LLDG  + E QL +AG++LSR DYEDVV ERSI+ LCGYPLC   L
Sbjct: 8    PVSVKDAVYRLQHLLLDGAAAGEAQLLAAGAILSRRDYEDVVAERSIAGLCGYPLCATPL 67

Query: 2155 PSDRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWE 1976
            P+DR  RKGR++IS +E  + DL+ET  YCS  C   S+ F  S + ER AV D  K+ E
Sbjct: 68   PADR-PRKGRYRISLKEHRVYDLQETYMYCSPGCVVDSRAFAGSLQPERCAVLDLVKVEE 126

Query: 1975 VLGLFGELSLEDKGKKNGA---MGIPELKILDKADAKAGEVSLEDWIGPPNAIEGYVPRD 1805
            VL +FG+  L  + + +G    +G+  LKI +  + +AGEV LE+W+GP NAIEGYVPR 
Sbjct: 127  VLRVFGDKGLGSQERGDGGVGELGMSGLKIKENEEVRAGEVPLEEWVGPSNAIEGYVPRK 186

Query: 1804 RSDLSSLPTREKHREK-----GSKPKEAGTVEREEVVVGNETAFVSTVIIGDQLSASEAS 1640
            R D ++       R K     GSK + +   ++E  ++ N+  F S +I  D+ S S+  
Sbjct: 187  RDDKAAAAAAAASRAKKEPREGSKSRNSKPSKKE--LIFNDMDFTSIIITQDEYSISKLP 244

Query: 1639 MGPQKNDSKPKANRKSKGKDI--------------------------------------- 1577
            +   +  S  KA ++SKGK +                                       
Sbjct: 245  VNSVEEVSATKA-KESKGKKVNGKDKQSRRAVIETSSAKPGTPNINQRELKGKSHDITED 303

Query: 1576 ---VEKAGKQSETQSRSALSK--GPQGEDSVAAAVVKQNGSQLKSALKSSGVKPLSRSVT 1412
                +K    SE    ++LS   G +G D    A      ++LK +LKS+G K ++RSVT
Sbjct: 304  EYSAQKVPSPSEVCQSNSLSHFTGAEGADDDGKADGTSTETRLKPSLKSTGTKKVTRSVT 363

Query: 1411 WADEKKAENIDAGNLFN-GQKTEEKSESIKNXXXXXXXXXXSLRFALAEACAIALSQAAE 1235
            WADEK     D G+L    +  +EK   + +           +RF+ AEACA+ALSQAAE
Sbjct: 364  WADEK-VNVADGGHLCEIREMVDEKEPPLTSAIENEHDDENLMRFSSAEACAMALSQAAE 422

Query: 1234 AVASGECDAEDAATEAGIVILP-PVEVDEGNSSE-LMDVSEPDRQSVKWPRKPVLLDADL 1061
            A  SGE D  DAA   G++ILP P EVDE    E   D  E D  SVKWP+KP +  AD+
Sbjct: 423  AATSGESDVFDAA---GLIILPRPHEVDEKAPVEDNADPLEVDSASVKWPKKPGIPTADI 479

Query: 1060 FDCEDSWHDTPPEGFSLTLSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREY 881
            FD +DSW+D PP+GF++TLSPFATMW ALF W ++S+LAYIYG+DES  + ++ VNGREY
Sbjct: 480  FDADDSWYDAPPDGFNMTLSPFATMWGALFAWTTSSTLAYIYGKDESFHEEYMSVNGREY 539

Query: 880  PRKVALSDGRSSEIKQTFAGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLP 701
            P+K+ L DGRS+EIKQT AGC+SR+LPG+++DLRLP P+S LEQ LGRLLDTM+++D LP
Sbjct: 540  PQKLVLPDGRSTEIKQTLAGCLSRALPGLISDLRLPLPVSTLEQGLGRLLDTMTFMDALP 599

Query: 700  PFRTNQWKVIVLLFIDALSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIP 521
              RT QW+VIVLLFIDALSVCR+P LT +MS++   L KVL AA++  EEYE+MKDLLIP
Sbjct: 600  ALRTKQWQVIVLLFIDALSVCRVPVLTAHMSNRHPSLQKVLQAARMSVEEYEIMKDLLIP 659

Query: 520  LGRQPEFSAQSG 485
            LGR P+FSAQSG
Sbjct: 660  LGRAPQFSAQSG 671


>gb|KHG00854.1| hypothetical protein F383_23706 [Gossypium arboreum]
          Length = 729

 Score =  551 bits (1420), Expect = e-154
 Identities = 332/701 (47%), Positives = 432/701 (61%), Gaps = 77/701 (10%)
 Frame = -3

Query: 2362 SPMAKDTPPPPIAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLC 2183
            S MAKD     I+V +AVHK+QL LLDGI  E QL S+GSL+SRSDYEDV+TERSISN C
Sbjct: 6    SSMAKDQS---ISVSEAVHKIQLHLLDGIRDEKQLISSGSLISRSDYEDVITERSISNTC 62

Query: 2182 GYPLCPNSLPSDRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRA 2003
            GYPLC N LPS+   R+GR++IS +E  + DL+ET ++C + C   S+ F  S +EER +
Sbjct: 63   GYPLCQNPLPSEP-RRRGRYRISLKEHRVYDLQETSRFCLADCLINSRAFAGSLQEERCS 121

Query: 2002 VSDPGKIWEVLGLFGELSLEDKGK-KNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAI 1826
            V +  K+  +L LF ++ L DK   KNG +G   LKI +  + KAGE+S    +GP NAI
Sbjct: 122  VLNHAKLNAILSLFDDVDLNDKDLGKNGDLGFSNLKIKENEEIKAGEISS---VGPSNAI 178

Query: 1825 EGYVPRDRSDLSSLPTREKHREKG---SKPKEAGTVEREEVV------------------ 1709
            EGYVP+   +L S P+  K+ + G   S   + G ++ +  V                  
Sbjct: 179  EGYVPQ--RELVSKPSSSKNSKNGVFDSSSSKLGDIKGDYFVNNEIDFTSAVIMNNEYTT 236

Query: 1708 --------------------VGNETAFVSTVIIGDQLSASEASMGPQKNDSKPKANR-KS 1592
                                V NE  F S +I+ D+ + S+   G ++  S  K  + + 
Sbjct: 237  SKNPGSLRQSQRTKPSSMKDVINEMDFTSEIIMNDEYTVSKTPPGSRQGSSGSKLEKTEG 296

Query: 1591 KG--KDIVEKAGKQSETQSRSALSK--------------GPQGEDSVAAAVVKQ------ 1478
            KG  KD  EK  +   ++S SAL+K                 G D++ A   K+      
Sbjct: 297  KGVCKDFEEKCMR---SESSSALTKEDSGIVQMPSTKCVDQSGLDTINAEAEKETHSDKA 353

Query: 1477 ---NGSQLKSALKSSGVKPLSRSVTWADEKKAENIDAGNLFNGQKTEEKSESIKNXXXXX 1307
               +G  LKS+LK +G K L+RSVTWAD+K  ++   G+L   ++ + +    +N     
Sbjct: 354  MASSGVVLKSSLKPAGAKKLNRSVTWADKKNVDSARKGSLCEVKEMDAQKGDSENIGRAE 413

Query: 1306 XXXXXS--LRFALAEACAIALSQAAEA--VASGECDAEDAATEAGIVILP-PVEVDEGNS 1142
                    LRFA AEACA+ALS+AA A  VASG+ D  DA +EAG++ILP P+E D+   
Sbjct: 414  DGDADDKMLRFASAEACAMALSKAAAAAAVASGDSDVNDAVSEAGLIILPHPLEADKEEK 473

Query: 1141 SELMDV----SEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPEGFSLTLSPFATMWMAL 974
             E +D      EP+   VKWP KP +  +D FD EDSW D PPEGFSLTLS FATMW AL
Sbjct: 474  VENIDTLEADPEPEEGPVKWPTKPGIPRSDFFDPEDSWFDAPPEGFSLTLSTFATMWNAL 533

Query: 973  FGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSEIKQTFAGCISRSLPGV 794
            F W+++SSLAYIYGRDE+  + +L VNGREYP+K+ L DGRSSEIK+T AGCISR+LP +
Sbjct: 534  FEWITSSSLAYIYGRDETFHEEYLSVNGREYPQKIVLRDGRSSEIKETLAGCISRALPAI 593

Query: 793  VADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLLFIDALSVCRIPALTQY 614
            V  LRLP PIS LEQ +GRLLDTMS+V+ LP FR  QW+V+VLL IDALSVCRIPALT +
Sbjct: 594  VTALRLPIPISTLEQGMGRLLDTMSFVEALPAFRMKQWQVLVLLLIDALSVCRIPALTPH 653

Query: 613  MSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQ 491
            M++ RMLLHKVLD AQ+  EEYEVMKDL+IPLGR P FSAQ
Sbjct: 654  MTNGRMLLHKVLDGAQISLEEYEVMKDLIIPLGRAPHFSAQ 694


>ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X1 [Solanum tuberosum]
          Length = 662

 Score =  551 bits (1419), Expect = e-153
 Identities = 330/671 (49%), Positives = 435/671 (64%), Gaps = 55/671 (8%)
 Frame = -3

Query: 2329 IAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSLPS 2150
            +AVKDAVHKLQL LL+GI  E QL +AGSLLSRSDY+DVVTERSI+N+CGYPLC NSLPS
Sbjct: 7    VAVKDAVHKLQLCLLEGIKDENQLIAAGSLLSRSDYQDVVTERSIANMCGYPLCSNSLPS 66

Query: 2149 DRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWEVL 1970
            +R +RKG ++IS +E  + DL ET  YCS+ C   S  F  S ++ER +  +P K+ +VL
Sbjct: 67   ER-SRKGHYRISLKEHKVYDLHETYMYCSTNCVVNSGAFAGSLQDERSSTLNPAKLNQVL 125

Query: 1969 GLFGELSLE--DKGKKNGAMGIPELKILDKADAK-AGEVSLEDWIGPPNAIEGYVPRDRS 1799
             LF  L L   +  K+NG +G  +LKI +K D K  GEVSLE+W+GP NAIEGYVP  + 
Sbjct: 126  NLFKGLHLHSPEDVKENGDLGSSKLKIQEKVDVKGGGEVSLEEWMGPSNAIEGYVP--QR 183

Query: 1798 DLSSLPTREKHREKGSKPKEAGTVEREEVVVGNETAFVSTVIIGDQLSASE------ASM 1637
            D S  P   K+  KG K K A  ++ E+ ++ NE  F ST+I  D+ S S+      A  
Sbjct: 184  DRSVNPALLKNINKGFKNKHA-RLQDEKNMILNEFDFSSTIITQDEYSVSKFPAPVNAVS 242

Query: 1636 GPQKNDSKPKANRKSKGKDI----------VEKAGKQSETQSRSA------------LSK 1523
              +  +++ K   K +  D+            ++G+++E   ++             +S 
Sbjct: 243  SEKFKEAQAKTRYKVRDDDVSILGKRVDALQLRSGEETEKSDKNTRFLKVDKFNSGEVSS 302

Query: 1522 GPQGED--SVAAAVVKQNGSQ-----------LKSALKSSGVKPLSRSVTWADEKKAENI 1382
            GP   D  + +  ++  +G +           LKS+LKSS  K +S+SVTWAD    E I
Sbjct: 303  GPSQHDVKNKSVLIMSDDGRKYASHGEHDKQLLKSSLKSSNSKKMSQSVTWAD----EII 358

Query: 1381 DAGNLFNGQKTEEKSESIK--------NXXXXXXXXXXSLRFALAEACAIALSQAAEAVA 1226
            D G    G+KTE  S+  +        +          S RF  AEACA ALSQAAEAVA
Sbjct: 359  DGG---IGKKTESSSKISEYENQAYGGSASTDMEEDDDSYRFESAEACAAALSQAAEAVA 415

Query: 1225 SGECDAEDAATEAGIVILP-PVEVDEG--NSSELMDVSEPDRQSVKWPRKPVLLDADLFD 1055
            SG  D  DA ++AGIVILP   EVDE     +E++D+ EP    +KWPRKP + + D+F+
Sbjct: 416  SGS-DVPDAVSKAGIVILPTSQEVDEAILQETEMLDI-EP--APLKWPRKPGMPNYDVFE 471

Query: 1054 CEDSWHDTPPEGFSLTLSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPR 875
             ED W+D PPEGF++TLSPFATM+ +LF W+S+SSLA+IYG DE++ + +L +NGREYP 
Sbjct: 472  SEDCWYDGPPEGFNMTLSPFATMFNSLFTWISSSSLAFIYGHDENNNEEYLSINGREYPH 531

Query: 874  KVALSDGRSSEIKQTFAGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPF 695
            K+ LSDG S+EIKQT AGC++R+LPG+VADLRLP PIS LEQ +  LL+TMS+VDPLP F
Sbjct: 532  KIVLSDGLSTEIKQTLAGCLARALPGLVADLRLPVPISTLEQGMVLLLNTMSFVDPLPAF 591

Query: 694  RTNQWKVIVLLFIDALSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLG 515
            R  QW++IVLLF+DALSVCRIP LT YM+ +R  L KVLD AQ+   EYE+MKDL+IPLG
Sbjct: 592  RMKQWQLIVLLFLDALSVCRIPTLTPYMTGRRTSLPKVLDGAQISTAEYEIMKDLIIPLG 651

Query: 514  RQPEFSAQSGG 482
            R P+FS QSGG
Sbjct: 652  RVPQFSMQSGG 662


>gb|KHG00855.1| hypothetical protein F383_23706 [Gossypium arboreum]
          Length = 708

 Score =  546 bits (1408), Expect = e-152
 Identities = 334/714 (46%), Positives = 434/714 (60%), Gaps = 88/714 (12%)
 Frame = -3

Query: 2362 SPMAKDTPPPPIAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLC 2183
            S MAKD     I+V +AVHK+QL LLDGI  E QL S+GSL+SRSDYEDV+TERSISN C
Sbjct: 6    SSMAKDQS---ISVSEAVHKIQLHLLDGIRDEKQLISSGSLISRSDYEDVITERSISNTC 62

Query: 2182 GYPLCPNSLPSDRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRA 2003
            GYPLC N LPS+   R+GR++IS +E  + DL+ET ++C + C   S+ F  S +EER +
Sbjct: 63   GYPLCQNPLPSEP-RRRGRYRISLKEHRVYDLQETSRFCLADCLINSRAFAGSLQEERCS 121

Query: 2002 VSDPGKIWEVLGLFGELSLEDKGK-KNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAI 1826
            V +  K+  +L LF ++ L DK   KNG +G   LKI +  + KAGE+S    +GP NAI
Sbjct: 122  VLNHAKLNAILSLFDDVDLNDKDLGKNGDLGFSNLKIKENEEIKAGEISS---VGPSNAI 178

Query: 1825 EGYVPRDRSDLSSLPTREKHREKG---SKPKEAGTVEREEVV------------------ 1709
            EGYVP+   +L S P+  K+ + G   S   + G ++ +  V                  
Sbjct: 179  EGYVPQ--RELVSKPSSSKNSKNGVFDSSSSKLGDIKGDYFVNNEIDFTSAVIMNNEYTT 236

Query: 1708 --------------------VGNETAFVSTVIIGDQLSASEASMGPQKNDSKPKANR-KS 1592
                                V NE  F S +I+ D+ + S+   G ++  S  K  + + 
Sbjct: 237  SKNPGSLRQSQRTKPSSMKDVINEMDFTSEIIMNDEYTVSKTPPGSRQGSSGSKLEKTEG 296

Query: 1591 KG--KDIVEKAGKQSETQSRSALSK--------------GPQGEDSVAAAVVKQ------ 1478
            KG  KD  EK  +   ++S SAL+K                 G D++ A   K+      
Sbjct: 297  KGVCKDFEEKCMR---SESSSALTKEDSGIVQMPSTKCVDQSGLDTINAEAEKETHSDKA 353

Query: 1477 ---NGSQLKSALKSSGVKPLSRSVTWADEKKAENIDAGNLFNGQKTEEKSESIKNXXXXX 1307
               +G  LKS+LK +G K L+RSVTWAD+K  ++   G+L   ++ + +    +N     
Sbjct: 354  MASSGVVLKSSLKPAGAKKLNRSVTWADKKNVDSARKGSLCEVKEMDAQKGDSENIGRAE 413

Query: 1306 XXXXXS--LRFALAEACAIALSQAAEA--VASGECDAEDAATEAGIVILP-PVEVDEGNS 1142
                    LRFA AEACA+ALS+AA A  VASG+ D  DA +EAG++ILP P+E D+   
Sbjct: 414  DGDADDKMLRFASAEACAMALSKAAAAAAVASGDSDVNDAVSEAGLIILPHPLEADKEEK 473

Query: 1141 SELMDV----SEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPEGFSLT----------- 1007
             E +D      EP+   VKWP KP +  +D FD EDSW D PPEGFSLT           
Sbjct: 474  VENIDTLEADPEPEEGPVKWPTKPGIPRSDFFDPEDSWFDAPPEGFSLTVSLIDGQECHK 533

Query: 1006 LSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSEIKQTF 827
            LS FATMW ALF W+++SSLAYIYGRDE+  + +L VNGREYP+K+ L DGRSSEIK+T 
Sbjct: 534  LSTFATMWNALFEWITSSSLAYIYGRDETFHEEYLSVNGREYPQKIVLRDGRSSEIKETL 593

Query: 826  AGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLLFIDAL 647
            AGCISR+LP +V  LRLP PIS LEQ +GRLLDTMS+V+ LP FR  QW+V+VLL IDAL
Sbjct: 594  AGCISRALPAIVTALRLPIPISTLEQGMGRLLDTMSFVEALPAFRMKQWQVLVLLLIDAL 653

Query: 646  SVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQSG 485
            SVCRIPALT +M++ RMLLHKVLD AQ+  EEYEVMKDL+IPLGR P FSAQSG
Sbjct: 654  SVCRIPALTPHMTNGRMLLHKVLDGAQISLEEYEVMKDLIIPLGRAPHFSAQSG 707


>ref|XP_012859052.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Erythranthe guttatus]
            gi|604299511|gb|EYU19406.1| hypothetical protein
            MIMGU_mgv1a003240mg [Erythranthe guttata]
          Length = 597

 Score =  546 bits (1406), Expect = e-152
 Identities = 305/623 (48%), Positives = 417/623 (66%), Gaps = 7/623 (1%)
 Frame = -3

Query: 2329 IAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSLPS 2150
            + VKDAVHKLQL+LL+GI  E QL +AGSL+S+SDY+DVVTER+I+++CGYPLC NSLPS
Sbjct: 7    LGVKDAVHKLQLSLLEGIKHESQLIAAGSLISQSDYQDVVTERTIAHVCGYPLCVNSLPS 66

Query: 2149 DRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWEVL 1970
            +   RKG ++IS +E  + DL ET  YCS+ C   S+ F AS +EER +  DP KI  VL
Sbjct: 67   EP-PRKGHYRISLKEHKVYDLHETHMYCSTECLIRSRAFGASLEEERSSSLDPAKINSVL 125

Query: 1969 GLFGELSLEDKG--KKNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAIEGYVPR-DRS 1799
             +F  LSL+      K+G +G+  LKI +K    +GE+SLE+W+GP NAI+GYVPR D++
Sbjct: 126  KMFDGLSLDSVMGLDKSGDLGLSGLKIREKMVTGSGEMSLEEWVGPSNAIDGYVPRRDQN 185

Query: 1798 DLSSLPTREKHREKGSKPKEAGTVEREEVVVGNETAFVSTVIIGDQLSASEASMGPQKND 1619
                 P+R+K     +KP  A T+  +         F ST+I+ D+ S S+ ++      
Sbjct: 186  SERKQPSRKKTESNHAKPNLADTLPFD-------VNFTSTIIMQDEYSVSKTAVP----- 233

Query: 1618 SKPKANRKSKGKDIVEKAGKQSETQSRSAL--SKGPQGEDSVAAAVVKQNGSQLKSALKS 1445
                  R++KGK   +   K  + +  S L  + GP   D+            LKS+LK+
Sbjct: 234  ------REAKGKVKGKMIRKSVKAEKISVLDDTAGPSQNDTTL----------LKSSLKT 277

Query: 1444 SGVKPLSRSVTWADEKKAENIDAGNLFNGQKT-EEKSESIKNXXXXXXXXXXSLRFALAE 1268
               K  +RSVTWADEK   + D  ++   ++  + K   +            S RF  AE
Sbjct: 278  LDSKKETRSVTWADEKS--DGDGKSISECREIGDNKGAVVMPHLTDEDVGDESYRFTSAE 335

Query: 1267 ACAIALSQAAEAVASGECDAEDAATEAGIVILPPV-EVDEGNSSELMDVSEPDRQSVKWP 1091
            ACA ALSQA+EAVASG+ DA DA +EAG++ILPP  EVDE    ++ +V + D   +KWP
Sbjct: 336  ACARALSQASEAVASGKTDASDAVSEAGVIILPPPHEVDEAKYEQIGEVVDVDPIELKWP 395

Query: 1090 RKPVLLDADLFDCEDSWHDTPPEGFSLTLSPFATMWMALFGWVSASSLAYIYGRDESSQD 911
             KP     DLFD EDSW+D+PPEGF+LTLSPF+TM+M+LF W+S+SSLAYIYG++E   +
Sbjct: 396  PKPGFSSEDLFDSEDSWYDSPPEGFNLTLSPFSTMFMSLFAWISSSSLAYIYGKEERFHE 455

Query: 910  YFLFVNGREYPRKVALSDGRSSEIKQTFAGCISRSLPGVVADLRLPTPISFLEQFLGRLL 731
             +L +NGREYP K+ + DGRS+E+K T AGC++R+LPG+V+++R+PTP+S +EQ +GRLL
Sbjct: 456  DYLSINGREYPPKIII-DGRSAEVKHTLAGCLARALPGLVSEIRIPTPVSTIEQGMGRLL 514

Query: 730  DTMSYVDPLPPFRTNQWKVIVLLFIDALSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEE 551
            DTMS+ D LP FR  QW+VI LLF+DALSV RIPAL+ YM+ +R+LL KVL+ AQ+  EE
Sbjct: 515  DTMSFTDALPGFRMKQWQVIALLFLDALSVSRIPALSPYMTGRRILLPKVLEGAQINVEE 574

Query: 550  YEVMKDLLIPLGRQPEFSAQSGG 482
            +E+MKDL+IPLGR P+FS QSGG
Sbjct: 575  FEIMKDLIIPLGRVPQFSTQSGG 597


>ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Populus trichocarpa]
            gi|550321730|gb|EEF05523.2| hypothetical protein
            POPTR_0015s01330g [Populus trichocarpa]
          Length = 696

 Score =  545 bits (1405), Expect = e-152
 Identities = 330/706 (46%), Positives = 426/706 (60%), Gaps = 82/706 (11%)
 Frame = -3

Query: 2356 MAKDTPPPPIAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGY 2177
            MAKD       VKD ++KLQL+LLDGI +E QL +AGS++S SDYEDVVTER+I+NLCGY
Sbjct: 1    MAKDQST---VVKDTIYKLQLSLLDGIQNEDQLLAAGSIMSHSDYEDVVTERTIANLCGY 57

Query: 2176 PLCPNSLPSDRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVS 1997
            PLC NSLPSDR  +KGR++IS +E  + DL ET  YCSS+C   S+TF+ S +EER  V 
Sbjct: 58   PLCGNSLPSDR-PQKGRYRISLKEHKVYDLHETYMYCSSSCVINSRTFSGSLQEERCLVL 116

Query: 1996 DPGKIWEVLGLFGELSLEDKGK--KNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAIE 1823
            +P K+ EVL LF   SL  +G   KNG +G   LKI +K +   GEVS E WIGP NAIE
Sbjct: 117  NPAKLNEVLMLFDNFSLGSEGSLGKNGDLGFSNLKIEEKTEKVEGEVSFEQWIGPSNAIE 176

Query: 1822 GYVP-RDRSD----------LSSLPTREKHR----------------------------E 1760
            GYVP RDR +           SS+ T++++                              
Sbjct: 177  GYVPQRDRLEEDFIIDDMDFTSSIITQDEYSISKTPSGLTDTNTDKKTQKPKAKGSHKGS 236

Query: 1759 KGSKPKEAGTVEREEVVVGNETAFVSTVIIG-DQLSASEASMGPQKNDSKPKANRKSKG- 1586
            KGSK K      ++E  + N+  F ST+II  D+ S S++  G     SK K  ++ +  
Sbjct: 237  KGSKAKGTKQSSKQESFI-NDMNFTSTIIITQDEYSISKSPSGLAGTTSKTKIQKQKEKV 295

Query: 1585 --------KDIVEKAG------KQSETQSRSALSKGPQGED------------------- 1505
                         K G      K  E +S+ A+      +D                   
Sbjct: 296  SQKSSENQSSATRKVGSSKTSRKVKEDRSKVAIKDELSSQDLSSPFDSCQTSSITITAEA 355

Query: 1504 ---SVAAAVVKQNGSQLKSALKSSGVKPLSRSVTWADEK--KAENIDAGNLFNGQKTEEK 1340
               SV+    K   S LK +LK+SG K L+RSVTWADEK   + + D   +   + T+  
Sbjct: 356  KEKSVSEKAAKPVESSLKPSLKTSGAKQLTRSVTWADEKVGSSGSRDLCEVRGMEDTKAG 415

Query: 1339 SESIKNXXXXXXXXXXSLRFALAEACAIALSQAAEAVASGECDAEDAATEAGIVILP-PV 1163
             E + N            +F  AEACA ALSQAAEAVASG+ DA +A +EAG+VILP P 
Sbjct: 416  PEIVDNIDKRDDGYVS--KFESAEACAKALSQAAEAVASGDADASNALSEAGLVILPQPH 473

Query: 1162 EVDEGNSSELMDVSEPDRQSVKWPRKPVLLDADLFDCEDSWHDTPPEGFSLTLSPFATMW 983
            ++D+G+  E +DV + +  ++KWP KP +  ++ FD E+SW+D PPEGFSL LS FAT+W
Sbjct: 474  DLDQGDPMEDVDVLDEESSTIKWPGKPGIPQSECFDPENSWYDAPPEGFSLELSSFATIW 533

Query: 982  MALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRKVALSDGRSSEIKQTFAGCISRSL 803
            MALF WV++SSLAY+YG+DESS + +L VNGREYPRK+ L DGRS EI+QT  GC+ R+ 
Sbjct: 534  MALFAWVTSSSLAYVYGKDESSHEEYLMVNGREYPRKIVLGDGRSFEIQQTIEGCLGRAF 593

Query: 802  PGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFRTNQWKVIVLLFIDALSVCRIPAL 623
            P VVADLRLP PIS LEQ    LL TMS+VD +P FR  QW+VI LLFI+ALSVCRIPAL
Sbjct: 594  PVVVADLRLPIPISTLEQGAANLLGTMSFVDAVPAFRMKQWQVIALLFIEALSVCRIPAL 653

Query: 622  TQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGRQPEFSAQSG 485
              YM ++RM    V+D  ++  EEYEVMKDL+IPLGR P+FS QSG
Sbjct: 654  ISYMDNRRM----VVDGVRMSAEEYEVMKDLMIPLGRAPQFSPQSG 695


>ref|XP_009771014.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X1 [Nicotiana sylvestris]
            gi|698557405|ref|XP_009771015.1| PREDICTED: putative RNA
            polymerase II subunit B1 CTD phosphatase RPAP2 homolog
            isoform X1 [Nicotiana sylvestris]
          Length = 664

 Score =  543 bits (1400), Expect = e-151
 Identities = 324/670 (48%), Positives = 427/670 (63%), Gaps = 54/670 (8%)
 Frame = -3

Query: 2329 IAVKDAVHKLQLALLDGIHSEYQLFSAGSLLSRSDYEDVVTERSISNLCGYPLCPNSLPS 2150
            IAVKDA+HKLQL LL+GI  E QLF+AGSLLSR DY+DVVTERSI+N+CGYPLC NSLPS
Sbjct: 7    IAVKDAIHKLQLYLLEGIKDENQLFAAGSLLSRRDYQDVVTERSIANMCGYPLCSNSLPS 66

Query: 2149 DRLARKGRFKISREEKTLLDLRETGKYCSSACHAISKTFTASFKEERRAVSDPGKIWEVL 1970
            +R  R G ++IS +E  + DL ET  YCS+ C   S  F  S ++ER +  +  K+ EVL
Sbjct: 67   ER-PRNGHYRISLKEHKVYDLHETYMYCSTNCAVNSGAFARSLQDERSSTLNTAKLNEVL 125

Query: 1969 GLFGELSLE--DKGKKNGAMGIPELKILDKADAKAGEVSLEDWIGPPNAIEGYVPRDRSD 1796
             LF  L L   +  K++G +G+ +LKI +K D K GEVS+E+W+GP +AIEGYVP+   +
Sbjct: 126  KLFVGLHLHSTEDVKESGDLGLSKLKIQEKVDVKGGEVSMEEWMGPSDAIEGYVPQRERN 185

Query: 1795 LSSLPTREKHREKGSKPKEAGTVEREEVVVGNETAFVSTVIIGDQLSASEASMGPQKNDS 1616
            L   P    + +K SK K+A  ++ E+ ++ +E  F ST+I  D  S S+         S
Sbjct: 186  LK--PALLNNIKKSSKNKQA-KLQNEKNMILHEMDFSSTIITQDGYSISKLPAPVNAVSS 242

Query: 1615 KPKANRKSKG----KDI-VEKAGKQ--------------SETQSRS---------ALSKG 1520
            K     +++     +D+ V   GKQ              +++ +RS          +S G
Sbjct: 243  KKVKEAQTRTSYEVRDVDVSILGKQVDALQLHSGEETEKTDSNNRSYKVDKFNTGEVSSG 302

Query: 1519 PQGEDSVAAAVVKQNGSQ---------------LKSALKSSGVKPLSRSVTWADEKKAEN 1385
            P   D    ++   N S                L+S+LKSS  K ++RSVTWADE    N
Sbjct: 303  PCQHDVKNKSLEVLNMSDAGREYASDDAREKQSLRSSLKSSKYKKMARSVTWADE----N 358

Query: 1384 IDAGNLFNGQKTEEKSE--------SIKNXXXXXXXXXXSLRFALAEACAIALSQAAEAV 1229
            +D G    G+ TE  SE        + ++          S RF  AEACA AL QAAEAV
Sbjct: 359  VDNGT---GKLTESSSEISEKGDQANRESGPTNMEEDDDSYRFESAEACAAALKQAAEAV 415

Query: 1228 ASGECDAEDAATEAGIVILPPV-EVDEGNSSELMDVSEPDRQSVKWPRKPVLLDADLFDC 1052
            ASG  D  DA + AGI+ILPP  EVDE    E  +V + +   +KWPRKP + + D+F+ 
Sbjct: 416  ASGS-DVPDAVSTAGIIILPPPKEVDEAVLKENDEVLDIEPAPLKWPRKPGVPNYDVFES 474

Query: 1051 EDSWHDTPPEGFSLTLSPFATMWMALFGWVSASSLAYIYGRDESSQDYFLFVNGREYPRK 872
            EDSW+D+PPEGF+L LSPF+TM+ +LF W+S+SSL++IYG DES  + +L VNG EYPRK
Sbjct: 475  EDSWYDSPPEGFNLNLSPFSTMFNSLFTWISSSSLSFIYGNDESFNEEYLSVNGSEYPRK 534

Query: 871  VALSDGRSSEIKQTFAGCISRSLPGVVADLRLPTPISFLEQFLGRLLDTMSYVDPLPPFR 692
            + LSDGRS+EIKQT A C++R+LPG+VADLRLP PIS LEQ L  L+DTMS+VDPLP FR
Sbjct: 535  IVLSDGRSTEIKQTLARCLARALPGLVADLRLPVPISVLEQGLVLLIDTMSFVDPLPAFR 594

Query: 691  TNQWKVIVLLFIDALSVCRIPALTQYMSSKRMLLHKVLDAAQVGGEEYEVMKDLLIPLGR 512
              QW++IVLLF+DALS+CRIP LT YM+ +R LL KVLD AQ+   EYE++KDL+IPLGR
Sbjct: 595  MKQWQLIVLLFLDALSICRIPVLTPYMTGRRTLLPKVLDGAQISAAEYEILKDLIIPLGR 654

Query: 511  QPEFSAQSGG 482
             P+FS QSGG
Sbjct: 655  VPQFSMQSGG 664


Top