BLASTX nr result

ID: Zingiber23_contig00009314 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber23_contig00009314
         (3781 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006662962.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymera...   912   0.0  
gb|ABA93957.1| NLI interacting factor-like phosphatase family pr...   888   0.0  
ref|XP_003577532.1| PREDICTED: RNA polymerase II C-terminal doma...   849   0.0  
gb|EEC68257.1| hypothetical protein OsI_36281 [Oryza sativa Indi...   835   0.0  
gb|EEE52187.1| hypothetical protein OsJ_34058 [Oryza sativa Japo...   830   0.0  
ref|XP_002449554.1| hypothetical protein SORBIDRAFT_05g019010 [S...   823   0.0  
gb|EXB81217.1| RNA polymerase II C-terminal domain phosphatase-l...   804   0.0  
gb|EOX99661.1| RNA polymerase II C-terminal domain phosphatase-l...   801   0.0  
ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal doma...   779   0.0  
gb|AFW60862.1| hypothetical protein ZEAMMB73_799152, partial [Ze...   778   0.0  
ref|XP_002304648.2| hypothetical protein POPTR_0003s16280g [Popu...   759   0.0  
ref|XP_006438860.1| hypothetical protein CICLE_v10030535mg [Citr...   751   0.0  
ref|XP_006438858.1| hypothetical protein CICLE_v10030535mg [Citr...   751   0.0  
emb|CBI35661.3| unnamed protein product [Vitis vinifera]              751   0.0  
ref|XP_004980548.1| PREDICTED: RNA polymerase II C-terminal doma...   747   0.0  
ref|XP_006341905.1| PREDICTED: RNA polymerase II C-terminal doma...   745   0.0  
ref|XP_002297869.2| CTD phosphatase-like protein 3 [Populus tric...   734   0.0  
ref|XP_004252660.1| PREDICTED: RNA polymerase II C-terminal doma...   733   0.0  
gb|EMS65645.1| RNA polymerase II C-terminal domain phosphatase-l...   732   0.0  
ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative ...   726   0.0  

>ref|XP_006662962.1| PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II C-terminal domain
            phosphatase-like 3-like [Oryza brachyantha]
          Length = 1267

 Score =  912 bits (2356), Expect = 0.0
 Identities = 585/1242 (47%), Positives = 728/1242 (58%), Gaps = 94/1242 (7%)
 Frame = +3

Query: 336  EEKPLDLMARERPRG---GRALDGETSDVDSSESLEEISAEDFKQEGRAG---------- 476
            EE+ + LMARERPR    G    GE SD DSS SLEEISA+DFK+E   G          
Sbjct: 10   EERLVVLMARERPRSAVXGGGGGGEGSDGDSSGSLEEISADDFKKESSGGGGGAGAGAGT 69

Query: 477  --------RSRVWMGYPMSKNYGPSLYNFAWAQAVQNKPLGLDLKPMSAGDLTDNVDAKG 632
                    RSRVWMGY M ++Y P+ ++FAWAQAVQNKPL     P +A D         
Sbjct: 70   GGGVAAAQRSRVWMGYSMPRSYAPAFHSFAWAQAVQNKPL----VPRAAAD--------- 116

Query: 633  KPQEEAYDVIVEDSNQEDDVXXXXXXXXXXXXXXXXXXXXXVVDETTQLSSDIPENEPEK 812
               E+  + +V+ S++E +                         ET  L SD+PE +PE 
Sbjct: 117  ---EDEVEHVVDTSDEEKE----EGEIEEGEAVQSSESPPRAQPETIVLDSDVPE-KPES 168

Query: 813  KESDGEEL-----QDISEFDKRISLILEELDTITVEEAETSFEAVCLRLRKSFEDLKPMF 977
               DG  +     ++  +FD+R+  ILEEL+TI++EEAE SFE  C RLR SFE+LKP+F
Sbjct: 169  AAMDGVTIPAGAEEEDMDFDQRVGSILEELETISIEEAEKSFEGACTRLRTSFENLKPLF 228

Query: 978  TGIESSDTXXXXXXQQAVMAIQTTYSALNSVTMQKKDQNKQLLLRLLIHIKNQYSVLFTL 1157
                S         QQA +AI T  +  NS  M K++Q K +LL+LL HIKN+YS + T 
Sbjct: 229  PETGSPMPMLDTLVQQAFIAIDTITTVANSYDMPKREQTKNMLLKLLFHIKNRYSYMLTP 288

Query: 1158 EQVKEIDTLVNSLVFEDNKKEKADLTNGPVVSLHGSNVLEKACLVPEALD--LP------ 1313
            +Q  E+D+ V  LVFED K    D  N P  +  G+N    A    + L   LP      
Sbjct: 289  DQRNELDSRVRQLVFEDGK----DTANCPNATC-GTNTSNVAATSGQVLSERLPFESGAG 343

Query: 1314 ---------KLVLPTRSKNRVDFSPLLDLHADYNEDSLPSPTRENLPKFSIPKPIGLGMV 1466
                     K+ +P  SKNR+  SPLLDLHADY+E+SLPSPTR++ P F +PKPIG G  
Sbjct: 344  NTFSGTSMLKVEIP--SKNRM-ISPLLDLHADYDENSLPSPTRDSTPPFPVPKPIGFGTF 400

Query: 1467 LPVSSQP-ITAKNGGEDVMLHPYVTDALKAVSSYQLKFGSNSILSTDRLPSPTPSEDGGQ 1643
            L    +P I  +        +P + DALKAVSSYQ K+G  S  ++D LPSPTPS DG +
Sbjct: 401  LMAPDRPSIMERVEPVKNSSYPSLNDALKAVSSYQQKYGQKSTFASDDLPSPTPSGDGDK 460

Query: 1644 -GXXXXXXXXXXXXXCNPRAVHAATQFQMVKSRSSAPCTNSSISNQSHP--VEQL-NPMS 1811
             G                  +      QM  SR S   ++S       P   +Q+ NP+S
Sbjct: 461  SGDKGGDIFGEVSSFPASNKIVLPVVNQMPPSRPSTVSSSSDSFAGGPPGYAKQIENPVS 520

Query: 1812 --KPALKPALKRRDPRLKFVNNEVKSISDEGR---VTEPDLLKSDP--VGVATNSKKHKT 1970
                 LK   K RDPRL+F+N +   ++D  R     EP+  K     VGV  NS+KHKT
Sbjct: 521  GSNHMLKATAKSRDPRLRFLNRDAGVVADANRRLNFAEPNPSKDRTMGVGVPINSRKHKT 580

Query: 1971 -DESVARDHMMKKQRHKLTTSREMETASG-SGWS-EAKNVIPQPSFKIQVNENFQVDVRK 2141
             DE +  ++M+K+ R      R++ T +G  GW+ +  NV    S   Q N+N ++    
Sbjct: 581  VDEPLVDENMLKRSRGGNGNPRDVLTPAGRGGWAKDGVNVSSYSSDGFQPNQNTRLGNST 640

Query: 2142 SGTAAAVPDKKPIFD-NNFDELSGLR----------SIPKSNPTPTISLPSLLK--AVNP 2282
            +G+     D   + + NN    SG+           S P+++  P++SLP++LK  AVNP
Sbjct: 641  TGSHNVRTDSTLVSNTNNMTNSSGINTGVVQAPQTNSSPQTSSAPSVSLPAMLKDIAVNP 700

Query: 2283 TILMQLHQMEQQRIAA-ENQKKNAVSTSDSANVLSVNELPGTVASINSTPLKSKESGLNQ 2459
            T+LMQ  QMEQQ+++A E  +K   S   ++N  +   LP + AS      K+ E+    
Sbjct: 701  TMLMQWIQMEQQKMSATEPLQKVTASVGMTSNETAGMVLPLSCAS------KTTEAAPVP 754

Query: 2460 PGISLIPSQMASSSTQPDVARIRMKPRDPRRILHDNMVQKNDGVV---YQQSKIDAAA-S 2627
               S +P Q A+  +Q D   IRMKPRDPRRILH N+ QKND V     +Q+KI+  A  
Sbjct: 755  SVRSQVPMQTAAVHSQNDAGVIRMKPRDPRRILHSNIAQKNDTVPPVGVEQAKINGTALP 814

Query: 2628 DPQSSLVRLAAPLQLTKNLANVLXXXXXXXXXXXXXXCNNQPVAS--------------Q 2765
            D Q S   L    Q  + L                   N  PV++              Q
Sbjct: 815  DSQGSKDHLLNHEQQAEQLQTSALPSQPVTPSARQVTMNANPVSNSQLAATALMPHGSTQ 874

Query: 2766 ADQVIVEAASGELN--DSETTSLLVSEAG--SEKGTRQSANPWGDVDHLFDGYNDEQKAT 2933
                 V  A   L    +ET    V+  G  +       A+PWGDVDHL DGY+D+QKA 
Sbjct: 875  QTSSSVNKADPRLTAGQNETNDDAVTSTGPLTAPDAVLPASPWGDVDHLLDGYDDQQKAL 934

Query: 2934 IQKERSRRIAEQNKMFAARKXXXXXXXXXXXXNSAKFIEVDLVHEDILRRKEEQDKEMSQ 3113
            IQKER+RRI EQ KMFAA+K            NSAKF EV+ +HE+ILR+KEEQD+E + 
Sbjct: 935  IQKERARRIMEQQKMFAAQKLCLVLDLDHTLLNSAKFAEVEPIHEEILRKKEEQDRERAD 994

Query: 3114 RHLFRFQHMGMWTKLRPGIWNFLEKASNLYELHLYTMGNKLYATEMAKVLDPTGTLFAGR 3293
            RHLF F HMGMWTKLRPGIWNFLEKAS LYELHLYTMGNK+YATEMA+VLDPTGTLFAGR
Sbjct: 995  RHLFCFHHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKIYATEMARVLDPTGTLFAGR 1054

Query: 3294 VISKGDDADTFDGDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 3473
            VIS+GDD DT D DERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNK NLIVVERYTYFP
Sbjct: 1055 VISRGDDGDTLDSDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKHNLIVVERYTYFP 1114

Query: 3474 SSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIERLHQNFFSHNSLKDVDVRNILAAEQ 3653
             SRRQFGL GPSLLEID DERPEDGTLASSLAVIER+HQNFF+H +L D DVR+ILA+EQ
Sbjct: 1115 CSRRQFGLPGPSLLEIDRDERPEDGTLASSLAVIERIHQNFFTHPNLNDADVRSILASEQ 1174

Query: 3654 RKILAGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQI 3779
            ++IL GCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQI
Sbjct: 1175 QRILGGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQI 1216


>gb|ABA93957.1| NLI interacting factor-like phosphatase family protein, expressed
            [Oryza sativa Japonica Group]
          Length = 1272

 Score =  888 bits (2294), Expect = 0.0
 Identities = 571/1262 (45%), Positives = 718/1262 (56%), Gaps = 114/1262 (9%)
 Frame = +3

Query: 336  EEKPLDLMARERPR----------------GGRALDGETSDVDSSESLEEISAEDFKQEG 467
            EE+ + LMARERPR                GG    GE SD DSS SLEEISA+DFK+E 
Sbjct: 10   EERLVVLMARERPRSAVVAPGGDLVTAGGGGGGGGGGEGSDGDSSGSLEEISADDFKKES 69

Query: 468  RAG-----------RSRVWMGYPMSKNYGPSLYNFAWAQAVQNKPLGLDLKPMSAGDLTD 614
             A            RSRVWMGY + ++Y P+ ++FAWAQAVQNKPL        A D  D
Sbjct: 70   SAAGGAAAAAAAQQRSRVWMGYNIPRSYAPAFHSFAWAQAVQNKPL-----VPRAADAAD 124

Query: 615  NVDAKGKPQEEAYDVIVEDSNQEDDVXXXXXXXXXXXXXXXXXXXXXVVDETTQLSS--- 785
                     E+  + +V+ S++E +                       V  TT  SS   
Sbjct: 125  ---------EDEVEHVVDTSDEEKE--------------EGEIEEGEAVQTTTTSSSSPP 161

Query: 786  --------DIPENEPEKKES-----------DGEELQDISEFDKRISLILEELDTITVEE 908
                    D+  + PEK ES            G E +++ +FD+R+  ILEEL+ +++EE
Sbjct: 162  CAQPPETIDLDSDAPEKSESMVAMYGGGAAPAGAEEEEV-DFDQRVGSILEELEMVSIEE 220

Query: 909  AETSFEAVCLRLRKSFEDLKPMFTGIESSDTXXXXXXQQAVMAIQTTYSALNSVTMQKKD 1088
            AE SFE  C RLR  FE+LKP+F    S         QQA + I T  +  NS  M K++
Sbjct: 221  AEKSFEGACTRLRTCFENLKPLFPESGSPMPMLDALVQQAFVGIDTITTVANSYDMPKRE 280

Query: 1089 QNKQLLLRLLIHIKNQYSVLFTLEQVKEIDTLVNSLVFEDNKKEKADLTNGP------VV 1250
            Q K +LL+LL HIKN+YS + T +Q  E+D+ V  LVFED K    D  NGP        
Sbjct: 281  QTKNMLLKLLFHIKNRYSDMLTPDQRDELDSRVRQLVFEDGK----DNANGPNATSTNAA 336

Query: 1251 SLHGSNVLEKACLVPEALD-LPKLVLPTRSKNRVDFSPLLDLHADYNEDSLPSPTRENLP 1427
            +  G  + E+      A +   K+ +P  +KNR+  SPLLDLHADY+E+SLPSPTR++ P
Sbjct: 337  APSGQVLSERLPFESGAGNSFSKVEIP--AKNRM-VSPLLDLHADYDENSLPSPTRDSKP 393

Query: 1428 KFSIPKPIGLGMVLPVSSQPIT------AKNGGEDVMLHPYVTDALKAVSSYQLKFGSNS 1589
             F +PKPIG G +     +P        AKN       +    DALKAV  YQ K G  S
Sbjct: 394  PFDVPKPIGYGALPMAPDRPSVLERVEPAKNSS-----YQSFNDALKAVCYYQQKHGQKS 448

Query: 1590 ILSTDRLPSPTPSEDGGQGXXXXXXXXXXXXXCNPRAVHAATQFQMVKSRSSAPCTNSSI 1769
              ++D LPSPTPS DG +               +     A      + SR S   +NS  
Sbjct: 449  NFASDDLPSPTPSGDGDKSGDKGGDVFGEVSSFSASNKIALPIVNQMPSRPSTVSSNSDS 508

Query: 1770 SNQSHP-----VEQLNPMSKPALKPALKRRDPRLKFVNNEVKSISDEGR---VTEPDLLK 1925
                 P     +E     S   LK   K RDPRLKF+N +   ++D  R     EP+  K
Sbjct: 509  FAGGPPGYAKQIENSVSGSNHLLKATAKSRDPRLKFLNRDTGGVADANRRVNFAEPNPSK 568

Query: 1926 SDPVG--VATNSKKHKT-DESVARDHMMKKQRHKLTTSREMETASGSGWS-EAKNVIPQP 2093
               +G  V+ NS+K+K  DE +  ++ +K+ R  +   R+M+     GW+ +  N+    
Sbjct: 569  DRTMGGGVSINSRKNKAVDEPMVDENALKRSRGVIGNLRDMQPTGRGGWAKDGGNISSYS 628

Query: 2094 SFKIQVNENFQVDVRKSG-----TAAAVPDKKPIFDNNFDELSGL------RSIPKSNPT 2240
            S   Q N+N ++    +G     T + +        NN     G+       S P+++  
Sbjct: 629  SDGFQPNQNTRLGNNTTGNHNIRTDSTLASNLNNTTNNSGTSPGIVQAPQTNSAPQTSSA 688

Query: 2241 PTISLPSLLK--AVNPTILMQLHQMEQQRIAA-ENQKKNAVSTSDSANVLSVNELPGTVA 2411
            P +SLP++LK  AVNPT+LMQ  QMEQQ+++A E Q+K   S   ++NV      PG V 
Sbjct: 689  PAVSLPAMLKDIAVNPTMLMQWIQMEQQKMSASEPQQKVTASVGMTSNVT-----PGMVL 743

Query: 2412 SINSTPLKSKESGLNQPGIS-LIPSQMASSSTQPDVARIRMKPRDPRRILHDNMVQKNDG 2588
             + + P  ++ + +  P +   +P Q A   +Q D   IRMKPRDPRRILH N+VQKND 
Sbjct: 744  PLGNAPKTTEVAAV--PSVRPQVPMQSAPMHSQNDTGVIRMKPRDPRRILHSNIVQKNDT 801

Query: 2589 V----VYQQSKIDAAASDPQSSLVRLAAPLQLTKNLANVLXXXXXXXXXXXXXXCNNQPV 2756
            V    V Q      A  D QSS   L    Q  + L  +                N  PV
Sbjct: 802  VPPVGVEQAKSNGTAPPDSQSSKDHLLNQDQKAEQLQAIALPSLPVTSSARPVTMNANPV 861

Query: 2757 ASQADQVIVEAASGELNDSETTSLLVSEAGSEKGTRQS---------------------A 2873
            ++   Q+   A      +++ TS  V++A       Q+                     A
Sbjct: 862  SNS--QLAATALMPPHGNTKQTSSSVNKADPRLAAGQNESNDDAATSTGPVTAPDAVPPA 919

Query: 2874 NPWGDVDHLFDGYNDEQKATIQKERSRRIAEQNKMFAARKXXXXXXXXXXXXNSAKFIEV 3053
            +P+GDVDHL DGY+D+QKA IQKER+RRI EQ+KMFAARK            NSAKFIEV
Sbjct: 920  SPYGDVDHLLDGYDDQQKALIQKERARRIKEQHKMFAARKLCLVLDLDHTLLNSAKFIEV 979

Query: 3054 DLVHEDILRRKEEQDKEMSQRHLFRFQHMGMWTKLRPGIWNFLEKASNLYELHLYTMGNK 3233
            D +H +ILR+KEEQD+E ++RHLF F HMGMWTKLRPGIWNFLEKAS LYELHLYTMGNK
Sbjct: 980  DHIHGEILRKKEEQDRERAERHLFCFNHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNK 1039

Query: 3234 LYATEMAKVLDPTGTLFAGRVISKGDDADTFDGDERVPKSKDLDGVLGMESAVVIIDDSV 3413
            +YATEMAKVLDPTGTLFAGRVIS+GDD D FD DERVPKSKDLDGVLGMESAVVIIDDSV
Sbjct: 1040 VYATEMAKVLDPTGTLFAGRVISRGDDGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSV 1099

Query: 3414 RVWPHNKLNLIVVERYTYFPSSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIERLHQN 3593
            RVWPHNK NLIVVERYTYFP SRRQFGL GPSLLEID DERPEDGTLASSLAVIER+H+N
Sbjct: 1100 RVWPHNKHNLIVVERYTYFPCSRRQFGLPGPSLLEIDRDERPEDGTLASSLAVIERIHKN 1159

Query: 3594 FFSHNSLKDVDVRNILAAEQRKILAGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTN 3773
            FFSH +L D DVR+ILA+EQ++IL GCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTN
Sbjct: 1160 FFSHPNLNDADVRSILASEQQRILGGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTN 1219

Query: 3774 QI 3779
            QI
Sbjct: 1220 QI 1221


>ref|XP_003577532.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Brachypodium distachyon]
          Length = 1259

 Score =  849 bits (2193), Expect = 0.0
 Identities = 538/1241 (43%), Positives = 707/1241 (56%), Gaps = 93/1241 (7%)
 Frame = +3

Query: 336  EEKPLDLMARERPR------GGRALD----GETSDVDSSESLEEISAEDFKQEGRAG--- 476
            EE+ + LM RERPR      GG  +     GETSD DSSESLEEI+A DF++E   G   
Sbjct: 10   EERLVVLMTRERPRSAVLAPGGDLVSANGGGETSDEDSSESLEEITAADFQKESSGGAAA 69

Query: 477  ---------RSRVWMGYPMSKNYGPSLYNFAWAQAVQNKPLGLDLKPMSAGDLTDNVDAK 629
                     RSRVWMGY MS++Y P+ ++FAWAQAVQNKPL     P  A D        
Sbjct: 70   GTAASAAAQRSRVWMGYTMSRSYAPAFHSFAWAQAVQNKPL----VPPPAAD-------- 117

Query: 630  GKPQEEAYDVIVEDSNQEDDVXXXXXXXXXXXXXXXXXXXXXVVDETTQLSSDIPENEP- 806
                E+  + IV+ S++E +                         ET  L SD+PE    
Sbjct: 118  ----EDEVEHIVDTSDEEKE----EGEIEEGEAVDTSFPSPHAQPETIDLDSDVPEKSES 169

Query: 807  ---EKKESDGEELQDISEFDKRISLILEELDTITVEEAETSFEAVCLRLRKSFEDLKPMF 977
               E   +    +++  +FD+R+  ILEEL+ +++EEAE SFE  C RLR  FE+LKP+F
Sbjct: 170  MAVEGSNTAAVAVEEEVDFDQRVGSILEELEMVSIEEAEKSFEGACERLRTCFENLKPLF 229

Query: 978  TGIESSDTXXXXXXQQAVMAIQTTYSALNSVTMQKKDQNKQLLLRLLIHIKNQYSVLFTL 1157
                S         QQ  + I T  +  NS  M K+ QNK++LL+LL H++N+YS + T 
Sbjct: 230  LESGSPMPMLDALVQQGFVGIDTITTVANSYAMPKRVQNKEMLLKLLFHLRNRYSDMLTP 289

Query: 1158 EQVKEIDTLVNSLVFEDNKKEKADLTNGPVVSL--HGSNVLEKACLVPE----------- 1298
            +Q  E+D+ V  L F D ++     T+GP  S   + +NV+     VP            
Sbjct: 290  DQRVELDSRVRQLAFVDGEEN----TDGPNASCSTNSTNVVVPTGQVPSERLPFESGATN 345

Query: 1299 ---ALDLPKLVLPTRSKNRVDFSPLLDLHADYNEDSLPSPTRENLPKFSIPKPIGLGMVL 1469
                  LP L   T++KNR+  SPLLDLHAD++E+SLPSPTR+N P+FS+PKPIG G   
Sbjct: 346  PFSGSSLPWL--ETQTKNRM-VSPLLDLHADHDENSLPSPTRDNAPQFSVPKPIGFGAFP 402

Query: 1470 PVSSQPITAKNGGEDVMLHPYVTDALKAVSSYQLKFGSNSILSTDRLPSPTPSEDGGQGX 1649
                + +T +       L+P V D+L  VSSY+ K+   S  + D LPSPTPS DG +  
Sbjct: 403  MGPDRSLTERAEPSKKNLYPSVNDSLD-VSSYKQKYSQKSNFANDDLPSPTPSGDGDKSE 461

Query: 1650 XXXXXXXXXXXX---CNPRAVHAATQFQMVKSRSSAPCTNSSISNQ---SHPVEQLNPMS 1811
                            N  A+ + +Q    +  S+   +N S S     +  +EQ     
Sbjct: 462  DKDGDMFGEISSFSSSNKTALPSVSQIPASRP-STVSSSNGSFSGPPGYAKKIEQSVSGP 520

Query: 1812 KPALKPALKRRDPRLKFVNNEVKSISDEGRVTEPDLLKSDPVGVATNSKKHKT-DESVAR 1988
              ALKP+ K RDPRL+++N +    +      EP+      +G      KHK   + +  
Sbjct: 521  NLALKPSAKSRDPRLRYLNRDPGDANRCMNFAEPNASLGGTLG------KHKAVGQPLMD 574

Query: 1989 DHMMKKQRHKLTTSREMETASGSGWSEAKNVIPQPSFKIQVNENFQVDVRKSGTAAAVPD 2168
            ++M+K+ R  +   R+++   G    +  N+   PS ++Q N+N ++D + +G      D
Sbjct: 575  ENMVKRARGSIGNPRDLQVPPGR---DGSNISFYPSDRVQSNQNTRLDTKTTGNPNLRAD 631

Query: 2169 KK------PIFDNNFDELSGLR-----SIPKSNPTPTISLPSLLK--AVNPTILMQLHQM 2309
             +       I +++      L      S+P+++  P++SLP++LK  AVNPT+LM   QM
Sbjct: 632  SQLLSNVSSITNSSVTSTKTLNAGQPDSVPQTSAAPSVSLPAVLKDIAVNPTVLMHWIQM 691

Query: 2310 EQQRIAAENQKKNAVSTSDSANVLSVNELPGTVASINSTPLKSKESGLNQPGISLIPSQM 2489
            EQQ+ +A   ++   +    ++ +  N+  G V    S  LK+ ++          P+Q 
Sbjct: 692  EQQKRSASEPQQTVNTLGGISSGMINNDTAGMVIPPGSA-LKTADAAQIPSIRPQCPTQT 750

Query: 2490 ASSSTQPDVARIRMKPRDPRRILHDNMVQKND----------GVVY---QQSKIDAAASD 2630
            A   +Q D   IRMKPRDPRRILH+N   KND          G+V    Q SK +    +
Sbjct: 751  APVISQTDAGVIRMKPRDPRRILHNNTSPKNDTTNSEQARSNGIVLPVSQDSKDNMINRE 810

Query: 2631 PQSSLVRLAAPLQLTKNLANVLXXXXXXXXXXXXXXCNNQPVASQADQVIVEAASGELN- 2807
             Q+  ++  A      +L+N+                N+Q  AS    +  +  SG +N 
Sbjct: 811  QQAEQLQTGALPSQPVSLSNIARPSTMSASMVDPVS-NSQLAASSL--MAPQQTSGSINR 867

Query: 2808 ----------DSETTSLLVSEAGSEKGTRQSANPWGDVDHLFDGYNDEQKATIQKERSRR 2957
                      D    +   +   +  G    AN WGD+D L  GY+D+QKA IQKER+RR
Sbjct: 868  ADPRLAPGQNDPNADAATNASPATTLGAAPPANQWGDLDDLLSGYDDQQKALIQKERARR 927

Query: 2958 IAEQNKMFAARKXXXXXXXXXXXXNSAKFIEVDLVHEDILRRKEEQDKEMSQRHLFRFQH 3137
            I EQ KMF+ARK            NSAKF+EVD +HE+ILR+KEEQD+E  +RHLFR  H
Sbjct: 928  IMEQQKMFSARKLCLVLDLDHTLLNSAKFLEVDPIHEEILRKKEEQDRERPERHLFRLHH 987

Query: 3138 MGMWTKLRPGIWNFLEKASNLYELHLYTMGNKLYATEMAKVLDPTGTLFAGRVISKGDDA 3317
            M MWTKLRPGIWNFLEKAS LYELHLYTMGNKLYATEMAKVLDPTG LF GRVIS+G D 
Sbjct: 988  MSMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPTGALFEGRVISRGGDG 1047

Query: 3318 -------DTFDGDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPS 3476
                   D+FD D+RVPKSKDLDGVLGMESAVVIIDDSVRVWPHNK N+IVVERYTYFP 
Sbjct: 1048 TSRGGDGDSFDSDDRVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKNNMIVVERYTYFPC 1107

Query: 3477 SRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIERLHQNFFSHNSLKDVDVRNILAAEQR 3656
            SRRQFGL GPSLLEID DERPEDGTLASSLAVI R+HQNFFSH +L D DVR+ILA+EQR
Sbjct: 1108 SRRQFGLPGPSLLEIDRDERPEDGTLASSLAVIGRIHQNFFSHPNLNDADVRSILASEQR 1167

Query: 3657 KILAGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQI 3779
            +ILAGCRIVFSRIFPVGEANPH+HPLWQ+AEQFGAVCTNQI
Sbjct: 1168 RILAGCRIVFSRIFPVGEANPHLHPLWQSAEQFGAVCTNQI 1208


>gb|EEC68257.1| hypothetical protein OsI_36281 [Oryza sativa Indica Group]
          Length = 1255

 Score =  835 bits (2156), Expect = 0.0
 Identities = 548/1268 (43%), Positives = 681/1268 (53%), Gaps = 120/1268 (9%)
 Frame = +3

Query: 336  EEKPLDLMARERPR----------------GGRALDGETSDVDSSESLEEISAEDFKQEG 467
            EE+ + LMARERPR                GG    GE SD DSS SLEEISA+DFK+E 
Sbjct: 10   EERLVVLMARERPRSAVVAPGGDLVTAGGGGGGGGGGEGSDGDSSGSLEEISADDFKKES 69

Query: 468  RAG--------------RSRVWMGYPMSKNYGPSLYNFAWAQAVQNKPLGLDLKPMSAGD 605
             A               RSRVWMGY + ++Y P+ ++FAWAQAVQNKPL        A D
Sbjct: 70   SAAGGVAAAAAAAAAQQRSRVWMGYNIPRSYAPAFHSFAWAQAVQNKPL-----VPRAAD 124

Query: 606  LTDNVDAKGKPQEEAYDVIVEDSNQEDDVXXXXXXXXXXXXXXXXXXXXXVVDETTQLSS 785
              D        ++E   V+     ++++                         ET  L S
Sbjct: 125  AAD--------EDEVEHVVDTSDEEKEEGEIEEGEAVQTTTTSSSSPPCAQPPETIDLDS 176

Query: 786  DIPENEPEKKESDG-------EELQDISEFDKRISLILEELDTITVEEAET--------- 917
            D PE        DG        E +++ +FD+R+  ILEEL+ +++EEAE          
Sbjct: 177  DAPEKSESMVAMDGGGAAPAGAEEEEV-DFDQRVGSILEELEMVSIEEAEKYGLMILLYG 235

Query: 918  ------------------------SFEAVCLRLRKSFEDLKPMFTGIESSDTXXXXXXQQ 1025
                                    SFE  C RLR  FE+LKP+F    S         QQ
Sbjct: 236  KVHVLDVFWCMIQLLRDPILIFCRSFEGACTRLRTCFENLKPLFPESGSPMPMLDALVQQ 295

Query: 1026 AVMAIQTTYSALNSVTMQKKDQNKQLLLRLLIHIKNQYSVLFTLEQVKEIDTLVNSLVFE 1205
            A + I T  +  NS  M K++Q K +LL+LL HIKN+YS + T +Q  E+D+ V  LVFE
Sbjct: 296  AFVGIDTITTVANSYDMPKREQTKNMLLKLLFHIKNRYSDMLTPDQRDELDSRVRQLVFE 355

Query: 1206 DNKKEKADLTNGP------VVSLHGSNVLEKACLVPEALD-LPKLVLPTRSKNRVDFSPL 1364
            D K    D  NGP        +  G  + E+      A +   K+ +P  +KNR+  SPL
Sbjct: 356  DGK----DNANGPNATSTNAAAPSGQVLSERLPFESGAGNSFSKVEIP--AKNRM-VSPL 408

Query: 1365 LDLHADYNEDSLPSPTRENLPKFSIPKPIGLGMVLPVSSQPIT------AKNGGEDVMLH 1526
            LDLHADY+E+SLPSPTR++ P F +PKPIG G +     +P        AKN       +
Sbjct: 409  LDLHADYDENSLPSPTRDSAPPFDVPKPIGYGALPMAPDRPSVLERVEPAKNSS-----Y 463

Query: 1527 PYVTDALKAVSSYQLKFGSNSILSTDRLPSPTPSEDGGQGXXXXXXXXXXXXXCNPRAVH 1706
                DALKAV  YQ K G  S  ++D LPSPTPS DG +               +     
Sbjct: 464  QSFNDALKAVCYYQQKHGQKSNFASDDLPSPTPSGDGDKSGDKGGDVFGEVSSFSASNKI 523

Query: 1707 AATQFQMVKSRSSAPCTNSSISNQSHP-----VEQLNPMSKPALKPALKRRDPRLKFVNN 1871
                   + SR S   +NS       P     +E     S   LK   K RDPRLKF+N 
Sbjct: 524  VLPIVNQMPSRPSTVSSNSDSFAGGPPGYAKQIENSVSGSNHLLKATAKSRDPRLKFLNR 583

Query: 1872 EVKSISDEGR---VTEPDLLKSDPVGVATNSKKHKTDESVARDHMMKKQRHKLTTSREME 2042
            +   ++D  R     EP+ LK  P  V   S                        +    
Sbjct: 584  DTGGVADANRRVNFAEPNPLKIGPWVVEYQS-----------------------IAPNQN 620

Query: 2043 TASGSGWSEAKNVIPQPSFKIQVNENFQVDVRKSGTAAAVPDKKPIFDNNFDELSGLRSI 2222
            T  G+  +   N+    +    +N         SGT+  +            +     S 
Sbjct: 621  TRLGNNTTGNHNIRTDSTLASNLNNT----TNNSGTSPGIV-----------QAPQTNSA 665

Query: 2223 PKSNPTPTISLPSLLK--AVNPTILMQLHQMEQQRIAA-ENQKKNAVSTSDSANVLSVNE 2393
            P+++  P +SLP++LK  AVNPT+LMQ  +ME  +++A E Q+K   S   ++NV     
Sbjct: 666  PQTSSAPAVSLPAMLKDIAVNPTMLMQWIRMEHHKMSASEPQQKVTASVGMTSNVT---- 721

Query: 2394 LPGTVASINSTPLKSKESGLNQPGIS-LIPSQMASSSTQPDVARIRMKPRDPRRILHDNM 2570
             PG V  + + P  ++ + +  P +   +P Q A   +Q D   IRMKPRDPRRILH N+
Sbjct: 722  -PGMVLPLGNAPKTTEVAAV--PSVRPQVPMQSAPMHSQNDTGVIRMKPRDPRRILHSNI 778

Query: 2571 VQKNDGV----VYQQSKIDAAASDPQSSLVRLAAPLQLTKNLANVLXXXXXXXXXXXXXX 2738
            VQKND V    V Q      A  D QSS   L    Q  + L  +               
Sbjct: 779  VQKNDTVPPVGVEQAKSNGTAPPDSQSSKDHLLNQDQKAEQLQAIALPSLPVTSSARPVT 838

Query: 2739 CNNQPVASQADQVIVEAASGELNDSETTSLLVSEAGSEKGTRQS---------------- 2870
             N  PV++   Q+   A      +++ TS  V++A       Q+                
Sbjct: 839  MNANPVSNS--QLAATALMPPHGNTKQTSSSVNKADPRLAAGQNESNDDAATSTGPVTAP 896

Query: 2871 -----ANPWGDVDHLFDGYNDEQKATIQKERSRRIAEQNKMFAARKXXXXXXXXXXXXNS 3035
                 A+P+GDVDHL DGY+D+QKA IQKER+RRI EQ+KMFAARK            NS
Sbjct: 897  DAVPPASPYGDVDHLLDGYDDQQKALIQKERARRIKEQHKMFAARKLCLVLDLDHTLLNS 956

Query: 3036 AKFIEVDLVHEDILRRKEEQDKEMSQRHLFRFQHMGMWTKLRPGIWNFLEKASNLYELHL 3215
            AKFIEVD +H +ILR+KEEQD+E ++RHLF F HMGMWTKLRPGIWNFLEKAS LYELHL
Sbjct: 957  AKFIEVDHIHGEILRKKEEQDRERAERHLFCFNHMGMWTKLRPGIWNFLEKASKLYELHL 1016

Query: 3216 YTMGNKLYATEMAKVLDPTGTLFAGRVISKGDDADTFDGDERVPKSKDLDGVLGMESAVV 3395
            YTMGNK+YATEMAKVLDPTGTLFAGRVIS+GDD D FD DERVPKSKDLDGVLGMESAVV
Sbjct: 1017 YTMGNKVYATEMAKVLDPTGTLFAGRVISRGDDGDPFDSDERVPKSKDLDGVLGMESAVV 1076

Query: 3396 IIDDSVRVWPHNKLNLIVVERYTYFPSSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVI 3575
            IIDDSVRVWPHNK NLIVVERYTYFP SRRQFGL GPSLLEID DERPEDGTLASSL VI
Sbjct: 1077 IIDDSVRVWPHNKHNLIVVERYTYFPCSRRQFGLPGPSLLEIDRDERPEDGTLASSLTVI 1136

Query: 3576 ERLHQNFFSHNSLKDVDVRNILAAEQRKILAGCRIVFSRIFPVGEANPHMHPLWQTAEQF 3755
            ER+H+NFFSH +L D DVR+ILA+EQ++IL GCRIVFSRIFPVGEANPHMHPLWQTAEQF
Sbjct: 1137 ERIHKNFFSHPNLNDADVRSILASEQQRILGGCRIVFSRIFPVGEANPHMHPLWQTAEQF 1196

Query: 3756 GAVCTNQI 3779
            GAVCTNQI
Sbjct: 1197 GAVCTNQI 1204


>gb|EEE52187.1| hypothetical protein OsJ_34058 [Oryza sativa Japonica Group]
          Length = 1267

 Score =  830 bits (2143), Expect = 0.0
 Identities = 554/1277 (43%), Positives = 697/1277 (54%), Gaps = 136/1277 (10%)
 Frame = +3

Query: 357  MARERPR----------------GGRALDGETSDVDSSESLEEISAEDFKQEGRAGRSRV 488
            MARERPR                GG    GE SD DSS SLEEISA+DFK+E  A  S  
Sbjct: 1    MARERPRSAVVAPGGDLVTAGGGGGGGGGGEGSDGDSSGSLEEISADDFKKESSAAGSAA 60

Query: 489  WMGYPMSKNYGPSLYNFAWAQAVQNKPLGLDLKPMSAGDLTDNVDAKGKPQEEAYDVIVE 668
                   ++           +AVQNKP+       SA D  D         E+  + +V+
Sbjct: 61   AAAAAQQRS-----------RAVQNKPV-----VPSAADAAD---------EDEVEHVVD 95

Query: 669  DSNQEDDVXXXXXXXXXXXXXXXXXXXXXVVDETTQLSS-----------DIPENEPEKK 815
             S++E +                       V  TT  SS           D+  + PEK 
Sbjct: 96   TSDEEKE--------------EGEIEEGEAVQTTTTSSSSPPCAQPPETIDLDSDAPEKS 141

Query: 816  ES-----------DGEELQDISEFDKRISLILEELDTITVEEAET--------------- 917
            ES            G E +++ +FD+R+  ILEEL+ +++EEAE                
Sbjct: 142  ESMVAMYGGGAAPAGAEEEEV-DFDQRVGSILEELEMVSIEEAEKYGLMILLYGKVHVLD 200

Query: 918  ------------------SFEAVCLRLRKSFEDLKPMFTGIESSDTXXXXXXQQAVMAIQ 1043
                              SFE  C RLR  FE+LKP+F    S         QQA + I 
Sbjct: 201  VFWCMIQLLRDPILMFCRSFEGACTRLRTCFENLKPLFPESGSPMPMLDALVQQAFVGID 260

Query: 1044 TTYSALNSVTMQKKDQNKQLLLRLLIHIKNQYSVLFTLEQVKEIDTLVNSLVFEDNKKEK 1223
            T  +  NS  M K++Q K +LL+LL HIKN+YS + T +Q  E+D+ V  LVFED K   
Sbjct: 261  TITTVANSYDMPKREQTKNMLLKLLFHIKNRYSDMLTPDQRDELDSRVRQLVFEDGK--- 317

Query: 1224 ADLTNGP------VVSLHGSNVLEKACLVPEALD-LPKLVLPTRSKNRVDFSPLLDLHAD 1382
             D  NGP        +  G  + E+      A +   K+ +P  +KNR+  SPLLDLHAD
Sbjct: 318  -DNANGPNATSTNAAAPSGQVLSERLPFESGAGNSFSKVEIP--AKNRM-VSPLLDLHAD 373

Query: 1383 YNEDSLPSPTRENLPKFSIPKPIGLGMVLPVSSQPIT------AKNGGEDVMLHPYVTDA 1544
            Y+E+SLPSPTR++ P F +PKPIG G +     +P        AKN       +    DA
Sbjct: 374  YDENSLPSPTRDSKPPFDVPKPIGYGALPMAPDRPSVLERVEPAKNSS-----YQSFNDA 428

Query: 1545 LKAVSSYQLKFGSNSILSTDRLPSPTPSEDGGQGXXXXXXXXXXXXXCNPRAVHAATQFQ 1724
            LKAV  YQ K G  S  ++D LPSPTPS DG +               +     A     
Sbjct: 429  LKAVCYYQQKHGQKSNFASDDLPSPTPSGDGDKSGDKGGDVFGEVSSFSASNKIALPIVN 488

Query: 1725 MVKSRSSAPCTNSSISNQSHP-----VEQLNPMSKPALKPALKRRDPRLKFVNNEVKSIS 1889
             + SR S   +NS       P     +E     S   LK   K RDPRLKF+N +   ++
Sbjct: 489  QMPSRPSTVSSNSDSFAGGPPGYAKQIENSVSGSNHLLKATAKSRDPRLKFLNRDTGGVA 548

Query: 1890 DEGR---VTEPDLLKSDPVG--VATNSKKHKT-DESVARDHMMKKQRHKLTTSREMETAS 2051
            D  R     EP+  K   +G  V+ NS+K+K  DE +  ++ +K+ R  +   R+M+   
Sbjct: 549  DANRRVNFAEPNPSKDRTMGGGVSINSRKNKAVDEPMVDENALKRSRGVIGNLRDMQPTG 608

Query: 2052 GSGWS-EAKNVIPQPSFKIQVNENFQVDVRKSG-----TAAAVPDKKPIFDNNFDELSGL 2213
              GW+ +  N+    S   Q N+N ++    +G     T + +        NN     G+
Sbjct: 609  RGGWAKDGGNISSYSSDGFQPNQNTRLGNNTTGNHNIRTDSTLASNLNNTTNNSGTSPGI 668

Query: 2214 ------RSIPKSNPTPTISLPSLLK--AVNPTILMQLHQMEQQRIAA-ENQKKNAVSTSD 2366
                   S P+++  P +SLP++LK  AVNPT+LMQ  QMEQQ+++A E Q+K   S   
Sbjct: 669  VQAPQTNSAPQTSSAPAVSLPAMLKDIAVNPTMLMQWIQMEQQKMSASEPQQKVTASVGM 728

Query: 2367 SANVLSVNELPGTVASINSTPLKSKESGLNQPGIS-LIPSQMASSSTQPDVARIRMKPRD 2543
            ++NV      PG V  + + P  ++ + +  P +   +P Q A   +Q D   IRMKPRD
Sbjct: 729  TSNVT-----PGMVLPLGNAPKTTEVAAV--PSVRPQVPMQSAPMHSQNDTGVIRMKPRD 781

Query: 2544 PRRILHDNMVQKNDGV----VYQQSKIDAAASDPQSSLVRLAAPLQLTKNLANVLXXXXX 2711
            PRRILH N+VQKND V    V Q      A  D QSS   L    Q  + L  +      
Sbjct: 782  PRRILHSNIVQKNDTVPPVGVEQAKSNGTAPPDSQSSKDHLLNQDQKAEQLQAIALPSLP 841

Query: 2712 XXXXXXXXXCNNQPVASQADQVIVEAASGELNDSETTSLLVSEAGSEKGTRQS------- 2870
                      N  PV++   Q+   A      +++ TS  V++A       Q+       
Sbjct: 842  VTSSARPVTMNANPVSNS--QLAATALMPPHGNTKQTSSSVNKADPRLAAGQNESNDDAA 899

Query: 2871 --------------ANPWGDVDHLFDGYNDEQKATIQKERSRRIAEQNKMFAARKXXXXX 3008
                          A+P+GDVDHL DGY+D+QKA IQKER+RRI EQ+KMFAARK     
Sbjct: 900  TSTGPVTAPDAVPPASPYGDVDHLLDGYDDQQKALIQKERARRIKEQHKMFAARKLCLVL 959

Query: 3009 XXXXXXXNSAKFIEVDLVHEDILRRKEEQDKEMSQRHLFRFQHMGMWTKLRPGIWNFLEK 3188
                   NSAKFIEVD +H +ILR+KEEQD+E ++RHLF F HMGMWTKLRPGIWNFLEK
Sbjct: 960  DLDHTLLNSAKFIEVDHIHGEILRKKEEQDRERAERHLFCFNHMGMWTKLRPGIWNFLEK 1019

Query: 3189 ASNLYELHLYTMGNKLYATEMAKVLDPTGTLFAGRVISKGDDADTFDGDERVPKSKDLDG 3368
            AS LYELHLYTMGNK+YATEMAKVLDPTGTLFAGRVIS+GDD D FD DERVPKSKDLDG
Sbjct: 1020 ASKLYELHLYTMGNKVYATEMAKVLDPTGTLFAGRVISRGDDGDPFDSDERVPKSKDLDG 1079

Query: 3369 VLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPSSRRQFGLLGPSLLEIDHDERPEDG 3548
            VLGMESAVVIIDDSVRVWPHNK NLIVVERYTYFP SRRQFGL GPSLLEID DERPEDG
Sbjct: 1080 VLGMESAVVIIDDSVRVWPHNKHNLIVVERYTYFPCSRRQFGLPGPSLLEIDRDERPEDG 1139

Query: 3549 TLASSLAVIERLHQNFFSHNSLKDVDVRNILAAEQRKILAGCRIVFSRIFPVGEANPHMH 3728
            TLASSLAVIER+H+NFFSH +L D DVR+ILA+EQ++IL GCRIVFSRIFPVGEANPHMH
Sbjct: 1140 TLASSLAVIERIHKNFFSHPNLNDADVRSILASEQQRILGGCRIVFSRIFPVGEANPHMH 1199

Query: 3729 PLWQTAEQFGAVCTNQI 3779
            PLWQTAEQFGAVCTNQI
Sbjct: 1200 PLWQTAEQFGAVCTNQI 1216


>ref|XP_002449554.1| hypothetical protein SORBIDRAFT_05g019010 [Sorghum bicolor]
            gi|241935397|gb|EES08542.1| hypothetical protein
            SORBIDRAFT_05g019010 [Sorghum bicolor]
          Length = 1197

 Score =  823 bits (2126), Expect = 0.0
 Identities = 524/1224 (42%), Positives = 703/1224 (57%), Gaps = 83/1224 (6%)
 Frame = +3

Query: 357  MARERPR------GGRALD----GETSDVDSSESLEEISAEDFKQEGRAG---------- 476
            MARERPR      GG  +     GE SD DS+ S+EEISA+DF+++  +           
Sbjct: 1    MARERPRSTVVAAGGDLVTAPGGGEGSDGDSAGSIEEISADDFRKDSSSALGGPAAAAAA 60

Query: 477  --RSRVWMGYP----MSKNYGPSLYNFAWAQAVQNKPLGLDLKPMSAGDLTDN-VDAKGK 635
              RSR W+G P    M++N+G +  +FAW+QAV+NKPLGL   P S  D  ++ VDA   
Sbjct: 61   GQRSRSWVGPPAVGYMARNFGHAFNSFAWSQAVRNKPLGLQPPPASDEDEVEHAVDASDG 120

Query: 636  PQEEAYDVIVEDSNQEDDVXXXXXXXXXXXXXXXXXXXXXVVDETTQLSSDIPENEPEKK 815
             +EE       +  + + V                      ++++  L+  +P +  E++
Sbjct: 121  EKEEG------EIEEGEAVEAEASPARAQPETIDLDADADALEKSESLAGAVPASAAEEE 174

Query: 816  ESDGEELQDISEFDKRISLILEELDTITVEEAETSFEAVCLRLRKSFEDLKPMFTGIESS 995
            E +          D+R+  ILEEL+ +++EEAE SFE  C RL   FE+LKP+F  +E+ 
Sbjct: 175  EVN---------LDQRVGSILEELEMVSIEEAEKSFEGACGRLHTCFENLKPLFQELENG 225

Query: 996  D--TXXXXXXQQAVMAIQTTYSALNSVTMQKKDQNKQLLLRLLIHIKNQYSVLFTLEQVK 1169
                      QQA + I T  +   S  + + +QNK  LL+ L HIKN+YS + T EQ  
Sbjct: 226  SPMAILEPLMQQAFIGIDTLTTVAISYNLPRSEQNKTTLLKSLFHIKNRYSDMLTPEQRD 285

Query: 1170 EIDTLVNSLVF--EDNKKEKAD---------LTNGPVVSLHGSNVLEKACLVPEALDLPK 1316
            E+D+ V  LVF  +DN  + +          L     VS  G    E     P +  LP+
Sbjct: 286  ELDSRVRKLVFGEKDNVSDPSTSSGTNAINVLAPSGQVSSSGGLPFESGAANPFS-SLPR 344

Query: 1317 LVLPTRSKNRVDFSPLLDLHADYNEDSLPSPTRENLPKFSIPKPIGLGMVLPVSSQPITA 1496
            L +P +       SPLLDLHADY+E+SLPSPTR+N P F +PKPIG G   P+  + ++ 
Sbjct: 345  LEVPAKR-----ISPLLDLHADYDENSLPSPTRDNAPPFPVPKPIGFG-AFPMVPEKLSF 398

Query: 1497 KNGGEDVM--LHPYVTDALKAVSSYQLKFGSNSILSTDRLPSPTPSEDGGQGXXXXXXXX 1670
                E     L+P + D LKAVSSYQ K+G  S+  +D LPSPTPS D G+         
Sbjct: 399  PERVEPAKNSLYPSLNDPLKAVSSYQQKYGQKSVFPSDDLPSPTPSGDEGKSADKGGDIF 458

Query: 1671 XXXXXCN-PRAVHAATQFQMVKSRSSAPCTNSSISNQSHP------VEQLNPMSKP--AL 1823
                    P+++   +  QM  S+ S   ++S IS  S P      +EQ  P++ P  A+
Sbjct: 459  SEVSSFPVPKSIALPSTSQMPASQPST-VSSSGISYASGPPGFAKQIEQ--PVAGPNHAI 515

Query: 1824 KPALKRRDPRLKFVNNEVKSISDEGRVTEPDLLKSDPVGVAT-NSKKHKT-DESVARDHM 1997
            K A K RDPRL+F+N +    +D  R      LK   +G A+  ++KHK  D+    +++
Sbjct: 516  KAASKSRDPRLRFLNRDSAGATDVNRRANFSELKDGNLGGASVGNRKHKAIDDPQVDENV 575

Query: 1998 MKKQRHKLTTSREMETASGSGWSEAKNVIPQPSFKIQVNENFQVDVRKSGTAAAVPDKKP 2177
            +K+ R      R+++                       N N  +++R    ++ +     
Sbjct: 576  LKRFRGGTANPRDLQPTG--------------------NPNQLMNIRAPTNSSGI----- 610

Query: 2178 IFDNNFDELSGLRSI-PKSNPTPTISLPSLLK---AVNPTILMQLHQMEQQRIAAENQKK 2345
                N   L   ++  P  +  P + +PS+L    AVNPT+LM L QME       +QKK
Sbjct: 611  ----NMKTLQPPQTTAPHVSAAPAVPVPSMLLKDIAVNPTLLMHLIQME-------HQKK 659

Query: 2346 NAVSTSDS-ANVLSVNELPGTVASINSTPLKSKESGLNQPGISLIPSQMASSSTQPDVAR 2522
            +A  T    ++ +S N + G V +  + P K+ E+         +P+Q  S ++Q D   
Sbjct: 660  SASETQGGMSSGMSNNGIAGMVFTPGNAP-KTTEAAQVPSVRPQVPAQTPSLNSQNDGGI 718

Query: 2523 IRMKPRDPRRILHDNMVQKNDGVVYQQSKI------DAAASDPQSSLV--------RLAA 2660
            +RMKPRDPRRILH+N+ QK+D +V +Q K       D+  +  Q+S +         +A 
Sbjct: 719  LRMKPRDPRRILHNNVAQKSDAMVLEQVKTNGITQPDSQGTKDQTSSMPSQPTLPSSVAR 778

Query: 2661 PLQLTKNLANVLXXXXXXXXXXXXXXCNNQPVASQADQVIVEAASG-----------ELN 2807
            P   TK++  V                +N  +A+ A     + A G           E N
Sbjct: 779  PFTNTKHVDPV----------------SNSQLAATAIMAPTQQALGSINKVDPRLAVEQN 822

Query: 2808 DSETTSLLVSEAGSEKGTRQSANPWGDVDHLFDGYNDEQKATIQKERSRRIAEQNKMFAA 2987
                 +     + +E    Q  +PWG++DHL DGY+D+QKA IQKER+RRI EQ+KMF+A
Sbjct: 823  GQNADATTTDASATELEATQPVSPWGNLDHLLDGYDDKQKALIQKERARRITEQHKMFSA 882

Query: 2988 RKXXXXXXXXXXXXNSAKFIEVDLVHEDILRRKEEQDKEMSQRHLFRFQHMGMWTKLRPG 3167
            RK            NSAKFIEV+ +HE++LR+KEEQD+ + +RHL+RF HM MWTKLRPG
Sbjct: 883  RKLCLVLDLDHTLLNSAKFIEVEPIHEEMLRKKEEQDRTLPERHLYRFHHMNMWTKLRPG 942

Query: 3168 IWNFLEKASNLYELHLYTMGNKLYATEMAKVLDPTGTLFAGRVISKGDDADTFDGDERVP 3347
            IWNFLEKASNL+ELHLYTMGNKLYATEMAKVLDPTGTLFAGRVIS+GDD D FD DERVP
Sbjct: 943  IWNFLEKASNLFELHLYTMGNKLYATEMAKVLDPTGTLFAGRVISRGDDGDPFDSDERVP 1002

Query: 3348 KSKDLDGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPSSRRQFGLLGPSLLEIDH 3527
            KSKDLDGVLGMESAVVIIDDSVRVWPHN+ NLIVVERYTYFP SRRQFGL GPSLLEID 
Sbjct: 1003 KSKDLDGVLGMESAVVIIDDSVRVWPHNRHNLIVVERYTYFPCSRRQFGLPGPSLLEIDR 1062

Query: 3528 DERPEDGTLASSLAVIERLHQNFFSHNSLKDVDVRNILAAEQRKILAGCRIVFSRIFPVG 3707
            DERPEDGTLASSLAVIER+H NFFSH +L + DVR+ILA+EQR+ILAGCRIVFSR+FPVG
Sbjct: 1063 DERPEDGTLASSLAVIERIHHNFFSHPNLNEADVRSILASEQRRILAGCRIVFSRVFPVG 1122

Query: 3708 EANPHMHPLWQTAEQFGAVCTNQI 3779
            +A+PH+HPLWQTAEQFGAVCTN +
Sbjct: 1123 DASPHLHPLWQTAEQFGAVCTNLV 1146


>gb|EXB81217.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Morus
            notabilis]
          Length = 1301

 Score =  804 bits (2077), Expect = 0.0
 Identities = 537/1272 (42%), Positives = 711/1272 (55%), Gaps = 135/1272 (10%)
 Frame = +3

Query: 369  RPRGGRAL-DGETSDVDSSESLEEISAEDF-KQEGRA-------------------GRSR 485
            R   GR + D E  ++  S S+EEIS EDF KQEG                     G SR
Sbjct: 3    RIESGRVVEDVEEGEISDSASVEEISEEDFNKQEGNGTGSGKVMSVSDSNSKESKFGDSR 62

Query: 486  VWM------GYPMSKNYGPSLYNFAWAQAVQNKPLG-LDLKPMSAGDLTDNVDAKGKPQ- 641
            VW        YP  + Y   LYN AWAQAVQNKPL  + +  + A D +  V +   P  
Sbjct: 63   VWTMRDLYANYPGFRGYTTGLYNLAWAQAVQNKPLNEIFVMDVDADDSSRVVLSSASPAV 122

Query: 642  --------------EEAYDVIVEDSNQEDDVXXXXXXXXXXXXXXXXXXXXXVVDETTQL 779
                          E+   V+++DS  E +                         E  + 
Sbjct: 123  NSGRREGKNGVKEVEKVEKVVIDDSADEME-----------------------EGELEEG 159

Query: 780  SSDIPENEPEKKESDGEELQD------------------ISEFDKRISLILEELDTITVE 905
              D+ E+EP +K + GEE +D                    E +KR+ LI E L ++ V 
Sbjct: 160  EIDL-ESEPTQKPA-GEEAKDGDLNCEAENVGGLEVDSRRDELEKRVDLIWETLGSVNVV 217

Query: 906  EAETSFEAVCLRLRKSFEDLKPMFTGIESSDTXXXXXXQQAVMAIQTTYSALNSVTMQKK 1085
             AE SFE VC RL+++ E L+ + +  E S        Q ++ AIQ   S   S+++ +K
Sbjct: 218  NAEKSFEEVCSRLQRTLESLRGVLSEKEFSFPTKDVVIQMSITAIQVVNSVFCSMSVNQK 277

Query: 1086 DQNKQLLLRLLIHIKNQYSVLFTLEQVKEIDTLVNSL-------------------VFED 1208
            +Q K+ L RL   +KN  + LF+ EQ KEI+ +++SL                   + E 
Sbjct: 278  EQKKETLSRLFCSVKNCGTPLFSPEQTKEIELMISSLNPLNVLPSSGASDKEKETQIIER 337

Query: 1209 NKKEKADLTNGPV--VSLHGSNV-LEKACLV----PEALDLPKLVLP--TRSKNRVDFSP 1361
              +  ++LTN      S+  ++V L + C+        + LP+L+ P     K R    P
Sbjct: 338  LHEMDSNLTNANAENASIERTSVKLPQDCVASVVHSNPITLPELLRPGTLAFKGRGLLLP 397

Query: 1362 LLDLHADYNEDSLPSPTRENLPKFSIPKPIGL--GMVLPVSSQPITAKNGGEDVMLHPYV 1535
            LLDLH D++ DSLPSPTRE    F + KP+G+  G++ PVS+    A  G E+  LH Y 
Sbjct: 398  LLDLHKDHDADSLPSPTREAPSCFPVYKPLGVADGIIKPVSTTAKVAP-GAEESRLHRYE 456

Query: 1536 TDALKAVSSYQLKFGSNSILSTDRLPSPTPSEDGGQGXXXXXXXXXXXXXCNPRAVHAAT 1715
            TDALKAVS+YQ KFG  S L +DRLPSPTPSE+  +               N R      
Sbjct: 457  TDALKAVSTYQQKFGRGSFLMSDRLPSPTPSEECDEEDDINQEVSSSLTSGNLRTPAIPI 516

Query: 1716 QFQMVKSRS---SAPCTNSSISNQSHPVEQLNPMSKPALKPALKRRDPRLKFVNNEVKSI 1886
                V + S   S+P     I+ ++     +   S   +K + + RDPRL+F N++  ++
Sbjct: 517  LRPSVVTSSVPVSSPTMQGPIAAKN--AAPVGSGSNSTMKASARSRDPRLRFANSDAGAL 574

Query: 1887 SDEGRV-----TEPDLLKSDPVGVATNSKKHKTDESVARD-HMMKKQRHKLTTSR-EMET 2045
                R        P +   DP    T+S+K +  E    D   +K+QRH   +++ +++T
Sbjct: 575  DLNQRPLTAVHNGPKVEPGDP----TSSRKQRIVEEPNLDGPALKRQRHAFVSAKIDVKT 630

Query: 2046 ASG-SGWSEAKNVI-PQPSFKIQVNENFQVDVRKSGTAAAVPDKKPIFDNNFDELSGLRS 2219
            ASG  GW E      PQ   K Q+ EN + D RKS          PI +N  +   G   
Sbjct: 631  ASGVGGWLEDNGTTGPQIMNKNQLVENAEADPRKSIHLV----NGPIMNNGPN--IGKEQ 684

Query: 2220 IPKSNPTPTISLPSLLK--AVNPTILMQ-LHQMEQQRIAAENQKKNAVSTSDSANVLSVN 2390
            +P +  +   +LP++LK  AVNPTI M  L+++ QQ++ A + ++ + S+ ++ +    N
Sbjct: 685  VPVTGTSTPDALPAILKDIAVNPTIFMDILNKLGQQQLLAADAQQKSDSSKNTTHPPGTN 744

Query: 2391 ELPGTVASINSTPLKSKESGLNQ-PGISL-IPSQMASSSTQPDVARIRMKPRDPRRILHD 2564
             + G    +N  P  SK SG+ Q P +SL   SQ+A++S Q ++ +IRMKPRDPRR+LH 
Sbjct: 745  SILGAAPLVNVAP--SKASGILQTPAVSLPTTSQVATASMQDELGKIRMKPRDPRRVLHG 802

Query: 2565 NMVQKNDGVVYQQSK-------------------IDAAASD----PQSSLVRLAAPLQLT 2675
            NM+QK+  + ++Q K                   +    +D    P   +V+     Q T
Sbjct: 803  NMLQKSWSLGHEQFKPIVSSVSCTPGNKDNLNGPVQEGQADKKQVPSQLVVQPDIARQFT 862

Query: 2676 KNLANVL----XXXXXXXXXXXXXXCNNQPVASQADQVIVEAASGELNDSETTSLLVSEA 2843
            KNL N+                    ++QP+  + D+  V+A      D  + +    E 
Sbjct: 863  KNLRNIADLMSVSQASTSPATVSQNLSSQPLPVKPDRGDVKAVVPNSEDQHSGTNSTPET 922

Query: 2844 GSEKGTRQSANPWGDVDHLFDGYNDEQKATIQKERSRRIAEQNKMFAARKXXXXXXXXXX 3023
                 +R + N WGDV+HLF+GY+DEQKA IQ+ER+RR+ EQ KMF A K          
Sbjct: 923  TLAVPSR-TPNAWGDVEHLFEGYDDEQKAAIQRERARRLEEQKKMFDAHKLCLVLDLDHT 981

Query: 3024 XXNSAKFIEVDLVHEDILRRKEEQDKEMSQRHLFRFQHMGMWTKLRPGIWNFLEKASNLY 3203
              NSAKF+EVD VH++ILR+KEEQD+E  QRHLFRF HMGMWTKLRPG+WNFLEKAS LY
Sbjct: 982  LLNSAKFVEVDSVHDEILRKKEEQDREKPQRHLFRFPHMGMWTKLRPGVWNFLEKASKLY 1041

Query: 3204 ELHLYTMGNKLYATEMAKVLDPTGTLFAGRVISKGDDADTFDGDERVPKSKDLDGVLGME 3383
            ELHLYTMGNKLYATEMAKVLDP GTLF+GRVIS+GDD D FDGDERVPKSKDL+GVLGME
Sbjct: 1042 ELHLYTMGNKLYATEMAKVLDPMGTLFSGRVISRGDDGDPFDGDERVPKSKDLEGVLGME 1101

Query: 3384 SAVVIIDDSVRVWPHNKLNLIVVERYTYFPSSRRQFGLLGPSLLEIDHDERPEDGTLASS 3563
            S+VVIIDDSVRVWPHNKLNLIVVERYTYFP SRRQFGL GPSLLEIDHDERPE GTLASS
Sbjct: 1102 SSVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEQGTLASS 1161

Query: 3564 LAVIERLHQNFFSHNSLKDVDVRNILAAEQRKILAGCRIVFSRIFPVGEANPHMHPLWQT 3743
            LAVIE++HQNFFSH+SL +VDVRNILA+EQRKILAGCRIVFSR+FPV E NPH+HPLWQT
Sbjct: 1162 LAVIEKIHQNFFSHHSLDEVDVRNILASEQRKILAGCRIVFSRVFPVSEVNPHLHPLWQT 1221

Query: 3744 AEQFGAVCTNQI 3779
            AEQFGAVCT QI
Sbjct: 1222 AEQFGAVCTTQI 1233


>gb|EOX99661.1| RNA polymerase II C-terminal domain phosphatase-like 3, putative
            [Theobroma cacao]
          Length = 1290

 Score =  801 bits (2069), Expect = 0.0
 Identities = 546/1266 (43%), Positives = 703/1266 (55%), Gaps = 104/1266 (8%)
 Frame = +3

Query: 294  MLFAGSNLLFLNH--SEEKPLDLMARERPRGGRALDGETSDVDSSESLEEISAEDFKQE- 464
            M   GSN +  +     E+ L  M ++  +     +GE SD   S S+EEIS EDF ++ 
Sbjct: 1    MYVVGSNWIEWSKLKKNEETLGEMGKDETKVEDVEEGEISD---SASIEEISEEDFNKQD 57

Query: 465  ------------GRAG-RSRVWM-----GYP-MSKNYGPSLYNFAWAQAVQNKPL----- 572
                        G A   SRVW       YP + + Y   LYNFAWAQAVQNKPL     
Sbjct: 58   VKILKESKSSKGGEANSNSRVWTMQDLCKYPSVIRGYASGLYNFAWAQAVQNKPLNEIFV 117

Query: 573  -------------GLDLKPMSAGDLTDNVDAKGKPQEEAYDVIVEDSNQEDDVXXXXXXX 713
                              P S+    ++ + KG     A  V+++D ++++         
Sbjct: 118  KDFEQPQQDENKNSKRSSPSSSVASVNSKEEKGSSGNLAVKVVIDDDSEDE--------- 168

Query: 714  XXXXXXXXXXXXXXVVDETTQLSSDIPENEPEKK--ESDGEELQDISEFDKRISLILEEL 887
                             E  +   D+ ++EP++K   S+   + +  E +KR +LI   L
Sbjct: 169  ---MEEDKVVNLDKEEGELEEGEIDL-DSEPKEKVLSSEDGNVGNSDELEKRANLIRGVL 224

Query: 888  DTITVEEAETSFEAVCLRLRKSFEDLKPMFTGIESSDTXXXXXXQQAVMAIQTTYSALNS 1067
            + +TV EAE SFE VC RL  + E L+ +   +E S        Q A  AI + + ALN 
Sbjct: 225  EGVTVIEAEKSFEGVCSRLHNALESLRALI--LECSVPAKDALIQLAFGAINSAFVALN- 281

Query: 1068 VTMQKKDQNKQLLLRLLIHIKNQYSVLFTLEQVKEIDTLVNSL---------------VF 1202
                 K+QN  +L RLL  +K     LF  +++KEID ++ SL               V 
Sbjct: 282  --CNSKEQNVAILSRLLSIVKGHDPSLFPPDKMKEIDVMLISLNSPARAIDTEKDMKVVD 339

Query: 1203 EDNKKEKADLTNGPVVSLHGSNVLEKACLV-----PEALDLPKLVLPTRSKNRVDFSPLL 1367
              NKK+   L       L  +N L  +        P AL           +NR    PLL
Sbjct: 340  GVNKKDPDALPENICHDLTVTNKLPSSAKFVINNKPNALTETLKPGVPNFRNRGISLPLL 399

Query: 1368 DLHADYNEDSLPSPTRENLPKFSIPKPIGLGMVLPVSSQPITAK--NGGEDVMLHPYVTD 1541
            DLH D++ DSLPSPTRE  P   + KP+  G V+ V S  +T K  +  E   LHPY TD
Sbjct: 400  DLHKDHDADSLPSPTRETTPCLPVNKPLTSGDVM-VKSGFMTGKGSHDAEGDKLHPYETD 458

Query: 1542 ALKAVSSYQLKFGSNSILSTDRLPSPTPSEDGG-QGXXXXXXXXXXXXXCNPRAVHAATQ 1718
            ALKA S+YQ KFG  S  S+DRLPSPTPSE+ G +G              N +       
Sbjct: 459  ALKAFSTYQQKFGQGSFFSSDRLPSPTPSEESGDEGGDNGGEVSSSSSIGNFKPNLPILG 518

Query: 1719 FQMVKSRSSAPCTNSSISNQ--SHPVEQLNPMSKPALKPALKRRDPRLKFVNNEVKSISD 1892
              +V S       +SS+  Q  +     ++ +S    K   K RDPRL F N+   ++  
Sbjct: 519  HPIVSSAPLVDSASSSLQGQITTRNATPMSSVSNIVSKSLAKSRDPRLWFANSNASALDL 578

Query: 1893 EGRVTEPDLLKSDPVGVATNSKKHKT-DESVARDHMMKKQRHKLT---TSREMETASG-S 2057
              R+   +  K  PVG   +S+K K+ +E +     +K+QR++L     +R+++T SG  
Sbjct: 579  NERLLH-NASKVAPVGGIMDSRKKKSVEEPILDSPALKRQRNELENLGVARDVQTVSGIG 637

Query: 2058 GWSEAKNVI-PQPSFKIQVNENFQVDVRKSGTAAAVPDKKPIFDNNFDELSGLRSIPKSN 2234
            GW E  + I  Q + + Q  EN + + RK      V     +       +     +P ++
Sbjct: 638  GWLEDTDAIGSQITNRNQTAENLESNSRKMDN--GVTSSSTLSGKTNITVGTNEQVPVTS 695

Query: 2235 PTPTISLPSLLK--AVNPTILMQLHQM-EQQRIAAENQKKNAVSTSDSANVLSVNELPGT 2405
             T T SLP+LLK  AVNPT+L+ + +M +QQR+ AE Q+K+      + +  S N L G 
Sbjct: 696  -TSTPSLPALLKDIAVNPTMLINILKMGQQQRLGAEAQQKSPDPVKSTFHQPSSNSLLGV 754

Query: 2406 VASINSTPLKSKESGLN-QPGISLIPSQMASSSTQPDVARIRMKPRDPRRILHDNMVQKN 2582
            V+S N  P  S  +  +   GIS  P+      +  +  +IRMKPRDPRR+LH N +Q++
Sbjct: 755  VSSTNVIPSPSVNNVPSISSGISSKPAGNLQVPSPDESGKIRMKPRDPRRVLHGNSLQRS 814

Query: 2583 DGVVYQQSKIDAA-ASDPQSSLVRLAA----------PL------------QLTKNLANV 2693
              +   Q K + A  S  Q S   L A          P+            Q T NL N+
Sbjct: 815  GSMGLDQLKTNGALTSSTQGSKDNLNAQKLDSQTESKPMQSQLVPPPDITQQFTNNLKNI 874

Query: 2694 LXXXXXXXXXXXXXXCNN----QPVASQADQVIVEAASGELNDSETTSLLVSEAGSEKGT 2861
                            ++    QPV  ++D + ++A      D +T + L  EAG+  G 
Sbjct: 875  ADIMSVSQALTSLPPVSHNLVPQPVLIKSDSMDMKALVSNSEDQQTGAGLAPEAGA-TGP 933

Query: 2862 RQSANPWGDVDHLFDGYNDEQKATIQKERSRRIAEQNKMFAARKXXXXXXXXXXXXNSAK 3041
            R S N WGDV+HLF+ Y+D+QKA IQ+ER+RRI EQ KMF+ARK            NSAK
Sbjct: 934  R-SQNAWGDVEHLFERYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTLLNSAK 992

Query: 3042 FIEVDLVHEDILRRKEEQDKEMSQRHLFRFQHMGMWTKLRPGIWNFLEKASNLYELHLYT 3221
            FIEVD VHE+ILR+KEEQD+E  +RHLFRF HMGMWTKLRPGIWNFLEKAS LYELHLYT
Sbjct: 993  FIEVDPVHEEILRKKEEQDREKPERHLFRFHHMGMWTKLRPGIWNFLEKASKLYELHLYT 1052

Query: 3222 MGNKLYATEMAKVLDPTGTLFAGRVISKGDDADTFDGDERVPKSKDLDGVLGMESAVVII 3401
            MGNKLYATEMAKVLDP G LFAGRVIS+GDD D FDGDERVP+SKDL+GVLGMESAVVII
Sbjct: 1053 MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPRSKDLEGVLGMESAVVII 1112

Query: 3402 DDSVRVWPHNKLNLIVVERYTYFPSSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIER 3581
            DDSVRVWPHNKLNLIVVERYTYFP SRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIER
Sbjct: 1113 DDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIER 1172

Query: 3582 LHQNFFSHNSLKDVDVRNILAAEQRKILAGCRIVFSRIFPVGEANPHMHPLWQTAEQFGA 3761
            +HQ+FFSH +L DVDVRNILA+EQRKILAGCRIVFSR+FPVGEANPH+HPLWQTAEQFGA
Sbjct: 1173 IHQDFFSHQNLDDVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGA 1232

Query: 3762 VCTNQI 3779
            VCTNQI
Sbjct: 1233 VCTNQI 1238


>ref|XP_002266931.2| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Vitis vinifera]
          Length = 1238

 Score =  779 bits (2011), Expect = 0.0
 Identities = 519/1211 (42%), Positives = 676/1211 (55%), Gaps = 82/1211 (6%)
 Frame = +3

Query: 393  DGETSDVDSSESLEEISAEDF-KQEGRAGR-------SRVWMGYPMS---------KNYG 521
            D E  ++  S S+EEIS EDF KQE R  R       +RVW    +            Y 
Sbjct: 14   DVEEGEISDSASVEEISEEDFNKQEVRVLREAKPKADTRVWTMRDLQDLYKYHQACSGYT 73

Query: 522  PSLYNFAWAQAVQNKPLG-------LDLKPMSAGDLTDNVDAKGKPQEEAYDVIVEDSNQ 680
            P LYN AWAQAVQNKPL         + K  S+   T   D+     +E   VI++DS  
Sbjct: 74   PRLYNLAWAQAVQNKPLNDIFVMDDEESKRSSSSSNTSRDDSSSA--KEVAKVIIDDSGD 131

Query: 681  EDDVXXXXXXXXXXXXXXXXXXXXXVVDETTQLSSDIPENEPEKKESDGEELQDISEFDK 860
            E DV                        +       +  NEPE    + E ++       
Sbjct: 132  EMDVKMDDVSEKEEGELEEGEIDLDSEPDVKDEGGVLDVNEPEIDLKERELVE------- 184

Query: 861  RISLILEELDTITVEEAETSFEAVCLRLRKSFEDLKPMFTGI---ESSDTXXXXXXQQAV 1031
            R+  I E+L+++TV EAE SF  VC RL+ +   L+ +F      ESS        QQ +
Sbjct: 185  RVKSIQEDLESVTVIEAEKSFSGVCSRLQNTLGSLQKVFGEKVVGESSVPTKDALAQQLI 244

Query: 1032 MAIQTTYSALNSVTMQKKDQNKQLLLRLLIHIKNQYSVLFTLEQVKEIDTLVNSL----- 1196
             AI+       S+   +K+ NK +  RLL  ++   S +F+++ +KE++ +++ L     
Sbjct: 245  NAIRALNHVFCSMNSNQKELNKDVFSRLLSCVECGDSPIFSIQHIKEVEVMMSFLDTPAA 304

Query: 1197 ---VFEDNKKEKADLTNGPVVSLHGSNVLE--KACLVPEALDLPKLVLPTRSKNRVD--- 1352
                   +K     +T+G   ++  S+V    +A    + L L  + + + ++N  D   
Sbjct: 305  QSSAEASDKVNDVQVTDGMNRNILDSSVESSGRAFASAKKLSLDSISVESYNQNNPDALK 364

Query: 1353 -----------FSPLLDLHADYNEDSLPSPTRENLPKFSIPKPIGLGMVLPVSSQPITAK 1499
                       F PLLDLH D++EDSLPSPT +    F + K           S+ +TAK
Sbjct: 365  PGLSSSRGRFIFGPLLDLHKDHDEDSLPSPTGKAPQCFPVNK-----------SELVTAK 413

Query: 1500 NGGE--DVMLHPYVTDALKAVSSYQLKFGSNSILSTDRLPSPTPSEDGGQGXXXXXXXXX 1673
               E  D ++HPY TDALKAVS+YQ KFG  S L  D+LPSPTPSE+ G           
Sbjct: 414  VAHETQDSIMHPYETDALKAVSTYQQKFGLTSFLPIDKLPSPTPSEESGDTYGDISGEVS 473

Query: 1674 XXXXCN-PRAVHAATQFQMVKSRSSAPCTNSSISN-----QSHPVEQLNPMSKPALKPAL 1835
                 + P   +A      + S  SAP  +SSI       ++  +    P    ++  + 
Sbjct: 474  SSSTISAPITANAPALGHPIVS--SAPQMDSSIVQGPTVGRNTSLVSSGPHLDSSVVASA 531

Query: 1836 KRRDPRLKFVNNEVKSISDEGRVTEP--DLLKSDPVGVATNSKKHKTDESVARDH-MMKK 2006
            K RDPRL+  +++  S+    R      +  K DP+G   +S+K K+ E    D  + K+
Sbjct: 532  KSRDPRLRLASSDAGSLDLNERPLPAVSNSPKVDPLGEIVSSRKQKSAEEPLLDGPVTKR 591

Query: 2007 QRHKLT---TSREMETASGSG-WSEAKN-VIPQPSFKIQVNENFQVDVRKSGTAAAVPDK 2171
            QR+ LT   T R+ +T   SG W E  N VIPQ   + Q+ EN   D +K  +   V   
Sbjct: 592  QRNGLTSPATVRDAQTVVASGGWLEDSNTVIPQMMNRNQLIENTGTDPKKLESKVTVTGI 651

Query: 2172 KPIFDNNFDELSGLRSIPKSNPTPTISLPSLLK--AVNPTILMQL-HQMEQQRIAAENQK 2342
                D  +  ++G   +P    + T SL SLLK  AVNP + M + +++EQQ+  + +  
Sbjct: 652  G--CDKPYVTVNGNEHLPVVATSTTASLQSLLKDIAVNPAVWMNIFNKVEQQK--SGDPA 707

Query: 2343 KNAVSTSDSANVLSVNELPGTVASINSTPLKSKESGLNQPGISLIPSQMASSSTQPDVAR 2522
            KN V    S ++L V   P +VA +  + L  K +G           Q+  +    +  +
Sbjct: 708  KNTVLPPTSNSILGVVP-PASVAPLKPSALGQKPAGA---------LQVPQTGPMDESGK 757

Query: 2523 IRMKPRDPRRILHDNMVQKNDGVVYQQSKIDAAASDPQSSLVRLAA--------PLQLTK 2678
            +RMKPRDPRRILH N  Q++     +Q K +A   + Q+    + +          Q TK
Sbjct: 758  VRMKPRDPRRILHANSFQRSGSSGSEQFKTNAQKQEDQTETKSVPSHSVNPPDISQQFTK 817

Query: 2679 NLANVLXXXXXXXXXXXXXX----CNNQPVASQADQVIVEAASGELNDSETTSLLVSEAG 2846
            NL N+                    ++Q V    D++ V+A   +  D  T +   S+  
Sbjct: 818  NLKNIADLMSASQASSMTPTFPQILSSQSVQVNTDRMDVKATVSDSGDQLTAN--GSKPE 875

Query: 2847 SEKGTRQSANPWGDVDHLFDGYNDEQKATIQKERSRRIAEQNKMFAARKXXXXXXXXXXX 3026
            S  G  QS N WGDV+HLFDGY+D+QKA IQ+ER+RRI EQ KMF+ARK           
Sbjct: 876  SAAGPPQSKNTWGDVEHLFDGYDDQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTL 935

Query: 3027 XNSAKFIEVDLVHEDILRRKEEQDKEMSQRHLFRFQHMGMWTKLRPGIWNFLEKASNLYE 3206
             NSAKF+EVD VH++ILR+KEEQD+E SQRHLFRF HMGMWTKLRPGIWNFLEKAS LYE
Sbjct: 936  LNSAKFVEVDPVHDEILRKKEEQDREKSQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYE 995

Query: 3207 LHLYTMGNKLYATEMAKVLDPTGTLFAGRVISKGDDADTFDGDERVPKSKDLDGVLGMES 3386
            LHLYTMGNKLYATEMAKVLDP G LFAGRVISKGDD D  DGDERVPKSKDL+GVLGMES
Sbjct: 996  LHLYTMGNKLYATEMAKVLDPKGVLFAGRVISKGDDGDVLDGDERVPKSKDLEGVLGMES 1055

Query: 3387 AVVIIDDSVRVWPHNKLNLIVVERYTYFPSSRRQFGLLGPSLLEIDHDERPEDGTLASSL 3566
            AVVIIDDSVRVWPHNKLNLIVVERYTYFP SRRQFGL GPSLLEIDHDERPEDGTLASSL
Sbjct: 1056 AVVIIDDSVRVWPHNKLNLIVVERYTYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASSL 1115

Query: 3567 AVIERLHQNFFSHNSLKDVDVRNILAAEQRKILAGCRIVFSRIFPVGEANPHMHPLWQTA 3746
            AVIER+HQ+FFS+ +L +VDVRNILA+EQRKILAGCRIVFSR+FPVGEANPH+HPLWQTA
Sbjct: 1116 AVIERIHQSFFSNRALDEVDVRNILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTA 1175

Query: 3747 EQFGAVCTNQI 3779
            E FGAVCTNQI
Sbjct: 1176 ESFGAVCTNQI 1186


>gb|AFW60862.1| hypothetical protein ZEAMMB73_799152, partial [Zea mays]
          Length = 1234

 Score =  778 bits (2009), Expect = 0.0
 Identities = 515/1257 (40%), Positives = 687/1257 (54%), Gaps = 116/1257 (9%)
 Frame = +3

Query: 357  MARERPR------GGRALD-----GETSDVDSSESLEEISAEDFKQEGRAG--------- 476
            MARE+PR      GG  +      GE SD DSS S+EEI+A+DFK++  +          
Sbjct: 1    MAREQPRSAVVAAGGDLVTAAGGGGEGSDRDSSGSIEEITADDFKKDSSSALGGAAAAAG 60

Query: 477  -RSRVWMGYP----MSKNYGPSLYNFAWAQAVQNKPLGL------DLKPMSAGDLTDNVD 623
             RSR W+  P    M++N+  +  +FAW+QAV+NKPLGL      D +   A D++D   
Sbjct: 61   PRSRSWVAPPAVGYMARNFRYAFNSFAWSQAVRNKPLGLQPPAPDDDEVEHAVDVSDGEK 120

Query: 624  AKGKPQE-EAYDVIVEDSNQEDDVXXXXXXXXXXXXXXXXXXXXXVVDETTQLSSDIPEN 800
             +G+ +E EA + +   +  +                           ET  L SD PE 
Sbjct: 121  EEGEIEEGEAVEALASPAPAQP--------------------------ETIDLDSDAPEK 154

Query: 801  EPEKKESDGE---------ELQDISEFDKRISLILEELDTITVEEAET------------ 917
              E    DG          E ++++  D+R+  ILEEL+ +++EEAE             
Sbjct: 155  S-ESVAIDGSASVVPVPAAEEEEVN-LDQRVGSILEELEMVSIEEAEKYMGICFMFFLEQ 212

Query: 918  -----SFEAVCLRLRKSFEDLKPMFTGIESSD--TXXXXXXQQAVMAIQTTYSALNSVTM 1076
                 SFE  C RL   FE+LKP+F  +E+           QQA + I T  +  N   +
Sbjct: 213  RLCFRSFEGACARLHTCFENLKPLFQELENGSPMAILEPLMQQAFIGIDTLTTVANLYNL 272

Query: 1077 QKKDQNKQLLLRLLIHIKNQYSVLFTLEQVKEIDTLVNSLVF--EDNKKEKADL--TNGP 1244
             +++QNK  LL+LL HIKN+YS + T EQ +E+D+ V  LVF  +DN  + +    T+  
Sbjct: 273  PRREQNKTTLLKLLFHIKNRYSDMLTPEQREEMDSRVRKLVFGEKDNVSDPSTSCGTSAI 332

Query: 1245 VVSLHGSNVLEKACLVPEA------LDLPKLVLPTRSKNRVDFSPLLDLHADYNEDSLPS 1406
             VS     V     L  E+        LP+L +P +       SPLL+LHADY+E+SLPS
Sbjct: 333  NVSAPSGQVSNTGGLPFESGAANLFSSLPRLEVPAKRN-----SPLLNLHADYDENSLPS 387

Query: 1407 PTRENLPKFSIPKPIGLGMVLPVSSQPITAKNGGEDVM--LHPYVTDALKAVSSYQLKFG 1580
            PTR+N P F   KPIG G   P+  + ++  +  E     L+P + D LKAVSSYQ K+G
Sbjct: 388  PTRDNAPPFPALKPIGFG-AFPMVPEKLSFLDRVEPTKNSLYPPLNDPLKAVSSYQQKYG 446

Query: 1581 SNSILSTDRLPSPTPSEDGGQGXXXXXXXXXXXXXCN-PRAVHAATQFQMVKSR------ 1739
              S+  +D LPSPTPS D G+                 P+++   +  QM  S+      
Sbjct: 447  QKSVYPSDDLPSPTPSGDEGKPADKGGDIFSDVSSFPVPKSIVLPSTSQMPASQPSTVSS 506

Query: 1740 -------------SSAPCTNSS--ISNQSHP------VEQLNPMSKPALKPALKRRDPRL 1856
                         +S P T SS  IS  S P      +EQ       A+K A K RDPRL
Sbjct: 507  SSISYASSTSQMAASQPITVSSSGISYASGPPGFAKQIEQSTAGPNHAIKAASKSRDPRL 566

Query: 1857 KFVNNEVKSISDEG-RVTEPDLLKSDPVGVATNSKKHKT-DESVARDHMMKKQRHKLTTS 2030
            +F+N +    +D   R    +L   +  GV+  ++K K  D+    D+ +K+ R  +   
Sbjct: 567  RFLNRDSAGATDVNWRANFSELKDGNLGGVSVGNRKQKAVDDPQVDDNALKRFRGGIANQ 626

Query: 2031 REMETASGSGWSEAKNVIPQPSFKIQVNENFQVDVRKSGTAAAVPDKKPIFDNNFDELSG 2210
            R+M+                       N N  +++R    ++++         N   L  
Sbjct: 627  RDMQPTG--------------------NPNQLMNIRAPTHSSSI---------NMKTLQP 657

Query: 2211 LRSI-PKSNPTPTISLPSLLK---AVNPTILMQLHQMEQQRIAAENQKKNAVSTSDSANV 2378
             ++  P  +  P + LP +L    AVNP +LM L QME Q+ +A        S    ++ 
Sbjct: 658  PQTTAPHVSAAPAVPLPPMLLKDIAVNPALLMHLIQMEHQKKSASE------SQGGMSSG 711

Query: 2379 LSVNELPGTVASINSTPLKSKESGLNQPGISLIPSQMASSSTQPDVARIRMKPRDPRRIL 2558
            ++ N + G V +  + P K  E+         +P Q    ++Q D   +RMKPRDPRRIL
Sbjct: 712  MTNNGIAGMVFTPGNAP-KITEAAQVPSVRPQVPVQTPPLNSQNDGGIVRMKPRDPRRIL 770

Query: 2559 HDNMVQKNDGVVYQQSKIDAAASDPQSSLVRLAAPLQLTKNLANVLXXXXXXXXXXXXXX 2738
            H+N+ QK+D +  +Q K +               P+     L + +              
Sbjct: 771  HNNIAQKSDAMSLEQVKNNGTTQPDSQGTKDQTTPVPSQPALPSSIARPFSSAKHVDPV- 829

Query: 2739 CNNQPVASQADQVIVEAASG----------ELNDSETTSLLVSEAGSEKGTRQSANPWGD 2888
             +N  +A+ A     +A S           E N     +     + +     Q  +PWGD
Sbjct: 830  -SNSQLAATAIMAPTQALSSVNKVDPRLAVEQNGQNADATTNGASATTLEATQPVSPWGD 888

Query: 2889 VDHLFDGYNDEQKATIQKERSRRIAEQNKMFAARKXXXXXXXXXXXXNSAKFIEVDLVHE 3068
            VDHL DGY+D+QKA IQKER+RRI EQ+KMF+ARK            NSAKFIEV+ +HE
Sbjct: 889  VDHLLDGYDDQQKALIQKERARRITEQHKMFSARKLCLVLDLDHTLLNSAKFIEVEPIHE 948

Query: 3069 DILRRKEEQDKEMSQRHLFRFQHMGMWTKLRPGIWNFLEKASNLYELHLYTMGNKLYATE 3248
            ++LR+KEEQD+ + +RHL+RF HM MWTKLRPGIWNFL+KASNL+ELHLYTMGNKLYATE
Sbjct: 949  EMLRKKEEQDRTLPERHLYRFHHMNMWTKLRPGIWNFLQKASNLFELHLYTMGNKLYATE 1008

Query: 3249 MAKVLDPTGTLFAGRVISKGDDADTFDGDERVPKSKDLDGVLGMESAVVIIDDSVRVWPH 3428
            MAKVLDPTGTLFAGRVIS+GDD D FD DERVPKSKDLDGVLGMESAVVIIDDSVRVWPH
Sbjct: 1009 MAKVLDPTGTLFAGRVISRGDDGDPFDSDERVPKSKDLDGVLGMESAVVIIDDSVRVWPH 1068

Query: 3429 NKLNLIVVERYTYFPSSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIERLHQNFFSHN 3608
            N+ NLIVVERYTYFP SRRQFGL GPSLLEID DERPEDGTLASSLAVIER+H NFFSH 
Sbjct: 1069 NRHNLIVVERYTYFPCSRRQFGLPGPSLLEIDRDERPEDGTLASSLAVIERIHHNFFSHP 1128

Query: 3609 SLKDVDVRNILAAEQRKILAGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQI 3779
            +L + DVR+ILA+EQR+IL GCRIVFSR+FPVG+A+PH+HPLWQTAEQFGAVCTN +
Sbjct: 1129 NLNEADVRSILASEQRRILTGCRIVFSRVFPVGDASPHLHPLWQTAEQFGAVCTNLV 1185


>ref|XP_002304648.2| hypothetical protein POPTR_0003s16280g [Populus trichocarpa]
            gi|550343308|gb|EEE79627.2| hypothetical protein
            POPTR_0003s16280g [Populus trichocarpa]
          Length = 1247

 Score =  759 bits (1959), Expect = 0.0
 Identities = 508/1223 (41%), Positives = 671/1223 (54%), Gaps = 90/1223 (7%)
 Frame = +3

Query: 381  GRALDGETSDVDSSESLEEISAEDF-KQE--------------GRAGRSRVWM-----GY 500
            G+  D E  ++  + S+EEIS +DF KQE                + + +VW       Y
Sbjct: 15   GKMEDVEEGEISDTASVEEISEDDFNKQEVVVVKETPSSTTNNNSSSKQKVWTVRDLYKY 74

Query: 501  PMSKNYGPSLYNFAWAQAVQNKPLGLDLKPMSAGDLTD--NVDAKGKPQEEAYDVIVEDS 674
             +   Y   LYN AWAQAVQNKPL      +   D +   +V +    +E+   V+++DS
Sbjct: 75   QVGGGYMSGLYNLAWAQAVQNKPLNELFVEVEVDDSSQKSSVSSVNSSKEDKRTVVIDDS 134

Query: 675  NQEDDVXXXXXXXXXXXXXXXXXXXXXVVDETTQLSSDIPENEPEKKESDGEELQDISEF 854
              E DV                     +  E  +L     + + E K   G    D    
Sbjct: 135  GDEMDVVKVID----------------IEKEEGELEEGEIDLDSEGKSEGGMVSVDT--- 175

Query: 855  DKRISLILEELDTITVEEAETSFEAVCLRLRKSFEDLKPMFTGIESSDTXXXXXXQQAVM 1034
            +KR+  I E+L++++V + + SFEAVCL+L  + E LK +    E+         +    
Sbjct: 176  EKRVKSIREDLESVSVIKDDKSFEAVCLKLHNALESLKELVRVNENGFPSKDSLVRLLFT 235

Query: 1035 AIQTTYSALNSVTMQKKDQNKQLLLRLLIHIKNQYSVLFTLEQVKEI------DTLVNSL 1196
            AI    S  +S+  + K+QNK + +R L  + +     F+ E  KE+      D  + SL
Sbjct: 236  AIGAVNSFFSSMNQKLKEQNKGVFMRFLSLVNSHDPSFFSPEHTKEVCDFCNFDFRIVSL 295

Query: 1197 VFEDNKKEKADLTN-GPVVSLHGSNVLEKACLVPEALDLPKLVLPTRSKNRVDFSPLLDL 1373
             +        DLT    + S   S V  K      +++ PK  +P+  K+R    PLLDL
Sbjct: 296  CY--------DLTTMNRLPSAAESFVHNKPNF---SIEPPKPGVPS-FKSRGVLLPLLDL 343

Query: 1374 HADYNEDSLPSPTRENLPKFSIPK--PIGLGMVLPVSSQPITAKNGGEDVMLHPYVTDAL 1547
               ++EDSLPSPTRE  P F + +  PIG GM+      P  A    E+  +HPY TDAL
Sbjct: 344  KKFHDEDSLPSPTRETAPSFPVQRLLPIGDGMISSGLPVPKVASIT-EEPRVHPYETDAL 402

Query: 1548 KAVSSYQLKFGSNSILSTDRLPSPTPSEDGGQGXXXXXXXXXXXXXCNPRAVHAATQFQM 1727
            KAVSSYQ KF  NS   T+ LPSPTPSE+ G G              N R V+     + 
Sbjct: 403  KAVSSYQKKFNLNSFF-TNELPSPTPSEESGNGDGDTAGEVSSSSTVNYRTVNPPVSDRK 461

Query: 1728 VKSRSSAP-------------CTNSSISNQSHPVEQLNPMSK---PALKPALKRRDPRLK 1859
              S S +P               NSSI     P     P+S      +K + K RDPRL+
Sbjct: 462  SASPSPSPPPPPPPPPPPPPHLNNSSI-RVVIPTRNSAPVSSGTSSTVKASAKSRDPRLR 520

Query: 1860 FVNNEVKSISDEGR----VTEPDLLKSDPVGVATNSKKHKTDESVARDHMMKKQRHKLTT 2027
            +VN +  ++    R    V  P   +++P G    S+K K +E V     +K+QR+    
Sbjct: 521  YVNTDASALDQNQRTLLMVNNPP--RAEPSGAIAGSRKQKIEEDVLDGTSLKRQRNSFDN 578

Query: 2028 S---REMETASGSG-WSEAKNVI-PQPSFKIQVNENFQVDVRKSG-----TAAAVPDKKP 2177
                R++ + +G+G W E  ++  PQ   K Q  EN +   R +      +  +V     
Sbjct: 579  FGVVRDIRSMTGTGGWLEDTDMAEPQTVNKNQWAENAEPGQRINNGVVCPSTGSVMSSVS 638

Query: 2178 IFDNNFDELSGLRSIPKSNPTP-----TISLPSLLK--AVNPTILMQLHQM-EQQRIAAE 2333
               N    + G+ +I  S   P     T SLP LLK   VNPT+L+ + +M +QQR+A +
Sbjct: 639  CSGNVQVPVMGINTIAGSEQAPVTSTTTASLPDLLKDITVNPTMLINILKMGQQQRLALD 698

Query: 2334 NQKKNAVSTSDSANVLSVNELPGTVASINSTPLKSKESGL--NQPGISLIPSQMASSSTQ 2507
             Q+K A     +++  S N + G +  +N+  + S  SG+     G +  PSQ+A   T 
Sbjct: 699  GQQKLADPAKSTSHPPSSNTVLGAIPEVNA--VSSLPSGILPRSAGKAQGPSQIA---TT 753

Query: 2508 PDVARIRMKPRDPRRILHDNMVQKNDGVVYQQSKIDAAASDPQSSL-------------V 2648
             +  +IRMKPRDPRR+LH+N +Q+   +  +Q K     S  Q +              +
Sbjct: 754  DESGKIRMKPRDPRRVLHNNALQRAGSLGSEQFKTTTLTSTTQGTKDNQNLQKQEGLAEL 813

Query: 2649 RLAAPLQL----TKNLANVLXXXXXXXXXXXXXXCNNQPVASQADQVIVEAASGELNDSE 2816
            +   P  +    TK+L N+                + Q VASQ  Q+  +   G+   S 
Sbjct: 814  KPVVPPDISSPFTKSLKNIADIVSVSQTCTTPPFVS-QNVASQPVQIKSDRVDGKTGISN 872

Query: 2817 TTSLLVSEAGSE--KGTRQSANPWGDVDHLFDGYNDEQKATIQKERSRRIAEQNKMFAAR 2990
            +   +   +  E    +  S N W DV+HLF+GY+D+QKA IQ+ER+RRI EQ K+FAAR
Sbjct: 873  SDQKMGPASSPEVVAASSLSQNTWEDVEHLFEGYDDQQKAAIQRERARRIEEQKKLFAAR 932

Query: 2991 KXXXXXXXXXXXXNSAKFIEVDLVHEDILRRKEEQDKEMSQRHLFRFQHMGMWTKLRPGI 3170
            K            NSAKF+EVD VH++ILR+KEEQD+E   RHLFRF HMGMWTKLRPGI
Sbjct: 933  KLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKPYRHLFRFPHMGMWTKLRPGI 992

Query: 3171 WNFLEKASNLYELHLYTMGNKLYATEMAKVLDPTGTLFAGRVISKGDDADTFDGDERVPK 3350
            WNFLEKAS LYELHLYTMGNKLYATEMAKVLDP G LFAGRV+S+GDD D  DGDERVPK
Sbjct: 993  WNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKGVLFAGRVVSRGDDGDLLDGDERVPK 1052

Query: 3351 SKDLDGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPSSRRQFGLLGPSLLEIDHD 3530
            SKDL+GVLGMES VVIIDDS+RVWPHNKLNLIVVERY YFP SRRQFGL GPSLLEIDHD
Sbjct: 1053 SKDLEGVLGMESGVVIIDDSLRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHD 1112

Query: 3531 ERPEDGTLASSLAVIERLHQNFFSHNSLKDVDVRNILAAEQRKILAGCRIVFSRIFPVGE 3710
            ERPEDGTLA SLAVIER+HQNFF+H+SL + DVRNILA+EQRKILAGCRIVFSR+FPVGE
Sbjct: 1113 ERPEDGTLACSLAVIERIHQNFFTHHSLDEADVRNILASEQRKILAGCRIVFSRVFPVGE 1172

Query: 3711 ANPHMHPLWQTAEQFGAVCTNQI 3779
             NPH+HPLWQ+AEQFGAVCTNQI
Sbjct: 1173 VNPHLHPLWQSAEQFGAVCTNQI 1195


>ref|XP_006438860.1| hypothetical protein CICLE_v10030535mg [Citrus clementina]
            gi|568858958|ref|XP_006483010.1| PREDICTED: RNA
            polymerase II C-terminal domain phosphatase-like 3-like
            [Citrus sinensis] gi|557541056|gb|ESR52100.1|
            hypothetical protein CICLE_v10030535mg [Citrus
            clementina]
          Length = 1234

 Score =  751 bits (1938), Expect = 0.0
 Identities = 519/1242 (41%), Positives = 682/1242 (54%), Gaps = 113/1242 (9%)
 Frame = +3

Query: 393  DGETSDVDSSESLEEISAEDFK--QE---------------GRAGRSRVWM------GYP 503
            D E  ++  + S+EEIS EDFK  QE               G    +RVW        YP
Sbjct: 4    DVEEGEISDTASVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNKYP 63

Query: 504  -MSKNYGPSLYNFAWAQAVQNKPLGLDLKPMSAGDLTDNVDAKGKPQEEAYDVIVEDSNQ 680
             + + YGP L+N AWAQAVQNKPL  ++  M A    D+V  +  P      V    +  
Sbjct: 64   AICRGYGPGLHNLAWAQAVQNKPLN-EIFVMEAEQ--DDVSKRSSPASSVASVNSGAAAG 120

Query: 681  EDDVXXXXXXXXXXXXXXXXXXXXXVVDETTQLSSDIPENEPEKKESDGEELQDISEFDK 860
            +DD                      V+D+    S D  E E  + E    EL   SE ++
Sbjct: 121  KDD---------------KKVVEKVVIDD----SGDEIEKEEGELEEGEIELDLESESNE 161

Query: 861  RIS-LILEELDTITVEE----------AETSFEAVCLRLRKSFEDLKPMFTGIESSDTXX 1007
            ++S  + EE+  I VE            + SFE VC +L  + E L+ +    E++    
Sbjct: 162  KVSEQVKEEMKLINVESIREALESVLRGDISFEGVCSKLEFTLESLRELVN--ENNVPTK 219

Query: 1008 XXXXQQAVMAIQTTYSALNSVTMQKKDQNKQLLLRLLIHIKNQYSVLFTLEQVKEIDTLV 1187
                Q A  A+Q+ +S   S+    K+QNK++L RLL  IK+    LF+  Q+KE++ ++
Sbjct: 220  DALIQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSVIKSHEPPLFSSNQIKEMEAML 279

Query: 1188 NSLVFEDNKKEKADLTNGPVVSLHGSNVLEKACLVPEALD----LPKLVLPTRS------ 1337
            +SLV   N KEK       ++++HG N  +   +   A++      K+ LP  S      
Sbjct: 280  SSLVTRANDKEK------DMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNKP 333

Query: 1338 -----------KNRVDFSPLLDLHADYNEDSLPSPTRENLPKFSIPKPIGLGMVLPVSSQ 1484
                       ++R    PLLD H  ++ DSLPSPTRE  P   + + + +G  +  S  
Sbjct: 334  LEASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGVVKSWA 393

Query: 1485 PITAKNGGEDVMLHP-YVTDALKAVSSYQLKFGSNSILSTDRLPSPTPSEDGGQGXXXXX 1661
                 +   +V   P Y TDAL+A SSYQ KFG NS      LPSPTPSE+ G G     
Sbjct: 394  AAAKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTG 453

Query: 1662 XXXXXXXXCN-PRAVHAATQFQMVKSRS----SAPCTNSSI-----SNQSHPVEQ-LNPM 1808
                     + P+ V+  T  Q   S      S P   SS+     +N S P     NP+
Sbjct: 454  GEISSATAVDQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPV 513

Query: 1809 SKP--ALKPALKRRDPRLKFVNNEVKSISDEGRVTEPDLLKSDPVGVATNSKKHKT-DES 1979
             KP   +K  +K RDPRL+F ++   +++ +      +  K +PVG   +S+K KT +E 
Sbjct: 514  VKPNPVVKAPIKSRDPRLRFASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEP 573

Query: 1980 VARDHMMKKQRHKLTTS---REMETASGSG-WSEAKNVIPQPSFKIQVNENFQVDVRKS- 2144
            V     +K+QR+    S   R+ +   GSG W E  ++  +P     +N N  VD  +S 
Sbjct: 574  VLDGPALKRQRNGFENSGVVRDEKNIYGSGGWLEDTDMF-EPQI---MNRNLLVDSAESN 629

Query: 2145 ------GTAAAVPDKKPIFDNNFDELSGLRSIPKSNPTPTISLPSLLK--AVNPTILMQL 2300
                  G  + +    P        +SG    P + P+ T+SLP+LLK  AVNPT+L+ +
Sbjct: 630  SRKLDNGATSPITSGTPNV-----VVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNI 684

Query: 2301 HQM-EQQRIAAENQKKNAVSTSDSANVLSVNELPGTVASINSTPLKSKESGLNQPGISLI 2477
             +M +QQ++AA+ Q+K+    +DS+       +P ++  ++ T                I
Sbjct: 685  LKMGQQQKLAADAQQKS----NDSSMNTMHPPIPSSIPPVSVT--------------CSI 726

Query: 2478 PSQMASSSTQPDVARIRMKPRDPRRILHDNMVQ---------KNDGV-----------VY 2597
            PS + S     ++ ++RMKPRDPRR+LH N +Q         K DG            + 
Sbjct: 727  PSGILSKPMD-ELGKVRMKPRDPRRVLHGNALQRSGSLGPEFKTDGPSAPCTQGSKENLN 785

Query: 2598 QQSKIDAAASDP--QSSLVRLAAPLQLTKNLANVLXXXXXXXXXXXXXXCN-NQPVA--- 2759
             Q ++ A  + P    S+++     Q TKNL ++                + N P+    
Sbjct: 786  FQKQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQ 845

Query: 2760 --SQADQVIVEAASGELNDSETTSLLVSEAGSEKGTRQSANPWGDVDHLFDGYNDEQKAT 2933
              S AD   ++A     +D +T +    EAG      QSA  WGDV+HLF+GY+D+QKA 
Sbjct: 846  IKSGAD---MKAVVTNHDDKQTGTGSGPEAGPVGAHPQSA--WGDVEHLFEGYDDQQKAA 900

Query: 2934 IQKERSRRIAEQNKMFAARKXXXXXXXXXXXXNSAKFIEVDLVHEDILRRKEEQDKEMSQ 3113
            IQKER+RR+ EQ KMF+ARK            NSAKF EVD VH++ILR+KEEQD+E   
Sbjct: 901  IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH 960

Query: 3114 RHLFRFQHMGMWTKLRPGIWNFLEKASNLYELHLYTMGNKLYATEMAKVLDPTGTLFAGR 3293
            RHLFRF HMGMWTKLRPGIW FLE+AS L+E+HLYTMGNKLYATEMAKVLDP G LFAGR
Sbjct: 961  RHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020

Query: 3294 VISKGDDADTFDGDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 3473
            VIS+GDD D FDGDERVPKSKDL+GVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP
Sbjct: 1021 VISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080

Query: 3474 SSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIERLHQNFFSHNSLKDVDVRNILAAEQ 3653
             SRRQFGLLGPSLLEIDHDER EDGTLASSL VIERLH+ FFSH SL DVDVRNILAAEQ
Sbjct: 1081 CSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQ 1140

Query: 3654 RKILAGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQI 3779
            RKILAGCRIVFSR+FPVGEANPH+HPLWQTAEQFGAVCT  I
Sbjct: 1141 RKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHI 1182


>ref|XP_006438858.1| hypothetical protein CICLE_v10030535mg [Citrus clementina]
            gi|557541054|gb|ESR52098.1| hypothetical protein
            CICLE_v10030535mg [Citrus clementina]
          Length = 1208

 Score =  751 bits (1938), Expect = 0.0
 Identities = 519/1242 (41%), Positives = 682/1242 (54%), Gaps = 113/1242 (9%)
 Frame = +3

Query: 393  DGETSDVDSSESLEEISAEDFK--QE---------------GRAGRSRVWM------GYP 503
            D E  ++  + S+EEIS EDFK  QE               G    +RVW        YP
Sbjct: 4    DVEEGEISDTASVEEISEEDFKIKQEEVVKVVKETKPIKVGGGEAAARVWTMRDLYNKYP 63

Query: 504  -MSKNYGPSLYNFAWAQAVQNKPLGLDLKPMSAGDLTDNVDAKGKPQEEAYDVIVEDSNQ 680
             + + YGP L+N AWAQAVQNKPL  ++  M A    D+V  +  P      V    +  
Sbjct: 64   AICRGYGPGLHNLAWAQAVQNKPLN-EIFVMEAEQ--DDVSKRSSPASSVASVNSGAAAG 120

Query: 681  EDDVXXXXXXXXXXXXXXXXXXXXXVVDETTQLSSDIPENEPEKKESDGEELQDISEFDK 860
            +DD                      V+D+    S D  E E  + E    EL   SE ++
Sbjct: 121  KDD---------------KKVVEKVVIDD----SGDEIEKEEGELEEGEIELDLESESNE 161

Query: 861  RIS-LILEELDTITVEE----------AETSFEAVCLRLRKSFEDLKPMFTGIESSDTXX 1007
            ++S  + EE+  I VE            + SFE VC +L  + E L+ +    E++    
Sbjct: 162  KVSEQVKEEMKLINVESIREALESVLRGDISFEGVCSKLEFTLESLRELVN--ENNVPTK 219

Query: 1008 XXXXQQAVMAIQTTYSALNSVTMQKKDQNKQLLLRLLIHIKNQYSVLFTLEQVKEIDTLV 1187
                Q A  A+Q+ +S   S+    K+QNK++L RLL  IK+    LF+  Q+KE++ ++
Sbjct: 220  DALIQLAFSAVQSVHSVFCSMNHVLKEQNKEILSRLLSVIKSHEPPLFSSNQIKEMEAML 279

Query: 1188 NSLVFEDNKKEKADLTNGPVVSLHGSNVLEKACLVPEALD----LPKLVLPTRS------ 1337
            +SLV   N KEK       ++++HG N  +   +   A++      K+ LP  S      
Sbjct: 280  SSLVTRANDKEK------DMLAMHGVNGKDSNIVTENAVNDLNFKEKVPLPVDSLMQNKP 333

Query: 1338 -----------KNRVDFSPLLDLHADYNEDSLPSPTRENLPKFSIPKPIGLGMVLPVSSQ 1484
                       ++R    PLLD H  ++ DSLPSPTRE  P   + + + +G  +  S  
Sbjct: 334  LEASKPGPPGYRSRGVLLPLLDPHKVHDVDSLPSPTRETTPSVPVQRALVVGDGVVKSWA 393

Query: 1485 PITAKNGGEDVMLHP-YVTDALKAVSSYQLKFGSNSILSTDRLPSPTPSEDGGQGXXXXX 1661
                 +   +V   P Y TDAL+A SSYQ KFG NS      LPSPTPSE+ G G     
Sbjct: 394  AAAKLSHNAEVHKTPHYETDALRAFSSYQQKFGRNSFFMNSELPSPTPSEESGDGDGDTG 453

Query: 1662 XXXXXXXXCN-PRAVHAATQFQMVKSRS----SAPCTNSSI-----SNQSHPVEQ-LNPM 1808
                     + P+ V+  T  Q   S      S P   SS+     +N S P     NP+
Sbjct: 454  GEISSATAVDQPKPVNMPTLGQQPVSSQPMDISQPMDISSVQALTTANNSAPASSGYNPV 513

Query: 1809 SKP--ALKPALKRRDPRLKFVNNEVKSISDEGRVTEPDLLKSDPVGVATNSKKHKT-DES 1979
             KP   +K  +K RDPRL+F ++   +++ +      +  K +PVG   +S+K KT +E 
Sbjct: 514  VKPNPVVKAPIKSRDPRLRFASSNALNLNHQPAPILHNAPKVEPVGRVMSSRKQKTVEEP 573

Query: 1980 VARDHMMKKQRHKLTTS---REMETASGSG-WSEAKNVIPQPSFKIQVNENFQVDVRKS- 2144
            V     +K+QR+    S   R+ +   GSG W E  ++  +P     +N N  VD  +S 
Sbjct: 574  VLDGPALKRQRNGFENSGVVRDEKNIYGSGGWLEDTDMF-EPQI---MNRNLLVDSAESN 629

Query: 2145 ------GTAAAVPDKKPIFDNNFDELSGLRSIPKSNPTPTISLPSLLK--AVNPTILMQL 2300
                  G  + +    P        +SG    P + P+ T+SLP+LLK  AVNPT+L+ +
Sbjct: 630  SRKLDNGATSPITSGTPNV-----VVSGNEPAPATTPSTTVSLPALLKDIAVNPTMLLNI 684

Query: 2301 HQM-EQQRIAAENQKKNAVSTSDSANVLSVNELPGTVASINSTPLKSKESGLNQPGISLI 2477
             +M +QQ++AA+ Q+K+    +DS+       +P ++  ++ T                I
Sbjct: 685  LKMGQQQKLAADAQQKS----NDSSMNTMHPPIPSSIPPVSVT--------------CSI 726

Query: 2478 PSQMASSSTQPDVARIRMKPRDPRRILHDNMVQ---------KNDGV-----------VY 2597
            PS + S     ++ ++RMKPRDPRR+LH N +Q         K DG            + 
Sbjct: 727  PSGILSKPMD-ELGKVRMKPRDPRRVLHGNALQRSGSLGPEFKTDGPSAPCTQGSKENLN 785

Query: 2598 QQSKIDAAASDP--QSSLVRLAAPLQLTKNLANVLXXXXXXXXXXXXXXCN-NQPVA--- 2759
             Q ++ A  + P    S+++     Q TKNL ++                + N P+    
Sbjct: 786  FQKQLGAPEAKPVLSQSVLQPDITQQFTKNLKHIADFMSVSQPLTSEPMVSQNSPIQPGQ 845

Query: 2760 --SQADQVIVEAASGELNDSETTSLLVSEAGSEKGTRQSANPWGDVDHLFDGYNDEQKAT 2933
              S AD   ++A     +D +T +    EAG      QSA  WGDV+HLF+GY+D+QKA 
Sbjct: 846  IKSGAD---MKAVVTNHDDKQTGTGSGPEAGPVGAHPQSA--WGDVEHLFEGYDDQQKAA 900

Query: 2934 IQKERSRRIAEQNKMFAARKXXXXXXXXXXXXNSAKFIEVDLVHEDILRRKEEQDKEMSQ 3113
            IQKER+RR+ EQ KMF+ARK            NSAKF EVD VH++ILR+KEEQD+E   
Sbjct: 901  IQKERTRRLEEQKKMFSARKLCLVLDLDHTLLNSAKFHEVDPVHDEILRKKEEQDREKPH 960

Query: 3114 RHLFRFQHMGMWTKLRPGIWNFLEKASNLYELHLYTMGNKLYATEMAKVLDPTGTLFAGR 3293
            RHLFRF HMGMWTKLRPGIW FLE+AS L+E+HLYTMGNKLYATEMAKVLDP G LFAGR
Sbjct: 961  RHLFRFPHMGMWTKLRPGIWTFLERASKLFEMHLYTMGNKLYATEMAKVLDPKGVLFAGR 1020

Query: 3294 VISKGDDADTFDGDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 3473
            VIS+GDD D FDGDERVPKSKDL+GVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP
Sbjct: 1021 VISRGDDGDPFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFP 1080

Query: 3474 SSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIERLHQNFFSHNSLKDVDVRNILAAEQ 3653
             SRRQFGLLGPSLLEIDHDER EDGTLASSL VIERLH+ FFSH SL DVDVRNILAAEQ
Sbjct: 1081 CSRRQFGLLGPSLLEIDHDERSEDGTLASSLGVIERLHKIFFSHQSLDDVDVRNILAAEQ 1140

Query: 3654 RKILAGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQI 3779
            RKILAGCRIVFSR+FPVGEANPH+HPLWQTAEQFGAVCT  I
Sbjct: 1141 RKILAGCRIVFSRVFPVGEANPHLHPLWQTAEQFGAVCTKHI 1182


>emb|CBI35661.3| unnamed protein product [Vitis vinifera]
          Length = 1184

 Score =  751 bits (1938), Expect = 0.0
 Identities = 502/1188 (42%), Positives = 654/1188 (55%), Gaps = 59/1188 (4%)
 Frame = +3

Query: 393  DGETSDVDSSESLEEISAEDF-KQEGRAGR-------SRVWMGYPMS---------KNYG 521
            D E  ++  S S+EEIS EDF KQE R  R       +RVW    +            Y 
Sbjct: 54   DVEEGEISDSASVEEISEEDFNKQEVRVLREAKPKADTRVWTMRDLQDLYKYHQACSGYT 113

Query: 522  PSLYNFAWAQAVQNKPLGLDLKPMSAGDLTDNVDAKGKPQEEAYDVIVEDSNQEDDVXXX 701
            P LYN AWAQAVQNKPL          D+   +D  G    +  DV ++D +++++    
Sbjct: 114  PRLYNLAWAQAVQNKPLN---------DIFVIIDDSG----DEMDVKMDDVSEKEE---- 156

Query: 702  XXXXXXXXXXXXXXXXXXVVDETTQLSSDIPENEPEKKESDGEELQDISEFDKRISLILE 881
                              V DE   L  D+ E E + KE          E  +R+  I E
Sbjct: 157  ---GELEEGEIDLDSEPDVKDEGGVL--DVNEPEIDLKER---------ELVERVKSIQE 202

Query: 882  ELDTITVEEAETSFEAVCLRLRKSFEDLKPMFTGI---ESSDTXXXXXXQQAVMAIQTTY 1052
            +L+++TV EAE SF  VC RL+ +   L+ +F      ESS        QQ + AI+   
Sbjct: 203  DLESVTVIEAEKSFSGVCSRLQNTLGSLQKVFGEKVVGESSVPTKDALAQQLINAIRALN 262

Query: 1053 SALNSVTMQKKDQNKQLLLRLLIHIKNQYSVLFTLEQVKEIDTLVNSL--------VFED 1208
                S+   +K+ NK +  RLL  ++   S +F+++ +KE++ +++ L            
Sbjct: 263  HVFCSMNSNQKELNKDVFSRLLSCVECGDSPIFSIQHIKEVEVMMSFLDTPAAQSSAEAS 322

Query: 1209 NKKEKADLTNGPVVSLHGSNVLEKACLVPEALDLPKLVLPTRSKNRVDFSPLLDLHADYN 1388
            +K     +T+G   ++  S+V         A          + + R  F PLLDLH D++
Sbjct: 323  DKVNDVQVTDGMNRNILDSSVESSGRAFASA---------KKFRGRFIFGPLLDLHKDHD 373

Query: 1389 EDSLPSPTRENLPKFSIPKPIGLGMVLPVSSQPITAKNGGE--DVMLHPYVTDALKAVSS 1562
            EDSLPSPT +    F + K           S+ +TAK   E  D ++HPY TDALKAVS+
Sbjct: 374  EDSLPSPTGKAPQCFPVNK-----------SELVTAKVAHETQDSIMHPYETDALKAVST 422

Query: 1563 YQLKFGSNSILSTDRLPSPTPSEDGGQ------GXXXXXXXXXXXXXCNPRA-----VHA 1709
            YQ KFG  S L  D+LPSPTPSE+ G       G              N  A     V +
Sbjct: 423  YQQKFGLTSFLPIDKLPSPTPSEESGDTYGDISGEVSSSSTISAPITANAPALGHPIVSS 482

Query: 1710 ATQFQMVKSRSSAPCTNSSISNQSHPVEQLNPMSKPALKPALKRRDPRLKFVNNEVKSIS 1889
            A Q  +V+     P    +++++ + +          L+ + K RDPRL+  +++  S+ 
Sbjct: 483  APQMDIVQGLV-VPRNTGAVNSRFNSI----------LRASAKSRDPRLRLASSDAGSLD 531

Query: 1890 DEGRVTEP--DLLKSDPVGVATNSKKHKTDESVARDH-MMKKQRHKLTTSREMETASGSG 2060
               R      +  K DP+G   +S+K K+ E    D  + K+QR+ LT+           
Sbjct: 532  LNERPLPAVSNSPKVDPLGEIVSSRKQKSAEEPLLDGPVTKRQRNGLTS----------- 580

Query: 2061 WSEAKNVIPQPSFKIQVNENFQVDVRKSGTAAAVPDKKPIFDNNFDELSGLRSIPKSNPT 2240
                      P+ K++             T   +   KP     +  ++G   +P    +
Sbjct: 581  ----------PATKLE----------SKVTVTGIGCDKP-----YVTVNGNEHLPVVATS 615

Query: 2241 PTISLPSLLK--AVNPTILMQL-HQMEQQRIAAENQKKNAVSTSDSANVLSVNELPGTVA 2411
             T SL SLLK  AVNP + M + +++EQQ+  + +  KN V    S ++L      G V 
Sbjct: 616  TTASLQSLLKDIAVNPAVWMNIFNKVEQQK--SGDPAKNTVLPPTSNSIL------GVVP 667

Query: 2412 SINSTPLKSKESGLNQPGISLIPSQMASSSTQPDVARIRMKPRDPRRILHDNMVQKNDGV 2591
              +  PLK    G    G   +P Q    + Q +  ++RMKPRDPRRILH N  Q++   
Sbjct: 668  PASVAPLKPSALGQKPAGALQVP-QTGPMNPQDESGKVRMKPRDPRRILHANSFQRSGSS 726

Query: 2592 VYQQSKIDAAASDPQSSLVRLAA--------PLQLTKNLANVLXXXXXXXXXXXXXX--- 2738
              +Q K +A   + Q+    + +          Q TKNL N+                  
Sbjct: 727  GSEQFKTNAQKQEDQTETKSVPSHSVNPPDISQQFTKNLKNIADLMSASQASSMTPTFPQ 786

Query: 2739 -CNNQPVASQADQVIVEAASGELNDSETTSLLVSEAGSEKGTRQSANPWGDVDHLFDGYN 2915
              ++Q V    D++ V+A   +  D  T +   S+  S  G  QS N WGDV+HLFDGY+
Sbjct: 787  ILSSQSVQVNTDRMDVKATVSDSGDQLTAN--GSKPESAAGPPQSKNTWGDVEHLFDGYD 844

Query: 2916 DEQKATIQKERSRRIAEQNKMFAARKXXXXXXXXXXXXNSAKFIEVDLVHEDILRRKEEQ 3095
            D+QKA IQ+ER+RRI EQ KMF+ARK            NSAKF+EVD VH++ILR+KEEQ
Sbjct: 845  DQQKAAIQRERARRIEEQKKMFSARKLCLVLDLDHTLLNSAKFVEVDPVHDEILRKKEEQ 904

Query: 3096 DKEMSQRHLFRFQHMGMWTKLRPGIWNFLEKASNLYELHLYTMGNKLYATEMAKVLDPTG 3275
            D+E SQRHLFRF HMGMWTKLRPGIWNFLEKAS LYELHLYTMGNKLYATEMAKVLDP G
Sbjct: 905  DREKSQRHLFRFPHMGMWTKLRPGIWNFLEKASKLYELHLYTMGNKLYATEMAKVLDPKG 964

Query: 3276 TLFAGRVISKGDDADTFDGDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKLNLIVVE 3455
             LFAGRVISKGDD D  DGDERVPKSKDL+GVLGMESAVVIIDDSVRVWPHNKLNLIVVE
Sbjct: 965  VLFAGRVISKGDDGDVLDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVE 1024

Query: 3456 RYTYFPSSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIERLHQNFFSHNSLKDVDVRN 3635
            RYTYFP SRRQFGL GPSLLEIDHDERPEDGTLASSLAVIER+HQ+FFS+ +L +VDVRN
Sbjct: 1025 RYTYFPCSRRQFGLPGPSLLEIDHDERPEDGTLASSLAVIERIHQSFFSNRALDEVDVRN 1084

Query: 3636 ILAAEQRKILAGCRIVFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQI 3779
            ILA+EQRKILAGCRIVFSR+FPVGEANPH+HPLWQTAE FGAVCTNQI
Sbjct: 1085 ILASEQRKILAGCRIVFSRVFPVGEANPHLHPLWQTAESFGAVCTNQI 1132


>ref|XP_004980548.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Setaria italica]
          Length = 1074

 Score =  747 bits (1928), Expect = 0.0
 Identities = 455/984 (46%), Positives = 574/984 (58%), Gaps = 27/984 (2%)
 Frame = +3

Query: 909  AETSFEAVCLRLRKSFEDLKPMFTGIESSDTXXXXXXQQAVMAIQTTYSALNSVTMQKKD 1088
            A  SFE  C RL   FE+LKP++    S          QA + I T  +  NS  + +K+
Sbjct: 81   AARSFEGTCARLHTCFENLKPLYPENGSPMPILDPLVHQAFIGIDTLTTVANSYNLPRKE 140

Query: 1089 QNKQLLLRLLIHIKNQYSVLFTLEQVKEIDTLVNSLVFEDNKKEKADLTNGPVVSLHGSN 1268
            QNK +LL+LL HIKN+YS + TL+Q  E+D+ V  LVFED        T+G   S   S+
Sbjct: 141  QNKTMLLKLLFHIKNRYSSILTLDQRDELDSRVRQLVFEDKDNVNDPSTSGSAPSGQVSS 200

Query: 1269 ----VLEKACLVPEALDLPKLVLPTRSKNRVDFSPLLDLHADYNEDSLPSPTRENLPKFS 1436
                    A        LP+L +P +       SPLLDLHADY+E+SLPSPTR+N P F 
Sbjct: 201  GRLPYESGAANSFSGSSLPRLEIPAKR-----ISPLLDLHADYDENSLPSPTRDNAPPFP 255

Query: 1437 IPKPIGLGMVLPVSSQPITAKNGGEDVMLHPYVTDALKAVSSYQLKFGSNSILSTDRLPS 1616
            +PKPIG G    V  +P   +       ++P + D LKAVSSYQ K+G  S+  +D LPS
Sbjct: 256  VPKPIGFGAFPMVPERPSFPERESGRNSIYPSLNDPLKAVSSYQQKYGQKSVFPSDDLPS 315

Query: 1617 PTPS-EDGGQGXXXXXXXXXXXXXCNPRAVHAATQFQMVKSRSSAPCTNSSISNQSHP-- 1787
            PTPS +DG                  P+     +  QM  S+ +   + S+IS    P  
Sbjct: 316  PTPSGDDGKSADKGGDIFGEISSFPAPKKTALPSTSQMPASQPNT-TSGSNISYAGGPPG 374

Query: 1788 ----VEQLNPMSKPALKPALKRRDPRLKFVNNEVKSISDEG-RVTEPDLLKSD-PVGVAT 1949
                 EQL      ALK   K RDPRL+F+N +    +D   RV   DL   +   GV T
Sbjct: 375  YGKQAEQLAAGPNHALKATSKSRDPRLRFLNRDSAGATDANQRVNFSDLKDGNLGAGVPT 434

Query: 1950 NSKKHK-TDESVARDHMMKKQRHKLTTSREMETASGSGWSEAKNVIPQPSFKIQVNENFQ 2126
             ++KHK  DE    ++++K+ R               G  + +N++       Q+  N +
Sbjct: 435  INRKHKAVDEPQVDENVLKRFR--------------LGAGDPRNMLVPTGNPNQLMTNMR 480

Query: 2127 VDVRKSGTAAAVPDKKPIFDNNFDELSGLRSIPKSNPTPTISLP-SLLK--AVNPTILMQ 2297
                 SG  A  P   P             S P+ +  P +SLP SLLK  A NPT+LM 
Sbjct: 481  APPNSSG--ATTPFLHPT----------QSSAPQISAPPAVSLPSSLLKDIAGNPTVLMN 528

Query: 2298 LHQMEQQRIAAENQKKNAVSTSDSANVLSVNELPGTVASINSTPLKSKESGLNQPGISLI 2477
              +MEQQ+++A   ++ A+    S+  +S+  + GTV    S P K+ E+         +
Sbjct: 529  WIKMEQQKMSASEPQQVAM----SSGTISIG-IAGTVLPPGSAP-KTTEAAQVPSVRPQV 582

Query: 2478 PSQMASSSTQPDVARIRMKPRDPRRILHDNMVQKNDGVVYQQSKIDAAASDPQSSLVRLA 2657
              Q    ++Q D   +RMKPRDPRRILH+++ QK D V  +Q K +    D Q +  +  
Sbjct: 583  LMQTPPLNSQNDTGILRMKPRDPRRILHNSIAQKTDAVGLEQHKSNGTKPDSQGTKDQTT 642

Query: 2658 APLQLTKNLANVLXXXXXXXXXXXXXXCNNQPVASQADQVIVEAASGELNDSETTSLLVS 2837
            + +     ++ +                +N  +A+ A     + AS  L   +   L V 
Sbjct: 643  SMVSQPALVSGI--ARPFTMSTKHVDPVSNSQLAATALMAPTQQASSSLTRVD-PRLAVE 699

Query: 2838 EAGSEKGTR----------QSANPWGDVDHLFDGYNDEQKATIQKERSRRIAEQNKMFAA 2987
            + G                Q  NPWGDVDHL DGY+D+QKA IQKER+RRI EQ+KMF+A
Sbjct: 700  QNGHNAYAANAPATPLEAVQPVNPWGDVDHLLDGYDDQQKALIQKERARRITEQHKMFSA 759

Query: 2988 RKXXXXXXXXXXXXNSAKFIEVDLVHEDILRRKEEQDKEMSQRHLFRFQHMGMWTKLRPG 3167
            RK            NSAKFIEVD VHE+ILR+KEEQD+ M +RHL+RF HM MWTKLRPG
Sbjct: 760  RKLCLVLDLDHTLLNSAKFIEVDSVHEEILRKKEEQDRSMPERHLYRFHHMNMWTKLRPG 819

Query: 3168 IWNFLEKASNLYELHLYTMGNKLYATEMAKVLDPTGTLFAGRVISKGDDADTFDGDERVP 3347
            IWNFL+KAS L+ELHLYTMGNKLYATEMAKVLDPTGTLFAGRVIS+GDD D FD DER+P
Sbjct: 820  IWNFLDKASKLFELHLYTMGNKLYATEMAKVLDPTGTLFAGRVISRGDDGDPFDSDERLP 879

Query: 3348 KSKDLDGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPSSRRQFGLLGPSLLEIDH 3527
            KSKDLDGVLGMESAVVIIDDSVRVWPHN+ NLIVVERYTYFP SRRQFGL GPSLLEID 
Sbjct: 880  KSKDLDGVLGMESAVVIIDDSVRVWPHNRHNLIVVERYTYFPCSRRQFGLPGPSLLEIDR 939

Query: 3528 DERPEDGTLASSLAVIERLHQNFFSHNSLKDVDVRNILAAEQRKILAGCRIVFSRIFPVG 3707
            DERPEDGTLASSLAVIER+H NFFSH  L + DVR IL+ EQ++ILAGC IVFSR+FPVG
Sbjct: 940  DERPEDGTLASSLAVIERIHHNFFSHPKLNEADVRTILSDEQKRILAGCHIVFSRVFPVG 999

Query: 3708 EANPHMHPLWQTAEQFGAVCTNQI 3779
            +  PH+HPLWQTAEQFGAVCTN I
Sbjct: 1000 DTKPHLHPLWQTAEQFGAVCTNLI 1023


>ref|XP_006341905.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Solanum tuberosum]
          Length = 1218

 Score =  745 bits (1924), Expect = 0.0
 Identities = 505/1233 (40%), Positives = 659/1233 (53%), Gaps = 101/1233 (8%)
 Frame = +3

Query: 384  RALDGETSDVDSSESLEEISAEDFKQE------------------GRAGRSRVWM----- 494
            R  D E  ++  S S+EEIS + F ++                       +RVW      
Sbjct: 6    RVEDVEEGEISDSASVEEISEDAFNRQDPPTTTKIKIASNENQNQNSTTTTRVWTMRDAY 65

Query: 495  GYPMSKNYGPSLYNFAWAQAVQNKPLGLDLKPMSAGDLTDNVDAKGKPQEEAY-DVIVED 671
             YP+S++Y   LYN AWAQAVQNKPL  +L  M++ +     +A    + +   DV V+D
Sbjct: 66   KYPISRDYARGLYNLAWAQAVQNKPLD-ELFVMTSDNSNQCANANANVESKVIIDVDVDD 124

Query: 672  SNQEDDVXXXXXXXXXXXXXXXXXXXXXVVDETTQLSSDIPENEPEKKESDGEELQDISE 851
              +E                                     E E E+ E D +    +  
Sbjct: 125  DAKE-------------------------------------EGELEEGEIDLDAADLVLN 147

Query: 852  FDKRISLILEELDTITVEEAETSFEAVCLRLRKSFEDLKPMFTGIESSDTXXXXXXQQAV 1031
            F K  + + E+L ++T++E   SF  VC +L+ S   L  +    + +D       Q  +
Sbjct: 148  FGKEANFVREQLQSVTLDETHKSFSMVCSKLQTSLLALGELALSQDKNDILI----QLFM 203

Query: 1032 MAIQTTYSALNSVTMQKKDQNKQLLLRLLIHIKNQYSVLFTLEQVKEIDTLVNSL----V 1199
             A++T  S   S+   +K QN  +L RLL H K Q   L + EQ+KE+D ++ S+    V
Sbjct: 204  TALRTINSVFYSMNQDQKQQNTDILSRLLFHAKTQLPALLSSEQLKEVDAVILSINQSAV 263

Query: 1200 FEDNKKEKADLTNG-PVVSLHGSNVLEKAC-------LVPEALDLPKLVLPTRSKNRVDF 1355
            F + +    D  NG  VV L    V  K+              DL  + + +        
Sbjct: 264  FSNTQDN--DKVNGIKVVELLDKKVSHKSSENANQDFTAVNKYDLGAVSIKSSGLKEQSV 321

Query: 1356 S------------------PLLDLHADYNEDSLPSPTRENLPKFSIPKPIGL-GMV---L 1469
            S                  PLLDLH D++ED+LPSPTRE  P+F + K     GMV   L
Sbjct: 322  SFESVKPGLANSKAKGLSIPLLDLHKDHDEDTLPSPTREIGPQFPVAKATQAHGMVKLDL 381

Query: 1470 PVSSQPITAKNGGEDVMLHPYVTDALKAVSSYQLKFGSNSILSTDRLPSPTPSEDGGQGX 1649
            P+ +  +   N     +LHPY TDALKAVSSYQ KFG +S+  ++ LPSPTPSE+G  G 
Sbjct: 382  PIFAGSLEKGNS----LLHPYETDALKAVSSYQQKFGRSSLFVSENLPSPTPSEEGDSGK 437

Query: 1650 XXXXXXXXXXXXCNPRAVHAATQFQMVKSRSSAPCTNSSISNQSHPVEQLNPMS---KPA 1820
                         +  A H           SS P TN             +P+S    P+
Sbjct: 438  GDIGGEVTSLDVVH-NASHLNESSMGQPILSSVPQTNILDGQGLGTARTADPLSFLPNPS 496

Query: 1821 LKPAL-KRRDPRLKFVNNE-VKSISDEGRVTEPDL-LKSDPVGVATNSKKHKT-DESVAR 1988
            L+ +  K RDPRL+   ++ V   +++  +  PD+ LK +       SKK KT D  V  
Sbjct: 497  LRSSTAKSRDPRLRLATSDAVAQNTNKNILPIPDIDLKLEASLEMIGSKKQKTVDLPVFG 556

Query: 1989 DHMMKKQRHKLTTS---REMETASGSG-WSEAKNVIPQPSFKIQ-VNENFQVDVRK---- 2141
              + K+QR + T S    ++  ++G+G W E +     P        ++   D+RK    
Sbjct: 557  APLPKRQRSEQTDSIIVSDVRPSTGNGGWLEDRGTAGLPITSSNCATDSSDNDIRKLEQV 616

Query: 2142 SGTAAAVPDKKPIFDNNFDELSGLRSIPKSNPTPTISLPSLLK--AVNPTILMQLHQMEQ 2315
            + T A +P        NF         P +  + + +L SLLK  A+NP+I M + +MEQ
Sbjct: 617  TATIATIPSVIVNAAENF---------PVTGISTSTTLHSLLKDIAINPSIWMNIIKMEQ 667

Query: 2316 QRIAAENQKKNAVSTSDSANVLSVNELPGTVASINSTPLKSKESGLNQPGISLIPSQMAS 2495
            Q+ +A+  +      S S ++L      G V S ++   +S   G    GI   P+  AS
Sbjct: 668  QK-SADASRTTTAQASSSKSIL------GAVPSTDAIAPRSSAIGQRSVGILQTPTHTAS 720

Query: 2496 SSTQPDVARIRMKPRDPRRILHDNMVQKNDGVVYQQSKIDAAAS---------------- 2627
            +    +VA +RMKPRDPRR+LH+  V K   V   Q K   A +                
Sbjct: 721  AD---EVAIVRMKPRDPRRVLHNTAVLKGGNVGSDQCKTGVAGTHATISNLGFQSQEDQL 777

Query: 2628 DPQSSLVRLAAP----LQLTKNLANVLXXXXXXXXXXXXXXCNNQPVASQADQVIVEAAS 2795
            D +S++     P     Q TKNL N+                  Q    Q+ Q   E   
Sbjct: 778  DRKSAVTLSTTPPDIARQFTKNLKNIADMISVSPSTSLSAASQTQTQCLQSHQSRSEGKE 837

Query: 2796 GELNDSETTSLLVSEAG--SEKGTRQSANP---WGDVDHLFDGYNDEQKATIQKERSRRI 2960
                 SE     V++AG  SEKG+  S  P   WGDV+HLF+GY+D+Q+A IQ+ER+RR+
Sbjct: 838  AVSEPSER----VNDAGLASEKGSPGSLQPQISWGDVEHLFEGYSDQQRADIQRERARRL 893

Query: 2961 AEQNKMFAARKXXXXXXXXXXXXNSAKFIEVDLVHEDILRRKEEQDKEMSQRHLFRFQHM 3140
             EQ KMF+ RK            NSAKF+E+D VHE+ILR+KEEQD+E   RHLFRF HM
Sbjct: 894  EEQKKMFSVRKLCLVLDLDHTLLNSAKFVEIDPVHEEILRKKEEQDREKPCRHLFRFPHM 953

Query: 3141 GMWTKLRPGIWNFLEKASNLYELHLYTMGNKLYATEMAKVLDPTGTLFAGRVISKGDDAD 3320
            GMWTKLRPGIWNFLEKASNL+ELHLYTMGNKLYATEMAK+LDP G LFAGRVIS+GDD D
Sbjct: 954  GMWTKLRPGIWNFLEKASNLFELHLYTMGNKLYATEMAKLLDPKGDLFAGRVISRGDDGD 1013

Query: 3321 TFDGDERVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPSSRRQFGLL 3500
             FDGDERVPKSKDL+GVLGMESAVVIIDDSVRVWPHNKLNLIVVERY YFP SRRQFGL 
Sbjct: 1014 PFDGDERVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLP 1073

Query: 3501 GPSLLEIDHDERPEDGTLASSLAVIERLHQNFFSHNSLKDVDVRNILAAEQRKILAGCRI 3680
            GPSLLEIDHDERPEDGTLAS L VI+R+HQNFF+H S+ + DVRNILA EQ+KILAGCRI
Sbjct: 1074 GPSLLEIDHDERPEDGTLASCLGVIQRIHQNFFAHRSIDEADVRNILATEQKKILAGCRI 1133

Query: 3681 VFSRIFPVGEANPHMHPLWQTAEQFGAVCTNQI 3779
            VFSR+FPVGEANPH+HPLWQTAEQFGAVCT+QI
Sbjct: 1134 VFSRVFPVGEANPHLHPLWQTAEQFGAVCTSQI 1166


>ref|XP_002297869.2| CTD phosphatase-like protein 3 [Populus trichocarpa]
            gi|550347145|gb|EEE82674.2| CTD phosphatase-like protein
            3 [Populus trichocarpa]
          Length = 1190

 Score =  734 bits (1896), Expect = 0.0
 Identities = 500/1206 (41%), Positives = 655/1206 (54%), Gaps = 68/1206 (5%)
 Frame = +3

Query: 366  ERPRGGRAL---DGETSDVDSSESLEEISAEDF-KQE---------GRAGRSRVWM---- 494
            E   GGR+    D E  ++  + S+EEIS EDF KQE               +VW     
Sbjct: 5    ETAGGGRSSGIEDVEEGEISDTASVEEISEEDFNKQEVVIVKETPSSNNSSQKVWTVRDL 64

Query: 495  -GYPMSKNYGPSLYNFAWAQAVQNKPLGLDLKPMSAGDLTDNVDAKGKPQEEAYDVIVED 671
              Y +   Y   LYN AWA+AVQNKPL          +LT               V+++D
Sbjct: 65   YKYQVGGGYMSGLYNLAWARAVQNKPLN---------ELT---------------VVIDD 100

Query: 672  SNQEDDVXXXXXXXXXXXXXXXXXXXXXVVDETTQLSSDIPENEPEKKESDGEELQDISE 851
            S  E DV                        E  +   D+ ++EP   +S+G    D+  
Sbjct: 101  SGDEMDVVKVIDIEKEEG-------------ELEEGEIDL-DSEPVVVQSEGMVSVDV-- 144

Query: 852  FDKRISLILEELDTITVEEAETSFEAVCLRLRKSFEDLKPMFTGIESSDTXXXXXXQQAV 1031
             + R+  I ++L++++V E E SFEAVCL+L K  E LK +  G ++S        Q   
Sbjct: 145  -ENRVKSIRKDLESVSVIETEKSFEAVCLKLHKVLESLKELVGGNDNSFPSKDGLVQLLF 203

Query: 1032 MAIQTTYSALNSVTMQKKDQNKQLLLRLLIHIKNQYSVLFTLEQVKEI------DTLVNS 1193
            MAI+   S   S+  + K+QNK +  R    + + Y   F+  Q KE+      D+L  +
Sbjct: 204  MAIRVVNSVFCSMNKKLKEQNKGVFSRFFSLLNSHYPPFFSPGQNKEVLNENHNDSLAKT 263

Query: 1194 LVFE-DNKKEKADLTNGPVVSLHGSNVLEKACLVPEALDLPKLVLPTRSKNRVDFSPLLD 1370
              ++     EK      P       N   K+   P+   +P        K+R    PLLD
Sbjct: 264  AGYDLTTMSEKL-----PAAETFVQNKPNKSIEAPKPPGVPSF------KSRGVLLPLLD 312

Query: 1371 LHADYNEDSLPSPTRENLPKFSIPKPIGLGMVLPVSSQPITAKNG-GEDVMLHPYVTDAL 1547
            L   ++EDSLPSPT+E  P F + + + +G  +  S  P+       E+  +HPY TDAL
Sbjct: 313  LKKYHDEDSLPSPTQETTP-FPVQRLLAIGDGMVSSGLPVPKVTPVAEEPRMHPYETDAL 371

Query: 1548 KAVSSYQLKFGSNSILSTDRLPSPTPSEDGGQGXXXXXXXXXXXXXC-NPRAVHAATQFQ 1724
            KAVSSYQ KF  NS   T+ LPSPTPSE+ G G               N R V+     Q
Sbjct: 372  KAVSSYQQKFNRNSFF-TNELPSPTPSEESGNGDGDTAGEVSSSSTVVNYRTVNPPVSDQ 430

Query: 1725 MVKSRSSAPCT------NSSISNQSHPVEQLNPMSK---PALKPALKRRDPRLKFVNNEV 1877
                 S  P        +SS      P     P+S      +K + K RDPRL++VN + 
Sbjct: 431  KNAPPSPPPLPPPPPHPDSSNIRGVVPTRNSAPVSSGPSSTIKASAKSRDPRLRYVNIDA 490

Query: 1878 KSISDEGRVTEP--DLLKSDPVGVATNSKKHKTDESVARDHMMKKQRHKLTTS---REME 2042
             ++    R      +L + +P G    SKKHK +E V  D  +K+QR+        R++E
Sbjct: 491  CALDHNQRALPMVNNLPRVEPAGAIVGSKKHKIEEDVLDDPSLKRQRNSFDNYGAVRDIE 550

Query: 2043 TASGSG-WSEAKNVI-PQPSFKIQVNENFQVDVRKSGTAAAVPDKKPIFDNNFDELSGLR 2216
            + +G+G W E  ++  PQ   K Q  EN  V+   SG A     + P        ++G  
Sbjct: 551  SMTGTGGWLEDTDMAEPQTVNKNQWAENSNVN--GSGNA-----QSPFM--GISNITGSE 601

Query: 2217 SIPKSNPTPTISLPSLLK--AVNPTILMQLHQM-EQQRIAAENQKKNAVSTSDSANVLSV 2387
                ++ T T SLP LLK  AVNPT+L+ + +M +QQR+A + Q+  +     +++    
Sbjct: 602  QAQVTS-TATTSLPDLLKDIAVNPTMLINILKMGQQQRLALDGQQTLSDPAKSTSHPPIS 660

Query: 2388 NELPGTVASINSTPLKSKESGL-NQPGISLIPSQMASSSTQPDVARIRMKPRDPRRILHD 2564
            N + G + ++N     S+ SG+  +P  + +PSQ+A+S    +  +IRMKPRDPRR LH+
Sbjct: 661  NTVLGAIPTVNVA--SSQPSGIFPRPAGTPVPSQIATSD---ESGKIRMKPRDPRRFLHN 715

Query: 2565 NMVQKNDGVVYQQSKI-----------DAAASDPQSSLVRLAAPLQ------LTKNLANV 2693
            N +Q+   +  +Q K            D      Q  L  L   +        TK+L N+
Sbjct: 716  NSLQRAGSMGSEQFKTTTLTPTTQGTKDDQNVQKQEGLAELKPTVPPDISFPFTKSLENI 775

Query: 2694 LXXXXXXXXXXXXXXCNNQPVASQADQVIVEAASGE----LNDSETTSLLVSEAGSEKGT 2861
                            + Q VASQ  Q   E   G+    ++D +T      E  +   +
Sbjct: 776  ADILSVSQASTTPPFIS-QNVASQPMQTKSERVDGKTGISISDQKTGPASSPEVVA--AS 832

Query: 2862 RQSANPWGDVDHLFDGYNDEQKATIQKERSRRIAEQNKMFAARKXXXXXXXXXXXXNSAK 3041
              S N W DV+HLF+GY+D+QKA IQ+ER+RR+ EQ KMFAARK            NSAK
Sbjct: 833  SHSQNTWKDVEHLFEGYDDQQKAAIQRERARRLEEQKKMFAARKLCLVLDLDHTLLNSAK 892

Query: 3042 FIEVDLVHEDILRRKEEQDKEMSQRHLFRFQHMGMWTKLRPGIWNFLEKASNLYELHLYT 3221
             I    +H++ILR+KEEQD+E   RH+FR  HMGMWTKLRPGIWNFLEKAS L+ELHLYT
Sbjct: 893  AILSSSLHDEILRKKEEQDREKPYRHIFRIPHMGMWTKLRPGIWNFLEKASKLFELHLYT 952

Query: 3222 MGNKLYATEMAKVLDPTGTLFAGRVISKGDDADTFDGDERVPKSKDLDGVLGMESAVVII 3401
            MGNKLYATEMAKVLDP G LFAGRVIS+GDD D FDGDERVPKSKDL+GVLGMES VVII
Sbjct: 953  MGNKLYATEMAKVLDPKGVLFAGRVISRGDDGDPFDGDERVPKSKDLEGVLGMESGVVII 1012

Query: 3402 DDSVRVWPHNKLNLIVVERYTYFPSSRRQFGLLGPSLLEIDHDERPEDGTLASSLAVIER 3581
            DDSVRVWPHNKLNLIVVERY YFP SRRQFGL GPSLLEIDHDERPEDGTLA S AVIE+
Sbjct: 1013 DDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPEDGTLACSFAVIEK 1072

Query: 3582 LHQNFFSHNSLKDVDVRNILAAEQRKILAGCRIVFSRIFPVGEANPHMHPLWQTAEQFGA 3761
            +HQNFF+H SL + DVRNILA+EQRKIL GCRI+FSR+FPVGE NPH+HPLWQ AEQFGA
Sbjct: 1073 IHQNFFTHRSLDEADVRNILASEQRKILGGCRILFSRVFPVGEVNPHLHPLWQMAEQFGA 1132

Query: 3762 VCTNQI 3779
            VCTNQI
Sbjct: 1133 VCTNQI 1138


>ref|XP_004252660.1| PREDICTED: RNA polymerase II C-terminal domain phosphatase-like
            3-like [Solanum lycopersicum]
          Length = 1211

 Score =  733 bits (1892), Expect = 0.0
 Identities = 497/1227 (40%), Positives = 651/1227 (53%), Gaps = 95/1227 (7%)
 Frame = +3

Query: 384  RALDGETSDVDSSESLEEISAEDFKQEGRAGRS---------------------RVWM-- 494
            R  D E  ++  S S+EEIS + F ++     S                     RVW   
Sbjct: 6    RVEDAEEGEISDSASVEEISEDAFNRQDPPTTSTTSKIKIASNENQNQNSTTATRVWTMR 65

Query: 495  ---GYPMSKNYGPSLYNFAWAQAVQNKPLGLDLKPMSAGDLTDNVDAKGKPQEEAYDVIV 665
                YP+S++Y   LYN AWAQAVQNKPL  +L  M++ +     + + K      DV V
Sbjct: 66   DVYKYPISRDYARGLYNLAWAQAVQNKPLD-ELFVMTSDNSNQCANGESKV---IIDVDV 121

Query: 666  EDSNQEDDVXXXXXXXXXXXXXXXXXXXXXVVDETTQLSSDIPENEPEKKESDGEELQDI 845
            +D  +E                                     E E E+ E D +    +
Sbjct: 122  DDDAKE-------------------------------------EGELEEGEIDLDSADLV 144

Query: 846  SEFDKRISLILEELDTITVEEAETSFEAVCLRLRKSFEDLKPMFTGIESSDTXXXXXXQQ 1025
              F K  + I E+L ++T++E   SF  VC +L+ S   L  +    + +D       Q 
Sbjct: 145  VNFGKEANFIREQLQSVTLDETHKSFSMVCSKLQTSLLALGELALSQDKNDILI----QL 200

Query: 1026 AVMAIQTTYSALNSVTMQKKDQNKQLLLRLLIHIKNQYSVLFTLEQVKEIDTLVNSL--- 1196
             + A++T  S   S+   +K QN  +L RLL + K Q   L + EQ+KE+D L+ S+   
Sbjct: 201  FMTALRTINSVFYSMNDHQKQQNTDILSRLLFNAKTQLPALLSSEQLKELDALILSINHS 260

Query: 1197 VFEDNKKEKADLTNGPVVSL------HGSN-----------------VLEKACLVPEALD 1307
            +   N ++   +    VV L      H S+                 V  K+  + E   
Sbjct: 261  LVSSNTQDNDTVNGINVVQLLDMKDSHKSSENANQDFTSVNKYDLGDVSIKSSGLKEQSV 320

Query: 1308 LPKLVLP--TRSKNRVDFSPLLDLHADYNEDSLPSPTRENLPKFSIPKPIGLGMV-LPVS 1478
              + V P    SK +    PLLDLH D++ED+LPSPTR+  P+F   +  G+  + LP+ 
Sbjct: 321  SSESVKPGLDNSKAKGLSFPLLDLHKDHDEDTLPSPTRQIGPQFPATQTHGMVKLDLPIF 380

Query: 1479 SQPITAKNGGEDVMLHPYVTDALKAVSSYQLKFGSNSILSTDRLPSPTPSEDGGQGXXXX 1658
               +   N     +LHPY TDALKAVSSYQ KFG +S+  ++ LPSPTPSE+   G    
Sbjct: 381  PASLDKGNS----LLHPYETDALKAVSSYQQKFGRSSLFVSENLPSPTPSEEDDSGKGDT 436

Query: 1659 XXXXXXXXXCNPRAVHAATQFQMVKSRSSAPCTNSSISNQSHPVEQLNPMS---KPALKP 1829
                      +  A H           SS P TN             +P+S    P+L+ 
Sbjct: 437  GGEVTSFDVVH-NASHLNESSMGQPILSSVPQTNILDGQGLGTTRTADPLSFLPNPSLRS 495

Query: 1830 AL-KRRDPRLKFVNNEVKSISDEGRVTEPDLLKSDPVGVATNSKKHKTDESVARDHMMKK 2006
            +  K RDPRL+   ++  + +    + + DL     + +  + K+   D S     + K+
Sbjct: 496  STAKSRDPRLRLATSDTVAQNTILPIPDIDLKLEASLEMIVSKKQKTVDLSAFDAPLPKR 555

Query: 2007 QRHKLTTS---REMETASGSG-WSEAKNVIPQPSFKIQ-VNENFQVDVRK----SGTAAA 2159
            QR + T S    ++  + G+G W E +     P         N   D+RK    + T A 
Sbjct: 556  QRSEQTDSIIVSDVRPSIGNGGWLEDRGTAELPITSSNCATYNSDNDIRKLEQVTATIAT 615

Query: 2160 VPDKKPIFDNNFDELSGLRSIPKSNPTPTISLPSLLK--AVNPTILMQLHQMEQQRIAAE 2333
            +P        NF         P +  + + +L SLLK  A+NP+I M + + EQQ+ +A+
Sbjct: 616  IPSVIVNAAENF---------PVTGISTSTTLHSLLKDIAINPSIWMNIIKTEQQK-SAD 665

Query: 2334 NQKKNAVSTSDSANVLSVNELPGTVASINSTPLKSKESGLNQPGISLIPSQMASSSTQPD 2513
              + N    S S ++L    +P TVA       +S   G    GI   P+  AS+    +
Sbjct: 666  ASRTNTAQASSSKSILGA--VPSTVA----VAPRSSAIGQRSVGILQTPTHTASAD---E 716

Query: 2514 VARIRMKPRDPRRILHDNMVQKNDGVVYQQSKIDAAAS----------------DPQSSL 2645
            VA +RMKPRDPRR+LH   V K   V   Q K   A +                D +S++
Sbjct: 717  VAIVRMKPRDPRRVLHSTAVLKGGSVGLDQCKTGVAGTHATISNLSFQSQEDQLDRKSAV 776

Query: 2646 VRLAAP----LQLTKNLANVLXXXXXXXXXXXXXXCNNQPVASQADQVIVEAASGELNDS 2813
                 P     Q TKNL N+                  Q +  QA Q   E        S
Sbjct: 777  TLSTTPPDIACQFTKNLKNIADMISVSPSTSPSVASQTQTLCIQAYQSRSEVKGAVSEPS 836

Query: 2814 ETTSLLVSEAG--SEKGTRQSANP---WGDVDHLFDGYNDEQKATIQKERSRRIAEQNKM 2978
            E     V++AG  SEKG+  S  P   WGDV+HLF+GY+D+Q+A IQ+ER+RR+ EQ KM
Sbjct: 837  EW----VNDAGLASEKGSPGSLQPQISWGDVEHLFEGYSDQQRADIQRERTRRLEEQKKM 892

Query: 2979 FAARKXXXXXXXXXXXXNSAKFIEVDLVHEDILRRKEEQDKEMSQRHLFRFQHMGMWTKL 3158
            F+ RK            NSAKF+E+D VHE+ILR+KEEQD+E   RHLFRF HMGMWTKL
Sbjct: 893  FSVRKLCLVLDLDHTLLNSAKFVEIDPVHEEILRKKEEQDREKPYRHLFRFPHMGMWTKL 952

Query: 3159 RPGIWNFLEKASNLYELHLYTMGNKLYATEMAKVLDPTGTLFAGRVISKGDDADTFDGDE 3338
            RPGIWNFLEKASNL+ELHLYTMGNKLYATEMAK+LDP G LFAGRVIS+GDD D FDGDE
Sbjct: 953  RPGIWNFLEKASNLFELHLYTMGNKLYATEMAKLLDPKGDLFAGRVISRGDDGDPFDGDE 1012

Query: 3339 RVPKSKDLDGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPSSRRQFGLLGPSLLE 3518
            RVPKSKDL+GVLGMESAVVIIDDSVRVWPHNKLNLIVVERY YFP SRRQFGL GPSLLE
Sbjct: 1013 RVPKSKDLEGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLE 1072

Query: 3519 IDHDERPEDGTLASSLAVIERLHQNFFSHNSLKDVDVRNILAAEQRKILAGCRIVFSRIF 3698
            IDHDERPEDGTLAS L VI+R+HQNFF+H S+ + DVRNILA EQ+KILAGCRIVFSR+F
Sbjct: 1073 IDHDERPEDGTLASCLGVIQRIHQNFFTHRSIDEADVRNILATEQKKILAGCRIVFSRVF 1132

Query: 3699 PVGEANPHMHPLWQTAEQFGAVCTNQI 3779
            PVGEA+PH+HPLWQTAEQFGAVCT+QI
Sbjct: 1133 PVGEASPHLHPLWQTAEQFGAVCTSQI 1159


>gb|EMS65645.1| RNA polymerase II C-terminal domain phosphatase-like 3 [Triticum
            urartu]
          Length = 1119

 Score =  732 bits (1889), Expect = 0.0
 Identities = 483/1159 (41%), Positives = 640/1159 (55%), Gaps = 67/1159 (5%)
 Frame = +3

Query: 504  MSKNYGPSLYNFAWAQAVQNKPLGLDLKPMSAGDLTDNVDAKGKPQEEAYDVIVEDSNQE 683
            MS++Y P+ ++FAWAQAVQNKPL     P  A D            E+  + +V+ S++E
Sbjct: 1    MSRSYAPAFHSFAWAQAVQNKPL----VPRPAAD------------EDEVEHLVDTSDEE 44

Query: 684  DDVXXXXXXXXXXXXXXXXXXXXXVVDETTQLSSDIPENEPE---KKESDGEELQDISEF 854
             +                         ET  L SD  +       ++     E  D  +F
Sbjct: 45   KEEGEIEEGEAVQSTSPPIKQP-----ETIDLDSDAQDKSESVDMEQTRLAVEAADELDF 99

Query: 855  DKRISLILEELDTITVEEAETSFEAVCLRLRKSFEDLKPMFTGIESSDTXXXXXXQQAVM 1034
            D+R+  ILEEL+ +++EEAE SFEA C RLR  FE LKP+F    S         QQA +
Sbjct: 100  DQRVGSILEELERLSIEEAEKSFEASCARLRSCFESLKPLFPESGSPMPMLDALVQQAFV 159

Query: 1035 AIQTTYSALNSVTMQKKDQNKQLLLRLLIHIKNQYSVLFTLEQVKEIDTLVNSLVF---E 1205
             I T  +  NS  M K++QNK +LL+LL HIKN+YS +  L Q  E+D+ V  LVF   E
Sbjct: 160  GIDTITTVANSYAMPKREQNKNMLLKLLFHIKNRYSDMLALNQRDELDSRVRQLVFVDGE 219

Query: 1206 DNKKEKADLTNGPVVSLHGSNVLEK------ACLVPEALDLPKLVLPTRSKNRVDFSPLL 1367
            DN           VV   G    ++      A   P     P   +P  + NR+  SPLL
Sbjct: 220  DNAGSNCSTKTVNVVVPSGQVPSDRLPVESGAANPPRGSSFPSWEIP--ANNRI-VSPLL 276

Query: 1368 DLHADYNEDSLPSPTRENLPKFSIPKPIGLGMVLPVSSQPITAKN-GGEDVMLHPYVTDA 1544
            DLHADY+E+SLPSPTR + P FS+PKPIG G+      +  +A+      + L+P V DA
Sbjct: 277  DLHADYDENSLPSPTRVSAPPFSVPKPIGFGVFPMAPDRYFSAERIDPSKIFLYPCVNDA 336

Query: 1545 LKAVSSYQLKFGSNSILSTDRLPSPTPSEDGGQGXXXXXXXXXXXXXCNPRAVHAATQFQ 1724
            LK VSSY+ K+G  S  ++D LPSPTPS+DG +               +     A     
Sbjct: 337  LKDVSSYRQKYGPTSTFASDDLPSPTPSDDGDKSGDKEGDIFGEVSSFSASNKSAPPSGN 396

Query: 1725 MVKSRSSAPCTNSSISNQSHP------VEQLNPMSKPALKPALKRRDPRLKFVNNEVKSI 1886
            ++ +   +   +S+ S    P      +EQ       ALKP+ K RDPRL+F+N +    
Sbjct: 397  LMPASRPSAVISSNDSFAGGPPGYAKQIEQSVSGPSHALKPSAKSRDPRLRFLNRDSGGT 456

Query: 1887 SDEG---RVTEPDLLKSDPVG--VATNSKKHK-TDESVARDHMMKKQRHKLTTSREMETA 2048
            +D      + EP+  K   +G  V+ NS+KHK TD+ +  + ++K+ R    + R++   
Sbjct: 457  ADANIHVNLAEPNASKDGTLGGVVSDNSRKHKATDQPLMDETVLKRARESTGSPRDILVP 516

Query: 2049 SGSGWSEAKNVIPQPSFKIQVNENFQVDVRKS------------GTAAAVPDKKPIFDNN 2192
             G    +  N+      ++Q N++  ++ + +               +++PD        
Sbjct: 517  PGR---DGSNISSYSGDRVQSNKHTGLETKTARNPSIRTSSQLISNVSSIPDSTGTL--- 570

Query: 2193 FDELSGLRSIPKSNPTPTISLPSLLK--AVNPTILMQLHQME-QQRIAAENQKKNAVSTS 2363
              + S   S+P+++  P +SLP++LK  AVNPT+LM   QME Q+R A+E Q  + + +S
Sbjct: 571  --QASQPNSVPQTSAAPIVSLPAVLKDIAVNPTVLMHWIQMEHQKRSASEPQPASGIISS 628

Query: 2364 DSANVLSVNEL--PGTVASINSTPLKSKESGLNQPGISLIPSQMASSSTQPDVARIRMKP 2537
               N ++   +  PG         LK+ E            SQ AS ++Q D   IRMK 
Sbjct: 629  GMINNVTAGMVIPPGNA-------LKTAEVAHIPSYRPQATSQTASVNSQNDPGVIRMKA 681

Query: 2538 RDPRRILHDNMVQKNDGVVYQQSKIDAAA----SDPQSSLV---RLAAPLQLTK------ 2678
            RDPRR+LH+N  QKND +   Q+K +  A     D + +L+   +LA  LQ T       
Sbjct: 682  RDPRRVLHNNTSQKNDTLNSDQAKSNGIALPAFQDSKDNLINRQQLAEQLQTTVLPSQPV 741

Query: 2679 NLANVLXXXXXXXXXXXXXXCNNQPVAS-----QADQVIVEAASGELNDSETTSLLVSEA 2843
            +L+++                N+Q  AS     Q   V V  A   +   +  S   + A
Sbjct: 742  SLSSIARQSTMSASKVDPVS-NSQLAASSLIAPQESLVSVNRADPRVAAGQNDSNNAAPA 800

Query: 2844 GSEKGTRQSANPWGDVDHLFDGYNDEQKATIQKERSRRIAEQNKMFAARKXXXXXXXXXX 3023
             +  GTR  AN WGD+D L +GY+D+QKA IQKER+RRI EQ+ MF++RK          
Sbjct: 801  -TTLGTRPPANQWGDLDDLLNGYDDQQKALIQKERARRIMEQHTMFSSRKLCLVLDLDHT 859

Query: 3024 XXNSAKFIEVDLVHEDILRRKEEQDKEMSQRHLFRFQHMGMWTKLRPGIWNFLEKASNLY 3203
              NSAKFIEVD +HE+ILR+KEEQD+E S+RHLFRF HM MWTKLRPGIWNFLEKAS LY
Sbjct: 860  LLNSAKFIEVDPIHEEILRKKEEQDRERSERHLFRFHHMQMWTKLRPGIWNFLEKASKLY 919

Query: 3204 ELHLYTMGNKLYATEMAKVLDPTGTLFAGRV-------ISKGDDADTFDGDERVPKSKDL 3362
            ELHLYTMGNKLYATEMAKVLDP+GTLFAGRV       IS+G D DTFD D+RVPKSKDL
Sbjct: 920  ELHLYTMGNKLYATEMAKVLDPSGTLFAGRVISRGGDGISRGGDGDTFDSDDRVPKSKDL 979

Query: 3363 DGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPSSRRQFGLLGPSLLEIDHDERPE 3542
            DGVLGMESAVVIIDDSVRVWPHNK N+IVVER+               SL  I  D    
Sbjct: 980  DGVLGMESAVVIIDDSVRVWPHNKNNMIVVERHV--------------SLSNIYSD---- 1021

Query: 3543 DGTLASSLAVIERLHQNFFSHNSLKDVDVRNILAAEQRKILAGCRIVFSRIFPVGEANPH 3722
                     VI R+HQNFFSH +L D DVR+IL++EQR+ILAGCRIVFSRIFPVGEANPH
Sbjct: 1022 ---------VIGRIHQNFFSHPNLNDADVRSILSSEQRRILAGCRIVFSRIFPVGEANPH 1072

Query: 3723 MHPLWQTAEQFGAVCTNQI 3779
            +HPLWQTAEQFGAVCTNQI
Sbjct: 1073 LHPLWQTAEQFGAVCTNQI 1091


>ref|XP_002512650.1| RNA polymerase II ctd phosphatase, putative [Ricinus communis]
            gi|223548611|gb|EEF50102.1| RNA polymerase II ctd
            phosphatase, putative [Ricinus communis]
          Length = 1195

 Score =  726 bits (1873), Expect = 0.0
 Identities = 489/1219 (40%), Positives = 638/1219 (52%), Gaps = 90/1219 (7%)
 Frame = +3

Query: 393  DGETSDVDSSESLEEISAEDF--------------------KQEGRAGRSRVWM-----G 497
            D E  ++  + S+EEIS EDF                    K++G  G  RVW       
Sbjct: 14   DVEEGEISDTASIEEISEEDFNKQDVVVVKPPSSNNETTKQKEQGN-GNGRVWTISDLYR 72

Query: 498  YPMSKNYGPSLYNFAWAQAVQNKPLGLDLKPMSA--GDLTDNVDAKGKPQEEAYDVIVED 671
            Y M   +   LYN AWAQAVQ+KP G   KP++    D+ + +D   K    +      +
Sbjct: 73   YQMVGGHVSGLYNLAWAQAVQSKP-GKSNKPLNELFADVVEELDESSKRSSPSSSAASVN 131

Query: 672  SNQEDD------VXXXXXXXXXXXXXXXXXXXXXVVDETTQLSSDIPENEPEKKESDGEE 833
            SN +D       V                     +VD   +   ++ E E +     GE+
Sbjct: 132  SNNKDGDEEKKKVVEKVVIDDNGDEMMDDNNRNKIVDVVEKEEGELEEGEIDLDMEPGEK 191

Query: 834  LQDISEFDKRISLILEELDTITVEEAETSFEAVCLRLRKSFEDLKPMFTGIESSDTXXXX 1013
              +    +  I       D + VE  E  FE     +R                      
Sbjct: 192  ANNGDVLNMNI-------DGLEVESGEKGFEKKMNSIR---------------------- 222

Query: 1014 XXQQAVMAIQTTYSALNSVTMQKKDQNKQLLLRLLIHIKNQYSVLFTLEQVKEIDTLVNS 1193
                          AL SVT++            ++   +   V F+    KE + L+++
Sbjct: 223  -------------DALESVTIE-----------FVLACTDSSGVSFSSFSEKEKEPLIST 258

Query: 1194 LVFEDNKKEKADLTNGPVVSLHGSNVLEKACLVPEALDL----PKLVLPTRSKNRVDFSP 1361
            +V   NKK+           +   N L     V    +L    PK  + +  K+R    P
Sbjct: 259  VV---NKKDNDVNGKSSGHDMSAVNKLPTDSFVNNKANLSIEGPKTGVSS-FKSRAALLP 314

Query: 1362 LLDLHADYNEDSLPSPTREN---LPKFSIPKPIGLGMVLPVSSQPITAKNGGEDVMLHPY 1532
            LLDLH D++ DSLPSPTRE+   LP + +  P    MVL   +             +HPY
Sbjct: 315  LLDLHKDHDADSLPSPTRESALPLPAYRVLTP---KMVLDTGNS-----------RMHPY 360

Query: 1533 VTDALKAVSSYQLKFGSNSILSTDRLPSPTPSEDGGQGXXXXXXXXXXXXXCNP-RAVHA 1709
             TDALKAVSSYQ KF  +S   TDRLPSPTPSE+ G G              +  R  + 
Sbjct: 361  ETDALKAVSSYQQKFSKSSFALTDRLPSPTPSEESGNGDGDTGGEVSSSLSVSSFRPANP 420

Query: 1710 ATQFQMVKSRSSAPCTNSSISNQSHPVEQLNPMSKPAL--KPALKRRDPRLKFVNNEVKS 1883
             T  Q   S S      SS+         +   S P+L  K + K RDPRL+FVN++  +
Sbjct: 421  LTSGQSNASISLPRMDGSSLPGVISIKSAVRASSAPSLTVKASAKSRDPRLRFVNSDSNA 480

Query: 1884 ISDEGR-VTEPDLLKSDPVGVATNSKKHK-TDESVARDHMMKKQRHKLTTS---REMETA 2048
            +    R V   + LK +P+G   N K+ K  D+ +   H +K+Q++ L  S   R+++T 
Sbjct: 481  LDQNHRAVPVVNTLKVEPIGGTMNKKRQKIVDDPIPDGHSLKRQKNALENSGVVRDVKTM 540

Query: 2049 SGSG-WSEAKNVI-PQPSFKIQVNENFQVDVRKSGTAAAVPDKKPIFDNNFDELSGLRSI 2222
             GSG W E  +++ PQ   K Q+ +N + D R+            I   N   +SG   I
Sbjct: 541  VGSGGWLEDTDMVGPQTMNKNQLVDNAESDPRRKDGGGVCTSSSCISSVN---ISGTEQI 597

Query: 2223 PKSNPT------------PTISLPSLLK--AVNPTILMQLHQM-EQQRIAAENQKKNAVS 2357
            P +  +             T ++P LLK  AVNPT+L+ + +M +QQR+A E Q+K    
Sbjct: 598  PVTGTSVPIGGELVPVKGSTAAIPDLLKNIAVNPTMLINILKMGQQQRLALEAQQKPVDP 657

Query: 2358 TSDSANVLSVNELPGTVASINSTPLKSKESGL--NQPGISLIPSQMASSSTQPDVARIRM 2531
               +   L+ N + GTV  + +       SG+     G   +  Q+    T  D+ +IRM
Sbjct: 658  AKSTTYPLNSNSMLGTVPVVGAA-----HSGILPRPAGTVQVSPQLG---TADDLGKIRM 709

Query: 2532 KPRDPRRILHDNMVQKNDGV-------------VYQQSKIDAAASDPQSSLVRLAAPLQ- 2669
            KPRDPRR+LH+N +Q+N  +             + Q++K +      +  + +   PLQ 
Sbjct: 710  KPRDPRRVLHNNALQRNGSMGSEHLKTNLTSIPINQETKDNQNLQKQEGQVEKKPVPLQS 769

Query: 2670 ---------LTKNLANVLXXXXXXXXXXXXXXCNNQPVASQADQVIVEAASGELNDSETT 2822
                      TKNL N+                   P ASQ  +  + ++   L      
Sbjct: 770  LALPDISMPFTKNLKNIADIVSVSHASTSQPLVPQNP-ASQPMRTTISSSDQFLGIGSAP 828

Query: 2823 SLLVSEAGSEKGTRQSANPWGDVDHLFDGYNDEQKATIQKERSRRIAEQNKMFAARKXXX 3002
                + A   +    + N WGDV+HLF+GYND+QKA IQ+ER+RRI EQ K+F+ARK   
Sbjct: 829  GAAAAAAAGPR----TQNAWGDVEHLFEGYNDQQKAAIQRERARRIEEQKKLFSARKLCL 884

Query: 3003 XXXXXXXXXNSAKFIEVDLVHEDILRRKEEQDKEMSQRHLFRFQHMGMWTKLRPGIWNFL 3182
                     NSAKF+EVD VH++ILR+KEEQD+E + RHLFRF HMGMWTKLRPGIWNFL
Sbjct: 885  VLDLDHTLLNSAKFVEVDPVHDEILRKKEEQDREKAHRHLFRFPHMGMWTKLRPGIWNFL 944

Query: 3183 EKASNLYELHLYTMGNKLYATEMAKVLDPTGTLFAGRVISKGDDADTFDGDERVPKSKDL 3362
            EKAS LYELHLYTMGNKLYATEMAKVLDPTG LF GRVIS+GDD + FDGDER+PKSKDL
Sbjct: 945  EKASKLYELHLYTMGNKLYATEMAKVLDPTGVLFNGRVISRGDDGEPFDGDERIPKSKDL 1004

Query: 3363 DGVLGMESAVVIIDDSVRVWPHNKLNLIVVERYTYFPSSRRQFGLLGPSLLEIDHDERPE 3542
            +GVLGMES VVI+DDSVRVWPHNKLNLIVVERY YFP SRRQFGL GPSLLEIDHDERPE
Sbjct: 1005 EGVLGMESGVVIMDDSVRVWPHNKLNLIVVERYIYFPCSRRQFGLPGPSLLEIDHDERPE 1064

Query: 3543 DGTLASSLAVIERLHQNFFSHNSLKDVDVRNILAAEQRKILAGCRIVFSRIFPVGEANPH 3722
            DGTLA SLAVIER+HQNFF+H SL + DVRNILA+EQRKILAGCRIVFSR+FPVGEANPH
Sbjct: 1065 DGTLACSLAVIERIHQNFFTHPSLDEADVRNILASEQRKILAGCRIVFSRVFPVGEANPH 1124

Query: 3723 MHPLWQTAEQFGAVCTNQI 3779
            +HPLWQTAEQFGAVCTNQI
Sbjct: 1125 LHPLWQTAEQFGAVCTNQI 1143


Top