BLASTX nr result

ID: Atropa21_contig00028681 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00028681
         (1469 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subuni...   725   0.0  
ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subuni...   723   0.0  
emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera]   425   e-116
ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258...   422   e-115
ref|XP_002521936.1| conserved hypothetical protein [Ricinus comm...   389   e-105
gb|ESW17761.1| hypothetical protein PHAVU_007G265900g [Phaseolus...   384   e-104
ref|XP_003519102.1| PREDICTED: putative RNA polymerase II subuni...   374   e-101
ref|XP_006575272.1| PREDICTED: putative RNA polymerase II subuni...   366   2e-98
ref|XP_003537129.1| PREDICTED: putative RNA polymerase II subuni...   362   3e-97
ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Popu...   355   3e-95
ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subuni...   353   1e-94
gb|EOY34549.1| F2P16.20-like protein isoform 5 [Theobroma cacao]      352   2e-94
gb|EOY34546.1| F2P16.20-like protein isoform 2 [Theobroma cacao]      352   2e-94
gb|EOY34545.1| F2P16.20 protein, putative isoform 1 [Theobroma c...   352   2e-94
gb|EOY34548.1| F2P16.20 protein, putative isoform 4 [Theobroma c...   345   2e-92
gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis]     340   1e-90
gb|EOY34547.1| F2P16.20-like protein isoform 3, partial [Theobro...   338   3e-90
gb|EMJ09632.1| hypothetical protein PRUPE_ppa002134mg [Prunus pe...   324   7e-86
gb|EPS64466.1| hypothetical protein M569_10314, partial [Genlise...   297   9e-78
ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subuni...   295   4e-77

>ref|XP_004230345.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Solanum lycopersicum]
          Length = 660

 Score =  725 bits (1872), Expect = 0.0
 Identities = 383/494 (77%), Positives = 410/494 (82%), Gaps = 5/494 (1%)
 Frame = -3

Query: 1467 MYCSTNCVVNSRTFAGSLQDERSSTLNPAKLNEVLKLFEGLHLHSSEDVKENGDLGSSKL 1288
            MYCSTNCVVNS  FAGSLQDERSSTLNPAKLN+VL LF+GLHLHS +DVKENGD GSSKL
Sbjct: 91   MYCSTNCVVNSGAFAGSLQDERSSTLNPAKLNQVLNLFKGLHLHSLDDVKENGDRGSSKL 150

Query: 1287 KIQEKMDVKGGEVSLEEWMGPSNAIEGYVPQRDRIVNPGLLKNVNRGSKNKHAGIQNEKN 1108
            KIQEK+D+KGGEVSLEEWMGPSNAIEGYVPQRDR VNP LLKN+N+GSKNKHA +Q+EKN
Sbjct: 151  KIQEKVDLKGGEVSLEEWMGPSNAIEGYVPQRDRSVNPALLKNINKGSKNKHARLQDEKN 210

Query: 1107 MILNEIDFSSIIITQDEYSISKFPAPVNAVSS---KEAQTKTRNEVRDD-VSILEKHVDA 940
            MILNE DFSS IITQDEYS+SKFPAPVNA S+   KE Q KTR +VRDD V IL K VDA
Sbjct: 211  MILNEFDFSSTIITQDEYSVSKFPAPVNADSNVKFKETQAKTRYKVRDDDVYILGKQVDA 270

Query: 939  LQLRSGEETEKSDKNNRCFKVDKLNSGEVSSGHSQHDVKNKSAEVLNMSGAGRKYASDGA 760
            LQLRSGEETEKSDKN R  KVDK NSGEVSSG SQHDVKNKS  VL MS  GRKYAS G 
Sbjct: 271  LQLRSGEETEKSDKNTRFLKVDKFNSGEVSSGPSQHDVKNKS--VLIMSDDGRKYASHGE 328

Query: 759  QDXXXXXXXXXXXXXKRMARSVTWADENIDNGAGNKTESSS-ISEDENQAYERSGSTDME 583
             D              +M+RSVTWADE+ID G G KTESSS ISE E+QAY  S STDME
Sbjct: 329  HDKLKSSLKSSNSK--KMSRSVTWADESIDGGIGKKTESSSKISEYESQAYGGSASTDME 386

Query: 582  EDDDSYRFXXXXXXXXXXXXXXXXXXSGSDVPDAVSKAGIVIFPPPQEVDEAIHQEKDEM 403
            E+DDSYRF                  SGSDVPDAVSKAGIVI PP QEVDEAI QE DEM
Sbjct: 387  ENDDSYRFESAEACAAALSQAAEAVASGSDVPDAVSKAGIVILPPSQEVDEAILQETDEM 446

Query: 402  LDTEPAPLKWPRKPGVPSFDVFESEETWYDSPPEGFHMTLSPFATMFNSLFTWISSLSLA 223
            LD E APLKWPRKPG+P++DVFESE++WYDSPPEGF+MTLSPF TMFNSLFTWISS SLA
Sbjct: 447  LDLETAPLKWPRKPGMPNYDVFESEDSWYDSPPEGFNMTLSPFGTMFNSLFTWISSSSLA 506

Query: 222  FIYGHDESNNEEYLSVNGREYPRKIVLSDGRSTEIKKTLAGCLARALPGLVADLRLPVPV 43
            FIYGHDESNNEEYLS+NGREYPRKIVLSDGRSTEIK+TLAGCLARALPGLVADLRLPVP+
Sbjct: 507  FIYGHDESNNEEYLSINGREYPRKIVLSDGRSTEIKQTLAGCLARALPGLVADLRLPVPI 566

Query: 42   STLEQGMGLFLDTM 1
            STLEQGM L L+TM
Sbjct: 567  STLEQGMVLLLNTM 580


>ref|XP_006358558.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X1 [Solanum tuberosum]
          Length = 662

 Score =  723 bits (1865), Expect = 0.0
 Identities = 387/495 (78%), Positives = 409/495 (82%), Gaps = 6/495 (1%)
 Frame = -3

Query: 1467 MYCSTNCVVNSRTFAGSLQDERSSTLNPAKLNEVLKLFEGLHLHSSEDVKENGDLGSSKL 1288
            MYCSTNCVVNS  FAGSLQDERSSTLNPAKLN+VL LF+GLHLHS EDVKENGDLGSSKL
Sbjct: 91   MYCSTNCVVNSGAFAGSLQDERSSTLNPAKLNQVLNLFKGLHLHSPEDVKENGDLGSSKL 150

Query: 1287 KIQEKMDVKGG-EVSLEEWMGPSNAIEGYVPQRDRIVNPGLLKNVNRGSKNKHAGIQNEK 1111
            KIQEK+DVKGG EVSLEEWMGPSNAIEGYVPQRDR VNP LLKN+N+G KNKHA +Q+EK
Sbjct: 151  KIQEKVDVKGGGEVSLEEWMGPSNAIEGYVPQRDRSVNPALLKNINKGFKNKHARLQDEK 210

Query: 1110 NMILNEIDFSSIIITQDEYSISKFPAPVNAVSS---KEAQTKTRNEVRDD-VSILEKHVD 943
            NMILNE DFSS IITQDEYS+SKFPAPVNAVSS   KEAQ KTR +VRDD VSIL K VD
Sbjct: 211  NMILNEFDFSSTIITQDEYSVSKFPAPVNAVSSEKFKEAQAKTRYKVRDDDVSILGKRVD 270

Query: 942  ALQLRSGEETEKSDKNNRCFKVDKLNSGEVSSGHSQHDVKNKSAEVLNMSGAGRKYASDG 763
            ALQLRSGEETEKSDKN R  KVDK NSGEVSSG SQHDVKNKS  VL MS  GRKYAS G
Sbjct: 271  ALQLRSGEETEKSDKNTRFLKVDKFNSGEVSSGPSQHDVKNKS--VLIMSDDGRKYASHG 328

Query: 762  AQDXXXXXXXXXXXXXKRMARSVTWADENIDNGAGNKTESSS-ISEDENQAYERSGSTDM 586
              D             K+M++SVTWADE ID G G KTESSS ISE ENQAY  S STDM
Sbjct: 329  EHDKQLLKSSLKSSNSKKMSQSVTWADEIIDGGIGKKTESSSKISEYENQAYGGSASTDM 388

Query: 585  EEDDDSYRFXXXXXXXXXXXXXXXXXXSGSDVPDAVSKAGIVIFPPPQEVDEAIHQEKDE 406
            EEDDDSYRF                  SGSDVPDAVSKAGIVI P  QEVDEAI QE  E
Sbjct: 389  EEDDDSYRFESAEACAAALSQAAEAVASGSDVPDAVSKAGIVILPTSQEVDEAILQET-E 447

Query: 405  MLDTEPAPLKWPRKPGVPSFDVFESEETWYDSPPEGFHMTLSPFATMFNSLFTWISSLSL 226
            MLD EPAPLKWPRKPG+P++DVFESE+ WYD PPEGF+MTLSPFATMFNSLFTWISS SL
Sbjct: 448  MLDIEPAPLKWPRKPGMPNYDVFESEDCWYDGPPEGFNMTLSPFATMFNSLFTWISSSSL 507

Query: 225  AFIYGHDESNNEEYLSVNGREYPRKIVLSDGRSTEIKKTLAGCLARALPGLVADLRLPVP 46
            AFIYGHDE+NNEEYLS+NGREYP KIVLSDG STEIK+TLAGCLARALPGLVADLRLPVP
Sbjct: 508  AFIYGHDENNNEEYLSINGREYPHKIVLSDGLSTEIKQTLAGCLARALPGLVADLRLPVP 567

Query: 45   VSTLEQGMGLFLDTM 1
            +STLEQGM L L+TM
Sbjct: 568  ISTLEQGMVLLLNTM 582


>emb|CAN62034.1| hypothetical protein VITISV_014731 [Vitis vinifera]
          Length = 659

 Score =  425 bits (1092), Expect = e-116
 Identities = 231/494 (46%), Positives = 324/494 (65%), Gaps = 5/494 (1%)
 Frame = -3

Query: 1467 MYCSTNCVVNSRTFAGSLQDERSSTLNPAKLNEVLKLFEGLHLHSSEDVKENGDLGSSKL 1288
            MYCS+ CVVNSR+FAGSLQ+ER S LN  ++N +L+LF    L S++ + ++GDLG S+L
Sbjct: 91   MYCSSGCVVNSRSFAGSLQEERCSVLNSERINGILRLFGESSLESNKILGKHGDLGLSEL 150

Query: 1287 KIQEKMDVKGGEVSLEEWMGPSNAIEGYVPQRDRIVNPGLLKNVNRGSKNKHAGIQNEKN 1108
            KI+E ++ K GEVS+E+W+GPSNAIEGYVPQRDR + P  +KN   GSK+ ++ + + KN
Sbjct: 151  KIRENVEKKAGEVSMEDWIGPSNAIEGYVPQRDRNLKPKNIKNHKEGSKSSNSKMDSGKN 210

Query: 1107 MILNEIDFSSIIITQDEYSISKFPAPVNAVSS--KEAQTKTRNEVRDDVSILEKHVDALQ 934
             +++E+DF S IIT+DEYSISK    +   +S  K  + K +  + D +S+LEK    +Q
Sbjct: 211  FVIDEMDFVSTIITKDEYSISKSSKGLKDTTSHAKSKEPKEKASIGDQLSMLEKSAPPIQ 270

Query: 933  LRSGEETEKSD-KNNRCFKVDKLNSGEVSSGHSQHDVKNKSAEVLNMSGAGRKYASDGAQ 757
              S  +  +S  + +R    D+ ++ EV S  SQ       +E+  + G    +  + AQ
Sbjct: 271  NDSESKLRESKGRRSRVIFKDEFSTAEVPSVPSQ-----SGSELNGVKGKEEYHTENAAQ 325

Query: 756  -DXXXXXXXXXXXXXKRMARSVTWADENIDNGAGNKTESSSISEDENQAYERSGSTDMEE 580
                           K++ RSVTWADE +D+            E + +     G  D+ +
Sbjct: 326  LGPTKPKSSLKPSGGKKVIRSVTWADEKMDSADSRDFCKVRELEVKKEDPNGLGDIDVGD 385

Query: 579  DDDSYRFXXXXXXXXXXXXXXXXXXSG-SDVPDAVSKAGIVIFPPPQEVDEAIHQEKDEM 403
            DD++ RF                  SG +D+ DAVS+AGI+I P P+++DE    +  ++
Sbjct: 386  DDNALRFASAEACAVALSQAAEAVASGETDMTDAVSEAGIIILPHPRDMDEGESLKDADL 445

Query: 402  LDTEPAPLKWPRKPGVPSFDVFESEETWYDSPPEGFHMTLSPFATMFNSLFTWISSLSLA 223
            L+ EP PLKWP KPG+   D+F+S+++WYD+PPEGF +TLSPFATM+ +LF WI+S S+A
Sbjct: 446  LEPEPVPLKWPIKPGISHSDIFDSDDSWYDTPPEGFSLTLSPFATMWMALFAWITSSSIA 505

Query: 222  FIYGHDESNNEEYLSVNGREYPRKIVLSDGRSTEIKKTLAGCLARALPGLVADLRLPVPV 43
            +IYG DES +EEYLSVNGREYP+KIVL+DGRS+EIK+TLAGCL+RALPGLVADLRLP+PV
Sbjct: 506  YIYGRDESFHEEYLSVNGREYPKKIVLTDGRSSEIKQTLAGCLSRALPGLVADLRLPIPV 565

Query: 42   STLEQGMGLFLDTM 1
            S LEQG+G  LDTM
Sbjct: 566  SNLEQGVGRLLDTM 579


>ref|XP_002280625.1| PREDICTED: uncharacterized protein LOC100258021 [Vitis vinifera]
            gi|296089830|emb|CBI39649.3| unnamed protein product
            [Vitis vinifera]
          Length = 659

 Score =  422 bits (1084), Expect = e-115
 Identities = 230/494 (46%), Positives = 322/494 (65%), Gaps = 5/494 (1%)
 Frame = -3

Query: 1467 MYCSTNCVVNSRTFAGSLQDERSSTLNPAKLNEVLKLFEGLHLHSSEDVKENGDLGSSKL 1288
            MYCS+ CVVNSR+FAGSLQ+ER S LN  ++N +L+LF    L S++ + ++GDLG S+L
Sbjct: 91   MYCSSGCVVNSRSFAGSLQEERCSVLNSERINGILRLFGESSLESNKILGKHGDLGLSEL 150

Query: 1287 KIQEKMDVKGGEVSLEEWMGPSNAIEGYVPQRDRIVNPGLLKNVNRGSKNKHAGIQNEKN 1108
            KI+E ++ K GEVS+E+W+GPSNAIEGYVPQRDR + P  +KN   GSK+ ++ + + KN
Sbjct: 151  KIRENVEKKAGEVSMEDWIGPSNAIEGYVPQRDRNLKPKNIKNRKEGSKSSNSKMDSGKN 210

Query: 1107 MILNEIDFSSIIITQDEYSISKFPAPVNAVSS--KEAQTKTRNEVRDDVSILEKHVDALQ 934
             +++E+DF   IIT+DEYSISK    +   +S  K  + K +  + D +S+LEK    +Q
Sbjct: 211  FVIDEMDFVRTIITEDEYSISKSSKGLKDTTSHAKSKEPKEKASIGDQLSMLEKSAPPIQ 270

Query: 933  LRSGEETEKSD-KNNRCFKVDKLNSGEVSSGHSQHDVKNKSAEVLNMSGAGRKYASDGAQ 757
              S  +  +S  + +R    D+ ++ EV S  SQ       +E+  + G    +  + AQ
Sbjct: 271  NDSESKLRESKGRRSRVIFKDEFSTAEVPSVPSQ-----SGSELNGVKGKEEYHTENAAQ 325

Query: 756  -DXXXXXXXXXXXXXKRMARSVTWADENIDNGAGNKTESSSISEDENQAYERSGSTDMEE 580
                           K++ RSVTWADE +D+            E + +     G  D+ +
Sbjct: 326  LGPTKLKSCLKPSGGKKVTRSVTWADEKMDSADSRDFCKVRELEVKKEDPNGLGDIDVGD 385

Query: 579  DDDSYRFXXXXXXXXXXXXXXXXXXSG-SDVPDAVSKAGIVIFPPPQEVDEAIHQEKDEM 403
            DD++ RF                  SG +D+ DAVS+A I+I P P+++DE    +  ++
Sbjct: 386  DDNALRFASAEACAIALSQAAEAVASGETDMTDAVSEARIIILPHPRDMDEGESLKDADL 445

Query: 402  LDTEPAPLKWPRKPGVPSFDVFESEETWYDSPPEGFHMTLSPFATMFNSLFTWISSLSLA 223
            L+ EP PLKWP KPG+   D+F+S+++WYD+PPEGF +TLSPFATM+ +LF WI+S S+A
Sbjct: 446  LEPEPVPLKWPIKPGISHSDIFDSDDSWYDTPPEGFSLTLSPFATMWMALFAWITSSSIA 505

Query: 222  FIYGHDESNNEEYLSVNGREYPRKIVLSDGRSTEIKKTLAGCLARALPGLVADLRLPVPV 43
            +IYG DES +EEYLSVNGREYP+KIVL+DGRS+EIK+TLAGCLARALPGLVADLRLP+PV
Sbjct: 506  YIYGRDESFHEEYLSVNGREYPKKIVLTDGRSSEIKQTLAGCLARALPGLVADLRLPIPV 565

Query: 42   STLEQGMGLFLDTM 1
            S LEQG+G  LDTM
Sbjct: 566  SNLEQGVGRLLDTM 579


>ref|XP_002521936.1| conserved hypothetical protein [Ricinus communis]
            gi|223538861|gb|EEF40460.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 645

 Score =  389 bits (998), Expect = e-105
 Identities = 228/496 (45%), Positives = 302/496 (60%), Gaps = 7/496 (1%)
 Frame = -3

Query: 1467 MYCSTNCVVNSRTFAGSLQDERSSTLNPAKLNEVLKLFEGLHLHSSEDVKENGDLGSSKL 1288
            MYCS++C+VNSR F+ SLQ++R S LNP KLNE+L+ F  L L  SE +  +GDLG S L
Sbjct: 91   MYCSSSCLVNSRAFSESLQEKRCSVLNPIKLNEILRKFNDLTL-DSEGLGRSGDLGLSNL 149

Query: 1287 KIQEKMDVKGGEVSLEEWMGPSNAIEGYVPQRDRIVNPGLLKNVNRGSKNKHAGIQNEKN 1108
            KIQEK +   G+VSLEEW+GPSNAIEGYVPQ DR  NP L KN   G K       ++++
Sbjct: 150  KIQEKSETNVGKVSLEEWIGPSNAIEGYVPQGDRDPNPSL-KNHKEGLKAICKKPVSKQD 208

Query: 1107 MILNEIDFSSIIITQDEYSISKFPAPVNAVSSK---EAQTKTRNE-VRDDVSILEKH--V 946
               ++ DF+S IIT DEYSISK P+ + + +S    +AQT   +E +   +S L K   +
Sbjct: 209  CFFSDTDFTSTIITNDEYSISKGPSGLTSTASDIKLQAQTGKGHEGLNAQLSSLRKQDSI 268

Query: 945  DALQLRSGEETEKSDKNNRCFKVDKLNSGEVSSGHSQHDVKNKSAEVLNMSGAGRKYASD 766
             A +   G   EK  K    F+   L S    +  ++   +   A  LN S       S 
Sbjct: 269  KASRKSKGRRKEKVIKEQLNFQ--DLPSSSYYTAEAEDISQATGAANLNESVLKPSLKSS 326

Query: 765  GAQDXXXXXXXXXXXXXKRMARSVTWADENIDNGAGNKTESSSISEDENQAYERSGSTDM 586
            GA+               R  RSVTWADE +DN            E  N+++E S S + 
Sbjct: 327  GAK---------------RSNRSVTWADERVDNAGSRNLCEVQEMEQTNESHEISESANK 371

Query: 585  EEDDDSYRFXXXXXXXXXXXXXXXXXXSG-SDVPDAVSKAGIVIFPPPQEVDEAIHQEKD 409
             +D    RF                  SG +DV  A+S+AGI++ PP Q++ +  + EK+
Sbjct: 372  GDDGHMLRFESAEACAVALSQAAEAVASGDADVNKAMSEAGIIVLPPSQDLGQGGNVEKN 431

Query: 408  EMLDTEPAPLKWPRKPGVPSFDVFESEETWYDSPPEGFHMTLSPFATMFNSLFTWISSLS 229
            +M++ E A LKWP KPG+P  D+F+ E++WYD+PPEGF +TLSPFATM+ +LF W++S S
Sbjct: 432  DMIEQESASLKWPTKPGIPQSDLFDPEDSWYDAPPEGFSLTLSPFATMWMALFAWVTSSS 491

Query: 228  LAFIYGHDESNNEEYLSVNGREYPRKIVLSDGRSTEIKKTLAGCLARALPGLVADLRLPV 49
            LA+IYG DES +E+YLSVNGREYPRKIVL DGRS+EI+ T   CLAR  PGLVA+LRLP+
Sbjct: 492  LAYIYGRDESAHEDYLSVNGREYPRKIVLRDGRSSEIRLTAESCLARTFPGLVANLRLPI 551

Query: 48   PVSTLEQGMGLFLDTM 1
            PVSTLEQG G  L+TM
Sbjct: 552  PVSTLEQGAGRLLETM 567


>gb|ESW17761.1| hypothetical protein PHAVU_007G265900g [Phaseolus vulgaris]
          Length = 706

 Score =  384 bits (987), Expect = e-104
 Identities = 231/538 (42%), Positives = 311/538 (57%), Gaps = 49/538 (9%)
 Frame = -3

Query: 1467 MYCSTNCVVNSRTFAGSLQDERSSTLNPAKLNEVLKLFEGLHLHSSEDVKENGDLGSSKL 1288
            M+CS+NCVV+S+ F+G LQ ER S L+P KLN VL LFE L+L  +E+V ++GDLG S L
Sbjct: 91   MFCSSNCVVSSKAFSGILQAERCSALDPEKLNNVLGLFENLNLEQTENVPKDGDLGLSNL 150

Query: 1287 KIQEKMDVKGGEVSLEEWMGPSNAIEGYVPQRDRIVNPGLLKNVNRGSKNKHAGIQNEKN 1108
            KIQEK     GEV LE+W+GPSNAIEGYVP+     + GL KNV +GSK  H    N+K+
Sbjct: 151  KIQEKTVTTSGEVPLEQWVGPSNAIEGYVPKPRERESKGLRKNVKKGSKAGHGKSNNDKD 210

Query: 1107 MILNEIDFSSIIITQDEYSISKF-PAPVNAVSS---KEAQTKTRNEVRDDVSILEKHVDA 940
            +I +E++F S II QDEYS+SK  P   +  +    K      + E +  + ++ K  D+
Sbjct: 211  LINSEMNFVSTIIMQDEYSVSKASPGQTDTTAHHQIKPTAVDRQQEEKVGLKVVRKDEDS 270

Query: 939  LQ-----LRSGEETEKSDKNNRCFK-------------VDKLNSGEVSSGHSQHDV-KNK 817
            +Q       SG     S+K     K             + K ++  VS     +DV KN 
Sbjct: 271  IQDLSSSFESGLHLSASEKGKEVSKSCEVVVKSTPNLAIKKKDAHSVSISERHYDVEKNN 330

Query: 816  SAE------------VLNMSGAGRKYASDGAQDXXXXXXXXXXXXXK-----------RM 706
            SA              +N   +   +  D  ++             K           ++
Sbjct: 331  SARKSVQLKGETSRVTVNGDASTSNFDPDNVKEKFQVEKVGGLCETKLKSSLKSAGEKKL 390

Query: 705  ARSVTWADENIDNGAGNKTESSSISE--DENQAYERSGSTDMEEDDDSYRFXXXXXXXXX 532
            +R+VTWADE I NGAGNK +   + E  D  +  E  G+ D+  ++D  R          
Sbjct: 391  SRTVTWADEKI-NGAGNK-DLCEVKEFGDIIKESESVGNEDVANNEDMLRQASAEACAIA 448

Query: 531  XXXXXXXXXSG-SDVPDAVSKAGIVIFPPPQEVDEAIHQEKDEMLDTEPAPLKWPRKPGV 355
                     SG SD  DAVS+AGI+I P P +  E    E  ++L  +   LKWPRKPG+
Sbjct: 449  LSQASEAVASGDSDATDAVSEAGIIILPQPHDAVEEGTMEDADILQNDSVTLKWPRKPGI 508

Query: 354  PSFDVFESEETWYDSPPEGFHMTLSPFATMFNSLFTWISSLSLAFIYGHDESNNEEYLSV 175
               D FES+++W+D+PPEGF +TLSPFA M+N++F+W++S SLA+IYG DES +EEYLSV
Sbjct: 509  SDIDFFESDDSWFDAPPEGFSLTLSPFANMWNAIFSWMTSYSLAYIYGRDESFHEEYLSV 568

Query: 174  NGREYPRKIVLSDGRSTEIKKTLAGCLARALPGLVADLRLPVPVSTLEQGMGLFLDTM 1
            NGREYP K+VLSDGRS+EIK+T AGCLARA P LVA LRLP+P+STLEQGM   L+TM
Sbjct: 569  NGREYPCKVVLSDGRSSEIKQTFAGCLARAFPALVAGLRLPIPISTLEQGMACLLETM 626


>ref|XP_003519102.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X1 [Glycine max]
          Length = 706

 Score =  374 bits (960), Expect = e-101
 Identities = 232/539 (43%), Positives = 311/539 (57%), Gaps = 50/539 (9%)
 Frame = -3

Query: 1467 MYCSTNCVVNSRTFAGSLQDERSSTLNPAKLNEVLKLFEGLHLHSSEDVKENGDLGSSKL 1288
            M+CS+NC+V+S+TFAGSLQ ER S L+  KLN VL LFE L+L   E +++NGDLG S L
Sbjct: 91   MFCSSNCLVSSKTFAGSLQAERCSGLDLEKLNNVLSLFENLNLEPVETLQKNGDLGLSDL 150

Query: 1287 KIQEKMDVKGGEVSLEEWMGPSNAIEGYVPQRDRIVNPGLLKNVNRGSKNKHAGIQNEKN 1108
            KIQEK +   GEVSLE+W GPSNAIEGYVP+     + GL KNV +GSK  H    ++ N
Sbjct: 151  KIQEKTERSSGEVSLEQWAGPSNAIEGYVPKPRNRDSKGLRKNVKKGSKTGHGKSISDIN 210

Query: 1107 MILNEIDFSSIIITQDEYSISKFPAPVNAVSSKEAQTKTRNEVRD----DVSILEKHVDA 940
            +I +E+ F S II QDEYS+SK P P    ++   Q K    V+     D  ++ K  D+
Sbjct: 211  LINSEMGFVSTIIMQDEYSVSKVP-PGQMDATANHQIKPTATVKQPEKVDAEVVRKDDDS 269

Query: 939  LQ-----------LRSGEETEKSDKNNRCFKVDKLNSG---EVSSGHS-----------Q 835
            +Q           L + E+ E+  K+  C  V K + G   +    HS           Q
Sbjct: 270  IQDLSSSFKSSLILSTSEKEEEVTKS--CEAVLKFSPGCAIQKKDVHSISISERQCDVEQ 327

Query: 834  HDVKNKSAEV-----------------LNMSGAGRKYASD--GAQDXXXXXXXXXXXXXK 712
            +D   KS +V                 L+ +    K+  +  G                K
Sbjct: 328  NDSARKSVQVKGKTSRVIANDDASTSNLDPANVEEKFQVEKAGGSLKTKPRSSLKSAGEK 387

Query: 711  RMARSVTWADENIDN-GAGNKTESSSISEDENQAYERSGSTDMEEDDDSYRFXXXXXXXX 535
            + +R+VTWADE I++ G+ +  E     + + ++     + D+  D+D  R         
Sbjct: 388  KFSRTVTWADEKINSTGSKDLCEFKEFGDIKKESDSVGNNIDVANDEDILRRASAEACAI 447

Query: 534  XXXXXXXXXXSG-SDVPDAVSKAGIVIFPPPQEVDEAIHQEKDEMLDTEPAPLKWPRKPG 358
                      SG SDV DAVS+AGI I PPP +  E    E  ++L  +   LKWPRK G
Sbjct: 448  ALSSASEAVASGDSDVSDAVSEAGITILPPPHDAAEEGTVEDADILQNDSVTLKWPRKTG 507

Query: 357  VPSFDVFESEETWYDSPPEGFHMTLSPFATMFNSLFTWISSLSLAFIYGHDESNNEEYLS 178
            +   D FES+++W+D+PPEGF +TLSPFATM+N+LF+W +S SLA+IYG DES +EEYLS
Sbjct: 508  ISEADFFESDDSWFDAPPEGFSLTLSPFATMWNTLFSWTTSSSLAYIYGRDESFHEEYLS 567

Query: 177  VNGREYPRKIVLSDGRSTEIKKTLAGCLARALPGLVADLRLPVPVSTLEQGMGLFLDTM 1
            VNGREYP K+VL+DGRS+EIK+TLA CLARALP LVA LRLP+PVS +EQGM   L+TM
Sbjct: 568  VNGREYPCKVVLADGRSSEIKQTLASCLARALPALVAVLRLPIPVSIMEQGMACLLETM 626


>ref|XP_006575272.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog isoform X2 [Glycine max]
          Length = 716

 Score =  366 bits (939), Expect = 2e-98
 Identities = 232/549 (42%), Positives = 311/549 (56%), Gaps = 60/549 (10%)
 Frame = -3

Query: 1467 MYCSTNCVVNSRTFAGSLQDERSSTLNPAKLNEVLKLFEGLHLHSSEDVKENGDLGSSKL 1288
            M+CS+NC+V+S+TFAGSLQ ER S L+  KLN VL LFE L+L   E +++NGDLG S L
Sbjct: 91   MFCSSNCLVSSKTFAGSLQAERCSGLDLEKLNNVLSLFENLNLEPVETLQKNGDLGLSDL 150

Query: 1287 KIQEKMDVKGGEVSLEEWMGPSNAIEGYVPQRDRIVNPGLLKNVNRGSKNKHAGIQNEKN 1108
            KIQEK +   GEVSLE+W GPSNAIEGYVP+     + GL KNV +GSK  H    ++ N
Sbjct: 151  KIQEKTERSSGEVSLEQWAGPSNAIEGYVPKPRNRDSKGLRKNVKKGSKTGHGKSISDIN 210

Query: 1107 MILNEIDFSSIIITQDEYSISKFPAPVNAVSSKEAQTKTRNEVRD----DVSILEKHVDA 940
            +I +E+ F S II QDEYS+SK P P    ++   Q K    V+     D  ++ K  D+
Sbjct: 211  LINSEMGFVSTIIMQDEYSVSKVP-PGQMDATANHQIKPTATVKQPEKVDAEVVRKDDDS 269

Query: 939  LQ-----------LRSGEETEKSDKNNRCFKVDKLNSG---EVSSGHS-----------Q 835
            +Q           L + E+ E+  K+  C  V K + G   +    HS           Q
Sbjct: 270  IQDLSSSFKSSLILSTSEKEEEVTKS--CEAVLKFSPGCAIQKKDVHSISISERQCDVEQ 327

Query: 834  HDVKNKSAEV-----------------LNMSGAGRKYASD--GAQDXXXXXXXXXXXXXK 712
            +D   KS +V                 L+ +    K+  +  G                K
Sbjct: 328  NDSARKSVQVKGKTSRVIANDDASTSNLDPANVEEKFQVEKAGGSLKTKPRSSLKSAGEK 387

Query: 711  RMARSVTWADENIDN-GAGNKTESSSISEDENQAYERSGSTDMEEDDDSYRFXXXXXXXX 535
            + +R+VTWADE I++ G+ +  E     + + ++     + D+  D+D  R         
Sbjct: 388  KFSRTVTWADEKINSTGSKDLCEFKEFGDIKKESDSVGNNIDVANDEDILRRASAEACAI 447

Query: 534  XXXXXXXXXXSG-SDVPDAV----------SKAGIVIFPPPQEVDEAIHQEKDEMLDTEP 388
                      SG SDV DAV          S+AGI I PPP +  E    E  ++L  + 
Sbjct: 448  ALSSASEAVASGDSDVSDAVFSPMNETCAVSEAGITILPPPHDAAEEGTVEDADILQNDS 507

Query: 387  APLKWPRKPGVPSFDVFESEETWYDSPPEGFHMTLSPFATMFNSLFTWISSLSLAFIYGH 208
              LKWPRK G+   D FES+++W+D+PPEGF +TLSPFATM+N+LF+W +S SLA+IYG 
Sbjct: 508  VTLKWPRKTGISEADFFESDDSWFDAPPEGFSLTLSPFATMWNTLFSWTTSSSLAYIYGR 567

Query: 207  DESNNEEYLSVNGREYPRKIVLSDGRSTEIKKTLAGCLARALPGLVADLRLPVPVSTLEQ 28
            DES +EEYLSVNGREYP K+VL+DGRS+EIK+TLA CLARALP LVA LRLP+PVS +EQ
Sbjct: 568  DESFHEEYLSVNGREYPCKVVLADGRSSEIKQTLASCLARALPALVAVLRLPIPVSIMEQ 627

Query: 27   GMGLFLDTM 1
            GM   L+TM
Sbjct: 628  GMACLLETM 636


>ref|XP_003537129.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Glycine max]
          Length = 706

 Score =  362 bits (928), Expect = 3e-97
 Identities = 227/536 (42%), Positives = 313/536 (58%), Gaps = 47/536 (8%)
 Frame = -3

Query: 1467 MYCSTNCVVNSRTFAGSLQDERSSTLNPAKLNEVLKLFEGLHLHSSEDVKENGDLGSSKL 1288
            M+C +NCVV+S+ FAGSLQ ER S L+  KLN +L LFE L+L  +E++++N D G S L
Sbjct: 91   MFCCSNCVVSSKAFAGSLQAERCSGLDLEKLNNILSLFENLNLEPAENLQKNEDFGLSDL 150

Query: 1287 KIQEKMDVKGGEVSLEEWMGPSNAIEGYVPQRDRIVNPGLLKNVNRGSKNKHAGIQNEKN 1108
            KIQEK +   GEVSLE+W GPSNAIEGYVP+     + GL KNV +GSK  H    ++ N
Sbjct: 151  KIQEKTETSSGEVSLEQWAGPSNAIEGYVPKPRDHDSKGLRKNVKKGSKAGHGKPISDIN 210

Query: 1107 MILNEIDFSSIIITQDEYSISK-FPAPVNAVSSKE----AQTKTRNEV------RDDVSI 961
            +I +E+ F S II QD YS+SK  P   +A +  +    A  K   +V      +DD SI
Sbjct: 211  LISSEMGFVSTIIMQDGYSVSKVLPGQRDATAHHQIKPTAIVKQLGKVDAKVVRKDDGSI 270

Query: 960  LE---KHVDALQLRSGEETEK-------SDKNNRCFKVDKLNSGEVSSGHSQHDVKN--- 820
             +       +L L + E+ E+       + K++    + K +   VS    Q DV+    
Sbjct: 271  QDLSSSFKSSLILGTSEKEEELAQSCEAALKSSPDCAIKKKDVYSVSISERQCDVEQNDS 330

Query: 819  --KSAEV-----------------LNMSGAGRKYASD--GAQDXXXXXXXXXXXXXKRMA 703
              KS +V                 L+ +    K+  +  G                K+++
Sbjct: 331  AKKSVQVKGKMSRVTANDDASTSNLDPANVEEKFQVEKAGGSLNTKPKSSLKSAGEKKLS 390

Query: 702  RSVTWADENIDN-GAGNKTESSSISEDENQAYERSGSTDMEEDDDSYRFXXXXXXXXXXX 526
            R+VTWAD+ I++ G+ +     +  +  N++     S D+  D+D+ R            
Sbjct: 391  RTVTWADKKINSTGSKDLCGFKNFGDIRNESDSAGNSIDVANDEDTLRRASAEACVIALS 450

Query: 525  XXXXXXXSG-SDVPDAVSKAGIVIFPPPQEVDEAIHQEKDEMLDTEPAPLKWPRKPGVPS 349
                   SG SDV DAVS+AGI+I PPP +  E    E  ++L  +   +KWPRKPG+  
Sbjct: 451  SASEAVASGDSDVSDAVSEAGIIILPPPHDAGEEGTLEDVDILQNDSVTVKWPRKPGISE 510

Query: 348  FDVFESEETWYDSPPEGFHMTLSPFATMFNSLFTWISSLSLAFIYGHDESNNEEYLSVNG 169
             D FES+++W+D+ PEGF +TLSPFATM+N+LF+WI+S SLA+IYG DES  EEYLSVNG
Sbjct: 511  ADFFESDDSWFDAAPEGFSLTLSPFATMWNTLFSWITSSSLAYIYGRDESFQEEYLSVNG 570

Query: 168  REYPRKIVLSDGRSTEIKKTLAGCLARALPGLVADLRLPVPVSTLEQGMGLFLDTM 1
            REYP K+VL+DGRS+EIK+TLA CLARALP LVA LRLP+PVST+EQGM   L+TM
Sbjct: 571  REYPCKVVLADGRSSEIKQTLASCLARALPTLVAVLRLPIPVSTMEQGMACLLETM 626


>ref|XP_002321396.2| hypothetical protein POPTR_0015s01330g [Populus trichocarpa]
            gi|550321730|gb|EEF05523.2| hypothetical protein
            POPTR_0015s01330g [Populus trichocarpa]
          Length = 696

 Score =  355 bits (911), Expect = 3e-95
 Identities = 213/536 (39%), Positives = 298/536 (55%), Gaps = 47/536 (8%)
 Frame = -3

Query: 1467 MYCSTNCVVNSRTFAGSLQDERSSTLNPAKLNEVLKLFEGLHLHSSEDVKENGDLGSSKL 1288
            MYCS++CV+NSRTF+GSLQ+ER   LNPAKLNEVL LF+   L S   + +NGDLG S L
Sbjct: 91   MYCSSSCVINSRTFSGSLQEERCLVLNPAKLNEVLMLFDNFSLGSEGSLGKNGDLGFSNL 150

Query: 1287 KIQEKMDVKGGEVSLEEWMGPSNAIEGYVPQRDRIVNPGLLKNVN--------------- 1153
            KI+EK +   GEVS E+W+GPSNAIEGYVPQRDR+    ++ +++               
Sbjct: 151  KIEEKTEKVEGEVSFEQWIGPSNAIEGYVPQRDRLEEDFIIDDMDFTSSIITQDEYSISK 210

Query: 1152 --------------------------RGSKNKHAGIQNEKNMILNEIDFSS-IIITQDEY 1054
                                      +GSK K     +++   +N+++F+S IIITQDEY
Sbjct: 211  TPSGLTDTNTDKKTQKPKAKGSHKGSKGSKAKGTKQSSKQESFINDMNFTSTIIITQDEY 270

Query: 1053 SISKFPAPVNAVSSKEAQTKTRNEVRDDVSILEKHVDALQLRSGEETEKSDKNNRCFKV- 877
            SISK P+ +   +SK    K + +V    S  E    A +     +T +  K +R  KV 
Sbjct: 271  SISKSPSGLAGTTSKTKIQKQKEKVSQKSS--ENQSSATRKVGSSKTSRKVKEDRS-KVA 327

Query: 876  --DKLNSGEVSSGHSQHDVKNKSAEVLNMSGAGRKYASDGAQDXXXXXXXXXXXXXKR-M 706
              D+L+S ++SS     D    S+  +      +  +   A+               + +
Sbjct: 328  IKDELSSQDLSS---PFDSCQTSSITITAEAKEKSVSEKAAKPVESSLKPSLKTSGAKQL 384

Query: 705  ARSVTWADENIDNGAGNKTESSSISEDENQAYERSGSTDMEEDDDSYRFXXXXXXXXXXX 526
             RSVTWADE + +            ED     E   + D  +D    +F           
Sbjct: 385  TRSVTWADEKVGSSGSRDLCEVRGMEDTKAGPEIVDNIDKRDDGYVSKFESAEACAKALS 444

Query: 525  XXXXXXXSG-SDVPDAVSKAGIVIFPPPQEVDEAIHQEKDEMLDTEPAPLKWPRKPGVPS 349
                   SG +D  +A+S+AG+VI P P ++D+    E  ++LD E + +KWP KPG+P 
Sbjct: 445  QAAEAVASGDADASNALSEAGLVILPQPHDLDQGDPMEDVDVLDEESSTIKWPGKPGIPQ 504

Query: 348  FDVFESEETWYDSPPEGFHMTLSPFATMFNSLFTWISSLSLAFIYGHDESNNEEYLSVNG 169
             + F+ E +WYD+PPEGF + LS FAT++ +LF W++S SLA++YG DES++EEYL VNG
Sbjct: 505  SECFDPENSWYDAPPEGFSLELSSFATIWMALFAWVTSSSLAYVYGKDESSHEEYLMVNG 564

Query: 168  REYPRKIVLSDGRSTEIKKTLAGCLARALPGLVADLRLPVPVSTLEQGMGLFLDTM 1
            REYPRKIVL DGRS EI++T+ GCL RA P +VADLRLP+P+STLEQG    L TM
Sbjct: 565  REYPRKIVLGDGRSFEIQQTIEGCLGRAFPVVVADLRLPIPISTLEQGAANLLGTM 620


>ref|XP_004497627.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cicer arietinum]
          Length = 666

 Score =  353 bits (906), Expect = 1e-94
 Identities = 219/517 (42%), Positives = 300/517 (58%), Gaps = 28/517 (5%)
 Frame = -3

Query: 1467 MYCSTNCVVNSRTFAGSLQDERSSTLNPAKLNEVLKLFEGLHLHSSEDVKENGDLGSSKL 1288
            M+CS++CVVNS+ FAGSL+D+R   L+P KLN +L+LF   +L   E+  ++G+LG S L
Sbjct: 91   MFCSSSCVVNSKAFAGSLKDKRCLALDPQKLNNILRLFGNSNLEPMENSGKDGELGLSSL 150

Query: 1287 KIQEKMDVKGGEVSLEEWMGPSNAIEGYVPQRDRIVNPGLLKNVNRGSKNKHAGIQNEKN 1108
            +IQ+K +    EVSLE+W+GPSNAIEGYVP++    + G  KN  +GSK  H      KN
Sbjct: 151  RIQDKTETVT-EVSLEQWVGPSNAIEGYVPKKRDNGSKGSQKNTKKGSKASHGKSNGVKN 209

Query: 1107 MILNEIDFSSIIITQDEYSISKFPAPVNAVSSKEAQTKTRNEV----------RDDVSIL 958
            +I +E DF S II QDEYS+SK       VSS +      +++          R D  ++
Sbjct: 210  LINSEFDFMSTIIMQDEYSVSK-------VSSGQTDATVDHQIKPTAILEQPKRVDHELV 262

Query: 957  EKHVDALQLRSG-------------EETEKSDKNNRCFKVDKL--NSGEVSSGHSQHDVK 823
             K  D   L S              +E  KS KN    K +++  N    +S     DV+
Sbjct: 263  RKDDDIQDLSSSFASSLNLSASKKDKEIAKSCKNVLKGKTNRVAANDDSSTSNFDPSDVE 322

Query: 822  NKSAEVLNMSGAGRKYASDGAQDXXXXXXXXXXXXXKRMARSVTWADENIDNGAGNKTES 643
             K      +     K  S    +              ++ RSVTWAD+ ID G G+ T+ 
Sbjct: 323  EKIQIEKEIGSCHTKPKSSLKSNGKK-----------KLGRSVTWADKKID-GCGS-TDL 369

Query: 642  SSISEDENQAYER--SGSTDMEEDDDSYRFXXXXXXXXXXXXXXXXXXSG-SDVPDAVSK 472
             +  E  N   E   + + D+ +D+D  R                   SG SD  DAVS+
Sbjct: 370  CAFKEFGNIKKESDVADNVDVVDDEDILRSVSAEACAIALSQAAEAVASGDSDAIDAVSE 429

Query: 471  AGIVIFPPPQEVDEAIHQEKDEMLDTEPAPLKWPRKPGVPSFDVFESEETWYDSPPEGFH 292
            AGI+I P  +   E    +  ++L+T+   LKWPRKPG+  FD+F S+++W+D+PPEGF 
Sbjct: 430  AGIIILPHTENAVEESTVDDVDILETDSVTLKWPRKPGISDFDLFASDDSWFDAPPEGFS 489

Query: 291  MTLSPFATMFNSLFTWISSLSLAFIYGHDESNNEEYLSVNGREYPRKIVLSDGRSTEIKK 112
            +TLSPFAT++N+ F+WI+S SLA+IYG D S  EE+LSV+GREYP KIVLSDGRS+EIK+
Sbjct: 490  LTLSPFATLWNAFFSWITSSSLAYIYGRDVSFYEEFLSVDGREYPCKIVLSDGRSSEIKQ 549

Query: 111  TLAGCLARALPGLVADLRLPVPVSTLEQGMGLFLDTM 1
            TLA CLARALP +VA+L+LP+PVSTLEQGM   LDTM
Sbjct: 550  TLASCLARALPAVVAELKLPMPVSTLEQGMVCLLDTM 586


>gb|EOY34549.1| F2P16.20-like protein isoform 5 [Theobroma cacao]
          Length = 708

 Score =  352 bits (903), Expect = 2e-94
 Identities = 218/520 (41%), Positives = 297/520 (57%), Gaps = 31/520 (5%)
 Frame = -3

Query: 1467 MYCSTNCVVNSRTFAGSLQDERSSTLNPAKLNEVLKLFEGLHLHSSEDVKENGDLGSSKL 1288
            M+CSTNC++NSR FAGSLQ+ER S LN AKLN++L LF  L L  + D+ +NGDLG S L
Sbjct: 145  MFCSTNCLINSRAFAGSLQEERCSVLNHAKLNDILSLFGDLDLDDN-DLGKNGDLGFSNL 203

Query: 1287 KIQEKMDVKGGEVSLEEWMGPSNAIEGYVPQRDRIVNPGLLKNVNR---GSKNKHAGIQN 1117
            +I+E  +VK  +VSL    GPSNAIEGYVPQR+ I  P   KN       S +   G + 
Sbjct: 204  RIKENEEVKAEDVSLA---GPSNAIEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKK 260

Query: 1116 EKNMILNEIDFSSIIITQDEYSISKFPAPVNAVSSKEAQTKTRNEVRDDVS-----ILEK 952
            E+  + NE+DF+  II  DEY ISK P         +  +K  + V +++      I+  
Sbjct: 261  EEYFVNNELDFAGTIIMNDEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMND 320

Query: 951  HVDALQLRSGEETEKSDKNNRCFK---VDKLNSGEVSSGHSQHDVKNKSAEVLNMSGAGR 781
                 ++ SG +    D N +  +   + K +  +     S   ++ K + ++ +     
Sbjct: 321  EYTISKMPSGSKQSCFDSNLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKN 380

Query: 780  KYAS---------------DGA--QDXXXXXXXXXXXXXKRMARSVTWADEN-IDN-GAG 658
             Y S               D A                 K++ R VTWAD+   DN G G
Sbjct: 381  VYQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNG 440

Query: 657  NKTESSSISEDENQAYERSGSTDMEEDDDSYRFXXXXXXXXXXXXXXXXXXSG-SDVPDA 481
            N  E   +   +  + E SGS +   DD+  RF                  SG SDV DA
Sbjct: 441  NLCEVKEMETMKGDS-EISGSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDA 499

Query: 480  VSKAGIVIFPPPQEVDEAIHQEKDEMLDTEPAPLKWPRKPGVPSFDVFESEETWYDSPPE 301
            V + G++I P   EVD+    E  +ML+ E AP+KWP+KPG+P  D+F  E++W+D+PPE
Sbjct: 500  VYENGLIILPSLCEVDKEEPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPE 559

Query: 300  GFHMTLSPFATMFNSLFTWISSLSLAFIYGHDESNNEEYLSVNGREYPRKIVLSDGRSTE 121
            GF +TLS FATM+N+LF WI+S SLA+IYG DES +EEYLS+NGREYPRKI L DGRS+E
Sbjct: 560  GFSLTLSTFATMWNALFEWITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSE 619

Query: 120  IKKTLAGCLARALPGLVADLRLPVPVSTLEQGMGLFLDTM 1
            IK+TLA C++RALP +V DLRLP+P+STLEQGMG  +DT+
Sbjct: 620  IKETLASCISRALPAIVTDLRLPIPISTLEQGMGHLIDTI 659


>gb|EOY34546.1| F2P16.20-like protein isoform 2 [Theobroma cacao]
          Length = 679

 Score =  352 bits (903), Expect = 2e-94
 Identities = 218/520 (41%), Positives = 297/520 (57%), Gaps = 31/520 (5%)
 Frame = -3

Query: 1467 MYCSTNCVVNSRTFAGSLQDERSSTLNPAKLNEVLKLFEGLHLHSSEDVKENGDLGSSKL 1288
            M+CSTNC++NSR FAGSLQ+ER S LN AKLN++L LF  L L  + D+ +NGDLG S L
Sbjct: 145  MFCSTNCLINSRAFAGSLQEERCSVLNHAKLNDILSLFGDLDLDDN-DLGKNGDLGFSNL 203

Query: 1287 KIQEKMDVKGGEVSLEEWMGPSNAIEGYVPQRDRIVNPGLLKNVNR---GSKNKHAGIQN 1117
            +I+E  +VK  +VSL    GPSNAIEGYVPQR+ I  P   KN       S +   G + 
Sbjct: 204  RIKENEEVKAEDVSLA---GPSNAIEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKK 260

Query: 1116 EKNMILNEIDFSSIIITQDEYSISKFPAPVNAVSSKEAQTKTRNEVRDDVS-----ILEK 952
            E+  + NE+DF+  II  DEY ISK P         +  +K  + V +++      I+  
Sbjct: 261  EEYFVNNELDFAGTIIMNDEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMND 320

Query: 951  HVDALQLRSGEETEKSDKNNRCFK---VDKLNSGEVSSGHSQHDVKNKSAEVLNMSGAGR 781
                 ++ SG +    D N +  +   + K +  +     S   ++ K + ++ +     
Sbjct: 321  EYTISKMPSGSKQSCFDSNLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKN 380

Query: 780  KYAS---------------DGA--QDXXXXXXXXXXXXXKRMARSVTWADEN-IDN-GAG 658
             Y S               D A                 K++ R VTWAD+   DN G G
Sbjct: 381  VYQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNG 440

Query: 657  NKTESSSISEDENQAYERSGSTDMEEDDDSYRFXXXXXXXXXXXXXXXXXXSG-SDVPDA 481
            N  E   +   +  + E SGS +   DD+  RF                  SG SDV DA
Sbjct: 441  NLCEVKEMETMKGDS-EISGSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDA 499

Query: 480  VSKAGIVIFPPPQEVDEAIHQEKDEMLDTEPAPLKWPRKPGVPSFDVFESEETWYDSPPE 301
            V + G++I P   EVD+    E  +ML+ E AP+KWP+KPG+P  D+F  E++W+D+PPE
Sbjct: 500  VYENGLIILPSLCEVDKEEPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPE 559

Query: 300  GFHMTLSPFATMFNSLFTWISSLSLAFIYGHDESNNEEYLSVNGREYPRKIVLSDGRSTE 121
            GF +TLS FATM+N+LF WI+S SLA+IYG DES +EEYLS+NGREYPRKI L DGRS+E
Sbjct: 560  GFSLTLSTFATMWNALFEWITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSE 619

Query: 120  IKKTLAGCLARALPGLVADLRLPVPVSTLEQGMGLFLDTM 1
            IK+TLA C++RALP +V DLRLP+P+STLEQGMG  +DT+
Sbjct: 620  IKETLASCISRALPAIVTDLRLPIPISTLEQGMGHLIDTI 659


>gb|EOY34545.1| F2P16.20 protein, putative isoform 1 [Theobroma cacao]
          Length = 739

 Score =  352 bits (903), Expect = 2e-94
 Identities = 218/520 (41%), Positives = 297/520 (57%), Gaps = 31/520 (5%)
 Frame = -3

Query: 1467 MYCSTNCVVNSRTFAGSLQDERSSTLNPAKLNEVLKLFEGLHLHSSEDVKENGDLGSSKL 1288
            M+CSTNC++NSR FAGSLQ+ER S LN AKLN++L LF  L L  + D+ +NGDLG S L
Sbjct: 145  MFCSTNCLINSRAFAGSLQEERCSVLNHAKLNDILSLFGDLDLDDN-DLGKNGDLGFSNL 203

Query: 1287 KIQEKMDVKGGEVSLEEWMGPSNAIEGYVPQRDRIVNPGLLKNVNR---GSKNKHAGIQN 1117
            +I+E  +VK  +VSL    GPSNAIEGYVPQR+ I  P   KN       S +   G + 
Sbjct: 204  RIKENEEVKAEDVSLA---GPSNAIEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKK 260

Query: 1116 EKNMILNEIDFSSIIITQDEYSISKFPAPVNAVSSKEAQTKTRNEVRDDVS-----ILEK 952
            E+  + NE+DF+  II  DEY ISK P         +  +K  + V +++      I+  
Sbjct: 261  EEYFVNNELDFAGTIIMNDEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMND 320

Query: 951  HVDALQLRSGEETEKSDKNNRCFK---VDKLNSGEVSSGHSQHDVKNKSAEVLNMSGAGR 781
                 ++ SG +    D N +  +   + K +  +     S   ++ K + ++ +     
Sbjct: 321  EYTISKMPSGSKQSCFDSNLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKN 380

Query: 780  KYAS---------------DGA--QDXXXXXXXXXXXXXKRMARSVTWADEN-IDN-GAG 658
             Y S               D A                 K++ R VTWAD+   DN G G
Sbjct: 381  VYQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNG 440

Query: 657  NKTESSSISEDENQAYERSGSTDMEEDDDSYRFXXXXXXXXXXXXXXXXXXSG-SDVPDA 481
            N  E   +   +  + E SGS +   DD+  RF                  SG SDV DA
Sbjct: 441  NLCEVKEMETMKGDS-EISGSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDA 499

Query: 480  VSKAGIVIFPPPQEVDEAIHQEKDEMLDTEPAPLKWPRKPGVPSFDVFESEETWYDSPPE 301
            V + G++I P   EVD+    E  +ML+ E AP+KWP+KPG+P  D+F  E++W+D+PPE
Sbjct: 500  VYENGLIILPSLCEVDKEEPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPE 559

Query: 300  GFHMTLSPFATMFNSLFTWISSLSLAFIYGHDESNNEEYLSVNGREYPRKIVLSDGRSTE 121
            GF +TLS FATM+N+LF WI+S SLA+IYG DES +EEYLS+NGREYPRKI L DGRS+E
Sbjct: 560  GFSLTLSTFATMWNALFEWITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSE 619

Query: 120  IKKTLAGCLARALPGLVADLRLPVPVSTLEQGMGLFLDTM 1
            IK+TLA C++RALP +V DLRLP+P+STLEQGMG  +DT+
Sbjct: 620  IKETLASCISRALPAIVTDLRLPIPISTLEQGMGHLIDTI 659


>gb|EOY34548.1| F2P16.20 protein, putative isoform 4 [Theobroma cacao]
          Length = 607

 Score =  345 bits (886), Expect = 2e-92
 Identities = 215/513 (41%), Positives = 292/513 (56%), Gaps = 31/513 (6%)
 Frame = -3

Query: 1467 MYCSTNCVVNSRTFAGSLQDERSSTLNPAKLNEVLKLFEGLHLHSSEDVKENGDLGSSKL 1288
            M+CSTNC++NSR FAGSLQ+ER S LN AKLN++L LF  L L  + D+ +NGDLG S L
Sbjct: 91   MFCSTNCLINSRAFAGSLQEERCSVLNHAKLNDILSLFGDLDLDDN-DLGKNGDLGFSNL 149

Query: 1287 KIQEKMDVKGGEVSLEEWMGPSNAIEGYVPQRDRIVNPGLLKNVNR---GSKNKHAGIQN 1117
            +I+E  +VK  +VSL    GPSNAIEGYVPQR+ I  P   KN       S +   G + 
Sbjct: 150  RIKENEEVKAEDVSLA---GPSNAIEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKK 206

Query: 1116 EKNMILNEIDFSSIIITQDEYSISKFPAPVNAVSSKEAQTKTRNEVRDDVS-----ILEK 952
            E+  + NE+DF+  II  DEY ISK P         +  +K  + V +++      I+  
Sbjct: 207  EEYFVNNELDFAGTIIMNDEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMND 266

Query: 951  HVDALQLRSGEETEKSDKNNRCFK---VDKLNSGEVSSGHSQHDVKNKSAEVLNMSGAGR 781
                 ++ SG +    D N +  +   + K +  +     S   ++ K + ++ +     
Sbjct: 267  EYTISKMPSGSKQSCFDSNLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKN 326

Query: 780  KYAS---------------DGA--QDXXXXXXXXXXXXXKRMARSVTWADEN-IDN-GAG 658
             Y S               D A                 K++ R VTWAD+   DN G G
Sbjct: 327  VYQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNG 386

Query: 657  NKTESSSISEDENQAYERSGSTDMEEDDDSYRFXXXXXXXXXXXXXXXXXXSG-SDVPDA 481
            N  E   +   +  + E SGS +   DD+  RF                  SG SDV DA
Sbjct: 387  NLCEVKEMETMKGDS-EISGSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDA 445

Query: 480  VSKAGIVIFPPPQEVDEAIHQEKDEMLDTEPAPLKWPRKPGVPSFDVFESEETWYDSPPE 301
            V + G++I P   EVD+    E  +ML+ E AP+KWP+KPG+P  D+F  E++W+D+PPE
Sbjct: 446  VYENGLIILPSLCEVDKEEPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPE 505

Query: 300  GFHMTLSPFATMFNSLFTWISSLSLAFIYGHDESNNEEYLSVNGREYPRKIVLSDGRSTE 121
            GF +TLS FATM+N+LF WI+S SLA+IYG DES +EEYLS+NGREYPRKI L DGRS+E
Sbjct: 506  GFSLTLSTFATMWNALFEWITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSE 565

Query: 120  IKKTLAGCLARALPGLVADLRLPVPVSTLEQGM 22
            IK+TLA C++RALP +V DLRLP+P+STLEQGM
Sbjct: 566  IKETLASCISRALPAIVTDLRLPIPISTLEQGM 598


>gb|EXB67559.1| hypothetical protein L484_006008 [Morus notabilis]
          Length = 695

 Score =  340 bits (871), Expect = 1e-90
 Identities = 211/530 (39%), Positives = 294/530 (55%), Gaps = 41/530 (7%)
 Frame = -3

Query: 1467 MYCSTNCVVNSRTFAGSLQDERSSTLNPAKLNEVLKLFEGLH-LHSSEDVKENGDLGSSK 1291
            MYCS++CV+NSRTFA SL+DER + L+ A+++ VL++FE    L       ++ DLG SK
Sbjct: 93   MYCSSDCVINSRTFAASLKDERCAVLDSARIDAVLRMFEDYSGLERELGFGKDRDLGFSK 152

Query: 1290 LKIQEKMDVKGGEVSLEEWMGPSNAIEGYVPQRDRIVNPGLLKNVNRGSKNKHAGIQNEK 1111
            LKI+EK +   G+VSLE+W GPSNAIEGYV QR+R       K    GSK+   G +   
Sbjct: 153  LKIEEKTENCVGDVSLEQWAGPSNAIEGYVLQRER-------KPKELGSKSPKRGSKANN 205

Query: 1110 NMILNEIDFSSIIITQDEYSISKFPAPVN------------------AVSSKEAQTKTRN 985
             +++N++DF S IIT+DEY++SK P+ +                   A+ ++ A  +T  
Sbjct: 206  TVLINDMDFVSTIITEDEYTVSKTPSSLKKTGLDSKVREQEEILAKKAMGNEFAVLETSY 265

Query: 984  EVRDDVS----ILEKHVDALQLRSGEETEKSDKNNRCFKVDKLNSGEVSSGHSQHDVKNK 817
                +VS    + E    +L+  S   + ++++ +   K +K     + S       K  
Sbjct: 266  APASNVSRVGLVFEDVTSSLRAGSCLSSARAEEESHDDKAEKCTEASIKSSLKPSRKKKL 325

Query: 816  S-----AEVLNMSGAGRKYAS---------DGAQDXXXXXXXXXXXXXKRMARSVTWADE 679
            S     A+    S  GRK            D +                +  +SV WADE
Sbjct: 326  SRTVTWADEKTDSSGGRKLCEIREIEDMKEDPSVVENKNGVSFTSSGKMKAGQSVIWADE 385

Query: 678  NIDNGAGNKTESSSISEDENQAYERSGSTDMEEDDDSYRFXXXXXXXXXXXXXXXXXXSG 499
              D+            ED  +A +   + D  E+DD++RF                  S 
Sbjct: 386  KGDSSKSIDVCEVREIEDAKEAADMLCNADTGENDDTFRFASAEACARALDEASEAVASE 445

Query: 498  S-DVPDAVSKAGIVIFPPPQEVDEAIHQEKD---EMLDTEPAPLKWPRKPGVPSFDVFES 331
              +V DA+S+AGI+I P P+  DE    E+D   E  + E AP+KWP+KPG    D+F+ 
Sbjct: 446  ELEVNDAMSEAGIIILPRPENGDEGEPMEEDDDDETSEPEQAPIKWPKKPGSQHSDLFDP 505

Query: 330  EETWYDSPPEGFHMTLSPFATMFNSLFTWISSLSLAFIYGHDESNNEEYLSVNGREYPRK 151
            E++W+D+PPE F +TLSPFA M+N+LFTW +S +LA+IYG DES +EEY  VNGREYP K
Sbjct: 506  EDSWFDAPPEDFSLTLSPFAKMWNALFTWTTSSTLAYIYGRDESLHEEYAVVNGREYPEK 565

Query: 150  IVLSDGRSTEIKKTLAGCLARALPGLVADLRLPVPVSTLEQGMGLFLDTM 1
            IV  DGRS+EIK+TLAG LARALPGLVADLRL  P+S+LEQGMG  LDTM
Sbjct: 566  IVFGDGRSSEIKQTLAGSLARALPGLVADLRLSTPISSLEQGMGRLLDTM 615


>gb|EOY34547.1| F2P16.20-like protein isoform 3, partial [Theobroma cacao]
          Length = 703

 Score =  338 bits (868), Expect = 3e-90
 Identities = 215/520 (41%), Positives = 291/520 (55%), Gaps = 31/520 (5%)
 Frame = -3

Query: 1467 MYCSTNCVVNSRTFAGSLQDERSSTLNPAKLNEVLKLFEGLHLHSSEDVKENGDLGSSKL 1288
            M+CSTNC++NSR FAGSLQ+ER S LN AKLN++L LF  L L  + D+ +NGDLG S L
Sbjct: 145  MFCSTNCLINSRAFAGSLQEERCSVLNHAKLNDILSLFGDLDLDDN-DLGKNGDLGFSNL 203

Query: 1287 KIQEKMDVKGGEVSLEEWMGPSNAIEGYVPQRDRIVNPGLLKNVNR---GSKNKHAGIQN 1117
            +I+E  +VK  +VSL    GPSNAIEGYVPQR+ I  P   KN       S +   G + 
Sbjct: 204  RIKENEEVKAEDVSLA---GPSNAIEGYVPQRELISKPTPPKNNKNKVFDSSSSKLGSKK 260

Query: 1116 EKNMILNEIDFSSIIITQDEYSISKFPAPVNAVSSKEAQTKTRNEVRDDVS-----ILEK 952
            E+  + NE+DF+  II  DEY ISK P         +  +K  + V +++      I+  
Sbjct: 261  EEYFVNNELDFAGTIIMNDEYIISKKPGSFKQGDRTKLSSKKEDFVINEMDFTSEIIMND 320

Query: 951  HVDALQLRSGEETEKSDKNNRCFK---VDKLNSGEVSSGHSQHDVKNKSAEVLNMSGAGR 781
                 ++ SG +    D N +  +   + K +  +     S   ++ K + ++ +     
Sbjct: 321  EYTISKMPSGSKQSCFDSNLKEVEEKGICKDSEDKCVISGSSSALREKDSSIVELPSTKN 380

Query: 780  KYAS---------------DGA--QDXXXXXXXXXXXXXKRMARSVTWADEN-IDN-GAG 658
             Y S               D A                 K++ R VTWAD+   DN G G
Sbjct: 381  VYQSGLDTSSAEAEKETHADKAVTSSETVLKSSLKSAGAKKLNRFVTWADKKKADNAGNG 440

Query: 657  NKTESSSISEDENQAYERSGSTDMEEDDDSYRFXXXXXXXXXXXXXXXXXXSG-SDVPDA 481
            N  E   +   +  + E SGS +   DD+  RF                  SG SDV DA
Sbjct: 441  NLCEVKEMETMKGDS-EISGSAEDGGDDNMLRFVSAEACAMALSKAAEAVASGDSDVTDA 499

Query: 480  VSKAGIVIFPPPQEVDEAIHQEKDEMLDTEPAPLKWPRKPGVPSFDVFESEETWYDSPPE 301
            V            EVD+    E  +ML+ E AP+KWP+KPG+P  D+F  E++W+D+PPE
Sbjct: 500  VC-----------EVDKEEPMEDGDMLEPETAPVKWPKKPGIPHSDMFNPEDSWFDAPPE 548

Query: 300  GFHMTLSPFATMFNSLFTWISSLSLAFIYGHDESNNEEYLSVNGREYPRKIVLSDGRSTE 121
            GF +TLS FATM+N+LF WI+S SLA+IYG DES +EEYLS+NGREYPRKI L DGRS+E
Sbjct: 549  GFSLTLSTFATMWNALFEWITSSSLAYIYGRDESFHEEYLSINGREYPRKIALRDGRSSE 608

Query: 120  IKKTLAGCLARALPGLVADLRLPVPVSTLEQGMGLFLDTM 1
            IK+TLA C++RALP +V DLRLP+P+STLEQGMG  +DT+
Sbjct: 609  IKETLASCISRALPAIVTDLRLPIPISTLEQGMGHLIDTI 648


>gb|EMJ09632.1| hypothetical protein PRUPE_ppa002134mg [Prunus persica]
          Length = 711

 Score =  324 bits (830), Expect = 7e-86
 Identities = 216/541 (39%), Positives = 292/541 (53%), Gaps = 52/541 (9%)
 Frame = -3

Query: 1467 MYCSTNCVVNSRTFAGSLQDERSSTLNPAKLNEVLKLFEGLHLHSSE-DVKENGDLGSSK 1291
            MYCS+ CV+ S+ FA SL +ER   L+  K+  +L+ F  +     E    E GDLG SK
Sbjct: 99   MYCSSRCVIESKAFAQSLGEERCDVLDFGKVERILRAFGDVGFDKGEVGFGEIGDLGISK 158

Query: 1290 LKIQEKMDVKGGEVSLEEW---------------MGPSNAIEGYVPQRDRIVNPGLLKNV 1156
            LKI+EK++   G++ +                  +GPSNAIEGYVPQ++RI  P   K  
Sbjct: 159  LKIEEKVETGIGDLGISRLKIEEKSETHIGDLGAVGPSNAIEGYVPQKERISKPLGSKKN 218

Query: 1155 NRGSKNKHAGIQNEKNMILNEIDFSSIIITQDEYSISKFPAPV----------------- 1027
              GSK K A + +  ++I NE+DF S IIT DEYS+SK P  V                 
Sbjct: 219  KEGSKGKDAKMSSGMDIIFNEMDFMSTIITSDEYSVSKIPPSVGEPDFETKFKKSKGKVG 278

Query: 1026 ---NAVSSKEAQTK---TRNEVRDDVSILE--KHVDALQLRSGEETEKSDKNNRCFKVDK 871
               N    K  Q+K    +N  +DDV I E     DA Q      T++  +     K ++
Sbjct: 279  LNKNDSVKKSRQSKGGKNKNVKKDDVCIREVPSTSDASQTVLNGSTKEEKEEFIVEKAEQ 338

Query: 870  LNSGEVSSGHSQHDVK--NKSA----EVLNMSGAGRKYASDGAQDXXXXXXXXXXXXXKR 709
                 + S       K  N+S     E+++ +G+   Y     +                
Sbjct: 339  SGEALLRSSLKPSGTKKLNRSVTWADEMIDSTGSRNLYEVREMEQIMEYSDAFSSMHKPS 398

Query: 708  MARSV----TWADENIDNGAGNKTESSSISE-DENQAYERSGSTDMEEDDDSYRFXXXXX 544
            +   V    TW DE ID+     T+S +I E  E Q  +  GS D++E++          
Sbjct: 399  VENKVGCSNTWFDEKIDS-----TKSKNICEVREVQDADVLGSLDLQENE--ILESAEAC 451

Query: 543  XXXXXXXXXXXXXSGSDVPDAVSKAGIVIFPPPQEVDEAIHQEKDEMLDTEPAPLKWPRK 364
                           SDV  AVS AGI+I P P  +DE    E  +ML++E APL WPRK
Sbjct: 452  AMALNQAAEAVASGESDVSGAVSGAGIIILPRPDGLDEEEPTEDVDMLESEQAPL-WPRK 510

Query: 363  PGVPSFDVFESEETWYDSPPEGFHMTLSPFATMFNSLFTWISSLSLAFIYGHDESNNEEY 184
            PG+P  D+F+ E++W+D+PPEGF +TLSPFATM+NSLFTWI+S +LA+IYG DES +EE+
Sbjct: 511  PGIPCSDLFDPEDSWFDAPPEGFSVTLSPFATMWNSLFTWITSSTLAYIYGRDESFHEEF 570

Query: 183  LSVNGREYPRKIVLSDGRSTEIKKTLAGCLARALPGLVADLRLPVPVSTLEQGMGLFLDT 4
            LSVNGREYP KIVL+ GRS+EIKKTL    ARALPG+V++LRLP P+S+LEQGMG  L+T
Sbjct: 571  LSVNGREYPPKIVLAGGRSSEIKKTLDESFARALPGVVSELRLPTPISSLEQGMGRMLNT 630

Query: 3    M 1
            M
Sbjct: 631  M 631


>gb|EPS64466.1| hypothetical protein M569_10314, partial [Genlisea aurea]
          Length = 597

 Score =  297 bits (760), Expect = 9e-78
 Identities = 187/495 (37%), Positives = 271/495 (54%), Gaps = 7/495 (1%)
 Frame = -3

Query: 1464 YCSTNCVVNSRTFAGSLQDERSSTLNPAKLNEVLKLFEGLHLHSSEDVKENGDLGSSKLK 1285
            +CS+ C++NSR F+  L DER+S L+P KLNEVLK F+G   +S+ ++  N DLG S+L+
Sbjct: 92   FCSSGCLINSRAFSIGLPDERTSDLDPIKLNEVLKRFDGFGANSTPNMGRNEDLGLSQLR 151

Query: 1284 IQEKMDVKGGEVSLEEWMGPSNAIEGYVPQRDRIVNPGLLKNVNRGSKNKHAGIQNEKNM 1105
            I EK +++ GEVS  EW+GPS+AI+GYVP+RDR  N  L     +G    H  +Q   ++
Sbjct: 152  IMEKENIEAGEVSSNEWIGPSDAIDGYVPRRDRNSNT-LSSKQKKGESRYHLSLQVLTSI 210

Query: 1104 ILNEIDFSSIIITQDEYSISKFPAPVNAVSSKEAQTKTRNEVRDDVSILEKHVDALQLRS 925
              +++ F+S+II Q+EYSI+K   P ++  S E+  K   E  +DV   +    ++    
Sbjct: 211  FPSDMSFTSVIIDQNEYSIAKTTTPSSSKQSGESNEKVIPE--EDVRPKQSPDSSVANIK 268

Query: 924  GEETEKSDKNNRCFKVD-KLNSGEVSSGHSQHDVK----NKSAE--VLNMSGAGRKYASD 766
            G       K N   K+D KL++ E  +  +  + K    +KSA+   +  S     Y+ +
Sbjct: 269  GSGFRNPSKRNGRAKIDAKLSASEDKASENGGEPKLADGDKSAQGAAVLKSSLKTSYSKE 328

Query: 765  GAQDXXXXXXXXXXXXXKRMARSVTWADENIDNGAGNKTESSSISEDENQAYERSGSTDM 586
                                 R+V+WAD   ++G   +T    +++       R  S+  
Sbjct: 329  TT------------------TRTVSWADVKAEDGQNLETVCE-MNDPHGGGISRETSSVE 369

Query: 585  EEDDDSYRFXXXXXXXXXXXXXXXXXXSGSDVPDAVSKAGIVIFPPPQEVDEAIHQEKDE 406
                 S +                         DA  K  +  F   +   EAI      
Sbjct: 370  SHKTASTKASK----------------------DAPGKFLLTDFNEGEIFTEAI------ 401

Query: 405  MLDTEPAPLKWPRKPGVPSFDVFESEETWYDSPPEGFHMTLSPFATMFNSLFTWISSLSL 226
                    LKWP KPG    D+ ES++T YD PP+GF+++LSPF T+FNSLF+WISS SL
Sbjct: 402  --------LKWPPKPGFSEADLVESDDTLYDRPPDGFNLSLSPFCTLFNSLFSWISSSSL 453

Query: 225  AFIYGHDESNNEEYLSVNGREYPRKIVLSDGRSTEIKKTLAGCLARALPGLVADLRLPVP 46
            A+IYG D+S +EEY++ NGREYP K+V  DGRS+EIK+TL+  LARALPG+V++LRLP P
Sbjct: 454  AYIYGKDDSFHEEYVNANGREYPCKVVAEDGRSSEIKQTLSAALARALPGVVSELRLPTP 513

Query: 45   VSTLEQGMGLFLDTM 1
            +S LEQGMG  LDTM
Sbjct: 514  ISILEQGMGRLLDTM 528


>ref|XP_004152151.1| PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase
            RPAP2 homolog [Cucumis sativus]
          Length = 662

 Score =  295 bits (754), Expect = 4e-77
 Identities = 196/504 (38%), Positives = 274/504 (54%), Gaps = 16/504 (3%)
 Frame = -3

Query: 1464 YCSTNCVVNSRTFAGSLQDERSSTLNPAKLNEVLKLFEGLHLHSSEDVKENGDLGSSKLK 1285
            YCS+ C++NSR F+G LQDER S +NP KL E+LKLFE + L S E++  N D G   L+
Sbjct: 92   YCSSACLINSRAFSGRLQDERCSVMNPDKLKEILKLFENMSLDSKENMGNNCDSG---LE 148

Query: 1284 IQEKMDVKGGEVSLEEWMGPSNAIEGYVPQRDRIVNPGLLKNVNR---GSKNKHAGIQNE 1114
            IQEK++   GEV +EEWMGPSNAIEGYVP RD  V     K+      GSK K   +   
Sbjct: 149  IQEKIESNIGEVPIEEWMGPSNAIEGYVPHRDHKVMTLHSKDGKESKDGSKAKIKPLGGG 208

Query: 1113 KNMILNEIDFSSIIITQDEYSISKFPAPVNAVSSKEAQTKTRN--------EVRDDVSIL 958
            K+   ++   +S IIT +EYS+SK  + +  ++     T ++N        E  D  +IL
Sbjct: 209  KDFF-SDFSITSTIITDEEYSVSKISSGLKEMA---LDTNSKNQTGEFCGKESNDQFAIL 264

Query: 957  EK-HVDALQLRSGEETEKSDKNNRCFKVDKLNSGEVSSGHSQHDVKNKSAEVLNMSGAGR 781
            E  H  A    S     +  K        K ++  +S   S    KN+S     M+   R
Sbjct: 265  ETPHAPAPPKNSVGRKARGSKERTKVSATKESTDNLSDAPSTS--KNRSTNFNLMTEEPR 322

Query: 780  KYASDGAQDXXXXXXXXXXXXXKRMARSVTWADENIDNGA-GNKTESSSISEDENQAYER 604
               +D                 K + RSVTWADE  D+ +  N  E   + + +  +   
Sbjct: 323  GGFND--LSGTELKSSLKKPGKKNLCRSVTWADEKTDDASIMNLPEVGEMGKTKECSRTT 380

Query: 603  SGSTDMEED-DDSYRFXXXXXXXXXXXXXXXXXXSG-SDVPDAVSKAGIVIFPPPQEVDE 430
            S   + + D +D  R                   SG S+V DAVS+AGI+I P P + +E
Sbjct: 381  SNLVNFDNDNEDILRVESAEACAMALSQAAEAITSGQSEVSDAVSEAGIIILPHPSDANE 440

Query: 429  AIHQEKDEMLDTEPAPL-KWPRKPGVPSFDVFESEETWYDSPPEGFHMTLSPFATMFNSL 253
                  D +  +EP    +   K GV   D+F+  ++WYD+PPEGF +TLS FATM+ ++
Sbjct: 441  --EASTDPVNASEPHSFSEKSNKLGVLRSDLFDPSDSWYDAPPEGFSLTLSSFATMWMAI 498

Query: 252  FTWISSLSLAFIYGHDESNNEEYLSVNGREYPRKIVLSDGRSTEIKKTLAGCLARALPGL 73
            F W++S SLA+IYG D+  +EE+L ++G+EYP KIV +DGRS+EIK+TLAGCL RA+PGL
Sbjct: 499  FAWVTSSSLAYIYGKDDKFHEEFLYIDGKEYPSKIVSADGRSSEIKQTLAGCLTRAIPGL 558

Query: 72   VADLRLPVPVSTLEQGMGLFLDTM 1
             ++L L  P+S LE GM   LDTM
Sbjct: 559  ASELNLSTPISRLENGMAHLLDTM 582


Top