BLASTX nr result

ID: Mentha28_contig00032162 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha28_contig00032162
         (1458 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU21515.1| hypothetical protein MIMGU_mgv1a000493mg [Mimulus...   647   0.0  
ref|XP_006343144.1| PREDICTED: MMS19 nucleotide excision repair ...   537   e-150
ref|XP_004236399.1| PREDICTED: MMS19 nucleotide excision repair ...   534   e-149
emb|CBI36057.3| unnamed protein product [Vitis vinifera]              501   e-139
gb|EPS68498.1| hypothetical protein M569_06270, partial [Genlise...   501   e-139
ref|XP_007024314.1| MMS19 nucleotide excision repair protein, pu...   492   e-136
ref|XP_007024313.1| MMS19 nucleotide excision repair protein, pu...   492   e-136
ref|XP_007024312.1| MMS19 nucleotide excision repair protein, pu...   492   e-136
ref|XP_007024310.1| MMS19 nucleotide excision repair protein, pu...   492   e-136
ref|XP_006465695.1| PREDICTED: MMS19 nucleotide excision repair ...   490   e-136
ref|XP_006465694.1| PREDICTED: MMS19 nucleotide excision repair ...   490   e-136
ref|XP_006426876.1| hypothetical protein CICLE_v10024743mg [Citr...   487   e-135
ref|XP_007217541.1| hypothetical protein PRUPE_ppa023072mg [Prun...   479   e-132
ref|XP_002515963.1| DNA repair/transcription protein met18/mms19...   472   e-130
ref|XP_004141784.1| PREDICTED: MMS19 nucleotide excision repair ...   470   e-130
ref|XP_004302857.1| PREDICTED: uncharacterized protein LOC101304...   444   e-122
gb|EXB74582.1| hypothetical protein L484_026279 [Morus notabilis]     433   e-119
ref|XP_004486785.1| PREDICTED: uncharacterized protein LOC101495...   432   e-118
ref|XP_007150605.1| hypothetical protein PHAVU_005G166100g [Phas...   432   e-118
ref|XP_006597169.1| PREDICTED: MMS19 nucleotide excision repair ...   430   e-118

>gb|EYU21515.1| hypothetical protein MIMGU_mgv1a000493mg [Mimulus guttatus]
          Length = 1120

 Score =  647 bits (1669), Expect = 0.0
 Identities = 340/484 (70%), Positives = 379/484 (78%)
 Frame = +3

Query: 3    SVQWIKHIELYVSSAANPSQQAACVDAVAALIKKGVFTLETLVREMEMYLTTTDSIIRSR 182
            SVQ IKH+ELYV+S+A PSQQ A VDAVAAL+K  + TL+ LVREMEMYLTTTDSI+RSR
Sbjct: 4    SVQLIKHVELYVNSSATPSQQVASVDAVAALLKNDLLTLDALVREMEMYLTTTDSIVRSR 63

Query: 183  GIXXXXXXXXXXXXXXXXXXXXXXXXGFFTERLADWKALRGAIVGCLALLRRKDDVGSVT 362
            G                         GFFTERLADWKALRGAIVGCLALLRRK DVG VT
Sbjct: 64   GTLLLAEILEQLTSKPLNSTSIHSLIGFFTERLADWKALRGAIVGCLALLRRKVDVGIVT 123

Query: 363  SSEAKAVAESYLQNLQVQSLGQHDRVLSFQLMECLLDRYPGAIGDLGDTLVYGICEAIDG 542
             SEAKAVA+SYLQNLQVQSLGQHDR LSFQLM+CLLDRYPGAI DLGD LVYGICEAIDG
Sbjct: 124  DSEAKAVAQSYLQNLQVQSLGQHDRKLSFQLMDCLLDRYPGAIRDLGDNLVYGICEAIDG 183

Query: 543  EKDPQCLLPVFRIVECLAQLYPDPSGPLANYAEDLFEILGSYFPIRFTHPKGEDDDDANK 722
            EKDPQCLL VF IVE LA+LY   +GPLAN+AEDLFEILGSYFPI FTHPKGEDDD   +
Sbjct: 184  EKDPQCLLLVFHIVESLARLY---TGPLANFAEDLFEILGSYFPIHFTHPKGEDDD-VKR 239

Query: 723  EKLSRALMLAFASTPLFEPFSIPLLLEKLSSYLPSAKVESFRYLSYCTMKYGPERMASHA 902
            E+LSRALMLAFAST LFEPFSIPLLLEKLSS LPSAKVESF+YLSYC+ KYGPERM  HA
Sbjct: 240  EELSRALMLAFASTHLFEPFSIPLLLEKLSSSLPSAKVESFKYLSYCSTKYGPERMVKHA 299

Query: 903  EALWSSVKDATYISPQCTLTKESELLGGMGFQDSDVMMQAFILLQEVIQQCGDFISLILG 1082
            EALWSSVKD TYISP  T + ESE +GGM FQDS++M  AF+LLQEV +Q  DF+SL++ 
Sbjct: 300  EALWSSVKDVTYISPSSTPSTESESMGGMSFQDSEIMRHAFVLLQEVTRQHADFVSLVIA 359

Query: 1083 DNDINVFMNSLNQYEEMDQIPSLEKQRLHVVGHMLFTCAKSSPALCNKVFHSFFPLLMEC 1262
            DNDI+VF+NSLNQY+E D IP L KQ+LH +GH+L TCAK S  LCNKVF  FFPLLM+ 
Sbjct: 360  DNDIHVFINSLNQYKEFDDIPVLVKQKLHALGHILSTCAKPSVELCNKVFEGFFPLLMDG 419

Query: 1263 LGLSVAKSSNGHLDEDCPPVKYNFAAIYLCIELIAACRDVAVSLDSSKGVLDFSQQTWST 1442
             GLS AK S+         V+  F AIYLC EL+AA R + +SLD+     DFS+QTW  
Sbjct: 420  FGLSAAKPSDN--------VECKFGAIYLCTELLAASRYLTLSLDNCTLDPDFSRQTWHV 471

Query: 1443 MLSN 1454
            MLSN
Sbjct: 472  MLSN 475


>ref|XP_006343144.1| PREDICTED: MMS19 nucleotide excision repair protein homolog [Solanum
            tuberosum]
          Length = 1170

 Score =  537 bits (1383), Expect = e-150
 Identities = 275/483 (56%), Positives = 350/483 (72%), Gaps = 1/483 (0%)
 Frame = +3

Query: 3    SVQWIKHIELYVSSAANPSQQAACVDAVAALIKKGVFTLETLVREMEMYLTTTDSIIRSR 182
            ++Q++ HIE YVSS+++ +QQAA VDA+A L+K  + +LETLVREMEMYLTTTD+IIRSR
Sbjct: 6    AIQYVIHIESYVSSSSSEAQQAASVDAIAVLLKNDLLSLETLVREMEMYLTTTDNIIRSR 65

Query: 183  GIXXXXXXXXXXXXXXXXXXXXXXXXGFFTERLADWKALRGAIVGCLALLRRKDDVGSVT 362
            GI                         FFTERLADWKAL GA+VGCLALLRRK   G + 
Sbjct: 66   GILLLGELLMRLMSKPLGDTAISSLIEFFTERLADWKALHGALVGCLALLRRKTGTGMIN 125

Query: 363  SSEAKAVAESYLQNLQVQSLGQHDRVLSFQLMECLLDRYPGAIGDLGDTLVYGICEAIDG 542
             S+AKAVAESYL+ LQVQSLGQ DR L  Q++ECLLDRY  A+  LGD LVYGICEAIDG
Sbjct: 126  RSQAKAVAESYLKTLQVQSLGQQDRKLCLQILECLLDRYRDALFSLGDDLVYGICEAIDG 185

Query: 543  EKDPQCLLPVFRIVECLAQLYPDPSGPLANYAEDLFEILGSYFPIRFTHPKGEDDDDANK 722
            EKDPQCL+ +F IVE LAQL+P+ SGPL N+A DLFEIL  YFPI FTHPK  DD D  +
Sbjct: 186  EKDPQCLMLIFHIVELLAQLFPEASGPLENFAGDLFEILECYFPIHFTHPK-SDDVDMKR 244

Query: 723  EKLSRALMLAFASTPLFEPFSIPLLLEKLSSYLPSAKVESFRYLSYCTMKYGPERMASHA 902
             +LSRALMLAFASTPL+EP  IPLLL+KLSS LPSAKVES +YLSYCT+KYG +RM  + 
Sbjct: 245  GELSRALMLAFASTPLYEPSVIPLLLDKLSSSLPSAKVESLKYLSYCTLKYGGDRMEKYT 304

Query: 903  EALWSSVKDATYISPQCTLTKESELLGGMGFQDSDVMMQAFILLQEVIQQCGD-FISLIL 1079
            ++LWS++KDA +  PQ TL+++S+ + G+GF +S++M QA  LLQ +++Q  D F+SLIL
Sbjct: 305  KSLWSALKDALFTCPQSTLSEDSDPIDGLGFHESEIMTQALELLQVLVRQHNDSFLSLIL 364

Query: 1080 GDNDINVFMNSLNQYEEMDQIPSLEKQRLHVVGHMLFTCAKSSPALCNKVFHSFFPLLME 1259
            GD DI+ F+NS +Q+++ + + +  KQRLH VGH+L  C K+S + CNKVF SFFP L++
Sbjct: 365  GDGDISTFLNSFSQFDDFNSLSTQYKQRLHAVGHVLSVCIKASGSSCNKVFESFFPRLVD 424

Query: 1260 CLGLSVAKSSNGHLDEDCPPVKYNFAAIYLCIELIAACRDVAVSLDSSKGVLDFSQQTWS 1439
             L LSV  S    +        +NF A+YLC+EL+AACR + VS D      D ++ +W 
Sbjct: 425  ALRLSVENSHG--IVHSALDANFNFGALYLCVELLAACRQLVVSSDEVASAHDLARDSWC 482

Query: 1440 TML 1448
             +L
Sbjct: 483  QIL 485


>ref|XP_004236399.1| PREDICTED: MMS19 nucleotide excision repair protein homolog [Solanum
            lycopersicum]
          Length = 1153

 Score =  534 bits (1375), Expect = e-149
 Identities = 273/479 (56%), Positives = 348/479 (72%), Gaps = 1/479 (0%)
 Frame = +3

Query: 15   IKHIELYVSSAANPSQQAACVDAVAALIKKGVFTLETLVREMEMYLTTTDSIIRSRGIXX 194
            ++ IE YVSS+++ +QQAA +DA+A L+K  + +LETLVREMEMYLTTTD+IIRSRGI  
Sbjct: 23   VRIIESYVSSSSSEAQQAASIDAIALLLKNDLLSLETLVREMEMYLTTTDNIIRSRGILL 82

Query: 195  XXXXXXXXXXXXXXXXXXXXXXGFFTERLADWKALRGAIVGCLALLRRKDDVGSVTSSEA 374
                                   FFTERLADWKAL GA+VGCLALLRRK  VG ++ S+A
Sbjct: 83   LGELLMRLMSKPLGDTAISSLMEFFTERLADWKALHGALVGCLALLRRKTGVGMISRSQA 142

Query: 375  KAVAESYLQNLQVQSLGQHDRVLSFQLMECLLDRYPGAIGDLGDTLVYGICEAIDGEKDP 554
            KAVAESYL+ LQVQSLGQHDR L  Q++ECLLDRY  A+  LGD LVYGICEAIDGEKDP
Sbjct: 143  KAVAESYLKTLQVQSLGQHDRKLCLQILECLLDRYRDALFSLGDDLVYGICEAIDGEKDP 202

Query: 555  QCLLPVFRIVECLAQLYPDPSGPLANYAEDLFEILGSYFPIRFTHPKGEDDDDANKEKLS 734
            QCL+ +F IVE LAQL+P+ SGPL N+A DLFEIL  YFPI FTHPK  DD D  +E+LS
Sbjct: 203  QCLMLIFHIVELLAQLFPEASGPLENFAGDLFEILECYFPIHFTHPK-SDDVDIKREELS 261

Query: 735  RALMLAFASTPLFEPFSIPLLLEKLSSYLPSAKVESFRYLSYCTMKYGPERMASHAEALW 914
            RALMLAFASTPLFEP  IPLLL+KLSS LPSAKVES +YLS+CT+KYG +RM  + ++LW
Sbjct: 262  RALMLAFASTPLFEPSVIPLLLDKLSSSLPSAKVESLKYLSFCTLKYGGDRMEKYTKSLW 321

Query: 915  SSVKDATYISPQCTLTKESELLGGMGFQDSDVMMQAFILLQEVIQQ-CGDFISLILGDND 1091
            S++KDA + SPQ TL+++S+ + G+GF +S++M QA   LQ +++Q    F+SLI+GD D
Sbjct: 322  SALKDALFTSPQSTLSEDSDPIDGLGFHESEIMTQALEFLQVLVRQHNASFLSLIMGDGD 381

Query: 1092 INVFMNSLNQYEEMDQIPSLEKQRLHVVGHMLFTCAKSSPALCNKVFHSFFPLLMECLGL 1271
            I+ F+NS +Q++  + + +  KQRLH VGH+L  C K+S + CNKVF SFFP L++ L L
Sbjct: 382  ISTFLNSFSQFDNFNSLSTQYKQRLHAVGHVLSVCIKASASSCNKVFESFFPRLVDALRL 441

Query: 1272 SVAKSSNGHLDEDCPPVKYNFAAIYLCIELIAACRDVAVSLDSSKGVLDFSQQTWSTML 1448
            SV  S    +        +NF A+YLC+EL+AACR + VS D      D ++ +W  +L
Sbjct: 442  SVDNSHG--IVHSAVDANFNFGALYLCVELLAACRQLVVSSDEVASAHDLARDSWCQIL 498


>emb|CBI36057.3| unnamed protein product [Vitis vinifera]
          Length = 1146

 Score =  501 bits (1291), Expect = e-139
 Identities = 263/489 (53%), Positives = 343/489 (70%), Gaps = 9/489 (1%)
 Frame = +3

Query: 9    QWIKHIELYVSSAANPSQQAACVDAVAALIKKGVFTLETLVREMEMYLTTTDSIIRSRGI 188
            Q  ++IE YV S+ + +QQAA VDA+A L+K  + TLETLV EM MYLTTTD+IIR+RGI
Sbjct: 6    QLTQYIESYVDSSRSSTQQAASVDAIAYLLKNDILTLETLVTEMGMYLTTTDNIIRTRGI 65

Query: 189  XXXXXXXXXXXXXXXXXXXXXXXXGFFTERLADWKALRGAIVGCLALLRRKDDVGSVTSS 368
                                     FFT+RLADW+ALRGA++GCLAL++RK ++G VT +
Sbjct: 66   LLLAELLTRLASKPLDNVTIHSLISFFTDRLADWRALRGALIGCLALMKRKSNMGRVTDN 125

Query: 369  EAKAVAESYLQNLQVQSLGQHDRVLSFQLMECLLDRYPGAIGDLGDTLVYGICEAIDGEK 548
            +A+AVA++YL+N+QVQSLGQHDR L F+++ECLLD YP ++  LGD LVYGIC AIDGEK
Sbjct: 126  DARAVAQAYLENVQVQSLGQHDRKLCFEILECLLDHYPESVASLGDDLVYGICGAIDGEK 185

Query: 549  DPQCLLPVFRIVECLAQLYPDPSGPLANYAEDLFEILGSYFPIRFTHPKGEDDDDANKEK 728
            DP+CL+  F IVE LA+L+PDPSGPLA++A DLF+ILG YFPI FTHP+GE D D  ++ 
Sbjct: 186  DPRCLMLTFHIVEILARLFPDPSGPLASFAGDLFDILGCYFPIHFTHPQGE-DVDVKRDD 244

Query: 729  LSRALMLAFASTPLFEPFSIPLLLEKLSSYLPSAKVESFRYLSYCTMKYGPERMASHAEA 908
            LSRALMLAF+ST LFEPF+IPLLLEKLSS LP AKV+S +YLS C +KYG +RM  H EA
Sbjct: 245  LSRALMLAFSSTTLFEPFAIPLLLEKLSSSLPLAKVDSLKYLSNCLLKYGDDRMTKHVEA 304

Query: 909  LWSSVKDATYISPQ-CTLTKESELLGGMGFQDSDVMMQAFILLQEVI-QQCGDFISLILG 1082
            +W SVKDA + S Q   L+  SELL  +GFQ+++++ +A ILLQ+VI +  G  +SLI+G
Sbjct: 305  IWFSVKDAIFCSEQEPMLSLASELLDHVGFQENEIVTEAIILLQKVILENSGLSLSLIVG 364

Query: 1083 DNDINVFMNSLNQYEEMDQIPSLEKQRLHVVGHMLFTCAKSSPALCNKVFHSFFPLLMEC 1262
            D DIN  +N++  +   + IP   K +L  +G +L+  AK+S   CN+VF SFF  LM+ 
Sbjct: 365  DKDINTIVNTVTSFRSYNDIPLQSKHKLCAIGRILYVSAKASITCCNRVFESFFFRLMDT 424

Query: 1263 LGLSVAKSSNGHLDEDCPP-------VKYNFAAIYLCIELIAACRDVAVSLDSSKGVLDF 1421
            LGLSV  SS      DC P        + NF A+YLCIEL+AACRD+ V  +        
Sbjct: 425  LGLSVRNSSG-----DCLPNFDYVFSERLNFGALYLCIELLAACRDLVVGSEELTSKSVS 479

Query: 1422 SQQTWSTML 1448
            +Q++W  ML
Sbjct: 480  AQESWCCML 488


>gb|EPS68498.1| hypothetical protein M569_06270, partial [Genlisea aurea]
          Length = 970

 Score =  501 bits (1290), Expect = e-139
 Identities = 251/397 (63%), Positives = 307/397 (77%), Gaps = 2/397 (0%)
 Frame = +3

Query: 264  FFTERLADWKALRGAIVGCLALLRRKDDVGSVTSSEAKAVAESYLQNLQVQSLGQHDRVL 443
            FF ERLADWKALRGA+VGCLALLRRK DVG ++ SEAKA+A+SY+Q+LQVQ+LGQHDR L
Sbjct: 28   FFAERLADWKALRGALVGCLALLRRKADVGGISGSEAKAIAQSYIQHLQVQALGQHDRKL 87

Query: 444  SFQLMECLLDRYPGAIGDLGDTLVYGICEAIDGEKDPQCLLPVFRIVECLAQLYPDPSGP 623
            S QL+ECLLD Y  A+ DLGD LVYGIC AIDGEKDPQCLL VF IVE L +LY   SGP
Sbjct: 88   SLQLLECLLDCYFSAVADLGDNLVYGICGAIDGEKDPQCLLIVFSIVEILGRLYSGSSGP 147

Query: 624  LANYAEDLFEILGSYFPIRFTHPKGEDDDDANKEKLSRALMLAFASTPLFEPFSIPLLLE 803
            L NYAE+LFE++GSYFPI FTHPKG D+DD  +++LSRALM+AFASTPLFEPFSIPLLLE
Sbjct: 148  LVNYAEELFEVIGSYFPIHFTHPKG-DEDDRKRQELSRALMMAFASTPLFEPFSIPLLLE 206

Query: 804  KLSSYLPSAKVESFRYLSYCTMKYGPERMASHAEALWSSVKDATYISPQCTLTKESELLG 983
            K SS LPSAK+ES RYL YC++KYG +RMA H+EALWSSVKD  Y SP  TL+ ES+   
Sbjct: 207  KFSSTLPSAKLESIRYLCYCSVKYGQDRMAKHSEALWSSVKDTVYFSPDSTLSMESQ-SD 265

Query: 984  GMGFQDSDVMMQAFILLQEVIQQCGDFISLILGDNDINVFMNSLNQYEEMDQIPSLEKQR 1163
             + F++SD+M+QAF LL+E+  Q GDFI+L++ D D+NVF+NSLNQY E D IP   KQR
Sbjct: 266  ALNFRESDIMIQAFALLREINLQNGDFINLVIQDGDMNVFLNSLNQYREFDDIPLKVKQR 325

Query: 1164 LHVVGHMLFTCAKSSPALCNKVFHSFFPLLMECLGLSVAK-SSNGHLDEDC-PPVKYNFA 1337
            LH VG +   CA++S A C+KVF  FFPLLM+ LG S  K   + H DE C   +K NF 
Sbjct: 326  LHSVGRIFSACAETSAASCSKVFERFFPLLMDGLGFSAGKLLQDNHPDEACASSIKLNFG 385

Query: 1338 AIYLCIELIAACRDVAVSLDSSKGVLDFSQQTWSTML 1448
            A+YLC++L+ A R + +S D++  V + +   W +ML
Sbjct: 386  ALYLCVKLLTASRYLILSTDNTPAVSNLAHHVWFSML 422


>ref|XP_007024314.1| MMS19 nucleotide excision repair protein, putative isoform 5
            [Theobroma cacao] gi|508779680|gb|EOY26936.1| MMS19
            nucleotide excision repair protein, putative isoform 5
            [Theobroma cacao]
          Length = 1157

 Score =  492 bits (1267), Expect = e-136
 Identities = 258/484 (53%), Positives = 336/484 (69%), Gaps = 4/484 (0%)
 Frame = +3

Query: 9    QWIKHIELYVSSAANPSQQAACVDAVAALIKKGVFTLETLVREMEMYLTTTDSIIRSRGI 188
            Q I+ IE +V S  +P+QQAA +D +A+L+K    T+ETLVREME YLTT D+IIR+RGI
Sbjct: 6    QLIQGIESFVDSTRSPTQQAASLDVIASLLKNNQLTIETLVREMEGYLTTADNIIRARGI 65

Query: 189  XXXXXXXXXXXXXXXXXXXXXXXXGFFTERLADWKALRGAIVGCLALLRRKDDVGSVTSS 368
                                     FFT+RLADW+ALRGA+VGCLALLRRK   G V+ +
Sbjct: 66   LLLGEVLMHLASKPLDDATIHSLIQFFTDRLADWRALRGALVGCLALLRRKSSGGIVSET 125

Query: 369  EAKAVAESYLQNLQVQSLGQHDRVLSFQLMECLLDRYPGAIGDLGDTLVYGICEAIDGEK 548
            +AKAVAESYLQNLQVQSLG++DR L F+L+ CLL+RYP AI  LGD L+YGICEA+DGEK
Sbjct: 126  DAKAVAESYLQNLQVQSLGKYDRKLCFELLLCLLERYPKAIASLGDNLIYGICEAVDGEK 185

Query: 549  DPQCLLPVFRIVECLAQLYPDPSGPLANYAEDLFEILGSYFPIRFTHPKGEDDDDANKEK 728
            DP CL+ +F I+E L QL+PDP GP  ++A DLFE L  YFP+ FTHPKGE D +  ++ 
Sbjct: 186  DPHCLMLIFHIIEILPQLFPDPLGPFTSFAHDLFENLSYYFPVHFTHPKGE-DVNIKRDD 244

Query: 729  LSRALMLAFASTPLFEPFSIPLLLEKLSSYLPSAKVESFRYLSYCTMKYGPERMASHAEA 908
            L+RALMLAF+STPLFEPF+IPLL+EKLSS LPSAKV+S RYLS CT+KYG +RMA H EA
Sbjct: 245  LARALMLAFSSTPLFEPFAIPLLIEKLSSSLPSAKVDSLRYLSDCTVKYGVDRMAKHGEA 304

Query: 909  LWSSVKDATYISPQCTLTKESELLGGMGFQDSDVMMQAFILLQE-VIQQCGDFISLILGD 1085
            LWSS+KDA + S    L+   E L G+   ++++  +A  LLQ+ ++Q    F+ LI+ D
Sbjct: 305  LWSSLKDAVFTSLDGVLSFTPESLEGLCLPENEIAAEALSLLQKLIVQNTNFFLDLIVVD 364

Query: 1086 NDINVFMNSLNQYEEMDQIPSLEKQRLHVVGHMLFTCAKSSPALCNKVFHSFFPLLMECL 1265
             DIN+  N ++ Y+    IP+  KQRLH VG +L    K+S A CN+VF  FF  LM+ L
Sbjct: 365  EDINMIFNMISSYKSYHGIPAQSKQRLHAVGCILSASVKASTASCNRVFECFFSRLMDIL 424

Query: 1266 GLSVAKSSNGHLDED---CPPVKYNFAAIYLCIELIAACRDVAVSLDSSKGVLDFSQQTW 1436
            GL V ++S+G+L  D     P +YN  A+YL IEL++ACRDV  S ++       +++TW
Sbjct: 425  GLCV-RNSSGNLSSDDSIMIPKRYNHGALYLSIELLSACRDVIASSETIIAASAHTEETW 483

Query: 1437 STML 1448
            S +L
Sbjct: 484  SYLL 487


>ref|XP_007024313.1| MMS19 nucleotide excision repair protein, putative isoform 4
            [Theobroma cacao] gi|508779679|gb|EOY26935.1| MMS19
            nucleotide excision repair protein, putative isoform 4
            [Theobroma cacao]
          Length = 1136

 Score =  492 bits (1267), Expect = e-136
 Identities = 258/484 (53%), Positives = 336/484 (69%), Gaps = 4/484 (0%)
 Frame = +3

Query: 9    QWIKHIELYVSSAANPSQQAACVDAVAALIKKGVFTLETLVREMEMYLTTTDSIIRSRGI 188
            Q I+ IE +V S  +P+QQAA +D +A+L+K    T+ETLVREME YLTT D+IIR+RGI
Sbjct: 6    QLIQGIESFVDSTRSPTQQAASLDVIASLLKNNQLTIETLVREMEGYLTTADNIIRARGI 65

Query: 189  XXXXXXXXXXXXXXXXXXXXXXXXGFFTERLADWKALRGAIVGCLALLRRKDDVGSVTSS 368
                                     FFT+RLADW+ALRGA+VGCLALLRRK   G V+ +
Sbjct: 66   LLLGEVLMHLASKPLDDATIHSLIQFFTDRLADWRALRGALVGCLALLRRKSSGGIVSET 125

Query: 369  EAKAVAESYLQNLQVQSLGQHDRVLSFQLMECLLDRYPGAIGDLGDTLVYGICEAIDGEK 548
            +AKAVAESYLQNLQVQSLG++DR L F+L+ CLL+RYP AI  LGD L+YGICEA+DGEK
Sbjct: 126  DAKAVAESYLQNLQVQSLGKYDRKLCFELLLCLLERYPKAIASLGDNLIYGICEAVDGEK 185

Query: 549  DPQCLLPVFRIVECLAQLYPDPSGPLANYAEDLFEILGSYFPIRFTHPKGEDDDDANKEK 728
            DP CL+ +F I+E L QL+PDP GP  ++A DLFE L  YFP+ FTHPKGE D +  ++ 
Sbjct: 186  DPHCLMLIFHIIEILPQLFPDPLGPFTSFAHDLFENLSYYFPVHFTHPKGE-DVNIKRDD 244

Query: 729  LSRALMLAFASTPLFEPFSIPLLLEKLSSYLPSAKVESFRYLSYCTMKYGPERMASHAEA 908
            L+RALMLAF+STPLFEPF+IPLL+EKLSS LPSAKV+S RYLS CT+KYG +RMA H EA
Sbjct: 245  LARALMLAFSSTPLFEPFAIPLLIEKLSSSLPSAKVDSLRYLSDCTVKYGVDRMAKHGEA 304

Query: 909  LWSSVKDATYISPQCTLTKESELLGGMGFQDSDVMMQAFILLQE-VIQQCGDFISLILGD 1085
            LWSS+KDA + S    L+   E L G+   ++++  +A  LLQ+ ++Q    F+ LI+ D
Sbjct: 305  LWSSLKDAVFTSLDGVLSFTPESLEGLCLPENEIAAEALSLLQKLIVQNTNFFLDLIVVD 364

Query: 1086 NDINVFMNSLNQYEEMDQIPSLEKQRLHVVGHMLFTCAKSSPALCNKVFHSFFPLLMECL 1265
             DIN+  N ++ Y+    IP+  KQRLH VG +L    K+S A CN+VF  FF  LM+ L
Sbjct: 365  EDINMIFNMISSYKSYHGIPAQSKQRLHAVGCILSASVKASTASCNRVFECFFSRLMDIL 424

Query: 1266 GLSVAKSSNGHLDED---CPPVKYNFAAIYLCIELIAACRDVAVSLDSSKGVLDFSQQTW 1436
            GL V ++S+G+L  D     P +YN  A+YL IEL++ACRDV  S ++       +++TW
Sbjct: 425  GLCV-RNSSGNLSSDDSIMIPKRYNHGALYLSIELLSACRDVIASSETIIAASAHTEETW 483

Query: 1437 STML 1448
            S +L
Sbjct: 484  SYLL 487


>ref|XP_007024312.1| MMS19 nucleotide excision repair protein, putative isoform 3
            [Theobroma cacao] gi|508779678|gb|EOY26934.1| MMS19
            nucleotide excision repair protein, putative isoform 3
            [Theobroma cacao]
          Length = 1062

 Score =  492 bits (1267), Expect = e-136
 Identities = 258/484 (53%), Positives = 336/484 (69%), Gaps = 4/484 (0%)
 Frame = +3

Query: 9    QWIKHIELYVSSAANPSQQAACVDAVAALIKKGVFTLETLVREMEMYLTTTDSIIRSRGI 188
            Q I+ IE +V S  +P+QQAA +D +A+L+K    T+ETLVREME YLTT D+IIR+RGI
Sbjct: 6    QLIQGIESFVDSTRSPTQQAASLDVIASLLKNNQLTIETLVREMEGYLTTADNIIRARGI 65

Query: 189  XXXXXXXXXXXXXXXXXXXXXXXXGFFTERLADWKALRGAIVGCLALLRRKDDVGSVTSS 368
                                     FFT+RLADW+ALRGA+VGCLALLRRK   G V+ +
Sbjct: 66   LLLGEVLMHLASKPLDDATIHSLIQFFTDRLADWRALRGALVGCLALLRRKSSGGIVSET 125

Query: 369  EAKAVAESYLQNLQVQSLGQHDRVLSFQLMECLLDRYPGAIGDLGDTLVYGICEAIDGEK 548
            +AKAVAESYLQNLQVQSLG++DR L F+L+ CLL+RYP AI  LGD L+YGICEA+DGEK
Sbjct: 126  DAKAVAESYLQNLQVQSLGKYDRKLCFELLLCLLERYPKAIASLGDNLIYGICEAVDGEK 185

Query: 549  DPQCLLPVFRIVECLAQLYPDPSGPLANYAEDLFEILGSYFPIRFTHPKGEDDDDANKEK 728
            DP CL+ +F I+E L QL+PDP GP  ++A DLFE L  YFP+ FTHPKGE D +  ++ 
Sbjct: 186  DPHCLMLIFHIIEILPQLFPDPLGPFTSFAHDLFENLSYYFPVHFTHPKGE-DVNIKRDD 244

Query: 729  LSRALMLAFASTPLFEPFSIPLLLEKLSSYLPSAKVESFRYLSYCTMKYGPERMASHAEA 908
            L+RALMLAF+STPLFEPF+IPLL+EKLSS LPSAKV+S RYLS CT+KYG +RMA H EA
Sbjct: 245  LARALMLAFSSTPLFEPFAIPLLIEKLSSSLPSAKVDSLRYLSDCTVKYGVDRMAKHGEA 304

Query: 909  LWSSVKDATYISPQCTLTKESELLGGMGFQDSDVMMQAFILLQE-VIQQCGDFISLILGD 1085
            LWSS+KDA + S    L+   E L G+   ++++  +A  LLQ+ ++Q    F+ LI+ D
Sbjct: 305  LWSSLKDAVFTSLDGVLSFTPESLEGLCLPENEIAAEALSLLQKLIVQNTNFFLDLIVVD 364

Query: 1086 NDINVFMNSLNQYEEMDQIPSLEKQRLHVVGHMLFTCAKSSPALCNKVFHSFFPLLMECL 1265
             DIN+  N ++ Y+    IP+  KQRLH VG +L    K+S A CN+VF  FF  LM+ L
Sbjct: 365  EDINMIFNMISSYKSYHGIPAQSKQRLHAVGCILSASVKASTASCNRVFECFFSRLMDIL 424

Query: 1266 GLSVAKSSNGHLDED---CPPVKYNFAAIYLCIELIAACRDVAVSLDSSKGVLDFSQQTW 1436
            GL V ++S+G+L  D     P +YN  A+YL IEL++ACRDV  S ++       +++TW
Sbjct: 425  GLCV-RNSSGNLSSDDSIMIPKRYNHGALYLSIELLSACRDVIASSETIIAASAHTEETW 483

Query: 1437 STML 1448
            S +L
Sbjct: 484  SYLL 487


>ref|XP_007024310.1| MMS19 nucleotide excision repair protein, putative isoform 1
            [Theobroma cacao] gi|590619491|ref|XP_007024311.1| MMS19
            nucleotide excision repair protein, putative isoform 1
            [Theobroma cacao] gi|508779676|gb|EOY26932.1| MMS19
            nucleotide excision repair protein, putative isoform 1
            [Theobroma cacao] gi|508779677|gb|EOY26933.1| MMS19
            nucleotide excision repair protein, putative isoform 1
            [Theobroma cacao]
          Length = 1149

 Score =  492 bits (1267), Expect = e-136
 Identities = 258/484 (53%), Positives = 336/484 (69%), Gaps = 4/484 (0%)
 Frame = +3

Query: 9    QWIKHIELYVSSAANPSQQAACVDAVAALIKKGVFTLETLVREMEMYLTTTDSIIRSRGI 188
            Q I+ IE +V S  +P+QQAA +D +A+L+K    T+ETLVREME YLTT D+IIR+RGI
Sbjct: 6    QLIQGIESFVDSTRSPTQQAASLDVIASLLKNNQLTIETLVREMEGYLTTADNIIRARGI 65

Query: 189  XXXXXXXXXXXXXXXXXXXXXXXXGFFTERLADWKALRGAIVGCLALLRRKDDVGSVTSS 368
                                     FFT+RLADW+ALRGA+VGCLALLRRK   G V+ +
Sbjct: 66   LLLGEVLMHLASKPLDDATIHSLIQFFTDRLADWRALRGALVGCLALLRRKSSGGIVSET 125

Query: 369  EAKAVAESYLQNLQVQSLGQHDRVLSFQLMECLLDRYPGAIGDLGDTLVYGICEAIDGEK 548
            +AKAVAESYLQNLQVQSLG++DR L F+L+ CLL+RYP AI  LGD L+YGICEA+DGEK
Sbjct: 126  DAKAVAESYLQNLQVQSLGKYDRKLCFELLLCLLERYPKAIASLGDNLIYGICEAVDGEK 185

Query: 549  DPQCLLPVFRIVECLAQLYPDPSGPLANYAEDLFEILGSYFPIRFTHPKGEDDDDANKEK 728
            DP CL+ +F I+E L QL+PDP GP  ++A DLFE L  YFP+ FTHPKGE D +  ++ 
Sbjct: 186  DPHCLMLIFHIIEILPQLFPDPLGPFTSFAHDLFENLSYYFPVHFTHPKGE-DVNIKRDD 244

Query: 729  LSRALMLAFASTPLFEPFSIPLLLEKLSSYLPSAKVESFRYLSYCTMKYGPERMASHAEA 908
            L+RALMLAF+STPLFEPF+IPLL+EKLSS LPSAKV+S RYLS CT+KYG +RMA H EA
Sbjct: 245  LARALMLAFSSTPLFEPFAIPLLIEKLSSSLPSAKVDSLRYLSDCTVKYGVDRMAKHGEA 304

Query: 909  LWSSVKDATYISPQCTLTKESELLGGMGFQDSDVMMQAFILLQE-VIQQCGDFISLILGD 1085
            LWSS+KDA + S    L+   E L G+   ++++  +A  LLQ+ ++Q    F+ LI+ D
Sbjct: 305  LWSSLKDAVFTSLDGVLSFTPESLEGLCLPENEIAAEALSLLQKLIVQNTNFFLDLIVVD 364

Query: 1086 NDINVFMNSLNQYEEMDQIPSLEKQRLHVVGHMLFTCAKSSPALCNKVFHSFFPLLMECL 1265
             DIN+  N ++ Y+    IP+  KQRLH VG +L    K+S A CN+VF  FF  LM+ L
Sbjct: 365  EDINMIFNMISSYKSYHGIPAQSKQRLHAVGCILSASVKASTASCNRVFECFFSRLMDIL 424

Query: 1266 GLSVAKSSNGHLDED---CPPVKYNFAAIYLCIELIAACRDVAVSLDSSKGVLDFSQQTW 1436
            GL V ++S+G+L  D     P +YN  A+YL IEL++ACRDV  S ++       +++TW
Sbjct: 425  GLCV-RNSSGNLSSDDSIMIPKRYNHGALYLSIELLSACRDVIASSETIIAASAHTEETW 483

Query: 1437 STML 1448
            S +L
Sbjct: 484  SYLL 487


>ref|XP_006465695.1| PREDICTED: MMS19 nucleotide excision repair protein homolog isoform
            X2 [Citrus sinensis]
          Length = 1151

 Score =  490 bits (1261), Expect = e-136
 Identities = 260/490 (53%), Positives = 336/490 (68%), Gaps = 8/490 (1%)
 Frame = +3

Query: 9    QWIKHIELYVSSAANPSQQAACVDAVAALIKKGVFTLETLVREMEMYLTTTDSIIRSRGI 188
            Q I+HIE +V+ +++P+ QAA +D +A+L+KK V T+ETLVREM MYLTTTD +IR+RGI
Sbjct: 6    QLIQHIESFVNLSSSPTHQAASLDVIASLLKKNVLTIETLVREMGMYLTTTDDVIRARGI 65

Query: 189  XXXXXXXXXXXXXXXXXXXXXXXXGFFTERLADWKALRGAIVGCLALLRRKDDVGSVTSS 368
                                     FFT+RLADWKALRGA+VGCLALLRRK   G +T++
Sbjct: 66   LLLGELLTHLASKPLDDATIHSMLAFFTDRLADWKALRGALVGCLALLRRKSSGGVITTN 125

Query: 369  EAKAVAESYLQNLQVQSLGQHDRVLSFQLMECLLDRYPGAIGDLGDTLVYGICEAIDGEK 548
            +AKAVA+SY+QNLQVQSL QHDR L F+L+ECLL RYP A+  LG+ L+Y ICEAIDGEK
Sbjct: 126  DAKAVAQSYIQNLQVQSLAQHDRKLCFELLECLLQRYPDAVVSLGEDLLYAICEAIDGEK 185

Query: 549  DPQCLLPVFRIVECLAQLYPDPSGPLANYAEDLFEILGSYFPIRFTHPKGEDDDDANKEK 728
            DP CL+  F IVE  A+L+ D    LAN+A DLFEILG YFPI FTH K E D D  ++ 
Sbjct: 186  DPHCLMLTFHIVEVAAELFSDDL--LANFASDLFEILGCYFPIHFTHSKAE-DFDVKRDD 242

Query: 729  LSRALMLAFASTPLFEPFSIPLLLEKLSSYLPSAKVESFRYLSYCTMKYGPERMASHAEA 908
            LSRALM AF+ST LFEPF+IPLLLEKLSS L SAKV+S +YLS+CT+KYG +R+  HA+A
Sbjct: 243  LSRALMAAFSSTSLFEPFAIPLLLEKLSSSLQSAKVDSLKYLSHCTVKYGADRIEKHAKA 302

Query: 909  LWSSVKDATYISPQCTLTKESELLGGMGFQDSDVMMQAFILLQEVIQQ-CGDFISLILGD 1085
            +WSS+KDA Y S + TL+  SE L G+GF+++ ++ ++  LL  V +Q  G F+S I+GD
Sbjct: 303  MWSSIKDAVYSSHEPTLSFASESLDGVGFRENVILTESLNLLDTVFKQNSGLFLSWIIGD 362

Query: 1086 NDINVFMNSLNQYEEMDQIPSLEKQRLHVVGHMLFTCAKSSPALCNKVFHSFFPLLMECL 1265
             DIN+   S++ Y+   +I    KQ+LH VG +L   AK+SPA CN V  SFFP LM  L
Sbjct: 363  EDINLIFKSISSYKTYKEISLQSKQKLHAVGSILSVSAKASPAACNSVMESFFPCLMHAL 422

Query: 1266 GLSVAKSSNGHLDEDCPP-------VKYNFAAIYLCIELIAACRDVAVSLDSSKGVLDFS 1424
            GLSV  S+     +DC P        K N  A+YLCIEL+ ACR++  S +  K V   +
Sbjct: 423  GLSVGNST-----QDCFPNDGNVLRGKLNHGALYLCIELMTACRELMASSEEFKSVAAPA 477

Query: 1425 QQTWSTMLSN 1454
             + W  +L +
Sbjct: 478  NERWYCLLQS 487


>ref|XP_006465694.1| PREDICTED: MMS19 nucleotide excision repair protein homolog isoform
            X1 [Citrus sinensis]
          Length = 1155

 Score =  490 bits (1261), Expect = e-136
 Identities = 260/490 (53%), Positives = 336/490 (68%), Gaps = 8/490 (1%)
 Frame = +3

Query: 9    QWIKHIELYVSSAANPSQQAACVDAVAALIKKGVFTLETLVREMEMYLTTTDSIIRSRGI 188
            Q I+HIE +V+ +++P+ QAA +D +A+L+KK V T+ETLVREM MYLTTTD +IR+RGI
Sbjct: 6    QLIQHIESFVNLSSSPTHQAASLDVIASLLKKNVLTIETLVREMGMYLTTTDDVIRARGI 65

Query: 189  XXXXXXXXXXXXXXXXXXXXXXXXGFFTERLADWKALRGAIVGCLALLRRKDDVGSVTSS 368
                                     FFT+RLADWKALRGA+VGCLALLRRK   G +T++
Sbjct: 66   LLLGELLTHLASKPLDDATIHSMLAFFTDRLADWKALRGALVGCLALLRRKSSGGVITTN 125

Query: 369  EAKAVAESYLQNLQVQSLGQHDRVLSFQLMECLLDRYPGAIGDLGDTLVYGICEAIDGEK 548
            +AKAVA+SY+QNLQVQSL QHDR L F+L+ECLL RYP A+  LG+ L+Y ICEAIDGEK
Sbjct: 126  DAKAVAQSYIQNLQVQSLAQHDRKLCFELLECLLQRYPDAVVSLGEDLLYAICEAIDGEK 185

Query: 549  DPQCLLPVFRIVECLAQLYPDPSGPLANYAEDLFEILGSYFPIRFTHPKGEDDDDANKEK 728
            DP CL+  F IVE  A+L+ D    LAN+A DLFEILG YFPI FTH K E D D  ++ 
Sbjct: 186  DPHCLMLTFHIVEVAAELFSDDL--LANFASDLFEILGCYFPIHFTHSKAE-DFDVKRDD 242

Query: 729  LSRALMLAFASTPLFEPFSIPLLLEKLSSYLPSAKVESFRYLSYCTMKYGPERMASHAEA 908
            LSRALM AF+ST LFEPF+IPLLLEKLSS L SAKV+S +YLS+CT+KYG +R+  HA+A
Sbjct: 243  LSRALMAAFSSTSLFEPFAIPLLLEKLSSSLQSAKVDSLKYLSHCTVKYGADRIEKHAKA 302

Query: 909  LWSSVKDATYISPQCTLTKESELLGGMGFQDSDVMMQAFILLQEVIQQ-CGDFISLILGD 1085
            +WSS+KDA Y S + TL+  SE L G+GF+++ ++ ++  LL  V +Q  G F+S I+GD
Sbjct: 303  MWSSIKDAVYSSHEPTLSFASESLDGVGFRENVILTESLNLLDTVFKQNSGLFLSWIIGD 362

Query: 1086 NDINVFMNSLNQYEEMDQIPSLEKQRLHVVGHMLFTCAKSSPALCNKVFHSFFPLLMECL 1265
             DIN+   S++ Y+   +I    KQ+LH VG +L   AK+SPA CN V  SFFP LM  L
Sbjct: 363  EDINLIFKSISSYKTYKEISLQSKQKLHAVGSILSVSAKASPAACNSVMESFFPCLMHAL 422

Query: 1266 GLSVAKSSNGHLDEDCPP-------VKYNFAAIYLCIELIAACRDVAVSLDSSKGVLDFS 1424
            GLSV  S+     +DC P        K N  A+YLCIEL+ ACR++  S +  K V   +
Sbjct: 423  GLSVGNST-----QDCFPNDGNVLRGKLNHGALYLCIELMTACRELMASSEEFKSVAAPA 477

Query: 1425 QQTWSTMLSN 1454
             + W  +L +
Sbjct: 478  NERWYCLLQS 487


>ref|XP_006426876.1| hypothetical protein CICLE_v10024743mg [Citrus clementina]
            gi|557528866|gb|ESR40116.1| hypothetical protein
            CICLE_v10024743mg [Citrus clementina]
          Length = 1155

 Score =  487 bits (1254), Expect = e-135
 Identities = 259/490 (52%), Positives = 336/490 (68%), Gaps = 8/490 (1%)
 Frame = +3

Query: 9    QWIKHIELYVSSAANPSQQAACVDAVAALIKKGVFTLETLVREMEMYLTTTDSIIRSRGI 188
            Q I+HIE +V+ +++P+ QAA +D +A+L+KK V T+ETLVREM MYLTTTD +IR+RGI
Sbjct: 6    QLIQHIESFVNLSSSPTHQAASLDVIASLLKKNVLTIETLVREMGMYLTTTDDVIRARGI 65

Query: 189  XXXXXXXXXXXXXXXXXXXXXXXXGFFTERLADWKALRGAIVGCLALLRRKDDVGSVTSS 368
                                     FFT+RLADWKALRGA+VGCLALLRRK   G +T++
Sbjct: 66   LLLGELLTHLASKPLDDATIHSMLAFFTDRLADWKALRGALVGCLALLRRKSSGGVITTN 125

Query: 369  EAKAVAESYLQNLQVQSLGQHDRVLSFQLMECLLDRYPGAIGDLGDTLVYGICEAIDGEK 548
            +AKAVA+SY+QNLQVQSL QHDR L F+L+ECLL RYP A+  LG+ L+Y ICEA+DGEK
Sbjct: 126  DAKAVAQSYIQNLQVQSLAQHDRKLCFELLECLLQRYPDAVVSLGEDLLYAICEAVDGEK 185

Query: 549  DPQCLLPVFRIVECLAQLYPDPSGPLANYAEDLFEILGSYFPIRFTHPKGEDDDDANKEK 728
            DP CL+  F IVE  A+L+ D    LAN+A DLFEILG YFPI FTH K E D D  ++ 
Sbjct: 186  DPHCLMLTFHIVEVAAELFSDDL--LANFAGDLFEILGCYFPIHFTHSKAE-DFDVKRDD 242

Query: 729  LSRALMLAFASTPLFEPFSIPLLLEKLSSYLPSAKVESFRYLSYCTMKYGPERMASHAEA 908
            LSRALM AF+ST LFEPF+IPLLLEKLSS L SAKV+S +YLS+CT+KYG +R+  HA+A
Sbjct: 243  LSRALMAAFSSTSLFEPFAIPLLLEKLSSSLQSAKVDSLKYLSHCTVKYGADRIEKHAKA 302

Query: 909  LWSSVKDATYISPQCTLTKESELLGGMGFQDSDVMMQAFILLQEVIQQ-CGDFISLILGD 1085
            +WSS+KDA Y S + TL+  SE L G+GF+D+ ++ ++  LL  V +Q  G F+S I+GD
Sbjct: 303  MWSSIKDAIYSSHEPTLSFASESLDGVGFRDNVILTESLNLLDTVFKQNSGLFLSWIIGD 362

Query: 1086 NDINVFMNSLNQYEEMDQIPSLEKQRLHVVGHMLFTCAKSSPALCNKVFHSFFPLLMECL 1265
             DIN+   S++ ++   +I    KQ+LH VG +L   AK+SPA CN V  SFFP LM  L
Sbjct: 363  EDINLIFKSISSFKTYKEISLQSKQKLHAVGSILSVSAKASPAACNSVMESFFPCLMHPL 422

Query: 1266 GLSVAKSSNGHLDEDCPP-------VKYNFAAIYLCIELIAACRDVAVSLDSSKGVLDFS 1424
            GLSV  S+     +DC P        K N  A+YLCIEL+ ACR++  S +  K V   +
Sbjct: 423  GLSVGNST-----QDCFPNDGNVLRGKLNHGALYLCIELMTACRELMASSEEFKSVAAPA 477

Query: 1425 QQTWSTMLSN 1454
             + W  +L +
Sbjct: 478  NERWYCLLQS 487


>ref|XP_007217541.1| hypothetical protein PRUPE_ppa023072mg [Prunus persica]
            gi|462413691|gb|EMJ18740.1| hypothetical protein
            PRUPE_ppa023072mg [Prunus persica]
          Length = 1158

 Score =  479 bits (1234), Expect = e-132
 Identities = 253/490 (51%), Positives = 341/490 (69%), Gaps = 6/490 (1%)
 Frame = +3

Query: 3    SVQWIKHIELYVSSAANPSQQAACVDAVAALIKKGVFTLETLVREMEMYLTTTDSIIRSR 182
            + + I+HIELYV ++ +P++QAA ++++ +L+K    T+E LV+EM MYLTTTD++IR+R
Sbjct: 4    TTELIQHIELYVDTSRSPTEQAASLNSIISLVKSDFLTIEVLVKEMRMYLTTTDNVIRAR 63

Query: 183  GIXXXXXXXXXXXXXXXXXXXXXXXXGFFTERLADWKALRGAIVGCLALLRRKDDVGSVT 362
            GI                        GFFT+RLADW+ALRGA+VGCLALLRRK + G V+
Sbjct: 64   GILLLAEVLTGLASKPLDNATIHSLIGFFTDRLADWRALRGALVGCLALLRRKVNAGMVS 123

Query: 363  SSEAKAVAESYLQNLQVQSLGQHDRVLSFQLMECLLDRYPGAIGDLGDTLVYGICEAIDG 542
            +S+ K VA+SY+++LQVQSLGQHDR L F+L+ECLL+R+P  I  LG+T  YGIC+A+DG
Sbjct: 124  ASDGKLVAQSYIESLQVQSLGQHDRKLCFELLECLLERHPNEIASLGETFFYGICQAMDG 183

Query: 543  EKDPQCLLPVFRIVECLAQLYPDPSGPLANYAEDLFEILGSYFPIRFTHPKGEDDDDANK 722
            EKDP CL+  F IVE L ++YPDPSG LA++  DLFE+LGSYFPI FTH K ++D +  +
Sbjct: 184  EKDPHCLMLTFPIVETLVRIYPDPSGSLASFCGDLFELLGSYFPIHFTHLK-DEDAEVKR 242

Query: 723  EKLSRALMLAFASTPLFEPFSIPLLLEKLSSYLPSAKVESFRYLSYCTMKYGPERMASHA 902
            + LS+ALM AF+STPLFEPF IPLLLEKLSS LP AKV+S +YL++CT KYG +RMA HA
Sbjct: 243  DDLSKALMSAFSSTPLFEPFVIPLLLEKLSSSLPLAKVDSLKYLNHCTAKYGADRMAKHA 302

Query: 903  EALWSSVKDATYIS---PQCTLTKESELLGGMGFQDSDVMMQAFILLQEV-IQQCGDFIS 1070
             A+W S+KDA   S   P  + T  SE L G+GFQ++++  +A +LLQ+V +Q    F+S
Sbjct: 303  GAIWISLKDAISNSLEKPDMSFT--SEPLYGLGFQENEIATEALMLLQKVTLQNEALFLS 360

Query: 1071 LILGDNDINVFMNSLNQYEEMDQIPSLEKQRLHVVGHMLFTCAKSSPALCNKVFHSFFPL 1250
            LI+ D  IN+  NS+  +E  + IP   KQ LH VG +L+  +K+S A CN VF SFFP 
Sbjct: 361  LIIQDEGINIVFNSIASHEHYNNIPLQGKQWLHAVGRILYIISKTSMASCNSVFESFFPR 420

Query: 1251 LMECLGLSVAKSSNG-HLDEDC-PPVKYNFAAIYLCIELIAACRDVAVSLDSSKGVLDFS 1424
            LM  L +SV  S+    L+E+  P  K+NF A+YLC+ELIAACRD+ +         D  
Sbjct: 421  LMNTLEISVTNSAGDCTLNENTFPSKKFNFGALYLCVELIAACRDLIMRSKDLAPKPDTP 480

Query: 1425 QQTWSTMLSN 1454
            Q+T   ML +
Sbjct: 481  QETCRYMLQS 490


>ref|XP_002515963.1| DNA repair/transcription protein met18/mms19, putative [Ricinus
            communis] gi|223544868|gb|EEF46383.1| DNA
            repair/transcription protein met18/mms19, putative
            [Ricinus communis]
          Length = 1174

 Score =  472 bits (1215), Expect = e-130
 Identities = 242/484 (50%), Positives = 336/484 (69%), Gaps = 4/484 (0%)
 Frame = +3

Query: 9    QWIKHIELYVSSAANPSQQAACVDAVAALIKKGVFTLETLVREMEMYLTTTDSIIRSRGI 188
            Q  ++IE YV ++ + SQQAA +DA+  L+K    T+ +LV+EMEMYLTTTD IIR+RGI
Sbjct: 6    QLTQYIESYVDASRSLSQQAASLDAIVLLLKNDAVTIGSLVKEMEMYLTTTDDIIRARGI 65

Query: 189  XXXXXXXXXXXXXXXXXXXXXXXXGFFTERLADWKALRGAIVGCLALLRRKDDVGSVTSS 368
                                     FFTERLADW+ALRGA+VGCLAL+RR+ + G +T  
Sbjct: 66   LLLGEALSHLSSKPLDNTTIHSLIAFFTERLADWRALRGALVGCLALIRRRSN-GIITGI 124

Query: 369  EAKAVAESYLQNLQVQSLGQHDRVLSFQLMECLLDRYPGAIGDLGDTLVYGICEAIDGEK 548
            +AK VAESYLQNLQVQSL Q+DR L F+L+ECLL+  P A+  LG+ L+YGICEAIDGEK
Sbjct: 125  DAKVVAESYLQNLQVQSLAQYDRKLCFELLECLLENCPAAVASLGEDLIYGICEAIDGEK 184

Query: 549  DPQCLLPVFRIVECLAQLYPDPSGPLANYAEDLFEILGSYFPIRFTHPKGEDDDDANKEK 728
            DPQCL+  F IVE L +L+PDPSGP +++A D+F ILG YFPI FTHPK E D D  ++ 
Sbjct: 185  DPQCLMLTFHIVEVLGKLFPDPSGPFSSFAGDIFSILGCYFPIHFTHPKAE-DVDVKRDD 243

Query: 729  LSRALMLAFASTPLFEPFSIPLLLEKLSSYLPSAKVESFRYLSYCTMKYGPERMASHAEA 908
            LSRALMLAF+STPLFEPF++PLLLEKLSS LP+AKV+S +YLSYCT+K+  +R+A HA A
Sbjct: 244  LSRALMLAFSSTPLFEPFAMPLLLEKLSSSLPTAKVDSLKYLSYCTLKFRADRIAEHAGA 303

Query: 909  LWSSVKDATYIS-PQCTLTKESELLGGMGFQDSDVMMQAFILLQEVIQQCGD-FISLILG 1082
            +WSS+KDA Y S  +  L+ + E +   G + +++  +A +LL+ +I Q  + F+S+I+ 
Sbjct: 304  IWSSLKDAIYSSGEEPMLSSDLESVDSPGSEKNEIATEALLLLENLIVQNNNFFLSMIIS 363

Query: 1083 DNDINVFMNSLNQYEEMDQIPSLEKQRLHVVGHMLFTCAKSSPALCNKVFHSFFPLLMEC 1262
            D ++ +  N++  Y+  ++I    KQ+LH+VG +L+ CAK S + CN++F S+FP LME 
Sbjct: 364  DEEVKMIFNTITSYKSYNEISLQSKQKLHMVGRILYVCAKVSVSSCNRIFESYFPRLMEA 423

Query: 1263 LGLSVAKSSNG-HLDEDCPPVKY-NFAAIYLCIELIAACRDVAVSLDSSKGVLDFSQQTW 1436
            LG+ V  +S   H +E+C   K  N+ + YL I+L+ ACRD++ S D+       + +T+
Sbjct: 424  LGILVENTSGACHSNENCVKAKQPNYGSFYLSIKLLGACRDLSTSSDNLASQCISTNETY 483

Query: 1437 STML 1448
              +L
Sbjct: 484  CCLL 487


>ref|XP_004141784.1| PREDICTED: MMS19 nucleotide excision repair protein homolog [Cucumis
            sativus]
          Length = 1147

 Score =  470 bits (1209), Expect = e-130
 Identities = 247/463 (53%), Positives = 320/463 (69%), Gaps = 2/463 (0%)
 Frame = +3

Query: 21   HIELYVSSAANPSQQAACVDAVAALIKKGVFTLETLVREMEMYLTTTDSIIRSRGIXXXX 200
            ++E +V  +  PSQQA  ++ + +L+K  V T+ETLVREM MYLT TD+IIR RGI    
Sbjct: 10   YVESFVDVSRTPSQQATSLETITSLVKNNVLTIETLVREMGMYLTITDNIIRGRGILLLG 69

Query: 201  XXXXXXXXXXXXXXXXXXXXGFFTERLADWKALRGAIVGCLALLRRKDDVGSVTSSEAKA 380
                                 FFTERLADWKALRGA+VGCLAL+RRK +VGS++ ++AK+
Sbjct: 70   ELLACLASKPLDSATIHSLIAFFTERLADWKALRGALVGCLALMRRKTNVGSISQNDAKS 129

Query: 381  VAESYLQNLQVQSLGQHDRVLSFQLMECLLDRYPGAIGDLGDTLVYGICEAIDGEKDPQC 560
            VA+SY QNLQVQSLGQHDR LSF+L+ CLL+ YP A+  LGD LVYGICEAIDGEKDP C
Sbjct: 130  VAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYPDAVVSLGDDLVYGICEAIDGEKDPHC 189

Query: 561  LLPVFRIVECLAQLYPDPSGPLANYAEDLFEILGSYFPIRFTHPKGEDDDDANKEKLSRA 740
            LL  FRIVE +A+L+PDP+G LA+ + DLFE LG YFPI FTH K E+D D  +  LS A
Sbjct: 190  LLLTFRIVELVAKLFPDPTGALASSSSDLFEFLGCYFPIHFTHGK-EEDIDVRRNDLSHA 248

Query: 741  LMLAFASTPLFEPFSIPLLLEKLSSYLPSAKVESFRYLSYCTMKYGPERMASHAEALWSS 920
            LM AF+STPLFEPF+IPLLLEKLSS LP AK++S +YLS CT+KYG +RM  H+EA+WSS
Sbjct: 249  LMRAFSSTPLFEPFAIPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMKKHSEAIWSS 308

Query: 921  VKDATYIS-PQCTLTKESELLGGMGFQDSDVMMQAFILLQE-VIQQCGDFISLILGDNDI 1094
            VK+  + S  Q  L+  +E L    FQ++++  +A  LLQ+ V+   G F++LI+ D D+
Sbjct: 309  VKEIIFTSIGQPNLSINTESLNSPSFQENEMTTEALRLLQKMVVASNGLFLTLIINDEDV 368

Query: 1095 NVFMNSLNQYEEMDQIPSLEKQRLHVVGHMLFTCAKSSPALCNKVFHSFFPLLMECLGLS 1274
                N LN Y      P   +QRL+ VGH+L+T A +S A C+ VF S+F  L++ +G+S
Sbjct: 369  KDIFNILNIYTCYKDFPLQSRQRLNAVGHILYTSASASVASCDHVFESYFHRLLDFMGIS 428

Query: 1275 VAKSSNGHLDEDCPPVKYNFAAIYLCIELIAACRDVAVSLDSS 1403
            V      H D+  P    NF A+YLCIE+IAACR++ VS D +
Sbjct: 429  V---DQYHNDKISPIRNLNFGALYLCIEVIAACRNLIVSSDEN 468


>ref|XP_004302857.1| PREDICTED: uncharacterized protein LOC101304108 [Fragaria vesca
            subsp. vesca]
          Length = 1149

 Score =  444 bits (1143), Expect = e-122
 Identities = 234/489 (47%), Positives = 317/489 (64%), Gaps = 4/489 (0%)
 Frame = +3

Query: 3    SVQWIKHIELYVSSAANPSQQAACVDAVAALIKKGVFTLETLVREMEMYLTTTDSIIRSR 182
            + Q   H+E YV +A  P++QAA ++ + +L+KK + T+E LV+EM MYLT TD++IR+R
Sbjct: 4    TTQLTHHLECYVDTARPPAEQAASLNFITSLVKKDLLTIEVLVKEMRMYLTITDNVIRAR 63

Query: 183  GIXXXXXXXXXXXXXXXXXXXXXXXXGFFTERLADWKALRGAIVGCLALLRRKDDVGSVT 362
            GI                        GFFT+RL+DW+ALRGA++GCLALLRR+ + G V+
Sbjct: 64   GILLLAEVLTGLSSKPLDNATIHSLIGFFTDRLSDWRALRGALIGCLALLRRQVNAGMVS 123

Query: 363  SSEAKAVAESYLQNLQVQSLGQHDRVLSFQLMECLLDRYPGAIGDLGDTLVYGICEAIDG 542
            +S+AK VA+SY +N+ VQSL Q DR L F+L+ECLL RYP  +  LG+ L Y I EAID 
Sbjct: 124  ASDAKVVAQSYRENIPVQSLAQQDRKLCFELLECLLQRYPNEVASLGEDLFYAISEAIDE 183

Query: 543  EKDPQCLLPVFRIVECLAQLYPDPSGPLANYAEDLFEILGSYFPIRFTHPKGEDDDDANK 722
            EKDP CL+  F IVE L +L+PDPSGPLA +  DLFE LG YFPI FTH K ++D +  +
Sbjct: 184  EKDPHCLILTFHIVEALVKLFPDPSGPLATFCGDLFEFLGCYFPIHFTHLK-DEDANVKR 242

Query: 723  EKLSRALMLAFASTPLFEPFSIPLLLEKLSSYLPSAKVESFRYLSYCTMKYGPERMASHA 902
            E LS+ALM AF+ST LFEPF IPLLLEKLSS LP AKV+S +YL+YC  +YG ERMA HA
Sbjct: 243  EDLSKALMSAFSSTALFEPFVIPLLLEKLSSSLPLAKVDSLKYLNYCASRYGAERMAKHA 302

Query: 903  EALWSSVKDATYISPQCTLTK-ESELLGGMGFQDSDVMMQAFILLQEV-IQQCGDFISLI 1076
            E +W S+K A   S +       +E L G+GF++++++ +A ILLQ V +Q     +SLI
Sbjct: 303  ETIWISIKHAISNSLEVPAKSFTAEPLVGLGFEENEIVTEALILLQNVTMQNDALLLSLI 362

Query: 1077 LGDNDINVFMNSLNQYEEMDQIPSLEKQRLHVVGHMLFTCAKSSPALCNKVFHSFFPLLM 1256
            + D DIN  +NS+  +E    IPS  +Q LH VG + F   K+S A CN+VF SFFP LM
Sbjct: 363  VRDEDINNVINSIASHESYTNIPSQGRQSLHAVGRIFFIITKTSMASCNRVFESFFPSLM 422

Query: 1257 ECLGLSVAKSSNGHL--DEDCPPVKYNFAAIYLCIELIAACRDVAVSLDSSKGVLDFSQQ 1430
            + L +S+  SS      +      ++ F A+Y C+E IAACRD+ +  +        + +
Sbjct: 423  KTLEISMGNSSKDCTLKENSFSSKRFKFGALYFCVEFIAACRDLIMRTNDHDEKFGTADE 482

Query: 1431 TWSTMLSNS 1457
            T   ML +S
Sbjct: 483  TCCCMLQSS 491


>gb|EXB74582.1| hypothetical protein L484_026279 [Morus notabilis]
          Length = 1210

 Score =  433 bits (1114), Expect = e-119
 Identities = 231/499 (46%), Positives = 315/499 (63%), Gaps = 42/499 (8%)
 Frame = +3

Query: 18   KHIELYVSSAANPSQQAACVDAVAALIKKGVFTLETLVREMEMYLTTTDSIIRSRGIXXX 197
            +HIE YV +  + ++QAA +D++ +L+K G+ T+E LVREM+MYLTTTD +IR+RGI   
Sbjct: 9    RHIESYVDTTRSLNEQAASLDSIISLVKNGLVTIEKLVREMDMYLTTTDHVIRARGILLL 68

Query: 198  XXXXXXXXXXXXXXXXXXXXXGFFTERLADWKALRGAIVGCLALLRRKDDVGSVTSSEAK 377
                                  FF +RL DW+ LRGA+VGCLALLRRK D G V +++AK
Sbjct: 69   AELLTNLSLKPLDNVTIHSLIDFFADRLVDWRTLRGALVGCLALLRRKSDAGMVPATDAK 128

Query: 378  AVAESYLQNLQVQSLGQHDR---------------------------------------- 437
            AVA SY++NLQVQSLGQHDR                                        
Sbjct: 129  AVALSYVKNLQVQSLGQHDRKLCFELLECLLVTYPNEVASLLCFELLECLLVTYPNEVAS 188

Query: 438  VLSFQLMECLLDRYPGAIGDLGDTLVYGICEAIDGEKDPQCLLPVFRIVECLAQLYPDPS 617
            +L F+L+ECLL  YP  +  LG+ ++Y +CE++DGEKDP CL+ VF I+  L  L+P+PS
Sbjct: 189  LLCFELLECLLVTYPNEVASLGEDIIYSVCESVDGEKDPHCLMLVFHIIPALVGLFPNPS 248

Query: 618  GPLANYAEDLFEILGSYFPIRFTHPKGEDDDDANKEKLSRALMLAFASTPLFEPFSIPLL 797
            G LA++  DLFE+LG YFPI FTH K E D D  ++ LSRALM+AF+STPL EPF IPLL
Sbjct: 249  GSLASFPRDLFEVLGCYFPIHFTHHKVE-DVDVKRDDLSRALMIAFSSTPLLEPFVIPLL 307

Query: 798  LEKLSSYLPSAKVESFRYLSYCTMKYGPERMASHAEALWSSVKDATYIS-PQCTLTKESE 974
            LEKLSS L SAK++S +YLSYC++KYG +RMA HA  LWSS+K+A   S  + T +  SE
Sbjct: 308  LEKLSSSLSSAKIDSLKYLSYCSIKYGADRMARHAGILWSSIKNAISTSLKEPTESFYSE 367

Query: 975  LLGGMGFQDSDVMMQAFILLQEVIQQCGD-FISLILGDNDINVFMNSLNQYEEMDQIPSL 1151
             + G+GFQ+++V+ +A +LL+ V+ Q  +  +S+I+ D DI+   N++  Y     IP  
Sbjct: 368  SIDGLGFQENEVVSEALVLLETVVMQNNNLLLSMIVDDEDISTVFNTMTSYGRYKDIPLQ 427

Query: 1152 EKQRLHVVGHMLFTCAKSSPALCNKVFHSFFPLLMECLGLSVAKSSNGHLDEDCPPVKYN 1331
             KQRLHVVG +L+   K+S A CN+V  +FF  L++ L LS+  SS             N
Sbjct: 428  GKQRLHVVGRILYITTKTSIASCNRVLETFFRPLVDILQLSIRSSSRDWF--------LN 479

Query: 1332 FAAIYLCIELIAACRDVAV 1388
            F A+YLC+EL+AACRD+ +
Sbjct: 480  FGALYLCMELLAACRDLVI 498


>ref|XP_004486785.1| PREDICTED: uncharacterized protein LOC101495813 [Cicer arietinum]
          Length = 1138

 Score =  432 bits (1112), Expect = e-118
 Identities = 228/489 (46%), Positives = 314/489 (64%), Gaps = 4/489 (0%)
 Frame = +3

Query: 3    SVQWIKHIELYVSSAANPSQQAACVDAVAALIKKGVFTLETLVREMEMYLTTTDSIIRSR 182
            + Q  +HIE YV S++ P+ QA  +DA+  LIK    TLE LVRE++MYLT+TD++IR+R
Sbjct: 4    TTQLTRHIESYVDSSSTPTHQATSLDAIGLLIKTNALTLEALVRELDMYLTSTDTVIRAR 63

Query: 183  GIXXXXXXXXXXXXXXXXXXXXXXXXGFFTERLADWKALRGAIVGCLALLRRKDDVGSVT 362
            GI                        GFF ERLADWKA+RGA+VGCLAL+RRK   G VT
Sbjct: 64   GILLLAEVLTRICSKPLDSETIHSLVGFFKERLADWKAVRGALVGCLALIRRKSVAGMVT 123

Query: 363  SSEAKAVAESYLQNLQVQSLGQHDRVLSFQLMECLLDRYPGAIGDLGDTLVYGICEAIDG 542
             S+AKA+A+S+LQ LQVQSLG +DR L F+L++ LL+ +  A+  L + L+YGICEAID 
Sbjct: 124  GSDAKAIAQSFLQYLQVQSLGHYDRKLCFELLDFLLEHHADAVASLEEDLIYGICEAIDA 183

Query: 543  EKDPQCLLPVFRIVECLAQLYPDPSGPLANYAEDLFEILGSYFPIRFTHPKGEDDDDANK 722
            EKDP+CL+  F IVE LA+LYPDPSG LA++A D+F+IL  YFPI FTHP    D    +
Sbjct: 184  EKDPECLMLAFHIVESLARLYPDPSGLLASFASDVFDILAPYFPIHFTHP-SSGDTHVQR 242

Query: 723  EKLSRALMLAFASTPLFEPFSIPLLLEKLSSYLPSAKVESFRYLSYCTMKYGPERMASHA 902
            + LS+ LM AF+STPLFEPF IPLLLEKLSS L SAK++S +YL  C+ KYG ER+A +A
Sbjct: 243  DDLSKILMSAFSSTPLFEPFVIPLLLEKLSSSLHSAKIDSLQYLRVCSSKYGAERIAKYA 302

Query: 903  EALWSSVKDATY---ISPQCTLTKESELLGGMGFQDSDVMMQAFILLQE-VIQQCGDFIS 1070
             A+WSS+KD  Y     P  + T     + G+GF +++V+++A  LLQ+ ++Q     +S
Sbjct: 303  GAIWSSLKDTLYTYLAEPDLSFTLP---INGIGFPENEVVIEALSLLQQLIVQNNSQLVS 359

Query: 1071 LILGDNDINVFMNSLNQYEEMDQIPSLEKQRLHVVGHMLFTCAKSSPALCNKVFHSFFPL 1250
            LI+ D D+N  +NS+  YE  D I   EK++LH +G +L+   K+S + CN VF S F  
Sbjct: 360  LIIDDEDVNFIINSIASYETYDTISVQEKKKLHAIGRILYITVKASISSCNAVFQSLFLR 419

Query: 1251 LMECLGLSVAKSSNGHLDEDCPPVKYNFAAIYLCIELIAACRDVAVSLDSSKGVLDFSQQ 1430
            +M+ LG+ V+                 F  +YLCIEL+A  R++ V  +  +        
Sbjct: 420  MMDNLGIPVSNIDGLQNSAIFTSQNVKFGFLYLCIELLAGSRELVVLSEEKRETYCTLLH 479

Query: 1431 TWSTMLSNS 1457
            ++ST+L N+
Sbjct: 480  SYSTVLFNA 488


>ref|XP_007150605.1| hypothetical protein PHAVU_005G166100g [Phaseolus vulgaris]
            gi|561023869|gb|ESW22599.1| hypothetical protein
            PHAVU_005G166100g [Phaseolus vulgaris]
          Length = 1145

 Score =  432 bits (1110), Expect = e-118
 Identities = 237/490 (48%), Positives = 323/490 (65%), Gaps = 8/490 (1%)
 Frame = +3

Query: 3    SVQWIKHIELYV-SSAANPSQQAACVDAVAALIKKGVFTLETLVREMEMYLTTTDSIIRS 179
            S Q  +HIE YV +S+++PS Q A ++AVA+L+K  V  LE LV+E+ MYLTTTD +IR+
Sbjct: 4    STQLTRHIESYVDASSSSPSLQVASLNAVASLVKTDVLPLEALVKELGMYLTTTDDVIRA 63

Query: 180  RGIXXXXXXXXXXXXXXXXXXXXXXXXGFFTERLADWKALRGAIVGCLALLRRKDDVGSV 359
            RGI                        GFF ERLADW+A+RGA++GCLAL+RRK  +G V
Sbjct: 64   RGILLLAEVITRTESKPLDSATIHSLVGFFKERLADWRAVRGALLGCLALIRRKSVLGIV 123

Query: 360  TSSEAKAVAESYLQNLQVQSLGQHDRVLSFQLMECLLDRYPGAIGDLGDTLVYGICEAID 539
            TS++AKA+A+S+ Q +QVQSLGQ DR L F+L++CLL+ YP AI  LGD L+YGICEAID
Sbjct: 124  TSTDAKAIAQSFFQYMQVQSLGQSDRKLCFELLDCLLEHYPDAITPLGDGLIYGICEAID 183

Query: 540  GEKDPQCLLPVFRIVECLAQLYPDPSGPLANYAEDLFEILGSYFPIRFTHPKGEDDDDAN 719
             EKDP+CL+  F IV+  AQLYP+ SG LA YA+D+F+IL  YFPI FTHP    D    
Sbjct: 184  AEKDPECLMLAFHIVQSWAQLYPESSGLLATYAKDVFDILEPYFPIHFTHPTNA-DTPVQ 242

Query: 720  KEKLSRALMLAFASTPLFEPFSIPLLLEKLSSYLPSAKVESFRYLSYCTMKYGPERMASH 899
            ++ LSR+LM AF+STPLFEPF IPLLLEKLSS L SAK++S +YL  C+ KYG ER+A +
Sbjct: 243  RDDLSRSLMSAFSSTPLFEPFVIPLLLEKLSSSLHSAKIDSLKYLRVCSSKYGAERIAKY 302

Query: 900  AEALWSSVKD--ATYI-SPQCTLTKESELLGGMGFQDSDVMMQAFILLQE-VIQQCGDFI 1067
            A ++WSS+KD  +TY+  P  +L        G+GF +++ +++A  LLQ+ ++Q     +
Sbjct: 303  ANSIWSSIKDILSTYLGEPDFSLNIAP--ADGIGFPENEFVVEALSLLQQLIVQNSSLLV 360

Query: 1068 SLILGDNDINVFMNSLNQYEEMDQIPSLEKQRLHVVGHMLFTCAKSSPALCNKVFHSFFP 1247
             LI+ D D+N+F N++  YE  D IP  EK++LH +G +L+  AKS+   CN VF S F 
Sbjct: 361  CLIVDDEDVNIFFNTIASYEIYDAIPVQEKKKLHAIGRILYIAAKSTVTSCNAVFESLFS 420

Query: 1248 LLMECLGLSVA---KSSNGHLDEDCPPVKYNFAAIYLCIELIAACRDVAVSLDSSKGVLD 1418
             +M+ LG+SV+    S+NG +      VK  F  +YLCIEL+   R++ V          
Sbjct: 421  KIMDNLGVSVSNIDSSANGDISSS-QRVKIGF--LYLCIELLVGFRELIVGSKEPALQYV 477

Query: 1419 FSQQTWSTML 1448
               +T  TML
Sbjct: 478  IEHETCCTML 487


>ref|XP_006597169.1| PREDICTED: MMS19 nucleotide excision repair protein homolog isoform
            X4 [Glycine max]
          Length = 1095

 Score =  430 bits (1106), Expect = e-118
 Identities = 230/486 (47%), Positives = 320/486 (65%), Gaps = 4/486 (0%)
 Frame = +3

Query: 3    SVQWIKHIELYVSSAANPSQQAACVDAVAALIKKGVFTLETLVREMEMYLTTTDSIIRSR 182
            + Q  +HIE YV S++ P+QQA+ ++AVA+L+      LE LVRE+EMYLTTTD+++R+R
Sbjct: 4    TTQLTRHIESYVDSSSTPAQQASSLNAVASLVNTDALPLEALVRELEMYLTTTDNVVRAR 63

Query: 183  GIXXXXXXXXXXXXXXXXXXXXXXXXGFFTERLADWKALRGAIVGCLALLRRKDDVGSVT 362
            GI                        GFF +RLADW+A++GA+VGCLAL+RRK  VG VT
Sbjct: 64   GILLLAEVMTRIESKPLNSATIHSLVGFFKDRLADWRAVQGALVGCLALIRRKSVVGMVT 123

Query: 363  SSEAKAVAESYLQNLQVQSLGQHDRVLSFQLMECLLDRYPGAIGDLGDTLVYGICEAIDG 542
             S+A  +A+S+LQ +QVQSLGQ+DR L F+L++CLL+RY  A+  LG+ L+YGICEAID 
Sbjct: 124  DSDATTIAQSFLQYMQVQSLGQYDRKLCFELLDCLLERYFDAVTTLGEDLIYGICEAIDA 183

Query: 543  EKDPQCLLPVFRIVECLAQLYPDPSGPLANYAEDLFEILGSYFPIRFTHPKGEDDDDANK 722
            EKDP CL   F IV  LAQL PD S  LA+YA+D+F+IL  YFPI FTHP    D    +
Sbjct: 184  EKDPDCLKLAFHIVASLAQLNPDSSSLLASYAKDVFDILEPYFPIHFTHP-SSGDTHVQR 242

Query: 723  EKLSRALMLAFASTPLFEPFSIPLLLEKLSSYLPSAKVESFRYLSYCTMKYGPERMASHA 902
            + LS +LM AF+STPLFEPF IPLLLEKLSS L SAK++S +YL  C+ KYG ER+A +A
Sbjct: 243  DDLSTSLMSAFSSTPLFEPFVIPLLLEKLSSSLHSAKIDSLKYLRVCSSKYGAERIAKYA 302

Query: 903  EALWSSVKD--ATYI-SPQCTLTKESELLGGMGFQDSDVMMQAFILLQEVIQQCGD-FIS 1070
             A+WSS+KD  +TY+  P  + T     + G+GF +++ +++A  LLQ++I Q     +S
Sbjct: 303  GAIWSSLKDTLSTYLGEPDFSFTIAP--VDGIGFPENEFVIEALSLLQQLIAQNSSLLVS 360

Query: 1071 LILGDNDINVFMNSLNQYEEMDQIPSLEKQRLHVVGHMLFTCAKSSPALCNKVFHSFFPL 1250
            LI+ D D+N   +++  YE  D IP  EK++LH +G +L+  +K++ + CN +F S F  
Sbjct: 361  LIIDDEDVNTIFSTITSYETYDAIPVQEKKKLHAIGRILYITSKTTISSCNAMFESLFTR 420

Query: 1251 LMECLGLSVAKSSNGHLDEDCPPVKYNFAAIYLCIELIAACRDVAVSLDSSKGVLDFSQQ 1430
            +M+ LG SV +  NG +    P  +  F  +YLCIEL+A CR++ V  +       F  +
Sbjct: 421  MMDNLGFSV-RFPNGDIS---PSQRLKFGFLYLCIELLAGCRELIVGSEEPALQYVFEHE 476

Query: 1431 TWSTML 1448
            T  TML
Sbjct: 477  TCCTML 482


Top