BLASTX nr result

ID: Paeonia22_contig00027966 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia22_contig00027966
         (1755 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI36057.3| unnamed protein product [Vitis vinifera]              722   0.0  
ref|XP_007217541.1| hypothetical protein PRUPE_ppa023072mg [Prun...   655   0.0  
ref|XP_006465695.1| PREDICTED: MMS19 nucleotide excision repair ...   652   0.0  
ref|XP_006465694.1| PREDICTED: MMS19 nucleotide excision repair ...   652   0.0  
ref|XP_006426876.1| hypothetical protein CICLE_v10024743mg [Citr...   651   0.0  
ref|XP_007024314.1| MMS19 nucleotide excision repair protein, pu...   625   e-176
ref|XP_007024313.1| MMS19 nucleotide excision repair protein, pu...   625   e-176
ref|XP_007024312.1| MMS19 nucleotide excision repair protein, pu...   625   e-176
ref|XP_007024310.1| MMS19 nucleotide excision repair protein, pu...   625   e-176
ref|XP_002515963.1| DNA repair/transcription protein met18/mms19...   619   e-174
gb|EXB74582.1| hypothetical protein L484_026279 [Morus notabilis]     594   e-167
ref|XP_004302857.1| PREDICTED: uncharacterized protein LOC101304...   588   e-165
ref|XP_006385450.1| hypothetical protein POPTR_0003s04720g [Popu...   582   e-163
ref|XP_004141784.1| PREDICTED: MMS19 nucleotide excision repair ...   578   e-162
ref|XP_004236399.1| PREDICTED: MMS19 nucleotide excision repair ...   558   e-156
ref|XP_003546956.1| PREDICTED: MMS19 nucleotide excision repair ...   551   e-154
ref|XP_006597167.1| PREDICTED: MMS19 nucleotide excision repair ...   542   e-151
ref|XP_006343144.1| PREDICTED: MMS19 nucleotide excision repair ...   535   e-149
ref|XP_006595125.1| PREDICTED: DNA repair/transcription protein ...   531   e-148
ref|XP_006595124.1| PREDICTED: DNA repair/transcription protein ...   531   e-148

>emb|CBI36057.3| unnamed protein product [Vitis vinifera]
          Length = 1146

 Score =  722 bits (1864), Expect = 0.0
 Identities = 378/584 (64%), Positives = 451/584 (77%)
 Frame = -2

Query: 1754 GPLASYAEDLFEILACYFPIHFTHPKSEDVEVKRDDLSRALMLAFSSTPLFEPFAIPLLL 1575
            GPLAS+A DLF+IL CYFPIHFTHP+ EDV+VKRDDLSRALMLAFSST LFEPFAIPLLL
Sbjct: 209  GPLASFAGDLFDILGCYFPIHFTHPQGEDVDVKRDDLSRALMLAFSSTTLFEPFAIPLLL 268

Query: 1574 EKLSSALPSAKLDSLKYLSNCTLKYGADRMAKHAEAIWSSLKDVIYASPQEPISSLTSEP 1395
            EKLSS+LP AK+DSLKYLSNC LKYG DRM KH EAIW S+KD I+ S QEP+ SL SE 
Sbjct: 269  EKLSSSLPLAKVDSLKYLSNCLLKYGDDRMTKHVEAIWFSVKDAIFCSEQEPMLSLASEL 328

Query: 1394 LDGIRFQDNEITKEALILLQKVILQNCGLFISLIIGDEGINMIIDSIEIYTTSNDITLQS 1215
            LD + FQ+NEI  EA+ILLQKVIL+N GL +SLI+GD+ IN I++++  + + NDI LQS
Sbjct: 329  LDHVGFQENEIVTEAIILLQKVILENSGLSLSLIVGDKDINTIVNTVTSFRSYNDIPLQS 388

Query: 1214 KQKLHAVGSILHVSAKASVASCNRVFESFFLRLMGALELSVKNSSGFCVPADNDVVSEKL 1035
            K KL A+G IL+VSAKAS+  CNRVFESFF RLM  L LSV+NSSG C+P  + V SE+L
Sbjct: 389  KHKLCAIGRILYVSAKASITCCNRVFESFFFRLMDTLGLSVRNSSGDCLPNFDYVFSERL 448

Query: 1034 NFGALYLCIELIDACRYMIAGFEELTANTVSADETCCSMLRNFSSSLFKAFSSTLVTSIS 855
            NFGALYLCIEL+ ACR ++ G EELT+ +VSA E+ C ML +FSS L KAFSS L  S  
Sbjct: 449  NFGALYLCIELLAACRDLVVGSEELTSKSVSAQESWCCMLHSFSSLLMKAFSSVLDASTD 508

Query: 854  SGAHDADIYLRVKGLQTLATFSGGFSTISKSTFENILMTFTSVITTDFNSTLLWRQTLKA 675
              A++ADIY  VKGLQ LATF G F  ISKS FEN+L+TF S+I  DFN TLLW+  LKA
Sbjct: 509  KDAYEADIYSGVKGLQILATFPGEFLPISKSIFENVLLTFISIIVEDFNKTLLWKLALKA 568

Query: 674  LVKIGLFVDQSNESEKTTSYMSIVVDKIVSKMSFNDDSMPFPLNLEAICDIGETGLNFML 495
            LV+IG F+D+ +ESEK  SY  IVV+KIVS M  +D  +PF L LEAI DIG TGLN ML
Sbjct: 569  LVQIGSFIDRFHESEKALSYNYIVVEKIVSLMFLDDFGLPFQLRLEAISDIGTTGLNVML 628

Query: 494  RVVQGLEEAISANLSDIYVHGNLKSAEIVVQLFECLCNKVLPWFHKTGYSDKVPLRFAII 315
            ++VQGLE+AI ANLS++YVHGNLKSA+I VQL EC  NK+LP  H  G  + V  RFA+ 
Sbjct: 629  KIVQGLEDAIFANLSEVYVHGNLKSAKIAVQLLECYSNKLLPGIHGAGDFEDVLSRFAVN 688

Query: 314  IWNQVENSSTFSDKVLGKELLNATMMAMKIAVGNCSEENQSTILEKAYSVLSSRTSFPLK 135
            IWNQ+ENS  FS      ELLNATM AMK+AVG+CSE +Q  I++KAYSVLSS  SF L 
Sbjct: 689  IWNQIENSMAFSVGAQENELLNATMTAMKLAVGSCSEGSQGKIIKKAYSVLSSCPSFTLM 748

Query: 134  ESMSLTIPVQLDGLQLTRILDGFSSVDDWLLSLFASVTVALRPR 3
            ESM +T  VQL+GLQ T+ L+ FS  D W++SLFAS  +A+RP+
Sbjct: 749  ESMPITGTVQLEGLQHTQDLECFSCRDKWVISLFASAIIAVRPQ 792


>ref|XP_007217541.1| hypothetical protein PRUPE_ppa023072mg [Prunus persica]
            gi|462413691|gb|EMJ18740.1| hypothetical protein
            PRUPE_ppa023072mg [Prunus persica]
          Length = 1158

 Score =  655 bits (1689), Expect = 0.0
 Identities = 352/594 (59%), Positives = 428/594 (72%), Gaps = 10/594 (1%)
 Frame = -2

Query: 1754 GPLASYAEDLFEILACYFPIHFTHPKSEDVEVKRDDLSRALMLAFSSTPLFEPFAIPLLL 1575
            G LAS+  DLFE+L  YFPIHFTH K ED EVKRDDLS+ALM AFSSTPLFEPF IPLLL
Sbjct: 209  GSLASFCGDLFELLGSYFPIHFTHLKDEDAEVKRDDLSKALMSAFSSTPLFEPFVIPLLL 268

Query: 1574 EKLSSALPSAKLDSLKYLSNCTLKYGADRMAKHAEAIWSSLKDVIYASPQEPISSLTSEP 1395
            EKLSS+LP AK+DSLKYL++CT KYGADRMAKHA AIW SLKD I  S ++P  S TSEP
Sbjct: 269  EKLSSSLPLAKVDSLKYLNHCTAKYGADRMAKHAGAIWISLKDAISNSLEKPDMSFTSEP 328

Query: 1394 LDGIRFQDNEITKEALILLQKVILQNCGLFISLIIGDEGINMIIDSIEIYTTSNDITLQS 1215
            L G+ FQ+NEI  EAL+LLQKV LQN  LF+SLII DEGIN++ +SI  +   N+I LQ 
Sbjct: 329  LYGLGFQENEIATEALMLLQKVTLQNEALFLSLIIQDEGINIVFNSIASHEHYNNIPLQG 388

Query: 1214 KQKLHAVGSILHVSAKASVASCNRVFESFFLRLMGALELSVKNSSGFCVPADNDVVSEKL 1035
            KQ LHAVG IL++ +K S+ASCN VFESFF RLM  LE+SV NS+G C   +N   S+K 
Sbjct: 389  KQWLHAVGRILYIISKTSMASCNSVFESFFPRLMNTLEISVTNSAGDCTLNENTFPSKKF 448

Query: 1034 NFGALYLCIELIDACRYMIAGFEELTANTVSADETCCSMLRNFSSSLFKAFSSTLVTSIS 855
            NFGALYLC+ELI ACR +I   ++L     +  ETC  ML++F+ SL  AFSS+L T+ +
Sbjct: 449  NFGALYLCVELIAACRDLIMRSKDLAPKPDTPQETCRYMLQSFADSLVNAFSSSLATNAN 508

Query: 854  SGAHDADIYLRVKGLQTLATFSGGFSTISKSTFENILMTFTSVITTDFNSTLLWRQTLKA 675
              AH ADIY +VKGLQ LATF G F  ISK  F NIL    S+I  DFN  LLW+  LKA
Sbjct: 509  EVAHGADIYFKVKGLQILATFPGDFLPISKFLFANILTILMSIILVDFNKILLWKLVLKA 568

Query: 674  LVKIGLFVDQSNESEKTTSYMSIVVDKIVSKMSFNDDSMPFPLNLEAICDIGETGLNFML 495
            LV IG FVD  +ESEK   YM  VVDK VS +S +D  MPF L LEA  +IG +G N ML
Sbjct: 569  LVHIGSFVDVYHESEKALGYMGAVVDKTVSLVSRDDVKMPFSLKLEAASEIGASGRNHML 628

Query: 494  RVVQGLEEAISANLSDIYVHGNLKSAEIVVQLFECLCNKVLPWFHKTGYSDKVPLRFAII 315
            ++VQG+EEAI A LSD YVHGNLKSAE  +QL EC CNK+L W ++TG  ++V LRF I 
Sbjct: 629  KIVQGMEEAIVAKLSD-YVHGNLKSAEKTIQLLECYCNKILSWINETGGLEEVLLRFVIN 687

Query: 314  IWNQVENSSTFSDKVLGKELLNATMMAMKIAVGNCSEENQSTILEKAYSVLSSRTSFPLK 135
            IWN VE+   FS +V  +ELL+ATMMAMK+A+G+CSEE+Q+ I+ KAYSV+SS  S P K
Sbjct: 688  IWNCVESCKDFSIQVQEEELLDATMMAMKLAIGSCSEESQNIIIHKAYSVISSSISIPFK 747

Query: 134  ESMSLTIPVQLDGLQLTRILDG----------FSSVDDWLLSLFASVTVALRPR 3
            ES+  T  +QL+ L ++  +D           FS  D+W+LS FASV +A+RP+
Sbjct: 748  ESLDATSSIQLEELSVSEQIDNSSHRDDQIDKFSLRDEWILSHFASVIIAVRPK 801


>ref|XP_006465695.1| PREDICTED: MMS19 nucleotide excision repair protein homolog isoform
            X2 [Citrus sinensis]
          Length = 1151

 Score =  652 bits (1682), Expect = 0.0
 Identities = 347/582 (59%), Positives = 435/582 (74%)
 Frame = -2

Query: 1748 LASYAEDLFEILACYFPIHFTHPKSEDVEVKRDDLSRALMLAFSSTPLFEPFAIPLLLEK 1569
            LA++A DLFEIL CYFPIHFTH K+ED +VKRDDLSRALM AFSST LFEPFAIPLLLEK
Sbjct: 209  LANFASDLFEILGCYFPIHFTHSKAEDFDVKRDDLSRALMAAFSSTSLFEPFAIPLLLEK 268

Query: 1568 LSSALPSAKLDSLKYLSNCTLKYGADRMAKHAEAIWSSLKDVIYASPQEPISSLTSEPLD 1389
            LSS+L SAK+DSLKYLS+CT+KYGADR+ KHA+A+WSS+KD +Y+S  EP  S  SE LD
Sbjct: 269  LSSSLQSAKVDSLKYLSHCTVKYGADRIEKHAKAMWSSIKDAVYSS-HEPTLSFASESLD 327

Query: 1388 GIRFQDNEITKEALILLQKVILQNCGLFISLIIGDEGINMIIDSIEIYTTSNDITLQSKQ 1209
            G+ F++N I  E+L LL  V  QN GLF+S IIGDE IN+I  SI  Y T  +I+LQSKQ
Sbjct: 328  GVGFRENVILTESLNLLDTVFKQNSGLFLSWIIGDEDINLIFKSISSYKTYKEISLQSKQ 387

Query: 1208 KLHAVGSILHVSAKASVASCNRVFESFFLRLMGALELSVKNSSGFCVPADNDVVSEKLNF 1029
            KLHAVGSIL VSAKAS A+CN V ESFF  LM AL LSV NS+  C P D +V+  KLN 
Sbjct: 388  KLHAVGSILSVSAKASPAACNSVMESFFPCLMHALGLSVGNSTQDCFPNDGNVLRGKLNH 447

Query: 1028 GALYLCIELIDACRYMIAGFEELTANTVSADETCCSMLRNFSSSLFKAFSSTLVTSISSG 849
            GALYLCIEL+ ACR ++A  EE  +    A+E    +L+++S+SL KA  STL TS +  
Sbjct: 448  GALYLCIELMTACRELMASSEEFKSVAAPANERWYCLLQSYSASLAKALRSTLETSANED 507

Query: 848  AHDADIYLRVKGLQTLATFSGGFSTISKSTFENILMTFTSVITTDFNSTLLWRQTLKALV 669
            +++ ++Y  VKGL  L TF GG   IS S FENIL+TFTS+I ++F +TLLW+  LKALV
Sbjct: 508  SYETNVYFGVKGLLILGTFRGGSLIISNSIFENILLTFTSIIISEFENTLLWKLALKALV 567

Query: 668  KIGLFVDQSNESEKTTSYMSIVVDKIVSKMSFNDDSMPFPLNLEAICDIGETGLNFMLRV 489
             IG F+D+ NESEK  SYM +V++KIVS  S +D SMPFPL LEAI +IG TG N++L++
Sbjct: 568  HIGSFIDRFNESEKALSYMDVVIEKIVSLASSHDFSMPFPLKLEAISEIGATGRNYLLKI 627

Query: 488  VQGLEEAISANLSDIYVHGNLKSAEIVVQLFECLCNKVLPWFHKTGYSDKVPLRFAIIIW 309
            VQGLEEA+ ANL ++ VHGN KSAE+VVQL EC  NKVLP  H+ G  ++V LRFA+ IW
Sbjct: 628  VQGLEEAVCANLYEVLVHGNPKSAEVVVQLLECYSNKVLPRIHEIGGFEEVLLRFAVNIW 687

Query: 308  NQVENSSTFSDKVLGKELLNATMMAMKIAVGNCSEENQSTILEKAYSVLSSRTSFPLKES 129
            N +E S TFS +V  K LL+ATM AMK+AVG+CS E+Q+ + +KA++VLS  T FPL+++
Sbjct: 688  NLIEKSVTFSSQVHEKGLLDATMKAMKLAVGSCSVESQNIVFQKAFTVLSLGTYFPLEDA 747

Query: 128  MSLTIPVQLDGLQLTRILDGFSSVDDWLLSLFASVTVALRPR 3
             S  IP+ L+  QLT+     SS + W+ SLFASV +A RP+
Sbjct: 748  AS-NIPILLNEFQLTQETSISSSREAWICSLFASVIIAARPQ 788


>ref|XP_006465694.1| PREDICTED: MMS19 nucleotide excision repair protein homolog isoform
            X1 [Citrus sinensis]
          Length = 1155

 Score =  652 bits (1682), Expect = 0.0
 Identities = 347/582 (59%), Positives = 435/582 (74%)
 Frame = -2

Query: 1748 LASYAEDLFEILACYFPIHFTHPKSEDVEVKRDDLSRALMLAFSSTPLFEPFAIPLLLEK 1569
            LA++A DLFEIL CYFPIHFTH K+ED +VKRDDLSRALM AFSST LFEPFAIPLLLEK
Sbjct: 209  LANFASDLFEILGCYFPIHFTHSKAEDFDVKRDDLSRALMAAFSSTSLFEPFAIPLLLEK 268

Query: 1568 LSSALPSAKLDSLKYLSNCTLKYGADRMAKHAEAIWSSLKDVIYASPQEPISSLTSEPLD 1389
            LSS+L SAK+DSLKYLS+CT+KYGADR+ KHA+A+WSS+KD +Y+S  EP  S  SE LD
Sbjct: 269  LSSSLQSAKVDSLKYLSHCTVKYGADRIEKHAKAMWSSIKDAVYSS-HEPTLSFASESLD 327

Query: 1388 GIRFQDNEITKEALILLQKVILQNCGLFISLIIGDEGINMIIDSIEIYTTSNDITLQSKQ 1209
            G+ F++N I  E+L LL  V  QN GLF+S IIGDE IN+I  SI  Y T  +I+LQSKQ
Sbjct: 328  GVGFRENVILTESLNLLDTVFKQNSGLFLSWIIGDEDINLIFKSISSYKTYKEISLQSKQ 387

Query: 1208 KLHAVGSILHVSAKASVASCNRVFESFFLRLMGALELSVKNSSGFCVPADNDVVSEKLNF 1029
            KLHAVGSIL VSAKAS A+CN V ESFF  LM AL LSV NS+  C P D +V+  KLN 
Sbjct: 388  KLHAVGSILSVSAKASPAACNSVMESFFPCLMHALGLSVGNSTQDCFPNDGNVLRGKLNH 447

Query: 1028 GALYLCIELIDACRYMIAGFEELTANTVSADETCCSMLRNFSSSLFKAFSSTLVTSISSG 849
            GALYLCIEL+ ACR ++A  EE  +    A+E    +L+++S+SL KA  STL TS +  
Sbjct: 448  GALYLCIELMTACRELMASSEEFKSVAAPANERWYCLLQSYSASLAKALRSTLETSANED 507

Query: 848  AHDADIYLRVKGLQTLATFSGGFSTISKSTFENILMTFTSVITTDFNSTLLWRQTLKALV 669
            +++ ++Y  VKGL  L TF GG   IS S FENIL+TFTS+I ++F +TLLW+  LKALV
Sbjct: 508  SYETNVYFGVKGLLILGTFRGGSLIISNSIFENILLTFTSIIISEFENTLLWKLALKALV 567

Query: 668  KIGLFVDQSNESEKTTSYMSIVVDKIVSKMSFNDDSMPFPLNLEAICDIGETGLNFMLRV 489
             IG F+D+ NESEK  SYM +V++KIVS  S +D SMPFPL LEAI +IG TG N++L++
Sbjct: 568  HIGSFIDRFNESEKALSYMDVVIEKIVSLASSHDFSMPFPLKLEAISEIGATGRNYLLKI 627

Query: 488  VQGLEEAISANLSDIYVHGNLKSAEIVVQLFECLCNKVLPWFHKTGYSDKVPLRFAIIIW 309
            VQGLEEA+ ANL ++ VHGN KSAE+VVQL EC  NKVLP  H+ G  ++V LRFA+ IW
Sbjct: 628  VQGLEEAVCANLYEVLVHGNPKSAEVVVQLLECYSNKVLPRIHEIGGFEEVLLRFAVNIW 687

Query: 308  NQVENSSTFSDKVLGKELLNATMMAMKIAVGNCSEENQSTILEKAYSVLSSRTSFPLKES 129
            N +E S TFS +V  K LL+ATM AMK+AVG+CS E+Q+ + +KA++VLS  T FPL+++
Sbjct: 688  NLIEKSVTFSSQVHEKGLLDATMKAMKLAVGSCSVESQNIVFQKAFTVLSLGTYFPLEDA 747

Query: 128  MSLTIPVQLDGLQLTRILDGFSSVDDWLLSLFASVTVALRPR 3
             S  IP+ L+  QLT+     SS + W+ SLFASV +A RP+
Sbjct: 748  AS-NIPILLNEFQLTQETSISSSREAWICSLFASVIIAARPQ 788


>ref|XP_006426876.1| hypothetical protein CICLE_v10024743mg [Citrus clementina]
            gi|557528866|gb|ESR40116.1| hypothetical protein
            CICLE_v10024743mg [Citrus clementina]
          Length = 1155

 Score =  651 bits (1680), Expect = 0.0
 Identities = 348/582 (59%), Positives = 435/582 (74%)
 Frame = -2

Query: 1748 LASYAEDLFEILACYFPIHFTHPKSEDVEVKRDDLSRALMLAFSSTPLFEPFAIPLLLEK 1569
            LA++A DLFEIL CYFPIHFTH K+ED +VKRDDLSRALM AFSST LFEPFAIPLLLEK
Sbjct: 209  LANFAGDLFEILGCYFPIHFTHSKAEDFDVKRDDLSRALMAAFSSTSLFEPFAIPLLLEK 268

Query: 1568 LSSALPSAKLDSLKYLSNCTLKYGADRMAKHAEAIWSSLKDVIYASPQEPISSLTSEPLD 1389
            LSS+L SAK+DSLKYLS+CT+KYGADR+ KHA+A+WSS+KD IY+S  EP  S  SE LD
Sbjct: 269  LSSSLQSAKVDSLKYLSHCTVKYGADRIEKHAKAMWSSIKDAIYSS-HEPTLSFASESLD 327

Query: 1388 GIRFQDNEITKEALILLQKVILQNCGLFISLIIGDEGINMIIDSIEIYTTSNDITLQSKQ 1209
            G+ F+DN I  E+L LL  V  QN GLF+S IIGDE IN+I  SI  + T  +I+LQSKQ
Sbjct: 328  GVGFRDNVILTESLNLLDTVFKQNSGLFLSWIIGDEDINLIFKSISSFKTYKEISLQSKQ 387

Query: 1208 KLHAVGSILHVSAKASVASCNRVFESFFLRLMGALELSVKNSSGFCVPADNDVVSEKLNF 1029
            KLHAVGSIL VSAKAS A+CN V ESFF  LM  L LSV NS+  C P D +V+  KLN 
Sbjct: 388  KLHAVGSILSVSAKASPAACNSVMESFFPCLMHPLGLSVGNSTQDCFPNDGNVLRGKLNH 447

Query: 1028 GALYLCIELIDACRYMIAGFEELTANTVSADETCCSMLRNFSSSLFKAFSSTLVTSISSG 849
            GALYLCIEL+ ACR ++A  EE  +    A+E    +L+++S+SL KA  STL TS +  
Sbjct: 448  GALYLCIELMTACRELMASSEEFKSVAAPANERWYCLLQSYSASLAKALRSTLETSANED 507

Query: 848  AHDADIYLRVKGLQTLATFSGGFSTISKSTFENILMTFTSVITTDFNSTLLWRQTLKALV 669
            +++ ++Y  VKGL  L TFSGG   IS S FENIL+TFTS+I ++F +TLLW+  LKALV
Sbjct: 508  SYETNVYFGVKGLLILGTFSGGSLIISNSIFENILLTFTSIIISEFENTLLWKLALKALV 567

Query: 668  KIGLFVDQSNESEKTTSYMSIVVDKIVSKMSFNDDSMPFPLNLEAICDIGETGLNFMLRV 489
             IG F+D+ NESEK  SYM +V++KIVS  S +D SMPFPL LEAI +IG TG N++L++
Sbjct: 568  HIGSFIDRFNESEKALSYMDVVIEKIVSLASSHDFSMPFPLKLEAISEIGATGRNYLLKI 627

Query: 488  VQGLEEAISANLSDIYVHGNLKSAEIVVQLFECLCNKVLPWFHKTGYSDKVPLRFAIIIW 309
            VQGLEEA+ ANL ++ VHGN KSAE+VVQL EC  NKVLP  H+ G  ++V LRFA+ IW
Sbjct: 628  VQGLEEAVCANLYEVLVHGNPKSAEVVVQLLECYSNKVLPRIHEIGGFEEVLLRFAVNIW 687

Query: 308  NQVENSSTFSDKVLGKELLNATMMAMKIAVGNCSEENQSTILEKAYSVLSSRTSFPLKES 129
            N +E S TFS +V  K LL+ATM AMK+AVG+CS E+Q+ + +KA++VLS  T FPL+++
Sbjct: 688  NLIEKSVTFSSQVHEKGLLDATMKAMKLAVGSCSVESQNIVFQKAFTVLSLGTYFPLEDA 747

Query: 128  MSLTIPVQLDGLQLTRILDGFSSVDDWLLSLFASVTVALRPR 3
             S  IP+QL+  QLT+     SS + W+ SLFASV +A  P+
Sbjct: 748  AS-NIPIQLNEFQLTQETSISSSREAWICSLFASVIIAACPQ 788


>ref|XP_007024314.1| MMS19 nucleotide excision repair protein, putative isoform 5
            [Theobroma cacao] gi|508779680|gb|EOY26936.1| MMS19
            nucleotide excision repair protein, putative isoform 5
            [Theobroma cacao]
          Length = 1157

 Score =  625 bits (1613), Expect = e-176
 Identities = 329/584 (56%), Positives = 425/584 (72%), Gaps = 1/584 (0%)
 Frame = -2

Query: 1754 GPLASYAEDLFEILACYFPIHFTHPKSEDVEVKRDDLSRALMLAFSSTPLFEPFAIPLLL 1575
            GP  S+A DLFE L+ YFP+HFTHPK EDV +KRDDL+RALMLAFSSTPLFEPFAIPLL+
Sbjct: 209  GPFTSFAHDLFENLSYYFPVHFTHPKGEDVNIKRDDLARALMLAFSSTPLFEPFAIPLLI 268

Query: 1574 EKLSSALPSAKLDSLKYLSNCTLKYGADRMAKHAEAIWSSLKDVIYASPQEPISSLTSEP 1395
            EKLSS+LPSAK+DSL+YLS+CT+KYG DRMAKH EA+WSSLKD ++ S  + + S T E 
Sbjct: 269  EKLSSSLPSAKVDSLRYLSDCTVKYGVDRMAKHGEALWSSLKDAVFTS-LDGVLSFTPES 327

Query: 1394 LDGIRFQDNEITKEALILLQKVILQNCGLFISLIIGDEGINMIIDSIEIYTTSNDITLQS 1215
            L+G+   +NEI  EAL LLQK+I+QN   F+ LI+ DE INMI + I  Y + + I  QS
Sbjct: 328  LEGLCLPENEIAAEALSLLQKLIVQNTNFFLDLIVVDEDINMIFNMISSYKSYHGIPAQS 387

Query: 1214 KQKLHAVGSILHVSAKASVASCNRVFESFFLRLMGALELSVKNSSGFCVPADNDVVSEKL 1035
            KQ+LHAVG IL  S KAS ASCNRVFE FF RLM  L L V+NSSG     D+ ++ ++ 
Sbjct: 388  KQRLHAVGCILSASVKASTASCNRVFECFFSRLMDILGLCVRNSSGNLSSDDSIMIPKRY 447

Query: 1034 NFGALYLCIELIDACRYMIAGFEELTANTVSADETCCSMLRNFSSSLFKAFSSTLVTSIS 855
            N GALYL IEL+ ACR +IA  E + A +   +ET   +LR+FSSSL KAF S  + + S
Sbjct: 448  NHGALYLSIELLSACRDVIASSETIIAASAHTEETWSYLLRSFSSSLTKAFCSASICT-S 506

Query: 854  SGAHDADIYLRVKGLQTLATFSGGFSTISKSTFENILMTFTSVITTDFNSTLLWRQTLKA 675
              +HDAD+Y  VKGL  LATF  G+  ISK  FE ILMTF S++T D+++TLLW+  LKA
Sbjct: 507  EDSHDADVYFGVKGLLILATFPEGYLLISKPVFEKILMTFVSIVTVDYSNTLLWKLALKA 566

Query: 674  LVKIGLFVDQSNESEKTTSYMSIVVDKIVSKMSFNDDSMPFPLNLEAICDIGETGLNFML 495
            LV+IG F+++ +ESEK  SY+ +VV+KIVS  S  D S+PFPL LEA+ +IG +G ++ML
Sbjct: 567  LVQIGSFIEKCHESEKEPSYLGLVVEKIVSFSSLGDFSIPFPLRLEALSEIGTSGKSYML 626

Query: 494  RVVQGLEEAISANLSDIYVHGNLKSAEIVVQLFECLCNKVLPWFHKTGYSDKVPLRFAII 315
            +VV+GLEEAI ANLS++YVHG+  SAEIV QL +C  +KV+PW       D+VPL+FAI 
Sbjct: 627  KVVEGLEEAIYANLSEVYVHGSSNSAEIVTQLLKCYSDKVIPWIQCAKGFDEVPLQFAIH 686

Query: 314  IWNQVENSSTFSDKVLGK-ELLNATMMAMKIAVGNCSEENQSTILEKAYSVLSSRTSFPL 138
            IWNQ+E S  F+     K E+L+  M AMK+AV +CSEENQ+ I++K+Y +LSS TSFPL
Sbjct: 687  IWNQIELSMVFNATQTNKIEVLDVMMKAMKLAVASCSEENQNIIVQKSYHILSSSTSFPL 746

Query: 137  KESMSLTIPVQLDGLQLTRILDGFSSVDDWLLSLFASVTVALRP 6
            KE        + +  Q+ ++ D  SS D+W+LSLFA+V +A+ P
Sbjct: 747  KELF------RQESFQIVQV-DNSSSRDEWILSLFAAVVIAVHP 783


>ref|XP_007024313.1| MMS19 nucleotide excision repair protein, putative isoform 4
            [Theobroma cacao] gi|508779679|gb|EOY26935.1| MMS19
            nucleotide excision repair protein, putative isoform 4
            [Theobroma cacao]
          Length = 1136

 Score =  625 bits (1613), Expect = e-176
 Identities = 329/584 (56%), Positives = 425/584 (72%), Gaps = 1/584 (0%)
 Frame = -2

Query: 1754 GPLASYAEDLFEILACYFPIHFTHPKSEDVEVKRDDLSRALMLAFSSTPLFEPFAIPLLL 1575
            GP  S+A DLFE L+ YFP+HFTHPK EDV +KRDDL+RALMLAFSSTPLFEPFAIPLL+
Sbjct: 209  GPFTSFAHDLFENLSYYFPVHFTHPKGEDVNIKRDDLARALMLAFSSTPLFEPFAIPLLI 268

Query: 1574 EKLSSALPSAKLDSLKYLSNCTLKYGADRMAKHAEAIWSSLKDVIYASPQEPISSLTSEP 1395
            EKLSS+LPSAK+DSL+YLS+CT+KYG DRMAKH EA+WSSLKD ++ S  + + S T E 
Sbjct: 269  EKLSSSLPSAKVDSLRYLSDCTVKYGVDRMAKHGEALWSSLKDAVFTS-LDGVLSFTPES 327

Query: 1394 LDGIRFQDNEITKEALILLQKVILQNCGLFISLIIGDEGINMIIDSIEIYTTSNDITLQS 1215
            L+G+   +NEI  EAL LLQK+I+QN   F+ LI+ DE INMI + I  Y + + I  QS
Sbjct: 328  LEGLCLPENEIAAEALSLLQKLIVQNTNFFLDLIVVDEDINMIFNMISSYKSYHGIPAQS 387

Query: 1214 KQKLHAVGSILHVSAKASVASCNRVFESFFLRLMGALELSVKNSSGFCVPADNDVVSEKL 1035
            KQ+LHAVG IL  S KAS ASCNRVFE FF RLM  L L V+NSSG     D+ ++ ++ 
Sbjct: 388  KQRLHAVGCILSASVKASTASCNRVFECFFSRLMDILGLCVRNSSGNLSSDDSIMIPKRY 447

Query: 1034 NFGALYLCIELIDACRYMIAGFEELTANTVSADETCCSMLRNFSSSLFKAFSSTLVTSIS 855
            N GALYL IEL+ ACR +IA  E + A +   +ET   +LR+FSSSL KAF S  + + S
Sbjct: 448  NHGALYLSIELLSACRDVIASSETIIAASAHTEETWSYLLRSFSSSLTKAFCSASICT-S 506

Query: 854  SGAHDADIYLRVKGLQTLATFSGGFSTISKSTFENILMTFTSVITTDFNSTLLWRQTLKA 675
              +HDAD+Y  VKGL  LATF  G+  ISK  FE ILMTF S++T D+++TLLW+  LKA
Sbjct: 507  EDSHDADVYFGVKGLLILATFPEGYLLISKPVFEKILMTFVSIVTVDYSNTLLWKLALKA 566

Query: 674  LVKIGLFVDQSNESEKTTSYMSIVVDKIVSKMSFNDDSMPFPLNLEAICDIGETGLNFML 495
            LV+IG F+++ +ESEK  SY+ +VV+KIVS  S  D S+PFPL LEA+ +IG +G ++ML
Sbjct: 567  LVQIGSFIEKCHESEKEPSYLGLVVEKIVSFSSLGDFSIPFPLRLEALSEIGTSGKSYML 626

Query: 494  RVVQGLEEAISANLSDIYVHGNLKSAEIVVQLFECLCNKVLPWFHKTGYSDKVPLRFAII 315
            +VV+GLEEAI ANLS++YVHG+  SAEIV QL +C  +KV+PW       D+VPL+FAI 
Sbjct: 627  KVVEGLEEAIYANLSEVYVHGSSNSAEIVTQLLKCYSDKVIPWIQCAKGFDEVPLQFAIH 686

Query: 314  IWNQVENSSTFSDKVLGK-ELLNATMMAMKIAVGNCSEENQSTILEKAYSVLSSRTSFPL 138
            IWNQ+E S  F+     K E+L+  M AMK+AV +CSEENQ+ I++K+Y +LSS TSFPL
Sbjct: 687  IWNQIELSMVFNATQTNKIEVLDVMMKAMKLAVASCSEENQNIIVQKSYHILSSSTSFPL 746

Query: 137  KESMSLTIPVQLDGLQLTRILDGFSSVDDWLLSLFASVTVALRP 6
            KE        + +  Q+ ++ D  SS D+W+LSLFA+V +A+ P
Sbjct: 747  KELF------RQESFQIVQV-DNSSSRDEWILSLFAAVVIAVHP 783


>ref|XP_007024312.1| MMS19 nucleotide excision repair protein, putative isoform 3
            [Theobroma cacao] gi|508779678|gb|EOY26934.1| MMS19
            nucleotide excision repair protein, putative isoform 3
            [Theobroma cacao]
          Length = 1062

 Score =  625 bits (1613), Expect = e-176
 Identities = 329/584 (56%), Positives = 425/584 (72%), Gaps = 1/584 (0%)
 Frame = -2

Query: 1754 GPLASYAEDLFEILACYFPIHFTHPKSEDVEVKRDDLSRALMLAFSSTPLFEPFAIPLLL 1575
            GP  S+A DLFE L+ YFP+HFTHPK EDV +KRDDL+RALMLAFSSTPLFEPFAIPLL+
Sbjct: 209  GPFTSFAHDLFENLSYYFPVHFTHPKGEDVNIKRDDLARALMLAFSSTPLFEPFAIPLLI 268

Query: 1574 EKLSSALPSAKLDSLKYLSNCTLKYGADRMAKHAEAIWSSLKDVIYASPQEPISSLTSEP 1395
            EKLSS+LPSAK+DSL+YLS+CT+KYG DRMAKH EA+WSSLKD ++ S  + + S T E 
Sbjct: 269  EKLSSSLPSAKVDSLRYLSDCTVKYGVDRMAKHGEALWSSLKDAVFTS-LDGVLSFTPES 327

Query: 1394 LDGIRFQDNEITKEALILLQKVILQNCGLFISLIIGDEGINMIIDSIEIYTTSNDITLQS 1215
            L+G+   +NEI  EAL LLQK+I+QN   F+ LI+ DE INMI + I  Y + + I  QS
Sbjct: 328  LEGLCLPENEIAAEALSLLQKLIVQNTNFFLDLIVVDEDINMIFNMISSYKSYHGIPAQS 387

Query: 1214 KQKLHAVGSILHVSAKASVASCNRVFESFFLRLMGALELSVKNSSGFCVPADNDVVSEKL 1035
            KQ+LHAVG IL  S KAS ASCNRVFE FF RLM  L L V+NSSG     D+ ++ ++ 
Sbjct: 388  KQRLHAVGCILSASVKASTASCNRVFECFFSRLMDILGLCVRNSSGNLSSDDSIMIPKRY 447

Query: 1034 NFGALYLCIELIDACRYMIAGFEELTANTVSADETCCSMLRNFSSSLFKAFSSTLVTSIS 855
            N GALYL IEL+ ACR +IA  E + A +   +ET   +LR+FSSSL KAF S  + + S
Sbjct: 448  NHGALYLSIELLSACRDVIASSETIIAASAHTEETWSYLLRSFSSSLTKAFCSASICT-S 506

Query: 854  SGAHDADIYLRVKGLQTLATFSGGFSTISKSTFENILMTFTSVITTDFNSTLLWRQTLKA 675
              +HDAD+Y  VKGL  LATF  G+  ISK  FE ILMTF S++T D+++TLLW+  LKA
Sbjct: 507  EDSHDADVYFGVKGLLILATFPEGYLLISKPVFEKILMTFVSIVTVDYSNTLLWKLALKA 566

Query: 674  LVKIGLFVDQSNESEKTTSYMSIVVDKIVSKMSFNDDSMPFPLNLEAICDIGETGLNFML 495
            LV+IG F+++ +ESEK  SY+ +VV+KIVS  S  D S+PFPL LEA+ +IG +G ++ML
Sbjct: 567  LVQIGSFIEKCHESEKEPSYLGLVVEKIVSFSSLGDFSIPFPLRLEALSEIGTSGKSYML 626

Query: 494  RVVQGLEEAISANLSDIYVHGNLKSAEIVVQLFECLCNKVLPWFHKTGYSDKVPLRFAII 315
            +VV+GLEEAI ANLS++YVHG+  SAEIV QL +C  +KV+PW       D+VPL+FAI 
Sbjct: 627  KVVEGLEEAIYANLSEVYVHGSSNSAEIVTQLLKCYSDKVIPWIQCAKGFDEVPLQFAIH 686

Query: 314  IWNQVENSSTFSDKVLGK-ELLNATMMAMKIAVGNCSEENQSTILEKAYSVLSSRTSFPL 138
            IWNQ+E S  F+     K E+L+  M AMK+AV +CSEENQ+ I++K+Y +LSS TSFPL
Sbjct: 687  IWNQIELSMVFNATQTNKIEVLDVMMKAMKLAVASCSEENQNIIVQKSYHILSSSTSFPL 746

Query: 137  KESMSLTIPVQLDGLQLTRILDGFSSVDDWLLSLFASVTVALRP 6
            KE        + +  Q+ ++ D  SS D+W+LSLFA+V +A+ P
Sbjct: 747  KELF------RQESFQIVQV-DNSSSRDEWILSLFAAVVIAVHP 783


>ref|XP_007024310.1| MMS19 nucleotide excision repair protein, putative isoform 1
            [Theobroma cacao] gi|590619491|ref|XP_007024311.1| MMS19
            nucleotide excision repair protein, putative isoform 1
            [Theobroma cacao] gi|508779676|gb|EOY26932.1| MMS19
            nucleotide excision repair protein, putative isoform 1
            [Theobroma cacao] gi|508779677|gb|EOY26933.1| MMS19
            nucleotide excision repair protein, putative isoform 1
            [Theobroma cacao]
          Length = 1149

 Score =  625 bits (1613), Expect = e-176
 Identities = 329/584 (56%), Positives = 425/584 (72%), Gaps = 1/584 (0%)
 Frame = -2

Query: 1754 GPLASYAEDLFEILACYFPIHFTHPKSEDVEVKRDDLSRALMLAFSSTPLFEPFAIPLLL 1575
            GP  S+A DLFE L+ YFP+HFTHPK EDV +KRDDL+RALMLAFSSTPLFEPFAIPLL+
Sbjct: 209  GPFTSFAHDLFENLSYYFPVHFTHPKGEDVNIKRDDLARALMLAFSSTPLFEPFAIPLLI 268

Query: 1574 EKLSSALPSAKLDSLKYLSNCTLKYGADRMAKHAEAIWSSLKDVIYASPQEPISSLTSEP 1395
            EKLSS+LPSAK+DSL+YLS+CT+KYG DRMAKH EA+WSSLKD ++ S  + + S T E 
Sbjct: 269  EKLSSSLPSAKVDSLRYLSDCTVKYGVDRMAKHGEALWSSLKDAVFTS-LDGVLSFTPES 327

Query: 1394 LDGIRFQDNEITKEALILLQKVILQNCGLFISLIIGDEGINMIIDSIEIYTTSNDITLQS 1215
            L+G+   +NEI  EAL LLQK+I+QN   F+ LI+ DE INMI + I  Y + + I  QS
Sbjct: 328  LEGLCLPENEIAAEALSLLQKLIVQNTNFFLDLIVVDEDINMIFNMISSYKSYHGIPAQS 387

Query: 1214 KQKLHAVGSILHVSAKASVASCNRVFESFFLRLMGALELSVKNSSGFCVPADNDVVSEKL 1035
            KQ+LHAVG IL  S KAS ASCNRVFE FF RLM  L L V+NSSG     D+ ++ ++ 
Sbjct: 388  KQRLHAVGCILSASVKASTASCNRVFECFFSRLMDILGLCVRNSSGNLSSDDSIMIPKRY 447

Query: 1034 NFGALYLCIELIDACRYMIAGFEELTANTVSADETCCSMLRNFSSSLFKAFSSTLVTSIS 855
            N GALYL IEL+ ACR +IA  E + A +   +ET   +LR+FSSSL KAF S  + + S
Sbjct: 448  NHGALYLSIELLSACRDVIASSETIIAASAHTEETWSYLLRSFSSSLTKAFCSASICT-S 506

Query: 854  SGAHDADIYLRVKGLQTLATFSGGFSTISKSTFENILMTFTSVITTDFNSTLLWRQTLKA 675
              +HDAD+Y  VKGL  LATF  G+  ISK  FE ILMTF S++T D+++TLLW+  LKA
Sbjct: 507  EDSHDADVYFGVKGLLILATFPEGYLLISKPVFEKILMTFVSIVTVDYSNTLLWKLALKA 566

Query: 674  LVKIGLFVDQSNESEKTTSYMSIVVDKIVSKMSFNDDSMPFPLNLEAICDIGETGLNFML 495
            LV+IG F+++ +ESEK  SY+ +VV+KIVS  S  D S+PFPL LEA+ +IG +G ++ML
Sbjct: 567  LVQIGSFIEKCHESEKEPSYLGLVVEKIVSFSSLGDFSIPFPLRLEALSEIGTSGKSYML 626

Query: 494  RVVQGLEEAISANLSDIYVHGNLKSAEIVVQLFECLCNKVLPWFHKTGYSDKVPLRFAII 315
            +VV+GLEEAI ANLS++YVHG+  SAEIV QL +C  +KV+PW       D+VPL+FAI 
Sbjct: 627  KVVEGLEEAIYANLSEVYVHGSSNSAEIVTQLLKCYSDKVIPWIQCAKGFDEVPLQFAIH 686

Query: 314  IWNQVENSSTFSDKVLGK-ELLNATMMAMKIAVGNCSEENQSTILEKAYSVLSSRTSFPL 138
            IWNQ+E S  F+     K E+L+  M AMK+AV +CSEENQ+ I++K+Y +LSS TSFPL
Sbjct: 687  IWNQIELSMVFNATQTNKIEVLDVMMKAMKLAVASCSEENQNIIVQKSYHILSSSTSFPL 746

Query: 137  KESMSLTIPVQLDGLQLTRILDGFSSVDDWLLSLFASVTVALRP 6
            KE        + +  Q+ ++ D  SS D+W+LSLFA+V +A+ P
Sbjct: 747  KELF------RQESFQIVQV-DNSSSRDEWILSLFAAVVIAVHP 783


>ref|XP_002515963.1| DNA repair/transcription protein met18/mms19, putative [Ricinus
            communis] gi|223544868|gb|EEF46383.1| DNA
            repair/transcription protein met18/mms19, putative
            [Ricinus communis]
          Length = 1174

 Score =  619 bits (1597), Expect = e-174
 Identities = 332/600 (55%), Positives = 430/600 (71%), Gaps = 16/600 (2%)
 Frame = -2

Query: 1754 GPLASYAEDLFEILACYFPIHFTHPKSEDVEVKRDDLSRALMLAFSSTPLFEPFAIPLLL 1575
            GP +S+A D+F IL CYFPIHFTHPK+EDV+VKRDDLSRALMLAFSSTPLFEPFA+PLLL
Sbjct: 208  GPFSSFAGDIFSILGCYFPIHFTHPKAEDVDVKRDDLSRALMLAFSSTPLFEPFAMPLLL 267

Query: 1574 EKLSSALPSAKLDSLKYLSNCTLKYGADRMAKHAEAIWSSLKDVIYASPQEPISSLTSEP 1395
            EKLSS+LP+AK+DSLKYLS CTLK+ ADR+A+HA AIWSSLKD IY+S +EP+ S   E 
Sbjct: 268  EKLSSSLPTAKVDSLKYLSYCTLKFRADRIAEHAGAIWSSLKDAIYSSGEEPMLSSDLES 327

Query: 1394 LDGIRFQDNEITKEALILLQKVILQNCGLFISLIIGDEGINMIIDSIEIYTTSNDITLQS 1215
            +D    + NEI  EAL+LL+ +I+QN   F+S+II DE + MI ++I  Y + N+I+LQS
Sbjct: 328  VDSPGSEKNEIATEALLLLENLIVQNNNFFLSMIISDEEVKMIFNTITSYKSYNEISLQS 387

Query: 1214 KQKLHAVGSILHVSAKASVASCNRVFESFFLRLMGALELSVKNSSGFCVPADNDVVSEKL 1035
            KQKLH VG IL+V AK SV+SCNR+FES+F RLM AL + V+N+SG C   +N V +++ 
Sbjct: 388  KQKLHMVGRILYVCAKVSVSSCNRIFESYFPRLMEALGILVENTSGACHSNENCVKAKQP 447

Query: 1034 NFGALYLCIELIDACRYMIAGFEELTANTVSADETCCSMLRNFSSSLFKAFSSTLVTSIS 855
            N+G+ YL I+L+ ACR +    + L +  +S +ET C +L+ FS+SL + FS+ L TS S
Sbjct: 448  NYGSFYLSIKLLGACRDLSTSSDNLASQCISTNETYCCLLQRFSTSLTETFSAALATSTS 507

Query: 854  SGAHDADIYLRVKGLQTLATFSGGFSTISKSTFENILMTFTSVITTDFNSTLLWRQTLKA 675
              A D D+YL VKGLQ LATF GG+  +SK TF+NILMTF S+IT DFN TLLW Q LKA
Sbjct: 508  GPAQDVDMYLGVKGLQILATFPGGYLFLSKLTFDNILMTFLSIITVDFNKTLLWNQALKA 567

Query: 674  LVKIGLFVDQSNESEKTTSYMSIVVDKIVSKMSFNDDSMPFPLNLEAICDIGETGLNFML 495
            LV+IG FV   NES+K  SY+ IVV K++   S  D SMP+ L L AI  IG +G  +ML
Sbjct: 568  LVQIGSFVHGCNESDKEMSYVDIVVGKMILLASSPDFSMPWSLKLTAISSIGMSGQKYML 627

Query: 494  RVVQGLEEAISANLSDIY---------------VHGNLKSAEIVVQLFECLCNKVLPWFH 360
            +V  GLEEAI ANL++IY               V GNLKSA+I++QL EC  +++LPW  
Sbjct: 628  KVFLGLEEAIRANLAEIYVCMIKKKIYVLYSCLVQGNLKSAKILLQLLECYSDELLPWIQ 687

Query: 359  KTGYSDKVPLRFAIIIWNQVENSSTFSDKVLGKE-LLNATMMAMKIAVGNCSEENQSTIL 183
            KT   ++V ++F + +WNQ+EN + F+    GKE LL+A M  MK AV  CS E+Q+ I+
Sbjct: 688  KTEGFEEVLMQFVVNLWNQIENFNAFTVAFHGKESLLDAIMKVMKDAVAFCSVESQNVII 747

Query: 182  EKAYSVLSSRTSFPLKESMSLTIPVQLDGLQLTRILDGFSSVDDWLLSLFASVTVALRPR 3
             KAY VLSS T  PLKES+S    VQL+  +  + +D  SS D+W+ SLFASV +ALRP+
Sbjct: 748  YKAYGVLSSSTFLPLKESLSEN-SVQLECFRAIQQMDRLSSRDEWIHSLFASVIIALRPQ 806


>gb|EXB74582.1| hypothetical protein L484_026279 [Morus notabilis]
          Length = 1210

 Score =  594 bits (1532), Expect = e-167
 Identities = 315/584 (53%), Positives = 414/584 (70%)
 Frame = -2

Query: 1754 GPLASYAEDLFEILACYFPIHFTHPKSEDVEVKRDDLSRALMLAFSSTPLFEPFAIPLLL 1575
            G LAS+  DLFE+L CYFPIHFTH K EDV+VKRDDLSRALM+AFSSTPL EPF IPLLL
Sbjct: 249  GSLASFPRDLFEVLGCYFPIHFTHHKVEDVDVKRDDLSRALMIAFSSTPLLEPFVIPLLL 308

Query: 1574 EKLSSALPSAKLDSLKYLSNCTLKYGADRMAKHAEAIWSSLKDVIYASPQEPISSLTSEP 1395
            EKLSS+L SAK+DSLKYLS C++KYGADRMA+HA  +WSS+K+ I  S +EP  S  SE 
Sbjct: 309  EKLSSSLSSAKIDSLKYLSYCSIKYGADRMARHAGILWSSIKNAISTSLKEPTESFYSES 368

Query: 1394 LDGIRFQDNEITKEALILLQKVILQNCGLFISLIIGDEGINMIIDSIEIYTTSNDITLQS 1215
            +DG+ FQ+NE+  EAL+LL+ V++QN  L +S+I+ DE I+ + +++  Y    DI LQ 
Sbjct: 369  IDGLGFQENEVVSEALVLLETVVMQNNNLLLSMIVDDEDISTVFNTMTSYGRYKDIPLQG 428

Query: 1214 KQKLHAVGSILHVSAKASVASCNRVFESFFLRLMGALELSVKNSSGFCVPADNDVVSEKL 1035
            KQ+LH VG IL+++ K S+ASCNRV E+FF  L+  L+LS+++SS              L
Sbjct: 429  KQRLHVVGRILYITTKTSIASCNRVLETFFRPLVDILQLSIRSSSR----------DWFL 478

Query: 1034 NFGALYLCIELIDACRYMIAGFEELTANTVSADETCCSMLRNFSSSLFKAFSSTLVTSIS 855
            NFGALYLC+EL+ ACR ++    EL +N++ A ET C +L++F  SL  A  S L T+ +
Sbjct: 479  NFGALYLCMELLAACRDLVIYSRELASNSIPAHETFCCILQSFCVSLIDALCSILETTAN 538

Query: 854  SGAHDADIYLRVKGLQTLATFSGGFSTISKSTFENILMTFTSVITTDFNSTLLWRQTLKA 675
             GA D DIYLRV+ LQ LATF      IS + F+NIL T  S+I  DFN   LW+  LKA
Sbjct: 539  EGADDVDIYLRVRSLQILATFPEDLLAISDNVFKNILTTLMSIIFKDFNQKFLWKLALKA 598

Query: 674  LVKIGLFVDQSNESEKTTSYMSIVVDKIVSKMSFNDDSMPFPLNLEAICDIGETGLNFML 495
            LV IG FV +  ESEK  SY SIVV+K+VS +S ++ ++PFPL LEA+ +IG +G N ML
Sbjct: 599  LVHIGSFVSR-YESEKAQSYNSIVVEKMVSWVSVDNCTLPFPLKLEAVSEIGASGRNHML 657

Query: 494  RVVQGLEEAISANLSDIYVHGNLKSAEIVVQLFECLCNKVLPWFHKTGYSDKVPLRFAII 315
             +VQGLE AI + +SD YVHGN+ SAE+ +QL +    KV+PW H+T   +++ LRFA  
Sbjct: 658  NIVQGLEGAIFSYVSDFYVHGNVSSAEVAIQLLQFYSEKVIPWIHETEGLEEILLRFATN 717

Query: 314  IWNQVENSSTFSDKVLGKELLNATMMAMKIAVGNCSEENQSTILEKAYSVLSSRTSFPLK 135
            IW+ VE+  + + +V  K LL+A MMAMK+ VG+CSEE Q  IL+KAY+VLSS TS  LK
Sbjct: 718  IWDHVESWISCNVEVQEKGLLDAIMMAMKLTVGSCSEEIQYIILQKAYTVLSSNTSLLLK 777

Query: 134  ESMSLTIPVQLDGLQLTRILDGFSSVDDWLLSLFASVTVALRPR 3
            +S   +IPVQL+  QL + +D  S  D+ +LSLFASV +A+RPR
Sbjct: 778  KSSLTSIPVQLEESQLIQHVDNISHRDELVLSLFASVIIAVRPR 821


>ref|XP_004302857.1| PREDICTED: uncharacterized protein LOC101304108 [Fragaria vesca
            subsp. vesca]
          Length = 1149

 Score =  588 bits (1516), Expect = e-165
 Identities = 312/584 (53%), Positives = 398/584 (68%)
 Frame = -2

Query: 1754 GPLASYAEDLFEILACYFPIHFTHPKSEDVEVKRDDLSRALMLAFSSTPLFEPFAIPLLL 1575
            GPLA++  DLFE L CYFPIHFTH K ED  VKR+DLS+ALM AFSST LFEPF IPLLL
Sbjct: 209  GPLATFCGDLFEFLGCYFPIHFTHLKDEDANVKREDLSKALMSAFSSTALFEPFVIPLLL 268

Query: 1574 EKLSSALPSAKLDSLKYLSNCTLKYGADRMAKHAEAIWSSLKDVIYASPQEPISSLTSEP 1395
            EKLSS+LP AK+DSLKYL+ C  +YGA+RMAKHAE IW S+K  I  S + P  S T+EP
Sbjct: 269  EKLSSSLPLAKVDSLKYLNYCASRYGAERMAKHAETIWISIKHAISNSLEVPAKSFTAEP 328

Query: 1394 LDGIRFQDNEITKEALILLQKVILQNCGLFISLIIGDEGINMIIDSIEIYTTSNDITLQS 1215
            L G+ F++NEI  EALILLQ V +QN  L +SLI+ DE IN +I+SI  + +  +I  Q 
Sbjct: 329  LVGLGFEENEIVTEALILLQNVTMQNDALLLSLIVRDEDINNVINSIASHESYTNIPSQG 388

Query: 1214 KQKLHAVGSILHVSAKASVASCNRVFESFFLRLMGALELSVKNSSGFCVPADNDVVSEKL 1035
            +Q LHAVG I  +  K S+ASCNRVFESFF  LM  LE+S+ NSS  C   +N   S++ 
Sbjct: 389  RQSLHAVGRIFFIITKTSMASCNRVFESFFPSLMKTLEISMGNSSKDCTLKENSFSSKRF 448

Query: 1034 NFGALYLCIELIDACRYMIAGFEELTANTVSADETCCSMLRNFSSSLFKAFSSTLVTSIS 855
             FGALY C+E I ACR +I    +      +ADETCC ML++ + +L  AF +TL     
Sbjct: 449  KFGALYFCVEFIAACRDLIMRTNDHDEKFGTADETCCCMLQSSAPTLITAFCTTLAQISC 508

Query: 854  SGAHDADIYLRVKGLQTLATFSGGFSTISKSTFENILMTFTSVITTDFNSTLLWRQTLKA 675
            + A DADIY +VKGLQ LATF G F  I K+ FEN+L T  S+I  DF+  LLW+  LKA
Sbjct: 509  NVADDADIYFKVKGLQMLATFPGYFLQIPKAMFENVLKTLMSIILVDFDKPLLWKLALKA 568

Query: 674  LVKIGLFVDQSNESEKTTSYMSIVVDKIVSKMSFNDDSMPFPLNLEAICDIGETGLNFML 495
            L  IG FVD   ESEK  SY S VV+K +S +  +D  +PFPL LEA+ +IG +  N ML
Sbjct: 569  LAHIGSFVDVHLESEKAQSYTSFVVEKTIS-LPQDDFDVPFPLKLEAVFEIGASRPNHML 627

Query: 494  RVVQGLEEAISANLSDIYVHGNLKSAEIVVQLFECLCNKVLPWFHKTGYSDKVPLRFAII 315
            R++QGLE+AI ANLS  ++HG+LK+AE  +QL EC  NK++ W  + G  ++V  RF I 
Sbjct: 628  RIIQGLEDAIVANLSKTFIHGDLKAAEKTIQLLECYSNKIISWIDENGGLEEVLCRFVIS 687

Query: 314  IWNQVENSSTFSDKVLGKELLNATMMAMKIAVGNCSEENQSTILEKAYSVLSSRTSFPLK 135
            IWN +E     S++V  K LL+ATM AMK+AVG+CSEE+Q+ I++KAY  LSS  S P K
Sbjct: 688  IWNCLERCKDSSNQVQDKGLLDATMTAMKLAVGSCSEESQNIIIQKAYGALSSGISIPFK 747

Query: 134  ESMSLTIPVQLDGLQLTRILDGFSSVDDWLLSLFASVTVALRPR 3
            +S   +   +L+ L L   LD  S  D+W+ SLFASV +A+RPR
Sbjct: 748  DSTDDSSLAKLETLHLFEQLDKLSPRDEWIFSLFASVIIAMRPR 791


>ref|XP_006385450.1| hypothetical protein POPTR_0003s04720g [Populus trichocarpa]
            gi|550342418|gb|ERP63247.1| hypothetical protein
            POPTR_0003s04720g [Populus trichocarpa]
          Length = 913

 Score =  582 bits (1499), Expect = e-163
 Identities = 306/550 (55%), Positives = 406/550 (73%)
 Frame = -2

Query: 1652 DDLSRALMLAFSSTPLFEPFAIPLLLEKLSSALPSAKLDSLKYLSNCTLKYGADRMAKHA 1473
            +D+S    LAFSS+PLFEP  IPLLLEKLSS+L SAK+DSLKYLS CT KYGA+R+AKHA
Sbjct: 11   EDISFPDCLAFSSSPLFEPSVIPLLLEKLSSSLSSAKVDSLKYLSYCTSKYGAERIAKHA 70

Query: 1472 EAIWSSLKDVIYASPQEPISSLTSEPLDGIRFQDNEITKEALILLQKVILQNCGLFISLI 1293
             AIWSSLKDVI+ S Q  + S T E L G+  Q+NEI  EAL LL+KV++QN  LF S+I
Sbjct: 71   GAIWSSLKDVIFTSGQSFVLSFTPESLGGLGCQENEIAAEALALLEKVVIQNNDLFSSMI 130

Query: 1292 IGDEGINMIIDSIEIYTTSNDITLQSKQKLHAVGSILHVSAKASVASCNRVFESFFLRLM 1113
            +GDE INM+++SI    + N+I LQS QKL++VG IL+VS KASVASC+R+F+ FF  LM
Sbjct: 131  VGDEEINMVLNSITGCQSYNEIPLQSTQKLYSVGRILYVSVKASVASCSRIFQYFFSCLM 190

Query: 1112 GALELSVKNSSGFCVPADNDVVSEKLNFGALYLCIELIDACRYMIAGFEELTANTVSADE 933
             ++ L V N SG C   D+ ++S++ N G+LYLC+EL+ ACR ++    +L +  VSA+E
Sbjct: 191  ESMGLPVVNGSGTCSFNDDCIISKRPNHGSLYLCVELLGACRDLVISSGDLASQCVSANE 250

Query: 932  TCCSMLRNFSSSLFKAFSSTLVTSISSGAHDADIYLRVKGLQTLATFSGGFSTISKSTFE 753
            T C +L+ FS+SL K FSSTL TS    AHDAD+YL VKGLQ LATF GG+  +SKST E
Sbjct: 251  TWCCLLQRFSTSLSKIFSSTLATSTDKPAHDADVYLGVKGLQILATFPGGYLLVSKSTCE 310

Query: 752  NILMTFTSVITTDFNSTLLWRQTLKALVKIGLFVDQSNESEKTTSYMSIVVDKIVSKMSF 573
            +ILMTF S+IT DFN TLLW+ ++KALV+IGLF+  SNESEK+ SYM IVV KIVS +S 
Sbjct: 311  SILMTFVSIITVDFNKTLLWKLSVKALVQIGLFIHGSNESEKSMSYMDIVVQKIVSMISS 370

Query: 572  NDDSMPFPLNLEAICDIGETGLNFMLRVVQGLEEAISANLSDIYVHGNLKSAEIVVQLFE 393
            ++  +PF L LEAI DIG +GL +ML++V GL+E I ANL++  V GN+KSA++++ L E
Sbjct: 371  DNHDIPFQLQLEAISDIGTSGLQYMLKIVTGLQEVIRANLAE--VQGNVKSAKVIIHLLE 428

Query: 392  CLCNKVLPWFHKTGYSDKVPLRFAIIIWNQVENSSTFSDKVLGKELLNATMMAMKIAVGN 213
            C  N++LPW  K    ++V L+F + IWNQ+EN   F D +  KELL+ATM  MK+AV +
Sbjct: 429  CYSNELLPWIQKYEVFEEVLLQFVVSIWNQIENCMAFPDGIFEKELLDATMKVMKLAVAS 488

Query: 212  CSEENQSTILEKAYSVLSSRTSFPLKESMSLTIPVQLDGLQLTRILDGFSSVDDWLLSLF 33
            CS E+Q+ I++KAY+VLSS T    K+S+S ++  QL+ L+ T+  + FSS D+W+ SLF
Sbjct: 489  CSVESQNIIIDKAYTVLSSSTFLSTKDSLS-SLQAQLEELEDTQETNKFSSRDEWIHSLF 547

Query: 32   ASVTVALRPR 3
             SV +AL P+
Sbjct: 548  ISVIIALHPQ 557


>ref|XP_004141784.1| PREDICTED: MMS19 nucleotide excision repair protein homolog [Cucumis
            sativus]
          Length = 1147

 Score =  578 bits (1489), Expect = e-162
 Identities = 313/589 (53%), Positives = 416/589 (70%), Gaps = 5/589 (0%)
 Frame = -2

Query: 1754 GPLASYAEDLFEILACYFPIHFTHPKSEDVEVKRDDLSRALMLAFSSTPLFEPFAIPLLL 1575
            G LAS + DLFE L CYFPIHFTH K ED++V+R+DLS ALM AFSSTPLFEPFAIPLLL
Sbjct: 209  GALASSSSDLFEFLGCYFPIHFTHGKEEDIDVRRNDLSHALMRAFSSTPLFEPFAIPLLL 268

Query: 1574 EKLSSALPSAKLDSLKYLSNCTLKYGADRMAKHAEAIWSSLKDVIYASPQEPISSLTSEP 1395
            EKLSS+LP AK+DSLKYLS+CT+KYGADRM KH+EAIWSS+K++I+ S  +P  S+ +E 
Sbjct: 269  EKLSSSLPLAKIDSLKYLSDCTVKYGADRMKKHSEAIWSSVKEIIFTSIGQPNLSINTES 328

Query: 1394 LDGIRFQDNEITKEALILLQKVILQNCGLFISLIIGDEGINMIIDSIEIYTTSNDITLQS 1215
            L+   FQ+NE+T EAL LLQK+++ + GLF++LII DE +  I + + IYT   D  LQS
Sbjct: 329  LNSPSFQENEMTTEALRLLQKMVVASNGLFLTLIINDEDVKDIFNILNIYTCYKDFPLQS 388

Query: 1214 KQKLHAVGSILHVSAKASVASCNRVFESFFLRLMGALELSVKNSSGFCVPADNDVVS--E 1041
            +Q+L+AVG IL+ SA ASVASC+ VFES+F RL+  + +SV           ND +S   
Sbjct: 389  RQRLNAVGHILYTSASASVASCDHVFESYFHRLLDFMGISVDQ-------YHNDKISPIR 441

Query: 1040 KLNFGALYLCIELIDACRYMIAGFEELTANTVSADETCCSMLRNFSSSLFKAFSSTLVTS 861
             LNFGALYLCIE+I ACR +I   +E   NT S  E   SML+ FS S+ +  SST    
Sbjct: 442  NLNFGALYLCIEVIAACRNLIVSSDE---NTCSVKEKSYSMLQIFSCSVVQLLSSTFSGI 498

Query: 860  ISSGAHDADIYLRVKGLQTLATFSGGFSTISKSTFENILMTFTSVITTDFNSTLLWRQTL 681
            +    HDA+ Y  VKGL  L+TF  G S +S+  FE+IL+ F S IT +F    LW   L
Sbjct: 499  VKRDLHDAEFYCAVKGLLNLSTFPVGSSPVSRVIFEDILLEFMSFITVNFKFGSLWNHAL 558

Query: 680  KALVKIGLFVDQSNESEKTTSYMSIVVDKIVSKMSFNDDSMPFPLNLEAICDIGETGLNF 501
            KAL  IG FVD+   S ++ SYM IVV+KI    S +D+ +P  L LE   DIG TG ++
Sbjct: 559  KALQHIGSFVDKYPGSVESQSYMHIVVEKIALMFSPHDEVLPLMLKLEMAVDIGRTGRSY 618

Query: 500  MLRVVQGLEEAISANLSDIYVHGNLKSAEIVVQLFECLCNKVLPWFHKTGYSDKVPLRFA 321
            ML++V G+EE I  NLS++YV+GN KS EIV+ L +C   K+LPWF + G  ++V LRFA
Sbjct: 619  MLKIVGGIEETIFYNLSEVYVYGNSKSVEIVLSLLDCYSTKILPWFDEAGDFEEVILRFA 678

Query: 320  IIIWNQVENSSTFS---DKVLGKELLNATMMAMKIAVGNCSEENQSTILEKAYSVLSSRT 150
            + IW+Q+E  STFS   DK + + LL+ATMMA+K++V +CS+E+Q+ I++KA++VL + +
Sbjct: 679  LNIWDQIEKCSTFSTSMDKCI-QVLLDATMMALKLSVRSCSKESQNIIVQKAFNVLLTSS 737

Query: 149  SFPLKESMSLTIPVQLDGLQLTRILDGFSSVDDWLLSLFASVTVALRPR 3
              PLK ++S TIPVQ++GLQ  +  D  +S D+W+LSLFASVT+ALRP+
Sbjct: 738  FSPLKVTLSNTIPVQMEGLQFLQQKDNPTSRDEWILSLFASVTIALRPQ 786


>ref|XP_004236399.1| PREDICTED: MMS19 nucleotide excision repair protein homolog [Solanum
            lycopersicum]
          Length = 1153

 Score =  558 bits (1439), Expect = e-156
 Identities = 295/584 (50%), Positives = 403/584 (69%)
 Frame = -2

Query: 1754 GPLASYAEDLFEILACYFPIHFTHPKSEDVEVKRDDLSRALMLAFSSTPLFEPFAIPLLL 1575
            GPL ++A DLFEIL CYFPIHFTHPKS+DV++KR++LSRALMLAF+STPLFEP  IPLLL
Sbjct: 224  GPLENFAGDLFEILECYFPIHFTHPKSDDVDIKREELSRALMLAFASTPLFEPSVIPLLL 283

Query: 1574 EKLSSALPSAKLDSLKYLSNCTLKYGADRMAKHAEAIWSSLKDVIYASPQEPISSLTSEP 1395
            +KLSS+LPSAK++SLKYLS CTLKYG DRM K+ +++WS+LKD ++ SPQ  +S   S+P
Sbjct: 284  DKLSSSLPSAKVESLKYLSFCTLKYGGDRMEKYTKSLWSALKDALFTSPQSTLSE-DSDP 342

Query: 1394 LDGIRFQDNEITKEALILLQKVILQNCGLFISLIIGDEGINMIIDSIEIYTTSNDITLQS 1215
            +DG+ F ++EI  +AL  LQ ++ Q+   F+SLI+GD  I+  ++S   +   N ++ Q 
Sbjct: 343  IDGLGFHESEIMTQALEFLQVLVRQHNASFLSLIMGDGDISTFLNSFSQFDNFNSLSTQY 402

Query: 1214 KQKLHAVGSILHVSAKASVASCNRVFESFFLRLMGALELSVKNSSGFCVPADNDVVSEKL 1035
            KQ+LHAVG +L V  KAS +SCN+VFESFF RL+ AL LSV NS G      +  V    
Sbjct: 403  KQRLHAVGHVLSVCIKASASSCNKVFESFFPRLVDALRLSVDNSHGIV----HSAVDANF 458

Query: 1034 NFGALYLCIELIDACRYMIAGFEELTANTVSADETCCSMLRNFSSSLFKAFSSTLVTSIS 855
            NFGALYLC+EL+ ACR ++   +E+ +    A ++ C +L +FS+SL   F   +  S  
Sbjct: 459  NFGALYLCVELLAACRQLVVSSDEVASAHDLARDSWCQILHSFSTSLCNVFFCLIRASCV 518

Query: 854  SGAHDADIYLRVKGLQTLATFSGGFSTISKSTFENILMTFTSVITTDFNSTLLWRQTLKA 675
                +A +Y  VKGL+ LATF G F ++SK  +ENIL+T TS+I ++FN   LW+  LKA
Sbjct: 519  ESTRNAYVYAAVKGLEILATFPGSFISVSKLMYENILLTLTSIIESEFNKKFLWKAALKA 578

Query: 674  LVKIGLFVDQSNESEKTTSYMSIVVDKIVSKMSFNDDSMPFPLNLEAICDIGETGLNFML 495
            LV+I LFV++ +E EK  S+ SIV  KIVS +S +D +MP  L LEA+ DIG TG NFML
Sbjct: 579  LVEISLFVNKYHEDEKAASFNSIVKQKIVSLISSDDLNMPQSLKLEAVFDIGLTGKNFML 638

Query: 494  RVVQGLEEAISANLSDIYVHGNLKSAEIVVQLFECLCNKVLPWFHKTGYSDKVPLRFAII 315
             VV  LE+ ISANLS+I VHG+ + A +   L EC  NKVLPWFH  G +D+V L FA+ 
Sbjct: 639  SVVSELEKTISANLSEILVHGDRRLAGLTAGLLECYSNKVLPWFHVNGGADEVSLSFAVN 698

Query: 314  IWNQVENSSTFSDKVLGKELLNATMMAMKIAVGNCSEENQSTILEKAYSVLSSRTSFPLK 135
            I+ ++E++++ S +  GKELL ATM AMK A+  CS E+Q  +L+KA  V+ +  SF   
Sbjct: 699  IFTKMEHNTSLSLEAEGKELLGATMAAMKQAMTCCSVESQEKVLQKAIDVMET-NSFFFS 757

Query: 134  ESMSLTIPVQLDGLQLTRILDGFSSVDDWLLSLFASVTVALRPR 3
             ++ L   +     QL +  +G S  D+W++SLFASV +ALRP+
Sbjct: 758  NNLILGTDLFNKKTQLGQTSEGLSCQDEWIISLFASVVIALRPQ 801


>ref|XP_003546956.1| PREDICTED: MMS19 nucleotide excision repair protein homolog isoform
            X1 [Glycine max]
          Length = 1135

 Score =  551 bits (1419), Expect = e-154
 Identities = 299/582 (51%), Positives = 388/582 (66%)
 Frame = -2

Query: 1748 LASYAEDLFEILACYFPIHFTHPKSEDVEVKRDDLSRALMLAFSSTPLFEPFAIPLLLEK 1569
            LASYA+D+F+IL  YFPIHFTHP S D  V+RDDLS +LM AFSSTPLFEPF IPLLLEK
Sbjct: 211  LASYAKDVFDILEPYFPIHFTHPSSGDTHVQRDDLSTSLMSAFSSTPLFEPFVIPLLLEK 270

Query: 1568 LSSALPSAKLDSLKYLSNCTLKYGADRMAKHAEAIWSSLKDVIYASPQEPISSLTSEPLD 1389
            LSS+L SAK+DSLKYL  C+ KYGA+R+AK+A AIWSSLKD +     EP  S T  P+D
Sbjct: 271  LSSSLHSAKIDSLKYLRVCSSKYGAERIAKYAGAIWSSLKDTLSTYLGEPDFSFTIAPVD 330

Query: 1388 GIRFQDNEITKEALILLQKVILQNCGLFISLIIGDEGINMIIDSIEIYTTSNDITLQSKQ 1209
            GI F +NE   EAL LLQ++I QN  L +SLII DE +N I  +I  Y T + I +Q K+
Sbjct: 331  GIGFPENEFVIEALSLLQQLIAQNSSLLVSLIIDDEDVNTIFSTITSYETYDAIPVQEKK 390

Query: 1208 KLHAVGSILHVSAKASVASCNRVFESFFLRLMGALELSVKNSSGFCVPADNDVVSEKLNF 1029
            KLHA+G IL++++K +++SCN +FES F R+M  L  SV+  +G   P      S++L F
Sbjct: 391  KLHAIGRILYITSKTTISSCNAMFESLFTRMMDNLGFSVRFPNGDISP------SQRLKF 444

Query: 1028 GALYLCIELIDACRYMIAGFEELTANTVSADETCCSMLRNFSSSLFKAFSSTLVTSISSG 849
            G LYLCIEL+  CR +I G EE     V   ETCC+ML +FS+ LF AF S L  S   G
Sbjct: 445  GFLYLCIELLAGCRELIVGSEEPALQYVFEHETCCTMLHSFSTPLFNAFGSVLAVSADRG 504

Query: 848  AHDADIYLRVKGLQTLATFSGGFSTISKSTFENILMTFTSVITTDFNSTLLWRQTLKALV 669
              D D Y+ VKGLQ LA F      I KS FENIL  F S+I  DFN T+LW   LKAL 
Sbjct: 505  PLDPDTYVGVKGLQILAMFHSDVFPIQKSIFENILKKFMSIIIEDFNKTILWEAALKALH 564

Query: 668  KIGLFVDQSNESEKTTSYMSIVVDKIVSKMSFNDDSMPFPLNLEAICDIGETGLNFMLRV 489
             +G F  +  ESEK  SY ++VV+KIV  +S +D ++ F L +EA+ +IG+TG+  ML +
Sbjct: 565  HVGSFFQKFCESEKAMSYRNLVVEKIVEILSLDDITLSFSLKVEALLNIGKTGMKNMLTI 624

Query: 488  VQGLEEAISANLSDIYVHGNLKSAEIVVQLFECLCNKVLPWFHKTGYSDKVPLRFAIIIW 309
            +QGL  A+ ANLS +YVH NL+S+EI VQL EC   ++LPW H+ G S+   ++FA+ IW
Sbjct: 625  LQGLGRAVFANLSKVYVHRNLRSSEIAVQLLECYSCQLLPWIHENGGSEDFVMQFAVDIW 684

Query: 308  NQVENSSTFSDKVLGKELLNATMMAMKIAVGNCSEENQSTILEKAYSVLSSRTSFPLKES 129
            +Q  N    S    GK LL+A M AM+++VG+CS E+Q+ I+ KAYSVLSS T+F LKE 
Sbjct: 685  SQAGNCMDLSTPFEGKGLLDAMMKAMRLSVGSCSVESQNLIIRKAYSVLSSHTNFQLKE- 743

Query: 128  MSLTIPVQLDGLQLTRILDGFSSVDDWLLSLFASVTVALRPR 3
                    ++ L LT      S  D+ ++SLFASV +A+ P+
Sbjct: 744  --------VERLPLTPGKYDISLRDEGIISLFASVVIAVCPK 777


>ref|XP_006597167.1| PREDICTED: MMS19 nucleotide excision repair protein homolog isoform
            X2 [Glycine max]
          Length = 1133

 Score =  542 bits (1396), Expect = e-151
 Identities = 298/582 (51%), Positives = 386/582 (66%)
 Frame = -2

Query: 1748 LASYAEDLFEILACYFPIHFTHPKSEDVEVKRDDLSRALMLAFSSTPLFEPFAIPLLLEK 1569
            LASYA+D+F+IL  YFPIHFTHP S D  V+RDDLS +LM AFSSTPLFEPF IPLLLEK
Sbjct: 211  LASYAKDVFDILEPYFPIHFTHPSSGDTHVQRDDLSTSLMSAFSSTPLFEPFVIPLLLEK 270

Query: 1568 LSSALPSAKLDSLKYLSNCTLKYGADRMAKHAEAIWSSLKDVIYASPQEPISSLTSEPLD 1389
            LSS+L SAK+DSLKYL  C+ KYGA+R+AK+A AIWSSLKD +     EP  S T  P+D
Sbjct: 271  LSSSLHSAKIDSLKYLRVCSSKYGAERIAKYAGAIWSSLKDTLSTYLGEPDFSFTIAPVD 330

Query: 1388 GIRFQDNEITKEALILLQKVILQNCGLFISLIIGDEGINMIIDSIEIYTTSNDITLQSKQ 1209
            GI F +NE   EAL LLQ++I QN  L +SLII DE +N I  +I  Y T + I +Q K+
Sbjct: 331  GIGFPENEFVIEALSLLQQLIAQNSSLLVSLIIDDEDVNTIFSTITSYETYDAIPVQEKK 390

Query: 1208 KLHAVGSILHVSAKASVASCNRVFESFFLRLMGALELSVKNSSGFCVPADNDVVSEKLNF 1029
            KLHA+G IL++++K +++SCN +FES F R+M  L  SV+  +G   P      S++L F
Sbjct: 391  KLHAIGRILYITSKTTISSCNAMFESLFTRMMDNLGFSVRFPNGDISP------SQRLKF 444

Query: 1028 GALYLCIELIDACRYMIAGFEELTANTVSADETCCSMLRNFSSSLFKAFSSTLVTSISSG 849
            G LYLCIEL+  CR +I G EE     V   ETCC+ML +FS+ LF AF S L  S   G
Sbjct: 445  GFLYLCIELLAGCRELIVGSEEPALQYVFEHETCCTMLHSFSTPLFNAFGSVLAVSADRG 504

Query: 848  AHDADIYLRVKGLQTLATFSGGFSTISKSTFENILMTFTSVITTDFNSTLLWRQTLKALV 669
              D D Y+ VKGLQ LA F      I KS FENIL  F S+I  DFN T+LW   LKAL 
Sbjct: 505  PLDPDTYVGVKGLQILAMFHSDVFPIQKSIFENILKKFMSIIIEDFNKTILWEAALKALH 564

Query: 668  KIGLFVDQSNESEKTTSYMSIVVDKIVSKMSFNDDSMPFPLNLEAICDIGETGLNFMLRV 489
             +G F  +  ESEK  SY ++VV+KIV  +S +D ++ F L +EA+ +IG+TG+  ML +
Sbjct: 565  HVGSFFQKFCESEKAMSYRNLVVEKIVEILSLDDITLSFSLKVEALLNIGKTGMKNMLTI 624

Query: 488  VQGLEEAISANLSDIYVHGNLKSAEIVVQLFECLCNKVLPWFHKTGYSDKVPLRFAIIIW 309
            +QGL  A+ ANLS   VH NL+S+EI VQL EC   ++LPW H+ G S+   ++FA+ IW
Sbjct: 625  LQGLGRAVFANLSK--VHRNLRSSEIAVQLLECYSCQLLPWIHENGGSEDFVMQFAVDIW 682

Query: 308  NQVENSSTFSDKVLGKELLNATMMAMKIAVGNCSEENQSTILEKAYSVLSSRTSFPLKES 129
            +Q  N    S    GK LL+A M AM+++VG+CS E+Q+ I+ KAYSVLSS T+F LKE 
Sbjct: 683  SQAGNCMDLSTPFEGKGLLDAMMKAMRLSVGSCSVESQNLIIRKAYSVLSSHTNFQLKE- 741

Query: 128  MSLTIPVQLDGLQLTRILDGFSSVDDWLLSLFASVTVALRPR 3
                    ++ L LT      S  D+ ++SLFASV +A+ P+
Sbjct: 742  --------VERLPLTPGKYDISLRDEGIISLFASVVIAVCPK 775


>ref|XP_006343144.1| PREDICTED: MMS19 nucleotide excision repair protein homolog [Solanum
            tuberosum]
          Length = 1170

 Score =  535 bits (1377), Expect = e-149
 Identities = 295/614 (48%), Positives = 400/614 (65%), Gaps = 30/614 (4%)
 Frame = -2

Query: 1754 GPLASYAEDLFEILACYFPIHFTHPKSEDVEVKRDDLSRALMLAFSSTPLFEPFAIPLLL 1575
            GPL ++A DLFEIL CYFPIHFTHPKS+DV++KR +LSRALMLAF+STPL+EP  IPLLL
Sbjct: 211  GPLENFAGDLFEILECYFPIHFTHPKSDDVDMKRGELSRALMLAFASTPLYEPSVIPLLL 270

Query: 1574 EKLSSALPSAKLDSLKYLSNCTLKYGADRMAKHAEAIWSSLKDVIYASPQEPISSLTSEP 1395
            +KLSS+LPSAK++SLKYLS CTLKYG DRM K+ +++WS+LKD ++  PQ  +S   S+P
Sbjct: 271  DKLSSSLPSAKVESLKYLSYCTLKYGGDRMEKYTKSLWSALKDALFTCPQSTLSE-DSDP 329

Query: 1394 LDGIRFQDNEITKEALILLQKVILQNCGLFISLIIGDEGINMIIDSIEIYTTSNDITLQS 1215
            +DG+ F ++EI  +AL LLQ ++ Q+   F+SLI+GD  I+  ++S   +   N ++ Q 
Sbjct: 330  IDGLGFHESEIMTQALELLQVLVRQHNDSFLSLILGDGDISTFLNSFSQFDDFNSLSTQY 389

Query: 1214 KQKLHAVGSILHVSAKASVASCNRVFESFFLRLMGALELSVKNSSGFCVPADNDVVSEKL 1035
            KQ+LHAVG +L V  KAS +SCN+VFESFF RL+ AL LSV+NS G      +  +    
Sbjct: 390  KQRLHAVGHVLSVCIKASGSSCNKVFESFFPRLVDALRLSVENSHGIV----HSALDANF 445

Query: 1034 NFGALYLCIELIDACRYMIAGFEELTANTVSADETCCSMLRNFSSSLFKAFSSTLVTSIS 855
            NFGALYLC+EL+ ACR ++   +E+ +    A ++ C +LR+F +SL   F   +  S  
Sbjct: 446  NFGALYLCVELLAACRQLVVSSDEVASAHDLARDSWCQILRSFCTSLCNVFFCLIRASCV 505

Query: 854  SGAHDADIYLRVKGLQTLATFSGGFSTISKSTFENILMTFTSVITTDFNSTLLWRQTLKA 675
                +A +Y  VKGL+ L TF G F ++SK  +ENIL+T TS+I +DFN   LW+  LKA
Sbjct: 506  ESTWNAYVYAAVKGLEILGTFPGSFISVSKLMYENILLTLTSIIESDFNKKFLWKAALKA 565

Query: 674  LVKIGLFVDQSNESEKTTSYMSIVVDKIVSKMSFNDDSMPFPLNLEAICDIGETGLNFML 495
            LV+I LFV++ +E EK   + SIV  KIVS +S +D +MP  L LEAI DIG TG +FM 
Sbjct: 566  LVEISLFVNKYHEDEKAAIFNSIVKQKIVSLISSDDLNMPQSLKLEAIFDIGLTGKSFMH 625

Query: 494  RVVQGLEEAISANLSDI------------------------------YVHGNLKSAEIVV 405
             VV  LE+ ISANLS+I                               VHG+ + A +  
Sbjct: 626  SVVSELEKTISANLSEILVRVLIETSRLLLTYHMHRLFNFGALFLLLQVHGDRRLAGLTP 685

Query: 404  QLFECLCNKVLPWFHKTGYSDKVPLRFAIIIWNQVENSSTFSDKVLGKELLNATMMAMKI 225
             L EC  NKVLPWFH  G +D+V L FAI I+ ++EN+S+ S +  GKELL ATM AMK 
Sbjct: 686  GLLECYSNKVLPWFHGNGGADEVSLSFAINIFTKMENNSSLSLEAKGKELLGATMAAMKQ 745

Query: 224  AVGNCSEENQSTILEKAYSVLSSRTSFPLKESMSLTIPVQLDGLQLTRILDGFSSVDDWL 45
            A+  CS E+Q  +L+KA  V+ + +SF L   + L   +     QL +  +G S  D+W+
Sbjct: 746  AMTGCSVESQEKVLQKAIDVMET-SSFFLSNDLILGTDLFNKKTQLGQTSEGLSCRDEWI 804

Query: 44   LSLFASVTVALRPR 3
             SLFASV +ALRP+
Sbjct: 805  TSLFASVVIALRPQ 818


>ref|XP_006595125.1| PREDICTED: DNA repair/transcription protein mms19-like isoform X2
            [Glycine max]
          Length = 1013

 Score =  531 bits (1367), Expect = e-148
 Identities = 294/585 (50%), Positives = 386/585 (65%), Gaps = 1/585 (0%)
 Frame = -2

Query: 1754 GPLASYAEDLFEILACYFPIHFTHPKSEDVEVKRDDLSRALMLAFSSTPLFEPFAIPLLL 1575
            G LAS+A+D+F+IL  YFPIHFT P S D  V+RD LS +LM AFSSTPLFEPF IPLLL
Sbjct: 90   GLLASFAKDVFDILEPYFPIHFTRPSSGDTHVQRD-LSTSLMSAFSSTPLFEPFVIPLLL 148

Query: 1574 EKLSSALPSAKLDSLKYLSNCTLKYGADRMAKHAEAIWSSLKDVIYASPQEPISSLTSEP 1395
            EKLSS+L SAK+DSLKYL  C+ KYGA R+AK+A AIWSSLKD +     EP  S T  P
Sbjct: 149  EKLSSSLHSAKIDSLKYLRVCSSKYGAGRIAKYAGAIWSSLKDTLSTYLGEPDFSFTIAP 208

Query: 1394 LDGIRFQDNEITKEALILLQKVILQNCGLFISLIIGDEGINMIIDSIEIYTTSNDITLQS 1215
            +DGI F +NE   EAL LLQ++I+QN  L +SLII DE +N I  +I  Y T + I +Q 
Sbjct: 209  VDGIGFPENEFVLEALSLLQQLIVQNSSLLVSLIIDDEDVNSIFSTIASYETYDAIPVQE 268

Query: 1214 KQKLHAVGSILHVSAKASVASCNRVFESFFLRLMGALELSVKNSSGFCVPADNDVVSEKL 1035
            K+KLHA+G IL+++AK +++SCN VFES F RLM  L  SV+  +    P      S+++
Sbjct: 269  KKKLHAIGRILNITAKTTISSCNAVFESLFSRLMDNLGFSVRFPNSDIPP------SQRV 322

Query: 1034 NFGALYLCIELIDACRYMIAGFEELTANTVSADETCCSMLRNFSSSLFKAFSSTLVTSIS 855
             FG LY+CIEL+  CR +I G +E     V   ETCC+ML  FS+ LF AF S L  S  
Sbjct: 323  KFGFLYVCIELLAGCRELIVGSDEPALQYVFEHETCCTMLHRFSTPLFNAFGSVLAVSAD 382

Query: 854  SGAHDADIYLRVKGLQTLATFSGGFSTISKSTFENILMTFTSVITTDFNSTLLWRQTLKA 675
                D D Y+ VKGLQ LA F      I KS FENIL  F S+I  DFN T+LW   LKA
Sbjct: 383  RCPLDPDTYIGVKGLQILAMFGSDVFPIQKSVFENILKKFMSIIVEDFNKTILWEAALKA 442

Query: 674  LVKIGLFVDQSNESEKTTSYMSIVVDKIVSKMSFNDDSMPFPLNLEAICDIGETGLNFML 495
            L ++G FV + +ESEK  SY ++VV+KIV  +S +D ++PF L LEA+ +IG TG+  ML
Sbjct: 443  LYQVGSFVQKFHESEKAMSYRNLVVEKIVEILSLDDITLPFSLELEALSNIGMTGMKNML 502

Query: 494  RVVQGLEEAISANLSDIYVHGNLKSAEIVVQLFECLCNKVLPWFHKTGYSDKVPLRFAII 315
             ++QGL  A+ +NLS ++VH NL+S++I VQL EC   ++LPW H+ G S+   ++F + 
Sbjct: 503  TILQGLGRAVFSNLSKVHVHRNLRSSDIAVQLLECYSCQLLPWIHENGGSEDFVMQFVVD 562

Query: 314  IWNQVENSSTFSDKVLGKELLNATMMAMKIAVGNCSEENQSTILEKAYSVLSSRTSF-PL 138
            IW+Q  N   FS     K LL+A M AMK++VG+C+ E+Q+ I++KAY VLSS T+F  L
Sbjct: 563  IWSQAGNCMDFSTLFEEKGLLDAIMKAMKLSVGSCAVESQNLIIQKAYCVLSSHTNFQQL 622

Query: 137  KESMSLTIPVQLDGLQLTRILDGFSSVDDWLLSLFASVTVALRPR 3
            KE         ++ L LT      S  D+ L+SLFASV +A+ P+
Sbjct: 623  KE---------VERLPLTPGNYNISLRDEGLISLFASVVIAVFPK 658


>ref|XP_006595124.1| PREDICTED: DNA repair/transcription protein mms19-like isoform X1
            [Glycine max]
          Length = 1132

 Score =  531 bits (1367), Expect = e-148
 Identities = 294/585 (50%), Positives = 386/585 (65%), Gaps = 1/585 (0%)
 Frame = -2

Query: 1754 GPLASYAEDLFEILACYFPIHFTHPKSEDVEVKRDDLSRALMLAFSSTPLFEPFAIPLLL 1575
            G LAS+A+D+F+IL  YFPIHFT P S D  V+RD LS +LM AFSSTPLFEPF IPLLL
Sbjct: 209  GLLASFAKDVFDILEPYFPIHFTRPSSGDTHVQRD-LSTSLMSAFSSTPLFEPFVIPLLL 267

Query: 1574 EKLSSALPSAKLDSLKYLSNCTLKYGADRMAKHAEAIWSSLKDVIYASPQEPISSLTSEP 1395
            EKLSS+L SAK+DSLKYL  C+ KYGA R+AK+A AIWSSLKD +     EP  S T  P
Sbjct: 268  EKLSSSLHSAKIDSLKYLRVCSSKYGAGRIAKYAGAIWSSLKDTLSTYLGEPDFSFTIAP 327

Query: 1394 LDGIRFQDNEITKEALILLQKVILQNCGLFISLIIGDEGINMIIDSIEIYTTSNDITLQS 1215
            +DGI F +NE   EAL LLQ++I+QN  L +SLII DE +N I  +I  Y T + I +Q 
Sbjct: 328  VDGIGFPENEFVLEALSLLQQLIVQNSSLLVSLIIDDEDVNSIFSTIASYETYDAIPVQE 387

Query: 1214 KQKLHAVGSILHVSAKASVASCNRVFESFFLRLMGALELSVKNSSGFCVPADNDVVSEKL 1035
            K+KLHA+G IL+++AK +++SCN VFES F RLM  L  SV+  +    P      S+++
Sbjct: 388  KKKLHAIGRILNITAKTTISSCNAVFESLFSRLMDNLGFSVRFPNSDIPP------SQRV 441

Query: 1034 NFGALYLCIELIDACRYMIAGFEELTANTVSADETCCSMLRNFSSSLFKAFSSTLVTSIS 855
             FG LY+CIEL+  CR +I G +E     V   ETCC+ML  FS+ LF AF S L  S  
Sbjct: 442  KFGFLYVCIELLAGCRELIVGSDEPALQYVFEHETCCTMLHRFSTPLFNAFGSVLAVSAD 501

Query: 854  SGAHDADIYLRVKGLQTLATFSGGFSTISKSTFENILMTFTSVITTDFNSTLLWRQTLKA 675
                D D Y+ VKGLQ LA F      I KS FENIL  F S+I  DFN T+LW   LKA
Sbjct: 502  RCPLDPDTYIGVKGLQILAMFGSDVFPIQKSVFENILKKFMSIIVEDFNKTILWEAALKA 561

Query: 674  LVKIGLFVDQSNESEKTTSYMSIVVDKIVSKMSFNDDSMPFPLNLEAICDIGETGLNFML 495
            L ++G FV + +ESEK  SY ++VV+KIV  +S +D ++PF L LEA+ +IG TG+  ML
Sbjct: 562  LYQVGSFVQKFHESEKAMSYRNLVVEKIVEILSLDDITLPFSLELEALSNIGMTGMKNML 621

Query: 494  RVVQGLEEAISANLSDIYVHGNLKSAEIVVQLFECLCNKVLPWFHKTGYSDKVPLRFAII 315
             ++QGL  A+ +NLS ++VH NL+S++I VQL EC   ++LPW H+ G S+   ++F + 
Sbjct: 622  TILQGLGRAVFSNLSKVHVHRNLRSSDIAVQLLECYSCQLLPWIHENGGSEDFVMQFVVD 681

Query: 314  IWNQVENSSTFSDKVLGKELLNATMMAMKIAVGNCSEENQSTILEKAYSVLSSRTSF-PL 138
            IW+Q  N   FS     K LL+A M AMK++VG+C+ E+Q+ I++KAY VLSS T+F  L
Sbjct: 682  IWSQAGNCMDFSTLFEEKGLLDAIMKAMKLSVGSCAVESQNLIIQKAYCVLSSHTNFQQL 741

Query: 137  KESMSLTIPVQLDGLQLTRILDGFSSVDDWLLSLFASVTVALRPR 3
            KE         ++ L LT      S  D+ L+SLFASV +A+ P+
Sbjct: 742  KE---------VERLPLTPGNYNISLRDEGLISLFASVVIAVFPK 777


Top