BLASTX nr result
ID: Akebia24_contig00005148
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00005148 (3338 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI36057.3| unnamed protein product [Vitis vinifera] 1235 0.0 ref|XP_006465695.1| PREDICTED: MMS19 nucleotide excision repair ... 1107 0.0 ref|XP_006465694.1| PREDICTED: MMS19 nucleotide excision repair ... 1103 0.0 ref|XP_006426876.1| hypothetical protein CICLE_v10024743mg [Citr... 1101 0.0 ref|XP_002515963.1| DNA repair/transcription protein met18/mms19... 1083 0.0 ref|XP_007024313.1| MMS19 nucleotide excision repair protein, pu... 1072 0.0 ref|XP_007024310.1| MMS19 nucleotide excision repair protein, pu... 1072 0.0 ref|XP_007217541.1| hypothetical protein PRUPE_ppa023072mg [Prun... 1070 0.0 ref|XP_007024314.1| MMS19 nucleotide excision repair protein, pu... 1067 0.0 gb|EXB74582.1| hypothetical protein L484_026279 [Morus notabilis] 1051 0.0 ref|XP_007024312.1| MMS19 nucleotide excision repair protein, pu... 1032 0.0 ref|XP_004302857.1| PREDICTED: uncharacterized protein LOC101304... 1008 0.0 ref|XP_004141784.1| PREDICTED: MMS19 nucleotide excision repair ... 996 0.0 ref|XP_003546956.1| PREDICTED: MMS19 nucleotide excision repair ... 987 0.0 ref|XP_006597167.1| PREDICTED: MMS19 nucleotide excision repair ... 981 0.0 ref|XP_006853692.1| hypothetical protein AMTR_s00056p00136660 [A... 978 0.0 ref|XP_007150605.1| hypothetical protein PHAVU_005G166100g [Phas... 969 0.0 ref|XP_004236399.1| PREDICTED: MMS19 nucleotide excision repair ... 966 0.0 ref|XP_006595124.1| PREDICTED: DNA repair/transcription protein ... 963 0.0 ref|XP_006343144.1| PREDICTED: MMS19 nucleotide excision repair ... 954 0.0 >emb|CBI36057.3| unnamed protein product [Vitis vinifera] Length = 1146 Score = 1235 bits (3195), Expect = 0.0 Identities = 645/1101 (58%), Positives = 800/1101 (72%), Gaps = 5/1101 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ S ++E++VD SR S QQ++++DAIA L+KND+LT+E LV EMG+YLTTTDN+I Sbjct: 1 MAQLSQLTQYIESYVDSSRSSTQQAASVDAIAYLLKNDILTLETLVTEMGMYLTTTDNII 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI +SKPLDN T+HSLI FFTDRL DW+AL GALIGCLAL++RKSN+G Sbjct: 61 RTRGILLLAELLTRLASKPLDNVTIHSLISFFTDRLADWRALRGALIGCLALMKRKSNMG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 VT +DAR + YL+N+QVQSLG HDRKLCFEIL CLLD Y E+V +LGDDL+YGIC A Sbjct: 121 RVTDNDARAVAQAYLENVQVQSLGQHDRKLCFEILECLLDHYPESVASLGDDLVYGICGA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 IDGEKDP+CLMLTFHIVEIL LFPDP GPLA F+ DLFD+LG YFPIHFTHP+ + DV Sbjct: 181 IDGEKDPRCLMLTFHIVEILARLFPDPSGPLASFAGDLFDILGCYFPIHFTHPQGEDVDV 240 Query: 770 KRDDLSRALMLAFASTPLFEPFAMXXXXXXXXXXXXXXXVDSLKYLSHCSVKYGVDRMGK 949 KRDDLSRALMLAF+ST LFEPFA+ VDSLKYLS+C +KYG DRM K Sbjct: 241 KRDDLSRALMLAFSSTTLFEPFAIPLLLEKLSSSLPLAKVDSLKYLSNCLLKYGDDRMTK 300 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLQKFMLQDGGLFL 1129 H +IW S+KDA F S QEP+ SL SE + + FQ+NEI EA++ LQK +L++ GL L Sbjct: 301 HVEAIWFSVKDAIF-CSEQEPMLSLASELLDHVGFQENEIVTEAIILLQKVILENSGLSL 359 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 SLI+ D++I +VTS + YNDIP++ K KL A+G IL VS+ S CCNRVF++FFF Sbjct: 360 SLIVGDKDINTIVNTVTSFRSYNDIPLQSKHKLCAIGRILYVSAKASITCCNRVFESFFF 419 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGALYLCIELLAACRGLAIGSEELSPQLIS 1489 RLMD LGLS +S C+ + +V SE+LN GALYLCIELLAACR L +GSEEL+ + +S Sbjct: 420 RLMDTLGLSVRNSSGDCLPNFDYVFSERLNFGALYLCIELLAACRDLVVGSEELTSKSVS 479 Query: 1490 EEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSKS 1669 +++WCC+L FS L A +L ST++ A EADIY GVKGLQILATFP FLP SKS Sbjct: 480 AQESWCCMLHSFSSLLMKAFSSVLDASTDKDAYEADIYSGVKGLQILATFPGEFLPISKS 539 Query: 1670 IFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLASL 1849 IFEN+L F S + + + LLWKL+LKAL+QIG+FI + H+SEK + Y IVVEK+ SL Sbjct: 540 IFENVLLTFISIIVEDFNKTLLWKLALKALVQIGSFIDRFHESEKALSYNYIVVEKIVSL 599 Query: 1850 IPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILVP 2029 + L+D +P L+LEAI +IGT GL+ ML+++QG+E+AI AN E +V GNLKS +I V Sbjct: 600 MFLDDFGLPFQLRLEAISDIGTTGLNVMLKIVQGLEDAIFANLSEVYVHGNLKSAKIAVQ 659 Query: 2030 LLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTMRLA 2209 LLECYS+K+LP H + FEDV RF+VNIWNQ+EN+M F+ G + LL+ MT M+LA Sbjct: 660 LLECYSNKLLPGIHGAGDFEDVLSRFAVNIWNQIENSMAFSVGAQENELLNATMTAMKLA 719 Query: 2210 FAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDECII 2389 CSE QG I++KAY VLSS F L ES+ + ++L+ LQ TQDLE FS RD+ +I Sbjct: 720 VGSCSEGSQGKIIKKAYSVLSSCPSFTLMESMPITGTVQLEGLQHTQDLECFSCRDKWVI 779 Query: 2390 SLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEASSE 2569 SLFAS IIA+RPQT + N+R++L LFMT LLKGHV AAQALGSM+NKL N E SS Sbjct: 780 SLFASAIIAVRPQTHIPNIRVVLHLFMTNLLKGHVPAAQALGSMVNKLCPKSNGVEISST 839 Query: 2570 CTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDDSNE-GHVNLYRNVA----LQSNAMVGLA 2734 CTLE+A+DIIF LW PL +CS I NE G NL + + LQ A+ GLA Sbjct: 840 CTLEDALDIIFNTSLWDSHNHGPLKRCSGIGVDNEMGLANLCLSASNCQLLQVCAIEGLA 899 Query: 2735 WIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRSAADA 2914 WIGKG+++RGHEKVKDI MI LRCLLS++N E D+ P V +SAADA Sbjct: 900 WIGKGLLLRGHEKVKDITMIFLRCLLSKNN--------------QEQDVLPSVAKSAADA 945 Query: 2915 FHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAIGHVI 3094 FHVL+SDS++CLNKRFHA IRPLYKQ FFSS++P+L+SS+ E S TRSMLYRA+ H+I Sbjct: 946 FHVLMSDSEICLNKRFHANIRPLYKQRFFSSVLPILVSSMAESRLSNTRSMLYRALAHII 1005 Query: 3095 SGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKEAVTENA 3274 S TPL AV++EAKK Y+LLLVLSGILMD+NG+E V ENA Sbjct: 1006 SDTPLIAVLSEAKKIIPILLDSLSILSTYNLDKDILYNLLLVLSGILMDKNGQETVVENA 1065 Query: 3275 HIIINHIIGLVSYPHMMLVRE 3337 H+IIN +IGLV YPHMM+VRE Sbjct: 1066 HVIINCLIGLVGYPHMMVVRE 1086 >ref|XP_006465695.1| PREDICTED: MMS19 nucleotide excision repair protein homolog isoform X2 [Citrus sinensis] Length = 1151 Score = 1107 bits (2864), Expect = 0.0 Identities = 592/1101 (53%), Positives = 765/1101 (69%), Gaps = 5/1101 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+AS I H+E+FV+ S Q+++LD IA+L+K ++LTIE LV+EMG+YLTTTD+VI Sbjct: 1 MAEASQLIQHIESFVNLSSSPTHQAASLDVIASLLKKNVLTIETLVREMGMYLTTTDDVI 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI +SKPLD+AT+HS++ FFTDRL DW+AL GAL+GCLALLRRKS+ G Sbjct: 61 RARGILLLGELLTHLASKPLDDATIHSMLAFFTDRLADWKALRGALVGCLALLRRKSSGG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 ++T +DA+ + +Y+QN+QVQSL HDRKLCFE+L CLL RY +AV++LG+DL+Y ICEA Sbjct: 121 VITTNDAKAVAQSYIQNLQVQSLAQHDRKLCFELLECLLQRYPDAVVSLGEDLLYAICEA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 IDGEKDP CLMLTFHIVE+ LF D L LA F+ DLF++LG YFPIHFTH K + FDV Sbjct: 181 IDGEKDPHCLMLTFHIVEVAAELFSDDL--LANFASDLFEILGCYFPIHFTHSKAEDFDV 238 Query: 770 KRDDLSRALMLAFASTPLFEPFAMXXXXXXXXXXXXXXXVDSLKYLSHCSVKYGVDRMGK 949 KRDDLSRALM AF+ST LFEPFA+ VDSLKYLSHC+VKYG DR+ K Sbjct: 239 KRDDLSRALMAAFSSTSLFEPFAIPLLLEKLSSSLQSAKVDSLKYLSHCTVKYGADRIEK 298 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLQKFMLQDGGLFL 1129 HA ++WSS+KDA + S EP S SES +G+ F++N I E+L L Q+ GLFL Sbjct: 299 HAKAMWSSIKDAVYSS--HEPTLSFASESLDGVGFRENVILTESLNLLDTVFKQNSGLFL 356 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 S I+ DE+I + F+S++S K Y +I ++ KQKLHAVG ILSVS+ S A CN V ++FF Sbjct: 357 SWIIGDEDINLIFKSISSYKTYKEISLQSKQKLHAVGSILSVSAKASPAACNSVMESFFP 416 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGALYLCIELLAACRGLAIGSEELSPQLIS 1489 LM LGLS G+S C +DG VL KLN GALYLCIEL+ ACR L SEE Sbjct: 417 CLMHALGLSVGNSTQDCFPNDGNVLRGKLNHGALYLCIELMTACRELMASSEEFKSVAAP 476 Query: 1490 EEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSKS 1669 + W CLL+ +S LA ALR L TS N+ + E ++Y GVKGL IL TF L S S Sbjct: 477 ANERWYCLLQSYSASLAKALRSTLETSANEDSYETNVYFGVKGLLILGTFRGGSLIISNS 536 Query: 1670 IFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLASL 1849 IFENIL FTS + +E LLWKL+LKAL+ IG+FI + ++SEK + YM +V+EK+ SL Sbjct: 537 IFENILLTFTSIIISEFENTLLWKLALKALVHIGSFIDRFNESEKALSYMDVVIEKIVSL 596 Query: 1850 IPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILVP 2029 D SMP L+LEAI EIG G +++L+++QG+EEA+ AN +E V GN KS E++V Sbjct: 597 ASSHDFSMPFPLKLEAISEIGATGRNYLLKIVQGLEEAVCANLYEVLVHGNPKSAEVVVQ 656 Query: 2030 LLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTMRLA 2209 LLECYS+KVLP H+ GFE+V LRF+VNIWN +E ++TF++ V K LLD M M+LA Sbjct: 657 LLECYSNKVLPRIHEIGGFEEVLLRFAVNIWNLIEKSVTFSSQVHEKGLLDATMKAMKLA 716 Query: 2210 FAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDECII 2389 CS Q ++ QKA+ VLS ++FPL+++ S ++PI L++ QLTQ+ S R+ I Sbjct: 717 VGSCSVESQNIVFQKAFTVLSLGTYFPLEDAAS-NIPILLNEFQLTQETSISSSREAWIC 775 Query: 2390 SLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEASSE 2569 SLFASVIIA RPQT + NVR++++LFMT LLKG+V AAQALGSM+NKL L N TE Sbjct: 776 SLFASVIIAARPQTHIPNVRLVIRLFMTTLLKGNVPAAQALGSMVNKLGLKSNGTEVHGN 835 Query: 2570 CTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDD-SNEGHVNLYRNV----ALQSNAMVGLA 2734 CTLEEA+DIIF LWS S L +++ S+ G ++ R +LQ +A+ GLA Sbjct: 836 CTLEEAMDIIFDSKLWSFNDSVTLRSNGGLENGSSIGLTDICRGATNIRSLQVHAIAGLA 895 Query: 2735 WIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRSAADA 2914 WIGKG++MRGHEKVKDI M + CLLS S + + L+QD SE V++ AADA Sbjct: 896 WIGKGLLMRGHEKVKDITMTFIECLLSNSKLGSFSLEQDYSENSSES-----VVKYAADA 950 Query: 2915 FHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAIGHVI 3094 F +L+ DS+ CL+++ HATIRPLYKQ F+S++MP+L S + + +SS +RS+L RA H+I Sbjct: 951 FKILMGDSEDCLSRKLHATIRPLYKQRFYSTIMPILQSLIIKSNSSFSRSILCRACAHII 1010 Query: 3095 SGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKEAVTENA 3274 S TPL V+ +AK YSLLLVLSGIL D+NG+EAV E A Sbjct: 1011 SDTPLIVVLNDAKTVIPILMDGLSILSNDVSDKDIVYSLLLVLSGILTDKNGQEAVIECA 1070 Query: 3275 HIIINHIIGLVSYPHMMLVRE 3337 HIII+H I L+SYPHMMLVRE Sbjct: 1071 HIIIDHSIILISYPHMMLVRE 1091 >ref|XP_006465694.1| PREDICTED: MMS19 nucleotide excision repair protein homolog isoform X1 [Citrus sinensis] Length = 1155 Score = 1103 bits (2853), Expect = 0.0 Identities = 593/1105 (53%), Positives = 765/1105 (69%), Gaps = 9/1105 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+AS I H+E+FV+ S Q+++LD IA+L+K ++LTIE LV+EMG+YLTTTD+VI Sbjct: 1 MAEASQLIQHIESFVNLSSSPTHQAASLDVIASLLKKNVLTIETLVREMGMYLTTTDDVI 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI +SKPLD+AT+HS++ FFTDRL DW+AL GAL+GCLALLRRKS+ G Sbjct: 61 RARGILLLGELLTHLASKPLDDATIHSMLAFFTDRLADWKALRGALVGCLALLRRKSSGG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 ++T +DA+ + +Y+QN+QVQSL HDRKLCFE+L CLL RY +AV++LG+DL+Y ICEA Sbjct: 121 VITTNDAKAVAQSYIQNLQVQSLAQHDRKLCFELLECLLQRYPDAVVSLGEDLLYAICEA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 IDGEKDP CLMLTFHIVE+ LF D L LA F+ DLF++LG YFPIHFTH K + FDV Sbjct: 181 IDGEKDPHCLMLTFHIVEVAAELFSDDL--LANFASDLFEILGCYFPIHFTHSKAEDFDV 238 Query: 770 KRDDLSRALMLAFASTPLFEPFAMXXXXXXXXXXXXXXXVDSLKYLSHCSVKYGVDRMGK 949 KRDDLSRALM AF+ST LFEPFA+ VDSLKYLSHC+VKYG DR+ K Sbjct: 239 KRDDLSRALMAAFSSTSLFEPFAIPLLLEKLSSSLQSAKVDSLKYLSHCTVKYGADRIEK 298 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLQKFMLQDGGLFL 1129 HA ++WSS+KDA + S EP S SES +G+ F++N I E+L L Q+ GLFL Sbjct: 299 HAKAMWSSIKDAVYSS--HEPTLSFASESLDGVGFRENVILTESLNLLDTVFKQNSGLFL 356 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 S I+ DE+I + F+S++S K Y +I ++ KQKLHAVG ILSVS+ S A CN V ++FF Sbjct: 357 SWIIGDEDINLIFKSISSYKTYKEISLQSKQKLHAVGSILSVSAKASPAACNSVMESFFP 416 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGALYLCIELLAACRGLAIGSEELSPQLIS 1489 LM LGLS G+S C +DG VL KLN GALYLCIEL+ ACR L SEE Sbjct: 417 CLMHALGLSVGNSTQDCFPNDGNVLRGKLNHGALYLCIELMTACRELMASSEEFKSVAAP 476 Query: 1490 EEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSKS 1669 + W CLL+ +S LA ALR L TS N+ + E ++Y GVKGL IL TF L S S Sbjct: 477 ANERWYCLLQSYSASLAKALRSTLETSANEDSYETNVYFGVKGLLILGTFRGGSLIISNS 536 Query: 1670 IFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLASL 1849 IFENIL FTS + +E LLWKL+LKAL+ IG+FI + ++SEK + YM +V+EK+ SL Sbjct: 537 IFENILLTFTSIIISEFENTLLWKLALKALVHIGSFIDRFNESEKALSYMDVVIEKIVSL 596 Query: 1850 IPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILVP 2029 D SMP L+LEAI EIG G +++L+++QG+EEA+ AN +E V GN KS E++V Sbjct: 597 ASSHDFSMPFPLKLEAISEIGATGRNYLLKIVQGLEEAVCANLYEVLVHGNPKSAEVVVQ 656 Query: 2030 LLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTMRLA 2209 LLECYS+KVLP H+ GFE+V LRF+VNIWN +E ++TF++ V K LLD M M+LA Sbjct: 657 LLECYSNKVLPRIHEIGGFEEVLLRFAVNIWNLIEKSVTFSSQVHEKGLLDATMKAMKLA 716 Query: 2210 FAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDECII 2389 CS Q ++ QKA+ VLS ++FPL+++ S ++PI L++ QLTQ+ S R+ I Sbjct: 717 VGSCSVESQNIVFQKAFTVLSLGTYFPLEDAAS-NIPILLNEFQLTQETSISSSREAWIC 775 Query: 2390 SLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEASSE 2569 SLFASVIIA RPQT + NVR++++LFMT LLKG+V AAQALGSM+NKL L N TE Sbjct: 776 SLFASVIIAARPQTHIPNVRLVIRLFMTTLLKGNVPAAQALGSMVNKLGLKSNGTEVHGN 835 Query: 2570 CTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDD-SNEGHVNLYRNV----ALQSNAMVGLA 2734 CTLEEA+DIIF LWS S L +++ S+ G ++ R +LQ +A+ GLA Sbjct: 836 CTLEEAMDIIFDSKLWSFNDSVTLRSNGGLENGSSIGLTDICRGATNIRSLQVHAIAGLA 895 Query: 2735 WIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRSAADA 2914 WIGKG++MRGHEKVKDI M + CLLS S + + L+QD SE V++ AADA Sbjct: 896 WIGKGLLMRGHEKVKDITMTFIECLLSNSKLGSFSLEQDYSENSSES-----VVKYAADA 950 Query: 2915 FHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAIGHVI 3094 F +L+ DS+ CL+++ HATIRPLYKQ F+S++MP+L S + + +SS +RS+L RA H+I Sbjct: 951 FKILMGDSEDCLSRKLHATIRPLYKQRFYSTIMPILQSLIIKSNSSFSRSILCRACAHII 1010 Query: 3095 SGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGK----EAV 3262 S TPL V+ +AK YSLLLVLSGIL D+NGK EAV Sbjct: 1011 SDTPLIVVLNDAKTVIPILMDGLSILSNDVSDKDIVYSLLLVLSGILTDKNGKTIGQEAV 1070 Query: 3263 TENAHIIINHIIGLVSYPHMMLVRE 3337 E AHIII+H I L+SYPHMMLVRE Sbjct: 1071 IECAHIIIDHSIILISYPHMMLVRE 1095 >ref|XP_006426876.1| hypothetical protein CICLE_v10024743mg [Citrus clementina] gi|557528866|gb|ESR40116.1| hypothetical protein CICLE_v10024743mg [Citrus clementina] Length = 1155 Score = 1101 bits (2847), Expect = 0.0 Identities = 591/1105 (53%), Positives = 765/1105 (69%), Gaps = 9/1105 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+AS I H+E+FV+ S Q+++LD IA+L+K ++LTIE LV+EMG+YLTTTD+VI Sbjct: 1 MAEASQLIQHIESFVNLSSSPTHQAASLDVIASLLKKNVLTIETLVREMGMYLTTTDDVI 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI +SKPLD+AT+HS++ FFTDRL DW+AL GAL+GCLALLRRKS+ G Sbjct: 61 RARGILLLGELLTHLASKPLDDATIHSMLAFFTDRLADWKALRGALVGCLALLRRKSSGG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 ++T +DA+ + +Y+QN+QVQSL HDRKLCFE+L CLL RY +AV++LG+DL+Y ICEA Sbjct: 121 VITTNDAKAVAQSYIQNLQVQSLAQHDRKLCFELLECLLQRYPDAVVSLGEDLLYAICEA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 +DGEKDP CLMLTFHIVE+ LF D L LA F+ DLF++LG YFPIHFTH K + FDV Sbjct: 181 VDGEKDPHCLMLTFHIVEVAAELFSDDL--LANFAGDLFEILGCYFPIHFTHSKAEDFDV 238 Query: 770 KRDDLSRALMLAFASTPLFEPFAMXXXXXXXXXXXXXXXVDSLKYLSHCSVKYGVDRMGK 949 KRDDLSRALM AF+ST LFEPFA+ VDSLKYLSHC+VKYG DR+ K Sbjct: 239 KRDDLSRALMAAFSSTSLFEPFAIPLLLEKLSSSLQSAKVDSLKYLSHCTVKYGADRIEK 298 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLQKFMLQDGGLFL 1129 HA ++WSS+KDA + S EP S SES +G+ F+ N I E+L L Q+ GLFL Sbjct: 299 HAKAMWSSIKDAIYSS--HEPTLSFASESLDGVGFRDNVILTESLNLLDTVFKQNSGLFL 356 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 S I+ DE+I + F+S++S K Y +I ++ KQKLHAVG ILSVS+ S A CN V ++FF Sbjct: 357 SWIIGDEDINLIFKSISSFKTYKEISLQSKQKLHAVGSILSVSAKASPAACNSVMESFFP 416 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGALYLCIELLAACRGLAIGSEELSPQLIS 1489 LM LGLS G+S C +DG VL KLN GALYLCIEL+ ACR L SEE Sbjct: 417 CLMHPLGLSVGNSTQDCFPNDGNVLRGKLNHGALYLCIELMTACRELMASSEEFKSVAAP 476 Query: 1490 EEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSKS 1669 + W CLL+ +S LA ALR L TS N+ + E ++Y GVKGL IL TF L S S Sbjct: 477 ANERWYCLLQSYSASLAKALRSTLETSANEDSYETNVYFGVKGLLILGTFSGGSLIISNS 536 Query: 1670 IFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLASL 1849 IFENIL FTS + +E LLWKL+LKAL+ IG+FI + ++SEK + YM +V+EK+ SL Sbjct: 537 IFENILLTFTSIIISEFENTLLWKLALKALVHIGSFIDRFNESEKALSYMDVVIEKIVSL 596 Query: 1850 IPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILVP 2029 D SMP L+LEAI EIG G +++L+++QG+EEA+ AN +E V GN KS E++V Sbjct: 597 ASSHDFSMPFPLKLEAISEIGATGRNYLLKIVQGLEEAVCANLYEVLVHGNPKSAEVVVQ 656 Query: 2030 LLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTMRLA 2209 LLECYS+KVLP H+ GFE+V LRF+VNIWN +E ++TF++ V K LLD M M+LA Sbjct: 657 LLECYSNKVLPRIHEIGGFEEVLLRFAVNIWNLIEKSVTFSSQVHEKGLLDATMKAMKLA 716 Query: 2210 FAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDECII 2389 CS Q ++ QKA+ VLS ++FPL+++ S ++PI+L++ QLTQ+ S R+ I Sbjct: 717 VGSCSVESQNIVFQKAFTVLSLGTYFPLEDAAS-NIPIQLNEFQLTQETSISSSREAWIC 775 Query: 2390 SLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEASSE 2569 SLFASVIIA PQT + NVR++++LFMT LLKG+V AAQALGSM+NKL L N TE Sbjct: 776 SLFASVIIAACPQTHIPNVRLVIRLFMTTLLKGNVPAAQALGSMVNKLGLKSNGTEVHGN 835 Query: 2570 CTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDD-SNEGHVNLYRNV----ALQSNAMVGLA 2734 CTLEEA+DIIF LWS S L +++ S+ G ++ R +LQ +A+ GLA Sbjct: 836 CTLEEAMDIIFDSKLWSFNDSVTLRSNGGLENGSSIGLTDICRGATNIRSLQVHAIAGLA 895 Query: 2735 WIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRSAADA 2914 WIGKG++MRGHEKVKDI M + CLLS S + + L+QD SE V++ AADA Sbjct: 896 WIGKGLLMRGHEKVKDITMTFIECLLSNSKLGSFSLEQDYSENSSES-----VVKYAADA 950 Query: 2915 FHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAIGHVI 3094 F +L+ DS+ CL+++ HATIRPLYKQ F+S++MP+L S + + +SS +RS+L RA H+I Sbjct: 951 FKILMGDSEDCLSRKLHATIRPLYKQRFYSTIMPILQSLIIKSNSSFSRSILCRACAHII 1010 Query: 3095 SGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDEN----GKEAV 3262 S TPL V+ +AK YSLLLVLSGIL D+N G+EAV Sbjct: 1011 SDTPLIVVLNDAKTVIPILMDGLSILSNDVSDKDIVYSLLLVLSGILTDKNVKTIGQEAV 1070 Query: 3263 TENAHIIINHIIGLVSYPHMMLVRE 3337 E AHIII+HII L+SYPHMMLVRE Sbjct: 1071 IECAHIIIDHIIKLISYPHMMLVRE 1095 >ref|XP_002515963.1| DNA repair/transcription protein met18/mms19, putative [Ricinus communis] gi|223544868|gb|EEF46383.1| DNA repair/transcription protein met18/mms19, putative [Ricinus communis] Length = 1174 Score = 1083 bits (2802), Expect = 0.0 Identities = 579/1119 (51%), Positives = 769/1119 (68%), Gaps = 23/1119 (2%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ + ++E++VD SR +QQ+++LDAI L+KND +TI +LVKEM +YLTTTD++I Sbjct: 1 MAEPTQLTQYIESYVDASRSLSQQAASLDAIVLLLKNDAVTIGSLVKEMEMYLTTTDDII 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI SSKPLDN T+HSLI FFT+RL DW+AL GAL+GCLAL+RR+SN G Sbjct: 61 RARGILLLGEALSHLSSKPLDNTTIHSLIAFFTERLADWRALRGALVGCLALIRRRSN-G 119 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 I+T DA+ + +YLQN+QVQSL +DRKLCFE+L CLL+ AV +LG+DLIYGICEA Sbjct: 120 IITGIDAKVVAESYLQNLQVQSLAQYDRKLCFELLECLLENCPAAVASLGEDLIYGICEA 179 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 IDGEKDPQCLMLTFHIVE+LG LFPDP GP + F+ D+F +LG YFPIHFTHPK + DV Sbjct: 180 IDGEKDPQCLMLTFHIVEVLGKLFPDPSGPFSSFAGDIFSILGCYFPIHFTHPKAEDVDV 239 Query: 770 KRDDLSRALMLAFASTPLFEPFAMXXXXXXXXXXXXXXXVDSLKYLSHCSVKYGVDRMGK 949 KRDDLSRALMLAF+STPLFEPFAM VDSLKYLS+C++K+ DR+ + Sbjct: 240 KRDDLSRALMLAFSSTPLFEPFAMPLLLEKLSSSLPTAKVDSLKYLSYCTLKFRADRIAE 299 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLQKFMLQDGGLFL 1129 HA +IWSSLKDA + SS +EP+ S ES + +KNEI EAL+ L+ ++Q+ FL Sbjct: 300 HAGAIWSSLKDAIY-SSGEEPMLSSDLESVDSPGSEKNEIATEALLLLENLIVQNNNFFL 358 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 S+I+ DEE++M F ++TS K YN+I ++ KQKLH VG IL V + VS + CNR+F+++F Sbjct: 359 SMIISDEEVKMIFNTITSYKSYNEISLQSKQKLHMVGRILYVCAKVSVSSCNRIFESYFP 418 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGALYLCIELLAACRGLAIGSEELSPQLIS 1489 RLM+ LG+ ++ C ++ V +++ N G+ YL I+LL ACR L+ S+ L+ Q IS Sbjct: 419 RLMEALGILVENTSGACHSNENCVKAKQPNYGSFYLSIKLLGACRDLSTSSDNLASQCIS 478 Query: 1490 EEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSKS 1669 +T+CCLL++FS L L TST+ A + D+Y GVKGLQILATFP +L SK Sbjct: 479 TNETYCCLLQRFSTSLTETFSAALATSTSGPAQDVDMYLGVKGLQILATFPGGYLFLSKL 538 Query: 1670 IFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLASL 1849 F+NIL F S ++V + + LLW +LKAL+QIG+F+ C++S+K + Y+ IVV K+ L Sbjct: 539 TFDNILMTFLSIITVDFNKTLLWNQALKALVQIGSFVHGCNESDKEMSYVDIVVGKMILL 598 Query: 1850 IPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAF------------- 1990 D SMP SL+L AI IG G +ML+V G+EEAI AN E + Sbjct: 599 ASSPDFSMPWSLKLTAISSIGMSGQKYMLKVFLGLEEAIRANLAEIYVCMIKKKIYVLYS 658 Query: 1991 --VKGNLKSVEILVPLLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVE 2164 V+GNLKS +IL+ LLECYS ++LPW K+EGFE+V ++F VN+WNQ+EN F Sbjct: 659 CLVQGNLKSAKILLQLLECYSDELLPWIQKTEGFEEVLMQFVVNLWNQIENFNAFTVAFH 718 Query: 2165 GK-VLLDKMMTTMRLAFAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQ 2341 GK LLD +M M+ A A CS Q +I+ KAYGVLSS++F PLKES+S + ++L+ + Sbjct: 719 GKESLLDAIMKVMKDAVAFCSVESQNVIIYKAYGVLSSSTFLPLKESLSEN-SVQLECFR 777 Query: 2342 LTQDLESFSYRDECIISLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSM 2521 Q ++ S RDE I SLFASVIIALRPQT + N RI+L LF+T LLKGHV+ A+ALGS+ Sbjct: 778 AIQQMDRLSSRDEWIHSLFASVIIALRPQTHIPNTRIVLHLFITALLKGHVTTAEALGSL 837 Query: 2522 INKLPLMINTTEASSECTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDDSNEGHVNLYR-- 2695 +NKL N S +CT+EEA+DIIF + L G+ + + +E ++L + Sbjct: 838 VNKLDQKSNDACISGDCTIEEAMDIIFSINLLCSFGNGSSGRFDRTRNGDE--MDLIKLC 895 Query: 2696 ----NVA-LQSNAMVGLAWIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLG 2860 N+A ++ A+VGLAWIGKG++MRGHEKVKDI M+ L CLLS + SPL+ L Sbjct: 896 LDAPNLAWIKIPAIVGLAWIGKGLLMRGHEKVKDITMVFLNCLLSDGEIGASPLKHGSLE 955 Query: 2861 ECSEHDIHPIVMRSAADAFHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKE 3040 E D+ VM+SA+DAF +L+SDS++CLN+++HA +RPLYKQ FFSS+MP+L + + Sbjct: 956 NNGEQDMQQSVMKSASDAFQILMSDSELCLNRKYHAIVRPLYKQRFFSSIMPILYPLITK 1015 Query: 3041 CDSSTTRSMLYRAIGHVISGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLV 3220 DSS ++S+LYRA HVIS TPL+ + +AKK Y LLLV Sbjct: 1016 SDSSFSKSLLYRAFAHVISDTPLSVISNDAKKLVPVLLDGLTLLGKDVLDKDIMYGLLLV 1075 Query: 3221 LSGILMDENGKEAVTENAHIIINHIIGLVSYPHMMLVRE 3337 LSGIL D NGKEAV ENAHIII +I LV+YPHMML+RE Sbjct: 1076 LSGILTDTNGKEAVIENAHIIIKCLIELVAYPHMMLIRE 1114 >ref|XP_007024313.1| MMS19 nucleotide excision repair protein, putative isoform 4 [Theobroma cacao] gi|508779679|gb|EOY26935.1| MMS19 nucleotide excision repair protein, putative isoform 4 [Theobroma cacao] Length = 1136 Score = 1072 bits (2773), Expect = 0.0 Identities = 574/1105 (51%), Positives = 760/1105 (68%), Gaps = 9/1105 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ S I +E+FVD +R QQ+++LD IA+L+KN+ LTIE LV+EM YLTT DN+I Sbjct: 1 MAETSQLIQGIESFVDSTRSPTQQAASLDVIASLLKNNQLTIETLVREMEGYLTTADNII 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI +SKPLD+AT+HSLI FFTDRL DW+AL GAL+GCLALLRRKS+ G Sbjct: 61 RARGILLLGEVLMHLASKPLDDATIHSLIQFFTDRLADWRALRGALVGCLALLRRKSSGG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 IV+ +DA+ + +YLQN+QVQSLG +DRKLCFE+L+CLL+RY +A+ +LGD+LIYGICEA Sbjct: 121 IVSETDAKAVAESYLQNLQVQSLGKYDRKLCFELLLCLLERYPKAIASLGDNLIYGICEA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 +DGEKDP CLML FHI+EIL LFPDPLGP F+ DLF+ L YFP+HFTHPK + ++ Sbjct: 181 VDGEKDPHCLMLIFHIIEILPQLFPDPLGPFTSFAHDLFENLSYYFPVHFTHPKGEDVNI 240 Query: 770 KRDDLSRALMLAFASTPLFEPFAMXXXXXXXXXXXXXXXVDSLKYLSHCSVKYGVDRMGK 949 KRDDL+RALMLAF+STPLFEPFA+ VDSL+YLS C+VKYGVDRM K Sbjct: 241 KRDDLARALMLAFSSTPLFEPFAIPLLIEKLSSSLPSAKVDSLRYLSDCTVKYGVDRMAK 300 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLQKFMLQDGGLFL 1129 H ++WSSLKDA F S + + S ES G+ +NEI EAL LQK ++Q+ FL Sbjct: 301 HGEALWSSLKDAVFTS--LDGVLSFTPESLEGLCLPENEIAAEALSLLQKLIVQNTNFFL 358 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 LI+ DE+I M F ++S K Y+ IP + KQ+LHAVGCILS S S+A CNRVF+ FF Sbjct: 359 DLIVVDEDINMIFNMISSYKSYHGIPAQSKQRLHAVGCILSASVKASTASCNRVFECFFS 418 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGALYLCIELLAACRGLAIGSEELSPQLIS 1489 RLMDILGL +S DD ++ ++ N GALYL IELL+ACR + SE + Sbjct: 419 RLMDILGLCVRNSSGNLSSDDSIMIPKRYNHGALYLSIELLSACRDVIASSETIIAASAH 478 Query: 1490 EEDTWCCLLKKFSGPLAGAL-RYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSK 1666 E+TW LL+ FS L A + TS + H +AD+Y GVKGL ILATFP +L SK Sbjct: 479 TEETWSYLLRSFSSSLTKAFCSASICTSEDSH--DADVYFGVKGLLILATFPEGYLLISK 536 Query: 1667 SIFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLAS 1846 +FE IL F S ++V + LLWKL+LKAL+QIG+FI+KCH+SEK Y+ +VVEK+ S Sbjct: 537 PVFEKILMTFVSIVTVDYSNTLLWKLALKALVQIGSFIEKCHESEKEPSYLGLVVEKIVS 596 Query: 1847 LIPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILV 2026 L D S+P L+LEA+ EIGT G +ML+V++G+EEAI AN E +V G+ S EI+ Sbjct: 597 FSSLGDFSIPFPLRLEALSEIGTSGKSYMLKVVEGLEEAIYANLSEVYVHGSSNSAEIVT 656 Query: 2027 PLLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKV-LLDKMMTTMR 2203 LL+CYS KV+PW ++GF++V L+F+++IWNQ+E +M FNA K+ +LD MM M+ Sbjct: 657 QLLKCYSDKVIPWIQCAKGFDEVPLQFAIHIWNQIELSMVFNATQTNKIEVLDVMMKAMK 716 Query: 2204 LAFAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDEC 2383 LA A CSE +Q +IVQK+Y +LSS++ FPLKE + + Q+ Q +++ S RDE Sbjct: 717 LAVASCSEENQNIIVQKSYHILSSSTSFPLKEL------FRQESFQIVQ-VDNSSSRDEW 769 Query: 2384 IISLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEAS 2563 I+SLFA+V+IA+ P+T + N++ +L LFMT LLKG+V AQALGS++NKL L + Sbjct: 770 ILSLFAAVVIAVHPETYVPNIKPLLYLFMTTLLKGNVVTAQALGSVVNKLGL--ESAGVQ 827 Query: 2564 SECTLEEAIDIIFKMGLW---SVCGSDPLWKCSAIDDSNEGHVNLYRNV----ALQSNAM 2722 ++CTLEE +DII + LW S +D K ++ D + +NL ++ +LQ +A+ Sbjct: 828 TDCTLEEVMDIILNLSLWIFHSNSSADIQAKMTSAHDIS--LINLCSSIGSCTSLQIHAI 885 Query: 2723 VGLAWIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRS 2902 VGLAWIGKG++MRGHEKVKDI MI LRCL ++ + +E D+H VM+S Sbjct: 886 VGLAWIGKGLLMRGHEKVKDITMIFLRCLQPNGRAEILHQEEGISESNNELDLHHSVMKS 945 Query: 2903 AADAFHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAI 3082 AADAF +L+ DS+VCLN+ FHA IRPLYKQ FFS+MMP+L S + + + +R +L RA Sbjct: 946 AADAFQILMGDSEVCLNRGFHAVIRPLYKQRFFSTMMPILQSLIMKSE-PLSRPLLLRAS 1004 Query: 3083 GHVISGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKEAV 3262 H+I TPL V+++AKK Y LLLVLSGILMD+NG+EAV Sbjct: 1005 AHIIVDTPLIVVLSDAKKIIPMLLDGLSALSNDILDKDVIYGLLLVLSGILMDKNGQEAV 1064 Query: 3263 TENAHIIINHIIGLVSYPHMMLVRE 3337 +++AH I N +I L+ YPHMMLVRE Sbjct: 1065 SDSAHTITNRLIELIQYPHMMLVRE 1089 >ref|XP_007024310.1| MMS19 nucleotide excision repair protein, putative isoform 1 [Theobroma cacao] gi|590619491|ref|XP_007024311.1| MMS19 nucleotide excision repair protein, putative isoform 1 [Theobroma cacao] gi|508779676|gb|EOY26932.1| MMS19 nucleotide excision repair protein, putative isoform 1 [Theobroma cacao] gi|508779677|gb|EOY26933.1| MMS19 nucleotide excision repair protein, putative isoform 1 [Theobroma cacao] Length = 1149 Score = 1072 bits (2773), Expect = 0.0 Identities = 574/1105 (51%), Positives = 760/1105 (68%), Gaps = 9/1105 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ S I +E+FVD +R QQ+++LD IA+L+KN+ LTIE LV+EM YLTT DN+I Sbjct: 1 MAETSQLIQGIESFVDSTRSPTQQAASLDVIASLLKNNQLTIETLVREMEGYLTTADNII 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI +SKPLD+AT+HSLI FFTDRL DW+AL GAL+GCLALLRRKS+ G Sbjct: 61 RARGILLLGEVLMHLASKPLDDATIHSLIQFFTDRLADWRALRGALVGCLALLRRKSSGG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 IV+ +DA+ + +YLQN+QVQSLG +DRKLCFE+L+CLL+RY +A+ +LGD+LIYGICEA Sbjct: 121 IVSETDAKAVAESYLQNLQVQSLGKYDRKLCFELLLCLLERYPKAIASLGDNLIYGICEA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 +DGEKDP CLML FHI+EIL LFPDPLGP F+ DLF+ L YFP+HFTHPK + ++ Sbjct: 181 VDGEKDPHCLMLIFHIIEILPQLFPDPLGPFTSFAHDLFENLSYYFPVHFTHPKGEDVNI 240 Query: 770 KRDDLSRALMLAFASTPLFEPFAMXXXXXXXXXXXXXXXVDSLKYLSHCSVKYGVDRMGK 949 KRDDL+RALMLAF+STPLFEPFA+ VDSL+YLS C+VKYGVDRM K Sbjct: 241 KRDDLARALMLAFSSTPLFEPFAIPLLIEKLSSSLPSAKVDSLRYLSDCTVKYGVDRMAK 300 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLQKFMLQDGGLFL 1129 H ++WSSLKDA F S + + S ES G+ +NEI EAL LQK ++Q+ FL Sbjct: 301 HGEALWSSLKDAVFTS--LDGVLSFTPESLEGLCLPENEIAAEALSLLQKLIVQNTNFFL 358 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 LI+ DE+I M F ++S K Y+ IP + KQ+LHAVGCILS S S+A CNRVF+ FF Sbjct: 359 DLIVVDEDINMIFNMISSYKSYHGIPAQSKQRLHAVGCILSASVKASTASCNRVFECFFS 418 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGALYLCIELLAACRGLAIGSEELSPQLIS 1489 RLMDILGL +S DD ++ ++ N GALYL IELL+ACR + SE + Sbjct: 419 RLMDILGLCVRNSSGNLSSDDSIMIPKRYNHGALYLSIELLSACRDVIASSETIIAASAH 478 Query: 1490 EEDTWCCLLKKFSGPLAGAL-RYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSK 1666 E+TW LL+ FS L A + TS + H +AD+Y GVKGL ILATFP +L SK Sbjct: 479 TEETWSYLLRSFSSSLTKAFCSASICTSEDSH--DADVYFGVKGLLILATFPEGYLLISK 536 Query: 1667 SIFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLAS 1846 +FE IL F S ++V + LLWKL+LKAL+QIG+FI+KCH+SEK Y+ +VVEK+ S Sbjct: 537 PVFEKILMTFVSIVTVDYSNTLLWKLALKALVQIGSFIEKCHESEKEPSYLGLVVEKIVS 596 Query: 1847 LIPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILV 2026 L D S+P L+LEA+ EIGT G +ML+V++G+EEAI AN E +V G+ S EI+ Sbjct: 597 FSSLGDFSIPFPLRLEALSEIGTSGKSYMLKVVEGLEEAIYANLSEVYVHGSSNSAEIVT 656 Query: 2027 PLLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKV-LLDKMMTTMR 2203 LL+CYS KV+PW ++GF++V L+F+++IWNQ+E +M FNA K+ +LD MM M+ Sbjct: 657 QLLKCYSDKVIPWIQCAKGFDEVPLQFAIHIWNQIELSMVFNATQTNKIEVLDVMMKAMK 716 Query: 2204 LAFAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDEC 2383 LA A CSE +Q +IVQK+Y +LSS++ FPLKE + + Q+ Q +++ S RDE Sbjct: 717 LAVASCSEENQNIIVQKSYHILSSSTSFPLKEL------FRQESFQIVQ-VDNSSSRDEW 769 Query: 2384 IISLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEAS 2563 I+SLFA+V+IA+ P+T + N++ +L LFMT LLKG+V AQALGS++NKL L + Sbjct: 770 ILSLFAAVVIAVHPETYVPNIKPLLYLFMTTLLKGNVVTAQALGSVVNKLGL--ESAGVQ 827 Query: 2564 SECTLEEAIDIIFKMGLW---SVCGSDPLWKCSAIDDSNEGHVNLYRNV----ALQSNAM 2722 ++CTLEE +DII + LW S +D K ++ D + +NL ++ +LQ +A+ Sbjct: 828 TDCTLEEVMDIILNLSLWIFHSNSSADIQAKMTSAHDIS--LINLCSSIGSCTSLQIHAI 885 Query: 2723 VGLAWIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRS 2902 VGLAWIGKG++MRGHEKVKDI MI LRCL ++ + +E D+H VM+S Sbjct: 886 VGLAWIGKGLLMRGHEKVKDITMIFLRCLQPNGRAEILHQEEGISESNNELDLHHSVMKS 945 Query: 2903 AADAFHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAI 3082 AADAF +L+ DS+VCLN+ FHA IRPLYKQ FFS+MMP+L S + + + +R +L RA Sbjct: 946 AADAFQILMGDSEVCLNRGFHAVIRPLYKQRFFSTMMPILQSLIMKSE-PLSRPLLLRAS 1004 Query: 3083 GHVISGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKEAV 3262 H+I TPL V+++AKK Y LLLVLSGILMD+NG+EAV Sbjct: 1005 AHIIVDTPLIVVLSDAKKIIPMLLDGLSALSNDILDKDVIYGLLLVLSGILMDKNGQEAV 1064 Query: 3263 TENAHIIINHIIGLVSYPHMMLVRE 3337 +++AH I N +I L+ YPHMMLVRE Sbjct: 1065 SDSAHTITNRLIELIQYPHMMLVRE 1089 >ref|XP_007217541.1| hypothetical protein PRUPE_ppa023072mg [Prunus persica] gi|462413691|gb|EMJ18740.1| hypothetical protein PRUPE_ppa023072mg [Prunus persica] Length = 1158 Score = 1070 bits (2768), Expect = 0.0 Identities = 571/1110 (51%), Positives = 757/1110 (68%), Gaps = 14/1110 (1%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ + I H+E +VD SR +Q+++L++I +LVK+D LTIE LVKEM +YLTTTDNVI Sbjct: 1 MAETTELIQHIELYVDTSRSPTEQAASLNSIISLVKSDFLTIEVLVKEMRMYLTTTDNVI 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI +SKPLDNAT+HSLIGFFTDRL DW+AL GAL+GCLALLRRK N G Sbjct: 61 RARGILLLAEVLTGLASKPLDNATIHSLIGFFTDRLADWRALRGALVGCLALLRRKVNAG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 +V+ SD + + +Y++++QVQSLG HDRKLCFE+L CLL+R+ + +LG+ YGIC+A Sbjct: 121 MVSASDGKLVAQSYIESLQVQSLGQHDRKLCFELLECLLERHPNEIASLGETFFYGICQA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 +DGEKDP CLMLTF IVE L ++PDP G LA F DLF++LG YFPIHFTH K++ +V Sbjct: 181 MDGEKDPHCLMLTFPIVETLVRIYPDPSGSLASFCGDLFELLGSYFPIHFTHLKDEDAEV 240 Query: 770 KRDDLSRALMLAFASTPLFEPFAMXXXXXXXXXXXXXXXVDSLKYLSHCSVKYGVDRMGK 949 KRDDLS+ALM AF+STPLFEPF + VDSLKYL+HC+ KYG DRM K Sbjct: 241 KRDDLSKALMSAFSSTPLFEPFVIPLLLEKLSSSLPLAKVDSLKYLNHCTAKYGADRMAK 300 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLQKFMLQDGGLFL 1129 HA +IW SLKDA +S ++P S SE G+ FQ+NEI EAL+ LQK LQ+ LFL Sbjct: 301 HAGAIWISLKDA-ISNSLEKPDMSFTSEPLYGLGFQENEIATEALMLLQKVTLQNEALFL 359 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 SLI++DE I + F S+ S + YN+IP++ KQ LHAVG IL + S S A CN VF++FF Sbjct: 360 SLIIQDEGINIVFNSIASHEHYNNIPLQGKQWLHAVGRILYIISKTSMASCNSVFESFFP 419 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGALYLCIELLAACRGLAIGSEELSPQLIS 1489 RLM+ L +S +S C L++ S+K N GALYLC+EL+AACR L + S++L+P+ + Sbjct: 420 RLMNTLEISVTNSAGDCTLNENTFPSKKFNFGALYLCVELIAACRDLIMRSKDLAPKPDT 479 Query: 1490 EEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSKS 1669 ++T +L+ F+ L A L T+ N+ A ADIY VKGLQILATFP FLP SK Sbjct: 480 PQETCRYMLQSFADSLVNAFSSSLATNANEVAHGADIYFKVKGLQILATFPGDFLPISKF 539 Query: 1670 IFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLASL 1849 +F NIL + S + V + + LLWKL LKAL+ IG+F+ H+SEK + YM VV+K SL Sbjct: 540 LFANILTILMSIILVDFNKILLWKLVLKALVHIGSFVDVYHESEKALGYMGAVVDKTVSL 599 Query: 1850 IPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILVP 2029 + +D MP SL+LEA EIG G + ML+++QG+EEAI A + +V GNLKS E + Sbjct: 600 VSRDDVKMPFSLKLEAASEIGASGRNHMLKIVQGMEEAIVAKLSD-YVHGNLKSAEKTIQ 658 Query: 2030 LLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTMRLA 2209 LLECY +K+L W +++ G E+V LRF +NIWN +E+ F+ V+ + LLD M M+LA Sbjct: 659 LLECYCNKILSWINETGGLEEVLLRFVINIWNCVESCKDFSIQVQEEELLDATMMAMKLA 718 Query: 2210 FAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLES--------- 2362 CSE Q +I+ KAY V+SS+ P KES+ + I+L++L +++ +++ Sbjct: 719 IGSCSEESQNIIIHKAYSVISSSISIPFKESLDATSSIQLEELSVSEQIDNSSHRDDQID 778 Query: 2363 -FSYRDECIISLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPL 2539 FS RDE I+S FASVIIA+RP+ ++NV+ IL LFMT +LKG V AAQALGS+INKL Sbjct: 779 KFSLRDEWILSHFASVIIAVRPKAQIVNVKGILHLFMTTVLKGCVPAAQALGSVINKLGT 838 Query: 2540 MINTTEASSECTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDDSNEGHVNL----YRNVAL 2707 N T S +CTLEEA+D+IF+ LW++ + L C + + S G +L N L Sbjct: 839 KSNETANSIDCTLEEAVDMIFRTKLWNLNENGVLRTCGSGNGSKVGLTDLCLGFSSNKLL 898 Query: 2708 QSNAMVGLAWIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHP 2887 + +A+VGLAWIGKG+++ GHEKVKD+ ILL CLLS +R L+Q +L E Sbjct: 899 RVHAVVGLAWIGKGLLLLGHEKVKDVTKILLECLLSEGRIRAMELKQGLLENSYEQH--- 955 Query: 2888 IVMRSAADAFHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSM 3067 VMRSAADAFH+L+SDS+VCLN++FHA RPLYKQ FFS++MP+L S + + DSS RSM Sbjct: 956 SVMRSAADAFHILMSDSEVCLNRKFHAIARPLYKQRFFSTVMPILQSCIIKSDSSVCRSM 1015 Query: 3068 LYRAIGHVISGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDEN 3247 L+RA H+IS PL +++EAKK YSLLLVLSGIL D+N Sbjct: 1016 LFRASAHLISNAPLIVILSEAKKLMPVLLDGLSLLSEDILDKDKLYSLLLVLSGILTDKN 1075 Query: 3248 GKEAVTENAHIIINHIIGLVSYPHMMLVRE 3337 G+ AV ENAHI++N + L+ YPHMM VRE Sbjct: 1076 GQVAVIENAHILVNCLTRLIDYPHMMFVRE 1105 >ref|XP_007024314.1| MMS19 nucleotide excision repair protein, putative isoform 5 [Theobroma cacao] gi|508779680|gb|EOY26936.1| MMS19 nucleotide excision repair protein, putative isoform 5 [Theobroma cacao] Length = 1157 Score = 1067 bits (2759), Expect = 0.0 Identities = 570/1103 (51%), Positives = 758/1103 (68%), Gaps = 9/1103 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ S I +E+FVD +R QQ+++LD IA+L+KN+ LTIE LV+EM YLTT DN+I Sbjct: 1 MAETSQLIQGIESFVDSTRSPTQQAASLDVIASLLKNNQLTIETLVREMEGYLTTADNII 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI +SKPLD+AT+HSLI FFTDRL DW+AL GAL+GCLALLRRKS+ G Sbjct: 61 RARGILLLGEVLMHLASKPLDDATIHSLIQFFTDRLADWRALRGALVGCLALLRRKSSGG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 IV+ +DA+ + +YLQN+QVQSLG +DRKLCFE+L+CLL+RY +A+ +LGD+LIYGICEA Sbjct: 121 IVSETDAKAVAESYLQNLQVQSLGKYDRKLCFELLLCLLERYPKAIASLGDNLIYGICEA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 +DGEKDP CLML FHI+EIL LFPDPLGP F+ DLF+ L YFP+HFTHPK + ++ Sbjct: 181 VDGEKDPHCLMLIFHIIEILPQLFPDPLGPFTSFAHDLFENLSYYFPVHFTHPKGEDVNI 240 Query: 770 KRDDLSRALMLAFASTPLFEPFAMXXXXXXXXXXXXXXXVDSLKYLSHCSVKYGVDRMGK 949 KRDDL+RALMLAF+STPLFEPFA+ VDSL+YLS C+VKYGVDRM K Sbjct: 241 KRDDLARALMLAFSSTPLFEPFAIPLLIEKLSSSLPSAKVDSLRYLSDCTVKYGVDRMAK 300 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLQKFMLQDGGLFL 1129 H ++WSSLKDA F S + + S ES G+ +NEI EAL LQK ++Q+ FL Sbjct: 301 HGEALWSSLKDAVFTS--LDGVLSFTPESLEGLCLPENEIAAEALSLLQKLIVQNTNFFL 358 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 LI+ DE+I M F ++S K Y+ IP + KQ+LHAVGCILS S S+A CNRVF+ FF Sbjct: 359 DLIVVDEDINMIFNMISSYKSYHGIPAQSKQRLHAVGCILSASVKASTASCNRVFECFFS 418 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGALYLCIELLAACRGLAIGSEELSPQLIS 1489 RLMDILGL +S DD ++ ++ N GALYL IELL+ACR + SE + Sbjct: 419 RLMDILGLCVRNSSGNLSSDDSIMIPKRYNHGALYLSIELLSACRDVIASSETIIAASAH 478 Query: 1490 EEDTWCCLLKKFSGPLAGAL-RYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSK 1666 E+TW LL+ FS L A + TS + H +AD+Y GVKGL ILATFP +L SK Sbjct: 479 TEETWSYLLRSFSSSLTKAFCSASICTSEDSH--DADVYFGVKGLLILATFPEGYLLISK 536 Query: 1667 SIFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLAS 1846 +FE IL F S ++V + LLWKL+LKAL+QIG+FI+KCH+SEK Y+ +VVEK+ S Sbjct: 537 PVFEKILMTFVSIVTVDYSNTLLWKLALKALVQIGSFIEKCHESEKEPSYLGLVVEKIVS 596 Query: 1847 LIPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILV 2026 L D S+P L+LEA+ EIGT G +ML+V++G+EEAI AN E +V G+ S EI+ Sbjct: 597 FSSLGDFSIPFPLRLEALSEIGTSGKSYMLKVVEGLEEAIYANLSEVYVHGSSNSAEIVT 656 Query: 2027 PLLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKV-LLDKMMTTMR 2203 LL+CYS KV+PW ++GF++V L+F+++IWNQ+E +M FNA K+ +LD MM M+ Sbjct: 657 QLLKCYSDKVIPWIQCAKGFDEVPLQFAIHIWNQIELSMVFNATQTNKIEVLDVMMKAMK 716 Query: 2204 LAFAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDEC 2383 LA A CSE +Q +IVQK+Y +LSS++ FPLKE + + Q+ Q +++ S RDE Sbjct: 717 LAVASCSEENQNIIVQKSYHILSSSTSFPLKEL------FRQESFQIVQ-VDNSSSRDEW 769 Query: 2384 IISLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEAS 2563 I+SLFA+V+IA+ P+T + N++ +L LFMT LLKG+V AQALGS++NKL L + Sbjct: 770 ILSLFAAVVIAVHPETYVPNIKPLLYLFMTTLLKGNVVTAQALGSVVNKLGL--ESAGVQ 827 Query: 2564 SECTLEEAIDIIFKMGLW---SVCGSDPLWKCSAIDDSNEGHVNLYRNV----ALQSNAM 2722 ++CTLEE +DII + LW S +D K ++ D + +NL ++ +LQ +A+ Sbjct: 828 TDCTLEEVMDIILNLSLWIFHSNSSADIQAKMTSAHDIS--LINLCSSIGSCTSLQIHAI 885 Query: 2723 VGLAWIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRS 2902 VGLAWIGKG++MRGHEKVKDI MI LRCL ++ + +E D+H VM+S Sbjct: 886 VGLAWIGKGLLMRGHEKVKDITMIFLRCLQPNGRAEILHQEEGISESNNELDLHHSVMKS 945 Query: 2903 AADAFHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAI 3082 AADAF +L+ DS+VCLN+ FHA IRPLYKQ FFS+MMP+L S + + + +R +L RA Sbjct: 946 AADAFQILMGDSEVCLNRGFHAVIRPLYKQRFFSTMMPILQSLIMKSE-PLSRPLLLRAS 1004 Query: 3083 GHVISGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKEAV 3262 H+I TPL V+++AKK Y LLLVLSGILMD+NG+EAV Sbjct: 1005 AHIIVDTPLIVVLSDAKKIIPMLLDGLSALSNDILDKDVIYGLLLVLSGILMDKNGQEAV 1064 Query: 3263 TENAHIIINHIIGLVSYPHMMLV 3331 +++AH I N +I L+ YPHMM++ Sbjct: 1065 SDSAHTITNRLIELIQYPHMMVI 1087 >gb|EXB74582.1| hypothetical protein L484_026279 [Morus notabilis] Length = 1210 Score = 1051 bits (2718), Expect = 0.0 Identities = 564/1136 (49%), Positives = 755/1136 (66%), Gaps = 40/1136 (3%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ S H+E++VD +R N+Q+++LD+I +LVKN L+TIE LV+EM +YLTTTD+VI Sbjct: 1 MAEPSVLTRHIESYVDTTRSLNEQAASLDSIISLVKNGLVTIEKLVREMDMYLTTTDHVI 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI S KPLDN T+HSLI FF DRLVDW+ L GAL+GCLALLRRKS+ G Sbjct: 61 RARGILLLAELLTNLSLKPLDNVTIHSLIDFFADRLVDWRTLRGALVGCLALLRRKSDAG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITL----------- 556 +V +DA+ + L+Y++N+QVQSLG HDRKLCFE+L CLL Y V +L Sbjct: 121 MVPATDAKAVALSYVKNLQVQSLGQHDRKLCFELLECLLVTYPNEVASLLCFELLECLLV 180 Query: 557 -----------------------------GDDLIYGICEAIDGEKDPQCLMLTFHIVEIL 649 G+D+IY +CE++DGEKDP CLML FHI+ L Sbjct: 181 TYPNEVASLLCFELLECLLVTYPNEVASLGEDIIYSVCESVDGEKDPHCLMLVFHIIPAL 240 Query: 650 GHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDVKRDDLSRALMLAFASTPLFE 829 LFP+P G LA F DLF+VLG YFPIHFTH K + DVKRDDLSRALM+AF+STPL E Sbjct: 241 VGLFPNPSGSLASFPRDLFEVLGCYFPIHFTHHKVEDVDVKRDDLSRALMIAFSSTPLLE 300 Query: 830 PFAMXXXXXXXXXXXXXXXVDSLKYLSHCSVKYGVDRMGKHATSIWSSLKDAFFRSSPQE 1009 PF + +DSLKYLS+CS+KYG DRM +HA +WSS+K+A +S +E Sbjct: 301 PFVIPLLLEKLSSSLSSAKIDSLKYLSYCSIKYGADRMARHAGILWSSIKNAI-STSLKE 359 Query: 1010 PIFSLISESPNGMEFQKNEITNEALVCLQKFMLQDGGLFLSLILEDEEIEMTFRSVTSAK 1189 P S SES +G+ FQ+NE+ +EALV L+ ++Q+ L LS+I++DE+I F ++TS Sbjct: 360 PTESFYSESIDGLGFQENEVVSEALVLLETVVMQNNNLLLSMIVDDEDISTVFNTMTSYG 419 Query: 1190 KYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFFRLMDILGLSPGSSDSGCILD 1369 +Y DIP++ KQ+LH VG IL +++ S A CNRV + FF L+DIL LS SS Sbjct: 420 RYKDIPLQGKQRLHVVGRILYITTKTSIASCNRVLETFFRPLVDILQLSIRSSSR----- 474 Query: 1370 DGFVLSEKLNSGALYLCIELLAACRGLAIGSEELSPQLISEEDTWCCLLKKFSGPLAGAL 1549 D F LN GALYLC+ELLAACR L I S EL+ I +T+CC+L+ F L AL Sbjct: 475 DWF-----LNFGALYLCMELLAACRDLVIYSRELASNSIPAHETFCCILQSFCVSLIDAL 529 Query: 1550 RYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSKSIFENILNVFTSNLSVGWEEQ 1729 IL T+ N+ A + DIY V+ LQILATFP L S ++F+NIL S + + ++ Sbjct: 530 CSILETTANEGADDVDIYLRVRSLQILATFPEDLLAISDNVFKNILTTLMSIIFKDFNQK 589 Query: 1730 LLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLASLIPLEDSSMPLSLQLEAIFEI 1909 LWKL+LKAL+ IG+F+ + ++SEK Y +IVVEK+ S + +++ ++P L+LEA+ EI Sbjct: 590 FLWKLALKALVHIGSFVSR-YESEKAQSYNSIVVEKMVSWVSVDNCTLPFPLKLEAVSEI 648 Query: 1910 GTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILVPLLECYSSKVLPWSHKSEGFE 2089 G G + ML ++QG+E AI + + +V GN+ S E+ + LL+ YS KV+PW H++EG E Sbjct: 649 GASGRNHMLNIVQGLEGAIFSYVSDFYVHGNVSSAEVAIQLLQFYSEKVIPWIHETEGLE 708 Query: 2090 DVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTMRLAFAGCSEYDQGLIVQKAYGVL 2269 ++ LRF+ NIW+ +E+ ++ N V+ K LLD +M M+L CSE Q +I+QKAY VL Sbjct: 709 EILLRFATNIWDHVESWISCNVEVQEKGLLDAIMMAMKLTVGSCSEEIQYIILQKAYTVL 768 Query: 2270 SSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDECIISLFASVIIALRPQTSLLNVR 2449 SS + LK+S S+P++L++ QL Q +++ S+RDE ++SLFASVIIA+RP+T + N++ Sbjct: 769 SSNTSLLLKKSSLTSIPVQLEESQLIQHVDNISHRDELVLSLFASVIIAVRPRTEIPNMK 828 Query: 2450 IILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEASSECTLEEAIDIIFKMGLWSVCG 2629 IL LF+T LL+GHV +AQALGSMINK +TE S E TLE+A+DIIFK W Sbjct: 829 EILYLFLTTLLRGHVPSAQALGSMINKFDTKAKSTEISRESTLEDAMDIIFKTKSWFFRD 888 Query: 2630 SDPLWKCSAIDDSNEGHVNLYRNVALQSNAMVGLAWIGKGMIMRGHEKVKDIMMILLRCL 2809 ++ L + + + L N+ LQ +A+VGLAWIGKG+++RGHEKVKD++M LL CL Sbjct: 889 NEVLQRNGNGMGLKDLCLGLMNNIQLQVHAIVGLAWIGKGLLLRGHEKVKDVIMTLLECL 948 Query: 2810 LSRSNMRTSPLQQDMLGECSEHDIHPIVMRSAADAFHVLLSDSDVCLNKRFHATIRPLYK 2989 + S+ R + L+QD E D HP V RSAADAFH+L+SDS VCLNK FHA IRPLYK Sbjct: 949 MPDSSTRAAKLKQDSFENILEQDFHPSVRRSAADAFHILMSDSGVCLNKIFHAIIRPLYK 1008 Query: 2990 QHFFSSMMPVLLSSLKECDSSTTRSMLYRAIGHVISGTPLAAVVTEAKKXXXXXXXXXXX 3169 QH FS +MP+L S LK D S +RSMLYRA H+I+ PL VV+EAKK Sbjct: 1009 QHLFSVVMPLLQSLLKNFDPSFSRSMLYRASVHIIADAPLIVVVSEAKKLISLLLEGLSI 1068 Query: 3170 XXXXXXXXXXTYSLLLVLSGILMDENGKEAVTENAHIIINHIIGLVSYPHMMLVRE 3337 YSLLLVLS IL D+ G+EAV ENAH +IN +IGL++YPHMMLVRE Sbjct: 1069 LSEDILDKDQLYSLLLVLSAILTDKKGEEAVIENAHSVINCLIGLIAYPHMMLVRE 1124 >ref|XP_007024312.1| MMS19 nucleotide excision repair protein, putative isoform 3 [Theobroma cacao] gi|508779678|gb|EOY26934.1| MMS19 nucleotide excision repair protein, putative isoform 3 [Theobroma cacao] Length = 1062 Score = 1032 bits (2668), Expect = 0.0 Identities = 555/1075 (51%), Positives = 735/1075 (68%), Gaps = 9/1075 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ S I +E+FVD +R QQ+++LD IA+L+KN+ LTIE LV+EM YLTT DN+I Sbjct: 1 MAETSQLIQGIESFVDSTRSPTQQAASLDVIASLLKNNQLTIETLVREMEGYLTTADNII 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI +SKPLD+AT+HSLI FFTDRL DW+AL GAL+GCLALLRRKS+ G Sbjct: 61 RARGILLLGEVLMHLASKPLDDATIHSLIQFFTDRLADWRALRGALVGCLALLRRKSSGG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 IV+ +DA+ + +YLQN+QVQSLG +DRKLCFE+L+CLL+RY +A+ +LGD+LIYGICEA Sbjct: 121 IVSETDAKAVAESYLQNLQVQSLGKYDRKLCFELLLCLLERYPKAIASLGDNLIYGICEA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 +DGEKDP CLML FHI+EIL LFPDPLGP F+ DLF+ L YFP+HFTHPK + ++ Sbjct: 181 VDGEKDPHCLMLIFHIIEILPQLFPDPLGPFTSFAHDLFENLSYYFPVHFTHPKGEDVNI 240 Query: 770 KRDDLSRALMLAFASTPLFEPFAMXXXXXXXXXXXXXXXVDSLKYLSHCSVKYGVDRMGK 949 KRDDL+RALMLAF+STPLFEPFA+ VDSL+YLS C+VKYGVDRM K Sbjct: 241 KRDDLARALMLAFSSTPLFEPFAIPLLIEKLSSSLPSAKVDSLRYLSDCTVKYGVDRMAK 300 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLQKFMLQDGGLFL 1129 H ++WSSLKDA F S + + S ES G+ +NEI EAL LQK ++Q+ FL Sbjct: 301 HGEALWSSLKDAVFTS--LDGVLSFTPESLEGLCLPENEIAAEALSLLQKLIVQNTNFFL 358 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 LI+ DE+I M F ++S K Y+ IP + KQ+LHAVGCILS S S+A CNRVF+ FF Sbjct: 359 DLIVVDEDINMIFNMISSYKSYHGIPAQSKQRLHAVGCILSASVKASTASCNRVFECFFS 418 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGALYLCIELLAACRGLAIGSEELSPQLIS 1489 RLMDILGL +S DD ++ ++ N GALYL IELL+ACR + SE + Sbjct: 419 RLMDILGLCVRNSSGNLSSDDSIMIPKRYNHGALYLSIELLSACRDVIASSETIIAASAH 478 Query: 1490 EEDTWCCLLKKFSGPLAGAL-RYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSK 1666 E+TW LL+ FS L A + TS + H +AD+Y GVKGL ILATFP +L SK Sbjct: 479 TEETWSYLLRSFSSSLTKAFCSASICTSEDSH--DADVYFGVKGLLILATFPEGYLLISK 536 Query: 1667 SIFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLAS 1846 +FE IL F S ++V + LLWKL+LKAL+QIG+FI+KCH+SEK Y+ +VVEK+ S Sbjct: 537 PVFEKILMTFVSIVTVDYSNTLLWKLALKALVQIGSFIEKCHESEKEPSYLGLVVEKIVS 596 Query: 1847 LIPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILV 2026 L D S+P L+LEA+ EIGT G +ML+V++G+EEAI AN E +V G+ S EI+ Sbjct: 597 FSSLGDFSIPFPLRLEALSEIGTSGKSYMLKVVEGLEEAIYANLSEVYVHGSSNSAEIVT 656 Query: 2027 PLLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKV-LLDKMMTTMR 2203 LL+CYS KV+PW ++GF++V L+F+++IWNQ+E +M FNA K+ +LD MM M+ Sbjct: 657 QLLKCYSDKVIPWIQCAKGFDEVPLQFAIHIWNQIELSMVFNATQTNKIEVLDVMMKAMK 716 Query: 2204 LAFAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDEC 2383 LA A CSE +Q +IVQK+Y +LSS++ FPLKE + + Q+ Q +++ S RDE Sbjct: 717 LAVASCSEENQNIIVQKSYHILSSSTSFPLKEL------FRQESFQIVQ-VDNSSSRDEW 769 Query: 2384 IISLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEAS 2563 I+SLFA+V+IA+ P+T + N++ +L LFMT LLKG+V AQALGS++NKL L + Sbjct: 770 ILSLFAAVVIAVHPETYVPNIKPLLYLFMTTLLKGNVVTAQALGSVVNKLGL--ESAGVQ 827 Query: 2564 SECTLEEAIDIIFKMGLW---SVCGSDPLWKCSAIDDSNEGHVNLYRNV----ALQSNAM 2722 ++CTLEE +DII + LW S +D K ++ D + +NL ++ +LQ +A+ Sbjct: 828 TDCTLEEVMDIILNLSLWIFHSNSSADIQAKMTSAHDIS--LINLCSSIGSCTSLQIHAI 885 Query: 2723 VGLAWIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRS 2902 VGLAWIGKG++MRGHEKVKDI MI LRCL ++ + +E D+H VM+S Sbjct: 886 VGLAWIGKGLLMRGHEKVKDITMIFLRCLQPNGRAEILHQEEGISESNNELDLHHSVMKS 945 Query: 2903 AADAFHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAI 3082 AADAF +L+ DS+VCLN+ FHA IRPLYKQ FFS+MMP+L S + + + +R +L RA Sbjct: 946 AADAFQILMGDSEVCLNRGFHAVIRPLYKQRFFSTMMPILQSLIMKSE-PLSRPLLLRAS 1004 Query: 3083 GHVISGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDEN 3247 H+I TPL V+++AKK Y LLLVLSGILMD+N Sbjct: 1005 AHIIVDTPLIVVLSDAKKIIPMLLDGLSALSNDILDKDVIYGLLLVLSGILMDKN 1059 >ref|XP_004302857.1| PREDICTED: uncharacterized protein LOC101304108 [Fragaria vesca subsp. vesca] Length = 1149 Score = 1008 bits (2605), Expect = 0.0 Identities = 550/1100 (50%), Positives = 721/1100 (65%), Gaps = 4/1100 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA + H+E +VD +RP +Q+++L+ I +LVK DLLTIE LVKEM +YLT TDNVI Sbjct: 1 MAATTQLTHHLECYVDTARPPAEQAASLNFITSLVKKDLLTIEVLVKEMRMYLTITDNVI 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI SSKPLDNAT+HSLIGFFTDRL DW+AL GALIGCLALLRR+ N G Sbjct: 61 RARGILLLAEVLTGLSSKPLDNATIHSLIGFFTDRLSDWRALRGALIGCLALLRRQVNAG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 +V+ SDA+ + +Y +NI VQSL DRKLCFE+L CLL RY V +LG+DL Y I EA Sbjct: 121 MVSASDAKVVAQSYRENIPVQSLAQQDRKLCFELLECLLQRYPNEVASLGEDLFYAISEA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 ID EKDP CL+LTFHIVE L LFPDP GPLA F DLF+ LG YFPIHFTH K++ +V Sbjct: 181 IDEEKDPHCLILTFHIVEALVKLFPDPSGPLATFCGDLFEFLGCYFPIHFTHLKDEDANV 240 Query: 770 KRDDLSRALMLAFASTPLFEPFAMXXXXXXXXXXXXXXXVDSLKYLSHCSVKYGVDRMGK 949 KR+DLS+ALM AF+ST LFEPF + VDSLKYL++C+ +YG +RM K Sbjct: 241 KREDLSKALMSAFSSTALFEPFVIPLLLEKLSSSLPLAKVDSLKYLNYCASRYGAERMAK 300 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLQKFMLQDGGLFL 1129 HA +IW S+K A +S + P S +E G+ F++NEI EAL+ LQ +Q+ L L Sbjct: 301 HAETIWISIKHA-ISNSLEVPAKSFTAEPLVGLGFEENEIVTEALILLQNVTMQNDALLL 359 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 SLI+ DE+I S+ S + Y +IP + +Q LHAVG I + + S A CNRVF++FF Sbjct: 360 SLIVRDEDINNVINSIASHESYTNIPSQGRQSLHAVGRIFFIITKTSMASCNRVFESFFP 419 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGALYLCIELLAACRGLAIGSEELSPQLIS 1489 LM L +S G+S C L + S++ GALY C+E +AACR L + + + + + Sbjct: 420 SLMKTLEISMGNSSKDCTLKENSFSSKRFKFGALYFCVEFIAACRDLIMRTNDHDEKFGT 479 Query: 1490 EEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSKS 1669 ++T CC+L+ + L A L + A +ADIY VKGLQ+LATFP FL K+ Sbjct: 480 ADETCCCMLQSSAPTLITAFCTTLAQISCNVADDADIYFKVKGLQMLATFPGYFLQIPKA 539 Query: 1670 IFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLASL 1849 +FEN+L S + V +++ LLWKL+LKAL IG+F+ +SEK Y + VVEK SL Sbjct: 540 MFENVLKTLMSIILVDFDKPLLWKLALKALAHIGSFVDVHLESEKAQSYTSFVVEKTISL 599 Query: 1850 IPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILVP 2029 P +D +P L+LEA+FEIG + MLR+IQG+E+AI AN + F+ G+LK+ E + Sbjct: 600 -PQDDFDVPFPLKLEAVFEIGASRPNHMLRIIQGLEDAIVANLSKTFIHGDLKAAEKTIQ 658 Query: 2030 LLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTMRLA 2209 LLECYS+K++ W ++ G E+V RF ++IWN +E + V+ K LLD MT M+LA Sbjct: 659 LLECYSNKIISWIDENGGLEEVLCRFVISIWNCLERCKDSSNQVQDKGLLDATMTAMKLA 718 Query: 2210 FAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDECII 2389 CSE Q +I+QKAYG LSS P K+S S KL+ L L + L+ S RDE I Sbjct: 719 VGSCSEESQNIIIQKAYGALSSGISIPFKDSTDDSSLAKLETLHLFEQLDKLSPRDEWIF 778 Query: 2390 SLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEASSE 2569 SLFASVIIA+RP+T + N + IL LFMT L+KG AAQALGS+INKL + N S+ Sbjct: 779 SLFASVIIAMRPRTPIANAKGILHLFMTALVKGCTPAAQALGSVINKLGIQSNEITISTA 838 Query: 2570 CTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDDSNEGHVNLYRNVA----LQSNAMVGLAW 2737 CTLEEA+ IIF+ LW++ + L N G L V+ LQ + + GLAW Sbjct: 839 CTLEEAMGIIFRSKLWNIGENGVLRGSGTSHSRNVGLTELCLGVSSNKLLQVHVITGLAW 898 Query: 2738 IGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRSAADAF 2917 IGKG+++ G+E+VKD+ I+L CLL+ + TS L+Q +L SE P VMR+AADAF Sbjct: 899 IGKGLLLIGNEQVKDVTKIILDCLLADDKVDTSELRQGLLETSSE---QPSVMRTAADAF 955 Query: 2918 HVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAIGHVIS 3097 H+L+SDSDVCLN++FHA IRPLYKQ FFS++MP+L S + + DSS +RSML+RA H+IS Sbjct: 956 HILMSDSDVCLNRKFHANIRPLYKQRFFSTVMPILHSLIVKSDSSLSRSMLFRASAHLIS 1015 Query: 3098 GTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKEAVTENAH 3277 PL +++EAKK YSLLLVLSGIL D+ G+EAV ENAH Sbjct: 1016 NAPLIVILSEAKKLMKVLLDGLSILSDDILDKDKLYSLLLVLSGILTDKCGEEAVLENAH 1075 Query: 3278 IIINHIIGLVSYPHMMLVRE 3337 III+ + LV+YPHMMLVRE Sbjct: 1076 IIIDCLSRLVAYPHMMLVRE 1095 >ref|XP_004141784.1| PREDICTED: MMS19 nucleotide excision repair protein homolog [Cucumis sativus] Length = 1147 Score = 996 bits (2574), Expect = 0.0 Identities = 533/1106 (48%), Positives = 739/1106 (66%), Gaps = 10/1106 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ +VE+FVD SR +QQ+++L+ I +LVKN++LTIE LV+EMG+YLT TDN+I Sbjct: 1 MAELCKLTHYVESFVDVSRTPSQQATSLETITSLVKNNVLTIETLVREMGMYLTITDNII 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R RGI +SKPLD+AT+HSLI FFT+RL DW+AL GAL+GCLAL+RRK+NVG Sbjct: 61 RGRGILLLGELLACLASKPLDSATIHSLIAFFTERLADWKALRGALVGCLALMRRKTNVG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 ++ +DA+++ +Y QN+QVQSLG HDRKL FE+L CLL+ Y +AV++LGDDL+YGICEA Sbjct: 121 SISQNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYPDAVVSLGDDLVYGICEA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 IDGEKDP CL+LTF IVE++ LFPDP G LA S DLF+ LG YFPIHFTH K + DV Sbjct: 181 IDGEKDPHCLLLTFRIVELVAKLFPDPTGALASSSSDLFEFLGCYFPIHFTHGKEEDIDV 240 Query: 770 KRDDLSRALMLAFASTPLFEPFAMXXXXXXXXXXXXXXXVDSLKYLSHCSVKYGVDRMGK 949 +R+DLS ALM AF+STPLFEPFA+ +DSLKYLS C+VKYG DRM K Sbjct: 241 RRNDLSHALMRAFSSTPLFEPFAIPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMKK 300 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLQKFMLQDGGLFL 1129 H+ +IWSS+K+ F S Q P S+ +ES N FQ+NE+T EAL LQK ++ GLFL Sbjct: 301 HSEAIWSSVKEIIFTSIGQ-PNLSINTESLNSPSFQENEMTTEALRLLQKMVVASNGLFL 359 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 +LI+ DE+++ F + Y D P++ +Q+L+AVG IL S++ S A C+ VF+++F Sbjct: 360 TLIINDEDVKDIFNILNIYTCYKDFPLQSRQRLNAVGHILYTSASASVASCDHVFESYFH 419 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGALYLCIELLAACRGLAIGSEELSPQLIS 1489 RL+D +G+S +D LN GALYLCIE++AACR L + S+E S Sbjct: 420 RLLDFMGISVDQYH-----NDKISPIRNLNFGALYLCIEVIAACRNLIVSSDE---NTCS 471 Query: 1490 EEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSKS 1669 ++ +L+ FS + L + +A+ YC VKGL L+TFP P S+ Sbjct: 472 VKEKSYSMLQIFSCSVVQLLSSTFSGIVKRDLHDAEFYCAVKGLLNLSTFPVGSSPVSRV 531 Query: 1670 IFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLASL 1849 IFE+IL F S ++V ++ LW +LKAL IG+F+ K S + YM IVVEK+A + Sbjct: 532 IFEDILLEFMSFITVNFKFGSLWNHALKALQHIGSFVDKYPGSVESQSYMHIVVEKIALM 591 Query: 1850 IPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILVP 2029 D +PL L+LE +IG G +ML+++ GIEE I N E +V GN KSVEI++ Sbjct: 592 FSPHDEVLPLMLKLEMAVDIGRTGRSYMLKIVGGIEETIFYNLSEVYVYGNSKSVEIVLS 651 Query: 2030 LLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEG--KVLLDKMMTTMR 2203 LL+CYS+K+LPW ++ FE+V LRF++NIW+Q+E TF+ ++ +VLLD M ++ Sbjct: 652 LLDCYSTKILPWFDEAGDFEEVILRFALNIWDQIEKCSTFSTSMDKCIQVLLDATMMALK 711 Query: 2204 LAFAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDEC 2383 L+ CS+ Q +IVQKA+ VL ++SF PLK ++S ++P++++ LQ Q ++ + RDE Sbjct: 712 LSVRSCSKESQNIIVQKAFNVLLTSSFSPLKVTLSNTIPVQMEGLQFLQQKDNPTSRDEW 771 Query: 2384 IISLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEAS 2563 I+SLFASV IALRPQ + +VR+I++L M +G V AAQALGSMINKL + + E S Sbjct: 772 ILSLFASVTIALRPQVHVPDVRLIIRLLMLSTTRGCVPAAQALGSMINKLSVKSDKVEVS 831 Query: 2564 SECTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDDSNEGH--------VNLYRNVALQSNA 2719 S +LEEAIDIIFK ++C + + +G ++ ++ LQ +A Sbjct: 832 SYVSLEEAIDIIFKTE----------FRCLHNESTGDGSEMFLTDLCSSIEKSSLLQVHA 881 Query: 2720 MVGLAWIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMR 2899 +VGL+WIGKG+++ GH+KV+DI M+ L+ L+S+S SPLQQ L + +E + VM+ Sbjct: 882 VVGLSWIGKGLLLCGHDKVRDITMVFLQLLVSKSRTDASPLQQFKLEKDNETSLDFAVMK 941 Query: 2900 SAADAFHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRA 3079 AA+AFH+L+SDS+ CLN++FHA +RPLYKQ FFS+MMP+ + + + D+S +R MLY+A Sbjct: 942 GAAEAFHILMSDSEACLNRKFHAIVRPLYKQRFFSTMMPIFQTLVSKSDTSLSRYMLYQA 1001 Query: 3080 IGHVISGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKEA 3259 HVIS TPL A++++AKK YSLLLVLSGILMD+NG+EA Sbjct: 1002 YAHVISDTPLTAILSDAKKFIPMLLDGLLTLSVNGINKDVVYSLLLVLSGILMDKNGQEA 1061 Query: 3260 VTENAHIIINHIIGLVSYPHMMLVRE 3337 VTENAH I++ + GL + HMMLVRE Sbjct: 1062 VTENAHKIVDCLAGLTDFSHMMLVRE 1087 >ref|XP_003546956.1| PREDICTED: MMS19 nucleotide excision repair protein homolog isoform X1 [Glycine max] Length = 1135 Score = 987 bits (2551), Expect = 0.0 Identities = 537/1104 (48%), Positives = 723/1104 (65%), Gaps = 8/1104 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ + H+E++VD S QQ+S+L+A+A+LV D L +EALV+E+ +YLTTTDNV+ Sbjct: 1 MAETTQLTRHIESYVDSSSTPAQQASSLNAVASLVNTDALPLEALVRELEMYLTTTDNVV 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI SKPL++AT+HSL+GFF DRL DW+A+ GAL+GCLAL+RRKS VG Sbjct: 61 RARGILLLAEVMTRIESKPLNSATIHSLVGFFKDRLADWRAVQGALVGCLALIRRKSVVG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 +VT SDA T+ ++LQ +QVQSLG +DRKLCFE+L CLL+RY +AV TLG+DLIYGICEA Sbjct: 121 MVTDSDATTIAQSFLQYMQVQSLGQYDRKLCFELLDCLLERYFDAVTTLGEDLIYGICEA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 ID EKDP CL L FHIV L L PD LA +++D+FD+L YFPIHFTHP + V Sbjct: 181 IDAEKDPDCLKLAFHIVASLAQLNPDSSSLLASYAKDVFDILEPYFPIHFTHPSSGDTHV 240 Query: 770 KRDDLSRALMLAFASTPLFEPFAMXXXXXXXXXXXXXXXVDSLKYLSHCSVKYGVDRMGK 949 +RDDLS +LM AF+STPLFEPF + +DSLKYL CS KYG +R+ K Sbjct: 241 QRDDLSTSLMSAFSSTPLFEPFVIPLLLEKLSSSLHSAKIDSLKYLRVCSSKYGAERIAK 300 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLQKFMLQDGGLFL 1129 +A +IWSSLKD + EP FS +G+ F +NE EAL LQ+ + Q+ L + Sbjct: 301 YAGAIWSSLKDT-LSTYLGEPDFSFTIAPVDGIGFPENEFVIEALSLLQQLIAQNSSLLV 359 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 SLI++DE++ F ++TS + Y+ IPV+ K+KLHA+G IL ++S + + CN +F++ F Sbjct: 360 SLIIDDEDVNTIFSTITSYETYDAIPVQEKKKLHAIGRILYITSKTTISSCNAMFESLFT 419 Query: 1310 RLMDILGLS---PGSSDSGCILDDGFVLSEKLNSGALYLCIELLAACRGLAIGSEELSPQ 1480 R+MD LG S P S S++L G LYLCIELLA CR L +GSEE + Q Sbjct: 420 RMMDNLGFSVRFPNGDISP---------SQRLKFGFLYLCIELLAGCRELIVGSEEPALQ 470 Query: 1481 LISEEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPT 1660 + E +T C +L FS PL A +L S ++ + D Y GVKGLQILA F S P Sbjct: 471 YVFEHETCCTMLHSFSTPLFNAFGSVLAVSADRGPLDPDTYVGVKGLQILAMFHSDVFPI 530 Query: 1661 SKSIFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKL 1840 KSIFENIL F S + + + +LW+ +LKAL +G+F +K +SEK + Y +VVEK+ Sbjct: 531 QKSIFENILKKFMSIIIEDFNKTILWEAALKALHHVGSFFQKFCESEKAMSYRNLVVEKI 590 Query: 1841 ASLIPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEI 2020 ++ L+D ++ SL++EA+ IG G+ ML ++QG+ A+ AN + +V NL+S EI Sbjct: 591 VEILSLDDITLSFSLKVEALLNIGKTGMKNMLTILQGLGRAVFANLSKVYVHRNLRSSEI 650 Query: 2021 LVPLLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTM 2200 V LLECYS ++LPW H++ G ED ++F+V+IW+Q N M + EGK LLD MM M Sbjct: 651 AVQLLECYSCQLLPWIHENGGSEDFVMQFAVDIWSQAGNCMDLSTPFEGKGLLDAMMKAM 710 Query: 2201 RLAFAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDE 2380 RL+ CS Q LI++KAY VLSS + F LKE ++ L LT S RDE Sbjct: 711 RLSVGSCSVESQNLIIRKAYSVLSSHTNFQLKE---------VERLPLTPGKYDISLRDE 761 Query: 2381 CIISLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEA 2560 IISLFASV+IA+ P+T + N+R+++ LF+ LL+G V AQALGS++NKL +T E Sbjct: 762 GIISLFASVVIAVCPKTYIPNIRVLVHLFIITLLRGVVPVAQALGSILNKLVSTSSTAEN 821 Query: 2561 SSECTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDDSNEG-----HVNLYRNVALQSNAMV 2725 SS+ TLEEA+D IF + S +D L +C+ + NE + + + LQ NA+ Sbjct: 822 SSDLTLEEALDAIFNTKI-SFSSTDMLQRCNGTSNGNEMVFTDICLGIANDRMLQINAIC 880 Query: 2726 GLAWIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRSA 2905 GL+W+GKG+++RGHEK+KDI MI + CL+S SPL +D L E +V++ A Sbjct: 881 GLSWMGKGLLLRGHEKIKDITMIFMECLIS-GTKSASPLIKDSLENTEEQIQDLLVIKCA 939 Query: 2906 ADAFHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAIG 3085 DAFHVL+SDS+VCLN++FHATIRPLYKQ FFSS+MP+L + + SS +RS LYRA Sbjct: 940 TDAFHVLMSDSEVCLNRKFHATIRPLYKQRFFSSVMPILQQIITKSHSSLSRSFLYRAFA 999 Query: 3086 HVISGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKEAVT 3265 H++S TP+ A+V+EAKK Y LLLVLSGILM++NG+EAV Sbjct: 1000 HIMSDTPMVAIVSEAKK-LIPVLLDCLSMLTEIQDKDMLYGLLLVLSGILMEKNGQEAVV 1058 Query: 3266 ENAHIIINHIIGLVSYPHMMLVRE 3337 ENAHIIIN +I LV YPH MLVRE Sbjct: 1059 ENAHIIINCLIKLVGYPHKMLVRE 1082 >ref|XP_006597167.1| PREDICTED: MMS19 nucleotide excision repair protein homolog isoform X2 [Glycine max] Length = 1133 Score = 981 bits (2535), Expect = 0.0 Identities = 537/1104 (48%), Positives = 722/1104 (65%), Gaps = 8/1104 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ + H+E++VD S QQ+S+L+A+A+LV D L +EALV+E+ +YLTTTDNV+ Sbjct: 1 MAETTQLTRHIESYVDSSSTPAQQASSLNAVASLVNTDALPLEALVRELEMYLTTTDNVV 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI SKPL++AT+HSL+GFF DRL DW+A+ GAL+GCLAL+RRKS VG Sbjct: 61 RARGILLLAEVMTRIESKPLNSATIHSLVGFFKDRLADWRAVQGALVGCLALIRRKSVVG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 +VT SDA T+ ++LQ +QVQSLG +DRKLCFE+L CLL+RY +AV TLG+DLIYGICEA Sbjct: 121 MVTDSDATTIAQSFLQYMQVQSLGQYDRKLCFELLDCLLERYFDAVTTLGEDLIYGICEA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 ID EKDP CL L FHIV L L PD LA +++D+FD+L YFPIHFTHP + V Sbjct: 181 IDAEKDPDCLKLAFHIVASLAQLNPDSSSLLASYAKDVFDILEPYFPIHFTHPSSGDTHV 240 Query: 770 KRDDLSRALMLAFASTPLFEPFAMXXXXXXXXXXXXXXXVDSLKYLSHCSVKYGVDRMGK 949 +RDDLS +LM AF+STPLFEPF + +DSLKYL CS KYG +R+ K Sbjct: 241 QRDDLSTSLMSAFSSTPLFEPFVIPLLLEKLSSSLHSAKIDSLKYLRVCSSKYGAERIAK 300 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLQKFMLQDGGLFL 1129 +A +IWSSLKD + EP FS +G+ F +NE EAL LQ+ + Q+ L + Sbjct: 301 YAGAIWSSLKDT-LSTYLGEPDFSFTIAPVDGIGFPENEFVIEALSLLQQLIAQNSSLLV 359 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 SLI++DE++ F ++TS + Y+ IPV+ K+KLHA+G IL ++S + + CN +F++ F Sbjct: 360 SLIIDDEDVNTIFSTITSYETYDAIPVQEKKKLHAIGRILYITSKTTISSCNAMFESLFT 419 Query: 1310 RLMDILGLS---PGSSDSGCILDDGFVLSEKLNSGALYLCIELLAACRGLAIGSEELSPQ 1480 R+MD LG S P S S++L G LYLCIELLA CR L +GSEE + Q Sbjct: 420 RMMDNLGFSVRFPNGDISP---------SQRLKFGFLYLCIELLAGCRELIVGSEEPALQ 470 Query: 1481 LISEEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPT 1660 + E +T C +L FS PL A +L S ++ + D Y GVKGLQILA F S P Sbjct: 471 YVFEHETCCTMLHSFSTPLFNAFGSVLAVSADRGPLDPDTYVGVKGLQILAMFHSDVFPI 530 Query: 1661 SKSIFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKL 1840 KSIFENIL F S + + + +LW+ +LKAL +G+F +K +SEK + Y +VVEK+ Sbjct: 531 QKSIFENILKKFMSIIIEDFNKTILWEAALKALHHVGSFFQKFCESEKAMSYRNLVVEKI 590 Query: 1841 ASLIPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEI 2020 ++ L+D ++ SL++EA+ IG G+ ML ++QG+ A+ AN + V NL+S EI Sbjct: 591 VEILSLDDITLSFSLKVEALLNIGKTGMKNMLTILQGLGRAVFANLSK--VHRNLRSSEI 648 Query: 2021 LVPLLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTM 2200 V LLECYS ++LPW H++ G ED ++F+V+IW+Q N M + EGK LLD MM M Sbjct: 649 AVQLLECYSCQLLPWIHENGGSEDFVMQFAVDIWSQAGNCMDLSTPFEGKGLLDAMMKAM 708 Query: 2201 RLAFAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDE 2380 RL+ CS Q LI++KAY VLSS + F LKE ++ L LT S RDE Sbjct: 709 RLSVGSCSVESQNLIIRKAYSVLSSHTNFQLKE---------VERLPLTPGKYDISLRDE 759 Query: 2381 CIISLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEA 2560 IISLFASV+IA+ P+T + N+R+++ LF+ LL+G V AQALGS++NKL +T E Sbjct: 760 GIISLFASVVIAVCPKTYIPNIRVLVHLFIITLLRGVVPVAQALGSILNKLVSTSSTAEN 819 Query: 2561 SSECTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDDSNEG-----HVNLYRNVALQSNAMV 2725 SS+ TLEEA+D IF + S +D L +C+ + NE + + + LQ NA+ Sbjct: 820 SSDLTLEEALDAIFNTKI-SFSSTDMLQRCNGTSNGNEMVFTDICLGIANDRMLQINAIC 878 Query: 2726 GLAWIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRSA 2905 GL+W+GKG+++RGHEK+KDI MI + CL+S SPL +D L E +V++ A Sbjct: 879 GLSWMGKGLLLRGHEKIKDITMIFMECLIS-GTKSASPLIKDSLENTEEQIQDLLVIKCA 937 Query: 2906 ADAFHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAIG 3085 DAFHVL+SDS+VCLN++FHATIRPLYKQ FFSS+MP+L + + SS +RS LYRA Sbjct: 938 TDAFHVLMSDSEVCLNRKFHATIRPLYKQRFFSSVMPILQQIITKSHSSLSRSFLYRAFA 997 Query: 3086 HVISGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKEAVT 3265 H++S TP+ A+V+EAKK Y LLLVLSGILM++NG+EAV Sbjct: 998 HIMSDTPMVAIVSEAKK-LIPVLLDCLSMLTEIQDKDMLYGLLLVLSGILMEKNGQEAVV 1056 Query: 3266 ENAHIIINHIIGLVSYPHMMLVRE 3337 ENAHIIIN +I LV YPH MLVRE Sbjct: 1057 ENAHIIINCLIKLVGYPHKMLVRE 1080 >ref|XP_006853692.1| hypothetical protein AMTR_s00056p00136660 [Amborella trichopoda] gi|548857353|gb|ERN15159.1| hypothetical protein AMTR_s00056p00136660 [Amborella trichopoda] Length = 1160 Score = 978 bits (2527), Expect = 0.0 Identities = 534/1110 (48%), Positives = 726/1110 (65%), Gaps = 14/1110 (1%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA +SWIPHVE FVDPSR +QQ ++L+ IATL+K D LT+EALV+EM +YLTTTD + Sbjct: 1 MASTASWIPHVEVFVDPSRSKDQQDASLNVIATLMKKDTLTLEALVQEMEVYLTTTDASV 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 RSRGI +SKP+D A +HSL FFT RL DWQAL GA IGCLALLRRKS+ G Sbjct: 61 RSRGILLVAELLSYLASKPIDGAIIHSLTEFFTSRLADWQALRGAFIGCLALLRRKSHTG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 ++ +D L ++L NIQVQ+L HDR LC E+L CLLD++ + L D+L+YG+ A Sbjct: 121 TISDNDLINLTKSFLLNIQVQALALHDRLLCLELLECLLDQFPSVISVLDDELVYGVLAA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 ID EKDP+CLML FH+VE+L +FPDP ++D FD+LGRYFPI+FT P D D+ Sbjct: 181 IDEEKDPRCLMLVFHVVELLVQVFPDP-----SVAKDAFDILGRYFPIYFTRPNVDAIDI 235 Query: 770 KRDDLSRALMLAFASTPLFEPFAMXXXXXXXXXXXXXXXVDSLKYLSHCSVKYGVDRMGK 949 KR+DLSR +M AF+S+PLFEPF + +D+LKYLSHC+ +YG RM Sbjct: 236 KREDLSRRMMNAFSSSPLFEPFCIPLLLEKLSSSLEMAKLDALKYLSHCAPRYGPSRMAS 295 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLQK----FMLQDG 1117 HA +IWS+LKD F S P S+I E P+ + Q+NE+ EALVCL+ F + Sbjct: 296 HAYAIWSALKDVIFNLSSHGPSISIICELPDNLGSQENEVVKEALVCLENCVLVFDIPKD 355 Query: 1118 GLFLSLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQ 1297 FL LI+EDE++EMTFRS+TS K D+P E KQ+ A+ IL S+ VSSACCNRVF Sbjct: 356 ETFLRLIVEDEDLEMTFRSITSEKCNKDLPHERKQRFCALRNILFTSAKVSSACCNRVFG 415 Query: 1298 NFFFRLMDILGLSPGSSDSGCILDDGFVLSEKLNSGALYLCIELLAACRGL--AIGSEEL 1471 +FF RLM+ L +S S + + +L+ AL++C+EL+AA L A+ S+E+ Sbjct: 416 SFFQRLMNFLRISSLDSPFDWASNRNSYVCVELDFEALHICLELIAASNHLANALSSQEV 475 Query: 1472 SPQLISEEDTWCCLLKKFSGPLAGALRYILVTSTNQ-------HACEADIYCGVKGLQIL 1630 P +D W LL+ FSG L AL +V + + E D+ V GLQIL Sbjct: 476 CP--TPTQDPWLLLLQSFSGCLVFALGSSVVANKSSSIREMSPSIGEEDLPLKVTGLQIL 533 Query: 1631 ATFPSCFLPTSKSIFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVI 1810 ATFP + P S+ FENIL VF S ++ +EE LW +LKAL+Q+G I++ HDS++ + Sbjct: 534 ATFPDSYSPLSRDAFENILAVFMSVITERYEETSLWTSTLKALVQVGMSIERYHDSQRGV 593 Query: 1811 CYMTIVVEKLASLIPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAF 1990 C+MTIV+EKL S + + PLSL L+AI EI +GL FM RV +G EA+S NF EA Sbjct: 594 CFMTIVIEKLLSYLFNRSTFPPLSLNLKAISEIAMMGLCFMKRVTKGFGEALSTNFLEAV 653 Query: 1991 VKGNLKSVEILVPLLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGK 2170 +GN KS E+ + +L+CYS +LPW EGFE+ A+ + +IW+ ME +++F G GK Sbjct: 654 AEGNTKSAEMAIEILKCYSLYLLPWLQNKEGFEEDAMHLATDIWSYME-SISFCIGSHGK 712 Query: 2171 VLLDKMMTTMRLAFAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQ 2350 LL+ M M+LA C+ Q IV KA+ +L+S++ + +K+S+S S ++L+ L++T Sbjct: 713 SLLEATMMAMKLAVGCCTMNQQSSIVSKAHNILASSTLYLVKDSMSLSTSVQLEKLKITP 772 Query: 2351 DLESFSYRDECIISLFASVIIALRPQTSLLNVRIILKLFM-TGLLKGHVSAAQALGSMIN 2527 + S + +D +ISLFASV+IAL+PQT + ++RIIL+LFM LLKG ++AQALGS++N Sbjct: 773 ESVSSACKDGWLISLFASVVIALQPQTVIPDLRIILELFMIVVLLKGDEASAQALGSIVN 832 Query: 2528 KLPLMINTTEASSECTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDDSNEGHVNLYRNVAL 2707 K P + + E S CTL EA+DI+ + G + + K +D++ E +L + Sbjct: 833 KWP--VKSNEVSGACTLGEAMDIMVERGFRPIIFNVNQKKHEDVDNNKEIVSHLPISNDS 890 Query: 2708 QSNAMVGLAWIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHP 2887 + +A+ GLAWIGKG++MRGHEKVKDI ++LL C+L MR+ P Q D+LG I+ Sbjct: 891 RVHALFGLAWIGKGLVMRGHEKVKDITLLLLSCVLPTGGMRSMPSQHDVLGNDGGESINI 950 Query: 2888 IVMRSAADAFHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSM 3067 V RSAADAFH+++SDS+ +N++FHATIRPLYKQ F S++MP+LLSS+KE SS T+SM Sbjct: 951 AVARSAADAFHIIMSDSETSVNQKFHATIRPLYKQRFCSTVMPILLSSIKESHSSITKSM 1010 Query: 3068 LYRAIGHVISGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDEN 3247 L+R GH+I GTPLAA++ EA K Y LL+VLSGILMDE Sbjct: 1011 LFRTFGHIIIGTPLAAILIEAPKIVPPLLDGLSMLTLDVQNKDQIYDLLVVLSGILMDET 1070 Query: 3248 GKEAVTENAHIIINHIIGLVSYPHMMLVRE 3337 GKEAV ENAH II + LV+YPH+M+VRE Sbjct: 1071 GKEAVVENAHTIIGCLSKLVTYPHLMIVRE 1100 >ref|XP_007150605.1| hypothetical protein PHAVU_005G166100g [Phaseolus vulgaris] gi|561023869|gb|ESW22599.1| hypothetical protein PHAVU_005G166100g [Phaseolus vulgaris] Length = 1145 Score = 969 bits (2504), Expect = 0.0 Identities = 528/1105 (47%), Positives = 725/1105 (65%), Gaps = 9/1105 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPS-NQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNV 226 MA+++ H+E++VD S S + Q ++L+A+A+LVK D+L +EALVKE+G+YLTTTD+V Sbjct: 1 MAESTQLTRHIESYVDASSSSPSLQVASLNAVASLVKTDVLPLEALVKELGMYLTTTDDV 60 Query: 227 IRSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNV 406 IR+RGI SKPLD+AT+HSL+GFF +RL DW+A+ GAL+GCLAL+RRKS + Sbjct: 61 IRARGILLLAEVITRTESKPLDSATIHSLVGFFKERLADWRAVRGALLGCLALIRRKSVL 120 Query: 407 GIVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICE 586 GIVT +DA+ + ++ Q +QVQSLG DRKLCFE+L CLL+ Y +A+ LGD LIYGICE Sbjct: 121 GIVTSTDAKAIAQSFFQYMQVQSLGQSDRKLCFELLDCLLEHYPDAITPLGDGLIYGICE 180 Query: 587 AIDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFD 766 AID EKDP+CLML FHIV+ L+P+ G LA +++D+FD+L YFPIHFTHP N Sbjct: 181 AIDAEKDPECLMLAFHIVQSWAQLYPESSGLLATYAKDVFDILEPYFPIHFTHPTNADTP 240 Query: 767 VKRDDLSRALMLAFASTPLFEPFAMXXXXXXXXXXXXXXXVDSLKYLSHCSVKYGVDRMG 946 V+RDDLSR+LM AF+STPLFEPF + +DSLKYL CS KYG +R+ Sbjct: 241 VQRDDLSRSLMSAFSSTPLFEPFVIPLLLEKLSSSLHSAKIDSLKYLRVCSSKYGAERIA 300 Query: 947 KHATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLQKFMLQDGGLF 1126 K+A SIWSS+KD + EP FSL +G+ F +NE EAL LQ+ ++Q+ L Sbjct: 301 KYANSIWSSIKD-ILSTYLGEPDFSLNIAPADGIGFPENEFVVEALSLLQQLIVQNSSLL 359 Query: 1127 LSLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFF 1306 + LI++DE++ + F ++ S + Y+ IPV+ K+KLHA+G IL +++ + CN VF++ F Sbjct: 360 VCLIVDDEDVNIFFNTIASYEIYDAIPVQEKKKLHAIGRILYIAAKSTVTSCNAVFESLF 419 Query: 1307 FRLMDILGLSPGSSDSGCILDDGFVLSEKLNSGALYLCIELLAACRGLAIGSEELSPQLI 1486 ++MD LG+S + DS + S+++ G LYLCIELL R L +GS+E + Q + Sbjct: 420 SKIMDNLGVSVSNIDSSA--NGDISSSQRVKIGFLYLCIELLVGFRELIVGSKEPALQYV 477 Query: 1487 SEEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSK 1666 E +T C +L FS L A +L S ++ + D Y GVKGLQILA F S K Sbjct: 478 IEHETCCTMLHSFSSSLFNAFGLVLAESADRCPLDPDTYIGVKGLQILAMFHSDVFSMQK 537 Query: 1667 SIFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLAS 1846 SIFENIL F S + + +++LW+ +LKAL +G+F+++ H+SEK + Y ++VVEK+ Sbjct: 538 SIFENILKKFMSIIIEDFNKKILWEAALKALCHVGSFVQEFHESEKAMSYGSLVVEKIVE 597 Query: 1847 LIPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILV 2026 + L+D +P SL++EA+ IG G+ ML +QG+ +A+ AN + V +L+S EI V Sbjct: 598 FLFLDDIIVPFSLKVEALSNIGMTGMKNMLTSLQGMRKAVFANLSK--VHTDLRSSEIAV 655 Query: 2027 PLLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTMRL 2206 LLECYS K+LPW+H++ G ED AL+F+V+IW+Q N M + E K LL +M M+L Sbjct: 656 QLLECYSCKLLPWTHENGGSEDFALQFAVDIWSQAGNCMVSSTSFEEKGLLYALMKAMKL 715 Query: 2207 AFAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDECI 2386 + CS Q LI+QKAY +LSS + F LKE L+ L L+ + S DE I Sbjct: 716 SVGICSVESQNLIIQKAYSILSSRTNFQLKE---------LERLPLSPGKYNISLTDEWI 766 Query: 2387 ISLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEASS 2566 ISLFASV+IA+ P+T + N+R+++ LF+ LL+G V AQALGS++NKL N+ E SS Sbjct: 767 ISLFASVVIAVCPKTLIPNIRVLVNLFIVTLLRGIVPVAQALGSLLNKLVSTSNSAENSS 826 Query: 2567 ECTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDDSNE--------GHVNLYRNVALQSNAM 2722 + TLEEA+D IF +W D L +C+ + E G N + LQ NA+ Sbjct: 827 DITLEEALDAIFNTKIW-FSSIDILQRCNGTSNGKEIVLTDICLGFAN---DKLLQINAI 882 Query: 2723 VGLAWIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRS 2902 GL+WIGKG+++RGHE +KDI M + CL+ + P +D LG E P+VM+S Sbjct: 883 CGLSWIGKGLLLRGHEGIKDITMTFIECLIP-GTKSSLPFFKDSLGNTEEQIQDPLVMKS 941 Query: 2903 AADAFHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAI 3082 AADAFHVL+SDS+VCLNK+FHATIRPLYKQ FFSSMMP+ L + + SS +RS LYRA+ Sbjct: 942 AADAFHVLMSDSEVCLNKKFHATIRPLYKQRFFSSMMPIFLQLITKAYSSLSRSFLYRAL 1001 Query: 3083 GHVISGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKEAV 3262 H+IS TP+ AV+ +AKK Y LLLVLS IL ++NG+EAV Sbjct: 1002 AHIISDTPMVAVLNDAKKLIPVLLDCFSMLTEDIQDKDMLYGLLLVLSCILTEKNGQEAV 1061 Query: 3263 TENAHIIINHIIGLVSYPHMMLVRE 3337 TENAHIIIN +I LV YPH MLVRE Sbjct: 1062 TENAHIIINCLIKLVDYPHKMLVRE 1086 >ref|XP_004236399.1| PREDICTED: MMS19 nucleotide excision repair protein homolog [Solanum lycopersicum] Length = 1153 Score = 966 bits (2498), Expect = 0.0 Identities = 515/1100 (46%), Positives = 719/1100 (65%), Gaps = 3/1100 (0%) Frame = +2 Query: 47 LMAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNV 226 ++++ + + +E++V S QQ++++DAIA L+KNDLL++E LV+EM +YLTTTDN+ Sbjct: 15 VLSQPTELVRIIESYVSSSSSEAQQAASIDAIALLLKNDLLSLETLVREMEMYLTTTDNI 74 Query: 227 IRSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNV 406 IRSRGI SKPL + + SL+ FFT+RL DW+ALHGAL+GCLALLRRK+ V Sbjct: 75 IRSRGILLLGELLMRLMSKPLGDTAISSLMEFFTERLADWKALHGALVGCLALLRRKTGV 134 Query: 407 GIVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICE 586 G+++ S A+ + +YL+ +QVQSLG HDRKLC +IL CLLDRY +A+ +LGDDL+YGICE Sbjct: 135 GMISRSQAKAVAESYLKTLQVQSLGQHDRKLCLQILECLLDRYRDALFSLGDDLVYGICE 194 Query: 587 AIDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFD 766 AIDGEKDPQCLML FHIVE+L LFP+ GPL F+ DLF++L YFPIHFTHPK+D D Sbjct: 195 AIDGEKDPQCLMLIFHIVELLAQLFPEASGPLENFAGDLFEILECYFPIHFTHPKSDDVD 254 Query: 767 VKRDDLSRALMLAFASTPLFEPFAMXXXXXXXXXXXXXXXVDSLKYLSHCSVKYGVDRMG 946 +KR++LSRALMLAFASTPLFEP + V+SLKYLS C++KYG DRM Sbjct: 255 IKREELSRALMLAFASTPLFEPSVIPLLLDKLSSSLPSAKVESLKYLSFCTLKYGGDRME 314 Query: 947 KHATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLQKFMLQDGGLF 1126 K+ S+WS+LKDA F +SPQ + S S+ +G+ F ++EI +AL LQ + Q F Sbjct: 315 KYTKSLWSALKDALF-TSPQSTL-SEDSDPIDGLGFHESEIMTQALEFLQVLVRQHNASF 372 Query: 1127 LSLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFF 1306 LSLI+ D +I S + +N + + KQ+LHAVG +LSV S++ CN+VF++FF Sbjct: 373 LSLIMGDGDISTFLNSFSQFDNFNSLSTQYKQRLHAVGHVLSVCIKASASSCNKVFESFF 432 Query: 1307 FRLMDILGLSPGSSDSGCILDDGFVLSEKLNSGALYLCIELLAACRGLAIGSEELSPQLI 1486 RL+D L LS +S G + + N GALYLC+ELLAACR L + S+E++ Sbjct: 433 PRLVDALRLSVDNS-HGIVHS---AVDANFNFGALYLCVELLAACRQLVVSSDEVASAHD 488 Query: 1487 SEEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSK 1666 D+WC +L FS L ++ S + A +Y VKGL+ILATFP F+ SK Sbjct: 489 LARDSWCQILHSFSTSLCNVFFCLIRASCVESTRNAYVYAAVKGLEILATFPGSFISVSK 548 Query: 1667 SIFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLAS 1846 ++ENIL TS + + ++ LWK +LKAL++I F+ K H+ EK + +IV +K+ S Sbjct: 549 LMYENILLTLTSIIESEFNKKFLWKAALKALVEISLFVNKYHEDEKAASFNSIVKQKIVS 608 Query: 1847 LIPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILV 2026 LI +D +MP SL+LEA+F+IG G +FML V+ +E+ ISAN E V G+ + + Sbjct: 609 LISSDDLNMPQSLKLEAVFDIGLTGKNFMLSVVSELEKTISANLSEILVHGDRRLAGLTA 668 Query: 2027 PLLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTMRL 2206 LLECYS+KVLPW H + G ++V+L F+VNI+ +ME+N + + EGK LL M M+ Sbjct: 669 GLLECYSNKVLPWFHVNGGADEVSLSFAVNIFTKMEHNTSLSLEAEGKELLGATMAAMKQ 728 Query: 2207 AFAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDECI 2386 A CS Q ++QKA V+ + SFF ++ + QL Q E S +DE I Sbjct: 729 AMTCCSVESQEKVLQKAIDVMETNSFF-FSNNLILGTDLFNKKTQLGQTSEGLSCQDEWI 787 Query: 2387 ISLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEASS 2566 ISLFASV+IALRPQT + N+R++L+L LL+GH+ +AQALGS++NKLPL I S Sbjct: 788 ISLFASVVIALRPQTQIPNIRLLLQLLAMTLLEGHIPSAQALGSLVNKLPLNI-----SE 842 Query: 2567 ECTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDDSNEGH---VNLYRNVALQSNAMVGLAW 2737 +C+L+E ID++ K LW S + N G ++ R+ +L S+A++GLAW Sbjct: 843 DCSLKELIDMLLKNVLWR--------NISIGKEGNHGDAVAMSNLRSSSLNSHAVIGLAW 894 Query: 2738 IGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRSAADAF 2917 IGKG++MRGHEK+KD+ M L CL+S + D + + +E + + +SAADAF Sbjct: 895 IGKGLLMRGHEKLKDVTMTFLSCLVSNEDQGNLLPFNDQMKDPAELKVFSL-RKSAADAF 953 Query: 2918 HVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAIGHVIS 3097 H+++SDSD CLN+ +HA +RPLYKQ FF+ MMP+ LS++ +CDSST+R LY+A H++S Sbjct: 954 HIVMSDSDACLNRNYHAIVRPLYKQRFFNIMMPMFLSAIAKCDSSTSRCFLYQAFAHLVS 1013 Query: 3098 GTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKEAVTENAH 3277 TPL AVV +AKK YS+L+VLSGIL D+NG+E + ENA Sbjct: 1014 ETPLVAVVGDAKKVLPVLMDCFLILSKDISHKEIIYSVLIVLSGILTDKNGQETIIENAP 1073 Query: 3278 IIINHIIGLVSYPHMMLVRE 3337 ++I +I L SYP+MM++RE Sbjct: 1074 MVIRRLIELTSYPYMMVIRE 1093 >ref|XP_006595124.1| PREDICTED: DNA repair/transcription protein mms19-like isoform X1 [Glycine max] Length = 1132 Score = 963 bits (2489), Expect = 0.0 Identities = 533/1101 (48%), Positives = 722/1101 (65%), Gaps = 5/1101 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ + H+E++VD S P++Q +S L+AIA+LV D LT+EAL++E+ +YLTTTDNV+ Sbjct: 1 MAETTQLTRHIESYVDSSSPTHQAAS-LNAIASLVNTDALTLEALIRELEMYLTTTDNVV 59 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI SKPL++AT+HSL+GFF DRL DW+A+ GAL+GCLAL+RRKS VG Sbjct: 60 RARGILLLAEVMTHIESKPLNSATIHSLVGFFKDRLADWRAVRGALVGCLALIRRKSVVG 119 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNH-DRKLCFEILICLLDRYSEAVITLGDDLIYGICE 586 +VT SDA T+ ++LQ +QVQSLG H DRKLCFE+L CLL+RY +AV TLG+DLIYGICE Sbjct: 120 MVTDSDATTIAQSFLQYMQVQSLGQHYDRKLCFELLDCLLERYFDAVTTLGEDLIYGICE 179 Query: 587 AIDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFD 766 AID EKDP CL L FHIVE L L PD G LA F++D+FD+L YFPIHFT P + Sbjct: 180 AIDAEKDPDCLKLAFHIVESLAQLNPDSSGLLASFAKDVFDILEPYFPIHFTRPSSGDTH 239 Query: 767 VKRDDLSRALMLAFASTPLFEPFAMXXXXXXXXXXXXXXXVDSLKYLSHCSVKYGVDRMG 946 V+RD LS +LM AF+STPLFEPF + +DSLKYL CS KYG R+ Sbjct: 240 VQRD-LSTSLMSAFSSTPLFEPFVIPLLLEKLSSSLHSAKIDSLKYLRVCSSKYGAGRIA 298 Query: 947 KHATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLQKFMLQDGGLF 1126 K+A +IWSSLKD + EP FS +G+ F +NE EAL LQ+ ++Q+ L Sbjct: 299 KYAGAIWSSLKDTL-STYLGEPDFSFTIAPVDGIGFPENEFVLEALSLLQQLIVQNSSLL 357 Query: 1127 LSLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFF 1306 +SLI++DE++ F ++ S + Y+ IPV+ K+KLHA+G IL++++ + + CN VF++ F Sbjct: 358 VSLIIDDEDVNSIFSTIASYETYDAIPVQEKKKLHAIGRILNITAKTTISSCNAVFESLF 417 Query: 1307 FRLMDILGLSPGSSDSGCILDDGFVLSEKLNSGALYLCIELLAACRGLAIGSEELSPQLI 1486 RLMD LG S +S S+++ G LY+CIELLA CR L +GS+E + Q + Sbjct: 418 SRLMDNLGFSVRFPNSD------IPPSQRVKFGFLYVCIELLAGCRELIVGSDEPALQYV 471 Query: 1487 SEEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSK 1666 E +T C +L +FS PL A +L S ++ + D Y GVKGLQILA F S P K Sbjct: 472 FEHETCCTMLHRFSTPLFNAFGSVLAVSADRCPLDPDTYIGVKGLQILAMFGSDVFPIQK 531 Query: 1667 SIFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLAS 1846 S+FENIL F S + + + +LW+ +LKAL Q+G+F++K H+SEK + Y +VVEK+ Sbjct: 532 SVFENILKKFMSIIVEDFNKTILWEAALKALYQVGSFVQKFHESEKAMSYRNLVVEKIVE 591 Query: 1847 LIPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILV 2026 ++ L+D ++P SL+LEA+ IG G+ ML ++QG+ A+ +N + V NL+S +I V Sbjct: 592 ILSLDDITLPFSLELEALSNIGMTGMKNMLTILQGLGRAVFSNLSKVHVHRNLRSSDIAV 651 Query: 2027 PLLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTMRL 2206 LLECYS ++LPW H++ G ED ++F V+IW+Q N M F+ E K LLD +M M+L Sbjct: 652 QLLECYSCQLLPWIHENGGSEDFVMQFVVDIWSQAGNCMDFSTLFEEKGLLDAIMKAMKL 711 Query: 2207 AFAGCSEYDQGLIVQKAYGVLSS-TSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDEC 2383 + C+ Q LI+QKAY VLSS T+F LKE ++ L LT + S RDE Sbjct: 712 SVGSCAVESQNLIIQKAYCVLSSHTNFQQLKE---------VERLPLTPGNYNISLRDEG 762 Query: 2384 IISLFASVIIALRPQTSLLNVRIILKLFMTGLLKGH-VSAAQALGSMINKLPLMINTTEA 2560 +ISLFASV+IA+ P+T + N R+++ LF+ LL+G V AQALGS++NKL N+ E Sbjct: 763 LISLFASVVIAVFPKTYIPNKRVLMHLFIITLLRGGVVPVAQALGSILNKLVSTSNSAEN 822 Query: 2561 SSECTLEEAIDIIF--KMGLWSVCGSDPLWKCSAIDDSNEGHVNLYRNVALQSNAMVGLA 2734 SS+ TLEEA+D+IF K+ S + D G N + LQ NA+ GL+ Sbjct: 823 SSDLTLEEALDVIFNTKISFSSTDNGRSNGNEMVLTDICLGIAN---DRMLQINAICGLS 879 Query: 2735 WIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRSAADA 2914 WIGKG+++ GHEK+KDI+MI L CL+S SPL +D L EH +VM+ AADA Sbjct: 880 WIGKGLLLSGHEKIKDIIMIFLECLIS-GTKSASPLIKDSLENTEEHIQDLLVMKCAADA 938 Query: 2915 FHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAIGHVI 3094 FHVL+SDS+VCLN++FHA IRPLYKQ F SS+MP+L + + SS +RS LYRA H++ Sbjct: 939 FHVLMSDSEVCLNRKFHAMIRPLYKQRFSSSVMPILQQIITKSHSSLSRSFLYRAFAHIL 998 Query: 3095 SGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKEAVTENA 3274 S TP+ A+++EAKK Y LLLVLSGIL ++NG+EA ENA Sbjct: 999 SDTPMVAILSEAKKLIPVLLDCLSMLTEDIQDKDMLYGLLLVLSGILTEKNGQEAAIENA 1058 Query: 3275 HIIINHIIGLVSYPHMMLVRE 3337 HIIIN +I LV YPH MLVRE Sbjct: 1059 HIIINCLIKLVDYPHKMLVRE 1079 >ref|XP_006343144.1| PREDICTED: MMS19 nucleotide excision repair protein homolog [Solanum tuberosum] Length = 1170 Score = 954 bits (2466), Expect = 0.0 Identities = 517/1126 (45%), Positives = 713/1126 (63%), Gaps = 33/1126 (2%) Frame = +2 Query: 59 ASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVIRSR 238 A ++ H+E++V S QQ++++DAIA L+KNDLL++E LV+EM +YLTTTDN+IRSR Sbjct: 6 AIQYVIHIESYVSSSSSEAQQAASVDAIAVLLKNDLLSLETLVREMEMYLTTTDNIIRSR 65 Query: 239 GIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVGIVT 418 GI SKPL + + SLI FFT+RL DW+ALHGAL+GCLALLRRK+ G++ Sbjct: 66 GILLLGELLMRLMSKPLGDTAISSLIEFFTERLADWKALHGALVGCLALLRRKTGTGMIN 125 Query: 419 VSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEAIDG 598 S A+ + +YL+ +QVQSLG DRKLC +IL CLLDRY +A+ +LGDDL+YGICEAIDG Sbjct: 126 RSQAKAVAESYLKTLQVQSLGQQDRKLCLQILECLLDRYRDALFSLGDDLVYGICEAIDG 185 Query: 599 EKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDVKRD 778 EKDPQCLML FHIVE+L LFP+ GPL F+ DLF++L YFPIHFTHPK+D D+KR Sbjct: 186 EKDPQCLMLIFHIVELLAQLFPEASGPLENFAGDLFEILECYFPIHFTHPKSDDVDMKRG 245 Query: 779 DLSRALMLAFASTPLFEPFAMXXXXXXXXXXXXXXXVDSLKYLSHCSVKYGVDRMGKHAT 958 +LSRALMLAFASTPL+EP + V+SLKYLS+C++KYG DRM K+ Sbjct: 246 ELSRALMLAFASTPLYEPSVIPLLLDKLSSSLPSAKVESLKYLSYCTLKYGGDRMEKYTK 305 Query: 959 SIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLQKFMLQDGGLFLSLI 1138 S+WS+LKDA F + PQ + S S+ +G+ F ++EI +AL LQ + Q FLSLI Sbjct: 306 SLWSALKDALF-TCPQSTL-SEDSDPIDGLGFHESEIMTQALELLQVLVRQHNDSFLSLI 363 Query: 1139 LEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFFRLM 1318 L D +I S + +N + + KQ+LHAVG +LSV S + CN+VF++FF RL+ Sbjct: 364 LGDGDISTFLNSFSQFDDFNSLSTQYKQRLHAVGHVLSVCIKASGSSCNKVFESFFPRLV 423 Query: 1319 DILGLSPGSSDSGCILDDGFVLSEKLNSGALYLCIELLAACRGLAIGSEELSPQLISEED 1498 D L LS +S G + L N GALYLC+ELLAACR L + S+E++ D Sbjct: 424 DALRLSVENS-HGIVHS---ALDANFNFGALYLCVELLAACRQLVVSSDEVASAHDLARD 479 Query: 1499 TWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSKSIFE 1678 +WC +L+ F L ++ S + A +Y VKGL+IL TFP F+ SK ++E Sbjct: 480 SWCQILRSFCTSLCNVFFCLIRASCVESTWNAYVYAAVKGLEILGTFPGSFISVSKLMYE 539 Query: 1679 NILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLASLIPL 1858 NIL TS + + ++ LWK +LKAL++I F+ K H+ EK + +IV +K+ SLI Sbjct: 540 NILLTLTSIIESDFNKKFLWKAALKALVEISLFVNKYHEDEKAAIFNSIVKQKIVSLISS 599 Query: 1859 EDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILVP--- 2029 +D +MP SL+LEAIF+IG G FM V+ +E+ ISAN E V+ +++ +L+ Sbjct: 600 DDLNMPQSLKLEAIFDIGLTGKSFMHSVVSELEKTISANLSEILVRVLIETSRLLLTYHM 659 Query: 2030 ---------------------------LLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQ 2128 LLECYS+KVLPW H + G ++V+L F++NI+ + Sbjct: 660 HRLFNFGALFLLLQVHGDRRLAGLTPGLLECYSNKVLPWFHGNGGADEVSLSFAINIFTK 719 Query: 2129 MENNMTFNAGVEGKVLLDKMMTTMRLAFAGCSEYDQGLIVQKAYGVLSSTSFFPLKESIS 2308 MENN + + +GK LL M M+ A GCS Q ++QKA V+ ++SFF L + Sbjct: 720 MENNSSLSLEAKGKELLGATMAAMKQAMTGCSVESQEKVLQKAIDVMETSSFF-LSNDLI 778 Query: 2309 FSVPIKLDDLQLTQDLESFSYRDECIISLFASVIIALRPQTSLLNVRIILKLFMTGLLKG 2488 + QL Q E S RDE I SLFASV+IALRPQT + N+R++L+L LL+G Sbjct: 779 LGTDLFNKKTQLGQTSEGLSCRDEWITSLFASVVIALRPQTQIPNIRLLLQLLAMTLLEG 838 Query: 2489 HVSAAQALGSMINKLPLMINTTEASSECTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDDS 2668 H+ +AQALGS++NKLPL I S +C+LEE ID +FK +W S + Sbjct: 839 HIPSAQALGSLVNKLPLNI-----SEDCSLEELIDTLFKNVMWR--------NISIGKEG 885 Query: 2669 NEGHVNLYRNV---ALQSNAMVGLAWIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSP 2839 N+G N+ +L S+A++G AWIGKG++MRGHEK+KD+ M L CL+S + Sbjct: 886 NDGGAVAMSNLRLNSLNSHAVIGFAWIGKGLLMRGHEKLKDVTMTFLSCLVSNEDQGNLL 945 Query: 2840 LQQDMLGECSEHDIHPIVMRSAADAFHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPV 3019 D + + +EH + + +SAADAFH+L+SDSD CLN+ +HA +RPLYKQ FF+ MMP+ Sbjct: 946 PFNDQMKDPAEHKV-LCLRKSAADAFHILMSDSDACLNRNYHAIVRPLYKQRFFNIMMPM 1004 Query: 3020 LLSSLKECDSSTTRSMLYRAIGHVISGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXX 3199 LS++ +CDSST+R LY+A H++S TPL AVV +AKK Sbjct: 1005 FLSAIVKCDSSTSRCFLYQAFAHLVSETPLVAVVGDAKKVLPVLMDCFLVLSKDISHKEI 1064 Query: 3200 TYSLLLVLSGILMDENGKEAVTENAHIIINHIIGLVSYPHMMLVRE 3337 YS+L+VLSGIL D+NG+EA+ ENA ++I +I L SYP++M++RE Sbjct: 1065 IYSVLIVLSGILTDKNGQEAIIENAPMVIRRLIELTSYPYVMVIRE 1110