BLASTX nr result
ID: Akebia23_contig00001991
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia23_contig00001991 (3999 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI36057.3| unnamed protein product [Vitis vinifera] 1337 0.0 ref|XP_006465695.1| PREDICTED: MMS19 nucleotide excision repair ... 1209 0.0 ref|XP_006465694.1| PREDICTED: MMS19 nucleotide excision repair ... 1204 0.0 ref|XP_006426876.1| hypothetical protein CICLE_v10024743mg [Citr... 1198 0.0 ref|XP_002515963.1| DNA repair/transcription protein met18/mms19... 1194 0.0 ref|XP_007024310.1| MMS19 nucleotide excision repair protein, pu... 1177 0.0 ref|XP_007217541.1| hypothetical protein PRUPE_ppa023072mg [Prun... 1154 0.0 gb|EXB74582.1| hypothetical protein L484_026279 [Morus notabilis] 1141 0.0 ref|XP_007024313.1| MMS19 nucleotide excision repair protein, pu... 1116 0.0 ref|XP_004141784.1| PREDICTED: MMS19 nucleotide excision repair ... 1095 0.0 ref|XP_004302857.1| PREDICTED: uncharacterized protein LOC101304... 1089 0.0 ref|XP_007024314.1| MMS19 nucleotide excision repair protein, pu... 1074 0.0 ref|XP_006853692.1| hypothetical protein AMTR_s00056p00136660 [A... 1068 0.0 ref|XP_003546956.1| PREDICTED: MMS19 nucleotide excision repair ... 1068 0.0 ref|XP_004236399.1| PREDICTED: MMS19 nucleotide excision repair ... 1063 0.0 ref|XP_006597167.1| PREDICTED: MMS19 nucleotide excision repair ... 1062 0.0 ref|XP_007150605.1| hypothetical protein PHAVU_005G166100g [Phas... 1052 0.0 ref|XP_006595124.1| PREDICTED: DNA repair/transcription protein ... 1044 0.0 ref|XP_007024312.1| MMS19 nucleotide excision repair protein, pu... 1042 0.0 ref|XP_004486785.1| PREDICTED: uncharacterized protein LOC101495... 1019 0.0 >emb|CBI36057.3| unnamed protein product [Vitis vinifera] Length = 1146 Score = 1337 bits (3459), Expect = 0.0 Identities = 692/1161 (59%), Positives = 857/1161 (73%), Gaps = 5/1161 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ S ++E++VD SR S QQ++++DAIA L+KND+LT+E LV EMG+YLTTTDN+I Sbjct: 1 MAQLSQLTQYIESYVDSSRSSTQQAASVDAIAYLLKNDILTLETLVTEMGMYLTTTDNII 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI +SKPLDN T+HSLI FFTDRL DW+AL GALIGCLAL++RKSN+G Sbjct: 61 RTRGILLLAELLTRLASKPLDNVTIHSLISFFTDRLADWRALRGALIGCLALMKRKSNMG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 VT +DAR + YL+N+QVQSLG HDRKLCFEIL CLLD Y E+V +LGDDL+YGIC A Sbjct: 121 RVTDNDARAVAQAYLENVQVQSLGQHDRKLCFEILECLLDHYPESVASLGDDLVYGICGA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 IDGEKDP+CLMLTFHIVEIL LFPDP GPLA F+ DLFD+LG YFPIHFTHP+ + DV Sbjct: 181 IDGEKDPRCLMLTFHIVEILARLFPDPSGPLASFAGDLFDILGCYFPIHFTHPQGEDVDV 240 Query: 770 KRDDLSRALMLAFASTPLFEPFAMPXXXXXXXXXXPTAKVDSLKYLSHCSVKYGVDRMGK 949 KRDDLSRALMLAF+ST LFEPFA+P P AKVDSLKYLS+C +KYG DRM K Sbjct: 241 KRDDLSRALMLAFSSTTLFEPFAIPLLLEKLSSSLPLAKVDSLKYLSNCLLKYGDDRMTK 300 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLEKFMLQDGGLFL 1129 H +IW S+KDA F S QEP+ SL SE + + FQ+NEI EA++ L+K +L++ GL L Sbjct: 301 HVEAIWFSVKDAIF-CSEQEPMLSLASELLDHVGFQENEIVTEAIILLQKVILENSGLSL 359 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 SLI+ D++I +VTS + YNDIP++ K KL A+G IL VS+ S CCNRVF++FFF Sbjct: 360 SLIVGDKDINTIVNTVTSFRSYNDIPLQSKHKLCAIGRILYVSAKASITCCNRVFESFFF 419 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGGLYLCIELLAACRGLAIGSEELSPQLIS 1489 RLMD LGLS +S C+ + +V SE+LN G LYLCIELLAACR L +GSEEL+ + +S Sbjct: 420 RLMDTLGLSVRNSSGDCLPNFDYVFSERLNFGALYLCIELLAACRDLVVGSEELTSKSVS 479 Query: 1490 EEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSKS 1669 +++WCC+L FS L A +L ST++ A EADIY GVKGLQILATFP FLP SKS Sbjct: 480 AQESWCCMLHSFSSLLMKAFSSVLDASTDKDAYEADIYSGVKGLQILATFPGEFLPISKS 539 Query: 1670 IFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLASL 1849 IFEN+L F S + + + LLWKL+LKAL+QIG+FI + H+SEK + Y IVVEK+ SL Sbjct: 540 IFENVLLTFISIIVEDFNKTLLWKLALKALVQIGSFIDRFHESEKALSYNYIVVEKIVSL 599 Query: 1850 IPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILVP 2029 + L+D +P L+LEAI +IGT GL+ ML+++QG+E+AI AN E +V GNLKS +I V Sbjct: 600 MFLDDFGLPFQLRLEAISDIGTTGLNVMLKIVQGLEDAIFANLSEVYVHGNLKSAKIAVQ 659 Query: 2030 LLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTMRLA 2209 LLECYS+K+LP H + FEDV RF+VNIWNQ+EN+M F+ G + LL+ MT M+LA Sbjct: 660 LLECYSNKLLPGIHGAGDFEDVLSRFAVNIWNQIENSMAFSVGAQENELLNATMTAMKLA 719 Query: 2210 VAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDECII 2389 V CSE QG I++KAY VLSS F L ES+ + ++L+ LQ TQDLE FS RD+ +I Sbjct: 720 VGSCSEGSQGKIIKKAYSVLSSCPSFTLMESMPITGTVQLEGLQHTQDLECFSCRDKWVI 779 Query: 2390 SLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEASSE 2569 SLFAS IIA+RPQT + N+R++L LFMT LLKGHV AAQALGSM+NKL N E SS Sbjct: 780 SLFASAIIAVRPQTHIPNIRVVLHLFMTNLLKGHVPAAQALGSMVNKLCPKSNGVEISST 839 Query: 2570 CTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDDSNE-GHVNLYRNVA----LQSNAMVGLA 2734 CTLE+A+DIIF LW PL +CS I NE G NL + + LQ A+ GLA Sbjct: 840 CTLEDALDIIFNTSLWDSHNHGPLKRCSGIGVDNEMGLANLCLSASNCQLLQVCAIEGLA 899 Query: 2735 WIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRSAADA 2914 WIGKG+++RGHEKVKDI MI LRCLLS++N E D+ P V +SAADA Sbjct: 900 WIGKGLLLRGHEKVKDITMIFLRCLLSKNN--------------QEQDVLPSVAKSAADA 945 Query: 2915 FHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAIGHVI 3094 FHVL+SDS++CLNKRFHA IRPLYKQ FFSS++P+L+SS+ E S TRSMLYRA+ H+I Sbjct: 946 FHVLMSDSEICLNKRFHANIRPLYKQRFFSSVLPILVSSMAESRLSNTRSMLYRALAHII 1005 Query: 3095 SGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKETITENV 3274 S TPL AV++EAKK Y+LLLVLSGILMD+NG+ET+ EN Sbjct: 1006 SDTPLIAVLSEAKKIIPILLDSLSILSTYNLDKDILYNLLLVLSGILMDKNGQETVVENA 1065 Query: 3275 HIIINHIIGLVSYPHMMLVRETAIQCLVAMTGLPYTRIYPMRLQLLQALSKALDDPKRTV 3454 H+IIN +IGLV YPHMM+VRETAIQCLVAM+ LP+ RIYPMR Q+L+++ KALDDPKR V Sbjct: 1066 HVIINCLIGLVGYPHMMVVRETAIQCLVAMSRLPHARIYPMRTQVLRSVQKALDDPKRAV 1125 Query: 3455 RQEAVRCRQAWASIASTNVHF 3517 R EAVRCRQAWASIAS ++HF Sbjct: 1126 RHEAVRCRQAWASIASRSLHF 1146 >ref|XP_006465695.1| PREDICTED: MMS19 nucleotide excision repair protein homolog isoform X2 [Citrus sinensis] Length = 1151 Score = 1209 bits (3127), Expect = 0.0 Identities = 639/1161 (55%), Positives = 823/1161 (70%), Gaps = 5/1161 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+AS I H+E+FV+ S Q+++LD IA+L+K ++LTIE LV+EMG+YLTTTD+VI Sbjct: 1 MAEASQLIQHIESFVNLSSSPTHQAASLDVIASLLKKNVLTIETLVREMGMYLTTTDDVI 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI +SKPLD+AT+HS++ FFTDRL DW+AL GAL+GCLALLRRKS+ G Sbjct: 61 RARGILLLGELLTHLASKPLDDATIHSMLAFFTDRLADWKALRGALVGCLALLRRKSSGG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 ++T +DA+ + +Y+QN+QVQSL HDRKLCFE+L CLL RY +AV++LG+DL+Y ICEA Sbjct: 121 VITTNDAKAVAQSYIQNLQVQSLAQHDRKLCFELLECLLQRYPDAVVSLGEDLLYAICEA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 IDGEKDP CLMLTFHIVE+ LF D L LA F+ DLF++LG YFPIHFTH K + FDV Sbjct: 181 IDGEKDPHCLMLTFHIVEVAAELFSDDL--LANFASDLFEILGCYFPIHFTHSKAEDFDV 238 Query: 770 KRDDLSRALMLAFASTPLFEPFAMPXXXXXXXXXXPTAKVDSLKYLSHCSVKYGVDRMGK 949 KRDDLSRALM AF+ST LFEPFA+P +AKVDSLKYLSHC+VKYG DR+ K Sbjct: 239 KRDDLSRALMAAFSSTSLFEPFAIPLLLEKLSSSLQSAKVDSLKYLSHCTVKYGADRIEK 298 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLEKFMLQDGGLFL 1129 HA ++WSS+KDA + S EP S SES +G+ F++N I E+L L+ Q+ GLFL Sbjct: 299 HAKAMWSSIKDAVYSS--HEPTLSFASESLDGVGFRENVILTESLNLLDTVFKQNSGLFL 356 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 S I+ DE+I + F+S++S K Y +I ++ KQKLHAVG ILSVS+ S A CN V ++FF Sbjct: 357 SWIIGDEDINLIFKSISSYKTYKEISLQSKQKLHAVGSILSVSAKASPAACNSVMESFFP 416 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGGLYLCIELLAACRGLAIGSEELSPQLIS 1489 LM LGLS G+S C +DG VL KLN G LYLCIEL+ ACR L SEE Sbjct: 417 CLMHALGLSVGNSTQDCFPNDGNVLRGKLNHGALYLCIELMTACRELMASSEEFKSVAAP 476 Query: 1490 EEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSKS 1669 + W CLL+ +S LA ALR L TS N+ + E ++Y GVKGL IL TF L S S Sbjct: 477 ANERWYCLLQSYSASLAKALRSTLETSANEDSYETNVYFGVKGLLILGTFRGGSLIISNS 536 Query: 1670 IFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLASL 1849 IFENIL FTS + +E LLWKL+LKAL+ IG+FI + ++SEK + YM +V+EK+ SL Sbjct: 537 IFENILLTFTSIIISEFENTLLWKLALKALVHIGSFIDRFNESEKALSYMDVVIEKIVSL 596 Query: 1850 IPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILVP 2029 D SMP L+LEAI EIG G +++L+++QG+EEA+ AN +E V GN KS E++V Sbjct: 597 ASSHDFSMPFPLKLEAISEIGATGRNYLLKIVQGLEEAVCANLYEVLVHGNPKSAEVVVQ 656 Query: 2030 LLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTMRLA 2209 LLECYS+KVLP H+ GFE+V LRF+VNIWN +E ++TF++ V K LLD M M+LA Sbjct: 657 LLECYSNKVLPRIHEIGGFEEVLLRFAVNIWNLIEKSVTFSSQVHEKGLLDATMKAMKLA 716 Query: 2210 VAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDECII 2389 V CS Q ++ QKA+ VLS ++FPL+++ S ++PI L++ QLTQ+ S R+ I Sbjct: 717 VGSCSVESQNIVFQKAFTVLSLGTYFPLEDAAS-NIPILLNEFQLTQETSISSSREAWIC 775 Query: 2390 SLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEASSE 2569 SLFASVIIA RPQT + NVR++++LFMT LLKG+V AAQALGSM+NKL L N TE Sbjct: 776 SLFASVIIAARPQTHIPNVRLVIRLFMTTLLKGNVPAAQALGSMVNKLGLKSNGTEVHGN 835 Query: 2570 CTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDD-SNEGHVNLYRNV----ALQSNAMVGLA 2734 CTLEEA+DIIF LWS S L +++ S+ G ++ R +LQ +A+ GLA Sbjct: 836 CTLEEAMDIIFDSKLWSFNDSVTLRSNGGLENGSSIGLTDICRGATNIRSLQVHAIAGLA 895 Query: 2735 WIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRSAADA 2914 WIGKG++MRGHEKVKDI M + CLLS S + + L+QD SE V++ AADA Sbjct: 896 WIGKGLLMRGHEKVKDITMTFIECLLSNSKLGSFSLEQDYSENSSES-----VVKYAADA 950 Query: 2915 FHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAIGHVI 3094 F +L+ DS+ CL+++ HATIRPLYKQ F+S++MP+L S + + +SS +RS+L RA H+I Sbjct: 951 FKILMGDSEDCLSRKLHATIRPLYKQRFYSTIMPILQSLIIKSNSSFSRSILCRACAHII 1010 Query: 3095 SGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKETITENV 3274 S TPL V+ +AK YSLLLVLSGIL D+NG+E + E Sbjct: 1011 SDTPLIVVLNDAKTVIPILMDGLSILSNDVSDKDIVYSLLLVLSGILTDKNGQEAVIECA 1070 Query: 3275 HIIINHIIGLVSYPHMMLVRETAIQCLVAMTGLPYTRIYPMRLQLLQALSKALDDPKRTV 3454 HIII+H I L+SYPHMMLVRETAIQCLVAM+GLP+ RIYPMR Q+LQA+S+ALDDPKR V Sbjct: 1071 HIIIDHSIILISYPHMMLVRETAIQCLVAMSGLPHARIYPMRRQVLQAVSRALDDPKRAV 1130 Query: 3455 RQEAVRCRQAWASIASTNVHF 3517 RQEAVRCRQAWAS AS +++F Sbjct: 1131 RQEAVRCRQAWASTASRSLYF 1151 >ref|XP_006465694.1| PREDICTED: MMS19 nucleotide excision repair protein homolog isoform X1 [Citrus sinensis] Length = 1155 Score = 1204 bits (3116), Expect = 0.0 Identities = 640/1165 (54%), Positives = 823/1165 (70%), Gaps = 9/1165 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+AS I H+E+FV+ S Q+++LD IA+L+K ++LTIE LV+EMG+YLTTTD+VI Sbjct: 1 MAEASQLIQHIESFVNLSSSPTHQAASLDVIASLLKKNVLTIETLVREMGMYLTTTDDVI 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI +SKPLD+AT+HS++ FFTDRL DW+AL GAL+GCLALLRRKS+ G Sbjct: 61 RARGILLLGELLTHLASKPLDDATIHSMLAFFTDRLADWKALRGALVGCLALLRRKSSGG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 ++T +DA+ + +Y+QN+QVQSL HDRKLCFE+L CLL RY +AV++LG+DL+Y ICEA Sbjct: 121 VITTNDAKAVAQSYIQNLQVQSLAQHDRKLCFELLECLLQRYPDAVVSLGEDLLYAICEA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 IDGEKDP CLMLTFHIVE+ LF D L LA F+ DLF++LG YFPIHFTH K + FDV Sbjct: 181 IDGEKDPHCLMLTFHIVEVAAELFSDDL--LANFASDLFEILGCYFPIHFTHSKAEDFDV 238 Query: 770 KRDDLSRALMLAFASTPLFEPFAMPXXXXXXXXXXPTAKVDSLKYLSHCSVKYGVDRMGK 949 KRDDLSRALM AF+ST LFEPFA+P +AKVDSLKYLSHC+VKYG DR+ K Sbjct: 239 KRDDLSRALMAAFSSTSLFEPFAIPLLLEKLSSSLQSAKVDSLKYLSHCTVKYGADRIEK 298 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLEKFMLQDGGLFL 1129 HA ++WSS+KDA + S EP S SES +G+ F++N I E+L L+ Q+ GLFL Sbjct: 299 HAKAMWSSIKDAVYSS--HEPTLSFASESLDGVGFRENVILTESLNLLDTVFKQNSGLFL 356 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 S I+ DE+I + F+S++S K Y +I ++ KQKLHAVG ILSVS+ S A CN V ++FF Sbjct: 357 SWIIGDEDINLIFKSISSYKTYKEISLQSKQKLHAVGSILSVSAKASPAACNSVMESFFP 416 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGGLYLCIELLAACRGLAIGSEELSPQLIS 1489 LM LGLS G+S C +DG VL KLN G LYLCIEL+ ACR L SEE Sbjct: 417 CLMHALGLSVGNSTQDCFPNDGNVLRGKLNHGALYLCIELMTACRELMASSEEFKSVAAP 476 Query: 1490 EEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSKS 1669 + W CLL+ +S LA ALR L TS N+ + E ++Y GVKGL IL TF L S S Sbjct: 477 ANERWYCLLQSYSASLAKALRSTLETSANEDSYETNVYFGVKGLLILGTFRGGSLIISNS 536 Query: 1670 IFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLASL 1849 IFENIL FTS + +E LLWKL+LKAL+ IG+FI + ++SEK + YM +V+EK+ SL Sbjct: 537 IFENILLTFTSIIISEFENTLLWKLALKALVHIGSFIDRFNESEKALSYMDVVIEKIVSL 596 Query: 1850 IPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILVP 2029 D SMP L+LEAI EIG G +++L+++QG+EEA+ AN +E V GN KS E++V Sbjct: 597 ASSHDFSMPFPLKLEAISEIGATGRNYLLKIVQGLEEAVCANLYEVLVHGNPKSAEVVVQ 656 Query: 2030 LLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTMRLA 2209 LLECYS+KVLP H+ GFE+V LRF+VNIWN +E ++TF++ V K LLD M M+LA Sbjct: 657 LLECYSNKVLPRIHEIGGFEEVLLRFAVNIWNLIEKSVTFSSQVHEKGLLDATMKAMKLA 716 Query: 2210 VAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDECII 2389 V CS Q ++ QKA+ VLS ++FPL+++ S ++PI L++ QLTQ+ S R+ I Sbjct: 717 VGSCSVESQNIVFQKAFTVLSLGTYFPLEDAAS-NIPILLNEFQLTQETSISSSREAWIC 775 Query: 2390 SLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEASSE 2569 SLFASVIIA RPQT + NVR++++LFMT LLKG+V AAQALGSM+NKL L N TE Sbjct: 776 SLFASVIIAARPQTHIPNVRLVIRLFMTTLLKGNVPAAQALGSMVNKLGLKSNGTEVHGN 835 Query: 2570 CTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDD-SNEGHVNLYRNV----ALQSNAMVGLA 2734 CTLEEA+DIIF LWS S L +++ S+ G ++ R +LQ +A+ GLA Sbjct: 836 CTLEEAMDIIFDSKLWSFNDSVTLRSNGGLENGSSIGLTDICRGATNIRSLQVHAIAGLA 895 Query: 2735 WIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRSAADA 2914 WIGKG++MRGHEKVKDI M + CLLS S + + L+QD SE V++ AADA Sbjct: 896 WIGKGLLMRGHEKVKDITMTFIECLLSNSKLGSFSLEQDYSENSSES-----VVKYAADA 950 Query: 2915 FHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAIGHVI 3094 F +L+ DS+ CL+++ HATIRPLYKQ F+S++MP+L S + + +SS +RS+L RA H+I Sbjct: 951 FKILMGDSEDCLSRKLHATIRPLYKQRFYSTIMPILQSLIIKSNSSFSRSILCRACAHII 1010 Query: 3095 SGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGK----ETI 3262 S TPL V+ +AK YSLLLVLSGIL D+NGK E + Sbjct: 1011 SDTPLIVVLNDAKTVIPILMDGLSILSNDVSDKDIVYSLLLVLSGILTDKNGKTIGQEAV 1070 Query: 3263 TENVHIIINHIIGLVSYPHMMLVRETAIQCLVAMTGLPYTRIYPMRLQLLQALSKALDDP 3442 E HIII+H I L+SYPHMMLVRETAIQCLVAM+GLP+ RIYPMR Q+LQA+S+ALDDP Sbjct: 1071 IECAHIIIDHSIILISYPHMMLVRETAIQCLVAMSGLPHARIYPMRRQVLQAVSRALDDP 1130 Query: 3443 KRTVRQEAVRCRQAWASIASTNVHF 3517 KR VRQEAVRCRQAWAS AS +++F Sbjct: 1131 KRAVRQEAVRCRQAWASTASRSLYF 1155 >ref|XP_006426876.1| hypothetical protein CICLE_v10024743mg [Citrus clementina] gi|557528866|gb|ESR40116.1| hypothetical protein CICLE_v10024743mg [Citrus clementina] Length = 1155 Score = 1198 bits (3100), Expect = 0.0 Identities = 636/1165 (54%), Positives = 822/1165 (70%), Gaps = 9/1165 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+AS I H+E+FV+ S Q+++LD IA+L+K ++LTIE LV+EMG+YLTTTD+VI Sbjct: 1 MAEASQLIQHIESFVNLSSSPTHQAASLDVIASLLKKNVLTIETLVREMGMYLTTTDDVI 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI +SKPLD+AT+HS++ FFTDRL DW+AL GAL+GCLALLRRKS+ G Sbjct: 61 RARGILLLGELLTHLASKPLDDATIHSMLAFFTDRLADWKALRGALVGCLALLRRKSSGG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 ++T +DA+ + +Y+QN+QVQSL HDRKLCFE+L CLL RY +AV++LG+DL+Y ICEA Sbjct: 121 VITTNDAKAVAQSYIQNLQVQSLAQHDRKLCFELLECLLQRYPDAVVSLGEDLLYAICEA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 +DGEKDP CLMLTFHIVE+ LF D L LA F+ DLF++LG YFPIHFTH K + FDV Sbjct: 181 VDGEKDPHCLMLTFHIVEVAAELFSDDL--LANFAGDLFEILGCYFPIHFTHSKAEDFDV 238 Query: 770 KRDDLSRALMLAFASTPLFEPFAMPXXXXXXXXXXPTAKVDSLKYLSHCSVKYGVDRMGK 949 KRDDLSRALM AF+ST LFEPFA+P +AKVDSLKYLSHC+VKYG DR+ K Sbjct: 239 KRDDLSRALMAAFSSTSLFEPFAIPLLLEKLSSSLQSAKVDSLKYLSHCTVKYGADRIEK 298 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLEKFMLQDGGLFL 1129 HA ++WSS+KDA + S EP S SES +G+ F+ N I E+L L+ Q+ GLFL Sbjct: 299 HAKAMWSSIKDAIYSS--HEPTLSFASESLDGVGFRDNVILTESLNLLDTVFKQNSGLFL 356 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 S I+ DE+I + F+S++S K Y +I ++ KQKLHAVG ILSVS+ S A CN V ++FF Sbjct: 357 SWIIGDEDINLIFKSISSFKTYKEISLQSKQKLHAVGSILSVSAKASPAACNSVMESFFP 416 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGGLYLCIELLAACRGLAIGSEELSPQLIS 1489 LM LGLS G+S C +DG VL KLN G LYLCIEL+ ACR L SEE Sbjct: 417 CLMHPLGLSVGNSTQDCFPNDGNVLRGKLNHGALYLCIELMTACRELMASSEEFKSVAAP 476 Query: 1490 EEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSKS 1669 + W CLL+ +S LA ALR L TS N+ + E ++Y GVKGL IL TF L S S Sbjct: 477 ANERWYCLLQSYSASLAKALRSTLETSANEDSYETNVYFGVKGLLILGTFSGGSLIISNS 536 Query: 1670 IFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLASL 1849 IFENIL FTS + +E LLWKL+LKAL+ IG+FI + ++SEK + YM +V+EK+ SL Sbjct: 537 IFENILLTFTSIIISEFENTLLWKLALKALVHIGSFIDRFNESEKALSYMDVVIEKIVSL 596 Query: 1850 IPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILVP 2029 D SMP L+LEAI EIG G +++L+++QG+EEA+ AN +E V GN KS E++V Sbjct: 597 ASSHDFSMPFPLKLEAISEIGATGRNYLLKIVQGLEEAVCANLYEVLVHGNPKSAEVVVQ 656 Query: 2030 LLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTMRLA 2209 LLECYS+KVLP H+ GFE+V LRF+VNIWN +E ++TF++ V K LLD M M+LA Sbjct: 657 LLECYSNKVLPRIHEIGGFEEVLLRFAVNIWNLIEKSVTFSSQVHEKGLLDATMKAMKLA 716 Query: 2210 VAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDECII 2389 V CS Q ++ QKA+ VLS ++FPL+++ S ++PI+L++ QLTQ+ S R+ I Sbjct: 717 VGSCSVESQNIVFQKAFTVLSLGTYFPLEDAAS-NIPIQLNEFQLTQETSISSSREAWIC 775 Query: 2390 SLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEASSE 2569 SLFASVIIA PQT + NVR++++LFMT LLKG+V AAQALGSM+NKL L N TE Sbjct: 776 SLFASVIIAACPQTHIPNVRLVIRLFMTTLLKGNVPAAQALGSMVNKLGLKSNGTEVHGN 835 Query: 2570 CTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDD-SNEGHVNLYRNV----ALQSNAMVGLA 2734 CTLEEA+DIIF LWS S L +++ S+ G ++ R +LQ +A+ GLA Sbjct: 836 CTLEEAMDIIFDSKLWSFNDSVTLRSNGGLENGSSIGLTDICRGATNIRSLQVHAIAGLA 895 Query: 2735 WIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRSAADA 2914 WIGKG++MRGHEKVKDI M + CLLS S + + L+QD SE V++ AADA Sbjct: 896 WIGKGLLMRGHEKVKDITMTFIECLLSNSKLGSFSLEQDYSENSSES-----VVKYAADA 950 Query: 2915 FHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAIGHVI 3094 F +L+ DS+ CL+++ HATIRPLYKQ F+S++MP+L S + + +SS +RS+L RA H+I Sbjct: 951 FKILMGDSEDCLSRKLHATIRPLYKQRFYSTIMPILQSLIIKSNSSFSRSILCRACAHII 1010 Query: 3095 SGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDEN----GKETI 3262 S TPL V+ +AK YSLLLVLSGIL D+N G+E + Sbjct: 1011 SDTPLIVVLNDAKTVIPILMDGLSILSNDVSDKDIVYSLLLVLSGILTDKNVKTIGQEAV 1070 Query: 3263 TENVHIIINHIIGLVSYPHMMLVRETAIQCLVAMTGLPYTRIYPMRLQLLQALSKALDDP 3442 E HIII+HII L+SYPHMMLVRETAIQCLVAM+ LP+ RIYPMR ++LQA+S+ALDDP Sbjct: 1071 IECAHIIIDHIIKLISYPHMMLVRETAIQCLVAMSKLPHARIYPMRREVLQAISRALDDP 1130 Query: 3443 KRTVRQEAVRCRQAWASIASTNVHF 3517 KR VRQEAVRCRQAWAS AS +++F Sbjct: 1131 KRAVRQEAVRCRQAWASTASRSLYF 1155 >ref|XP_002515963.1| DNA repair/transcription protein met18/mms19, putative [Ricinus communis] gi|223544868|gb|EEF46383.1| DNA repair/transcription protein met18/mms19, putative [Ricinus communis] Length = 1174 Score = 1194 bits (3089), Expect = 0.0 Identities = 630/1179 (53%), Positives = 829/1179 (70%), Gaps = 23/1179 (1%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ + ++E++VD SR +QQ+++LDAI L+KND +TI +LVKEM +YLTTTD++I Sbjct: 1 MAEPTQLTQYIESYVDASRSLSQQAASLDAIVLLLKNDAVTIGSLVKEMEMYLTTTDDII 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI SSKPLDN T+HSLI FFT+RL DW+AL GAL+GCLAL+RR+SN G Sbjct: 61 RARGILLLGEALSHLSSKPLDNTTIHSLIAFFTERLADWRALRGALVGCLALIRRRSN-G 119 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 I+T DA+ + +YLQN+QVQSL +DRKLCFE+L CLL+ AV +LG+DLIYGICEA Sbjct: 120 IITGIDAKVVAESYLQNLQVQSLAQYDRKLCFELLECLLENCPAAVASLGEDLIYGICEA 179 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 IDGEKDPQCLMLTFHIVE+LG LFPDP GP + F+ D+F +LG YFPIHFTHPK + DV Sbjct: 180 IDGEKDPQCLMLTFHIVEVLGKLFPDPSGPFSSFAGDIFSILGCYFPIHFTHPKAEDVDV 239 Query: 770 KRDDLSRALMLAFASTPLFEPFAMPXXXXXXXXXXPTAKVDSLKYLSHCSVKYGVDRMGK 949 KRDDLSRALMLAF+STPLFEPFAMP PTAKVDSLKYLS+C++K+ DR+ + Sbjct: 240 KRDDLSRALMLAFSSTPLFEPFAMPLLLEKLSSSLPTAKVDSLKYLSYCTLKFRADRIAE 299 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLEKFMLQDGGLFL 1129 HA +IWSSLKDA + SS +EP+ S ES + +KNEI EAL+ LE ++Q+ FL Sbjct: 300 HAGAIWSSLKDAIY-SSGEEPMLSSDLESVDSPGSEKNEIATEALLLLENLIVQNNNFFL 358 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 S+I+ DEE++M F ++TS K YN+I ++ KQKLH VG IL V + VS + CNR+F+++F Sbjct: 359 SMIISDEEVKMIFNTITSYKSYNEISLQSKQKLHMVGRILYVCAKVSVSSCNRIFESYFP 418 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGGLYLCIELLAACRGLAIGSEELSPQLIS 1489 RLM+ LG+ ++ C ++ V +++ N G YL I+LL ACR L+ S+ L+ Q IS Sbjct: 419 RLMEALGILVENTSGACHSNENCVKAKQPNYGSFYLSIKLLGACRDLSTSSDNLASQCIS 478 Query: 1490 EEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSKS 1669 +T+CCLL++FS L L TST+ A + D+Y GVKGLQILATFP +L SK Sbjct: 479 TNETYCCLLQRFSTSLTETFSAALATSTSGPAQDVDMYLGVKGLQILATFPGGYLFLSKL 538 Query: 1670 IFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLASL 1849 F+NIL F S ++V + + LLW +LKAL+QIG+F+ C++S+K + Y+ IVV K+ L Sbjct: 539 TFDNILMTFLSIITVDFNKTLLWNQALKALVQIGSFVHGCNESDKEMSYVDIVVGKMILL 598 Query: 1850 IPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAF------------- 1990 D SMP SL+L AI IG G +ML+V G+EEAI AN E + Sbjct: 599 ASSPDFSMPWSLKLTAISSIGMSGQKYMLKVFLGLEEAIRANLAEIYVCMIKKKIYVLYS 658 Query: 1991 --VKGNLKSVEILVPLLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVE 2164 V+GNLKS +IL+ LLECYS ++LPW K+EGFE+V ++F VN+WNQ+EN F Sbjct: 659 CLVQGNLKSAKILLQLLECYSDELLPWIQKTEGFEEVLMQFVVNLWNQIENFNAFTVAFH 718 Query: 2165 GK-VLLDKMMTTMRLAVAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQ 2341 GK LLD +M M+ AVA CS Q +I+ KAYGVLSS++F PLKES+S + ++L+ + Sbjct: 719 GKESLLDAIMKVMKDAVAFCSVESQNVIIYKAYGVLSSSTFLPLKESLSEN-SVQLECFR 777 Query: 2342 LTQDLESFSYRDECIISLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSM 2521 Q ++ S RDE I SLFASVIIALRPQT + N RI+L LF+T LLKGHV+ A+ALGS+ Sbjct: 778 AIQQMDRLSSRDEWIHSLFASVIIALRPQTHIPNTRIVLHLFITALLKGHVTTAEALGSL 837 Query: 2522 INKLPLMINTTEASSECTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDDSNEGHVNLYR-- 2695 +NKL N S +CT+EEA+DIIF + L G+ + + +E ++L + Sbjct: 838 VNKLDQKSNDACISGDCTIEEAMDIIFSINLLCSFGNGSSGRFDRTRNGDE--MDLIKLC 895 Query: 2696 ----NVA-LQSNAMVGLAWIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLG 2860 N+A ++ A+VGLAWIGKG++MRGHEKVKDI M+ L CLLS + SPL+ L Sbjct: 896 LDAPNLAWIKIPAIVGLAWIGKGLLMRGHEKVKDITMVFLNCLLSDGEIGASPLKHGSLE 955 Query: 2861 ECSEHDIHPIVMRSAADAFHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKE 3040 E D+ VM+SA+DAF +L+SDS++CLN+++HA +RPLYKQ FFSS+MP+L + + Sbjct: 956 NNGEQDMQQSVMKSASDAFQILMSDSELCLNRKYHAIVRPLYKQRFFSSIMPILYPLITK 1015 Query: 3041 CDSSTTRSMLYRAIGHVISGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLV 3220 DSS ++S+LYRA HVIS TPL+ + +AKK Y LLLV Sbjct: 1016 SDSSFSKSLLYRAFAHVISDTPLSVISNDAKKLVPVLLDGLTLLGKDVLDKDIMYGLLLV 1075 Query: 3221 LSGILMDENGKETITENVHIIINHIIGLVSYPHMMLVRETAIQCLVAMTGLPYTRIYPMR 3400 LSGIL D NGKE + EN HIII +I LV+YPHMML+RETA+QCLVAM+ LP+TRIYP+R Sbjct: 1076 LSGILTDTNGKEAVIENAHIIIKCLIELVAYPHMMLIRETAVQCLVAMSELPHTRIYPVR 1135 Query: 3401 LQLLQALSKALDDPKRTVRQEAVRCRQAWASIASTNVHF 3517 +Q+LQA+SKALDDPKR VRQEAVRCRQAWASIAS ++H+ Sbjct: 1136 IQVLQAISKALDDPKRAVRQEAVRCRQAWASIASRSLHY 1174 >ref|XP_007024310.1| MMS19 nucleotide excision repair protein, putative isoform 1 [Theobroma cacao] gi|590619491|ref|XP_007024311.1| MMS19 nucleotide excision repair protein, putative isoform 1 [Theobroma cacao] gi|508779676|gb|EOY26932.1| MMS19 nucleotide excision repair protein, putative isoform 1 [Theobroma cacao] gi|508779677|gb|EOY26933.1| MMS19 nucleotide excision repair protein, putative isoform 1 [Theobroma cacao] Length = 1149 Score = 1177 bits (3044), Expect = 0.0 Identities = 621/1165 (53%), Positives = 818/1165 (70%), Gaps = 9/1165 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ S I +E+FVD +R QQ+++LD IA+L+KN+ LTIE LV+EM YLTT DN+I Sbjct: 1 MAETSQLIQGIESFVDSTRSPTQQAASLDVIASLLKNNQLTIETLVREMEGYLTTADNII 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI +SKPLD+AT+HSLI FFTDRL DW+AL GAL+GCLALLRRKS+ G Sbjct: 61 RARGILLLGEVLMHLASKPLDDATIHSLIQFFTDRLADWRALRGALVGCLALLRRKSSGG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 IV+ +DA+ + +YLQN+QVQSLG +DRKLCFE+L+CLL+RY +A+ +LGD+LIYGICEA Sbjct: 121 IVSETDAKAVAESYLQNLQVQSLGKYDRKLCFELLLCLLERYPKAIASLGDNLIYGICEA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 +DGEKDP CLML FHI+EIL LFPDPLGP F+ DLF+ L YFP+HFTHPK + ++ Sbjct: 181 VDGEKDPHCLMLIFHIIEILPQLFPDPLGPFTSFAHDLFENLSYYFPVHFTHPKGEDVNI 240 Query: 770 KRDDLSRALMLAFASTPLFEPFAMPXXXXXXXXXXPTAKVDSLKYLSHCSVKYGVDRMGK 949 KRDDL+RALMLAF+STPLFEPFA+P P+AKVDSL+YLS C+VKYGVDRM K Sbjct: 241 KRDDLARALMLAFSSTPLFEPFAIPLLIEKLSSSLPSAKVDSLRYLSDCTVKYGVDRMAK 300 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLEKFMLQDGGLFL 1129 H ++WSSLKDA F S + + S ES G+ +NEI EAL L+K ++Q+ FL Sbjct: 301 HGEALWSSLKDAVFTS--LDGVLSFTPESLEGLCLPENEIAAEALSLLQKLIVQNTNFFL 358 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 LI+ DE+I M F ++S K Y+ IP + KQ+LHAVGCILS S S+A CNRVF+ FF Sbjct: 359 DLIVVDEDINMIFNMISSYKSYHGIPAQSKQRLHAVGCILSASVKASTASCNRVFECFFS 418 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGGLYLCIELLAACRGLAIGSEELSPQLIS 1489 RLMDILGL +S DD ++ ++ N G LYL IELL+ACR + SE + Sbjct: 419 RLMDILGLCVRNSSGNLSSDDSIMIPKRYNHGALYLSIELLSACRDVIASSETIIAASAH 478 Query: 1490 EEDTWCCLLKKFSGPLAGAL-RYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSK 1666 E+TW LL+ FS L A + TS + H +AD+Y GVKGL ILATFP +L SK Sbjct: 479 TEETWSYLLRSFSSSLTKAFCSASICTSEDSH--DADVYFGVKGLLILATFPEGYLLISK 536 Query: 1667 SIFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLAS 1846 +FE IL F S ++V + LLWKL+LKAL+QIG+FI+KCH+SEK Y+ +VVEK+ S Sbjct: 537 PVFEKILMTFVSIVTVDYSNTLLWKLALKALVQIGSFIEKCHESEKEPSYLGLVVEKIVS 596 Query: 1847 LIPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILV 2026 L D S+P L+LEA+ EIGT G +ML+V++G+EEAI AN E +V G+ S EI+ Sbjct: 597 FSSLGDFSIPFPLRLEALSEIGTSGKSYMLKVVEGLEEAIYANLSEVYVHGSSNSAEIVT 656 Query: 2027 PLLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKV-LLDKMMTTMR 2203 LL+CYS KV+PW ++GF++V L+F+++IWNQ+E +M FNA K+ +LD MM M+ Sbjct: 657 QLLKCYSDKVIPWIQCAKGFDEVPLQFAIHIWNQIELSMVFNATQTNKIEVLDVMMKAMK 716 Query: 2204 LAVAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDEC 2383 LAVA CSE +Q +IVQK+Y +LSS++ FPLKE + + Q+ Q +++ S RDE Sbjct: 717 LAVASCSEENQNIIVQKSYHILSSSTSFPLKEL------FRQESFQIVQ-VDNSSSRDEW 769 Query: 2384 IISLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEAS 2563 I+SLFA+V+IA+ P+T + N++ +L LFMT LLKG+V AQALGS++NKL L + Sbjct: 770 ILSLFAAVVIAVHPETYVPNIKPLLYLFMTTLLKGNVVTAQALGSVVNKLGL--ESAGVQ 827 Query: 2564 SECTLEEAIDIIFKMGLW---SVCGSDPLWKCSAIDDSNEGHVNLYRNV----ALQSNAM 2722 ++CTLEE +DII + LW S +D K ++ D + +NL ++ +LQ +A+ Sbjct: 828 TDCTLEEVMDIILNLSLWIFHSNSSADIQAKMTSAHDIS--LINLCSSIGSCTSLQIHAI 885 Query: 2723 VGLAWIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRS 2902 VGLAWIGKG++MRGHEKVKDI MI LRCL ++ + +E D+H VM+S Sbjct: 886 VGLAWIGKGLLMRGHEKVKDITMIFLRCLQPNGRAEILHQEEGISESNNELDLHHSVMKS 945 Query: 2903 AADAFHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAI 3082 AADAF +L+ DS+VCLN+ FHA IRPLYKQ FFS+MMP+L S + + + +R +L RA Sbjct: 946 AADAFQILMGDSEVCLNRGFHAVIRPLYKQRFFSTMMPILQSLIMKSE-PLSRPLLLRAS 1004 Query: 3083 GHVISGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKETI 3262 H+I TPL V+++AKK Y LLLVLSGILMD+NG+E + Sbjct: 1005 AHIIVDTPLIVVLSDAKKIIPMLLDGLSALSNDILDKDVIYGLLLVLSGILMDKNGQEAV 1064 Query: 3263 TENVHIIINHIIGLVSYPHMMLVRETAIQCLVAMTGLPYTRIYPMRLQLLQALSKALDDP 3442 +++ H I N +I L+ YPHMMLVRETAIQCLVA++GL Y R+YPMR Q+LQA++KALDDP Sbjct: 1065 SDSAHTITNRLIELIQYPHMMLVRETAIQCLVAISGLSYARVYPMRTQVLQAIAKALDDP 1124 Query: 3443 KRTVRQEAVRCRQAWASIASTNVHF 3517 KR VRQEAVRCRQAWASIAS ++HF Sbjct: 1125 KRAVRQEAVRCRQAWASIASRSLHF 1149 >ref|XP_007217541.1| hypothetical protein PRUPE_ppa023072mg [Prunus persica] gi|462413691|gb|EMJ18740.1| hypothetical protein PRUPE_ppa023072mg [Prunus persica] Length = 1158 Score = 1154 bits (2986), Expect = 0.0 Identities = 607/1160 (52%), Positives = 803/1160 (69%), Gaps = 14/1160 (1%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ + I H+E +VD SR +Q+++L++I +LVK+D LTIE LVKEM +YLTTTDNVI Sbjct: 1 MAETTELIQHIELYVDTSRSPTEQAASLNSIISLVKSDFLTIEVLVKEMRMYLTTTDNVI 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI +SKPLDNAT+HSLIGFFTDRL DW+AL GAL+GCLALLRRK N G Sbjct: 61 RARGILLLAEVLTGLASKPLDNATIHSLIGFFTDRLADWRALRGALVGCLALLRRKVNAG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 +V+ SD + + +Y++++QVQSLG HDRKLCFE+L CLL+R+ + +LG+ YGIC+A Sbjct: 121 MVSASDGKLVAQSYIESLQVQSLGQHDRKLCFELLECLLERHPNEIASLGETFFYGICQA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 +DGEKDP CLMLTF IVE L ++PDP G LA F DLF++LG YFPIHFTH K++ +V Sbjct: 181 MDGEKDPHCLMLTFPIVETLVRIYPDPSGSLASFCGDLFELLGSYFPIHFTHLKDEDAEV 240 Query: 770 KRDDLSRALMLAFASTPLFEPFAMPXXXXXXXXXXPTAKVDSLKYLSHCSVKYGVDRMGK 949 KRDDLS+ALM AF+STPLFEPF +P P AKVDSLKYL+HC+ KYG DRM K Sbjct: 241 KRDDLSKALMSAFSSTPLFEPFVIPLLLEKLSSSLPLAKVDSLKYLNHCTAKYGADRMAK 300 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLEKFMLQDGGLFL 1129 HA +IW SLKDA +S ++P S SE G+ FQ+NEI EAL+ L+K LQ+ LFL Sbjct: 301 HAGAIWISLKDA-ISNSLEKPDMSFTSEPLYGLGFQENEIATEALMLLQKVTLQNEALFL 359 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 SLI++DE I + F S+ S + YN+IP++ KQ LHAVG IL + S S A CN VF++FF Sbjct: 360 SLIIQDEGINIVFNSIASHEHYNNIPLQGKQWLHAVGRILYIISKTSMASCNSVFESFFP 419 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGGLYLCIELLAACRGLAIGSEELSPQLIS 1489 RLM+ L +S +S C L++ S+K N G LYLC+EL+AACR L + S++L+P+ + Sbjct: 420 RLMNTLEISVTNSAGDCTLNENTFPSKKFNFGALYLCVELIAACRDLIMRSKDLAPKPDT 479 Query: 1490 EEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSKS 1669 ++T +L+ F+ L A L T+ N+ A ADIY VKGLQILATFP FLP SK Sbjct: 480 PQETCRYMLQSFADSLVNAFSSSLATNANEVAHGADIYFKVKGLQILATFPGDFLPISKF 539 Query: 1670 IFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLASL 1849 +F NIL + S + V + + LLWKL LKAL+ IG+F+ H+SEK + YM VV+K SL Sbjct: 540 LFANILTILMSIILVDFNKILLWKLVLKALVHIGSFVDVYHESEKALGYMGAVVDKTVSL 599 Query: 1850 IPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILVP 2029 + +D MP SL+LEA EIG G + ML+++QG+EEAI A + +V GNLKS E + Sbjct: 600 VSRDDVKMPFSLKLEAASEIGASGRNHMLKIVQGMEEAIVAKLSD-YVHGNLKSAEKTIQ 658 Query: 2030 LLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTMRLA 2209 LLECY +K+L W +++ G E+V LRF +NIWN +E+ F+ V+ + LLD M M+LA Sbjct: 659 LLECYCNKILSWINETGGLEEVLLRFVINIWNCVESCKDFSIQVQEEELLDATMMAMKLA 718 Query: 2210 VAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLES--------- 2362 + CSE Q +I+ KAY V+SS+ P KES+ + I+L++L +++ +++ Sbjct: 719 IGSCSEESQNIIIHKAYSVISSSISIPFKESLDATSSIQLEELSVSEQIDNSSHRDDQID 778 Query: 2363 -FSYRDECIISLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPL 2539 FS RDE I+S FASVIIA+RP+ ++NV+ IL LFMT +LKG V AAQALGS+INKL Sbjct: 779 KFSLRDEWILSHFASVIIAVRPKAQIVNVKGILHLFMTTVLKGCVPAAQALGSVINKLGT 838 Query: 2540 MINTTEASSECTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDDSNEGHVNL----YRNVAL 2707 N T S +CTLEEA+D+IF+ LW++ + L C + + S G +L N L Sbjct: 839 KSNETANSIDCTLEEAVDMIFRTKLWNLNENGVLRTCGSGNGSKVGLTDLCLGFSSNKLL 898 Query: 2708 QSNAMVGLAWIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHP 2887 + +A+VGLAWIGKG+++ GHEKVKD+ ILL CLLS +R L+Q +L E Sbjct: 899 RVHAVVGLAWIGKGLLLLGHEKVKDVTKILLECLLSEGRIRAMELKQGLLENSYEQH--- 955 Query: 2888 IVMRSAADAFHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSM 3067 VMRSAADAFH+L+SDS+VCLN++FHA RPLYKQ FFS++MP+L S + + DSS RSM Sbjct: 956 SVMRSAADAFHILMSDSEVCLNRKFHAIARPLYKQRFFSTVMPILQSCIIKSDSSVCRSM 1015 Query: 3068 LYRAIGHVISGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDEN 3247 L+RA H+IS PL +++EAKK YSLLLVLSGIL D+N Sbjct: 1016 LFRASAHLISNAPLIVILSEAKKLMPVLLDGLSLLSEDILDKDKLYSLLLVLSGILTDKN 1075 Query: 3248 GKETITENVHIIINHIIGLVSYPHMMLVRETAIQCLVAMTGLPYTRIYPMRLQLLQALSK 3427 G+ + EN HI++N + L+ YPHMM VRETA+QCL+A + LPY RI+PMR Q+LQA+ K Sbjct: 1076 GQVAVIENAHILVNCLTRLIDYPHMMFVRETALQCLIATSELPYARIFPMRTQVLQAICK 1135 Query: 3428 ALDDPKRTVRQEAVRCRQAW 3487 ALDDPKR VRQEAVRCR+AW Sbjct: 1136 ALDDPKRAVRQEAVRCRRAW 1155 >gb|EXB74582.1| hypothetical protein L484_026279 [Morus notabilis] Length = 1210 Score = 1141 bits (2952), Expect = 0.0 Identities = 606/1193 (50%), Positives = 806/1193 (67%), Gaps = 40/1193 (3%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ S H+E++VD +R N+Q+++LD+I +LVKN L+TIE LV+EM +YLTTTD+VI Sbjct: 1 MAEPSVLTRHIESYVDTTRSLNEQAASLDSIISLVKNGLVTIEKLVREMDMYLTTTDHVI 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI S KPLDN T+HSLI FF DRLVDW+ L GAL+GCLALLRRKS+ G Sbjct: 61 RARGILLLAELLTNLSLKPLDNVTIHSLIDFFADRLVDWRTLRGALVGCLALLRRKSDAG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITL----------- 556 +V +DA+ + L+Y++N+QVQSLG HDRKLCFE+L CLL Y V +L Sbjct: 121 MVPATDAKAVALSYVKNLQVQSLGQHDRKLCFELLECLLVTYPNEVASLLCFELLECLLV 180 Query: 557 -----------------------------GDDLIYGICEAIDGEKDPQCLMLTFHIVEIL 649 G+D+IY +CE++DGEKDP CLML FHI+ L Sbjct: 181 TYPNEVASLLCFELLECLLVTYPNEVASLGEDIIYSVCESVDGEKDPHCLMLVFHIIPAL 240 Query: 650 GHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDVKRDDLSRALMLAFASTPLFE 829 LFP+P G LA F DLF+VLG YFPIHFTH K + DVKRDDLSRALM+AF+STPL E Sbjct: 241 VGLFPNPSGSLASFPRDLFEVLGCYFPIHFTHHKVEDVDVKRDDLSRALMIAFSSTPLLE 300 Query: 830 PFAMPXXXXXXXXXXPTAKVDSLKYLSHCSVKYGVDRMGKHATSIWSSLKDAFFRSSPQE 1009 PF +P +AK+DSLKYLS+CS+KYG DRM +HA +WSS+K+A +S +E Sbjct: 301 PFVIPLLLEKLSSSLSSAKIDSLKYLSYCSIKYGADRMARHAGILWSSIKNAI-STSLKE 359 Query: 1010 PIFSLISESPNGMEFQKNEITNEALVCLEKFMLQDGGLFLSLILEDEEIEMTFRSVTSAK 1189 P S SES +G+ FQ+NE+ +EALV LE ++Q+ L LS+I++DE+I F ++TS Sbjct: 360 PTESFYSESIDGLGFQENEVVSEALVLLETVVMQNNNLLLSMIVDDEDISTVFNTMTSYG 419 Query: 1190 KYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFFRLMDILGLSPGSSDSGCILD 1369 +Y DIP++ KQ+LH VG IL +++ S A CNRV + FF L+DIL LS SS Sbjct: 420 RYKDIPLQGKQRLHVVGRILYITTKTSIASCNRVLETFFRPLVDILQLSIRSSSR----- 474 Query: 1370 DGFVLSEKLNSGGLYLCIELLAACRGLAIGSEELSPQLISEEDTWCCLLKKFSGPLAGAL 1549 D F LN G LYLC+ELLAACR L I S EL+ I +T+CC+L+ F L AL Sbjct: 475 DWF-----LNFGALYLCMELLAACRDLVIYSRELASNSIPAHETFCCILQSFCVSLIDAL 529 Query: 1550 RYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSKSIFENILNVFTSNLSVGWEEQ 1729 IL T+ N+ A + DIY V+ LQILATFP L S ++F+NIL S + + ++ Sbjct: 530 CSILETTANEGADDVDIYLRVRSLQILATFPEDLLAISDNVFKNILTTLMSIIFKDFNQK 589 Query: 1730 LLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLASLIPLEDSSMPLSLQLEAIFEI 1909 LWKL+LKAL+ IG+F+ + ++SEK Y +IVVEK+ S + +++ ++P L+LEA+ EI Sbjct: 590 FLWKLALKALVHIGSFVSR-YESEKAQSYNSIVVEKMVSWVSVDNCTLPFPLKLEAVSEI 648 Query: 1910 GTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILVPLLECYSSKVLPWSHKSEGFE 2089 G G + ML ++QG+E AI + + +V GN+ S E+ + LL+ YS KV+PW H++EG E Sbjct: 649 GASGRNHMLNIVQGLEGAIFSYVSDFYVHGNVSSAEVAIQLLQFYSEKVIPWIHETEGLE 708 Query: 2090 DVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTMRLAVAGCSEYDQGLIVQKAYGVL 2269 ++ LRF+ NIW+ +E+ ++ N V+ K LLD +M M+L V CSE Q +I+QKAY VL Sbjct: 709 EILLRFATNIWDHVESWISCNVEVQEKGLLDAIMMAMKLTVGSCSEEIQYIILQKAYTVL 768 Query: 2270 SSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDECIISLFASVIIALRPQTSLLNVR 2449 SS + LK+S S+P++L++ QL Q +++ S+RDE ++SLFASVIIA+RP+T + N++ Sbjct: 769 SSNTSLLLKKSSLTSIPVQLEESQLIQHVDNISHRDELVLSLFASVIIAVRPRTEIPNMK 828 Query: 2450 IILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEASSECTLEEAIDIIFKMGLWSVCG 2629 IL LF+T LL+GHV +AQALGSMINK +TE S E TLE+A+DIIFK W Sbjct: 829 EILYLFLTTLLRGHVPSAQALGSMINKFDTKAKSTEISRESTLEDAMDIIFKTKSWFFRD 888 Query: 2630 SDPLWKCSAIDDSNEGHVNLYRNVALQSNAMVGLAWIGKGMIMRGHEKVKDIMMILLRCL 2809 ++ L + + + L N+ LQ +A+VGLAWIGKG+++RGHEKVKD++M LL CL Sbjct: 889 NEVLQRNGNGMGLKDLCLGLMNNIQLQVHAIVGLAWIGKGLLLRGHEKVKDVIMTLLECL 948 Query: 2810 LSRSNMRTSPLQQDMLGECSEHDIHPIVMRSAADAFHVLLSDSDVCLNKRFHATIRPLYK 2989 + S+ R + L+QD E D HP V RSAADAFH+L+SDS VCLNK FHA IRPLYK Sbjct: 949 MPDSSTRAAKLKQDSFENILEQDFHPSVRRSAADAFHILMSDSGVCLNKIFHAIIRPLYK 1008 Query: 2990 QHFFSSMMPVLLSSLKECDSSTTRSMLYRAIGHVISGTPLAAVVTEAKKXXXXXXXXXXX 3169 QH FS +MP+L S LK D S +RSMLYRA H+I+ PL VV+EAKK Sbjct: 1009 QHLFSVVMPLLQSLLKNFDPSFSRSMLYRASVHIIADAPLIVVVSEAKKLISLLLEGLSI 1068 Query: 3170 XXXXXXXXXXTYSLLLVLSGILMDENGKETITENVHIIINHIIGLVSYPHMMLVRETAIQ 3349 YSLLLVLS IL D+ G+E + EN H +IN +IGL++YPHMMLVRET IQ Sbjct: 1069 LSEDILDKDQLYSLLLVLSAILTDKKGEEAVIENAHSVINCLIGLIAYPHMMLVRETTIQ 1128 Query: 3350 CLVAMTGLPYTRIYPMRLQLLQALSKALDDPKRTVRQEAVRCRQAWASIASTN 3508 CLVAM+ LP+TRIYPMR ++LQA+SKALDDPKR VRQEAVRC+QAW+ + N Sbjct: 1129 CLVAMSKLPHTRIYPMRTKVLQAMSKALDDPKRAVRQEAVRCQQAWSVLWGVN 1181 >ref|XP_007024313.1| MMS19 nucleotide excision repair protein, putative isoform 4 [Theobroma cacao] gi|508779679|gb|EOY26935.1| MMS19 nucleotide excision repair protein, putative isoform 4 [Theobroma cacao] Length = 1136 Score = 1116 bits (2886), Expect = 0.0 Identities = 591/1130 (52%), Positives = 785/1130 (69%), Gaps = 9/1130 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ S I +E+FVD +R QQ+++LD IA+L+KN+ LTIE LV+EM YLTT DN+I Sbjct: 1 MAETSQLIQGIESFVDSTRSPTQQAASLDVIASLLKNNQLTIETLVREMEGYLTTADNII 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI +SKPLD+AT+HSLI FFTDRL DW+AL GAL+GCLALLRRKS+ G Sbjct: 61 RARGILLLGEVLMHLASKPLDDATIHSLIQFFTDRLADWRALRGALVGCLALLRRKSSGG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 IV+ +DA+ + +YLQN+QVQSLG +DRKLCFE+L+CLL+RY +A+ +LGD+LIYGICEA Sbjct: 121 IVSETDAKAVAESYLQNLQVQSLGKYDRKLCFELLLCLLERYPKAIASLGDNLIYGICEA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 +DGEKDP CLML FHI+EIL LFPDPLGP F+ DLF+ L YFP+HFTHPK + ++ Sbjct: 181 VDGEKDPHCLMLIFHIIEILPQLFPDPLGPFTSFAHDLFENLSYYFPVHFTHPKGEDVNI 240 Query: 770 KRDDLSRALMLAFASTPLFEPFAMPXXXXXXXXXXPTAKVDSLKYLSHCSVKYGVDRMGK 949 KRDDL+RALMLAF+STPLFEPFA+P P+AKVDSL+YLS C+VKYGVDRM K Sbjct: 241 KRDDLARALMLAFSSTPLFEPFAIPLLIEKLSSSLPSAKVDSLRYLSDCTVKYGVDRMAK 300 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLEKFMLQDGGLFL 1129 H ++WSSLKDA F S + + S ES G+ +NEI EAL L+K ++Q+ FL Sbjct: 301 HGEALWSSLKDAVFTS--LDGVLSFTPESLEGLCLPENEIAAEALSLLQKLIVQNTNFFL 358 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 LI+ DE+I M F ++S K Y+ IP + KQ+LHAVGCILS S S+A CNRVF+ FF Sbjct: 359 DLIVVDEDINMIFNMISSYKSYHGIPAQSKQRLHAVGCILSASVKASTASCNRVFECFFS 418 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGGLYLCIELLAACRGLAIGSEELSPQLIS 1489 RLMDILGL +S DD ++ ++ N G LYL IELL+ACR + SE + Sbjct: 419 RLMDILGLCVRNSSGNLSSDDSIMIPKRYNHGALYLSIELLSACRDVIASSETIIAASAH 478 Query: 1490 EEDTWCCLLKKFSGPLAGAL-RYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSK 1666 E+TW LL+ FS L A + TS + H +AD+Y GVKGL ILATFP +L SK Sbjct: 479 TEETWSYLLRSFSSSLTKAFCSASICTSEDSH--DADVYFGVKGLLILATFPEGYLLISK 536 Query: 1667 SIFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLAS 1846 +FE IL F S ++V + LLWKL+LKAL+QIG+FI+KCH+SEK Y+ +VVEK+ S Sbjct: 537 PVFEKILMTFVSIVTVDYSNTLLWKLALKALVQIGSFIEKCHESEKEPSYLGLVVEKIVS 596 Query: 1847 LIPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILV 2026 L D S+P L+LEA+ EIGT G +ML+V++G+EEAI AN E +V G+ S EI+ Sbjct: 597 FSSLGDFSIPFPLRLEALSEIGTSGKSYMLKVVEGLEEAIYANLSEVYVHGSSNSAEIVT 656 Query: 2027 PLLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKV-LLDKMMTTMR 2203 LL+CYS KV+PW ++GF++V L+F+++IWNQ+E +M FNA K+ +LD MM M+ Sbjct: 657 QLLKCYSDKVIPWIQCAKGFDEVPLQFAIHIWNQIELSMVFNATQTNKIEVLDVMMKAMK 716 Query: 2204 LAVAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDEC 2383 LAVA CSE +Q +IVQK+Y +LSS++ FPLKE + + Q+ Q +++ S RDE Sbjct: 717 LAVASCSEENQNIIVQKSYHILSSSTSFPLKEL------FRQESFQIVQ-VDNSSSRDEW 769 Query: 2384 IISLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEAS 2563 I+SLFA+V+IA+ P+T + N++ +L LFMT LLKG+V AQALGS++NKL L + Sbjct: 770 ILSLFAAVVIAVHPETYVPNIKPLLYLFMTTLLKGNVVTAQALGSVVNKLGL--ESAGVQ 827 Query: 2564 SECTLEEAIDIIFKMGLW---SVCGSDPLWKCSAIDDSNEGHVNLYRNV----ALQSNAM 2722 ++CTLEE +DII + LW S +D K ++ D + +NL ++ +LQ +A+ Sbjct: 828 TDCTLEEVMDIILNLSLWIFHSNSSADIQAKMTSAHDIS--LINLCSSIGSCTSLQIHAI 885 Query: 2723 VGLAWIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRS 2902 VGLAWIGKG++MRGHEKVKDI MI LRCL ++ + +E D+H VM+S Sbjct: 886 VGLAWIGKGLLMRGHEKVKDITMIFLRCLQPNGRAEILHQEEGISESNNELDLHHSVMKS 945 Query: 2903 AADAFHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAI 3082 AADAF +L+ DS+VCLN+ FHA IRPLYKQ FFS+MMP+L S + + + +R +L RA Sbjct: 946 AADAFQILMGDSEVCLNRGFHAVIRPLYKQRFFSTMMPILQSLIMKSE-PLSRPLLLRAS 1004 Query: 3083 GHVISGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKETI 3262 H+I TPL V+++AKK Y LLLVLSGILMD+NG+E + Sbjct: 1005 AHIIVDTPLIVVLSDAKKIIPMLLDGLSALSNDILDKDVIYGLLLVLSGILMDKNGQEAV 1064 Query: 3263 TENVHIIINHIIGLVSYPHMMLVRETAIQCLVAMTGLPYTRIYPMRLQLL 3412 +++ H I N +I L+ YPHMMLVRETAIQCLVA++GL Y R+YPMR Q++ Sbjct: 1065 SDSAHTITNRLIELIQYPHMMLVRETAIQCLVAISGLSYARVYPMRTQVV 1114 >ref|XP_004141784.1| PREDICTED: MMS19 nucleotide excision repair protein homolog [Cucumis sativus] Length = 1147 Score = 1095 bits (2833), Expect = 0.0 Identities = 579/1166 (49%), Positives = 795/1166 (68%), Gaps = 10/1166 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ +VE+FVD SR +QQ+++L+ I +LVKN++LTIE LV+EMG+YLT TDN+I Sbjct: 1 MAELCKLTHYVESFVDVSRTPSQQATSLETITSLVKNNVLTIETLVREMGMYLTITDNII 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R RGI +SKPLD+AT+HSLI FFT+RL DW+AL GAL+GCLAL+RRK+NVG Sbjct: 61 RGRGILLLGELLACLASKPLDSATIHSLIAFFTERLADWKALRGALVGCLALMRRKTNVG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 ++ +DA+++ +Y QN+QVQSLG HDRKL FE+L CLL+ Y +AV++LGDDL+YGICEA Sbjct: 121 SISQNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYPDAVVSLGDDLVYGICEA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 IDGEKDP CL+LTF IVE++ LFPDP G LA S DLF+ LG YFPIHFTH K + DV Sbjct: 181 IDGEKDPHCLLLTFRIVELVAKLFPDPTGALASSSSDLFEFLGCYFPIHFTHGKEEDIDV 240 Query: 770 KRDDLSRALMLAFASTPLFEPFAMPXXXXXXXXXXPTAKVDSLKYLSHCSVKYGVDRMGK 949 +R+DLS ALM AF+STPLFEPFA+P P AK+DSLKYLS C+VKYG DRM K Sbjct: 241 RRNDLSHALMRAFSSTPLFEPFAIPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMKK 300 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLEKFMLQDGGLFL 1129 H+ +IWSS+K+ F S Q P S+ +ES N FQ+NE+T EAL L+K ++ GLFL Sbjct: 301 HSEAIWSSVKEIIFTSIGQ-PNLSINTESLNSPSFQENEMTTEALRLLQKMVVASNGLFL 359 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 +LI+ DE+++ F + Y D P++ +Q+L+AVG IL S++ S A C+ VF+++F Sbjct: 360 TLIINDEDVKDIFNILNIYTCYKDFPLQSRQRLNAVGHILYTSASASVASCDHVFESYFH 419 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGGLYLCIELLAACRGLAIGSEELSPQLIS 1489 RL+D +G+S +D LN G LYLCIE++AACR L + S+E S Sbjct: 420 RLLDFMGISVDQYH-----NDKISPIRNLNFGALYLCIEVIAACRNLIVSSDE---NTCS 471 Query: 1490 EEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSKS 1669 ++ +L+ FS + L + +A+ YC VKGL L+TFP P S+ Sbjct: 472 VKEKSYSMLQIFSCSVVQLLSSTFSGIVKRDLHDAEFYCAVKGLLNLSTFPVGSSPVSRV 531 Query: 1670 IFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLASL 1849 IFE+IL F S ++V ++ LW +LKAL IG+F+ K S + YM IVVEK+A + Sbjct: 532 IFEDILLEFMSFITVNFKFGSLWNHALKALQHIGSFVDKYPGSVESQSYMHIVVEKIALM 591 Query: 1850 IPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILVP 2029 D +PL L+LE +IG G +ML+++ GIEE I N E +V GN KSVEI++ Sbjct: 592 FSPHDEVLPLMLKLEMAVDIGRTGRSYMLKIVGGIEETIFYNLSEVYVYGNSKSVEIVLS 651 Query: 2030 LLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEG--KVLLDKMMTTMR 2203 LL+CYS+K+LPW ++ FE+V LRF++NIW+Q+E TF+ ++ +VLLD M ++ Sbjct: 652 LLDCYSTKILPWFDEAGDFEEVILRFALNIWDQIEKCSTFSTSMDKCIQVLLDATMMALK 711 Query: 2204 LAVAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDEC 2383 L+V CS+ Q +IVQKA+ VL ++SF PLK ++S ++P++++ LQ Q ++ + RDE Sbjct: 712 LSVRSCSKESQNIIVQKAFNVLLTSSFSPLKVTLSNTIPVQMEGLQFLQQKDNPTSRDEW 771 Query: 2384 IISLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEAS 2563 I+SLFASV IALRPQ + +VR+I++L M +G V AAQALGSMINKL + + E S Sbjct: 772 ILSLFASVTIALRPQVHVPDVRLIIRLLMLSTTRGCVPAAQALGSMINKLSVKSDKVEVS 831 Query: 2564 SECTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDDSNEGH--------VNLYRNVALQSNA 2719 S +LEEAIDIIFK ++C + + +G ++ ++ LQ +A Sbjct: 832 SYVSLEEAIDIIFKTE----------FRCLHNESTGDGSEMFLTDLCSSIEKSSLLQVHA 881 Query: 2720 MVGLAWIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMR 2899 +VGL+WIGKG+++ GH+KV+DI M+ L+ L+S+S SPLQQ L + +E + VM+ Sbjct: 882 VVGLSWIGKGLLLCGHDKVRDITMVFLQLLVSKSRTDASPLQQFKLEKDNETSLDFAVMK 941 Query: 2900 SAADAFHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRA 3079 AA+AFH+L+SDS+ CLN++FHA +RPLYKQ FFS+MMP+ + + + D+S +R MLY+A Sbjct: 942 GAAEAFHILMSDSEACLNRKFHAIVRPLYKQRFFSTMMPIFQTLVSKSDTSLSRYMLYQA 1001 Query: 3080 IGHVISGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKET 3259 HVIS TPL A++++AKK YSLLLVLSGILMD+NG+E Sbjct: 1002 YAHVISDTPLTAILSDAKKFIPMLLDGLLTLSVNGINKDVVYSLLLVLSGILMDKNGQEA 1061 Query: 3260 ITENVHIIINHIIGLVSYPHMMLVRETAIQCLVAMTGLPYTRIYPMRLQLLQALSKALDD 3439 +TEN H I++ + GL + HMMLVRETAIQCLVA++ LP+ RIYPMR Q+L +SKALDD Sbjct: 1062 VTENAHKIVDCLAGLTDFSHMMLVRETAIQCLVAVSELPHARIYPMRRQVLHVISKALDD 1121 Query: 3440 PKRTVRQEAVRCRQAWASIASTNVHF 3517 PKR+VRQEAVRCRQAWASIAS ++HF Sbjct: 1122 PKRSVRQEAVRCRQAWASIASRSLHF 1147 >ref|XP_004302857.1| PREDICTED: uncharacterized protein LOC101304108 [Fragaria vesca subsp. vesca] Length = 1149 Score = 1089 bits (2816), Expect = 0.0 Identities = 587/1154 (50%), Positives = 768/1154 (66%), Gaps = 4/1154 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA + H+E +VD +RP +Q+++L+ I +LVK DLLTIE LVKEM +YLT TDNVI Sbjct: 1 MAATTQLTHHLECYVDTARPPAEQAASLNFITSLVKKDLLTIEVLVKEMRMYLTITDNVI 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI SSKPLDNAT+HSLIGFFTDRL DW+AL GALIGCLALLRR+ N G Sbjct: 61 RARGILLLAEVLTGLSSKPLDNATIHSLIGFFTDRLSDWRALRGALIGCLALLRRQVNAG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 +V+ SDA+ + +Y +NI VQSL DRKLCFE+L CLL RY V +LG+DL Y I EA Sbjct: 121 MVSASDAKVVAQSYRENIPVQSLAQQDRKLCFELLECLLQRYPNEVASLGEDLFYAISEA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 ID EKDP CL+LTFHIVE L LFPDP GPLA F DLF+ LG YFPIHFTH K++ +V Sbjct: 181 IDEEKDPHCLILTFHIVEALVKLFPDPSGPLATFCGDLFEFLGCYFPIHFTHLKDEDANV 240 Query: 770 KRDDLSRALMLAFASTPLFEPFAMPXXXXXXXXXXPTAKVDSLKYLSHCSVKYGVDRMGK 949 KR+DLS+ALM AF+ST LFEPF +P P AKVDSLKYL++C+ +YG +RM K Sbjct: 241 KREDLSKALMSAFSSTALFEPFVIPLLLEKLSSSLPLAKVDSLKYLNYCASRYGAERMAK 300 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLEKFMLQDGGLFL 1129 HA +IW S+K A +S + P S +E G+ F++NEI EAL+ L+ +Q+ L L Sbjct: 301 HAETIWISIKHA-ISNSLEVPAKSFTAEPLVGLGFEENEIVTEALILLQNVTMQNDALLL 359 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 SLI+ DE+I S+ S + Y +IP + +Q LHAVG I + + S A CNRVF++FF Sbjct: 360 SLIVRDEDINNVINSIASHESYTNIPSQGRQSLHAVGRIFFIITKTSMASCNRVFESFFP 419 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGGLYLCIELLAACRGLAIGSEELSPQLIS 1489 LM L +S G+S C L + S++ G LY C+E +AACR L + + + + + Sbjct: 420 SLMKTLEISMGNSSKDCTLKENSFSSKRFKFGALYFCVEFIAACRDLIMRTNDHDEKFGT 479 Query: 1490 EEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSKS 1669 ++T CC+L+ + L A L + A +ADIY VKGLQ+LATFP FL K+ Sbjct: 480 ADETCCCMLQSSAPTLITAFCTTLAQISCNVADDADIYFKVKGLQMLATFPGYFLQIPKA 539 Query: 1670 IFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLASL 1849 +FEN+L S + V +++ LLWKL+LKAL IG+F+ +SEK Y + VVEK SL Sbjct: 540 MFENVLKTLMSIILVDFDKPLLWKLALKALAHIGSFVDVHLESEKAQSYTSFVVEKTISL 599 Query: 1850 IPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILVP 2029 P +D +P L+LEA+FEIG + MLR+IQG+E+AI AN + F+ G+LK+ E + Sbjct: 600 -PQDDFDVPFPLKLEAVFEIGASRPNHMLRIIQGLEDAIVANLSKTFIHGDLKAAEKTIQ 658 Query: 2030 LLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTMRLA 2209 LLECYS+K++ W ++ G E+V RF ++IWN +E + V+ K LLD MT M+LA Sbjct: 659 LLECYSNKIISWIDENGGLEEVLCRFVISIWNCLERCKDSSNQVQDKGLLDATMTAMKLA 718 Query: 2210 VAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDECII 2389 V CSE Q +I+QKAYG LSS P K+S S KL+ L L + L+ S RDE I Sbjct: 719 VGSCSEESQNIIIQKAYGALSSGISIPFKDSTDDSSLAKLETLHLFEQLDKLSPRDEWIF 778 Query: 2390 SLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEASSE 2569 SLFASVIIA+RP+T + N + IL LFMT L+KG AAQALGS+INKL + N S+ Sbjct: 779 SLFASVIIAMRPRTPIANAKGILHLFMTALVKGCTPAAQALGSVINKLGIQSNEITISTA 838 Query: 2570 CTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDDSNEGHVNLYRNVA----LQSNAMVGLAW 2737 CTLEEA+ IIF+ LW++ + L N G L V+ LQ + + GLAW Sbjct: 839 CTLEEAMGIIFRSKLWNIGENGVLRGSGTSHSRNVGLTELCLGVSSNKLLQVHVITGLAW 898 Query: 2738 IGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRSAADAF 2917 IGKG+++ G+E+VKD+ I+L CLL+ + TS L+Q +L SE P VMR+AADAF Sbjct: 899 IGKGLLLIGNEQVKDVTKIILDCLLADDKVDTSELRQGLLETSSE---QPSVMRTAADAF 955 Query: 2918 HVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAIGHVIS 3097 H+L+SDSDVCLN++FHA IRPLYKQ FFS++MP+L S + + DSS +RSML+RA H+IS Sbjct: 956 HILMSDSDVCLNRKFHANIRPLYKQRFFSTVMPILHSLIVKSDSSLSRSMLFRASAHLIS 1015 Query: 3098 GTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKETITENVH 3277 PL +++EAKK YSLLLVLSGIL D+ G+E + EN H Sbjct: 1016 NAPLIVILSEAKKLMKVLLDGLSILSDDILDKDKLYSLLLVLSGILTDKCGEEAVLENAH 1075 Query: 3278 IIINHIIGLVSYPHMMLVRETAIQCLVAMTGLPYTRIYPMRLQLLQALSKALDDPKRTVR 3457 III+ + LV+YPHMMLVRETAIQCL+AM+ LP RI+P++ Q+LQA+ KALDDPKR VR Sbjct: 1076 IIIDCLSRLVAYPHMMLVRETAIQCLLAMSELPRPRIFPLKSQVLQAIFKALDDPKRAVR 1135 Query: 3458 QEAVRCRQAWASIA 3499 +EAVRCR AW S A Sbjct: 1136 EEAVRCRHAWTSTA 1149 >ref|XP_007024314.1| MMS19 nucleotide excision repair protein, putative isoform 5 [Theobroma cacao] gi|508779680|gb|EOY26936.1| MMS19 nucleotide excision repair protein, putative isoform 5 [Theobroma cacao] Length = 1157 Score = 1074 bits (2777), Expect = 0.0 Identities = 570/1103 (51%), Positives = 761/1103 (68%), Gaps = 9/1103 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ S I +E+FVD +R QQ+++LD IA+L+KN+ LTIE LV+EM YLTT DN+I Sbjct: 1 MAETSQLIQGIESFVDSTRSPTQQAASLDVIASLLKNNQLTIETLVREMEGYLTTADNII 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI +SKPLD+AT+HSLI FFTDRL DW+AL GAL+GCLALLRRKS+ G Sbjct: 61 RARGILLLGEVLMHLASKPLDDATIHSLIQFFTDRLADWRALRGALVGCLALLRRKSSGG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 IV+ +DA+ + +YLQN+QVQSLG +DRKLCFE+L+CLL+RY +A+ +LGD+LIYGICEA Sbjct: 121 IVSETDAKAVAESYLQNLQVQSLGKYDRKLCFELLLCLLERYPKAIASLGDNLIYGICEA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 +DGEKDP CLML FHI+EIL LFPDPLGP F+ DLF+ L YFP+HFTHPK + ++ Sbjct: 181 VDGEKDPHCLMLIFHIIEILPQLFPDPLGPFTSFAHDLFENLSYYFPVHFTHPKGEDVNI 240 Query: 770 KRDDLSRALMLAFASTPLFEPFAMPXXXXXXXXXXPTAKVDSLKYLSHCSVKYGVDRMGK 949 KRDDL+RALMLAF+STPLFEPFA+P P+AKVDSL+YLS C+VKYGVDRM K Sbjct: 241 KRDDLARALMLAFSSTPLFEPFAIPLLIEKLSSSLPSAKVDSLRYLSDCTVKYGVDRMAK 300 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLEKFMLQDGGLFL 1129 H ++WSSLKDA F S + + S ES G+ +NEI EAL L+K ++Q+ FL Sbjct: 301 HGEALWSSLKDAVFTS--LDGVLSFTPESLEGLCLPENEIAAEALSLLQKLIVQNTNFFL 358 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 LI+ DE+I M F ++S K Y+ IP + KQ+LHAVGCILS S S+A CNRVF+ FF Sbjct: 359 DLIVVDEDINMIFNMISSYKSYHGIPAQSKQRLHAVGCILSASVKASTASCNRVFECFFS 418 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGGLYLCIELLAACRGLAIGSEELSPQLIS 1489 RLMDILGL +S DD ++ ++ N G LYL IELL+ACR + SE + Sbjct: 419 RLMDILGLCVRNSSGNLSSDDSIMIPKRYNHGALYLSIELLSACRDVIASSETIIAASAH 478 Query: 1490 EEDTWCCLLKKFSGPLAGAL-RYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSK 1666 E+TW LL+ FS L A + TS + H +AD+Y GVKGL ILATFP +L SK Sbjct: 479 TEETWSYLLRSFSSSLTKAFCSASICTSEDSH--DADVYFGVKGLLILATFPEGYLLISK 536 Query: 1667 SIFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLAS 1846 +FE IL F S ++V + LLWKL+LKAL+QIG+FI+KCH+SEK Y+ +VVEK+ S Sbjct: 537 PVFEKILMTFVSIVTVDYSNTLLWKLALKALVQIGSFIEKCHESEKEPSYLGLVVEKIVS 596 Query: 1847 LIPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILV 2026 L D S+P L+LEA+ EIGT G +ML+V++G+EEAI AN E +V G+ S EI+ Sbjct: 597 FSSLGDFSIPFPLRLEALSEIGTSGKSYMLKVVEGLEEAIYANLSEVYVHGSSNSAEIVT 656 Query: 2027 PLLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKV-LLDKMMTTMR 2203 LL+CYS KV+PW ++GF++V L+F+++IWNQ+E +M FNA K+ +LD MM M+ Sbjct: 657 QLLKCYSDKVIPWIQCAKGFDEVPLQFAIHIWNQIELSMVFNATQTNKIEVLDVMMKAMK 716 Query: 2204 LAVAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDEC 2383 LAVA CSE +Q +IVQK+Y +LSS++ FPLKE + + Q+ Q +++ S RDE Sbjct: 717 LAVASCSEENQNIIVQKSYHILSSSTSFPLKEL------FRQESFQIVQ-VDNSSSRDEW 769 Query: 2384 IISLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEAS 2563 I+SLFA+V+IA+ P+T + N++ +L LFMT LLKG+V AQALGS++NKL L + Sbjct: 770 ILSLFAAVVIAVHPETYVPNIKPLLYLFMTTLLKGNVVTAQALGSVVNKLGL--ESAGVQ 827 Query: 2564 SECTLEEAIDIIFKMGLW---SVCGSDPLWKCSAIDDSNEGHVNLYRNV----ALQSNAM 2722 ++CTLEE +DII + LW S +D K ++ D + +NL ++ +LQ +A+ Sbjct: 828 TDCTLEEVMDIILNLSLWIFHSNSSADIQAKMTSAHDIS--LINLCSSIGSCTSLQIHAI 885 Query: 2723 VGLAWIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRS 2902 VGLAWIGKG++MRGHEKVKDI MI LRCL ++ + +E D+H VM+S Sbjct: 886 VGLAWIGKGLLMRGHEKVKDITMIFLRCLQPNGRAEILHQEEGISESNNELDLHHSVMKS 945 Query: 2903 AADAFHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAI 3082 AADAF +L+ DS+VCLN+ FHA IRPLYKQ FFS+MMP+L S + + + +R +L RA Sbjct: 946 AADAFQILMGDSEVCLNRGFHAVIRPLYKQRFFSTMMPILQSLIMKSE-PLSRPLLLRAS 1004 Query: 3083 GHVISGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKETI 3262 H+I TPL V+++AKK Y LLLVLSGILMD+NG+E + Sbjct: 1005 AHIIVDTPLIVVLSDAKKIIPMLLDGLSALSNDILDKDVIYGLLLVLSGILMDKNGQEAV 1064 Query: 3263 TENVHIIINHIIGLVSYPHMMLV 3331 +++ H I N +I L+ YPHMM++ Sbjct: 1065 SDSAHTITNRLIELIQYPHMMVI 1087 >ref|XP_006853692.1| hypothetical protein AMTR_s00056p00136660 [Amborella trichopoda] gi|548857353|gb|ERN15159.1| hypothetical protein AMTR_s00056p00136660 [Amborella trichopoda] Length = 1160 Score = 1068 bits (2763), Expect = 0.0 Identities = 577/1170 (49%), Positives = 777/1170 (66%), Gaps = 14/1170 (1%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA +SWIPHVE FVDPSR +QQ ++L+ IATL+K D LT+EALV+EM +YLTTTD + Sbjct: 1 MASTASWIPHVEVFVDPSRSKDQQDASLNVIATLMKKDTLTLEALVQEMEVYLTTTDASV 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 RSRGI +SKP+D A +HSL FFT RL DWQAL GA IGCLALLRRKS+ G Sbjct: 61 RSRGILLVAELLSYLASKPIDGAIIHSLTEFFTSRLADWQALRGAFIGCLALLRRKSHTG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 ++ +D L ++L NIQVQ+L HDR LC E+L CLLD++ + L D+L+YG+ A Sbjct: 121 TISDNDLINLTKSFLLNIQVQALALHDRLLCLELLECLLDQFPSVISVLDDELVYGVLAA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 ID EKDP+CLML FH+VE+L +FPDP ++D FD+LGRYFPI+FT P D D+ Sbjct: 181 IDEEKDPRCLMLVFHVVELLVQVFPDP-----SVAKDAFDILGRYFPIYFTRPNVDAIDI 235 Query: 770 KRDDLSRALMLAFASTPLFEPFAMPXXXXXXXXXXPTAKVDSLKYLSHCSVKYGVDRMGK 949 KR+DLSR +M AF+S+PLFEPF +P AK+D+LKYLSHC+ +YG RM Sbjct: 236 KREDLSRRMMNAFSSSPLFEPFCIPLLLEKLSSSLEMAKLDALKYLSHCAPRYGPSRMAS 295 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLEK----FMLQDG 1117 HA +IWS+LKD F S P S+I E P+ + Q+NE+ EALVCLE F + Sbjct: 296 HAYAIWSALKDVIFNLSSHGPSISIICELPDNLGSQENEVVKEALVCLENCVLVFDIPKD 355 Query: 1118 GLFLSLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQ 1297 FL LI+EDE++EMTFRS+TS K D+P E KQ+ A+ IL S+ VSSACCNRVF Sbjct: 356 ETFLRLIVEDEDLEMTFRSITSEKCNKDLPHERKQRFCALRNILFTSAKVSSACCNRVFG 415 Query: 1298 NFFFRLMDILGLSPGSSDSGCILDDGFVLSEKLNSGGLYLCIELLAACRGL--AIGSEEL 1471 +FF RLM+ L +S S + + +L+ L++C+EL+AA L A+ S+E+ Sbjct: 416 SFFQRLMNFLRISSLDSPFDWASNRNSYVCVELDFEALHICLELIAASNHLANALSSQEV 475 Query: 1472 SPQLISEEDTWCCLLKKFSGPLAGALRYILVTSTNQ-------HACEADIYCGVKGLQIL 1630 P +D W LL+ FSG L AL +V + + E D+ V GLQIL Sbjct: 476 CP--TPTQDPWLLLLQSFSGCLVFALGSSVVANKSSSIREMSPSIGEEDLPLKVTGLQIL 533 Query: 1631 ATFPSCFLPTSKSIFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVI 1810 ATFP + P S+ FENIL VF S ++ +EE LW +LKAL+Q+G I++ HDS++ + Sbjct: 534 ATFPDSYSPLSRDAFENILAVFMSVITERYEETSLWTSTLKALVQVGMSIERYHDSQRGV 593 Query: 1811 CYMTIVVEKLASLIPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAF 1990 C+MTIV+EKL S + + PLSL L+AI EI +GL FM RV +G EA+S NF EA Sbjct: 594 CFMTIVIEKLLSYLFNRSTFPPLSLNLKAISEIAMMGLCFMKRVTKGFGEALSTNFLEAV 653 Query: 1991 VKGNLKSVEILVPLLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGK 2170 +GN KS E+ + +L+CYS +LPW EGFE+ A+ + +IW+ ME +++F G GK Sbjct: 654 AEGNTKSAEMAIEILKCYSLYLLPWLQNKEGFEEDAMHLATDIWSYME-SISFCIGSHGK 712 Query: 2171 VLLDKMMTTMRLAVAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQ 2350 LL+ M M+LAV C+ Q IV KA+ +L+S++ + +K+S+S S ++L+ L++T Sbjct: 713 SLLEATMMAMKLAVGCCTMNQQSSIVSKAHNILASSTLYLVKDSMSLSTSVQLEKLKITP 772 Query: 2351 DLESFSYRDECIISLFASVIIALRPQTSLLNVRIILKLFM-TGLLKGHVSAAQALGSMIN 2527 + S + +D +ISLFASV+IAL+PQT + ++RIIL+LFM LLKG ++AQALGS++N Sbjct: 773 ESVSSACKDGWLISLFASVVIALQPQTVIPDLRIILELFMIVVLLKGDEASAQALGSIVN 832 Query: 2528 KLPLMINTTEASSECTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDDSNEGHVNLYRNVAL 2707 K P + + E S CTL EA+DI+ + G + + K +D++ E +L + Sbjct: 833 KWP--VKSNEVSGACTLGEAMDIMVERGFRPIIFNVNQKKHEDVDNNKEIVSHLPISNDS 890 Query: 2708 QSNAMVGLAWIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHP 2887 + +A+ GLAWIGKG++MRGHEKVKDI ++LL C+L MR+ P Q D+LG I+ Sbjct: 891 RVHALFGLAWIGKGLVMRGHEKVKDITLLLLSCVLPTGGMRSMPSQHDVLGNDGGESINI 950 Query: 2888 IVMRSAADAFHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSM 3067 V RSAADAFH+++SDS+ +N++FHATIRPLYKQ F S++MP+LLSS+KE SS T+SM Sbjct: 951 AVARSAADAFHIIMSDSETSVNQKFHATIRPLYKQRFCSTVMPILLSSIKESHSSITKSM 1010 Query: 3068 LYRAIGHVISGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDEN 3247 L+R GH+I GTPLAA++ EA K Y LL+VLSGILMDE Sbjct: 1011 LFRTFGHIIIGTPLAAILIEAPKIVPPLLDGLSMLTLDVQNKDQIYDLLVVLSGILMDET 1070 Query: 3248 GKETITENVHIIINHIIGLVSYPHMMLVRETAIQCLVAMTGLPYTRIYPMRLQLLQALSK 3427 GKE + EN H II + LV+YPH+M+VRETAIQCLVAM LP+ RIYPMRLQ+L+ +SK Sbjct: 1071 GKEAVVENAHTIIGCLSKLVTYPHLMIVRETAIQCLVAMAALPHARIYPMRLQVLETVSK 1130 Query: 3428 ALDDPKRTVRQEAVRCRQAWASIASTNVHF 3517 ALDD KR+VRQEAVRC WAS+AS ++ F Sbjct: 1131 ALDDQKRSVRQEAVRCHHVWASMASRSLRF 1160 >ref|XP_003546956.1| PREDICTED: MMS19 nucleotide excision repair protein homolog isoform X1 [Glycine max] Length = 1135 Score = 1068 bits (2762), Expect = 0.0 Identities = 574/1157 (49%), Positives = 771/1157 (66%), Gaps = 8/1157 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ + H+E++VD S QQ+S+L+A+A+LV D L +EALV+E+ +YLTTTDNV+ Sbjct: 1 MAETTQLTRHIESYVDSSSTPAQQASSLNAVASLVNTDALPLEALVRELEMYLTTTDNVV 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI SKPL++AT+HSL+GFF DRL DW+A+ GAL+GCLAL+RRKS VG Sbjct: 61 RARGILLLAEVMTRIESKPLNSATIHSLVGFFKDRLADWRAVQGALVGCLALIRRKSVVG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 +VT SDA T+ ++LQ +QVQSLG +DRKLCFE+L CLL+RY +AV TLG+DLIYGICEA Sbjct: 121 MVTDSDATTIAQSFLQYMQVQSLGQYDRKLCFELLDCLLERYFDAVTTLGEDLIYGICEA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 ID EKDP CL L FHIV L L PD LA +++D+FD+L YFPIHFTHP + V Sbjct: 181 IDAEKDPDCLKLAFHIVASLAQLNPDSSSLLASYAKDVFDILEPYFPIHFTHPSSGDTHV 240 Query: 770 KRDDLSRALMLAFASTPLFEPFAMPXXXXXXXXXXPTAKVDSLKYLSHCSVKYGVDRMGK 949 +RDDLS +LM AF+STPLFEPF +P +AK+DSLKYL CS KYG +R+ K Sbjct: 241 QRDDLSTSLMSAFSSTPLFEPFVIPLLLEKLSSSLHSAKIDSLKYLRVCSSKYGAERIAK 300 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLEKFMLQDGGLFL 1129 +A +IWSSLKD + EP FS +G+ F +NE EAL L++ + Q+ L + Sbjct: 301 YAGAIWSSLKDT-LSTYLGEPDFSFTIAPVDGIGFPENEFVIEALSLLQQLIAQNSSLLV 359 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 SLI++DE++ F ++TS + Y+ IPV+ K+KLHA+G IL ++S + + CN +F++ F Sbjct: 360 SLIIDDEDVNTIFSTITSYETYDAIPVQEKKKLHAIGRILYITSKTTISSCNAMFESLFT 419 Query: 1310 RLMDILGLS---PGSSDSGCILDDGFVLSEKLNSGGLYLCIELLAACRGLAIGSEELSPQ 1480 R+MD LG S P S S++L G LYLCIELLA CR L +GSEE + Q Sbjct: 420 RMMDNLGFSVRFPNGDISP---------SQRLKFGFLYLCIELLAGCRELIVGSEEPALQ 470 Query: 1481 LISEEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPT 1660 + E +T C +L FS PL A +L S ++ + D Y GVKGLQILA F S P Sbjct: 471 YVFEHETCCTMLHSFSTPLFNAFGSVLAVSADRGPLDPDTYVGVKGLQILAMFHSDVFPI 530 Query: 1661 SKSIFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKL 1840 KSIFENIL F S + + + +LW+ +LKAL +G+F +K +SEK + Y +VVEK+ Sbjct: 531 QKSIFENILKKFMSIIIEDFNKTILWEAALKALHHVGSFFQKFCESEKAMSYRNLVVEKI 590 Query: 1841 ASLIPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEI 2020 ++ L+D ++ SL++EA+ IG G+ ML ++QG+ A+ AN + +V NL+S EI Sbjct: 591 VEILSLDDITLSFSLKVEALLNIGKTGMKNMLTILQGLGRAVFANLSKVYVHRNLRSSEI 650 Query: 2021 LVPLLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTM 2200 V LLECYS ++LPW H++ G ED ++F+V+IW+Q N M + EGK LLD MM M Sbjct: 651 AVQLLECYSCQLLPWIHENGGSEDFVMQFAVDIWSQAGNCMDLSTPFEGKGLLDAMMKAM 710 Query: 2201 RLAVAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDE 2380 RL+V CS Q LI++KAY VLSS + F LKE ++ L LT S RDE Sbjct: 711 RLSVGSCSVESQNLIIRKAYSVLSSHTNFQLKE---------VERLPLTPGKYDISLRDE 761 Query: 2381 CIISLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEA 2560 IISLFASV+IA+ P+T + N+R+++ LF+ LL+G V AQALGS++NKL +T E Sbjct: 762 GIISLFASVVIAVCPKTYIPNIRVLVHLFIITLLRGVVPVAQALGSILNKLVSTSSTAEN 821 Query: 2561 SSECTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDDSNEG-----HVNLYRNVALQSNAMV 2725 SS+ TLEEA+D IF + S +D L +C+ + NE + + + LQ NA+ Sbjct: 822 SSDLTLEEALDAIFNTKI-SFSSTDMLQRCNGTSNGNEMVFTDICLGIANDRMLQINAIC 880 Query: 2726 GLAWIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRSA 2905 GL+W+GKG+++RGHEK+KDI MI + CL+S SPL +D L E +V++ A Sbjct: 881 GLSWMGKGLLLRGHEKIKDITMIFMECLIS-GTKSASPLIKDSLENTEEQIQDLLVIKCA 939 Query: 2906 ADAFHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAIG 3085 DAFHVL+SDS+VCLN++FHATIRPLYKQ FFSS+MP+L + + SS +RS LYRA Sbjct: 940 TDAFHVLMSDSEVCLNRKFHATIRPLYKQRFFSSVMPILQQIITKSHSSLSRSFLYRAFA 999 Query: 3086 HVISGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKETIT 3265 H++S TP+ A+V+EAKK Y LLLVLSGILM++NG+E + Sbjct: 1000 HIMSDTPMVAIVSEAKK-LIPVLLDCLSMLTEIQDKDMLYGLLLVLSGILMEKNGQEAVV 1058 Query: 3266 ENVHIIINHIIGLVSYPHMMLVRETAIQCLVAMTGLPYTRIYPMRLQLLQALSKALDDPK 3445 EN HIIIN +I LV YPH MLVRETAIQCLVA++ LP+ RIYPMR Q+L+A+SK LDD K Sbjct: 1059 ENAHIIINCLIKLVGYPHKMLVRETAIQCLVALSELPHARIYPMRTQVLRAISKCLDDSK 1118 Query: 3446 RTVRQEAVRCRQAWASI 3496 R VR EAV+CRQ WAS+ Sbjct: 1119 RAVRHEAVKCRQTWASM 1135 >ref|XP_004236399.1| PREDICTED: MMS19 nucleotide excision repair protein homolog [Solanum lycopersicum] Length = 1153 Score = 1063 bits (2750), Expect = 0.0 Identities = 561/1160 (48%), Positives = 775/1160 (66%), Gaps = 3/1160 (0%) Frame = +2 Query: 47 LMAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNV 226 ++++ + + +E++V S QQ++++DAIA L+KNDLL++E LV+EM +YLTTTDN+ Sbjct: 15 VLSQPTELVRIIESYVSSSSSEAQQAASIDAIALLLKNDLLSLETLVREMEMYLTTTDNI 74 Query: 227 IRSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNV 406 IRSRGI SKPL + + SL+ FFT+RL DW+ALHGAL+GCLALLRRK+ V Sbjct: 75 IRSRGILLLGELLMRLMSKPLGDTAISSLMEFFTERLADWKALHGALVGCLALLRRKTGV 134 Query: 407 GIVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICE 586 G+++ S A+ + +YL+ +QVQSLG HDRKLC +IL CLLDRY +A+ +LGDDL+YGICE Sbjct: 135 GMISRSQAKAVAESYLKTLQVQSLGQHDRKLCLQILECLLDRYRDALFSLGDDLVYGICE 194 Query: 587 AIDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFD 766 AIDGEKDPQCLML FHIVE+L LFP+ GPL F+ DLF++L YFPIHFTHPK+D D Sbjct: 195 AIDGEKDPQCLMLIFHIVELLAQLFPEASGPLENFAGDLFEILECYFPIHFTHPKSDDVD 254 Query: 767 VKRDDLSRALMLAFASTPLFEPFAMPXXXXXXXXXXPTAKVDSLKYLSHCSVKYGVDRMG 946 +KR++LSRALMLAFASTPLFEP +P P+AKV+SLKYLS C++KYG DRM Sbjct: 255 IKREELSRALMLAFASTPLFEPSVIPLLLDKLSSSLPSAKVESLKYLSFCTLKYGGDRME 314 Query: 947 KHATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLEKFMLQDGGLF 1126 K+ S+WS+LKDA F +SPQ + S S+ +G+ F ++EI +AL L+ + Q F Sbjct: 315 KYTKSLWSALKDALF-TSPQSTL-SEDSDPIDGLGFHESEIMTQALEFLQVLVRQHNASF 372 Query: 1127 LSLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFF 1306 LSLI+ D +I S + +N + + KQ+LHAVG +LSV S++ CN+VF++FF Sbjct: 373 LSLIMGDGDISTFLNSFSQFDNFNSLSTQYKQRLHAVGHVLSVCIKASASSCNKVFESFF 432 Query: 1307 FRLMDILGLSPGSSDSGCILDDGFVLSEKLNSGGLYLCIELLAACRGLAIGSEELSPQLI 1486 RL+D L LS +S G + + N G LYLC+ELLAACR L + S+E++ Sbjct: 433 PRLVDALRLSVDNS-HGIVHS---AVDANFNFGALYLCVELLAACRQLVVSSDEVASAHD 488 Query: 1487 SEEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSK 1666 D+WC +L FS L ++ S + A +Y VKGL+ILATFP F+ SK Sbjct: 489 LARDSWCQILHSFSTSLCNVFFCLIRASCVESTRNAYVYAAVKGLEILATFPGSFISVSK 548 Query: 1667 SIFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLAS 1846 ++ENIL TS + + ++ LWK +LKAL++I F+ K H+ EK + +IV +K+ S Sbjct: 549 LMYENILLTLTSIIESEFNKKFLWKAALKALVEISLFVNKYHEDEKAASFNSIVKQKIVS 608 Query: 1847 LIPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILV 2026 LI +D +MP SL+LEA+F+IG G +FML V+ +E+ ISAN E V G+ + + Sbjct: 609 LISSDDLNMPQSLKLEAVFDIGLTGKNFMLSVVSELEKTISANLSEILVHGDRRLAGLTA 668 Query: 2027 PLLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTMRL 2206 LLECYS+KVLPW H + G ++V+L F+VNI+ +ME+N + + EGK LL M M+ Sbjct: 669 GLLECYSNKVLPWFHVNGGADEVSLSFAVNIFTKMEHNTSLSLEAEGKELLGATMAAMKQ 728 Query: 2207 AVAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDECI 2386 A+ CS Q ++QKA V+ + SFF ++ + QL Q E S +DE I Sbjct: 729 AMTCCSVESQEKVLQKAIDVMETNSFF-FSNNLILGTDLFNKKTQLGQTSEGLSCQDEWI 787 Query: 2387 ISLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEASS 2566 ISLFASV+IALRPQT + N+R++L+L LL+GH+ +AQALGS++NKLPL I S Sbjct: 788 ISLFASVVIALRPQTQIPNIRLLLQLLAMTLLEGHIPSAQALGSLVNKLPLNI-----SE 842 Query: 2567 ECTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDDSNEGH---VNLYRNVALQSNAMVGLAW 2737 +C+L+E ID++ K LW S + N G ++ R+ +L S+A++GLAW Sbjct: 843 DCSLKELIDMLLKNVLWR--------NISIGKEGNHGDAVAMSNLRSSSLNSHAVIGLAW 894 Query: 2738 IGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRSAADAF 2917 IGKG++MRGHEK+KD+ M L CL+S + D + + +E + + +SAADAF Sbjct: 895 IGKGLLMRGHEKLKDVTMTFLSCLVSNEDQGNLLPFNDQMKDPAELKVFSL-RKSAADAF 953 Query: 2918 HVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAIGHVIS 3097 H+++SDSD CLN+ +HA +RPLYKQ FF+ MMP+ LS++ +CDSST+R LY+A H++S Sbjct: 954 HIVMSDSDACLNRNYHAIVRPLYKQRFFNIMMPMFLSAIAKCDSSTSRCFLYQAFAHLVS 1013 Query: 3098 GTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKETITENVH 3277 TPL AVV +AKK YS+L+VLSGIL D+NG+ETI EN Sbjct: 1014 ETPLVAVVGDAKKVLPVLMDCFLILSKDISHKEIIYSVLIVLSGILTDKNGQETIIENAP 1073 Query: 3278 IIINHIIGLVSYPHMMLVRETAIQCLVAMTGLPYTRIYPMRLQLLQALSKALDDPKRTVR 3457 ++I +I L SYP+MM++RETAIQC AM+ LP+ RIYPMR Q+LQA++KALDDPKR VR Sbjct: 1074 MVIRRLIELTSYPYMMVIRETAIQCFGAMSELPHARIYPMRTQVLQAITKALDDPKRVVR 1133 Query: 3458 QEAVRCRQAWASIASTNVHF 3517 EAV+CR AWASIAS ++HF Sbjct: 1134 LEAVKCRLAWASIASRSIHF 1153 >ref|XP_006597167.1| PREDICTED: MMS19 nucleotide excision repair protein homolog isoform X2 [Glycine max] Length = 1133 Score = 1062 bits (2746), Expect = 0.0 Identities = 574/1157 (49%), Positives = 770/1157 (66%), Gaps = 8/1157 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ + H+E++VD S QQ+S+L+A+A+LV D L +EALV+E+ +YLTTTDNV+ Sbjct: 1 MAETTQLTRHIESYVDSSSTPAQQASSLNAVASLVNTDALPLEALVRELEMYLTTTDNVV 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI SKPL++AT+HSL+GFF DRL DW+A+ GAL+GCLAL+RRKS VG Sbjct: 61 RARGILLLAEVMTRIESKPLNSATIHSLVGFFKDRLADWRAVQGALVGCLALIRRKSVVG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 +VT SDA T+ ++LQ +QVQSLG +DRKLCFE+L CLL+RY +AV TLG+DLIYGICEA Sbjct: 121 MVTDSDATTIAQSFLQYMQVQSLGQYDRKLCFELLDCLLERYFDAVTTLGEDLIYGICEA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 ID EKDP CL L FHIV L L PD LA +++D+FD+L YFPIHFTHP + V Sbjct: 181 IDAEKDPDCLKLAFHIVASLAQLNPDSSSLLASYAKDVFDILEPYFPIHFTHPSSGDTHV 240 Query: 770 KRDDLSRALMLAFASTPLFEPFAMPXXXXXXXXXXPTAKVDSLKYLSHCSVKYGVDRMGK 949 +RDDLS +LM AF+STPLFEPF +P +AK+DSLKYL CS KYG +R+ K Sbjct: 241 QRDDLSTSLMSAFSSTPLFEPFVIPLLLEKLSSSLHSAKIDSLKYLRVCSSKYGAERIAK 300 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLEKFMLQDGGLFL 1129 +A +IWSSLKD + EP FS +G+ F +NE EAL L++ + Q+ L + Sbjct: 301 YAGAIWSSLKDT-LSTYLGEPDFSFTIAPVDGIGFPENEFVIEALSLLQQLIAQNSSLLV 359 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 SLI++DE++ F ++TS + Y+ IPV+ K+KLHA+G IL ++S + + CN +F++ F Sbjct: 360 SLIIDDEDVNTIFSTITSYETYDAIPVQEKKKLHAIGRILYITSKTTISSCNAMFESLFT 419 Query: 1310 RLMDILGLS---PGSSDSGCILDDGFVLSEKLNSGGLYLCIELLAACRGLAIGSEELSPQ 1480 R+MD LG S P S S++L G LYLCIELLA CR L +GSEE + Q Sbjct: 420 RMMDNLGFSVRFPNGDISP---------SQRLKFGFLYLCIELLAGCRELIVGSEEPALQ 470 Query: 1481 LISEEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPT 1660 + E +T C +L FS PL A +L S ++ + D Y GVKGLQILA F S P Sbjct: 471 YVFEHETCCTMLHSFSTPLFNAFGSVLAVSADRGPLDPDTYVGVKGLQILAMFHSDVFPI 530 Query: 1661 SKSIFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKL 1840 KSIFENIL F S + + + +LW+ +LKAL +G+F +K +SEK + Y +VVEK+ Sbjct: 531 QKSIFENILKKFMSIIIEDFNKTILWEAALKALHHVGSFFQKFCESEKAMSYRNLVVEKI 590 Query: 1841 ASLIPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEI 2020 ++ L+D ++ SL++EA+ IG G+ ML ++QG+ A+ AN + V NL+S EI Sbjct: 591 VEILSLDDITLSFSLKVEALLNIGKTGMKNMLTILQGLGRAVFANLSK--VHRNLRSSEI 648 Query: 2021 LVPLLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTM 2200 V LLECYS ++LPW H++ G ED ++F+V+IW+Q N M + EGK LLD MM M Sbjct: 649 AVQLLECYSCQLLPWIHENGGSEDFVMQFAVDIWSQAGNCMDLSTPFEGKGLLDAMMKAM 708 Query: 2201 RLAVAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDE 2380 RL+V CS Q LI++KAY VLSS + F LKE ++ L LT S RDE Sbjct: 709 RLSVGSCSVESQNLIIRKAYSVLSSHTNFQLKE---------VERLPLTPGKYDISLRDE 759 Query: 2381 CIISLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEA 2560 IISLFASV+IA+ P+T + N+R+++ LF+ LL+G V AQALGS++NKL +T E Sbjct: 760 GIISLFASVVIAVCPKTYIPNIRVLVHLFIITLLRGVVPVAQALGSILNKLVSTSSTAEN 819 Query: 2561 SSECTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDDSNEG-----HVNLYRNVALQSNAMV 2725 SS+ TLEEA+D IF + S +D L +C+ + NE + + + LQ NA+ Sbjct: 820 SSDLTLEEALDAIFNTKI-SFSSTDMLQRCNGTSNGNEMVFTDICLGIANDRMLQINAIC 878 Query: 2726 GLAWIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRSA 2905 GL+W+GKG+++RGHEK+KDI MI + CL+S SPL +D L E +V++ A Sbjct: 879 GLSWMGKGLLLRGHEKIKDITMIFMECLIS-GTKSASPLIKDSLENTEEQIQDLLVIKCA 937 Query: 2906 ADAFHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAIG 3085 DAFHVL+SDS+VCLN++FHATIRPLYKQ FFSS+MP+L + + SS +RS LYRA Sbjct: 938 TDAFHVLMSDSEVCLNRKFHATIRPLYKQRFFSSVMPILQQIITKSHSSLSRSFLYRAFA 997 Query: 3086 HVISGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKETIT 3265 H++S TP+ A+V+EAKK Y LLLVLSGILM++NG+E + Sbjct: 998 HIMSDTPMVAIVSEAKK-LIPVLLDCLSMLTEIQDKDMLYGLLLVLSGILMEKNGQEAVV 1056 Query: 3266 ENVHIIINHIIGLVSYPHMMLVRETAIQCLVAMTGLPYTRIYPMRLQLLQALSKALDDPK 3445 EN HIIIN +I LV YPH MLVRETAIQCLVA++ LP+ RIYPMR Q+L+A+SK LDD K Sbjct: 1057 ENAHIIINCLIKLVGYPHKMLVRETAIQCLVALSELPHARIYPMRTQVLRAISKCLDDSK 1116 Query: 3446 RTVRQEAVRCRQAWASI 3496 R VR EAV+CRQ WAS+ Sbjct: 1117 RAVRHEAVKCRQTWASM 1133 >ref|XP_007150605.1| hypothetical protein PHAVU_005G166100g [Phaseolus vulgaris] gi|561023869|gb|ESW22599.1| hypothetical protein PHAVU_005G166100g [Phaseolus vulgaris] Length = 1145 Score = 1052 bits (2721), Expect = 0.0 Identities = 566/1160 (48%), Positives = 776/1160 (66%), Gaps = 9/1160 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPS-NQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNV 226 MA+++ H+E++VD S S + Q ++L+A+A+LVK D+L +EALVKE+G+YLTTTD+V Sbjct: 1 MAESTQLTRHIESYVDASSSSPSLQVASLNAVASLVKTDVLPLEALVKELGMYLTTTDDV 60 Query: 227 IRSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNV 406 IR+RGI SKPLD+AT+HSL+GFF +RL DW+A+ GAL+GCLAL+RRKS + Sbjct: 61 IRARGILLLAEVITRTESKPLDSATIHSLVGFFKERLADWRAVRGALLGCLALIRRKSVL 120 Query: 407 GIVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICE 586 GIVT +DA+ + ++ Q +QVQSLG DRKLCFE+L CLL+ Y +A+ LGD LIYGICE Sbjct: 121 GIVTSTDAKAIAQSFFQYMQVQSLGQSDRKLCFELLDCLLEHYPDAITPLGDGLIYGICE 180 Query: 587 AIDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFD 766 AID EKDP+CLML FHIV+ L+P+ G LA +++D+FD+L YFPIHFTHP N Sbjct: 181 AIDAEKDPECLMLAFHIVQSWAQLYPESSGLLATYAKDVFDILEPYFPIHFTHPTNADTP 240 Query: 767 VKRDDLSRALMLAFASTPLFEPFAMPXXXXXXXXXXPTAKVDSLKYLSHCSVKYGVDRMG 946 V+RDDLSR+LM AF+STPLFEPF +P +AK+DSLKYL CS KYG +R+ Sbjct: 241 VQRDDLSRSLMSAFSSTPLFEPFVIPLLLEKLSSSLHSAKIDSLKYLRVCSSKYGAERIA 300 Query: 947 KHATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLEKFMLQDGGLF 1126 K+A SIWSS+KD + EP FSL +G+ F +NE EAL L++ ++Q+ L Sbjct: 301 KYANSIWSSIKD-ILSTYLGEPDFSLNIAPADGIGFPENEFVVEALSLLQQLIVQNSSLL 359 Query: 1127 LSLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFF 1306 + LI++DE++ + F ++ S + Y+ IPV+ K+KLHA+G IL +++ + CN VF++ F Sbjct: 360 VCLIVDDEDVNIFFNTIASYEIYDAIPVQEKKKLHAIGRILYIAAKSTVTSCNAVFESLF 419 Query: 1307 FRLMDILGLSPGSSDSGCILDDGFVLSEKLNSGGLYLCIELLAACRGLAIGSEELSPQLI 1486 ++MD LG+S + DS + S+++ G LYLCIELL R L +GS+E + Q + Sbjct: 420 SKIMDNLGVSVSNIDSSA--NGDISSSQRVKIGFLYLCIELLVGFRELIVGSKEPALQYV 477 Query: 1487 SEEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSK 1666 E +T C +L FS L A +L S ++ + D Y GVKGLQILA F S K Sbjct: 478 IEHETCCTMLHSFSSSLFNAFGLVLAESADRCPLDPDTYIGVKGLQILAMFHSDVFSMQK 537 Query: 1667 SIFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLAS 1846 SIFENIL F S + + +++LW+ +LKAL +G+F+++ H+SEK + Y ++VVEK+ Sbjct: 538 SIFENILKKFMSIIIEDFNKKILWEAALKALCHVGSFVQEFHESEKAMSYGSLVVEKIVE 597 Query: 1847 LIPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILV 2026 + L+D +P SL++EA+ IG G+ ML +QG+ +A+ AN + V +L+S EI V Sbjct: 598 FLFLDDIIVPFSLKVEALSNIGMTGMKNMLTSLQGMRKAVFANLSK--VHTDLRSSEIAV 655 Query: 2027 PLLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTMRL 2206 LLECYS K+LPW+H++ G ED AL+F+V+IW+Q N M + E K LL +M M+L Sbjct: 656 QLLECYSCKLLPWTHENGGSEDFALQFAVDIWSQAGNCMVSSTSFEEKGLLYALMKAMKL 715 Query: 2207 AVAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDECI 2386 +V CS Q LI+QKAY +LSS + F LKE L+ L L+ + S DE I Sbjct: 716 SVGICSVESQNLIIQKAYSILSSRTNFQLKE---------LERLPLSPGKYNISLTDEWI 766 Query: 2387 ISLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEASS 2566 ISLFASV+IA+ P+T + N+R+++ LF+ LL+G V AQALGS++NKL N+ E SS Sbjct: 767 ISLFASVVIAVCPKTLIPNIRVLVNLFIVTLLRGIVPVAQALGSLLNKLVSTSNSAENSS 826 Query: 2567 ECTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDDSNE--------GHVNLYRNVALQSNAM 2722 + TLEEA+D IF +W D L +C+ + E G N + LQ NA+ Sbjct: 827 DITLEEALDAIFNTKIW-FSSIDILQRCNGTSNGKEIVLTDICLGFAN---DKLLQINAI 882 Query: 2723 VGLAWIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRS 2902 GL+WIGKG+++RGHE +KDI M + CL+ + P +D LG E P+VM+S Sbjct: 883 CGLSWIGKGLLLRGHEGIKDITMTFIECLIP-GTKSSLPFFKDSLGNTEEQIQDPLVMKS 941 Query: 2903 AADAFHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAI 3082 AADAFHVL+SDS+VCLNK+FHATIRPLYKQ FFSSMMP+ L + + SS +RS LYRA+ Sbjct: 942 AADAFHVLMSDSEVCLNKKFHATIRPLYKQRFFSSMMPIFLQLITKAYSSLSRSFLYRAL 1001 Query: 3083 GHVISGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKETI 3262 H+IS TP+ AV+ +AKK Y LLLVLS IL ++NG+E + Sbjct: 1002 AHIISDTPMVAVLNDAKKLIPVLLDCFSMLTEDIQDKDMLYGLLLVLSCILTEKNGQEAV 1061 Query: 3263 TENVHIIINHIIGLVSYPHMMLVRETAIQCLVAMTGLPYTRIYPMRLQLLQALSKALDDP 3442 TEN HIIIN +I LV YPH MLVRETAIQCLVA++ LP++RIYPMR Q+LQA+SK LDD Sbjct: 1062 TENAHIIINCLIKLVDYPHKMLVRETAIQCLVALSELPHSRIYPMRTQVLQAISKCLDDT 1121 Query: 3443 KRTVRQEAVRCRQAWASIAS 3502 +R VR EAV+CRQ WAS++S Sbjct: 1122 RRVVRYEAVKCRQTWASMSS 1141 >ref|XP_006595124.1| PREDICTED: DNA repair/transcription protein mms19-like isoform X1 [Glycine max] Length = 1132 Score = 1044 bits (2699), Expect = 0.0 Identities = 571/1153 (49%), Positives = 769/1153 (66%), Gaps = 5/1153 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ + H+E++VD S P++Q +S L+AIA+LV D LT+EAL++E+ +YLTTTDNV+ Sbjct: 1 MAETTQLTRHIESYVDSSSPTHQAAS-LNAIASLVNTDALTLEALIRELEMYLTTTDNVV 59 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI SKPL++AT+HSL+GFF DRL DW+A+ GAL+GCLAL+RRKS VG Sbjct: 60 RARGILLLAEVMTHIESKPLNSATIHSLVGFFKDRLADWRAVRGALVGCLALIRRKSVVG 119 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNH-DRKLCFEILICLLDRYSEAVITLGDDLIYGICE 586 +VT SDA T+ ++LQ +QVQSLG H DRKLCFE+L CLL+RY +AV TLG+DLIYGICE Sbjct: 120 MVTDSDATTIAQSFLQYMQVQSLGQHYDRKLCFELLDCLLERYFDAVTTLGEDLIYGICE 179 Query: 587 AIDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFD 766 AID EKDP CL L FHIVE L L PD G LA F++D+FD+L YFPIHFT P + Sbjct: 180 AIDAEKDPDCLKLAFHIVESLAQLNPDSSGLLASFAKDVFDILEPYFPIHFTRPSSGDTH 239 Query: 767 VKRDDLSRALMLAFASTPLFEPFAMPXXXXXXXXXXPTAKVDSLKYLSHCSVKYGVDRMG 946 V+RD LS +LM AF+STPLFEPF +P +AK+DSLKYL CS KYG R+ Sbjct: 240 VQRD-LSTSLMSAFSSTPLFEPFVIPLLLEKLSSSLHSAKIDSLKYLRVCSSKYGAGRIA 298 Query: 947 KHATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLEKFMLQDGGLF 1126 K+A +IWSSLKD + EP FS +G+ F +NE EAL L++ ++Q+ L Sbjct: 299 KYAGAIWSSLKDTL-STYLGEPDFSFTIAPVDGIGFPENEFVLEALSLLQQLIVQNSSLL 357 Query: 1127 LSLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFF 1306 +SLI++DE++ F ++ S + Y+ IPV+ K+KLHA+G IL++++ + + CN VF++ F Sbjct: 358 VSLIIDDEDVNSIFSTIASYETYDAIPVQEKKKLHAIGRILNITAKTTISSCNAVFESLF 417 Query: 1307 FRLMDILGLSPGSSDSGCILDDGFVLSEKLNSGGLYLCIELLAACRGLAIGSEELSPQLI 1486 RLMD LG S +S S+++ G LY+CIELLA CR L +GS+E + Q + Sbjct: 418 SRLMDNLGFSVRFPNSD------IPPSQRVKFGFLYVCIELLAGCRELIVGSDEPALQYV 471 Query: 1487 SEEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSK 1666 E +T C +L +FS PL A +L S ++ + D Y GVKGLQILA F S P K Sbjct: 472 FEHETCCTMLHRFSTPLFNAFGSVLAVSADRCPLDPDTYIGVKGLQILAMFGSDVFPIQK 531 Query: 1667 SIFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLAS 1846 S+FENIL F S + + + +LW+ +LKAL Q+G+F++K H+SEK + Y +VVEK+ Sbjct: 532 SVFENILKKFMSIIVEDFNKTILWEAALKALYQVGSFVQKFHESEKAMSYRNLVVEKIVE 591 Query: 1847 LIPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILV 2026 ++ L+D ++P SL+LEA+ IG G+ ML ++QG+ A+ +N + V NL+S +I V Sbjct: 592 ILSLDDITLPFSLELEALSNIGMTGMKNMLTILQGLGRAVFSNLSKVHVHRNLRSSDIAV 651 Query: 2027 PLLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTMRL 2206 LLECYS ++LPW H++ G ED ++F V+IW+Q N M F+ E K LLD +M M+L Sbjct: 652 QLLECYSCQLLPWIHENGGSEDFVMQFVVDIWSQAGNCMDFSTLFEEKGLLDAIMKAMKL 711 Query: 2207 AVAGCSEYDQGLIVQKAYGVLSS-TSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDEC 2383 +V C+ Q LI+QKAY VLSS T+F LKE ++ L LT + S RDE Sbjct: 712 SVGSCAVESQNLIIQKAYCVLSSHTNFQQLKE---------VERLPLTPGNYNISLRDEG 762 Query: 2384 IISLFASVIIALRPQTSLLNVRIILKLFMTGLLKGH-VSAAQALGSMINKLPLMINTTEA 2560 +ISLFASV+IA+ P+T + N R+++ LF+ LL+G V AQALGS++NKL N+ E Sbjct: 763 LISLFASVVIAVFPKTYIPNKRVLMHLFIITLLRGGVVPVAQALGSILNKLVSTSNSAEN 822 Query: 2561 SSECTLEEAIDIIF--KMGLWSVCGSDPLWKCSAIDDSNEGHVNLYRNVALQSNAMVGLA 2734 SS+ TLEEA+D+IF K+ S + D G N + LQ NA+ GL+ Sbjct: 823 SSDLTLEEALDVIFNTKISFSSTDNGRSNGNEMVLTDICLGIAN---DRMLQINAICGLS 879 Query: 2735 WIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRSAADA 2914 WIGKG+++ GHEK+KDI+MI L CL+S SPL +D L EH +VM+ AADA Sbjct: 880 WIGKGLLLSGHEKIKDIIMIFLECLIS-GTKSASPLIKDSLENTEEHIQDLLVMKCAADA 938 Query: 2915 FHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAIGHVI 3094 FHVL+SDS+VCLN++FHA IRPLYKQ F SS+MP+L + + SS +RS LYRA H++ Sbjct: 939 FHVLMSDSEVCLNRKFHAMIRPLYKQRFSSSVMPILQQIITKSHSSLSRSFLYRAFAHIL 998 Query: 3095 SGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKETITENV 3274 S TP+ A+++EAKK Y LLLVLSGIL ++NG+E EN Sbjct: 999 SDTPMVAILSEAKKLIPVLLDCLSMLTEDIQDKDMLYGLLLVLSGILTEKNGQEAAIENA 1058 Query: 3275 HIIINHIIGLVSYPHMMLVRETAIQCLVAMTGLPYTRIYPMRLQLLQALSKALDDPKRTV 3454 HIIIN +I LV YPH MLVRETAIQCLVA++ LP+ RIYPMR Q+L+A+SK LDD KR V Sbjct: 1059 HIIINCLIKLVDYPHKMLVRETAIQCLVALSELPHARIYPMRTQVLRAISKCLDDSKRAV 1118 Query: 3455 RQEAVRCRQAWAS 3493 R EAV+CRQ WAS Sbjct: 1119 RHEAVKCRQTWAS 1131 >ref|XP_007024312.1| MMS19 nucleotide excision repair protein, putative isoform 3 [Theobroma cacao] gi|508779678|gb|EOY26934.1| MMS19 nucleotide excision repair protein, putative isoform 3 [Theobroma cacao] Length = 1062 Score = 1042 bits (2695), Expect = 0.0 Identities = 558/1075 (51%), Positives = 740/1075 (68%), Gaps = 9/1075 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ S I +E+FVD +R QQ+++LD IA+L+KN+ LTIE LV+EM YLTT DN+I Sbjct: 1 MAETSQLIQGIESFVDSTRSPTQQAASLDVIASLLKNNQLTIETLVREMEGYLTTADNII 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI +SKPLD+AT+HSLI FFTDRL DW+AL GAL+GCLALLRRKS+ G Sbjct: 61 RARGILLLGEVLMHLASKPLDDATIHSLIQFFTDRLADWRALRGALVGCLALLRRKSSGG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 IV+ +DA+ + +YLQN+QVQSLG +DRKLCFE+L+CLL+RY +A+ +LGD+LIYGICEA Sbjct: 121 IVSETDAKAVAESYLQNLQVQSLGKYDRKLCFELLLCLLERYPKAIASLGDNLIYGICEA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 +DGEKDP CLML FHI+EIL LFPDPLGP F+ DLF+ L YFP+HFTHPK + ++ Sbjct: 181 VDGEKDPHCLMLIFHIIEILPQLFPDPLGPFTSFAHDLFENLSYYFPVHFTHPKGEDVNI 240 Query: 770 KRDDLSRALMLAFASTPLFEPFAMPXXXXXXXXXXPTAKVDSLKYLSHCSVKYGVDRMGK 949 KRDDL+RALMLAF+STPLFEPFA+P P+AKVDSL+YLS C+VKYGVDRM K Sbjct: 241 KRDDLARALMLAFSSTPLFEPFAIPLLIEKLSSSLPSAKVDSLRYLSDCTVKYGVDRMAK 300 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLEKFMLQDGGLFL 1129 H ++WSSLKDA F S + + S ES G+ +NEI EAL L+K ++Q+ FL Sbjct: 301 HGEALWSSLKDAVFTS--LDGVLSFTPESLEGLCLPENEIAAEALSLLQKLIVQNTNFFL 358 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 LI+ DE+I M F ++S K Y+ IP + KQ+LHAVGCILS S S+A CNRVF+ FF Sbjct: 359 DLIVVDEDINMIFNMISSYKSYHGIPAQSKQRLHAVGCILSASVKASTASCNRVFECFFS 418 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGGLYLCIELLAACRGLAIGSEELSPQLIS 1489 RLMDILGL +S DD ++ ++ N G LYL IELL+ACR + SE + Sbjct: 419 RLMDILGLCVRNSSGNLSSDDSIMIPKRYNHGALYLSIELLSACRDVIASSETIIAASAH 478 Query: 1490 EEDTWCCLLKKFSGPLAGAL-RYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSK 1666 E+TW LL+ FS L A + TS + H +AD+Y GVKGL ILATFP +L SK Sbjct: 479 TEETWSYLLRSFSSSLTKAFCSASICTSEDSH--DADVYFGVKGLLILATFPEGYLLISK 536 Query: 1667 SIFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLAS 1846 +FE IL F S ++V + LLWKL+LKAL+QIG+FI+KCH+SEK Y+ +VVEK+ S Sbjct: 537 PVFEKILMTFVSIVTVDYSNTLLWKLALKALVQIGSFIEKCHESEKEPSYLGLVVEKIVS 596 Query: 1847 LIPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILV 2026 L D S+P L+LEA+ EIGT G +ML+V++G+EEAI AN E +V G+ S EI+ Sbjct: 597 FSSLGDFSIPFPLRLEALSEIGTSGKSYMLKVVEGLEEAIYANLSEVYVHGSSNSAEIVT 656 Query: 2027 PLLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKV-LLDKMMTTMR 2203 LL+CYS KV+PW ++GF++V L+F+++IWNQ+E +M FNA K+ +LD MM M+ Sbjct: 657 QLLKCYSDKVIPWIQCAKGFDEVPLQFAIHIWNQIELSMVFNATQTNKIEVLDVMMKAMK 716 Query: 2204 LAVAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDEC 2383 LAVA CSE +Q +IVQK+Y +LSS++ FPLKE + + Q+ Q +++ S RDE Sbjct: 717 LAVASCSEENQNIIVQKSYHILSSSTSFPLKEL------FRQESFQIVQ-VDNSSSRDEW 769 Query: 2384 IISLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEAS 2563 I+SLFA+V+IA+ P+T + N++ +L LFMT LLKG+V AQALGS++NKL L + Sbjct: 770 ILSLFAAVVIAVHPETYVPNIKPLLYLFMTTLLKGNVVTAQALGSVVNKLGL--ESAGVQ 827 Query: 2564 SECTLEEAIDIIFKMGLW---SVCGSDPLWKCSAIDDSNEGHVNLYRNV----ALQSNAM 2722 ++CTLEE +DII + LW S +D K ++ D + +NL ++ +LQ +A+ Sbjct: 828 TDCTLEEVMDIILNLSLWIFHSNSSADIQAKMTSAHDIS--LINLCSSIGSCTSLQIHAI 885 Query: 2723 VGLAWIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRS 2902 VGLAWIGKG++MRGHEKVKDI MI LRCL ++ + +E D+H VM+S Sbjct: 886 VGLAWIGKGLLMRGHEKVKDITMIFLRCLQPNGRAEILHQEEGISESNNELDLHHSVMKS 945 Query: 2903 AADAFHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAI 3082 AADAF +L+ DS+VCLN+ FHA IRPLYKQ FFS+MMP+L S + + + +R +L RA Sbjct: 946 AADAFQILMGDSEVCLNRGFHAVIRPLYKQRFFSTMMPILQSLIMKSE-PLSRPLLLRAS 1004 Query: 3083 GHVISGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDEN 3247 H+I TPL V+++AKK Y LLLVLSGILMD+N Sbjct: 1005 AHIIVDTPLIVVLSDAKKIIPMLLDGLSALSNDILDKDVIYGLLLVLSGILMDKN 1059 >ref|XP_004486785.1| PREDICTED: uncharacterized protein LOC101495813 [Cicer arietinum] Length = 1138 Score = 1019 bits (2635), Expect = 0.0 Identities = 553/1157 (47%), Positives = 753/1157 (65%), Gaps = 5/1157 (0%) Frame = +2 Query: 50 MAKASSWIPHVEAFVDPSRPSNQQSSNLDAIATLVKNDLLTIEALVKEMGLYLTTTDNVI 229 MA+ + H+E++VD S Q+++LDAI L+K + LT+EALV+E+ +YLT+TD VI Sbjct: 1 MAETTQLTRHIESYVDSSSTPTHQATSLDAIGLLIKTNALTLEALVRELDMYLTSTDTVI 60 Query: 230 RSRGIXXXXXXXXXXSSKPLDNATVHSLIGFFTDRLVDWQALHGALIGCLALLRRKSNVG 409 R+RGI SKPLD+ T+HSL+GFF +RL DW+A+ GAL+GCLAL+RRKS G Sbjct: 61 RARGILLLAEVLTRICSKPLDSETIHSLVGFFKERLADWKAVRGALVGCLALIRRKSVAG 120 Query: 410 IVTVSDARTLGLTYLQNIQVQSLGNHDRKLCFEILICLLDRYSEAVITLGDDLIYGICEA 589 +VT SDA+ + ++LQ +QVQSLG++DRKLCFE+L LL+ +++AV +L +DLIYGICEA Sbjct: 121 MVTGSDAKAIAQSFLQYLQVQSLGHYDRKLCFELLDFLLEHHADAVASLEEDLIYGICEA 180 Query: 590 IDGEKDPQCLMLTFHIVEILGHLFPDPLGPLAGFSEDLFDVLGRYFPIHFTHPKNDGFDV 769 ID EKDP+CLML FHIVE L L+PDP G LA F+ D+FD+L YFPIHFTHP + V Sbjct: 181 IDAEKDPECLMLAFHIVESLARLYPDPSGLLASFASDVFDILAPYFPIHFTHPSSGDTHV 240 Query: 770 KRDDLSRALMLAFASTPLFEPFAMPXXXXXXXXXXPTAKVDSLKYLSHCSVKYGVDRMGK 949 +RDDLS+ LM AF+STPLFEPF +P +AK+DSL+YL CS KYG +R+ K Sbjct: 241 QRDDLSKILMSAFSSTPLFEPFVIPLLLEKLSSSLHSAKIDSLQYLRVCSSKYGAERIAK 300 Query: 950 HATSIWSSLKDAFFRSSPQEPIFSLISESPNGMEFQKNEITNEALVCLEKFMLQDGGLFL 1129 +A +IWSSLKD + + EP S + NG+ F +NE+ EAL L++ ++Q+ + Sbjct: 301 YAGAIWSSLKDTLY-TYLAEPDLS-FTLPINGIGFPENEVVIEALSLLQQLIVQNNSQLV 358 Query: 1130 SLILEDEEIEMTFRSVTSAKKYNDIPVEIKQKLHAVGCILSVSSTVSSACCNRVFQNFFF 1309 SLI++DE++ S+ S + Y+ I V+ K+KLHA+G IL ++ S + CN VFQ+ F Sbjct: 359 SLIIDDEDVNFIINSIASYETYDTISVQEKKKLHAIGRILYITVKASISSCNAVFQSLFL 418 Query: 1310 RLMDILGLSPGSSDSGCILDDGFVLSEKLNSGGLYLCIELLAACRGLAIGSEELSPQLIS 1489 R+MD LG+ + D + + S+ + G LYLCIELLA R L + SEE Sbjct: 419 RMMDNLGIPVSNIDG--LQNSAIFTSQNVKFGFLYLCIELLAGSRELVVLSEE------- 469 Query: 1490 EEDTWCCLLKKFSGPLAGALRYILVTSTNQHACEADIYCGVKGLQILATFPSCFLPTSKS 1669 + +T+C LL +S L A +L S ++ DI+ GVKGLQILA F P KS Sbjct: 470 KRETYCTLLHSYSTVLFNAFDSVLDVSADRCLPGPDIHIGVKGLQILAMFHLDVFPIPKS 529 Query: 1670 IFENILNVFTSNLSVGWEEQLLWKLSLKALMQIGTFIKKCHDSEKVICYMTIVVEKLASL 1849 FENIL F S + G+ +LW +LKAL IG+F++ +SEK + Y + VV+K L Sbjct: 530 TFENILKKFMSIIIEGFNRTVLWNAALKALFHIGSFVQNFSESEKAMSYRSFVVDKTMEL 589 Query: 1850 IPLEDSSMPLSLQLEAIFEIGTIGLDFMLRVIQGIEEAISANFFEAFVKGNLKSVEILVP 2029 + L+D ++P SL++EA+ +IG G+ ML ++QG+E A+ AN E V NLKS EI Sbjct: 590 LSLDDIALPFSLKVEALSDIGMTGMKNMLTILQGLEGALFANLSE--VHRNLKSSEIAAQ 647 Query: 2030 LLECYSSKVLPWSHKSEGFEDVALRFSVNIWNQMENNMTFNAGVEGKVLLDKMMTTMRLA 2209 LLECYS K+LPW H++ G E+ ++F+V IW+Q N M FN+ E K LLD MM ++++ Sbjct: 648 LLECYSCKLLPWIHENGGAEESIVQFTVEIWSQAGNCMDFNSPFEEKGLLDSMMKALKVS 707 Query: 2210 VAGCSEYDQGLIVQKAYGVLSSTSFFPLKESISFSVPIKLDDLQLTQDLESFSYRDECII 2389 V CS Q I+QKAY +LSS + F L + + L LT S RDE I+ Sbjct: 708 VGCCSVESQNAIIQKAYSILSSHTKFQLND---------VGRLPLTPGAYDISPRDEGIL 758 Query: 2390 SLFASVIIALRPQTSLLNVRIILKLFMTGLLKGHVSAAQALGSMINKLPLMINTTEASSE 2569 LFASVIIALRP+T + N+ +L+LF+ LLKG V AQALGSM+NKL N E S E Sbjct: 759 LLFASVIIALRPKTHIPNIGGLLQLFIITLLKGVVPVAQALGSMVNKLIPKSNGAEKSGE 818 Query: 2570 CTLEEAIDIIFKMGLWSVCGSDPLWKCSAIDDSNEG-----HVNLYRNVALQSNAMVGLA 2734 LEEA+DIIF +W +D L +C+ + +E + + + LQ+NA+ GLA Sbjct: 819 FPLEEALDIIFNTKIW-FSSTDMLQRCNGSSNGSEMVLTDLCLGITNDRLLQTNAICGLA 877 Query: 2735 WIGKGMIMRGHEKVKDIMMILLRCLLSRSNMRTSPLQQDMLGECSEHDIHPIVMRSAADA 2914 WIGKG+++RGHEK+KDI MI + CL+S + PL + L E P+ + AADA Sbjct: 878 WIGKGLLLRGHEKIKDITMIFIECLMS-DRKSSLPLIEGSLASTEEQKSDPLPRKCAADA 936 Query: 2915 FHVLLSDSDVCLNKRFHATIRPLYKQHFFSSMMPVLLSSLKECDSSTTRSMLYRAIGHVI 3094 FH+L+SD++VCLN++FHATIRPLYKQ FFS+++P+ + DS +RS LYRA HV+ Sbjct: 937 FHILMSDAEVCLNRKFHATIRPLYKQRFFSTILPIFQQLITRSDSPLSRSFLYRAFAHVM 996 Query: 3095 SGTPLAAVVTEAKKXXXXXXXXXXXXXXXXXXXXXTYSLLLVLSGILMDENGKETITENV 3274 S TPL +V EAKK Y LLLVLSGIL ++NG+E + EN Sbjct: 997 SDTPLIVIVNEAKKLIPVLLDCLSLLTEDIEDKDILYGLLLVLSGILTEKNGQEAVIENA 1056 Query: 3275 HIIINHIIGLVSYPHMMLVRETAIQCLVAMTGLPYTRIYPMRLQLLQALSKALDDPKRTV 3454 HIIIN +I LV YP LVRETAIQCLVA++ LP+ RIYP+R Q+L+A+ K LDD KR+V Sbjct: 1057 HIIINCLIKLVDYPQKTLVRETAIQCLVALSMLPHARIYPLRTQVLRAIPKCLDDTKRSV 1116 Query: 3455 RQEAVRCRQAWASIAST 3505 R+EAV+CRQAWASI S+ Sbjct: 1117 RREAVKCRQAWASITSS 1133