BLASTX nr result
ID: Angelica22_contig00001459
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica22_contig00001459 (3877 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI36057.3| unnamed protein product [Vitis vinifera] 1180 0.0 ref|XP_002515963.1| DNA repair/transcription protein met18/mms19... 1039 0.0 ref|XP_004141784.1| PREDICTED: MMS19 nucleotide excision repair ... 1022 0.0 ref|XP_003546956.1| PREDICTED: uncharacterized protein LOC100788... 989 0.0 ref|XP_003597674.1| MMS19 nucleotide excision repair protein-lik... 985 0.0 >emb|CBI36057.3| unnamed protein product [Vitis vinifera] Length = 1146 Score = 1180 bits (3053), Expect = 0.0 Identities = 631/1146 (55%), Positives = 793/1146 (69%), Gaps = 20/1146 (1%) Frame = +1 Query: 73 MGDSTHCVKHIESFVDSSSTPXXXXXXXXXXXXLVNNDIITLEALVREMGLYLTTTDSFL 252 M + ++IES+VDSS + L+ NDI+TLE LV EMG+YLTTTD+ + Sbjct: 1 MAQLSQLTQYIESYVDSSRSSTQQAASVDAIAYLLKNDILTLETLVTEMGMYLTTTDNII 60 Query: 253 RARGTLLLAESLVCLASKPLENETVHSLIGFFTERLVDWKALHGALIGCLCLLRRKGNVG 432 R RG LLLAE L LASKPL+N T+HSLI FFT+RL DW+AL GALIGCL L++RK N+G Sbjct: 61 RTRGILLLAELLTRLASKPLDNVTIHSLISFFTDRLADWRALRGALIGCLALMKRKSNMG 120 Query: 433 IVQGSKALAVAQCFLQNVQVQSLRESDRKXXXXXXXXXXXXYPNDVVHLEDVIIPGICES 612 V + A AVAQ +L+NVQVQSL + DRK YP V L D ++ GIC + Sbjct: 121 RVTDNDARAVAQAYLENVQVQSLGQHDRKLCFEILECLLDHYPESVASLGDDLVYGICGA 180 Query: 613 IDGEKDPECLMLAFQIVEALVHLYPDPTGPLASYATDLFENLSCYFPIHFTHPKGEDIDV 792 IDGEKDP CLML F IVE L L+PDP+GPLAS+A DLF+ L CYFPIHFTHP+GED+DV Sbjct: 181 IDGEKDPRCLMLTFHIVEILARLFPDPSGPLASFAGDLFDILGCYFPIHFTHPQGEDVDV 240 Query: 793 KKEELSRTLMMAFASTPLFEPFAIXXXXXXXXXXXXXAKVESLKYLIYCSDKYGAARMAK 972 K+++LSR LM+AF+ST LFEPFAI AKV+SLKYL C KYG RM K Sbjct: 241 KRDDLSRALMLAFSSTTLFEPFAIPLLLEKLSSSLPLAKVDSLKYLSNCLLKYGDDRMTK 300 Query: 973 HVKVMWSALKDIIYTSPXXXXXXXXXXX-DGMSFEESHTVTEALVLLQMVIHQNSNLLLD 1149 HV+ +W ++KD I+ S D + F+E+ VTEA++LLQ VI +NS L L Sbjct: 301 HVEAIWFSVKDAIFCSEQEPMLSLASELLDHVGFQENEIVTEAIILLQKVILENSGLSLS 360 Query: 1150 LIVSDEDMKRTVDNIHQFKEYNGITLQSKQKLHAVGCILYLSAKASIGSCNRVFEGFFQR 1329 LIV D+D+ V+ + F+ YN I LQSK KL A+G ILY+SAKASI CNRVFE FF R Sbjct: 361 LIVGDKDINTIVNTVTSFRSYNDIPLQSKHKLCAIGRILYVSAKASITCCNRVFESFFFR 420 Query: 1330 LIHALGISLENS--------DAPLSSPCNFGATYLCVELLAGCRALVVGTEGNTSTNVLA 1485 L+ LG+S+ NS D S NFGA YLC+ELLA CR LVVG+E TS +V A Sbjct: 421 LMDTLGLSVRNSSGDCLPNFDYVFSERLNFGALYLCIELLAACRDLVVGSEELTSKSVSA 480 Query: 1486 QEAWSNMLCSYGTSLSKIFSFTLVTGQDGSTQNAYLHYGVKGLQILATFPGNFYPLSASI 1665 QE+W ML S+ + L K FS L D A ++ GVKGLQILATFPG F P+S SI Sbjct: 481 QESWCCMLHSFSSLLMKAFSSVLDASTDKDAYEADIYSGVKGLQILATFPGEFLPISKSI 540 Query: 1666 YESILLKLISVVTLNFKDALLWKLALTALVEIGLFVDKSEDTEKAQSFDAIVVDRIASLL 1845 +E++LL IS++ +F LLWKLAL ALV+IG F+D+ ++EKA S++ IVV++I SL+ Sbjct: 541 FENVLLTFISIIVEDFNKTLLWKLALKALVQIGSFIDRFHESEKALSYNYIVVEKIVSLM 600 Query: 1846 LCDDLTMPSQLKLEIISSIGTTGLSYMQKILQGXXXXXXXXXXGVYVNGDPELLETATSL 2025 DD +P QL+LE IS IGTTGL+ M KI+QG VYV+G+ + + A L Sbjct: 601 FLDDFGLPFQLRLEAISDIGTTGLNVMLKIVQGLEDAIFANLSEVYVHGNLKSAKIAVQL 660 Query: 2026 LECYSCKVLPWFDSIGGHEEVQFHFALNIWDQIENNMSFCISSQEQKLLPAAMTAMQYAV 2205 LECYS K+LP G E+V FA+NIW+QIEN+M+F + +QE +LL A MTAM+ AV Sbjct: 661 LECYSNKLLPGIHGAGDFEDVLSRFAVNIWNQIENSMAFSVGAQENELLNATMTAMKLAV 720 Query: 2206 GRCLVESQTVIVNRAFGMLSSTTSIPVKDLMNETTPVKVGGLKSNDGLDCFSCRDKWIIS 2385 G C SQ I+ +A+ +LSS S + + M T V++ GL+ L+CFSCRDKW+IS Sbjct: 721 GSCSEGSQGKIIKKAYSVLSSCPSFTLMESMPITGTVQLEGLQHTQDLECFSCRDKWVIS 780 Query: 2386 LFASVVVALRPQAHITNVKMIFQLFLTTQLYGHVPSAQALGSIINKLPVKVENTDPSKVF 2565 LFAS ++A+RPQ HI N++++ LF+T L GHVP+AQALGS++NKL K + S Sbjct: 781 LFASAIIAVRPQTHIPNIRVVLHLFMTNLLKGHVPAAQALGSMVNKLCPKSNGVEISSTC 840 Query: 2566 SLVEAMDMIFSSRVWILCNEGPSK-CLVAGNNNEIGFRNSLLVQSH-------AIEGLAW 2721 +L +A+D+IF++ +W N GP K C G +NE+G N L S+ AIEGLAW Sbjct: 841 TLEDALDIIFNTSLWDSHNHGPLKRCSGIGVDNEMGLANLCLSASNCQLLQVCAIEGLAW 900 Query: 2722 IGKGLLMRGHEKVKDIVVTLLSCLLS--NYEQGLL-SIMRSAADAFHILISDSEACLNKR 2892 IGKGLL+RGHEKVKDI + L CLLS N EQ +L S+ +SAADAFH+L+SDSE CLNKR Sbjct: 901 IGKGLLLRGHEKVKDITMIFLRCLLSKNNQEQDVLPSVAKSAADAFHVLMSDSEICLNKR 960 Query: 2893 LHATIRPLYKQRLFSIVLPVILSLISKADSSIQREMLYRAFANVVCGTPLSAILSEAKKL 3072 HA IRPLYKQR FS VLP+++S ++++ S R MLYRA A+++ TPL A+LSEAKK+ Sbjct: 961 FHANIRPLYKQRFFSSVLPILVSSMAESRLSNTRSMLYRALAHIISDTPLIAVLSEAKKI 1020 Query: 3073 IPILVDSLSVLSEDVLHKDIVYSILLVLSGILTDKKRSEVVVENAHIIIRRLIALVSYPH 3252 IPIL+DSLS+LS L KDI+Y++LLVLSGIL DK E VVENAH+II LI LV YPH Sbjct: 1021 IPILLDSLSILSTYNLDKDILYNLLLVLSGILMDKNGQETVVENAHVIINCLIGLVGYPH 1080 Query: 3253 MMVTRETAIQCLTAMSELPHARIYPMRTQVLLSISKTLDDPKRAVRQEAVKCRQAWASIA 3432 MMV RETAIQCL AMS LPHARIYPMRTQVL S+ K LDDPKRAVR EAV+CRQAWASIA Sbjct: 1081 MMVVRETAIQCLVAMSRLPHARIYPMRTQVLRSVQKALDDPKRAVRHEAVRCRQAWASIA 1140 Query: 3433 *KTLHF 3450 ++LHF Sbjct: 1141 SRSLHF 1146 >ref|XP_002515963.1| DNA repair/transcription protein met18/mms19, putative [Ricinus communis] gi|223544868|gb|EEF46383.1| DNA repair/transcription protein met18/mms19, putative [Ricinus communis] Length = 1174 Score = 1039 bits (2686), Expect = 0.0 Identities = 564/1176 (47%), Positives = 761/1176 (64%), Gaps = 50/1176 (4%) Frame = +1 Query: 73 MGDSTHCVKHIESFVDSSSTPXXXXXXXXXXXXLVNNDIITLEALVREMGLYLTTTDSFL 252 M + T ++IES+VD+S + L+ ND +T+ +LV+EM +YLTTTD + Sbjct: 1 MAEPTQLTQYIESYVDASRSLSQQAASLDAIVLLLKNDAVTIGSLVKEMEMYLTTTDDII 60 Query: 253 RARGTLLLAESLVCLASKPLENETVHSLIGFFTERLVDWKALHGALIGCLCLLRRKGNVG 432 RARG LLL E+L L+SKPL+N T+HSLI FFTERL DW+AL GAL+GCL L+RR+ N G Sbjct: 61 RARGILLLGEALSHLSSKPLDNTTIHSLIAFFTERLADWRALRGALVGCLALIRRRSN-G 119 Query: 433 IVQGSKALAVAQCFLQNVQVQSLRESDRKXXXXXXXXXXXXYPNDVVHLEDVIIPGICES 612 I+ G A VA+ +LQN+QVQSL + DRK P V L + +I GICE+ Sbjct: 120 IITGIDAKVVAESYLQNLQVQSLAQYDRKLCFELLECLLENCPAAVASLGEDLIYGICEA 179 Query: 613 IDGEKDPECLMLAFQIVEALVHLYPDPTGPLASYATDLFENLSCYFPIHFTHPKGEDIDV 792 IDGEKDP+CLML F IVE L L+PDP+GP +S+A D+F L CYFPIHFTHPK ED+DV Sbjct: 180 IDGEKDPQCLMLTFHIVEVLGKLFPDPSGPFSSFAGDIFSILGCYFPIHFTHPKAEDVDV 239 Query: 793 KKEELSRTLMMAFASTPLFEPFAIXXXXXXXXXXXXXAKVESLKYLIYCSDKYGAARMAK 972 K+++LSR LM+AF+STPLFEPFA+ AKV+SLKYL YC+ K+ A R+A+ Sbjct: 240 KRDDLSRALMLAFSSTPLFEPFAMPLLLEKLSSSLPTAKVDSLKYLSYCTLKFRADRIAE 299 Query: 973 HVKVMWSALKDIIYTS-PXXXXXXXXXXXDGMSFEESHTVTEALVLLQMVIHQNSNLLLD 1149 H +WS+LKD IY+S D E++ TEAL+LL+ +I QN+N L Sbjct: 300 HAGAIWSSLKDAIYSSGEEPMLSSDLESVDSPGSEKNEIATEALLLLENLIVQNNNFFLS 359 Query: 1150 LIVSDEDMKRTVDNIHQFKEYNGITLQSKQKLHAVGCILYLSAKASIGSCNRVFEGFFQR 1329 +I+SDE++K + I +K YN I+LQSKQKLH VG ILY+ AK S+ SCNR+FE +F R Sbjct: 360 MIISDEEVKMIFNTITSYKSYNEISLQSKQKLHMVGRILYVCAKVSVSSCNRIFESYFPR 419 Query: 1330 LIHALGISLENSDAPLSSP--C------NFGATYLCVELLAGCRALVVGTEGNTSTNVLA 1485 L+ ALGI +EN+ S C N+G+ YL ++LL CR L ++ S + Sbjct: 420 LMEALGILVENTSGACHSNENCVKAKQPNYGSFYLSIKLLGACRDLSTSSDNLASQCIST 479 Query: 1486 QEAWSNMLCSYGTSLSKIFSFTLVTGQDGSTQNAYLHYGVKGLQILATFPGNFYPLSASI 1665 E + +L + TSL++ FS L T G Q+ ++ GVKGLQILATFPG + LS Sbjct: 480 NETYCCLLQRFSTSLTETFSAALATSTSGPAQDVDMYLGVKGLQILATFPGGYLFLSKLT 539 Query: 1666 YESILLKLISVVTLNFKDALLWKLALTALVEIGLFVDKSEDTEKAQSFDAIVVDRIASLL 1845 +++IL+ +S++T++F LLW AL ALV+IG FV +++K S+ IVV ++ L Sbjct: 540 FDNILMTFLSIITVDFNKTLLWNQALKALVQIGSFVHGCNESDKEMSYVDIVVGKMILLA 599 Query: 1846 LCDDLTMPSQLKLEIISSIGTTGLSYMQKILQGXXXXXXXXXXGVYV------------- 1986 D +MP LKL ISSIG +G YM K+ G +YV Sbjct: 600 SSPDFSMPWSLKLTAISSIGMSGQKYMLKVFLGLEEAIRANLAEIYVCMIKKKIYVLYSC 659 Query: 1987 --NGDPELLETATSLLECYSCKVLPWFDSIGGHEEVQFHFALNIWDQIENNMSFCISSQ- 2157 G+ + + LLECYS ++LPW G EEV F +N+W+QIEN +F ++ Sbjct: 660 LVQGNLKSAKILLQLLECYSDELLPWIQKTEGFEEVLMQFVVNLWNQIENFNAFTVAFHG 719 Query: 2158 EQKLLPAAMTAMQYAVGRCLVESQTVIVNRAFGMLSSTTSIPVKDLMNETTPVKVGGLKS 2337 ++ LL A M M+ AV C VESQ VI+ +A+G+LSS+T +P+K+ ++E + V++ ++ Sbjct: 720 KESLLDAIMKVMKDAVAFCSVESQNVIIYKAYGVLSSSTFLPLKESLSENS-VQLECFRA 778 Query: 2338 NDGLDCFSCRDKWIISLFASVVVALRPQAHITNVKMIFQLFLTTQLYGHVPSAQALGSII 2517 +D S RD+WI SLFASV++ALRPQ HI N +++ LF+T L GHV +A+ALGS++ Sbjct: 779 IQQMDRLSSRDEWIHSLFASVIIALRPQTHIPNTRIVLHLFITALLKGHVTTAEALGSLV 838 Query: 2518 NKLPVKVENTDPSKVFSLVEAMDMIFSSRVWILCNEGPSKCLVAGNNNE--------IGF 2673 NKL K + S ++ EAMD+IFS + G S N + + Sbjct: 839 NKLDQKSNDACISGDCTIEEAMDIIFSINLLCSFGNGSSGRFDRTRNGDEMDLIKLCLDA 898 Query: 2674 RNSLLVQSHAIEGLAWIGKGLLMRGHEKVKDIVVTLLSCLLSNYEQGLL----------- 2820 N ++ AI GLAWIGKGLLMRGHEKVKDI + L+CLLS+ E G Sbjct: 899 PNLAWIKIPAIVGLAWIGKGLLMRGHEKVKDITMVFLNCLLSDGEIGASPLKHGSLENNG 958 Query: 2821 ------SIMRSAADAFHILISDSEACLNKRLHATIRPLYKQRLFSIVLPVILSLISKADS 2982 S+M+SA+DAF IL+SDSE CLN++ HA +RPLYKQR FS ++P++ LI+K+DS Sbjct: 959 EQDMQQSVMKSASDAFQILMSDSELCLNRKYHAIVRPLYKQRFFSSIMPILYPLITKSDS 1018 Query: 2983 SIQREMLYRAFANVVCGTPLSAILSEAKKLIPILVDSLSVLSEDVLHKDIVYSILLVLSG 3162 S + +LYRAFA+V+ TPLS I ++AKKL+P+L+D L++L +DVL KDI+Y +LLVLSG Sbjct: 1019 SFSKSLLYRAFAHVISDTPLSVISNDAKKLVPVLLDGLTLLGKDVLDKDIMYGLLLVLSG 1078 Query: 3163 ILTDKKRSEVVVENAHIIIRRLIALVSYPHMMVTRETAIQCLTAMSELPHARIYPMRTQV 3342 ILTD E V+ENAHIII+ LI LV+YPHMM+ RETA+QCL AMSELPH RIYP+R QV Sbjct: 1079 ILTDTNGKEAVIENAHIIIKCLIELVAYPHMMLIRETAVQCLVAMSELPHTRIYPVRIQV 1138 Query: 3343 LLSISKTLDDPKRAVRQEAVKCRQAWASIA*KTLHF 3450 L +ISK LDDPKRAVRQEAV+CRQAWASIA ++LH+ Sbjct: 1139 LQAISKALDDPKRAVRQEAVRCRQAWASIASRSLHY 1174 >ref|XP_004141784.1| PREDICTED: MMS19 nucleotide excision repair protein homolog [Cucumis sativus] Length = 1147 Score = 1022 bits (2643), Expect = 0.0 Identities = 563/1156 (48%), Positives = 752/1156 (65%), Gaps = 30/1156 (2%) Frame = +1 Query: 73 MGDSTHCVKHIESFVDSSSTPXXXXXXXXXXXXLVNNDIITLEALVREMGLYLTTTDSFL 252 M + ++ESFVD S TP LV N+++T+E LVREMG+YLT TD+ + Sbjct: 1 MAELCKLTHYVESFVDVSRTPSQQATSLETITSLVKNNVLTIETLVREMGMYLTITDNII 60 Query: 253 RARGTLLLAESLVCLASKPLENETVHSLIGFFTERLVDWKALHGALIGCLCLLRRKGNVG 432 R RG LLL E L CLASKPL++ T+HSLI FFTERL DWKAL GAL+GCL L+RRK NVG Sbjct: 61 RGRGILLLGELLACLASKPLDSATIHSLIAFFTERLADWKALRGALVGCLALMRRKTNVG 120 Query: 433 IVQGSKALAVAQCFLQNVQVQSLRESDRKXXXXXXXXXXXXYPNDVVHLEDVIIPGICES 612 + + A +VAQ + QN+QVQSL + DRK YP+ VV L D ++ GICE+ Sbjct: 121 SISQNDAKSVAQSYFQNLQVQSLGQHDRKLSFELLACLLEHYPDAVVSLGDDLVYGICEA 180 Query: 613 IDGEKDPECLMLAFQIVEALVHLYPDPTGPLASYATDLFENLSCYFPIHFTHPKGEDIDV 792 IDGEKDP CL+L F+IVE + L+PDPTG LAS ++DLFE L CYFPIHFTH K EDIDV Sbjct: 181 IDGEKDPHCLLLTFRIVELVAKLFPDPTGALASSSSDLFEFLGCYFPIHFTHGKEEDIDV 240 Query: 793 KKEELSRTLMMAFASTPLFEPFAIXXXXXXXXXXXXXAKVESLKYLIYCSDKYGAARMAK 972 ++ +LS LM AF+STPLFEPFAI AK++SLKYL C+ KYGA RM K Sbjct: 241 RRNDLSHALMRAFSSTPLFEPFAIPLLLEKLSSSLPLAKIDSLKYLSDCTVKYGADRMKK 300 Query: 973 HVKVMWSALKDIIYTS-PXXXXXXXXXXXDGMSFEESHTVTEALVLLQMVIHQNSNLLLD 1149 H + +WS++K+II+TS + SF+E+ TEAL LLQ ++ ++ L L Sbjct: 301 HSEAIWSSVKEIIFTSIGQPNLSINTESLNSPSFQENEMTTEALRLLQKMVVASNGLFLT 360 Query: 1150 LIVSDEDMKRTVDNIHQFKEYNGITLQSKQKLHAVGCILYLSAKASIGSCNRVFEGFFQR 1329 LI++DED+K + ++ + Y LQS+Q+L+AVG ILY SA AS+ SC+ VFE +F R Sbjct: 361 LIINDEDVKDIFNILNIYTCYKDFPLQSRQRLNAVGHILYTSASASVASCDHVFESYFHR 420 Query: 1330 LIHALGISLENSDAPLSSP---CNFGATYLCVELLAGCRALVVGTEGNTSTNVLAQEAWS 1500 L+ +GIS++ SP NFGA YLC+E++A CR L+V ++ NT + +E Sbjct: 421 LLDFMGISVDQYHNDKISPIRNLNFGALYLCIEVIAACRNLIVSSDENTCS---VKEKSY 477 Query: 1501 NMLCSYGTSLSKIFSFTLVTGQDGSTQNAYLHYGVKGLQILATFPGNFYPLSASIYESIL 1680 +ML + S+ ++ S T +A + VKGL L+TFP P+S I+E IL Sbjct: 478 SMLQIFSCSVVQLLSSTFSGIVKRDLHDAEFYCAVKGLLNLSTFPVGSSPVSRVIFEDIL 537 Query: 1681 LKLISVVTLNFKDALLWKLALTALVEIGLFVDKSEDTEKAQSFDAIVVDRIASLLLCDDL 1860 L+ +S +T+NFK LW AL AL IG FVDK + ++QS+ IVV++IA + D Sbjct: 538 LEFMSFITVNFKFGSLWNHALKALQHIGSFVDKYPGSVESQSYMHIVVEKIALMFSPHDE 597 Query: 1861 TMPSQLKLEIISSIGTTGLSYMQKILQGXXXXXXXXXXGVYVNGDPELLETATSLLECYS 2040 +P LKLE+ IG TG SYM KI+ G VYV G+ + +E SLL+CYS Sbjct: 598 VLPLMLKLEMAVDIGRTGRSYMLKIVGGIEETIFYNLSEVYVYGNSKSVEIVLSLLDCYS 657 Query: 2041 CKVLPWFDSIGGHEEVQFHFALNIWDQIENNMSFCISSQE--QKLLPAAMTAMQYAVGRC 2214 K+LPWFD G EEV FALNIWDQIE +F S + Q LL A M A++ +V C Sbjct: 658 TKILPWFDEAGDFEEVILRFALNIWDQIEKCSTFSTSMDKCIQVLLDATMMALKLSVRSC 717 Query: 2215 LVESQTVIVNRAFGMLSSTTSIPVKDLMNETTPVKVGGLKSNDGLDCFSCRDKWIISLFA 2394 ESQ +IV +AF +L +++ P+K ++ T PV++ GL+ D + RD+WI+SLFA Sbjct: 718 SKESQNIIVQKAFNVLLTSSFSPLKVTLSNTIPVQMEGLQFLQQKDNPTSRDEWILSLFA 777 Query: 2395 SVVVALRPQAHITNVKMIFQLFLTTQLYGHVPSAQALGSIINKLPVKVENTDPSKVFSLV 2574 SV +ALRPQ H+ +V++I +L + + G VP+AQALGS+INKL VK + + S SL Sbjct: 778 SVTIALRPQVHVPDVRLIIRLLMLSTTRGCVPAAQALGSMINKLSVKSDKVEVSSYVSLE 837 Query: 2575 EAMDMIFSSRVWILCNEGPSKCLVAGNNNEI-------GFRNSLLVQSHAIEGLAWIGKG 2733 EA+D+IF + L NE G+ +E+ S L+Q HA+ GL+WIGKG Sbjct: 838 EAIDIIFKTEFRCLHNES------TGDGSEMFLTDLCSSIEKSSLLQVHAVVGLSWIGKG 891 Query: 2734 LLMRGHEKVKDIVVTLLSCLLS----------------NYEQGL-LSIMRSAADAFHILI 2862 LL+ GH+KV+DI + L L+S + E L ++M+ AA+AFHIL+ Sbjct: 892 LLLCGHDKVRDITMVFLQLLVSKSRTDASPLQQFKLEKDNETSLDFAVMKGAAEAFHILM 951 Query: 2863 SDSEACLNKRLHATIRPLYKQRLFSIVLPVILSLISKADSSIQREMLYRAFANVVCGTPL 3042 SDSEACLN++ HA +RPLYKQR FS ++P+ +L+SK+D+S+ R MLY+A+A+V+ TPL Sbjct: 952 SDSEACLNRKFHAIVRPLYKQRFFSTMMPIFQTLVSKSDTSLSRYMLYQAYAHVISDTPL 1011 Query: 3043 SAILSEAKKLIPILVDSLSVLSEDVLHKDIVYSILLVLSGILTDKKRSEVVVENAHIIIR 3222 +AILS+AKK IP+L+D L LS + ++KD+VYS+LLVLSGIL DK E V ENAH I+ Sbjct: 1012 TAILSDAKKFIPMLLDGLLTLSVNGINKDVVYSLLLVLSGILMDKNGQEAVTENAHKIVD 1071 Query: 3223 RLIALVSYPHMMVTRETAIQCLTAMSELPHARIYPMRTQVLLSISKTLDDPKRAVRQEAV 3402 L L + HMM+ RETAIQCL A+SELPHARIYPMR QVL ISK LDDPKR+VRQEAV Sbjct: 1072 CLAGLTDFSHMMLVRETAIQCLVAVSELPHARIYPMRRQVLHVISKALDDPKRSVRQEAV 1131 Query: 3403 KCRQAWASIA*KTLHF 3450 +CRQAWASIA ++LHF Sbjct: 1132 RCRQAWASIASRSLHF 1147 >ref|XP_003546956.1| PREDICTED: uncharacterized protein LOC100788537 [Glycine max] Length = 1135 Score = 989 bits (2558), Expect = 0.0 Identities = 543/1145 (47%), Positives = 726/1145 (63%), Gaps = 26/1145 (2%) Frame = +1 Query: 73 MGDSTHCVKHIESFVDSSSTPXXXXXXXXXXXXLVNNDIITLEALVREMGLYLTTTDSFL 252 M ++T +HIES+VDSSSTP LVN D + LEALVRE+ +YLTTTD+ + Sbjct: 1 MAETTQLTRHIESYVDSSSTPAQQASSLNAVASLVNTDALPLEALVRELEMYLTTTDNVV 60 Query: 253 RARGTLLLAESLVCLASKPLENETVHSLIGFFTERLVDWKALHGALIGCLCLLRRKGNVG 432 RARG LLLAE + + SKPL + T+HSL+GFF +RL DW+A+ GAL+GCL L+RRK VG Sbjct: 61 RARGILLLAEVMTRIESKPLNSATIHSLVGFFKDRLADWRAVQGALVGCLALIRRKSVVG 120 Query: 433 IVQGSKALAVAQCFLQNVQVQSLRESDRKXXXXXXXXXXXXYPNDVVHLEDVIIPGICES 612 +V S A +AQ FLQ +QVQSL + DRK Y + V L + +I GICE+ Sbjct: 121 MVTDSDATTIAQSFLQYMQVQSLGQYDRKLCFELLDCLLERYFDAVTTLGEDLIYGICEA 180 Query: 613 IDGEKDPECLMLAFQIVEALVHLYPDPTGPLASYATDLFENLSCYFPIHFTHPKGEDIDV 792 ID EKDP+CL LAF IV +L L PD + LASYA D+F+ L YFPIHFTHP D V Sbjct: 181 IDAEKDPDCLKLAFHIVASLAQLNPDSSSLLASYAKDVFDILEPYFPIHFTHPSSGDTHV 240 Query: 793 KKEELSRTLMMAFASTPLFEPFAIXXXXXXXXXXXXXAKVESLKYLIYCSDKYGAARMAK 972 ++++LS +LM AF+STPLFEPF I AK++SLKYL CS KYGA R+AK Sbjct: 241 QRDDLSTSLMSAFSSTPLFEPFVIPLLLEKLSSSLHSAKIDSLKYLRVCSSKYGAERIAK 300 Query: 973 HVKVMWSALKDIIYT-SPXXXXXXXXXXXDGMSFEESHTVTEALVLLQMVIHQNSNLLLD 1149 + +WS+LKD + T DG+ F E+ V EAL LLQ +I QNS+LL+ Sbjct: 301 YAGAIWSSLKDTLSTYLGEPDFSFTIAPVDGIGFPENEFVIEALSLLQQLIAQNSSLLVS 360 Query: 1150 LIVSDEDMKRTVDNIHQFKEYNGITLQSKQKLHAVGCILYLSAKASIGSCNRVFEGFFQR 1329 LI+ DED+ I ++ Y+ I +Q K+KLHA+G ILY+++K +I SCN +FE F R Sbjct: 361 LIIDDEDVNTIFSTITSYETYDAIPVQEKKKLHAIGRILYITSKTTISSCNAMFESLFTR 420 Query: 1330 LIHALGISLE--NSDAPLSSPCNFGATYLCVELLAGCRALVVGTEGNTSTNVLAQEAWSN 1503 ++ LG S+ N D S FG YLC+ELLAGCR L+VG+E V E Sbjct: 421 MMDNLGFSVRFPNGDISPSQRLKFGFLYLCIELLAGCRELIVGSEEPALQYVFEHETCCT 480 Query: 1504 MLCSYGTSLSKIFSFTLVTGQDGSTQNAYLHYGVKGLQILATFPGNFYPLSASIYESILL 1683 ML S+ T L F L D + + GVKGLQILA F + +P+ SI+E+IL Sbjct: 481 MLHSFSTPLFNAFGSVLAVSADRGPLDPDTYVGVKGLQILAMFHSDVFPIQKSIFENILK 540 Query: 1684 KLISVVTLNFKDALLWKLALTALVEIGLFVDKSEDTEKAQSFDAIVVDRIASLLLCDDLT 1863 K +S++ +F +LW+ AL AL +G F K ++EKA S+ +VV++I +L DD+T Sbjct: 541 KFMSIIIEDFNKTILWEAALKALHHVGSFFQKFCESEKAMSYRNLVVEKIVEILSLDDIT 600 Query: 1864 MPSQLKLEIISSIGTTGLSYMQKILQGXXXXXXXXXXGVYVNGDPELLETATSLLECYSC 2043 + LK+E + +IG TG+ M ILQG VYV+ + E A LLECYSC Sbjct: 601 LSFSLKVEALLNIGKTGMKNMLTILQGLGRAVFANLSKVYVHRNLRSSEIAVQLLECYSC 660 Query: 2044 KVLPWFDSIGGHEEVQFHFALNIWDQIENNMSFCISSQEQKLLPAAMTAMQYAVGRCLVE 2223 ++LPW GG E+ FA++IW Q N M + + LL A M AM+ +VG C VE Sbjct: 661 QLLPWIHENGGSEDFVMQFAVDIWSQAGNCMDLSTPFEGKGLLDAMMKAMRLSVGSCSVE 720 Query: 2224 SQTVIVNRAFGMLSSTTSIPVKDLMNETTPVKVGGLKSNDGLDCFSCRDKWIISLFASVV 2403 SQ +I+ +A+ +LSS T+ +K++ E P+ G S RD+ IISLFASVV Sbjct: 721 SQNLIIRKAYSVLSSHTNFQLKEV--ERLPLTPGKYD-------ISLRDEGIISLFASVV 771 Query: 2404 VALRPQAHITNVKMIFQLFLTTQLYGHVPSAQALGSIINKLPVKVENTDPSKVFSLVEAM 2583 +A+ P+ +I N++++ LF+ T L G VP AQALGSI+NKL + S +L EA+ Sbjct: 772 IAVCPKTYIPNIRVLVHLFIITLLRGVVPVAQALGSILNKLVSTSSTAENSSDLTLEEAL 831 Query: 2584 DMIFSSRVWILCNEGPSKCLVAGNNNE-------IGFRNSLLVQSHAIEGLAWIGKGLLM 2742 D IF++++ + +C N NE +G N ++Q +AI GL+W+GKGLL+ Sbjct: 832 DAIFNTKISFSSTDMLQRCNGTSNGNEMVFTDICLGIANDRMLQINAICGLSWMGKGLLL 891 Query: 2743 RGHEKVKDIVVTLLSCLLS--------------NYEQGL--LSIMRSAADAFHILISDSE 2874 RGHEK+KDI + + CL+S N E+ + L +++ A DAFH+L+SDSE Sbjct: 892 RGHEKIKDITMIFMECLISGTKSASPLIKDSLENTEEQIQDLLVIKCATDAFHVLMSDSE 951 Query: 2875 ACLNKRLHATIRPLYKQRLFSIVLPVILSLISKADSSIQREMLYRAFANVVCGTPLSAIL 3054 CLN++ HATIRPLYKQR FS V+P++ +I+K+ SS+ R LYRAFA+++ TP+ AI+ Sbjct: 952 VCLNRKFHATIRPLYKQRFFSSVMPILQQIITKSHSSLSRSFLYRAFAHIMSDTPMVAIV 1011 Query: 3055 SEAKKLIPILVDSLSVLSEDVLHKDIVYSILLVLSGILTDKKRSEVVVENAHIIIRRLIA 3234 SEAKKLIP+L+D LS+L+E + KD++Y +LLVLSGIL +K E VVENAHIII LI Sbjct: 1012 SEAKKLIPVLLDCLSMLTE-IQDKDMLYGLLLVLSGILMEKNGQEAVVENAHIIINCLIK 1070 Query: 3235 LVSYPHMMVTRETAIQCLTAMSELPHARIYPMRTQVLLSISKTLDDPKRAVRQEAVKCRQ 3414 LV YPH M+ RETAIQCL A+SELPHARIYPMRTQVL +ISK LDD KRAVR EAVKCRQ Sbjct: 1071 LVGYPHKMLVRETAIQCLVALSELPHARIYPMRTQVLRAISKCLDDSKRAVRHEAVKCRQ 1130 Query: 3415 AWASI 3429 WAS+ Sbjct: 1131 TWASM 1135 >ref|XP_003597674.1| MMS19 nucleotide excision repair protein-like protein [Medicago truncatula] gi|355486722|gb|AES67925.1| MMS19 nucleotide excision repair protein-like protein [Medicago truncatula] Length = 1140 Score = 985 bits (2547), Expect = 0.0 Identities = 541/1151 (47%), Positives = 728/1151 (63%), Gaps = 31/1151 (2%) Frame = +1 Query: 73 MGDSTHCVKHIESFVDSSSTPXXXXXXXXXXXXLVNNDIITLEALVREMGLYLTTTDSFL 252 M ++T +HIES+VDSSSTP L+ + +TLEALVRE+ +YLT+TD+ + Sbjct: 1 MAETTQLTRHIESYVDSSSTPSHQVASLDAIVLLIKTNALTLEALVRELDMYLTSTDTLI 60 Query: 253 RARGTLLLAESLVCLASKPLENETVHSLIGFFTERLVDWKALHGALIGCLCLLRRKGNVG 432 R+RG LLLAE L + S L+++T+HSL+GFF ER+ DWKA+ GAL+GCL L+RRK VG Sbjct: 61 RSRGILLLAEVLTRINSISLDSKTIHSLVGFFKERMADWKAVRGALVGCLALIRRKSVVG 120 Query: 433 IVQGSKALAVAQCFLQNVQVQSLRESDRKXXXXXXXXXXXXYPNDVVHLEDVIIPGICES 612 +V GS A ++ Q FLQ++QVQSL DRK + + + LE+ +I GICE+ Sbjct: 121 MVTGSDAKSITQSFLQHLQVQSLGLYDRKLCFELLDYLLEHHVDSIASLEEDLIFGICEA 180 Query: 613 IDGEKDPECLMLAFQIVEALVHLYPDPTGPLASYATDLFENLSCYFPIHFTHPKGEDIDV 792 ID EKDPECLMLAF IVE+L LYPDP+G LAS+A D+F+ L YFPIHFTH D V Sbjct: 181 IDAEKDPECLMLAFHIVESLARLYPDPSGLLASFARDVFDLLEPYFPIHFTHQTSGDTHV 240 Query: 793 KKEELSRTLMMAFASTPLFEPFAIXXXXXXXXXXXXXAKVESLKYLIYCSDKYGAARMAK 972 ++++LSRTLM AFASTP+FEPF I AK++SL+YL CS KYGA R+AK Sbjct: 241 QRDDLSRTLMSAFASTPVFEPFVIPLLLQKLSSSLHSAKIDSLQYLRVCSSKYGAERIAK 300 Query: 973 HVKVMWSALKDIIYT-SPXXXXXXXXXXXDGMSFEESHTVTEALVLLQMVIHQNSNLLLD 1149 + +WS+LKD +YT DG++F +S V EAL LLQ +I QNS+ L+ Sbjct: 301 YTGAIWSSLKDTLYTYLGEPDFSFTLAPTDGINFPKSEVVVEALSLLQQLIVQNSSQLVS 360 Query: 1150 LIVSDEDMKRTVDNIHQFKEYNGITLQSKQKLHAVGCILYLSAKASIGSCNRVFEGFFQR 1329 LI+ DED+ ++I ++ Y+ I++Q K+KLHA+G ILY+SAK SI SCN VF+ F R Sbjct: 361 LIIDDEDVNFITNSIASYEMYDTISVQEKKKLHAIGRILYISAKTSIPSCNAVFQSLFLR 420 Query: 1330 LIHALGIS------LENSDAPLSSPCNFGATYLCVELLAGCRALVVGTEGNTSTNVLAQE 1491 ++ LG S L+N S NFG YLC+ELL+GCR LV+ ++ +E Sbjct: 421 MMDKLGFSASNIDGLQNGGILASQSVNFGFLYLCIELLSGCRELVILSDEK-------RE 473 Query: 1492 AWSNMLCSYGTSLSKIFSFTLVTGQDGSTQNAYLHYGVKGLQILATFPGNFYPLSASIYE 1671 + +L S L F L D + ++ GVKGLQILA F + +P+ S +E Sbjct: 474 TYCTILHSSSAVLFNAFGSVLAVTADRCPSHPDIYIGVKGLQILAMFHLDVFPIPKSTFE 533 Query: 1672 SILLKLISVVTLNFKDALLWKLALTALVEIGLFVDKSEDTEKAQSFDAIVVDRIASLLLC 1851 +IL K +S++ +F +LW L AL IG FV K ++EKA S+ + VVD+I +L Sbjct: 534 NILKKFMSIIIEDFGQTVLWNATLKALFHIGSFVQKFSESEKAMSYRSFVVDKIMEMLSL 593 Query: 1852 DDLTMPSQLKLEIISSIGTTGLSYMQKILQGXXXXXXXXXXGVYVNGDPELLETATSLLE 2031 DD+ +P LK+E + IG TG+ M ILQ V+ N ETA LLE Sbjct: 594 DDIALPFSLKVEALYDIGMTGMKNMLTILQAMEGAIFTNLSEVHSNLTSH--ETAVQLLE 651 Query: 2032 CYSCKVLPWFDSIGGHEEVQFHFALNIWDQIENNMSFCISSQEQKLLPAAMTAMQYAVGR 2211 CYSCK+LPW GG EE FA++IW+Q N M F +++ LL A M AM+ +VG Sbjct: 652 CYSCKLLPWILKNGGAEEFVVQFAVDIWNQAGNCMDFNSPFEDKGLLDAMMKAMKVSVGC 711 Query: 2212 CLVESQTVIVNRAFGMLSSTTSIPVKDLMNETTPVKVGGLKSNDGLDCFSCRDKWIISLF 2391 C VESQ V + +A+ LSS T + D VG L G S RD+ I+ LF Sbjct: 712 CSVESQNVTIQKAYSTLSSHTKFQLND---------VGRLPLTSGKYDISPRDEGILLLF 762 Query: 2392 ASVVVALRPQAHITNVKMIFQLFLTTQLYGHVPSAQALGSIINKLPVKVENTDPSKVFSL 2571 ASV++ALRP+ HI N++ + LF+ T L G VP AQALGS++NKL K + S +L Sbjct: 763 ASVIIALRPKTHIPNIRGLLHLFIITLLKGVVPVAQALGSMLNKLTSKSNGAEKSDELTL 822 Query: 2572 VEAMDMIFSSRVWILCNEGPSKCLVAGNNNEI-------GFRNSLLVQSHAIEGLAWIGK 2730 EA+D+IF++++W N + N ++I G N L+QS+AI GL+WIGK Sbjct: 823 EEALDIIFNTKIWFSSNNMLQIYNGSSNGSDIVLTDLCLGITNDRLLQSNAICGLSWIGK 882 Query: 2731 GLLMRGHEKVKDIVVTLLSCLLSNYEQGLLSIM-----------------RSAADAFHIL 2859 GLL+RGHEK+KDI + CL+S+ + + ++ + A +AFH+L Sbjct: 883 GLLLRGHEKIKDITMIFTECLISDRRKTSVPLVEGSLENTEKQKCDPLARKCATEAFHVL 942 Query: 2860 ISDSEACLNKRLHATIRPLYKQRLFSIVLPVILSLISKADSSIQREMLYRAFANVVCGTP 3039 +SD+E CLN++ HAT+RPLYKQR FS ++P+ L LIS++DS + R +L RAFA+V+ TP Sbjct: 943 MSDAEDCLNRKFHATVRPLYKQRFFSSMMPIFLQLISRSDSLLSRSLLLRAFAHVMSDTP 1002 Query: 3040 LSAILSEAKKLIPILVDSLSVLSEDVLHKDIVYSILLVLSGILTDKKRSEVVVENAHIII 3219 L IL+EAKKLIP+L+D L +L+ED+ KDI+Y +LLVLSG+LT+K E V+ENAHIII Sbjct: 1003 LIVILNEAKKLIPVLLDCLFMLTEDIQDKDILYGLLLVLSGMLTEKNGQEAVIENAHIII 1062 Query: 3220 RRLIALVSYPHMMVTRETAIQCLTAMSELPHARIYPMRTQVLLSISKTLDDPKRAVRQEA 3399 LI LV YPH + RETAIQCL A+SELPH RIYP+RTQVL +I K LDD KR+VR EA Sbjct: 1063 NGLIKLVDYPHKTLVRETAIQCLVALSELPHVRIYPLRTQVLQAIFKCLDDTKRSVRNEA 1122 Query: 3400 VKCRQAWASIA 3432 VKCRQAWASIA Sbjct: 1123 VKCRQAWASIA 1133