BLASTX nr result
ID: Sinomenium21_contig00012987
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00012987 (1406 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI36057.3| unnamed protein product [Vitis vinifera] 440 e-121 ref|XP_006385450.1| hypothetical protein POPTR_0003s04720g [Popu... 381 e-103 ref|XP_007024314.1| MMS19 nucleotide excision repair protein, pu... 379 e-102 ref|XP_007024313.1| MMS19 nucleotide excision repair protein, pu... 379 e-102 ref|XP_007024312.1| MMS19 nucleotide excision repair protein, pu... 379 e-102 ref|XP_007024310.1| MMS19 nucleotide excision repair protein, pu... 379 e-102 ref|XP_006465695.1| PREDICTED: MMS19 nucleotide excision repair ... 372 e-100 ref|XP_006465694.1| PREDICTED: MMS19 nucleotide excision repair ... 372 e-100 ref|XP_006426876.1| hypothetical protein CICLE_v10024743mg [Citr... 370 e-100 ref|XP_002515963.1| DNA repair/transcription protein met18/mms19... 370 e-100 ref|XP_007217541.1| hypothetical protein PRUPE_ppa023072mg [Prun... 361 4e-97 ref|XP_004302857.1| PREDICTED: uncharacterized protein LOC101304... 342 2e-91 gb|EXB74582.1| hypothetical protein L484_026279 [Morus notabilis] 335 2e-89 ref|XP_004141784.1| PREDICTED: MMS19 nucleotide excision repair ... 329 2e-87 ref|XP_004236399.1| PREDICTED: MMS19 nucleotide excision repair ... 324 5e-86 ref|XP_006595125.1| PREDICTED: DNA repair/transcription protein ... 313 2e-82 ref|XP_006595124.1| PREDICTED: DNA repair/transcription protein ... 313 2e-82 ref|XP_006343144.1| PREDICTED: MMS19 nucleotide excision repair ... 311 4e-82 gb|EYU21515.1| hypothetical protein MIMGU_mgv1a000493mg [Mimulus... 310 1e-81 ref|XP_007150605.1| hypothetical protein PHAVU_005G166100g [Phas... 306 1e-80 >emb|CBI36057.3| unnamed protein product [Vitis vinifera] Length = 1146 Score = 440 bits (1132), Expect = e-121 Identities = 253/476 (53%), Positives = 314/476 (65%), Gaps = 8/476 (1%) Frame = -1 Query: 1406 SEENSTWGCISDVSCVCSEEPNFGALYLCIELLVACRDLVLGFEGGSQQQVITVEDTCFL 1227 S NS+ C+ + V SE NFGALYLCIELL ACRDLV+G E + + V E C + Sbjct: 428 SVRNSSGDCLPNFDYVFSERLNFGALYLCIELLAACRDLVVGSEELTSKSVSAQESWCCM 487 Query: 1226 LQRFSGQLTGAFCSILATTTKQDICKAHVYCGVKGLQILATFPRCFLRISKSVFETILII 1047 L FS L AF S+L +T +D +A +Y GVKGLQILATFP FL ISKS+FE +L+ Sbjct: 488 LHSFSSLLMKAFSSVLDASTDKDAYEADIYSGVKGLQILATFPGEFLPISKSIFENVLLT 547 Query: 1046 FMSIITAGLETTLLWKHTLKALMQIGIFIEKFSDSEGETSYMIIVVEKIVSLIFLDDSAM 867 F+SII TLLWK LKAL+QIG FI++F +SE SY IVVEKIVSL+FLDD + Sbjct: 548 FISIIVEDFNKTLLWKLALKALVQIGSFIDRFHESEKALSYNYIVVEKIVSLMFLDDFGL 607 Query: 866 PLPLLLEAMSDIGTTCLKFMLRVNQGLEDAISANLLEASVEGNSK-VDILGPVLECYSNK 690 P L LEA+SDIGTT L ML++ QGLEDAI ANL E V GN K I +LECYSNK Sbjct: 608 PFQLRLEAISDIGTTGLNVMLKIVQGLEDAIFANLSEVYVHGNLKSAKIAVQLLECYSNK 667 Query: 689 VLPWFLNTGGFDDVALRFSINMWNQIESSKLFDMGIQGKVNELLNKMMMTMRLAVGGCSE 510 +LP G F+DV RF++N+WNQIE+S F +G Q NELLN M M+LAVG CSE Sbjct: 668 LLPGIHGAGDFEDVLSRFAVNIWNQIENSMAFSVGAQ--ENELLNATMTAMKLAVGSCSE 725 Query: 509 DNQVLIVQKAFGVLSSNIFLPLKDSMDFSVPLKLEELQLTQEFHTFSCRDEWAISLFASV 330 +Q I++KA+ VLSS L +SM + ++LE LQ TQ+ FSCRD+W ISLFAS Sbjct: 726 GSQGKIIKKAYSVLSSCPSFTLMESMPITGTVQLEGLQHTQDLECFSCRDKWVISLFASA 785 Query: 329 IVALRPQTPLQNVRVILRFFATTLLKGHVPSAQALGSILNKL--PSNNMGNLNSCTVEDA 156 I+A+RPQT + N+RV+L F T LLKGHVP+AQALGS++NKL SN + ++CT+EDA Sbjct: 786 IIAVRPQTHIPNIRVVLHLFMTNLLKGHVPAAQALGSMVNKLCPKSNGVEISSTCTLEDA 845 Query: 155 LSIIFEIGLF-----GNIPSWKFHPVDSNNVALINLCDSEGKSNLVQSNSIIGLAW 3 L IIF L+ G + VD N + L NLC S L+Q +I GLAW Sbjct: 846 LDIIFNTSLWDSHNHGPLKRCSGIGVD-NEMGLANLCLSASNCQLLQVCAIEGLAW 900 >ref|XP_006385450.1| hypothetical protein POPTR_0003s04720g [Populus trichocarpa] gi|550342418|gb|ERP63247.1| hypothetical protein POPTR_0003s04720g [Populus trichocarpa] Length = 913 Score = 381 bits (979), Expect = e-103 Identities = 226/478 (47%), Positives = 301/478 (62%), Gaps = 13/478 (2%) Frame = -1 Query: 1397 NSTWGCISDVSCVCSEEPNFGALYLCIELLVACRDLVLGFEGGSQQQVITVEDTCFLLQR 1218 N + C + C+ S+ PN G+LYLC+ELL ACRDLV+ + Q V E C LLQR Sbjct: 199 NGSGTCSFNDDCIISKRPNHGSLYLCVELLGACRDLVISSGDLASQCVSANETWCCLLQR 258 Query: 1217 FSGQLTGAFCSILATTTKQDICKAHVYCGVKGLQILATFPRCFLRISKSVFETILIIFMS 1038 FS L+ F S LAT+T + A VY GVKGLQILATFP +L +SKS E+IL+ F+S Sbjct: 259 FSTSLSKIFSSTLATSTDKPAHDADVYLGVKGLQILATFPGGYLLVSKSTCESILMTFVS 318 Query: 1037 IITAGLETTLLWKHTLKALMQIGIFIEKFSDSEGETSYMIIVVEKIVSLIFLDDSAMPLP 858 IIT TLLWK ++KAL+QIG+FI ++SE SYM IVV+KIVS+I D+ +P Sbjct: 319 IITVDFNKTLLWKLSVKALVQIGLFIHGSNESEKSMSYMDIVVQKIVSMISSDNHDIPFQ 378 Query: 857 LLLEAMSDIGTTCLKFMLRVNQGLEDAISANLLEASVEGNSK-VDILGPVLECYSNKVLP 681 L LEA+SDIGT+ L++ML++ GL++ I ANL A V+GN K ++ +LECYSN++LP Sbjct: 379 LQLEAISDIGTSGLQYMLKIVTGLQEVIRANL--AEVQGNVKSAKVIIHLLECYSNELLP 436 Query: 680 WFLNTGGFDDVALRFSINMWNQIESSKLFDMGIQGKVNELLNKMMMTMRLAVGGCSEDNQ 501 W F++V L+F +++WNQIE+ F GI K ELL+ M M+LAV CS ++Q Sbjct: 437 WIQKYEVFEEVLLQFVVSIWNQIENCMAFPDGIFEK--ELLDATMKVMKLAVASCSVESQ 494 Query: 500 VLIVQKAFGVLSSNIFLPLKDSMDFSVPLKLEELQLTQEFHTFSCRDEWAISLFASVIVA 321 +I+ KA+ VLSS+ FL KDS+ S+ +LEEL+ TQE + FS RDEW SLF SVI+A Sbjct: 495 NIIIDKAYTVLSSSTFLSTKDSLS-SLQAQLEELEDTQETNKFSSRDEWIHSLFISVIIA 553 Query: 320 LRPQTPLQNVRVILRFFATTLLKGHVPSAQALGSILNKLPSNNMGNLNS--CTVEDALSI 147 L PQT + N+R +L F LKG+V +AQALGS++NKL G S CT E+A+ I Sbjct: 554 LHPQTRIPNIRTVLHFLMIVFLKGYVTAAQALGSLVNKLDLKTSGTEYSGGCTFEEAMDI 613 Query: 146 IF----------EIGLFGNIPSWKFHPVDSNNVALINLCDSEGKSNLVQSNSIIGLAW 3 IF G G W + L NLC S L++ +SI+GLAW Sbjct: 614 IFGKNLSSSDHVSAGRSGITGYW-------SETGLTNLCLGAANSGLLEIHSIVGLAW 664 >ref|XP_007024314.1| MMS19 nucleotide excision repair protein, putative isoform 5 [Theobroma cacao] gi|508779680|gb|EOY26936.1| MMS19 nucleotide excision repair protein, putative isoform 5 [Theobroma cacao] Length = 1157 Score = 379 bits (974), Expect = e-102 Identities = 221/476 (46%), Positives = 304/476 (63%), Gaps = 11/476 (2%) Frame = -1 Query: 1397 NSTWGCISDVSCVCSEEPNFGALYLCIELLVACRDLVLGFEGGSQQQVITVEDTCFLLQR 1218 NS+ SD S + + N GALYL IELL ACRD++ E T E +LL+ Sbjct: 430 NSSGNLSSDDSIMIPKRYNHGALYLSIELLSACRDVIASSETIIAASAHTEETWSYLLRS 489 Query: 1217 FSGQLTGAFCSILATTTKQDICKAHVYCGVKGLQILATFPRCFLRISKSVFETILIIFMS 1038 FS LT AFCS + T +D A VY GVKGL ILATFP +L ISK VFE IL+ F+S Sbjct: 490 FSSSLTKAFCSA-SICTSEDSHDADVYFGVKGLLILATFPEGYLLISKPVFEKILMTFVS 548 Query: 1037 IITAGLETTLLWKHTLKALMQIGIFIEKFSDSEGETSYMIIVVEKIVSLIFLDDSAMPLP 858 I+T TLLWK LKAL+QIG FIEK +SE E SY+ +VVEKIVS L D ++P P Sbjct: 549 IVTVDYSNTLLWKLALKALVQIGSFIEKCHESEKEPSYLGLVVEKIVSFSSLGDFSIPFP 608 Query: 857 LLLEAMSDIGTTCLKFMLRVNQGLEDAISANLLEASVEGNS-KVDILGPVLECYSNKVLP 681 L LEA+S+IGT+ +ML+V +GLE+AI ANL E V G+S +I+ +L+CYS+KV+P Sbjct: 609 LRLEALSEIGTSGKSYMLKVVEGLEEAIYANLSEVYVHGSSNSAEIVTQLLKCYSDKVIP 668 Query: 680 WFLNTGGFDDVALRFSINMWNQIESSKLFDMGIQGKVNELLNKMMMTMRLAVGGCSEDNQ 501 W GFD+V L+F+I++WNQIE S +F+ K+ E+L+ MM M+LAV CSE+NQ Sbjct: 669 WIQCAKGFDEVPLQFAIHIWNQIELSMVFNATQTNKI-EVLDVMMKAMKLAVASCSEENQ 727 Query: 500 VLIVQKAFGVLSSNIFLPLKDSMDFSVPLKLEELQLTQEFHTFSCRDEWAISLFASVIVA 321 +IVQK++ +LSS+ PLK+ + E Q+ Q ++ S RDEW +SLFA+V++A Sbjct: 728 NIIVQKSYHILSSSTSFPLKEL------FRQESFQIVQVDNS-SSRDEWILSLFAAVVIA 780 Query: 320 LRPQTPLQNVRVILRFFATTLLKGHVPSAQALGSILNKLPSNNMGNLNSCTVEDALSIIF 141 + P+T + N++ +L F TTLLKG+V +AQALGS++NKL + G CT+E+ + II Sbjct: 781 VHPETYVPNIKPLLYLFMTTLLKGNVVTAQALGSVVNKLGLESAGVQTDCTLEEVMDIIL 840 Query: 140 EIGLFGNIPSWKFHP----------VDSNNVALINLCDSEGKSNLVQSNSIIGLAW 3 + L W FH +++++LINLC S G +Q ++I+GLAW Sbjct: 841 NLSL------WIFHSNSSADIQAKMTSAHDISLINLCSSIGSCTSLQIHAIVGLAW 890 >ref|XP_007024313.1| MMS19 nucleotide excision repair protein, putative isoform 4 [Theobroma cacao] gi|508779679|gb|EOY26935.1| MMS19 nucleotide excision repair protein, putative isoform 4 [Theobroma cacao] Length = 1136 Score = 379 bits (974), Expect = e-102 Identities = 221/476 (46%), Positives = 304/476 (63%), Gaps = 11/476 (2%) Frame = -1 Query: 1397 NSTWGCISDVSCVCSEEPNFGALYLCIELLVACRDLVLGFEGGSQQQVITVEDTCFLLQR 1218 NS+ SD S + + N GALYL IELL ACRD++ E T E +LL+ Sbjct: 430 NSSGNLSSDDSIMIPKRYNHGALYLSIELLSACRDVIASSETIIAASAHTEETWSYLLRS 489 Query: 1217 FSGQLTGAFCSILATTTKQDICKAHVYCGVKGLQILATFPRCFLRISKSVFETILIIFMS 1038 FS LT AFCS + T +D A VY GVKGL ILATFP +L ISK VFE IL+ F+S Sbjct: 490 FSSSLTKAFCSA-SICTSEDSHDADVYFGVKGLLILATFPEGYLLISKPVFEKILMTFVS 548 Query: 1037 IITAGLETTLLWKHTLKALMQIGIFIEKFSDSEGETSYMIIVVEKIVSLIFLDDSAMPLP 858 I+T TLLWK LKAL+QIG FIEK +SE E SY+ +VVEKIVS L D ++P P Sbjct: 549 IVTVDYSNTLLWKLALKALVQIGSFIEKCHESEKEPSYLGLVVEKIVSFSSLGDFSIPFP 608 Query: 857 LLLEAMSDIGTTCLKFMLRVNQGLEDAISANLLEASVEGNS-KVDILGPVLECYSNKVLP 681 L LEA+S+IGT+ +ML+V +GLE+AI ANL E V G+S +I+ +L+CYS+KV+P Sbjct: 609 LRLEALSEIGTSGKSYMLKVVEGLEEAIYANLSEVYVHGSSNSAEIVTQLLKCYSDKVIP 668 Query: 680 WFLNTGGFDDVALRFSINMWNQIESSKLFDMGIQGKVNELLNKMMMTMRLAVGGCSEDNQ 501 W GFD+V L+F+I++WNQIE S +F+ K+ E+L+ MM M+LAV CSE+NQ Sbjct: 669 WIQCAKGFDEVPLQFAIHIWNQIELSMVFNATQTNKI-EVLDVMMKAMKLAVASCSEENQ 727 Query: 500 VLIVQKAFGVLSSNIFLPLKDSMDFSVPLKLEELQLTQEFHTFSCRDEWAISLFASVIVA 321 +IVQK++ +LSS+ PLK+ + E Q+ Q ++ S RDEW +SLFA+V++A Sbjct: 728 NIIVQKSYHILSSSTSFPLKEL------FRQESFQIVQVDNS-SSRDEWILSLFAAVVIA 780 Query: 320 LRPQTPLQNVRVILRFFATTLLKGHVPSAQALGSILNKLPSNNMGNLNSCTVEDALSIIF 141 + P+T + N++ +L F TTLLKG+V +AQALGS++NKL + G CT+E+ + II Sbjct: 781 VHPETYVPNIKPLLYLFMTTLLKGNVVTAQALGSVVNKLGLESAGVQTDCTLEEVMDIIL 840 Query: 140 EIGLFGNIPSWKFHP----------VDSNNVALINLCDSEGKSNLVQSNSIIGLAW 3 + L W FH +++++LINLC S G +Q ++I+GLAW Sbjct: 841 NLSL------WIFHSNSSADIQAKMTSAHDISLINLCSSIGSCTSLQIHAIVGLAW 890 >ref|XP_007024312.1| MMS19 nucleotide excision repair protein, putative isoform 3 [Theobroma cacao] gi|508779678|gb|EOY26934.1| MMS19 nucleotide excision repair protein, putative isoform 3 [Theobroma cacao] Length = 1062 Score = 379 bits (974), Expect = e-102 Identities = 221/476 (46%), Positives = 304/476 (63%), Gaps = 11/476 (2%) Frame = -1 Query: 1397 NSTWGCISDVSCVCSEEPNFGALYLCIELLVACRDLVLGFEGGSQQQVITVEDTCFLLQR 1218 NS+ SD S + + N GALYL IELL ACRD++ E T E +LL+ Sbjct: 430 NSSGNLSSDDSIMIPKRYNHGALYLSIELLSACRDVIASSETIIAASAHTEETWSYLLRS 489 Query: 1217 FSGQLTGAFCSILATTTKQDICKAHVYCGVKGLQILATFPRCFLRISKSVFETILIIFMS 1038 FS LT AFCS + T +D A VY GVKGL ILATFP +L ISK VFE IL+ F+S Sbjct: 490 FSSSLTKAFCSA-SICTSEDSHDADVYFGVKGLLILATFPEGYLLISKPVFEKILMTFVS 548 Query: 1037 IITAGLETTLLWKHTLKALMQIGIFIEKFSDSEGETSYMIIVVEKIVSLIFLDDSAMPLP 858 I+T TLLWK LKAL+QIG FIEK +SE E SY+ +VVEKIVS L D ++P P Sbjct: 549 IVTVDYSNTLLWKLALKALVQIGSFIEKCHESEKEPSYLGLVVEKIVSFSSLGDFSIPFP 608 Query: 857 LLLEAMSDIGTTCLKFMLRVNQGLEDAISANLLEASVEGNS-KVDILGPVLECYSNKVLP 681 L LEA+S+IGT+ +ML+V +GLE+AI ANL E V G+S +I+ +L+CYS+KV+P Sbjct: 609 LRLEALSEIGTSGKSYMLKVVEGLEEAIYANLSEVYVHGSSNSAEIVTQLLKCYSDKVIP 668 Query: 680 WFLNTGGFDDVALRFSINMWNQIESSKLFDMGIQGKVNELLNKMMMTMRLAVGGCSEDNQ 501 W GFD+V L+F+I++WNQIE S +F+ K+ E+L+ MM M+LAV CSE+NQ Sbjct: 669 WIQCAKGFDEVPLQFAIHIWNQIELSMVFNATQTNKI-EVLDVMMKAMKLAVASCSEENQ 727 Query: 500 VLIVQKAFGVLSSNIFLPLKDSMDFSVPLKLEELQLTQEFHTFSCRDEWAISLFASVIVA 321 +IVQK++ +LSS+ PLK+ + E Q+ Q ++ S RDEW +SLFA+V++A Sbjct: 728 NIIVQKSYHILSSSTSFPLKEL------FRQESFQIVQVDNS-SSRDEWILSLFAAVVIA 780 Query: 320 LRPQTPLQNVRVILRFFATTLLKGHVPSAQALGSILNKLPSNNMGNLNSCTVEDALSIIF 141 + P+T + N++ +L F TTLLKG+V +AQALGS++NKL + G CT+E+ + II Sbjct: 781 VHPETYVPNIKPLLYLFMTTLLKGNVVTAQALGSVVNKLGLESAGVQTDCTLEEVMDIIL 840 Query: 140 EIGLFGNIPSWKFHP----------VDSNNVALINLCDSEGKSNLVQSNSIIGLAW 3 + L W FH +++++LINLC S G +Q ++I+GLAW Sbjct: 841 NLSL------WIFHSNSSADIQAKMTSAHDISLINLCSSIGSCTSLQIHAIVGLAW 890 >ref|XP_007024310.1| MMS19 nucleotide excision repair protein, putative isoform 1 [Theobroma cacao] gi|590619491|ref|XP_007024311.1| MMS19 nucleotide excision repair protein, putative isoform 1 [Theobroma cacao] gi|508779676|gb|EOY26932.1| MMS19 nucleotide excision repair protein, putative isoform 1 [Theobroma cacao] gi|508779677|gb|EOY26933.1| MMS19 nucleotide excision repair protein, putative isoform 1 [Theobroma cacao] Length = 1149 Score = 379 bits (974), Expect = e-102 Identities = 221/476 (46%), Positives = 304/476 (63%), Gaps = 11/476 (2%) Frame = -1 Query: 1397 NSTWGCISDVSCVCSEEPNFGALYLCIELLVACRDLVLGFEGGSQQQVITVEDTCFLLQR 1218 NS+ SD S + + N GALYL IELL ACRD++ E T E +LL+ Sbjct: 430 NSSGNLSSDDSIMIPKRYNHGALYLSIELLSACRDVIASSETIIAASAHTEETWSYLLRS 489 Query: 1217 FSGQLTGAFCSILATTTKQDICKAHVYCGVKGLQILATFPRCFLRISKSVFETILIIFMS 1038 FS LT AFCS + T +D A VY GVKGL ILATFP +L ISK VFE IL+ F+S Sbjct: 490 FSSSLTKAFCSA-SICTSEDSHDADVYFGVKGLLILATFPEGYLLISKPVFEKILMTFVS 548 Query: 1037 IITAGLETTLLWKHTLKALMQIGIFIEKFSDSEGETSYMIIVVEKIVSLIFLDDSAMPLP 858 I+T TLLWK LKAL+QIG FIEK +SE E SY+ +VVEKIVS L D ++P P Sbjct: 549 IVTVDYSNTLLWKLALKALVQIGSFIEKCHESEKEPSYLGLVVEKIVSFSSLGDFSIPFP 608 Query: 857 LLLEAMSDIGTTCLKFMLRVNQGLEDAISANLLEASVEGNS-KVDILGPVLECYSNKVLP 681 L LEA+S+IGT+ +ML+V +GLE+AI ANL E V G+S +I+ +L+CYS+KV+P Sbjct: 609 LRLEALSEIGTSGKSYMLKVVEGLEEAIYANLSEVYVHGSSNSAEIVTQLLKCYSDKVIP 668 Query: 680 WFLNTGGFDDVALRFSINMWNQIESSKLFDMGIQGKVNELLNKMMMTMRLAVGGCSEDNQ 501 W GFD+V L+F+I++WNQIE S +F+ K+ E+L+ MM M+LAV CSE+NQ Sbjct: 669 WIQCAKGFDEVPLQFAIHIWNQIELSMVFNATQTNKI-EVLDVMMKAMKLAVASCSEENQ 727 Query: 500 VLIVQKAFGVLSSNIFLPLKDSMDFSVPLKLEELQLTQEFHTFSCRDEWAISLFASVIVA 321 +IVQK++ +LSS+ PLK+ + E Q+ Q ++ S RDEW +SLFA+V++A Sbjct: 728 NIIVQKSYHILSSSTSFPLKEL------FRQESFQIVQVDNS-SSRDEWILSLFAAVVIA 780 Query: 320 LRPQTPLQNVRVILRFFATTLLKGHVPSAQALGSILNKLPSNNMGNLNSCTVEDALSIIF 141 + P+T + N++ +L F TTLLKG+V +AQALGS++NKL + G CT+E+ + II Sbjct: 781 VHPETYVPNIKPLLYLFMTTLLKGNVVTAQALGSVVNKLGLESAGVQTDCTLEEVMDIIL 840 Query: 140 EIGLFGNIPSWKFHP----------VDSNNVALINLCDSEGKSNLVQSNSIIGLAW 3 + L W FH +++++LINLC S G +Q ++I+GLAW Sbjct: 841 NLSL------WIFHSNSSADIQAKMTSAHDISLINLCSSIGSCTSLQIHAIVGLAW 890 >ref|XP_006465695.1| PREDICTED: MMS19 nucleotide excision repair protein homolog isoform X2 [Citrus sinensis] Length = 1151 Score = 372 bits (955), Expect = e-100 Identities = 218/478 (45%), Positives = 294/478 (61%), Gaps = 13/478 (2%) Frame = -1 Query: 1397 NSTWGCISDVSCVCSEEPNFGALYLCIELLVACRDLVLGFEGGSQQQVITVEDTCFLLQR 1218 NST C + V + N GALYLCIEL+ ACR+L+ E E LLQ Sbjct: 428 NSTQDCFPNDGNVLRGKLNHGALYLCIELMTACRELMASSEEFKSVAAPANERWYCLLQS 487 Query: 1217 FSGQLTGAFCSILATTTKQDICKAHVYCGVKGLQILATFPRCFLRISKSVFETILIIFMS 1038 +S L A S L T+ +D + +VY GVKGL IL TF L IS S+FE IL+ F S Sbjct: 488 YSASLAKALRSTLETSANEDSYETNVYFGVKGLLILGTFRGGSLIISNSIFENILLTFTS 547 Query: 1037 IITAGLETTLLWKHTLKALMQIGIFIEKFSDSEGETSYMIIVVEKIVSLIFLDDSAMPLP 858 II + E TLLWK LKAL+ IG FI++F++SE SYM +V+EKIVSL D +MP P Sbjct: 548 IIISEFENTLLWKLALKALVHIGSFIDRFNESEKALSYMDVVIEKIVSLASSHDFSMPFP 607 Query: 857 LLLEAMSDIGTTCLKFMLRVNQGLEDAISANLLEASVEGNSK-VDILGPVLECYSNKVLP 681 L LEA+S+IG T ++L++ QGLE+A+ ANL E V GN K +++ +LECYSNKVLP Sbjct: 608 LKLEAISEIGATGRNYLLKIVQGLEEAVCANLYEVLVHGNPKSAEVVVQLLECYSNKVLP 667 Query: 680 WFLNTGGFDDVALRFSINMWNQIESSKLFDMGIQGKVNELLNKMMMTMRLAVGGCSEDNQ 501 GGF++V LRF++N+WN IE S F + K LL+ M M+LAVG CS ++Q Sbjct: 668 RIHEIGGFEEVLLRFAVNIWNLIEKSVTFSSQVHEK--GLLDATMKAMKLAVGSCSVESQ 725 Query: 500 VLIVQKAFGVLSSNIFLPLKDSMDFSVPLKLEELQLTQEFHTFSCRDEWAISLFASVIVA 321 ++ QKAF VLS + PL+D+ ++P+ L E QLTQE S R+ W SLFASVI+A Sbjct: 726 NIVFQKAFTVLSLGTYFPLEDAAS-NIPILLNEFQLTQETSISSSREAWICSLFASVIIA 784 Query: 320 LRPQTPLQNVRVILRFFATTLLKGHVPSAQALGSILNK--LPSNNMGNLNSCTVEDALSI 147 RPQT + NVR+++R F TTLLKG+VP+AQALGS++NK L SN +CT+E+A+ I Sbjct: 785 ARPQTHIPNVRLVIRLFMTTLLKGNVPAAQALGSMVNKLGLKSNGTEVHGNCTLEEAMDI 844 Query: 146 IFEIGLFGNIPSWKFHP----------VDSNNVALINLCDSEGKSNLVQSNSIIGLAW 3 IF+ L W F+ + +++ L ++C +Q ++I GLAW Sbjct: 845 IFDSKL------WSFNDSVTLRSNGGLENGSSIGLTDICRGATNIRSLQVHAIAGLAW 896 >ref|XP_006465694.1| PREDICTED: MMS19 nucleotide excision repair protein homolog isoform X1 [Citrus sinensis] Length = 1155 Score = 372 bits (955), Expect = e-100 Identities = 218/478 (45%), Positives = 294/478 (61%), Gaps = 13/478 (2%) Frame = -1 Query: 1397 NSTWGCISDVSCVCSEEPNFGALYLCIELLVACRDLVLGFEGGSQQQVITVEDTCFLLQR 1218 NST C + V + N GALYLCIEL+ ACR+L+ E E LLQ Sbjct: 428 NSTQDCFPNDGNVLRGKLNHGALYLCIELMTACRELMASSEEFKSVAAPANERWYCLLQS 487 Query: 1217 FSGQLTGAFCSILATTTKQDICKAHVYCGVKGLQILATFPRCFLRISKSVFETILIIFMS 1038 +S L A S L T+ +D + +VY GVKGL IL TF L IS S+FE IL+ F S Sbjct: 488 YSASLAKALRSTLETSANEDSYETNVYFGVKGLLILGTFRGGSLIISNSIFENILLTFTS 547 Query: 1037 IITAGLETTLLWKHTLKALMQIGIFIEKFSDSEGETSYMIIVVEKIVSLIFLDDSAMPLP 858 II + E TLLWK LKAL+ IG FI++F++SE SYM +V+EKIVSL D +MP P Sbjct: 548 IIISEFENTLLWKLALKALVHIGSFIDRFNESEKALSYMDVVIEKIVSLASSHDFSMPFP 607 Query: 857 LLLEAMSDIGTTCLKFMLRVNQGLEDAISANLLEASVEGNSK-VDILGPVLECYSNKVLP 681 L LEA+S+IG T ++L++ QGLE+A+ ANL E V GN K +++ +LECYSNKVLP Sbjct: 608 LKLEAISEIGATGRNYLLKIVQGLEEAVCANLYEVLVHGNPKSAEVVVQLLECYSNKVLP 667 Query: 680 WFLNTGGFDDVALRFSINMWNQIESSKLFDMGIQGKVNELLNKMMMTMRLAVGGCSEDNQ 501 GGF++V LRF++N+WN IE S F + K LL+ M M+LAVG CS ++Q Sbjct: 668 RIHEIGGFEEVLLRFAVNIWNLIEKSVTFSSQVHEK--GLLDATMKAMKLAVGSCSVESQ 725 Query: 500 VLIVQKAFGVLSSNIFLPLKDSMDFSVPLKLEELQLTQEFHTFSCRDEWAISLFASVIVA 321 ++ QKAF VLS + PL+D+ ++P+ L E QLTQE S R+ W SLFASVI+A Sbjct: 726 NIVFQKAFTVLSLGTYFPLEDAAS-NIPILLNEFQLTQETSISSSREAWICSLFASVIIA 784 Query: 320 LRPQTPLQNVRVILRFFATTLLKGHVPSAQALGSILNK--LPSNNMGNLNSCTVEDALSI 147 RPQT + NVR+++R F TTLLKG+VP+AQALGS++NK L SN +CT+E+A+ I Sbjct: 785 ARPQTHIPNVRLVIRLFMTTLLKGNVPAAQALGSMVNKLGLKSNGTEVHGNCTLEEAMDI 844 Query: 146 IFEIGLFGNIPSWKFHP----------VDSNNVALINLCDSEGKSNLVQSNSIIGLAW 3 IF+ L W F+ + +++ L ++C +Q ++I GLAW Sbjct: 845 IFDSKL------WSFNDSVTLRSNGGLENGSSIGLTDICRGATNIRSLQVHAIAGLAW 896 >ref|XP_006426876.1| hypothetical protein CICLE_v10024743mg [Citrus clementina] gi|557528866|gb|ESR40116.1| hypothetical protein CICLE_v10024743mg [Citrus clementina] Length = 1155 Score = 370 bits (951), Expect = e-100 Identities = 217/478 (45%), Positives = 294/478 (61%), Gaps = 13/478 (2%) Frame = -1 Query: 1397 NSTWGCISDVSCVCSEEPNFGALYLCIELLVACRDLVLGFEGGSQQQVITVEDTCFLLQR 1218 NST C + V + N GALYLCIEL+ ACR+L+ E E LLQ Sbjct: 428 NSTQDCFPNDGNVLRGKLNHGALYLCIELMTACRELMASSEEFKSVAAPANERWYCLLQS 487 Query: 1217 FSGQLTGAFCSILATTTKQDICKAHVYCGVKGLQILATFPRCFLRISKSVFETILIIFMS 1038 +S L A S L T+ +D + +VY GVKGL IL TF L IS S+FE IL+ F S Sbjct: 488 YSASLAKALRSTLETSANEDSYETNVYFGVKGLLILGTFSGGSLIISNSIFENILLTFTS 547 Query: 1037 IITAGLETTLLWKHTLKALMQIGIFIEKFSDSEGETSYMIIVVEKIVSLIFLDDSAMPLP 858 II + E TLLWK LKAL+ IG FI++F++SE SYM +V+EKIVSL D +MP P Sbjct: 548 IIISEFENTLLWKLALKALVHIGSFIDRFNESEKALSYMDVVIEKIVSLASSHDFSMPFP 607 Query: 857 LLLEAMSDIGTTCLKFMLRVNQGLEDAISANLLEASVEGNSK-VDILGPVLECYSNKVLP 681 L LEA+S+IG T ++L++ QGLE+A+ ANL E V GN K +++ +LECYSNKVLP Sbjct: 608 LKLEAISEIGATGRNYLLKIVQGLEEAVCANLYEVLVHGNPKSAEVVVQLLECYSNKVLP 667 Query: 680 WFLNTGGFDDVALRFSINMWNQIESSKLFDMGIQGKVNELLNKMMMTMRLAVGGCSEDNQ 501 GGF++V LRF++N+WN IE S F + K LL+ M M+LAVG CS ++Q Sbjct: 668 RIHEIGGFEEVLLRFAVNIWNLIEKSVTFSSQVHEK--GLLDATMKAMKLAVGSCSVESQ 725 Query: 500 VLIVQKAFGVLSSNIFLPLKDSMDFSVPLKLEELQLTQEFHTFSCRDEWAISLFASVIVA 321 ++ QKAF VLS + PL+D+ ++P++L E QLTQE S R+ W SLFASVI+A Sbjct: 726 NIVFQKAFTVLSLGTYFPLEDAAS-NIPIQLNEFQLTQETSISSSREAWICSLFASVIIA 784 Query: 320 LRPQTPLQNVRVILRFFATTLLKGHVPSAQALGSILNK--LPSNNMGNLNSCTVEDALSI 147 PQT + NVR+++R F TTLLKG+VP+AQALGS++NK L SN +CT+E+A+ I Sbjct: 785 ACPQTHIPNVRLVIRLFMTTLLKGNVPAAQALGSMVNKLGLKSNGTEVHGNCTLEEAMDI 844 Query: 146 IFEIGLFGNIPSWKFHP----------VDSNNVALINLCDSEGKSNLVQSNSIIGLAW 3 IF+ L W F+ + +++ L ++C +Q ++I GLAW Sbjct: 845 IFDSKL------WSFNDSVTLRSNGGLENGSSIGLTDICRGATNIRSLQVHAIAGLAW 896 >ref|XP_002515963.1| DNA repair/transcription protein met18/mms19, putative [Ricinus communis] gi|223544868|gb|EEF46383.1| DNA repair/transcription protein met18/mms19, putative [Ricinus communis] Length = 1174 Score = 370 bits (950), Expect = e-100 Identities = 219/488 (44%), Positives = 303/488 (62%), Gaps = 22/488 (4%) Frame = -1 Query: 1400 ENSTWGCISDVSCVCSEEPNFGALYLCIELLVACRDLVLGFEGGSQQQVITVEDTCFLLQ 1221 EN++ C S+ +CV +++PN+G+ YL I+LL ACRDL + + Q + T E C LLQ Sbjct: 429 ENTSGACHSNENCVKAKQPNYGSFYLSIKLLGACRDLSTSSDNLASQCISTNETYCCLLQ 488 Query: 1220 RFSGQLTGAFCSILATTTKQDICKAHVYCGVKGLQILATFPRCFLRISKSVFETILIIFM 1041 RFS LT F + LAT+T +Y GVKGLQILATFP +L +SK F+ IL+ F+ Sbjct: 489 RFSTSLTETFSAALATSTSGPAQDVDMYLGVKGLQILATFPGGYLFLSKLTFDNILMTFL 548 Query: 1040 SIITAGLETTLLWKHTLKALMQIGIFIEKFSDSEGETSYMIIVVEKIVSLIFLDDSAMPL 861 SIIT TLLW LKAL+QIG F+ ++S+ E SY+ IVV K++ L D +MP Sbjct: 549 SIITVDFNKTLLWNQALKALVQIGSFVHGCNESDKEMSYVDIVVGKMILLASSPDFSMPW 608 Query: 860 PLLLEAMSDIGTTCLKFMLRVNQGLEDAISANLLE---------------ASVEGNSK-V 729 L L A+S IG + K+ML+V GLE+AI ANL E V+GN K Sbjct: 609 SLKLTAISSIGMSGQKYMLKVFLGLEEAIRANLAEIYVCMIKKKIYVLYSCLVQGNLKSA 668 Query: 728 DILGPVLECYSNKVLPWFLNTGGFDDVALRFSINMWNQIESSKLFDMGIQGKVNELLNKM 549 IL +LECYS+++LPW T GF++V ++F +N+WNQIE+ F + GK LL+ + Sbjct: 669 KILLQLLECYSDELLPWIQKTEGFEEVLMQFVVNLWNQIENFNAFTVAFHGK-ESLLDAI 727 Query: 548 MMTMRLAVGGCSEDNQVLIVQKAFGVLSSNIFLPLKDSMDFSVPLKLEELQLTQEFHTFS 369 M M+ AV CS ++Q +I+ KA+GVLSS+ FLPLK+S+ + ++LE + Q+ S Sbjct: 728 MKVMKDAVAFCSVESQNVIIYKAYGVLSSSTFLPLKESLSEN-SVQLECFRAIQQMDRLS 786 Query: 368 CRDEWAISLFASVIVALRPQTPLQNVRVILRFFATTLLKGHVPSAQALGSILNKL--PSN 195 RDEW SLFASVI+ALRPQT + N R++L F T LLKGHV +A+ALGS++NKL SN Sbjct: 787 SRDEWIHSLFASVIIALRPQTHIPNTRIVLHLFITALLKGHVTTAEALGSLVNKLDQKSN 846 Query: 194 NMGNLNSCTVEDALSIIFEIGL---FGNIPSWKFHPV-DSNNVALINLCDSEGKSNLVQS 27 + CT+E+A+ IIF I L FGN S +F + + + LI LC ++ Sbjct: 847 DACISGDCTIEEAMDIIFSINLLCSFGNGSSGRFDRTRNGDEMDLIKLCLDAPNLAWIKI 906 Query: 26 NSIIGLAW 3 +I+GLAW Sbjct: 907 PAIVGLAW 914 >ref|XP_007217541.1| hypothetical protein PRUPE_ppa023072mg [Prunus persica] gi|462413691|gb|EMJ18740.1| hypothetical protein PRUPE_ppa023072mg [Prunus persica] Length = 1158 Score = 361 bits (927), Expect = 4e-97 Identities = 214/485 (44%), Positives = 299/485 (61%), Gaps = 17/485 (3%) Frame = -1 Query: 1406 SEENSTWGCISDVSCVCSEEPNFGALYLCIELLVACRDLVLGFEGGSQQQVITVEDTC-F 1230 S NS C + + S++ NFGALYLC+EL+ ACRDL++ + + + T ++TC + Sbjct: 428 SVTNSAGDCTLNENTFPSKKFNFGALYLCVELIAACRDLIMRSKDLAPKP-DTPQETCRY 486 Query: 1229 LLQRFSGQLTGAFCSILATTTKQDICKAHVYCGVKGLQILATFPRCFLRISKSVFETILI 1050 +LQ F+ L AF S LAT + A +Y VKGLQILATFP FL ISK +F IL Sbjct: 487 MLQSFADSLVNAFSSSLATNANEVAHGADIYFKVKGLQILATFPGDFLPISKFLFANILT 546 Query: 1049 IFMSIITAGLETTLLWKHTLKALMQIGIFIEKFSDSEGETSYMIIVVEKIVSLIFLDDSA 870 I MSII LLWK LKAL+ IG F++ + +SE YM VV+K VSL+ DD Sbjct: 547 ILMSIILVDFNKILLWKLVLKALVHIGSFVDVYHESEKALGYMGAVVDKTVSLVSRDDVK 606 Query: 869 MPLPLLLEAMSDIGTTCLKFMLRVNQGLEDAISANLLEASVEGNSK-VDILGPVLECYSN 693 MP L LEA S+IG + ML++ QG+E+AI A L + V GN K + +LECY N Sbjct: 607 MPFSLKLEAASEIGASGRNHMLKIVQGMEEAIVAKLSD-YVHGNLKSAEKTIQLLECYCN 665 Query: 692 KVLPWFLNTGGFDDVALRFSINMWNQIESSKLFDMGIQGKVNELLNKMMMTMRLAVGGCS 513 K+L W TGG ++V LRF IN+WN +ES K D IQ + ELL+ MM M+LA+G CS Sbjct: 666 KILSWINETGGLEEVLLRFVINIWNCVESCK--DFSIQVQEEELLDATMMAMKLAIGSCS 723 Query: 512 EDNQVLIVQKAFGVLSSNIFLPLKDSMDFSVPLKLEELQLTQEFHT----------FSCR 363 E++Q +I+ KA+ V+SS+I +P K+S+D + ++LEEL ++++ FS R Sbjct: 724 EESQNIIIHKAYSVISSSISIPFKESLDATSSIQLEELSVSEQIDNSSHRDDQIDKFSLR 783 Query: 362 DEWAISLFASVIVALRPQTPLQNVRVILRFFATTLLKGHVPSAQALGSILNKL--PSNNM 189 DEW +S FASVI+A+RP+ + NV+ IL F TT+LKG VP+AQALGS++NKL SN Sbjct: 784 DEWILSHFASVIIAVRPKAQIVNVKGILHLFMTTVLKGCVPAAQALGSVINKLGTKSNET 843 Query: 188 GNLNSCTVEDALSIIFEIGLFGNIPSWKFHPVDSNN---VALINLCDSEGKSNLVQSNSI 18 N CT+E+A+ +IF L+ + S N V L +LC + L++ +++ Sbjct: 844 ANSIDCTLEEAVDMIFRTKLWNLNENGVLRTCGSGNGSKVGLTDLCLGFSSNKLLRVHAV 903 Query: 17 IGLAW 3 +GLAW Sbjct: 904 VGLAW 908 >ref|XP_004302857.1| PREDICTED: uncharacterized protein LOC101304108 [Fragaria vesca subsp. vesca] Length = 1149 Score = 342 bits (877), Expect = 2e-91 Identities = 206/471 (43%), Positives = 276/471 (58%), Gaps = 6/471 (1%) Frame = -1 Query: 1397 NSTWGCISDVSCVCSEEPNFGALYLCIELLVACRDLVLGFEGGSQQQVITVEDTCFLLQR 1218 NS+ C + S+ FGALY C+E + ACRDL++ ++ E C +LQ Sbjct: 431 NSSKDCTLKENSFSSKRFKFGALYFCVEFIAACRDLIMRTNDHDEKFGTADETCCCMLQS 490 Query: 1217 FSGQLTGAFCSILATTTKQDICKAHVYCGVKGLQILATFPRCFLRISKSVFETILIIFMS 1038 + L AFC+ LA + A +Y VKGLQ+LATFP FL+I K++FE +L MS Sbjct: 491 SAPTLITAFCTTLAQISCNVADDADIYFKVKGLQMLATFPGYFLQIPKAMFENVLKTLMS 550 Query: 1037 IITAGLETTLLWKHTLKALMQIGIFIEKFSDSEGETSYMIIVVEKIVSLIFLDDSAMPLP 858 II + LLWK LKAL IG F++ +SE SY VVEK +SL DD +P P Sbjct: 551 IILVDFDKPLLWKLALKALAHIGSFVDVHLESEKAQSYTSFVVEKTISLP-QDDFDVPFP 609 Query: 857 LLLEAMSDIGTTCLKFMLRVNQGLEDAISANLLEASVEGNSKV-DILGPVLECYSNKVLP 681 L LEA+ +IG + MLR+ QGLEDAI ANL + + G+ K + +LECYSNK++ Sbjct: 610 LKLEAVFEIGASRPNHMLRIIQGLEDAIVANLSKTFIHGDLKAAEKTIQLLECYSNKIIS 669 Query: 680 WFLNTGGFDDVALRFSINMWNQIESSKLFDMGIQGKVNELLNKMMMTMRLAVGGCSEDNQ 501 W GG ++V RF I++WN +E K +Q K LL+ M M+LAVG CSE++Q Sbjct: 670 WIDENGGLEEVLCRFVISIWNCLERCKDSSNQVQDK--GLLDATMTAMKLAVGSCSEESQ 727 Query: 500 VLIVQKAFGVLSSNIFLPLKDSMDFSVPLKLEELQLTQEFHTFSCRDEWAISLFASVIVA 321 +I+QKA+G LSS I +P KDS D S KLE L L ++ S RDEW SLFASVI+A Sbjct: 728 NIIIQKAYGALSSGISIPFKDSTDDSSLAKLETLHLFEQLDKLSPRDEWIFSLFASVIIA 787 Query: 320 LRPQTPLQNVRVILRFFATTLLKGHVPSAQALGSILNKL--PSNNMGNLNSCTVEDALSI 147 +RP+TP+ N + IL F T L+KG P+AQALGS++NKL SN + +CT+E+A+ I Sbjct: 788 MRPRTPIANAKGILHLFMTALVKGCTPAAQALGSVINKLGIQSNEITISTACTLEEAMGI 847 Query: 146 IFEIGLFG---NIPSWKFHPVDSNNVALINLCDSEGKSNLVQSNSIIGLAW 3 IF L+ N S NV L LC + L+Q + I GLAW Sbjct: 848 IFRSKLWNIGENGVLRGSGTSHSRNVGLTELCLGVSSNKLLQVHVITGLAW 898 >gb|EXB74582.1| hypothetical protein L484_026279 [Morus notabilis] Length = 1210 Score = 335 bits (860), Expect = 2e-89 Identities = 201/455 (44%), Positives = 275/455 (60%), Gaps = 8/455 (1%) Frame = -1 Query: 1343 NFGALYLCIELLVACRDLVLGFEGGSQQQVITVEDTCFLLQRFSGQLTGAFCSILATTTK 1164 NFGALYLC+ELL ACRDLV+ + + E C +LQ F L A CSIL TT Sbjct: 479 NFGALYLCMELLAACRDLVIYSRELASNSIPAHETFCCILQSFCVSLIDALCSILETTAN 538 Query: 1163 QDICKAHVYCGVKGLQILATFPRCFLRISKSVFETILIIFMSIITAGLETTLLWKHTLKA 984 + +Y V+ LQILATFP L IS +VF+ IL MSII LWK LKA Sbjct: 539 EGADDVDIYLRVRSLQILATFPEDLLAISDNVFKNILTTLMSIIFKDFNQKFLWKLALKA 598 Query: 983 LMQIGIFIEKFSDSEGETSYMIIVVEKIVSLIFLDDSAMPLPLLLEAMSDIGTTCLKFML 804 L+ IG F+ ++ +SE SY IVVEK+VS + +D+ +P PL LEA+S+IG + ML Sbjct: 599 LVHIGSFVSRY-ESEKAQSYNSIVVEKMVSWVSVDNCTLPFPLKLEAVSEIGASGRNHML 657 Query: 803 RVNQGLEDAISANLLEASVEGN-SKVDILGPVLECYSNKVLPWFLNTGGFDDVALRFSIN 627 + QGLE AI + + + V GN S ++ +L+ YS KV+PW T G +++ LRF+ N Sbjct: 658 NIVQGLEGAIFSYVSDFYVHGNVSSAEVAIQLLQFYSEKVIPWIHETEGLEEILLRFATN 717 Query: 626 MWNQIESSKLFDMGIQGKVNELLNKMMMTMRLAVGGCSEDNQVLIVQKAFGVLSSNIFLP 447 +W+ +ES ++ +Q K LL+ +MM M+L VG CSE+ Q +I+QKA+ VLSSN L Sbjct: 718 IWDHVESWISCNVEVQEK--GLLDAIMMAMKLTVGSCSEEIQYIILQKAYTVLSSNTSLL 775 Query: 446 LKDSMDFSVPLKLEELQLTQEFHTFSCRDEWAISLFASVIVALRPQTPLQNVRVILRFFA 267 LK S S+P++LEE QL Q S RDE +SLFASVI+A+RP+T + N++ IL F Sbjct: 776 LKKSSLTSIPVQLEESQLIQHVDNISHRDELVLSLFASVIIAVRPRTEIPNMKEILYLFL 835 Query: 266 TTLLKGHVPSAQALGSILNKLPSNNMGN--LNSCTVEDALSIIFEIGLFGNIPSWKFHPV 93 TTLL+GHVPSAQALGS++NK + T+EDA+ IIF+ SW F Sbjct: 836 TTLLRGHVPSAQALGSMINKFDTKAKSTEISRESTLEDAMDIIFK------TKSWFFRDN 889 Query: 92 D-----SNNVALINLCDSEGKSNLVQSNSIIGLAW 3 + N + L +LC + +Q ++I+GLAW Sbjct: 890 EVLQRNGNGMGLKDLCLGLMNNIQLQVHAIVGLAW 924 >ref|XP_004141784.1| PREDICTED: MMS19 nucleotide excision repair protein homolog [Cucumis sativus] Length = 1147 Score = 329 bits (843), Expect = 2e-87 Identities = 188/453 (41%), Positives = 280/453 (61%), Gaps = 6/453 (1%) Frame = -1 Query: 1343 NFGALYLCIELLVACRDLVLGFEGGSQQQVITVEDTCF-LLQRFSGQLTGAFCSILATTT 1167 NFGALYLCIE++ ACR+L++ S + +V++ + +LQ FS + S + Sbjct: 444 NFGALYLCIEVIAACRNLIVS----SDENTCSVKEKSYSMLQIFSCSVVQLLSSTFSGIV 499 Query: 1166 KQDICKAHVYCGVKGLQILATFPRCFLRISKSVFETILIIFMSIITAGLETTLLWKHTLK 987 K+D+ A YC VKGL L+TFP +S+ +FE IL+ FMS IT + LW H LK Sbjct: 500 KRDLHDAEFYCAVKGLLNLSTFPVGSSPVSRVIFEDILLEFMSFITVNFKFGSLWNHALK 559 Query: 986 ALMQIGIFIEKFSDSEGETSYMIIVVEKIVSLIFLDDSAMPLPLLLEAMSDIGTTCLKFM 807 AL IG F++K+ S SYM IVVEKI + D +PL L LE DIG T +M Sbjct: 560 ALQHIGSFVDKYPGSVESQSYMHIVVEKIALMFSPHDEVLPLMLKLEMAVDIGRTGRSYM 619 Query: 806 LRVNQGLEDAISANLLEASVEGNSK-VDILGPVLECYSNKVLPWFLNTGGFDDVALRFSI 630 L++ G+E+ I NL E V GNSK V+I+ +L+CYS K+LPWF G F++V LRF++ Sbjct: 620 LKIVGGIEETIFYNLSEVYVYGNSKSVEIVLSLLDCYSTKILPWFDEAGDFEEVILRFAL 679 Query: 629 NMWNQIESSKLFDMGIQGKVNELLNKMMMTMRLAVGGCSEDNQVLIVQKAFGVLSSNIFL 450 N+W+QIE F + + LL+ MM ++L+V CS+++Q +IVQKAF VL ++ F Sbjct: 680 NIWDQIEKCSTFSTSMDKCIQVLLDATMMALKLSVRSCSKESQNIIVQKAFNVLLTSSFS 739 Query: 449 PLKDSMDFSVPLKLEELQLTQEFHTFSCRDEWAISLFASVIVALRPQTPLQNVRVILRFF 270 PLK ++ ++P+++E LQ Q+ + RDEW +SLFASV +ALRPQ + +VR+I+R Sbjct: 740 PLKVTLSNTIPVQMEGLQFLQQKDNPTSRDEWILSLFASVTIALRPQVHVPDVRLIIRLL 799 Query: 269 ATTLLKGHVPSAQALGSILNKL--PSNNMGNLNSCTVEDALSIIF--EIGLFGNIPSWKF 102 + +G VP+AQALGS++NKL S+ + + ++E+A+ IIF E N + Sbjct: 800 MLSTTRGCVPAAQALGSMINKLSVKSDKVEVSSYVSLEEAIDIIFKTEFRCLHNESTG-- 857 Query: 101 HPVDSNNVALINLCDSEGKSNLVQSNSIIGLAW 3 D + + L +LC S KS+L+Q ++++GL+W Sbjct: 858 ---DGSEMFLTDLCSSIEKSSLLQVHAVVGLSW 887 >ref|XP_004236399.1| PREDICTED: MMS19 nucleotide excision repair protein homolog [Solanum lycopersicum] Length = 1153 Score = 324 bits (831), Expect = 5e-86 Identities = 187/448 (41%), Positives = 273/448 (60%), Gaps = 1/448 (0%) Frame = -1 Query: 1343 NFGALYLCIELLVACRDLVLGFEGGSQQQVITVEDTCFLLQRFSGQLTGAFCSILATTTK 1164 NFGALYLC+ELL ACR LV+ + + + + C +L FS L F ++ + Sbjct: 459 NFGALYLCVELLAACRQLVVSSDEVASAHDLARDSWCQILHSFSTSLCNVFFCLIRASCV 518 Query: 1163 QDICKAHVYCGVKGLQILATFPRCFLRISKSVFETILIIFMSIITAGLETTLLWKHTLKA 984 + A+VY VKGL+ILATFP F+ +SK ++E IL+ SII + LWK LKA Sbjct: 519 ESTRNAYVYAAVKGLEILATFPGSFISVSKLMYENILLTLTSIIESEFNKKFLWKAALKA 578 Query: 983 LMQIGIFIEKFSDSEGETSYMIIVVEKIVSLIFLDDSAMPLPLLLEAMSDIGTTCLKFML 804 L++I +F+ K+ + E S+ IV +KIVSLI DD MP L LEA+ DIG T FML Sbjct: 579 LVEISLFVNKYHEDEKAASFNSIVKQKIVSLISSDDLNMPQSLKLEAVFDIGLTGKNFML 638 Query: 803 RVNQGLEDAISANLLEASVEGNSKV-DILGPVLECYSNKVLPWFLNTGGFDDVALRFSIN 627 V LE ISANL E V G+ ++ + +LECYSNKVLPWF GG D+V+L F++N Sbjct: 639 SVVSELEKTISANLSEILVHGDRRLAGLTAGLLECYSNKVLPWFHVNGGADEVSLSFAVN 698 Query: 626 MWNQIESSKLFDMGIQGKVNELLNKMMMTMRLAVGGCSEDNQVLIVQKAFGVLSSNIFLP 447 ++ ++E + + +GK ELL M M+ A+ CS ++Q ++QKA V+ +N F Sbjct: 699 IFTKMEHNTSLSLEAEGK--ELLGATMAAMKQAMTCCSVESQEKVLQKAIDVMETNSFF- 755 Query: 446 LKDSMDFSVPLKLEELQLTQEFHTFSCRDEWAISLFASVIVALRPQTPLQNVRVILRFFA 267 +++ L ++ QL Q SC+DEW ISLFASV++ALRPQT + N+R++L+ A Sbjct: 756 FSNNLILGTDLFNKKTQLGQTSEGLSCQDEWIISLFASVVIALRPQTQIPNIRLLLQLLA 815 Query: 266 TTLLKGHVPSAQALGSILNKLPSNNMGNLNSCTVEDALSIIFEIGLFGNIPSWKFHPVDS 87 TLL+GH+PSAQALGS++NKLP N C++++ + ++ + L+ NI K Sbjct: 816 MTLLEGHIPSAQALGSLVNKLPLNIS---EDCSLKELIDMLLKNVLWRNISIGK-EGNHG 871 Query: 86 NNVALINLCDSEGKSNLVQSNSIIGLAW 3 + VA+ NL +S+ + S+++IGLAW Sbjct: 872 DAVAMSNL-----RSSSLNSHAVIGLAW 894 >ref|XP_006595125.1| PREDICTED: DNA repair/transcription protein mms19-like isoform X2 [Glycine max] Length = 1013 Score = 313 bits (801), Expect = 2e-82 Identities = 191/455 (41%), Positives = 269/455 (59%), Gaps = 4/455 (0%) Frame = -1 Query: 1355 SEEPNFGALYLCIELLVACRDLVLGFEGGSQQQVITVEDTCFLLQRFSGQLTGAFCSILA 1176 S+ FG LY+CIELL CR+L++G + + Q V E C +L RFS L AF S+LA Sbjct: 319 SQRVKFGFLYVCIELLAGCRELIVGSDEPALQYVFEHETCCTMLHRFSTPLFNAFGSVLA 378 Query: 1175 TTTKQDICKAHVYCGVKGLQILATFPRCFLRISKSVFETILIIFMSIITAGLETTLLWKH 996 + + Y GVKGLQILA F I KSVFE IL FMSII T+LW+ Sbjct: 379 VSADRCPLDPDTYIGVKGLQILAMFGSDVFPIQKSVFENILKKFMSIIVEDFNKTILWEA 438 Query: 995 TLKALMQIGIFIEKFSDSEGETSYMIIVVEKIVSLIFLDDSAMPLPLLLEAMSDIGTTCL 816 LKAL Q+G F++KF +SE SY +VVEKIV ++ LDD +P L LEA+S+IG T + Sbjct: 439 ALKALYQVGSFVQKFHESEKAMSYRNLVVEKIVEILSLDDITLPFSLELEALSNIGMTGM 498 Query: 815 KFMLRVNQGLEDAISANLLEASVEGNSK-VDILGPVLECYSNKVLPWFLNTGGFDDVALR 639 K ML + QGL A+ +NL + V N + DI +LECYS ++LPW GG +D ++ Sbjct: 499 KNMLTILQGLGRAVFSNLSKVHVHRNLRSSDIAVQLLECYSCQLLPWIHENGGSEDFVMQ 558 Query: 638 FSINMWNQIESSKLFDMGIQGKVNELLNKMMMTMRLAVGGCSEDNQVLIVQKAFGVLSSN 459 F +++W+Q + D + LL+ +M M+L+VG C+ ++Q LI+QKA+ VLSS+ Sbjct: 559 FVVDIWSQ--AGNCMDFSTLFEEKGLLDAIMKAMKLSVGSCAVESQNLIIQKAYCVLSSH 616 Query: 458 IFLPLKDSMDFSVPLKLEELQLTQEFHTFSCRDEWAISLFASVIVALRPQTPLQNVRVIL 279 +F ++E L LT + S RDE ISLFASV++A+ P+T + N RV++ Sbjct: 617 --------TNFQQLKEVERLPLTPGNYNISLRDEGLISLFASVVIAVFPKTYIPNKRVLM 668 Query: 278 RFFATTLLKGH-VPSAQALGSILNKL--PSNNMGNLNSCTVEDALSIIFEIGLFGNIPSW 108 F TLL+G VP AQALGSILNKL SN+ N + T+E+AL +IF + + S Sbjct: 669 HLFIITLLRGGVVPVAQALGSILNKLVSTSNSAENSSDLTLEEALDVIFNTKI--SFSST 726 Query: 107 KFHPVDSNNVALINLCDSEGKSNLVQSNSIIGLAW 3 + N + L ++C ++Q N+I GL+W Sbjct: 727 DNGRSNGNEMVLTDICLGIANDRMLQINAICGLSW 761 >ref|XP_006595124.1| PREDICTED: DNA repair/transcription protein mms19-like isoform X1 [Glycine max] Length = 1132 Score = 313 bits (801), Expect = 2e-82 Identities = 191/455 (41%), Positives = 269/455 (59%), Gaps = 4/455 (0%) Frame = -1 Query: 1355 SEEPNFGALYLCIELLVACRDLVLGFEGGSQQQVITVEDTCFLLQRFSGQLTGAFCSILA 1176 S+ FG LY+CIELL CR+L++G + + Q V E C +L RFS L AF S+LA Sbjct: 438 SQRVKFGFLYVCIELLAGCRELIVGSDEPALQYVFEHETCCTMLHRFSTPLFNAFGSVLA 497 Query: 1175 TTTKQDICKAHVYCGVKGLQILATFPRCFLRISKSVFETILIIFMSIITAGLETTLLWKH 996 + + Y GVKGLQILA F I KSVFE IL FMSII T+LW+ Sbjct: 498 VSADRCPLDPDTYIGVKGLQILAMFGSDVFPIQKSVFENILKKFMSIIVEDFNKTILWEA 557 Query: 995 TLKALMQIGIFIEKFSDSEGETSYMIIVVEKIVSLIFLDDSAMPLPLLLEAMSDIGTTCL 816 LKAL Q+G F++KF +SE SY +VVEKIV ++ LDD +P L LEA+S+IG T + Sbjct: 558 ALKALYQVGSFVQKFHESEKAMSYRNLVVEKIVEILSLDDITLPFSLELEALSNIGMTGM 617 Query: 815 KFMLRVNQGLEDAISANLLEASVEGNSK-VDILGPVLECYSNKVLPWFLNTGGFDDVALR 639 K ML + QGL A+ +NL + V N + DI +LECYS ++LPW GG +D ++ Sbjct: 618 KNMLTILQGLGRAVFSNLSKVHVHRNLRSSDIAVQLLECYSCQLLPWIHENGGSEDFVMQ 677 Query: 638 FSINMWNQIESSKLFDMGIQGKVNELLNKMMMTMRLAVGGCSEDNQVLIVQKAFGVLSSN 459 F +++W+Q + D + LL+ +M M+L+VG C+ ++Q LI+QKA+ VLSS+ Sbjct: 678 FVVDIWSQ--AGNCMDFSTLFEEKGLLDAIMKAMKLSVGSCAVESQNLIIQKAYCVLSSH 735 Query: 458 IFLPLKDSMDFSVPLKLEELQLTQEFHTFSCRDEWAISLFASVIVALRPQTPLQNVRVIL 279 +F ++E L LT + S RDE ISLFASV++A+ P+T + N RV++ Sbjct: 736 --------TNFQQLKEVERLPLTPGNYNISLRDEGLISLFASVVIAVFPKTYIPNKRVLM 787 Query: 278 RFFATTLLKGH-VPSAQALGSILNKL--PSNNMGNLNSCTVEDALSIIFEIGLFGNIPSW 108 F TLL+G VP AQALGSILNKL SN+ N + T+E+AL +IF + + S Sbjct: 788 HLFIITLLRGGVVPVAQALGSILNKLVSTSNSAENSSDLTLEEALDVIFNTKI--SFSST 845 Query: 107 KFHPVDSNNVALINLCDSEGKSNLVQSNSIIGLAW 3 + N + L ++C ++Q N+I GL+W Sbjct: 846 DNGRSNGNEMVLTDICLGIANDRMLQINAICGLSW 880 >ref|XP_006343144.1| PREDICTED: MMS19 nucleotide excision repair protein homolog [Solanum tuberosum] Length = 1170 Score = 311 bits (797), Expect = 4e-82 Identities = 188/478 (39%), Positives = 270/478 (56%), Gaps = 31/478 (6%) Frame = -1 Query: 1343 NFGALYLCIELLVACRDLVLGFEGGSQQQVITVEDTCFLLQRFSGQLTGAFCSILATTTK 1164 NFGALYLC+ELL ACR LV+ + + + + C +L+ F L F ++ + Sbjct: 446 NFGALYLCVELLAACRQLVVSSDEVASAHDLARDSWCQILRSFCTSLCNVFFCLIRASCV 505 Query: 1163 QDICKAHVYCGVKGLQILATFPRCFLRISKSVFETILIIFMSIITAGLETTLLWKHTLKA 984 + A+VY VKGL+IL TFP F+ +SK ++E IL+ SII + LWK LKA Sbjct: 506 ESTWNAYVYAAVKGLEILGTFPGSFISVSKLMYENILLTLTSIIESDFNKKFLWKAALKA 565 Query: 983 LMQIGIFIEKFSDSEGETSYMIIVVEKIVSLIFLDDSAMPLPLLLEAMSDIGTTCLKFML 804 L++I +F+ K+ + E + IV +KIVSLI DD MP L LEA+ DIG T FM Sbjct: 566 LVEISLFVNKYHEDEKAAIFNSIVKQKIVSLISSDDLNMPQSLKLEAIFDIGLTGKSFMH 625 Query: 803 RVNQGLEDAISANLLEA------------------------------SVEGNSKVDILGP 714 V LE ISANL E V G+ ++ L P Sbjct: 626 SVVSELEKTISANLSEILVRVLIETSRLLLTYHMHRLFNFGALFLLLQVHGDRRLAGLTP 685 Query: 713 -VLECYSNKVLPWFLNTGGFDDVALRFSINMWNQIESSKLFDMGIQGKVNELLNKMMMTM 537 +LECYSNKVLPWF GG D+V+L F+IN++ ++E++ + +GK ELL M M Sbjct: 686 GLLECYSNKVLPWFHGNGGADEVSLSFAINIFTKMENNSSLSLEAKGK--ELLGATMAAM 743 Query: 536 RLAVGGCSEDNQVLIVQKAFGVLSSNIFLPLKDSMDFSVPLKLEELQLTQEFHTFSCRDE 357 + A+ GCS ++Q ++QKA V+ ++ F L + + L ++ QL Q SCRDE Sbjct: 744 KQAMTGCSVESQEKVLQKAIDVMETSSFF-LSNDLILGTDLFNKKTQLGQTSEGLSCRDE 802 Query: 356 WAISLFASVIVALRPQTPLQNVRVILRFFATTLLKGHVPSAQALGSILNKLPSNNMGNLN 177 W SLFASV++ALRPQT + N+R++L+ A TLL+GH+PSAQALGS++NKLP N Sbjct: 803 WITSLFASVVIALRPQTQIPNIRLLLQLLAMTLLEGHIPSAQALGSLVNKLPLNIS---E 859 Query: 176 SCTVEDALSIIFEIGLFGNIPSWKFHPVDSNNVALINLCDSEGKSNLVQSNSIIGLAW 3 C++E+ + +F+ ++ NI K D VA+ NL + N + S+++IG AW Sbjct: 860 DCSLEELIDTLFKNVMWRNISIGK-EGNDGGAVAMSNL-----RLNSLNSHAVIGFAW 911 >gb|EYU21515.1| hypothetical protein MIMGU_mgv1a000493mg [Mimulus guttatus] Length = 1120 Score = 310 bits (794), Expect = 1e-81 Identities = 192/463 (41%), Positives = 274/463 (59%), Gaps = 14/463 (3%) Frame = -1 Query: 1349 EPNFGALYLCIELLVACRDLVLGFEGGSQQQVITVEDTCFLLQRFSGQLTGAFCSILATT 1170 E FGA+YLC ELL A R L L + + + + +L FS L AF ++L + Sbjct: 432 ECKFGAIYLCTELLAASRYLTLSLDNCTLDPDFSRQTWHVMLSNFSKSLEKAFIALLRSN 491 Query: 1169 TKQDICKAHVYCGVKGLQILATFPRCFLRISKSVFETILIIFMSIITAGLETTLLWKHTL 990 + A+VY GVKGLQILATFP FL +SKS+++ IL+ +SI+T+ T LW L Sbjct: 492 VADNAESAYVYFGVKGLQILATFPESFLPVSKSIYDDILLELVSIVTSSGSKTFLWTLAL 551 Query: 989 KALMQIGIFIEKFSDSEGETSYMIIVVEKIVSLIFLDDSAMPLPLLLEAMSDIGTTCLKF 810 KAL++IG FI K S S+ IVVEKIVSLI DDSA+PL L L+A+ +IG T Sbjct: 552 KALVEIGFFINKCPGSGKAASFESIVVEKIVSLISSDDSALPLSLKLQAVFEIGETRKDI 611 Query: 809 MLRVNQGLEDAISANLLEASVEGN-SKVDILGPVLECYSNKVLPWFLNTGGFDDVALRFS 633 MLRV Q L++AIS E + GN +++ +L+ Y+ KVLPWFL GG +++ L F+ Sbjct: 612 MLRVVQALDEAISTKFSEVNDHGNHESYNMIVKLLDTYTQKVLPWFLEIGGSEEIPLNFA 671 Query: 632 INMWNQIESSKLFDMGIQGKVNELLNKMMMTMRLAVGGCSEDNQVLIVQKAFGVLSSNIF 453 + +W+++E+S+ ++ + +L M M+ AVG CS++NQ +I+ KAFG+L S Sbjct: 672 LGIWDKMETSRFLNVNPLQIASGVLGATMTAMKSAVGSCSKENQEIIISKAFGILFS--- 728 Query: 452 LPLKDSMDFSVP--------LKLEELQLTQEFHTFSCRDEWAISLFASVIVALRPQTPLQ 297 S DF P +K +ELQ T + RD+W SLFASV++ALRPQT + Sbjct: 729 -----STDFGSPGFKSGNDIVKEDELQQT---NNNVGRDKWLTSLFASVVIALRPQTIIP 780 Query: 296 NVRVILRFFATTLLKGHVPSAQALGSILNKLP--SNNMGNLNSCTVEDALSIIF-EIGLF 126 N +++L+ F T+LL GHVPSA ALGS++NKLP N M + S T+ +A+ IIF + Sbjct: 781 NGKMVLQLFITSLLNGHVPSAHALGSLVNKLPLEINGMDSSTSFTLNEAMDIIFHSFNIL 840 Query: 125 GNIPSWKFHPVDSNNVALINLCDSEGKSNLVQS--NSIIGLAW 3 GN S +D ++ L L +QS N+++GLAW Sbjct: 841 GNDGS----GIDFGSLRLNTL--------RIQSAINTVVGLAW 871 >ref|XP_007150605.1| hypothetical protein PHAVU_005G166100g [Phaseolus vulgaris] gi|561023869|gb|ESW22599.1| hypothetical protein PHAVU_005G166100g [Phaseolus vulgaris] Length = 1145 Score = 306 bits (785), Expect = 1e-80 Identities = 189/456 (41%), Positives = 266/456 (58%), Gaps = 5/456 (1%) Frame = -1 Query: 1355 SEEPNFGALYLCIELLVACRDLVLGFEGGSQQQVITVEDTCFLLQRFSGQLTGAFCSILA 1176 S+ G LYLCIELLV R+L++G + + Q VI E C +L FS L AF +LA Sbjct: 444 SQRVKIGFLYLCIELLVGFRELIVGSKEPALQYVIEHETCCTMLHSFSSSLFNAFGLVLA 503 Query: 1175 TTTKQDICKAHVYCGVKGLQILATFPRCFLRISKSVFETILIIFMSIITAGLETTLLWKH 996 + + Y GVKGLQILA F + KS+FE IL FMSII +LW+ Sbjct: 504 ESADRCPLDPDTYIGVKGLQILAMFHSDVFSMQKSIFENILKKFMSIIIEDFNKKILWEA 563 Query: 995 TLKALMQIGIFIEKFSDSEGETSYMIIVVEKIVSLIFLDDSAMPLPLLLEAMSDIGTTCL 816 LKAL +G F+++F +SE SY +VVEKIV +FLDD +P L +EA+S+IG T + Sbjct: 564 ALKALCHVGSFVQEFHESEKAMSYGSLVVEKIVEFLFLDDIIVPFSLKVEALSNIGMTGM 623 Query: 815 KFMLRVNQGLEDAISANLLEASVEGNSKVDILGPVLECYSNKVLPWFLNTGGFDDVALRF 636 K ML QG+ A+ ANL + + S +I +LECYS K+LPW GG +D AL+F Sbjct: 624 KNMLTSLQGMRKAVFANLSKVHTDLRSS-EIAVQLLECYSCKLLPWTHENGGSEDFALQF 682 Query: 635 SINMWNQIESSKLFDMGIQGKVNELLNKMMMTMRLAVGGCSEDNQVLIVQKAFGVLSSNI 456 ++++W+Q + + + K LL +M M+L+VG CS ++Q LI+QKA+ +LSS Sbjct: 683 AVDIWSQAGNCMVSSTSFEEK--GLLYALMKAMKLSVGICSVESQNLIIQKAYSILSSRT 740 Query: 455 FLPLKDSMDFSVPLKLEELQLTQEFHTFSCRDEWAISLFASVIVALRPQTPLQNVRVILR 276 LK+ LE L L+ + S DEW ISLFASV++A+ P+T + N+RV++ Sbjct: 741 NFQLKE---------LERLPLSPGKYNISLTDEWIISLFASVVIAVCPKTLIPNIRVLVN 791 Query: 275 FFATTLLKGHVPSAQALGSILNKL--PSNNMGNLNSCTVEDALSIIFEIGL-FGNIPSWK 105 F TLL+G VP AQALGS+LNKL SN+ N + T+E+AL IF + F +I + Sbjct: 792 LFIVTLLRGIVPVAQALGSLLNKLVSTSNSAENSSDITLEEALDAIFNTKIWFSSIDILQ 851 Query: 104 FHPVDSN--NVALINLCDSEGKSNLVQSNSIIGLAW 3 SN + L ++C L+Q N+I GL+W Sbjct: 852 RCNGTSNGKEIVLTDICLGFANDKLLQINAICGLSW 887