BLASTX nr result
ID: Akebia27_contig00016351
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia27_contig00016351 (1196 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI15085.3| unnamed protein product [Vitis vinifera] 342 2e-91 ref|XP_002283633.1| PREDICTED: endonuclease III-like [Vitis vini... 342 2e-91 ref|XP_006439743.1| hypothetical protein CICLE_v10021561mg [Citr... 339 1e-90 ref|XP_007036109.1| DNA glycosylase superfamily protein isoform ... 337 5e-90 ref|XP_006476719.1| PREDICTED: protein ROS1-like isoform X2 [Cit... 337 6e-90 ref|XP_006476718.1| PREDICTED: protein ROS1-like isoform X1 [Cit... 337 6e-90 ref|XP_002511456.1| Endonuclease III, putative [Ricinus communis... 335 3e-89 ref|XP_004299588.1| PREDICTED: DEMETER-like protein 3-like [Frag... 325 3e-86 ref|XP_002321564.2| hypothetical protein POPTR_0015s08260g [Popu... 322 2e-85 gb|EXB42063.1| Protein ROS1 [Morus notabilis] 320 6e-85 ref|XP_007036108.1| DNA glycosylase superfamily protein isoform ... 318 4e-84 ref|XP_007036110.1| DNA glycosylase superfamily protein isoform ... 317 8e-84 ref|XP_006476720.1| PREDICTED: protein ROS1-like isoform X3 [Cit... 316 1e-83 ref|XP_002875868.1| predicted protein [Arabidopsis lyrata subsp.... 316 1e-83 ref|XP_007155390.1| hypothetical protein PHAVU_003G197200g [Phas... 315 2e-83 ref|XP_003525486.1| PREDICTED: uncharacterized protein LOC100802... 315 2e-83 ref|XP_003608916.1| Ultraviolet N-glycosylase/AP lyase [Medicago... 315 2e-83 ref|XP_006291571.1| hypothetical protein CARUB_v10017731mg [Caps... 313 1e-82 ref|XP_006404333.1| hypothetical protein EUTSA_v10010580mg [Eutr... 312 2e-82 ref|XP_006836744.1| hypothetical protein AMTR_s00088p00146000 [A... 312 2e-82 >emb|CBI15085.3| unnamed protein product [Vitis vinifera] Length = 310 Score = 342 bits (877), Expect = 2e-91 Identities = 177/289 (61%), Positives = 209/289 (72%), Gaps = 11/289 (3%) Frame = -2 Query: 835 MQRNRKRKNLHSIS-----KPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQE 671 MQR+RKRK S S + PRPT EC++VRD LL LHGFPQ Sbjct: 1 MQRSRKRKQEESSSCSKESATKSARNDVVVDPYPSHPRPTPVECRAVRDDLLALHGFPQR 60 Query: 670 FAKYRRTDY------LNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNT 509 F KYR+ +P G P +L S+ + QKESVLDGL+SI+LSQNT Sbjct: 61 FEKYRKLRLPPLPHTSSPGLDGGGGTPVKLDPSDGDDVNGSSQKESVLDGLVSIILSQNT 120 Query: 508 TDANSTRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGK 329 TD NS RAFASLKS FPTW+DVLAA+ K +EN+I+CGGLAVTKASCIK +L+ LLE+KGK Sbjct: 121 TDVNSQRAFASLKSAFPTWQDVLAADSKSIENAIRCGGLAVTKASCIKKMLSCLLERKGK 180 Query: 328 LCLEYLRGMSADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPA 149 LCLEYLR ++ DEIK EL FKGIGPKTVACVLMFHLQ+DDFPVDTHV +I KA+GWVPA Sbjct: 181 LCLEYLRDLTVDEIKTELSHFKGIGPKTVACVLMFHLQRDDFPVDTHVIQIGKAIGWVPA 240 Query: 148 SSDREKSYLHLNKRIPNKLKFDLNCLLVTHGKLCQRCTKKWGNQQNKVS 2 +DR+K+YLHLN+RIP++LKFDLNCLL THGKLC CT+K NQ+ K S Sbjct: 241 VADRKKAYLHLNRRIPDELKFDLNCLLFTHGKLCHECTQKGANQKRKES 289 >ref|XP_002283633.1| PREDICTED: endonuclease III-like [Vitis vinifera] Length = 310 Score = 342 bits (877), Expect = 2e-91 Identities = 177/289 (61%), Positives = 209/289 (72%), Gaps = 11/289 (3%) Frame = -2 Query: 835 MQRNRKRKNLHSIS-----KPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQE 671 MQR+RKRK S S + PRPT EC++VRD LL LHGFPQ Sbjct: 1 MQRSRKRKQEESSSCSKESATKSARNDVVVDPYPSHPRPTPVECRAVRDDLLALHGFPQR 60 Query: 670 FAKYRRTDY------LNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNT 509 F KYR+ +P G P +L S+ + QKESVLDGL+SI+LSQNT Sbjct: 61 FEKYRKLRLPPLPHTSSPGLDGGGGTPVKLDPSDGDDVNGSSQKESVLDGLVSIILSQNT 120 Query: 508 TDANSTRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGK 329 TD NS RAFASLKS FPTW+DVLAA+ K +EN+I+CGGLAVTKASCIK +L+ LLE+KGK Sbjct: 121 TDVNSQRAFASLKSAFPTWQDVLAADSKSIENAIRCGGLAVTKASCIKKMLSCLLERKGK 180 Query: 328 LCLEYLRGMSADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPA 149 LCLEYLR ++ DEIK EL FKGIGPKTVACVLMFHLQ+DDFPVDTHV +I KA+GWVPA Sbjct: 181 LCLEYLRDLTVDEIKTELSHFKGIGPKTVACVLMFHLQRDDFPVDTHVIQIGKAIGWVPA 240 Query: 148 SSDREKSYLHLNKRIPNKLKFDLNCLLVTHGKLCQRCTKKWGNQQNKVS 2 +DR+K+YLHLN+RIP++LKFDLNCLL THGKLC CT+K NQ+ K S Sbjct: 241 VADRKKAYLHLNRRIPDELKFDLNCLLFTHGKLCHECTQKGANQKRKES 289 >ref|XP_006439743.1| hypothetical protein CICLE_v10021561mg [Citrus clementina] gi|557542005|gb|ESR52983.1| hypothetical protein CICLE_v10021561mg [Citrus clementina] Length = 281 Score = 339 bits (870), Expect = 1e-90 Identities = 172/278 (61%), Positives = 203/278 (73%) Frame = -2 Query: 835 MQRNRKRKNLHSISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEFAKYR 656 MQ++RKRK Q++ RPT++EC+ +RD LL LHGFP EF KYR Sbjct: 1 MQKSRKRK--------QVEVTETRQDPYPTHSRPTAEECRGIRDELLALHGFPPEFVKYR 52 Query: 655 RTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAFAS 476 + S P + S E ++ESVLDGL+ LLSQNTT+ANS +AFAS Sbjct: 53 NQRLKHNMTRDKNSVPLDM------SEYDEGEEESVLDGLVKTLLSQNTTEANSLKAFAS 106 Query: 475 LKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGMSA 296 LKS FPTWE VLAAE K +EN+I+CGGLA TKA+CIKN+L LLE KGKLCLEYLRG+S Sbjct: 107 LKSTFPTWEHVLAAEQKCIENAIRCGGLAPTKAACIKNILKCLLESKGKLCLEYLRGLSI 166 Query: 295 DEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASSDREKSYLHL 116 DEIKAEL F+GIGPKTVACVLMFHLQQDDFPVDTHVF I+KA+GWVP ++DR K+YLHL Sbjct: 167 DEIKAELSRFRGIGPKTVACVLMFHLQQDDFPVDTHVFEISKAIGWVPTAADRNKTYLHL 226 Query: 115 NKRIPNKLKFDLNCLLVTHGKLCQRCTKKWGNQQNKVS 2 N+RIP +LKFDLNCLL THGKLC+ C KK GN+Q K S Sbjct: 227 NQRIPKELKFDLNCLLYTHGKLCRNCIKKGGNRQRKES 264 >ref|XP_007036109.1| DNA glycosylase superfamily protein isoform 2 [Theobroma cacao] gi|508773354|gb|EOY20610.1| DNA glycosylase superfamily protein isoform 2 [Theobroma cacao] Length = 292 Score = 337 bits (865), Expect = 5e-90 Identities = 172/275 (62%), Positives = 204/275 (74%) Frame = -2 Query: 838 KMQRNRKRKNLHSISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEFAKY 659 KMQ++RKRK L I RPT EC+SVRD LL LHGFP EF KY Sbjct: 2 KMQKSRKRKQL-GIDGHSKTPKITTEEPYPSHHRPTPDECRSVRDELLALHGFPAEFLKY 60 Query: 658 RRTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAFA 479 R + +EP+ KSEP + + + +ESVLDGL+ +LSQNTT+ NS +AFA Sbjct: 61 RHQRLIK-------TEPTIDAKSEPLNNNYDDGEESVLDGLVKTVLSQNTTELNSQKAFA 113 Query: 478 SLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGMS 299 SLKS FPTWEDVLAAE K +EN+I+CGGLA KASCIKN+L L E+KGKLC EYLR +S Sbjct: 114 SLKSAFPTWEDVLAAESKNLENAIRCGGLAPRKASCIKNVLRCLHERKGKLCFEYLRDLS 173 Query: 298 ADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASSDREKSYLH 119 DEIKAEL FKG+GPKTVACVLMF+LQQDDFPVDTHVF I +A+GWVPA++DR+K+YLH Sbjct: 174 IDEIKAELSNFKGVGPKTVACVLMFNLQQDDFPVDTHVFEIARAIGWVPATADRKKTYLH 233 Query: 118 LNKRIPNKLKFDLNCLLVTHGKLCQRCTKKWGNQQ 14 LN+RIPNKLKFDLNCLL THGKLC++CT K +QQ Sbjct: 234 LNRRIPNKLKFDLNCLLYTHGKLCRKCTMKGSSQQ 268 >ref|XP_006476719.1| PREDICTED: protein ROS1-like isoform X2 [Citrus sinensis] Length = 278 Score = 337 bits (864), Expect = 6e-90 Identities = 170/278 (61%), Positives = 204/278 (73%) Frame = -2 Query: 835 MQRNRKRKNLHSISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEFAKYR 656 MQ++RKRK Q++ RPT++EC+ +RD LL LHGFP EF KYR Sbjct: 1 MQKSRKRK--------QVEVTETRQDPYPTHSRPTAEECRGIRDELLALHGFPPEFVKYR 52 Query: 655 RTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAFAS 476 + S P + + + E ++ESVLDGL+ +LSQNTT+ANS +AFAS Sbjct: 53 NQRLKHNMTRDKNSVPLDMNEYD------EGEEESVLDGLVKTVLSQNTTEANSLKAFAS 106 Query: 475 LKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGMSA 296 LKS FPTWE VLAAE K +EN+I+CGGLA TKA+CIKN+L LLE KGKLCLEYLRG+S Sbjct: 107 LKSTFPTWEHVLAAEQKCIENAIRCGGLAPTKAACIKNILKCLLESKGKLCLEYLRGLSI 166 Query: 295 DEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASSDREKSYLHL 116 DEIKAEL F+GIGPKTVACVLMFHLQQDDFPVDTHVF I+KA+GWVP ++DR K+YLHL Sbjct: 167 DEIKAELSRFRGIGPKTVACVLMFHLQQDDFPVDTHVFEISKAIGWVPTAADRNKTYLHL 226 Query: 115 NKRIPNKLKFDLNCLLVTHGKLCQRCTKKWGNQQNKVS 2 N+RIP +LKFDLNCLL THGKLC+ C KK GN+Q K S Sbjct: 227 NQRIPKELKFDLNCLLYTHGKLCRNCIKKGGNRQRKES 264 >ref|XP_006476718.1| PREDICTED: protein ROS1-like isoform X1 [Citrus sinensis] Length = 281 Score = 337 bits (864), Expect = 6e-90 Identities = 170/278 (61%), Positives = 204/278 (73%) Frame = -2 Query: 835 MQRNRKRKNLHSISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEFAKYR 656 MQ++RKRK Q++ RPT++EC+ +RD LL LHGFP EF KYR Sbjct: 1 MQKSRKRK--------QVEVTETRQDPYPTHSRPTAEECRGIRDELLALHGFPPEFVKYR 52 Query: 655 RTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAFAS 476 + S P + + + E ++ESVLDGL+ +LSQNTT+ANS +AFAS Sbjct: 53 NQRLKHNMTRDKNSVPLDMNEYD------EGEEESVLDGLVKTVLSQNTTEANSLKAFAS 106 Query: 475 LKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGMSA 296 LKS FPTWE VLAAE K +EN+I+CGGLA TKA+CIKN+L LLE KGKLCLEYLRG+S Sbjct: 107 LKSTFPTWEHVLAAEQKCIENAIRCGGLAPTKAACIKNILKCLLESKGKLCLEYLRGLSI 166 Query: 295 DEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASSDREKSYLHL 116 DEIKAEL F+GIGPKTVACVLMFHLQQDDFPVDTHVF I+KA+GWVP ++DR K+YLHL Sbjct: 167 DEIKAELSRFRGIGPKTVACVLMFHLQQDDFPVDTHVFEISKAIGWVPTAADRNKTYLHL 226 Query: 115 NKRIPNKLKFDLNCLLVTHGKLCQRCTKKWGNQQNKVS 2 N+RIP +LKFDLNCLL THGKLC+ C KK GN+Q K S Sbjct: 227 NQRIPKELKFDLNCLLYTHGKLCRNCIKKGGNRQRKES 264 >ref|XP_002511456.1| Endonuclease III, putative [Ricinus communis] gi|223550571|gb|EEF52058.1| Endonuclease III, putative [Ricinus communis] Length = 291 Score = 335 bits (858), Expect = 3e-89 Identities = 170/280 (60%), Positives = 204/280 (72%), Gaps = 2/280 (0%) Frame = -2 Query: 835 MQRNRKRK--NLHSISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEFAK 662 MQ+NRKRK + + +K PRPT +EC +RDSLL HGFPQEFAK Sbjct: 1 MQKNRKRKLKSAETETKSAKINNGNKEEPYPTHPRPTPEECLCIRDSLLAFHGFPQEFAK 60 Query: 661 YRRTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAF 482 YR+ D ++ S + + +E+VLDGL+ +LSQNTT+ NS RAF Sbjct: 61 YRKQRLGGDD------------DNKSSDVNSDTAEETVLDGLVKTVLSQNTTEVNSQRAF 108 Query: 481 ASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGM 302 +LKS+FPTW+DVLAAE K++EN+I+CGGLA KASCIKN+L LLEKKGK+CLEYLR M Sbjct: 109 DNLKSDFPTWQDVLAAEPKWIENAIRCGGLAPAKASCIKNILNCLLEKKGKICLEYLRDM 168 Query: 301 SADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASSDREKSYL 122 S DEIKAEL FKG+GPKTVACVLMFHLQQ+DFPVDTHVF I KA+GWVP +DR K+YL Sbjct: 169 SVDEIKAELSQFKGVGPKTVACVLMFHLQQEDFPVDTHVFEIAKALGWVPEVADRNKTYL 228 Query: 121 HLNKRIPNKLKFDLNCLLVTHGKLCQRCTKKWGNQQNKVS 2 HLN+RIPN+LKFDLNCLL THGKLC++C KK GNQ K S Sbjct: 229 HLNQRIPNELKFDLNCLLYTHGKLCRKCIKKRGNQSRKES 268 >ref|XP_004299588.1| PREDICTED: DEMETER-like protein 3-like [Fragaria vesca subsp. vesca] Length = 286 Score = 325 bits (832), Expect = 3e-86 Identities = 173/282 (61%), Positives = 206/282 (73%), Gaps = 4/282 (1%) Frame = -2 Query: 835 MQRNRKRKN-LHSISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEFAKY 659 M +NRKRK + P++ RPT +EC SVRD LL LHGFP+EFAKY Sbjct: 1 MPKNRKRKEQAEADHNPKLPTKTTPKDPYPNHARPTREECVSVRDDLLALHGFPKEFAKY 60 Query: 658 RRTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAFA 479 R + +SNG + V SEP +KESVLDGL+ LLSQNTT++NS +AFA Sbjct: 61 REQRLSSQ--ASNGHDND--VSSEPLD-----EKESVLDGLVRTLLSQNTTESNSLKAFA 111 Query: 478 SLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGMS 299 SLKS FPTWE+VLAA+ + +E++I+CGGLA TKASCIKN+L+ LLEKK KLCLEYLR +S Sbjct: 112 SLKSAFPTWEEVLAADSQSLESAIRCGGLAKTKASCIKNMLSCLLEKKEKLCLEYLRDLS 171 Query: 298 ADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASSDREKSYLH 119 DEIKAEL FKGIGPKTVACVLMF LQQDDFPVDTHV+ I KAM WVP +DR K+YLH Sbjct: 172 VDEIKAELSHFKGIGPKTVACVLMFQLQQDDFPVDTHVYEIAKAMAWVPVGADRNKTYLH 231 Query: 118 LNKRIPNKLKFDLNCLLVTHGKLCQRCTKKWGN---QQNKVS 2 LN+ IP++LKFDLNCLL THGKLC++C KK G+ QQ K S Sbjct: 232 LNQWIPDELKFDLNCLLYTHGKLCRKCIKKGGSTGKQQEKES 273 >ref|XP_002321564.2| hypothetical protein POPTR_0015s08260g [Populus trichocarpa] gi|550322300|gb|EEF05691.2| hypothetical protein POPTR_0015s08260g [Populus trichocarpa] Length = 306 Score = 322 bits (825), Expect = 2e-85 Identities = 169/293 (57%), Positives = 208/293 (70%), Gaps = 17/293 (5%) Frame = -2 Query: 835 MQRNRKRKNLHSISKPQIKXXXXXXXXXXXXP-------RPTSQECQSVRDSLLTLHGFP 677 MQ KRK H + KP+ RPT +EC+++RDSLL HGFP Sbjct: 1 MQTGHKRKQQHEL-KPRTNKKSAETISNIKEEEPFPTHARPTPEECRAIRDSLLAFHGFP 59 Query: 676 QEFAKYRRT-DYL--------NPDYSSN-GSEPSQLVKSEPSSTSQEPQKESVLDGLISI 527 QEFAKYR+ YL +P +N + +VK E +E ++ESVLDGL+ Sbjct: 60 QEFAKYRKQRPYLITLQDKEESPHLINNCDGKNDNVVKVEEE---EEEEEESVLDGLVKT 116 Query: 526 LLSQNTTDANSTRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRL 347 +LSQNTT+ NS RAF +LKS FPTWE+VLAAE KF+E++I+CGGLA TKA+CI+N+L+ L Sbjct: 117 VLSQNTTEVNSQRAFLNLKSAFPTWENVLAAESKFIEDAIRCGGLAPTKAACIRNILSSL 176 Query: 346 LEKKGKLCLEYLRGMSADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKA 167 +EK G+LCLEYLR + EIKAEL FKGIGPKTVACVLMF+LQ+DDFPVDTHVF I KA Sbjct: 177 MEKNGRLCLEYLRDLPVAEIKAELSHFKGIGPKTVACVLMFNLQKDDFPVDTHVFEIAKA 236 Query: 166 MGWVPASSDREKSYLHLNKRIPNKLKFDLNCLLVTHGKLCQRCTKKWGNQQNK 8 +GWVP +DR K+YLHLN RIP +LKFDLNCLL THGKLC++CTKK G+QQ K Sbjct: 237 IGWVPPVADRNKTYLHLNHRIPKELKFDLNCLLYTHGKLCRKCTKKSGSQQRK 289 >gb|EXB42063.1| Protein ROS1 [Morus notabilis] Length = 308 Score = 320 bits (821), Expect = 6e-85 Identities = 161/245 (65%), Positives = 185/245 (75%) Frame = -2 Query: 736 PTSQECQSVRDSLLTLHGFPQEFAKYRRTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQK 557 PT +C++VRD LL LHGFPQEFAKYRR + NG E K Sbjct: 68 PTPDQCRAVRDDLLALHGFPQEFAKYRR----QKPTTDNGEESES--------------K 109 Query: 556 ESVLDGLISILLSQNTTDANSTRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKA 377 ESVLDGL+ +LSQNTT+ANS RAFASLKS FPTWE VL A+ K +E++I+CGGLA KA Sbjct: 110 ESVLDGLVMTVLSQNTTEANSQRAFASLKSAFPTWEQVLNADSKCIEDAIRCGGLAPKKA 169 Query: 376 SCIKNLLTRLLEKKGKLCLEYLRGMSADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPV 197 SCIKN L LLE+KGKLCLEYL S DE+KAEL FKGIGPKTVACVLMFHLQQDDFPV Sbjct: 170 SCIKNTLRSLLERKGKLCLEYLLDFSVDEVKAELSCFKGIGPKTVACVLMFHLQQDDFPV 229 Query: 196 DTHVFRITKAMGWVPASSDREKSYLHLNKRIPNKLKFDLNCLLVTHGKLCQRCTKKWGNQ 17 DTHVF I KA+GW+PA +DR K+YLHLN+RIPN+LKFDLNCLL THGK+C++C KK G+Q Sbjct: 230 DTHVFEIAKALGWLPAGADRNKAYLHLNQRIPNELKFDLNCLLYTHGKMCRKCIKKGGSQ 289 Query: 16 QNKVS 2 K S Sbjct: 290 IKKGS 294 >ref|XP_007036108.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508773353|gb|EOY20609.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] Length = 446 Score = 318 bits (814), Expect = 4e-84 Identities = 172/299 (57%), Positives = 202/299 (67%), Gaps = 24/299 (8%) Frame = -2 Query: 838 KMQRNRKRKNLHSISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEFAKY 659 KMQ++RKRK L I RPT EC+SVRD LL LHGFP EF KY Sbjct: 2 KMQKSRKRKQL-GIDGHSKTPKITTEEPYPSHHRPTPDECRSVRDELLALHGFPAEFLKY 60 Query: 658 RRTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAFA 479 R + +EP+ KSEP + + + +ESVLDGL+ +LSQNTT+ NS +AFA Sbjct: 61 RHQRLIK-------TEPTIDAKSEPLNNNYDDGEESVLDGLVKTVLSQNTTELNSQKAFA 113 Query: 478 SLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGMS 299 SLKS FPTWEDVLAAE K +EN+I+CGGLA KASCIKN+L L E+KGKLC EYLR +S Sbjct: 114 SLKSAFPTWEDVLAAESKNLENAIRCGGLAPRKASCIKNVLRCLHERKGKLCFEYLRDLS 173 Query: 298 ADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASSDREKSYLH 119 DEIKAEL FKG+GPKTVACVLMF+LQQDDFPVDTHVF I +A+GWVPA++DR+K+YLH Sbjct: 174 IDEIKAELSNFKGVGPKTVACVLMFNLQQDDFPVDTHVFEIARAIGWVPATADRKKTYLH 233 Query: 118 LNKRIPNKLKFDLNCLLVTH--------GKL----------------CQRCTKKWGNQQ 14 LN+RIPNKLKFDLNCLL TH GK CQ C KK+ N Q Sbjct: 234 LNRRIPNKLKFDLNCLLYTHDGQGTVEAGKTVKEKSVTRKLEKRKYECQFCLKKFTNSQ 292 >ref|XP_007036110.1| DNA glycosylase superfamily protein isoform 3 [Theobroma cacao] gi|508773355|gb|EOY20611.1| DNA glycosylase superfamily protein isoform 3 [Theobroma cacao] Length = 264 Score = 317 bits (811), Expect = 8e-84 Identities = 163/260 (62%), Positives = 192/260 (73%) Frame = -2 Query: 838 KMQRNRKRKNLHSISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEFAKY 659 KMQ++RKRK L I RPT EC+SVRD LL LHGFP EF KY Sbjct: 2 KMQKSRKRKQL-GIDGHSKTPKITTEEPYPSHHRPTPDECRSVRDELLALHGFPAEFLKY 60 Query: 658 RRTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAFA 479 R + +EP+ KSEP + + + +ESVLDGL+ +LSQNTT+ NS +AFA Sbjct: 61 RHQRLIK-------TEPTIDAKSEPLNNNYDDGEESVLDGLVKTVLSQNTTELNSQKAFA 113 Query: 478 SLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGMS 299 SLKS FPTWEDVLAAE K +EN+I+CGGLA KASCIKN+L L E+KGKLC EYLR +S Sbjct: 114 SLKSAFPTWEDVLAAESKNLENAIRCGGLAPRKASCIKNVLRCLHERKGKLCFEYLRDLS 173 Query: 298 ADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASSDREKSYLH 119 DEIKAEL FKG+GPKTVACVLMF+LQQDDFPVDTHVF I +A+GWVPA++DR+K+YLH Sbjct: 174 IDEIKAELSNFKGVGPKTVACVLMFNLQQDDFPVDTHVFEIARAIGWVPATADRKKTYLH 233 Query: 118 LNKRIPNKLKFDLNCLLVTH 59 LN+RIPNKLKFDLNCLL TH Sbjct: 234 LNRRIPNKLKFDLNCLLYTH 253 >ref|XP_006476720.1| PREDICTED: protein ROS1-like isoform X3 [Citrus sinensis] Length = 258 Score = 316 bits (810), Expect = 1e-83 Identities = 163/272 (59%), Positives = 196/272 (72%) Frame = -2 Query: 835 MQRNRKRKNLHSISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEFAKYR 656 MQ++RKRK Q++ RPT++EC+ +RD LL LHGFP EF KYR Sbjct: 1 MQKSRKRK--------QVEVTETRQDPYPTHSRPTAEECRGIRDELLALHGFPPEFVKYR 52 Query: 655 RTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAFAS 476 + S P + + + E ++ESVLDGL+ +LSQNTT+ANS +AFAS Sbjct: 53 NQRLKHNMTRDKNSVPLDMNEYD------EGEEESVLDGLVKTVLSQNTTEANSLKAFAS 106 Query: 475 LKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGMSA 296 LKS FPTWE VLAAE K +EN+I+CGGLA TKA+CIKN+L LLE KGKLCLEYLRG+S Sbjct: 107 LKSTFPTWEHVLAAEQKCIENAIRCGGLAPTKAACIKNILKCLLESKGKLCLEYLRGLSI 166 Query: 295 DEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASSDREKSYLHL 116 DEIKAEL F+GIGPKTVACVLMFHLQQDDFPVDTHVF I+KA+GWVP ++DR K+YLHL Sbjct: 167 DEIKAELSRFRGIGPKTVACVLMFHLQQDDFPVDTHVFEISKAIGWVPTAADRNKTYLHL 226 Query: 115 NKRIPNKLKFDLNCLLVTHGKLCQRCTKKWGN 20 N+RIP +LKFDLNCLL THG + R K GN Sbjct: 227 NQRIPKELKFDLNCLLYTHGNILPRA--KEGN 256 >ref|XP_002875868.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297321706|gb|EFH52127.1| predicted protein [Arabidopsis lyrata subsp. lyrata] Length = 294 Score = 316 bits (809), Expect = 1e-83 Identities = 160/272 (58%), Positives = 199/272 (73%), Gaps = 4/272 (1%) Frame = -2 Query: 835 MQRNRKRKNLHS----ISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEF 668 M + +KRK L+ P IK RPT++EC+ VRD+LL+LHGFP EF Sbjct: 1 MSKAQKRKRLNQDDGESKTPAIKSTVDGSNPYPTLLRPTAEECREVRDALLSLHGFPPEF 60 Query: 667 AKYRRTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTR 488 A YRR L + +G + +KSEP ++E ESVLDGL+ ILLSQNTT++NS R Sbjct: 61 ANYRR-QRLRSLSAVDGHDTQCTMKSEPLDEAEE---ESVLDGLVKILLSQNTTESNSQR 116 Query: 487 AFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLR 308 AFASLK+ FP WEDVLAAE K +E++I+CGGLA KA CIKN+L RL ++G LCLEYLR Sbjct: 117 AFASLKAAFPNWEDVLAAESKSIESAIRCGGLAPKKAVCIKNILNRLQTERGVLCLEYLR 176 Query: 307 GMSADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASSDREKS 128 G+S +E+K EL FKGIGPKTV+CVLMF+LQ +DFPVDTHVF I KA+GWVP ++DR K+ Sbjct: 177 GLSVEEVKTELSHFKGIGPKTVSCVLMFNLQHNDFPVDTHVFEIAKALGWVPKTADRNKT 236 Query: 127 YLHLNKRIPNKLKFDLNCLLVTHGKLCQRCTK 32 Y+HLN+RIP++LKFDLNCLL THGKLC C K Sbjct: 237 YVHLNRRIPDELKFDLNCLLYTHGKLCSNCKK 268 >ref|XP_007155390.1| hypothetical protein PHAVU_003G197200g [Phaseolus vulgaris] gi|561028744|gb|ESW27384.1| hypothetical protein PHAVU_003G197200g [Phaseolus vulgaris] Length = 282 Score = 315 bits (807), Expect = 2e-83 Identities = 155/244 (63%), Positives = 183/244 (75%) Frame = -2 Query: 739 RPTSQECQSVRDSLLTLHGFPQEFAKYRRTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQ 560 RPT +EC++VRD+LL LHG P E AKYR+ LN EP Sbjct: 41 RPTPEECEAVRDTLLALHGIPPELAKYRKLQPLNDAVQPESPEP---------------- 84 Query: 559 KESVLDGLISILLSQNTTDANSTRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTK 380 VLDGL+ +LSQNTT+ANS +AF SLKS+FPTWE V AE K VEN+I+CGGLA TK Sbjct: 85 ---VLDGLVRTVLSQNTTEANSQKAFVSLKSSFPTWEHVFGAESKDVENAIRCGGLAPTK 141 Query: 379 ASCIKNLLTRLLEKKGKLCLEYLRGMSADEIKAELLGFKGIGPKTVACVLMFHLQQDDFP 200 ASCIKN+L L E++G+LCLEYLR +S DE KAEL FKGIGPKTVACVLMF+LQQDDFP Sbjct: 142 ASCIKNMLRCLRERRGQLCLEYLRDLSVDEAKAELSLFKGIGPKTVACVLMFNLQQDDFP 201 Query: 199 VDTHVFRITKAMGWVPASSDREKSYLHLNKRIPNKLKFDLNCLLVTHGKLCQRCTKKWGN 20 VDTH+F I+K MGWVP+ +DR KSYLHLN+RIPN+LKFDLNCL+ THGKLC++C+ K GN Sbjct: 202 VDTHIFEISKTMGWVPSVADRNKSYLHLNQRIPNELKFDLNCLMFTHGKLCRKCSSKKGN 261 Query: 19 QQNK 8 QQ K Sbjct: 262 QQGK 265 >ref|XP_003525486.1| PREDICTED: uncharacterized protein LOC100802952 [Glycine max] Length = 284 Score = 315 bits (807), Expect = 2e-83 Identities = 160/244 (65%), Positives = 188/244 (77%) Frame = -2 Query: 739 RPTSQECQSVRDSLLTLHGFPQEFAKYRRTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQ 560 RPT QEC++VRD+LL LHG P E AKYR+ L P EP QL EP Sbjct: 41 RPTPQECEAVRDTLLALHGIPPELAKYRK---LPPS-----DEPVQLQPPEP-------- 84 Query: 559 KESVLDGLISILLSQNTTDANSTRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTK 380 VLDGL+ +LSQNTT+ANS +AFASLKS+FP+WE VL AE K VEN+I+CGGLA TK Sbjct: 85 ---VLDGLVRTVLSQNTTEANSQKAFASLKSSFPSWEQVLWAESKDVENAIRCGGLAPTK 141 Query: 379 ASCIKNLLTRLLEKKGKLCLEYLRGMSADEIKAELLGFKGIGPKTVACVLMFHLQQDDFP 200 ASCIKN+L L E++G+LCLEYLR +S DE+KAEL FKGIGPKTVACVLMF+LQQDDFP Sbjct: 142 ASCIKNVLRCLRERRGELCLEYLRDLSVDEVKAELSLFKGIGPKTVACVLMFNLQQDDFP 201 Query: 199 VDTHVFRITKAMGWVPASSDREKSYLHLNKRIPNKLKFDLNCLLVTHGKLCQRCTKKWGN 20 VDTH+F I K MGWVPA ++R KSYLHLN+R+PN+LKFDLNCLL THGKLC +C+ K GN Sbjct: 202 VDTHIFEIAKTMGWVPAVANRNKSYLHLNQRVPNELKFDLNCLLYTHGKLCHQCSGKKGN 261 Query: 19 QQNK 8 +Q K Sbjct: 262 KQGK 265 >ref|XP_003608916.1| Ultraviolet N-glycosylase/AP lyase [Medicago truncatula] gi|355509971|gb|AES91113.1| Ultraviolet N-glycosylase/AP lyase [Medicago truncatula] Length = 280 Score = 315 bits (807), Expect = 2e-83 Identities = 165/285 (57%), Positives = 200/285 (70%), Gaps = 9/285 (3%) Frame = -2 Query: 835 MQRNRKRK---------NLHSISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHG 683 M++ RKRK N +S+ PQIK PT QEC +RD+LL+LHG Sbjct: 1 MEKKRKRKVKTERDGDRNPNSVQVPQIKTENPKNPFPSHSA-PTPQECLEIRDNLLSLHG 59 Query: 682 FPQEFAKYRRTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTD 503 P E AKYR KS+ ++ + EP E+VLDGL+ +LSQNTT+ Sbjct: 60 IPPELAKYR--------------------KSQQTNDTVEPP-ETVLDGLVRTILSQNTTE 98 Query: 502 ANSTRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLC 323 ANS +AFASLKS FPTWE V AE K +EN+I+CGGLA TKA CIKNLL+ LLE+KGK+C Sbjct: 99 ANSNKAFASLKSLFPTWEHVHGAESKELENAIRCGGLAPTKAKCIKNLLSCLLERKGKMC 158 Query: 322 LEYLRGMSADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASS 143 LEYLR +S DE+KAEL FKGIGPKTV+CVLMF+LQ DDFPVDTH+F I K MGWVPA++ Sbjct: 159 LEYLRDLSVDEVKAELSLFKGIGPKTVSCVLMFNLQLDDFPVDTHIFEIAKTMGWVPAAA 218 Query: 142 DREKSYLHLNKRIPNKLKFDLNCLLVTHGKLCQRCTKKWGNQQNK 8 DR K+YLHLN+RIP++LKFDLNCLL THGKLC C+ K GN+Q K Sbjct: 219 DRNKTYLHLNQRIPDELKFDLNCLLYTHGKLCSNCSSKRGNKQQK 263 >ref|XP_006291571.1| hypothetical protein CARUB_v10017731mg [Capsella rubella] gi|482560278|gb|EOA24469.1| hypothetical protein CARUB_v10017731mg [Capsella rubella] Length = 298 Score = 313 bits (801), Expect = 1e-82 Identities = 162/281 (57%), Positives = 201/281 (71%), Gaps = 5/281 (1%) Frame = -2 Query: 835 MQRNRKRKNLHS----ISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEF 668 M + +KRK L+ P IK RPT++EC+ VRD+LL+LHGFP EF Sbjct: 1 MSKAQKRKRLNQGDGESKTPVIKSAVDGGDPYPALLRPTAEECRDVRDALLSLHGFPPEF 60 Query: 667 AKYRRTDY-LNPDYSSNGSEPSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANST 491 A YRR L +G++ + VK EP ++E ESVLDGL+ ILLSQNTT++NS Sbjct: 61 ASYRRKRLRLFSAVDDHGTQCT--VKPEPLDEAEE---ESVLDGLVKILLSQNTTESNSL 115 Query: 490 RAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYL 311 RAFASLK+ FP WEDVLAAE +EN+I+CGGLA KA CIKN+L RL +KG LCLEYL Sbjct: 116 RAFASLKAAFPKWEDVLAAESISIENAIRCGGLAPKKAVCIKNILNRLQNEKGVLCLEYL 175 Query: 310 RGMSADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASSDREK 131 R +S DE+K+EL FKG+GPKTV+CVLMF+LQ +DFPVDTHVF I KA+GWVP ++DR K Sbjct: 176 RSLSVDEVKSELSQFKGVGPKTVSCVLMFNLQHNDFPVDTHVFEIAKALGWVPKTADRNK 235 Query: 130 SYLHLNKRIPNKLKFDLNCLLVTHGKLCQRCTKKWGNQQNK 8 +Y+HLN+RIP++LKFDLNCLL THGKLC C K G + K Sbjct: 236 TYVHLNRRIPDELKFDLNCLLYTHGKLCSNCKKNVGKPKAK 276 >ref|XP_006404333.1| hypothetical protein EUTSA_v10010580mg [Eutrema salsugineum] gi|557105452|gb|ESQ45786.1| hypothetical protein EUTSA_v10010580mg [Eutrema salsugineum] Length = 302 Score = 312 bits (799), Expect = 2e-82 Identities = 157/275 (57%), Positives = 196/275 (71%), Gaps = 7/275 (2%) Frame = -2 Query: 835 MQRNRKRKNLH----SISKPQIKXXXXXXXXXXXXPRPTSQECQSVRDSLLTLHGFPQEF 668 M +++KR LH P K RPTS EC+ VRD+LL+LHGFP EF Sbjct: 1 MSKSQKRTRLHLDDGDSKTPATKSTVYGGDPYPSHLRPTSDECRDVRDALLSLHGFPPEF 60 Query: 667 AKYRRTDYLNPDYSSNGSEPSQLVKSEPSSTSQEPQ---KESVLDGLISILLSQNTTDAN 497 YRR L + +G +KSEP + + + +E+VLDGL+ ILLSQNTT+ N Sbjct: 61 DSYRR-QRLRSSSAVDGYHTHCTMKSEPLEAANDEKDEIEETVLDGLVKILLSQNTTEIN 119 Query: 496 STRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLE 317 S RAFASLK+ FP WEDVL AE K +EN+I+CGGLA KA CIKN+L+RL ++G+LCLE Sbjct: 120 SQRAFASLKAAFPKWEDVLGAEPKSIENAIRCGGLAPKKAVCIKNILSRLQSERGRLCLE 179 Query: 316 YLRGMSADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASSDR 137 YLRG+S +E+K EL FKGIGPKTV+CVLMF+LQ +DFPVDTHVF I KA+GWVP ++DR Sbjct: 180 YLRGLSVEEVKTELSHFKGIGPKTVSCVLMFNLQHNDFPVDTHVFEIAKAIGWVPKTADR 239 Query: 136 EKSYLHLNKRIPNKLKFDLNCLLVTHGKLCQRCTK 32 K+Y+HLN+RIP++LKFDLNCLL THGKLC C K Sbjct: 240 NKTYVHLNRRIPDELKFDLNCLLYTHGKLCSNCKK 274 >ref|XP_006836744.1| hypothetical protein AMTR_s00088p00146000 [Amborella trichopoda] gi|548839304|gb|ERM99597.1| hypothetical protein AMTR_s00088p00146000 [Amborella trichopoda] Length = 305 Score = 312 bits (799), Expect = 2e-82 Identities = 156/238 (65%), Positives = 187/238 (78%), Gaps = 2/238 (0%) Frame = -2 Query: 739 RPTSQECQSVRDSLLTLHGFPQEFAKYRRTDYLNPDYSSNGSEPSQLVKSEPSSTSQEP- 563 RPT QEC VRD+L++LHGFP+EFA++RR + + D E Q + P Sbjct: 53 RPTPQECLIVRDALISLHGFPEEFAEFRRKEAVVND----SFEEKQQKLDDEGEVRIAPL 108 Query: 562 -QKESVLDGLISILLSQNTTDANSTRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAV 386 Q SVLDGL+S++LSQNTTD NS RAF SLK FPTWEDV AAE K V N+IKCGGLA Sbjct: 109 IQGGSVLDGLVSVILSQNTTDVNSRRAFESLKLAFPTWEDVHAAESKSVVNTIKCGGLAE 168 Query: 385 TKASCIKNLLTRLLEKKGKLCLEYLRGMSADEIKAELLGFKGIGPKTVACVLMFHLQQDD 206 TKASCIKN+L+ LLE+KGK+CL+YLR M D+IKAEL FKG+GPKTVACVLMF+LQ+DD Sbjct: 169 TKASCIKNILSALLEQKGKICLDYLREMPIDKIKAELRHFKGVGPKTVACVLMFYLQKDD 228 Query: 205 FPVDTHVFRITKAMGWVPASSDREKSYLHLNKRIPNKLKFDLNCLLVTHGKLCQRCTK 32 FPVDTHVFRI KA+GWVP+ ++REK+YLHLN +IP+ LKFDLNCLLVTHGK C++CTK Sbjct: 229 FPVDTHVFRIVKAIGWVPSEANREKAYLHLNSQIPDDLKFDLNCLLVTHGKHCEKCTK 286