BLASTX nr result
ID: Akebia25_contig00028350
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia25_contig00028350 (1535 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006439743.1| hypothetical protein CICLE_v10021561mg [Citr... 343 9e-92 ref|XP_006476718.1| PREDICTED: protein ROS1-like isoform X1 [Cit... 341 5e-91 ref|XP_002511456.1| Endonuclease III, putative [Ricinus communis... 337 1e-89 emb|CBI15085.3| unnamed protein product [Vitis vinifera] 335 2e-89 ref|XP_002283633.1| PREDICTED: endonuclease III-like [Vitis vini... 335 2e-89 ref|XP_007036109.1| DNA glycosylase superfamily protein isoform ... 333 1e-88 ref|XP_006476719.1| PREDICTED: protein ROS1-like isoform X2 [Cit... 330 1e-87 gb|EXB42063.1| Protein ROS1 [Morus notabilis] 322 2e-85 ref|XP_002321564.2| hypothetical protein POPTR_0015s08260g [Popu... 322 2e-85 ref|XP_004299588.1| PREDICTED: DEMETER-like protein 3-like [Frag... 317 7e-84 ref|XP_007155390.1| hypothetical protein PHAVU_003G197200g [Phas... 317 1e-83 ref|XP_003608916.1| Ultraviolet N-glycosylase/AP lyase [Medicago... 312 3e-82 ref|XP_003525486.1| PREDICTED: uncharacterized protein LOC100802... 309 2e-81 ref|XP_006476720.1| PREDICTED: protein ROS1-like isoform X3 [Cit... 306 2e-80 ref|XP_004508835.1| PREDICTED: protein ROS1-like [Cicer arietinu... 306 2e-80 ref|XP_006291571.1| hypothetical protein CARUB_v10017731mg [Caps... 306 2e-80 ref|NP_566893.1| DNA glycosylase superfamily protein [Arabidopsi... 305 4e-80 ref|XP_002875868.1| predicted protein [Arabidopsis lyrata subsp.... 304 6e-80 ref|XP_006836744.1| hypothetical protein AMTR_s00088p00146000 [A... 302 2e-79 ref|XP_007036108.1| DNA glycosylase superfamily protein isoform ... 301 5e-79 >ref|XP_006439743.1| hypothetical protein CICLE_v10021561mg [Citrus clementina] gi|557542005|gb|ESR52983.1| hypothetical protein CICLE_v10021561mg [Citrus clementina] Length = 281 Score = 343 bits (881), Expect = 9e-92 Identities = 175/289 (60%), Positives = 207/289 (71%) Frame = -3 Query: 1431 MQRNRKRKNLHSISKTQIKXXXXXXXXXXXXXXPTSQECQSVRDSLLTLHGFPQEFAKYX 1252 MQ++RKRK Q++ PT++EC+ +RD LL LHGFP EF KY Sbjct: 1 MQKSRKRK--------QVEVTETRQDPYPTHSRPTAEECRGIRDELLALHGFPPEFVKYR 52 Query: 1251 XXXXXXXXXXXXXXXXSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAFAS 1072 + S E ++ESVLDGL+ LLSQNTT+ANS +AFAS Sbjct: 53 NQRLKHNMTRDKNSVPLDM------SEYDEGEEESVLDGLVKTLLSQNTTEANSLKAFAS 106 Query: 1071 LKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGMSA 892 LKS FPTWE VLAAE K +EN+I+CGGLA TKA+CIKN+L LLE KGKLCLEYLRG+S Sbjct: 107 LKSTFPTWEHVLAAEQKCIENAIRCGGLAPTKAACIKNILKCLLESKGKLCLEYLRGLSI 166 Query: 891 DEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASSDREKSYLHL 712 DEIKAEL F+GIGPKTVACVLMFHLQQDDFPVDTHVF I+KA+GWVP ++DR K+YLHL Sbjct: 167 DEIKAELSRFRGIGPKTVACVLMFHLQQDDFPVDTHVFEISKAIGWVPTAADRNKTYLHL 226 Query: 711 NKRIPKKLKFDLNCLLVTHGKLCQRCTKKWGNQQNKVSAAHPCPLSDYC 565 N+RIPK+LKFDLNCLL THGKLC+ C KK GN+Q K SA + CPL +YC Sbjct: 227 NQRIPKELKFDLNCLLYTHGKLCRNCIKKGGNRQRKESAGNLCPLLNYC 275 >ref|XP_006476718.1| PREDICTED: protein ROS1-like isoform X1 [Citrus sinensis] Length = 281 Score = 341 bits (875), Expect = 5e-91 Identities = 173/289 (59%), Positives = 208/289 (71%) Frame = -3 Query: 1431 MQRNRKRKNLHSISKTQIKXXXXXXXXXXXXXXPTSQECQSVRDSLLTLHGFPQEFAKYX 1252 MQ++RKRK Q++ PT++EC+ +RD LL LHGFP EF KY Sbjct: 1 MQKSRKRK--------QVEVTETRQDPYPTHSRPTAEECRGIRDELLALHGFPPEFVKYR 52 Query: 1251 XXXXXXXXXXXXXXXXSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAFAS 1072 + + + E ++ESVLDGL+ +LSQNTT+ANS +AFAS Sbjct: 53 NQRLKHNMTRDKNSVPLDMNEYD------EGEEESVLDGLVKTVLSQNTTEANSLKAFAS 106 Query: 1071 LKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGMSA 892 LKS FPTWE VLAAE K +EN+I+CGGLA TKA+CIKN+L LLE KGKLCLEYLRG+S Sbjct: 107 LKSTFPTWEHVLAAEQKCIENAIRCGGLAPTKAACIKNILKCLLESKGKLCLEYLRGLSI 166 Query: 891 DEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASSDREKSYLHL 712 DEIKAEL F+GIGPKTVACVLMFHLQQDDFPVDTHVF I+KA+GWVP ++DR K+YLHL Sbjct: 167 DEIKAELSRFRGIGPKTVACVLMFHLQQDDFPVDTHVFEISKAIGWVPTAADRNKTYLHL 226 Query: 711 NKRIPKKLKFDLNCLLVTHGKLCQRCTKKWGNQQNKVSAAHPCPLSDYC 565 N+RIPK+LKFDLNCLL THGKLC+ C KK GN+Q K SA + CPL +YC Sbjct: 227 NQRIPKELKFDLNCLLYTHGKLCRNCIKKGGNRQRKESAGNLCPLLNYC 275 >ref|XP_002511456.1| Endonuclease III, putative [Ricinus communis] gi|223550571|gb|EEF52058.1| Endonuclease III, putative [Ricinus communis] Length = 291 Score = 337 bits (863), Expect = 1e-89 Identities = 172/297 (57%), Positives = 208/297 (70%), Gaps = 2/297 (0%) Frame = -3 Query: 1431 MQRNRKRK--NLHSISKTQIKXXXXXXXXXXXXXXPTSQECQSVRDSLLTLHGFPQEFAK 1258 MQ+NRKRK + + +K+ PT +EC +RDSLL HGFPQEFAK Sbjct: 1 MQKNRKRKLKSAETETKSAKINNGNKEEPYPTHPRPTPEECLCIRDSLLAFHGFPQEFAK 60 Query: 1257 YXXXXXXXXXXXXXXXXXSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAF 1078 Y ++ S + + +E+VLDGL+ +LSQNTT+ NS RAF Sbjct: 61 YRKQRLGGDDD------------NKSSDVNSDTAEETVLDGLVKTVLSQNTTEVNSQRAF 108 Query: 1077 ASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGM 898 +LKS+FPTW+DVLAAE K++EN+I+CGGLA KASCIKN+L LLEKKGK+CLEYLR M Sbjct: 109 DNLKSDFPTWQDVLAAEPKWIENAIRCGGLAPAKASCIKNILNCLLEKKGKICLEYLRDM 168 Query: 897 SADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASSDREKSYL 718 S DEIKAEL FKG+GPKTVACVLMFHLQQ+DFPVDTHVF I KA+GWVP +DR K+YL Sbjct: 169 SVDEIKAELSQFKGVGPKTVACVLMFHLQQEDFPVDTHVFEIAKALGWVPEVADRNKTYL 228 Query: 717 HLNKRIPKKLKFDLNCLLVTHGKLCQRCTKKWGNQQNKVSAAHPCPLSDYCFSTDLK 547 HLN+RIP +LKFDLNCLL THGKLC++C KK GNQ K S CPL YC S+ +K Sbjct: 229 HLNQRIPNELKFDLNCLLYTHGKLCRKCIKKRGNQSRKESHDDSCPLLSYCNSSSVK 285 >emb|CBI15085.3| unnamed protein product [Vitis vinifera] Length = 310 Score = 335 bits (860), Expect = 2e-89 Identities = 177/300 (59%), Positives = 206/300 (68%), Gaps = 11/300 (3%) Frame = -3 Query: 1431 MQRNRKRKNLHSIS-----KTQIKXXXXXXXXXXXXXXPTSQECQSVRDSLLTLHGFPQE 1267 MQR+RKRK S S T+ PT EC++VRD LL LHGFPQ Sbjct: 1 MQRSRKRKQEESSSCSKESATKSARNDVVVDPYPSHPRPTPVECRAVRDDLLALHGFPQR 60 Query: 1266 FAKYXXXXXXXXXXXXXXXXXS------QLVKSEPSSTSQEPQKESVLDGLISILLSQNT 1105 F KY +L S+ + QKESVLDGL+SI+LSQNT Sbjct: 61 FEKYRKLRLPPLPHTSSPGLDGGGGTPVKLDPSDGDDVNGSSQKESVLDGLVSIILSQNT 120 Query: 1104 TDANSTRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGK 925 TD NS RAFASLKS FPTW+DVLAA+ K +EN+I+CGGLAVTKASCIK +L+ LLE+KGK Sbjct: 121 TDVNSQRAFASLKSAFPTWQDVLAADSKSIENAIRCGGLAVTKASCIKKMLSCLLERKGK 180 Query: 924 LCLEYLRGMSADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPA 745 LCLEYLR ++ DEIK EL FKGIGPKTVACVLMFHLQ+DDFPVDTHV +I KA+GWVPA Sbjct: 181 LCLEYLRDLTVDEIKTELSHFKGIGPKTVACVLMFHLQRDDFPVDTHVIQIGKAIGWVPA 240 Query: 744 SSDREKSYLHLNKRIPKKLKFDLNCLLVTHGKLCQRCTKKWGNQQNKVSAAHPCPLSDYC 565 +DR+K+YLHLN+RIP +LKFDLNCLL THGKLC CT+K NQ+ K S CPL YC Sbjct: 241 VADRKKAYLHLNRRIPDELKFDLNCLLFTHGKLCHECTQKGANQKRKESHESSCPLLTYC 300 >ref|XP_002283633.1| PREDICTED: endonuclease III-like [Vitis vinifera] Length = 310 Score = 335 bits (860), Expect = 2e-89 Identities = 177/300 (59%), Positives = 206/300 (68%), Gaps = 11/300 (3%) Frame = -3 Query: 1431 MQRNRKRKNLHSIS-----KTQIKXXXXXXXXXXXXXXPTSQECQSVRDSLLTLHGFPQE 1267 MQR+RKRK S S T+ PT EC++VRD LL LHGFPQ Sbjct: 1 MQRSRKRKQEESSSCSKESATKSARNDVVVDPYPSHPRPTPVECRAVRDDLLALHGFPQR 60 Query: 1266 FAKYXXXXXXXXXXXXXXXXXS------QLVKSEPSSTSQEPQKESVLDGLISILLSQNT 1105 F KY +L S+ + QKESVLDGL+SI+LSQNT Sbjct: 61 FEKYRKLRLPPLPHTSSPGLDGGGGTPVKLDPSDGDDVNGSSQKESVLDGLVSIILSQNT 120 Query: 1104 TDANSTRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGK 925 TD NS RAFASLKS FPTW+DVLAA+ K +EN+I+CGGLAVTKASCIK +L+ LLE+KGK Sbjct: 121 TDVNSQRAFASLKSAFPTWQDVLAADSKSIENAIRCGGLAVTKASCIKKMLSCLLERKGK 180 Query: 924 LCLEYLRGMSADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPA 745 LCLEYLR ++ DEIK EL FKGIGPKTVACVLMFHLQ+DDFPVDTHV +I KA+GWVPA Sbjct: 181 LCLEYLRDLTVDEIKTELSHFKGIGPKTVACVLMFHLQRDDFPVDTHVIQIGKAIGWVPA 240 Query: 744 SSDREKSYLHLNKRIPKKLKFDLNCLLVTHGKLCQRCTKKWGNQQNKVSAAHPCPLSDYC 565 +DR+K+YLHLN+RIP +LKFDLNCLL THGKLC CT+K NQ+ K S CPL YC Sbjct: 241 VADRKKAYLHLNRRIPDELKFDLNCLLFTHGKLCHECTQKGANQKRKESHESSCPLLTYC 300 >ref|XP_007036109.1| DNA glycosylase superfamily protein isoform 2 [Theobroma cacao] gi|508773354|gb|EOY20610.1| DNA glycosylase superfamily protein isoform 2 [Theobroma cacao] Length = 292 Score = 333 bits (854), Expect = 1e-88 Identities = 173/298 (58%), Positives = 206/298 (69%) Frame = -3 Query: 1434 KMQRNRKRKNLHSISKTQIKXXXXXXXXXXXXXXPTSQECQSVRDSLLTLHGFPQEFAKY 1255 KMQ++RKRK L I PT EC+SVRD LL LHGFP EF KY Sbjct: 2 KMQKSRKRKQL-GIDGHSKTPKITTEEPYPSHHRPTPDECRSVRDELLALHGFPAEFLKY 60 Query: 1254 XXXXXXXXXXXXXXXXXSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAFA 1075 KSEP + + + +ESVLDGL+ +LSQNTT+ NS +AFA Sbjct: 61 RHQRLIKTEPTID-------AKSEPLNNNYDDGEESVLDGLVKTVLSQNTTELNSQKAFA 113 Query: 1074 SLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGMS 895 SLKS FPTWEDVLAAE K +EN+I+CGGLA KASCIKN+L L E+KGKLC EYLR +S Sbjct: 114 SLKSAFPTWEDVLAAESKNLENAIRCGGLAPRKASCIKNVLRCLHERKGKLCFEYLRDLS 173 Query: 894 ADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASSDREKSYLH 715 DEIKAEL FKG+GPKTVACVLMF+LQQDDFPVDTHVF I +A+GWVPA++DR+K+YLH Sbjct: 174 IDEIKAELSNFKGVGPKTVACVLMFNLQQDDFPVDTHVFEIARAIGWVPATADRKKTYLH 233 Query: 714 LNKRIPKKLKFDLNCLLVTHGKLCQRCTKKWGNQQNKVSAAHPCPLSDYCFSTDLKEI 541 LN+RIP KLKFDLNCLL THGKLC++CT K +QQ CPL YC ++ + +I Sbjct: 234 LNRRIPNKLKFDLNCLLYTHGKLCRKCTMKGSSQQKSARNDDSCPLCTYCKNSSVNKI 291 >ref|XP_006476719.1| PREDICTED: protein ROS1-like isoform X2 [Citrus sinensis] Length = 278 Score = 330 bits (846), Expect = 1e-87 Identities = 169/284 (59%), Positives = 203/284 (71%) Frame = -3 Query: 1431 MQRNRKRKNLHSISKTQIKXXXXXXXXXXXXXXPTSQECQSVRDSLLTLHGFPQEFAKYX 1252 MQ++RKRK Q++ PT++EC+ +RD LL LHGFP EF KY Sbjct: 1 MQKSRKRK--------QVEVTETRQDPYPTHSRPTAEECRGIRDELLALHGFPPEFVKYR 52 Query: 1251 XXXXXXXXXXXXXXXXSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAFAS 1072 + + + E ++ESVLDGL+ +LSQNTT+ANS +AFAS Sbjct: 53 NQRLKHNMTRDKNSVPLDMNEYD------EGEEESVLDGLVKTVLSQNTTEANSLKAFAS 106 Query: 1071 LKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGMSA 892 LKS FPTWE VLAAE K +EN+I+CGGLA TKA+CIKN+L LLE KGKLCLEYLRG+S Sbjct: 107 LKSTFPTWEHVLAAEQKCIENAIRCGGLAPTKAACIKNILKCLLESKGKLCLEYLRGLSI 166 Query: 891 DEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASSDREKSYLHL 712 DEIKAEL F+GIGPKTVACVLMFHLQQDDFPVDTHVF I+KA+GWVP ++DR K+YLHL Sbjct: 167 DEIKAELSRFRGIGPKTVACVLMFHLQQDDFPVDTHVFEISKAIGWVPTAADRNKTYLHL 226 Query: 711 NKRIPKKLKFDLNCLLVTHGKLCQRCTKKWGNQQNKVSAAHPCP 580 N+RIPK+LKFDLNCLL THGKLC+ C KK GN+Q K SA + P Sbjct: 227 NQRIPKELKFDLNCLLYTHGKLCRNCIKKGGNRQRKESAGNILP 270 >gb|EXB42063.1| Protein ROS1 [Morus notabilis] Length = 308 Score = 322 bits (826), Expect = 2e-85 Identities = 161/257 (62%), Positives = 187/257 (72%) Frame = -3 Query: 1329 TSQECQSVRDSLLTLHGFPQEFAKYXXXXXXXXXXXXXXXXXSQLVKSEPSSTSQEPQKE 1150 T +C++VRD LL LHGFPQEFAKY K + + KE Sbjct: 69 TPDQCRAVRDDLLALHGFPQEFAKYRRQ------------------KPTTDNGEESESKE 110 Query: 1149 SVLDGLISILLSQNTTDANSTRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKAS 970 SVLDGL+ +LSQNTT+ANS RAFASLKS FPTWE VL A+ K +E++I+CGGLA KAS Sbjct: 111 SVLDGLVMTVLSQNTTEANSQRAFASLKSAFPTWEQVLNADSKCIEDAIRCGGLAPKKAS 170 Query: 969 CIKNLLTRLLEKKGKLCLEYLRGMSADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVD 790 CIKN L LLE+KGKLCLEYL S DE+KAEL FKGIGPKTVACVLMFHLQQDDFPVD Sbjct: 171 CIKNTLRSLLERKGKLCLEYLLDFSVDEVKAELSCFKGIGPKTVACVLMFHLQQDDFPVD 230 Query: 789 THVFRITKAMGWVPASSDREKSYLHLNKRIPKKLKFDLNCLLVTHGKLCQRCTKKWGNQQ 610 THVF I KA+GW+PA +DR K+YLHLN+RIP +LKFDLNCLL THGK+C++C KK G+Q Sbjct: 231 THVFEIAKALGWLPAGADRNKAYLHLNQRIPNELKFDLNCLLYTHGKMCRKCIKKGGSQI 290 Query: 609 NKVSAAHPCPLSDYCFS 559 K S+ CPL YC S Sbjct: 291 KKGSSDDSCPLLHYCKS 307 >ref|XP_002321564.2| hypothetical protein POPTR_0015s08260g [Populus trichocarpa] gi|550322300|gb|EEF05691.2| hypothetical protein POPTR_0015s08260g [Populus trichocarpa] Length = 306 Score = 322 bits (826), Expect = 2e-85 Identities = 157/262 (59%), Positives = 195/262 (74%), Gaps = 7/262 (2%) Frame = -3 Query: 1329 TSQECQSVRDSLLTLHGFPQEFAKYXXXXXXXXXXXXXXXXXSQLVKSEPSSTS------ 1168 T +EC+++RDSLL HGFPQEFAKY + + + + Sbjct: 41 TPEECRAIRDSLLAFHGFPQEFAKYRKQRPYLITLQDKEESPHLINNCDGKNDNVVKVEE 100 Query: 1167 -QEPQKESVLDGLISILLSQNTTDANSTRAFASLKSNFPTWEDVLAAELKFVENSIKCGG 991 +E ++ESVLDGL+ +LSQNTT+ NS RAF +LKS FPTWE+VLAAE KF+E++I+CGG Sbjct: 101 EEEEEEESVLDGLVKTVLSQNTTEVNSQRAFLNLKSAFPTWENVLAAESKFIEDAIRCGG 160 Query: 990 LAVTKASCIKNLLTRLLEKKGKLCLEYLRGMSADEIKAELLGFKGIGPKTVACVLMFHLQ 811 LA TKA+CI+N+L+ L+EK G+LCLEYLR + EIKAEL FKGIGPKTVACVLMF+LQ Sbjct: 161 LAPTKAACIRNILSSLMEKNGRLCLEYLRDLPVAEIKAELSHFKGIGPKTVACVLMFNLQ 220 Query: 810 QDDFPVDTHVFRITKAMGWVPASSDREKSYLHLNKRIPKKLKFDLNCLLVTHGKLCQRCT 631 +DDFPVDTHVF I KA+GWVP +DR K+YLHLN RIPK+LKFDLNCLL THGKLC++CT Sbjct: 221 KDDFPVDTHVFEIAKAIGWVPPVADRNKTYLHLNHRIPKELKFDLNCLLYTHGKLCRKCT 280 Query: 630 KKWGNQQNKVSAAHPCPLSDYC 565 KK G+QQ K + CPL +YC Sbjct: 281 KKSGSQQRKETHDDSCPLLNYC 302 >ref|XP_004299588.1| PREDICTED: DEMETER-like protein 3-like [Fragaria vesca subsp. vesca] Length = 286 Score = 317 bits (813), Expect = 7e-84 Identities = 172/296 (58%), Positives = 204/296 (68%), Gaps = 7/296 (2%) Frame = -3 Query: 1431 MQRNRKRKN-LHSISKTQIKXXXXXXXXXXXXXXPTSQECQSVRDSLLTLHGFPQEFAKY 1255 M +NRKRK + ++ PT +EC SVRD LL LHGFP+EFAKY Sbjct: 1 MPKNRKRKEQAEADHNPKLPTKTTPKDPYPNHARPTREECVSVRDDLLALHGFPKEFAKY 60 Query: 1254 XXXXXXXXXXXXXXXXXSQLVKSEPSSTSQEP--QKESVLDGLISILLSQNTTDANSTRA 1081 Q + S EP +KESVLDGL+ LLSQNTT++NS +A Sbjct: 61 REQRLSS-----------QASNGHDNDVSSEPLDEKESVLDGLVRTLLSQNTTESNSLKA 109 Query: 1080 FASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRG 901 FASLKS FPTWE+VLAA+ + +E++I+CGGLA TKASCIKN+L+ LLEKK KLCLEYLR Sbjct: 110 FASLKSAFPTWEEVLAADSQSLESAIRCGGLAKTKASCIKNMLSCLLEKKEKLCLEYLRD 169 Query: 900 MSADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASSDREKSY 721 +S DEIKAEL FKGIGPKTVACVLMF LQQDDFPVDTHV+ I KAM WVP +DR K+Y Sbjct: 170 LSVDEIKAELSHFKGIGPKTVACVLMFQLQQDDFPVDTHVYEIAKAMAWVPVGADRNKTY 229 Query: 720 LHLNKRIPKKLKFDLNCLLVTHGKLCQRCTKKWGN---QQNKVSA-AHPCPLSDYC 565 LHLN+ IP +LKFDLNCLL THGKLC++C KK G+ QQ K S ++ CPL YC Sbjct: 230 LHLNQWIPDELKFDLNCLLYTHGKLCRKCIKKGGSTGKQQEKESEDSNSCPLLRYC 285 >ref|XP_007155390.1| hypothetical protein PHAVU_003G197200g [Phaseolus vulgaris] gi|561028744|gb|ESW27384.1| hypothetical protein PHAVU_003G197200g [Phaseolus vulgaris] Length = 282 Score = 317 bits (811), Expect = 1e-83 Identities = 157/260 (60%), Positives = 191/260 (73%), Gaps = 1/260 (0%) Frame = -3 Query: 1329 TSQECQSVRDSLLTLHGFPQEFAKYXXXXXXXXXXXXXXXXXSQLVKSEPSSTSQEPQK- 1153 T +EC++VRD+LL LHG P E AKY K +P + + +P+ Sbjct: 43 TPEECEAVRDTLLALHGIPPELAKYR--------------------KLQPLNDAVQPESP 82 Query: 1152 ESVLDGLISILLSQNTTDANSTRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKA 973 E VLDGL+ +LSQNTT+ANS +AF SLKS+FPTWE V AE K VEN+I+CGGLA TKA Sbjct: 83 EPVLDGLVRTVLSQNTTEANSQKAFVSLKSSFPTWEHVFGAESKDVENAIRCGGLAPTKA 142 Query: 972 SCIKNLLTRLLEKKGKLCLEYLRGMSADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPV 793 SCIKN+L L E++G+LCLEYLR +S DE KAEL FKGIGPKTVACVLMF+LQQDDFPV Sbjct: 143 SCIKNMLRCLRERRGQLCLEYLRDLSVDEAKAELSLFKGIGPKTVACVLMFNLQQDDFPV 202 Query: 792 DTHVFRITKAMGWVPASSDREKSYLHLNKRIPKKLKFDLNCLLVTHGKLCQRCTKKWGNQ 613 DTH+F I+K MGWVP+ +DR KSYLHLN+RIP +LKFDLNCL+ THGKLC++C+ K GNQ Sbjct: 203 DTHIFEISKTMGWVPSVADRNKSYLHLNQRIPNELKFDLNCLMFTHGKLCRKCSSKKGNQ 262 Query: 612 QNKVSAAHPCPLSDYCFSTD 553 Q K CPL +YC +D Sbjct: 263 QGKKGNDKSCPLLNYCKESD 282 >ref|XP_003608916.1| Ultraviolet N-glycosylase/AP lyase [Medicago truncatula] gi|355509971|gb|AES91113.1| Ultraviolet N-glycosylase/AP lyase [Medicago truncatula] Length = 280 Score = 312 bits (799), Expect = 3e-82 Identities = 166/297 (55%), Positives = 201/297 (67%), Gaps = 9/297 (3%) Frame = -3 Query: 1431 MQRNRKRK---------NLHSISKTQIKXXXXXXXXXXXXXXPTSQECQSVRDSLLTLHG 1279 M++ RKRK N +S+ QIK T QEC +RD+LL+LHG Sbjct: 1 MEKKRKRKVKTERDGDRNPNSVQVPQIKTENPKNPFPSHSAP-TPQECLEIRDNLLSLHG 59 Query: 1278 FPQEFAKYXXXXXXXXXXXXXXXXXSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTD 1099 P E AKY KS+ ++ + EP E+VLDGL+ +LSQNTT+ Sbjct: 60 IPPELAKYR--------------------KSQQTNDTVEPP-ETVLDGLVRTILSQNTTE 98 Query: 1098 ANSTRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLC 919 ANS +AFASLKS FPTWE V AE K +EN+I+CGGLA TKA CIKNLL+ LLE+KGK+C Sbjct: 99 ANSNKAFASLKSLFPTWEHVHGAESKELENAIRCGGLAPTKAKCIKNLLSCLLERKGKMC 158 Query: 918 LEYLRGMSADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASS 739 LEYLR +S DE+KAEL FKGIGPKTV+CVLMF+LQ DDFPVDTH+F I K MGWVPA++ Sbjct: 159 LEYLRDLSVDEVKAELSLFKGIGPKTVSCVLMFNLQLDDFPVDTHIFEIAKTMGWVPAAA 218 Query: 738 DREKSYLHLNKRIPKKLKFDLNCLLVTHGKLCQRCTKKWGNQQNKVSAAHPCPLSDY 568 DR K+YLHLN+RIP +LKFDLNCLL THGKLC C+ K GN+Q K CPL +Y Sbjct: 219 DRNKTYLHLNQRIPDELKFDLNCLLYTHGKLCSNCSSKRGNKQQKKFNDSSCPLLNY 275 >ref|XP_003525486.1| PREDICTED: uncharacterized protein LOC100802952 [Glycine max] Length = 284 Score = 309 bits (791), Expect = 2e-81 Identities = 157/254 (61%), Positives = 188/254 (74%) Frame = -3 Query: 1329 TSQECQSVRDSLLTLHGFPQEFAKYXXXXXXXXXXXXXXXXXSQLVKSEPSSTSQEPQKE 1150 T QEC++VRD+LL LHG P E AKY +L S+ Q P E Sbjct: 43 TPQECEAVRDTLLALHGIPPELAKYR-----------------KLPPSDEPVQLQPP--E 83 Query: 1149 SVLDGLISILLSQNTTDANSTRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKAS 970 VLDGL+ +LSQNTT+ANS +AFASLKS+FP+WE VL AE K VEN+I+CGGLA TKAS Sbjct: 84 PVLDGLVRTVLSQNTTEANSQKAFASLKSSFPSWEQVLWAESKDVENAIRCGGLAPTKAS 143 Query: 969 CIKNLLTRLLEKKGKLCLEYLRGMSADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVD 790 CIKN+L L E++G+LCLEYLR +S DE+KAEL FKGIGPKTVACVLMF+LQQDDFPVD Sbjct: 144 CIKNVLRCLRERRGELCLEYLRDLSVDEVKAELSLFKGIGPKTVACVLMFNLQQDDFPVD 203 Query: 789 THVFRITKAMGWVPASSDREKSYLHLNKRIPKKLKFDLNCLLVTHGKLCQRCTKKWGNQQ 610 TH+F I K MGWVPA ++R KSYLHLN+R+P +LKFDLNCLL THGKLC +C+ K GN+Q Sbjct: 204 THIFEIAKTMGWVPAVANRNKSYLHLNQRVPNELKFDLNCLLYTHGKLCHQCSGKKGNKQ 263 Query: 609 NKVSAAHPCPLSDY 568 K + CPL +Y Sbjct: 264 GKKCDDNSCPLLNY 277 >ref|XP_006476720.1| PREDICTED: protein ROS1-like isoform X3 [Citrus sinensis] Length = 258 Score = 306 bits (784), Expect = 2e-80 Identities = 160/272 (58%), Positives = 192/272 (70%) Frame = -3 Query: 1431 MQRNRKRKNLHSISKTQIKXXXXXXXXXXXXXXPTSQECQSVRDSLLTLHGFPQEFAKYX 1252 MQ++RKRK Q++ PT++EC+ +RD LL LHGFP EF KY Sbjct: 1 MQKSRKRK--------QVEVTETRQDPYPTHSRPTAEECRGIRDELLALHGFPPEFVKYR 52 Query: 1251 XXXXXXXXXXXXXXXXSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAFAS 1072 + + + E ++ESVLDGL+ +LSQNTT+ANS +AFAS Sbjct: 53 NQRLKHNMTRDKNSVPLDMNEYD------EGEEESVLDGLVKTVLSQNTTEANSLKAFAS 106 Query: 1071 LKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGMSA 892 LKS FPTWE VLAAE K +EN+I+CGGLA TKA+CIKN+L LLE KGKLCLEYLRG+S Sbjct: 107 LKSTFPTWEHVLAAEQKCIENAIRCGGLAPTKAACIKNILKCLLESKGKLCLEYLRGLSI 166 Query: 891 DEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASSDREKSYLHL 712 DEIKAEL F+GIGPKTVACVLMFHLQQDDFPVDTHVF I+KA+GWVP ++DR K+YLHL Sbjct: 167 DEIKAELSRFRGIGPKTVACVLMFHLQQDDFPVDTHVFEISKAIGWVPTAADRNKTYLHL 226 Query: 711 NKRIPKKLKFDLNCLLVTHGKLCQRCTKKWGN 616 N+RIPK+LKFDLNCLL THG + R K GN Sbjct: 227 NQRIPKELKFDLNCLLYTHGNILPRA--KEGN 256 >ref|XP_004508835.1| PREDICTED: protein ROS1-like [Cicer arietinum] gi|502152248|ref|XP_004508836.1| PREDICTED: protein ROS1-like [Cicer arietinum] Length = 285 Score = 306 bits (783), Expect = 2e-80 Identities = 161/298 (54%), Positives = 195/298 (65%), Gaps = 10/298 (3%) Frame = -3 Query: 1431 MQRNRKRK---------NLHSISKTQIKXXXXXXXXXXXXXXP-TSQECQSVRDSLLTLH 1282 M++ RKRK N S+ +QI+ T QEC +RD+LL LH Sbjct: 1 MEKKRKRKQEAKRNEERNAKSVKASQIQTENENLKEPFPSHSGPTPQECLDIRDTLLALH 60 Query: 1281 GFPQEFAKYXXXXXXXXXXXXXXXXXSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTT 1102 G P E AKY + + T E+VLDGL+ +LSQNTT Sbjct: 61 GLPPELAKYRKS------------------QQQTDDTINPDPPETVLDGLVRTILSQNTT 102 Query: 1101 DANSTRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKL 922 ++NS +AFASLKS+FPTWE V AE K +EN+I+CGGLA TKASCIKNLL LLEK+GK Sbjct: 103 ESNSNKAFASLKSSFPTWEHVHGAESKELENAIRCGGLAPTKASCIKNLLRCLLEKRGKF 162 Query: 921 CLEYLRGMSADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPAS 742 CLEYLR +S +IKAEL FKGIGPKTVACVLMF+LQQDDFPVDTH+F I K +GWVPA Sbjct: 163 CLEYLRDLSVAQIKAELSLFKGIGPKTVACVLMFNLQQDDFPVDTHIFEIAKTIGWVPAV 222 Query: 741 SDREKSYLHLNKRIPKKLKFDLNCLLVTHGKLCQRCTKKWGNQQNKVSAAHPCPLSDY 568 +DR K+YLHLN+RIP +LKFDLNCLL THGK C +C+ K GN+Q K + CPL +Y Sbjct: 223 ADRNKTYLHLNQRIPNELKFDLNCLLYTHGKFCSKCSSKRGNKQQKKFNDNSCPLLNY 280 >ref|XP_006291571.1| hypothetical protein CARUB_v10017731mg [Capsella rubella] gi|482560278|gb|EOA24469.1| hypothetical protein CARUB_v10017731mg [Capsella rubella] Length = 298 Score = 306 bits (783), Expect = 2e-80 Identities = 163/293 (55%), Positives = 197/293 (67%), Gaps = 4/293 (1%) Frame = -3 Query: 1431 MQRNRKRKNLHS---ISKTQ-IKXXXXXXXXXXXXXXPTSQECQSVRDSLLTLHGFPQEF 1264 M + +KRK L+ SKT IK PT++EC+ VRD+LL+LHGFP EF Sbjct: 1 MSKAQKRKRLNQGDGESKTPVIKSAVDGGDPYPALLRPTAEECRDVRDALLSLHGFPPEF 60 Query: 1263 AKYXXXXXXXXXXXXXXXXXSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTR 1084 A Y VK EP ++E ESVLDGL+ ILLSQNTT++NS R Sbjct: 61 ASYRRKRLRLFSAVDDHGTQCT-VKPEPLDEAEE---ESVLDGLVKILLSQNTTESNSLR 116 Query: 1083 AFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLR 904 AFASLK+ FP WEDVLAAE +EN+I+CGGLA KA CIKN+L RL +KG LCLEYLR Sbjct: 117 AFASLKAAFPKWEDVLAAESISIENAIRCGGLAPKKAVCIKNILNRLQNEKGVLCLEYLR 176 Query: 903 GMSADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASSDREKS 724 +S DE+K+EL FKG+GPKTV+CVLMF+LQ +DFPVDTHVF I KA+GWVP ++DR K+ Sbjct: 177 SLSVDEVKSELSQFKGVGPKTVSCVLMFNLQHNDFPVDTHVFEIAKALGWVPKTADRNKT 236 Query: 723 YLHLNKRIPKKLKFDLNCLLVTHGKLCQRCTKKWGNQQNKVSAAHPCPLSDYC 565 Y+HLN+RIP +LKFDLNCLL THGKLC C K G + K A P D C Sbjct: 237 YVHLNRRIPDELKFDLNCLLYTHGKLCSNCKKNVGKPKAKAKAKEASPSPDNC 289 >ref|NP_566893.1| DNA glycosylase superfamily protein [Arabidopsis thaliana] gi|332644814|gb|AEE78335.1| DNA glycosylase superfamily protein [Arabidopsis thaliana] Length = 293 Score = 305 bits (781), Expect = 4e-80 Identities = 156/289 (53%), Positives = 197/289 (68%), Gaps = 4/289 (1%) Frame = -3 Query: 1431 MQRNRKRKNLHSI---SKTQI-KXXXXXXXXXXXXXXPTSQECQSVRDSLLTLHGFPQEF 1264 M + +KRK L+ SKT K PT++EC+ VRD+LL+LHGFP EF Sbjct: 1 MSKAQKRKRLNKYDGESKTPANKSTVDGGNPYPTLLRPTAEECRDVRDALLSLHGFPPEF 60 Query: 1263 AKYXXXXXXXXXXXXXXXXXSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTR 1084 A Y L S T E ++ESVLDGL+ ILLSQNTT++NS R Sbjct: 61 ANYRRQRLRSFSAVDDHDTQCNL----KSETLNETEEESVLDGLVKILLSQNTTESNSQR 116 Query: 1083 AFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLR 904 AFASLK+ FP W+DVL AE K +EN+I+CGGLA KA CIKN+L RL ++G+LCLEYLR Sbjct: 117 AFASLKATFPKWDDVLNAESKSIENAIRCGGLAPKKAVCIKNILNRLQNERGRLCLEYLR 176 Query: 903 GMSADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASSDREKS 724 G+S +E+K EL FKG+GPKTV+CVLMF+LQ +DFPVDTHVF I KA+GWVP ++DR K+ Sbjct: 177 GLSVEEVKTELSHFKGVGPKTVSCVLMFNLQHNDFPVDTHVFEIAKALGWVPKTADRNKT 236 Query: 723 YLHLNKRIPKKLKFDLNCLLVTHGKLCQRCTKKWGNQQNKVSAAHPCPL 577 Y+HLN++IP +LKFDLNCLL THGK+C C K + KV++ CPL Sbjct: 237 YVHLNRKIPDELKFDLNCLLYTHGKICSNCKKNVAKPKAKVASPDDCPL 285 >ref|XP_002875868.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297321706|gb|EFH52127.1| predicted protein [Arabidopsis lyrata subsp. lyrata] Length = 294 Score = 304 bits (779), Expect = 6e-80 Identities = 163/291 (56%), Positives = 199/291 (68%), Gaps = 6/291 (2%) Frame = -3 Query: 1431 MQRNRKRKNLHSI---SKTQ-IKXXXXXXXXXXXXXXPTSQECQSVRDSLLTLHGFPQEF 1264 M + +KRK L+ SKT IK PT++EC+ VRD+LL+LHGFP EF Sbjct: 1 MSKAQKRKRLNQDDGESKTPAIKSTVDGSNPYPTLLRPTAEECREVRDALLSLHGFPPEF 60 Query: 1263 AKYXXXXXXXXXXXXXXXXXSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTR 1084 A Y + KSEP ++E ESVLDGL+ ILLSQNTT++NS R Sbjct: 61 ANYRRQRLRSLSAVDGHDTQCTM-KSEPLDEAEE---ESVLDGLVKILLSQNTTESNSQR 116 Query: 1083 AFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLR 904 AFASLK+ FP WEDVLAAE K +E++I+CGGLA KA CIKN+L RL ++G LCLEYLR Sbjct: 117 AFASLKAAFPNWEDVLAAESKSIESAIRCGGLAPKKAVCIKNILNRLQTERGVLCLEYLR 176 Query: 903 GMSADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASSDREKS 724 G+S +E+K EL FKGIGPKTV+CVLMF+LQ +DFPVDTHVF I KA+GWVP ++DR K+ Sbjct: 177 GLSVEEVKTELSHFKGIGPKTVSCVLMFNLQHNDFPVDTHVFEIAKALGWVPKTADRNKT 236 Query: 723 YLHLNKRIPKKLKFDLNCLLVTHGKLCQRCTKKWGNQQNKVSAAHP--CPL 577 Y+HLN+RIP +LKFDLNCLL THGKLC C K + K A P CPL Sbjct: 237 YVHLNRRIPDELKFDLNCLLYTHGKLCSNCKKTVAKPKAKARVASPDECPL 287 >ref|XP_006836744.1| hypothetical protein AMTR_s00088p00146000 [Amborella trichopoda] gi|548839304|gb|ERM99597.1| hypothetical protein AMTR_s00088p00146000 [Amborella trichopoda] Length = 305 Score = 302 bits (774), Expect = 2e-79 Identities = 156/256 (60%), Positives = 187/256 (73%), Gaps = 2/256 (0%) Frame = -3 Query: 1329 TSQECQSVRDSLLTLHGFPQEFAKYXXXXXXXXXXXXXXXXXSQLVKSEPSSTSQEP--Q 1156 T QEC VRD+L++LHGFP+EFA++ Q + P Q Sbjct: 55 TPQECLIVRDALISLHGFPEEFAEFRRKEAVVNDSFEEK----QQKLDDEGEVRIAPLIQ 110 Query: 1155 KESVLDGLISILLSQNTTDANSTRAFASLKSNFPTWEDVLAAELKFVENSIKCGGLAVTK 976 SVLDGL+S++LSQNTTD NS RAF SLK FPTWEDV AAE K V N+IKCGGLA TK Sbjct: 111 GGSVLDGLVSVILSQNTTDVNSRRAFESLKLAFPTWEDVHAAESKSVVNTIKCGGLAETK 170 Query: 975 ASCIKNLLTRLLEKKGKLCLEYLRGMSADEIKAELLGFKGIGPKTVACVLMFHLQQDDFP 796 ASCIKN+L+ LLE+KGK+CL+YLR M D+IKAEL FKG+GPKTVACVLMF+LQ+DDFP Sbjct: 171 ASCIKNILSALLEQKGKICLDYLREMPIDKIKAELRHFKGVGPKTVACVLMFYLQKDDFP 230 Query: 795 VDTHVFRITKAMGWVPASSDREKSYLHLNKRIPKKLKFDLNCLLVTHGKLCQRCTKKWGN 616 VDTHVFRI KA+GWVP+ ++REK+YLHLN +IP LKFDLNCLLVTHGK C++CTK Sbjct: 231 VDTHVFRIVKAIGWVPSEANREKAYLHLNSQIPDDLKFDLNCLLVTHGKHCEKCTKGHRA 290 Query: 615 QQNKVSAAHPCPLSDY 568 Q+ + + CPLS Y Sbjct: 291 QRTPLGS---CPLSSY 303 >ref|XP_007036108.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508773353|gb|EOY20609.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] Length = 446 Score = 301 bits (771), Expect = 5e-79 Identities = 167/299 (55%), Positives = 194/299 (64%), Gaps = 24/299 (8%) Frame = -3 Query: 1434 KMQRNRKRKNLHSISKTQIKXXXXXXXXXXXXXXPTSQECQSVRDSLLTLHGFPQEFAKY 1255 KMQ++RKRK L I PT EC+SVRD LL LHGFP EF KY Sbjct: 2 KMQKSRKRKQL-GIDGHSKTPKITTEEPYPSHHRPTPDECRSVRDELLALHGFPAEFLKY 60 Query: 1254 XXXXXXXXXXXXXXXXXSQLVKSEPSSTSQEPQKESVLDGLISILLSQNTTDANSTRAFA 1075 KSEP + + + +ESVLDGL+ +LSQNTT+ NS +AFA Sbjct: 61 RHQRLIKTEPTID-------AKSEPLNNNYDDGEESVLDGLVKTVLSQNTTELNSQKAFA 113 Query: 1074 SLKSNFPTWEDVLAAELKFVENSIKCGGLAVTKASCIKNLLTRLLEKKGKLCLEYLRGMS 895 SLKS FPTWEDVLAAE K +EN+I+CGGLA KASCIKN+L L E+KGKLC EYLR +S Sbjct: 114 SLKSAFPTWEDVLAAESKNLENAIRCGGLAPRKASCIKNVLRCLHERKGKLCFEYLRDLS 173 Query: 894 ADEIKAELLGFKGIGPKTVACVLMFHLQQDDFPVDTHVFRITKAMGWVPASSDREKSYLH 715 DEIKAEL FKG+GPKTVACVLMF+LQQDDFPVDTHVF I +A+GWVPA++DR+K+YLH Sbjct: 174 IDEIKAELSNFKGVGPKTVACVLMFNLQQDDFPVDTHVFEIARAIGWVPATADRKKTYLH 233 Query: 714 LNKRIPKKLKFDLNCLLVTH--------GKL----------------CQRCTKKWGNQQ 610 LN+RIP KLKFDLNCLL TH GK CQ C KK+ N Q Sbjct: 234 LNRRIPNKLKFDLNCLLYTHDGQGTVEAGKTVKEKSVTRKLEKRKYECQFCLKKFTNSQ 292