BLASTX nr result
ID: Ephedra28_contig00005243
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Ephedra28_contig00005243 (1467 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002530111.1| DNA-3-methyladenine glycosylase, putative [R... 338 4e-90 gb|ESW21148.1| hypothetical protein PHAVU_005G045900g [Phaseolus... 334 5e-89 ref|XP_002312220.1| methyladenine glycosylase family protein [Po... 334 5e-89 ref|XP_002315089.2| methyladenine glycosylase family protein [Po... 333 8e-89 ref|XP_002303719.2| hypothetical protein POPTR_0003s15520g [Popu... 331 5e-88 ref|XP_002303720.2| hypothetical protein POPTR_0003s15520g [Popu... 331 5e-88 ref|XP_003546838.1| PREDICTED: uncharacterized protein LOC100811... 327 6e-87 gb|EMJ12930.1| hypothetical protein PRUPE_ppa006139mg [Prunus pe... 325 4e-86 ref|XP_003531474.1| PREDICTED: uncharacterized protein LOC100791... 324 7e-86 ref|XP_004232605.1| PREDICTED: uncharacterized protein LOC101246... 323 1e-85 ref|XP_006468594.1| PREDICTED: uncharacterized protein LOC102614... 321 6e-85 ref|XP_006364818.1| PREDICTED: uncharacterized protein LOC102594... 319 2e-84 ref|XP_002263612.1| PREDICTED: uncharacterized protein LOC100256... 317 1e-83 emb|CAN68394.1| hypothetical protein VITISV_042519 [Vitis vinifera] 316 1e-83 ref|XP_006448576.1| hypothetical protein CICLE_v10015639mg [Citr... 316 2e-83 ref|XP_002299382.2| hypothetical protein POPTR_0001s12320g [Popu... 315 2e-83 gb|EOY14286.1| DNA glycosylase superfamily protein isoform 1 [Th... 315 3e-83 ref|XP_002515807.1| DNA-3-methyladenine glycosylase, putative [R... 315 3e-83 ref|XP_003595924.1| DNA-3-methyladenine glycosylase [Medicago tr... 315 3e-83 ref|XP_004295546.1| PREDICTED: uncharacterized protein LOC101298... 315 4e-83 >ref|XP_002530111.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis] gi|223530365|gb|EEF32255.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis] Length = 403 Score = 338 bits (866), Expect = 4e-90 Identities = 190/427 (44%), Positives = 266/427 (62%), Gaps = 25/427 (5%) Frame = -3 Query: 1234 SSNTMSGSKVVEIGDRPVLQPRVPKANEGNNATLTGTKTMPKKSLKTTL-SPPSRP---- 1070 +++ + S + +I RPVLQP+ + TL ++ K S K+ + PP+ P Sbjct: 17 ANHHIPASTIAKINGRPVLQPKSDQV-----PTLERRNSLKKNSPKSPIIQPPAAPLPLL 71 Query: 1069 PKNTNVG--HPKTLATPPTSPSSKIHNAXXXXXXXXXXXXXXKQLSKRIPERIVLKRAND 896 P T + P +L+ PP SP K S R P LKR ND Sbjct: 72 PTTTTIKPKQPSSLS-PPISPKLK---------------------SPRPP---ALKRGND 106 Query: 895 TTSLNTSAD---------AKAPLESKASSPKCPVISVSVKGAANGRKKPRKSMSFDGSS- 746 +LN+SA+ + +SK SSP PV++ + +++ S Sbjct: 107 LNTLNSSAEKFLTPRKAVSTTLKKSKKSSPATPVVAETCT-----------VLNYSSSLI 155 Query: 745 VRSPGSLAAARKAELAEMCAQRKMKISHYGRKQGTPKAQKVVEDLPNDEGV--------K 590 V +PGS+AAAR+ +A M QRK++ +HYGR K+++ + +P D + Sbjct: 156 VEAPGSIAAARREHVATMQEQRKLRTAHYGRVNSGSKSKRDAKIVPVDSPAATAVPQEER 215 Query: 589 RCGFITSQSDPAHVAYHDQEWGVPVHDDKMLFELLVLMGAQVGMNWPAILNKREAFREAF 410 RC FIT SDP +VAYHDQEWGVPVHDDKMLFELLVL GAQ+G +W ++L KREAFREAF Sbjct: 216 RCSFITPSSDPIYVAYHDQEWGVPVHDDKMLFELLVLTGAQIGSDWTSVLKKREAFREAF 275 Query: 409 GGFDPEIVATFNEKKIAAVNAEYGLMDVAKIRGMVENAKQILEIVKEFGSFDRYIWSFVN 230 GFD EIVA F+EKK +++AEYG M+++++RG+V+N+ +IL++ KEFGSFD+Y+W FVN Sbjct: 276 SGFDAEIVAKFSEKKTTSISAEYG-MEISQVRGVVDNSNRILQVKKEFGSFDKYLWGFVN 334 Query: 229 YSPIVNKYKYARNVPVKTSKAETISKDLVKRNFRFVGPTIMHSFMQAAGLTNDHLVNCFR 50 + PI +Y+ + +PVKTSK+ETISKD+VKR FR+VGPT+MHSFMQAAGL+NDHL++C R Sbjct: 335 HKPITTQYRSSNKIPVKTSKSETISKDMVKRGFRYVGPTVMHSFMQAAGLSNDHLISCSR 394 Query: 49 HEECIAL 29 H +C+AL Sbjct: 395 HHQCLAL 401 >gb|ESW21148.1| hypothetical protein PHAVU_005G045900g [Phaseolus vulgaris] Length = 405 Score = 334 bits (857), Expect = 5e-89 Identities = 196/417 (47%), Positives = 253/417 (60%), Gaps = 13/417 (3%) Frame = -3 Query: 1240 TNSSNTMSGSKVVEIGDRPVLQP---RVPKANEGNNATLTGTKTMPKKSLKTTLSPPSRP 1070 T +S M V I RPVLQP RVP N+ K P KSL SPPS P Sbjct: 20 TTTSTVMPS--VARINGRPVLQPTCNRVPNLERRNSIK----KVQPPKSL----SPPSPP 69 Query: 1069 PKNTNVGHPKTLATPPTSPSSKIHNAXXXXXXXXXXXXXXKQLSKRIPERIVLKRANDTT 890 + KT TPP SP SK S R+P +KR ND Sbjct: 70 LSS------KTSLTPPVSPKSK---------------------SPRLP---AVKRGNDNN 99 Query: 889 SLNTSADAKAPLESKASSPKCP-VISVSVKGAANGRKKPRKSMSFDGSSVR-SPGSLAAA 716 LNTS + A +S + +P S S K + S S+ S + SPGS+AA Sbjct: 100 GLNTSYEKIAIPKSSSKAPTLERKKSKSFKEGSCAPASTEASFSYASSLITDSPGSIAAV 159 Query: 715 RKAELAEMCAQRKMKISHYGRKQGTPKAQKVVEDLPND--------EGVKRCGFITSQSD 560 R+ ++A AQRKMKI+HYGR + K ++VV P+ E KRC FIT+ SD Sbjct: 160 RREQMALQQAQRKMKIAHYGRSKSA-KFERVVPLDPSTTTLTSKPTEEEKRCSFITANSD 218 Query: 559 PAHVAYHDQEWGVPVHDDKMLFELLVLMGAQVGMNWPAILNKREAFREAFGGFDPEIVAT 380 P ++AYHD+EWGVPVHDDKMLFELLVL GAQVG +W + L KR+ FR AF FD E VA Sbjct: 219 PIYIAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSTLKKRQDFRAAFSDFDAETVAN 278 Query: 379 FNEKKIAAVNAEYGLMDVAKIRGMVENAKQILEIVKEFGSFDRYIWSFVNYSPIVNKYKY 200 +K++ ++++EYG+ D++++RG+V+NA QILEI K+FGSFD+YIW FVN+ PI +YK+ Sbjct: 279 LTDKQMMSISSEYGI-DISRVRGVVDNANQILEIKKDFGSFDKYIWGFVNHKPISTQYKF 337 Query: 199 ARNVPVKTSKAETISKDLVKRNFRFVGPTIMHSFMQAAGLTNDHLVNCFRHEECIAL 29 +PVKTSK+E+ISKD+V+R +RFVGPT++HSFMQAAGLTNDHL+ C RH +C L Sbjct: 338 GHKIPVKTSKSESISKDMVRRGYRFVGPTVVHSFMQAAGLTNDHLITCHRHLQCTLL 394 >ref|XP_002312220.1| methyladenine glycosylase family protein [Populus trichocarpa] gi|118486806|gb|ABK95238.1| unknown [Populus trichocarpa] gi|222852040|gb|EEE89587.1| methyladenine glycosylase family protein [Populus trichocarpa] Length = 403 Score = 334 bits (857), Expect = 5e-89 Identities = 192/407 (47%), Positives = 253/407 (62%), Gaps = 17/407 (4%) Frame = -3 Query: 1207 VVEIGDRPVLQPRVPKANEGNNATLTGTKTMPKKSLKTTLSPPSRPPKNTNVGHPKTLAT 1028 V I RPVLQP + TL ++ K + K++ PP PP +N + A+ Sbjct: 18 VARINGRPVLQPTCNLVS-----TLERRNSLKKTAPKSSPPPPPPPPTFSNKTNK---AS 69 Query: 1027 PPTSPSSKIHNAXXXXXXXXXXXXXXKQLSKRIPERIVLKRANDTTSLNTSADAKAPLES 848 PP SP SK S R+P +KR +D SLN+S++ + Sbjct: 70 PPLSPMSK---------------------SPRLP---AIKRGSDANSLNSSSEKVVIPRN 105 Query: 847 KASSPKCP-VISVSVKGAANGRKKPRK----SMSFDGSS-VRSPGSLAAARKAELAEMCA 686 +P S S K ++ GR S+S+ S V +PGS+AA R+ ++A A Sbjct: 106 TTKTPTLERKKSKSFKESSVGRGVHSSFIEASLSYSSSLIVEAPGSIAAVRREQMALQHA 165 Query: 685 QRKMKISHYGRKQGTPKAQKVVEDLPNDEGV-----------KRCGFITSQSDPAHVAYH 539 QRKM+I+HYGR + +VV PND + KRC FIT+ SDP +VAYH Sbjct: 166 QRKMRIAHYGRSKSARFEDQVV---PNDSSISMATKTDQEEEKRCSFITANSDPIYVAYH 222 Query: 538 DQEWGVPVHDDKMLFELLVLMGAQVGMNWPAILNKREAFREAFGGFDPEIVATFNEKKIA 359 D+EWGVPVHDDKMLFELLVL GAQVG +W +IL KR+ FR+AF GFD EIVA +EK+I Sbjct: 223 DEEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRDAFSGFDAEIVANISEKQIM 282 Query: 358 AVNAEYGLMDVAKIRGMVENAKQILEIVKEFGSFDRYIWSFVNYSPIVNKYKYARNVPVK 179 +++AEYG+ D++++RG+V+N+ +ILEI KEFGSFDRYIW+FVN PI YK+ +PVK Sbjct: 283 SISAEYGI-DMSRVRGVVDNSNRILEIKKEFGSFDRYIWTFVNNKPISTSYKFGHKIPVK 341 Query: 178 TSKAETISKDLVKRNFRFVGPTIMHSFMQAAGLTNDHLVNCFRHEEC 38 TSK+ETISKD+V+R FRFVGPT++HSFMQAAGLTNDHL+ C RH C Sbjct: 342 TSKSETISKDMVRRGFRFVGPTMVHSFMQAAGLTNDHLITCHRHLPC 388 >ref|XP_002315089.2| methyladenine glycosylase family protein [Populus trichocarpa] gi|550330066|gb|EEF01260.2| methyladenine glycosylase family protein [Populus trichocarpa] Length = 411 Score = 333 bits (855), Expect = 8e-89 Identities = 194/416 (46%), Positives = 251/416 (60%), Gaps = 26/416 (6%) Frame = -3 Query: 1207 VVEIGDRPVLQP---RVPKANEGNNATLTGTKTMPKKSLKTTLSPPSRPPKNTNVGHPKT 1037 V I RPVLQP RVP N+ T K+ P PP PP + N + Sbjct: 18 VARINGRPVLQPTCNRVPTLERHNSLKKTAPKSPPPPP------PPLPPPTSANKTNK-- 69 Query: 1036 LATPPTSPSSKIHNAXXXXXXXXXXXXXXKQLSKRIPERIVLKRANDTTSLNTSAD---- 869 A+PP SP SK S R+P +KR +D SLN+S+D Sbjct: 70 -ASPPLSPKSK---------------------SPRLP---AIKRGSDANSLNSSSDKVVI 104 Query: 868 ----AKAPLESKASSPKCPVISVSVKGAANGRKKPRKSMSFDGSS-VRSPGSLAAARKAE 704 AK P+ + S SV G+ S+S+ S V +PGS+AA R+ + Sbjct: 105 PRSTAKTPILERKKSKSFKETSV---GSGALSSSIEASLSYSSSLIVEAPGSIAAVRREQ 161 Query: 703 LAEMCAQRKMKISHYGRKQGTPKAQKVVE-------DLPNDEGVKRCGFITSQS------ 563 +A AQRKM+I+HYGR + + KVV DE KRC FIT+ S Sbjct: 162 MALQHAQRKMRIAHYGRSKSSRFEAKVVPVDSSINVTTKTDEEEKRCSFITANSGKEKYE 221 Query: 562 -DPAHVAYHDQEWGVPVHDDKMLFELLVLMGAQVGMNWPAILNKREAFREAFGGFDPEIV 386 +P +VAYHD+EWGVPVHDDKMLFELLVL GAQVG +W +IL KR+ FR+AF GFD EIV Sbjct: 222 MNPIYVAYHDKEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRDAFSGFDAEIV 281 Query: 385 ATFNEKKIAAVNAEYGLMDVAKIRGMVENAKQILEIVKEFGSFDRYIWSFVNYSPIVNKY 206 A EK++ +++AEYG+ +++++RG+V+N+K+ILEI KEFGSFDRYIW+FVN P N+Y Sbjct: 282 ANITEKQMMSISAEYGI-EISRVRGVVDNSKRILEIKKEFGSFDRYIWTFVNNKPFSNQY 340 Query: 205 KYARNVPVKTSKAETISKDLVKRNFRFVGPTIMHSFMQAAGLTNDHLVNCFRHEEC 38 K+ +PVKTSK+ETISKD+V+R FRFVGPT++HSFMQA GLTNDHL+ C RH C Sbjct: 341 KFGHKIPVKTSKSETISKDMVRRGFRFVGPTMVHSFMQAVGLTNDHLITCHRHLPC 396 >ref|XP_002303719.2| hypothetical protein POPTR_0003s15520g [Populus trichocarpa] gi|550343248|gb|EEE78698.2| hypothetical protein POPTR_0003s15520g [Populus trichocarpa] Length = 420 Score = 331 bits (848), Expect = 5e-88 Identities = 190/423 (44%), Positives = 251/423 (59%), Gaps = 14/423 (3%) Frame = -3 Query: 1237 NSSNTMSGSKVVEIGDRPVLQPR---VPKANEGNNATLTGTKTMPKKSLKTTLSPPSRPP 1067 N S + + + +I RPVLQP+ VP N+ P + P +P Sbjct: 9 NQSTSNIATTIAKINGRPVLQPKSNQVPSLERHNSLKKNSPPKSPTREPAGPPVPLMQPA 68 Query: 1066 KN---TNVGHPKTLATPPTSPSSKIHNAXXXXXXXXXXXXXXKQLSKRIPERIVLKRAND 896 N T P L+ PP SP K S R P +KR N+ Sbjct: 69 CNAAGTKTRLPSALS-PPISPKLK---------------------SPRPP---AVKRGNE 103 Query: 895 TTSLNTSADAKAPLESKASSPKCPVISVSVKGAANGRKKPRKSMSFDGSS---VRSPGSL 725 LNTSA+ K + + S K + G + + SS V +PGS+ Sbjct: 104 PGGLNTSAE-KVLTPRSTTKVTTSTVKKSKKSSTAGVPHSVDTFAMKYSSSLLVEAPGSI 162 Query: 724 AAARKAELAEMCAQRKMKISHYGRKQGTPKAQKVVE-DLPNDEGV----KRCGFITSQSD 560 AAAR+ ++A M QRKM+I+HYGR + K+V + P + KRC FIT SD Sbjct: 163 AAARREQVAVMQEQRKMRIAHYGRTKSAKYQGKIVPANSPATSTITREEKRCSFITPNSD 222 Query: 559 PAHVAYHDQEWGVPVHDDKMLFELLVLMGAQVGMNWPAILNKREAFREAFGGFDPEIVAT 380 P +VAYHD+EWGVPVHDDK+LFELL L GAQVG W ++L KREAFREAF GFD EIVA Sbjct: 223 PVYVAYHDEEWGVPVHDDKLLFELLALTGAQVGSEWTSVLKKREAFREAFSGFDAEIVAK 282 Query: 379 FNEKKIAAVNAEYGLMDVAKIRGMVENAKQILEIVKEFGSFDRYIWSFVNYSPIVNKYKY 200 F EKKIA+++AEYGL D++++RG+V+N+ +ILE+ +EFGSFD Y+W +VN+ PI +YK Sbjct: 283 FTEKKIASISAEYGL-DISQVRGVVDNSNRILEVKREFGSFDEYLWGYVNHKPISTQYKS 341 Query: 199 ARNVPVKTSKAETISKDLVKRNFRFVGPTIMHSFMQAAGLTNDHLVNCFRHEECIALCHH 20 + +PVKTSK+ETISKD+VKR FRFVGPT++HSFMQA GL+NDHL+ C RH +CIAL Sbjct: 342 CQKIPVKTSKSETISKDMVKRGFRFVGPTVIHSFMQAGGLSNDHLITCPRHLQCIALASQ 401 Query: 19 VAK 11 + + Sbjct: 402 LPR 404 >ref|XP_002303720.2| hypothetical protein POPTR_0003s15520g [Populus trichocarpa] gi|550343247|gb|EEE78699.2| hypothetical protein POPTR_0003s15520g [Populus trichocarpa] Length = 417 Score = 331 bits (848), Expect = 5e-88 Identities = 190/423 (44%), Positives = 251/423 (59%), Gaps = 14/423 (3%) Frame = -3 Query: 1237 NSSNTMSGSKVVEIGDRPVLQPR---VPKANEGNNATLTGTKTMPKKSLKTTLSPPSRPP 1067 N S + + + +I RPVLQP+ VP N+ P + P +P Sbjct: 9 NQSTSNIATTIAKINGRPVLQPKSNQVPSLERHNSLKKNSPPKSPTREPAGPPVPLMQPA 68 Query: 1066 KN---TNVGHPKTLATPPTSPSSKIHNAXXXXXXXXXXXXXXKQLSKRIPERIVLKRAND 896 N T P L+ PP SP K S R P +KR N+ Sbjct: 69 CNAAGTKTRLPSALS-PPISPKLK---------------------SPRPP---AVKRGNE 103 Query: 895 TTSLNTSADAKAPLESKASSPKCPVISVSVKGAANGRKKPRKSMSFDGSS---VRSPGSL 725 LNTSA+ K + + S K + G + + SS V +PGS+ Sbjct: 104 PGGLNTSAE-KVLTPRSTTKVTTSTVKKSKKSSTAGVPHSVDTFAMKYSSSLLVEAPGSI 162 Query: 724 AAARKAELAEMCAQRKMKISHYGRKQGTPKAQKVVE-DLPNDEGV----KRCGFITSQSD 560 AAAR+ ++A M QRKM+I+HYGR + K+V + P + KRC FIT SD Sbjct: 163 AAARREQVAVMQEQRKMRIAHYGRTKSAKYQGKIVPANSPATSTITREEKRCSFITPNSD 222 Query: 559 PAHVAYHDQEWGVPVHDDKMLFELLVLMGAQVGMNWPAILNKREAFREAFGGFDPEIVAT 380 P +VAYHD+EWGVPVHDDK+LFELL L GAQVG W ++L KREAFREAF GFD EIVA Sbjct: 223 PVYVAYHDEEWGVPVHDDKLLFELLALTGAQVGSEWTSVLKKREAFREAFSGFDAEIVAK 282 Query: 379 FNEKKIAAVNAEYGLMDVAKIRGMVENAKQILEIVKEFGSFDRYIWSFVNYSPIVNKYKY 200 F EKKIA+++AEYGL D++++RG+V+N+ +ILE+ +EFGSFD Y+W +VN+ PI +YK Sbjct: 283 FTEKKIASISAEYGL-DISQVRGVVDNSNRILEVKREFGSFDEYLWGYVNHKPISTQYKS 341 Query: 199 ARNVPVKTSKAETISKDLVKRNFRFVGPTIMHSFMQAAGLTNDHLVNCFRHEECIALCHH 20 + +PVKTSK+ETISKD+VKR FRFVGPT++HSFMQA GL+NDHL+ C RH +CIAL Sbjct: 342 CQKIPVKTSKSETISKDMVKRGFRFVGPTVIHSFMQAGGLSNDHLITCPRHLQCIALASQ 401 Query: 19 VAK 11 + + Sbjct: 402 LPR 404 >ref|XP_003546838.1| PREDICTED: uncharacterized protein LOC100811352 [Glycine max] Length = 400 Score = 327 bits (839), Expect = 6e-87 Identities = 192/418 (45%), Positives = 253/418 (60%), Gaps = 16/418 (3%) Frame = -3 Query: 1234 SSNTMSGSKVVEIGDRPVLQP---RVPKANEGNNATLTGTKTMPKKSLKTTLSPPSRPPK 1064 ++ T + V I RPVLQP RVP N+ K P KSL SPPS P Sbjct: 19 ATTTTAKPSVARINGRPVLQPTCNRVPNLERRNSIK----KVAPAKSL----SPPSPPLP 70 Query: 1063 NTNVGHPKTLATPPTSPSSKIHNAXXXXXXXXXXXXXXKQLSKRIPERIVLKRANDTTSL 884 + KT TPP SP SK S R+P KR ND L Sbjct: 71 S------KTSLTPPVSPKSK---------------------SPRLP---ATKRGNDNNGL 100 Query: 883 NTSADAKAPLESKASSPKCPVI----SVSVKGAANGRKKPRKSMSFDGSSVR-SPGSLAA 719 N+S + + SS K P + S S K + S+S+ S + SPGS+AA Sbjct: 101 NSSYEK---IVIPRSSIKTPTLERKKSKSFKEGSCVSASIEASLSYSSSLITDSPGSIAA 157 Query: 718 ARKAELAEMCAQRKMKISHYGRKQGTPKAQKVVEDLPND--------EGVKRCGFITSQS 563 R+ ++A AQRKMKI+HYGR + K ++VV P++ E KRC FIT+ S Sbjct: 158 VRREQMALQQAQRKMKIAHYGRSKSA-KFERVVPLDPSNTSLASKPTEEEKRCSFITANS 216 Query: 562 DPAHVAYHDQEWGVPVHDDKMLFELLVLMGAQVGMNWPAILNKREAFREAFGGFDPEIVA 383 DP ++AYHD+EWGVPVHDDKMLFELLVL GAQVG +W + L KR FR AF FD E VA Sbjct: 217 DPIYIAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSTLKKRLDFRAAFSEFDAETVA 276 Query: 382 TFNEKKIAAVNAEYGLMDVAKIRGMVENAKQILEIVKEFGSFDRYIWSFVNYSPIVNKYK 203 +K++ ++++EYG+ D++++RG+V+NA QILEI K+FGSFD+YIW FVN+ P+ +YK Sbjct: 277 NLTDKQMMSISSEYGI-DISRVRGVVDNANQILEIKKDFGSFDKYIWGFVNHKPLSTQYK 335 Query: 202 YARNVPVKTSKAETISKDLVKRNFRFVGPTIMHSFMQAAGLTNDHLVNCFRHEECIAL 29 + +PVKTSK+E+ISKD+V+R FR+VGPT++HSFMQA+GLTNDHL+ C RH +C L Sbjct: 336 FGHKIPVKTSKSESISKDMVRRGFRYVGPTVVHSFMQASGLTNDHLITCHRHLQCTLL 393 >gb|EMJ12930.1| hypothetical protein PRUPE_ppa006139mg [Prunus persica] Length = 426 Score = 325 bits (832), Expect = 4e-86 Identities = 188/425 (44%), Positives = 248/425 (58%), Gaps = 32/425 (7%) Frame = -3 Query: 1207 VVEIGDRPVLQP---RVPKANEGNNATLTGTKTMPKKSLKTTLSPPSRPPKNTNVGHPKT 1037 V I RPVLQP RVP + N+ T P T S S P+ +N + Sbjct: 18 VARINGRPVLQPTCNRVPSLDRRNSIKKISTPRAPPPPPLPTSSASSTSPRISNKA--SS 75 Query: 1036 LATPPTSPSSKIHNAXXXXXXXXXXXXXXKQLSKRIPERIVLKRANDTTSLNTSAD---- 869 L TPP SP SK S R P +KR ND LN+S++ Sbjct: 76 LLTPPISPKSK---------------------SPRPP---AIKRGNDPNGLNSSSEKVVT 111 Query: 868 ----AKAPLESKASSPKCPVISVSVKGAA--------------NGRKKPRKSMSFDGSSV 743 +A + + S SV V GA+ + S+S+ S + Sbjct: 112 PGGTTRAKILERKKSKSFKRASVGVDGASADLHHHGDFSAGGFSSSLNIEASLSYSSSLI 171 Query: 742 -RSPGSLAAARKAELAEMCAQRKMKISHYGRKQGTPKAQKVVEDLPND------EGVKRC 584 +PGS+AA R+ ++A AQRKM+I+HYGR + + V D + E KRC Sbjct: 172 TEAPGSIAAVRREQMALQHAQRKMRIAHYGRSKSANFERVVPVDASGNIEAKGAEEEKRC 231 Query: 583 GFITSQSDPAHVAYHDQEWGVPVHDDKMLFELLVLMGAQVGMNWPAILNKREAFREAFGG 404 FIT+ SDP +VAYHD+EWGVPVHDDKMLFELLVL GAQVG +W +IL KR+ FR AF Sbjct: 232 SFITANSDPIYVAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRNAFSD 291 Query: 403 FDPEIVATFNEKKIAAVNAEYGLMDVAKIRGMVENAKQILEIVKEFGSFDRYIWSFVNYS 224 FD EIVA F +K++ ++ +EYG+ D++++RG+V+N+ +ILEI KEFGSFD+YIW FVN Sbjct: 292 FDAEIVANFTDKQMVSIGSEYGI-DISRVRGVVDNSNRILEIKKEFGSFDKYIWGFVNQK 350 Query: 223 PIVNKYKYARNVPVKTSKAETISKDLVKRNFRFVGPTIMHSFMQAAGLTNDHLVNCFRHE 44 PI +YK +PVKTSK+E+ISKD+V+R FRFVGPT++HSFMQA+GLTNDHL+ C RH Sbjct: 351 PISPQYKLGYKIPVKTSKSESISKDMVRRGFRFVGPTVVHSFMQASGLTNDHLITCHRHL 410 Query: 43 ECIAL 29 +C L Sbjct: 411 QCTLL 415 >ref|XP_003531474.1| PREDICTED: uncharacterized protein LOC100791725 [Glycine max] Length = 400 Score = 324 bits (830), Expect = 7e-86 Identities = 191/409 (46%), Positives = 246/409 (60%), Gaps = 16/409 (3%) Frame = -3 Query: 1207 VVEIGDRPVLQP---RVPKANEGNNATLTGTKTMPKKSLKTTLSPPSRPPKNTNVGHPKT 1037 V I RPVLQP RVP N+ K P KSL SPPS P + KT Sbjct: 23 VARINGRPVLQPTCNRVPNLERRNSIK----KVAPPKSL----SPPSPPLPS------KT 68 Query: 1036 LATPPTSPSSKIHNAXXXXXXXXXXXXXXKQLSKRIPERIVLKRANDTTSLNTSADAKAP 857 TPP SP K S R+P KR ND LN+S + Sbjct: 69 SLTPPVSPKLK---------------------SPRLP---ATKRGNDNNGLNSSYEK--- 101 Query: 856 LESKASSPKCPVI----SVSVKGAANGRKKPRKSMSFDGSSVR-SPGSLAAARKAELAEM 692 + SS K P + S S K + S+S+ S + SPGS+AA R+ ++A Sbjct: 102 IVIPRSSTKTPTLERKKSKSFKEGSCVSASIEASLSYSSSLITDSPGSIAAVRREQMALQ 161 Query: 691 CAQRKMKISHYGRKQGTPKAQKVVEDLPND--------EGVKRCGFITSQSDPAHVAYHD 536 AQRKMKI+HYGR + K ++VV P++ E KRC FIT SDP ++AYHD Sbjct: 162 QAQRKMKIAHYGRSKSA-KFERVVPLDPSNTSLASKPTEEEKRCSFITPNSDPIYIAYHD 220 Query: 535 QEWGVPVHDDKMLFELLVLMGAQVGMNWPAILNKREAFREAFGGFDPEIVATFNEKKIAA 356 +EWGVPVHDDKMLFELLVL GAQVG +W + L KR FR AF FD E VA +K++ + Sbjct: 221 EEWGVPVHDDKMLFELLVLSGAQVGSDWTSTLKKRLDFRAAFSEFDAETVANLTDKQMMS 280 Query: 355 VNAEYGLMDVAKIRGMVENAKQILEIVKEFGSFDRYIWSFVNYSPIVNKYKYARNVPVKT 176 +++EYG+ D++++RG+V+NA QILEI K+FGSFD+YIW FVN+ PI +YK+ +PVKT Sbjct: 281 ISSEYGI-DISRVRGVVDNANQILEIKKDFGSFDKYIWGFVNHKPISTQYKFGHKIPVKT 339 Query: 175 SKAETISKDLVKRNFRFVGPTIMHSFMQAAGLTNDHLVNCFRHEECIAL 29 SK+E+ISKD+V+R FRFVGPT++HSFMQ +GLTNDHL+ C RH +C L Sbjct: 340 SKSESISKDMVRRGFRFVGPTVVHSFMQTSGLTNDHLITCHRHLQCTLL 388 >ref|XP_004232605.1| PREDICTED: uncharacterized protein LOC101246304 [Solanum lycopersicum] Length = 395 Score = 323 bits (827), Expect = 1e-85 Identities = 183/410 (44%), Positives = 244/410 (59%), Gaps = 13/410 (3%) Frame = -3 Query: 1219 SGSKVVEIGDRPVLQPRVPKANEGNNATLTGTKTMPKKSLKTTLSPPSRPPKNTNVGHPK 1040 S + +I RPVLQP N L + KK+ T + P +T V Sbjct: 11 SAQTLSQINGRPVLQPH------SNIVPLYERRNSLKKTTHT--AAPVTANGSTKVKMSS 62 Query: 1039 TLATPPTSPSSKIHNAXXXXXXXXXXXXXXKQLSKRIPERIVLKRANDTTSLNTSADAKA 860 + TPP SP K S R+P +KR N+ S+ A+ Sbjct: 63 S-TTPPVSPKMK---------------------SPRLP---AIKRGNNIDPNGLSSSAEK 97 Query: 859 PLESKASSPKCPVISVSVKGAANGRKKP----RKSMSFDGSS-VRSPGSLAAARKAELAE 695 + K ++ K P++ K ++ G P S+ + S V +PGS+AAAR+ ++A Sbjct: 98 IVTPKGTANKAPILLKKPKKSSGGLASPSSVENSSLKYSSSLIVEAPGSIAAARREQVAI 157 Query: 694 MCAQRKMKISHYGRKQGTPKAQKVVE--------DLPNDEGVKRCGFITSQSDPAHVAYH 539 QRKMKI+HYGR + KV +PN KRC FIT SDP ++AYH Sbjct: 158 AQVQRKMKIAHYGRTKSAKYEGKVSSLDPSFASAVIPNPREDKRCSFITPNSDPLYIAYH 217 Query: 538 DQEWGVPVHDDKMLFELLVLMGAQVGMNWPAILNKREAFREAFGGFDPEIVATFNEKKIA 359 D+EWGVPVHDD +LFELLVL GAQVG +W ++L KR+ FR+AF GFDPEIV+ +NEKKI Sbjct: 218 DEEWGVPVHDDNLLFELLVLTGAQVGSDWTSVLKKRQEFRDAFSGFDPEIVSKYNEKKIT 277 Query: 358 AVNAEYGLMDVAKIRGMVENAKQILEIVKEFGSFDRYIWSFVNYSPIVNKYKYARNVPVK 179 + + EYG+ ++++IRG V+N+ +ILEI K FGSFD+Y+W FVN PI +YK +PVK Sbjct: 278 STSVEYGI-ELSQIRGAVDNSTRILEIKKTFGSFDKYLWGFVNNKPIATQYKACNKIPVK 336 Query: 178 TSKAETISKDLVKRNFRFVGPTIMHSFMQAAGLTNDHLVNCFRHEECIAL 29 TSK+ETISKD+VKR FR+VGPT++HSFMQAAGLTNDHL+ C RH C+AL Sbjct: 337 TSKSETISKDMVKRGFRYVGPTVIHSFMQAAGLTNDHLIACPRHLPCVAL 386 >ref|XP_006468594.1| PREDICTED: uncharacterized protein LOC102614205 [Citrus sinensis] Length = 375 Score = 321 bits (822), Expect = 6e-85 Identities = 182/405 (44%), Positives = 241/405 (59%), Gaps = 11/405 (2%) Frame = -3 Query: 1201 EIGDRPVLQP---RVPKANEGNNATLTGTKTMPKKSLKTTLSPPSRPPKNTNV-GHPKTL 1034 +I RPVLQP +VP + N+ TG+ PK + T N N K+L Sbjct: 13 QINGRPVLQPTSNQVPSLEKRNSIKKTGS---PKSPITTD---------NVNSKSFTKSL 60 Query: 1033 ATPPTSPSSKIHNAXXXXXXXXXXXXXXKQLSKRIPERIVLKRANDTTSLNTSADAKAPL 854 +PP SP K P +KR ND LNTSA+ Sbjct: 61 LSPPVSPKLKS------------------------PRPAAVKRGNDPNVLNTSAEKIMTP 96 Query: 853 ESKASSPKCPVISVSVKGAANGRKKPRKSMSFDGSS-VRSPGSLAAARKAELAEMCAQRK 677 + AS K P K + +D S V +PGS+AAAR+ +A M QRK Sbjct: 97 KKLASLVKKP-------------KNVGVAPCYDSSLIVEAPGSIAAARREHVAIMQEQRK 143 Query: 676 MKISHYGRKQGT------PKAQKVVEDLPNDEGVKRCGFITSQSDPAHVAYHDQEWGVPV 515 ++I+HYGR + P ND KRC FIT SDP +VAYHD+EWGVPV Sbjct: 144 LRIAHYGRTKSAKFEGKVPGLDSFANGDNNDREEKRCSFITPNSDPIYVAYHDEEWGVPV 203 Query: 514 HDDKMLFELLVLMGAQVGMNWPAILNKREAFREAFGGFDPEIVATFNEKKIAAVNAEYGL 335 HDDK+LFELLVL AQVG +W ++L KR+AFREAF GFD E+VA F EKK+ +++A Y + Sbjct: 204 HDDKLLFELLVLTAAQVGSDWTSVLKKRQAFREAFSGFDAEVVAKFTEKKMTSLSANYAI 263 Query: 334 MDVAKIRGMVENAKQILEIVKEFGSFDRYIWSFVNYSPIVNKYKYARNVPVKTSKAETIS 155 D++++RG+V+N+ +ILE+ K+FGSFD+Y+W FVN+ PI +Y+ ++ +PVKTSK+E IS Sbjct: 264 -DLSQVRGIVDNSIRILEVKKQFGSFDKYLWGFVNHKPINTQYRSSQKIPVKTSKSEAIS 322 Query: 154 KDLVKRNFRFVGPTIMHSFMQAAGLTNDHLVNCFRHEECIALCHH 20 KD+VK+ FRFVGPT++HSFMQAAGLTNDHL+ C RH +C AL H Sbjct: 323 KDMVKKGFRFVGPTVIHSFMQAAGLTNDHLITCTRHLQCTALASH 367 >ref|XP_006364818.1| PREDICTED: uncharacterized protein LOC102594852 [Solanum tuberosum] Length = 395 Score = 319 bits (818), Expect = 2e-84 Identities = 178/404 (44%), Positives = 239/404 (59%), Gaps = 13/404 (3%) Frame = -3 Query: 1201 EIGDRPVLQPRVPKANEGNNATLTGTKTMPKKSLKTTLSPPSRPPKNTNVGHPKTLATPP 1022 +I RPVLQP N L + KK+ T S + + TPP Sbjct: 17 QINGRPVLQPH------SNIVPLYERRNSLKKTTNTAASVTANGSTKVKTS---SSTTPP 67 Query: 1021 TSPSSKIHNAXXXXXXXXXXXXXXKQLSKRIPERIVLKRANDTTSLNTSADAKAPLESKA 842 SP K S R+P +KR N+ S+ A+ + K Sbjct: 68 VSPKMK---------------------SPRLP---AIKRGNNIDPNGLSSSAEKIVTPKG 103 Query: 841 SSPKCPVISVSVKGAANGRKKP----RKSMSFDGSS-VRSPGSLAAARKAELAEMCAQRK 677 ++ K P++ K ++ G P S+ + S V +PGS+AAAR+ ++A QRK Sbjct: 104 TANKAPILLKKPKKSSGGLASPPYVENSSLKYSSSLIVEAPGSIAAARREQVAIAQVQRK 163 Query: 676 MKISHYGRKQGTPKAQKVVE--------DLPNDEGVKRCGFITSQSDPAHVAYHDQEWGV 521 MKI+HYGR + KV +PN KRC FIT SDP ++AYHD+EWGV Sbjct: 164 MKIAHYGRTKSAKYEGKVSSLDPSFASAVIPNPREEKRCSFITPNSDPLYIAYHDEEWGV 223 Query: 520 PVHDDKMLFELLVLMGAQVGMNWPAILNKREAFREAFGGFDPEIVATFNEKKIAAVNAEY 341 PVHDD +LFELLVL GAQVG +W ++L KR+ FR+AF GFDPEIV+ +NEKKI + + EY Sbjct: 224 PVHDDNLLFELLVLTGAQVGSDWTSVLRKRQEFRDAFSGFDPEIVSKYNEKKITSTSVEY 283 Query: 340 GLMDVAKIRGMVENAKQILEIVKEFGSFDRYIWSFVNYSPIVNKYKYARNVPVKTSKAET 161 G+ ++++IRG V+N+ +ILEI K F SF++Y+W FVN PI +YK +PVKTSK+ET Sbjct: 284 GI-ELSQIRGAVDNSTRILEIKKTFDSFNKYLWGFVNNKPIATQYKACNKIPVKTSKSET 342 Query: 160 ISKDLVKRNFRFVGPTIMHSFMQAAGLTNDHLVNCFRHEECIAL 29 ISKD+VKR FR+VGPT++HSFMQAAGLTNDHL+ C RH +C+AL Sbjct: 343 ISKDMVKRGFRYVGPTVIHSFMQAAGLTNDHLIACPRHLQCMAL 386 >ref|XP_002263612.1| PREDICTED: uncharacterized protein LOC100256507 [Vitis vinifera] gi|297738175|emb|CBI27376.3| unnamed protein product [Vitis vinifera] Length = 398 Score = 317 bits (811), Expect = 1e-83 Identities = 179/405 (44%), Positives = 240/405 (59%), Gaps = 11/405 (2%) Frame = -3 Query: 1201 EIGDRPVLQP---RVPKANEGNNATLTGTKTMPKKSLKTTLSPPSRPPKNTNVGHPKTLA 1031 +I RP LQP R+P ++ K PK T+ P S PP T + KT Sbjct: 20 QINGRPALQPTCNRIPSLERHHSFK----KISPKSP--TSPLPASPPPPTTIINTTKTKP 73 Query: 1030 --TPPTSPSSKIHNAXXXXXXXXXXXXXXKQLSKRIPERIVLKRANDTTSLNTSADAKAP 857 TPP SP+ K P + LKR ND LN+S + Sbjct: 74 SLTPPASPNLKS------------------------PRQPALKRGNDPNGLNSSLEKVLT 109 Query: 856 LESKASSPKCPVISVSVKGAANGRKKPRKSMSFDGSS---VRSPGSLAAARKAELAEMCA 686 S P K + G + S + SS V +PGS+AAAR+ ++A M Sbjct: 110 PRGTTKSSSSPK---KTKKCSAGLAPSSDTSSLNYSSSLIVEAPGSIAAARREQMAIMQV 166 Query: 685 QRKMKISHYGRKQGTPKAQKVVEDLP---NDEGVKRCGFITSQSDPAHVAYHDQEWGVPV 515 QRKM+I+HYGR + +K+ P KRC FIT SDP++V YHD+EWGVPV Sbjct: 167 QRKMRIAHYGRTKSAKYEEKIGPVDPLVITTREEKRCSFITPNSDPSYVEYHDEEWGVPV 226 Query: 514 HDDKMLFELLVLMGAQVGMNWPAILNKREAFREAFGGFDPEIVATFNEKKIAAVNAEYGL 335 HDDK LFELLV+ GAQVG +W +L KR+ +R+A G+D EIV F+EKKI +++A YG+ Sbjct: 227 HDDKRLFELLVMTGAQVGSDWTTVLKKRQEYRDALSGYDAEIVGKFSEKKITSISAYYGI 286 Query: 334 MDVAKIRGMVENAKQILEIVKEFGSFDRYIWSFVNYSPIVNKYKYARNVPVKTSKAETIS 155 D++++RG+V+N+ +ILEI +EFGSF +YIW FVN+ PI +YK +PVKTSK+E+IS Sbjct: 287 -DLSQVRGVVDNSNRILEIKREFGSFHKYIWGFVNHKPITTQYKSCHKIPVKTSKSESIS 345 Query: 154 KDLVKRNFRFVGPTIMHSFMQAAGLTNDHLVNCFRHEECIALCHH 20 KD+V+R FR VGPT+++SFMQAAGLTNDHL++C RH +CIAL H Sbjct: 346 KDMVRRGFRLVGPTVIYSFMQAAGLTNDHLISCPRHLQCIALSSH 390 >emb|CAN68394.1| hypothetical protein VITISV_042519 [Vitis vinifera] Length = 398 Score = 316 bits (810), Expect = 1e-83 Identities = 179/405 (44%), Positives = 240/405 (59%), Gaps = 11/405 (2%) Frame = -3 Query: 1201 EIGDRPVLQP---RVPKANEGNNATLTGTKTMPKKSLKTTLSPPSRPPKNTNVGHPKTLA 1031 +I RP LQP R+P ++ K PK T+ P S PP T + KT Sbjct: 20 QINGRPALQPTCNRIPSLERHHSFK----KISPKSP--TSPLPASLPPPTTIINTTKTKP 73 Query: 1030 --TPPTSPSSKIHNAXXXXXXXXXXXXXXKQLSKRIPERIVLKRANDTTSLNTSADAKAP 857 TPP SP+ K P + LKR ND LN+S + Sbjct: 74 SLTPPASPNLKS------------------------PRQPALKRGNDPNGLNSSLEKVLT 109 Query: 856 LESKASSPKCPVISVSVKGAANGRKKPRKSMSFDGSS---VRSPGSLAAARKAELAEMCA 686 S P K + G + S + SS V +PGS+AAAR+ ++A M Sbjct: 110 PRGTTKSSSSPK---KTKKCSAGLAPSSDTSSLNYSSSFIVEAPGSIAAARREQMAIMQV 166 Query: 685 QRKMKISHYGRKQGTPKAQKVVEDLP---NDEGVKRCGFITSQSDPAHVAYHDQEWGVPV 515 QRKM+I+HYGR + +K+ P KRC FIT SDP++V YHD+EWGVPV Sbjct: 167 QRKMRIAHYGRTKSAKYEEKISPVDPLVITTREEKRCSFITPNSDPSYVEYHDEEWGVPV 226 Query: 514 HDDKMLFELLVLMGAQVGMNWPAILNKREAFREAFGGFDPEIVATFNEKKIAAVNAEYGL 335 HDDK LFELLV+ GAQVG +W +L KR+ +R+AF G+D EIV F+EKKI +++A YG+ Sbjct: 227 HDDKRLFELLVMTGAQVGSDWTTVLKKRQEYRDAFSGYDAEIVGKFSEKKITSISAYYGI 286 Query: 334 MDVAKIRGMVENAKQILEIVKEFGSFDRYIWSFVNYSPIVNKYKYARNVPVKTSKAETIS 155 D++++RG+V+N+ +ILEI +EFGSF +YIW FVN+ PI + K +PVKTSK+E+IS Sbjct: 287 -DLSQVRGVVDNSNRILEIKREFGSFHKYIWGFVNHKPITTQXKSCHKIPVKTSKSESIS 345 Query: 154 KDLVKRNFRFVGPTIMHSFMQAAGLTNDHLVNCFRHEECIALCHH 20 KD+V+R FR VGPT+++SFMQAAGLTNDHL++C RH +CIAL H Sbjct: 346 KDMVRRGFRLVGPTVIYSFMQAAGLTNDHLISCPRHLQCIALSSH 390 >ref|XP_006448576.1| hypothetical protein CICLE_v10015639mg [Citrus clementina] gi|557551187|gb|ESR61816.1| hypothetical protein CICLE_v10015639mg [Citrus clementina] Length = 375 Score = 316 bits (809), Expect = 2e-83 Identities = 183/407 (44%), Positives = 241/407 (59%), Gaps = 13/407 (3%) Frame = -3 Query: 1201 EIGDRPVLQP---RVPKANEGNNATLTGTKTMPKKSLKTTLSPPSRPPKNTNVGHP---K 1040 +I RPVLQP +VP + + S+K T SP S P NV K Sbjct: 13 QINGRPVLQPTSNQVPSLEK-------------RSSIKKTGSPKS-PITTNNVNSKSFTK 58 Query: 1039 TLATPPTSPSSKIHNAXXXXXXXXXXXXXXKQLSKRIPERIVLKRANDTTSLNTSADAKA 860 +L +PP SP K P +KR ND LNTSA+ Sbjct: 59 SLLSPPVSPKLKS------------------------PRPAAVKRGNDPNVLNTSAE--- 91 Query: 859 PLESKASSPKCPVISVSVKGAANGRKKPRKSMSFDGSS-VRSPGSLAAARKAELAEMCAQ 683 K +PK ++ VK N P +D S V +PGS+AAAR+ +A M Q Sbjct: 92 ----KIMTPK--KLASFVKKPKNAEVAP----CYDSSLIVEAPGSIAAARREHVAIMQEQ 141 Query: 682 RKMKISHYGRKQGT------PKAQKVVEDLPNDEGVKRCGFITSQSDPAHVAYHDQEWGV 521 RK++I+HYGR + P ND KRC FIT SDP +VAYHD+EWGV Sbjct: 142 RKLRIAHYGRTKSAKFEGKVPGLDSFANGDNNDREEKRCSFITPNSDPKYVAYHDEEWGV 201 Query: 520 PVHDDKMLFELLVLMGAQVGMNWPAILNKREAFREAFGGFDPEIVATFNEKKIAAVNAEY 341 PVHDDK+LFELLVL AQVG +W ++L KR AFREAF GFD E+VA F EKKI +++A Y Sbjct: 202 PVHDDKLLFELLVLTAAQVGSDWTSVLKKRRAFREAFSGFDAEVVAKFTEKKITSLSANY 261 Query: 340 GLMDVAKIRGMVENAKQILEIVKEFGSFDRYIWSFVNYSPIVNKYKYARNVPVKTSKAET 161 + D++++RG+V+N+ +ILE+ K+FGSFD+Y+W FVN+ I +Y+ ++ +P KTSK+E Sbjct: 262 AI-DLSQVRGIVDNSIRILEVKKQFGSFDKYLWGFVNHKTINTQYRSSQKIPAKTSKSEA 320 Query: 160 ISKDLVKRNFRFVGPTIMHSFMQAAGLTNDHLVNCFRHEECIALCHH 20 ISKD+VK+ FRFVGPT++HSFMQAAGL+NDHL+ C RH +C AL H Sbjct: 321 ISKDMVKKGFRFVGPTVIHSFMQAAGLSNDHLITCTRHLQCTALASH 367 >ref|XP_002299382.2| hypothetical protein POPTR_0001s12320g [Populus trichocarpa] gi|550347083|gb|EEE84187.2| hypothetical protein POPTR_0001s12320g [Populus trichocarpa] Length = 373 Score = 315 bits (808), Expect = 2e-83 Identities = 193/434 (44%), Positives = 245/434 (56%), Gaps = 19/434 (4%) Frame = -3 Query: 1273 CSFLLTLLGFFTNSSNTMSGSKVVEIGDRPVLQPR---VPKANEGNNATLTGTKTMPKKS 1103 CSF L S+N ++ + + +I RPVLQP+ VP N+ K P KS Sbjct: 2 CSFKFRL----HRSANNIA-TPIAKINGRPVLQPKSNQVPSLERRNSLK----KNSPAKS 52 Query: 1102 LKTTLSPPSRPP----------KNTNVGHPKTLATPPTSPSSKIHNAXXXXXXXXXXXXX 953 T P + PP T P L+ PP SP K Sbjct: 53 --PTQEPAAVPPIPLMQPAGNAAGTKTKQPSGLS-PPISPKLKS---------------- 93 Query: 952 XKQLSKRIPERIVLKRANDTTSLNTSADAK-APLESKASSPKCPVISVSVKGAANGRKKP 776 P +KR ND LNTSA+ PLES Sbjct: 94 --------PVLPAVKRGNDPDGLNTSAEKVWTPLES------------------------ 121 Query: 775 RKSMSFDGSSVRSPGSLAAARKAELAEMCAQRKMKISHYGRKQGTPKAQKVVE-DLPNDE 599 PGS+AAAR+ +A M QRKM+I+HYGR + KVV D P Sbjct: 122 -------------PGSIAAARREHVAVMQEQRKMRIAHYGRTKSAKYHGKVVPADSPATN 168 Query: 598 GV----KRCGFITSQSDPAHVAYHDQEWGVPVHDDKMLFELLVLMGAQVGMNWPAILNKR 431 + KRC FIT SDP +VAYHD+EWGVPVHDDKMLFELLVL GAQVG +W ++L KR Sbjct: 169 TISREEKRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLTGAQVGSDWTSVLKKR 228 Query: 430 EAFREAFGGFDPEIVATFNEKKIAAVNAEYGLMDVAKIRGMVENAKQILEIVKEFGSFDR 251 EAFREAF GFD E+VA F EKKIA+++AEYG+ D +++RG+V+N+ +I+E+ +EFGSFD+ Sbjct: 229 EAFREAFSGFDAEVVAKFTEKKIASISAEYGI-DTSQVRGVVDNSNKIMEVKREFGSFDK 287 Query: 250 YIWSFVNYSPIVNKYKYARNVPVKTSKAETISKDLVKRNFRFVGPTIMHSFMQAAGLTND 71 Y+W +VN+ PI +YK + +PVKTSK+ETISKD+VKR FRFVGPT++HSFMQA GL ND Sbjct: 288 YLWEYVNHKPIFTQYKSCQKIPVKTSKSETISKDMVKRGFRFVGPTVIHSFMQAGGLRND 347 Query: 70 HLVNCFRHEECIAL 29 HL+ C RH + AL Sbjct: 348 HLITCPRHLQYTAL 361 >gb|EOY14286.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] Length = 409 Score = 315 bits (807), Expect = 3e-83 Identities = 180/405 (44%), Positives = 240/405 (59%), Gaps = 12/405 (2%) Frame = -3 Query: 1207 VVEIGDRPVLQP---RVPKANEGNNATLTGTKTMPKK-SLKTTLSPPSRPPKNTNVGHPK 1040 V I RPVLQP RVP + N+ + P SL +TL P+ N G K Sbjct: 18 VARINGRPVLQPTCNRVPSLDRRNSLKKIPPLSPPTPPSLASTL--PATSATVGNGGRAK 75 Query: 1039 TLATPPTSPSSKIHNAXXXXXXXXXXXXXXKQLSKRIPERIVLKRANDTTSLNTSADAKA 860 TPP SP SK P +KR +D +LNTS++ Sbjct: 76 ASLTPPISPKSKS------------------------PRPAAIKRGSDPNALNTSSEKVM 111 Query: 859 PLESKASSPKCPVISVSVKGAANGRKK-PRKSMSFDGSS-VRSPGSLAAARKAELAEMCA 686 + + + +G NG S+S+ S V +PGS+AA R+ ++A A Sbjct: 112 TPRNITKTLERKKSKSFKEGMGNGLSSWIEPSLSYSSSLIVEAPGSIAAVRREQMALQQA 171 Query: 685 QRKMKISHYGRKQGTPKAQKVVEDLPN------DEGVKRCGFITSQSDPAHVAYHDQEWG 524 QRKMKI+HYGR + KVV + DE KRC FIT SDP +VAYHD+EWG Sbjct: 172 QRKMKIAHYGRSKSAKFESKVVPLNTSSAMTKPDEEEKRCSFITPNSDPVYVAYHDEEWG 231 Query: 523 VPVHDDKMLFELLVLMGAQVGMNWPAILNKREAFREAFGGFDPEIVATFNEKKIAAVNAE 344 VPVHDD MLFELLVL GAQVG +W +IL KR+ FR+AF GFD E VA F +K++ +++E Sbjct: 232 VPVHDDSMLFELLVLSGAQVGSDWISILKKRQDFRDAFSGFDAETVAKFTDKEMTTISSE 291 Query: 343 YGLMDVAKIRGMVENAKQILEIVKEFGSFDRYIWSFVNYSPIVNKYKYARNVPVKTSKAE 164 YG+ D++++ G+V+N+ +ILE+ +FGSFD+YIW FVN+ I +YK+ +PVKTSK+E Sbjct: 292 YGI-DISRVLGVVDNSNRILEVKGQFGSFDKYIWGFVNHKAISTQYKFGHKIPVKTSKSE 350 Query: 163 TISKDLVKRNFRFVGPTIMHSFMQAAGLTNDHLVNCFRHEECIAL 29 +ISKD+++R FR VGPT++HSFMQAAGLTNDHL+ C RH C L Sbjct: 351 SISKDMLRRGFRCVGPTVVHSFMQAAGLTNDHLITCHRHLPCTLL 395 >ref|XP_002515807.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis] gi|223545076|gb|EEF46588.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis] Length = 404 Score = 315 bits (807), Expect = 3e-83 Identities = 181/410 (44%), Positives = 236/410 (57%), Gaps = 17/410 (4%) Frame = -3 Query: 1207 VVEIGDRPVLQPRVPKANEGNNATLTGTKTMPKKSLKTTLSPPSRPPKNTNVGHPKTLAT 1028 V I RPVLQP N T K K + PP PP + T Sbjct: 24 VARINGRPVLQPTC-------NHVPTPDKRSSFKKMSLNCPPPPPPPSSPPSSTFDDKTT 76 Query: 1027 PPTSPSSKIHNAXXXXXXXXXXXXXXKQLSKRIPERIVLKRANDTTSLNTSAD------- 869 P SP SK S R P +KR +D LN S++ Sbjct: 77 TPVSPKSK---------------------SPRPP---AIKRGSDPNGLNASSEKVVIPSN 112 Query: 868 -AKAPLESKASSPKCPVISVSVKGAANGRKKPRKSMSFDGSS-VRSPGSLAAARKAELAE 695 ++ P + S S ++ S+ + S V SPGS+AA R+ ++A Sbjct: 113 NSRTPRLERKKSKSFKETSAGTGLFSSSASSAEASLHYSSSLIVESPGSIAAVRREQMAF 172 Query: 694 MCAQRKMKISHYGRKQGTP-KAQKV--VEDLPN-----DEGVKRCGFITSQSDPAHVAYH 539 AQRKM+I+HYGR + +A V ++ L N DE KRC FIT SDP +VAYH Sbjct: 173 QHAQRKMRIAHYGRSKSAKFEANNVFPIDSLTNISTKSDEEEKRCNFITPNSDPIYVAYH 232 Query: 538 DQEWGVPVHDDKMLFELLVLMGAQVGMNWPAILNKREAFREAFGGFDPEIVATFNEKKIA 359 D+EWGVPV DDK+LFELLVL GAQVG +W +IL KR+ FR+AF GFD EIVA F EK + Sbjct: 233 DEEWGVPVRDDKLLFELLVLSGAQVGSDWTSILKKRQDFRDAFSGFDAEIVADFTEKHMI 292 Query: 358 AVNAEYGLMDVAKIRGMVENAKQILEIVKEFGSFDRYIWSFVNYSPIVNKYKYARNVPVK 179 +++ EYG+ D+ ++RG+V+N+ ++LEI KEFGSF +YIW+FVN PI +YK+ +PVK Sbjct: 293 SISTEYGI-DINRVRGVVDNSNRVLEIKKEFGSFSKYIWAFVNNKPISTQYKFGHKIPVK 351 Query: 178 TSKAETISKDLVKRNFRFVGPTIMHSFMQAAGLTNDHLVNCFRHEECIAL 29 TSK+E+ISKD+V+R FRFVGPT++HSFMQAAGLTNDHL+ C RH C L Sbjct: 352 TSKSESISKDMVRRGFRFVGPTMVHSFMQAAGLTNDHLITCHRHLPCTLL 401 >ref|XP_003595924.1| DNA-3-methyladenine glycosylase [Medicago truncatula] gi|355484972|gb|AES66175.1| DNA-3-methyladenine glycosylase [Medicago truncatula] Length = 390 Score = 315 bits (807), Expect = 3e-83 Identities = 180/413 (43%), Positives = 240/413 (58%), Gaps = 20/413 (4%) Frame = -3 Query: 1207 VVEIGDRPVLQP---RVPKANEGNNATLTGTKTMPKKSLKTTLSPPSRPPKNTNVGHPKT 1037 V I RPVLQP VP N+ KKS +LSP P K + Sbjct: 21 VARINGRPVLQPTCNHVPNLERRNSI---------KKSTPKSLSPLPLPNKTNT-----S 66 Query: 1036 LATPPTSPSSKIHNAXXXXXXXXXXXXXXKQLSKRIPERIVLKRANDTTSLNTSADA--- 866 TPP SP K S + +KR ND LN S + Sbjct: 67 SLTPPISPKPK---------------------SPTSTRPLAIKRGNDNNGLNLSCEKISI 105 Query: 865 -----KAPLESKASSPKCPVISVSVKGAANGRKKPRKSMSFDGSSVR-SPGSLAAARKAE 704 K P + S S ++ A S+S+ S + SPGS+AA R+ + Sbjct: 106 PKNIMKTPTLERKKSKSFKEGSFGIEAA---------SLSYSSSLITDSPGSIAAVRREQ 156 Query: 703 LAEMCAQRKMKISHYGRKQGTPKAQKV--------VEDLPNDEGVKRCGFITSQSDPAHV 548 +A AQRKMKI+HYGR + K ++V ++ ++ KRC FIT+ SDP ++ Sbjct: 157 VALQQAQRKMKIAHYGRSKSA-KFERVFPIDPSSALDSKTTNQEEKRCSFITTNSDPIYI 215 Query: 547 AYHDQEWGVPVHDDKMLFELLVLMGAQVGMNWPAILNKREAFREAFGGFDPEIVATFNEK 368 AYHD+EWGVPVHDDKMLFELL+L GAQVG +W + L KR FR AF FD EIVA +K Sbjct: 216 AYHDEEWGVPVHDDKMLFELLILSGAQVGSDWTSTLKKRLDFRAAFSEFDAEIVANLTDK 275 Query: 367 KIAAVNAEYGLMDVAKIRGMVENAKQILEIVKEFGSFDRYIWSFVNYSPIVNKYKYARNV 188 ++ ++++EYG+ D++K+RG+V+NA QIL++ K FGSFD+YIW FVN+ PI N+YK+ + Sbjct: 276 QMMSISSEYGI-DISKVRGVVDNANQILQVRKGFGSFDKYIWGFVNHKPISNQYKFGHKI 334 Query: 187 PVKTSKAETISKDLVKRNFRFVGPTIMHSFMQAAGLTNDHLVNCFRHEECIAL 29 PVKTSK+E+ISKD++KR FR+VGPT++HSFMQAAGLTNDHL+ C RH +C L Sbjct: 335 PVKTSKSESISKDMIKRGFRYVGPTVVHSFMQAAGLTNDHLITCHRHLQCTLL 387 >ref|XP_004295546.1| PREDICTED: uncharacterized protein LOC101298985 [Fragaria vesca subsp. vesca] Length = 410 Score = 315 bits (806), Expect = 4e-83 Identities = 184/421 (43%), Positives = 244/421 (57%), Gaps = 25/421 (5%) Frame = -3 Query: 1207 VVEIGDRPVLQP---RVPKANEGNNATLTGTKTMPKKSLKTTLSPPSRPPKNTNVGHPKT 1037 V I RPVLQP RVP + N+ T P L S + P +T + Sbjct: 18 VSRINGRPVLQPTCNRVPSLDRRNSLKKLSTPPPPPLPLSNASSTSTSPRISTKA----S 73 Query: 1036 LATPPTSPSSKIHNAXXXXXXXXXXXXXXKQLSKRIPERIVLKRANDTTSLNTSAD---- 869 L TPP SP SK S R P + + ND LN+S++ Sbjct: 74 LTTPPVSPKSK---------------------SPRPPA--IKRSGNDPNGLNSSSEKVVT 110 Query: 868 ------AKAPLESKASSPKCPVISVSVKGAANGRKKPR-------KSMSFDGSSV-RSPG 731 AK K+ S K V GA N R S+S+ S + +PG Sbjct: 111 PGGTTRAKVLERKKSKSFKLGV------GADNAHDHGRLSSASIEASLSYSSSLITEAPG 164 Query: 730 SLAAARKAELAEMCAQRKMKISHYGRKQGTPKAQ----KVVEDLPNDEGVKRCGFITSQS 563 ++AA R+ ++A AQRKM+I+HYGR + +E +E KRC FIT+ S Sbjct: 165 TIAAGRREQMALQHAQRKMRIAHYGRSNSANFERVAPIDTMEAKGGEEDHKRCSFITANS 224 Query: 562 DPAHVAYHDQEWGVPVHDDKMLFELLVLMGAQVGMNWPAILNKREAFREAFGGFDPEIVA 383 DP +VAYHDQEWGVPVHDDKMLFELLVL GAQVG +W +IL KR+ FR+AF GFD E VA Sbjct: 225 DPIYVAYHDQEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRDAFSGFDAEAVA 284 Query: 382 TFNEKKIAAVNAEYGLMDVAKIRGMVENAKQILEIVKEFGSFDRYIWSFVNYSPIVNKYK 203 +K++ ++ +EYG+ D++++RG+V+N+ +ILE+ +EFGSF +YIW FVN+ PI +YK Sbjct: 285 NLTDKQMISICSEYGI-DISRVRGVVDNSNRILEVKREFGSFHKYIWGFVNHKPISPQYK 343 Query: 202 YARNVPVKTSKAETISKDLVKRNFRFVGPTIMHSFMQAAGLTNDHLVNCFRHEECIALCH 23 +PVKTSK+E+ISKD+V+R FRFVGPT++HSFMQA+GLTNDHL C RH +C L Sbjct: 344 QGYKIPVKTSKSESISKDMVRRGFRFVGPTVVHSFMQASGLTNDHLTTCHRHLQCTLLAA 403 Query: 22 H 20 H Sbjct: 404 H 404