BLASTX nr result
ID: Mentha26_contig00014559
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha26_contig00014559 (864 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU28970.1| hypothetical protein MIMGU_mgv1a005017mg [Mimulus... 140 6e-31 ref|XP_006394952.1| hypothetical protein EUTSA_v10003560mg [Eutr... 132 2e-28 gb|EXC02372.1| hypothetical protein L484_006666 [Morus notabilis] 129 2e-27 gb|EXB95528.1| hypothetical protein L484_002543 [Morus notabilis] 129 2e-27 ref|NP_198117.2| PWWP domain-containing protein [Arabidopsis tha... 127 7e-27 dbj|BAH30603.1| hypothetical protein [Arabidopsis thaliana] 127 7e-27 ref|XP_007020229.1| Tudor/PWWP/MBT superfamily protein, putative... 126 1e-26 ref|XP_003555609.1| PREDICTED: uncharacterized protein LOC100792... 125 2e-26 ref|XP_006344642.1| PREDICTED: uncharacterized protein LOC102596... 124 6e-26 ref|XP_006382497.1| PWWP domain-containing family protein [Popul... 124 6e-26 ref|XP_006286941.1| hypothetical protein CARUB_v10000086mg, part... 124 6e-26 ref|XP_006472071.1| PREDICTED: uncharacterized protein LOC102607... 123 8e-26 ref|XP_007208117.1| hypothetical protein PRUPE_ppa000687mg [Prun... 120 5e-25 ref|XP_003535335.1| PREDICTED: uncharacterized protein LOC100812... 120 9e-25 ref|XP_006433394.1| hypothetical protein CICLE_v10000070mg [Citr... 119 1e-24 ref|XP_004230219.1| PREDICTED: uncharacterized protein LOC101248... 119 1e-24 ref|XP_002882413.1| PWWP domain-containing protein [Arabidopsis ... 119 2e-24 ref|XP_003626260.1| DNA (cytosine-5)-methyltransferase 3A [Medic... 119 2e-24 ref|XP_006408078.1| hypothetical protein EUTSA_v10019994mg [Eutr... 117 7e-24 ref|XP_003553721.1| PREDICTED: uncharacterized protein LOC100805... 116 1e-23 >gb|EYU28970.1| hypothetical protein MIMGU_mgv1a005017mg [Mimulus guttatus] Length = 500 Score = 140 bits (353), Expect = 6e-31 Identities = 105/257 (40%), Positives = 128/257 (49%), Gaps = 16/257 (6%) Frame = +3 Query: 3 GASLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGN 182 GASLPSGA+LRA+FARFGPLDH++TRV+W+T Sbjct: 276 GASLPSGAELRARFARFGPLDHASTRVYWKT----------------------------- 306 Query: 183 RNVRAYIREKLVEG-----EPVKVQKE----AAPPNEQRTAARIPXXXXXXXXXXXXXXX 335 NV+ Y+R+ E PVKVQKE PP + T P Sbjct: 307 -NVKCYLRDSEAEAAESEPPPVKVQKEDVDQRTPPAKIATQQLPPPPPGQQSLQLKSCLK 365 Query: 336 XXXXXTNEEVGNGNGRGT--RVKFVLGGEGA---EQVSSYPEV-GSSYTHSSSTDVTTAT 497 EE GNGNGRG RVKF+LGG+ + EQVSS+ E SS T S+S TT + Sbjct: 366 KPIG--GEEGGNGNGRGNTPRVKFILGGDKSSKTEQVSSFAEADSSSSTTSASASYTTHS 423 Query: 498 KIMPTK-FGQDSIVTTPQLQKXXXXXXXXXXXXVKMGGVEQLPKNDISQQLLNLLTRCRD 674 + +K + + T P K+ L NDISQ+LLNLLTRC D Sbjct: 424 MDLSSKNLPKFNAPTLPNTTTSHRQIHPHHHQFQKIPINIPLATNDISQELLNLLTRCSD 483 Query: 675 VVNNLTGALGHVPYHSL 725 VVNNLTGALG+VPYHSL Sbjct: 484 VVNNLTGALGYVPYHSL 500 >ref|XP_006394952.1| hypothetical protein EUTSA_v10003560mg [Eutrema salsugineum] gi|557091591|gb|ESQ32238.1| hypothetical protein EUTSA_v10003560mg [Eutrema salsugineum] Length = 1082 Score = 132 bits (332), Expect = 2e-28 Identities = 97/269 (36%), Positives = 128/269 (47%), Gaps = 28/269 (10%) Frame = +3 Query: 3 GASLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGN 182 G SLPS A L+A+F RFG LD SA RVFW++ TCR+V+ YKADA+ A +A G++ LFGN Sbjct: 815 GTSLPSAALLKARFGRFGLLDQSAIRVFWKSSTCRVVFLYKADAQTAFRYATGNNTLFGN 874 Query: 183 RNVRAYIRE----KLVEGEPVKVQKEAAPPN---EQRTAARIPXXXXXXXXXXXXXXXXX 341 NVR ++R+ K EP +++ P + +Q P Sbjct: 875 VNVRYFLRDVDTPKPEPHEPENAKEDDEPQSQWLDQAPPLHQPILPPPNINLKSCLKKPV 934 Query: 342 XXXTNEEV-GNGNGRGTRVKFVLGGE----GAEQVSSYPEVGSSYTHSSSTDVTTATKIM 506 +N GNGN RVKF+LGGE A S+ G S + SSS+ T AT+ Sbjct: 935 DEQSNSSSNGNGNRGTARVKFMLGGEQNSIKATTEPSFSNRGPSASSSSSSS-TIATEFF 993 Query: 507 PTKFGQ------------DSIVTTPQLQKXXXXXXXXXXXXVKMGGVE----QLPKNDIS 638 KF + PQ K V + DIS Sbjct: 994 SKKFQNVVHHHQQPSTLPPILPLPPQYSKPIKTVDHVEPPMPPFRNVRGPSPVVGAGDIS 1053 Query: 639 QQLLNLLTRCRDVVNNLTGALGHVPYHSL 725 Q+LNLL++C DVV N+TG LG+VPYH L Sbjct: 1054 HQMLNLLSKCNDVVANVTGLLGYVPYHPL 1082 >gb|EXC02372.1| hypothetical protein L484_006666 [Morus notabilis] Length = 1198 Score = 129 bits (323), Expect = 2e-27 Identities = 97/285 (34%), Positives = 136/285 (47%), Gaps = 46/285 (16%) Frame = +3 Query: 9 SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188 SLPS A+L+A+FARFGP+D S RVFW++ TCR+V+ +K+DA+AA FA +++LFG Sbjct: 914 SLPSPAELKARFARFGPMDQSGLRVFWKSSTCRVVFLHKSDAQAACRFAAANNSLFGTPG 973 Query: 189 VRAYIREKLV-----------EGEPVKVQ----KEAA---PPNEQRTAARIPXXXXXXXX 314 +R Y RE +G+ + + K+ A P+ T +P Sbjct: 974 MRCYTREVEAPATEAPESGKGQGDDISLDTPRTKDTAVLQRPSSITTKQPLPQAAVQLKS 1033 Query: 315 XXXXXXXXXXXXTNEEV--GNGNGRGT-RVKFVLGGE-----------------GAEQVS 434 V G+GN RGT RVKF+L GE + + Sbjct: 1034 CLKKAATDESGQQGTGVGGGSGNSRGTPRVKFMLDGEDSSSRVEQSLMAGNRNNSSNNSA 1093 Query: 435 SYPEVGS-SYTHSSSTDVTTATKIMPTKFGQ-----DSIVTTPQLQKXXXXXXXXXXXXV 596 S+P+ G+ S ++SSST + A F + I+ TPQL K Sbjct: 1094 SFPDGGAPSSSNSSSTSTSVAMDFSVRNFQKVISQSPPILPTPQLAKTPLNNLHHLEMIA 1153 Query: 597 KMGGVEQL--PKNDISQQLLNLLTRCRDVVNNLTGALGHVPYHSL 725 + P DISQQ+L+LLTRC DVV N+T LG+VPYH L Sbjct: 1154 PPRNTTSIAPPTVDISQQMLSLLTRCNDVVTNVTSLLGYVPYHPL 1198 >gb|EXB95528.1| hypothetical protein L484_002543 [Morus notabilis] Length = 1196 Score = 129 bits (323), Expect = 2e-27 Identities = 97/285 (34%), Positives = 136/285 (47%), Gaps = 46/285 (16%) Frame = +3 Query: 9 SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188 SLPS A+L+A+FARFGP+D S RVFW++ TCR+V+ +K+DA+AA FA +++LFG Sbjct: 912 SLPSPAELKARFARFGPMDQSGLRVFWKSSTCRVVFLHKSDAQAACRFAAANNSLFGTPG 971 Query: 189 VRAYIREKLV-----------EGEPVKVQ----KEAA---PPNEQRTAARIPXXXXXXXX 314 +R Y RE +G+ + + K+ A P+ T +P Sbjct: 972 MRCYTREVEAPATEAPESGKGQGDDISLDTTRTKDTAVLQRPSSITTKQPLPQAAVQLKS 1031 Query: 315 XXXXXXXXXXXXTNEEV--GNGNGRGT-RVKFVLGGE-----------------GAEQVS 434 V G+GN RGT RVKF+L GE + + Sbjct: 1032 CLKKAATDESGQQGTGVGGGSGNSRGTPRVKFMLDGEDSSSRVEQSLMAGNRNNSSNNSA 1091 Query: 435 SYPEVGS-SYTHSSSTDVTTATKIMPTKFGQ-----DSIVTTPQLQKXXXXXXXXXXXXV 596 S+P+ G+ S ++SSST + A F + I+ TPQL K Sbjct: 1092 SFPDGGAPSSSNSSSTSTSVAMDFSVRNFQKVISQSPPILPTPQLAKTPLNNLHHLEMIA 1151 Query: 597 KMGGVEQL--PKNDISQQLLNLLTRCRDVVNNLTGALGHVPYHSL 725 + P DISQQ+L+LLTRC DVV N+T LG+VPYH L Sbjct: 1152 PPRNTTSIAPPTVDISQQMLSLLTRCNDVVTNVTSLLGYVPYHPL 1196 >ref|NP_198117.2| PWWP domain-containing protein [Arabidopsis thaliana] gi|332006328|gb|AED93711.1| PWWP domain-containing protein [Arabidopsis thaliana] Length = 1072 Score = 127 bits (318), Expect = 7e-27 Identities = 88/280 (31%), Positives = 129/280 (46%), Gaps = 39/280 (13%) Frame = +3 Query: 3 GASLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGN 182 G SLPS A L+A+F RFG LD SA RVFW++ TCR+V+ YKADA+ A +A G++ LFGN Sbjct: 798 GTSLPSAALLKARFGRFGLLDQSAIRVFWKSSTCRVVFLYKADAQTAFRYATGNNTLFGN 857 Query: 183 RNVRAYIRE-------------KLVEGEPVKVQKEAAPPNEQRTAARIPXXXXXXXXXXX 323 NV+ ++R+ + EP + APP Q T +P Sbjct: 858 VNVKYFLRDVDAPKAEPREPENTKEDDEPQSQWLDQAPPLHQPT---LPPPNVNLKSCLK 914 Query: 324 XXXXXXXXXTNEEVGNGNGRGTRVKFVLGGEGAEQVSSYPEVGSSYTHSSSTDVTTATKI 503 +N GNGN RVKF+LGGE ++ + T + ++ ++++ Sbjct: 915 KPVDDPSSSSNN--GNGNRAAVRVKFMLGGEENSSKANTEPPQVTMTLNRNSGPSSSSSS 972 Query: 504 MPTKFGQDSI-----------VTTPQLQKXXXXXXXXXXXXVK---------------MG 605 +P +F T P + +K G Sbjct: 973 VPMEFVSKKFQNVVHHQQLPPSTLPPILPLPPQYTKPQQLPIKPVDHVEPPMPPSRNFRG 1032 Query: 606 GVEQLPKNDISQQLLNLLTRCRDVVNNLTGALGHVPYHSL 725 + + DIS Q+LNLL++C +VV N+TG LG+VPYH L Sbjct: 1033 PIPAVSAGDISHQMLNLLSKCNEVVANVTGLLGYVPYHPL 1072 >dbj|BAH30603.1| hypothetical protein [Arabidopsis thaliana] Length = 1063 Score = 127 bits (318), Expect = 7e-27 Identities = 88/280 (31%), Positives = 129/280 (46%), Gaps = 39/280 (13%) Frame = +3 Query: 3 GASLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGN 182 G SLPS A L+A+F RFG LD SA RVFW++ TCR+V+ YKADA+ A +A G++ LFGN Sbjct: 789 GTSLPSAALLKARFGRFGLLDQSAIRVFWKSSTCRVVFLYKADAQTAFRYATGNNTLFGN 848 Query: 183 RNVRAYIRE-------------KLVEGEPVKVQKEAAPPNEQRTAARIPXXXXXXXXXXX 323 NV+ ++R+ + EP + APP Q T +P Sbjct: 849 VNVKYFLRDVDAPKAEPREPENTKEDDEPQSQWLDQAPPLHQPT---LPPPNVNLKSCLK 905 Query: 324 XXXXXXXXXTNEEVGNGNGRGTRVKFVLGGEGAEQVSSYPEVGSSYTHSSSTDVTTATKI 503 +N GNGN RVKF+LGGE ++ + T + ++ ++++ Sbjct: 906 KPVDDPSSSSNN--GNGNRAAVRVKFMLGGEENSSKANTEPPQVTMTLNRNSGPSSSSSS 963 Query: 504 MPTKFGQDSI-----------VTTPQLQKXXXXXXXXXXXXVK---------------MG 605 +P +F T P + +K G Sbjct: 964 VPMEFVSKKFQNVVHHQQLPPSTLPPILPLPPQYTKPQQLPIKPVDHVEPPMPPSRNFRG 1023 Query: 606 GVEQLPKNDISQQLLNLLTRCRDVVNNLTGALGHVPYHSL 725 + + DIS Q+LNLL++C +VV N+TG LG+VPYH L Sbjct: 1024 PIPAVSAGDISHQMLNLLSKCNEVVANVTGLLGYVPYHPL 1063 >ref|XP_007020229.1| Tudor/PWWP/MBT superfamily protein, putative [Theobroma cacao] gi|508725557|gb|EOY17454.1| Tudor/PWWP/MBT superfamily protein, putative [Theobroma cacao] Length = 1133 Score = 126 bits (316), Expect = 1e-26 Identities = 95/278 (34%), Positives = 132/278 (47%), Gaps = 39/278 (14%) Frame = +3 Query: 9 SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188 SLPS A+L+A+F RFG LD SA RVFW++ TCR+V+++K DA+AA +A G+++LFGN N Sbjct: 860 SLPSVAELKARFGRFGSLDQSAIRVFWKSSTCRVVFRHKLDAQAAYRYANGNNSLFGNVN 919 Query: 189 VRAYIREKLVEGEPVKV--------------QKEAAPPNEQRTAARIPXXXXXXXXXXXX 326 VR ++R VE V+V P +R+A +P Sbjct: 920 VRYHVRS--VEAPAVEVPDFDKARGDDTASETMRVKDPAVERSAPILP--HQPLPQSTVL 975 Query: 327 XXXXXXXXTNEEVGNGN----GRGT-RVKFVLGGEGAEQVSSYP-------EVGSSYTHS 470 T +E G G+ GRGT RVKF+LGGE + +S+ Sbjct: 976 LKSCLKKPTADEAGQGSGGNGGRGTARVKFMLGGEETSRGEQLMVGNRNNFNNNASFADG 1035 Query: 471 SSTDVT------TATKIMPTKFGQDSIVTTPQLQKXXXXXXXXXXXXVKMG---GVEQLP 623 +T + K++P I PQ K + + +P Sbjct: 1036 GATSIAMEFNSKNFQKVVPPSSSPSPIHPIPQYGKAPANNLHHTEVAPRNSHNLNTQTIP 1095 Query: 624 KN----DISQQLLNLLTRCRDVVNNLTGALGHVPYHSL 725 DISQQ+L+LLTRC DVV N+TG LG+VPYH L Sbjct: 1096 PGTASIDISQQMLSLLTRCNDVVTNVTGLLGYVPYHPL 1133 >ref|XP_003555609.1| PREDICTED: uncharacterized protein LOC100792700 [Glycine max] Length = 1056 Score = 125 bits (314), Expect = 2e-26 Identities = 89/264 (33%), Positives = 122/264 (46%), Gaps = 25/264 (9%) Frame = +3 Query: 9 SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188 SLPS A+L+A+FARFGP+D S RVFW+T TCR+V+ +K DA++A +AL + +LFGN Sbjct: 797 SLPSVAELKARFARFGPIDQSGLRVFWKTSTCRVVFLHKVDAQSAYKYALANQSLFGNVG 856 Query: 189 VRAYIREKLVEGEPVKVQKEAAPPNEQRTAARIP-----------XXXXXXXXXXXXXXX 335 ++ ++RE V +A N + R+ Sbjct: 857 MKCFLREFGDASSEVSEAAKARGDNGANESPRVKDPAVVQRQSSVSAQQPLPQPMIQLKS 916 Query: 336 XXXXXTNEEVGNGNGRG------TRVKFVLGGEGAEQVSSYPEVGSSYTHSSSTDVTTAT 497 T +E+G G G G RVKF+LGGE SS E +S V+ A Sbjct: 917 ILKKSTGDELGQGTGNGGSSKGTPRVKFMLGGE----ESSRGEQLMVGNRNSFNSVSFAD 972 Query: 498 KIMPTKFGQDSIVTTP-QLQK-------XXXXXXXXXXXXVKMGGVEQLPKNDISQQLLN 653 P+ D P Q +K + P DISQQ+++ Sbjct: 973 GGAPSSVAMDFNTPPPTQFKKIPQQNLHNSEMAPRNTPNFINATASATAPTVDISQQMIS 1032 Query: 654 LLTRCRDVVNNLTGALGHVPYHSL 725 LLTRC D+VNNLT LG+VPYH L Sbjct: 1033 LLTRCNDIVNNLTSLLGYVPYHPL 1056 >ref|XP_006344642.1| PREDICTED: uncharacterized protein LOC102596406 [Solanum tuberosum] Length = 1016 Score = 124 bits (310), Expect = 6e-26 Identities = 88/262 (33%), Positives = 127/262 (48%), Gaps = 23/262 (8%) Frame = +3 Query: 9 SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188 +LPS ++L+A+FARFG LDHSATRVFW++ TCRLVYQY+ A A FA S NLFGN N Sbjct: 763 ALPSISELKARFARFGALDHSATRVFWKSSTCRLVYQYRDHAVQAFRFASASTNLFGNTN 822 Query: 189 VRAYIREKLVEGEPVKVQKE----AAPPNEQRTAARIPXXXXXXXXXXXXXXXXXXXXTN 356 VR IRE E + + K + P ++ +R Sbjct: 823 VRCSIREVAAEAQDTEATKNDSGGTSAPKDRAADSR--SSGKPGQLKSCLKKPPGEEGPT 880 Query: 357 EEVGNGNGRGT-RVKFVLGGEG------AEQVSSYPEV--------GSSYTHSSSTDVTT 491 + GNG+ RGT RVKF+LG E EQ++ V GS+ + S+ + T+ Sbjct: 881 IDGGNGSNRGTPRVKFMLGAEDNINRDRGEQMNDIKNVNNTSSIADGSASSSSNINNYTS 940 Query: 492 ATKIMPTKFGQDSIVTTPQLQKXXXXXXXXXXXXVKM----GGVEQLPKNDISQQLLNLL 659 + ++P + TT ++ P+ + SQ +L+LL Sbjct: 941 QSSMLP-------LPTTAHYANAPNDIHFALQAPHRIAPNYNNQVSAPEANFSQHMLSLL 993 Query: 660 TRCRDVVNNLTGALGHVPYHSL 725 T+C D+V +LT LG+ PY+ L Sbjct: 994 TKCSDIVTDLTNLLGYFPYNGL 1015 >ref|XP_006382497.1| PWWP domain-containing family protein [Populus trichocarpa] gi|550337858|gb|ERP60294.1| PWWP domain-containing family protein [Populus trichocarpa] Length = 1021 Score = 124 bits (310), Expect = 6e-26 Identities = 92/279 (32%), Positives = 130/279 (46%), Gaps = 40/279 (14%) Frame = +3 Query: 9 SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188 SLPS AQL+AKFARFG +D SA RVFW++ CR+V++ K DA+AAL +A+G+ +LFGN N Sbjct: 745 SLPSAAQLKAKFARFGSIDQSAIRVFWKSSQCRVVFRRKLDAQAALRYAVGNKSLFGNVN 804 Query: 189 VRAYIRE-----------KLVEGEPVKVQ-KEAAPPNEQRTAARI---PXXXXXXXXXXX 323 VR +RE + G+ V +A P +R AA P Sbjct: 805 VRYNLREVGAPASEAPESEKSRGDDTSVDATQAKDPLVERQAAAFAHQPPSQSAGQLKSI 864 Query: 324 XXXXXXXXXTNEEVGNGNGRGTRVKFVLGGEGAEQ-----VSSYPEVGSSYTHSSSTDVT 488 GNG GRGTRVKF+LGGE + V + ++ + + T Sbjct: 865 LKKPNGEEAVPVPGGNG-GRGTRVKFILGGEETNRGEQMMVGNRNNFNNNASFADGGAPT 923 Query: 489 TATKI--------------------MPTKFGQDSIVTTPQLQKXXXXXXXXXXXXVKMGG 608 T + +PT+F D + + + G Sbjct: 924 TTVAMDFSSKNFQKVIPPSPLPILPLPTQFANDPLNNSHHHTEVPPRNLHNFIIPPPSSG 983 Query: 609 VEQLPKNDISQQLLNLLTRCRDVVNNLTGALGHVPYHSL 725 P DISQQ+L+LLT C D+V +++G LG++PYH L Sbjct: 984 -PSTPSMDISQQMLSLLTTCNDLVTSVSGLLGYMPYHPL 1021 >ref|XP_006286941.1| hypothetical protein CARUB_v10000086mg, partial [Capsella rubella] gi|482555647|gb|EOA19839.1| hypothetical protein CARUB_v10000086mg, partial [Capsella rubella] Length = 1109 Score = 124 bits (310), Expect = 6e-26 Identities = 94/281 (33%), Positives = 131/281 (46%), Gaps = 40/281 (14%) Frame = +3 Query: 3 GASLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGN 182 G SLPS A L+A+F RFG LD SA RVFW++ TCR+V+ YKADA+ A +A G+++LFGN Sbjct: 834 GTSLPSAALLKARFGRFGLLDQSAIRVFWKSSTCRVVFLYKADAQTAFRYATGNNSLFGN 893 Query: 183 RNVRAYIRE----KLVEGEPVKVQ---------KEAAPPNEQRTAARIPXXXXXXXXXXX 323 NV+ ++R+ K EP + ++ APP Q +P Sbjct: 894 VNVKYFLRDVDAPKAEPREPENTKEDDETQSQWQDQAPPLHQPI---LPPPNVNLKSCLK 950 Query: 324 XXXXXXXXXTNEEVGNGNGRGTRVKFVLGG-EGAEQVSSYP----EVGSSYTHSSSTDVT 488 +N GN N RVKF+LGG E + + S+ P S+ SS+ + Sbjct: 951 KPVDDPSSSSNN--GNSNRGSVRVKFMLGGEENSSKTSTEPPQPVTTASNRNSGSSSSSS 1008 Query: 489 TATKIMPTKF------GQDSIVTTPQLQKXXXXXXXXXXXXVKMGGVEQLP--------- 623 A + + KF Q T P + + VE P Sbjct: 1009 VAMEFVSKKFQNVVHHQQLPPSTLPPILPLPPQYSKPHVPIKPVDHVEPPPMPPIRNNFR 1068 Query: 624 -------KNDISQQLLNLLTRCRDVVNNLTGALGHVPYHSL 725 DIS Q+LNLL++C +VV N+TG LG+VPYH L Sbjct: 1069 GQSQAVSSGDISHQMLNLLSKCNEVVANVTGLLGYVPYHPL 1109 >ref|XP_006472071.1| PREDICTED: uncharacterized protein LOC102607628 isoform X2 [Citrus sinensis] Length = 1143 Score = 123 bits (309), Expect = 8e-26 Identities = 91/268 (33%), Positives = 130/268 (48%), Gaps = 29/268 (10%) Frame = +3 Query: 9 SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188 SLPS A+L+A+F RFG LD SA RVFW+++TCR+V+++KADA+AA +A G++ LFGN Sbjct: 897 SLPSAAELKARFGRFGSLDQSAIRVFWKSFTCRVVFKHKADAQAAYKYANGNNTLFGNVK 956 Query: 189 VRAYIREKLVEGEPV----KVQKEAA----PPNEQRTAARIPXXXXXXXXXXXXXXXXXX 344 VR +RE V KV+ + + P + A R Sbjct: 957 VRYILREVEAPAPEVPDFDKVRGDESSYETPRIKDPVADRPTPAPGLLPQPNIQLKSCLK 1016 Query: 345 XXTNEE-----VGNGNGRGTRVKFVLGGEGA---EQV-------------SSYPEVGSSY 461 ++E +GNG RVKF+LGGE + EQ+ +S+ + G++ Sbjct: 1017 KPASDEGGQVAMGNGTKGTARVKFMLGGEESNRGEQMMVGNRNNFNNNNNASFADGGAAS 1076 Query: 462 THSSSTDVTTATKIMPTKFGQDSIVTTPQLQKXXXXXXXXXXXXVKMGGVEQLPKNDISQ 641 + S + D T + + TP + P DISQ Sbjct: 1077 SSSVAMDFNTPPR-------NSHNLNTPTISPPPPP--------------PSAPSIDISQ 1115 Query: 642 QLLNLLTRCRDVVNNLTGALGHVPYHSL 725 Q+L+LLTRC DVV N+TG LG+VPYH L Sbjct: 1116 QMLSLLTRCNDVVTNVTGLLGYVPYHPL 1143 >ref|XP_007208117.1| hypothetical protein PRUPE_ppa000687mg [Prunus persica] gi|462403759|gb|EMJ09316.1| hypothetical protein PRUPE_ppa000687mg [Prunus persica] Length = 1036 Score = 120 bits (302), Expect = 5e-25 Identities = 100/308 (32%), Positives = 134/308 (43%), Gaps = 69/308 (22%) Frame = +3 Query: 9 SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188 SLPS A+L+AKFARFGP+D S RVFW++ TCR+V+ +K+DA+AAL FA + +LFGN + Sbjct: 735 SLPSPAELKAKFARFGPMDQSGLRVFWKSATCRVVFLHKSDAQAALKFATANSSLFGNFS 794 Query: 189 VRAYIREKLVEGEPVKVQKEAAPPNE--------------------QRTAARIPXXXXXX 308 VR IRE V G V + P+E Q+ A +P Sbjct: 795 VRCQIRE--VGGPEVPDSGKGDNPSEIPRVKDSSVGQSPAMASALRQQQQALLPQSAVQL 852 Query: 309 XXXXXXXXXXXXXXTNEEVGNGNGRGT-RVKFVLGGEGAEQ------------------- 428 GNGN +GT RVKF+LGGE + + Sbjct: 853 KSILKKSSGEEQGGQVTTGGNGNSKGTARVKFMLGGEESSRSTDQFMMAGNRNNFNNNNS 912 Query: 429 VSSYPEVGSSYTHSSSTD-------------------VTTATKIMPTKFGQDSIVTTPQL 551 +S+ + G + HSSST +++ I+P G PQ Sbjct: 913 SASFAD-GGAAAHSSSTSSIAMDFNTRNFQKVNAPPTFSSSPPILPPPLGPP---LPPQY 968 Query: 552 QKXXXXXXXXXXXXVKMGGVEQ----------LPKNDISQQLLNLLTRCRDVVNNLTGAL 701 K + Q P DIS Q+L+LLTRC DVV N+ G L Sbjct: 969 AKPPHNKFPQHHSEMAPPRNSQHLNTPTAFPSAPSVDISHQMLSLLTRCNDVVANVKGLL 1028 Query: 702 GHVPYHSL 725 G+VPYH L Sbjct: 1029 GYVPYHPL 1036 >ref|XP_003535335.1| PREDICTED: uncharacterized protein LOC100812480 [Glycine max] Length = 1045 Score = 120 bits (300), Expect = 9e-25 Identities = 85/274 (31%), Positives = 127/274 (46%), Gaps = 35/274 (12%) Frame = +3 Query: 9 SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188 SLPS A+L+A+FARFGP+D S RVFW+T TCR+V+ +K DA++A +AL + +LFGN Sbjct: 772 SLPSVAELKARFARFGPIDQSGLRVFWKTSTCRVVFLHKVDAQSAYKYALANQSLFGNVG 831 Query: 189 VRAYIREKLVEGEPVKVQKEAAPPN--------------EQRTAARIPXXXXXXXXXXXX 326 V+ ++RE V +A N +++++A+ P Sbjct: 832 VKCFLREFGDASSEVSEAAKARGDNGANESPRVKNPAVVQRQSSAQQPLPQPTIQLKSIL 891 Query: 327 XXXXXXXXTNEEVGNGNGRGT-RVKFVLGGE----GAEQVSSYPEVGSSYTHSSSTDVTT 491 G+ +GT RVKF+LGGE G + + +S + + ++ Sbjct: 892 KKSTADEPGQLTGNGGSSKGTPRVKFMLGGEESSRGEQLMVGNRNSFNSVSFADGGAPSS 951 Query: 492 ATKIMPTKFGQDSIVTTPQLQKXXXXXXXXXXXXVKMGGVEQLPKN-------------- 629 +K Q +I P + E P+N Sbjct: 952 VAMDFNSKNVQKAISQPPLPNTPPPPTQFTKILQHNLHNSEMAPRNTPNFINATTSATAP 1011 Query: 630 --DISQQLLNLLTRCRDVVNNLTGALGHVPYHSL 725 DISQQ+++LLTRC D+VNNLT LG+VPYH L Sbjct: 1012 TVDISQQMISLLTRCNDIVNNLTSLLGYVPYHPL 1045 >ref|XP_006433394.1| hypothetical protein CICLE_v10000070mg [Citrus clementina] gi|568836067|ref|XP_006472070.1| PREDICTED: uncharacterized protein LOC102607628 isoform X1 [Citrus sinensis] gi|557535516|gb|ESR46634.1| hypothetical protein CICLE_v10000070mg [Citrus clementina] Length = 1179 Score = 119 bits (299), Expect = 1e-24 Identities = 93/283 (32%), Positives = 132/283 (46%), Gaps = 44/283 (15%) Frame = +3 Query: 9 SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188 SLPS A+L+A+F RFG LD SA RVFW+++TCR+V+++KADA+AA +A G++ LFGN Sbjct: 897 SLPSAAELKARFGRFGSLDQSAIRVFWKSFTCRVVFKHKADAQAAYKYANGNNTLFGNVK 956 Query: 189 VRAYIREKLVEGEPV----KVQKEAA----PPNEQRTAARIPXXXXXXXXXXXXXXXXXX 344 VR +RE V KV+ + + P + A R Sbjct: 957 VRYILREVEAPAPEVPDFDKVRGDESSYETPRIKDPVADRPTPAPGLLPQPNIQLKSCLK 1016 Query: 345 XXTNEE-----VGNGNGRGTRVKFVLGGEGA---EQV-------------SSYPEVGSSY 461 ++E +GNG RVKF+LGGE + EQ+ +S+ + G++ Sbjct: 1017 KPASDEGGQVAMGNGTKGTARVKFMLGGEESNRGEQMMVGNRNNFNNNNNASFADGGAAS 1076 Query: 462 THSSSTDVTTAT--KIMPTKFGQDSIVTTPQLQKXXXXXXXXXXXXVKMGG--------- 608 + S + D + K++P I Q K Sbjct: 1077 SSSVAMDFNSKNFQKVVPPFSSSLGIPPHSQYAKPLYNNTHLTDVAPPRNSHNLNTPTIS 1136 Query: 609 ----VEQLPKNDISQQLLNLLTRCRDVVNNLTGALGHVPYHSL 725 P DISQQ+L+LLTRC DVV N+TG LG+VPYH L Sbjct: 1137 PPPPPPSAPSIDISQQMLSLLTRCNDVVTNVTGLLGYVPYHPL 1179 >ref|XP_004230219.1| PREDICTED: uncharacterized protein LOC101248143 [Solanum lycopersicum] Length = 1011 Score = 119 bits (298), Expect = 1e-24 Identities = 88/258 (34%), Positives = 122/258 (47%), Gaps = 19/258 (7%) Frame = +3 Query: 9 SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188 +LPS ++L+A+FARFG LDHSATRVFW++ TCRLVY Y+ A A FA S NLFGN N Sbjct: 757 ALPSISELKARFARFGALDHSATRVFWKSSTCRLVYLYRNHAVQAFRFASASTNLFGNTN 816 Query: 189 VRAYIREKLVEGEPVKVQKE----AAPPNEQRTAARIPXXXXXXXXXXXXXXXXXXXXTN 356 VR IRE E + + K + P + +R T Sbjct: 817 VRCSIREVTAEAQDPETTKNDSGGTSAPKDGSADSRSSGKAGQLKSCLKKPPGEEGPTT- 875 Query: 357 EEVGNGNGRGT-RVKFVLGGEG------AEQVSSYPEVGSSYTHSSSTDVTTATKIMPTK 515 + GNG+ RGT RVKF+LG E EQ++ V + T S + ++T + Sbjct: 876 -DGGNGSNRGTPRVKFMLGAEDNINRDRGEQMNDIKNVNN--TSSIADGSASSTSNINNY 932 Query: 516 FGQDSIVTTPQLQKXXXXXXXXXXXXVK--------MGGVEQLPKNDISQQLLNLLTRCR 671 Q S+++ P V + + SQQ+L LLT+C Sbjct: 933 TSQLSMLSLPSTAHYVNAPNDIHLALQAPLRNAPNYNNQVSSATEANFSQQMLALLTKCS 992 Query: 672 DVVNNLTGALGHVPYHSL 725 D+V +LT LG+ PY+ L Sbjct: 993 DIVTDLTNLLGYFPYNGL 1010 >ref|XP_002882413.1| PWWP domain-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297328253|gb|EFH58672.1| PWWP domain-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 887 Score = 119 bits (297), Expect = 2e-24 Identities = 85/247 (34%), Positives = 117/247 (47%), Gaps = 6/247 (2%) Frame = +3 Query: 3 GASLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGN 182 G SLPS A L+A+F RFG LD SA RV W++ CR++++YK DA+ AL +A GS+++FGN Sbjct: 644 GTSLPSTALLKARFGRFGQLDQSAIRVSWKSSICRVIFKYKLDAQTALRYASGSNSIFGN 703 Query: 183 RNVRAYIREKLVEGEPVKVQKEAAPPNEQRTAARIPXXXXXXXXXXXXXXXXXXXXTNEE 362 NV ++R+ +++ A +E Sbjct: 704 VNVTYFLRDMKASSASGDHEQKKAKADEPIIEPLNQWLEKAPPVHQPNIQLKSCLKKPGN 763 Query: 363 VGNGNGRGTRVKFVLGGEGAEQVSSYPEVGSSYTHSSSTDVTTATKIMPTKFGQDSIVTT 542 GNGN R RVKF+LG E S +Y SSS+ V T+ S T Sbjct: 764 NGNGNHRTVRVKFMLGEETETPFSVSGRNNGNYASSSSSSVAMEYVSENTQNMVPS--TL 821 Query: 543 PQLQKXXXXXXXXXXXXVKMGGVEQLPKN------DISQQLLNLLTRCRDVVNNLTGALG 704 P + ++ VE P N DIS Q++ LLTRC DVV+N+T LG Sbjct: 822 PPILPLSSQDSEPKPVNNQVNHVEP-PINPSQLTVDISLQMMELLTRCNDVVSNVTCLLG 880 Query: 705 HVPYHSL 725 +VPYH L Sbjct: 881 YVPYHFL 887 >ref|XP_003626260.1| DNA (cytosine-5)-methyltransferase 3A [Medicago truncatula] gi|124360021|gb|ABN08037.1| PWWP [Medicago truncatula] gi|355501275|gb|AES82478.1| DNA (cytosine-5)-methyltransferase 3A [Medicago truncatula] Length = 1114 Score = 119 bits (297), Expect = 2e-24 Identities = 83/275 (30%), Positives = 120/275 (43%), Gaps = 36/275 (13%) Frame = +3 Query: 9 SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188 SLPS A+L+A+FARFGP+D S R+FW++ TCR+V+ YK+DA+AA F++G+ +LFG+ Sbjct: 840 SLPSVAELKARFARFGPMDQSGFRIFWKSSTCRVVFLYKSDAQAAYKFSVGNPSLFGSTG 899 Query: 189 VRAYIRE---------KLVEGEPVKVQKEAAPPNEQRTAARIPXXXXXXXXXXXXXXXXX 341 V +RE K+ + + P + + Sbjct: 900 VTCLLREIGDSASEATKVRGDDGINETPRVKDPAVAQKQTSVSSQKPLLPQPTIQLKSIL 959 Query: 342 XXXTNEEVGNGNGRG------TRVKFVLGGEGAEQ----------------VSSYPEVGS 455 T +E G G G G +RVKF+L GE + + + P V Sbjct: 960 KKSTGDESGQGTGNGSSSKGNSRVKFMLVGEESNRGEPLMVGNKNNNANLSDAGAPSVAM 1019 Query: 456 SYTHSSSTDVTTATKIMPTKFGQDSIVTTPQ-----LQKXXXXXXXXXXXXVKMGGVEQL 620 + + VTT T P + TPQ + + Sbjct: 1020 DFISKNIQKVTTTTSQPPLLPTPPQFLKTPQHNLRNSELATTSRNNPNFNSTTTASSATV 1079 Query: 621 PKNDISQQLLNLLTRCRDVVNNLTGALGHVPYHSL 725 DIS Q++ LLTRC DVV +LTG LG+VPYH L Sbjct: 1080 TSVDISHQMITLLTRCSDVVTDLTGLLGYVPYHPL 1114 >ref|XP_006408078.1| hypothetical protein EUTSA_v10019994mg [Eutrema salsugineum] gi|557109224|gb|ESQ49531.1| hypothetical protein EUTSA_v10019994mg [Eutrema salsugineum] Length = 980 Score = 117 bits (292), Expect = 7e-24 Identities = 88/252 (34%), Positives = 117/252 (46%), Gaps = 11/252 (4%) Frame = +3 Query: 3 GASLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGN 182 G SLPS AQL+A+F RFG LD SA RV W++ CR+V+ YK DA+ AL +A GS +LFGN Sbjct: 738 GTSLPSTAQLKARFGRFGQLDQSAIRVLWKSSICRVVFLYKLDAQTALRYASGSHSLFGN 797 Query: 183 RNVRAYIRE----KLVEGEPVKVQKEAAPPNEQRTAARIPXXXXXXXXXXXXXXXXXXXX 350 NV ++R+ EG K K P E + Sbjct: 798 VNVTYFLRDVEAPYASEGHEPKKAKTGEPILEPLSQWIDRAQPPVHQSFNIQPKSCLKKP 857 Query: 351 TNEEVGNGNGRGTRVKFVLGGE--GAEQVSSYPEVGSSYTHSSSTDVTTATKIMPTKFGQ 524 N GNGN RV+F+LGG+ G + S G+ + SSS + T Sbjct: 858 GNN--GNGNRGKARVRFMLGGKETGTPFLDSSKNNGNHSSSSSSVAIEFVT-------NN 908 Query: 525 DSIVTTPQLQKXXXXXXXXXXXXVKMGGVEQLPKN-----DISQQLLNLLTRCRDVVNNL 689 + P L K+ +E K DIS+Q++ LL C DVV+N+ Sbjct: 909 TQNMVPPNLHPIPWKNSKRKPVNNKVDHLEPPLKPSECRVDISEQIMELLLWCNDVVSNV 968 Query: 690 TGALGHVPYHSL 725 TG LG+VPYH L Sbjct: 969 TGFLGYVPYHPL 980 >ref|XP_003553721.1| PREDICTED: uncharacterized protein LOC100805944 [Glycine max] Length = 1075 Score = 116 bits (290), Expect = 1e-23 Identities = 89/284 (31%), Positives = 130/284 (45%), Gaps = 45/284 (15%) Frame = +3 Query: 9 SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188 SLPS A+L+A+FARFGP+D S RVFW + TCR+V+ +K DA+AA +++GS +LFG+ Sbjct: 799 SLPSIAELKARFARFGPMDQSGFRVFWNSSTCRVVFLHKVDAQAAYKYSVGSQSLFGSVG 858 Query: 189 VRAYIREKLVEGEPVKVQKEAAPPNEQRTAARIP-------------XXXXXXXXXXXXX 329 VR ++RE G+ EAA A P Sbjct: 859 VRFFLRE---FGDSAPEVSEAAKARADDGANETPRVKDPAGIHRQTLVSSQQPLLQPIQL 915 Query: 330 XXXXXXXTNEEVGNGNGRG------TRVKFVLGGEGA---EQVSSYPEVGSSYTHSSSTD 482 T ++ G G G +RVKF+LGGE + +Q++S GS ++++ Sbjct: 916 KSCLKKSTGDDSGQVTGNGSSSKGNSRVKFMLGGEESSRGDQLTS----GSRNNFNNASF 971 Query: 483 VTTATKIMPTKFGQDSI--VT----TPQLQKXXXXXXXXXXXXVKMGGVEQLPKN----- 629 + T F ++ VT P + ++ + P+N Sbjct: 972 ADAGAPPVATDFNSKNVQKVTLQPPLPPILPLPTQFIKSPQHNLRNSELAMAPRNSPNFI 1031 Query: 630 ------------DISQQLLNLLTRCRDVVNNLTGALGHVPYHSL 725 DISQ ++NLLTRC D+V NLTG LG+VPYH L Sbjct: 1032 NTIASAATATTVDISQPMINLLTRCSDIVTNLTGLLGYVPYHPL 1075