BLASTX nr result

ID: Mentha26_contig00014559 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00014559
         (864 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU28970.1| hypothetical protein MIMGU_mgv1a005017mg [Mimulus...   140   6e-31
ref|XP_006394952.1| hypothetical protein EUTSA_v10003560mg [Eutr...   132   2e-28
gb|EXC02372.1| hypothetical protein L484_006666 [Morus notabilis]     129   2e-27
gb|EXB95528.1| hypothetical protein L484_002543 [Morus notabilis]     129   2e-27
ref|NP_198117.2| PWWP domain-containing protein [Arabidopsis tha...   127   7e-27
dbj|BAH30603.1| hypothetical protein [Arabidopsis thaliana]           127   7e-27
ref|XP_007020229.1| Tudor/PWWP/MBT superfamily protein, putative...   126   1e-26
ref|XP_003555609.1| PREDICTED: uncharacterized protein LOC100792...   125   2e-26
ref|XP_006344642.1| PREDICTED: uncharacterized protein LOC102596...   124   6e-26
ref|XP_006382497.1| PWWP domain-containing family protein [Popul...   124   6e-26
ref|XP_006286941.1| hypothetical protein CARUB_v10000086mg, part...   124   6e-26
ref|XP_006472071.1| PREDICTED: uncharacterized protein LOC102607...   123   8e-26
ref|XP_007208117.1| hypothetical protein PRUPE_ppa000687mg [Prun...   120   5e-25
ref|XP_003535335.1| PREDICTED: uncharacterized protein LOC100812...   120   9e-25
ref|XP_006433394.1| hypothetical protein CICLE_v10000070mg [Citr...   119   1e-24
ref|XP_004230219.1| PREDICTED: uncharacterized protein LOC101248...   119   1e-24
ref|XP_002882413.1| PWWP domain-containing protein [Arabidopsis ...   119   2e-24
ref|XP_003626260.1| DNA (cytosine-5)-methyltransferase 3A [Medic...   119   2e-24
ref|XP_006408078.1| hypothetical protein EUTSA_v10019994mg [Eutr...   117   7e-24
ref|XP_003553721.1| PREDICTED: uncharacterized protein LOC100805...   116   1e-23

>gb|EYU28970.1| hypothetical protein MIMGU_mgv1a005017mg [Mimulus guttatus]
          Length = 500

 Score =  140 bits (353), Expect = 6e-31
 Identities = 105/257 (40%), Positives = 128/257 (49%), Gaps = 16/257 (6%)
 Frame = +3

Query: 3   GASLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGN 182
           GASLPSGA+LRA+FARFGPLDH++TRV+W+T                             
Sbjct: 276 GASLPSGAELRARFARFGPLDHASTRVYWKT----------------------------- 306

Query: 183 RNVRAYIREKLVEG-----EPVKVQKE----AAPPNEQRTAARIPXXXXXXXXXXXXXXX 335
            NV+ Y+R+   E       PVKVQKE      PP +  T    P               
Sbjct: 307 -NVKCYLRDSEAEAAESEPPPVKVQKEDVDQRTPPAKIATQQLPPPPPGQQSLQLKSCLK 365

Query: 336 XXXXXTNEEVGNGNGRGT--RVKFVLGGEGA---EQVSSYPEV-GSSYTHSSSTDVTTAT 497
                  EE GNGNGRG   RVKF+LGG+ +   EQVSS+ E   SS T S+S   TT +
Sbjct: 366 KPIG--GEEGGNGNGRGNTPRVKFILGGDKSSKTEQVSSFAEADSSSSTTSASASYTTHS 423

Query: 498 KIMPTK-FGQDSIVTTPQLQKXXXXXXXXXXXXVKMGGVEQLPKNDISQQLLNLLTRCRD 674
             + +K   + +  T P                 K+     L  NDISQ+LLNLLTRC D
Sbjct: 424 MDLSSKNLPKFNAPTLPNTTTSHRQIHPHHHQFQKIPINIPLATNDISQELLNLLTRCSD 483

Query: 675 VVNNLTGALGHVPYHSL 725
           VVNNLTGALG+VPYHSL
Sbjct: 484 VVNNLTGALGYVPYHSL 500


>ref|XP_006394952.1| hypothetical protein EUTSA_v10003560mg [Eutrema salsugineum]
            gi|557091591|gb|ESQ32238.1| hypothetical protein
            EUTSA_v10003560mg [Eutrema salsugineum]
          Length = 1082

 Score =  132 bits (332), Expect = 2e-28
 Identities = 97/269 (36%), Positives = 128/269 (47%), Gaps = 28/269 (10%)
 Frame = +3

Query: 3    GASLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGN 182
            G SLPS A L+A+F RFG LD SA RVFW++ TCR+V+ YKADA+ A  +A G++ LFGN
Sbjct: 815  GTSLPSAALLKARFGRFGLLDQSAIRVFWKSSTCRVVFLYKADAQTAFRYATGNNTLFGN 874

Query: 183  RNVRAYIRE----KLVEGEPVKVQKEAAPPN---EQRTAARIPXXXXXXXXXXXXXXXXX 341
             NVR ++R+    K    EP   +++  P +   +Q      P                 
Sbjct: 875  VNVRYFLRDVDTPKPEPHEPENAKEDDEPQSQWLDQAPPLHQPILPPPNINLKSCLKKPV 934

Query: 342  XXXTNEEV-GNGNGRGTRVKFVLGGE----GAEQVSSYPEVGSSYTHSSSTDVTTATKIM 506
               +N    GNGN    RVKF+LGGE     A    S+   G S + SSS+  T AT+  
Sbjct: 935  DEQSNSSSNGNGNRGTARVKFMLGGEQNSIKATTEPSFSNRGPSASSSSSSS-TIATEFF 993

Query: 507  PTKFGQ------------DSIVTTPQLQKXXXXXXXXXXXXVKMGGVE----QLPKNDIS 638
              KF                +   PQ  K                 V      +   DIS
Sbjct: 994  SKKFQNVVHHHQQPSTLPPILPLPPQYSKPIKTVDHVEPPMPPFRNVRGPSPVVGAGDIS 1053

Query: 639  QQLLNLLTRCRDVVNNLTGALGHVPYHSL 725
             Q+LNLL++C DVV N+TG LG+VPYH L
Sbjct: 1054 HQMLNLLSKCNDVVANVTGLLGYVPYHPL 1082


>gb|EXC02372.1| hypothetical protein L484_006666 [Morus notabilis]
          Length = 1198

 Score =  129 bits (323), Expect = 2e-27
 Identities = 97/285 (34%), Positives = 136/285 (47%), Gaps = 46/285 (16%)
 Frame = +3

Query: 9    SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188
            SLPS A+L+A+FARFGP+D S  RVFW++ TCR+V+ +K+DA+AA  FA  +++LFG   
Sbjct: 914  SLPSPAELKARFARFGPMDQSGLRVFWKSSTCRVVFLHKSDAQAACRFAAANNSLFGTPG 973

Query: 189  VRAYIREKLV-----------EGEPVKVQ----KEAA---PPNEQRTAARIPXXXXXXXX 314
            +R Y RE              +G+ + +     K+ A    P+   T   +P        
Sbjct: 974  MRCYTREVEAPATEAPESGKGQGDDISLDTPRTKDTAVLQRPSSITTKQPLPQAAVQLKS 1033

Query: 315  XXXXXXXXXXXXTNEEV--GNGNGRGT-RVKFVLGGE-----------------GAEQVS 434
                            V  G+GN RGT RVKF+L GE                  +   +
Sbjct: 1034 CLKKAATDESGQQGTGVGGGSGNSRGTPRVKFMLDGEDSSSRVEQSLMAGNRNNSSNNSA 1093

Query: 435  SYPEVGS-SYTHSSSTDVTTATKIMPTKFGQ-----DSIVTTPQLQKXXXXXXXXXXXXV 596
            S+P+ G+ S ++SSST  + A       F +       I+ TPQL K             
Sbjct: 1094 SFPDGGAPSSSNSSSTSTSVAMDFSVRNFQKVISQSPPILPTPQLAKTPLNNLHHLEMIA 1153

Query: 597  KMGGVEQL--PKNDISQQLLNLLTRCRDVVNNLTGALGHVPYHSL 725
                   +  P  DISQQ+L+LLTRC DVV N+T  LG+VPYH L
Sbjct: 1154 PPRNTTSIAPPTVDISQQMLSLLTRCNDVVTNVTSLLGYVPYHPL 1198


>gb|EXB95528.1| hypothetical protein L484_002543 [Morus notabilis]
          Length = 1196

 Score =  129 bits (323), Expect = 2e-27
 Identities = 97/285 (34%), Positives = 136/285 (47%), Gaps = 46/285 (16%)
 Frame = +3

Query: 9    SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188
            SLPS A+L+A+FARFGP+D S  RVFW++ TCR+V+ +K+DA+AA  FA  +++LFG   
Sbjct: 912  SLPSPAELKARFARFGPMDQSGLRVFWKSSTCRVVFLHKSDAQAACRFAAANNSLFGTPG 971

Query: 189  VRAYIREKLV-----------EGEPVKVQ----KEAA---PPNEQRTAARIPXXXXXXXX 314
            +R Y RE              +G+ + +     K+ A    P+   T   +P        
Sbjct: 972  MRCYTREVEAPATEAPESGKGQGDDISLDTTRTKDTAVLQRPSSITTKQPLPQAAVQLKS 1031

Query: 315  XXXXXXXXXXXXTNEEV--GNGNGRGT-RVKFVLGGE-----------------GAEQVS 434
                            V  G+GN RGT RVKF+L GE                  +   +
Sbjct: 1032 CLKKAATDESGQQGTGVGGGSGNSRGTPRVKFMLDGEDSSSRVEQSLMAGNRNNSSNNSA 1091

Query: 435  SYPEVGS-SYTHSSSTDVTTATKIMPTKFGQ-----DSIVTTPQLQKXXXXXXXXXXXXV 596
            S+P+ G+ S ++SSST  + A       F +       I+ TPQL K             
Sbjct: 1092 SFPDGGAPSSSNSSSTSTSVAMDFSVRNFQKVISQSPPILPTPQLAKTPLNNLHHLEMIA 1151

Query: 597  KMGGVEQL--PKNDISQQLLNLLTRCRDVVNNLTGALGHVPYHSL 725
                   +  P  DISQQ+L+LLTRC DVV N+T  LG+VPYH L
Sbjct: 1152 PPRNTTSIAPPTVDISQQMLSLLTRCNDVVTNVTSLLGYVPYHPL 1196


>ref|NP_198117.2| PWWP domain-containing protein [Arabidopsis thaliana]
            gi|332006328|gb|AED93711.1| PWWP domain-containing
            protein [Arabidopsis thaliana]
          Length = 1072

 Score =  127 bits (318), Expect = 7e-27
 Identities = 88/280 (31%), Positives = 129/280 (46%), Gaps = 39/280 (13%)
 Frame = +3

Query: 3    GASLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGN 182
            G SLPS A L+A+F RFG LD SA RVFW++ TCR+V+ YKADA+ A  +A G++ LFGN
Sbjct: 798  GTSLPSAALLKARFGRFGLLDQSAIRVFWKSSTCRVVFLYKADAQTAFRYATGNNTLFGN 857

Query: 183  RNVRAYIRE-------------KLVEGEPVKVQKEAAPPNEQRTAARIPXXXXXXXXXXX 323
             NV+ ++R+                + EP     + APP  Q T   +P           
Sbjct: 858  VNVKYFLRDVDAPKAEPREPENTKEDDEPQSQWLDQAPPLHQPT---LPPPNVNLKSCLK 914

Query: 324  XXXXXXXXXTNEEVGNGNGRGTRVKFVLGGEGAEQVSSYPEVGSSYTHSSSTDVTTATKI 503
                     +N   GNGN    RVKF+LGGE     ++      + T + ++  ++++  
Sbjct: 915  KPVDDPSSSSNN--GNGNRAAVRVKFMLGGEENSSKANTEPPQVTMTLNRNSGPSSSSSS 972

Query: 504  MPTKFGQDSI-----------VTTPQLQKXXXXXXXXXXXXVK---------------MG 605
            +P +F                 T P +              +K                G
Sbjct: 973  VPMEFVSKKFQNVVHHQQLPPSTLPPILPLPPQYTKPQQLPIKPVDHVEPPMPPSRNFRG 1032

Query: 606  GVEQLPKNDISQQLLNLLTRCRDVVNNLTGALGHVPYHSL 725
             +  +   DIS Q+LNLL++C +VV N+TG LG+VPYH L
Sbjct: 1033 PIPAVSAGDISHQMLNLLSKCNEVVANVTGLLGYVPYHPL 1072


>dbj|BAH30603.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1063

 Score =  127 bits (318), Expect = 7e-27
 Identities = 88/280 (31%), Positives = 129/280 (46%), Gaps = 39/280 (13%)
 Frame = +3

Query: 3    GASLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGN 182
            G SLPS A L+A+F RFG LD SA RVFW++ TCR+V+ YKADA+ A  +A G++ LFGN
Sbjct: 789  GTSLPSAALLKARFGRFGLLDQSAIRVFWKSSTCRVVFLYKADAQTAFRYATGNNTLFGN 848

Query: 183  RNVRAYIRE-------------KLVEGEPVKVQKEAAPPNEQRTAARIPXXXXXXXXXXX 323
             NV+ ++R+                + EP     + APP  Q T   +P           
Sbjct: 849  VNVKYFLRDVDAPKAEPREPENTKEDDEPQSQWLDQAPPLHQPT---LPPPNVNLKSCLK 905

Query: 324  XXXXXXXXXTNEEVGNGNGRGTRVKFVLGGEGAEQVSSYPEVGSSYTHSSSTDVTTATKI 503
                     +N   GNGN    RVKF+LGGE     ++      + T + ++  ++++  
Sbjct: 906  KPVDDPSSSSNN--GNGNRAAVRVKFMLGGEENSSKANTEPPQVTMTLNRNSGPSSSSSS 963

Query: 504  MPTKFGQDSI-----------VTTPQLQKXXXXXXXXXXXXVK---------------MG 605
            +P +F                 T P +              +K                G
Sbjct: 964  VPMEFVSKKFQNVVHHQQLPPSTLPPILPLPPQYTKPQQLPIKPVDHVEPPMPPSRNFRG 1023

Query: 606  GVEQLPKNDISQQLLNLLTRCRDVVNNLTGALGHVPYHSL 725
             +  +   DIS Q+LNLL++C +VV N+TG LG+VPYH L
Sbjct: 1024 PIPAVSAGDISHQMLNLLSKCNEVVANVTGLLGYVPYHPL 1063


>ref|XP_007020229.1| Tudor/PWWP/MBT superfamily protein, putative [Theobroma cacao]
            gi|508725557|gb|EOY17454.1| Tudor/PWWP/MBT superfamily
            protein, putative [Theobroma cacao]
          Length = 1133

 Score =  126 bits (316), Expect = 1e-26
 Identities = 95/278 (34%), Positives = 132/278 (47%), Gaps = 39/278 (14%)
 Frame = +3

Query: 9    SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188
            SLPS A+L+A+F RFG LD SA RVFW++ TCR+V+++K DA+AA  +A G+++LFGN N
Sbjct: 860  SLPSVAELKARFGRFGSLDQSAIRVFWKSSTCRVVFRHKLDAQAAYRYANGNNSLFGNVN 919

Query: 189  VRAYIREKLVEGEPVKV--------------QKEAAPPNEQRTAARIPXXXXXXXXXXXX 326
            VR ++R   VE   V+V                    P  +R+A  +P            
Sbjct: 920  VRYHVRS--VEAPAVEVPDFDKARGDDTASETMRVKDPAVERSAPILP--HQPLPQSTVL 975

Query: 327  XXXXXXXXTNEEVGNGN----GRGT-RVKFVLGGEGAEQVSSYP-------EVGSSYTHS 470
                    T +E G G+    GRGT RVKF+LGGE   +               +S+   
Sbjct: 976  LKSCLKKPTADEAGQGSGGNGGRGTARVKFMLGGEETSRGEQLMVGNRNNFNNNASFADG 1035

Query: 471  SSTDVT------TATKIMPTKFGQDSIVTTPQLQKXXXXXXXXXXXXVKMG---GVEQLP 623
             +T +          K++P       I   PQ  K             +       + +P
Sbjct: 1036 GATSIAMEFNSKNFQKVVPPSSSPSPIHPIPQYGKAPANNLHHTEVAPRNSHNLNTQTIP 1095

Query: 624  KN----DISQQLLNLLTRCRDVVNNLTGALGHVPYHSL 725
                  DISQQ+L+LLTRC DVV N+TG LG+VPYH L
Sbjct: 1096 PGTASIDISQQMLSLLTRCNDVVTNVTGLLGYVPYHPL 1133


>ref|XP_003555609.1| PREDICTED: uncharacterized protein LOC100792700 [Glycine max]
          Length = 1056

 Score =  125 bits (314), Expect = 2e-26
 Identities = 89/264 (33%), Positives = 122/264 (46%), Gaps = 25/264 (9%)
 Frame = +3

Query: 9    SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188
            SLPS A+L+A+FARFGP+D S  RVFW+T TCR+V+ +K DA++A  +AL + +LFGN  
Sbjct: 797  SLPSVAELKARFARFGPIDQSGLRVFWKTSTCRVVFLHKVDAQSAYKYALANQSLFGNVG 856

Query: 189  VRAYIREKLVEGEPVKVQKEAAPPNEQRTAARIP-----------XXXXXXXXXXXXXXX 335
            ++ ++RE       V    +A   N    + R+                           
Sbjct: 857  MKCFLREFGDASSEVSEAAKARGDNGANESPRVKDPAVVQRQSSVSAQQPLPQPMIQLKS 916

Query: 336  XXXXXTNEEVGNGNGRG------TRVKFVLGGEGAEQVSSYPEVGSSYTHSSSTDVTTAT 497
                 T +E+G G G G       RVKF+LGGE     SS  E       +S   V+ A 
Sbjct: 917  ILKKSTGDELGQGTGNGGSSKGTPRVKFMLGGE----ESSRGEQLMVGNRNSFNSVSFAD 972

Query: 498  KIMPTKFGQDSIVTTP-QLQK-------XXXXXXXXXXXXVKMGGVEQLPKNDISQQLLN 653
               P+    D     P Q +K                   +        P  DISQQ+++
Sbjct: 973  GGAPSSVAMDFNTPPPTQFKKIPQQNLHNSEMAPRNTPNFINATASATAPTVDISQQMIS 1032

Query: 654  LLTRCRDVVNNLTGALGHVPYHSL 725
            LLTRC D+VNNLT  LG+VPYH L
Sbjct: 1033 LLTRCNDIVNNLTSLLGYVPYHPL 1056


>ref|XP_006344642.1| PREDICTED: uncharacterized protein LOC102596406 [Solanum tuberosum]
          Length = 1016

 Score =  124 bits (310), Expect = 6e-26
 Identities = 88/262 (33%), Positives = 127/262 (48%), Gaps = 23/262 (8%)
 Frame = +3

Query: 9    SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188
            +LPS ++L+A+FARFG LDHSATRVFW++ TCRLVYQY+  A  A  FA  S NLFGN N
Sbjct: 763  ALPSISELKARFARFGALDHSATRVFWKSSTCRLVYQYRDHAVQAFRFASASTNLFGNTN 822

Query: 189  VRAYIREKLVEGEPVKVQKE----AAPPNEQRTAARIPXXXXXXXXXXXXXXXXXXXXTN 356
            VR  IRE   E +  +  K      + P ++   +R                        
Sbjct: 823  VRCSIREVAAEAQDTEATKNDSGGTSAPKDRAADSR--SSGKPGQLKSCLKKPPGEEGPT 880

Query: 357  EEVGNGNGRGT-RVKFVLGGEG------AEQVSSYPEV--------GSSYTHSSSTDVTT 491
             + GNG+ RGT RVKF+LG E        EQ++    V        GS+ + S+  + T+
Sbjct: 881  IDGGNGSNRGTPRVKFMLGAEDNINRDRGEQMNDIKNVNNTSSIADGSASSSSNINNYTS 940

Query: 492  ATKIMPTKFGQDSIVTTPQLQKXXXXXXXXXXXXVKM----GGVEQLPKNDISQQLLNLL 659
             + ++P       + TT                  ++          P+ + SQ +L+LL
Sbjct: 941  QSSMLP-------LPTTAHYANAPNDIHFALQAPHRIAPNYNNQVSAPEANFSQHMLSLL 993

Query: 660  TRCRDVVNNLTGALGHVPYHSL 725
            T+C D+V +LT  LG+ PY+ L
Sbjct: 994  TKCSDIVTDLTNLLGYFPYNGL 1015


>ref|XP_006382497.1| PWWP domain-containing family protein [Populus trichocarpa]
            gi|550337858|gb|ERP60294.1| PWWP domain-containing family
            protein [Populus trichocarpa]
          Length = 1021

 Score =  124 bits (310), Expect = 6e-26
 Identities = 92/279 (32%), Positives = 130/279 (46%), Gaps = 40/279 (14%)
 Frame = +3

Query: 9    SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188
            SLPS AQL+AKFARFG +D SA RVFW++  CR+V++ K DA+AAL +A+G+ +LFGN N
Sbjct: 745  SLPSAAQLKAKFARFGSIDQSAIRVFWKSSQCRVVFRRKLDAQAALRYAVGNKSLFGNVN 804

Query: 189  VRAYIRE-----------KLVEGEPVKVQ-KEAAPPNEQRTAARI---PXXXXXXXXXXX 323
            VR  +RE           +   G+   V   +A  P  +R AA     P           
Sbjct: 805  VRYNLREVGAPASEAPESEKSRGDDTSVDATQAKDPLVERQAAAFAHQPPSQSAGQLKSI 864

Query: 324  XXXXXXXXXTNEEVGNGNGRGTRVKFVLGGEGAEQ-----VSSYPEVGSSYTHSSSTDVT 488
                          GNG GRGTRVKF+LGGE   +     V +     ++ + +     T
Sbjct: 865  LKKPNGEEAVPVPGGNG-GRGTRVKFILGGEETNRGEQMMVGNRNNFNNNASFADGGAPT 923

Query: 489  TATKI--------------------MPTKFGQDSIVTTPQLQKXXXXXXXXXXXXVKMGG 608
            T   +                    +PT+F  D +  +    +                G
Sbjct: 924  TTVAMDFSSKNFQKVIPPSPLPILPLPTQFANDPLNNSHHHTEVPPRNLHNFIIPPPSSG 983

Query: 609  VEQLPKNDISQQLLNLLTRCRDVVNNLTGALGHVPYHSL 725
                P  DISQQ+L+LLT C D+V +++G LG++PYH L
Sbjct: 984  -PSTPSMDISQQMLSLLTTCNDLVTSVSGLLGYMPYHPL 1021


>ref|XP_006286941.1| hypothetical protein CARUB_v10000086mg, partial [Capsella rubella]
            gi|482555647|gb|EOA19839.1| hypothetical protein
            CARUB_v10000086mg, partial [Capsella rubella]
          Length = 1109

 Score =  124 bits (310), Expect = 6e-26
 Identities = 94/281 (33%), Positives = 131/281 (46%), Gaps = 40/281 (14%)
 Frame = +3

Query: 3    GASLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGN 182
            G SLPS A L+A+F RFG LD SA RVFW++ TCR+V+ YKADA+ A  +A G+++LFGN
Sbjct: 834  GTSLPSAALLKARFGRFGLLDQSAIRVFWKSSTCRVVFLYKADAQTAFRYATGNNSLFGN 893

Query: 183  RNVRAYIRE----KLVEGEPVKVQ---------KEAAPPNEQRTAARIPXXXXXXXXXXX 323
             NV+ ++R+    K    EP   +         ++ APP  Q     +P           
Sbjct: 894  VNVKYFLRDVDAPKAEPREPENTKEDDETQSQWQDQAPPLHQPI---LPPPNVNLKSCLK 950

Query: 324  XXXXXXXXXTNEEVGNGNGRGTRVKFVLGG-EGAEQVSSYP----EVGSSYTHSSSTDVT 488
                     +N   GN N    RVKF+LGG E + + S+ P       S+    SS+  +
Sbjct: 951  KPVDDPSSSSNN--GNSNRGSVRVKFMLGGEENSSKTSTEPPQPVTTASNRNSGSSSSSS 1008

Query: 489  TATKIMPTKF------GQDSIVTTPQLQKXXXXXXXXXXXXVKMGGVEQLP--------- 623
             A + +  KF       Q    T P +                +  VE  P         
Sbjct: 1009 VAMEFVSKKFQNVVHHQQLPPSTLPPILPLPPQYSKPHVPIKPVDHVEPPPMPPIRNNFR 1068

Query: 624  -------KNDISQQLLNLLTRCRDVVNNLTGALGHVPYHSL 725
                     DIS Q+LNLL++C +VV N+TG LG+VPYH L
Sbjct: 1069 GQSQAVSSGDISHQMLNLLSKCNEVVANVTGLLGYVPYHPL 1109


>ref|XP_006472071.1| PREDICTED: uncharacterized protein LOC102607628 isoform X2 [Citrus
            sinensis]
          Length = 1143

 Score =  123 bits (309), Expect = 8e-26
 Identities = 91/268 (33%), Positives = 130/268 (48%), Gaps = 29/268 (10%)
 Frame = +3

Query: 9    SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188
            SLPS A+L+A+F RFG LD SA RVFW+++TCR+V+++KADA+AA  +A G++ LFGN  
Sbjct: 897  SLPSAAELKARFGRFGSLDQSAIRVFWKSFTCRVVFKHKADAQAAYKYANGNNTLFGNVK 956

Query: 189  VRAYIREKLVEGEPV----KVQKEAA----PPNEQRTAARIPXXXXXXXXXXXXXXXXXX 344
            VR  +RE       V    KV+ + +    P  +   A R                    
Sbjct: 957  VRYILREVEAPAPEVPDFDKVRGDESSYETPRIKDPVADRPTPAPGLLPQPNIQLKSCLK 1016

Query: 345  XXTNEE-----VGNGNGRGTRVKFVLGGEGA---EQV-------------SSYPEVGSSY 461
               ++E     +GNG     RVKF+LGGE +   EQ+             +S+ + G++ 
Sbjct: 1017 KPASDEGGQVAMGNGTKGTARVKFMLGGEESNRGEQMMVGNRNNFNNNNNASFADGGAAS 1076

Query: 462  THSSSTDVTTATKIMPTKFGQDSIVTTPQLQKXXXXXXXXXXXXVKMGGVEQLPKNDISQ 641
            + S + D  T  +           + TP +                       P  DISQ
Sbjct: 1077 SSSVAMDFNTPPR-------NSHNLNTPTISPPPPP--------------PSAPSIDISQ 1115

Query: 642  QLLNLLTRCRDVVNNLTGALGHVPYHSL 725
            Q+L+LLTRC DVV N+TG LG+VPYH L
Sbjct: 1116 QMLSLLTRCNDVVTNVTGLLGYVPYHPL 1143


>ref|XP_007208117.1| hypothetical protein PRUPE_ppa000687mg [Prunus persica]
            gi|462403759|gb|EMJ09316.1| hypothetical protein
            PRUPE_ppa000687mg [Prunus persica]
          Length = 1036

 Score =  120 bits (302), Expect = 5e-25
 Identities = 100/308 (32%), Positives = 134/308 (43%), Gaps = 69/308 (22%)
 Frame = +3

Query: 9    SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188
            SLPS A+L+AKFARFGP+D S  RVFW++ TCR+V+ +K+DA+AAL FA  + +LFGN +
Sbjct: 735  SLPSPAELKAKFARFGPMDQSGLRVFWKSATCRVVFLHKSDAQAALKFATANSSLFGNFS 794

Query: 189  VRAYIREKLVEGEPVKVQKEAAPPNE--------------------QRTAARIPXXXXXX 308
            VR  IRE  V G  V    +   P+E                    Q+  A +P      
Sbjct: 795  VRCQIRE--VGGPEVPDSGKGDNPSEIPRVKDSSVGQSPAMASALRQQQQALLPQSAVQL 852

Query: 309  XXXXXXXXXXXXXXTNEEVGNGNGRGT-RVKFVLGGEGAEQ------------------- 428
                               GNGN +GT RVKF+LGGE + +                   
Sbjct: 853  KSILKKSSGEEQGGQVTTGGNGNSKGTARVKFMLGGEESSRSTDQFMMAGNRNNFNNNNS 912

Query: 429  VSSYPEVGSSYTHSSSTD-------------------VTTATKIMPTKFGQDSIVTTPQL 551
             +S+ + G +  HSSST                     +++  I+P   G       PQ 
Sbjct: 913  SASFAD-GGAAAHSSSTSSIAMDFNTRNFQKVNAPPTFSSSPPILPPPLGPP---LPPQY 968

Query: 552  QKXXXXXXXXXXXXVKMGGVEQ----------LPKNDISQQLLNLLTRCRDVVNNLTGAL 701
             K            +      Q           P  DIS Q+L+LLTRC DVV N+ G L
Sbjct: 969  AKPPHNKFPQHHSEMAPPRNSQHLNTPTAFPSAPSVDISHQMLSLLTRCNDVVANVKGLL 1028

Query: 702  GHVPYHSL 725
            G+VPYH L
Sbjct: 1029 GYVPYHPL 1036


>ref|XP_003535335.1| PREDICTED: uncharacterized protein LOC100812480 [Glycine max]
          Length = 1045

 Score =  120 bits (300), Expect = 9e-25
 Identities = 85/274 (31%), Positives = 127/274 (46%), Gaps = 35/274 (12%)
 Frame = +3

Query: 9    SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188
            SLPS A+L+A+FARFGP+D S  RVFW+T TCR+V+ +K DA++A  +AL + +LFGN  
Sbjct: 772  SLPSVAELKARFARFGPIDQSGLRVFWKTSTCRVVFLHKVDAQSAYKYALANQSLFGNVG 831

Query: 189  VRAYIREKLVEGEPVKVQKEAAPPN--------------EQRTAARIPXXXXXXXXXXXX 326
            V+ ++RE       V    +A   N              +++++A+ P            
Sbjct: 832  VKCFLREFGDASSEVSEAAKARGDNGANESPRVKNPAVVQRQSSAQQPLPQPTIQLKSIL 891

Query: 327  XXXXXXXXTNEEVGNGNGRGT-RVKFVLGGE----GAEQVSSYPEVGSSYTHSSSTDVTT 491
                           G+ +GT RVKF+LGGE    G + +       +S + +     ++
Sbjct: 892  KKSTADEPGQLTGNGGSSKGTPRVKFMLGGEESSRGEQLMVGNRNSFNSVSFADGGAPSS 951

Query: 492  ATKIMPTKFGQDSIVTTPQLQKXXXXXXXXXXXXVKMGGVEQLPKN-------------- 629
                  +K  Q +I   P                  +   E  P+N              
Sbjct: 952  VAMDFNSKNVQKAISQPPLPNTPPPPTQFTKILQHNLHNSEMAPRNTPNFINATTSATAP 1011

Query: 630  --DISQQLLNLLTRCRDVVNNLTGALGHVPYHSL 725
              DISQQ+++LLTRC D+VNNLT  LG+VPYH L
Sbjct: 1012 TVDISQQMISLLTRCNDIVNNLTSLLGYVPYHPL 1045


>ref|XP_006433394.1| hypothetical protein CICLE_v10000070mg [Citrus clementina]
            gi|568836067|ref|XP_006472070.1| PREDICTED:
            uncharacterized protein LOC102607628 isoform X1 [Citrus
            sinensis] gi|557535516|gb|ESR46634.1| hypothetical
            protein CICLE_v10000070mg [Citrus clementina]
          Length = 1179

 Score =  119 bits (299), Expect = 1e-24
 Identities = 93/283 (32%), Positives = 132/283 (46%), Gaps = 44/283 (15%)
 Frame = +3

Query: 9    SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188
            SLPS A+L+A+F RFG LD SA RVFW+++TCR+V+++KADA+AA  +A G++ LFGN  
Sbjct: 897  SLPSAAELKARFGRFGSLDQSAIRVFWKSFTCRVVFKHKADAQAAYKYANGNNTLFGNVK 956

Query: 189  VRAYIREKLVEGEPV----KVQKEAA----PPNEQRTAARIPXXXXXXXXXXXXXXXXXX 344
            VR  +RE       V    KV+ + +    P  +   A R                    
Sbjct: 957  VRYILREVEAPAPEVPDFDKVRGDESSYETPRIKDPVADRPTPAPGLLPQPNIQLKSCLK 1016

Query: 345  XXTNEE-----VGNGNGRGTRVKFVLGGEGA---EQV-------------SSYPEVGSSY 461
               ++E     +GNG     RVKF+LGGE +   EQ+             +S+ + G++ 
Sbjct: 1017 KPASDEGGQVAMGNGTKGTARVKFMLGGEESNRGEQMMVGNRNNFNNNNNASFADGGAAS 1076

Query: 462  THSSSTDVTTAT--KIMPTKFGQDSIVTTPQLQKXXXXXXXXXXXXVKMGG--------- 608
            + S + D  +    K++P       I    Q  K                          
Sbjct: 1077 SSSVAMDFNSKNFQKVVPPFSSSLGIPPHSQYAKPLYNNTHLTDVAPPRNSHNLNTPTIS 1136

Query: 609  ----VEQLPKNDISQQLLNLLTRCRDVVNNLTGALGHVPYHSL 725
                    P  DISQQ+L+LLTRC DVV N+TG LG+VPYH L
Sbjct: 1137 PPPPPPSAPSIDISQQMLSLLTRCNDVVTNVTGLLGYVPYHPL 1179


>ref|XP_004230219.1| PREDICTED: uncharacterized protein LOC101248143 [Solanum
            lycopersicum]
          Length = 1011

 Score =  119 bits (298), Expect = 1e-24
 Identities = 88/258 (34%), Positives = 122/258 (47%), Gaps = 19/258 (7%)
 Frame = +3

Query: 9    SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188
            +LPS ++L+A+FARFG LDHSATRVFW++ TCRLVY Y+  A  A  FA  S NLFGN N
Sbjct: 757  ALPSISELKARFARFGALDHSATRVFWKSSTCRLVYLYRNHAVQAFRFASASTNLFGNTN 816

Query: 189  VRAYIREKLVEGEPVKVQKE----AAPPNEQRTAARIPXXXXXXXXXXXXXXXXXXXXTN 356
            VR  IRE   E +  +  K      + P +    +R                      T 
Sbjct: 817  VRCSIREVTAEAQDPETTKNDSGGTSAPKDGSADSRSSGKAGQLKSCLKKPPGEEGPTT- 875

Query: 357  EEVGNGNGRGT-RVKFVLGGEG------AEQVSSYPEVGSSYTHSSSTDVTTATKIMPTK 515
             + GNG+ RGT RVKF+LG E        EQ++    V +  T S +    ++T  +   
Sbjct: 876  -DGGNGSNRGTPRVKFMLGAEDNINRDRGEQMNDIKNVNN--TSSIADGSASSTSNINNY 932

Query: 516  FGQDSIVTTPQLQKXXXXXXXXXXXXVK--------MGGVEQLPKNDISQQLLNLLTRCR 671
              Q S+++ P                             V    + + SQQ+L LLT+C 
Sbjct: 933  TSQLSMLSLPSTAHYVNAPNDIHLALQAPLRNAPNYNNQVSSATEANFSQQMLALLTKCS 992

Query: 672  DVVNNLTGALGHVPYHSL 725
            D+V +LT  LG+ PY+ L
Sbjct: 993  DIVTDLTNLLGYFPYNGL 1010


>ref|XP_002882413.1| PWWP domain-containing protein [Arabidopsis lyrata subsp. lyrata]
            gi|297328253|gb|EFH58672.1| PWWP domain-containing
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 887

 Score =  119 bits (297), Expect = 2e-24
 Identities = 85/247 (34%), Positives = 117/247 (47%), Gaps = 6/247 (2%)
 Frame = +3

Query: 3    GASLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGN 182
            G SLPS A L+A+F RFG LD SA RV W++  CR++++YK DA+ AL +A GS+++FGN
Sbjct: 644  GTSLPSTALLKARFGRFGQLDQSAIRVSWKSSICRVIFKYKLDAQTALRYASGSNSIFGN 703

Query: 183  RNVRAYIREKLVEGEPVKVQKEAAPPNEQRTAARIPXXXXXXXXXXXXXXXXXXXXTNEE 362
             NV  ++R+          +++ A  +E                                
Sbjct: 704  VNVTYFLRDMKASSASGDHEQKKAKADEPIIEPLNQWLEKAPPVHQPNIQLKSCLKKPGN 763

Query: 363  VGNGNGRGTRVKFVLGGEGAEQVSSYPEVGSSYTHSSSTDVTTATKIMPTKFGQDSIVTT 542
             GNGN R  RVKF+LG E     S       +Y  SSS+ V        T+    S  T 
Sbjct: 764  NGNGNHRTVRVKFMLGEETETPFSVSGRNNGNYASSSSSSVAMEYVSENTQNMVPS--TL 821

Query: 543  PQLQKXXXXXXXXXXXXVKMGGVEQLPKN------DISQQLLNLLTRCRDVVNNLTGALG 704
            P +               ++  VE  P N      DIS Q++ LLTRC DVV+N+T  LG
Sbjct: 822  PPILPLSSQDSEPKPVNNQVNHVEP-PINPSQLTVDISLQMMELLTRCNDVVSNVTCLLG 880

Query: 705  HVPYHSL 725
            +VPYH L
Sbjct: 881  YVPYHFL 887


>ref|XP_003626260.1| DNA (cytosine-5)-methyltransferase 3A [Medicago truncatula]
            gi|124360021|gb|ABN08037.1| PWWP [Medicago truncatula]
            gi|355501275|gb|AES82478.1| DNA
            (cytosine-5)-methyltransferase 3A [Medicago truncatula]
          Length = 1114

 Score =  119 bits (297), Expect = 2e-24
 Identities = 83/275 (30%), Positives = 120/275 (43%), Gaps = 36/275 (13%)
 Frame = +3

Query: 9    SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188
            SLPS A+L+A+FARFGP+D S  R+FW++ TCR+V+ YK+DA+AA  F++G+ +LFG+  
Sbjct: 840  SLPSVAELKARFARFGPMDQSGFRIFWKSSTCRVVFLYKSDAQAAYKFSVGNPSLFGSTG 899

Query: 189  VRAYIRE---------KLVEGEPVKVQKEAAPPNEQRTAARIPXXXXXXXXXXXXXXXXX 341
            V   +RE         K+   + +        P   +    +                  
Sbjct: 900  VTCLLREIGDSASEATKVRGDDGINETPRVKDPAVAQKQTSVSSQKPLLPQPTIQLKSIL 959

Query: 342  XXXTNEEVGNGNGRG------TRVKFVLGGEGAEQ----------------VSSYPEVGS 455
               T +E G G G G      +RVKF+L GE + +                 +  P V  
Sbjct: 960  KKSTGDESGQGTGNGSSSKGNSRVKFMLVGEESNRGEPLMVGNKNNNANLSDAGAPSVAM 1019

Query: 456  SYTHSSSTDVTTATKIMPTKFGQDSIVTTPQ-----LQKXXXXXXXXXXXXVKMGGVEQL 620
             +   +   VTT T   P        + TPQ      +                     +
Sbjct: 1020 DFISKNIQKVTTTTSQPPLLPTPPQFLKTPQHNLRNSELATTSRNNPNFNSTTTASSATV 1079

Query: 621  PKNDISQQLLNLLTRCRDVVNNLTGALGHVPYHSL 725
               DIS Q++ LLTRC DVV +LTG LG+VPYH L
Sbjct: 1080 TSVDISHQMITLLTRCSDVVTDLTGLLGYVPYHPL 1114


>ref|XP_006408078.1| hypothetical protein EUTSA_v10019994mg [Eutrema salsugineum]
            gi|557109224|gb|ESQ49531.1| hypothetical protein
            EUTSA_v10019994mg [Eutrema salsugineum]
          Length = 980

 Score =  117 bits (292), Expect = 7e-24
 Identities = 88/252 (34%), Positives = 117/252 (46%), Gaps = 11/252 (4%)
 Frame = +3

Query: 3    GASLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGN 182
            G SLPS AQL+A+F RFG LD SA RV W++  CR+V+ YK DA+ AL +A GS +LFGN
Sbjct: 738  GTSLPSTAQLKARFGRFGQLDQSAIRVLWKSSICRVVFLYKLDAQTALRYASGSHSLFGN 797

Query: 183  RNVRAYIRE----KLVEGEPVKVQKEAAPPNEQRTAARIPXXXXXXXXXXXXXXXXXXXX 350
             NV  ++R+       EG   K  K   P  E  +                         
Sbjct: 798  VNVTYFLRDVEAPYASEGHEPKKAKTGEPILEPLSQWIDRAQPPVHQSFNIQPKSCLKKP 857

Query: 351  TNEEVGNGNGRGTRVKFVLGGE--GAEQVSSYPEVGSSYTHSSSTDVTTATKIMPTKFGQ 524
             N   GNGN    RV+F+LGG+  G   + S    G+  + SSS  +   T         
Sbjct: 858  GNN--GNGNRGKARVRFMLGGKETGTPFLDSSKNNGNHSSSSSSVAIEFVT-------NN 908

Query: 525  DSIVTTPQLQKXXXXXXXXXXXXVKMGGVEQLPKN-----DISQQLLNLLTRCRDVVNNL 689
               +  P L               K+  +E   K      DIS+Q++ LL  C DVV+N+
Sbjct: 909  TQNMVPPNLHPIPWKNSKRKPVNNKVDHLEPPLKPSECRVDISEQIMELLLWCNDVVSNV 968

Query: 690  TGALGHVPYHSL 725
            TG LG+VPYH L
Sbjct: 969  TGFLGYVPYHPL 980


>ref|XP_003553721.1| PREDICTED: uncharacterized protein LOC100805944 [Glycine max]
          Length = 1075

 Score =  116 bits (290), Expect = 1e-23
 Identities = 89/284 (31%), Positives = 130/284 (45%), Gaps = 45/284 (15%)
 Frame = +3

Query: 9    SLPSGAQLRAKFARFGPLDHSATRVFWETYTCRLVYQYKADAEAALGFALGSDNLFGNRN 188
            SLPS A+L+A+FARFGP+D S  RVFW + TCR+V+ +K DA+AA  +++GS +LFG+  
Sbjct: 799  SLPSIAELKARFARFGPMDQSGFRVFWNSSTCRVVFLHKVDAQAAYKYSVGSQSLFGSVG 858

Query: 189  VRAYIREKLVEGEPVKVQKEAAPPNEQRTAARIP-------------XXXXXXXXXXXXX 329
            VR ++RE    G+      EAA       A   P                          
Sbjct: 859  VRFFLRE---FGDSAPEVSEAAKARADDGANETPRVKDPAGIHRQTLVSSQQPLLQPIQL 915

Query: 330  XXXXXXXTNEEVGNGNGRG------TRVKFVLGGEGA---EQVSSYPEVGSSYTHSSSTD 482
                   T ++ G   G G      +RVKF+LGGE +   +Q++S    GS    ++++ 
Sbjct: 916  KSCLKKSTGDDSGQVTGNGSSSKGNSRVKFMLGGEESSRGDQLTS----GSRNNFNNASF 971

Query: 483  VTTATKIMPTKFGQDSI--VT----TPQLQKXXXXXXXXXXXXVKMGGVEQLPKN----- 629
                   + T F   ++  VT     P +              ++   +   P+N     
Sbjct: 972  ADAGAPPVATDFNSKNVQKVTLQPPLPPILPLPTQFIKSPQHNLRNSELAMAPRNSPNFI 1031

Query: 630  ------------DISQQLLNLLTRCRDVVNNLTGALGHVPYHSL 725
                        DISQ ++NLLTRC D+V NLTG LG+VPYH L
Sbjct: 1032 NTIASAATATTVDISQPMINLLTRCSDIVTNLTGLLGYVPYHPL 1075


Top