BLASTX nr result

ID: Mentha26_contig00027142 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00027142
         (644 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU32028.1| hypothetical protein MIMGU_mgv1a001944mg [Mimulus...   145   1e-32
ref|XP_007051995.1| Nucleotidyltransferase family protein isofor...   141   2e-31
ref|XP_007051994.1| Nucleotidyltransferase family protein isofor...   141   2e-31
ref|XP_007051993.1| Nucleotidyltransferase family protein isofor...   141   2e-31
ref|XP_007051992.1| Nucleotidyltransferase family protein isofor...   141   2e-31
ref|XP_007051991.1| Nucleotidyltransferase family protein isofor...   141   2e-31
ref|XP_002511755.1| poly(A) polymerase cid, putative [Ricinus co...   129   6e-28
dbj|BAJ53142.1| JHL05D22.13 [Jatropha curcas]                         124   2e-26
ref|XP_006339776.1| PREDICTED: uncharacterized protein LOC102603...   122   9e-26
ref|XP_004229872.1| PREDICTED: uncharacterized protein LOC101244...   119   1e-24
ref|XP_002301312.2| hypothetical protein POPTR_0002s15230g [Popu...   114   3e-23
gb|EXC11712.1| Poly(A) RNA polymerase cid11 [Morus notabilis]         109   8e-22
ref|XP_006490961.1| PREDICTED: uncharacterized protein LOC102611...   107   2e-21
ref|XP_006445207.1| hypothetical protein CICLE_v10023615mg, part...   107   2e-21
ref|XP_006295859.1| hypothetical protein CARUB_v10024989mg [Caps...   100   4e-19
ref|XP_006375316.1| hypothetical protein POPTR_0014s06910g, part...    98   2e-18
ref|XP_002880188.1| hypothetical protein ARALYDRAFT_483698 [Arab...    96   9e-18
ref|NP_566048.1| Nucleotidyltransferase family protein [Arabidop...    96   9e-18
ref|XP_004308428.1| PREDICTED: uncharacterized protein LOC101313...    95   2e-17
ref|XP_007220905.1| hypothetical protein PRUPE_ppa002004mg [Prun...    94   4e-17

>gb|EYU32028.1| hypothetical protein MIMGU_mgv1a001944mg [Mimulus guttatus]
          Length = 735

 Score =  145 bits (365), Expect = 1e-32
 Identities = 84/186 (45%), Positives = 112/186 (60%)
 Frame = +3

Query: 87  TSNDRVKLGDGGSKTAVAPPPGFLSNSKDVRNREPGYGRRASDVNGDKGKGNSGQLHKND 266
           + N+R   GD GS  A+APP    +N K+V NRE GY  R  D   DKGKGNSG  +KN 
Sbjct: 249 SGNERRNQGDNGSHRALAPPGFSSNNMKNVGNREHGYVTRNPDNYVDKGKGNSGGSYKNG 308

Query: 267 RLSNQLDFPGLPAGSSIHSASTFDIEESMKQLHAESGEDSRRGEEKKANNDGSEMDDLEN 446
            +SN ++ PG   G  IH      +E+  K      G  + + +  +A    S+M+ +E+
Sbjct: 309 GVSNPINSPGSMMG--IH------VEDGGKGKELRFGGQNNKNQGDRAQ---SKMNGIED 357

Query: 447 QVDSLGIEEESGGKNTKKKHHRDKDYRSDDRGKWIMGQRMRIMKRQTTCRNDINRLSGPL 626
           Q+ SLGIEEESG  + KKK+  DK+YRSD RG+WIMGQRMR +K QT CR DI+R +   
Sbjct: 358 QMGSLGIEEESGETSDKKKNPHDKEYRSDQRGQWIMGQRMRHVKMQTACRKDIDRFNSQF 417

Query: 627 LALVES 644
           L + ES
Sbjct: 418 LTVFES 423


>ref|XP_007051995.1| Nucleotidyltransferase family protein isoform 5 [Theobroma cacao]
           gi|508704256|gb|EOX96152.1| Nucleotidyltransferase
           family protein isoform 5 [Theobroma cacao]
          Length = 635

 Score =  141 bits (355), Expect = 2e-31
 Identities = 89/214 (41%), Positives = 121/214 (56%), Gaps = 4/214 (1%)
 Frame = +3

Query: 15  TTGYEANERNSRTVMRAQNHVRSITSNDRVKLGDGGSKTAVAPPPGFLSNSKDVR-NREP 191
           T+ Y    RNS    + Q H  S             S  A   PPGFL   +    NR+ 
Sbjct: 205 TSPYVFQHRNSGDRGKQQQHGGSYRPTP--------SPEARRSPPGFLGKPRGGGGNRDF 256

Query: 192 GYGRRASDVNGDKGKGNSGQLHKNDR--LSNQLDFPGLPAGSSIHSASTFDIEESMKQLH 365
           G  RR  + N DK K    Q   ++   LS QLD PG PAGS++ S S  DIEES+ +LH
Sbjct: 257 GNRRRHFEHNVDKAKAEYSQPSSDNEVGLSGQLDRPGPPAGSNLQSVSATDIEESLLELH 316

Query: 366 AESGEDSRRGEEKKANNDGSEMDDLENQV-DSLGIEEESGGKNTKKKHHRDKDYRSDDRG 542
           ++ G D     +K    DG E+D++  Q+ +SL IE+ES  KN KK+H R+K+ R D+RG
Sbjct: 317 SDGGRDRFSRRDKFRREDGGEVDEVGEQLLESLLIEDESDDKNDKKQHRREKESRIDNRG 376

Query: 543 KWIMGQRMRIMKRQTTCRNDINRLSGPLLALVES 644
           + ++ QRMR++KRQ  CR+DI+RL+ P LAL ES
Sbjct: 377 QRLLSQRMRMLKRQMECRSDIHRLNAPFLALYES 410


>ref|XP_007051994.1| Nucleotidyltransferase family protein isoform 4, partial [Theobroma
           cacao] gi|508704255|gb|EOX96151.1|
           Nucleotidyltransferase family protein isoform 4, partial
           [Theobroma cacao]
          Length = 585

 Score =  141 bits (355), Expect = 2e-31
 Identities = 89/214 (41%), Positives = 121/214 (56%), Gaps = 4/214 (1%)
 Frame = +3

Query: 15  TTGYEANERNSRTVMRAQNHVRSITSNDRVKLGDGGSKTAVAPPPGFLSNSKDVR-NREP 191
           T+ Y    RNS    + Q H  S             S  A   PPGFL   +    NR+ 
Sbjct: 205 TSPYVFQHRNSGDRGKQQQHGGSYRPTP--------SPEARRSPPGFLGKPRGGGGNRDF 256

Query: 192 GYGRRASDVNGDKGKGNSGQLHKNDR--LSNQLDFPGLPAGSSIHSASTFDIEESMKQLH 365
           G  RR  + N DK K    Q   ++   LS QLD PG PAGS++ S S  DIEES+ +LH
Sbjct: 257 GNRRRHFEHNVDKAKAEYSQPSSDNEVGLSGQLDRPGPPAGSNLQSVSATDIEESLLELH 316

Query: 366 AESGEDSRRGEEKKANNDGSEMDDLENQV-DSLGIEEESGGKNTKKKHHRDKDYRSDDRG 542
           ++ G D     +K    DG E+D++  Q+ +SL IE+ES  KN KK+H R+K+ R D+RG
Sbjct: 317 SDGGRDRFSRRDKFRREDGGEVDEVGEQLLESLLIEDESDDKNDKKQHRREKESRIDNRG 376

Query: 543 KWIMGQRMRIMKRQTTCRNDINRLSGPLLALVES 644
           + ++ QRMR++KRQ  CR+DI+RL+ P LAL ES
Sbjct: 377 QRLLSQRMRMLKRQMECRSDIHRLNAPFLALYES 410


>ref|XP_007051993.1| Nucleotidyltransferase family protein isoform 3, partial [Theobroma
           cacao] gi|508704254|gb|EOX96150.1|
           Nucleotidyltransferase family protein isoform 3, partial
           [Theobroma cacao]
          Length = 584

 Score =  141 bits (355), Expect = 2e-31
 Identities = 89/214 (41%), Positives = 121/214 (56%), Gaps = 4/214 (1%)
 Frame = +3

Query: 15  TTGYEANERNSRTVMRAQNHVRSITSNDRVKLGDGGSKTAVAPPPGFLSNSKDVR-NREP 191
           T+ Y    RNS    + Q H  S             S  A   PPGFL   +    NR+ 
Sbjct: 205 TSPYVFQHRNSGDRGKQQQHGGSYRPTP--------SPEARRSPPGFLGKPRGGGGNRDF 256

Query: 192 GYGRRASDVNGDKGKGNSGQLHKNDR--LSNQLDFPGLPAGSSIHSASTFDIEESMKQLH 365
           G  RR  + N DK K    Q   ++   LS QLD PG PAGS++ S S  DIEES+ +LH
Sbjct: 257 GNRRRHFEHNVDKAKAEYSQPSSDNEVGLSGQLDRPGPPAGSNLQSVSATDIEESLLELH 316

Query: 366 AESGEDSRRGEEKKANNDGSEMDDLENQV-DSLGIEEESGGKNTKKKHHRDKDYRSDDRG 542
           ++ G D     +K    DG E+D++  Q+ +SL IE+ES  KN KK+H R+K+ R D+RG
Sbjct: 317 SDGGRDRFSRRDKFRREDGGEVDEVGEQLLESLLIEDESDDKNDKKQHRREKESRIDNRG 376

Query: 543 KWIMGQRMRIMKRQTTCRNDINRLSGPLLALVES 644
           + ++ QRMR++KRQ  CR+DI+RL+ P LAL ES
Sbjct: 377 QRLLSQRMRMLKRQMECRSDIHRLNAPFLALYES 410


>ref|XP_007051992.1| Nucleotidyltransferase family protein isoform 2 [Theobroma cacao]
           gi|508704253|gb|EOX96149.1| Nucleotidyltransferase
           family protein isoform 2 [Theobroma cacao]
          Length = 621

 Score =  141 bits (355), Expect = 2e-31
 Identities = 89/214 (41%), Positives = 121/214 (56%), Gaps = 4/214 (1%)
 Frame = +3

Query: 15  TTGYEANERNSRTVMRAQNHVRSITSNDRVKLGDGGSKTAVAPPPGFLSNSKDVR-NREP 191
           T+ Y    RNS    + Q H  S             S  A   PPGFL   +    NR+ 
Sbjct: 205 TSPYVFQHRNSGDRGKQQQHGGSYRPTP--------SPEARRSPPGFLGKPRGGGGNRDF 256

Query: 192 GYGRRASDVNGDKGKGNSGQLHKNDR--LSNQLDFPGLPAGSSIHSASTFDIEESMKQLH 365
           G  RR  + N DK K    Q   ++   LS QLD PG PAGS++ S S  DIEES+ +LH
Sbjct: 257 GNRRRHFEHNVDKAKAEYSQPSSDNEVGLSGQLDRPGPPAGSNLQSVSATDIEESLLELH 316

Query: 366 AESGEDSRRGEEKKANNDGSEMDDLENQV-DSLGIEEESGGKNTKKKHHRDKDYRSDDRG 542
           ++ G D     +K    DG E+D++  Q+ +SL IE+ES  KN KK+H R+K+ R D+RG
Sbjct: 317 SDGGRDRFSRRDKFRREDGGEVDEVGEQLLESLLIEDESDDKNDKKQHRREKESRIDNRG 376

Query: 543 KWIMGQRMRIMKRQTTCRNDINRLSGPLLALVES 644
           + ++ QRMR++KRQ  CR+DI+RL+ P LAL ES
Sbjct: 377 QRLLSQRMRMLKRQMECRSDIHRLNAPFLALYES 410


>ref|XP_007051991.1| Nucleotidyltransferase family protein isoform 1 [Theobroma cacao]
           gi|508704252|gb|EOX96148.1| Nucleotidyltransferase
           family protein isoform 1 [Theobroma cacao]
          Length = 722

 Score =  141 bits (355), Expect = 2e-31
 Identities = 89/214 (41%), Positives = 121/214 (56%), Gaps = 4/214 (1%)
 Frame = +3

Query: 15  TTGYEANERNSRTVMRAQNHVRSITSNDRVKLGDGGSKTAVAPPPGFLSNSKDVR-NREP 191
           T+ Y    RNS    + Q H  S             S  A   PPGFL   +    NR+ 
Sbjct: 205 TSPYVFQHRNSGDRGKQQQHGGSYRPTP--------SPEARRSPPGFLGKPRGGGGNRDF 256

Query: 192 GYGRRASDVNGDKGKGNSGQLHKNDR--LSNQLDFPGLPAGSSIHSASTFDIEESMKQLH 365
           G  RR  + N DK K    Q   ++   LS QLD PG PAGS++ S S  DIEES+ +LH
Sbjct: 257 GNRRRHFEHNVDKAKAEYSQPSSDNEVGLSGQLDRPGPPAGSNLQSVSATDIEESLLELH 316

Query: 366 AESGEDSRRGEEKKANNDGSEMDDLENQV-DSLGIEEESGGKNTKKKHHRDKDYRSDDRG 542
           ++ G D     +K    DG E+D++  Q+ +SL IE+ES  KN KK+H R+K+ R D+RG
Sbjct: 317 SDGGRDRFSRRDKFRREDGGEVDEVGEQLLESLLIEDESDDKNDKKQHRREKESRIDNRG 376

Query: 543 KWIMGQRMRIMKRQTTCRNDINRLSGPLLALVES 644
           + ++ QRMR++KRQ  CR+DI+RL+ P LAL ES
Sbjct: 377 QRLLSQRMRMLKRQMECRSDIHRLNAPFLALYES 410


>ref|XP_002511755.1| poly(A) polymerase cid, putative [Ricinus communis]
           gi|223548935|gb|EEF50424.1| poly(A) polymerase cid,
           putative [Ricinus communis]
          Length = 696

 Score =  129 bits (325), Expect = 6e-28
 Identities = 80/190 (42%), Positives = 107/190 (56%), Gaps = 22/190 (11%)
 Frame = +3

Query: 141 PPPGFLSNSKDVRNREPGYGRRASDVNGDKGKGNSGQLHKNDR----------------- 269
           PPPGF +  +   N +    RR  D N +K KGN  +L K +                  
Sbjct: 244 PPPGFSNKPRGGGNMDHVSRRRELDHNVNKEKGNHSELSKRNAFLSSESKSLRDGNGSRD 303

Query: 270 --LSNQLDFPGLPAGSSIHSASTFDIEESMKQLHAESGEDSRRGEEKKANNDGSEMDDL- 440
             L+ QLD PG PAGS++HS S  DIEES+   +AE  ED +        NDG ++DD+ 
Sbjct: 304 LGLTRQLDHPGPPAGSNLHSVSALDIEESLLNFNAEMVEDGK--------NDGHDLDDVG 355

Query: 441 ENQVDSLGIEEESGGKNTKK--KHHRDKDYRSDDRGKWIMGQRMRIMKRQTTCRNDINRL 614
           E   D+L +E ES GKN  K  +H RDK+ RSD+RG+ I+ QRMR++KRQ  CR DI+RL
Sbjct: 356 EELADTLLLEGESEGKNDNKQNRHSRDKESRSDNRGQQILSQRMRMLKRQMECRRDIDRL 415

Query: 615 SGPLLALVES 644
           +   LA+ ES
Sbjct: 416 NVSFLAIYES 425


>dbj|BAJ53142.1| JHL05D22.13 [Jatropha curcas]
          Length = 748

 Score =  124 bits (311), Expect = 2e-26
 Identities = 78/190 (41%), Positives = 107/190 (56%), Gaps = 22/190 (11%)
 Frame = +3

Query: 141 PPPGFLSNSKDVRNREPGYGRRASDVNGDKGKGNSGQLHKNDRL---------------- 272
           PPPGF +  +   N +    RR  D N +K KGN G+L   + L                
Sbjct: 255 PPPGFSNKPRGGGNWDYVSRRRELDYNVNKEKGNQGELSNRNALFSSEDKIPRDGDRSRD 314

Query: 273 ---SNQLDFPGLPAGSSIHSASTFDIEESMKQLHAESGEDSRRGEEKKANNDGSEMDDL- 440
              + QLD PG PAGS+++S S  D+E SM  + AE  ED +        ++G E+D+  
Sbjct: 315 LGLTGQLDRPGPPAGSNLYSVSAADVELSMLNVEAEVVEDGK--------DEGRELDEAG 366

Query: 441 ENQVDSLGIEEESGGKNTKK--KHHRDKDYRSDDRGKWIMGQRMRIMKRQTTCRNDINRL 614
           E  VDSL +E ES GKN KK  +H R+K+ RSD+RG+  + QRMR++KRQ  CR DI+RL
Sbjct: 367 EELVDSLLLEGESDGKNDKKQNRHSREKESRSDNRGQRTLSQRMRMLKRQMECRRDIDRL 426

Query: 615 SGPLLALVES 644
           + P LA+ ES
Sbjct: 427 NAPFLAIYES 436


>ref|XP_006339776.1| PREDICTED: uncharacterized protein LOC102603223 [Solanum tuberosum]
          Length = 775

 Score =  122 bits (306), Expect = 9e-26
 Identities = 80/206 (38%), Positives = 111/206 (53%), Gaps = 31/206 (15%)
 Frame = +3

Query: 120 GSKTAVAPPPGFLSN--SKDVRNREPGYGRRASDVN------GDKGKGNSGQLHKN---- 263
           G+     PPPGF S   S+D  +          ++N        K +  S  L +N    
Sbjct: 261 GTVRGAVPPPGFSSKPRSRDFEHNVDNEKNNFVELNHRGIGLNHKYERESKHLTRNGKNY 320

Query: 264 ------DRLSNQLDFPGLPAGSSIHSASTFDIEESMKQLHAESGEDSRRGEE-------- 401
                  R+  QLD P  PAGS +HS    D+E+S  +LH   GED+  GEE        
Sbjct: 321 AIGSDDQRVFRQLDSPVPPAGSKLHSVLGSDVEDSTLELH---GEDAESGEETVSGMRNV 377

Query: 402 --KKANNDGSEMDDL-ENQVDSLGIEEESGGKNTKKKHH--RDKDYRSDDRGKWIMGQRM 566
             + +    S++D+L E+ + SLG+E+E   ++ KKKHH  RDKDYRSD RG +I+GQRM
Sbjct: 378 LGRSSAQGQSDLDELGEHVISSLGLEDEPDERSDKKKHHASRDKDYRSDKRGAYILGQRM 437

Query: 567 RIMKRQTTCRNDINRLSGPLLALVES 644
           R++KRQ  CR+DINR++G  LA  ES
Sbjct: 438 RMLKRQIACRSDINRMNGAFLATFES 463


>ref|XP_004229872.1| PREDICTED: uncharacterized protein LOC101244121 [Solanum
           lycopersicum]
          Length = 775

 Score =  119 bits (297), Expect = 1e-24
 Identities = 78/206 (37%), Positives = 110/206 (53%), Gaps = 31/206 (15%)
 Frame = +3

Query: 120 GSKTAVAPPPGFLSN--SKDVRNREPGYGRRASDVN------GDKGKGNSGQLHKN---- 263
           G+   V PPPGF S   S+D  +          ++N        K +  S  L +N    
Sbjct: 261 GTVRGVVPPPGFSSKPRSRDFEHNVDNEKNNFVELNHRGIGLNHKYERESKHLSRNGKNY 320

Query: 264 ------DRLSNQLDFPGLPAGSSIHSASTFDIEESMKQLHAESGEDSRRGEE-------- 401
                  R+  +LD P  PAGS +HS    D+E+S  +L    GED+  GEE        
Sbjct: 321 AIGSDDQRVFRRLDSPVPPAGSKLHSVLASDVEDSTLELR---GEDAESGEETVSVMRDV 377

Query: 402 --KKANNDGSEMDDL-ENQVDSLGIEEESGGKNTKKKHH--RDKDYRSDDRGKWIMGQRM 566
             + +    SE+D+L E+ + SLG+E+E   ++ KK HH  RDKDYRSD RG +I+GQRM
Sbjct: 378 LGRSSAQGQSELDELGEHVISSLGLEDEPNERSDKKNHHASRDKDYRSDKRGAYILGQRM 437

Query: 567 RIMKRQTTCRNDINRLSGPLLALVES 644
           R++KRQ  CR+DINR++G  LA  +S
Sbjct: 438 RMLKRQIACRSDINRMNGAFLATFQS 463


>ref|XP_002301312.2| hypothetical protein POPTR_0002s15230g [Populus trichocarpa]
           gi|550345065|gb|EEE80585.2| hypothetical protein
           POPTR_0002s15230g [Populus trichocarpa]
          Length = 728

 Score =  114 bits (285), Expect = 3e-23
 Identities = 73/180 (40%), Positives = 108/180 (60%), Gaps = 11/180 (6%)
 Frame = +3

Query: 138 APPPGFLSNSKDVRNREPGYGRRASDVN-----GDKGKGNSGQLHKNDR-----LSNQLD 287
           +PPPGF +  +   N + G  RR  ++N     GD  + N+ ++ +++      L+ QLD
Sbjct: 247 SPPPGFSNKPRGGGNWDYGSRRRELELNITRENGDYSEMNNEKVRRSEGSVELGLTRQLD 306

Query: 288 FPGLPAGSSIHSASTFDIEESMKQLHAESGEDSRRGEEKKANNDGSEMDDL-ENQVDSLG 464
            PG PAGS++HS    +I ES+  L  E+GED +        +DG E+DDL E  VDSL 
Sbjct: 307 RPGPPAGSNLHSVLGSEIGESLINLDGENGEDGK--------DDGGELDDLGEELVDSLL 358

Query: 465 IEEESGGKNTKKKHHRDKDYRSDDRGKWIMGQRMRIMKRQTTCRNDINRLSGPLLALVES 644
           +  +S GK  KK+ +  K+ RSD+RGK I+ QRMR++K+QT C  DI+RL+   LA+ ES
Sbjct: 359 LNGQSEGKKDKKQSN--KESRSDNRGKKILSQRMRMLKKQTQCCLDIDRLNAAFLAIYES 416


>gb|EXC11712.1| Poly(A) RNA polymerase cid11 [Morus notabilis]
          Length = 703

 Score =  109 bits (272), Expect = 8e-22
 Identities = 81/216 (37%), Positives = 113/216 (52%), Gaps = 15/216 (6%)
 Frame = +3

Query: 42  NSRTVMRAQNHVRSITSNDRVKLGDGGSKTAVAPPPGFLSNSKDV--------RNREPGY 197
           N   ++     + S +S++ V+ G+   +    PPPGF S  K           N   G 
Sbjct: 192 NFNNLVDRSRRLSSNSSSNAVRQGNYEHQRT-NPPPGFRSKPKRTGLNHSIGGENSVSGD 250

Query: 198 GRRASDVN----GDKGKGNSGQLHKNDRLSNQLDFPGLPAGSSIHSASTFDIEESMKQLH 365
             R  DV     G +G G+ G       LS QLD PG P+GS++ S    D+EESM +L 
Sbjct: 251 LMRTRDVLAEDIGIRGDGSRGL-----ELSAQLDRPGPPSGSNLRSVLASDVEESMMKLE 305

Query: 366 AESGEDSRRGEEKKANNDGSEMDDL-ENQVDSLGIEEESGGKNTKKKHH--RDKDYRSDD 536
           +++ E             G E+DD+ +  VDSL IE+ES  KN  KKH   RDKD RSD 
Sbjct: 306 SDAVEVG----------GGHEIDDIGQRLVDSLLIEDESDDKNETKKHKNSRDKDSRSDS 355

Query: 537 RGKWIMGQRMRIMKRQTTCRNDINRLSGPLLALVES 644
           RG+ ++ QRMR+ KRQ  CR+DI+RL    +A+V+S
Sbjct: 356 RGQRLLSQRMRVYKRQMRCRSDIDRLDDAFIAIVKS 391


>ref|XP_006490961.1| PREDICTED: uncharacterized protein LOC102611932 [Citrus sinensis]
          Length = 699

 Score =  107 bits (268), Expect = 2e-21
 Identities = 78/184 (42%), Positives = 99/184 (53%), Gaps = 16/184 (8%)
 Frame = +3

Query: 141 PPPGFLSNSKDVRNREPGYGRRASDVNGDK-GKGNSGQLHKNDR--LSNQLDFPGLPAGS 311
           PPPGF   S   R    G  RR  + N D   +  S  +   +   L+ QLD PG P+GS
Sbjct: 207 PPPGF---SNKARVGGSGNSRRGFEHNVDMINRFTSSAVEGGNGVGLTRQLDRPGPPSGS 263

Query: 312 SIHSASTFDIEESMKQLHAESGE-----DSRRGEEKKANNDGSEMDDL-ENQVDSLGIEE 473
           ++HS S  DIEES+  L  E  E     D RR      +  G +MDD  E+ VDSL  ++
Sbjct: 264 NLHSVSALDIEESLLDLRREGRERHLGLDKRRENGPGYSQGGDDMDDFGEDLVDSLLPDD 323

Query: 474 ESGGKN-----TKKKHH--RDKDYRSDDRGKWIMGQRMRIMKRQTTCRNDINRLSGPLLA 632
           ES  KN       KKH   RDK+ RSD+RGK ++ QRMR +K Q  CR DI RL+ P LA
Sbjct: 324 ESELKNDTHERNDKKHRNSRDKEIRSDNRGKRLLSQRMRNLKWQIECRADIGRLNAPFLA 383

Query: 633 LVES 644
           + ES
Sbjct: 384 IYES 387


>ref|XP_006445207.1| hypothetical protein CICLE_v10023615mg, partial [Citrus clementina]
           gi|557547469|gb|ESR58447.1| hypothetical protein
           CICLE_v10023615mg, partial [Citrus clementina]
          Length = 1046

 Score =  107 bits (268), Expect = 2e-21
 Identities = 78/184 (42%), Positives = 99/184 (53%), Gaps = 16/184 (8%)
 Frame = +3

Query: 141 PPPGFLSNSKDVRNREPGYGRRASDVNGDK-GKGNSGQLHKNDR--LSNQLDFPGLPAGS 311
           PPPGF   S   R    G  RR  + N D   +  S  +   +   L+ QLD PG P+GS
Sbjct: 238 PPPGF---SNKARVGGSGNSRRGFEHNVDMINRFTSSAVEGGNGVGLTRQLDRPGPPSGS 294

Query: 312 SIHSASTFDIEESMKQLHAESGE-----DSRRGEEKKANNDGSEMDDL-ENQVDSLGIEE 473
           ++HS S  DIEES+  L  E  E     D RR      +  G +MDD  E+ VDSL  ++
Sbjct: 295 NLHSVSALDIEESLLDLRREGRERHLGLDKRRENGPGYSQGGDDMDDFGEDLVDSLLPDD 354

Query: 474 ESGGKN-----TKKKHH--RDKDYRSDDRGKWIMGQRMRIMKRQTTCRNDINRLSGPLLA 632
           ES  KN       KKH   RDK+ RSD+RGK ++ QRMR +K Q  CR DI RL+ P LA
Sbjct: 355 ESELKNDTHERNDKKHRNSRDKEIRSDNRGKRLLSQRMRNLKWQIECRADIGRLNAPFLA 414

Query: 633 LVES 644
           + ES
Sbjct: 415 IYES 418


>ref|XP_006295859.1| hypothetical protein CARUB_v10024989mg [Capsella rubella]
           gi|482564567|gb|EOA28757.1| hypothetical protein
           CARUB_v10024989mg [Capsella rubella]
          Length = 764

 Score =  100 bits (249), Expect = 4e-19
 Identities = 73/212 (34%), Positives = 104/212 (49%), Gaps = 36/212 (16%)
 Frame = +3

Query: 117 GGSKTAVAPPPGFLSN---------SKD--------VRNREPGYGRRAS-DVNGDKGKGN 242
           G   T   PPPGF SN         SKD         RN +      ++ +   D+ +G 
Sbjct: 233 GFKSTPTPPPPGFSSNQRGWDMNLGSKDDDRGIGSFQRNHDRAMWEHSNLNAEADRLRGL 292

Query: 243 SGQLHKNDRLSNQLDFPGLPAGSSIHSASTFDIEESMKQLHAESGEDSRRGEE------- 401
           S Q      LS Q+D PG P G+S+HS ST D   S   L+ E+   S R +E       
Sbjct: 293 SLQNESKFNLSQQIDHPGPPKGTSLHSVSTADAANSFSMLNKEARGGSERKDELGQLSKM 352

Query: 402 KKANNDGS-----EMDDL-ENQVDSLGIEEESGGKNTK-----KKHHRDKDYRSDDRGKW 548
           K+  N+ S     E+DD  E+ VDSL +E ++  K+ K      K  R+K+ R D+RG+W
Sbjct: 353 KREGNEKSGPGDDEIDDFGEDIVDSLLLEVDTDDKDAKDGKKNSKTSREKESRVDNRGRW 412

Query: 549 IMGQRMRIMKRQTTCRNDINRLSGPLLALVES 644
           ++ QR+R  K    CRNDI+R   P +A+ +S
Sbjct: 413 LLSQRLRERKMYMACRNDIHRYDAPFMAVYKS 444


>ref|XP_006375316.1| hypothetical protein POPTR_0014s06910g, partial [Populus
           trichocarpa] gi|550323667|gb|ERP53113.1| hypothetical
           protein POPTR_0014s06910g, partial [Populus trichocarpa]
          Length = 497

 Score = 97.8 bits (242), Expect = 2e-18
 Identities = 78/226 (34%), Positives = 109/226 (48%), Gaps = 23/226 (10%)
 Frame = +3

Query: 36  ERNSRTVMRAQNHVRSI-------TSNDRVKLGDGGSKTAVAPPPGFLSNSKDVR---NR 185
           ERN     +A +H  +        +S  R  L     +   +PPPGF +  +      N 
Sbjct: 210 ERNRHLEKQANSHSTNFEVRQPGASSGGRGNLHKEQHQNYKSPPPGFSNKPRGGGGGGNW 269

Query: 186 EPGYGRRA------------SDVNGDKGKGNSGQLHKNDRLSNQLDFPGLPAGSSIHSAS 329
           + G  RR             S++N +K + N G +    R + QLD PG P GS++HS  
Sbjct: 270 DHGGRRRELEHTMYREKGDYSELNNEKARRNEGSVEV--RFTRQLDRPGPPPGSNLHSVL 327

Query: 330 TFDIEESMKQLHAESGEDSRRGEEKKANNDGSEMDDL-ENQVDSLGIEEESGGKNTKKKH 506
             +I+ES+  L  E               DG  +DDL E  +DSL +E ES GK  KK+ 
Sbjct: 328 GSEIKESLINLDGE---------------DGGLLDDLGEELMDSLLLEGESDGKKDKKQS 372

Query: 507 HRDKDYRSDDRGKWIMGQRMRIMKRQTTCRNDINRLSGPLLALVES 644
              K+ RSD RG  I+ QRMR++KRQ  CR DI+RL+   LA+ ES
Sbjct: 373 --SKESRSDSRGHNILSQRMRMLKRQMQCRLDIDRLNAAFLAIYES 416


>ref|XP_002880188.1| hypothetical protein ARALYDRAFT_483698 [Arabidopsis lyrata subsp.
           lyrata] gi|297326027|gb|EFH56447.1| hypothetical protein
           ARALYDRAFT_483698 [Arabidopsis lyrata subsp. lyrata]
          Length = 757

 Score = 95.9 bits (237), Expect = 9e-18
 Identities = 71/220 (32%), Positives = 104/220 (47%), Gaps = 43/220 (19%)
 Frame = +3

Query: 114 DGGSKTAVAPPPGFLSN---------SKDV--------RNREPGYGRRAS--------DV 218
           +G    +  PPPGF SN         SKD         RN +   G  +           
Sbjct: 218 NGRGFKSTPPPPGFSSNQRGRDMNLTSKDDDRGMGSFHRNHDQAMGEHSKFWDQSVNFSA 277

Query: 219 NGDKGKGNSGQLHKNDRLSNQLDFPGLPAGSSIHSASTFDIEESMKQLHAESGEDSRRGE 398
             D+ +G S Q      LS Q+D PGLP G+S+HS S  D  +S   L+ E+   S R E
Sbjct: 278 EADRLRGLSIQNDSKFNLSQQIDHPGLPKGTSLHSVSAADAADSFSMLNKEARGGSERKE 337

Query: 399 E--------KKANNDGSEMDDL-----ENQVDSLGIEEESGGKNTK-----KKHHRDKDY 524
           E        ++ N +   +DD      E+ V SL +E+E+G K+ K      K  R+KD 
Sbjct: 338 ELGRLSKGKREGNANSGPVDDEIEDFGEDIVKSLLLEDETGEKDAKDGKKDSKTSREKDS 397

Query: 525 RSDDRGKWIMGQRMRIMKRQTTCRNDINRLSGPLLALVES 644
           R D+RG+ ++GQ+ R++K    CRNDI+R     +A+ +S
Sbjct: 398 RMDNRGQRLLGQKARMVKMYMACRNDIHRYDASFIAVYKS 437


>ref|NP_566048.1| Nucleotidyltransferase family protein [Arabidopsis thaliana]
           gi|13430538|gb|AAK25891.1|AF360181_1 unknown protein
           [Arabidopsis thaliana] gi|14532746|gb|AAK64074.1|
           unknown protein [Arabidopsis thaliana]
           gi|20197056|gb|AAC06161.2| expressed protein
           [Arabidopsis thaliana] gi|330255483|gb|AEC10577.1|
           Nucleotidyltransferase family protein [Arabidopsis
           thaliana]
          Length = 764

 Score = 95.9 bits (237), Expect = 9e-18
 Identities = 67/214 (31%), Positives = 101/214 (47%), Gaps = 36/214 (16%)
 Frame = +3

Query: 111 GDGGSKTAVAPPPGFLSNSKD------VRNREPGYGRRASDVNGDKGK------------ 236
           G G   T   PPPGF SN +        ++ + G GR      G+  K            
Sbjct: 231 GRGLKSTPPPPPPGFSSNQRGWDMSLGSKDDDRGMGRNHDQAMGEHSKVWNQSVDFSAEA 290

Query: 237 ----GNSGQLHKNDRLSNQLDFPGLPAGSSIHSASTFDIEESMKQLHAESGEDSRRGEE- 401
               G S Q      LS Q+D PG P G+S+HS S  D  +S   L+ E+     R EE 
Sbjct: 291 NRLRGLSIQNESKFNLSQQIDHPGPPKGASLHSVSAADAADSFSMLNKEARRGGERREEL 350

Query: 402 -------KKANNDGSEMDDL-ENQVDSLGIEEESGGKNTK-----KKHHRDKDYRSDDRG 542
                  ++ N +  E++D  E+ V SL +E+E+G K+        K  R+K+ R D+RG
Sbjct: 351 GQLSKAKREGNANSDEIEDFGEDIVKSLLLEDETGEKDANDGKKDSKTSREKESRVDNRG 410

Query: 543 KWIMGQRMRIMKRQTTCRNDINRLSGPLLALVES 644
           + ++GQ+ R++K    CRNDI+R     +A+ +S
Sbjct: 411 QRLLGQKARMVKMYMACRNDIHRYDATFIAIYKS 444


>ref|XP_004308428.1| PREDICTED: uncharacterized protein LOC101313262 [Fragaria vesca
           subsp. vesca]
          Length = 699

 Score = 95.1 bits (235), Expect = 2e-17
 Identities = 66/163 (40%), Positives = 85/163 (52%), Gaps = 1/163 (0%)
 Frame = +3

Query: 159 SNSKDVRNREPGY-GRRASDVNGDKGKGNSGQLHKNDRLSNQLDFPGLPAGSSIHSASTF 335
           S+S   RNRE  +   R   + G+ G G  G       LS QLD PG PAG+++HS S  
Sbjct: 241 SSSGFARNREGSFDNERVRRLAGEDG-GMRGNGDGRKGLSAQLDRPGPPAGTNLHSVSAS 299

Query: 336 DIEESMKQLHAESGEDSRRGEEKKANNDGSEMDDLENQVDSLGIEEESGGKNTKKKHHRD 515
           +IEESM             GE  + ++DG E       V    +EEE   K   K+HH  
Sbjct: 300 EIEESMMNFDG--------GERARKDSDGVE------DVGQHSLEEERDDKIEGKQHH-- 343

Query: 516 KDYRSDDRGKWIMGQRMRIMKRQTTCRNDINRLSGPLLALVES 644
           KD RSDDRG+  + QRMR  KRQT CR DI+R + P L + +S
Sbjct: 344 KDSRSDDRGQHQLSQRMRSYKRQTLCRFDIDRFNAPFLEIFDS 386


>ref|XP_007220905.1| hypothetical protein PRUPE_ppa002004mg [Prunus persica]
           gi|462417367|gb|EMJ22104.1| hypothetical protein
           PRUPE_ppa002004mg [Prunus persica]
          Length = 730

 Score = 94.0 bits (232), Expect = 4e-17
 Identities = 68/197 (34%), Positives = 96/197 (48%), Gaps = 29/197 (14%)
 Frame = +3

Query: 141 PPPGFLSNSKDVRNREPGYGRRASDVNGDKGKGNSGQLHKN-------DRL--------- 272
           PPPGF +NS+   N + G  RR  + N D+ + +S +  +N       +R+         
Sbjct: 250 PPPGFGNNSRGGGNWDSGSRRRDFEHNVDRERQSSSEFVRNRDASFEDERVRRLASEDSR 309

Query: 273 -----------SNQLDFPGLPAGSSIHSASTFDIEESMKQLHAESGEDSRRGEEKKANND 419
                      S QLD PG P G+++HSAS  +IE+SM  L  E              +D
Sbjct: 310 IRGNGARGLGFSAQLDDPGPPTGANLHSASASEIEKSMMNLQHEK-------------DD 356

Query: 420 GSEMDDLENQVDSLGIEEESGGKNTKKKHH--RDKDYRSDDRGKWIMGQRMRIMKRQTTC 593
            +E DD                KN  K+HH  R+KD RSD+RG+ ++ QRMRI K Q  C
Sbjct: 357 KNEEDD----------------KNEAKQHHNSREKDSRSDNRGQHLLSQRMRIFKSQMQC 400

Query: 594 RNDINRLSGPLLALVES 644
           R DI+RL+ P LA+ +S
Sbjct: 401 RFDIDRLNAPFLAIYDS 417


Top