BLASTX nr result

ID: Mentha29_contig00028272 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00028272
         (835 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU42166.1| hypothetical protein MIMGU_mgv1a000072mg [Mimulus...   287   3e-75
gb|EYU26433.1| hypothetical protein MIMGU_mgv1a0195131mg, partia...   256   1e-65
emb|CBI36804.3| unnamed protein product [Vitis vinifera]              192   2e-46
ref|XP_002280635.2| PREDICTED: uncharacterized protein LOC100263...   191   3e-46
ref|XP_004231275.1| PREDICTED: uncharacterized protein LOC101266...   188   2e-45
ref|XP_006344628.1| PREDICTED: DNA polymerase zeta catalytic sub...   185   2e-44
ref|XP_006344627.1| PREDICTED: DNA polymerase zeta catalytic sub...   185   2e-44
ref|XP_007206444.1| hypothetical protein PRUPE_ppa000111mg [Prun...   184   5e-44
ref|XP_007030809.1| Recovery protein 3 isoform 3 [Theobroma caca...   183   8e-44
ref|XP_007030808.1| Recovery protein 3 isoform 2, partial [Theob...   183   8e-44
ref|XP_007030807.1| Recovery protein 3 isoform 1 [Theobroma caca...   183   8e-44
ref|XP_002512387.1| DNA polymerase zeta catalytic subunit, putat...   182   1e-43
gb|EXC54611.1| Retrovirus-related Pol polyprotein from transposo...   182   1e-43
gb|EXC02386.1| Retrovirus-related Pol polyprotein from transposo...   182   1e-43
ref|XP_006391268.1| hypothetical protein EUTSA_v10017996mg [Eutr...   166   1e-38
gb|AAC18785.1| Similar to putative DNA polymerase gb|M29683 from...   162   1e-37
gb|AAG52299.1|AC011020_6 putative DNA polymerase zeta catalytic ...   162   1e-37
ref|NP_176917.2| DNA polymerase zeta subunit [Arabidopsis thalia...   162   1e-37
ref|NP_001185344.1| DNA polymerase zeta subunit [Arabidopsis tha...   162   1e-37
ref|XP_004494955.1| PREDICTED: DNA polymerase zeta catalytic sub...   160   7e-37

>gb|EYU42166.1| hypothetical protein MIMGU_mgv1a000072mg [Mimulus guttatus]
          Length = 1914

 Score =  287 bits (735), Expect = 3e-75
 Identities = 163/277 (58%), Positives = 191/277 (68%), Gaps = 11/277 (3%)
 Frame = -1

Query: 799  NEGSNLGMSIQGRAAADDLLPFFKRNFQEEQPPC--VSPRKLKPVDNHEAVMGVPVLYQN 626
            N G  +G S++GR   D  LPFF  N  EE+     VSPR  + +D+HE VMGVPV++QN
Sbjct: 865  NGGGKVGPSLEGRFE-DGCLPFFSTNSLEEEEKLQGVSPRNCEHIDSHELVMGVPVMHQN 923

Query: 625  DGSPLYMLTPAMSPPSKQSVDRWLSSD---ILRHKIDSPSQFLPISDGL------SQGSQ 473
            DGS L+MLTPA+SPPS++SVDRWLS D   I   K+D+P   LPIS G       SQGSQ
Sbjct: 924  DGSYLFMLTPAVSPPSRESVDRWLSFDCDNISERKLDAP--ILPISKGFPGDIVDSQGSQ 981

Query: 472  ADDSKFILPESASMPISEKTPKRDQLPERNQENITMEAKAFTETKRKIETSCSPDISQIS 293
            ADD K                  +QL E NQ N   E K   E K+KI T  S DISQIS
Sbjct: 982  ADDKKSDF---------------NQLHELNQGNCHTEVKTLNEAKKKISTGFSQDISQIS 1026

Query: 292  GPGKKIRLTPLSQIGFRDPASVGQGQQLTLLSLEVQADSRGDLKPDPRFDAINVIVLVLQ 113
            GP K +RLTPLSQIGFRDPASVGQGQQLTL+S+EV A+SRGDL+PDPRFDA+NVIVLV+Q
Sbjct: 1027 GPDKTVRLTPLSQIGFRDPASVGQGQQLTLISIEVLAESRGDLRPDPRFDAVNVIVLVIQ 1086

Query: 112  EDNESVFDTYVLLRCDSANLKKDLDAVSESKTFVFQE 2
            ED ES  DT++LLRCD   ++KDLDAVSESK FVF E
Sbjct: 1087 EDEESALDTHILLRCDFDYVEKDLDAVSESKLFVFTE 1123


>gb|EYU26433.1| hypothetical protein MIMGU_mgv1a0195131mg, partial [Mimulus guttatus]
          Length = 1755

 Score =  256 bits (653), Expect = 1e-65
 Identities = 150/279 (53%), Positives = 180/279 (64%), Gaps = 2/279 (0%)
 Frame = -1

Query: 832  EGNKQAALFLRNEGSNLGMSIQGRAAADDLLPFFKRNFQEEQPPC--VSPRKLKPVDNHE 659
            E   + ++   N GS  G S++GR   D  +PFF  N  EE+     VSPR  + +D+HE
Sbjct: 734  EPESEGSVLPGNGGSKAGASLEGRFE-DGCIPFFSTNSLEEEEELQGVSPRNCEYIDSHE 792

Query: 658  AVMGVPVLYQNDGSPLYMLTPAMSPPSKQSVDRWLSSDILRHKIDSPSQFLPISDGLSQG 479
             VMGVPV++QNDGS LYMLTPA+SPPS++SVD            +SP             
Sbjct: 793  LVMGVPVMHQNDGSYLYMLTPAVSPPSRESVDS-----------NSP------------- 828

Query: 478  SQADDSKFILPESASMPISEKTPKRDQLPERNQENITMEAKAFTETKRKIETSCSPDISQ 299
                     L  SAS    E+    +QL E NQ+N   E K   E K+KI T  S DISQ
Sbjct: 829  ---------LTSSASY---EEKSDFNQLHELNQDNCHTEVKIRNEAKKKISTGFSQDISQ 876

Query: 298  ISGPGKKIRLTPLSQIGFRDPASVGQGQQLTLLSLEVQADSRGDLKPDPRFDAINVIVLV 119
            ISGP K +RLTPLSQIGFRDPASVGQGQQLTL+S+EV A+SRGDL+PDPRFDA+NVIVLV
Sbjct: 877  ISGPDKMVRLTPLSQIGFRDPASVGQGQQLTLISIEVLAESRGDLRPDPRFDAVNVIVLV 936

Query: 118  LQEDNESVFDTYVLLRCDSANLKKDLDAVSESKTFVFQE 2
            +QED ES  DT +LLR DS N++KDLDAVSESK FVF E
Sbjct: 937  IQEDEESALDTRILLRYDSVNVEKDLDAVSESKLFVFTE 975


>emb|CBI36804.3| unnamed protein product [Vitis vinifera]
          Length = 1732

 Score =  192 bits (487), Expect = 2e-46
 Identities = 123/283 (43%), Positives = 166/283 (58%), Gaps = 5/283 (1%)
 Frame = -1

Query: 835  DEGNKQ----AALFLRNEGSNLGMSIQGRAAADDLLPFFKRNFQEEQPPCVSPRKLKPVD 668
            DE  KQ    A+  L N      M  QG    D+ +PFF  + QEE+   V  +    ++
Sbjct: 705  DERLKQTKASASSCLSNSPFEHEMVFQG-TILDEFIPFFVGDCQEEKK--VWNKCYNDLN 761

Query: 667  NHEAV-MGVPVLYQNDGSPLYMLTPAMSPPSKQSVDRWLSSDILRHKIDSPSQFLPISDG 491
            NH+ V MGVP  YQNDGS LY+LTP  SPPS   V RWL  D      D+ ++ LP+   
Sbjct: 762  NHQEVGMGVPTHYQNDGSFLYLLTPVFSPPSADCVHRWLLHD----DTDTSAEPLPVGSV 817

Query: 490  LSQGSQADDSKFILPESASMPISEKTPKRDQLPERNQENITMEAKAFTETKRKIETSCSP 311
                   D     + ++ +   ++K    D++PE+ Q    +        K K  T+CS 
Sbjct: 818  SHVKPVLDQQNHEIHDNLN---AKKNAFHDKVPEKTQVKGNI-------MKVKKCTNCSQ 867

Query: 310  DISQISGPGKKIRLTPLSQIGFRDPASVGQGQQLTLLSLEVQADSRGDLKPDPRFDAINV 131
            DISQISGP +K + TPLSQIGFRDPASVG GQQ+TLLS+E+QA+SRGDL+PDPR+DAINV
Sbjct: 868  DISQISGPEEKSKPTPLSQIGFRDPASVGGGQQVTLLSIEIQAESRGDLRPDPRYDAINV 927

Query: 130  IVLVLQEDNESVFDTYVLLRCDSANLKKDLDAVSESKTFVFQE 2
            IVL++QED++S  + +VL R +    ++ LD +S  K  V  E
Sbjct: 928  IVLLIQEDDDSALEVFVLCRSNIEPCQRKLDGISGCKVLVSSE 970


>ref|XP_002280635.2| PREDICTED: uncharacterized protein LOC100263126 [Vitis vinifera]
          Length = 2002

 Score =  191 bits (485), Expect = 3e-46
 Identities = 113/251 (45%), Positives = 155/251 (61%), Gaps = 1/251 (0%)
 Frame = -1

Query: 751 DDLLPFFKRNFQEEQPPCVSPRKLKPVDNHEAV-MGVPVLYQNDGSPLYMLTPAMSPPSK 575
           D+ +PFF  + QEE+   V  +    ++NH+ V MGVP  YQNDGS LY+LTP  SPPS 
Sbjct: 150 DEFIPFFVGDCQEEKK--VWNKCYNDLNNHQEVGMGVPTHYQNDGSFLYLLTPVFSPPSA 207

Query: 574 QSVDRWLSSDILRHKIDSPSQFLPISDGLSQGSQADDSKFILPESASMPISEKTPKRDQL 395
             V RWL  D      D+ ++ LP+          D     + ++ +   ++K    D++
Sbjct: 208 DCVHRWLLHD----DTDTSAEPLPVGSVSHVKPVLDQQNHEIHDNLN---AKKNAFHDKV 260

Query: 394 PERNQENITMEAKAFTETKRKIETSCSPDISQISGPGKKIRLTPLSQIGFRDPASVGQGQ 215
           PE+ Q    +        K K  T+CS DISQISGP +K + TPLSQIGFRDPASVG GQ
Sbjct: 261 PEKTQVKGNI-------MKVKKCTNCSQDISQISGPEEKSKPTPLSQIGFRDPASVGGGQ 313

Query: 214 QLTLLSLEVQADSRGDLKPDPRFDAINVIVLVLQEDNESVFDTYVLLRCDSANLKKDLDA 35
           Q+TLLS+E+QA+SRGDL+PDPR+DAINVIVL++QED++S  + +VL R +    ++ LD 
Sbjct: 314 QVTLLSIEIQAESRGDLRPDPRYDAINVIVLLIQEDDDSALEVFVLCRSNIEPCQRKLDG 373

Query: 34  VSESKTFVFQE 2
           +S  K  V  E
Sbjct: 374 ISGCKVLVSSE 384


>ref|XP_004231275.1| PREDICTED: uncharacterized protein LOC101266467 [Solanum
            lycopersicum]
          Length = 2734

 Score =  188 bits (478), Expect = 2e-45
 Identities = 121/267 (45%), Positives = 159/267 (59%), Gaps = 17/267 (6%)
 Frame = -1

Query: 751  DDLLPFFKRN-FQEEQPPCVSPRKLKPVDNHEAVMGVPVLYQNDGSPLYMLTPAMSPPSK 575
            D+  PFF+ N   +E+    +      V   + ++GVPV YQNDGS LYMLTP  SPP  
Sbjct: 884  DECPPFFEGNCLVKEKISSANCGTSNYVPCQDNLLGVPVHYQNDGSYLYMLTPVYSPPRS 943

Query: 574  QSVDRWLSSD-ILRHKID--SPSQFLP----ISDGL--SQGSQADDSKFILPESASMPIS 422
            +SV RWLS D ++  K+D  S     P     SD +  SQ SQ+      L  S S P  
Sbjct: 944  ESVRRWLSLDYVVSSKMDVVSAPPVYPSTKVCSDHIAESQDSQSTFCDQPLMYSGSEPNP 1003

Query: 421  EKTPKRDQLPERNQENIT-MEAKAFTETKRKIETSCSP------DISQISGPGKKIRLTP 263
             +     +  E+N   +  +   A  +   +I   C P      D+SQISGP +K RLTP
Sbjct: 1004 NQLQANKKCQEKNGVQMNPVVPDARIKQDEEIILKCEPSMRGSQDLSQISGPDRKSRLTP 1063

Query: 262  LSQIGFRDPASVGQGQQLTLLSLEVQADSRGDLKPDPRFDAINVIVLVLQEDNESVFDTY 83
            LSQ GFRDPAS+G GQQLT+LSLEVQA+SRGDL+PDPRFDA+ +IVLV QED++   DT+
Sbjct: 1064 LSQTGFRDPASIGCGQQLTILSLEVQAESRGDLRPDPRFDAVRIIVLVFQEDDDFGSDTH 1123

Query: 82   VLLRCDSANLKKDLDAVSESKTFVFQE 2
            VLL C+  +++++LD VSE K   F E
Sbjct: 1124 VLLHCNGESVQRNLDGVSECKVLTFIE 1150


>ref|XP_006344628.1| PREDICTED: DNA polymerase zeta catalytic subunit-like isoform X2
            [Solanum tuberosum]
          Length = 1747

 Score =  185 bits (469), Expect = 2e-44
 Identities = 120/270 (44%), Positives = 157/270 (58%), Gaps = 20/270 (7%)
 Frame = -1

Query: 751  DDLLPFFKRNF----QEEQPPCVSPRKLKPVDNHEAVMGVPVLYQNDGSPLYMLTPAMSP 584
            D+  PFF+ N     +     C +   +   DN   ++GVPV YQNDGS LYMLTP  SP
Sbjct: 692  DECPPFFEGNCLVGEKISSANCGTSNYVPCQDN---LLGVPVHYQNDGSYLYMLTPVYSP 748

Query: 583  PSKQSVDRWLS---SDILRHKIDSPSQFLP----ISDGL--SQGSQADDSKFILPESASM 431
            P  +SV RWLS   +D  +  + S     P     SD +  SQ SQ+      L +SAS 
Sbjct: 749  PQSESVRRWLSLDCADSSKMDVVSGPPVYPSTKVCSDHIAESQDSQSTFCDQPLMDSASE 808

Query: 430  PISEKTPKRDQLPERNQENIT-MEAKAFTETKRKIETSCSP------DISQISGPGKKIR 272
            P   +     +  E N   +  +   A  +   +I   C P      D+SQISGP +K R
Sbjct: 809  PNPNQLQANKKYQEINSVQMNPVVPDARIKKDEEIILKCEPSMRGSQDLSQISGPDRKSR 868

Query: 271  LTPLSQIGFRDPASVGQGQQLTLLSLEVQADSRGDLKPDPRFDAINVIVLVLQEDNESVF 92
            LTPLSQ GFRDPAS+G GQQLT LS+EVQA+SRGDL+PDPRFDA+ +IVLV QED++   
Sbjct: 869  LTPLSQTGFRDPASIGCGQQLTKLSIEVQAESRGDLRPDPRFDAVRIIVLVFQEDDDFRS 928

Query: 91   DTYVLLRCDSANLKKDLDAVSESKTFVFQE 2
            DT+VLL C+  +++++LD VSE K   F E
Sbjct: 929  DTHVLLHCNGESVQRNLDGVSECKVLTFIE 958


>ref|XP_006344627.1| PREDICTED: DNA polymerase zeta catalytic subunit-like isoform X1
            [Solanum tuberosum]
          Length = 1976

 Score =  185 bits (469), Expect = 2e-44
 Identities = 120/270 (44%), Positives = 157/270 (58%), Gaps = 20/270 (7%)
 Frame = -1

Query: 751  DDLLPFFKRNF----QEEQPPCVSPRKLKPVDNHEAVMGVPVLYQNDGSPLYMLTPAMSP 584
            D+  PFF+ N     +     C +   +   DN   ++GVPV YQNDGS LYMLTP  SP
Sbjct: 921  DECPPFFEGNCLVGEKISSANCGTSNYVPCQDN---LLGVPVHYQNDGSYLYMLTPVYSP 977

Query: 583  PSKQSVDRWLS---SDILRHKIDSPSQFLP----ISDGL--SQGSQADDSKFILPESASM 431
            P  +SV RWLS   +D  +  + S     P     SD +  SQ SQ+      L +SAS 
Sbjct: 978  PQSESVRRWLSLDCADSSKMDVVSGPPVYPSTKVCSDHIAESQDSQSTFCDQPLMDSASE 1037

Query: 430  PISEKTPKRDQLPERNQENIT-MEAKAFTETKRKIETSCSP------DISQISGPGKKIR 272
            P   +     +  E N   +  +   A  +   +I   C P      D+SQISGP +K R
Sbjct: 1038 PNPNQLQANKKYQEINSVQMNPVVPDARIKKDEEIILKCEPSMRGSQDLSQISGPDRKSR 1097

Query: 271  LTPLSQIGFRDPASVGQGQQLTLLSLEVQADSRGDLKPDPRFDAINVIVLVLQEDNESVF 92
            LTPLSQ GFRDPAS+G GQQLT LS+EVQA+SRGDL+PDPRFDA+ +IVLV QED++   
Sbjct: 1098 LTPLSQTGFRDPASIGCGQQLTKLSIEVQAESRGDLRPDPRFDAVRIIVLVFQEDDDFRS 1157

Query: 91   DTYVLLRCDSANLKKDLDAVSESKTFVFQE 2
            DT+VLL C+  +++++LD VSE K   F E
Sbjct: 1158 DTHVLLHCNGESVQRNLDGVSECKVLTFIE 1187


>ref|XP_007206444.1| hypothetical protein PRUPE_ppa000111mg [Prunus persica]
            gi|462402086|gb|EMJ07643.1| hypothetical protein
            PRUPE_ppa000111mg [Prunus persica]
          Length = 1771

 Score =  184 bits (466), Expect = 5e-44
 Identities = 116/268 (43%), Positives = 155/268 (57%), Gaps = 3/268 (1%)
 Frame = -1

Query: 796  EGSNLGMSIQGRAAADDLLPFFKRNFQEE---QPPCVSPRKLKPVDNHEAVMGVPVLYQN 626
            E  N      GRA  D+  PFF R+ Q+E   Q  CV   + +   + E+VMGVP+ YQ 
Sbjct: 750  ESKNASSLYDGRAT-DEFCPFFVRDCQDEREIQNKCV---RSESSSHQESVMGVPIHYQT 805

Query: 625  DGSPLYMLTPAMSPPSKQSVDRWLSSDILRHKIDSPSQFLPISDGLSQGSQADDSKFILP 446
            DGS LY+LTPA +PPS ++V RWLSSD            LPI   L QGSQ +       
Sbjct: 806  DGSYLYLLTPATTPPSAKNVCRWLSSD-------EKDDVLPI---LHQGSQENHGNH--- 852

Query: 445  ESASMPISEKTPKRDQLPERNQENITMEAKAFTETKRKIETSCSPDISQISGPGKKIRLT 266
                        +R ++ +R  + + ++  +            S D SQISGP  + + T
Sbjct: 853  ----------ETERTEIVQREGDAVKVQTCS----------EYSQDSSQISGPDGRSKPT 892

Query: 265  PLSQIGFRDPASVGQGQQLTLLSLEVQADSRGDLKPDPRFDAINVIVLVLQEDNESVFDT 86
            PLSQIGFRDPASVG GQQLTLLS+EVQA+SRGDL+PDPRFDAIN+I L +Q D++S+ + 
Sbjct: 893  PLSQIGFRDPASVGGGQQLTLLSVEVQAESRGDLRPDPRFDAINLISLAIQNDSDSIVEI 952

Query: 85   YVLLRCDSANLKKDLDAVSESKTFVFQE 2
            +VLL   + + ++ LD +S  K  VF E
Sbjct: 953  FVLLHSKAESSQRILDGISGCKVLVFYE 980


>ref|XP_007030809.1| Recovery protein 3 isoform 3 [Theobroma cacao]
            gi|590643463|ref|XP_007030810.1| Recovery protein 3
            isoform 3 [Theobroma cacao] gi|508719414|gb|EOY11311.1|
            Recovery protein 3 isoform 3 [Theobroma cacao]
            gi|508719415|gb|EOY11312.1| Recovery protein 3 isoform 3
            [Theobroma cacao]
          Length = 1590

 Score =  183 bits (464), Expect = 8e-44
 Identities = 123/289 (42%), Positives = 168/289 (58%), Gaps = 21/289 (7%)
 Frame = -1

Query: 805  LRNEGSNLGMSIQGRAAADDLLPFFKRNFQEE---QPPCVSPRKLKPVDNHEAVMGVPVL 635
            L NE +  G S  GRA  D++LPFF R  +EE   Q  C+         + EA +GVP+ 
Sbjct: 516  LFNEENCQGTS--GRAL-DEVLPFFSRGCEEEKEVQNKCLGNNNSN--FHQEAALGVPIH 570

Query: 634  YQNDGSPLYMLTPAMSPPSKQSVDRWLSSDI--LRHKIDSPSQFLPISDG------LSQG 479
            YQNDGS LY+LTP  SPPS  SV RWLS D      + ++ S   P   G       S+ 
Sbjct: 571  YQNDGSFLYLLTPVSSPPSPDSVYRWLSCDEEGSHRQSNAVSAESPSLTGSTECLIASEN 630

Query: 478  SQADDSKFILPESASM-----PISEKTPKRDQLPERNQENITMEAKAFTETKRKIET--S 320
            S   +    L +S+S       + +  P+++ +     ++ + E++   +++  I T  +
Sbjct: 631  SSPVNCNEALTKSSSKYHMTSMLEQGHPEKNMVLGSEVKSCSNESRTPCQSEENIRTVNA 690

Query: 319  C---SPDISQISGPGKKIRLTPLSQIGFRDPASVGQGQQLTLLSLEVQADSRGDLKPDPR 149
            C   S D+SQISGP  K R TPLSQIGFRDPASVG GQQLTLLSLEV  +SRGDL+PDPR
Sbjct: 691  CADGSQDMSQISGPDGKSRPTPLSQIGFRDPASVGAGQQLTLLSLEVHTESRGDLRPDPR 750

Query: 148  FDAINVIVLVLQEDNESVFDTYVLLRCDSANLKKDLDAVSESKTFVFQE 2
            FDA+NV+ L +Q DN+S  + +VLL   +   +++LD +   K FVF E
Sbjct: 751  FDAVNVVALAIQNDNDSETEVHVLLYSKTGFYQRNLDGIFGLKVFVFSE 799


>ref|XP_007030808.1| Recovery protein 3 isoform 2, partial [Theobroma cacao]
            gi|508719413|gb|EOY11310.1| Recovery protein 3 isoform 2,
            partial [Theobroma cacao]
          Length = 1425

 Score =  183 bits (464), Expect = 8e-44
 Identities = 123/289 (42%), Positives = 168/289 (58%), Gaps = 21/289 (7%)
 Frame = -1

Query: 805  LRNEGSNLGMSIQGRAAADDLLPFFKRNFQEE---QPPCVSPRKLKPVDNHEAVMGVPVL 635
            L NE +  G S  GRA  D++LPFF R  +EE   Q  C+         + EA +GVP+ 
Sbjct: 707  LFNEENCQGTS--GRAL-DEVLPFFSRGCEEEKEVQNKCLGNNNSN--FHQEAALGVPIH 761

Query: 634  YQNDGSPLYMLTPAMSPPSKQSVDRWLSSDI--LRHKIDSPSQFLPISDG------LSQG 479
            YQNDGS LY+LTP  SPPS  SV RWLS D      + ++ S   P   G       S+ 
Sbjct: 762  YQNDGSFLYLLTPVSSPPSPDSVYRWLSCDEEGSHRQSNAVSAESPSLTGSTECLIASEN 821

Query: 478  SQADDSKFILPESASM-----PISEKTPKRDQLPERNQENITMEAKAFTETKRKIET--S 320
            S   +    L +S+S       + +  P+++ +     ++ + E++   +++  I T  +
Sbjct: 822  SSPVNCNEALTKSSSKYHMTSMLEQGHPEKNMVLGSEVKSCSNESRTPCQSEENIRTVNA 881

Query: 319  C---SPDISQISGPGKKIRLTPLSQIGFRDPASVGQGQQLTLLSLEVQADSRGDLKPDPR 149
            C   S D+SQISGP  K R TPLSQIGFRDPASVG GQQLTLLSLEV  +SRGDL+PDPR
Sbjct: 882  CADGSQDMSQISGPDGKSRPTPLSQIGFRDPASVGAGQQLTLLSLEVHTESRGDLRPDPR 941

Query: 148  FDAINVIVLVLQEDNESVFDTYVLLRCDSANLKKDLDAVSESKTFVFQE 2
            FDA+NV+ L +Q DN+S  + +VLL   +   +++LD +   K FVF E
Sbjct: 942  FDAVNVVALAIQNDNDSETEVHVLLYSKTGFYQRNLDGIFGLKVFVFSE 990


>ref|XP_007030807.1| Recovery protein 3 isoform 1 [Theobroma cacao]
            gi|508719412|gb|EOY11309.1| Recovery protein 3 isoform 1
            [Theobroma cacao]
          Length = 2035

 Score =  183 bits (464), Expect = 8e-44
 Identities = 123/289 (42%), Positives = 168/289 (58%), Gaps = 21/289 (7%)
 Frame = -1

Query: 805  LRNEGSNLGMSIQGRAAADDLLPFFKRNFQEE---QPPCVSPRKLKPVDNHEAVMGVPVL 635
            L NE +  G S  GRA  D++LPFF R  +EE   Q  C+         + EA +GVP+ 
Sbjct: 961  LFNEENCQGTS--GRAL-DEVLPFFSRGCEEEKEVQNKCLGNNNSN--FHQEAALGVPIH 1015

Query: 634  YQNDGSPLYMLTPAMSPPSKQSVDRWLSSDI--LRHKIDSPSQFLPISDG------LSQG 479
            YQNDGS LY+LTP  SPPS  SV RWLS D      + ++ S   P   G       S+ 
Sbjct: 1016 YQNDGSFLYLLTPVSSPPSPDSVYRWLSCDEEGSHRQSNAVSAESPSLTGSTECLIASEN 1075

Query: 478  SQADDSKFILPESASM-----PISEKTPKRDQLPERNQENITMEAKAFTETKRKIET--S 320
            S   +    L +S+S       + +  P+++ +     ++ + E++   +++  I T  +
Sbjct: 1076 SSPVNCNEALTKSSSKYHMTSMLEQGHPEKNMVLGSEVKSCSNESRTPCQSEENIRTVNA 1135

Query: 319  C---SPDISQISGPGKKIRLTPLSQIGFRDPASVGQGQQLTLLSLEVQADSRGDLKPDPR 149
            C   S D+SQISGP  K R TPLSQIGFRDPASVG GQQLTLLSLEV  +SRGDL+PDPR
Sbjct: 1136 CADGSQDMSQISGPDGKSRPTPLSQIGFRDPASVGAGQQLTLLSLEVHTESRGDLRPDPR 1195

Query: 148  FDAINVIVLVLQEDNESVFDTYVLLRCDSANLKKDLDAVSESKTFVFQE 2
            FDA+NV+ L +Q DN+S  + +VLL   +   +++LD +   K FVF E
Sbjct: 1196 FDAVNVVALAIQNDNDSETEVHVLLYSKTGFYQRNLDGIFGLKVFVFSE 1244


>ref|XP_002512387.1| DNA polymerase zeta catalytic subunit, putative [Ricinus communis]
           gi|223548348|gb|EEF49839.1| DNA polymerase zeta
           catalytic subunit, putative [Ricinus communis]
          Length = 2066

 Score =  182 bits (463), Expect = 1e-43
 Identities = 115/276 (41%), Positives = 151/276 (54%), Gaps = 19/276 (6%)
 Frame = -1

Query: 772 IQGRAAADDLLPFFKRNFQEEQPPCVSPRKLKPV----DNHEAVMGVPVLYQNDGSPLYM 605
           +    A  +LLPFF+ + QE++   V   K  P     D  EA+MGVP  YQNDGS LY+
Sbjct: 143 LSAEKALGELLPFFEGDCQEKK---VVQNKALPNTNSNDQQEAIMGVPTHYQNDGSLLYL 199

Query: 604 LTPAMSPPSKQSVDRWLSSDILRHKIDSPSQFLPISDGLSQGSQADDSKFILPESASMPI 425
           LTP  SPPS   V RWL       + D+    L I    S  + + DS  +  ++ SM +
Sbjct: 200 LTPIYSPPSADCVYRWL-------RCDNEDVLLSIG---SPETGSHDSSRVYGDNISMEL 249

Query: 424 SEKTPKR---DQLPERNQENITMEAKAFTE------------TKRKIETSCSPDISQISG 290
              +  R   DQ+ +   + I  E    T+             K    T CS D+SQISG
Sbjct: 250 RSVSNVRLIEDQVQQEEHQIINSEFHPNTDELQRPLHHKENNAKLNACTECSIDLSQISG 309

Query: 289 PGKKIRLTPLSQIGFRDPASVGQGQQLTLLSLEVQADSRGDLKPDPRFDAINVIVLVLQE 110
           P ++ R TPLSQIGFRDPAS G GQQLT+LS+EVQA+SRGDL+PDPRFDAIN + L  Q 
Sbjct: 310 PNERSRPTPLSQIGFRDPASTGAGQQLTMLSIEVQAESRGDLRPDPRFDAINTVALAFQN 369

Query: 109 DNESVFDTYVLLRCDSANLKKDLDAVSESKTFVFQE 2
           DN+S  +  VLL  +  +  +  D +S +K   F E
Sbjct: 370 DNDSTVEVQVLLHSNKESYARSSDGLSVNKVLYFSE 405


>gb|EXC54611.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Morus
            notabilis]
          Length = 1609

 Score =  182 bits (462), Expect = 1e-43
 Identities = 112/272 (41%), Positives = 148/272 (54%), Gaps = 20/272 (7%)
 Frame = -1

Query: 757  AADDLLPFFKRNFQEE---QPPCVSPRKLKPVDNHEAVMGVPVLYQNDGSPLYMLTPAMS 587
            A DD LPFF  + Q E   Q  C++  +      HEA +GVP  YQNDGS  Y+LTP +S
Sbjct: 1134 ALDDFLPFFMEDCQGEAEIQTKCITVNE--STSEHEAAVGVPAHYQNDGSYSYLLTPLVS 1191

Query: 586  PPSKQSVDRWLS---SDILRHKIDSPSQFLPISDGLSQGSQADDSKFILPESASMPISEK 416
            PPS ++V RWLS   S     K+  P+     S   +    +    +    + S  IS +
Sbjct: 1192 PPSSKNVKRWLSIGTSVEENTKLRKPTSHKGTSSTPNCSPSSSPDDYNKASTGSGSISHQ 1251

Query: 415  TPKRDQLPERN--------------QENITMEAKAFTETKRKIETSCSPDISQISGPGKK 278
               + Q P+ N               E   M  +     K K  + CS DISQISGPG +
Sbjct: 1252 PYMKAQGPQDNCDTNNVDTIRTLACNEETAMVQREGNSAKVKAYSDCSLDISQISGPGGR 1311

Query: 277  IRLTPLSQIGFRDPASVGQGQQLTLLSLEVQADSRGDLKPDPRFDAINVIVLVLQEDNES 98
             + TPLSQIGFRDPASVG GQQLTLLS+EVQ  SRGDL PDPRFDA+NVI L +  D++ 
Sbjct: 1312 SKPTPLSQIGFRDPASVGAGQQLTLLSIEVQVASRGDLLPDPRFDAVNVITLAVHNDSDF 1371

Query: 97   VFDTYVLLRCDSANLKKDLDAVSESKTFVFQE 2
              + +VLL   + N +++LD ++     +F E
Sbjct: 1372 DVELHVLLHSKAENCQRNLDGITGCTLLIFYE 1403


>gb|EXC02386.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Morus
            notabilis]
          Length = 1854

 Score =  182 bits (462), Expect = 1e-43
 Identities = 112/272 (41%), Positives = 148/272 (54%), Gaps = 20/272 (7%)
 Frame = -1

Query: 757  AADDLLPFFKRNFQEE---QPPCVSPRKLKPVDNHEAVMGVPVLYQNDGSPLYMLTPAMS 587
            A DD LPFF  + Q E   Q  C++  +      HEA +GVP  YQNDGS  Y+LTP +S
Sbjct: 1391 ALDDFLPFFMEDCQGEAEIQTKCITVNE--STSEHEAAVGVPAHYQNDGSYSYLLTPLVS 1448

Query: 586  PPSKQSVDRWLS---SDILRHKIDSPSQFLPISDGLSQGSQADDSKFILPESASMPISEK 416
            PPS ++V RWLS   S     K+  P+     S   +    +    +    + S  IS +
Sbjct: 1449 PPSSKNVKRWLSIGTSVEENTKLRKPTSHKGTSSTPNCSPSSSPDDYNKASTGSGSISHQ 1508

Query: 415  TPKRDQLPERN--------------QENITMEAKAFTETKRKIETSCSPDISQISGPGKK 278
               + Q P+ N               E   M  +     K K  + CS DISQISGPG +
Sbjct: 1509 PYMKAQGPQDNCDTNNVDTIRTLACNEETAMVQREGNSAKVKAYSDCSLDISQISGPGGR 1568

Query: 277  IRLTPLSQIGFRDPASVGQGQQLTLLSLEVQADSRGDLKPDPRFDAINVIVLVLQEDNES 98
             + TPLSQIGFRDPASVG GQQLTLLS+EVQ  SRGDL PDPRFDA+NVI L +  D++ 
Sbjct: 1569 SKPTPLSQIGFRDPASVGAGQQLTLLSIEVQVASRGDLLPDPRFDAVNVITLAVHNDSDF 1628

Query: 97   VFDTYVLLRCDSANLKKDLDAVSESKTFVFQE 2
              + +VLL   + N +++LD ++     +F E
Sbjct: 1629 DVELHVLLHSKAENCQRNLDGITGCTLLIFYE 1660


>ref|XP_006391268.1| hypothetical protein EUTSA_v10017996mg [Eutrema salsugineum]
            gi|557087702|gb|ESQ28554.1| hypothetical protein
            EUTSA_v10017996mg [Eutrema salsugineum]
          Length = 1887

 Score =  166 bits (420), Expect = 1e-38
 Identities = 96/224 (42%), Positives = 136/224 (60%), Gaps = 5/224 (2%)
 Frame = -1

Query: 658  AVMGVPVLYQNDGSPLYMLTPAMSPPSKQSVDRWLSSDILRHKIDSPSQFLPISDGLSQG 479
            A +G+P+ + NDGS LY+LTPA SPPS  SV +W+S+      ID+  Q  P+ D  +  
Sbjct: 891  ASLGIPLHHLNDGSNLYLLTPAFSPPSVDSVSQWISNHKGELTIDAEKQ--PLGDDHASA 948

Query: 478  SQADDSKFILPESASMPISEKTPKRDQLPERNQENITMEAKAFTETKRK-----IETSCS 314
            S              M + EK  + + +   ++ N   E++   E+KRK     ++TS S
Sbjct: 949  SNV------------MSVFEKVEQHNNVFVNSESNAHTESEIDHESKRKFLNLNLQTSVS 996

Query: 313  PDISQISGPGKKIRLTPLSQIGFRDPASVGQGQQLTLLSLEVQADSRGDLKPDPRFDAIN 134
             ++SQIS P  K   TPLSQIGFRDPAS+G GQQLT+LS+EV A+SRGDL+PDPRFD++N
Sbjct: 997  QEMSQISAPEGKSGSTPLSQIGFRDPASMGAGQQLTVLSIEVHAESRGDLRPDPRFDSVN 1056

Query: 133  VIVLVLQEDNESVFDTYVLLRCDSANLKKDLDAVSESKTFVFQE 2
            VI LV+Q DN    + +VL+       ++++D +S  K  VF E
Sbjct: 1057 VIALVVQNDNSFAAEVFVLVFSPDRIYQRNVDGLSGCKLSVFLE 1100


>gb|AAC18785.1| Similar to putative DNA polymerase gb|M29683 from S. cerevisiae
            [Arabidopsis thaliana]
          Length = 1894

 Score =  162 bits (411), Expect = 1e-37
 Identities = 96/223 (43%), Positives = 140/223 (62%), Gaps = 6/223 (2%)
 Frame = -1

Query: 652  MGVPVLYQNDGSPLYMLTPAMSPPSKQSVDRWLSSDILRHKIDSPSQFLPISDGLS-QGS 476
            +G+P+ + NDGS LY+LTPA SPPS  SV +W+S+D     IDS  Q  P+ D  + +G+
Sbjct: 889  LGIPLHHLNDGSNLYLLTPAFSPPSVDSVLQWISNDKGDSNIDSEKQ--PLRDNHNDRGA 946

Query: 475  QADDSKFILPESASMPISEKTPKRDQLPERNQENITMEAKAFTETKR-----KIETSCSP 311
               D   +   S  + +SE   + + L   ++ N   E++   + K       ++ S S 
Sbjct: 947  SFTD---LASASNVVSVSEHVEQHNNLFVNSESNAYTESEIDLKPKGTFLNLNLQASVSQ 1003

Query: 310  DISQISGPGKKIRLTPLSQIGFRDPASVGQGQQLTLLSLEVQADSRGDLKPDPRFDAINV 131
            ++SQISGP  K   TPLSQ+GFRDPAS+G GQQLT+LS+EV A+SRGDL+PDPRFD++NV
Sbjct: 1004 ELSQISGPDGKSGPTPLSQMGFRDPASMGAGQQLTILSIEVHAESRGDLRPDPRFDSVNV 1063

Query: 130  IVLVLQEDNESVFDTYVLLRCDSANLKKDLDAVSESKTFVFQE 2
            I LV+Q D+  V + +VLL    +  ++++D +S  K  VF E
Sbjct: 1064 IALVVQNDDSFVAEVFVLLFSPDSIDQRNVDGLSGCKLSVFLE 1106


>gb|AAG52299.1|AC011020_6 putative DNA polymerase zeta catalytic subunit [Arabidopsis thaliana]
          Length = 1871

 Score =  162 bits (411), Expect = 1e-37
 Identities = 96/223 (43%), Positives = 140/223 (62%), Gaps = 6/223 (2%)
 Frame = -1

Query: 652  MGVPVLYQNDGSPLYMLTPAMSPPSKQSVDRWLSSDILRHKIDSPSQFLPISDGLS-QGS 476
            +G+P+ + NDGS LY+LTPA SPPS  SV +W+S+D     IDS  Q  P+ D  + +G+
Sbjct: 889  LGIPLHHLNDGSNLYLLTPAFSPPSVDSVLQWISNDKGDSNIDSEKQ--PLRDNHNDRGA 946

Query: 475  QADDSKFILPESASMPISEKTPKRDQLPERNQENITMEAKAFTETKR-----KIETSCSP 311
               D   +   S  + +SE   + + L   ++ N   E++   + K       ++ S S 
Sbjct: 947  SFTD---LASASNVVSVSEHVEQHNNLFVNSESNAYTESEIDLKPKGTFLNLNLQASVSQ 1003

Query: 310  DISQISGPGKKIRLTPLSQIGFRDPASVGQGQQLTLLSLEVQADSRGDLKPDPRFDAINV 131
            ++SQISGP  K   TPLSQ+GFRDPAS+G GQQLT+LS+EV A+SRGDL+PDPRFD++NV
Sbjct: 1004 ELSQISGPDGKSGPTPLSQMGFRDPASMGAGQQLTILSIEVHAESRGDLRPDPRFDSVNV 1063

Query: 130  IVLVLQEDNESVFDTYVLLRCDSANLKKDLDAVSESKTFVFQE 2
            I LV+Q D+  V + +VLL    +  ++++D +S  K  VF E
Sbjct: 1064 IALVVQNDDSFVAEVFVLLFSPDSIDQRNVDGLSGCKLSVFLE 1106


>ref|NP_176917.2| DNA polymerase zeta subunit [Arabidopsis thaliana]
            gi|75138548|sp|Q766Z3.1|REV3_ARATH RecName: Full=DNA
            polymerase zeta catalytic subunit; AltName: Full=Protein
            reversionless 3-like; Short=AtREV3
            gi|34330129|dbj|BAC82450.1| catalytic subunit of
            polymerase zeta [Arabidopsis thaliana]
            gi|332196534|gb|AEE34655.1| DNA polymerase zeta subunit
            [Arabidopsis thaliana]
          Length = 1890

 Score =  162 bits (411), Expect = 1e-37
 Identities = 96/223 (43%), Positives = 140/223 (62%), Gaps = 6/223 (2%)
 Frame = -1

Query: 652  MGVPVLYQNDGSPLYMLTPAMSPPSKQSVDRWLSSDILRHKIDSPSQFLPISDGLS-QGS 476
            +G+P+ + NDGS LY+LTPA SPPS  SV +W+S+D     IDS  Q  P+ D  + +G+
Sbjct: 885  LGIPLHHLNDGSNLYLLTPAFSPPSVDSVLQWISNDKGDSNIDSEKQ--PLRDNHNDRGA 942

Query: 475  QADDSKFILPESASMPISEKTPKRDQLPERNQENITMEAKAFTETKR-----KIETSCSP 311
               D   +   S  + +SE   + + L   ++ N   E++   + K       ++ S S 
Sbjct: 943  SFTD---LASASNVVSVSEHVEQHNNLFVNSESNAYTESEIDLKPKGTFLNLNLQASVSQ 999

Query: 310  DISQISGPGKKIRLTPLSQIGFRDPASVGQGQQLTLLSLEVQADSRGDLKPDPRFDAINV 131
            ++SQISGP  K   TPLSQ+GFRDPAS+G GQQLT+LS+EV A+SRGDL+PDPRFD++NV
Sbjct: 1000 ELSQISGPDGKSGPTPLSQMGFRDPASMGAGQQLTILSIEVHAESRGDLRPDPRFDSVNV 1059

Query: 130  IVLVLQEDNESVFDTYVLLRCDSANLKKDLDAVSESKTFVFQE 2
            I LV+Q D+  V + +VLL    +  ++++D +S  K  VF E
Sbjct: 1060 IALVVQNDDSFVAEVFVLLFSPDSIDQRNVDGLSGCKLSVFLE 1102


>ref|NP_001185344.1| DNA polymerase zeta subunit [Arabidopsis thaliana]
            gi|332196535|gb|AEE34656.1| DNA polymerase zeta subunit
            [Arabidopsis thaliana]
          Length = 1916

 Score =  162 bits (411), Expect = 1e-37
 Identities = 96/223 (43%), Positives = 140/223 (62%), Gaps = 6/223 (2%)
 Frame = -1

Query: 652  MGVPVLYQNDGSPLYMLTPAMSPPSKQSVDRWLSSDILRHKIDSPSQFLPISDGLS-QGS 476
            +G+P+ + NDGS LY+LTPA SPPS  SV +W+S+D     IDS  Q  P+ D  + +G+
Sbjct: 911  LGIPLHHLNDGSNLYLLTPAFSPPSVDSVLQWISNDKGDSNIDSEKQ--PLRDNHNDRGA 968

Query: 475  QADDSKFILPESASMPISEKTPKRDQLPERNQENITMEAKAFTETKR-----KIETSCSP 311
               D   +   S  + +SE   + + L   ++ N   E++   + K       ++ S S 
Sbjct: 969  SFTD---LASASNVVSVSEHVEQHNNLFVNSESNAYTESEIDLKPKGTFLNLNLQASVSQ 1025

Query: 310  DISQISGPGKKIRLTPLSQIGFRDPASVGQGQQLTLLSLEVQADSRGDLKPDPRFDAINV 131
            ++SQISGP  K   TPLSQ+GFRDPAS+G GQQLT+LS+EV A+SRGDL+PDPRFD++NV
Sbjct: 1026 ELSQISGPDGKSGPTPLSQMGFRDPASMGAGQQLTILSIEVHAESRGDLRPDPRFDSVNV 1085

Query: 130  IVLVLQEDNESVFDTYVLLRCDSANLKKDLDAVSESKTFVFQE 2
            I LV+Q D+  V + +VLL    +  ++++D +S  K  VF E
Sbjct: 1086 IALVVQNDDSFVAEVFVLLFSPDSIDQRNVDGLSGCKLSVFLE 1128


>ref|XP_004494955.1| PREDICTED: DNA polymerase zeta catalytic subunit-like [Cicer
            arietinum]
          Length = 1914

 Score =  160 bits (404), Expect = 7e-37
 Identities = 102/259 (39%), Positives = 137/259 (52%), Gaps = 7/259 (2%)
 Frame = -1

Query: 757  AADDLLPFFKRNFQEEQPP---CVSPRKLKPVDNHEAVMGVPVLYQNDGSPLYMLTPAMS 587
            A D  LP   +N Q++  P   CV+              GV   YQNDGS LY+LTP + 
Sbjct: 873  ALDVFLPNSAKNSQKQMEPWNKCVTK-----THKFSGTKGVATYYQNDGSHLYLLTPNIL 927

Query: 586  PPSKQSVDRWLSSDILRHKIDSPSQFLPISDGLSQGSQADDSKFILPESASMPISEKTPK 407
            PPS  SV RWL  D    + D+  Q +P            D     P+     +S+    
Sbjct: 928  PPSASSVQRWLFCD--EREPDAEDQDVPKCTSEHPLRHTPDQMHQEPDVEDKDVSKCASG 985

Query: 406  RDQLPERNQENIT-MEAKAFTETKRKIETSC---SPDISQISGPGKKIRLTPLSQIGFRD 239
                PE  Q+  T  +    +E + +   +C   S DISQISGP +K   TPLSQ+GFRD
Sbjct: 986  PPLRPELYQDAGTEKKLTCISEGQTERIEACIDGSQDISQISGPDEKSSFTPLSQVGFRD 1045

Query: 238  PASVGQGQQLTLLSLEVQADSRGDLKPDPRFDAINVIVLVLQEDNESVFDTYVLLRCDSA 59
            PASVG+GQQLTLLS+EV A+SRGDL PDP+FD IN++ L  Q D +++ +  VLL     
Sbjct: 1046 PASVGRGQQLTLLSIEVLAESRGDLLPDPQFDGINIVALGFQNDGDAIIEVLVLLHSKYF 1105

Query: 58   NLKKDLDAVSESKTFVFQE 2
            + ++ LD +S  K  VF +
Sbjct: 1106 SCQRSLDGLSGCKVLVFND 1124


Top