BLASTX nr result

ID: Zingiber25_contig00030310 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber25_contig00030310
         (1951 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|NP_001062551.1| Os09g0101800 [Oryza sativa Japonica Group] g...   327   1e-86
emb|CBI27315.3| unnamed protein product [Vitis vinifera]              326   3e-86
ref|XP_002273558.1| PREDICTED: uncharacterized protein LOC100268...   326   3e-86
tpg|DAA39571.1| TPA: hypothetical protein ZEAMMB73_434435 [Zea m...   325   6e-86
ref|XP_004956280.1| PREDICTED: uncharacterized protein LOC101754...   320   1e-84
gb|EEE69170.1| hypothetical protein OsJ_28335 [Oryza sativa Japo...   313   2e-82
ref|XP_006660407.1| PREDICTED: uncharacterized protein LOC102720...   312   3e-82
gb|EMJ28449.1| hypothetical protein PRUPE_ppa017973mg [Prunus pe...   299   3e-78
ref|XP_002512056.1| conserved hypothetical protein [Ricinus comm...   294   1e-76
ref|XP_006484353.1| PREDICTED: uncharacterized protein LOC102628...   289   3e-75
ref|XP_006856397.1| hypothetical protein AMTR_s00047p00207890 [A...   285   5e-74
gb|EOY01640.1| Histone-lysine N-methyltransferase ATX1, putative...   283   2e-73
gb|EOY01639.1| Histone-lysine N-methyltransferase ATX1, putative...   282   3e-73
ref|XP_006437884.1| hypothetical protein CICLE_v10033741mg, part...   277   1e-71
ref|XP_004298357.1| PREDICTED: uncharacterized protein LOC101305...   273   2e-70
ref|XP_002893407.1| predicted protein [Arabidopsis lyrata subsp....   272   3e-70
gb|AAG50686.1|AC079829_19 hypothetical protein [Arabidopsis thal...   271   1e-69
gb|AAF98582.1|AC013427_25 This gene may be cut off [Arabidopsis ...   271   1e-69
ref|NP_001185099.1| DNA binding protein [Arabidopsis thaliana] g...   271   1e-69
ref|NP_173957.2| DNA binding protein [Arabidopsis thaliana] gi|3...   271   1e-69

>ref|NP_001062551.1| Os09g0101800 [Oryza sativa Japonica Group]
            gi|51091867|dbj|BAD36680.1| unknown protein [Oryza sativa
            Japonica Group] gi|113630784|dbj|BAF24465.1| Os09g0101800
            [Oryza sativa Japonica Group]
          Length = 1099

 Score =  327 bits (838), Expect = 1e-86
 Identities = 178/409 (43%), Positives = 254/409 (62%), Gaps = 8/409 (1%)
 Frame = +2

Query: 359  ELAGCYRQPKPVLFILLRPYEDD-LQISVICGIRESDERFVFIYKVPLKDQGEICPYFIG 535
            +L GCY  P PVL I+L    +  L I V+CG+ ES ERF+++Y +  KDQ E  PYF+G
Sbjct: 679  DLMGCYLHPMPVLSIVLNTKNNSSLLIYVLCGLLESCERFLYVYTIVPKDQQETAPYFVG 738

Query: 536  YTSLMLPLLPGPAIGNTMYKGSCLQFTPDGQSIVFVSTIRAPLCRTQSMHCSCSMCTSVC 715
            YT L+L  L     GN  ++ S LQFTPDGQ +V + +IR P CR Q + CSCS+C    
Sbjct: 739  YTPLLLSSLERSCTGNLPFERSGLQFTPDGQFLVLLGSIRMPYCRKQIIDCSCSLCKLDQ 798

Query: 716  CEENAVLIGKVFMGYVLPSATLTTIESLSCIAVYEPNYVIAAELSGTLRVWLMNCNWSKS 895
            CE+N + I  V +GYV     L    SLSCI + EPNY++  E    L +W+M   W   
Sbjct: 799  CEDNYLKIVSVDLGYVSLLTKLMAYGSLSCILICEPNYIVTVEDGRNLHIWMMAAGWRII 858

Query: 896  LEEFVLPSFDYL-RPAVELKTVPKSDCLLVGHNGIGSFGLWNISKRVLLSRFLNPGNLIF 1072
             EE+V+PS   +    +EL+ +PKS  L+VGH+G GSF LW+ISKR LL+ F  PG ++F
Sbjct: 859  SEEYVIPSSGNVGNSIIELRRMPKSSTLIVGHDGTGSFSLWDISKRTLLATFTAPGIIVF 918

Query: 1073 QILPIGMFNFQDEINTSSCQNM-KLMQEISHHRVA-----EAVATPLQDDNAVWILVSAD 1234
            QI P+   + Q++I  +S  ++ + ++EI+   V+     E++ +P   D A+WIL+S+ 
Sbjct: 919  QIRPVVSCSLQEDIILASVSDIERRLREITVTGVSRKADKESILSP-GKDTAIWILISSA 977

Query: 1235 TDSDPQVVNHPKERQPTSNASWRPALLVRNIVVMGHMVDFRASSVDVSDGYGFIGTHDGV 1414
            + ++ Q     KE     NA WR ALL    ++MG ++D RA++VDV   +GF GTH G+
Sbjct: 978  SVAEYQSDLRAKEH----NARWRLALLANKTLIMGTILDPRATAVDVCGNHGFAGTHGGL 1033

Query: 1415 LYKWELSTGKKLSNLLRVQSGSIMCTAVDSKSGVVAVADEKCRLLILRQ 1561
            LY WELS+G+KL+      SG + C AVD+KSGVVAVAD+ C+L++  Q
Sbjct: 1034 LYAWELSSGRKLAGTQCFNSGRVSCVAVDAKSGVVAVADDGCQLVLYSQ 1082


>emb|CBI27315.3| unnamed protein product [Vitis vinifera]
          Length = 1177

 Score =  326 bits (835), Expect = 3e-86
 Identities = 173/410 (42%), Positives = 249/410 (60%), Gaps = 12/410 (2%)
 Frame = +2

Query: 359  ELAGCYRQPKPVLFILLRPYEDDLQISVICGIRESDERFVFIYKVPLKDQGEICPYFIGY 538
            EL GCY  P PVL +LL   ED++ I V+CG+    +  +FIYKV +K+     P F+GY
Sbjct: 762  ELVGCYVHPMPVLSVLLNTREDEIHICVLCGLLVDKDTILFIYKVTIKEPRLQSPTFVGY 821

Query: 539  TSLMLPLLPGPAIGNTMYKGSCLQFTPDGQSIVFVSTIRAPLCRTQSMHCSCSMCTSVCC 718
            T ++LP L   + G        LQFTPDGQS+V +++I+ P CR Q + C CS C   C 
Sbjct: 822  TPIILPTLKDRSGGEVALDRFGLQFTPDGQSLVLLNSIKTPYCREQKIPCLCSACKLECF 881

Query: 719  EENAVLIGKVFMGYVLPSATLTTIESLSCIAVYEPNYVIAAELSGTLRVWLMNCNWSKSL 898
            EENA+ I ++ +G++     L T++S+ CI V EPN+++A E SG L VW+MN  WS   
Sbjct: 882  EENAIKIVQIKLGFLSVVEKLKTVDSVQCILVCEPNHLVAVEESGRLHVWVMNSTWSVQT 941

Query: 899  EEFVLPSFDYLRPA-VELKTVPKSDCLLVGHNGIGSFGLWNISKRVLLSRFLNPGNLIFQ 1075
            E+F++P++D + P  VELK +PK   L+VGH+G G F LW+IS+R+L+SRF  P   IF+
Sbjct: 942  EDFIIPTYDCVSPCIVELKRIPKCAPLVVGHHGFGEFSLWDISQRILISRFAMPSISIFE 1001

Query: 1076 ILPIGMFNFQDEINTSSCQNMKLMQEISHHRVAEAVATPLQDDN-----------AVWIL 1222
             +PI +F+FQ E+  SS  ++ L   I+    A  +     ++N           AVW+L
Sbjct: 1002 FIPISLFSFQSEVPLSSNPDVDL--HINKIMAATKMWFSKHNENYTFLPLGGESIAVWLL 1059

Query: 1223 VSADTDSDPQVVNHPKERQPTSNASWRPALLVRNIVVMGHMVDFRASSVDVSDGYGFIGT 1402
            VS  +DSD Q  N   + Q      WR ALLV+N+V++G  +D RA+++  S G+G IGT
Sbjct: 1060 VSTLSDSDTQHDNQMNDCQTNPVGWWRLALLVKNMVILGSALDPRAAAIGASAGHGIIGT 1119

Query: 1403 HDGVLYKWELSTGKKLSNLLRVQSGSIMCTAVDSKSGVVAVADEKCRLLI 1552
            HDG++Y WELSTG KL +L   + G       DS+S V AVA +  +LL+
Sbjct: 1120 HDGLVYMWELSTGTKLGSLHYFKGGVSCIATDDSRSDVFAVAGDGGQLLV 1169


>ref|XP_002273558.1| PREDICTED: uncharacterized protein LOC100268093 [Vitis vinifera]
          Length = 1242

 Score =  326 bits (835), Expect = 3e-86
 Identities = 173/410 (42%), Positives = 249/410 (60%), Gaps = 12/410 (2%)
 Frame = +2

Query: 359  ELAGCYRQPKPVLFILLRPYEDDLQISVICGIRESDERFVFIYKVPLKDQGEICPYFIGY 538
            EL GCY  P PVL +LL   ED++ I V+CG+    +  +FIYKV +K+     P F+GY
Sbjct: 827  ELVGCYVHPMPVLSVLLNTREDEIHICVLCGLLVDKDTILFIYKVTIKEPRLQSPTFVGY 886

Query: 539  TSLMLPLLPGPAIGNTMYKGSCLQFTPDGQSIVFVSTIRAPLCRTQSMHCSCSMCTSVCC 718
            T ++LP L   + G        LQFTPDGQS+V +++I+ P CR Q + C CS C   C 
Sbjct: 887  TPIILPTLKDRSGGEVALDRFGLQFTPDGQSLVLLNSIKTPYCREQKIPCLCSACKLECF 946

Query: 719  EENAVLIGKVFMGYVLPSATLTTIESLSCIAVYEPNYVIAAELSGTLRVWLMNCNWSKSL 898
            EENA+ I ++ +G++     L T++S+ CI V EPN+++A E SG L VW+MN  WS   
Sbjct: 947  EENAIKIVQIKLGFLSVVEKLKTVDSVQCILVCEPNHLVAVEESGRLHVWVMNSTWSVQT 1006

Query: 899  EEFVLPSFDYLRPA-VELKTVPKSDCLLVGHNGIGSFGLWNISKRVLLSRFLNPGNLIFQ 1075
            E+F++P++D + P  VELK +PK   L+VGH+G G F LW+IS+R+L+SRF  P   IF+
Sbjct: 1007 EDFIIPTYDCVSPCIVELKRIPKCAPLVVGHHGFGEFSLWDISQRILISRFAMPSISIFE 1066

Query: 1076 ILPIGMFNFQDEINTSSCQNMKLMQEISHHRVAEAVATPLQDDN-----------AVWIL 1222
             +PI +F+FQ E+  SS  ++ L   I+    A  +     ++N           AVW+L
Sbjct: 1067 FIPISLFSFQSEVPLSSNPDVDL--HINKIMAATKMWFSKHNENYTFLPLGGESIAVWLL 1124

Query: 1223 VSADTDSDPQVVNHPKERQPTSNASWRPALLVRNIVVMGHMVDFRASSVDVSDGYGFIGT 1402
            VS  +DSD Q  N   + Q      WR ALLV+N+V++G  +D RA+++  S G+G IGT
Sbjct: 1125 VSTLSDSDTQHDNQMNDCQTNPVGWWRLALLVKNMVILGSALDPRAAAIGASAGHGIIGT 1184

Query: 1403 HDGVLYKWELSTGKKLSNLLRVQSGSIMCTAVDSKSGVVAVADEKCRLLI 1552
            HDG++Y WELSTG KL +L   + G       DS+S V AVA +  +LL+
Sbjct: 1185 HDGLVYMWELSTGTKLGSLHYFKGGVSCIATDDSRSDVFAVAGDGGQLLV 1234


>tpg|DAA39571.1| TPA: hypothetical protein ZEAMMB73_434435 [Zea mays]
          Length = 1159

 Score =  325 bits (832), Expect = 6e-86
 Identities = 198/570 (34%), Positives = 303/570 (53%), Gaps = 50/570 (8%)
 Frame = +2

Query: 2    SNSAVVSQTQNVTDYVDVNVKHSEFHDHSVNRTEENDSTSKPHPPFQFCQQ--AYQGPVN 175
            S   V+   Q+ T+   ++ K + FH  S   T+ +D+TS      +F  +  A++ P N
Sbjct: 578  STLLVMRDGQHHTEVPAIDQKENRFHSVSYKCTKSDDNTSFHSENVEFVDKHVAFESPDN 637

Query: 176  CV----PSQNIEVTELLSFSDEKPATRN----------IRHE------------QTNICQ 277
             +     SQ    TE     D   A             IR++            + N+C+
Sbjct: 638  GMYSSDGSQRAITTEGWPAGDGVKADEENLLGKVEECQIRYKNGNKNTILSVYGEGNVCE 697

Query: 278  RQPNS--NDQFDDSLFRAV-------------RLDEVLDESFELAGCYRQPKPVLFILLR 412
              P    ND F      A+             R     D   EL GCY  P PVL ++L 
Sbjct: 698  HIPTKGENDVFHHQPSHALSTTNCTHGLVSEDRTQARPDHHLELVGCYLHPMPVLSVMLN 757

Query: 413  PYE-DDLQISVICGIRESDERFVFIYKVPLKDQGEICPYFIGYTSLMLPLLPGPAIGNTM 589
                + L I V+CG+ ES +RF+++Y +  KDQ ++ P F+GYT L+LP L   + GN +
Sbjct: 758  TKNHNSLYIYVLCGLLESCQRFLYVYSINPKDQKDVSPCFVGYTPLVLPTLDHSSTGNFL 817

Query: 590  YKGSCLQFTPDGQSIVFVSTIRAPLCRTQSMHCSCSMCTSVCCEENAVLIGKVFMGYVLP 769
            +  S L FTPDGQ +V +S+I+ P CR Q++ C C +C    CE+N++ I  V  GY   
Sbjct: 818  FGRSGLHFTPDGQFLVLLSSIKIPSCRMQNIDCLCPVCNLCQCEDNSLKIVSVNSGYASL 877

Query: 770  SATLTTIESLSCIAVYEPNYVIAAELSGTLRVWLMNCNWSKSLEEFVLPSFDYLRPAVEL 949
               L    ++SCI ++EPNY++A E S  L +W M   WS+  E++++PS   + P +EL
Sbjct: 878  VTNLMPYGTVSCILIFEPNYIVAIEDSRNLHIWEMVDGWSEISEQYMIPSLGNMGPILEL 937

Query: 950  KTVPKSDCLLVGHNGIGSFGLWNISKRVLLSRFLNPGNLIFQILPIGMFNFQDEINTSSC 1129
              +PK+  L++GH+G G F LW+ISKR LL+ F  PGN +FQILP+G+ + Q++I  +  
Sbjct: 938  TRMPKNTSLIIGHDGEGGFCLWDISKRTLLATFAAPGNTVFQILPVGLCSLQEDIIHAPV 997

Query: 1130 QNM-KLMQEIS-----HHRVAEAVATPLQDDNAVWILVSADTDSDPQVVNHPKERQPTSN 1291
             ++ K +Q I+        V     TP + D AVW+L+S  + ++ Q     KE    ++
Sbjct: 998  DDIDKNLQVITVGDLYRKNVRVNFVTPPRQDIAVWVLISCASVAEYQHDLQAKE----NS 1053

Query: 1292 ASWRPALLVRNIVVMGHMVDFRASSVDVSDGYGFIGTHDGVLYKWELSTGKKLSNLLRVQ 1471
            A WR ALL +  V +G+++D R +++D    YG+ GTH G+LY WELS+G+K++      
Sbjct: 1054 ARWRFALLAKKRVFIGNVLDTRITALDACGNYGYAGTHGGLLYLWELSSGRKVTGTQCFN 1113

Query: 1472 SGSIMCTAVDSKSGVVAVADEKCRLLILRQ 1561
            SG + C +VDSKSG VAV D  C +L+  Q
Sbjct: 1114 SGRVSCVSVDSKSGAVAVTDGGCHVLLYTQ 1143


>ref|XP_004956280.1| PREDICTED: uncharacterized protein LOC101754000 [Setaria italica]
          Length = 1120

 Score =  320 bits (821), Expect = 1e-84
 Identities = 173/409 (42%), Positives = 251/409 (61%), Gaps = 8/409 (1%)
 Frame = +2

Query: 359  ELAGCYRQPKPVLFILLRPYEDD-LQISVICGIRESDERFVFIYKVPLKDQGEICPYFIG 535
            EL GCY  P PVL I+L       L I V+CG+ ES +R V++Y V  KDQ +  P F+G
Sbjct: 693  ELVGCYLHPMPVLSIMLNTKNHSRLYIYVLCGLLESYQRSVYVYTVT-KDQQDAPPCFVG 751

Query: 536  YTSLMLPLLPGPAIGNTMYKGSCLQFTPDGQSIVFVSTIRAPLCRTQSMHCSCSMCTSVC 715
            YT+L+LP L   + GN     S L FTPDGQ +V +S IR P CR Q++ C CS+C    
Sbjct: 752  YTTLLLPSLDQSSAGNISLARSGLHFTPDGQFVVLLSCIRIPFCRMQNIDCLCSVCKMGR 811

Query: 716  CEENAVLIGKVFMGYVLPSATLTTIESLSCIAVYEPNYVIAAELSGTLRVWLMNCNWSKS 895
            CE+N++ I  V +GYV     L    ++SCI + EPNY++A E S  L +W M   WS+ 
Sbjct: 812  CEDNSLKIVSVNLGYVSLVTKLLPNGTVSCILICEPNYIVACEDSRNLHIWEMVNGWSEI 871

Query: 896  LEEFVLPSFDYLRPAV-ELKTVPKSDCLLVGHNGIGSFGLWNISKRVLLSRFLNPGNLIF 1072
             E++V+PS   + P+V EL+ +PKS  L++GH+G G F LW+ISKR LL+ F  PGN++F
Sbjct: 872  SEQYVIPSLGNVGPSVLELRRMPKSHSLIMGHDGAGGFCLWDISKRTLLAIFAAPGNIVF 931

Query: 1073 QILPIGMFNFQDEINTSSCQNMK------LMQEISHHRVAEAVATPLQDDNAVWILVSAD 1234
            QILP+G+ + Q++I  +   ++        +  +S     E+  TP ++D AVW+L+S+ 
Sbjct: 932  QILPVGLCSLQEDIVHAPVDDIDKKLRGITISGMSRKIDQESFMTPPREDIAVWVLISSA 991

Query: 1235 TDSDPQVVNHPKERQPTSNASWRPALLVRNIVVMGHMVDFRASSVDVSDGYGFIGTHDGV 1414
            + ++ Q     K      NA WR ALL +  ++MG+++D R +++D S  YGF GTH G+
Sbjct: 992  SVAEYQCDLQTK----VHNARWRLALLAKKRIIMGNILDTRVTALDASGNYGFAGTHGGL 1047

Query: 1415 LYKWELSTGKKLSNLLRVQSGSIMCTAVDSKSGVVAVADEKCRLLILRQ 1561
            LY WELS+G+KL+       G + C AVD+KSG VAV D  C++L+  Q
Sbjct: 1048 LYLWELSSGRKLTGTQCFNRGPVSCVAVDAKSGAVAVTDGGCQVLLYTQ 1096


>gb|EEE69170.1| hypothetical protein OsJ_28335 [Oryza sativa Japonica Group]
          Length = 1106

 Score =  313 bits (802), Expect = 2e-82
 Identities = 178/434 (41%), Positives = 254/434 (58%), Gaps = 33/434 (7%)
 Frame = +2

Query: 359  ELAGCYRQPKPVLFILLRPYEDD-LQISVICGIRESDERFVFIYKVPLKDQGEICPYFIG 535
            +L GCY  P PVL I+L    +  L I V+CG+ ES ERF+++Y +  KDQ E  PYF+G
Sbjct: 661  DLMGCYLHPMPVLSIVLNTKNNSSLLIYVLCGLLESCERFLYVYTIVPKDQQETAPYFVG 720

Query: 536  YTSLMLPLLPGPAIGNTMYKGSCLQFTPDGQSIVFVSTIRAPLCRTQSMHCSCSMCTSVC 715
            YT L+L  L     GN  ++ S LQFTPDGQ +V + +IR P CR Q + CSCS+C    
Sbjct: 721  YTPLLLSSLERSCTGNLPFERSGLQFTPDGQFLVLLGSIRMPYCRKQIIDCSCSLCKLDQ 780

Query: 716  CEENAVLIGKVFMGYVLPSATLTTIESLSCIAVYEPNYVIAAELSGTLRVWLMNCNWSKS 895
            CE+N + I  V +GYV     L    SLSCI + EPNY++  E    L +W+M   W   
Sbjct: 781  CEDNYLKIVSVDLGYVSLLTKLMAYGSLSCILICEPNYIVTVEDGRNLHIWMMAAGWRII 840

Query: 896  LEEFVLPSFDYL-RPAVELKTVPKSDCLLVGHNGIGSFGL-------------------- 1012
             EE+V+PS   +    +EL+ +PKS  L+VGH+G GSF L                    
Sbjct: 841  SEEYVIPSSGNVGNSIIELRRMPKSSTLIVGHDGTGSFSLCNALPDSEIPKMDVVLVNDK 900

Query: 1013 -----WNISKRVLLSRFLNPGNLIFQILPIGMFNFQDEINTSSCQNM-KLMQEISHHRVA 1174
                 W+ISKR LL+ F  PG ++FQI P+   + Q++I  +S  ++ + ++EI+   V+
Sbjct: 901  YVLQEWDISKRTLLATFTAPGIIVFQIRPVVSCSLQEDIILASVSDIERRLREITVTGVS 960

Query: 1175 -----EAVATPLQDDNAVWILVSADTDSDPQVVNHPKERQPTSNASWRPALLVRNIVVMG 1339
                 E++ +P   D A+WIL+S+ + ++ Q     KE     NA WR ALL    ++MG
Sbjct: 961  RKADKESILSP-GKDTAIWILISSASVAEYQSDLRAKEH----NARWRLALLANKTLIMG 1015

Query: 1340 HMVDFRASSVDVSDGYGFIGTHDGVLYKWELSTGKKLSNLLRVQSGSIMCTAVDSKSGVV 1519
             ++D RA++VDV   +GF GTH G+LY WELS+G+KL+      SG + C AVD+KSGVV
Sbjct: 1016 TILDPRATAVDVCGNHGFAGTHGGLLYAWELSSGRKLAGTQCFNSGRVSCVAVDAKSGVV 1075

Query: 1520 AVADEKCRLLILRQ 1561
            AVAD+ C+L++  Q
Sbjct: 1076 AVADDGCQLVLYSQ 1089


>ref|XP_006660407.1| PREDICTED: uncharacterized protein LOC102720352 [Oryza brachyantha]
          Length = 1115

 Score =  312 bits (800), Expect = 3e-82
 Identities = 174/412 (42%), Positives = 245/412 (59%), Gaps = 11/412 (2%)
 Frame = +2

Query: 359  ELAGCYRQPKPVLFILLRPYEDD-LQISVICGIRESDERFVFIYKVPLKDQGEICPYFIG 535
            +L G Y  P PVL I L    +  L I V+CG  +S +RF+++Y +  KDQ E  PYF+ 
Sbjct: 694  DLMGSYLHPMPVLSITLNTKNNSSLLIYVLCGFLDSCQRFLYVYNIIPKDQQETKPYFVS 753

Query: 536  YTSLMLPLLPGPAIGNTMYKGSCLQFTPDGQSIVFVSTIRAPLCRTQSMHCSCSMCTSVC 715
            YT L+L  +     GN  ++ S LQFTPDGQ +VF  TIR P CR QS+ CSCS+C    
Sbjct: 754  YTPLLLSSMERSCTGNLPFERSGLQFTPDGQFLVFFGTIRMPFCRRQSIDCSCSLCKLNQ 813

Query: 716  CEENAVLIGKVFMGYVLPSATLTTIESLSCIAVYEPNYVIAAELSGTLRVWLMNCNWSKS 895
             E N + I  V +GYV     L    +LSCI + EPNY++  E  G L +W+M   W   
Sbjct: 814  YEVNCLKIVSVNLGYVSLLTKLIACGTLSCILICEPNYIVTVEDGGKLHIWMMAAGWRMI 873

Query: 896  LEEFVLPSFDYLRPAV-ELKTVPKSDCLLVGHNGIGSFGLWNISKRVLLSRFLNPGNLIF 1072
             EE+V+PSF  +  ++ EL+ +PK+  L++GH+G GSF LW+I+KR +L+ F  PG +IF
Sbjct: 874  SEEYVIPSFSNVGHSILELRRMPKNSNLIIGHDGAGSFCLWDIAKRTILATFTAPGIIIF 933

Query: 1073 QILPIGMFNFQDEINTSS-CQNMKLMQEISHHRVAEAV----ATPLQDDNAVWILVS--- 1228
            QILP+   + Q++I  +S   + K ++EI+   V+  V          D A+WIL+S   
Sbjct: 934  QILPVVSCSLQEDIILASFSDSEKRLREITISGVSRKVDNESILSSGKDTAIWILISSAS 993

Query: 1229 -ADTDSDPQVVNHPKERQPTSNASWRPALLVRNIVVMGHMVDFRASSVDVSDGYGFIGTH 1405
             A+  SD +   H        NA WR ALL    V MG ++D RA++V+    +GF GTH
Sbjct: 994  VAEYQSDLRANEH--------NARWRLALLANKTVFMGSILDPRATAVEACGNHGFTGTH 1045

Query: 1406 DGVLYKWELSTGKKLSNLLRVQSGSIMCTAVDSKSGVVAVADEKCRLLILRQ 1561
             G+LY WELS+G+KL+       G + C AVD+KSGVVAVAD++C+LL+  Q
Sbjct: 1046 GGLLYAWELSSGRKLAGTQCFNRGRVSCVAVDAKSGVVAVADDECQLLLYSQ 1097


>gb|EMJ28449.1| hypothetical protein PRUPE_ppa017973mg [Prunus persica]
          Length = 1170

 Score =  299 bits (766), Expect = 3e-78
 Identities = 174/456 (38%), Positives = 246/456 (53%), Gaps = 11/456 (2%)
 Frame = +2

Query: 230  KPATRNIRHEQTNICQRQPNSNDQFDDSLFRAVRLDEVLDESFELAGCYRQPKPVLFILL 409
            +P       +Q       PNS     DS   ++ L+  L  S E  G Y    PVL +LL
Sbjct: 718  EPNDTETSQKQGTGLMHDPNSVPHSSDSKPHSMELNNELTGSLEFVGRYSHQNPVLSVLL 777

Query: 410  RPYEDDLQISVICGIRESDERFVFIYKVPLKDQGEICPYFIGYTSLMLPLLPGPAIGNTM 589
                 ++ + V+CG     +  +FIYKV +++    CP F+G+TS+ LP+      G   
Sbjct: 778  SAKGTEIYVCVLCGPLVDKDGSLFIYKVAIEEPRVGCPSFVGHTSVTLPIRKD-YFGRIA 836

Query: 590  YKGSCLQFTPDGQSIVFVSTIRAPLCRTQSMHCSCSMCTSVCCEENAVLIGKVFMGYVLP 769
             + S LQFTPDGQ +V + +I+ P CR  S+HC CS CTS C EEN V I +V +GYV  
Sbjct: 837  LERSSLQFTPDGQYLVLLDSIKTPYCRQGSIHCLCSTCTSNCSEENTVKIVQVRLGYVSK 896

Query: 770  SATLTTIESLSCIAVYEPNYVIAAELSGTLRVWLMNCNWSKSLEEFVLPSFDYLRPA-VE 946
             A+L  ++SL CI V EPN ++A   SG L +W+MN  WS  +E FVLP+ D + P  VE
Sbjct: 897  VASLKAVDSLECILVCEPNNLVAVGESGRLHLWVMNSTWSAQIENFVLPAEDCISPGIVE 956

Query: 947  LKTVPKSDCLLVGHNGIGSFGLWNISKRVLLSRFLNPGNLIFQILPIGMFNFQDEINTSS 1126
            LK +P    ++VGHNG G F LW+ISK +L+SRF    + I Q +P+ +F ++ +   SS
Sbjct: 957  LKRIPNCTHIVVGHNGFGEFSLWDISKCILVSRFSAASSSICQFVPVSLFTWRIKCPVSS 1016

Query: 1127 CQNMKLMQEISHHRVAEAVATPLQ-------DDNAVWILVSADTDSDPQVVNHPKERQPT 1285
                    +I  H + E VA           +D AVW+LVS+ +DSD Q      +    
Sbjct: 1017 ------YSDIEEH-INELVAATSNNQFSLEGEDIAVWLLVSSSSDSDAQQDYVSDDCDSN 1069

Query: 1286 SNASWRPALLVRNIVVMGHMVDFRASSVDVSDGYGFIGTHDGVLYKWELSTGKKLSNLLR 1465
                WR AL+V+N+V+ G  +D RA+ +  S G G  GT DG++Y WELSTG K   +  
Sbjct: 1070 PMGRWRLALMVKNMVIFGSALDPRAAVIGASAGQGICGTCDGLVYMWELSTGNKFGAMHH 1129

Query: 1466 VQSGSIMCTAVDS---KSGVVAVADEKCRLLILRQK 1564
             + GS+ C A D      G VAVA +   L+ L  +
Sbjct: 1130 FKGGSVSCIATDDSRPSPGAVAVAGDNQLLVFLHSE 1165


>ref|XP_002512056.1| conserved hypothetical protein [Ricinus communis]
            gi|223549236|gb|EEF50725.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 1246

 Score =  294 bits (752), Expect = 1e-76
 Identities = 164/412 (39%), Positives = 233/412 (56%), Gaps = 10/412 (2%)
 Frame = +2

Query: 332  LDEVLDESFELAGCYRQPKPVLFILLRPYEDDLQISVICGIRESDERFVFIYKVPLKDQG 511
            L   LD   E  GCY  P PVL +L+R   +++ I V+CG+    +R +F+YK+ ++   
Sbjct: 752  LTNELDGIVEFLGCYFHPMPVLSLLVRRKGNEIYICVLCGLLVEKDRTLFLYKLAIEGPR 811

Query: 512  EICPYFIGYTSLMLPLLPGPAIGNTMYKGSCLQFTPDGQSIVFVSTIRAPLCRTQSMHCS 691
              CP FIG+TS+  P   G       ++ S LQ TPDGQ +V + + RAP CR   + C 
Sbjct: 812  IGCPCFIGHTSVTWPSSTGIFGREISFERSGLQLTPDGQCLVLLGSTRAPCCREGRLECL 871

Query: 692  CSMCTSVCCEENAVLIGKVFMGYVLPSATLTTIESLSCIAVYEPNYVIAAELSGTLRVWL 871
            CS C S C   N V I +V  GYV     L T +SL CI V EP++++AA  +  L +W 
Sbjct: 872  CSACASDCFGSNGVKIVQVKAGYVSVLVKLKTNDSLQCILVCEPDHLVAAGENSRLHLWT 931

Query: 872  MNCNWSKSLEEFVLPSFDYLRPAV-ELKTVPKSDCLLVGHNGIGSFGLWNISKRVLLSRF 1048
            MN  WS   EEF + S DY  P + ELK +PK   L++GH+G G F LW+ISKR+ +S+F
Sbjct: 932  MNSVWSAPTEEFTIQSNDYTSPCIMELKRIPKCTSLVIGHDGFGEFTLWDISKRIFVSKF 991

Query: 1049 LNPGNLIFQILPIGMFNFQDEINTSSCQN--------MKLMQEISHHRVAEAVATPLQDD 1204
             +P N + Q  PI +F++Q E++  S  N        M   +  S H +  ++     +D
Sbjct: 992  SSPSNSVHQFSPISLFHWQREVHGLSYSNVEAHVNRLMDATKMFSGHSINHSLP---HED 1048

Query: 1205 NAVWILVSADTDSDPQVVNHPKERQPTSNASWRPALLVRNIVVMGHMVDFRASSVDVSDG 1384
             A+W LVS   DSD          Q      WR ALL++N +++G  +D RA+++  S G
Sbjct: 1049 IAIWFLVSTAPDSDALHDYGSSHSQINPVGYWRLALLMKNSLILGSALDPRAAAIGTSAG 1108

Query: 1385 YGFIGTHDGVLYKWELSTGKKLSNLLRVQSGSIMCTAV-DSKSGVVAVADEK 1537
            +G IGT DG++Y WEL TGKKL  L + + GS  C A  DS SGV+A+AD+K
Sbjct: 1109 HGIIGTLDGLVYMWELLTGKKLGTLHKFKGGSASCIATDDSGSGVLAIADDK 1160


>ref|XP_006484353.1| PREDICTED: uncharacterized protein LOC102628159 [Citrus sinensis]
          Length = 1252

 Score =  289 bits (739), Expect = 3e-75
 Identities = 170/453 (37%), Positives = 253/453 (55%), Gaps = 9/453 (1%)
 Frame = +2

Query: 197  EVTELLSFSDEKPATRNIRHEQTNICQRQPNSNDQFDDSLFRAVRLDEVLDE---SFELA 367
            E+ E L+F++   +  + + E +       N+ +    S  +  +  E ++E   +F+L 
Sbjct: 777  ELDEQLNFAEFNSSVVSQKQEISGCEYTSSNAKESQVSSDLKLQKNVECINELAGTFDLM 836

Query: 368  GCYRQPKPVLFILLRPYEDDLQISVICGIRESDERFVFIYKVPLKDQGEICPYFIGYTSL 547
            GCY  P P+L +LL    D + + V CG     +R +FIY V +++     P  +G+TS+
Sbjct: 837  GCYFFPLPILSVLLSTTGDKIYVCVSCGFLVDKKRTLFIYTVDIQEPRVGNPSCVGHTSV 896

Query: 548  MLPLLPGPAIGNTMYKGSCLQFTPDGQSIVFVSTIRAPLCRTQSMHCSCSMCTSVCCEEN 727
            MLP L          + SC  FTPDGQ +V + +++ P CR     C CS CTS   +EN
Sbjct: 897  MLPFLKDNFGREIALERSCALFTPDGQYLVLLDSMKTPYCREGRSDCLCSTCTSHRLDEN 956

Query: 728  AVLIGKVFMGYVLPSATLTTIESLSCIAVYEPNYVIAAELSGTLRVWLMNCNWSKSLEEF 907
            AV I KV  GYV   A L T + + CI V EP ++IA   SG L +W MN +WS  +EE 
Sbjct: 957  AVKIVKVKPGYVSVVAKLKTDDCVQCILVCEPKHLIAVGESGKLHLWEMNSSWSAQVEEC 1016

Query: 908  VLPSFDYLRPA-VELKTVPKSDCLLVGHNGIGSFGLWNISKRVLLSRFLNPGNLIFQILP 1084
            ++P  D + P  VE+K +PK   L+VGHNG G FG+W+ISKRVL+SRF      I+Q  P
Sbjct: 1017 IIPINDCIYPCIVEMKRIPKCAPLVVGHNGFGEFGIWDISKRVLVSRFSAARASIYQFFP 1076

Query: 1085 IGMFNFQDEINTSSCQNMKLMQE-----ISHHRVAEAVATPLQDDNAVWILVSADTDSDP 1249
            I +F++Q   + S   +++L         S H    +    + +D+A+W+LVS  +DSD 
Sbjct: 1077 INLFSWQRNGSVSMDASLELTNTATTSLFSKHSEKSSFCPSVGEDSAIWLLVSTISDSDA 1136

Query: 1250 QVVNHPKERQPTSNASWRPALLVRNIVVMGHMVDFRASSVDVSDGYGFIGTHDGVLYKWE 1429
            Q     ++ Q      WR ALLV+N V++G  +D RAS++  S G G IGT+DG++Y WE
Sbjct: 1137 QHNCMSRDCQKNPVRFWRLALLVKNRVILGSPLDPRASAIGASSGLGIIGTNDGLVYAWE 1196

Query: 1430 LSTGKKLSNLLRVQSGSIMCTAVDSKSGVVAVA 1528
            LS+G KL  L   + G++ C A D  SG+ A+A
Sbjct: 1197 LSSGNKLGILHHFKGGTVSCIATDD-SGLQALA 1228


>ref|XP_006856397.1| hypothetical protein AMTR_s00047p00207890 [Amborella trichopoda]
            gi|548860257|gb|ERN17864.1| hypothetical protein
            AMTR_s00047p00207890 [Amborella trichopoda]
          Length = 1532

 Score =  285 bits (729), Expect = 5e-74
 Identities = 157/406 (38%), Positives = 236/406 (58%), Gaps = 8/406 (1%)
 Frame = +2

Query: 359  ELAGCYRQPKPVLFILLRPYEDDLQISVICGIRESDERFVFIYKVPLKDQGEICPYFIGY 538
            EL GCY  P PVL + +    D + I VICG+ E  ER +F+Y+V +++  + CP F+GY
Sbjct: 1120 ELIGCYEHPNPVLSLFMSTQGDIVYICVICGLVERRERSIFVYEVLVREMNQSCPSFLGY 1179

Query: 539  TSLMLPLLPGPAIGNTMYKGSCLQFTPDGQSIVFVSTIRAPLCRTQSMHCSCSMCTSVCC 718
            T + LP    P       + S +Q TPDGQ++V + TI+AP CR Q ++C CSMC+S+ C
Sbjct: 1180 TMISLPSQGEPFDIKIATERSWIQLTPDGQALVLLDTIQAPACREQDLNCLCSMCSSI-C 1238

Query: 719  EENAVLIGKVFMGYVLPSATLTTIESLSCIAVYEPNYVIAAELSGTLRVWLMNCNWSKSL 898
            E NAV I  V +GYV     + T+E   CI V  P++++A+  SG L V LMN +WS   
Sbjct: 1239 EPNAVKIVAVKLGYVSALTKVKTMEIAYCILVCAPSFLVASGASGRLHVLLMNSSWSTIA 1298

Query: 899  EEFVLPSFDYLRPA--VELKTVPKSDCLLVGHNGIGSFGLWNISKRVLLSRFLNPGNLIF 1072
            EEFVLPS +   PA  +EL+ +P+   L++GHN  G FG W+IS+RVLL RF      + 
Sbjct: 1299 EEFVLPSSNQ-TPASIIELRKIPQCSYLVIGHNSHGGFGFWDISRRVLLGRFSGFECSVD 1357

Query: 1073 QILPIGMFNFQDE------INTSSCQNMKLMQEISHHRVAEAVATPLQDDNAVWILVSAD 1234
             +LPIG+F+ Q        +     ++  +M          +V+    ++ AVW+L+S  
Sbjct: 1358 HVLPIGLFSRQKRDPMITGLMEDELRDEAIMATEDWISGGYSVSNLEGNEIAVWLLISTA 1417

Query: 1235 TDSDPQVVNHPKERQPTSNASWRPALLVRNIVVMGHMVDFRASSVDVSDGYGFIGTHDGV 1414
            +    Q+   P++        WR ALLV N VVMG +++ RA +  +  GYG IGTH+G+
Sbjct: 1418 SHPGEQLSQLPEDSDGNHFGCWRLALLVNNAVVMGTILNPRAKAAAILAGYGIIGTHEGL 1477

Query: 1415 LYKWELSTGKKLSNLLRVQSGSIMCTAVDSKSGVVAVADEKCRLLI 1552
            +Y WELS+G+K++N L    G +     D+ S ++AV  ++C +L+
Sbjct: 1478 VYMWELSSGRKVAN-LHSFDGGVSSIVADTSSSILAVVGDECNVLL 1522


>gb|EOY01640.1| Histone-lysine N-methyltransferase ATX1, putative isoform 2
            [Theobroma cacao] gi|508709744|gb|EOY01641.1|
            Histone-lysine N-methyltransferase ATX1, putative isoform
            2 [Theobroma cacao]
          Length = 1128

 Score =  283 bits (724), Expect = 2e-73
 Identities = 172/427 (40%), Positives = 246/427 (57%), Gaps = 16/427 (3%)
 Frame = +2

Query: 320  RAVRLDEVLDESFELAGCYRQPKPVLFILLRPYEDDLQISVICGIRESDERFVFIYKVPL 499
            R V L+  L     L G Y  P P+  + L    +++ I V+CG+    +R +F+Y+V +
Sbjct: 707  RDVELNCDLRGIVNLVGSYFHPLPISSVWLCTKGNEIHICVLCGLLVDKDRTLFLYRVSI 766

Query: 500  KDQGEICPYFIGYTSLMLPLLPGPAIGNTMYKGSC-LQFTPDGQSIVFVSTIRAPLCRTQ 676
            ++    CP F+GYTS+ L         + +    C LQFTPDGQ +V +  I+ P CR  
Sbjct: 767  EEPSIGCPSFVGYTSVTLTF-------SEIDSERCGLQFTPDGQCLVLLDGIKTPYCREG 819

Query: 677  SMHCSCSMCTSVCCEENAVLIGKVFMGYVLPSATLTTIESLSCIAVYEPNYVIAAELSGT 856
             + C CS+C+S C  EN V I +V  GYV   A L T+ES+ CI V E NY++AA  SG 
Sbjct: 820  IIDCICSICSSGCSNENGVKIVQVNHGYVSLVAKLETVESVQCILVCENNYLVAAGTSGR 879

Query: 857  LRVWLMNCNWSKSLEEFVLPSFDYLRP-AVELKTVPKSDCLLVGHNGIGSFGLWNISKRV 1033
            L +W+MN  WS   EEF+LP+ D L P  VELK +PK   L++GHNGIG F +W+I KR+
Sbjct: 880  LHLWVMNSTWSAWTEEFILPAGDCLSPCVVELKRIPKCARLVIGHNGIGEFVVWDILKRL 939

Query: 1034 LLSRFLNPGNLIFQILPIGMFNFQ------------DEINTSSCQNMKLMQEISHHRVAE 1177
            +LSRF   GN I Q LPI +F++Q            DEI T++    K++   S H+  +
Sbjct: 940  ILSRFSASGNPIKQFLPISLFSWQPVFSYADMNGRIDEIFTTT----KIL--FSEHK--D 991

Query: 1178 AVATPLQ-DDNAVWILVSADTDSDPQVVNHPKERQPTSNASWRPALLVRNIVVMGHMVDF 1354
                PL+ +D A+W+L+S  +D + Q    P   Q     SWR ALLV++ V++G  +D 
Sbjct: 992  CFFPPLEGEDIALWLLLSTVSDFEDQYERLPSNCQANPARSWRLALLVKDRVILGSTLDP 1051

Query: 1355 RASSVDVSDGYGFIGTHDGVLYKWELSTGKKLSNLLRVQSGSIMCTAVDS-KSGVVAVAD 1531
            RA+++  S  +G IG  DG++Y WELSTG +L  L   + GS+ C A D  +  VVAVA 
Sbjct: 1052 RAAAIGASFDHGIIGRDDGLVYMWELSTGTRLGVLHHFKGGSVSCIATDDLRPDVVAVAA 1111

Query: 1532 EKCRLLI 1552
            +  +LLI
Sbjct: 1112 DDGQLLI 1118


>gb|EOY01639.1| Histone-lysine N-methyltransferase ATX1, putative isoform 1
            [Theobroma cacao]
          Length = 1329

 Score =  282 bits (722), Expect = 3e-73
 Identities = 173/436 (39%), Positives = 245/436 (56%), Gaps = 25/436 (5%)
 Frame = +2

Query: 320  RAVRLDEVLDESFELAGCYRQPKPVLFILLRPYEDDLQISVICGIRESDERFVFIYKVPL 499
            R V L+  L     L G Y  P P+  + L    +++ I V+CG+    +R +F+Y+V +
Sbjct: 892  RDVELNCDLRGIVNLVGSYFHPLPISSVWLCTKGNEIHICVLCGLLVDKDRTLFLYRVSI 951

Query: 500  KDQGEICPYFIGYTSLMLPLLPGPAIGNTMYKGSC----------LQFTPDGQSIVFVST 649
            ++    CP F+GYTS+ L        G      S           LQFTPDGQ +V +  
Sbjct: 952  EEPSIGCPSFVGYTSVTLTFSEVSFGGRICCNSSAIFIIDSERCGLQFTPDGQCLVLLDG 1011

Query: 650  IRAPLCRTQSMHCSCSMCTSVCCEENAVLIGKVFMGYVLPSATLTTIESLSCIAVYEPNY 829
            I+ P CR   + C CS+C+S C  EN V I +V  GYV   A L T+ES+ CI V E NY
Sbjct: 1012 IKTPYCREGIIDCICSICSSGCSNENGVKIVQVNHGYVSLVAKLETVESVQCILVCENNY 1071

Query: 830  VIAAELSGTLRVWLMNCNWSKSLEEFVLPSFDYLRP-AVELKTVPKSDCLLVGHNGIGSF 1006
            ++AA  SG L +W+MN  WS   EEF+LP+ D L P  VELK +PK   L++GHNGIG F
Sbjct: 1072 LVAAGTSGRLHLWVMNSTWSAWTEEFILPAGDCLSPCVVELKRIPKCARLVIGHNGIGEF 1131

Query: 1007 GLWNISKRVLLSRFLNPGNLIFQILPIGMFNFQ------------DEINTSSCQNMKLMQ 1150
             +W+I KR++LSRF   GN I Q LPI +F++Q            DEI T++    K++ 
Sbjct: 1132 VVWDILKRLILSRFSASGNPIKQFLPISLFSWQPVFSYADMNGRIDEIFTTT----KIL- 1186

Query: 1151 EISHHRVAEAVATPLQ-DDNAVWILVSADTDSDPQVVNHPKERQPTSNASWRPALLVRNI 1327
              S H+  +    PL+ +D A+W+L+S  +D + Q    P   Q     SWR ALLV++ 
Sbjct: 1187 -FSEHK--DCFFPPLEGEDIALWLLLSTVSDFEDQYERLPSNCQANPARSWRLALLVKDR 1243

Query: 1328 VVMGHMVDFRASSVDVSDGYGFIGTHDGVLYKWELSTGKKLSNLLRVQSGSIMCTAVDS- 1504
            V++G  +D RA+++  S  +G IG  DG++Y WELSTG +L  L   + GS+ C A D  
Sbjct: 1244 VILGSTLDPRAAAIGASFDHGIIGRDDGLVYMWELSTGTRLGVLHHFKGGSVSCIATDDL 1303

Query: 1505 KSGVVAVADEKCRLLI 1552
            +  VVAVA +  +LLI
Sbjct: 1304 RPDVVAVAADDGQLLI 1319


>ref|XP_006437884.1| hypothetical protein CICLE_v10033741mg, partial [Citrus clementina]
            gi|557540080|gb|ESR51124.1| hypothetical protein
            CICLE_v10033741mg, partial [Citrus clementina]
          Length = 1177

 Score =  277 bits (709), Expect = 1e-71
 Identities = 162/430 (37%), Positives = 240/430 (55%), Gaps = 9/430 (2%)
 Frame = +2

Query: 197  EVTELLSFSDEKPATRNIRHEQTNICQRQPNSNDQFDDSLFRAVRLDEVLDE---SFELA 367
            E+ E L+F++   +  + + E +       N+ +    S  +  +  E ++E   +F+L 
Sbjct: 744  ELDEQLNFAEFNSSVVSQKQEISGCEYTSSNAKESQVSSDLKLQKNVECINELAGTFDLM 803

Query: 368  GCYRQPKPVLFILLRPYEDDLQISVICGIRESDERFVFIYKVPLKDQGEICPYFIGYTSL 547
            GCY  P P+L +LL    D + + V CG     +R +FIY V +++     P  +G+TS+
Sbjct: 804  GCYFFPLPILSVLLSTTGDKIYVCVSCGFLVDKKRTLFIYTVDIQEPRVGNPSCVGHTSV 863

Query: 548  MLPLLPGPAIGNTMYKGSCLQFTPDGQSIVFVSTIRAPLCRTQSMHCSCSMCTSVCCEEN 727
            MLP L          + SC  FTPDGQ +V + +++ P CR     C CS CTS   +EN
Sbjct: 864  MLPFLKDNFGREIALERSCALFTPDGQYLVLLDSMKTPYCREGRSDCLCSTCTSHRLDEN 923

Query: 728  AVLIGKVFMGYVLPSATLTTIESLSCIAVYEPNYVIAAELSGTLRVWLMNCNWSKSLEEF 907
            AV I KV  GYV   A L T + + CI V EP ++IA   SG L +W MN +WS  +EE 
Sbjct: 924  AVKIVKVNPGYVSVVAKLKTDDCVQCILVCEPKHLIAVGESGKLHLWEMNSSWSAQVEEC 983

Query: 908  VLPSFDYLRPA-VELKTVPKSDCLLVGHNGIGSFGLWNISKRVLLSRFLNPGNLIFQILP 1084
            ++P  D + P  VE+K +PK   L+VGHNG G FG+W+ISKRVL+SRF      I+Q  P
Sbjct: 984  IIPINDCIYPCIVEMKRIPKCAPLVVGHNGFGEFGIWDISKRVLVSRFSAARASIYQFFP 1043

Query: 1085 IGMFNFQDEINTSSCQNMKLMQE-----ISHHRVAEAVATPLQDDNAVWILVSADTDSDP 1249
            I +F++Q   + S   +++L         S H    +    + +D+A+W+LVS  +DSD 
Sbjct: 1044 INLFSWQRNGSVSMDASLELTNTATTSLFSKHSEKSSFCPSVGEDSAIWLLVSTISDSDA 1103

Query: 1250 QVVNHPKERQPTSNASWRPALLVRNIVVMGHMVDFRASSVDVSDGYGFIGTHDGVLYKWE 1429
            Q     ++ Q      WR ALLV+N V++G  +D RAS++  S G G IGT+DG++Y WE
Sbjct: 1104 QHNCMSRDCQKNPVRFWRLALLVKNRVILGSPLDPRASAIGASSGLGIIGTNDGLVYAWE 1163

Query: 1430 LSTGKKLSNL 1459
            LS+G KL  L
Sbjct: 1164 LSSGNKLGIL 1173


>ref|XP_004298357.1| PREDICTED: uncharacterized protein LOC101305752 [Fragaria vesca
            subsp. vesca]
          Length = 1259

 Score =  273 bits (698), Expect = 2e-70
 Identities = 168/466 (36%), Positives = 249/466 (53%), Gaps = 2/466 (0%)
 Frame = +2

Query: 173  NCVPSQNIEVTELLSFSDEKPATRNIRHEQTNICQRQPNSNDQFDDSLFRAVRLDEVLDE 352
            N V  + +    LL F D + +     H+Q       PNS     ++       +  L  
Sbjct: 797  NQVDKKVVGNENLLQFIDSETS-----HKQGPSFSYDPNSIPFSSNTKPHKKEHNNGLAG 851

Query: 353  SFELAGCYRQPKPVLFILLRPYEDDLQISVICGIRESDERFVFIYKVPLKDQGEICPYFI 532
              E  GCY QP PVL +LL      + +SV+CG+    +  +FIYKV +++        +
Sbjct: 852  ILEFVGCYTQPVPVLSVLLSTKGRYIYVSVLCGLLVGKDVSLFIYKVAIEEPMVGHSSLV 911

Query: 533  GYTSLMLPLLPGPAIGNTMYKGSCLQFTPDGQSIVFVSTIRAPLCRTQSMHCSCSMCTSV 712
            G+TSL LP L   + G  + +  CLQF PDGQ +V +  IR P CR    HC C+ C S 
Sbjct: 912  GHTSLTLPDLTDYSNGMALER-FCLQFIPDGQCLVLLDKIRTPFCRQGKTHCLCTTCASS 970

Query: 713  CCEENAVLIGKVFMGYVLPSATLTTIESLSCIAVYEPNYVIAAELSGTLRVWLMNCNWSK 892
            C EE+AV I +V +GYV     L   +S  CI V EPN +++   SG L +W+M+  WS 
Sbjct: 971  CSEEDAVKIVQVKLGYVSLVTRLKAAQSQRCILVCEPNNLVSVGKSGRLHLWVMDSTWSA 1030

Query: 893  SLEEFVLPSFDYLRP-AVELKTVPKSDCLLVGHNGIGSFGLWNISKRVLLSRFLNPGNLI 1069
             +E  V+PS D + P  V+LK +P    L+VGHNG G F LW+I+K + +SRF  P   I
Sbjct: 1031 QMEYIVMPSEDCISPGVVDLKRIPNCTHLIVGHNGYGEFSLWDITKCIFVSRFSAPSGSI 1090

Query: 1070 FQILPIGMFNFQDEINTSSCQNMKLMQEISHHRVAEAVATPLQDDNAVWILVSADTDSDP 1249
             Q +PI +F +Q   + SS   M+         +++ +++   +D A+ +LV   +DSD 
Sbjct: 1091 CQFVPISLFAWQMNFHASSHFEMEEHVNQMMASISKTLSSYEGEDVAICLLV-LSSDSDA 1149

Query: 1250 QVVNHPKERQPTSNASWRPALLVRNIVVMGHMVDFRASSVDVSDGYGFIGTHDGVLYKWE 1429
            Q         P     WR AL+V+NIV++G  +D RAS +  S G G  GT DG++Y WE
Sbjct: 1150 QHDYELGNCHPNPVGRWRLALMVKNIVILGTALDSRASVIGASAGQGICGTCDGLVYTWE 1209

Query: 1430 LSTGKKLSNLLRVQSGSIMCTA-VDSKSGVVAVADEKCRLLILRQK 1564
            LS+G KL  +   + GS+ C +  DS+SG VA+A +  ++L+ R +
Sbjct: 1210 LSSGTKLGTMHHFKGGSVSCISNDDSRSGAVAIAGDN-QVLVYRSR 1254


>ref|XP_002893407.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
            gi|297339249|gb|EFH69666.1| predicted protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 1194

 Score =  272 bits (696), Expect = 3e-70
 Identities = 167/491 (34%), Positives = 266/491 (54%), Gaps = 11/491 (2%)
 Frame = +2

Query: 113  STSKPHPPFQFCQQAYQGPVNCVPSQNIEVTELLSFSDEKPATRNIRHEQTNICQRQPNS 292
            S+S P   F+ CQ       N   +  I+V+E  S   +     N   ++T++ +   +S
Sbjct: 720  SSSFPASKFEDCQ------ANIGEALGIQVSEPPSTKSQ--CKENTSEKRTSVQEFPASS 771

Query: 293  NDQFDDSLFRAVRLDEVLDESFELAGCYRQPKPVLFILLRPYEDDLQISVICGIRESDER 472
            N + +    R V+++  + ++ EL GCY  P PV  +LL+   +++ I V+    E   R
Sbjct: 772  NLEIN----RDVKINNEMGKTVELLGCYFHPMPVSSVLLKSAGNEIYICVLSFATEDRVR 827

Query: 473  FVFIYKVPLKDQGEICPYFIGYTSLMLPLLPGPAIGNTMYKGSCLQFTPDGQSIVFVSTI 652
             +F+YK+  K   +  P  IG+T  +LP++   + GN   + S L FTPDG  ++ +  I
Sbjct: 828  TLFMYKMSAKAPSKGFPSIIGHTPAILPIVDDKSGGNRTLEISNLHFTPDGLHLILIGNI 887

Query: 653  RAPLCRTQSMHCSCSMCTSVCCEENAVLIGKVFMGYVLPSATLTTIESLSCIAVYEPNYV 832
            + P CR +   CSC +CTS C EENAV I +V  G+V     L   +S+ C+ V +PN +
Sbjct: 888  KTPYCRKRETDCSCLICTSACFEENAVRIVQVKTGHVSLVTKLQADDSVQCVVVCDPNNL 947

Query: 833  IAAELSGTLRVWLMNCNWSKSLEEFVLPSFDYLRPAV-ELKTVPKSDCLLVGHNGIGSFG 1009
            IAA  SG L VW MN +WS S EE V+ +   +   + ELK +PK   L++GHNGIG F 
Sbjct: 948  IAAVKSGNLIVWAMNSHWSGSTEESVILANPCISSCIMELKKIPKCPHLVIGHNGIGEFT 1007

Query: 1010 LWNISKRVLLSRFLNPGNLIFQILPIGMFNFQDEINTSSCQN----------MKLMQEIS 1159
            +W+ISKR L+SRF++P NLIF+ +P  +F +    + S+ ++          +   + I+
Sbjct: 1008 IWDISKRSLVSRFVSPSNLIFEFIPTSLFAWHPVHSHSTIEDHVDMILAATKLWFSKGIN 1067

Query: 1160 HHRVAEAVATPLQDDNAVWILVSADTDSDPQVVNHPKERQPTSNASWRPALLVRNIVVMG 1339
            +  +  A       D A+W+LVS D +SD +      +R  +    WR ALLV+N +++G
Sbjct: 1068 NKTLVPAEV----KDTAIWLLVSTDLESDAKC-----DRVESPARCWRLALLVKNQLILG 1118

Query: 1340 HMVDFRASSVDVSDGYGFIGTHDGVLYKWELSTGKKLSNLLRVQSGSIMCTAVDSKSGVV 1519
            + +D RA       G+G  GT DG++Y W+LSTG KL +L   +   + C + D  S  +
Sbjct: 1119 NQLDPRADVAGTISGHGVAGTLDGLVYMWDLSTGAKLGSLHDFKGQRVSCISTDD-SRNI 1177

Query: 1520 AVADEKCRLLI 1552
             +A E  +LL+
Sbjct: 1178 CIASEDGQLLV 1188


>gb|AAG50686.1|AC079829_19 hypothetical protein [Arabidopsis thaliana]
          Length = 1196

 Score =  271 bits (692), Expect = 1e-69
 Identities = 168/487 (34%), Positives = 265/487 (54%), Gaps = 7/487 (1%)
 Frame = +2

Query: 113  STSKPHPPFQFCQQAYQGPVNCVPSQNIEVTELLSFSDEKPATRNIRHEQTNICQRQPNS 292
            S+S P   F+ CQ       N      I+V+E    S E     N   + T++ +   +S
Sbjct: 722  SSSFPASKFEDCQ------ANIGEELGIQVSE--PPSTESQYKENTSEKCTSVQEFPASS 773

Query: 293  NDQFDDSLFRAVRLDEVLDESFELAGCYRQPKPVLFILLRPYEDDLQISVICGIRESDER 472
            N + +    R V+++  ++++ EL GCY  P PV  +LLR   +++ I V+    E   R
Sbjct: 774  NLKLN----RDVKINNEMEKTVELLGCYFHPMPVSSVLLRTVGNEIYILVLSFATEDRVR 829

Query: 473  FVFIYKVPLKDQGEICPYFIGYTSLMLPLLPGPAIGNTMYKGSCLQFTPDGQSIVFVSTI 652
             +F+YK+  +   +  P  IG+T  +LP++   + GN   + S L FTPDG  ++    I
Sbjct: 830  TLFMYKMSAEAPSKGFPSIIGHTPAILPIVDDKSSGNGTLEISNLHFTPDGLHLILTGNI 889

Query: 653  RAPLCRTQSMHCSCSMCTSVCCEENAVLIGKVFMGYVLPSATLTTIESLSCIAVYEPNYV 832
            + P CR +   CSC +CTS C EENAV I +V  G+V     L   +S+ C+ V +PN +
Sbjct: 890  KTPYCRKRETDCSCLICTSACFEENAVRIVQVKTGHVSLVTKLQADDSVQCVVVCDPNNL 949

Query: 833  IAAELSGTLRVWLMNCNWSKSLEEFVLPSFDYLRPAV-ELKTVPKSDCLLVGHNGIGSFG 1009
            IAA  SG L VW MN +WS   EE+V+ +   +   + ELK +PK   L++GHNGIG F 
Sbjct: 950  IAAVKSGNLIVWAMNSHWSGPTEEYVILANPCISSCIMELKKIPKCPHLVIGHNGIGEFT 1009

Query: 1010 LWNISKRVLLSRFLNPGNLIFQILPIGMFNFQDEINTSSCQ-NMKLMQEISHHRVAEAV- 1183
            +W+ISKR L+SRF++P NLIF+ +P  +F +    + S+ + N+ ++   +    ++ V 
Sbjct: 1010 IWDISKRSLVSRFVSPSNLIFEFIPTSLFAWHPVHSHSTIEDNVDMILAATKLWFSKGVN 1069

Query: 1184 ---ATPLQ-DDNAVWILVSADTDSDPQVVNHPKERQPTSNASWRPALLVRNIVVMGHMVD 1351
                 P +  D A+W+LVS D DSD +      +R  +    WR ALLV++ +++G  +D
Sbjct: 1070 NKTLVPAEVKDTAIWLLVSTDLDSDAKC-----DRVESPVRCWRLALLVKDQLILGSQLD 1124

Query: 1352 FRASSVDVSDGYGFIGTHDGVLYKWELSTGKKLSNLLRVQSGSIMCTAVDSKSGVVAVAD 1531
             RA       G+G  GT DG++Y W+LSTG KL +L   +   + C + D  S  + +A 
Sbjct: 1125 PRADVAGTISGHGVAGTLDGLVYMWDLSTGTKLGSLHDFKGQRVSCISTDD-SRNICIAS 1183

Query: 1532 EKCRLLI 1552
            E  +LL+
Sbjct: 1184 EDGQLLV 1190


>gb|AAF98582.1|AC013427_25 This gene may be cut off [Arabidopsis thaliana]
          Length = 554

 Score =  271 bits (692), Expect = 1e-69
 Identities = 168/487 (34%), Positives = 265/487 (54%), Gaps = 7/487 (1%)
 Frame = +2

Query: 113  STSKPHPPFQFCQQAYQGPVNCVPSQNIEVTELLSFSDEKPATRNIRHEQTNICQRQPNS 292
            S+S P   F+ CQ       N      I+V+E    S E     N   + T++ +   +S
Sbjct: 80   SSSFPASKFEDCQ------ANIGEELGIQVSE--PPSTESQYKENTSEKCTSVQEFPASS 131

Query: 293  NDQFDDSLFRAVRLDEVLDESFELAGCYRQPKPVLFILLRPYEDDLQISVICGIRESDER 472
            N + +    R V+++  ++++ EL GCY  P PV  +LLR   +++ I V+    E   R
Sbjct: 132  NLKLN----RDVKINNEMEKTVELLGCYFHPMPVSSVLLRTVGNEIYILVLSFATEDRVR 187

Query: 473  FVFIYKVPLKDQGEICPYFIGYTSLMLPLLPGPAIGNTMYKGSCLQFTPDGQSIVFVSTI 652
             +F+YK+  +   +  P  IG+T  +LP++   + GN   + S L FTPDG  ++    I
Sbjct: 188  TLFMYKMSAEAPSKGFPSIIGHTPAILPIVDDKSSGNGTLEISNLHFTPDGLHLILTGNI 247

Query: 653  RAPLCRTQSMHCSCSMCTSVCCEENAVLIGKVFMGYVLPSATLTTIESLSCIAVYEPNYV 832
            + P CR +   CSC +CTS C EENAV I +V  G+V     L   +S+ C+ V +PN +
Sbjct: 248  KTPYCRKRETDCSCLICTSACFEENAVRIVQVKTGHVSLVTKLQADDSVQCVVVCDPNNL 307

Query: 833  IAAELSGTLRVWLMNCNWSKSLEEFVLPSFDYLRPAV-ELKTVPKSDCLLVGHNGIGSFG 1009
            IAA  SG L VW MN +WS   EE+V+ +   +   + ELK +PK   L++GHNGIG F 
Sbjct: 308  IAAVKSGNLIVWAMNSHWSGPTEEYVILANPCISSCIMELKKIPKCPHLVIGHNGIGEFT 367

Query: 1010 LWNISKRVLLSRFLNPGNLIFQILPIGMFNFQDEINTSSCQ-NMKLMQEISHHRVAEAV- 1183
            +W+ISKR L+SRF++P NLIF+ +P  +F +    + S+ + N+ ++   +    ++ V 
Sbjct: 368  IWDISKRSLVSRFVSPSNLIFEFIPTSLFAWHPVHSHSTIEDNVDMILAATKLWFSKGVN 427

Query: 1184 ---ATPLQ-DDNAVWILVSADTDSDPQVVNHPKERQPTSNASWRPALLVRNIVVMGHMVD 1351
                 P +  D A+W+LVS D DSD +      +R  +    WR ALLV++ +++G  +D
Sbjct: 428  NKTLVPAEVKDTAIWLLVSTDLDSDAKC-----DRVESPVRCWRLALLVKDQLILGSQLD 482

Query: 1352 FRASSVDVSDGYGFIGTHDGVLYKWELSTGKKLSNLLRVQSGSIMCTAVDSKSGVVAVAD 1531
             RA       G+G  GT DG++Y W+LSTG KL +L   +   + C + D  S  + +A 
Sbjct: 483  PRADVAGTISGHGVAGTLDGLVYMWDLSTGTKLGSLHDFKGQRVSCISTDD-SRNICIAS 541

Query: 1532 EKCRLLI 1552
            E  +LL+
Sbjct: 542  EDGQLLV 548


>ref|NP_001185099.1| DNA binding protein [Arabidopsis thaliana]
            gi|332192557|gb|AEE30678.1| DNA binding protein
            [Arabidopsis thaliana]
          Length = 1194

 Score =  271 bits (692), Expect = 1e-69
 Identities = 168/487 (34%), Positives = 265/487 (54%), Gaps = 7/487 (1%)
 Frame = +2

Query: 113  STSKPHPPFQFCQQAYQGPVNCVPSQNIEVTELLSFSDEKPATRNIRHEQTNICQRQPNS 292
            S+S P   F+ CQ       N      I+V+E    S E     N   + T++ +   +S
Sbjct: 720  SSSFPASKFEDCQ------ANIGEELGIQVSE--PPSTESQYKENTSEKCTSVQEFPASS 771

Query: 293  NDQFDDSLFRAVRLDEVLDESFELAGCYRQPKPVLFILLRPYEDDLQISVICGIRESDER 472
            N + +    R V+++  ++++ EL GCY  P PV  +LLR   +++ I V+    E   R
Sbjct: 772  NLKLN----RDVKINNEMEKTVELLGCYFHPMPVSSVLLRTVGNEIYILVLSFATEDRVR 827

Query: 473  FVFIYKVPLKDQGEICPYFIGYTSLMLPLLPGPAIGNTMYKGSCLQFTPDGQSIVFVSTI 652
             +F+YK+  +   +  P  IG+T  +LP++   + GN   + S L FTPDG  ++    I
Sbjct: 828  TLFMYKMSAEAPSKGFPSIIGHTPAILPIVDDKSSGNGTLEISNLHFTPDGLHLILTGNI 887

Query: 653  RAPLCRTQSMHCSCSMCTSVCCEENAVLIGKVFMGYVLPSATLTTIESLSCIAVYEPNYV 832
            + P CR +   CSC +CTS C EENAV I +V  G+V     L   +S+ C+ V +PN +
Sbjct: 888  KTPYCRKRETDCSCLICTSACFEENAVRIVQVKTGHVSLVTKLQADDSVQCVVVCDPNNL 947

Query: 833  IAAELSGTLRVWLMNCNWSKSLEEFVLPSFDYLRPAV-ELKTVPKSDCLLVGHNGIGSFG 1009
            IAA  SG L VW MN +WS   EE+V+ +   +   + ELK +PK   L++GHNGIG F 
Sbjct: 948  IAAVKSGNLIVWAMNSHWSGPTEEYVILANPCISSCIMELKKIPKCPHLVIGHNGIGEFT 1007

Query: 1010 LWNISKRVLLSRFLNPGNLIFQILPIGMFNFQDEINTSSCQ-NMKLMQEISHHRVAEAV- 1183
            +W+ISKR L+SRF++P NLIF+ +P  +F +    + S+ + N+ ++   +    ++ V 
Sbjct: 1008 IWDISKRSLVSRFVSPSNLIFEFIPTSLFAWHPVHSHSTIEDNVDMILAATKLWFSKGVN 1067

Query: 1184 ---ATPLQ-DDNAVWILVSADTDSDPQVVNHPKERQPTSNASWRPALLVRNIVVMGHMVD 1351
                 P +  D A+W+LVS D DSD +      +R  +    WR ALLV++ +++G  +D
Sbjct: 1068 NKTLVPAEVKDTAIWLLVSTDLDSDAKC-----DRVESPVRCWRLALLVKDQLILGSQLD 1122

Query: 1352 FRASSVDVSDGYGFIGTHDGVLYKWELSTGKKLSNLLRVQSGSIMCTAVDSKSGVVAVAD 1531
             RA       G+G  GT DG++Y W+LSTG KL +L   +   + C + D  S  + +A 
Sbjct: 1123 PRADVAGTISGHGVAGTLDGLVYMWDLSTGTKLGSLHDFKGQRVSCISTDD-SRNICIAS 1181

Query: 1532 EKCRLLI 1552
            E  +LL+
Sbjct: 1182 EDGQLLV 1188


>ref|NP_173957.2| DNA binding protein [Arabidopsis thaliana]
            gi|332192556|gb|AEE30677.1| DNA binding protein
            [Arabidopsis thaliana]
          Length = 1189

 Score =  271 bits (692), Expect = 1e-69
 Identities = 168/487 (34%), Positives = 265/487 (54%), Gaps = 7/487 (1%)
 Frame = +2

Query: 113  STSKPHPPFQFCQQAYQGPVNCVPSQNIEVTELLSFSDEKPATRNIRHEQTNICQRQPNS 292
            S+S P   F+ CQ       N      I+V+E    S E     N   + T++ +   +S
Sbjct: 715  SSSFPASKFEDCQ------ANIGEELGIQVSE--PPSTESQYKENTSEKCTSVQEFPASS 766

Query: 293  NDQFDDSLFRAVRLDEVLDESFELAGCYRQPKPVLFILLRPYEDDLQISVICGIRESDER 472
            N + +    R V+++  ++++ EL GCY  P PV  +LLR   +++ I V+    E   R
Sbjct: 767  NLKLN----RDVKINNEMEKTVELLGCYFHPMPVSSVLLRTVGNEIYILVLSFATEDRVR 822

Query: 473  FVFIYKVPLKDQGEICPYFIGYTSLMLPLLPGPAIGNTMYKGSCLQFTPDGQSIVFVSTI 652
             +F+YK+  +   +  P  IG+T  +LP++   + GN   + S L FTPDG  ++    I
Sbjct: 823  TLFMYKMSAEAPSKGFPSIIGHTPAILPIVDDKSSGNGTLEISNLHFTPDGLHLILTGNI 882

Query: 653  RAPLCRTQSMHCSCSMCTSVCCEENAVLIGKVFMGYVLPSATLTTIESLSCIAVYEPNYV 832
            + P CR +   CSC +CTS C EENAV I +V  G+V     L   +S+ C+ V +PN +
Sbjct: 883  KTPYCRKRETDCSCLICTSACFEENAVRIVQVKTGHVSLVTKLQADDSVQCVVVCDPNNL 942

Query: 833  IAAELSGTLRVWLMNCNWSKSLEEFVLPSFDYLRPAV-ELKTVPKSDCLLVGHNGIGSFG 1009
            IAA  SG L VW MN +WS   EE+V+ +   +   + ELK +PK   L++GHNGIG F 
Sbjct: 943  IAAVKSGNLIVWAMNSHWSGPTEEYVILANPCISSCIMELKKIPKCPHLVIGHNGIGEFT 1002

Query: 1010 LWNISKRVLLSRFLNPGNLIFQILPIGMFNFQDEINTSSCQ-NMKLMQEISHHRVAEAV- 1183
            +W+ISKR L+SRF++P NLIF+ +P  +F +    + S+ + N+ ++   +    ++ V 
Sbjct: 1003 IWDISKRSLVSRFVSPSNLIFEFIPTSLFAWHPVHSHSTIEDNVDMILAATKLWFSKGVN 1062

Query: 1184 ---ATPLQ-DDNAVWILVSADTDSDPQVVNHPKERQPTSNASWRPALLVRNIVVMGHMVD 1351
                 P +  D A+W+LVS D DSD +      +R  +    WR ALLV++ +++G  +D
Sbjct: 1063 NKTLVPAEVKDTAIWLLVSTDLDSDAKC-----DRVESPVRCWRLALLVKDQLILGSQLD 1117

Query: 1352 FRASSVDVSDGYGFIGTHDGVLYKWELSTGKKLSNLLRVQSGSIMCTAVDSKSGVVAVAD 1531
             RA       G+G  GT DG++Y W+LSTG KL +L   +   + C + D  S  + +A 
Sbjct: 1118 PRADVAGTISGHGVAGTLDGLVYMWDLSTGTKLGSLHDFKGQRVSCISTDD-SRNICIAS 1176

Query: 1532 EKCRLLI 1552
            E  +LL+
Sbjct: 1177 EDGQLLV 1183


Top