BLASTX nr result

ID: Rehmannia22_contig00002719 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00002719
         (2385 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006349328.1| PREDICTED: YTH domain family protein 1-like ...   623   e-175
ref|XP_004230452.1| PREDICTED: uncharacterized protein LOC101267...   605   e-170
emb|CBI29706.3| unnamed protein product [Vitis vinifera]              602   e-169
ref|XP_002262918.1| PREDICTED: uncharacterized protein LOC100249...   601   e-169
emb|CAN72774.1| hypothetical protein VITISV_026284 [Vitis vinifera]   582   e-163
gb|EMJ23995.1| hypothetical protein PRUPE_ppa003557mg [Prunus pe...   551   e-154
ref|XP_006431655.1| hypothetical protein CICLE_v10000713mg [Citr...   522   e-145
ref|XP_006471138.1| PREDICTED: uncharacterized protein LOC102630...   522   e-145
ref|XP_006431654.1| hypothetical protein CICLE_v10000713mg [Citr...   502   e-139
ref|XP_006471139.1| PREDICTED: uncharacterized protein LOC102630...   502   e-139
ref|XP_002526452.1| yth domain-containing protein, putative [Ric...   499   e-138
gb|EOX97053.1| Yth domain-containing protein, putative isoform 1...   496   e-137
gb|EOX97056.1| Yth domain-containing protein, putative isoform 4...   493   e-136
ref|XP_006385033.1| hypothetical protein POPTR_0004s23250g [Popu...   491   e-136
gb|EXB29044.1| hypothetical protein L484_018461 [Morus notabilis]     490   e-135
gb|EOX97055.1| Yth domain-containing protein, putative isoform 3...   490   e-135
gb|EOX97054.1| Yth domain-containing protein, putative isoform 2...   476   e-131
ref|XP_006389534.1| hypothetical protein POPTR_0022s00680g [Popu...   475   e-131
gb|EOX97058.1| Yth domain-containing protein, putative isoform 6...   472   e-130
ref|XP_002331108.1| predicted protein [Populus trichocarpa]           445   e-122

>ref|XP_006349328.1| PREDICTED: YTH domain family protein 1-like [Solanum tuberosum]
          Length = 570

 Score =  623 bits (1607), Expect = e-175
 Identities = 319/581 (54%), Positives = 409/581 (70%), Gaps = 8/581 (1%)
 Frame = +2

Query: 353  MSGEKNIEASELIVPGPISDPSIKLSEKEM--GKEGIQSNSVSSVSAVGDAMSGIQVVDQ 526
            M+GEK IE  E + PG  SDPS KL EK++   K+G  S+SVSS+ AV     GI   DQ
Sbjct: 1    MAGEKIIEKPEAVAPGLKSDPSNKLIEKDLVSKKDGKASDSVSSLGAVTGIKGGI---DQ 57

Query: 527  PPASEQGVXXXXXXXXXXXXXXXNGTFGQPDDQGFYNAAGGPYTGIQSDNTSLLYYLPGF 706
            PPA+EQG                NGT+ Q D+Q ++NA      G+QSDN SLLYY+PG+
Sbjct: 58   PPAAEQGAYYPPTSYCDYYYPGYNGTYNQLDEQAYFNA------GVQSDNGSLLYYMPGY 111

Query: 707  GPYATGYMGVDGKPTYASTEYLQQPCSYGSDAFPCYSYDSTYSGNVSTGTAARSGSVKPV 886
             PY+ G++G DGK  Y S+ YLQQP SYGSD+ PCY++ S Y  +++   A + G+VK  
Sbjct: 112  NPYSAGFVGGDGKQPYPSSGYLQQPVSYGSDSMPCYTWGSPYCADITNSAAPKPGNVKST 171

Query: 887  MGPSGSGKSNGYNIPKTNTNFSSKA--LPFNSKVQQS---SNFSKSIYQNQSLKPLNKLG 1051
             G +GS KSNG+N  KTN++FSSK+  + FN K + S   SN  KS++Q Q   P+NK  
Sbjct: 172  FGRNGSVKSNGFNSTKTNSSFSSKSSTVLFNPKSRPSTAMSNPPKSVHQAQPFNPVNKFQ 231

Query: 1052 SSFQTTGLMKGYNPASKFSSFTNRNPGAFTQYNPVNHQSNCKVWYNNCRYKGR-EILRNA 1228
            S  Q+ GLMKG++    + S+T++N G F  Y+P+N Q+N ++W  N R K R    RN 
Sbjct: 232  SDVQSGGLMKGFHLVGDYPSYTSQNQGFFMPYDPINCQTNSRMWNGNYRIKPRGNFTRNG 291

Query: 1229 DLEALTELTRGPRSDSRNNSPKLAAAEVESPGLAIETDKYNLQEFRVEYDNAKFYVIKSY 1408
              EA  EL RGPR++ R+  P   +AE +     ++ +KYN ++F+ +YDNAKFY+IKSY
Sbjct: 292  VFEATNELPRGPRANGRS-VPSKPSAEEDQLVPTVQREKYNKEDFKTQYDNAKFYIIKSY 350

Query: 1409 SEDDVHKCIKYDVWSSTPNGNKKLDAAFREADAKTSETGTKCPVFLFFSVNGSGQFVGVA 1588
            SEDD+HKC+KYDVWSSTPNGNKKLD AF EA+AK+S TG+ CPVFLFFSVNGSGQF+GVA
Sbjct: 351  SEDDIHKCVKYDVWSSTPNGNKKLDTAFVEAEAKSSGTGSSCPVFLFFSVNGSGQFLGVA 410

Query: 1589 EMIGQVDFSKNMDFWQLDKWNGFFPIKWHIIKDVPNTLLRHIILENNENRPVTYSRDTQE 1768
            EM+GQVDF++NMDFWQLDKW+GFFP+KWHI+KDVPNT  RHIILENN+NRPVTYSRDTQE
Sbjct: 411  EMVGQVDFNRNMDFWQLDKWSGFFPLKWHIVKDVPNTQFRHIILENNDNRPVTYSRDTQE 470

Query: 1769 IGLKQGLEMLSIFKDYSERTSVLDDFNFYENREKSLKAKRNAKPASQTNGPKNGDYKKQF 1948
            IGLK+GLEML+I K+YSE+TS+LDDFNFYE REK LKAKR++KPA Q +  +  D  KQF
Sbjct: 471  IGLKEGLEMLNILKNYSEKTSILDDFNFYEKREKVLKAKRSSKPAIQADVYEKADSLKQF 530

Query: 1949 EAGEGTAGEQSAKTXXXXXXXXXXXXTKNLSLNSQPLKSSI 2071
            + G+    E+  KT            TKNLS+NS+P KSS+
Sbjct: 531  KGGDKVL-EEELKTNSADPTAPLISLTKNLSINSRPFKSSV 570


>ref|XP_004230452.1| PREDICTED: uncharacterized protein LOC101267743 [Solanum
            lycopersicum]
          Length = 563

 Score =  605 bits (1561), Expect = e-170
 Identities = 309/573 (53%), Positives = 400/573 (69%), Gaps = 8/573 (1%)
 Frame = +2

Query: 377  ASELIVPGPISDPSIKLSEKEM--GKEGIQSNSVSSVSAVGDAMSGIQVVDQPPASEQGV 550
            A+E + PG  SDPS KL EK++   K+G  ++SV+S+ AV          DQPPA+EQG 
Sbjct: 2    AAEAVAPGLKSDPSNKLIEKDLVSKKDGKAADSVASLGAVTSIKCEN---DQPPAAEQGA 58

Query: 551  XXXXXXXXXXXXXXXNGTFGQPDDQGFYNAAGGPYTGIQSDNTSLLYYLPGFGPYATGYM 730
                           NGT+ Q D+Q ++NA      G+QSDN SLLYY+PG+ PY+ G++
Sbjct: 59   YYPPTSYCDYYYPGYNGTYNQLDEQAYFNA------GVQSDNGSLLYYMPGYNPYSAGFV 112

Query: 731  GVDGKPTYASTEYLQQPCSYGSDAFPCYSYDSTYSGNVSTGTAARSGSVKPVMGPSGSGK 910
            G DGK  Y S+ YLQQP SYGSD+ PCY++ S Y  +++   A +SG+VK   G +GS K
Sbjct: 113  GGDGKQPYPSSGYLQQPVSYGSDSMPCYTWGSPYCADITNSAAPKSGNVKSTFGRNGSVK 172

Query: 911  SNGYNIPKTNTNFSSK--ALPFNSKVQQS---SNFSKSIYQNQSLKPLNKLGSSFQTTGL 1075
            SNG+N  KTN++FSSK   + FN K + +   SN  KS +Q Q   P+NK  S  Q+ GL
Sbjct: 173  SNGFNSTKTNSSFSSKNSTVLFNPKSRPATAMSNPPKSFHQAQPFNPVNKFQSDVQSGGL 232

Query: 1076 MKGYNPASKFSSFTNRNPGAFTQYNPVNHQSNCKVWYNNCRYKGR-EILRNADLEALTEL 1252
            MKG++    + S+T++N G F  Y+P+N Q+N ++W  N R K R    RN   EA  EL
Sbjct: 233  MKGFHLVGDYPSYTSQNQGFFMPYDPINCQTNSRMWNGNYRAKPRGNFTRNGVFEATNEL 292

Query: 1253 TRGPRSDSRNNSPKLAAAEVESPGLAIETDKYNLQEFRVEYDNAKFYVIKSYSEDDVHKC 1432
             RGPR++ R+  P   +AE +    A++ +KYN ++F+ +YDNAKFY+IKSYSEDD+HKC
Sbjct: 293  PRGPRANGRS-VPSKPSAEEDQLVPAVQREKYNKEDFKTQYDNAKFYIIKSYSEDDIHKC 351

Query: 1433 IKYDVWSSTPNGNKKLDAAFREADAKTSETGTKCPVFLFFSVNGSGQFVGVAEMIGQVDF 1612
            +KYDVWSSTPNGNKKLD AF E++AK S TG+ CPVFLFFSVNGSGQF+GVAEM+GQVDF
Sbjct: 352  VKYDVWSSTPNGNKKLDTAFVESEAKASGTGSSCPVFLFFSVNGSGQFLGVAEMVGQVDF 411

Query: 1613 SKNMDFWQLDKWNGFFPIKWHIIKDVPNTLLRHIILENNENRPVTYSRDTQEIGLKQGLE 1792
            ++NMDFWQLDKW+GFFP+KWHI+KDVPNT  RHIILENN+NRPVTYSRDTQEIGLK+GLE
Sbjct: 412  NRNMDFWQLDKWSGFFPLKWHIVKDVPNTQFRHIILENNDNRPVTYSRDTQEIGLKEGLE 471

Query: 1793 MLSIFKDYSERTSVLDDFNFYENREKSLKAKRNAKPASQTNGPKNGDYKKQFEAGEGTAG 1972
            ML+I K+YSE+TS+LDDFNFYE REK LKAKR++KP  Q +  +  D  KQF+ G+    
Sbjct: 472  MLNILKNYSEKTSILDDFNFYEKREKVLKAKRSSKPVIQADAYEKADSLKQFKGGDKVL- 530

Query: 1973 EQSAKTXXXXXXXXXXXXTKNLSLNSQPLKSSI 2071
            E+  KT            TKNLS+NS+P KSS+
Sbjct: 531  EEELKTNSTDPTAPLVSLTKNLSINSRPFKSSV 563


>emb|CBI29706.3| unnamed protein product [Vitis vinifera]
          Length = 708

 Score =  602 bits (1551), Expect = e-169
 Identities = 326/592 (55%), Positives = 413/592 (69%), Gaps = 19/592 (3%)
 Frame = +2

Query: 350  KMSGEKNIEASELIVPGPISDPSIKLSEKEM--GKEGIQSNSVSSVSAVGDAMSGIQ-VV 520
            +M+ EK  E SE +  G  SD   KL+++++  GK+GI S+S SS+++ GDA + ++   
Sbjct: 100  EMAAEKTFETSEQVTMGLKSDTFTKLTKQDVVSGKDGIPSDSTSSLTSSGDATASVKGET 159

Query: 521  DQPPASEQGVXXXXXXXXXXXXXXXNGTFGQPDDQGFYNAAGGPYTGIQSDNTSLLYYLP 700
            DQ   +EQGV               NG   Q DD G+YNA G  YTG+QSDN  ++YYLP
Sbjct: 160  DQESVAEQGVYYPPTSCYNYYYPGYNGALNQSDDHGYYNADGS-YTGVQSDN-GMVYYLP 217

Query: 701  GFGPYATG-YMGVDGK----PTYASTEYLQQPCSYGSDAFPCYSYDSTYSGNVSTGTAAR 865
            G+ PYA+G  MGVDG+    P Y S+ YLQQP  YG++A PCYS+DSTY G+ + GT A 
Sbjct: 218  GYNPYASGTLMGVDGQCVSQPPYFSSGYLQQPVPYGTEAVPCYSWDSTYVGDAANGTNAN 277

Query: 866  SGSVKPVMGPSGSGKSNGYNIPKTNTNFSSK-ALPFNSKVQQS---SNFSKSIYQNQSLK 1033
             G++K    P+ S K+N +   K N   ++K +LPF+SK  QS   SNFSKSI+Q+Q LK
Sbjct: 278  FGNIKSGSRPTASAKANNFPSMKANGTVANKYSLPFDSKSHQSAAPSNFSKSIFQSQPLK 337

Query: 1034 PLNK---LGSSFQTTGLMKGYNPASKFSSFTNRNPGAFTQYNPVNHQSNCKVWYNNCRYK 1204
            PLNK   LGS F   G  KG+NP SKFSSFTN+  G F     +N++ N + W  N +YK
Sbjct: 338  PLNKASHLGSDFPA-GFAKGFNPVSKFSSFTNQKQGFFPHNGVMNYRPNSRAWNGNEKYK 396

Query: 1205 GREIL-RNADLEALTELTRGPRSDSRNNSPKLAAAEVESPGLAIETDKYNLQEFRVEYDN 1381
             RE   RN   E+ TELT GPR+ +RN +P  +A E E  GL +  D+YNLQ+F+ EY+N
Sbjct: 397  LREKSNRNGHFESSTELTCGPRARNRN-APLNSATEKEELGLMVRRDQYNLQDFQTEYEN 455

Query: 1382 AKFYVIKSYSEDDVHKCIKYDVWSSTPNGNKKLDAAFREADAKTSETGTKCPVFLFFSVN 1561
            AKFYVIKS+SEDD+HKCIKYDVW+STPNGNKKLDAAF +A+AK +ETGTK P+FLFFSVN
Sbjct: 456  AKFYVIKSFSEDDIHKCIKYDVWASTPNGNKKLDAAFHDAEAKANETGTKFPIFLFFSVN 515

Query: 1562 GSGQFVGVAEMIGQVDFSKNMDFWQLDKWNGFFPIKWHIIKDVPNTLLRHIILENNENRP 1741
            GSGQFVGVAEM+GQVDF+K+MDFWQLDKWNGFFP+KWHI+KD+PN+ LRHI LE+NENR 
Sbjct: 516  GSGQFVGVAEMVGQVDFNKDMDFWQLDKWNGFFPVKWHIVKDIPNSQLRHITLESNENRS 575

Query: 1742 VTYSRDTQEIGLKQGLEMLSIFKDYSERTSVLDDFNFYENREKSLKAKRNAK--PASQTN 1915
            VTY+RDTQEIGLKQG+EML IFK+YS RTS+ DDFNFYENREKSL A+R++K  P SQ  
Sbjct: 576  VTYTRDTQEIGLKQGVEMLKIFKNYSARTSMFDDFNFYENREKSLHARRSSKPPPPSQME 635

Query: 1916 GPKNGDYKKQFEAGEGTAGEQSAKTXXXXXXXXXXXXTKNLSLNS-QPLKSS 2068
               +GD   +   GE    E+ A+T            TKNLSL++  P K+S
Sbjct: 636  IYGSGDDLPKHLHGEERKTEEPARTSRSHDPKSLINLTKNLSLSTPHPPKNS 687


>ref|XP_002262918.1| PREDICTED: uncharacterized protein LOC100249242 [Vitis vinifera]
          Length = 608

 Score =  601 bits (1550), Expect = e-169
 Identities = 326/591 (55%), Positives = 412/591 (69%), Gaps = 19/591 (3%)
 Frame = +2

Query: 353  MSGEKNIEASELIVPGPISDPSIKLSEKEM--GKEGIQSNSVSSVSAVGDAMSGIQ-VVD 523
            M+ EK  E SE +  G  SD   KL+++++  GK+GI S+S SS+++ GDA + ++   D
Sbjct: 1    MAAEKTFETSEQVTMGLKSDTFTKLTKQDVVSGKDGIPSDSTSSLTSSGDATASVKGETD 60

Query: 524  QPPASEQGVXXXXXXXXXXXXXXXNGTFGQPDDQGFYNAAGGPYTGIQSDNTSLLYYLPG 703
            Q   +EQGV               NG   Q DD G+YNA G  YTG+QSDN  ++YYLPG
Sbjct: 61   QESVAEQGVYYPPTSCYNYYYPGYNGALNQSDDHGYYNADGS-YTGVQSDN-GMVYYLPG 118

Query: 704  FGPYATG-YMGVDGK----PTYASTEYLQQPCSYGSDAFPCYSYDSTYSGNVSTGTAARS 868
            + PYA+G  MGVDG+    P Y S+ YLQQP  YG++A PCYS+DSTY G+ + GT A  
Sbjct: 119  YNPYASGTLMGVDGQCVSQPPYFSSGYLQQPVPYGTEAVPCYSWDSTYVGDAANGTNANF 178

Query: 869  GSVKPVMGPSGSGKSNGYNIPKTNTNFSSK-ALPFNSKVQQS---SNFSKSIYQNQSLKP 1036
            G++K    P+ S K+N +   K N   ++K +LPF+SK  QS   SNFSKSI+Q+Q LKP
Sbjct: 179  GNIKSGSRPTASAKANNFPSMKANGTVANKYSLPFDSKSHQSAAPSNFSKSIFQSQPLKP 238

Query: 1037 LNK---LGSSFQTTGLMKGYNPASKFSSFTNRNPGAFTQYNPVNHQSNCKVWYNNCRYKG 1207
            LNK   LGS F   G  KG+NP SKFSSFTN+  G F     +N++ N + W  N +YK 
Sbjct: 239  LNKASHLGSDFPA-GFAKGFNPVSKFSSFTNQKQGFFPHNGVMNYRPNSRAWNGNEKYKL 297

Query: 1208 REIL-RNADLEALTELTRGPRSDSRNNSPKLAAAEVESPGLAIETDKYNLQEFRVEYDNA 1384
            RE   RN   E+ TELT GPR+ +RN +P  +A E E  GL +  D+YNLQ+F+ EY+NA
Sbjct: 298  REKSNRNGHFESSTELTCGPRARNRN-APLNSATEKEELGLMVRRDQYNLQDFQTEYENA 356

Query: 1385 KFYVIKSYSEDDVHKCIKYDVWSSTPNGNKKLDAAFREADAKTSETGTKCPVFLFFSVNG 1564
            KFYVIKS+SEDD+HKCIKYDVW+STPNGNKKLDAAF +A+AK +ETGTK P+FLFFSVNG
Sbjct: 357  KFYVIKSFSEDDIHKCIKYDVWASTPNGNKKLDAAFHDAEAKANETGTKFPIFLFFSVNG 416

Query: 1565 SGQFVGVAEMIGQVDFSKNMDFWQLDKWNGFFPIKWHIIKDVPNTLLRHIILENNENRPV 1744
            SGQFVGVAEM+GQVDF+K+MDFWQLDKWNGFFP+KWHI+KD+PN+ LRHI LE+NENR V
Sbjct: 417  SGQFVGVAEMVGQVDFNKDMDFWQLDKWNGFFPVKWHIVKDIPNSQLRHITLESNENRSV 476

Query: 1745 TYSRDTQEIGLKQGLEMLSIFKDYSERTSVLDDFNFYENREKSLKAKRNAK--PASQTNG 1918
            TY+RDTQEIGLKQG+EML IFK+YS RTS+ DDFNFYENREKSL A+R++K  P SQ   
Sbjct: 477  TYTRDTQEIGLKQGVEMLKIFKNYSARTSMFDDFNFYENREKSLHARRSSKPPPPSQMEI 536

Query: 1919 PKNGDYKKQFEAGEGTAGEQSAKTXXXXXXXXXXXXTKNLSLNS-QPLKSS 2068
              +GD   +   GE    E+ A+T            TKNLSL++  P K+S
Sbjct: 537  YGSGDDLPKHLHGEERKTEEPARTSRSHDPKSLINLTKNLSLSTPHPPKNS 587


>emb|CAN72774.1| hypothetical protein VITISV_026284 [Vitis vinifera]
          Length = 812

 Score =  582 bits (1501), Expect = e-163
 Identities = 312/548 (56%), Positives = 391/548 (71%), Gaps = 21/548 (3%)
 Frame = +2

Query: 353  MSGEKNIEASEL---IVPGPISDPSIKLSEKEM--GKEGIQSNSVSSVSAVGDAMSGIQ- 514
            M+ EK  E   L   +  G  SD   KL+++++  GK+GI S+S SS+++ GDA + ++ 
Sbjct: 1    MAAEKTFETCILAYKVTMGLKSDTFTKLTKQDVVSGKDGIPSDSTSSLTSSGDATASVKG 60

Query: 515  VVDQPPASEQGVXXXXXXXXXXXXXXXNGTFGQPDDQGFYNAAGGPYTGIQSDNTSLLYY 694
              DQ   +EQGV               NG   Q DD G+YNA G  YTG+QSDN  ++YY
Sbjct: 61   ETDQESVAEQGVYYPPTSCYNYYYPGYNGALNQSDDHGYYNADGS-YTGVQSDN-GMVYY 118

Query: 695  LPGFGPYATG-YMGVDGK----PTYASTEYLQQPCSYGSDAFPCYSYDSTYSGNVSTGTA 859
            LPG+ PYA+G  MGVDG+    P Y S+ YLQQP  YG++A PCYS+DSTY G+ + GT 
Sbjct: 119  LPGYNPYASGTLMGVDGQCVSQPPYFSSGYLQQPVPYGTEAVPCYSWDSTYVGDAANGTN 178

Query: 860  ARSGSVKPVMGPSGSGKSNGYNIPKTNTNFSSK-ALPFNSKVQQS---SNFSKSIYQNQS 1027
            A  G++K    P+ S K+N +   K N   ++K +LPF+SK + S   SNFSKSI+Q+Q 
Sbjct: 179  ANFGNIKSGSRPTASAKANNFPSMKANGTVANKYSLPFDSKSRXSAAPSNFSKSIFQSQP 238

Query: 1028 LKPLNK---LGSSFQTTGLMKGYNPASKFSSFTNRNPGAFTQYNPVNHQSNCKVWYNNCR 1198
            LKPLNK   LGS F   G  KG+NP SKFSSFTN+  G F     +N++ N + W  N +
Sbjct: 239  LKPLNKASHLGSDFPA-GFAKGFNPVSKFSSFTNQKQGFFPHNGVMNYRPNSRAWNGNEK 297

Query: 1199 YKGREIL-RNADLEALTELTRGPRSDSRNNSPKLAAAEVESPGLAIETDKYNLQEFRVEY 1375
            YK RE   RN   E+ TELT GPR+ +RN SP  +A E E  GL +  D+YNLQ+F+ EY
Sbjct: 298  YKLREKSNRNGHFESSTELTCGPRARNRN-SPLNSATEKEELGLMVRRDQYNLQDFQTEY 356

Query: 1376 DNAKFYVIKSYSEDDVHKCIKYDVWSSTPNGNKKLDAAFREADAKTSETGTKCPVFLFFS 1555
            +NAKFYVIKS+SEDD+HKCIKYDVW+STPNGNKKLDAAF +A+AK +ETGTK P+FLFFS
Sbjct: 357  ENAKFYVIKSFSEDDIHKCIKYDVWASTPNGNKKLDAAFHDAEAKANETGTKFPIFLFFS 416

Query: 1556 VNGSGQFVGVAEMIGQVDFSKNMDFWQLDKWNGFFPIKWHIIKDVPNTLLRHIILENNEN 1735
            VNGSGQFVGVAEM+GQVDF+K+MDFWQLDKWNGFFP+KWHI+KD+PN+ LRHI LE+NEN
Sbjct: 417  VNGSGQFVGVAEMVGQVDFNKDMDFWQLDKWNGFFPVKWHIVKDIPNSQLRHITLESNEN 476

Query: 1736 RPVTYSRDTQEIGLKQGLEMLSIFKDYSERTSVLDDFNFYENREKSLKAKRNAK--PASQ 1909
            R VTY+RDTQEIGLKQG+EML IFK+YS RTS+ DDFNFYENREKSL A+R++K  P SQ
Sbjct: 477  RSVTYTRDTQEIGLKQGVEMLKIFKNYSARTSMFDDFNFYENREKSLHARRSSKPPPPSQ 536

Query: 1910 TNGPKNGD 1933
                 NGD
Sbjct: 537  MEIYGNGD 544


>gb|EMJ23995.1| hypothetical protein PRUPE_ppa003557mg [Prunus persica]
          Length = 566

 Score =  551 bits (1421), Expect = e-154
 Identities = 298/582 (51%), Positives = 390/582 (67%), Gaps = 14/582 (2%)
 Frame = +2

Query: 353  MSGEKNIEASELIVPGPISDPSIKLSEKEM--GKEGIQSNSVSSVSAVGDAMSGIQ-VVD 523
            M+GEK IE  E I     +D    LS++E+  GK+GI S+ + S+S++ D+ S I+   D
Sbjct: 1    MAGEKKIEKGEPISTVLTADSVTGLSQQEVVSGKDGIPSDPIPSMSSLLDSNSSIKGETD 60

Query: 524  QPPASEQGVXXXXXXXXXXXXXXXNGTFGQPDDQGFYNAAGGPYTGIQSDNTSLLYYLPG 703
            Q    E GV               NG+F Q DD G+ +A    +TG+QSDN S++YYLPG
Sbjct: 61   QDSVGEHGVYYQPTSCYNYYYPGYNGSFTQMDDHGYLHA-NNQHTGVQSDNGSMVYYLPG 119

Query: 704  FGPYATG-YMGVDGKPT-----YASTEYLQQPCSYGSDAFPCYSYDSTYSGNVSTGTAAR 865
            + PYA G  MG+DG+       + S+ Y+Q P SYGS+A PCYS+D+T+ G+VST   + 
Sbjct: 120  YNPYAPGTLMGIDGQGVGQQQYFPSSGYMQPPVSYGSEAAPCYSWDTTFVGDVSTAANSG 179

Query: 866  SGSVKPVMGPSGSGKSNGYNIPKTNTNFSSKALPFNSKVQQSSNFSKSIYQNQSLKPLNK 1045
             G+VK   G +   KS G+   KTN N  S+             FSKS+   Q  K LNK
Sbjct: 180  FGNVKVGPGSAALSKSGGFISTKTNGNLHSR-------------FSKSLPHTQPFKSLNK 226

Query: 1046 ---LGSSFQTTGLMKGYNPASKFSSFTNRNPGAFTQYNPVNHQSNCKVWYNNCRYKGREI 1216
               LG+ F + GL+KGYNPA +FSSF N+  G F     +N++SN ++   N R+K RE 
Sbjct: 227  VSHLGNDF-SAGLLKGYNPAGRFSSFANQKYGLFPPNGHMNYKSNARILNGNDRFKSREN 285

Query: 1217 L-RNADLEALTELTRGPRSDSRNNSPKLAAAEVESPGLAIETDKYNLQEFRVEYDNAKFY 1393
              RN D E+ TELTRGPRS +++ +P  +A E E     +  D+YNL +F+ +Y+ AKFY
Sbjct: 286  YNRNEDFESSTELTRGPRSRNKS-APLDSAIEKEELSFTVHRDQYNLPDFQTDYEKAKFY 344

Query: 1394 VIKSYSEDDVHKCIKYDVWSSTPNGNKKLDAAFREADAKTSETGTKCPVFLFFSVNGSGQ 1573
            VIKSYSEDDVHK IKYDVW+STPNGNKKLDA+FR+A++K+ ETGT+CP+FLFFSVNGSGQ
Sbjct: 345  VIKSYSEDDVHKSIKYDVWASTPNGNKKLDASFRDAESKSRETGTQCPIFLFFSVNGSGQ 404

Query: 1574 FVGVAEMIGQVDFSKNMDFWQLDKWNGFFPIKWHIIKDVPNTLLRHIILENNENRPVTYS 1753
            F+G+AEM GQVDF+K+MDFWQ+DKW+GFFP+KWH+IKD+PNT LRHIILENN+NRPVT++
Sbjct: 405  FIGLAEMAGQVDFNKDMDFWQVDKWSGFFPVKWHVIKDIPNTQLRHIILENNDNRPVTFT 464

Query: 1754 RDTQEIGLKQGLEMLSIFKDYSERTSVLDDFNFYENREKSLKAKRNAKPAS-QTNGPKNG 1930
            RDTQEIGLKQGLEML+IFK Y+ +TS+LDDF FYE+REKSLKAKR++KPA+ +     N 
Sbjct: 465  RDTQEIGLKQGLEMLNIFKSYTAKTSLLDDFIFYEDREKSLKAKRSSKPATLKMETYDNN 524

Query: 1931 DYKKQFEAGEGTAGEQSAKTXXXXXXXXXXXXTKNLSLNSQP 2056
            D  K   +G     ++SA              TKNLSLN  P
Sbjct: 525  DITKHINSGGRNVDDESAGIRMASDRASLISLTKNLSLNGCP 566


>ref|XP_006431655.1| hypothetical protein CICLE_v10000713mg [Citrus clementina]
            gi|567878195|ref|XP_006431656.1| hypothetical protein
            CICLE_v10000713mg [Citrus clementina]
            gi|557533777|gb|ESR44895.1| hypothetical protein
            CICLE_v10000713mg [Citrus clementina]
            gi|557533778|gb|ESR44896.1| hypothetical protein
            CICLE_v10000713mg [Citrus clementina]
          Length = 572

 Score =  522 bits (1345), Expect = e-145
 Identities = 289/584 (49%), Positives = 379/584 (64%), Gaps = 12/584 (2%)
 Frame = +2

Query: 353  MSGEKNIEASELIVPGPISDPSIKLSEKEM--GKEGIQSNSVSSVSAVGDAMSGIQ-VVD 523
            M+GEKNI   E +      +P  K + +++  GK+G  S+S +S++  G A SG++  +D
Sbjct: 1    MAGEKNIIKDEPVATELKGNPISKSTGQDVASGKDGAASDSTASMATSGHAASGMKGEID 60

Query: 524  QPPASEQGVXXXXXXXXXXXXXXXNGTFGQPDDQGFYNAAGGPYTGIQSDNTSLLYYLPG 703
            Q    E G                NG+F Q D+ G+ +  G  ++G+ SDN SLLYYLPG
Sbjct: 61   QESVGEYGAQNPSTVHYNYYYPGSNGSFSQVDNNGYIHTDGS-HSGVHSDNGSLLYYLPG 119

Query: 704  FGPYATGYMGVDGK-----PTYASTEYLQQPCSYGSDAFPCYSYDSTYSGNVSTGTAARS 868
            + PY+T  +GVDG+     P ++S+ YLQ P SYGS+  PCYS+DSTY  ++  G A   
Sbjct: 120  YDPYST-LVGVDGQCVGQQPYFSSSGYLQHPVSYGSEVMPCYSWDSTYVADIQNGNAVGF 178

Query: 869  GSVKPVMGPSGSGKSNGYNIPKTNTNFSSKALPFNSKVQQSSNFSKSIYQNQSLKPLNKL 1048
            G+ K   G +   KSNG N  K N  F++K              SKS Y  QS KP++K+
Sbjct: 179  GNEK-YGGSTAFAKSNGLNSVKKNGCFTNKV-------------SKSSY-TQSTKPVSKV 223

Query: 1049 GS--SFQTTGLMKGYNPASKFSSFTNRNPGAFTQYNPVNHQSNCKVWYNNCRYKGRE-IL 1219
                S  + G +KG NP   FS+F+N+  G F   N VN+ +N ++W  N RYK R+   
Sbjct: 224  TQLDSDLSAGFLKGSNPLGNFSAFSNQKQGFFP--NMVNYSTNGRMWNGNDRYKSRDKFS 281

Query: 1220 RNADLEALTELTRGPRSDSRNNSPKLAAAEVESPGLAIETDKYNLQEFRVEYDNAKFYVI 1399
            R   L   TEL RGPR+++++ S +++  + E P   +  D+YNL +F+VEY+  KFYVI
Sbjct: 282  RAGGLGMPTELIRGPRAENKSASLEISDKK-EVPSPTVSRDQYNLPDFQVEYEKVKFYVI 340

Query: 1400 KSYSEDDVHKCIKYDVWSSTPNGNKKLDAAFREADAKTSETGTKCPVFLFFSVNGSGQFV 1579
            KSYSEDD+HKCIKYDVWSSTPNGNKKLDA F EA+AK  ETGT+CP+FLFFSVNGSGQFV
Sbjct: 341  KSYSEDDIHKCIKYDVWSSTPNGNKKLDATFNEAEAKADETGTRCPIFLFFSVNGSGQFV 400

Query: 1580 GVAEMIGQVDFSKNMDFWQLDKWNGFFPIKWHIIKDVPNTLLRHIILENNENRPVTYSRD 1759
            G+AEM+G+VDF+K+MDFWQLDKWNGFFP+KWH+IKDVPNTLLRHI LENNEN+PVT+SRD
Sbjct: 401  GLAEMMGKVDFNKDMDFWQLDKWNGFFPVKWHVIKDVPNTLLRHITLENNENKPVTHSRD 460

Query: 1760 TQEIGLKQGLEMLSIFKDYSERTSVLDDFNFYENREKSLKAKRNAKPAS-QTNGPKNGDY 1936
            TQEIGLKQGLEML IFK YS +TS+LDDFNFYEN+E+S   K+++KPA+ Q +   + D+
Sbjct: 461  TQEIGLKQGLEMLKIFKSYSAKTSLLDDFNFYENKERSFHGKKSSKPATLQMDIFNDDDF 520

Query: 1937 KKQFEAGEGTAGEQSAKTXXXXXXXXXXXXTKNLSLNSQPLKSS 2068
             KQ ++ E    E S               TKNLSL     K S
Sbjct: 521  TKQIKSAEKEFDEDSIS---------IINLTKNLSLKPCTQKKS 555


>ref|XP_006471138.1| PREDICTED: uncharacterized protein LOC102630620 isoform X1 [Citrus
            sinensis]
          Length = 572

 Score =  522 bits (1344), Expect = e-145
 Identities = 293/586 (50%), Positives = 382/586 (65%), Gaps = 14/586 (2%)
 Frame = +2

Query: 353  MSGEKNIEASELIVPGPISDPSIKLSEKEM--GKEGIQSNSVSSVSAVGDAMSGIQ-VVD 523
            M+GEKNI   E +      +P  K + +++  GK+G  S+S +S++  G A SG++  +D
Sbjct: 1    MAGEKNIIKDEPVATELKGNPISKSTGQDVASGKDGAASDSTASMATSGHAASGMKGEID 60

Query: 524  QPPASEQGVXXXXXXXXXXXXXXXNGTFGQPDDQGFYNAAGGPYTGIQSDNTSLLYYLPG 703
            Q    E G                NG+F Q D+ G+ +  G  ++G+ SDN SLLYYLPG
Sbjct: 61   QESVGEYGAQNPSTVHYNYYYPGSNGSFSQVDNNGYIHTDGS-HSGVHSDNGSLLYYLPG 119

Query: 704  FGPYATGYMGVDGK-----PTYASTEYLQQPCSYGSDAFPCYSYDSTYSGNVSTGTAARS 868
            + PY+T  +GVDG+     P ++S+ YLQ P SYGS+  PCYS+DSTY  ++  G A   
Sbjct: 120  YDPYST-IVGVDGQCVGQQPYFSSSGYLQHPVSYGSEVMPCYSWDSTYVADIQNGNAVGF 178

Query: 869  GSVKPVMGPSGSGKSNGYNIPKTNTNFSSKALPFNSKVQQSSNFSKSIYQNQSLKPLNK- 1045
            G+ K   G +   KSNG N  K N  F++K              SKS Y  QS KP++K 
Sbjct: 179  GNEK-YGGSTAFAKSNGLNSVKKNGCFTNKV-------------SKSSY-TQSTKPVSKV 223

Query: 1046 --LGSSFQTTGLMKGYNPASKFSSFTNRNPGAFTQYNPVNHQSNCKVWYNNCRYKGRE-I 1216
              LGS   + G +KG +P   FS+F+N+  G F   N VN+ +N ++W  N RYK R+  
Sbjct: 224  TQLGSDL-SAGFLKGSDPLGNFSAFSNQKQGFFP--NMVNYSTNGRMWNGNDRYKSRDKF 280

Query: 1217 LRNADLEALTELTRGPRSDSRNNSPKLA-AAEVESPGLAIETDKYNLQEFRVEYDNAKFY 1393
             R   L   TEL RGPR+++++ S +++   EV SP   +  D+YNL +F+VEY+ AKFY
Sbjct: 281  SRAGGLGMPTELIRGPRAENKSASLEISDKKEVLSP--TVSRDQYNLPDFQVEYEKAKFY 338

Query: 1394 VIKSYSEDDVHKCIKYDVWSSTPNGNKKLDAAFREADAKTSETGTKCPVFLFFSVNGSGQ 1573
            VIKSYSEDD+HKCIKYDVWSSTPNGNKKLDA F EA+AK  ETGT+CP+FLFFSVNGSGQ
Sbjct: 339  VIKSYSEDDIHKCIKYDVWSSTPNGNKKLDATFNEAEAKADETGTRCPIFLFFSVNGSGQ 398

Query: 1574 FVGVAEMIGQVDFSKNMDFWQLDKWNGFFPIKWHIIKDVPNTLLRHIILENNENRPVTYS 1753
            FVG+AEM+G+VDF+K+MDFWQLDKWNGFFP+KWH+IKDVPNTLLRHI LENNEN+PVT+S
Sbjct: 399  FVGLAEMMGKVDFNKDMDFWQLDKWNGFFPVKWHVIKDVPNTLLRHITLENNENKPVTHS 458

Query: 1754 RDTQEIGLKQGLEMLSIFKDYSERTSVLDDFNFYENREKSLKAKRNAKPAS-QTNGPKNG 1930
            RDTQEIGLKQGLEML IFK YS +TS+LDDFNFYEN+E+S   K+++KPA+ Q +   + 
Sbjct: 459  RDTQEIGLKQGLEMLKIFKSYSAKTSLLDDFNFYENKERSFHGKKSSKPATLQMDIFNDD 518

Query: 1931 DYKKQFEAGEGTAGEQSAKTXXXXXXXXXXXXTKNLSLNSQPLKSS 2068
            D+ KQ ++ E    E S               TKNLSL     K S
Sbjct: 519  DFTKQIKSAEKEFDEDSIS---------IINLTKNLSLKPCTQKKS 555


>ref|XP_006431654.1| hypothetical protein CICLE_v10000713mg [Citrus clementina]
            gi|557533776|gb|ESR44894.1| hypothetical protein
            CICLE_v10000713mg [Citrus clementina]
          Length = 528

 Score =  502 bits (1293), Expect = e-139
 Identities = 274/536 (51%), Positives = 352/536 (65%), Gaps = 10/536 (1%)
 Frame = +2

Query: 491  GDAMSGIQ-VVDQPPASEQGVXXXXXXXXXXXXXXXNGTFGQPDDQGFYNAAGGPYTGIQ 667
            G A SG++  +DQ    E G                NG+F Q D+ G+ +  G  ++G+ 
Sbjct: 5    GHAASGMKGEIDQESVGEYGAQNPSTVHYNYYYPGSNGSFSQVDNNGYIHTDGS-HSGVH 63

Query: 668  SDNTSLLYYLPGFGPYATGYMGVDGK-----PTYASTEYLQQPCSYGSDAFPCYSYDSTY 832
            SDN SLLYYLPG+ PY+T  +GVDG+     P ++S+ YLQ P SYGS+  PCYS+DSTY
Sbjct: 64   SDNGSLLYYLPGYDPYST-LVGVDGQCVGQQPYFSSSGYLQHPVSYGSEVMPCYSWDSTY 122

Query: 833  SGNVSTGTAARSGSVKPVMGPSGSGKSNGYNIPKTNTNFSSKALPFNSKVQQSSNFSKSI 1012
              ++  G A   G+ K   G +   KSNG N  K N  F++K              SKS 
Sbjct: 123  VADIQNGNAVGFGNEK-YGGSTAFAKSNGLNSVKKNGCFTNKV-------------SKSS 168

Query: 1013 YQNQSLKPLNKLGS--SFQTTGLMKGYNPASKFSSFTNRNPGAFTQYNPVNHQSNCKVWY 1186
            Y  QS KP++K+    S  + G +KG NP   FS+F+N+  G F   N VN+ +N ++W 
Sbjct: 169  Y-TQSTKPVSKVTQLDSDLSAGFLKGSNPLGNFSAFSNQKQGFFP--NMVNYSTNGRMWN 225

Query: 1187 NNCRYKGRE-ILRNADLEALTELTRGPRSDSRNNSPKLAAAEVESPGLAIETDKYNLQEF 1363
             N RYK R+   R   L   TEL RGPR+++++ S +++  + E P   +  D+YNL +F
Sbjct: 226  GNDRYKSRDKFSRAGGLGMPTELIRGPRAENKSASLEISDKK-EVPSPTVSRDQYNLPDF 284

Query: 1364 RVEYDNAKFYVIKSYSEDDVHKCIKYDVWSSTPNGNKKLDAAFREADAKTSETGTKCPVF 1543
            +VEY+  KFYVIKSYSEDD+HKCIKYDVWSSTPNGNKKLDA F EA+AK  ETGT+CP+F
Sbjct: 285  QVEYEKVKFYVIKSYSEDDIHKCIKYDVWSSTPNGNKKLDATFNEAEAKADETGTRCPIF 344

Query: 1544 LFFSVNGSGQFVGVAEMIGQVDFSKNMDFWQLDKWNGFFPIKWHIIKDVPNTLLRHIILE 1723
            LFFSVNGSGQFVG+AEM+G+VDF+K+MDFWQLDKWNGFFP+KWH+IKDVPNTLLRHI LE
Sbjct: 345  LFFSVNGSGQFVGLAEMMGKVDFNKDMDFWQLDKWNGFFPVKWHVIKDVPNTLLRHITLE 404

Query: 1724 NNENRPVTYSRDTQEIGLKQGLEMLSIFKDYSERTSVLDDFNFYENREKSLKAKRNAKPA 1903
            NNEN+PVT+SRDTQEIGLKQGLEML IFK YS +TS+LDDFNFYEN+E+S   K+++KPA
Sbjct: 405  NNENKPVTHSRDTQEIGLKQGLEMLKIFKSYSAKTSLLDDFNFYENKERSFHGKKSSKPA 464

Query: 1904 S-QTNGPKNGDYKKQFEAGEGTAGEQSAKTXXXXXXXXXXXXTKNLSLNSQPLKSS 2068
            + Q +   + D+ KQ ++ E    E S               TKNLSL     K S
Sbjct: 465  TLQMDIFNDDDFTKQIKSAEKEFDEDSIS---------IINLTKNLSLKPCTQKKS 511


>ref|XP_006471139.1| PREDICTED: uncharacterized protein LOC102630620 isoform X2 [Citrus
            sinensis]
          Length = 528

 Score =  502 bits (1292), Expect = e-139
 Identities = 278/538 (51%), Positives = 355/538 (65%), Gaps = 12/538 (2%)
 Frame = +2

Query: 491  GDAMSGIQ-VVDQPPASEQGVXXXXXXXXXXXXXXXNGTFGQPDDQGFYNAAGGPYTGIQ 667
            G A SG++  +DQ    E G                NG+F Q D+ G+ +  G  ++G+ 
Sbjct: 5    GHAASGMKGEIDQESVGEYGAQNPSTVHYNYYYPGSNGSFSQVDNNGYIHTDGS-HSGVH 63

Query: 668  SDNTSLLYYLPGFGPYATGYMGVDGK-----PTYASTEYLQQPCSYGSDAFPCYSYDSTY 832
            SDN SLLYYLPG+ PY+T  +GVDG+     P ++S+ YLQ P SYGS+  PCYS+DSTY
Sbjct: 64   SDNGSLLYYLPGYDPYST-IVGVDGQCVGQQPYFSSSGYLQHPVSYGSEVMPCYSWDSTY 122

Query: 833  SGNVSTGTAARSGSVKPVMGPSGSGKSNGYNIPKTNTNFSSKALPFNSKVQQSSNFSKSI 1012
              ++  G A   G+ K   G +   KSNG N  K N  F++K              SKS 
Sbjct: 123  VADIQNGNAVGFGNEK-YGGSTAFAKSNGLNSVKKNGCFTNKV-------------SKSS 168

Query: 1013 YQNQSLKPLNK---LGSSFQTTGLMKGYNPASKFSSFTNRNPGAFTQYNPVNHQSNCKVW 1183
            Y  QS KP++K   LGS   + G +KG +P   FS+F+N+  G F   N VN+ +N ++W
Sbjct: 169  Y-TQSTKPVSKVTQLGSDL-SAGFLKGSDPLGNFSAFSNQKQGFFP--NMVNYSTNGRMW 224

Query: 1184 YNNCRYKGRE-ILRNADLEALTELTRGPRSDSRNNSPKLA-AAEVESPGLAIETDKYNLQ 1357
              N RYK R+   R   L   TEL RGPR+++++ S +++   EV SP   +  D+YNL 
Sbjct: 225  NGNDRYKSRDKFSRAGGLGMPTELIRGPRAENKSASLEISDKKEVLSP--TVSRDQYNLP 282

Query: 1358 EFRVEYDNAKFYVIKSYSEDDVHKCIKYDVWSSTPNGNKKLDAAFREADAKTSETGTKCP 1537
            +F+VEY+ AKFYVIKSYSEDD+HKCIKYDVWSSTPNGNKKLDA F EA+AK  ETGT+CP
Sbjct: 283  DFQVEYEKAKFYVIKSYSEDDIHKCIKYDVWSSTPNGNKKLDATFNEAEAKADETGTRCP 342

Query: 1538 VFLFFSVNGSGQFVGVAEMIGQVDFSKNMDFWQLDKWNGFFPIKWHIIKDVPNTLLRHII 1717
            +FLFFSVNGSGQFVG+AEM+G+VDF+K+MDFWQLDKWNGFFP+KWH+IKDVPNTLLRHI 
Sbjct: 343  IFLFFSVNGSGQFVGLAEMMGKVDFNKDMDFWQLDKWNGFFPVKWHVIKDVPNTLLRHIT 402

Query: 1718 LENNENRPVTYSRDTQEIGLKQGLEMLSIFKDYSERTSVLDDFNFYENREKSLKAKRNAK 1897
            LENNEN+PVT+SRDTQEIGLKQGLEML IFK YS +TS+LDDFNFYEN+E+S   K+++K
Sbjct: 403  LENNENKPVTHSRDTQEIGLKQGLEMLKIFKSYSAKTSLLDDFNFYENKERSFHGKKSSK 462

Query: 1898 PAS-QTNGPKNGDYKKQFEAGEGTAGEQSAKTXXXXXXXXXXXXTKNLSLNSQPLKSS 2068
            PA+ Q +   + D+ KQ ++ E    E S               TKNLSL     K S
Sbjct: 463  PATLQMDIFNDDDFTKQIKSAEKEFDEDSIS---------IINLTKNLSLKPCTQKKS 511


>ref|XP_002526452.1| yth domain-containing protein, putative [Ricinus communis]
            gi|223534232|gb|EEF35947.1| yth domain-containing
            protein, putative [Ricinus communis]
          Length = 582

 Score =  499 bits (1286), Expect = e-138
 Identities = 279/563 (49%), Positives = 354/563 (62%), Gaps = 21/563 (3%)
 Frame = +2

Query: 443  GKEGIQSNSVSSVSAVGDAMSGIQ-----------VVDQPPASEQGVXXXXXXXXXXXXX 589
            GK+GI S S SS+S+ GDA S I+             D+     Q +             
Sbjct: 29   GKDGIPSYSSSSISSSGDATSNIKGEADGISSIKGEADKESVGVQNIYNPPTSNYNYYYP 88

Query: 590  XXNGTFGQPDDQGFYNAAGGPYTGIQSDNTSLLYYLPGFGPYATGYM-GVDGK-----PT 751
              NG F Q DD G++ A G  + G+QSDN S++YYLPG+ PYA+G + GV+G+     P 
Sbjct: 89   GYNGPFPQLDDHGYFQADGS-HVGMQSDNGSVVYYLPGYNPYASGALIGVEGQSIGQQPY 147

Query: 752  YASTEYLQQPCSYGSDAFPCYSYDSTYSGNVSTGTAARSGSVKPVMGPSGSGKSNGYNIP 931
            ++S+ YLQ P SYGS A PCYS+DSTY+G+VS G+AA         G  GS KSNG N  
Sbjct: 148  FSSSGYLQHPVSYGSAAVPCYSWDSTYAGDVSNGSAAFGN------GKYGSAKSNGLNSM 201

Query: 932  KTNTNFSSKALPFNSKVQQSSNFSKSIYQNQSLKPLNK---LGSSFQTTGLMKGYNPASK 1102
            K+N N   K+             SKS Y  Q  +PLNK   LGS F + GLMKGY+    
Sbjct: 202  KSNGNIGGKS-------------SKSNYM-QPNRPLNKVSPLGSDF-SAGLMKGYHHVGN 246

Query: 1103 FSSFTNRNPGAFTQYNPVNHQSNCKVWYNNCRYKGRE-ILRNADLEALTELTRGPRSDSR 1279
            FSSF+    G  +    +N++ N ++W  N R + R+   +  D EA +ELT GPR+   
Sbjct: 247  FSSFSAHKQGPLSHNGTMNYRQNGRMWNGNDRNRPRDKFYKTNDFEASSELTCGPRAS-- 304

Query: 1280 NNSPKLAAAEVESPGLAIETDKYNLQEFRVEYDNAKFYVIKSYSEDDVHKCIKYDVWSST 1459
            N    L ++  E     +  D+YN  +F+ EY NAKFYVIKSY+EDD+HK IKY VW+ST
Sbjct: 305  NKISPLDSSAKEDLAFTVCRDQYNQADFKTEYKNAKFYVIKSYNEDDIHKSIKYAVWAST 364

Query: 1460 PNGNKKLDAAFREADAKTSETGTKCPVFLFFSVNGSGQFVGVAEMIGQVDFSKNMDFWQL 1639
            PNGNKKLDAAF EA+ ++SETGTKCP+FLFFSVNGSGQFVG+AEM+GQVDF K+MDFWQL
Sbjct: 365  PNGNKKLDAAFCEAEQRSSETGTKCPIFLFFSVNGSGQFVGLAEMVGQVDFEKDMDFWQL 424

Query: 1640 DKWNGFFPIKWHIIKDVPNTLLRHIILENNENRPVTYSRDTQEIGLKQGLEMLSIFKDYS 1819
            DKW+GFFP+KWH+IKD+PN  LRHIILENN+ RPVT+SRDTQEIG +QGLEML+IFK YS
Sbjct: 425  DKWSGFFPVKWHVIKDIPNNQLRHIILENNDKRPVTFSRDTQEIGFEQGLEMLNIFKGYS 484

Query: 1820 ERTSVLDDFNFYENREKSLKAKRNAKPASQTNGPKNGDYKKQFEAGEGTAGEQSAKTXXX 1999
             + S+LDDFNFYENRE S+  K N     +     NGD+ K  ++GE    E+ + T   
Sbjct: 485  SKASLLDDFNFYENRETSVDRKSNKLATLRMEINNNGDFPKHPKSGE-RKHEEESWTKKT 543

Query: 2000 XXXXXXXXXTKNLSLNSQPLKSS 2068
                     TKNLSLN    KS+
Sbjct: 544  SNPSSLINLTKNLSLNGYSQKSN 566


>gb|EOX97053.1| Yth domain-containing protein, putative isoform 1 [Theobroma cacao]
          Length = 573

 Score =  496 bits (1278), Expect = e-137
 Identities = 290/583 (49%), Positives = 367/583 (62%), Gaps = 11/583 (1%)
 Frame = +2

Query: 353  MSGEKNIEASELIVPGPISDPSIKLSEKEM--GKEGIQSNSVSSVSAVGDAMSGIQVVD- 523
            M+GEK  +  E +     S+   KL+E+++  GK G+ S+  S++S+     SG++    
Sbjct: 1    MAGEKMTDNPEPVSAVLKSEVVAKLAEQDVPSGKVGMPSDLTSTMSSSTYPSSGVKGESH 60

Query: 524  QPPASEQGVXXXXXXXXXXXXXXXNGTFGQPDDQGFYNAAGGPYTGIQSDNTSLLYYLPG 703
            Q    E GV               NG+  Q DD  ++  A G +TG+QS+N SL+YY+PG
Sbjct: 61   QDLVGEPGVNQPTSFYNYYYPGY-NGSLVQSDDNSYF-LANGSHTGMQSENGSLVYYMPG 118

Query: 704  FGPYATG-YMGVDGK----PTYASTEYLQQPCSYGSDAFPCYSYDSTYSGNVSTGTAARS 868
            + PYATG  MGVDG+      Y S+ Y Q P SYGS+A PCY +DSTY+G V  G     
Sbjct: 119  YNPYATGTLMGVDGQCVGQQPYFSSGYFQPPVSYGSEAMPCYIWDSTYAGEVLNGNVDGF 178

Query: 869  GSVKPVMGPSGSGKSNGYNIPKTNTNFSSKALPFNSKVQQSSNFSKSIYQNQSLKPLNK- 1045
            G+V    G S   KSNG+N  K+N    +K LP ++               Q +K LNK 
Sbjct: 179  GNVNYGSG-SAFAKSNGFNSLKSNGLVGTK-LPKST-------------HTQPIKALNKG 223

Query: 1046 --LGSSFQTTGLMKGYNPASKFSSFTNRNPGAFTQYNPVNHQSNCKVWYNNCRYKGREIL 1219
              LGS         GY+PA K  SF N+  G F    P+N++ N + W  N RYK     
Sbjct: 224  PHLGSDLSAGSY--GYHPAGKSPSFNNQKEGLFQHNGPMNYRLNGRGWNQNDRYKKSN-- 279

Query: 1220 RNADLEALTELTRGPRSDSRNNSPKLAAAEVESPGLAIETDKYNLQEFRVEYDNAKFYVI 1399
            R+ D +   E+TRGPR+ +R      ++ + E  GL +  DKYN  +F+ EYDNAKF+VI
Sbjct: 280  RDFDFQNSAEVTRGPRAWNRVLD---SSVKREDLGLTLCKDKYNPLDFQTEYDNAKFFVI 336

Query: 1400 KSYSEDDVHKCIKYDVWSSTPNGNKKLDAAFREADAKTSETGTKCPVFLFFSVNGSGQFV 1579
            KSYSEDDVHK +KYDVWSSTPNGN+KLDAAF EA+A+ SETGTK P+FL FSVNGSGQFV
Sbjct: 337  KSYSEDDVHKSMKYDVWSSTPNGNRKLDAAFHEAEARESETGTKFPIFLLFSVNGSGQFV 396

Query: 1580 GVAEMIGQVDFSKNMDFWQLDKWNGFFPIKWHIIKDVPNTLLRHIILENNENRPVTYSRD 1759
            G+AEMIG+VDF+K+MDFWQLDKWNGFFP+KWH+IKD+PN  L HIILENNENR VTYSRD
Sbjct: 397  GLAEMIGKVDFNKDMDFWQLDKWNGFFPVKWHVIKDIPNKELSHIILENNENRSVTYSRD 456

Query: 1760 TQEIGLKQGLEMLSIFKDYSERTSVLDDFNFYENREKSLKAKRNAKPASQTNGPKNGDYK 1939
            TQEIGLKQGLEML+IFK YS ++S+LDDF FYENREK+L AK+N KP +  N  K  D+ 
Sbjct: 457  TQEIGLKQGLEMLNIFKRYSAKSSLLDDFGFYENREKTLNAKKNYKPVTLRN--KEDDFT 514

Query: 1940 KQFEAGEGTAGEQSAKTXXXXXXXXXXXXTKNLSLNSQPLKSS 2068
            KQ +AGE    E   +T            TKNLSLN   LK+S
Sbjct: 515  KQTKAGERRVEEDLRRTKKTSDATSLINLTKNLSLNGCTLKNS 557


>gb|EOX97056.1| Yth domain-containing protein, putative isoform 4 [Theobroma cacao]
            gi|508705161|gb|EOX97057.1| Yth domain-containing
            protein, putative isoform 4 [Theobroma cacao]
          Length = 548

 Score =  493 bits (1268), Expect = e-136
 Identities = 286/582 (49%), Positives = 363/582 (62%), Gaps = 10/582 (1%)
 Frame = +2

Query: 353  MSGEKNIEASELIVPGPISDPSIKLSEKEM--GKEGIQSNSVSSVSAVGDAMSGIQVVDQ 526
            M+GEK  +  E +     S+   KL+E+++  GK G+ S+  S++S+     SG++    
Sbjct: 1    MAGEKMTDNPEPVSAVLKSEVVAKLAEQDVPSGKVGMPSDLTSTMSSSTYPSSGVK---- 56

Query: 527  PPASEQGVXXXXXXXXXXXXXXXNGTFGQPDDQGFYNAAGGPYTGIQSDNTSLLYYLPGF 706
                                   NG+  Q DD  ++  A G +TG+QS+N SL+YY+PG+
Sbjct: 57   ---------------------GYNGSLVQSDDNSYF-LANGSHTGMQSENGSLVYYMPGY 94

Query: 707  GPYATG-YMGVDGK----PTYASTEYLQQPCSYGSDAFPCYSYDSTYSGNVSTGTAARSG 871
             PYATG  MGVDG+      Y S+ Y Q P SYGS+A PCY +DSTY+G V  G     G
Sbjct: 95   NPYATGTLMGVDGQCVGQQPYFSSGYFQPPVSYGSEAMPCYIWDSTYAGEVLNGNVDGFG 154

Query: 872  SVKPVMGPSGSGKSNGYNIPKTNTNFSSKALPFNSKVQQSSNFSKSIYQNQSLKPLNK-- 1045
            +V    G S   KSNG+N  K+N    +K LP ++               Q +K LNK  
Sbjct: 155  NVNYGSG-SAFAKSNGFNSLKSNGLVGTK-LPKST-------------HTQPIKALNKGP 199

Query: 1046 -LGSSFQTTGLMKGYNPASKFSSFTNRNPGAFTQYNPVNHQSNCKVWYNNCRYKGREILR 1222
             LGS         GY+PA K  SF N+  G F    P+N++ N + W  N RYK     R
Sbjct: 200  HLGSDLSAGSY--GYHPAGKSPSFNNQKEGLFQHNGPMNYRLNGRGWNQNDRYKKSN--R 255

Query: 1223 NADLEALTELTRGPRSDSRNNSPKLAAAEVESPGLAIETDKYNLQEFRVEYDNAKFYVIK 1402
            + D +   E+TRGPR+ +R      ++ + E  GL +  DKYN  +F+ EYDNAKF+VIK
Sbjct: 256  DFDFQNSAEVTRGPRAWNRVLD---SSVKREDLGLTLCKDKYNPLDFQTEYDNAKFFVIK 312

Query: 1403 SYSEDDVHKCIKYDVWSSTPNGNKKLDAAFREADAKTSETGTKCPVFLFFSVNGSGQFVG 1582
            SYSEDDVHK +KYDVWSSTPNGN+KLDAAF EA+A+ SETGTK P+FL FSVNGSGQFVG
Sbjct: 313  SYSEDDVHKSMKYDVWSSTPNGNRKLDAAFHEAEARESETGTKFPIFLLFSVNGSGQFVG 372

Query: 1583 VAEMIGQVDFSKNMDFWQLDKWNGFFPIKWHIIKDVPNTLLRHIILENNENRPVTYSRDT 1762
            +AEMIG+VDF+K+MDFWQLDKWNGFFP+KWH+IKD+PN  L HIILENNENR VTYSRDT
Sbjct: 373  LAEMIGKVDFNKDMDFWQLDKWNGFFPVKWHVIKDIPNKELSHIILENNENRSVTYSRDT 432

Query: 1763 QEIGLKQGLEMLSIFKDYSERTSVLDDFNFYENREKSLKAKRNAKPASQTNGPKNGDYKK 1942
            QEIGLKQGLEML+IFK YS ++S+LDDF FYENREK+L AK+N KP +  N  K  D+ K
Sbjct: 433  QEIGLKQGLEMLNIFKRYSAKSSLLDDFGFYENREKTLNAKKNYKPVTLRN--KEDDFTK 490

Query: 1943 QFEAGEGTAGEQSAKTXXXXXXXXXXXXTKNLSLNSQPLKSS 2068
            Q +AGE    E   +T            TKNLSLN   LK+S
Sbjct: 491  QTKAGERRVEEDLRRTKKTSDATSLINLTKNLSLNGCTLKNS 532


>ref|XP_006385033.1| hypothetical protein POPTR_0004s23250g [Populus trichocarpa]
            gi|550341801|gb|ERP62830.1| hypothetical protein
            POPTR_0004s23250g [Populus trichocarpa]
          Length = 593

 Score =  491 bits (1264), Expect = e-136
 Identities = 279/599 (46%), Positives = 367/599 (61%), Gaps = 30/599 (5%)
 Frame = +2

Query: 362  EKNIEASELIVPGPISDPSIKLSEKEMGKEGIQSNSVSSVSAVGDAMSGIQVV------- 520
            EK +E +  +V  P+ +  +       GK+GI S+S  ++ + G   S  +V        
Sbjct: 8    EKRLEPNP-VVAKPVDNNVVS------GKDGIPSDSTPTILSSGSGASDTKVNGSAASTT 60

Query: 521  ----DQPPASEQGVXXXXXXXXXXXXXXXNGTFGQPDDQGFYNAAGGPYTGIQSDNTSLL 688
                DQ P                     +G+F   DD G+Y A G  + G+QSDN S++
Sbjct: 61   KKEGDQEP---HAAFVPPTSSYNYQYPGYSGSFTPLDDHGYYQADGS-HMGMQSDNGSMV 116

Query: 689  YYLPGFGPYATG-YMGVDGK-----PTYASTEYLQQPCSYGSDAFPCYSYDSTYSGNVST 850
            YY P + PYA+G  +GV+G+     P ++S+ YLQ P SYG +  PCYS+DSTY G+VS 
Sbjct: 117  YYWPSY-PYASGTVVGVEGQSVAQQPYFSSSGYLQHPVSYGLETMPCYSWDSTYVGDVSN 175

Query: 851  GTAARSGSVKPVMGPSGSGKSNGYNIPKTNTNFSSKALPFNSKVQQSSNFSKSIYQNQSL 1030
            G A      K   G +   KS+G+N  K+N+N  SK             FSK +Y  Q  
Sbjct: 176  GNAGFENG-KSGSGSTAFAKSSGFNSVKSNSNVGSK-------------FSKPMY-TQPA 220

Query: 1031 KPLNK---LGSSFQTTGLMKGYNPASKFSSFTNRNPGAFTQYNPVNHQSNCKVWYNNCRY 1201
            +P+ K   LGS F + GL KGY P  KF  FT +  G F    P+N++ N ++W  N R 
Sbjct: 221  RPMTKVSPLGSDF-SAGLYKGYQPMGKFPPFTGQKQGPFPHSGPLNYRQNVRMWNGNYRN 279

Query: 1202 KGREIL-RNADLEALTELTRGPRS--------DSRNNSPKLAAAEVESPGLAIETDKYNL 1354
            K R+   RN D E  TELTRGPR+        DS  N+  L ++  +  G A+  ++YNL
Sbjct: 280  KPRDRFNRNGDFENQTELTRGPRASIKNAPLDDSVKNNAPLDSSVKDMLGFAMHKEQYNL 339

Query: 1355 QEFRVEYDNAKFYVIKSYSEDDVHKCIKYDVWSSTPNGNKKLDAAFREADAKTSETGTKC 1534
             +F +EY NAKF+VIKSY+EDD+HK IKYDVW+STPNGNKKLDAAF  A+  +SETGTKC
Sbjct: 340  PDFEIEYSNAKFFVIKSYNEDDIHKSIKYDVWASTPNGNKKLDAAFHNAEEVSSETGTKC 399

Query: 1535 PVFLFFSVNGSGQFVGVAEMIGQVDFSKNMDFWQLDKWNGFFPIKWHIIKDVPNTLLRHI 1714
            P+FLFFSVNGSGQFVG+AEM+GQVDF+K+MDFWQ+DKWNGFFP+KWH+IKD+PN  LRHI
Sbjct: 400  PIFLFFSVNGSGQFVGLAEMVGQVDFNKDMDFWQIDKWNGFFPVKWHVIKDIPNGQLRHI 459

Query: 1715 ILENNENRPVTYSRDTQEIGLKQGLEMLSIFKDYSERTSVLDDFNFYENREKSLKAKRNA 1894
            +LENN+   VT+SRDTQEIGL++GLEML+IFK YS +TS+LDDFNFYENREKSL  K++ 
Sbjct: 460  VLENNDGHSVTFSRDTQEIGLEKGLEMLNIFKSYSAKTSMLDDFNFYENREKSLNTKKSN 519

Query: 1895 KPAS-QTNGPKNGDYKKQFEAGEGTAGEQSAKTXXXXXXXXXXXXTKNLSLNSQPLKSS 2068
            KPA+ +    +N D+ K   A E    E  ++             TKNLSLN    KS+
Sbjct: 520  KPATLRMEIFENSDFPKH-TAAEEKISEDDSRAKKTTNPSTLINLTKNLSLNGHNQKSN 577


>gb|EXB29044.1| hypothetical protein L484_018461 [Morus notabilis]
          Length = 549

 Score =  490 bits (1262), Expect = e-135
 Identities = 277/581 (47%), Positives = 367/581 (63%), Gaps = 15/581 (2%)
 Frame = +2

Query: 353  MSGEKNIEASELIVPGPISDPSIKLSEKEM--GKEGIQSNSVSSVSAVGDAMSGIQ-VVD 523
            M+GEK IE+SE +V    SDP   ++E++   GK+G+ SN ++++S   D    I+   D
Sbjct: 1    MAGEKKIESSEPVVTLLKSDPVTAVAEQDAAKGKDGVPSNLITAISTSKDVTPSIKGTTD 60

Query: 524  QPPASEQGVXXXXXXXXXXXXXXXNGTFGQPDDQGFYNAAGGPYTGIQSDNTSLLYYLPG 703
            Q    E GV               NG+F Q DD G+++A  G  TG+QSDN SL++Y P 
Sbjct: 61   QGSVGEHGVYGPPYNYYLPGY---NGSFAQVDDHGYFHA-NGSNTGLQSDNGSLVFYYP- 115

Query: 704  FGPYATG-YMGVDGKPT-----YASTEYLQQPCSYGSDAFPCYSYDSTYSGNVSTGTAAR 865
               Y +G  MGVDG+       ++S+ Y Q P SYGS+A  CYS+D T+   V  G +  
Sbjct: 116  ---YTSGPIMGVDGQGIGQQQYFSSSGYHQPPVSYGSEAMSCYSWDPTFGKEVPNGASGG 172

Query: 866  SGSVKPVMGPSGSGKSNGYNIPKTNTNFSSKALPFNSKVQQSSNFSKSIYQNQSLKPLNK 1045
              + K  +  +G  +SN +N  K+N + +SK             FSK +   Q +K LNK
Sbjct: 173  FPNAKSGLRSTGLARSNAFNSTKSNGSITSK-------------FSKPLLPTQPVKSLNK 219

Query: 1046 ---LGSSFQTT-GLMKGYNP--ASKFSSFTNRNPGAFTQYNPVNHQSNCKVWYNNCRYKG 1207
               LGS F T  GL+KGY      +F+SF+N+  G F      N++   ++W  N R   
Sbjct: 220  VPHLGSDFSTAAGLLKGYPQPQVGRFASFSNQKQGVFPYTGFSNYKQYGRIWSGNDR--- 276

Query: 1208 REILRNADLEALTELTRGPRSDSRNNSPKLAAAEVESPGLAIETDKYNLQEFRVEYDNAK 1387
                 N D EA  ELTRGPRS  RN     +++E E  GLA+  D+YNL +F+ +  NAK
Sbjct: 277  -----NGDFEASAELTRGPRS--RNKDLLDSSSEKEELGLAVRRDQYNLPDFQTDNVNAK 329

Query: 1388 FYVIKSYSEDDVHKCIKYDVWSSTPNGNKKLDAAFREADAKTSETGTKCPVFLFFSVNGS 1567
            FYVIKSYSEDDVHK IKYDVW+STPNGNKKLD++F +A+AK+SE G  CP+FLFFSVNGS
Sbjct: 330  FYVIKSYSEDDVHKSIKYDVWASTPNGNKKLDSSFHDAEAKSSEMGKNCPIFLFFSVNGS 389

Query: 1568 GQFVGVAEMIGQVDFSKNMDFWQLDKWNGFFPIKWHIIKDVPNTLLRHIILENNENRPVT 1747
            GQFVG+AEMIGQVDF+K+MDFWQ+DKW+GFFP++WHI+KDVPNT LRHIILENN+N+PVT
Sbjct: 390  GQFVGIAEMIGQVDFNKDMDFWQVDKWSGFFPVRWHIVKDVPNTQLRHIILENNDNKPVT 449

Query: 1748 YSRDTQEIGLKQGLEMLSIFKDYSERTSVLDDFNFYENREKSLKAKRNAKPASQTNGPKN 1927
            ++RDTQEIGLKQGLEML+IFK Y+ +T++LDDFNFYE+RE+SL+AKR++KPA+       
Sbjct: 450  FTRDTQEIGLKQGLEMLNIFKSYTAKTTLLDDFNFYESREQSLQAKRSSKPATL---KME 506

Query: 1928 GDYKKQFEAGEGTAGEQSAKTXXXXXXXXXXXXTKNLSLNS 2050
            G Y +      G   E                 TKNLSL++
Sbjct: 507  GIYNENDFTKRGNEVESGGAKMTSDRASSLINLTKNLSLSA 547


>gb|EOX97055.1| Yth domain-containing protein, putative isoform 3 [Theobroma cacao]
          Length = 572

 Score =  490 bits (1261), Expect = e-135
 Identities = 289/583 (49%), Positives = 366/583 (62%), Gaps = 11/583 (1%)
 Frame = +2

Query: 353  MSGEKNIEASELIVPGPISDPSIKLSEKEM--GKEGIQSNSVSSVSAVGDAMSGIQVVD- 523
            M+GEK  +  E +     S+   KL+E+++  GK G+ S+  S++S+     SG++    
Sbjct: 1    MAGEKMTDNPEPVSAVLKSEVVAKLAEQDVPSGKVGMPSDLTSTMSSSTYPSSGVKGESH 60

Query: 524  QPPASEQGVXXXXXXXXXXXXXXXNGTFGQPDDQGFYNAAGGPYTGIQSDNTSLLYYLPG 703
            Q    E GV               NG+  Q DD  ++  A G +TG+QS+N SL+YY+PG
Sbjct: 61   QDLVGEPGVNQPTSFYNYYYPGY-NGSLVQSDDNSYF-LANGSHTGMQSENGSLVYYMPG 118

Query: 704  FGPYATG-YMGVDGK----PTYASTEYLQQPCSYGSDAFPCYSYDSTYSGNVSTGTAARS 868
            + PYATG  MGVDG+      Y S+ Y Q P SYGS+A PCY +DSTY+G V  G     
Sbjct: 119  YNPYATGTLMGVDGQCVGQQPYFSSGYFQPPVSYGSEAMPCYIWDSTYAGEVLNGNVDGF 178

Query: 869  GSVKPVMGPSGSGKSNGYNIPKTNTNFSSKALPFNSKVQQSSNFSKSIYQNQSLKPLNK- 1045
            G+V    G S   KSNG+N  K+N    +K LP ++               Q +K LNK 
Sbjct: 179  GNVNYGSG-SAFAKSNGFNSLKSNGLVGTK-LPKST-------------HTQPIKALNKG 223

Query: 1046 --LGSSFQTTGLMKGYNPASKFSSFTNRNPGAFTQYNPVNHQSNCKVWYNNCRYKGREIL 1219
              LGS         GY+PA K  SF N+  G F    P+N++ N + W  N RYK     
Sbjct: 224  PHLGSDLSAGSY--GYHPAGKSPSFNNQKEGLFQHNGPMNYRLNGRGWNQNDRYKKSN-- 279

Query: 1220 RNADLEALTELTRGPRSDSRNNSPKLAAAEVESPGLAIETDKYNLQEFRVEYDNAKFYVI 1399
            R+ D +   E+TRGPR+ +R      ++ + E  GL +  DKYN  +F+ EYDNAKF+VI
Sbjct: 280  RDFDFQNSAEVTRGPRAWNRVLD---SSVKREDLGLTLCKDKYNPLDFQTEYDNAKFFVI 336

Query: 1400 KSYSEDDVHKCIKYDVWSSTPNGNKKLDAAFREADAKTSETGTKCPVFLFFSVNGSGQFV 1579
            KSYSEDDVHK +KYDVWSSTPNGN+KLDAAF EA+A+ SETGTK P+FL FSVNGSGQFV
Sbjct: 337  KSYSEDDVHKSMKYDVWSSTPNGNRKLDAAFHEAEARESETGTKFPIFLLFSVNGSGQFV 396

Query: 1580 GVAEMIGQVDFSKNMDFWQLDKWNGFFPIKWHIIKDVPNTLLRHIILENNENRPVTYSRD 1759
            G+AEMIG+VDF+K+MDFWQLDKWNGFFP+KWH+IKD+PN  L HIILENNENR VTYSRD
Sbjct: 397  GLAEMIGKVDFNKDMDFWQLDKWNGFFPVKWHVIKDIPNKELSHIILENNENRSVTYSRD 456

Query: 1760 TQEIGLKQGLEMLSIFKDYSERTSVLDDFNFYENREKSLKAKRNAKPASQTNGPKNGDYK 1939
            TQEIGLKQGLEML+IFK YS ++S+LDDF FYENREK+L AK+N KP +  N  K  D+ 
Sbjct: 457  TQEIGLKQGLEMLNIFKRYSAKSSLLDDFGFYENREKTLNAKKNYKPVTLRN--KEDDF- 513

Query: 1940 KQFEAGEGTAGEQSAKTXXXXXXXXXXXXTKNLSLNSQPLKSS 2068
             Q +AGE    E   +T            TKNLSLN   LK+S
Sbjct: 514  TQTKAGERRVEEDLRRTKKTSDATSLINLTKNLSLNGCTLKNS 556


>gb|EOX97054.1| Yth domain-containing protein, putative isoform 2, partial [Theobroma
            cacao]
          Length = 524

 Score =  476 bits (1225), Expect = e-131
 Identities = 277/547 (50%), Positives = 352/547 (64%), Gaps = 11/547 (2%)
 Frame = +2

Query: 353  MSGEKNIEASELIVPGPISDPSIKLSEKEM--GKEGIQSNSVSSVSAVGDAMSGIQVVD- 523
            M+GEK  +  E +     S+   KL+E+++  GK G+ S+  S++S+     SG++    
Sbjct: 1    MAGEKMTDNPEPVSAVLKSEVVAKLAEQDVPSGKVGMPSDLTSTMSSSTYPSSGVKGESH 60

Query: 524  QPPASEQGVXXXXXXXXXXXXXXXNGTFGQPDDQGFYNAAGGPYTGIQSDNTSLLYYLPG 703
            Q    E GV               NG+  Q DD  ++  A G +TG+QS+N SL+YY+PG
Sbjct: 61   QDLVGEPGVNQPTSFYNYYYPGY-NGSLVQSDDNSYF-LANGSHTGMQSENGSLVYYMPG 118

Query: 704  FGPYATG-YMGVDGK----PTYASTEYLQQPCSYGSDAFPCYSYDSTYSGNVSTGTAARS 868
            + PYATG  MGVDG+      Y S+ Y Q P SYGS+A PCY +DSTY+G V  G     
Sbjct: 119  YNPYATGTLMGVDGQCVGQQPYFSSGYFQPPVSYGSEAMPCYIWDSTYAGEVLNGNVDGF 178

Query: 869  GSVKPVMGPSGSGKSNGYNIPKTNTNFSSKALPFNSKVQQSSNFSKSIYQNQSLKPLNK- 1045
            G+V    G S   KSNG+N  K+N    +K LP ++               Q +K LNK 
Sbjct: 179  GNVNYGSG-SAFAKSNGFNSLKSNGLVGTK-LPKST-------------HTQPIKALNKG 223

Query: 1046 --LGSSFQTTGLMKGYNPASKFSSFTNRNPGAFTQYNPVNHQSNCKVWYNNCRYKGREIL 1219
              LGS         GY+PA K  SF N+  G F    P+N++ N + W  N RYK     
Sbjct: 224  PHLGSDLSAGSY--GYHPAGKSPSFNNQKEGLFQHNGPMNYRLNGRGWNQNDRYKKSN-- 279

Query: 1220 RNADLEALTELTRGPRSDSRNNSPKLAAAEVESPGLAIETDKYNLQEFRVEYDNAKFYVI 1399
            R+ D +   E+TRGPR+ +R      ++ + E  GL +  DKYN  +F+ EYDNAKF+VI
Sbjct: 280  RDFDFQNSAEVTRGPRAWNRVLD---SSVKREDLGLTLCKDKYNPLDFQTEYDNAKFFVI 336

Query: 1400 KSYSEDDVHKCIKYDVWSSTPNGNKKLDAAFREADAKTSETGTKCPVFLFFSVNGSGQFV 1579
            KSYSEDDVHK +KYDVWSSTPNGN+KLDAAF EA+A+ SETGTK P+FL FSVNGSGQFV
Sbjct: 337  KSYSEDDVHKSMKYDVWSSTPNGNRKLDAAFHEAEARESETGTKFPIFLLFSVNGSGQFV 396

Query: 1580 GVAEMIGQVDFSKNMDFWQLDKWNGFFPIKWHIIKDVPNTLLRHIILENNENRPVTYSRD 1759
            G+AEMIG+VDF+K+MDFWQLDKWNGFFP+KWH+IKD+PN  L HIILENNENR VTYSRD
Sbjct: 397  GLAEMIGKVDFNKDMDFWQLDKWNGFFPVKWHVIKDIPNKELSHIILENNENRSVTYSRD 456

Query: 1760 TQEIGLKQGLEMLSIFKDYSERTSVLDDFNFYENREKSLKAKRNAKPASQTNGPKNGDYK 1939
            TQEIGLKQGLEML+IFK YS ++S+LDDF FYENREK+L AK+N KP +  N  K  D+ 
Sbjct: 457  TQEIGLKQGLEMLNIFKRYSAKSSLLDDFGFYENREKTLNAKKNYKPVTLRN--KEDDF- 513

Query: 1940 KQFEAGE 1960
             Q +AGE
Sbjct: 514  TQTKAGE 520


>ref|XP_006389534.1| hypothetical protein POPTR_0022s00680g [Populus trichocarpa]
            gi|550312357|gb|ERP48448.1| hypothetical protein
            POPTR_0022s00680g [Populus trichocarpa]
          Length = 581

 Score =  475 bits (1222), Expect = e-131
 Identities = 271/569 (47%), Positives = 353/569 (62%), Gaps = 27/569 (4%)
 Frame = +2

Query: 443  GKEGIQSNSVSSVSAVGDAMSGIQV-----------VDQPPASEQGVXXXXXXXXXXXXX 589
            G +G+ S+S  ++SA G+ +S  +V            DQ P +                 
Sbjct: 28   GIDGLPSDSTPTISASGNGVSDTKVNGSAVSITKREADQEPNAASSYSYQYPGY------ 81

Query: 590  XXNGTFGQPDDQGFYNAAGGPYTGIQSDNTSLLYYLPGFGPYATG-YMGVDGK-----PT 751
              +G+  Q DDQ +Y A G   TG+QSDN S++YY P + PYA+G  +GVDG+     P 
Sbjct: 82   --SGSSTQLDDQVYYQADGSQ-TGMQSDNGSMVYYWPSY-PYASGTVVGVDGQSVAQQPY 137

Query: 752  YASTEYLQQPCSYGSDAFPCYSYDSTYSGNVSTGTAARSGSVKPVMGPSGSGKSNGYNIP 931
            ++S+ YLQ P SYG +A PCYS+DS Y G+VS G A      K   G +   +SNG+N  
Sbjct: 138  FSSSGYLQHPVSYGLEAMPCYSWDSAYVGDVSNGNAVFENG-KGGSGSTAFAQSNGFNST 196

Query: 932  KTNTNFSSKALPFNSKVQQSSNFSKSIYQNQSLKPLNKLGSSFQTTGLMKGYNPASKFSS 1111
            K+N N  SK              SK +Y  Q + P    GS F + GL KGY P  KF  
Sbjct: 197  KSNGNIGSK-------------ISKPMY-TQLVSPS---GSDF-SAGLFKGYQPMGKFPP 238

Query: 1112 FTNRNPGAFTQYNPVNHQSNCKVWYNNCRYKGREIL-RNADLEALTELTRGPRSDSRN-- 1282
            FT++ PG F    P+N++ N ++W  N R   R+   +N D E  TELTRGPR+ ++N  
Sbjct: 239  FTSQKPGPFPHNGPLNYRQNGRMWTGNYRNISRDRFNKNYDFENQTELTRGPRASNKNAP 298

Query: 1283 ------NSPKLAAAEVESPGLAIETDKYNLQEFRVEYDNAKFYVIKSYSEDDVHKCIKYD 1444
                   +  L ++  +  G+A+  ++YNL +F  EY NAKF+VIKSYSEDD+HK IKYD
Sbjct: 299  LDLLVNKNASLDSSVKDELGIAMRKEQYNLPDFETEYANAKFFVIKSYSEDDIHKSIKYD 358

Query: 1445 VWSSTPNGNKKLDAAFREADAKTSETGTKCPVFLFFSVNGSGQFVGVAEMIGQVDFSKNM 1624
            VW+STPNGNKKLDAAF  A+  +S+TG KCP+FLFFSVNGSGQFVG AEM+GQVDF+K+M
Sbjct: 359  VWASTPNGNKKLDAAFHNAEEVSSDTGYKCPIFLFFSVNGSGQFVGFAEMVGQVDFNKDM 418

Query: 1625 DFWQLDKWNGFFPIKWHIIKDVPNTLLRHIILENNENRPVTYSRDTQEIGLKQGLEMLSI 1804
            DFWQ+DKWNGFFP+KWH++KD+PN  LRHI+LENN+   VT+SRDTQEI LKQGLEML+I
Sbjct: 419  DFWQIDKWNGFFPVKWHVVKDIPNGHLRHIVLENNDGHSVTFSRDTQEIVLKQGLEMLNI 478

Query: 1805 FKDYSERTSVLDDFNFYENREKSLKAKRNAKPAS-QTNGPKNGDYKKQFEAGEGTAGEQS 1981
            FK YS +TS+LDDFNFYE REKSL  K+  KPA+ Q    KNGD+     A EG + E  
Sbjct: 479  FKSYSAKTSLLDDFNFYEKREKSLNTKKGNKPATLQMEIFKNGDF-AHTTAEEGIS-EDD 536

Query: 1982 AKTXXXXXXXXXXXXTKNLSLNSQPLKSS 2068
            ++T            TKNLSL+    KS+
Sbjct: 537  SRTKKTTNPSSLINLTKNLSLSGHIQKSN 565


>gb|EOX97058.1| Yth domain-containing protein, putative isoform 6, partial [Theobroma
            cacao]
          Length = 499

 Score =  472 bits (1215), Expect = e-130
 Identities = 273/546 (50%), Positives = 348/546 (63%), Gaps = 10/546 (1%)
 Frame = +2

Query: 353  MSGEKNIEASELIVPGPISDPSIKLSEKEM--GKEGIQSNSVSSVSAVGDAMSGIQVVDQ 526
            M+GEK  +  E +     S+   KL+E+++  GK G+ S+  S++S+     SG++    
Sbjct: 1    MAGEKMTDNPEPVSAVLKSEVVAKLAEQDVPSGKVGMPSDLTSTMSSSTYPSSGVK---- 56

Query: 527  PPASEQGVXXXXXXXXXXXXXXXNGTFGQPDDQGFYNAAGGPYTGIQSDNTSLLYYLPGF 706
                                   NG+  Q DD  ++  A G +TG+QS+N SL+YY+PG+
Sbjct: 57   ---------------------GYNGSLVQSDDNSYF-LANGSHTGMQSENGSLVYYMPGY 94

Query: 707  GPYATG-YMGVDGK----PTYASTEYLQQPCSYGSDAFPCYSYDSTYSGNVSTGTAARSG 871
             PYATG  MGVDG+      Y S+ Y Q P SYGS+A PCY +DSTY+G V  G     G
Sbjct: 95   NPYATGTLMGVDGQCVGQQPYFSSGYFQPPVSYGSEAMPCYIWDSTYAGEVLNGNVDGFG 154

Query: 872  SVKPVMGPSGSGKSNGYNIPKTNTNFSSKALPFNSKVQQSSNFSKSIYQNQSLKPLNK-- 1045
            +V    G S   KSNG+N  K+N    +K LP ++               Q +K LNK  
Sbjct: 155  NVNYGSG-SAFAKSNGFNSLKSNGLVGTK-LPKST-------------HTQPIKALNKGP 199

Query: 1046 -LGSSFQTTGLMKGYNPASKFSSFTNRNPGAFTQYNPVNHQSNCKVWYNNCRYKGREILR 1222
             LGS         GY+PA K  SF N+  G F    P+N++ N + W  N RYK     R
Sbjct: 200  HLGSDLSAGSY--GYHPAGKSPSFNNQKEGLFQHNGPMNYRLNGRGWNQNDRYKKSN--R 255

Query: 1223 NADLEALTELTRGPRSDSRNNSPKLAAAEVESPGLAIETDKYNLQEFRVEYDNAKFYVIK 1402
            + D +   E+TRGPR+ +R      ++ + E  GL +  DKYN  +F+ EYDNAKF+VIK
Sbjct: 256  DFDFQNSAEVTRGPRAWNRVLD---SSVKREDLGLTLCKDKYNPLDFQTEYDNAKFFVIK 312

Query: 1403 SYSEDDVHKCIKYDVWSSTPNGNKKLDAAFREADAKTSETGTKCPVFLFFSVNGSGQFVG 1582
            SYSEDDVHK +KYDVWSSTPNGN+KLDAAF EA+A+ SETGTK P+FL FSVNGSGQFVG
Sbjct: 313  SYSEDDVHKSMKYDVWSSTPNGNRKLDAAFHEAEARESETGTKFPIFLLFSVNGSGQFVG 372

Query: 1583 VAEMIGQVDFSKNMDFWQLDKWNGFFPIKWHIIKDVPNTLLRHIILENNENRPVTYSRDT 1762
            +AEMIG+VDF+K+MDFWQLDKWNGFFP+KWH+IKD+PN  L HIILENNENR VTYSRDT
Sbjct: 373  LAEMIGKVDFNKDMDFWQLDKWNGFFPVKWHVIKDIPNKELSHIILENNENRSVTYSRDT 432

Query: 1763 QEIGLKQGLEMLSIFKDYSERTSVLDDFNFYENREKSLKAKRNAKPASQTNGPKNGDYKK 1942
            QEIGLKQGLEML+IFK YS ++S+LDDF FYENREK+L AK+N KP +  N  K  D+  
Sbjct: 433  QEIGLKQGLEMLNIFKRYSAKSSLLDDFGFYENREKTLNAKKNYKPVTLRN--KEDDF-T 489

Query: 1943 QFEAGE 1960
            Q +AGE
Sbjct: 490  QTKAGE 495


>ref|XP_002331108.1| predicted protein [Populus trichocarpa]
          Length = 410

 Score =  445 bits (1145), Expect = e-122
 Identities = 231/430 (53%), Positives = 295/430 (68%), Gaps = 15/430 (3%)
 Frame = +2

Query: 662  IQSDNTSLLYYLPGFGPYATG-YMGVDGK-----PTYASTEYLQQPCSYGSDAFPCYSYD 823
            +QSDN S++YY P + PYA+G  +GVDG+     P ++S+ YLQ P SYG +A PCYS+D
Sbjct: 1    MQSDNGSMVYYWPSY-PYASGTVVGVDGQSVAQQPYFSSSGYLQHPVSYGLEAMPCYSWD 59

Query: 824  STYSGNVSTGTAARSGSVKPVMGPSGSGKSNGYNIPKTNTNFSSKALPFNSKVQQSSNFS 1003
            S Y G+VS G A      K   G +   +SNG+N  K+N N  SK              S
Sbjct: 60   SAYVGDVSNGNAVFENG-KGGSGSTAFAQSNGFNSTKSNGNIGSK-------------IS 105

Query: 1004 KSIYQNQSLKPLNKLGSSFQTTGLMKGYNPASKFSSFTNRNPGAFTQYNPVNHQSNCKVW 1183
            K +Y  Q + P    GS F + GL KGY P  KF  FT++ PG F    P+N++ N ++W
Sbjct: 106  KPMY-TQLVSPS---GSDF-SAGLFKGYQPMGKFPPFTSQKPGPFPHNGPLNYRQNGRMW 160

Query: 1184 YNNCRYKGREIL-RNADLEALTELTRGPRSDSRN--------NSPKLAAAEVESPGLAIE 1336
              N R   R+   +N D E  TELTRGPR+ ++N         +  L ++  +  G+A+ 
Sbjct: 161  TGNYRNISRDRFNKNYDFENQTELTRGPRASNKNAPLDLLVNKNASLDSSVKDELGIAMR 220

Query: 1337 TDKYNLQEFRVEYDNAKFYVIKSYSEDDVHKCIKYDVWSSTPNGNKKLDAAFREADAKTS 1516
             ++YNL +F  EY NAKF+VIKSYSEDD+HK IKYDVW+STPNGNKKLDAAF  A+  +S
Sbjct: 221  KEQYNLPDFETEYANAKFFVIKSYSEDDIHKSIKYDVWASTPNGNKKLDAAFHNAEEVSS 280

Query: 1517 ETGTKCPVFLFFSVNGSGQFVGVAEMIGQVDFSKNMDFWQLDKWNGFFPIKWHIIKDVPN 1696
            +TG KCP+FLFFSVNGSGQFVG AEM+GQVDF+K+MDFWQ+DKWNGFFP+KWH++KD+PN
Sbjct: 281  DTGYKCPIFLFFSVNGSGQFVGFAEMVGQVDFNKDMDFWQIDKWNGFFPVKWHVVKDIPN 340

Query: 1697 TLLRHIILENNENRPVTYSRDTQEIGLKQGLEMLSIFKDYSERTSVLDDFNFYENREKSL 1876
              LRHI+LENN+   VT+SRDTQEI LKQGLEML+IFK YS +TS+LDDFNFYE REKSL
Sbjct: 341  GHLRHIVLENNDGHSVTFSRDTQEIVLKQGLEMLNIFKSYSAKTSLLDDFNFYEKREKSL 400

Query: 1877 KAKRNAKPAS 1906
              K+  KPA+
Sbjct: 401  NTKKGNKPAT 410


Top