BLASTX nr result

ID: Akebia24_contig00004437 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00004437
         (2007 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI29706.3| unnamed protein product [Vitis vinifera]              436   e-119
ref|XP_002262918.1| PREDICTED: uncharacterized protein LOC100249...   436   e-119
emb|CAN72774.1| hypothetical protein VITISV_026284 [Vitis vinifera]   422   e-115
ref|XP_007222796.1| hypothetical protein PRUPE_ppa003557mg [Prun...   411   e-112
gb|AHN05781.1| YTH domain-contained RNA binding protein 10 [Malu...   397   e-108
ref|XP_006385033.1| hypothetical protein POPTR_0004s23250g [Popu...   390   e-106
ref|XP_002526452.1| yth domain-containing protein, putative [Ric...   390   e-105
ref|XP_006389534.1| hypothetical protein POPTR_0022s00680g [Popu...   377   e-101
gb|EXB29044.1| hypothetical protein L484_018461 [Morus notabilis]     376   e-101
ref|XP_007041225.1| Yth domain-containing protein, putative isof...   369   4e-99
ref|XP_007041222.1| Yth domain-containing protein, putative isof...   369   4e-99
ref|XP_006471138.1| PREDICTED: uncharacterized protein LOC102630...   367   1e-98
ref|XP_007041224.1| Yth domain-containing protein, putative isof...   367   1e-98
ref|XP_006349328.1| PREDICTED: YTH domain family protein 1-like ...   366   2e-98
ref|XP_004230452.1| PREDICTED: uncharacterized protein LOC101267...   363   2e-97
ref|XP_006431655.1| hypothetical protein CICLE_v10000713mg [Citr...   362   5e-97
ref|XP_006471139.1| PREDICTED: uncharacterized protein LOC102630...   355   6e-95
ref|XP_006431654.1| hypothetical protein CICLE_v10000713mg [Citr...   350   2e-93
ref|XP_007041227.1| Yth domain-containing protein, putative isof...   350   2e-93
ref|XP_007041223.1| Yth domain-containing protein, putative isof...   350   2e-93

>emb|CBI29706.3| unnamed protein product [Vitis vinifera]
          Length = 708

 Score =  436 bits (1122), Expect = e-119
 Identities = 247/467 (52%), Positives = 292/467 (62%), Gaps = 4/467 (0%)
 Frame = -1

Query: 1629 SINRNGPVVTHVSLRQLDSNSRMSAGSSYFPKSTPQSQPLKSLKEVSFSRPQIFGMGMDY 1450
            S+  NG V    SL   DS S  SA  S F KS  QSQPLK L + S             
Sbjct: 298  SMKANGTVANKYSL-PFDSKSHQSAAPSNFSKSIFQSQPLKPLNKAS------------- 343

Query: 1449 SSMKVPQLGSDFRSAGLAEGYHPVRKFSSLTNQIQGPFPHNGPMRNQIQGPFAHNGPMTY 1270
                   LGSDF  AG A+G++PV KFSS TNQ QG FPHN              G M Y
Sbjct: 344  ------HLGSDF-PAGFAKGFNPVSKFSSFTNQKQGFFPHN--------------GVMNY 382

Query: 1269 RPNGRTWVGNDRFRLGEKFNRNGNLETSREPNQEKFNRNGNFETSREPNREKFNRNGNFE 1090
            RPN R W GN++++L                                  REK NRNG+FE
Sbjct: 383  RPNSRAWNGNEKYKL----------------------------------REKSNRNGHFE 408

Query: 1089 TFREPNRGPRARGVRNPLSSSAENVEFVISLRRNKYNVQDFQTVYENAKFFVIKSYSEDD 910
            +  E   GPRAR    PL+S+ E  E  + +RR++YN+QDFQT YENAKF+VIKS+SEDD
Sbjct: 409  SSTELTCGPRARNRNAPLNSATEKEELGLMVRRDQYNLQDFQTEYENAKFYVIKSFSEDD 468

Query: 909  IHKSIKYGVWASTPNGNKKLDAVFYDTEAKSTESGISCPIFLFFSVNGSGQFLGLAEMIG 730
            IHK IKY VWASTPNGNKKLDA F+D EAK+ E+G   PIFLFFSVNGSGQF+G+AEM+G
Sbjct: 469  IHKCIKYDVWASTPNGNKKLDAAFHDAEAKANETGTKFPIFLFFSVNGSGQFVGVAEMVG 528

Query: 729  PVDFKKDMDFWQQDKWNGFFPLKWHIIKDIPNSKLRHIILENNENRPVTYTRDTQEIGLK 550
             VDF KDMDFWQ DKWNGFFP+KWHI+KDIPNS+LRHI LE+NENR VTYTRDTQEIGLK
Sbjct: 529  QVDFNKDMDFWQLDKWNGFFPVKWHIVKDIPNSQLRHITLESNENRSVTYTRDTQEIGLK 588

Query: 549  QGLEMLTIFKSYTAITSMLDDFKFYEDQEKSLQARSNKAVSP--HMETYGNKFP-QKQFE 379
            QG+EML IFK+Y+A TSM DDF FYE++EKSL AR +    P   ME YG+     K   
Sbjct: 589  QGVEMLKIFKNYSARTSMFDDFNFYENREKSLHARRSSKPPPPSQMEIYGSGDDLPKHLH 648

Query: 378  GVRKSDQASMRAQRSDSKMSMVVNLTKNLSLNS-HLPKAGAGKDSME 241
            G  +  +   R  RS    S+ +NLTKNLSL++ H PK  +G +  E
Sbjct: 649  GEERKTEEPARTSRSHDPKSL-INLTKNLSLSTPHPPKNSSGLNPTE 694


>ref|XP_002262918.1| PREDICTED: uncharacterized protein LOC100249242 [Vitis vinifera]
          Length = 608

 Score =  436 bits (1122), Expect = e-119
 Identities = 247/467 (52%), Positives = 292/467 (62%), Gaps = 4/467 (0%)
 Frame = -1

Query: 1629 SINRNGPVVTHVSLRQLDSNSRMSAGSSYFPKSTPQSQPLKSLKEVSFSRPQIFGMGMDY 1450
            S+  NG V    SL   DS S  SA  S F KS  QSQPLK L + S             
Sbjct: 198  SMKANGTVANKYSL-PFDSKSHQSAAPSNFSKSIFQSQPLKPLNKAS------------- 243

Query: 1449 SSMKVPQLGSDFRSAGLAEGYHPVRKFSSLTNQIQGPFPHNGPMRNQIQGPFAHNGPMTY 1270
                   LGSDF  AG A+G++PV KFSS TNQ QG FPHN              G M Y
Sbjct: 244  ------HLGSDF-PAGFAKGFNPVSKFSSFTNQKQGFFPHN--------------GVMNY 282

Query: 1269 RPNGRTWVGNDRFRLGEKFNRNGNLETSREPNQEKFNRNGNFETSREPNREKFNRNGNFE 1090
            RPN R W GN++++L                                  REK NRNG+FE
Sbjct: 283  RPNSRAWNGNEKYKL----------------------------------REKSNRNGHFE 308

Query: 1089 TFREPNRGPRARGVRNPLSSSAENVEFVISLRRNKYNVQDFQTVYENAKFFVIKSYSEDD 910
            +  E   GPRAR    PL+S+ E  E  + +RR++YN+QDFQT YENAKF+VIKS+SEDD
Sbjct: 309  SSTELTCGPRARNRNAPLNSATEKEELGLMVRRDQYNLQDFQTEYENAKFYVIKSFSEDD 368

Query: 909  IHKSIKYGVWASTPNGNKKLDAVFYDTEAKSTESGISCPIFLFFSVNGSGQFLGLAEMIG 730
            IHK IKY VWASTPNGNKKLDA F+D EAK+ E+G   PIFLFFSVNGSGQF+G+AEM+G
Sbjct: 369  IHKCIKYDVWASTPNGNKKLDAAFHDAEAKANETGTKFPIFLFFSVNGSGQFVGVAEMVG 428

Query: 729  PVDFKKDMDFWQQDKWNGFFPLKWHIIKDIPNSKLRHIILENNENRPVTYTRDTQEIGLK 550
             VDF KDMDFWQ DKWNGFFP+KWHI+KDIPNS+LRHI LE+NENR VTYTRDTQEIGLK
Sbjct: 429  QVDFNKDMDFWQLDKWNGFFPVKWHIVKDIPNSQLRHITLESNENRSVTYTRDTQEIGLK 488

Query: 549  QGLEMLTIFKSYTAITSMLDDFKFYEDQEKSLQARSNKAVSP--HMETYGNKFP-QKQFE 379
            QG+EML IFK+Y+A TSM DDF FYE++EKSL AR +    P   ME YG+     K   
Sbjct: 489  QGVEMLKIFKNYSARTSMFDDFNFYENREKSLHARRSSKPPPPSQMEIYGSGDDLPKHLH 548

Query: 378  GVRKSDQASMRAQRSDSKMSMVVNLTKNLSLNS-HLPKAGAGKDSME 241
            G  +  +   R  RS    S+ +NLTKNLSL++ H PK  +G +  E
Sbjct: 549  GEERKTEEPARTSRSHDPKSL-INLTKNLSLSTPHPPKNSSGLNPTE 594


>emb|CAN72774.1| hypothetical protein VITISV_026284 [Vitis vinifera]
          Length = 812

 Score =  422 bits (1084), Expect = e-115
 Identities = 230/411 (55%), Positives = 267/411 (64%), Gaps = 2/411 (0%)
 Frame = -1

Query: 1629 SINRNGPVVTHVSLRQLDSNSRMSAGSSYFPKSTPQSQPLKSLKEVSFSRPQIFGMGMDY 1450
            S+  NG V    SL   DS SR SA  S F KS  QSQPLK L + S             
Sbjct: 201  SMKANGTVANKYSL-PFDSKSRXSAAPSNFSKSIFQSQPLKPLNKAS------------- 246

Query: 1449 SSMKVPQLGSDFRSAGLAEGYHPVRKFSSLTNQIQGPFPHNGPMRNQIQGPFAHNGPMTY 1270
                   LGSDF  AG A+G++PV KFSS TNQ QG FPHN              G M Y
Sbjct: 247  ------HLGSDF-PAGFAKGFNPVSKFSSFTNQKQGFFPHN--------------GVMNY 285

Query: 1269 RPNGRTWVGNDRFRLGEKFNRNGNLETSREPNQEKFNRNGNFETSREPNREKFNRNGNFE 1090
            RPN R W GN++++L                                  REK NRNG+FE
Sbjct: 286  RPNSRAWNGNEKYKL----------------------------------REKSNRNGHFE 311

Query: 1089 TFREPNRGPRARGVRNPLSSSAENVEFVISLRRNKYNVQDFQTVYENAKFFVIKSYSEDD 910
            +  E   GPRAR   +PL+S+ E  E  + +RR++YN+QDFQT YENAKF+VIKS+SEDD
Sbjct: 312  SSTELTCGPRARNRNSPLNSATEKEELGLMVRRDQYNLQDFQTEYENAKFYVIKSFSEDD 371

Query: 909  IHKSIKYGVWASTPNGNKKLDAVFYDTEAKSTESGISCPIFLFFSVNGSGQFLGLAEMIG 730
            IHK IKY VWASTPNGNKKLDA F+D EAK+ E+G   PIFLFFSVNGSGQF+G+AEM+G
Sbjct: 372  IHKCIKYDVWASTPNGNKKLDAAFHDAEAKANETGTKFPIFLFFSVNGSGQFVGVAEMVG 431

Query: 729  PVDFKKDMDFWQQDKWNGFFPLKWHIIKDIPNSKLRHIILENNENRPVTYTRDTQEIGLK 550
             VDF KDMDFWQ DKWNGFFP+KWHI+KDIPNS+LRHI LE+NENR VTYTRDTQEIGLK
Sbjct: 432  QVDFNKDMDFWQLDKWNGFFPVKWHIVKDIPNSQLRHITLESNENRSVTYTRDTQEIGLK 491

Query: 549  QGLEMLTIFKSYTAITSMLDDFKFYEDQEKSLQARSNKAVSP--HMETYGN 403
            QG+EML IFK+Y+A TSM DDF FYE++EKSL AR +    P   ME YGN
Sbjct: 492  QGVEMLKIFKNYSARTSMFDDFNFYENREKSLHARRSSKPPPPSQMEIYGN 542


>ref|XP_007222796.1| hypothetical protein PRUPE_ppa003557mg [Prunus persica]
            gi|462419732|gb|EMJ23995.1| hypothetical protein
            PRUPE_ppa003557mg [Prunus persica]
          Length = 566

 Score =  411 bits (1057), Expect = e-112
 Identities = 229/424 (54%), Positives = 271/424 (63%), Gaps = 2/424 (0%)
 Frame = -1

Query: 1548 SYFPKSTPQSQPLKSLKEVSFSRPQIFGMGMDYSSMKVPQLGSDFRSAGLAEGYHPVRKF 1369
            S F KS P +QP KSL +VS                    LG+DF SAGL +GY+P  +F
Sbjct: 209  SRFSKSLPHTQPFKSLNKVS-------------------HLGNDF-SAGLLKGYNPAGRF 248

Query: 1368 SSLTNQIQGPFPHNGPMRNQIQGPFAHNGPMTYRPNGRTWVGNDRFRLGEKFNRNGNLET 1189
            SS  NQ  G FP NG M               Y+ N R   GNDRF+             
Sbjct: 249  SSFANQKYGLFPPNGHMN--------------YKSNARILNGNDRFK------------- 281

Query: 1188 SREPNQEKFNRNGNFETSREPNREKFNRNGNFETFREPNRGPRARGVRNPLSSSAENVEF 1009
                                 +RE +NRN +FE+  E  RGPR+R    PL S+ E  E 
Sbjct: 282  ---------------------SRENYNRNEDFESSTELTRGPRSRNKSAPLDSAIEKEEL 320

Query: 1008 VISLRRNKYNVQDFQTVYENAKFFVIKSYSEDDIHKSIKYGVWASTPNGNKKLDAVFYDT 829
              ++ R++YN+ DFQT YE AKF+VIKSYSEDD+HKSIKY VWASTPNGNKKLDA F D 
Sbjct: 321  SFTVHRDQYNLPDFQTDYEKAKFYVIKSYSEDDVHKSIKYDVWASTPNGNKKLDASFRDA 380

Query: 828  EAKSTESGISCPIFLFFSVNGSGQFLGLAEMIGPVDFKKDMDFWQQDKWNGFFPLKWHII 649
            E+KS E+G  CPIFLFFSVNGSGQF+GLAEM G VDF KDMDFWQ DKW+GFFP+KWH+I
Sbjct: 381  ESKSRETGTQCPIFLFFSVNGSGQFIGLAEMAGQVDFNKDMDFWQVDKWSGFFPVKWHVI 440

Query: 648  KDIPNSKLRHIILENNENRPVTYTRDTQEIGLKQGLEMLTIFKSYTAITSMLDDFKFYED 469
            KDIPN++LRHIILENN+NRPVT+TRDTQEIGLKQGLEML IFKSYTA TS+LDDF FYED
Sbjct: 441  KDIPNTQLRHIILENNDNRPVTFTRDTQEIGLKQGLEMLNIFKSYTAKTSLLDDFIFYED 500

Query: 468  QEKSLQA-RSNKAVSPHMETYGNKFPQKQF-EGVRKSDQASMRAQRSDSKMSMVVNLTKN 295
            +EKSL+A RS+K  +  METY N    K    G R  D  S   + +  + S+ ++LTKN
Sbjct: 501  REKSLKAKRSSKPATLKMETYDNNDITKHINSGGRNVDDESAGIRMASDRASL-ISLTKN 559

Query: 294  LSLN 283
            LSLN
Sbjct: 560  LSLN 563


>gb|AHN05781.1| YTH domain-contained RNA binding protein 10 [Malus domestica]
          Length = 517

 Score =  397 bits (1021), Expect = e-108
 Identities = 224/426 (52%), Positives = 267/426 (62%), Gaps = 2/426 (0%)
 Frame = -1

Query: 1551 SSYFPKSTPQSQPLKSLKEVSFSRPQIFGMGMDYSSMKVPQLGSDFRSAGLAEGYHPVRK 1372
            +S F K  PQ+QP KSL                    KVP LG+DF SAGL +GY P  +
Sbjct: 163  TSKFSKPLPQTQPGKSLN-------------------KVPHLGNDF-SAGLLKGYPPAGR 202

Query: 1371 FSSLTNQIQGPFPHNGPMRNQIQGPFAHNGPMTYRPNGRTWVGNDRFRLGEKFNRNGNLE 1192
            FSS TNQ  G FP                    Y+ NGR   GNDRF+            
Sbjct: 203  FSSYTNQKYGLFPPTSQTY--------------YQSNGRILNGNDRFK------------ 236

Query: 1191 TSREPNQEKFNRNGNFETSREPNREKFNRNGNFETFREPNRGPRARGVRNPLSSSAENVE 1012
                                  +REK+NRN +FE+  E  RGPR+R   +PL S  E  E
Sbjct: 237  ----------------------SREKYNRNEDFESSAELTRGPRSRNKSSPLDSPIEKEE 274

Query: 1011 FVISLRRNKYNVQDFQTVYENAKFFVIKSYSEDDIHKSIKYGVWASTPNGNKKLDAVFYD 832
              I++RR+KYN+ DFQT YE AKF+VIKSYSEDD+HKSIKY VWASTPNGNKKLDA F+D
Sbjct: 275  LGITVRRDKYNLPDFQTGYEKAKFYVIKSYSEDDVHKSIKYDVWASTPNGNKKLDAAFHD 334

Query: 831  TEAKSTESGISCPIFLFFSVNGSGQFLGLAEMIGPVDFKKDMDFWQQDKWNGFFPLKWHI 652
            +E K  E+    PIFLFFSVNGSGQF+G+AE IG VDF KDMDFWQ DKW+GFFP+KWH+
Sbjct: 335  SELKLRETNTQYPIFLFFSVNGSGQFVGVAERIGQVDFNKDMDFWQVDKWSGFFPVKWHV 394

Query: 651  IKDIPNSKLRHIILENNENRPVTYTRDTQEIGLKQGLEMLTIFKSYTAITSMLDDFKFYE 472
            IKDIPN +LRHIILENN+NRPVT+TRDTQEIG KQGLE+L IFKSYTA TS+LDDF FYE
Sbjct: 395  IKDIPNPQLRHIILENNDNRPVTFTRDTQEIGHKQGLELLNIFKSYTAKTSLLDDFTFYE 454

Query: 471  DQEKSLQA-RSNKAVSPHMETYGNKFPQKQFE-GVRKSDQASMRAQRSDSKMSMVVNLTK 298
            ++EKSLQA RSNK  +  ME Y N    K    G R  D      +    +M  +++LTK
Sbjct: 455  NREKSLQAKRSNKPATLKMENYDNGDITKHMNAGGRNID-----VESGGMRMGSLISLTK 509

Query: 297  NLSLNS 280
            N++LN+
Sbjct: 510  NMNLNA 515


>ref|XP_006385033.1| hypothetical protein POPTR_0004s23250g [Populus trichocarpa]
            gi|550341801|gb|ERP62830.1| hypothetical protein
            POPTR_0004s23250g [Populus trichocarpa]
          Length = 593

 Score =  390 bits (1003), Expect = e-106
 Identities = 219/443 (49%), Positives = 277/443 (62%), Gaps = 3/443 (0%)
 Frame = -1

Query: 1572 NSRMSAGSSYFPKSTPQSQPLKSLKEVS--FSRPQIFGMGMDYSSMKVPQLGSDFRSAGL 1399
            N +  +GS+ F KS+  +  +KS   V   FS+P         +  KV  LGSDF SAGL
Sbjct: 182  NGKSGSGSTAFAKSSGFNS-VKSNSNVGSKFSKPMYTQPARPMT--KVSPLGSDF-SAGL 237

Query: 1398 AEGYHPVRKFSSLTNQIQGPFPHNGPMRNQIQGPFAHNGPMTYRPNGRTWVGNDRFRLGE 1219
             +GY P+ KF   T Q QGPFPH              +GP+ YR N R W GN R +  +
Sbjct: 238  YKGYQPMGKFPPFTGQKQGPFPH--------------SGPLNYRQNVRMWNGNYRNKPRD 283

Query: 1218 KFNRNGNLETSREPNQEKFNRNGNFETSREPNREKFNRNGNFETFREPNRGPRARGVRNP 1039
            +FNRNG+ E     NQ +  R         P  +    N                    P
Sbjct: 284  RFNRNGDFE-----NQTELTRGPRASIKNAPLDDSVKNNA-------------------P 319

Query: 1038 LSSSAENVEFVISLRRNKYNVQDFQTVYENAKFFVIKSYSEDDIHKSIKYGVWASTPNGN 859
            L SS +++    ++ + +YN+ DF+  Y NAKFFVIKSY+EDDIHKSIKY VWASTPNGN
Sbjct: 320  LDSSVKDM-LGFAMHKEQYNLPDFEIEYSNAKFFVIKSYNEDDIHKSIKYDVWASTPNGN 378

Query: 858  KKLDAVFYDTEAKSTESGISCPIFLFFSVNGSGQFLGLAEMIGPVDFKKDMDFWQQDKWN 679
            KKLDA F++ E  S+E+G  CPIFLFFSVNGSGQF+GLAEM+G VDF KDMDFWQ DKWN
Sbjct: 379  KKLDAAFHNAEEVSSETGTKCPIFLFFSVNGSGQFVGLAEMVGQVDFNKDMDFWQIDKWN 438

Query: 678  GFFPLKWHIIKDIPNSKLRHIILENNENRPVTYTRDTQEIGLKQGLEMLTIFKSYTAITS 499
            GFFP+KWH+IKDIPN +LRHI+LENN+   VT++RDTQEIGL++GLEML IFKSY+A TS
Sbjct: 439  GFFPVKWHVIKDIPNGQLRHIVLENNDGHSVTFSRDTQEIGLEKGLEMLNIFKSYSAKTS 498

Query: 498  MLDDFKFYEDQEKSLQA-RSNKAVSPHMETYGNKFPQKQFEGVRKSDQASMRAQRSDSKM 322
            MLDDF FYE++EKSL   +SNK  +  ME + N    K      K  +   RA+++ +  
Sbjct: 499  MLDDFNFYENREKSLNTKKSNKPATLRMEIFENSDFPKHTAAEEKISEDDSRAKKT-TNP 557

Query: 321  SMVVNLTKNLSLNSHLPKAGAGK 253
            S ++NLTKNLSLN H  K+ + K
Sbjct: 558  STLINLTKNLSLNGHNQKSNSVK 580


>ref|XP_002526452.1| yth domain-containing protein, putative [Ricinus communis]
            gi|223534232|gb|EEF35947.1| yth domain-containing
            protein, putative [Ricinus communis]
          Length = 582

 Score =  390 bits (1001), Expect = e-105
 Identities = 210/401 (52%), Positives = 261/401 (65%), Gaps = 1/401 (0%)
 Frame = -1

Query: 1440 KVPQLGSDFRSAGLAEGYHPVRKFSSLTNQIQGPFPHNGPMRNQIQGPFAHNGPMTYRPN 1261
            KV  LGSDF SAGL +GYH V  FSS +        H        QGP +HNG M YR N
Sbjct: 225  KVSPLGSDF-SAGLMKGYHHVGNFSSFS-------AHK-------QGPLSHNGTMNYRQN 269

Query: 1260 GRTWVGNDRFRLGEKFNRNGNLETSREPNQEKFNRNGNFETSREPNREKFNRNGNFETFR 1081
            GR W GNDR R                                   R+KF +  +FE   
Sbjct: 270  GRMWNGNDRNR----------------------------------PRDKFYKTNDFEASS 295

Query: 1080 EPNRGPRARGVRNPLSSSAENVEFVISLRRNKYNVQDFQTVYENAKFFVIKSYSEDDIHK 901
            E   GPRA    +PL SSA+  +   ++ R++YN  DF+T Y+NAKF+VIKSY+EDDIHK
Sbjct: 296  ELTCGPRASNKISPLDSSAKE-DLAFTVCRDQYNQADFKTEYKNAKFYVIKSYNEDDIHK 354

Query: 900  SIKYGVWASTPNGNKKLDAVFYDTEAKSTESGISCPIFLFFSVNGSGQFLGLAEMIGPVD 721
            SIKY VWASTPNGNKKLDA F + E +S+E+G  CPIFLFFSVNGSGQF+GLAEM+G VD
Sbjct: 355  SIKYAVWASTPNGNKKLDAAFCEAEQRSSETGTKCPIFLFFSVNGSGQFVGLAEMVGQVD 414

Query: 720  FKKDMDFWQQDKWNGFFPLKWHIIKDIPNSKLRHIILENNENRPVTYTRDTQEIGLKQGL 541
            F+KDMDFWQ DKW+GFFP+KWH+IKDIPN++LRHIILENN+ RPVT++RDTQEIG +QGL
Sbjct: 415  FEKDMDFWQLDKWSGFFPVKWHVIKDIPNNQLRHIILENNDKRPVTFSRDTQEIGFEQGL 474

Query: 540  EMLTIFKSYTAITSMLDDFKFYEDQEKSLQARSNKAVSPHMETYGN-KFPQKQFEGVRKS 364
            EML IFK Y++  S+LDDF FYE++E S+  +SNK  +  ME   N  FP+    G RK 
Sbjct: 475  EMLNIFKGYSSKASLLDDFNFYENRETSVDRKSNKLATLRMEINNNGDFPKHPKSGERKH 534

Query: 363  DQASMRAQRSDSKMSMVVNLTKNLSLNSHLPKAGAGKDSME 241
            ++ S    +  S  S ++NLTKNLSLN +  K+ + K  +E
Sbjct: 535  EEESW--TKKTSNPSSLINLTKNLSLNGYSQKSNSIKKPIE 573


>ref|XP_006389534.1| hypothetical protein POPTR_0022s00680g [Populus trichocarpa]
            gi|550312357|gb|ERP48448.1| hypothetical protein
            POPTR_0022s00680g [Populus trichocarpa]
          Length = 581

 Score =  377 bits (968), Expect = e-101
 Identities = 205/408 (50%), Positives = 254/408 (62%), Gaps = 12/408 (2%)
 Frame = -1

Query: 1452 YSSMKVPQLGSDFRSAGLAEGYHPVRKFSSLTNQIQGPFPHNGPMRNQIQGPFAHNGPMT 1273
            Y+ +  P  GSDF SAGL +GY P+ KF   T+Q  GPFPH              NGP+ 
Sbjct: 211  YTQLVSPS-GSDF-SAGLFKGYQPMGKFPPFTSQKPGPFPH--------------NGPLN 254

Query: 1272 YRPNGRTWVGNDRFRLGEKFNRNGNLETSREPNQEKFNRNGNFETSREPNREKFNRNGNF 1093
            YR NGR W GN                                   R  +R++FN+N +F
Sbjct: 255  YRQNGRMWTGN----------------------------------YRNISRDRFNKNYDF 280

Query: 1092 ETFREPNRGPRARGVRNPLS---------SSAENVEFVISLRRNKYNVQDFQTVYENAKF 940
            E   E  RGPRA     PL           S+   E  I++R+ +YN+ DF+T Y NAKF
Sbjct: 281  ENQTELTRGPRASNKNAPLDLLVNKNASLDSSVKDELGIAMRKEQYNLPDFETEYANAKF 340

Query: 939  FVIKSYSEDDIHKSIKYGVWASTPNGNKKLDAVFYDTEAKSTESGISCPIFLFFSVNGSG 760
            FVIKSYSEDDIHKSIKY VWASTPNGNKKLDA F++ E  S+++G  CPIFLFFSVNGSG
Sbjct: 341  FVIKSYSEDDIHKSIKYDVWASTPNGNKKLDAAFHNAEEVSSDTGYKCPIFLFFSVNGSG 400

Query: 759  QFLGLAEMIGPVDFKKDMDFWQQDKWNGFFPLKWHIIKDIPNSKLRHIILENNENRPVTY 580
            QF+G AEM+G VDF KDMDFWQ DKWNGFFP+KWH++KDIPN  LRHI+LENN+   VT+
Sbjct: 401  QFVGFAEMVGQVDFNKDMDFWQIDKWNGFFPVKWHVVKDIPNGHLRHIVLENNDGHSVTF 460

Query: 579  TRDTQEIGLKQGLEMLTIFKSYTAITSMLDDFKFYEDQEKSLQA-RSNKAVSPHMETY-- 409
            +RDTQEI LKQGLEML IFKSY+A TS+LDDF FYE +EKSL   + NK  +  ME +  
Sbjct: 461  SRDTQEIVLKQGLEMLNIFKSYSAKTSLLDDFNFYEKREKSLNTKKGNKPATLQMEIFKN 520

Query: 408  GNKFPQKQFEGVRKSDQASMRAQRSDSKMSMVVNLTKNLSLNSHLPKA 265
            G+       EG+ + D  +    +  +  S ++NLTKNLSL+ H+ K+
Sbjct: 521  GDFAHTTAEEGISEDDSRT----KKTTNPSSLINLTKNLSLSGHIQKS 564


>gb|EXB29044.1| hypothetical protein L484_018461 [Morus notabilis]
          Length = 549

 Score =  376 bits (966), Expect = e-101
 Identities = 222/438 (50%), Positives = 279/438 (63%), Gaps = 6/438 (1%)
 Frame = -1

Query: 1572 NSRMSAGS--SYFPKSTPQSQPLKSLKEVSFSRPQIFGMGMDYSSMKVPQLGSDFRSA-G 1402
            NS  S GS  S F K    +QP+KSL                    KVP LGSDF +A G
Sbjct: 192  NSTKSNGSITSKFSKPLLPTQPVKSLN-------------------KVPHLGSDFSTAAG 232

Query: 1401 LAEGYHP--VRKFSSLTNQIQGPFPHNGPMRNQIQGPFAHNGPMTYRPNGRTWVGNDRFR 1228
            L +GY    V +F+S +NQ QG FP+ G         F++     Y+  GR W GNDR  
Sbjct: 233  LLKGYPQPQVGRFASFSNQKQGVFPYTG---------FSN-----YKQYGRIWSGNDR-- 276

Query: 1227 LGEKFNRNGNLETSREPNQEKFNRNGNFETSREPNREKFNRNGNFETFREPNRGPRARGV 1048
                                    NG+FE S                  E  RGPR+R  
Sbjct: 277  ------------------------NGDFEAS-----------------AELTRGPRSRN- 294

Query: 1047 RNPLSSSAENVEFVISLRRNKYNVQDFQTVYENAKFFVIKSYSEDDIHKSIKYGVWASTP 868
            ++ L SS+E  E  +++RR++YN+ DFQT   NAKF+VIKSYSEDD+HKSIKY VWASTP
Sbjct: 295  KDLLDSSSEKEELGLAVRRDQYNLPDFQTDNVNAKFYVIKSYSEDDVHKSIKYDVWASTP 354

Query: 867  NGNKKLDAVFYDTEAKSTESGISCPIFLFFSVNGSGQFLGLAEMIGPVDFKKDMDFWQQD 688
            NGNKKLD+ F+D EAKS+E G +CPIFLFFSVNGSGQF+G+AEMIG VDF KDMDFWQ D
Sbjct: 355  NGNKKLDSSFHDAEAKSSEMGKNCPIFLFFSVNGSGQFVGIAEMIGQVDFNKDMDFWQVD 414

Query: 687  KWNGFFPLKWHIIKDIPNSKLRHIILENNENRPVTYTRDTQEIGLKQGLEMLTIFKSYTA 508
            KW+GFFP++WHI+KD+PN++LRHIILENN+N+PVT+TRDTQEIGLKQGLEML IFKSYTA
Sbjct: 415  KWSGFFPVRWHIVKDVPNTQLRHIILENNDNKPVTFTRDTQEIGLKQGLEMLNIFKSYTA 474

Query: 507  ITSMLDDFKFYEDQEKSLQA-RSNKAVSPHMETYGNKFPQKQFEGVRKSDQASMRAQRSD 331
             T++LDDF FYE +E+SLQA RS+K  +  ME   N   +  F   R ++  S  A+ + 
Sbjct: 475  KTTLLDDFNFYESREQSLQAKRSSKPATLKMEGIYN---ENDFT-KRGNEVESGGAKMTS 530

Query: 330  SKMSMVVNLTKNLSLNSH 277
             + S ++NLTKNLSL+++
Sbjct: 531  DRASSLINLTKNLSLSAN 548


>ref|XP_007041225.1| Yth domain-containing protein, putative isoform 4 [Theobroma cacao]
            gi|590681985|ref|XP_007041226.1| Yth domain-containing
            protein, putative isoform 4 [Theobroma cacao]
            gi|508705160|gb|EOX97056.1| Yth domain-containing
            protein, putative isoform 4 [Theobroma cacao]
            gi|508705161|gb|EOX97057.1| Yth domain-containing
            protein, putative isoform 4 [Theobroma cacao]
          Length = 548

 Score =  369 bits (946), Expect = 4e-99
 Identities = 217/447 (48%), Positives = 271/447 (60%), Gaps = 3/447 (0%)
 Frame = -1

Query: 1572 NSRMSAG--SSYFPKSTPQSQPLKSLKEVSFSRPQIFGMGMDYSSMKVPQLGSDFRSAGL 1399
            NS  S G   +  PKST  +QP+K+L                    K P LGSD  SAG 
Sbjct: 171  NSLKSNGLVGTKLPKST-HTQPIKALN-------------------KGPHLGSDL-SAG- 208

Query: 1398 AEGYHPVRKFSSLTNQIQGPFPHNGPMRNQIQGPFAHNGPMTYRPNGRTWVGNDRFRLGE 1219
            + GYHP  K  S  NQ +G F HNG              PM YR NGR W  NDR++   
Sbjct: 209  SYGYHPAGKSPSFNNQKEGLFQHNG--------------PMNYRLNGRGWNQNDRYK--- 251

Query: 1218 KFNRNGNLETSREPNQEKFNRNGNFETSREPNREKFNRNGNFETFREPNRGPRARGVRNP 1039
                                              K NR+ +F+   E  RGPRA      
Sbjct: 252  ----------------------------------KSNRDFDFQNSAEVTRGPRAWN--RV 275

Query: 1038 LSSSAENVEFVISLRRNKYNVQDFQTVYENAKFFVIKSYSEDDIHKSIKYGVWASTPNGN 859
            L SS +  +  ++L ++KYN  DFQT Y+NAKFFVIKSYSEDD+HKS+KY VW+STPNGN
Sbjct: 276  LDSSVKREDLGLTLCKDKYNPLDFQTEYDNAKFFVIKSYSEDDVHKSMKYDVWSSTPNGN 335

Query: 858  KKLDAVFYDTEAKSTESGISCPIFLFFSVNGSGQFLGLAEMIGPVDFKKDMDFWQQDKWN 679
            +KLDA F++ EA+ +E+G   PIFL FSVNGSGQF+GLAEMIG VDF KDMDFWQ DKWN
Sbjct: 336  RKLDAAFHEAEARESETGTKFPIFLLFSVNGSGQFVGLAEMIGKVDFNKDMDFWQLDKWN 395

Query: 678  GFFPLKWHIIKDIPNSKLRHIILENNENRPVTYTRDTQEIGLKQGLEMLTIFKSYTAITS 499
            GFFP+KWH+IKDIPN +L HIILENNENR VTY+RDTQEIGLKQGLEML IFK Y+A +S
Sbjct: 396  GFFPVKWHVIKDIPNKELSHIILENNENRSVTYSRDTQEIGLKQGLEMLNIFKRYSAKSS 455

Query: 498  MLDDFKFYEDQEKSLQARSN-KAVSPHMETYGNKFPQKQFEGVRKSDQASMRAQRSDSKM 322
            +LDDF FYE++EK+L A+ N K V+  +    + F ++   G R+ ++  +R  +  S  
Sbjct: 456  LLDDFGFYENREKTLNAKKNYKPVT--LRNKEDDFTKQTKAGERRVEE-DLRRTKKTSDA 512

Query: 321  SMVVNLTKNLSLNSHLPKAGAGKDSME 241
            + ++NLTKNLSLN    K  A K+ +E
Sbjct: 513  TSLINLTKNLSLNGCTLKNSAVKNPIE 539


>ref|XP_007041222.1| Yth domain-containing protein, putative isoform 1 [Theobroma cacao]
            gi|508705157|gb|EOX97053.1| Yth domain-containing
            protein, putative isoform 1 [Theobroma cacao]
          Length = 573

 Score =  369 bits (946), Expect = 4e-99
 Identities = 217/447 (48%), Positives = 271/447 (60%), Gaps = 3/447 (0%)
 Frame = -1

Query: 1572 NSRMSAG--SSYFPKSTPQSQPLKSLKEVSFSRPQIFGMGMDYSSMKVPQLGSDFRSAGL 1399
            NS  S G   +  PKST  +QP+K+L                    K P LGSD  SAG 
Sbjct: 196  NSLKSNGLVGTKLPKST-HTQPIKALN-------------------KGPHLGSDL-SAG- 233

Query: 1398 AEGYHPVRKFSSLTNQIQGPFPHNGPMRNQIQGPFAHNGPMTYRPNGRTWVGNDRFRLGE 1219
            + GYHP  K  S  NQ +G F HNG              PM YR NGR W  NDR++   
Sbjct: 234  SYGYHPAGKSPSFNNQKEGLFQHNG--------------PMNYRLNGRGWNQNDRYK--- 276

Query: 1218 KFNRNGNLETSREPNQEKFNRNGNFETSREPNREKFNRNGNFETFREPNRGPRARGVRNP 1039
                                              K NR+ +F+   E  RGPRA      
Sbjct: 277  ----------------------------------KSNRDFDFQNSAEVTRGPRAWN--RV 300

Query: 1038 LSSSAENVEFVISLRRNKYNVQDFQTVYENAKFFVIKSYSEDDIHKSIKYGVWASTPNGN 859
            L SS +  +  ++L ++KYN  DFQT Y+NAKFFVIKSYSEDD+HKS+KY VW+STPNGN
Sbjct: 301  LDSSVKREDLGLTLCKDKYNPLDFQTEYDNAKFFVIKSYSEDDVHKSMKYDVWSSTPNGN 360

Query: 858  KKLDAVFYDTEAKSTESGISCPIFLFFSVNGSGQFLGLAEMIGPVDFKKDMDFWQQDKWN 679
            +KLDA F++ EA+ +E+G   PIFL FSVNGSGQF+GLAEMIG VDF KDMDFWQ DKWN
Sbjct: 361  RKLDAAFHEAEARESETGTKFPIFLLFSVNGSGQFVGLAEMIGKVDFNKDMDFWQLDKWN 420

Query: 678  GFFPLKWHIIKDIPNSKLRHIILENNENRPVTYTRDTQEIGLKQGLEMLTIFKSYTAITS 499
            GFFP+KWH+IKDIPN +L HIILENNENR VTY+RDTQEIGLKQGLEML IFK Y+A +S
Sbjct: 421  GFFPVKWHVIKDIPNKELSHIILENNENRSVTYSRDTQEIGLKQGLEMLNIFKRYSAKSS 480

Query: 498  MLDDFKFYEDQEKSLQARSN-KAVSPHMETYGNKFPQKQFEGVRKSDQASMRAQRSDSKM 322
            +LDDF FYE++EK+L A+ N K V+  +    + F ++   G R+ ++  +R  +  S  
Sbjct: 481  LLDDFGFYENREKTLNAKKNYKPVT--LRNKEDDFTKQTKAGERRVEE-DLRRTKKTSDA 537

Query: 321  SMVVNLTKNLSLNSHLPKAGAGKDSME 241
            + ++NLTKNLSLN    K  A K+ +E
Sbjct: 538  TSLINLTKNLSLNGCTLKNSAVKNPIE 564


>ref|XP_006471138.1| PREDICTED: uncharacterized protein LOC102630620 isoform X1 [Citrus
            sinensis]
          Length = 572

 Score =  367 bits (941), Expect = 1e-98
 Identities = 237/605 (39%), Positives = 314/605 (51%), Gaps = 45/605 (7%)
 Frame = -1

Query: 1965 GTSNSKVREKVDTSLKSTSLANLEEQDVASGRECKTSGSASSISLMGDANFVMEGESDQQ 1786
            G  N    E V T LK   ++    QDVASG++   S S +S++  G A   M+GE DQ+
Sbjct: 3    GEKNIIKDEPVATELKGNPISKSTGQDVASGKDGAASDSTASMATSGHAASGMKGEIDQE 62

Query: 1785 PDAEKG--------HDVAYP-------EIKDNAYFL---------EGNGSEIPHXXXXXX 1678
               E G        ++  YP       ++ +N Y             NGS + +      
Sbjct: 63   SVGEYGAQNPSTVHYNYYYPGSNGSFSQVDNNGYIHTDGSHSGVHSDNGSLLYYLPGYDP 122

Query: 1677 XXXXXXXXXXXXXXLNSINRNGPVVTHVS---------------LRQLDSNSRMSAGSSY 1543
                              + +G +   VS               +  + + + +  G+  
Sbjct: 123  YSTIVGVDGQCVGQQPYFSSSGYLQHPVSYGSEVMPCYSWDSTYVADIQNGNAVGFGNEK 182

Query: 1542 FPKSTP--QSQPLKSLKEVSFSRPQIFGMGMDYSSM---KVPQLGSDFRSAGLAEGYHPV 1378
            +  ST   +S  L S+K+      ++       S+    KV QLGSD  SAG  +G  P+
Sbjct: 183  YGGSTAFAKSNGLNSVKKNGCFTNKVSKSSYTQSTKPVSKVTQLGSDL-SAGFLKGSDPL 241

Query: 1377 RKFSSLTNQIQGPFPHNGPMRNQIQGPFAHNGPMTYRPNGRTWVGNDRFRLGEKFNRNGN 1198
              FS+ +NQ QG FP+                 + Y  NGR W GNDR++          
Sbjct: 242  GNFSAFSNQKQGFFPNM----------------VNYSTNGRMWNGNDRYK---------- 275

Query: 1197 LETSREPNQEKFNRNGNFETSREPNREKFNRNGNFETFREPNRGPRARGVRNPLSSSAEN 1018
                                    +R+KF+R G      E  RGPRA      L  S + 
Sbjct: 276  ------------------------SRDKFSRAGGLGMPTELIRGPRAENKSASLEISDKK 311

Query: 1017 VEFVISLRRNKYNVQDFQTVYENAKFFVIKSYSEDDIHKSIKYGVWASTPNGNKKLDAVF 838
                 ++ R++YN+ DFQ  YE AKF+VIKSYSEDDIHK IKY VW+STPNGNKKLDA F
Sbjct: 312  EVLSPTVSRDQYNLPDFQVEYEKAKFYVIKSYSEDDIHKCIKYDVWSSTPNGNKKLDATF 371

Query: 837  YDTEAKSTESGISCPIFLFFSVNGSGQFLGLAEMIGPVDFKKDMDFWQQDKWNGFFPLKW 658
             + EAK+ E+G  CPIFLFFSVNGSGQF+GLAEM+G VDF KDMDFWQ DKWNGFFP+KW
Sbjct: 372  NEAEAKADETGTRCPIFLFFSVNGSGQFVGLAEMMGKVDFNKDMDFWQLDKWNGFFPVKW 431

Query: 657  HIIKDIPNSKLRHIILENNENRPVTYTRDTQEIGLKQGLEMLTIFKSYTAITSMLDDFKF 478
            H+IKD+PN+ LRHI LENNEN+PVT++RDTQEIGLKQGLEML IFKSY+A TS+LDDF F
Sbjct: 432  HVIKDVPNTLLRHITLENNENKPVTHSRDTQEIGLKQGLEMLKIFKSYSAKTSLLDDFNF 491

Query: 477  YEDQEKSLQA-RSNKAVSPHMETYGNKFPQKQFEGVRKSDQASMRAQRSDSKMSMVVNLT 301
            YE++E+S    +S+K  +  M+ + +    KQ +   K           D     ++NLT
Sbjct: 492  YENKERSFHGKKSSKPATLQMDIFNDDDFTKQIKSAEK---------EFDEDSISIINLT 542

Query: 300  KNLSL 286
            KNLSL
Sbjct: 543  KNLSL 547


>ref|XP_007041224.1| Yth domain-containing protein, putative isoform 3 [Theobroma cacao]
            gi|508705159|gb|EOX97055.1| Yth domain-containing
            protein, putative isoform 3 [Theobroma cacao]
          Length = 572

 Score =  367 bits (941), Expect = 1e-98
 Identities = 218/447 (48%), Positives = 271/447 (60%), Gaps = 3/447 (0%)
 Frame = -1

Query: 1572 NSRMSAG--SSYFPKSTPQSQPLKSLKEVSFSRPQIFGMGMDYSSMKVPQLGSDFRSAGL 1399
            NS  S G   +  PKST  +QP+K+L                    K P LGSD  SAG 
Sbjct: 196  NSLKSNGLVGTKLPKST-HTQPIKALN-------------------KGPHLGSDL-SAG- 233

Query: 1398 AEGYHPVRKFSSLTNQIQGPFPHNGPMRNQIQGPFAHNGPMTYRPNGRTWVGNDRFRLGE 1219
            + GYHP  K  S  NQ +G F HNG              PM YR NGR W  NDR++   
Sbjct: 234  SYGYHPAGKSPSFNNQKEGLFQHNG--------------PMNYRLNGRGWNQNDRYK--- 276

Query: 1218 KFNRNGNLETSREPNQEKFNRNGNFETSREPNREKFNRNGNFETFREPNRGPRARGVRNP 1039
                                              K NR+ +F+   E  RGPRA      
Sbjct: 277  ----------------------------------KSNRDFDFQNSAEVTRGPRAWN--RV 300

Query: 1038 LSSSAENVEFVISLRRNKYNVQDFQTVYENAKFFVIKSYSEDDIHKSIKYGVWASTPNGN 859
            L SS +  +  ++L ++KYN  DFQT Y+NAKFFVIKSYSEDD+HKS+KY VW+STPNGN
Sbjct: 301  LDSSVKREDLGLTLCKDKYNPLDFQTEYDNAKFFVIKSYSEDDVHKSMKYDVWSSTPNGN 360

Query: 858  KKLDAVFYDTEAKSTESGISCPIFLFFSVNGSGQFLGLAEMIGPVDFKKDMDFWQQDKWN 679
            +KLDA F++ EA+ +E+G   PIFL FSVNGSGQF+GLAEMIG VDF KDMDFWQ DKWN
Sbjct: 361  RKLDAAFHEAEARESETGTKFPIFLLFSVNGSGQFVGLAEMIGKVDFNKDMDFWQLDKWN 420

Query: 678  GFFPLKWHIIKDIPNSKLRHIILENNENRPVTYTRDTQEIGLKQGLEMLTIFKSYTAITS 499
            GFFP+KWH+IKDIPN +L HIILENNENR VTY+RDTQEIGLKQGLEML IFK Y+A +S
Sbjct: 421  GFFPVKWHVIKDIPNKELSHIILENNENRSVTYSRDTQEIGLKQGLEMLNIFKRYSAKSS 480

Query: 498  MLDDFKFYEDQEKSLQARSN-KAVSPHMETYGNKFPQKQFEGVRKSDQASMRAQRSDSKM 322
            +LDDF FYE++EK+L A+ N K V+  +    + F Q +  G R+ ++  +R  +  S  
Sbjct: 481  LLDDFGFYENREKTLNAKKNYKPVT--LRNKEDDFTQTK-AGERRVEE-DLRRTKKTSDA 536

Query: 321  SMVVNLTKNLSLNSHLPKAGAGKDSME 241
            + ++NLTKNLSLN    K  A K+ +E
Sbjct: 537  TSLINLTKNLSLNGCTLKNSAVKNPIE 563


>ref|XP_006349328.1| PREDICTED: YTH domain family protein 1-like [Solanum tuberosum]
          Length = 570

 Score =  366 bits (939), Expect = 2e-98
 Identities = 207/458 (45%), Positives = 273/458 (59%), Gaps = 7/458 (1%)
 Frame = -1

Query: 1632 NSINRNGPVVTHVSLRQLDSNSRMSAGSS---YFPKSTPQ---SQPLKSLKEVSFSRPQI 1471
            ++  RNG V ++       +NS  S+ SS   + PKS P    S P KS+ +     P  
Sbjct: 170  STFGRNGSVKSN-GFNSTKTNSSFSSKSSTVLFNPKSRPSTAMSNPPKSVHQAQPFNP-- 226

Query: 1470 FGMGMDYSSMKVPQLGSDFRSAGLAEGYHPVRKFSSLTNQIQGPF-PHNGPMRNQIQGPF 1294
                       V +  SD +S GL +G+H V  + S T+Q QG F P++           
Sbjct: 227  -----------VNKFQSDVQSGGLMKGFHLVGDYPSYTSQNQGFFMPYD----------- 264

Query: 1293 AHNGPMTYRPNGRTWVGNDRFRLGEKFNRNGNLETSREPNQEKFNRNGNFETSREPNREK 1114
                P+  + N R W GN R +                                   R  
Sbjct: 265  ----PINCQTNSRMWNGNYRIK----------------------------------PRGN 286

Query: 1113 FNRNGNFETFREPNRGPRARGVRNPLSSSAENVEFVISLRRNKYNVQDFQTVYENAKFFV 934
            F RNG FE   E  RGPRA G   P   SAE  + V +++R KYN +DF+T Y+NAKF++
Sbjct: 287  FTRNGVFEATNELPRGPRANGRSVPSKPSAEEDQLVPTVQREKYNKEDFKTQYDNAKFYI 346

Query: 933  IKSYSEDDIHKSIKYGVWASTPNGNKKLDAVFYDTEAKSTESGISCPIFLFFSVNGSGQF 754
            IKSYSEDDIHK +KY VW+STPNGNKKLD  F + EAKS+ +G SCP+FLFFSVNGSGQF
Sbjct: 347  IKSYSEDDIHKCVKYDVWSSTPNGNKKLDTAFVEAEAKSSGTGSSCPVFLFFSVNGSGQF 406

Query: 753  LGLAEMIGPVDFKKDMDFWQQDKWNGFFPLKWHIIKDIPNSKLRHIILENNENRPVTYTR 574
            LG+AEM+G VDF ++MDFWQ DKW+GFFPLKWHI+KD+PN++ RHIILENN+NRPVTY+R
Sbjct: 407  LGVAEMVGQVDFNRNMDFWQLDKWSGFFPLKWHIVKDVPNTQFRHIILENNDNRPVTYSR 466

Query: 573  DTQEIGLKQGLEMLTIFKSYTAITSMLDDFKFYEDQEKSLQARSNKAVSPHMETYGNKFP 394
            DTQEIGLK+GLEML I K+Y+  TS+LDDF FYE +EK L+A+ +   +   + Y     
Sbjct: 467  DTQEIGLKEGLEMLNILKNYSEKTSILDDFNFYEKREKVLKAKRSSKPAIQADVYEKADS 526

Query: 393  QKQFEGVRKSDQASMRAQRSDSKMSMVVNLTKNLSLNS 280
             KQF+G  K  +  ++   +D   + +++LTKNLS+NS
Sbjct: 527  LKQFKGGDKVLEEELKTNSAD-PTAPLISLTKNLSINS 563


>ref|XP_004230452.1| PREDICTED: uncharacterized protein LOC101267743 [Solanum
            lycopersicum]
          Length = 563

 Score =  363 bits (931), Expect = 2e-97
 Identities = 207/458 (45%), Positives = 273/458 (59%), Gaps = 7/458 (1%)
 Frame = -1

Query: 1632 NSINRNGPVVTHVSLRQLDSNSRMSAGSS---YFPKSTP---QSQPLKSLKEVSFSRPQI 1471
            ++  RNG V ++       +NS  S+ +S   + PKS P    S P KS  +     P  
Sbjct: 163  STFGRNGSVKSN-GFNSTKTNSSFSSKNSTVLFNPKSRPATAMSNPPKSFHQAQPFNP-- 219

Query: 1470 FGMGMDYSSMKVPQLGSDFRSAGLAEGYHPVRKFSSLTNQIQGPF-PHNGPMRNQIQGPF 1294
                       V +  SD +S GL +G+H V  + S T+Q QG F P++           
Sbjct: 220  -----------VNKFQSDVQSGGLMKGFHLVGDYPSYTSQNQGFFMPYD----------- 257

Query: 1293 AHNGPMTYRPNGRTWVGNDRFRLGEKFNRNGNLETSREPNQEKFNRNGNFETSREPNREK 1114
                P+  + N R W                               NGN+   R   R  
Sbjct: 258  ----PINCQTNSRMW-------------------------------NGNY---RAKPRGN 279

Query: 1113 FNRNGNFETFREPNRGPRARGVRNPLSSSAENVEFVISLRRNKYNVQDFQTVYENAKFFV 934
            F RNG FE   E  RGPRA G   P   SAE  + V +++R KYN +DF+T Y+NAKF++
Sbjct: 280  FTRNGVFEATNELPRGPRANGRSVPSKPSAEEDQLVPAVQREKYNKEDFKTQYDNAKFYI 339

Query: 933  IKSYSEDDIHKSIKYGVWASTPNGNKKLDAVFYDTEAKSTESGISCPIFLFFSVNGSGQF 754
            IKSYSEDDIHK +KY VW+STPNGNKKLD  F ++EAK++ +G SCP+FLFFSVNGSGQF
Sbjct: 340  IKSYSEDDIHKCVKYDVWSSTPNGNKKLDTAFVESEAKASGTGSSCPVFLFFSVNGSGQF 399

Query: 753  LGLAEMIGPVDFKKDMDFWQQDKWNGFFPLKWHIIKDIPNSKLRHIILENNENRPVTYTR 574
            LG+AEM+G VDF ++MDFWQ DKW+GFFPLKWHI+KD+PN++ RHIILENN+NRPVTY+R
Sbjct: 400  LGVAEMVGQVDFNRNMDFWQLDKWSGFFPLKWHIVKDVPNTQFRHIILENNDNRPVTYSR 459

Query: 573  DTQEIGLKQGLEMLTIFKSYTAITSMLDDFKFYEDQEKSLQARSNKAVSPHMETYGNKFP 394
            DTQEIGLK+GLEML I K+Y+  TS+LDDF FYE +EK L+A+ +       + Y     
Sbjct: 460  DTQEIGLKEGLEMLNILKNYSEKTSILDDFNFYEKREKVLKAKRSSKPVIQADAYEKADS 519

Query: 393  QKQFEGVRKSDQASMRAQRSDSKMSMVVNLTKNLSLNS 280
             KQF+G  K  +  ++   +D   + +V+LTKNLS+NS
Sbjct: 520  LKQFKGGDKVLEEELKTNSTD-PTAPLVSLTKNLSINS 556


>ref|XP_006431655.1| hypothetical protein CICLE_v10000713mg [Citrus clementina]
            gi|567878195|ref|XP_006431656.1| hypothetical protein
            CICLE_v10000713mg [Citrus clementina]
            gi|557533777|gb|ESR44895.1| hypothetical protein
            CICLE_v10000713mg [Citrus clementina]
            gi|557533778|gb|ESR44896.1| hypothetical protein
            CICLE_v10000713mg [Citrus clementina]
          Length = 572

 Score =  362 bits (928), Expect = 5e-97
 Identities = 235/605 (38%), Positives = 313/605 (51%), Gaps = 45/605 (7%)
 Frame = -1

Query: 1965 GTSNSKVREKVDTSLKSTSLANLEEQDVASGRECKTSGSASSISLMGDANFVMEGESDQQ 1786
            G  N    E V T LK   ++    QDVASG++   S S +S++  G A   M+GE DQ+
Sbjct: 3    GEKNIIKDEPVATELKGNPISKSTGQDVASGKDGAASDSTASMATSGHAASGMKGEIDQE 62

Query: 1785 PDAEKG--------HDVAYP-------EIKDNAYFL---------EGNGSEIPHXXXXXX 1678
               E G        ++  YP       ++ +N Y             NGS + +      
Sbjct: 63   SVGEYGAQNPSTVHYNYYYPGSNGSFSQVDNNGYIHTDGSHSGVHSDNGSLLYYLPGYDP 122

Query: 1677 XXXXXXXXXXXXXXLNSINRNGPVVTHVS---------------LRQLDSNSRMSAGSSY 1543
                              + +G +   VS               +  + + + +  G+  
Sbjct: 123  YSTLVGVDGQCVGQQPYFSSSGYLQHPVSYGSEVMPCYSWDSTYVADIQNGNAVGFGNEK 182

Query: 1542 FPKSTP--QSQPLKSLKEVSFSRPQIFGMGMDYSSM---KVPQLGSDFRSAGLAEGYHPV 1378
            +  ST   +S  L S+K+      ++       S+    KV QL SD  SAG  +G +P+
Sbjct: 183  YGGSTAFAKSNGLNSVKKNGCFTNKVSKSSYTQSTKPVSKVTQLDSDL-SAGFLKGSNPL 241

Query: 1377 RKFSSLTNQIQGPFPHNGPMRNQIQGPFAHNGPMTYRPNGRTWVGNDRFRLGEKFNRNGN 1198
              FS+ +NQ QG FP+                 + Y  NGR W GNDR++          
Sbjct: 242  GNFSAFSNQKQGFFPNM----------------VNYSTNGRMWNGNDRYK---------- 275

Query: 1197 LETSREPNQEKFNRNGNFETSREPNREKFNRNGNFETFREPNRGPRARGVRNPLSSSAEN 1018
                                    +R+KF+R G      E  RGPRA      L  S + 
Sbjct: 276  ------------------------SRDKFSRAGGLGMPTELIRGPRAENKSASLEISDKK 311

Query: 1017 VEFVISLRRNKYNVQDFQTVYENAKFFVIKSYSEDDIHKSIKYGVWASTPNGNKKLDAVF 838
                 ++ R++YN+ DFQ  YE  KF+VIKSYSEDDIHK IKY VW+STPNGNKKLDA F
Sbjct: 312  EVPSPTVSRDQYNLPDFQVEYEKVKFYVIKSYSEDDIHKCIKYDVWSSTPNGNKKLDATF 371

Query: 837  YDTEAKSTESGISCPIFLFFSVNGSGQFLGLAEMIGPVDFKKDMDFWQQDKWNGFFPLKW 658
             + EAK+ E+G  CPIFLFFSVNGSGQF+GLAEM+G VDF KDMDFWQ DKWNGFFP+KW
Sbjct: 372  NEAEAKADETGTRCPIFLFFSVNGSGQFVGLAEMMGKVDFNKDMDFWQLDKWNGFFPVKW 431

Query: 657  HIIKDIPNSKLRHIILENNENRPVTYTRDTQEIGLKQGLEMLTIFKSYTAITSMLDDFKF 478
            H+IKD+PN+ LRHI LENNEN+PVT++RDTQEIGLKQGLEML IFKSY+A TS+LDDF F
Sbjct: 432  HVIKDVPNTLLRHITLENNENKPVTHSRDTQEIGLKQGLEMLKIFKSYSAKTSLLDDFNF 491

Query: 477  YEDQEKSLQA-RSNKAVSPHMETYGNKFPQKQFEGVRKSDQASMRAQRSDSKMSMVVNLT 301
            YE++E+S    +S+K  +  M+ + +    KQ +   K           D     ++NLT
Sbjct: 492  YENKERSFHGKKSSKPATLQMDIFNDDDFTKQIKSAEK---------EFDEDSISIINLT 542

Query: 300  KNLSL 286
            KNLSL
Sbjct: 543  KNLSL 547


>ref|XP_006471139.1| PREDICTED: uncharacterized protein LOC102630620 isoform X2 [Citrus
            sinensis]
          Length = 528

 Score =  355 bits (910), Expect = 6e-95
 Identities = 194/386 (50%), Positives = 239/386 (61%), Gaps = 1/386 (0%)
 Frame = -1

Query: 1440 KVPQLGSDFRSAGLAEGYHPVRKFSSLTNQIQGPFPHNGPMRNQIQGPFAHNGPMTYRPN 1261
            KV QLGSD  SAG  +G  P+  FS+ +NQ QG FP+                 + Y  N
Sbjct: 178  KVTQLGSDL-SAGFLKGSDPLGNFSAFSNQKQGFFPNM----------------VNYSTN 220

Query: 1260 GRTWVGNDRFRLGEKFNRNGNLETSREPNQEKFNRNGNFETSREPNREKFNRNGNFETFR 1081
            GR W GNDR++                                  +R+KF+R G      
Sbjct: 221  GRMWNGNDRYK----------------------------------SRDKFSRAGGLGMPT 246

Query: 1080 EPNRGPRARGVRNPLSSSAENVEFVISLRRNKYNVQDFQTVYENAKFFVIKSYSEDDIHK 901
            E  RGPRA      L  S +      ++ R++YN+ DFQ  YE AKF+VIKSYSEDDIHK
Sbjct: 247  ELIRGPRAENKSASLEISDKKEVLSPTVSRDQYNLPDFQVEYEKAKFYVIKSYSEDDIHK 306

Query: 900  SIKYGVWASTPNGNKKLDAVFYDTEAKSTESGISCPIFLFFSVNGSGQFLGLAEMIGPVD 721
             IKY VW+STPNGNKKLDA F + EAK+ E+G  CPIFLFFSVNGSGQF+GLAEM+G VD
Sbjct: 307  CIKYDVWSSTPNGNKKLDATFNEAEAKADETGTRCPIFLFFSVNGSGQFVGLAEMMGKVD 366

Query: 720  FKKDMDFWQQDKWNGFFPLKWHIIKDIPNSKLRHIILENNENRPVTYTRDTQEIGLKQGL 541
            F KDMDFWQ DKWNGFFP+KWH+IKD+PN+ LRHI LENNEN+PVT++RDTQEIGLKQGL
Sbjct: 367  FNKDMDFWQLDKWNGFFPVKWHVIKDVPNTLLRHITLENNENKPVTHSRDTQEIGLKQGL 426

Query: 540  EMLTIFKSYTAITSMLDDFKFYEDQEKSLQA-RSNKAVSPHMETYGNKFPQKQFEGVRKS 364
            EML IFKSY+A TS+LDDF FYE++E+S    +S+K  +  M+ + +    KQ +   K 
Sbjct: 427  EMLKIFKSYSAKTSLLDDFNFYENKERSFHGKKSSKPATLQMDIFNDDDFTKQIKSAEK- 485

Query: 363  DQASMRAQRSDSKMSMVVNLTKNLSL 286
                      D     ++NLTKNLSL
Sbjct: 486  --------EFDEDSISIINLTKNLSL 503


>ref|XP_006431654.1| hypothetical protein CICLE_v10000713mg [Citrus clementina]
            gi|557533776|gb|ESR44894.1| hypothetical protein
            CICLE_v10000713mg [Citrus clementina]
          Length = 528

 Score =  350 bits (897), Expect = 2e-93
 Identities = 192/386 (49%), Positives = 238/386 (61%), Gaps = 1/386 (0%)
 Frame = -1

Query: 1440 KVPQLGSDFRSAGLAEGYHPVRKFSSLTNQIQGPFPHNGPMRNQIQGPFAHNGPMTYRPN 1261
            KV QL SD  SAG  +G +P+  FS+ +NQ QG FP+                 + Y  N
Sbjct: 178  KVTQLDSDL-SAGFLKGSNPLGNFSAFSNQKQGFFPNM----------------VNYSTN 220

Query: 1260 GRTWVGNDRFRLGEKFNRNGNLETSREPNQEKFNRNGNFETSREPNREKFNRNGNFETFR 1081
            GR W GNDR++                                  +R+KF+R G      
Sbjct: 221  GRMWNGNDRYK----------------------------------SRDKFSRAGGLGMPT 246

Query: 1080 EPNRGPRARGVRNPLSSSAENVEFVISLRRNKYNVQDFQTVYENAKFFVIKSYSEDDIHK 901
            E  RGPRA      L  S +      ++ R++YN+ DFQ  YE  KF+VIKSYSEDDIHK
Sbjct: 247  ELIRGPRAENKSASLEISDKKEVPSPTVSRDQYNLPDFQVEYEKVKFYVIKSYSEDDIHK 306

Query: 900  SIKYGVWASTPNGNKKLDAVFYDTEAKSTESGISCPIFLFFSVNGSGQFLGLAEMIGPVD 721
             IKY VW+STPNGNKKLDA F + EAK+ E+G  CPIFLFFSVNGSGQF+GLAEM+G VD
Sbjct: 307  CIKYDVWSSTPNGNKKLDATFNEAEAKADETGTRCPIFLFFSVNGSGQFVGLAEMMGKVD 366

Query: 720  FKKDMDFWQQDKWNGFFPLKWHIIKDIPNSKLRHIILENNENRPVTYTRDTQEIGLKQGL 541
            F KDMDFWQ DKWNGFFP+KWH+IKD+PN+ LRHI LENNEN+PVT++RDTQEIGLKQGL
Sbjct: 367  FNKDMDFWQLDKWNGFFPVKWHVIKDVPNTLLRHITLENNENKPVTHSRDTQEIGLKQGL 426

Query: 540  EMLTIFKSYTAITSMLDDFKFYEDQEKSLQA-RSNKAVSPHMETYGNKFPQKQFEGVRKS 364
            EML IFKSY+A TS+LDDF FYE++E+S    +S+K  +  M+ + +    KQ +   K 
Sbjct: 427  EMLKIFKSYSAKTSLLDDFNFYENKERSFHGKKSSKPATLQMDIFNDDDFTKQIKSAEK- 485

Query: 363  DQASMRAQRSDSKMSMVVNLTKNLSL 286
                      D     ++NLTKNLSL
Sbjct: 486  --------EFDEDSISIINLTKNLSL 503


>ref|XP_007041227.1| Yth domain-containing protein, putative isoform 6, partial [Theobroma
            cacao] gi|508705162|gb|EOX97058.1| Yth domain-containing
            protein, putative isoform 6, partial [Theobroma cacao]
          Length = 499

 Score =  350 bits (897), Expect = 2e-93
 Identities = 197/380 (51%), Positives = 236/380 (62%), Gaps = 2/380 (0%)
 Frame = -1

Query: 1572 NSRMSAG--SSYFPKSTPQSQPLKSLKEVSFSRPQIFGMGMDYSSMKVPQLGSDFRSAGL 1399
            NS  S G   +  PKST  +QP+K+L                    K P LGSD  SAG 
Sbjct: 171  NSLKSNGLVGTKLPKST-HTQPIKALN-------------------KGPHLGSDL-SAG- 208

Query: 1398 AEGYHPVRKFSSLTNQIQGPFPHNGPMRNQIQGPFAHNGPMTYRPNGRTWVGNDRFRLGE 1219
            + GYHP  K  S  NQ +G F HNG              PM YR NGR W  NDR++   
Sbjct: 209  SYGYHPAGKSPSFNNQKEGLFQHNG--------------PMNYRLNGRGWNQNDRYK--- 251

Query: 1218 KFNRNGNLETSREPNQEKFNRNGNFETSREPNREKFNRNGNFETFREPNRGPRARGVRNP 1039
                                              K NR+ +F+   E  RGPRA      
Sbjct: 252  ----------------------------------KSNRDFDFQNSAEVTRGPRAWN--RV 275

Query: 1038 LSSSAENVEFVISLRRNKYNVQDFQTVYENAKFFVIKSYSEDDIHKSIKYGVWASTPNGN 859
            L SS +  +  ++L ++KYN  DFQT Y+NAKFFVIKSYSEDD+HKS+KY VW+STPNGN
Sbjct: 276  LDSSVKREDLGLTLCKDKYNPLDFQTEYDNAKFFVIKSYSEDDVHKSMKYDVWSSTPNGN 335

Query: 858  KKLDAVFYDTEAKSTESGISCPIFLFFSVNGSGQFLGLAEMIGPVDFKKDMDFWQQDKWN 679
            +KLDA F++ EA+ +E+G   PIFL FSVNGSGQF+GLAEMIG VDF KDMDFWQ DKWN
Sbjct: 336  RKLDAAFHEAEARESETGTKFPIFLLFSVNGSGQFVGLAEMIGKVDFNKDMDFWQLDKWN 395

Query: 678  GFFPLKWHIIKDIPNSKLRHIILENNENRPVTYTRDTQEIGLKQGLEMLTIFKSYTAITS 499
            GFFP+KWH+IKDIPN +L HIILENNENR VTY+RDTQEIGLKQGLEML IFK Y+A +S
Sbjct: 396  GFFPVKWHVIKDIPNKELSHIILENNENRSVTYSRDTQEIGLKQGLEMLNIFKRYSAKSS 455

Query: 498  MLDDFKFYEDQEKSLQARSN 439
            +LDDF FYE++EK+L A+ N
Sbjct: 456  LLDDFGFYENREKTLNAKKN 475


>ref|XP_007041223.1| Yth domain-containing protein, putative isoform 2, partial [Theobroma
            cacao] gi|508705158|gb|EOX97054.1| Yth domain-containing
            protein, putative isoform 2, partial [Theobroma cacao]
          Length = 524

 Score =  350 bits (897), Expect = 2e-93
 Identities = 197/380 (51%), Positives = 236/380 (62%), Gaps = 2/380 (0%)
 Frame = -1

Query: 1572 NSRMSAG--SSYFPKSTPQSQPLKSLKEVSFSRPQIFGMGMDYSSMKVPQLGSDFRSAGL 1399
            NS  S G   +  PKST  +QP+K+L                    K P LGSD  SAG 
Sbjct: 196  NSLKSNGLVGTKLPKST-HTQPIKALN-------------------KGPHLGSDL-SAG- 233

Query: 1398 AEGYHPVRKFSSLTNQIQGPFPHNGPMRNQIQGPFAHNGPMTYRPNGRTWVGNDRFRLGE 1219
            + GYHP  K  S  NQ +G F HNG              PM YR NGR W  NDR++   
Sbjct: 234  SYGYHPAGKSPSFNNQKEGLFQHNG--------------PMNYRLNGRGWNQNDRYK--- 276

Query: 1218 KFNRNGNLETSREPNQEKFNRNGNFETSREPNREKFNRNGNFETFREPNRGPRARGVRNP 1039
                                              K NR+ +F+   E  RGPRA      
Sbjct: 277  ----------------------------------KSNRDFDFQNSAEVTRGPRAWN--RV 300

Query: 1038 LSSSAENVEFVISLRRNKYNVQDFQTVYENAKFFVIKSYSEDDIHKSIKYGVWASTPNGN 859
            L SS +  +  ++L ++KYN  DFQT Y+NAKFFVIKSYSEDD+HKS+KY VW+STPNGN
Sbjct: 301  LDSSVKREDLGLTLCKDKYNPLDFQTEYDNAKFFVIKSYSEDDVHKSMKYDVWSSTPNGN 360

Query: 858  KKLDAVFYDTEAKSTESGISCPIFLFFSVNGSGQFLGLAEMIGPVDFKKDMDFWQQDKWN 679
            +KLDA F++ EA+ +E+G   PIFL FSVNGSGQF+GLAEMIG VDF KDMDFWQ DKWN
Sbjct: 361  RKLDAAFHEAEARESETGTKFPIFLLFSVNGSGQFVGLAEMIGKVDFNKDMDFWQLDKWN 420

Query: 678  GFFPLKWHIIKDIPNSKLRHIILENNENRPVTYTRDTQEIGLKQGLEMLTIFKSYTAITS 499
            GFFP+KWH+IKDIPN +L HIILENNENR VTY+RDTQEIGLKQGLEML IFK Y+A +S
Sbjct: 421  GFFPVKWHVIKDIPNKELSHIILENNENRSVTYSRDTQEIGLKQGLEMLNIFKRYSAKSS 480

Query: 498  MLDDFKFYEDQEKSLQARSN 439
            +LDDF FYE++EK+L A+ N
Sbjct: 481  LLDDFGFYENREKTLNAKKN 500


Top