BLASTX nr result

ID: Cocculus22_contig00003416 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus22_contig00003416
         (2652 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN72739.1| hypothetical protein VITISV_027256 [Vitis vinifera]   621   e-175
ref|XP_003633728.1| PREDICTED: KH domain-containing protein At4g...   620   e-175
ref|XP_006482257.1| PREDICTED: KH domain-containing protein At4g...   605   e-170
ref|XP_006430781.1| hypothetical protein CICLE_v10011187mg [Citr...   603   e-169
ref|XP_007033279.1| RNA-binding KH domain-containing protein iso...   551   e-154
ref|XP_007033276.1| RNA-binding KH domain-containing protein iso...   550   e-153
ref|XP_004304132.1| PREDICTED: KH domain-containing protein At4g...   522   e-145
ref|XP_007033280.1| RNA-binding KH domain-containing protein iso...   515   e-143
ref|XP_007033277.1| RNA-binding KH domain-containing protein iso...   515   e-143
ref|XP_006430782.1| hypothetical protein CICLE_v10011187mg [Citr...   504   e-140
ref|XP_002516721.1| Poly(rC)-binding protein, putative [Ricinus ...   502   e-139
ref|XP_007217649.1| hypothetical protein PRUPE_ppa027198mg, part...   484   e-134
ref|XP_006358053.1| PREDICTED: KH domain-containing protein At4g...   482   e-133
ref|XP_004234567.1| PREDICTED: KH domain-containing protein At4g...   480   e-132
ref|XP_006383459.1| hypothetical protein POPTR_0005s15690g [Popu...   469   e-129
gb|AEV43362.1| poly C-binding protein [Citrus sinensis]               426   e-116
ref|XP_007033281.1| RNA-binding KH domain-containing protein iso...   378   e-102
ref|XP_006661902.1| PREDICTED: KH domain-containing protein At4g...   362   4e-97
gb|AAP54423.2| KH domain containing protein, expressed [Oryza sa...   353   2e-94
ref|NP_001064947.1| Os10g0495000 [Oryza sativa Japonica Group] g...   353   2e-94

>emb|CAN72739.1| hypothetical protein VITISV_027256 [Vitis vinifera]
          Length = 668

 Score =  621 bits (1601), Expect = e-175
 Identities = 339/607 (55%), Positives = 417/607 (68%), Gaps = 3/607 (0%)
 Frame = -2

Query: 2363 KRNIGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAK 2184
            KR    K+ R N+  +E S G    A+TVYRILCP+              KA+R+ET AK
Sbjct: 46   KRKGSNKRGRWNNSSHEQSFGNSQVADTVYRILCPSKKIGGVIGKGGGIVKALREETQAK 105

Query: 2183 IKVDEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKV 2004
            I V ++VPG+DERV+II+S+ TK P+  +++ED E     E+E + M+PHCPAQ AL+KV
Sbjct: 106  ITVADSVPGSDERVIIIYSAPTKNPKEHNSNEDPER----EEEQDHMEPHCPAQDALMKV 161

Query: 2003 HERIVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRI 1824
            HERI+EE DL+G ++ EDD E+ VVTARLLVP+N VGCLLGK G VIQ+LRSETGANIR+
Sbjct: 162  HERIIEE-DLFGGTEFEDDNENTVVTARLLVPNNMVGCLLGKRGDVIQRLRSETGANIRV 220

Query: 1823 LPPEHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXXXXXX 1653
            LP EHLP CAMSSDELVQISG   VAKKALYEVSTLL+QNPR   PP +           
Sbjct: 221  LPAEHLPTCAMSSDELVQISGKPAVAKKALYEVSTLLHQNPRKDKPPSSFPMSFGGQGFH 280

Query: 1652 XXXXXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKAS 1473
                           MWS+RNS   G P MPW GG+ ++                 G+AS
Sbjct: 281  PPGASMGNMPPPGNPMWSNRNSNSQGVPPMPWMGGYRSQ-PSVVPGGFDGVHAGHGGEAS 339

Query: 1472 AEFNMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWN 1293
             EF+MKILCPA KIG VIGKGG NVKQ+QQETGA+IHVED+  E++ERVI VS F+A WN
Sbjct: 340  GEFSMKILCPAGKIGGVIGKGGFNVKQLQQETGASIHVEDALAESEERVIRVSSFEALWN 399

Query: 1292 PQSPTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRV 1113
            P+S TI+AILQLQ  T E S+KG +TTRLLVPS+KVGC+LGQGGH+I EMRRR QADIRV
Sbjct: 400  PRSQTIEAILQLQNKTSEYSDKGGMTTRLLVPSSKVGCILGQGGHVINEMRRRTQADIRV 459

Query: 1112 ISKDGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQGF 933
             SK+ KP CAS DEELVQISGN  VA++AL EIASRLRVR L   +   EP P+GP  GF
Sbjct: 460  YSKEDKPKCASDDEELVQISGNFGVAKDALAEIASRLRVRCLRDANGGVEPAPVGPVPGF 519

Query: 932  GRSEGFHDRXXXXXXXXXXXXXXGYEHSKGGIREFEPQSYPVQPSGAGPAGYPNLNSSME 753
            G                      G+E SKGG  E+EPQSYPV P+     GY N+NSSME
Sbjct: 520  GHPGKLPGGLPSSSGALGAGSSGGFELSKGGGLEYEPQSYPVPPAA---TGYHNVNSSME 576

Query: 752  VKIPNTAINAVLGAGGSNISSIGQLTGAKVKLNDPQAGVSECIVEIHGSSDQMIAAQNLL 573
             KIPN ++++V+G GG N++++ ++ GA+VKL DPQ+G SEC+VEI GSS+ + AAQ++L
Sbjct: 577  SKIPNNSVSSVIGMGGGNVANMSEMAGARVKLQDPQSGGSECVVEIRGSSEHLTAAQSIL 636

Query: 572  HNFIASA 552
              F+ASA
Sbjct: 637  QAFMASA 643


>ref|XP_003633728.1| PREDICTED: KH domain-containing protein At4g18375-like [Vitis
            vinifera] gi|296087281|emb|CBI33655.3| unnamed protein
            product [Vitis vinifera]
          Length = 676

 Score =  620 bits (1600), Expect = e-175
 Identities = 339/607 (55%), Positives = 416/607 (68%), Gaps = 3/607 (0%)
 Frame = -2

Query: 2363 KRNIGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAK 2184
            KR    K+ R N+  +E S G    A+TVYRILCP+              KA+R+ET AK
Sbjct: 18   KRKGSNKRGRWNNSSHEQSFGNSQVADTVYRILCPSKKIGGVIGKGGGIVKALREETQAK 77

Query: 2183 IKVDEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKV 2004
            I V ++VPG+DERV+II+S+ TK P+   ++ED E     E+E + M+PHCPAQ AL+KV
Sbjct: 78   ITVADSVPGSDERVIIIYSAPTKNPKEHDSNEDPEM----EEEQDHMEPHCPAQDALMKV 133

Query: 2003 HERIVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRI 1824
            HERI+EE DL+G ++ EDD E+ VVTARLLVP+N VGCLLGK G VIQ+LRSETGANIR+
Sbjct: 134  HERIIEE-DLFGGTEFEDDNENTVVTARLLVPNNMVGCLLGKRGDVIQRLRSETGANIRV 192

Query: 1823 LPPEHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXXXXXX 1653
            LP EHLP CAMSSDELVQISG   VAKKALYEVSTLL+QNPR   PP +           
Sbjct: 193  LPAEHLPTCAMSSDELVQISGKPAVAKKALYEVSTLLHQNPRKDKPPSSFPMSFGGQGFH 252

Query: 1652 XXXXXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKAS 1473
                           MWS+RNS   G P MPW GG+ ++                 G+AS
Sbjct: 253  PPGASMGNMPPPGNPMWSNRNSNSQGVPPMPWMGGYRSQ-PSVVPGGFDGVHAGHGGEAS 311

Query: 1472 AEFNMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWN 1293
             EF+MKILCPA KIG VIGKGG NVKQ+QQETGA+IHVED+  E++ERVI VS F+A WN
Sbjct: 312  GEFSMKILCPAGKIGGVIGKGGFNVKQLQQETGASIHVEDALAESEERVIRVSSFEALWN 371

Query: 1292 PQSPTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRV 1113
            P+S TI+AILQLQ  T E S+KG +TTRLLVPS+KVGC+LGQGGH+I EMRRR QADIRV
Sbjct: 372  PRSQTIEAILQLQNKTSEYSDKGGMTTRLLVPSSKVGCILGQGGHVINEMRRRTQADIRV 431

Query: 1112 ISKDGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQGF 933
             SK+ KP CAS DEELVQISGN  VA++AL EIASRLRVR L   +   EP P+GP  GF
Sbjct: 432  YSKEDKPKCASDDEELVQISGNFGVAKDALAEIASRLRVRCLRDANGGVEPAPVGPVPGF 491

Query: 932  GRSEGFHDRXXXXXXXXXXXXXXGYEHSKGGIREFEPQSYPVQPSGAGPAGYPNLNSSME 753
            G                      G+E SKGG  E+EPQSYPV P+     GY N+NSSME
Sbjct: 492  GHPGKLPGGLPSSSGALGAGSSGGFELSKGGGLEYEPQSYPVPPAA---TGYHNVNSSME 548

Query: 752  VKIPNTAINAVLGAGGSNISSIGQLTGAKVKLNDPQAGVSECIVEIHGSSDQMIAAQNLL 573
             KIPN ++++V+G GG N++++ ++ GA+VKL DPQ+G SEC+VEI GSS+ + AAQ++L
Sbjct: 549  SKIPNNSVSSVIGMGGGNVANMSEMAGARVKLQDPQSGGSECVVEIRGSSEHLTAAQSIL 608

Query: 572  HNFIASA 552
              F+ASA
Sbjct: 609  QAFMASA 615


>ref|XP_006482257.1| PREDICTED: KH domain-containing protein At4g18375 [Citrus sinensis]
          Length = 715

 Score =  605 bits (1559), Expect = e-170
 Identities = 326/605 (53%), Positives = 405/605 (66%), Gaps = 5/605 (0%)
 Frame = -2

Query: 2354 IGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAKIKV 2175
            +G KK   ++   E S G    A+TVYRILCP+              K++R+ET AKI V
Sbjct: 23   VGIKKGNWSNSSREQSFGNSQPADTVYRILCPSRKIGGVIGKGGNIVKSLREETQAKITV 82

Query: 2174 DEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKVHER 1995
             + +PG++ERV+II+SS TK  +  + D+D    ++ E + E M+PHC AQ ALLKVH+R
Sbjct: 83   ADTIPGSEERVIIIYSSPTKIAKKVNKDDD----SAAETKKESMEPHCAAQDALLKVHDR 138

Query: 1994 IVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRILPP 1815
            I+EE DL+G    +DD E++ VTARLLVP+N VGCLLGK G VIQ+LRSETGANIR+LP 
Sbjct: 139  IIEE-DLFGGMASDDDNENSTVTARLLVPNNMVGCLLGKRGDVIQRLRSETGANIRVLPA 197

Query: 1814 EHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXXXXXXXXX 1644
            + LP CAM++DELVQISG  +VAK+ALYEVSTLL+QNPR   PP +              
Sbjct: 198  DRLPPCAMNTDELVQISGKPNVAKRALYEVSTLLHQNPRKDKPPSSFPQAYGGQNFHSPP 257

Query: 1643 XXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKASAEF 1464
                         W  RNS  HG PS PW GG+G++  R              G+ SAEF
Sbjct: 258  APMADMHPLGNSSWPARNSSLHGMPSTPWMGGYGDQPSRMGSGSINSCPPGQMGEVSAEF 317

Query: 1463 NMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWNPQS 1284
            +MKILC A KIG VIGKGG NVKQ+QQETGA+IHVED+  ++DERVI  S F+  WNP+S
Sbjct: 318  SMKILCSAGKIGGVIGKGGFNVKQLQQETGASIHVEDAPTDSDERVIRASAFEGLWNPRS 377

Query: 1283 PTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRVISK 1104
             TIDAILQLQ  T E SEKGTITTRLLVPS+KVGC+LGQGGH+I EMRRR QADIRV SK
Sbjct: 378  QTIDAILQLQNKTSEFSEKGTITTRLLVPSSKVGCILGQGGHVINEMRRRTQADIRVYSK 437

Query: 1103 DGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQGFGRS 924
            D KP CAS DEELVQISGN  VA++ALTEIASRLR R+L   +  AEP P+GP Q  G +
Sbjct: 438  DDKPKCASEDEELVQISGNFGVAKDALTEIASRLRARTLRDANVGAEPAPVGPVQLVGAA 497

Query: 923  EGFHDRXXXXXXXXXXXXXXGYEHSKGGIREFEPQSYPVQPSGAGPA--GYPNLNSSMEV 750
             G   R              GYE  +GG  ++EPQSYP  PS   P+  GYPN+NS+ E 
Sbjct: 498  GGLPSRGPLPSGPVGAGISGGYEPFRGGY-DYEPQSYPPPPSAPPPSATGYPNMNSAFEA 556

Query: 749  KIPNTAINAVLGAGGSNISSIGQLTGAKVKLNDPQAGVSECIVEIHGSSDQMIAAQNLLH 570
            +IPN A+ +V+G GGSNI ++G++ GA+VKL DP  G SECIV+I GSS+ +I+A     
Sbjct: 557  RIPNKAVGSVMGTGGSNIPNVGEVVGARVKLQDPHPGSSECIVDIRGSSEHLISAHGTYQ 616

Query: 569  NFIAS 555
            +F+ S
Sbjct: 617  SFMTS 621


>ref|XP_006430781.1| hypothetical protein CICLE_v10011187mg [Citrus clementina]
            gi|557532838|gb|ESR44021.1| hypothetical protein
            CICLE_v10011187mg [Citrus clementina]
          Length = 710

 Score =  603 bits (1555), Expect = e-169
 Identities = 325/603 (53%), Positives = 403/603 (66%), Gaps = 3/603 (0%)
 Frame = -2

Query: 2354 IGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAKIKV 2175
            +G KK   ++   E S G    A+TVYRILCP+              K++R+ET AKI V
Sbjct: 23   VGIKKGNWSNSSREQSFGNSQPADTVYRILCPSRKIGGVIGKGGNIVKSLREETQAKITV 82

Query: 2174 DEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKVHER 1995
             + +PG++ERV+II+SS TK  +  + D+D    ++ E + E M+PHC AQ ALLKVH+R
Sbjct: 83   ADTIPGSEERVIIIYSSPTKIAKTQNKDDD----SAAETKKESMEPHCAAQDALLKVHDR 138

Query: 1994 IVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRILPP 1815
            I+EE DL+G    +DD E++ VTARLLVP+N VGCLLGK G VIQ+LRSETGANIR+LP 
Sbjct: 139  IIEE-DLFGGMASDDDNENSTVTARLLVPNNMVGCLLGKRGDVIQRLRSETGANIRVLPA 197

Query: 1814 EHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXXXXXXXXX 1644
            + LP CAM++DELVQISG  +VAK+ALYEVSTLL+QNPR   PP +              
Sbjct: 198  DRLPPCAMNTDELVQISGKPNVAKRALYEVSTLLHQNPRKDKPPSSFPQAYGGQNFHSPP 257

Query: 1643 XXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKASAEF 1464
                         W  RNS  HG PS PW GG+G++  R              G+ SAEF
Sbjct: 258  APMADMHPLGNSSWPARNSSLHGMPSTPWMGGYGDQPSRMGSGSINSCPPGQMGEVSAEF 317

Query: 1463 NMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWNPQS 1284
            +MKILC A KIG VIGKGG NVKQ+QQETGA+IHVED+  ++DERVI  S F+  WNP+S
Sbjct: 318  SMKILCSAGKIGGVIGKGGFNVKQLQQETGASIHVEDAPTDSDERVIRASAFEGLWNPRS 377

Query: 1283 PTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRVISK 1104
             TIDAILQLQ  T E SEKGTITTRLLVPS+KVGC+LGQGGH+I EMRRR QADIRV SK
Sbjct: 378  QTIDAILQLQNKTSEFSEKGTITTRLLVPSSKVGCILGQGGHVINEMRRRTQADIRVYSK 437

Query: 1103 DGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQGFGRS 924
            D KP CAS DEELVQISGN  VA++ALTEIASRLR R+L   +  AEP P+GP Q  G +
Sbjct: 438  DDKPKCASEDEELVQISGNFGVAKDALTEIASRLRARTLRDANVGAEPAPVGPVQLVGAA 497

Query: 923  EGFHDRXXXXXXXXXXXXXXGYEHSKGGIREFEPQSYPVQPSGAGPAGYPNLNSSMEVKI 744
             G   R              GYE  +GG  ++EPQSYP  PS     GYPN+NS+ E +I
Sbjct: 498  GGLPSRGPLPSGPVGAGISGGYEPFRGGY-DYEPQSYPPPPSA---MGYPNMNSAFEARI 553

Query: 743  PNTAINAVLGAGGSNISSIGQLTGAKVKLNDPQAGVSECIVEIHGSSDQMIAAQNLLHNF 564
            PN A+ +V+G GGSNI ++G++ GA+VKL DP  G SECIV+I GSS+ +I+A     +F
Sbjct: 554  PNKAVGSVMGTGGSNIPNVGEVAGARVKLQDPHPGSSECIVDIRGSSEHLISAHGTYQSF 613

Query: 563  IAS 555
            + S
Sbjct: 614  MTS 616


>ref|XP_007033279.1| RNA-binding KH domain-containing protein isoform 4 [Theobroma cacao]
            gi|508712308|gb|EOY04205.1| RNA-binding KH
            domain-containing protein isoform 4 [Theobroma cacao]
          Length = 706

 Score =  551 bits (1419), Expect = e-154
 Identities = 323/625 (51%), Positives = 396/625 (63%), Gaps = 6/625 (0%)
 Frame = -2

Query: 2321 GYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAKIKVDEAVPGTDERV 2142
            G+E  +G  N  +TVYR+LCP+              KA+R+ET AKI V ++V G DERV
Sbjct: 35   GHEQPSGNFNSGDTVYRVLCPSRKIGGVIGKGGSIVKALREETQAKITVGDSVLGCDERV 94

Query: 2141 VIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKVHERIVEEEDLYGAS 1962
            +II+SS  K  +  ++DED   + + + E   M+P C AQ ALLKVH++I E+ DL+G  
Sbjct: 95   IIIYSSPMKV-KTQNSDEDSRGE-NKKDEVVVMEPCCAAQDALLKVHDQIAED-DLFGGM 151

Query: 1961 DHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRILPPEHLPACAMSSD 1782
              +DD  + VVTARLLVP+N VGCLLGK G VIQ+LRSETGA+IRILP +HLPACAM++D
Sbjct: 152  ALDDDNGNTVVTARLLVPNNMVGCLLGKRGDVIQRLRSETGASIRILPADHLPACAMATD 211

Query: 1781 ELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXXXXXXXXXXXXXXXXXXXX 1611
            ELVQISG  DVAK+ALYEVSTLL+QNPR   PP++                         
Sbjct: 212  ELVQISGKRDVAKRALYEVSTLLHQNPRKDNPPLSFPVPHAGQNFPPPSAIPPSNPM--- 268

Query: 1610 XMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKASAEFNMKILCPADKI 1431
              WSHRNS  H  PSMPW    GN                   +ASAEF+MKILCPA KI
Sbjct: 269  --WSHRNSSPHDIPSMPWMVPHGNRPSGFGPGSLSSFPPARGAEASAEFSMKILCPAGKI 326

Query: 1430 GVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWNPQSPTIDAILQLQV 1251
            G VIGKGG NVKQ+QQETGA IHVED+T E+DERVI VS  +  WNP+S TIDAILQLQ 
Sbjct: 327  GGVIGKGGFNVKQLQQETGAGIHVEDATIESDERVIRVSAIEGLWNPRSQTIDAILQLQN 386

Query: 1250 HTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRVISKDGKPTCASADE 1071
             T E SEKGT+TTRLLVPS+KVGC+LGQGGH+I EMRRR QADIRV SKD KP CAS DE
Sbjct: 387  KTSEFSEKGTVTTRLLVPSSKVGCILGQGGHVINEMRRRTQADIRVYSKDDKPKCASEDE 446

Query: 1070 ELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQGFGRSEGFHDRXXXXX 891
            ELVQISGN  VA++AL EIASRLRVR+L  V+A AE  P+GP  GFG +    D      
Sbjct: 447  ELVQISGNFGVAKDALAEIASRLRVRTLRDVNAGAETAPVGPVNGFGPARSMADGLPPAA 506

Query: 890  XXXXXXXXXGYEHSKGGIREFEPQ-SYPVQPSGAGPAGYPNLNSSMEVKIPNTAINAVLG 714
                           GG RE+EPQ +Y V P+      Y N+N ++E KI N   ++V G
Sbjct: 507  AIGPGRSGGYESFRGGGGREYEPQNNYSVPPAA---VRYSNMNGALEAKILNNMSSSVTG 563

Query: 713  AGGSNISSIGQLTGAKVKLNDPQAGVSECIVEIHGSSDQMIAAQNLLHNFIASAAHNYNT 534
             GGS I +  Q++GA+V+L DPQ G SE   E  GSS+ + AAQ++  +F+ S+  + N 
Sbjct: 564  TGGSTILNT-QVSGARVRLEDPQTGGSE---EFRGSSEHLTAAQSIFQSFMPSSGQSMNA 619

Query: 533  XXXXXXXXXXXQS--YNTQQQQSPY 465
                       QS   N   Q+SPY
Sbjct: 620  QQSSYQNLSVQQSSYLNMNAQRSPY 644


>ref|XP_007033276.1| RNA-binding KH domain-containing protein isoform 1 [Theobroma cacao]
            gi|590652914|ref|XP_007033278.1| RNA-binding KH
            domain-containing protein isoform 1 [Theobroma cacao]
            gi|508712305|gb|EOY04202.1| RNA-binding KH
            domain-containing protein isoform 1 [Theobroma cacao]
            gi|508712307|gb|EOY04204.1| RNA-binding KH
            domain-containing protein isoform 1 [Theobroma cacao]
          Length = 706

 Score =  550 bits (1417), Expect = e-153
 Identities = 322/625 (51%), Positives = 396/625 (63%), Gaps = 6/625 (0%)
 Frame = -2

Query: 2321 GYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAKIKVDEAVPGTDERV 2142
            G+E  +G  N  +TVYR+LCP+              KA+R+ET AKI V ++V G DERV
Sbjct: 35   GHEQPSGNFNSGDTVYRVLCPSRKIGGVIGKGGSIVKALREETQAKITVGDSVLGCDERV 94

Query: 2141 VIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKVHERIVEEEDLYGAS 1962
            +II+SS  K  +  ++DED   + + + E   M+P C AQ ALLKVH++I E+ DL+G  
Sbjct: 95   IIIYSSPMKV-KTQNSDEDSRGE-NKKDEVVVMEPCCAAQDALLKVHDQIAED-DLFGGM 151

Query: 1961 DHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRILPPEHLPACAMSSD 1782
              +DD  + VVTARLLVP+N VGCLLGK G VIQ+LRSETGA+IRILP +HLPACAM++D
Sbjct: 152  ALDDDNGNTVVTARLLVPNNMVGCLLGKRGDVIQRLRSETGASIRILPADHLPACAMATD 211

Query: 1781 ELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXXXXXXXXXXXXXXXXXXXX 1611
            ELVQISG  DVAK+ALYEVSTLL+QNPR   PP++                         
Sbjct: 212  ELVQISGKRDVAKRALYEVSTLLHQNPRKDNPPLSFPVPHAGQNFPPPSAIPPSNPM--- 268

Query: 1610 XMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKASAEFNMKILCPADKI 1431
              WSHRNS  H  PSMPW    GN                   +ASAEF+MKILCPA KI
Sbjct: 269  --WSHRNSSPHDIPSMPWMVPHGNRPSGFGPGSLSSFPPARGAEASAEFSMKILCPAGKI 326

Query: 1430 GVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWNPQSPTIDAILQLQV 1251
            G VIGKGG NVKQ+QQETGA IHVED+T E+DERVI VS  +  WNP+S TIDAILQLQ 
Sbjct: 327  GGVIGKGGFNVKQLQQETGAGIHVEDATIESDERVIRVSAIEGLWNPRSQTIDAILQLQN 386

Query: 1250 HTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRVISKDGKPTCASADE 1071
             T E SEKGT+TTRLLVPS+KVGC+LGQGGH+I EMRRR QADIRV SKD KP CAS DE
Sbjct: 387  KTSEFSEKGTVTTRLLVPSSKVGCILGQGGHVINEMRRRTQADIRVYSKDDKPKCASEDE 446

Query: 1070 ELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQGFGRSEGFHDRXXXXX 891
            ELVQISGN  VA++AL EIASRLRVR+L  V+A AE  P+GP  GFG +    D      
Sbjct: 447  ELVQISGNFGVAKDALAEIASRLRVRTLRDVNAGAETAPVGPVNGFGPARSMADGLPPAA 506

Query: 890  XXXXXXXXXGYEHSKGGIREFEPQ-SYPVQPSGAGPAGYPNLNSSMEVKIPNTAINAVLG 714
                           GG RE+EPQ +Y V P+      Y N+N ++E KI N   ++V G
Sbjct: 507  AIGPGRSGGYESFRGGGGREYEPQNNYSVPPAA---VRYSNMNGALEAKILNNMSSSVTG 563

Query: 713  AGGSNISSIGQLTGAKVKLNDPQAGVSECIVEIHGSSDQMIAAQNLLHNFIASAAHNYNT 534
             GGS I +  +++GA+V+L DPQ G SE   E  GSS+ + AAQ++  +F+ S+  + N 
Sbjct: 564  TGGSTILN-SEVSGARVRLEDPQTGGSE---EFRGSSEHLTAAQSIFQSFMPSSGQSMNA 619

Query: 533  XXXXXXXXXXXQS--YNTQQQQSPY 465
                       QS   N   Q+SPY
Sbjct: 620  QQSSYQNLSVQQSSYLNMNAQRSPY 644


>ref|XP_004304132.1| PREDICTED: KH domain-containing protein At4g18375-like [Fragaria
            vesca subsp. vesca]
          Length = 760

 Score =  522 bits (1345), Expect = e-145
 Identities = 308/643 (47%), Positives = 390/643 (60%), Gaps = 6/643 (0%)
 Frame = -2

Query: 2375 VRLDKRNIGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDE 2196
            V+  ++    +K   N+   E S+G     ETVYRILCP+              K++R+E
Sbjct: 15   VQFKRKGSSNQKGNLNNSNREQSSGNAQSLETVYRILCPSKKIGGVIGKGGGIIKSLREE 74

Query: 2195 THAKIKVDEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHA 2016
            T +KI V ++VPG+DERV+IIFS  TK  R  ++DED    +    E +P++PHC AQ A
Sbjct: 75   TRSKITVSDSVPGSDERVIIIFSPPTKISRKQNSDED----SHKADEQKPLEPHCAAQDA 130

Query: 2015 LLKVHERIVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGA 1836
            LLKVH+RIVEE DLY     +DD E N+V  RLLVP+N VGCLLGK G VIQ+LRSET A
Sbjct: 131  LLKVHDRIVEE-DLYDGVTFDDDNE-NIVVTRLLVPNNLVGCLLGKRGDVIQRLRSETRA 188

Query: 1835 NIRILPPEHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXX 1665
            NIR+LP + LP CAM +DELVQISG  DVAKKALYEVSTLL+QNPR   PP+        
Sbjct: 189  NIRVLPADQLPTCAMDTDELVQISGKPDVAKKALYEVSTLLHQNPRKDKPPLG--LAIPF 246

Query: 1664 XXXXXXXXXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXX 1485
                               +W +R    HG P MPW G   N                  
Sbjct: 247  GGQGFHLRGAPNMFPPGNPIWPNREPS-HGMPPMPWIGECENHSSGYGRGGFDGDPAGHG 305

Query: 1484 GKASAEFNMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFD 1305
             +ASAEF +KILC A KIG VIGKGG NVKQ+Q+ETGANIHV+D++ +++ERVI VS F+
Sbjct: 306  VEASAEFFIKILCSAGKIGGVIGKGGFNVKQLQEETGANIHVQDASTDSEERVIRVSAFE 365

Query: 1304 APWNPQSPTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQA 1125
                P+S TI+AILQLQ    E+S+KGTITTRLLVPS+KVGC+LGQGG +I EMRRR QA
Sbjct: 366  VLRIPRSQTIEAILQLQNKASELSDKGTITTRLLVPSSKVGCILGQGGQVINEMRRRTQA 425

Query: 1124 DIRVISKDGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGP 945
            DIRV SKD +P CA  DEELVQISGN  VA++AL EI SRLR+R+L   +A  E  P+GP
Sbjct: 426  DIRVYSKDDRPKCADEDEELVQISGNFAVAKDALAEITSRLRIRTLRDTNAGEEHAPVGP 485

Query: 944  FQGFGRSEGFHDR-XXXXXXXXXXXXXXGYEHSKGGIREFEPQSYPVQPSGAGPAGYPNL 768
               FG       +                Y+H K G  E+EP+ YPV P+    +GY  +
Sbjct: 486  PPRFGPPGSLPVKGMPPPLSAVRAGSSGRYDHLKVGRHEYEPEGYPVPPAA---SGYQRV 542

Query: 767  NSSMEVKIPNTAINAVLGAGGSNISSIGQLTGAKVKLNDPQAGVSECIVEIHGSSDQMIA 588
            N +++ +IPN A+ +  G GGS++S IG++   +VK  D Q+G  E + EI G SDQ+ A
Sbjct: 543  NRALDSRIPNNAVGSFTGIGGSDVSHIGEVPRPRVKYQDSQSGGFEQVAEIRG-SDQLNA 601

Query: 587  AQNLLHNFIASAAHNYNT-XXXXXXXXXXXQSY-NTQQQQSPY 465
            AQN+L   +AS   N +              SY N    QSPY
Sbjct: 602  AQNILQALMASMGQNASAQPSSHHNANTRQGSYPNISDHQSPY 644


>ref|XP_007033280.1| RNA-binding KH domain-containing protein isoform 5 [Theobroma cacao]
            gi|508712309|gb|EOY04206.1| RNA-binding KH
            domain-containing protein isoform 5 [Theobroma cacao]
          Length = 591

 Score =  515 bits (1326), Expect = e-143
 Identities = 295/546 (54%), Positives = 354/546 (64%), Gaps = 4/546 (0%)
 Frame = -2

Query: 2321 GYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAKIKVDEAVPGTDERV 2142
            G+E  +G  N  +TVYR+LCP+              KA+R+ET AKI V ++V G DERV
Sbjct: 35   GHEQPSGNFNSGDTVYRVLCPSRKIGGVIGKGGSIVKALREETQAKITVGDSVLGCDERV 94

Query: 2141 VIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKVHERIVEEEDLYGAS 1962
            +II+SS  K  +  ++DED   + + + E   M+P C AQ ALLKVH++I E+ DL+G  
Sbjct: 95   IIIYSSPMKV-KTQNSDEDSRGE-NKKDEVVVMEPCCAAQDALLKVHDQIAED-DLFGGM 151

Query: 1961 DHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRILPPEHLPACAMSSD 1782
              +DD  + VVTARLLVP+N VGCLLGK G VIQ+LRSETGA+IRILP +HLPACAM++D
Sbjct: 152  ALDDDNGNTVVTARLLVPNNMVGCLLGKRGDVIQRLRSETGASIRILPADHLPACAMATD 211

Query: 1781 ELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXXXXXXXXXXXXXXXXXXXX 1611
            ELVQISG  DVAK+ALYEVSTLL+QNPR   PP++                         
Sbjct: 212  ELVQISGKRDVAKRALYEVSTLLHQNPRKDNPPLSFPVPHAGQNFPPPSAIPPSNPM--- 268

Query: 1610 XMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKASAEFNMKILCPADKI 1431
              WSHRNS  H  PSMPW    GN                   +ASAEF+MKILCPA KI
Sbjct: 269  --WSHRNSSPHDIPSMPWMVPHGNRPSGFGPGSLSSFPPARGAEASAEFSMKILCPAGKI 326

Query: 1430 GVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWNPQSPTIDAILQLQV 1251
            G VIGKGG NVKQ+QQETGA IHVED+T E+DERVI VS  +  WNP+S TIDAILQLQ 
Sbjct: 327  GGVIGKGGFNVKQLQQETGAGIHVEDATIESDERVIRVSAIEGLWNPRSQTIDAILQLQN 386

Query: 1250 HTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRVISKDGKPTCASADE 1071
             T E SEKGT+TTRLLVPS+KVGC+LGQGGH+I EMRRR QADIRV SKD KP CAS DE
Sbjct: 387  KTSEFSEKGTVTTRLLVPSSKVGCILGQGGHVINEMRRRTQADIRVYSKDDKPKCASEDE 446

Query: 1070 ELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQGFGRSEGFHDRXXXXX 891
            ELVQISGN  VA++AL EIASRLRVR+L  V+A AE  P+GP  GFG +    D      
Sbjct: 447  ELVQISGNFGVAKDALAEIASRLRVRTLRDVNAGAETAPVGPVNGFGPARSMADGLPPAA 506

Query: 890  XXXXXXXXXGYEHSKGGIREFEPQ-SYPVQPSGAGPAGYPNLNSSMEVKIPNTAINAVLG 714
                           GG RE+EPQ +Y V P+      Y N+N ++E KI N   ++V G
Sbjct: 507  AIGPGRSGGYESFRGGGGREYEPQNNYSVPPAA---VRYSNMNGALEAKILNNMSSSVTG 563

Query: 713  AGGSNI 696
             GGS I
Sbjct: 564  TGGSTI 569


>ref|XP_007033277.1| RNA-binding KH domain-containing protein isoform 2 [Theobroma cacao]
            gi|508712306|gb|EOY04203.1| RNA-binding KH
            domain-containing protein isoform 2 [Theobroma cacao]
          Length = 580

 Score =  515 bits (1326), Expect = e-143
 Identities = 295/546 (54%), Positives = 354/546 (64%), Gaps = 4/546 (0%)
 Frame = -2

Query: 2321 GYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAKIKVDEAVPGTDERV 2142
            G+E  +G  N  +TVYR+LCP+              KA+R+ET AKI V ++V G DERV
Sbjct: 35   GHEQPSGNFNSGDTVYRVLCPSRKIGGVIGKGGSIVKALREETQAKITVGDSVLGCDERV 94

Query: 2141 VIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKVHERIVEEEDLYGAS 1962
            +II+SS  K  +  ++DED   + + + E   M+P C AQ ALLKVH++I E+ DL+G  
Sbjct: 95   IIIYSSPMKV-KTQNSDEDSRGE-NKKDEVVVMEPCCAAQDALLKVHDQIAED-DLFGGM 151

Query: 1961 DHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRILPPEHLPACAMSSD 1782
              +DD  + VVTARLLVP+N VGCLLGK G VIQ+LRSETGA+IRILP +HLPACAM++D
Sbjct: 152  ALDDDNGNTVVTARLLVPNNMVGCLLGKRGDVIQRLRSETGASIRILPADHLPACAMATD 211

Query: 1781 ELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXXXXXXXXXXXXXXXXXXXX 1611
            ELVQISG  DVAK+ALYEVSTLL+QNPR   PP++                         
Sbjct: 212  ELVQISGKRDVAKRALYEVSTLLHQNPRKDNPPLSFPVPHAGQNFPPPSAIPPSNPM--- 268

Query: 1610 XMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKASAEFNMKILCPADKI 1431
              WSHRNS  H  PSMPW    GN                   +ASAEF+MKILCPA KI
Sbjct: 269  --WSHRNSSPHDIPSMPWMVPHGNRPSGFGPGSLSSFPPARGAEASAEFSMKILCPAGKI 326

Query: 1430 GVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWNPQSPTIDAILQLQV 1251
            G VIGKGG NVKQ+QQETGA IHVED+T E+DERVI VS  +  WNP+S TIDAILQLQ 
Sbjct: 327  GGVIGKGGFNVKQLQQETGAGIHVEDATIESDERVIRVSAIEGLWNPRSQTIDAILQLQN 386

Query: 1250 HTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRVISKDGKPTCASADE 1071
             T E SEKGT+TTRLLVPS+KVGC+LGQGGH+I EMRRR QADIRV SKD KP CAS DE
Sbjct: 387  KTSEFSEKGTVTTRLLVPSSKVGCILGQGGHVINEMRRRTQADIRVYSKDDKPKCASEDE 446

Query: 1070 ELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQGFGRSEGFHDRXXXXX 891
            ELVQISGN  VA++AL EIASRLRVR+L  V+A AE  P+GP  GFG +    D      
Sbjct: 447  ELVQISGNFGVAKDALAEIASRLRVRTLRDVNAGAETAPVGPVNGFGPARSMADGLPPAA 506

Query: 890  XXXXXXXXXGYEHSKGGIREFEPQ-SYPVQPSGAGPAGYPNLNSSMEVKIPNTAINAVLG 714
                           GG RE+EPQ +Y V P+      Y N+N ++E KI N   ++V G
Sbjct: 507  AIGPGRSGGYESFRGGGGREYEPQNNYSVPPAA---VRYSNMNGALEAKILNNMSSSVTG 563

Query: 713  AGGSNI 696
             GGS I
Sbjct: 564  TGGSTI 569


>ref|XP_006430782.1| hypothetical protein CICLE_v10011187mg [Citrus clementina]
            gi|557532839|gb|ESR44022.1| hypothetical protein
            CICLE_v10011187mg [Citrus clementina]
          Length = 528

 Score =  504 bits (1298), Expect = e-140
 Identities = 272/486 (55%), Positives = 330/486 (67%), Gaps = 3/486 (0%)
 Frame = -2

Query: 2354 IGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAKIKV 2175
            +G KK   ++   E S G    A+TVYRILCP+              K++R+ET AKI V
Sbjct: 23   VGIKKGNWSNSSREQSFGNSQPADTVYRILCPSRKIGGVIGKGGNIVKSLREETQAKITV 82

Query: 2174 DEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKVHER 1995
             + +PG++ERV+II+SS TK  +  + D+D    ++ E + E M+PHC AQ ALLKVH+R
Sbjct: 83   ADTIPGSEERVIIIYSSPTKIAKTQNKDDD----SAAETKKESMEPHCAAQDALLKVHDR 138

Query: 1994 IVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRILPP 1815
            I+EE DL+G    +DD E++ VTARLLVP+N VGCLLGK G VIQ+LRSETGANIR+LP 
Sbjct: 139  IIEE-DLFGGMASDDDNENSTVTARLLVPNNMVGCLLGKRGDVIQRLRSETGANIRVLPA 197

Query: 1814 EHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXXXXXXXXX 1644
            + LP CAM++DELVQISG  +VAK+ALYEVSTLL+QNPR   PP +              
Sbjct: 198  DRLPPCAMNTDELVQISGKPNVAKRALYEVSTLLHQNPRKDKPPSSFPQAYGGQNFHSPP 257

Query: 1643 XXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKASAEF 1464
                         W  RNS  HG PS PW GG+G++  R              G+ SAEF
Sbjct: 258  APMADMHPLGNSSWPARNSSLHGMPSTPWMGGYGDQPSRMGSGSINSCPPGQMGEVSAEF 317

Query: 1463 NMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWNPQS 1284
            +MKILC A KIG VIGKGG NVKQ+QQETGA+IHVED+  ++DERVI  S F+  WNP+S
Sbjct: 318  SMKILCSAGKIGGVIGKGGFNVKQLQQETGASIHVEDAPTDSDERVIRASAFEGLWNPRS 377

Query: 1283 PTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRVISK 1104
             TIDAILQLQ  T E SEKGTITTRLLVPS+KVGC+LGQGGH+I EMRRR QADIRV SK
Sbjct: 378  QTIDAILQLQNKTSEFSEKGTITTRLLVPSSKVGCILGQGGHVINEMRRRTQADIRVYSK 437

Query: 1103 DGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQGFGRS 924
            D KP CAS DEELVQISGN  VA++ALTEIASRLR R+L   +  AEP P+GP Q  G +
Sbjct: 438  DDKPKCASEDEELVQISGNFGVAKDALTEIASRLRARTLRDANVGAEPAPVGPVQLVGAA 497

Query: 923  EGFHDR 906
             G   R
Sbjct: 498  GGLPSR 503


>ref|XP_002516721.1| Poly(rC)-binding protein, putative [Ricinus communis]
            gi|223544094|gb|EEF45619.1| Poly(rC)-binding protein,
            putative [Ricinus communis]
          Length = 537

 Score =  502 bits (1293), Expect = e-139
 Identities = 288/527 (54%), Positives = 341/527 (64%), Gaps = 2/527 (0%)
 Frame = -2

Query: 2363 KRNIGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAK 2184
            KR    +K + N+ G E S+G P   +TVYRILCP+              K +R+ET AK
Sbjct: 18   KRKGVTRKGKWNNSGREESSGNPLPVDTVYRILCPSRKIGGVIGKGGGIIKGLREETQAK 77

Query: 2183 IKVDEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKV 2004
            I V + VPG+DERV+II+SS  K  R  +  EDL      E E + M+P+C AQ ALLKV
Sbjct: 78   ITVADPVPGSDERVIIIYSSPEKISRNHNDHEDLTM----ENEQDIMEPYCAAQDALLKV 133

Query: 2003 HERIVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRI 1824
            H+RIVEE DL+G    +DD E+  VTARLLVP+N VGCLLGK G VIQ+LRSETGANIR+
Sbjct: 134  HDRIVEE-DLFGGMTSDDDNENGFVTARLLVPNNMVGCLLGKRGDVIQRLRSETGANIRV 192

Query: 1823 LPPEHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPRP--PMNHXXXXXXXXXXX 1650
            LP +HLP CAMS+DELVQIS   DVAKKALYEVSTLL+QNPR   P +            
Sbjct: 193  LPADHLPTCAMSTDELVQISAKPDVAKKALYEVSTLLHQNPRKDKPPSVPMPYSGQSFHP 252

Query: 1649 XXXXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKASA 1470
                          MW H NS  H  P MP  G +G++                  + SA
Sbjct: 253  PGGPMKNLPPLGSPMWPHHNSS-HSIPPMPIMGRYGSQSSGFGPGGFDDVPRGHVAEPSA 311

Query: 1469 EFNMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWNP 1290
            EF+MKILC A KIG VIGKGGSNVK +QQ+TGA+IHVED++ E+DERVI VS  +A WNP
Sbjct: 312  EFSMKILCSAGKIGGVIGKGGSNVKVVQQDTGASIHVEDASAESDERVIRVSASEALWNP 371

Query: 1289 QSPTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRVI 1110
            +S TIDAILQLQ  T + SEKGTITTRLLVPS+KVGC+LGQGG +I EMRRR QADIRV 
Sbjct: 372  RSQTIDAILQLQNKTSDFSEKGTITTRLLVPSSKVGCILGQGGQVINEMRRRTQADIRVY 431

Query: 1109 SKDGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQGFG 930
            SKD KP CAS DEELVQISG   VA++AL EIASRLRVR+L  V+A AEPGP+GP QGFG
Sbjct: 432  SKDEKPKCASEDEELVQISGKFGVAKDALAEIASRLRVRTLRDVNAGAEPGPVGPIQGFG 491

Query: 929  RSEGFHDRXXXXXXXXXXXXXXGYEHSKGGIREFEPQSYPVQPSGAG 789
             +     R              GYE  +    E+E  SYPV P+  G
Sbjct: 492  AARSLPGR--GSSGMMGASSSGGYEPLRDTEHEYESHSYPVPPAAVG 536



 Score =  135 bits (341), Expect = 7e-29
 Identities = 101/328 (30%), Positives = 149/328 (45%), Gaps = 44/328 (13%)
 Frame = -2

Query: 1457 KILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAP------- 1299
            +ILCP+ KIG VIGKGG  +K +++ET A I V D    +DERVI++  + +P       
Sbjct: 48   RILCPSRKIGGVIGKGGGIIKGLREETQAKITVADPVPGSDERVIII--YSSPEKISRNH 105

Query: 1298 --------------WNPQSPTIDAILQLQ-----------VHTHEISEKGTITTRLLVPS 1194
                            P     DA+L++            + + + +E G +T RLLVP+
Sbjct: 106  NDHEDLTMENEQDIMEPYCAAQDALLKVHDRIVEEDLFGGMTSDDDNENGFVTARLLVPN 165

Query: 1193 NKVGCLLGQGGHIITEMRRRCQADIRVISKDGKPTCASADEELVQISGNVNVARNALTEI 1014
            N VGCLLG+ G +I  +R    A+IRV+  D  PTCA + +ELVQIS   +VA+ AL E+
Sbjct: 166  NMVGCLLGKRGDVIQRLRSETGANIRVLPADHLPTCAMSTDELVQISAKPDVAKKALYEV 225

Query: 1013 ASRLRVRSLEGVHAAAEPGPMGPFQGFGRSEGFHDRXXXXXXXXXXXXXXGYEHSKGGIR 834
            ++ L     +       P    P+ G    + FH                   H+     
Sbjct: 226  STLLH----QNPRKDKPPSVPMPYSG----QSFHPPGGPMKNLPPLGSPMWPHHNSSHSI 277

Query: 833  EFEP--QSYPVQPSGAGPAGY----------PNLNSSMEVKIPNTAINAVLGAGGSNISS 690
               P    Y  Q SG GP G+          P+   SM++      I  V+G GGSN+  
Sbjct: 278  PPMPIMGRYGSQSSGFGPGGFDDVPRGHVAEPSAEFSMKILCSAGKIGGVIGKGGSNVKV 337

Query: 689  IGQLTGAKVKLNDPQAGVSECIVEIHGS 606
            + Q TGA + + D  A   E ++ +  S
Sbjct: 338  VQQDTGASIHVEDASAESDERVIRVSAS 365


>ref|XP_007217649.1| hypothetical protein PRUPE_ppa027198mg, partial [Prunus persica]
            gi|462413799|gb|EMJ18848.1| hypothetical protein
            PRUPE_ppa027198mg, partial [Prunus persica]
          Length = 576

 Score =  484 bits (1247), Expect = e-134
 Identities = 287/572 (50%), Positives = 357/572 (62%), Gaps = 4/572 (0%)
 Frame = -2

Query: 2390 RRNTRVRLDKRNIGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXK 2211
            R N + +    N  +KK   N+   E         +TVYRILCP+              K
Sbjct: 12   RPNVQFKRKGGNNNYKKGNWNNSTREQLLENYQSLDTVYRILCPSKRIGGVIGKGGGIVK 71

Query: 2210 AMRDETHAKIKVDEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHC 2031
            A+R+ET AKI V ++V G+DERV+IIFSS TK     S+ +  + D+S+E E EPM PHC
Sbjct: 72   ALREETRAKITVADSVLGSDERVIIIFSSPTKI----SSKQTNDGDSSEENELEPMDPHC 127

Query: 2030 PAQHALLKVHERIVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLR 1851
             AQ ALLKVH RIVEE DL+G    +DD E++VVTARLLVP+N VGCLLGKGG VIQ+LR
Sbjct: 128  AAQDALLKVHNRIVEE-DLFGGVTFDDDNENSVVTARLLVPNNMVGCLLGKGGDVIQRLR 186

Query: 1850 SETGANIRILPPEHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHX 1680
            SET A+IR+LP + LP CAM +DELVQISG  DV K+ALYEVSTLL+QNPR   PP    
Sbjct: 187  SETSASIRVLPADQLPTCAMETDELVQISGKPDVTKRALYEVSTLLHQNPRKDKPPSGFP 246

Query: 1679 XXXXXXXXXXXXXXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXX 1500
                                    MWS+R    H    MP   G+GN             
Sbjct: 247  MPFWGQGFRPPGAPMTNVLPPGNPMWSNRAPS-HSTAPMPRMEGYGNCSSEFALGGFNGV 305

Query: 1499 XXXXXGKASAEFNMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVIL 1320
                 G+A AEF+MKILC   KIG VIGKGG NVKQ+Q ETGA+IHV++++ +++ERVI 
Sbjct: 306  PPGHGGEALAEFSMKILCSPGKIGGVIGKGGFNVKQLQLETGASIHVQEASTDSEERVIH 365

Query: 1319 VSCFDAPWNPQSPTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMR 1140
            VS F+A WNP S TI+AIL+LQ  T E+S+KG ITTRLLVPS+KVGC+LGQGG +I EMR
Sbjct: 366  VSAFEAFWNPISQTIEAILELQNKTSELSDKGIITTRLLVPSSKVGCILGQGGQVINEMR 425

Query: 1139 RRCQADIRVISKDGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEP 960
            RR QADIRV SKD +P CA+ DEELVQISGN  VA++AL EI SRLRVR+L   +A AEP
Sbjct: 426  RRTQADIRVYSKDDRPRCAAEDEELVQISGNFGVAKDALAEITSRLRVRTLRDANAGAEP 485

Query: 959  GPMGPFQGFGRSEGF-HDRXXXXXXXXXXXXXXGYEHSKGGIREFEPQSYPVQPSGAGPA 783
              +GP  GFG +                      Y+  K G  E+E QSY + PS A  +
Sbjct: 486  ALVGPAHGFGLAGSVPGGGPRQPPSAVGAGSSSRYDPLKVGRHEYELQSYRI-PSTA--S 542

Query: 782  GYPNLNSSMEVKIPNTAINAVLGAGGSNISSI 687
            GYP++N ++E KIP   +++  G  G N S+I
Sbjct: 543  GYPSVNHALEGKIPKNTVDSFPGTRGGNTSTI 574


>ref|XP_006358053.1| PREDICTED: KH domain-containing protein At4g18375-like isoform X1
            [Solanum tuberosum] gi|565383832|ref|XP_006358054.1|
            PREDICTED: KH domain-containing protein At4g18375-like
            isoform X2 [Solanum tuberosum]
          Length = 705

 Score =  482 bits (1240), Expect = e-133
 Identities = 289/650 (44%), Positives = 374/650 (57%), Gaps = 7/650 (1%)
 Frame = -2

Query: 2393 YRRNTRVRLD----KRNIGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXX 2226
            Y+RN+  +      KR  G K ++ N   +E S+     ++T Y ILC +          
Sbjct: 4    YKRNSLKQRSNPQFKRKGGSKNAKLNFSSHERSSENSQSSDTTYCILCQSKKVGSVIGKG 63

Query: 2225 XXXXKAMRDETHAKIKVDEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEP 2046
                KA+R+ET AKI V ++VPG+D+R+VI+ S STK  R  + D++ +   + E E+  
Sbjct: 64   GSIIKALREETQAKITVADSVPGSDDRIVIVSSPSTKLARRQNNDKNNDNPETKE-ENCS 122

Query: 2045 MQPHCPAQHALLKVHERIVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHV 1866
            M+PHC AQ ALLKVH RIVEE DL G  +  +DK   V+  RLLVP+N VGCLLG+ G V
Sbjct: 123  MEPHCAAQDALLKVHNRIVEE-DLQGVQN--EDKSETVIITRLLVPNNLVGCLLGRKGDV 179

Query: 1865 IQKLRSETGANIRILPPEHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---P 1695
            IQKLRSETGA+IR+L  EHLPACAM++DELVQISG   + KKALYEVSTLL+QNPR   P
Sbjct: 180  IQKLRSETGASIRVLSAEHLPACAMTTDELVQISGKPALVKKALYEVSTLLHQNPRKDKP 239

Query: 1694 PMNHXXXXXXXXXXXXXXXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXX 1515
              +                          MWS   +  +G       GG+ N+       
Sbjct: 240  ISSFPMVHGAQGFHPPGPPMENMIPPGKPMWSQSKTNLNGMSPALGVGGYRNQLTGFGRA 299

Query: 1514 XXXXXXXXXXGKASAEFNMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEAD 1335
                      G+A  +F MKILC A KIG VIGKGG NVKQ+QQ+TGA IHVED   E+D
Sbjct: 300  DFDYGPPPSAGEAPGDFTMKILCSAAKIGGVIGKGGFNVKQLQQDTGAGIHVEDVAPESD 359

Query: 1334 ERVILVSCFDAPWNPQSPTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHI 1155
            ERVI VS  ++ W+P+S TIDAILQLQ  T E S+KG +TTRLL PSNKVGC++GQGG +
Sbjct: 360  ERVIRVSSLESFWDPRSRTIDAILQLQSKTSEFSDKGIVTTRLLFPSNKVGCIIGQGGQV 419

Query: 1154 ITEMRRRCQADIRVISKDGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVH 975
            I EMRRR QADIRV+SKD KP CASADEELVQISG++ VA++AL EI+SRLR R L   +
Sbjct: 420  INEMRRRTQADIRVLSKDDKPRCASADEELVQISGSIGVAKDALVEISSRLRERCLRDAN 479

Query: 974  AAAEPGPMGPFQGFGRSEGFHDRXXXXXXXXXXXXXXGYEHSKGGIREFEPQSYPVQPSG 795
            +  E  P+ P  GF  SE F                  YEH KG +RE++  SYP  P  
Sbjct: 480  SKVESTPVRPLPGFVPSEDFRSGDPQRSGAMGAGSSRRYEHLKGAVREYDHPSYPDLPIA 539

Query: 794  AGPAGYPNLNSSMEVKIPNTAINAVLGAGGSNISSIGQLTGAKVKLNDPQAGVSECIVEI 615
                 + N+ S  E+K P+ +  +  G GG NI    +  G   +  DP++     I +I
Sbjct: 540  ---TRFSNIRSPPEMKFPDHSYGSAKGTGGYNID---EFAGRSARFQDPRSVGPGFIDDI 593

Query: 614  HGSSDQMIAAQNLLHNFIASAAHNYNTXXXXXXXXXXXQSYNTQQQQSPY 465
             G+SD M A  N+ H     ++ N+N               ++  QQ  Y
Sbjct: 594  RGTSDHMNAGHNVFH----GSSENFNAGRPTFQGYGSPAGQSSNIQQGAY 639


>ref|XP_004234567.1| PREDICTED: KH domain-containing protein At4g18375-like [Solanum
            lycopersicum]
          Length = 707

 Score =  480 bits (1236), Expect = e-132
 Identities = 285/615 (46%), Positives = 364/615 (59%), Gaps = 7/615 (1%)
 Frame = -2

Query: 2393 YRRNTRVRLD----KRNIGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXX 2226
            Y+RN+  +      KR  G K ++ N   +E S+     ++T Y ILC +          
Sbjct: 4    YKRNSLKQRSNPQFKRKGGSKNAKLNFSSHERSSENSQSSDTTYCILCQSKKVGSVIGKG 63

Query: 2225 XXXXKAMRDETHAKIKVDEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEP 2046
                KA+R+ET AKI V ++VPG+D+R+VII S STK  R  + D++ +   + E+ +  
Sbjct: 64   GSIIKALREETQAKITVADSVPGSDDRIVIISSPSTKLARRQNNDKNNDNPETKEENYS- 122

Query: 2045 MQPHCPAQHALLKVHERIVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHV 1866
            M+PHC AQ ALLKVH RIVEE DL G  +  +DK   V+  RLLVP+N VGCLLG+ G V
Sbjct: 123  MEPHCAAQDALLKVHNRIVEE-DLLGVQN--EDKSEAVIITRLLVPNNLVGCLLGRKGDV 179

Query: 1865 IQKLRSETGANIRILPPEHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---P 1695
            IQKLRSETGA+IR+L  EHLPACAM++DELVQISG   + KKALYEVSTLL+QNPR   P
Sbjct: 180  IQKLRSETGASIRVLSAEHLPACAMTTDELVQISGKPALVKKALYEVSTLLHQNPRKDKP 239

Query: 1694 PMNHXXXXXXXXXXXXXXXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXX 1515
              +                          MWS   +  +G P     GG+ N+       
Sbjct: 240  ISSFPMVHGAQGFHPPGPPMENMIPPGKPMWSQSKTNLNGMPPALGVGGYRNQLTGFGRA 299

Query: 1514 XXXXXXXXXXGKASAEFNMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEAD 1335
                      G+A  +F MKILC A KIG VIGKGG NVKQ+QQETGA IHVED   E+D
Sbjct: 300  DFDYGPPPSAGEAPGDFTMKILCSAAKIGGVIGKGGFNVKQLQQETGAGIHVEDVAPESD 359

Query: 1334 ERVILVSCFDAPWNPQSPTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHI 1155
            ERVI VS  ++ W+P+S TIDAILQLQ  T E S+KG +TTRLL PSNKVGC++GQGG +
Sbjct: 360  ERVIRVSSLESFWDPRSRTIDAILQLQSKTSEFSDKGIVTTRLLFPSNKVGCIIGQGGQV 419

Query: 1154 ITEMRRRCQADIRVISKDGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVH 975
            I EMRRR QADIRV+SKD KP CASADEELVQISG++ VA++AL EI+SRLR R L   +
Sbjct: 420  INEMRRRTQADIRVLSKDEKPRCASADEELVQISGSIGVAKDALVEISSRLRERCLRDAN 479

Query: 974  AAAEPGPMGPFQGFGRSEGFHDRXXXXXXXXXXXXXXGYEHSKGGIREFEPQSYPVQPSG 795
            +  E   + P  GF  SE F  R               YEH KG +RE +  SYP  P  
Sbjct: 480  SKVESTHVRPLPGFVPSEDFRSRDPQRSGVMGAGSSRRYEHLKGAVRECDHPSYPDLPIA 539

Query: 794  AGPAGYPNLNSSMEVKIPNTAINAVLGAGGSNISSIGQLTGAKVKLNDPQAGVSECIVEI 615
                 + N  S  E+K P+ +     G GG NI    +  G+  +  DP++     + +I
Sbjct: 540  ---TRFSNTRSPPEMKFPDHSYGTAKGTGGYNID---EFAGSSARYQDPRSIGPGFVDDI 593

Query: 614  HGSSDQMIAAQNLLH 570
             G+SD+M A  N+ H
Sbjct: 594  RGTSDRMNAGHNVFH 608


>ref|XP_006383459.1| hypothetical protein POPTR_0005s15690g [Populus trichocarpa]
            gi|550339070|gb|ERP61256.1| hypothetical protein
            POPTR_0005s15690g [Populus trichocarpa]
          Length = 563

 Score =  469 bits (1207), Expect = e-129
 Identities = 261/488 (53%), Positives = 313/488 (64%), Gaps = 6/488 (1%)
 Frame = -2

Query: 2351 GFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAKIKVD 2172
            G   SR+  +            +TVYRILCP+              KA+R+ET +KI V 
Sbjct: 21   GVSSSRKGKWSDSHGEECSGDGDTVYRILCPSRKIGGVIGKGGNIVKALREETQSKITVA 80

Query: 2171 EAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKVHERI 1992
            ++V G+DERV+II+SSS K PR    DE L    +   + E  +PHC AQ ALLKVH+RI
Sbjct: 81   DSVQGSDERVIIIYSSSDKPPRKMDGDEGLP---AGNGQQEAFEPHCAAQDALLKVHDRI 137

Query: 1991 VEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRILPPE 1812
            VEE DL+G    +DD ++NVVTARLLVP+N VGC+LGK G VIQ+LRSETGANIR+LP +
Sbjct: 138  VEE-DLFGGMASDDDNDNNVVTARLLVPNNMVGCVLGKRGDVIQRLRSETGANIRVLPAD 196

Query: 1811 HLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPRP------PMNHXXXXXXXXXXX 1650
            HLP+CAM +DELVQISG   VAK+ALYE+S LL+QNPR       PM +           
Sbjct: 197  HLPSCAMDTDELVQISGKPAVAKRALYEISILLHQNPRKDKLPSVPMPYGGRTFHPPSDS 256

Query: 1649 XXXXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKASA 1470
                           W HRNS    P SMPW G +GN                   + SA
Sbjct: 257  MANMLPPGNPM----WPHRNST---PHSMPWMGEYGNHPSEFGPGGFNGVPPGHGREPSA 309

Query: 1469 EFNMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWNP 1290
            EF+MKILC   KIG VIGKGGSNVK +QQETGA+IHVED++ E++ER I VS F+  WNP
Sbjct: 310  EFSMKILCSTGKIGGVIGKGGSNVKIVQQETGASIHVEDASAESEERAIRVSAFEGLWNP 369

Query: 1289 QSPTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRVI 1110
            +S TIDAILQLQ  T + SEKG I TRLLVPS+KVGC+LGQGG +I EMRRR QADIRV 
Sbjct: 370  RSQTIDAILQLQDKTSDFSEKGMIITRLLVPSSKVGCILGQGGQVINEMRRRLQADIRVY 429

Query: 1109 SKDGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQGFG 930
             K+ KP CAS DEELVQISGN  VA++AL EIASRLR R+L   +A  EPGP GP  GFG
Sbjct: 430  PKNDKPKCASDDEELVQISGNYGVAKDALAEIASRLRARTLRDANAGTEPGPAGPVPGFG 489

Query: 929  RSEGFHDR 906
             +     R
Sbjct: 490  PARNLPGR 497


>gb|AEV43362.1| poly C-binding protein [Citrus sinensis]
          Length = 569

 Score =  426 bits (1096), Expect = e-116
 Identities = 228/418 (54%), Positives = 283/418 (67%), Gaps = 3/418 (0%)
 Frame = -2

Query: 2354 IGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAKIKV 2175
            +G KK   ++   E S G    A+TVYRILCP+              K++R+ET AKI V
Sbjct: 23   VGIKKGNWSNSSREQSFGNSQPADTVYRILCPSRKIGGVIGKAGNIVKSLREETQAKITV 82

Query: 2174 DEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKVHER 1995
             + +PG++ERV+II+SS TK  +  + D+D    ++ E + E M+PHC AQ ALLKVH+R
Sbjct: 83   ADTIPGSEERVIIIYSSPTKIAKTQNKDDD----SAAETKKESMEPHCAAQDALLKVHDR 138

Query: 1994 IVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRILPP 1815
            I+EE DL+G    +DD E++ +TARLLVP+N VGCLLGK G VIQ+LRSETGANIR+LP 
Sbjct: 139  IIEE-DLFGGMASDDDNENSTITARLLVPNNMVGCLLGKRGDVIQRLRSETGANIRVLPA 197

Query: 1814 EHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXXXXXXXXX 1644
            + LP CAM++DE+VQISG  +VAK+ALYEVSTLL+QNPR   PP +              
Sbjct: 198  DRLPPCAMNTDEMVQISGKPNVAKRALYEVSTLLHQNPRKDKPPSSFPQAYGGQNFHSPP 257

Query: 1643 XXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKASAEF 1464
                         W  RNS  HG PS PW GG+G++  R              G+ SAEF
Sbjct: 258  APMADMHPLGNSSWPARNSSLHGMPSTPWMGGYGDQPSRMGSGSINSCPPGQMGEVSAEF 317

Query: 1463 NMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWNPQS 1284
            +MKILC A KIG VIGKGG NVKQ+QQETGA+IHVED+  ++DERVI  S F+  WNP+S
Sbjct: 318  SMKILCSAGKIGGVIGKGGFNVKQLQQETGASIHVEDAPTDSDERVIRASAFEGLWNPRS 377

Query: 1283 PTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRVI 1110
             TIDAILQLQ  T E SEKGTITTRLLVPS+KVGC+LGQGGH+I EMRRR QADIRV+
Sbjct: 378  QTIDAILQLQNKTSEFSEKGTITTRLLVPSSKVGCILGQGGHVINEMRRRTQADIRVV 435



 Score =  123 bits (308), Expect = 5e-25
 Identities = 96/323 (29%), Positives = 145/323 (44%), Gaps = 44/323 (13%)
 Frame = -2

Query: 1457 KILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAP------- 1299
            +ILCP+ KIG VIGK G+ VK +++ET A I V D+   ++ERVI++  + +P       
Sbjct: 50   RILCPSRKIGGVIGKAGNIVKSLREETQAKITVADTIPGSEERVIII--YSSPTKIAKTQ 107

Query: 1298 --------------WNPQSPTIDAILQLQ-----------VHTHEISEKGTITTRLLVPS 1194
                            P     DA+L++            + + + +E  TIT RLLVP+
Sbjct: 108  NKDDDSAAETKKESMEPHCAAQDALLKVHDRIIEEDLFGGMASDDDNENSTITARLLVPN 167

Query: 1193 NKVGCLLGQGGHIITEMRRRCQADIRVISKDGKPTCASADEELVQISGNVNVARNALTEI 1014
            N VGCLLG+ G +I  +R    A+IRV+  D  P CA   +E+VQISG  NVA+ AL E+
Sbjct: 168  NMVGCLLGKRGDVIQRLRSETGANIRVLPADRLPPCAMNTDEMVQISGKPNVAKRALYEV 227

Query: 1013 ASRLR------------VRSLEGVHAAAEPGPMGPFQGFGRSEGFHDRXXXXXXXXXXXX 870
            ++ L              ++  G +  + P PM      G S  +  R            
Sbjct: 228  STLLHQNPRKDKPPSSFPQAYGGQNFHSPPAPMADMHPLGNS-SWPARNSSLHGMPSTPW 286

Query: 869  XXGYEHSKGGIREFEPQSYPVQPSGAGPAGYPNLNSSMEVKIPNTAINAVLGAGGSNISS 690
              GY     G +     S  +     G  G  +   SM++      I  V+G GG N+  
Sbjct: 287  MGGY-----GDQPSRMGSGSINSCPPGQMGEVSAEFSMKILCSAGKIGGVIGKGGFNVKQ 341

Query: 689  IGQLTGAKVKLNDPQAGVSECIV 621
            + Q TGA + + D      E ++
Sbjct: 342  LQQETGASIHVEDAPTDSDERVI 364


>ref|XP_007033281.1| RNA-binding KH domain-containing protein isoform 6, partial
            [Theobroma cacao] gi|508712310|gb|EOY04207.1| RNA-binding
            KH domain-containing protein isoform 6, partial
            [Theobroma cacao]
          Length = 530

 Score =  378 bits (970), Expect = e-102
 Identities = 224/442 (50%), Positives = 269/442 (60%), Gaps = 6/442 (1%)
 Frame = -2

Query: 1772 QISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXXXXXXXXXXXXXXXXXXXXXMW 1602
            +ISG  DVAK+ALYEVSTLL+QNPR   PP++                           W
Sbjct: 39   KISGKRDVAKRALYEVSTLLHQNPRKDNPPLSFPVPHAGQNFPPPSAIPPSNPM-----W 93

Query: 1601 SHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKASAEFNMKILCPADKIGVV 1422
            SHRNS  H  PSMPW    GN                   +ASAEF+MKILCPA KIG V
Sbjct: 94   SHRNSSPHDIPSMPWMVPHGNRPSGFGPGSLSSFPPARGAEASAEFSMKILCPAGKIGGV 153

Query: 1421 IGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWNPQSPTIDAILQLQVHTH 1242
            IGKGG NVKQ+QQETGA IHVED+T E+DERVI VS  +  WNP+S TIDAILQLQ  T 
Sbjct: 154  IGKGGFNVKQLQQETGAGIHVEDATIESDERVIRVSAIEGLWNPRSQTIDAILQLQNKTS 213

Query: 1241 EISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRVISKDGKPTCASADEELV 1062
            E SEKGT+TTRLLVPS+KVGC+LGQGGH+I EMRRR QADIRV SKD KP CAS DEELV
Sbjct: 214  EFSEKGTVTTRLLVPSSKVGCILGQGGHVINEMRRRTQADIRVYSKDDKPKCASEDEELV 273

Query: 1061 QISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQGFGRSEGFHDRXXXXXXXX 882
            QISGN  VA++AL EIASRLRVR+L  V+A AE  P+GP  GFG +    D         
Sbjct: 274  QISGNFGVAKDALAEIASRLRVRTLRDVNAGAETAPVGPVNGFGPARSMADGLPPAAAIG 333

Query: 881  XXXXXXGYEHSKGGIREFEPQ-SYPVQPSGAGPAGYPNLNSSMEVKIPNTAINAVLGAGG 705
                        GG RE+EPQ +Y V P+      Y N+N ++E KI N   ++V G GG
Sbjct: 334  PGRSGGYESFRGGGGREYEPQNNYSVPPAA---VRYSNMNGALEAKILNNMSSSVTGTGG 390

Query: 704  SNISSIGQLTGAKVKLNDPQAGVSECIVEIHGSSDQMIAAQNLLHNFIASAAHNYNTXXX 525
            S I +  +++GA+V+L DPQ G SE   E  GSS+ + AAQ++  +F+ S+  + N    
Sbjct: 391  STILN-SEVSGARVRLEDPQTGGSE---EFRGSSEHLTAAQSIFQSFMPSSGQSMNAQQS 446

Query: 524  XXXXXXXXQS--YNTQQQQSPY 465
                    QS   N   Q+SPY
Sbjct: 447  SYQNLSVQQSSYLNMNAQRSPY 468



 Score = 94.7 bits (234), Expect = 2e-16
 Identities = 68/202 (33%), Positives = 95/202 (47%)
 Frame = -2

Query: 2318 YEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAKIKVDEAVPGTDERVV 2139
            + P+ G    AE   +ILCP               K ++ ET A I V++A   +DERV+
Sbjct: 127  FPPARGAEASAEFSMKILCPAGKIGGVIGKGGFNVKQLQQETGAGIHVEDATIESDERVI 186

Query: 2138 IIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKVHERIVEEEDLYGASD 1959
             +           SA E L              P      A+L++  +  E         
Sbjct: 187  RV-----------SAIEGL------------WNPRSQTIDAILQLQNKTSEFS------- 216

Query: 1958 HEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRILPPEHLPACAMSSDE 1779
                 E   VT RLLVPS++VGC+LG+GGHVI ++R  T A+IR+   +  P CA   +E
Sbjct: 217  -----EKGTVTTRLLVPSSKVGCILGQGGHVINEMRRRTQADIRVYSKDDKPKCASEDEE 271

Query: 1778 LVQISGASDVAKKALYEVSTLL 1713
            LVQISG   VAK AL E+++ L
Sbjct: 272  LVQISGNFGVAKDALAEIASRL 293


>ref|XP_006661902.1| PREDICTED: KH domain-containing protein At4g18375-like [Oryza
            brachyantha]
          Length = 773

 Score =  362 bits (930), Expect = 4e-97
 Identities = 211/474 (44%), Positives = 287/474 (60%), Gaps = 4/474 (0%)
 Frame = -2

Query: 2345 KKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAKIKVDEA 2166
            +K +R +  ++  +      ET+YRILCP               KA+RDET AKI+V ++
Sbjct: 25   RKRKRLNTKHDDGSMSSQPTETIYRILCPVKKIGSVLGRGGDVVKALRDETKAKIRVADS 84

Query: 2165 VPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKVHERIVE 1986
            +PG DERV+IIF+     P  S  DE  +  ++D  ++  M+PHC AQ ALLK+H++I E
Sbjct: 85   IPGADERVIIIFNY----PSQSEDDEAAQNISTDGFQN--MKPHCFAQDALLKIHDKIAE 138

Query: 1985 EEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRILPPEHL 1806
            +EDL+G  DHE  +  + VTAR+LVP NQVGCLLGKGG +IQ+LR+ETGA IR+LP E+L
Sbjct: 139  DEDLHGGIDHEKSETVDGVTARILVPGNQVGCLLGKGGSIIQQLRNETGAGIRVLPSENL 198

Query: 1805 PACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXXXXXXXXXXXX 1635
            P CA+ SDELVQISGA  + +KALYE+ST L+Q+PR   PP+                  
Sbjct: 199  PQCALKSDELVQISGAPSLVRKALYEISTRLHQHPRKDNPPLEEIIDASTQRKHQSPPQL 258

Query: 1634 XXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKASAEFNMK 1455
                     +    +      P +P    + N  LR                 + EF++K
Sbjct: 259  PHANPMLPHLHVDHS------PQIPLLDPYRNGPLRYHAG------------EAEEFSIK 300

Query: 1454 ILCPADKIGVVIGKGGSNVKQIQQETGANIHVED-STGEADERVILVSCFDAPWNPQSPT 1278
            ILC ++ IG VIGK G NV+Q++Q+TGA I V++     + ER+I+VS  + P +P SPT
Sbjct: 301  ILCASEHIGQVIGKSGGNVRQVEQQTGARIQVKEVGKNASGERLIVVSSQEIPDDPVSPT 360

Query: 1277 IDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRVISKDG 1098
            I+A++ L       +E   +TTRL+VPSNKVGC+LG+GG +ITEMRRR  A+IRV SK  
Sbjct: 361  IEALILLHSKVSAPAENRHLTTRLVVPSNKVGCILGEGGKVITEMRRRTGAEIRVYSKAD 420

Query: 1097 KPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQG 936
            KP   S DEELVQ++G   +AR ALTEIASRLR R+L    ++  P P  PF G
Sbjct: 421  KPKYLSFDEELVQVAGLPAIARGALTEIASRLRTRTLRDGSSSNNPPPFAPFDG 474



 Score = 72.8 bits (177), Expect = 8e-10
 Identities = 37/83 (44%), Positives = 56/83 (67%)
 Frame = -2

Query: 791 GPAGYPNLNSSMEVKIPNTAINAVLGAGGSNISSIGQLTGAKVKLNDPQAGVSECIVEIH 612
           G  GYP    S+E++IPN+ +  V+G GG+N++ I Q++GA+VKL +     SE IVEI 
Sbjct: 680 GFTGYPG--GSVELRIPNSYLETVIGVGGANLAEIRQISGARVKLLETHPASSESIVEIQ 737

Query: 611 GSSDQMIAAQNLLHNFIASAAHN 543
           G  DQ+ AAQ+LL  FI ++ ++
Sbjct: 738 GVPDQVKAAQSLLQGFIGASGNS 760


>gb|AAP54423.2| KH domain containing protein, expressed [Oryza sativa Japonica Group]
          Length = 677

 Score =  353 bits (907), Expect = 2e-94
 Identities = 206/490 (42%), Positives = 291/490 (59%), Gaps = 4/490 (0%)
 Frame = -2

Query: 2393 YRRNTRVRLDKRNIGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXX 2214
            ++ NT  +    N    K +R +  ++         ET+YRILCP               
Sbjct: 9    HKSNTSRKRPHFNSDDGKRKRLNSRHDDGTISSEPIETIYRILCPVKKIGSVLGRGGDIV 68

Query: 2213 KAMRDETHAKIKVDEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPH 2034
            KA+RD T AKI+V +++PG DERV+IIF+ S++T  A+   +++ TD       E M+PH
Sbjct: 69   KALRDTTKAKIRVADSIPGADERVIIIFNYSSQTEEAA---QNISTDG-----FEDMKPH 120

Query: 2033 CPAQHALLKVHERIVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKL 1854
            C AQ ALLK+H++I  +EDL+    HE  +  + V AR+LVP NQVGCLLGKGG +IQ+L
Sbjct: 121  CFAQDALLKIHDKIAADEDLHAGIVHEKSENVDDVIARILVPGNQVGCLLGKGGSIIQQL 180

Query: 1853 RSETGANIRILPPEHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---PPMNH 1683
            R++TGA IR+LP E+LP CA+ SDELVQISG+S + +KALYE+ST L+Q+PR   PP+  
Sbjct: 181  RNDTGAGIRVLPSENLPQCALKSDELVQISGSSSLVRKALYEISTRLHQHPRKDNPPLEE 240

Query: 1682 XXXXXXXXXXXXXXXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXX 1503
                                     +    +      P +P    + N  L+        
Sbjct: 241  IIDASTQRKHQAPPQLPHANPMLPHLHVDHS------PQIPLLDPYRNRPLQ-------- 286

Query: 1502 XXXXXXGKASAEFNMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVED-STGEADERV 1326
                     + EF++KILC ++ IG VIGK G NV+Q++Q+TGA + V++     ++ER+
Sbjct: 287  ----YHSAEAEEFSIKILCASEHIGQVIGKSGGNVRQVEQQTGACVQVKEVGKNASEERL 342

Query: 1325 ILVSCFDAPWNPQSPTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITE 1146
            I+VS  + P +P SPTI+A++ L      ++E   +TTRL+VPSNKVGC++G+GG +ITE
Sbjct: 343  IVVSSQEIPDDPVSPTIEALILLHSKVSTLAENHHLTTRLVVPSNKVGCIIGEGGKVITE 402

Query: 1145 MRRRCQADIRVISKDGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVHAAA 966
            MRRR  A+IRV SK  KP   S DEELVQ++G   +AR ALTEIASRLR R+L    ++ 
Sbjct: 403  MRRRTGAEIRVYSKADKPKYLSFDEELVQVAGLPAIARGALTEIASRLRTRTLRDGSSSN 462

Query: 965  EPGPMGPFQG 936
             P P  PF G
Sbjct: 463  NPTPFAPFDG 472


>ref|NP_001064947.1| Os10g0495000 [Oryza sativa Japonica Group] gi|22128716|gb|AAM92828.1|
            putative RNA-binding protein [Oryza sativa Japonica
            Group] gi|110289325|gb|ABB47821.2| KH domain containing
            protein, expressed [Oryza sativa Japonica Group]
            gi|113639556|dbj|BAF26861.1| Os10g0495000 [Oryza sativa
            Japonica Group] gi|215694845|dbj|BAG90036.1| unnamed
            protein product [Oryza sativa Japonica Group]
          Length = 762

 Score =  353 bits (907), Expect = 2e-94
 Identities = 206/490 (42%), Positives = 291/490 (59%), Gaps = 4/490 (0%)
 Frame = -2

Query: 2393 YRRNTRVRLDKRNIGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXX 2214
            ++ NT  +    N    K +R +  ++         ET+YRILCP               
Sbjct: 9    HKSNTSRKRPHFNSDDGKRKRLNSRHDDGTISSEPIETIYRILCPVKKIGSVLGRGGDIV 68

Query: 2213 KAMRDETHAKIKVDEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPH 2034
            KA+RD T AKI+V +++PG DERV+IIF+ S++T  A+   +++ TD       E M+PH
Sbjct: 69   KALRDTTKAKIRVADSIPGADERVIIIFNYSSQTEEAA---QNISTDG-----FEDMKPH 120

Query: 2033 CPAQHALLKVHERIVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKL 1854
            C AQ ALLK+H++I  +EDL+    HE  +  + V AR+LVP NQVGCLLGKGG +IQ+L
Sbjct: 121  CFAQDALLKIHDKIAADEDLHAGIVHEKSENVDDVIARILVPGNQVGCLLGKGGSIIQQL 180

Query: 1853 RSETGANIRILPPEHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---PPMNH 1683
            R++TGA IR+LP E+LP CA+ SDELVQISG+S + +KALYE+ST L+Q+PR   PP+  
Sbjct: 181  RNDTGAGIRVLPSENLPQCALKSDELVQISGSSSLVRKALYEISTRLHQHPRKDNPPLEE 240

Query: 1682 XXXXXXXXXXXXXXXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXX 1503
                                     +    +      P +P    + N  L+        
Sbjct: 241  IIDASTQRKHQAPPQLPHANPMLPHLHVDHS------PQIPLLDPYRNRPLQ-------- 286

Query: 1502 XXXXXXGKASAEFNMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVED-STGEADERV 1326
                     + EF++KILC ++ IG VIGK G NV+Q++Q+TGA + V++     ++ER+
Sbjct: 287  ----YHSAEAEEFSIKILCASEHIGQVIGKSGGNVRQVEQQTGACVQVKEVGKNASEERL 342

Query: 1325 ILVSCFDAPWNPQSPTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITE 1146
            I+VS  + P +P SPTI+A++ L      ++E   +TTRL+VPSNKVGC++G+GG +ITE
Sbjct: 343  IVVSSQEIPDDPVSPTIEALILLHSKVSTLAENHHLTTRLVVPSNKVGCIIGEGGKVITE 402

Query: 1145 MRRRCQADIRVISKDGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVHAAA 966
            MRRR  A+IRV SK  KP   S DEELVQ++G   +AR ALTEIASRLR R+L    ++ 
Sbjct: 403  MRRRTGAEIRVYSKADKPKYLSFDEELVQVAGLPAIARGALTEIASRLRTRTLRDGSSSN 462

Query: 965  EPGPMGPFQG 936
             P P  PF G
Sbjct: 463  NPTPFAPFDG 472



 Score = 77.8 bits (190), Expect = 2e-11
 Identities = 39/84 (46%), Positives = 60/84 (71%)
 Frame = -2

Query: 794 AGPAGYPNLNSSMEVKIPNTAINAVLGAGGSNISSIGQLTGAKVKLNDPQAGVSECIVEI 615
           +G  GYP    S+E +IPN+ + +V+GAGG N++ I Q++GA+VKL++   G SE IVEI
Sbjct: 668 SGLTGYPG--GSVEFRIPNSYLESVIGAGGVNLAEIRQISGARVKLHEAHPGSSESIVEI 725

Query: 614 HGSSDQMIAAQNLLHNFIASAAHN 543
            G  DQ+ AAQ+LL  FI +++++
Sbjct: 726 QGIPDQVKAAQSLLQGFIGASSNS 749


Top