BLASTX nr result

ID: Cocculus23_contig00004442 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00004442
         (2793 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN72739.1| hypothetical protein VITISV_027256 [Vitis vinifera]   621   e-175
ref|XP_003633728.1| PREDICTED: KH domain-containing protein At4g...   620   e-175
ref|XP_006482257.1| PREDICTED: KH domain-containing protein At4g...   605   e-170
ref|XP_006430781.1| hypothetical protein CICLE_v10011187mg [Citr...   603   e-169
ref|XP_007033279.1| RNA-binding KH domain-containing protein iso...   551   e-154
ref|XP_007033276.1| RNA-binding KH domain-containing protein iso...   550   e-153
ref|XP_004304132.1| PREDICTED: KH domain-containing protein At4g...   522   e-145
ref|XP_007033280.1| RNA-binding KH domain-containing protein iso...   515   e-143
ref|XP_007033277.1| RNA-binding KH domain-containing protein iso...   515   e-143
ref|XP_006430782.1| hypothetical protein CICLE_v10011187mg [Citr...   504   e-140
ref|XP_002516721.1| Poly(rC)-binding protein, putative [Ricinus ...   502   e-139
ref|XP_007217649.1| hypothetical protein PRUPE_ppa027198mg, part...   484   e-134
ref|XP_006358053.1| PREDICTED: KH domain-containing protein At4g...   482   e-133
ref|XP_004234567.1| PREDICTED: KH domain-containing protein At4g...   480   e-132
ref|XP_006383459.1| hypothetical protein POPTR_0005s15690g [Popu...   469   e-129
gb|AEV43362.1| poly C-binding protein [Citrus sinensis]               426   e-116
ref|XP_007033281.1| RNA-binding KH domain-containing protein iso...   378   e-102
ref|XP_006661902.1| PREDICTED: KH domain-containing protein At4g...   362   4e-97
gb|AAP54423.2| KH domain containing protein, expressed [Oryza sa...   353   2e-94
ref|NP_001064947.1| Os10g0495000 [Oryza sativa Japonica Group] g...   353   2e-94

>emb|CAN72739.1| hypothetical protein VITISV_027256 [Vitis vinifera]
          Length = 668

 Score =  621 bits (1601), Expect = e-175
 Identities = 339/607 (55%), Positives = 417/607 (68%), Gaps = 3/607 (0%)
 Frame = -3

Query: 2503 KRNIGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAK 2324
            KR    K+ R N+  +E S G    A+TVYRILCP+              KA+R+ET AK
Sbjct: 46   KRKGSNKRGRWNNSSHEQSFGNSQVADTVYRILCPSKKIGGVIGKGGGIVKALREETQAK 105

Query: 2323 IKVDEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKV 2144
            I V ++VPG+DERV+II+S+ TK P+  +++ED E     E+E + M+PHCPAQ AL+KV
Sbjct: 106  ITVADSVPGSDERVIIIYSAPTKNPKEHNSNEDPER----EEEQDHMEPHCPAQDALMKV 161

Query: 2143 HERIVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRI 1964
            HERI+EE DL+G ++ EDD E+ VVTARLLVP+N VGCLLGK G VIQ+LRSETGANIR+
Sbjct: 162  HERIIEE-DLFGGTEFEDDNENTVVTARLLVPNNMVGCLLGKRGDVIQRLRSETGANIRV 220

Query: 1963 LPPEHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXXXXXX 1793
            LP EHLP CAMSSDELVQISG   VAKKALYEVSTLL+QNPR   PP +           
Sbjct: 221  LPAEHLPTCAMSSDELVQISGKPAVAKKALYEVSTLLHQNPRKDKPPSSFPMSFGGQGFH 280

Query: 1792 XXXXXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKAS 1613
                           MWS+RNS   G P MPW GG+ ++                 G+AS
Sbjct: 281  PPGASMGNMPPPGNPMWSNRNSNSQGVPPMPWMGGYRSQ-PSVVPGGFDGVHAGHGGEAS 339

Query: 1612 AEFNMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWN 1433
             EF+MKILCPA KIG VIGKGG NVKQ+QQETGA+IHVED+  E++ERVI VS F+A WN
Sbjct: 340  GEFSMKILCPAGKIGGVIGKGGFNVKQLQQETGASIHVEDALAESEERVIRVSSFEALWN 399

Query: 1432 PQSPTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRV 1253
            P+S TI+AILQLQ  T E S+KG +TTRLLVPS+KVGC+LGQGGH+I EMRRR QADIRV
Sbjct: 400  PRSQTIEAILQLQNKTSEYSDKGGMTTRLLVPSSKVGCILGQGGHVINEMRRRTQADIRV 459

Query: 1252 ISKDGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQGF 1073
             SK+ KP CAS DEELVQISGN  VA++AL EIASRLRVR L   +   EP P+GP  GF
Sbjct: 460  YSKEDKPKCASDDEELVQISGNFGVAKDALAEIASRLRVRCLRDANGGVEPAPVGPVPGF 519

Query: 1072 GRSEGFHDRXXXXXXXXXXXXXXGYEHSKGGIREFEPQSYPVQPSGAGPAGYPNLNSSME 893
            G                      G+E SKGG  E+EPQSYPV P+     GY N+NSSME
Sbjct: 520  GHPGKLPGGLPSSSGALGAGSSGGFELSKGGGLEYEPQSYPVPPAA---TGYHNVNSSME 576

Query: 892  VKIPNTAINAVLGAGGSNISSIGQLTGAKVKLNDPQAGVSECIVEIHGSSDQMIAAQNLL 713
             KIPN ++++V+G GG N++++ ++ GA+VKL DPQ+G SEC+VEI GSS+ + AAQ++L
Sbjct: 577  SKIPNNSVSSVIGMGGGNVANMSEMAGARVKLQDPQSGGSECVVEIRGSSEHLTAAQSIL 636

Query: 712  HNFIASA 692
              F+ASA
Sbjct: 637  QAFMASA 643


>ref|XP_003633728.1| PREDICTED: KH domain-containing protein At4g18375-like [Vitis
            vinifera] gi|296087281|emb|CBI33655.3| unnamed protein
            product [Vitis vinifera]
          Length = 676

 Score =  620 bits (1600), Expect = e-175
 Identities = 339/607 (55%), Positives = 416/607 (68%), Gaps = 3/607 (0%)
 Frame = -3

Query: 2503 KRNIGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAK 2324
            KR    K+ R N+  +E S G    A+TVYRILCP+              KA+R+ET AK
Sbjct: 18   KRKGSNKRGRWNNSSHEQSFGNSQVADTVYRILCPSKKIGGVIGKGGGIVKALREETQAK 77

Query: 2323 IKVDEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKV 2144
            I V ++VPG+DERV+II+S+ TK P+   ++ED E     E+E + M+PHCPAQ AL+KV
Sbjct: 78   ITVADSVPGSDERVIIIYSAPTKNPKEHDSNEDPEM----EEEQDHMEPHCPAQDALMKV 133

Query: 2143 HERIVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRI 1964
            HERI+EE DL+G ++ EDD E+ VVTARLLVP+N VGCLLGK G VIQ+LRSETGANIR+
Sbjct: 134  HERIIEE-DLFGGTEFEDDNENTVVTARLLVPNNMVGCLLGKRGDVIQRLRSETGANIRV 192

Query: 1963 LPPEHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXXXXXX 1793
            LP EHLP CAMSSDELVQISG   VAKKALYEVSTLL+QNPR   PP +           
Sbjct: 193  LPAEHLPTCAMSSDELVQISGKPAVAKKALYEVSTLLHQNPRKDKPPSSFPMSFGGQGFH 252

Query: 1792 XXXXXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKAS 1613
                           MWS+RNS   G P MPW GG+ ++                 G+AS
Sbjct: 253  PPGASMGNMPPPGNPMWSNRNSNSQGVPPMPWMGGYRSQ-PSVVPGGFDGVHAGHGGEAS 311

Query: 1612 AEFNMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWN 1433
             EF+MKILCPA KIG VIGKGG NVKQ+QQETGA+IHVED+  E++ERVI VS F+A WN
Sbjct: 312  GEFSMKILCPAGKIGGVIGKGGFNVKQLQQETGASIHVEDALAESEERVIRVSSFEALWN 371

Query: 1432 PQSPTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRV 1253
            P+S TI+AILQLQ  T E S+KG +TTRLLVPS+KVGC+LGQGGH+I EMRRR QADIRV
Sbjct: 372  PRSQTIEAILQLQNKTSEYSDKGGMTTRLLVPSSKVGCILGQGGHVINEMRRRTQADIRV 431

Query: 1252 ISKDGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQGF 1073
             SK+ KP CAS DEELVQISGN  VA++AL EIASRLRVR L   +   EP P+GP  GF
Sbjct: 432  YSKEDKPKCASDDEELVQISGNFGVAKDALAEIASRLRVRCLRDANGGVEPAPVGPVPGF 491

Query: 1072 GRSEGFHDRXXXXXXXXXXXXXXGYEHSKGGIREFEPQSYPVQPSGAGPAGYPNLNSSME 893
            G                      G+E SKGG  E+EPQSYPV P+     GY N+NSSME
Sbjct: 492  GHPGKLPGGLPSSSGALGAGSSGGFELSKGGGLEYEPQSYPVPPAA---TGYHNVNSSME 548

Query: 892  VKIPNTAINAVLGAGGSNISSIGQLTGAKVKLNDPQAGVSECIVEIHGSSDQMIAAQNLL 713
             KIPN ++++V+G GG N++++ ++ GA+VKL DPQ+G SEC+VEI GSS+ + AAQ++L
Sbjct: 549  SKIPNNSVSSVIGMGGGNVANMSEMAGARVKLQDPQSGGSECVVEIRGSSEHLTAAQSIL 608

Query: 712  HNFIASA 692
              F+ASA
Sbjct: 609  QAFMASA 615


>ref|XP_006482257.1| PREDICTED: KH domain-containing protein At4g18375 [Citrus sinensis]
          Length = 715

 Score =  605 bits (1559), Expect = e-170
 Identities = 326/605 (53%), Positives = 405/605 (66%), Gaps = 5/605 (0%)
 Frame = -3

Query: 2494 IGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAKIKV 2315
            +G KK   ++   E S G    A+TVYRILCP+              K++R+ET AKI V
Sbjct: 23   VGIKKGNWSNSSREQSFGNSQPADTVYRILCPSRKIGGVIGKGGNIVKSLREETQAKITV 82

Query: 2314 DEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKVHER 2135
             + +PG++ERV+II+SS TK  +  + D+D    ++ E + E M+PHC AQ ALLKVH+R
Sbjct: 83   ADTIPGSEERVIIIYSSPTKIAKKVNKDDD----SAAETKKESMEPHCAAQDALLKVHDR 138

Query: 2134 IVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRILPP 1955
            I+EE DL+G    +DD E++ VTARLLVP+N VGCLLGK G VIQ+LRSETGANIR+LP 
Sbjct: 139  IIEE-DLFGGMASDDDNENSTVTARLLVPNNMVGCLLGKRGDVIQRLRSETGANIRVLPA 197

Query: 1954 EHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXXXXXXXXX 1784
            + LP CAM++DELVQISG  +VAK+ALYEVSTLL+QNPR   PP +              
Sbjct: 198  DRLPPCAMNTDELVQISGKPNVAKRALYEVSTLLHQNPRKDKPPSSFPQAYGGQNFHSPP 257

Query: 1783 XXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKASAEF 1604
                         W  RNS  HG PS PW GG+G++  R              G+ SAEF
Sbjct: 258  APMADMHPLGNSSWPARNSSLHGMPSTPWMGGYGDQPSRMGSGSINSCPPGQMGEVSAEF 317

Query: 1603 NMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWNPQS 1424
            +MKILC A KIG VIGKGG NVKQ+QQETGA+IHVED+  ++DERVI  S F+  WNP+S
Sbjct: 318  SMKILCSAGKIGGVIGKGGFNVKQLQQETGASIHVEDAPTDSDERVIRASAFEGLWNPRS 377

Query: 1423 PTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRVISK 1244
             TIDAILQLQ  T E SEKGTITTRLLVPS+KVGC+LGQGGH+I EMRRR QADIRV SK
Sbjct: 378  QTIDAILQLQNKTSEFSEKGTITTRLLVPSSKVGCILGQGGHVINEMRRRTQADIRVYSK 437

Query: 1243 DGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQGFGRS 1064
            D KP CAS DEELVQISGN  VA++ALTEIASRLR R+L   +  AEP P+GP Q  G +
Sbjct: 438  DDKPKCASEDEELVQISGNFGVAKDALTEIASRLRARTLRDANVGAEPAPVGPVQLVGAA 497

Query: 1063 EGFHDRXXXXXXXXXXXXXXGYEHSKGGIREFEPQSYPVQPSGAGPA--GYPNLNSSMEV 890
             G   R              GYE  +GG  ++EPQSYP  PS   P+  GYPN+NS+ E 
Sbjct: 498  GGLPSRGPLPSGPVGAGISGGYEPFRGGY-DYEPQSYPPPPSAPPPSATGYPNMNSAFEA 556

Query: 889  KIPNTAINAVLGAGGSNISSIGQLTGAKVKLNDPQAGVSECIVEIHGSSDQMIAAQNLLH 710
            +IPN A+ +V+G GGSNI ++G++ GA+VKL DP  G SECIV+I GSS+ +I+A     
Sbjct: 557  RIPNKAVGSVMGTGGSNIPNVGEVVGARVKLQDPHPGSSECIVDIRGSSEHLISAHGTYQ 616

Query: 709  NFIAS 695
            +F+ S
Sbjct: 617  SFMTS 621


>ref|XP_006430781.1| hypothetical protein CICLE_v10011187mg [Citrus clementina]
            gi|557532838|gb|ESR44021.1| hypothetical protein
            CICLE_v10011187mg [Citrus clementina]
          Length = 710

 Score =  603 bits (1555), Expect = e-169
 Identities = 325/603 (53%), Positives = 403/603 (66%), Gaps = 3/603 (0%)
 Frame = -3

Query: 2494 IGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAKIKV 2315
            +G KK   ++   E S G    A+TVYRILCP+              K++R+ET AKI V
Sbjct: 23   VGIKKGNWSNSSREQSFGNSQPADTVYRILCPSRKIGGVIGKGGNIVKSLREETQAKITV 82

Query: 2314 DEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKVHER 2135
             + +PG++ERV+II+SS TK  +  + D+D    ++ E + E M+PHC AQ ALLKVH+R
Sbjct: 83   ADTIPGSEERVIIIYSSPTKIAKTQNKDDD----SAAETKKESMEPHCAAQDALLKVHDR 138

Query: 2134 IVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRILPP 1955
            I+EE DL+G    +DD E++ VTARLLVP+N VGCLLGK G VIQ+LRSETGANIR+LP 
Sbjct: 139  IIEE-DLFGGMASDDDNENSTVTARLLVPNNMVGCLLGKRGDVIQRLRSETGANIRVLPA 197

Query: 1954 EHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXXXXXXXXX 1784
            + LP CAM++DELVQISG  +VAK+ALYEVSTLL+QNPR   PP +              
Sbjct: 198  DRLPPCAMNTDELVQISGKPNVAKRALYEVSTLLHQNPRKDKPPSSFPQAYGGQNFHSPP 257

Query: 1783 XXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKASAEF 1604
                         W  RNS  HG PS PW GG+G++  R              G+ SAEF
Sbjct: 258  APMADMHPLGNSSWPARNSSLHGMPSTPWMGGYGDQPSRMGSGSINSCPPGQMGEVSAEF 317

Query: 1603 NMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWNPQS 1424
            +MKILC A KIG VIGKGG NVKQ+QQETGA+IHVED+  ++DERVI  S F+  WNP+S
Sbjct: 318  SMKILCSAGKIGGVIGKGGFNVKQLQQETGASIHVEDAPTDSDERVIRASAFEGLWNPRS 377

Query: 1423 PTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRVISK 1244
             TIDAILQLQ  T E SEKGTITTRLLVPS+KVGC+LGQGGH+I EMRRR QADIRV SK
Sbjct: 378  QTIDAILQLQNKTSEFSEKGTITTRLLVPSSKVGCILGQGGHVINEMRRRTQADIRVYSK 437

Query: 1243 DGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQGFGRS 1064
            D KP CAS DEELVQISGN  VA++ALTEIASRLR R+L   +  AEP P+GP Q  G +
Sbjct: 438  DDKPKCASEDEELVQISGNFGVAKDALTEIASRLRARTLRDANVGAEPAPVGPVQLVGAA 497

Query: 1063 EGFHDRXXXXXXXXXXXXXXGYEHSKGGIREFEPQSYPVQPSGAGPAGYPNLNSSMEVKI 884
             G   R              GYE  +GG  ++EPQSYP  PS     GYPN+NS+ E +I
Sbjct: 498  GGLPSRGPLPSGPVGAGISGGYEPFRGGY-DYEPQSYPPPPSA---MGYPNMNSAFEARI 553

Query: 883  PNTAINAVLGAGGSNISSIGQLTGAKVKLNDPQAGVSECIVEIHGSSDQMIAAQNLLHNF 704
            PN A+ +V+G GGSNI ++G++ GA+VKL DP  G SECIV+I GSS+ +I+A     +F
Sbjct: 554  PNKAVGSVMGTGGSNIPNVGEVAGARVKLQDPHPGSSECIVDIRGSSEHLISAHGTYQSF 613

Query: 703  IAS 695
            + S
Sbjct: 614  MTS 616


>ref|XP_007033279.1| RNA-binding KH domain-containing protein isoform 4 [Theobroma cacao]
            gi|508712308|gb|EOY04205.1| RNA-binding KH
            domain-containing protein isoform 4 [Theobroma cacao]
          Length = 706

 Score =  551 bits (1419), Expect = e-154
 Identities = 323/625 (51%), Positives = 396/625 (63%), Gaps = 6/625 (0%)
 Frame = -3

Query: 2461 GYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAKIKVDEAVPGTDERV 2282
            G+E  +G  N  +TVYR+LCP+              KA+R+ET AKI V ++V G DERV
Sbjct: 35   GHEQPSGNFNSGDTVYRVLCPSRKIGGVIGKGGSIVKALREETQAKITVGDSVLGCDERV 94

Query: 2281 VIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKVHERIVEEEDLYGAS 2102
            +II+SS  K  +  ++DED   + + + E   M+P C AQ ALLKVH++I E+ DL+G  
Sbjct: 95   IIIYSSPMKV-KTQNSDEDSRGE-NKKDEVVVMEPCCAAQDALLKVHDQIAED-DLFGGM 151

Query: 2101 DHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRILPPEHLPACAMSSD 1922
              +DD  + VVTARLLVP+N VGCLLGK G VIQ+LRSETGA+IRILP +HLPACAM++D
Sbjct: 152  ALDDDNGNTVVTARLLVPNNMVGCLLGKRGDVIQRLRSETGASIRILPADHLPACAMATD 211

Query: 1921 ELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXXXXXXXXXXXXXXXXXXXX 1751
            ELVQISG  DVAK+ALYEVSTLL+QNPR   PP++                         
Sbjct: 212  ELVQISGKRDVAKRALYEVSTLLHQNPRKDNPPLSFPVPHAGQNFPPPSAIPPSNPM--- 268

Query: 1750 XMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKASAEFNMKILCPADKI 1571
              WSHRNS  H  PSMPW    GN                   +ASAEF+MKILCPA KI
Sbjct: 269  --WSHRNSSPHDIPSMPWMVPHGNRPSGFGPGSLSSFPPARGAEASAEFSMKILCPAGKI 326

Query: 1570 GVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWNPQSPTIDAILQLQV 1391
            G VIGKGG NVKQ+QQETGA IHVED+T E+DERVI VS  +  WNP+S TIDAILQLQ 
Sbjct: 327  GGVIGKGGFNVKQLQQETGAGIHVEDATIESDERVIRVSAIEGLWNPRSQTIDAILQLQN 386

Query: 1390 HTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRVISKDGKPTCASADE 1211
             T E SEKGT+TTRLLVPS+KVGC+LGQGGH+I EMRRR QADIRV SKD KP CAS DE
Sbjct: 387  KTSEFSEKGTVTTRLLVPSSKVGCILGQGGHVINEMRRRTQADIRVYSKDDKPKCASEDE 446

Query: 1210 ELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQGFGRSEGFHDRXXXXX 1031
            ELVQISGN  VA++AL EIASRLRVR+L  V+A AE  P+GP  GFG +    D      
Sbjct: 447  ELVQISGNFGVAKDALAEIASRLRVRTLRDVNAGAETAPVGPVNGFGPARSMADGLPPAA 506

Query: 1030 XXXXXXXXXGYEHSKGGIREFEPQ-SYPVQPSGAGPAGYPNLNSSMEVKIPNTAINAVLG 854
                           GG RE+EPQ +Y V P+      Y N+N ++E KI N   ++V G
Sbjct: 507  AIGPGRSGGYESFRGGGGREYEPQNNYSVPPAA---VRYSNMNGALEAKILNNMSSSVTG 563

Query: 853  AGGSNISSIGQLTGAKVKLNDPQAGVSECIVEIHGSSDQMIAAQNLLHNFIASAAHNYNT 674
             GGS I +  Q++GA+V+L DPQ G SE   E  GSS+ + AAQ++  +F+ S+  + N 
Sbjct: 564  TGGSTILNT-QVSGARVRLEDPQTGGSE---EFRGSSEHLTAAQSIFQSFMPSSGQSMNA 619

Query: 673  XXXXXXXXXXXQS--YNTQQQQSPY 605
                       QS   N   Q+SPY
Sbjct: 620  QQSSYQNLSVQQSSYLNMNAQRSPY 644


>ref|XP_007033276.1| RNA-binding KH domain-containing protein isoform 1 [Theobroma cacao]
            gi|590652914|ref|XP_007033278.1| RNA-binding KH
            domain-containing protein isoform 1 [Theobroma cacao]
            gi|508712305|gb|EOY04202.1| RNA-binding KH
            domain-containing protein isoform 1 [Theobroma cacao]
            gi|508712307|gb|EOY04204.1| RNA-binding KH
            domain-containing protein isoform 1 [Theobroma cacao]
          Length = 706

 Score =  550 bits (1417), Expect = e-153
 Identities = 322/625 (51%), Positives = 396/625 (63%), Gaps = 6/625 (0%)
 Frame = -3

Query: 2461 GYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAKIKVDEAVPGTDERV 2282
            G+E  +G  N  +TVYR+LCP+              KA+R+ET AKI V ++V G DERV
Sbjct: 35   GHEQPSGNFNSGDTVYRVLCPSRKIGGVIGKGGSIVKALREETQAKITVGDSVLGCDERV 94

Query: 2281 VIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKVHERIVEEEDLYGAS 2102
            +II+SS  K  +  ++DED   + + + E   M+P C AQ ALLKVH++I E+ DL+G  
Sbjct: 95   IIIYSSPMKV-KTQNSDEDSRGE-NKKDEVVVMEPCCAAQDALLKVHDQIAED-DLFGGM 151

Query: 2101 DHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRILPPEHLPACAMSSD 1922
              +DD  + VVTARLLVP+N VGCLLGK G VIQ+LRSETGA+IRILP +HLPACAM++D
Sbjct: 152  ALDDDNGNTVVTARLLVPNNMVGCLLGKRGDVIQRLRSETGASIRILPADHLPACAMATD 211

Query: 1921 ELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXXXXXXXXXXXXXXXXXXXX 1751
            ELVQISG  DVAK+ALYEVSTLL+QNPR   PP++                         
Sbjct: 212  ELVQISGKRDVAKRALYEVSTLLHQNPRKDNPPLSFPVPHAGQNFPPPSAIPPSNPM--- 268

Query: 1750 XMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKASAEFNMKILCPADKI 1571
              WSHRNS  H  PSMPW    GN                   +ASAEF+MKILCPA KI
Sbjct: 269  --WSHRNSSPHDIPSMPWMVPHGNRPSGFGPGSLSSFPPARGAEASAEFSMKILCPAGKI 326

Query: 1570 GVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWNPQSPTIDAILQLQV 1391
            G VIGKGG NVKQ+QQETGA IHVED+T E+DERVI VS  +  WNP+S TIDAILQLQ 
Sbjct: 327  GGVIGKGGFNVKQLQQETGAGIHVEDATIESDERVIRVSAIEGLWNPRSQTIDAILQLQN 386

Query: 1390 HTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRVISKDGKPTCASADE 1211
             T E SEKGT+TTRLLVPS+KVGC+LGQGGH+I EMRRR QADIRV SKD KP CAS DE
Sbjct: 387  KTSEFSEKGTVTTRLLVPSSKVGCILGQGGHVINEMRRRTQADIRVYSKDDKPKCASEDE 446

Query: 1210 ELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQGFGRSEGFHDRXXXXX 1031
            ELVQISGN  VA++AL EIASRLRVR+L  V+A AE  P+GP  GFG +    D      
Sbjct: 447  ELVQISGNFGVAKDALAEIASRLRVRTLRDVNAGAETAPVGPVNGFGPARSMADGLPPAA 506

Query: 1030 XXXXXXXXXGYEHSKGGIREFEPQ-SYPVQPSGAGPAGYPNLNSSMEVKIPNTAINAVLG 854
                           GG RE+EPQ +Y V P+      Y N+N ++E KI N   ++V G
Sbjct: 507  AIGPGRSGGYESFRGGGGREYEPQNNYSVPPAA---VRYSNMNGALEAKILNNMSSSVTG 563

Query: 853  AGGSNISSIGQLTGAKVKLNDPQAGVSECIVEIHGSSDQMIAAQNLLHNFIASAAHNYNT 674
             GGS I +  +++GA+V+L DPQ G SE   E  GSS+ + AAQ++  +F+ S+  + N 
Sbjct: 564  TGGSTILN-SEVSGARVRLEDPQTGGSE---EFRGSSEHLTAAQSIFQSFMPSSGQSMNA 619

Query: 673  XXXXXXXXXXXQS--YNTQQQQSPY 605
                       QS   N   Q+SPY
Sbjct: 620  QQSSYQNLSVQQSSYLNMNAQRSPY 644


>ref|XP_004304132.1| PREDICTED: KH domain-containing protein At4g18375-like [Fragaria
            vesca subsp. vesca]
          Length = 760

 Score =  522 bits (1345), Expect = e-145
 Identities = 308/643 (47%), Positives = 390/643 (60%), Gaps = 6/643 (0%)
 Frame = -3

Query: 2515 VRLDKRNIGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDE 2336
            V+  ++    +K   N+   E S+G     ETVYRILCP+              K++R+E
Sbjct: 15   VQFKRKGSSNQKGNLNNSNREQSSGNAQSLETVYRILCPSKKIGGVIGKGGGIIKSLREE 74

Query: 2335 THAKIKVDEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHA 2156
            T +KI V ++VPG+DERV+IIFS  TK  R  ++DED    +    E +P++PHC AQ A
Sbjct: 75   TRSKITVSDSVPGSDERVIIIFSPPTKISRKQNSDED----SHKADEQKPLEPHCAAQDA 130

Query: 2155 LLKVHERIVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGA 1976
            LLKVH+RIVEE DLY     +DD E N+V  RLLVP+N VGCLLGK G VIQ+LRSET A
Sbjct: 131  LLKVHDRIVEE-DLYDGVTFDDDNE-NIVVTRLLVPNNLVGCLLGKRGDVIQRLRSETRA 188

Query: 1975 NIRILPPEHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXX 1805
            NIR+LP + LP CAM +DELVQISG  DVAKKALYEVSTLL+QNPR   PP+        
Sbjct: 189  NIRVLPADQLPTCAMDTDELVQISGKPDVAKKALYEVSTLLHQNPRKDKPPLG--LAIPF 246

Query: 1804 XXXXXXXXXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXX 1625
                               +W +R    HG P MPW G   N                  
Sbjct: 247  GGQGFHLRGAPNMFPPGNPIWPNREPS-HGMPPMPWIGECENHSSGYGRGGFDGDPAGHG 305

Query: 1624 GKASAEFNMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFD 1445
             +ASAEF +KILC A KIG VIGKGG NVKQ+Q+ETGANIHV+D++ +++ERVI VS F+
Sbjct: 306  VEASAEFFIKILCSAGKIGGVIGKGGFNVKQLQEETGANIHVQDASTDSEERVIRVSAFE 365

Query: 1444 APWNPQSPTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQA 1265
                P+S TI+AILQLQ    E+S+KGTITTRLLVPS+KVGC+LGQGG +I EMRRR QA
Sbjct: 366  VLRIPRSQTIEAILQLQNKASELSDKGTITTRLLVPSSKVGCILGQGGQVINEMRRRTQA 425

Query: 1264 DIRVISKDGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGP 1085
            DIRV SKD +P CA  DEELVQISGN  VA++AL EI SRLR+R+L   +A  E  P+GP
Sbjct: 426  DIRVYSKDDRPKCADEDEELVQISGNFAVAKDALAEITSRLRIRTLRDTNAGEEHAPVGP 485

Query: 1084 FQGFGRSEGFHDR-XXXXXXXXXXXXXXGYEHSKGGIREFEPQSYPVQPSGAGPAGYPNL 908
               FG       +                Y+H K G  E+EP+ YPV P+    +GY  +
Sbjct: 486  PPRFGPPGSLPVKGMPPPLSAVRAGSSGRYDHLKVGRHEYEPEGYPVPPAA---SGYQRV 542

Query: 907  NSSMEVKIPNTAINAVLGAGGSNISSIGQLTGAKVKLNDPQAGVSECIVEIHGSSDQMIA 728
            N +++ +IPN A+ +  G GGS++S IG++   +VK  D Q+G  E + EI G SDQ+ A
Sbjct: 543  NRALDSRIPNNAVGSFTGIGGSDVSHIGEVPRPRVKYQDSQSGGFEQVAEIRG-SDQLNA 601

Query: 727  AQNLLHNFIASAAHNYNT-XXXXXXXXXXXQSY-NTQQQQSPY 605
            AQN+L   +AS   N +              SY N    QSPY
Sbjct: 602  AQNILQALMASMGQNASAQPSSHHNANTRQGSYPNISDHQSPY 644


>ref|XP_007033280.1| RNA-binding KH domain-containing protein isoform 5 [Theobroma cacao]
            gi|508712309|gb|EOY04206.1| RNA-binding KH
            domain-containing protein isoform 5 [Theobroma cacao]
          Length = 591

 Score =  515 bits (1326), Expect = e-143
 Identities = 295/546 (54%), Positives = 354/546 (64%), Gaps = 4/546 (0%)
 Frame = -3

Query: 2461 GYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAKIKVDEAVPGTDERV 2282
            G+E  +G  N  +TVYR+LCP+              KA+R+ET AKI V ++V G DERV
Sbjct: 35   GHEQPSGNFNSGDTVYRVLCPSRKIGGVIGKGGSIVKALREETQAKITVGDSVLGCDERV 94

Query: 2281 VIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKVHERIVEEEDLYGAS 2102
            +II+SS  K  +  ++DED   + + + E   M+P C AQ ALLKVH++I E+ DL+G  
Sbjct: 95   IIIYSSPMKV-KTQNSDEDSRGE-NKKDEVVVMEPCCAAQDALLKVHDQIAED-DLFGGM 151

Query: 2101 DHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRILPPEHLPACAMSSD 1922
              +DD  + VVTARLLVP+N VGCLLGK G VIQ+LRSETGA+IRILP +HLPACAM++D
Sbjct: 152  ALDDDNGNTVVTARLLVPNNMVGCLLGKRGDVIQRLRSETGASIRILPADHLPACAMATD 211

Query: 1921 ELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXXXXXXXXXXXXXXXXXXXX 1751
            ELVQISG  DVAK+ALYEVSTLL+QNPR   PP++                         
Sbjct: 212  ELVQISGKRDVAKRALYEVSTLLHQNPRKDNPPLSFPVPHAGQNFPPPSAIPPSNPM--- 268

Query: 1750 XMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKASAEFNMKILCPADKI 1571
              WSHRNS  H  PSMPW    GN                   +ASAEF+MKILCPA KI
Sbjct: 269  --WSHRNSSPHDIPSMPWMVPHGNRPSGFGPGSLSSFPPARGAEASAEFSMKILCPAGKI 326

Query: 1570 GVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWNPQSPTIDAILQLQV 1391
            G VIGKGG NVKQ+QQETGA IHVED+T E+DERVI VS  +  WNP+S TIDAILQLQ 
Sbjct: 327  GGVIGKGGFNVKQLQQETGAGIHVEDATIESDERVIRVSAIEGLWNPRSQTIDAILQLQN 386

Query: 1390 HTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRVISKDGKPTCASADE 1211
             T E SEKGT+TTRLLVPS+KVGC+LGQGGH+I EMRRR QADIRV SKD KP CAS DE
Sbjct: 387  KTSEFSEKGTVTTRLLVPSSKVGCILGQGGHVINEMRRRTQADIRVYSKDDKPKCASEDE 446

Query: 1210 ELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQGFGRSEGFHDRXXXXX 1031
            ELVQISGN  VA++AL EIASRLRVR+L  V+A AE  P+GP  GFG +    D      
Sbjct: 447  ELVQISGNFGVAKDALAEIASRLRVRTLRDVNAGAETAPVGPVNGFGPARSMADGLPPAA 506

Query: 1030 XXXXXXXXXGYEHSKGGIREFEPQ-SYPVQPSGAGPAGYPNLNSSMEVKIPNTAINAVLG 854
                           GG RE+EPQ +Y V P+      Y N+N ++E KI N   ++V G
Sbjct: 507  AIGPGRSGGYESFRGGGGREYEPQNNYSVPPAA---VRYSNMNGALEAKILNNMSSSVTG 563

Query: 853  AGGSNI 836
             GGS I
Sbjct: 564  TGGSTI 569


>ref|XP_007033277.1| RNA-binding KH domain-containing protein isoform 2 [Theobroma cacao]
            gi|508712306|gb|EOY04203.1| RNA-binding KH
            domain-containing protein isoform 2 [Theobroma cacao]
          Length = 580

 Score =  515 bits (1326), Expect = e-143
 Identities = 295/546 (54%), Positives = 354/546 (64%), Gaps = 4/546 (0%)
 Frame = -3

Query: 2461 GYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAKIKVDEAVPGTDERV 2282
            G+E  +G  N  +TVYR+LCP+              KA+R+ET AKI V ++V G DERV
Sbjct: 35   GHEQPSGNFNSGDTVYRVLCPSRKIGGVIGKGGSIVKALREETQAKITVGDSVLGCDERV 94

Query: 2281 VIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKVHERIVEEEDLYGAS 2102
            +II+SS  K  +  ++DED   + + + E   M+P C AQ ALLKVH++I E+ DL+G  
Sbjct: 95   IIIYSSPMKV-KTQNSDEDSRGE-NKKDEVVVMEPCCAAQDALLKVHDQIAED-DLFGGM 151

Query: 2101 DHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRILPPEHLPACAMSSD 1922
              +DD  + VVTARLLVP+N VGCLLGK G VIQ+LRSETGA+IRILP +HLPACAM++D
Sbjct: 152  ALDDDNGNTVVTARLLVPNNMVGCLLGKRGDVIQRLRSETGASIRILPADHLPACAMATD 211

Query: 1921 ELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXXXXXXXXXXXXXXXXXXXX 1751
            ELVQISG  DVAK+ALYEVSTLL+QNPR   PP++                         
Sbjct: 212  ELVQISGKRDVAKRALYEVSTLLHQNPRKDNPPLSFPVPHAGQNFPPPSAIPPSNPM--- 268

Query: 1750 XMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKASAEFNMKILCPADKI 1571
              WSHRNS  H  PSMPW    GN                   +ASAEF+MKILCPA KI
Sbjct: 269  --WSHRNSSPHDIPSMPWMVPHGNRPSGFGPGSLSSFPPARGAEASAEFSMKILCPAGKI 326

Query: 1570 GVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWNPQSPTIDAILQLQV 1391
            G VIGKGG NVKQ+QQETGA IHVED+T E+DERVI VS  +  WNP+S TIDAILQLQ 
Sbjct: 327  GGVIGKGGFNVKQLQQETGAGIHVEDATIESDERVIRVSAIEGLWNPRSQTIDAILQLQN 386

Query: 1390 HTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRVISKDGKPTCASADE 1211
             T E SEKGT+TTRLLVPS+KVGC+LGQGGH+I EMRRR QADIRV SKD KP CAS DE
Sbjct: 387  KTSEFSEKGTVTTRLLVPSSKVGCILGQGGHVINEMRRRTQADIRVYSKDDKPKCASEDE 446

Query: 1210 ELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQGFGRSEGFHDRXXXXX 1031
            ELVQISGN  VA++AL EIASRLRVR+L  V+A AE  P+GP  GFG +    D      
Sbjct: 447  ELVQISGNFGVAKDALAEIASRLRVRTLRDVNAGAETAPVGPVNGFGPARSMADGLPPAA 506

Query: 1030 XXXXXXXXXGYEHSKGGIREFEPQ-SYPVQPSGAGPAGYPNLNSSMEVKIPNTAINAVLG 854
                           GG RE+EPQ +Y V P+      Y N+N ++E KI N   ++V G
Sbjct: 507  AIGPGRSGGYESFRGGGGREYEPQNNYSVPPAA---VRYSNMNGALEAKILNNMSSSVTG 563

Query: 853  AGGSNI 836
             GGS I
Sbjct: 564  TGGSTI 569


>ref|XP_006430782.1| hypothetical protein CICLE_v10011187mg [Citrus clementina]
            gi|557532839|gb|ESR44022.1| hypothetical protein
            CICLE_v10011187mg [Citrus clementina]
          Length = 528

 Score =  504 bits (1298), Expect = e-140
 Identities = 272/486 (55%), Positives = 330/486 (67%), Gaps = 3/486 (0%)
 Frame = -3

Query: 2494 IGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAKIKV 2315
            +G KK   ++   E S G    A+TVYRILCP+              K++R+ET AKI V
Sbjct: 23   VGIKKGNWSNSSREQSFGNSQPADTVYRILCPSRKIGGVIGKGGNIVKSLREETQAKITV 82

Query: 2314 DEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKVHER 2135
             + +PG++ERV+II+SS TK  +  + D+D    ++ E + E M+PHC AQ ALLKVH+R
Sbjct: 83   ADTIPGSEERVIIIYSSPTKIAKTQNKDDD----SAAETKKESMEPHCAAQDALLKVHDR 138

Query: 2134 IVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRILPP 1955
            I+EE DL+G    +DD E++ VTARLLVP+N VGCLLGK G VIQ+LRSETGANIR+LP 
Sbjct: 139  IIEE-DLFGGMASDDDNENSTVTARLLVPNNMVGCLLGKRGDVIQRLRSETGANIRVLPA 197

Query: 1954 EHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXXXXXXXXX 1784
            + LP CAM++DELVQISG  +VAK+ALYEVSTLL+QNPR   PP +              
Sbjct: 198  DRLPPCAMNTDELVQISGKPNVAKRALYEVSTLLHQNPRKDKPPSSFPQAYGGQNFHSPP 257

Query: 1783 XXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKASAEF 1604
                         W  RNS  HG PS PW GG+G++  R              G+ SAEF
Sbjct: 258  APMADMHPLGNSSWPARNSSLHGMPSTPWMGGYGDQPSRMGSGSINSCPPGQMGEVSAEF 317

Query: 1603 NMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWNPQS 1424
            +MKILC A KIG VIGKGG NVKQ+QQETGA+IHVED+  ++DERVI  S F+  WNP+S
Sbjct: 318  SMKILCSAGKIGGVIGKGGFNVKQLQQETGASIHVEDAPTDSDERVIRASAFEGLWNPRS 377

Query: 1423 PTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRVISK 1244
             TIDAILQLQ  T E SEKGTITTRLLVPS+KVGC+LGQGGH+I EMRRR QADIRV SK
Sbjct: 378  QTIDAILQLQNKTSEFSEKGTITTRLLVPSSKVGCILGQGGHVINEMRRRTQADIRVYSK 437

Query: 1243 DGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQGFGRS 1064
            D KP CAS DEELVQISGN  VA++ALTEIASRLR R+L   +  AEP P+GP Q  G +
Sbjct: 438  DDKPKCASEDEELVQISGNFGVAKDALTEIASRLRARTLRDANVGAEPAPVGPVQLVGAA 497

Query: 1063 EGFHDR 1046
             G   R
Sbjct: 498  GGLPSR 503


>ref|XP_002516721.1| Poly(rC)-binding protein, putative [Ricinus communis]
            gi|223544094|gb|EEF45619.1| Poly(rC)-binding protein,
            putative [Ricinus communis]
          Length = 537

 Score =  502 bits (1293), Expect = e-139
 Identities = 288/527 (54%), Positives = 341/527 (64%), Gaps = 2/527 (0%)
 Frame = -3

Query: 2503 KRNIGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAK 2324
            KR    +K + N+ G E S+G P   +TVYRILCP+              K +R+ET AK
Sbjct: 18   KRKGVTRKGKWNNSGREESSGNPLPVDTVYRILCPSRKIGGVIGKGGGIIKGLREETQAK 77

Query: 2323 IKVDEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKV 2144
            I V + VPG+DERV+II+SS  K  R  +  EDL      E E + M+P+C AQ ALLKV
Sbjct: 78   ITVADPVPGSDERVIIIYSSPEKISRNHNDHEDLTM----ENEQDIMEPYCAAQDALLKV 133

Query: 2143 HERIVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRI 1964
            H+RIVEE DL+G    +DD E+  VTARLLVP+N VGCLLGK G VIQ+LRSETGANIR+
Sbjct: 134  HDRIVEE-DLFGGMTSDDDNENGFVTARLLVPNNMVGCLLGKRGDVIQRLRSETGANIRV 192

Query: 1963 LPPEHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPRP--PMNHXXXXXXXXXXX 1790
            LP +HLP CAMS+DELVQIS   DVAKKALYEVSTLL+QNPR   P +            
Sbjct: 193  LPADHLPTCAMSTDELVQISAKPDVAKKALYEVSTLLHQNPRKDKPPSVPMPYSGQSFHP 252

Query: 1789 XXXXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKASA 1610
                          MW H NS  H  P MP  G +G++                  + SA
Sbjct: 253  PGGPMKNLPPLGSPMWPHHNSS-HSIPPMPIMGRYGSQSSGFGPGGFDDVPRGHVAEPSA 311

Query: 1609 EFNMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWNP 1430
            EF+MKILC A KIG VIGKGGSNVK +QQ+TGA+IHVED++ E+DERVI VS  +A WNP
Sbjct: 312  EFSMKILCSAGKIGGVIGKGGSNVKVVQQDTGASIHVEDASAESDERVIRVSASEALWNP 371

Query: 1429 QSPTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRVI 1250
            +S TIDAILQLQ  T + SEKGTITTRLLVPS+KVGC+LGQGG +I EMRRR QADIRV 
Sbjct: 372  RSQTIDAILQLQNKTSDFSEKGTITTRLLVPSSKVGCILGQGGQVINEMRRRTQADIRVY 431

Query: 1249 SKDGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQGFG 1070
            SKD KP CAS DEELVQISG   VA++AL EIASRLRVR+L  V+A AEPGP+GP QGFG
Sbjct: 432  SKDEKPKCASEDEELVQISGKFGVAKDALAEIASRLRVRTLRDVNAGAEPGPVGPIQGFG 491

Query: 1069 RSEGFHDRXXXXXXXXXXXXXXGYEHSKGGIREFEPQSYPVQPSGAG 929
             +     R              GYE  +    E+E  SYPV P+  G
Sbjct: 492  AARSLPGR--GSSGMMGASSSGGYEPLRDTEHEYESHSYPVPPAAVG 536



 Score =  135 bits (341), Expect = 8e-29
 Identities = 101/328 (30%), Positives = 149/328 (45%), Gaps = 44/328 (13%)
 Frame = -3

Query: 1597 KILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAP------- 1439
            +ILCP+ KIG VIGKGG  +K +++ET A I V D    +DERVI++  + +P       
Sbjct: 48   RILCPSRKIGGVIGKGGGIIKGLREETQAKITVADPVPGSDERVIII--YSSPEKISRNH 105

Query: 1438 --------------WNPQSPTIDAILQLQ-----------VHTHEISEKGTITTRLLVPS 1334
                            P     DA+L++            + + + +E G +T RLLVP+
Sbjct: 106  NDHEDLTMENEQDIMEPYCAAQDALLKVHDRIVEEDLFGGMTSDDDNENGFVTARLLVPN 165

Query: 1333 NKVGCLLGQGGHIITEMRRRCQADIRVISKDGKPTCASADEELVQISGNVNVARNALTEI 1154
            N VGCLLG+ G +I  +R    A+IRV+  D  PTCA + +ELVQIS   +VA+ AL E+
Sbjct: 166  NMVGCLLGKRGDVIQRLRSETGANIRVLPADHLPTCAMSTDELVQISAKPDVAKKALYEV 225

Query: 1153 ASRLRVRSLEGVHAAAEPGPMGPFQGFGRSEGFHDRXXXXXXXXXXXXXXGYEHSKGGIR 974
            ++ L     +       P    P+ G    + FH                   H+     
Sbjct: 226  STLLH----QNPRKDKPPSVPMPYSG----QSFHPPGGPMKNLPPLGSPMWPHHNSSHSI 277

Query: 973  EFEP--QSYPVQPSGAGPAGY----------PNLNSSMEVKIPNTAINAVLGAGGSNISS 830
               P    Y  Q SG GP G+          P+   SM++      I  V+G GGSN+  
Sbjct: 278  PPMPIMGRYGSQSSGFGPGGFDDVPRGHVAEPSAEFSMKILCSAGKIGGVIGKGGSNVKV 337

Query: 829  IGQLTGAKVKLNDPQAGVSECIVEIHGS 746
            + Q TGA + + D  A   E ++ +  S
Sbjct: 338  VQQDTGASIHVEDASAESDERVIRVSAS 365


>ref|XP_007217649.1| hypothetical protein PRUPE_ppa027198mg, partial [Prunus persica]
            gi|462413799|gb|EMJ18848.1| hypothetical protein
            PRUPE_ppa027198mg, partial [Prunus persica]
          Length = 576

 Score =  484 bits (1247), Expect = e-134
 Identities = 287/572 (50%), Positives = 357/572 (62%), Gaps = 4/572 (0%)
 Frame = -3

Query: 2530 RRNTRVRLDKRNIGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXK 2351
            R N + +    N  +KK   N+   E         +TVYRILCP+              K
Sbjct: 12   RPNVQFKRKGGNNNYKKGNWNNSTREQLLENYQSLDTVYRILCPSKRIGGVIGKGGGIVK 71

Query: 2350 AMRDETHAKIKVDEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHC 2171
            A+R+ET AKI V ++V G+DERV+IIFSS TK     S+ +  + D+S+E E EPM PHC
Sbjct: 72   ALREETRAKITVADSVLGSDERVIIIFSSPTKI----SSKQTNDGDSSEENELEPMDPHC 127

Query: 2170 PAQHALLKVHERIVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLR 1991
             AQ ALLKVH RIVEE DL+G    +DD E++VVTARLLVP+N VGCLLGKGG VIQ+LR
Sbjct: 128  AAQDALLKVHNRIVEE-DLFGGVTFDDDNENSVVTARLLVPNNMVGCLLGKGGDVIQRLR 186

Query: 1990 SETGANIRILPPEHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHX 1820
            SET A+IR+LP + LP CAM +DELVQISG  DV K+ALYEVSTLL+QNPR   PP    
Sbjct: 187  SETSASIRVLPADQLPTCAMETDELVQISGKPDVTKRALYEVSTLLHQNPRKDKPPSGFP 246

Query: 1819 XXXXXXXXXXXXXXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXX 1640
                                    MWS+R    H    MP   G+GN             
Sbjct: 247  MPFWGQGFRPPGAPMTNVLPPGNPMWSNRAPS-HSTAPMPRMEGYGNCSSEFALGGFNGV 305

Query: 1639 XXXXXGKASAEFNMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVIL 1460
                 G+A AEF+MKILC   KIG VIGKGG NVKQ+Q ETGA+IHV++++ +++ERVI 
Sbjct: 306  PPGHGGEALAEFSMKILCSPGKIGGVIGKGGFNVKQLQLETGASIHVQEASTDSEERVIH 365

Query: 1459 VSCFDAPWNPQSPTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMR 1280
            VS F+A WNP S TI+AIL+LQ  T E+S+KG ITTRLLVPS+KVGC+LGQGG +I EMR
Sbjct: 366  VSAFEAFWNPISQTIEAILELQNKTSELSDKGIITTRLLVPSSKVGCILGQGGQVINEMR 425

Query: 1279 RRCQADIRVISKDGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEP 1100
            RR QADIRV SKD +P CA+ DEELVQISGN  VA++AL EI SRLRVR+L   +A AEP
Sbjct: 426  RRTQADIRVYSKDDRPRCAAEDEELVQISGNFGVAKDALAEITSRLRVRTLRDANAGAEP 485

Query: 1099 GPMGPFQGFGRSEGF-HDRXXXXXXXXXXXXXXGYEHSKGGIREFEPQSYPVQPSGAGPA 923
              +GP  GFG +                      Y+  K G  E+E QSY + PS A  +
Sbjct: 486  ALVGPAHGFGLAGSVPGGGPRQPPSAVGAGSSSRYDPLKVGRHEYELQSYRI-PSTA--S 542

Query: 922  GYPNLNSSMEVKIPNTAINAVLGAGGSNISSI 827
            GYP++N ++E KIP   +++  G  G N S+I
Sbjct: 543  GYPSVNHALEGKIPKNTVDSFPGTRGGNTSTI 574


>ref|XP_006358053.1| PREDICTED: KH domain-containing protein At4g18375-like isoform X1
            [Solanum tuberosum] gi|565383832|ref|XP_006358054.1|
            PREDICTED: KH domain-containing protein At4g18375-like
            isoform X2 [Solanum tuberosum]
          Length = 705

 Score =  482 bits (1240), Expect = e-133
 Identities = 289/650 (44%), Positives = 374/650 (57%), Gaps = 7/650 (1%)
 Frame = -3

Query: 2533 YRRNTRVRLD----KRNIGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXX 2366
            Y+RN+  +      KR  G K ++ N   +E S+     ++T Y ILC +          
Sbjct: 4    YKRNSLKQRSNPQFKRKGGSKNAKLNFSSHERSSENSQSSDTTYCILCQSKKVGSVIGKG 63

Query: 2365 XXXXKAMRDETHAKIKVDEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEP 2186
                KA+R+ET AKI V ++VPG+D+R+VI+ S STK  R  + D++ +   + E E+  
Sbjct: 64   GSIIKALREETQAKITVADSVPGSDDRIVIVSSPSTKLARRQNNDKNNDNPETKE-ENCS 122

Query: 2185 MQPHCPAQHALLKVHERIVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHV 2006
            M+PHC AQ ALLKVH RIVEE DL G  +  +DK   V+  RLLVP+N VGCLLG+ G V
Sbjct: 123  MEPHCAAQDALLKVHNRIVEE-DLQGVQN--EDKSETVIITRLLVPNNLVGCLLGRKGDV 179

Query: 2005 IQKLRSETGANIRILPPEHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---P 1835
            IQKLRSETGA+IR+L  EHLPACAM++DELVQISG   + KKALYEVSTLL+QNPR   P
Sbjct: 180  IQKLRSETGASIRVLSAEHLPACAMTTDELVQISGKPALVKKALYEVSTLLHQNPRKDKP 239

Query: 1834 PMNHXXXXXXXXXXXXXXXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXX 1655
              +                          MWS   +  +G       GG+ N+       
Sbjct: 240  ISSFPMVHGAQGFHPPGPPMENMIPPGKPMWSQSKTNLNGMSPALGVGGYRNQLTGFGRA 299

Query: 1654 XXXXXXXXXXGKASAEFNMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEAD 1475
                      G+A  +F MKILC A KIG VIGKGG NVKQ+QQ+TGA IHVED   E+D
Sbjct: 300  DFDYGPPPSAGEAPGDFTMKILCSAAKIGGVIGKGGFNVKQLQQDTGAGIHVEDVAPESD 359

Query: 1474 ERVILVSCFDAPWNPQSPTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHI 1295
            ERVI VS  ++ W+P+S TIDAILQLQ  T E S+KG +TTRLL PSNKVGC++GQGG +
Sbjct: 360  ERVIRVSSLESFWDPRSRTIDAILQLQSKTSEFSDKGIVTTRLLFPSNKVGCIIGQGGQV 419

Query: 1294 ITEMRRRCQADIRVISKDGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVH 1115
            I EMRRR QADIRV+SKD KP CASADEELVQISG++ VA++AL EI+SRLR R L   +
Sbjct: 420  INEMRRRTQADIRVLSKDDKPRCASADEELVQISGSIGVAKDALVEISSRLRERCLRDAN 479

Query: 1114 AAAEPGPMGPFQGFGRSEGFHDRXXXXXXXXXXXXXXGYEHSKGGIREFEPQSYPVQPSG 935
            +  E  P+ P  GF  SE F                  YEH KG +RE++  SYP  P  
Sbjct: 480  SKVESTPVRPLPGFVPSEDFRSGDPQRSGAMGAGSSRRYEHLKGAVREYDHPSYPDLPIA 539

Query: 934  AGPAGYPNLNSSMEVKIPNTAINAVLGAGGSNISSIGQLTGAKVKLNDPQAGVSECIVEI 755
                 + N+ S  E+K P+ +  +  G GG NI    +  G   +  DP++     I +I
Sbjct: 540  ---TRFSNIRSPPEMKFPDHSYGSAKGTGGYNID---EFAGRSARFQDPRSVGPGFIDDI 593

Query: 754  HGSSDQMIAAQNLLHNFIASAAHNYNTXXXXXXXXXXXQSYNTQQQQSPY 605
             G+SD M A  N+ H     ++ N+N               ++  QQ  Y
Sbjct: 594  RGTSDHMNAGHNVFH----GSSENFNAGRPTFQGYGSPAGQSSNIQQGAY 639


>ref|XP_004234567.1| PREDICTED: KH domain-containing protein At4g18375-like [Solanum
            lycopersicum]
          Length = 707

 Score =  480 bits (1236), Expect = e-132
 Identities = 285/615 (46%), Positives = 364/615 (59%), Gaps = 7/615 (1%)
 Frame = -3

Query: 2533 YRRNTRVRLD----KRNIGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXX 2366
            Y+RN+  +      KR  G K ++ N   +E S+     ++T Y ILC +          
Sbjct: 4    YKRNSLKQRSNPQFKRKGGSKNAKLNFSSHERSSENSQSSDTTYCILCQSKKVGSVIGKG 63

Query: 2365 XXXXKAMRDETHAKIKVDEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEP 2186
                KA+R+ET AKI V ++VPG+D+R+VII S STK  R  + D++ +   + E+ +  
Sbjct: 64   GSIIKALREETQAKITVADSVPGSDDRIVIISSPSTKLARRQNNDKNNDNPETKEENYS- 122

Query: 2185 MQPHCPAQHALLKVHERIVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHV 2006
            M+PHC AQ ALLKVH RIVEE DL G  +  +DK   V+  RLLVP+N VGCLLG+ G V
Sbjct: 123  MEPHCAAQDALLKVHNRIVEE-DLLGVQN--EDKSEAVIITRLLVPNNLVGCLLGRKGDV 179

Query: 2005 IQKLRSETGANIRILPPEHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---P 1835
            IQKLRSETGA+IR+L  EHLPACAM++DELVQISG   + KKALYEVSTLL+QNPR   P
Sbjct: 180  IQKLRSETGASIRVLSAEHLPACAMTTDELVQISGKPALVKKALYEVSTLLHQNPRKDKP 239

Query: 1834 PMNHXXXXXXXXXXXXXXXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXX 1655
              +                          MWS   +  +G P     GG+ N+       
Sbjct: 240  ISSFPMVHGAQGFHPPGPPMENMIPPGKPMWSQSKTNLNGMPPALGVGGYRNQLTGFGRA 299

Query: 1654 XXXXXXXXXXGKASAEFNMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEAD 1475
                      G+A  +F MKILC A KIG VIGKGG NVKQ+QQETGA IHVED   E+D
Sbjct: 300  DFDYGPPPSAGEAPGDFTMKILCSAAKIGGVIGKGGFNVKQLQQETGAGIHVEDVAPESD 359

Query: 1474 ERVILVSCFDAPWNPQSPTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHI 1295
            ERVI VS  ++ W+P+S TIDAILQLQ  T E S+KG +TTRLL PSNKVGC++GQGG +
Sbjct: 360  ERVIRVSSLESFWDPRSRTIDAILQLQSKTSEFSDKGIVTTRLLFPSNKVGCIIGQGGQV 419

Query: 1294 ITEMRRRCQADIRVISKDGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVH 1115
            I EMRRR QADIRV+SKD KP CASADEELVQISG++ VA++AL EI+SRLR R L   +
Sbjct: 420  INEMRRRTQADIRVLSKDEKPRCASADEELVQISGSIGVAKDALVEISSRLRERCLRDAN 479

Query: 1114 AAAEPGPMGPFQGFGRSEGFHDRXXXXXXXXXXXXXXGYEHSKGGIREFEPQSYPVQPSG 935
            +  E   + P  GF  SE F  R               YEH KG +RE +  SYP  P  
Sbjct: 480  SKVESTHVRPLPGFVPSEDFRSRDPQRSGVMGAGSSRRYEHLKGAVRECDHPSYPDLPIA 539

Query: 934  AGPAGYPNLNSSMEVKIPNTAINAVLGAGGSNISSIGQLTGAKVKLNDPQAGVSECIVEI 755
                 + N  S  E+K P+ +     G GG NI    +  G+  +  DP++     + +I
Sbjct: 540  ---TRFSNTRSPPEMKFPDHSYGTAKGTGGYNID---EFAGSSARYQDPRSIGPGFVDDI 593

Query: 754  HGSSDQMIAAQNLLH 710
             G+SD+M A  N+ H
Sbjct: 594  RGTSDRMNAGHNVFH 608


>ref|XP_006383459.1| hypothetical protein POPTR_0005s15690g [Populus trichocarpa]
            gi|550339070|gb|ERP61256.1| hypothetical protein
            POPTR_0005s15690g [Populus trichocarpa]
          Length = 563

 Score =  469 bits (1207), Expect = e-129
 Identities = 261/488 (53%), Positives = 313/488 (64%), Gaps = 6/488 (1%)
 Frame = -3

Query: 2491 GFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAKIKVD 2312
            G   SR+  +            +TVYRILCP+              KA+R+ET +KI V 
Sbjct: 21   GVSSSRKGKWSDSHGEECSGDGDTVYRILCPSRKIGGVIGKGGNIVKALREETQSKITVA 80

Query: 2311 EAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKVHERI 2132
            ++V G+DERV+II+SSS K PR    DE L    +   + E  +PHC AQ ALLKVH+RI
Sbjct: 81   DSVQGSDERVIIIYSSSDKPPRKMDGDEGLP---AGNGQQEAFEPHCAAQDALLKVHDRI 137

Query: 2131 VEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRILPPE 1952
            VEE DL+G    +DD ++NVVTARLLVP+N VGC+LGK G VIQ+LRSETGANIR+LP +
Sbjct: 138  VEE-DLFGGMASDDDNDNNVVTARLLVPNNMVGCVLGKRGDVIQRLRSETGANIRVLPAD 196

Query: 1951 HLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPRP------PMNHXXXXXXXXXXX 1790
            HLP+CAM +DELVQISG   VAK+ALYE+S LL+QNPR       PM +           
Sbjct: 197  HLPSCAMDTDELVQISGKPAVAKRALYEISILLHQNPRKDKLPSVPMPYGGRTFHPPSDS 256

Query: 1789 XXXXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKASA 1610
                           W HRNS    P SMPW G +GN                   + SA
Sbjct: 257  MANMLPPGNPM----WPHRNST---PHSMPWMGEYGNHPSEFGPGGFNGVPPGHGREPSA 309

Query: 1609 EFNMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWNP 1430
            EF+MKILC   KIG VIGKGGSNVK +QQETGA+IHVED++ E++ER I VS F+  WNP
Sbjct: 310  EFSMKILCSTGKIGGVIGKGGSNVKIVQQETGASIHVEDASAESEERAIRVSAFEGLWNP 369

Query: 1429 QSPTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRVI 1250
            +S TIDAILQLQ  T + SEKG I TRLLVPS+KVGC+LGQGG +I EMRRR QADIRV 
Sbjct: 370  RSQTIDAILQLQDKTSDFSEKGMIITRLLVPSSKVGCILGQGGQVINEMRRRLQADIRVY 429

Query: 1249 SKDGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQGFG 1070
             K+ KP CAS DEELVQISGN  VA++AL EIASRLR R+L   +A  EPGP GP  GFG
Sbjct: 430  PKNDKPKCASDDEELVQISGNYGVAKDALAEIASRLRARTLRDANAGTEPGPAGPVPGFG 489

Query: 1069 RSEGFHDR 1046
             +     R
Sbjct: 490  PARNLPGR 497


>gb|AEV43362.1| poly C-binding protein [Citrus sinensis]
          Length = 569

 Score =  426 bits (1096), Expect = e-116
 Identities = 228/418 (54%), Positives = 283/418 (67%), Gaps = 3/418 (0%)
 Frame = -3

Query: 2494 IGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAKIKV 2315
            +G KK   ++   E S G    A+TVYRILCP+              K++R+ET AKI V
Sbjct: 23   VGIKKGNWSNSSREQSFGNSQPADTVYRILCPSRKIGGVIGKAGNIVKSLREETQAKITV 82

Query: 2314 DEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKVHER 2135
             + +PG++ERV+II+SS TK  +  + D+D    ++ E + E M+PHC AQ ALLKVH+R
Sbjct: 83   ADTIPGSEERVIIIYSSPTKIAKTQNKDDD----SAAETKKESMEPHCAAQDALLKVHDR 138

Query: 2134 IVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRILPP 1955
            I+EE DL+G    +DD E++ +TARLLVP+N VGCLLGK G VIQ+LRSETGANIR+LP 
Sbjct: 139  IIEE-DLFGGMASDDDNENSTITARLLVPNNMVGCLLGKRGDVIQRLRSETGANIRVLPA 197

Query: 1954 EHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXXXXXXXXX 1784
            + LP CAM++DE+VQISG  +VAK+ALYEVSTLL+QNPR   PP +              
Sbjct: 198  DRLPPCAMNTDEMVQISGKPNVAKRALYEVSTLLHQNPRKDKPPSSFPQAYGGQNFHSPP 257

Query: 1783 XXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKASAEF 1604
                         W  RNS  HG PS PW GG+G++  R              G+ SAEF
Sbjct: 258  APMADMHPLGNSSWPARNSSLHGMPSTPWMGGYGDQPSRMGSGSINSCPPGQMGEVSAEF 317

Query: 1603 NMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWNPQS 1424
            +MKILC A KIG VIGKGG NVKQ+QQETGA+IHVED+  ++DERVI  S F+  WNP+S
Sbjct: 318  SMKILCSAGKIGGVIGKGGFNVKQLQQETGASIHVEDAPTDSDERVIRASAFEGLWNPRS 377

Query: 1423 PTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRVI 1250
             TIDAILQLQ  T E SEKGTITTRLLVPS+KVGC+LGQGGH+I EMRRR QADIRV+
Sbjct: 378  QTIDAILQLQNKTSEFSEKGTITTRLLVPSSKVGCILGQGGHVINEMRRRTQADIRVV 435



 Score =  123 bits (308), Expect = 5e-25
 Identities = 96/323 (29%), Positives = 145/323 (44%), Gaps = 44/323 (13%)
 Frame = -3

Query: 1597 KILCPADKIGVVIGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAP------- 1439
            +ILCP+ KIG VIGK G+ VK +++ET A I V D+   ++ERVI++  + +P       
Sbjct: 50   RILCPSRKIGGVIGKAGNIVKSLREETQAKITVADTIPGSEERVIII--YSSPTKIAKTQ 107

Query: 1438 --------------WNPQSPTIDAILQLQ-----------VHTHEISEKGTITTRLLVPS 1334
                            P     DA+L++            + + + +E  TIT RLLVP+
Sbjct: 108  NKDDDSAAETKKESMEPHCAAQDALLKVHDRIIEEDLFGGMASDDDNENSTITARLLVPN 167

Query: 1333 NKVGCLLGQGGHIITEMRRRCQADIRVISKDGKPTCASADEELVQISGNVNVARNALTEI 1154
            N VGCLLG+ G +I  +R    A+IRV+  D  P CA   +E+VQISG  NVA+ AL E+
Sbjct: 168  NMVGCLLGKRGDVIQRLRSETGANIRVLPADRLPPCAMNTDEMVQISGKPNVAKRALYEV 227

Query: 1153 ASRLR------------VRSLEGVHAAAEPGPMGPFQGFGRSEGFHDRXXXXXXXXXXXX 1010
            ++ L              ++  G +  + P PM      G S  +  R            
Sbjct: 228  STLLHQNPRKDKPPSSFPQAYGGQNFHSPPAPMADMHPLGNS-SWPARNSSLHGMPSTPW 286

Query: 1009 XXGYEHSKGGIREFEPQSYPVQPSGAGPAGYPNLNSSMEVKIPNTAINAVLGAGGSNISS 830
              GY     G +     S  +     G  G  +   SM++      I  V+G GG N+  
Sbjct: 287  MGGY-----GDQPSRMGSGSINSCPPGQMGEVSAEFSMKILCSAGKIGGVIGKGGFNVKQ 341

Query: 829  IGQLTGAKVKLNDPQAGVSECIV 761
            + Q TGA + + D      E ++
Sbjct: 342  LQQETGASIHVEDAPTDSDERVI 364


>ref|XP_007033281.1| RNA-binding KH domain-containing protein isoform 6, partial
            [Theobroma cacao] gi|508712310|gb|EOY04207.1| RNA-binding
            KH domain-containing protein isoform 6, partial
            [Theobroma cacao]
          Length = 530

 Score =  378 bits (970), Expect = e-102
 Identities = 224/442 (50%), Positives = 269/442 (60%), Gaps = 6/442 (1%)
 Frame = -3

Query: 1912 QISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXXXXXXXXXXXXXXXXXXXXXMW 1742
            +ISG  DVAK+ALYEVSTLL+QNPR   PP++                           W
Sbjct: 39   KISGKRDVAKRALYEVSTLLHQNPRKDNPPLSFPVPHAGQNFPPPSAIPPSNPM-----W 93

Query: 1741 SHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKASAEFNMKILCPADKIGVV 1562
            SHRNS  H  PSMPW    GN                   +ASAEF+MKILCPA KIG V
Sbjct: 94   SHRNSSPHDIPSMPWMVPHGNRPSGFGPGSLSSFPPARGAEASAEFSMKILCPAGKIGGV 153

Query: 1561 IGKGGSNVKQIQQETGANIHVEDSTGEADERVILVSCFDAPWNPQSPTIDAILQLQVHTH 1382
            IGKGG NVKQ+QQETGA IHVED+T E+DERVI VS  +  WNP+S TIDAILQLQ  T 
Sbjct: 154  IGKGGFNVKQLQQETGAGIHVEDATIESDERVIRVSAIEGLWNPRSQTIDAILQLQNKTS 213

Query: 1381 EISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRVISKDGKPTCASADEELV 1202
            E SEKGT+TTRLLVPS+KVGC+LGQGGH+I EMRRR QADIRV SKD KP CAS DEELV
Sbjct: 214  EFSEKGTVTTRLLVPSSKVGCILGQGGHVINEMRRRTQADIRVYSKDDKPKCASEDEELV 273

Query: 1201 QISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQGFGRSEGFHDRXXXXXXXX 1022
            QISGN  VA++AL EIASRLRVR+L  V+A AE  P+GP  GFG +    D         
Sbjct: 274  QISGNFGVAKDALAEIASRLRVRTLRDVNAGAETAPVGPVNGFGPARSMADGLPPAAAIG 333

Query: 1021 XXXXXXGYEHSKGGIREFEPQ-SYPVQPSGAGPAGYPNLNSSMEVKIPNTAINAVLGAGG 845
                        GG RE+EPQ +Y V P+      Y N+N ++E KI N   ++V G GG
Sbjct: 334  PGRSGGYESFRGGGGREYEPQNNYSVPPAA---VRYSNMNGALEAKILNNMSSSVTGTGG 390

Query: 844  SNISSIGQLTGAKVKLNDPQAGVSECIVEIHGSSDQMIAAQNLLHNFIASAAHNYNTXXX 665
            S I +  +++GA+V+L DPQ G SE   E  GSS+ + AAQ++  +F+ S+  + N    
Sbjct: 391  STILN-SEVSGARVRLEDPQTGGSE---EFRGSSEHLTAAQSIFQSFMPSSGQSMNAQQS 446

Query: 664  XXXXXXXXQS--YNTQQQQSPY 605
                    QS   N   Q+SPY
Sbjct: 447  SYQNLSVQQSSYLNMNAQRSPY 468



 Score = 94.7 bits (234), Expect = 2e-16
 Identities = 68/202 (33%), Positives = 95/202 (47%)
 Frame = -3

Query: 2458 YEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAKIKVDEAVPGTDERVV 2279
            + P+ G    AE   +ILCP               K ++ ET A I V++A   +DERV+
Sbjct: 127  FPPARGAEASAEFSMKILCPAGKIGGVIGKGGFNVKQLQQETGAGIHVEDATIESDERVI 186

Query: 2278 IIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKVHERIVEEEDLYGASD 2099
             +           SA E L              P      A+L++  +  E         
Sbjct: 187  RV-----------SAIEGL------------WNPRSQTIDAILQLQNKTSEFS------- 216

Query: 2098 HEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRILPPEHLPACAMSSDE 1919
                 E   VT RLLVPS++VGC+LG+GGHVI ++R  T A+IR+   +  P CA   +E
Sbjct: 217  -----EKGTVTTRLLVPSSKVGCILGQGGHVINEMRRRTQADIRVYSKDDKPKCASEDEE 271

Query: 1918 LVQISGASDVAKKALYEVSTLL 1853
            LVQISG   VAK AL E+++ L
Sbjct: 272  LVQISGNFGVAKDALAEIASRL 293


>ref|XP_006661902.1| PREDICTED: KH domain-containing protein At4g18375-like [Oryza
            brachyantha]
          Length = 773

 Score =  362 bits (930), Expect = 4e-97
 Identities = 211/474 (44%), Positives = 287/474 (60%), Gaps = 4/474 (0%)
 Frame = -3

Query: 2485 KKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXXKAMRDETHAKIKVDEA 2306
            +K +R +  ++  +      ET+YRILCP               KA+RDET AKI+V ++
Sbjct: 25   RKRKRLNTKHDDGSMSSQPTETIYRILCPVKKIGSVLGRGGDVVKALRDETKAKIRVADS 84

Query: 2305 VPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPHCPAQHALLKVHERIVE 2126
            +PG DERV+IIF+     P  S  DE  +  ++D  ++  M+PHC AQ ALLK+H++I E
Sbjct: 85   IPGADERVIIIFNY----PSQSEDDEAAQNISTDGFQN--MKPHCFAQDALLKIHDKIAE 138

Query: 2125 EEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKLRSETGANIRILPPEHL 1946
            +EDL+G  DHE  +  + VTAR+LVP NQVGCLLGKGG +IQ+LR+ETGA IR+LP E+L
Sbjct: 139  DEDLHGGIDHEKSETVDGVTARILVPGNQVGCLLGKGGSIIQQLRNETGAGIRVLPSENL 198

Query: 1945 PACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---PPMNHXXXXXXXXXXXXXXXX 1775
            P CA+ SDELVQISGA  + +KALYE+ST L+Q+PR   PP+                  
Sbjct: 199  PQCALKSDELVQISGAPSLVRKALYEISTRLHQHPRKDNPPLEEIIDASTQRKHQSPPQL 258

Query: 1774 XXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXXXXXXXXGKASAEFNMK 1595
                     +    +      P +P    + N  LR                 + EF++K
Sbjct: 259  PHANPMLPHLHVDHS------PQIPLLDPYRNGPLRYHAG------------EAEEFSIK 300

Query: 1594 ILCPADKIGVVIGKGGSNVKQIQQETGANIHVED-STGEADERVILVSCFDAPWNPQSPT 1418
            ILC ++ IG VIGK G NV+Q++Q+TGA I V++     + ER+I+VS  + P +P SPT
Sbjct: 301  ILCASEHIGQVIGKSGGNVRQVEQQTGARIQVKEVGKNASGERLIVVSSQEIPDDPVSPT 360

Query: 1417 IDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITEMRRRCQADIRVISKDG 1238
            I+A++ L       +E   +TTRL+VPSNKVGC+LG+GG +ITEMRRR  A+IRV SK  
Sbjct: 361  IEALILLHSKVSAPAENRHLTTRLVVPSNKVGCILGEGGKVITEMRRRTGAEIRVYSKAD 420

Query: 1237 KPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVHAAAEPGPMGPFQG 1076
            KP   S DEELVQ++G   +AR ALTEIASRLR R+L    ++  P P  PF G
Sbjct: 421  KPKYLSFDEELVQVAGLPAIARGALTEIASRLRTRTLRDGSSSNNPPPFAPFDG 474



 Score = 72.8 bits (177), Expect = 8e-10
 Identities = 37/83 (44%), Positives = 56/83 (67%)
 Frame = -3

Query: 931 GPAGYPNLNSSMEVKIPNTAINAVLGAGGSNISSIGQLTGAKVKLNDPQAGVSECIVEIH 752
           G  GYP    S+E++IPN+ +  V+G GG+N++ I Q++GA+VKL +     SE IVEI 
Sbjct: 680 GFTGYPG--GSVELRIPNSYLETVIGVGGANLAEIRQISGARVKLLETHPASSESIVEIQ 737

Query: 751 GSSDQMIAAQNLLHNFIASAAHN 683
           G  DQ+ AAQ+LL  FI ++ ++
Sbjct: 738 GVPDQVKAAQSLLQGFIGASGNS 760


>gb|AAP54423.2| KH domain containing protein, expressed [Oryza sativa Japonica Group]
          Length = 677

 Score =  353 bits (907), Expect = 2e-94
 Identities = 206/490 (42%), Positives = 291/490 (59%), Gaps = 4/490 (0%)
 Frame = -3

Query: 2533 YRRNTRVRLDKRNIGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXX 2354
            ++ NT  +    N    K +R +  ++         ET+YRILCP               
Sbjct: 9    HKSNTSRKRPHFNSDDGKRKRLNSRHDDGTISSEPIETIYRILCPVKKIGSVLGRGGDIV 68

Query: 2353 KAMRDETHAKIKVDEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPH 2174
            KA+RD T AKI+V +++PG DERV+IIF+ S++T  A+   +++ TD       E M+PH
Sbjct: 69   KALRDTTKAKIRVADSIPGADERVIIIFNYSSQTEEAA---QNISTDG-----FEDMKPH 120

Query: 2173 CPAQHALLKVHERIVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKL 1994
            C AQ ALLK+H++I  +EDL+    HE  +  + V AR+LVP NQVGCLLGKGG +IQ+L
Sbjct: 121  CFAQDALLKIHDKIAADEDLHAGIVHEKSENVDDVIARILVPGNQVGCLLGKGGSIIQQL 180

Query: 1993 RSETGANIRILPPEHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---PPMNH 1823
            R++TGA IR+LP E+LP CA+ SDELVQISG+S + +KALYE+ST L+Q+PR   PP+  
Sbjct: 181  RNDTGAGIRVLPSENLPQCALKSDELVQISGSSSLVRKALYEISTRLHQHPRKDNPPLEE 240

Query: 1822 XXXXXXXXXXXXXXXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXX 1643
                                     +    +      P +P    + N  L+        
Sbjct: 241  IIDASTQRKHQAPPQLPHANPMLPHLHVDHS------PQIPLLDPYRNRPLQ-------- 286

Query: 1642 XXXXXXGKASAEFNMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVED-STGEADERV 1466
                     + EF++KILC ++ IG VIGK G NV+Q++Q+TGA + V++     ++ER+
Sbjct: 287  ----YHSAEAEEFSIKILCASEHIGQVIGKSGGNVRQVEQQTGACVQVKEVGKNASEERL 342

Query: 1465 ILVSCFDAPWNPQSPTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITE 1286
            I+VS  + P +P SPTI+A++ L      ++E   +TTRL+VPSNKVGC++G+GG +ITE
Sbjct: 343  IVVSSQEIPDDPVSPTIEALILLHSKVSTLAENHHLTTRLVVPSNKVGCIIGEGGKVITE 402

Query: 1285 MRRRCQADIRVISKDGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVHAAA 1106
            MRRR  A+IRV SK  KP   S DEELVQ++G   +AR ALTEIASRLR R+L    ++ 
Sbjct: 403  MRRRTGAEIRVYSKADKPKYLSFDEELVQVAGLPAIARGALTEIASRLRTRTLRDGSSSN 462

Query: 1105 EPGPMGPFQG 1076
             P P  PF G
Sbjct: 463  NPTPFAPFDG 472


>ref|NP_001064947.1| Os10g0495000 [Oryza sativa Japonica Group] gi|22128716|gb|AAM92828.1|
            putative RNA-binding protein [Oryza sativa Japonica
            Group] gi|110289325|gb|ABB47821.2| KH domain containing
            protein, expressed [Oryza sativa Japonica Group]
            gi|113639556|dbj|BAF26861.1| Os10g0495000 [Oryza sativa
            Japonica Group] gi|215694845|dbj|BAG90036.1| unnamed
            protein product [Oryza sativa Japonica Group]
          Length = 762

 Score =  353 bits (907), Expect = 2e-94
 Identities = 206/490 (42%), Positives = 291/490 (59%), Gaps = 4/490 (0%)
 Frame = -3

Query: 2533 YRRNTRVRLDKRNIGFKKSRRNDYGYEPSAGIPNGAETVYRILCPTXXXXXXXXXXXXXX 2354
            ++ NT  +    N    K +R +  ++         ET+YRILCP               
Sbjct: 9    HKSNTSRKRPHFNSDDGKRKRLNSRHDDGTISSEPIETIYRILCPVKKIGSVLGRGGDIV 68

Query: 2353 KAMRDETHAKIKVDEAVPGTDERVVIIFSSSTKTPRASSADEDLETDASDEKEHEPMQPH 2174
            KA+RD T AKI+V +++PG DERV+IIF+ S++T  A+   +++ TD       E M+PH
Sbjct: 69   KALRDTTKAKIRVADSIPGADERVIIIFNYSSQTEEAA---QNISTDG-----FEDMKPH 120

Query: 2173 CPAQHALLKVHERIVEEEDLYGASDHEDDKESNVVTARLLVPSNQVGCLLGKGGHVIQKL 1994
            C AQ ALLK+H++I  +EDL+    HE  +  + V AR+LVP NQVGCLLGKGG +IQ+L
Sbjct: 121  CFAQDALLKIHDKIAADEDLHAGIVHEKSENVDDVIARILVPGNQVGCLLGKGGSIIQQL 180

Query: 1993 RSETGANIRILPPEHLPACAMSSDELVQISGASDVAKKALYEVSTLLYQNPR---PPMNH 1823
            R++TGA IR+LP E+LP CA+ SDELVQISG+S + +KALYE+ST L+Q+PR   PP+  
Sbjct: 181  RNDTGAGIRVLPSENLPQCALKSDELVQISGSSSLVRKALYEISTRLHQHPRKDNPPLEE 240

Query: 1822 XXXXXXXXXXXXXXXXXXXXXXXXXMWSHRNSGFHGPPSMPWGGGFGNEFLRXXXXXXXX 1643
                                     +    +      P +P    + N  L+        
Sbjct: 241  IIDASTQRKHQAPPQLPHANPMLPHLHVDHS------PQIPLLDPYRNRPLQ-------- 286

Query: 1642 XXXXXXGKASAEFNMKILCPADKIGVVIGKGGSNVKQIQQETGANIHVED-STGEADERV 1466
                     + EF++KILC ++ IG VIGK G NV+Q++Q+TGA + V++     ++ER+
Sbjct: 287  ----YHSAEAEEFSIKILCASEHIGQVIGKSGGNVRQVEQQTGACVQVKEVGKNASEERL 342

Query: 1465 ILVSCFDAPWNPQSPTIDAILQLQVHTHEISEKGTITTRLLVPSNKVGCLLGQGGHIITE 1286
            I+VS  + P +P SPTI+A++ L      ++E   +TTRL+VPSNKVGC++G+GG +ITE
Sbjct: 343  IVVSSQEIPDDPVSPTIEALILLHSKVSTLAENHHLTTRLVVPSNKVGCIIGEGGKVITE 402

Query: 1285 MRRRCQADIRVISKDGKPTCASADEELVQISGNVNVARNALTEIASRLRVRSLEGVHAAA 1106
            MRRR  A+IRV SK  KP   S DEELVQ++G   +AR ALTEIASRLR R+L    ++ 
Sbjct: 403  MRRRTGAEIRVYSKADKPKYLSFDEELVQVAGLPAIARGALTEIASRLRTRTLRDGSSSN 462

Query: 1105 EPGPMGPFQG 1076
             P P  PF G
Sbjct: 463  NPTPFAPFDG 472



 Score = 77.8 bits (190), Expect = 3e-11
 Identities = 39/84 (46%), Positives = 60/84 (71%)
 Frame = -3

Query: 934 AGPAGYPNLNSSMEVKIPNTAINAVLGAGGSNISSIGQLTGAKVKLNDPQAGVSECIVEI 755
           +G  GYP    S+E +IPN+ + +V+GAGG N++ I Q++GA+VKL++   G SE IVEI
Sbjct: 668 SGLTGYPG--GSVEFRIPNSYLESVIGAGGVNLAEIRQISGARVKLHEAHPGSSESIVEI 725

Query: 754 HGSSDQMIAAQNLLHNFIASAAHN 683
            G  DQ+ AAQ+LL  FI +++++
Sbjct: 726 QGIPDQVKAAQSLLQGFIGASSNS 749


Top