BLASTX nr result

ID: Akebia22_contig00021607 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia22_contig00021607
         (1646 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268...   161   9e-37
ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prun...   160   1e-36
ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobr...   149   4e-33
ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation spec...   148   6e-33
ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobr...   145   4e-32
ref|XP_006852230.1| hypothetical protein AMTR_s00049p00146760 [A...   138   6e-30
ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citr...   138   8e-30
ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation spec...   137   1e-29
ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation spec...   137   1e-29
ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citr...   137   2e-29
ref|XP_007044908.1| RNA-binding family protein isoform 5, partia...   130   2e-27
gb|EXB82464.1| Cleavage and polyadenylation specificity factor s...   127   1e-26
ref|XP_002515040.1| RNA binding protein, putative [Ricinus commu...   126   3e-26
ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Popu...   123   2e-25
ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobr...   120   1e-24
ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobr...   120   1e-24
emb|CAN66828.1| hypothetical protein VITISV_015886 [Vitis vinifera]   115   6e-23
ref|XP_002312652.1| RNA recognition motif-containing family prot...   114   2e-22
gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus...   113   3e-22
ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309...   112   4e-22

>ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268141 isoform 1 [Vitis
            vinifera] gi|359473133|ref|XP_003631251.1| PREDICTED:
            uncharacterized protein LOC100268141 isoform 2 [Vitis
            vinifera]
          Length = 647

 Score =  161 bits (407), Expect = 9e-37
 Identities = 99/229 (43%), Positives = 110/229 (48%), Gaps = 2/229 (0%)
 Frame = -1

Query: 1646 MHPQSMMGAGFDXXXXXXXXXXXXXXXXXXXXXXPSFPTMNTVGLPGVAPHVNPAFFXXX 1467
            MHPQ MMG+GFD                      PSFP +NT+GL GVAPHVNPAFF   
Sbjct: 420  MHPQGMMGSGFDPTYMGRGGAYGGFSGSAFPGMVPSFPAVNTMGLAGVAPHVNPAFFGRG 479

Query: 1466 XXXXXXXXXXXXXXXGHHAGMWTDTSVGGGWGGDEHGR-TKEXXXXXXXXXXXXXXGEAT 1290
                           GHHAGMWTDTS+GG WGG+EHGR T+E              GE  
Sbjct: 480  MAANGMGMMGATGMDGHHAGMWTDTSMGG-WGGEEHGRRTRESSYGGDDGASDYGYGEVN 538

Query: 1289 HER-GRSNAPSWEKDRGSERDWSGNXXXXXXXXXXXXXXXXXXXXXXXXEADGYRDHRLX 1113
            HE+ GRSN  S EK+RGSERDWSGN                        E DGYRDHR  
Sbjct: 539  HEKVGRSNTASREKERGSERDWSGNSERRHRDEREQDWERSDKDHRYREEKDGYRDHRQR 598

Query: 1112 XXXXXXXXXXXXXXXXXXXXXXXSMMQEDDHRSRSRDVDYGKRRRLPSE 966
                                     + ++DHRSRSRD DYGKRRRLPSE
Sbjct: 599  ERDFNNEDDWDRGQSSSRSRSRSRAVADEDHRSRSRDGDYGKRRRLPSE 647


>ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica]
            gi|462422613|gb|EMJ26876.1| hypothetical protein
            PRUPE_ppa002814mg [Prunus persica]
          Length = 630

 Score =  160 bits (406), Expect = 1e-36
 Identities = 101/233 (43%), Positives = 109/233 (46%), Gaps = 6/233 (2%)
 Frame = -1

Query: 1646 MHPQSMMGAGFDXXXXXXXXXXXXXXXXXXXXXXPSFPTMNTVGLPGVAPHVNPAFFXXX 1467
            M+PQ MMGAGFD                       SFP +NT+GL GVAPHVNPAFF   
Sbjct: 399  MNPQGMMGAGFDPTYMGRGGGYGGFPGPAFPGMLSSFPAVNTMGLAGVAPHVNPAFFGRG 458

Query: 1466 XXXXXXXXXXXXXXXGHHAGMWTDTSVGGGWGGDEHGR-TKEXXXXXXXXXXXXXXGEAT 1290
                           GHHAGMW D S+GG WGGDEHGR T+E              GEA 
Sbjct: 459  MATNGMGMMGSSGMDGHHAGMWNDPSMGG-WGGDEHGRRTRESSYGGDDGASEYGYGEAN 517

Query: 1289 HERG-RSNAPSWEKDRGSERDWSGNXXXXXXXXXXXXXXXXXXXXXXXXE----ADGYRD 1125
            HE+G RSNAPS E++RGSERDWSGN                              D YRD
Sbjct: 518  HEKGGRSNAPSRERERGSERDWSGNSERRHRDEREQDWDRSERGEHREHRYKEEKDSYRD 577

Query: 1124 HRLXXXXXXXXXXXXXXXXXXXXXXXXSMMQEDDHRSRSRDVDYGKRRRLPSE 966
            HR                           M EDDHRSRSRDVDYGKRRRLPSE
Sbjct: 578  HRQRERDVGYEDDWDRGQSSSRPRSRSKAMPEDDHRSRSRDVDYGKRRRLPSE 630


>ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobroma cacao]
            gi|590695496|ref|XP_007044905.1| RNA-binding family
            protein isoform 1 [Theobroma cacao]
            gi|590695500|ref|XP_007044906.1| RNA-binding family
            protein isoform 1 [Theobroma cacao]
            gi|508708839|gb|EOY00736.1| RNA-binding family protein
            isoform 1 [Theobroma cacao] gi|508708840|gb|EOY00737.1|
            RNA-binding family protein isoform 1 [Theobroma cacao]
            gi|508708841|gb|EOY00738.1| RNA-binding family protein
            isoform 1 [Theobroma cacao]
          Length = 652

 Score =  149 bits (376), Expect = 4e-33
 Identities = 94/231 (40%), Positives = 107/231 (46%), Gaps = 4/231 (1%)
 Frame = -1

Query: 1646 MHPQSMMGAGFDXXXXXXXXXXXXXXXXXXXXXXPSFPTMNTVGLPGVAPHVNPAFFXXX 1467
            MHPQ MMGAGFD                      PSFP +NT+GL GVAPHVNPAFF   
Sbjct: 423  MHPQGMMGAGFDPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRG 482

Query: 1466 XXXXXXXXXXXXXXXGHHAGMWTDTSVGGGWGGDEHGR-TKEXXXXXXXXXXXXXXGEAT 1290
                           G HAGMWTD S+GG WGGDEHGR T+E              G+A 
Sbjct: 483  MAPNGMGMMGASGMDGPHAGMWTDASMGG-WGGDEHGRRTRESSYGGEDGASEYGYGDAN 541

Query: 1289 HERGRSNAPSWEKDRGSERDWSGNXXXXXXXXXXXXXXXXXXXXXXXXEA---DGYRDHR 1119
            HE+GRS+  S EK+R SER+WSGN                             D YR+HR
Sbjct: 542  HEKGRSSGASREKERVSEREWSGNSDRRHRDEKEQDWDRSEREHREHRYREEKDSYREHR 601

Query: 1118 LXXXXXXXXXXXXXXXXXXXXXXXXSMMQEDDHRSRSRDVDYGKRRRLPSE 966
                                       M E++HRSRSRDVDYGK+RRLPSE
Sbjct: 602  HRERDLDYDDDWDRGQSSSRSRRRSHAMPEEEHRSRSRDVDYGKKRRLPSE 652


>ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            CG7185-like isoform X1 [Solanum tuberosum]
            gi|565349616|ref|XP_006341787.1| PREDICTED: cleavage and
            polyadenylation specificity factor subunit CG7185-like
            isoform X2 [Solanum tuberosum]
          Length = 648

 Score =  148 bits (374), Expect = 6e-33
 Identities = 94/229 (41%), Positives = 106/229 (46%), Gaps = 2/229 (0%)
 Frame = -1

Query: 1646 MHPQSMMGAGFDXXXXXXXXXXXXXXXXXXXXXXPSFPTMNTVGLPGVAPHVNPAFFXXX 1467
            MHPQ MMG GFD                      P F  +N +GLPGVAPHVNPAFF   
Sbjct: 421  MHPQGMMGPGFDPSFMGRGAGYGGFSGPAFPGMMPPFQAVNPMGLPGVAPHVNPAFFGRG 480

Query: 1466 XXXXXXXXXXXXXXXGHHAGMWTDTSVGGGWGGDEHGR-TKEXXXXXXXXXXXXXXGEAT 1290
                           G H GMWTDTS GGGWGG+EHGR T+E              GE +
Sbjct: 481  MAANGMGMMSAAGMDGPHPGMWTDTS-GGGWGGEEHGRRTRESSYGGEDNASEYGYGEVS 539

Query: 1289 HERG-RSNAPSWEKDRGSERDWSGNXXXXXXXXXXXXXXXXXXXXXXXXEADGYRDHRLX 1113
            H++G RS+A S EK+RGSERDWSGN                        E DGYRD+R  
Sbjct: 540  HDKGARSSAVSREKERGSERDWSGNSDKRHRDEREHDRDRHDKEHRYREERDGYRDYRQK 599

Query: 1112 XXXXXXXXXXXXXXXXXXXXXXXSMMQEDDHRSRSRDVDYGKRRRLPSE 966
                                      QE+DHRSRSRD +YGKRRR PSE
Sbjct: 600  ERESEYEEDYDRGQSSSRSRSKSRAAQEEDHRSRSRDTNYGKRRRAPSE 648


>ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobroma cacao]
            gi|590695488|ref|XP_007044903.1| RNA-binding family
            protein isoform 1 [Theobroma cacao]
            gi|508708837|gb|EOY00734.1| RNA-binding family protein
            isoform 1 [Theobroma cacao] gi|508708838|gb|EOY00735.1|
            RNA-binding family protein isoform 1 [Theobroma cacao]
          Length = 653

 Score =  145 bits (367), Expect = 4e-32
 Identities = 93/231 (40%), Positives = 105/231 (45%), Gaps = 4/231 (1%)
 Frame = -1

Query: 1646 MHPQSMMGAGFDXXXXXXXXXXXXXXXXXXXXXXPSFPTMNTVGLPGVAPHVNPAFFXXX 1467
            MHPQ MMGAGFD                      PSFP +NT+GL GVAPHVNPAFF   
Sbjct: 424  MHPQGMMGAGFDPTYMGRGGSYGGFPGPGFPGMLPSFPAVNTLGLAGVAPHVNPAFFGRG 483

Query: 1466 XXXXXXXXXXXXXXXGHHAGMWTDTSVGGGWGGDEHGR-TKEXXXXXXXXXXXXXXGEAT 1290
                           G H GMWTDTS+GG WGGDEHGR T+E              G+A 
Sbjct: 484  MAPNGMGMMGGPGMDGPHVGMWTDTSMGG-WGGDEHGRRTRESSYGGEDGASEYGYGDAN 542

Query: 1289 HERGRSNAPSWEKDRGSERDWSGNXXXXXXXXXXXXXXXXXXXXXXXXEA---DGYRDHR 1119
            HE+GRS+  S EK+R S+R+WSGN                             D YR+HR
Sbjct: 543  HEKGRSSGASREKERVSDREWSGNSDRRHRDEKERDWDRSEREHREHRYREEKDSYREHR 602

Query: 1118 LXXXXXXXXXXXXXXXXXXXXXXXXSMMQEDDHRSRSRDVDYGKRRRLPSE 966
                                       M E+  RSRSRDVDYGKRRRLPSE
Sbjct: 603  HRERDLDYDDDLDRGQSSSRSRRRSHAMPEEQRRSRSRDVDYGKRRRLPSE 653


>ref|XP_006852230.1| hypothetical protein AMTR_s00049p00146760 [Amborella trichopoda]
            gi|548855834|gb|ERN13697.1| hypothetical protein
            AMTR_s00049p00146760 [Amborella trichopoda]
          Length = 659

 Score =  138 bits (348), Expect = 6e-30
 Identities = 92/234 (39%), Positives = 101/234 (43%), Gaps = 7/234 (2%)
 Frame = -1

Query: 1646 MHPQSMMGAGFDXXXXXXXXXXXXXXXXXXXXXXP---SFPTMNTVGLPGVAPHVNPAFF 1476
            +HPQ MMG+GFD                          SF  M TVGLPGVAPHVNPAFF
Sbjct: 430  LHPQGMMGSGFDPTYGAHLGRGSGYGGFSGPHFPGMLPSFSPMGTVGLPGVAPHVNPAFF 489

Query: 1475 XXXXXXXXXXXXXXXXXXGHHAGMWTDTSVGGG--WGGDEHGRTKEXXXXXXXXXXXXXX 1302
                              GHH GMW D+S+GGG  WG +EHGR                 
Sbjct: 490  GRGVSANGMGMMGSGAMDGHHGGMWGDSSMGGGVGWGNEEHGRRTRESSYGDDGASDYGY 549

Query: 1301 GEATHERG--RSNAPSWEKDRGSERDWSGNXXXXXXXXXXXXXXXXXXXXXXXXEADGYR 1128
            G+  HERG  RSN P  EKDRGSERDWS                            DGY 
Sbjct: 550  GDGGHERGGGRSN-PGREKDRGSERDWSSGPERRHRDDRDSDWDRDPRYKDEK---DGYS 605

Query: 1127 DHRLXXXXXXXXXXXXXXXXXXXXXXXXSMMQEDDHRSRSRDVDYGKRRRLPSE 966
            DHR                          MMQE+D RSRS+DVDYGKRRR+PSE
Sbjct: 606  DHRQRERDWDNEDDWDRGRTSSRSRSKSRMMQEEDQRSRSKDVDYGKRRRVPSE 659


>ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citrus clementina]
            gi|557540375|gb|ESR51419.1| hypothetical protein
            CICLE_v10030915mg [Citrus clementina]
          Length = 658

 Score =  138 bits (347), Expect = 8e-30
 Identities = 94/232 (40%), Positives = 105/232 (45%), Gaps = 5/232 (2%)
 Frame = -1

Query: 1646 MHPQSMMGAGFDXXXXXXXXXXXXXXXXXXXXXXPSFPTMNTVGLPGVAPHVNPAFFXXX 1467
            MHPQ+MMG GFD                      PSFP +N +GL GVAPHVNPAFF   
Sbjct: 429  MHPQNMMG-GFDPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRG 487

Query: 1466 XXXXXXXXXXXXXXXGHHAGMWTDTSVGGGWGGDEHGR-TKEXXXXXXXXXXXXXXGEAT 1290
                           G H GMWTD+S+GG W G+EHGR T+E              GEA 
Sbjct: 488  MAANGMGMMGSSGMDGPHPGMWTDSSMGG-WVGEEHGRRTRESSYGGDDGASDYGYGEAN 546

Query: 1289 HERG-RSNAPSWEKDRGSERDWSGNXXXXXXXXXXXXXXXXXXXXXXXXEA---DGYRDH 1122
            HE+G RS A S EKDRGSERDWSGN                             D YRD 
Sbjct: 547  HEKGARSTAASREKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDR 606

Query: 1121 RLXXXXXXXXXXXXXXXXXXXXXXXXSMMQEDDHRSRSRDVDYGKRRRLPSE 966
            R                           + ++DHRSRSRDVDYGKRRRLPSE
Sbjct: 607  RQRDRDSTYDDNWDRGPSSSRSRSRSRAIPDEDHRSRSRDVDYGKRRRLPSE 658


>ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            CG7185-like isoform X1 [Citrus sinensis]
          Length = 655

 Score =  137 bits (346), Expect = 1e-29
 Identities = 94/232 (40%), Positives = 105/232 (45%), Gaps = 5/232 (2%)
 Frame = -1

Query: 1646 MHPQSMMGAGFDXXXXXXXXXXXXXXXXXXXXXXPSFPTMNTVGLPGVAPHVNPAFFXXX 1467
            MHPQ+MMG GFD                      PSFP +N +GL GVAPHVNPAFF   
Sbjct: 426  MHPQNMMG-GFDPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRG 484

Query: 1466 XXXXXXXXXXXXXXXGHHAGMWTDTSVGGGWGGDEHGR-TKEXXXXXXXXXXXXXXGEAT 1290
                           G H GMWTD+S+GG W G+EHGR T+E              GEA 
Sbjct: 485  MAANGMGMMGSSGMDGPHPGMWTDSSMGG-WLGEEHGRRTRESSYGGDDGASDYGYGEAN 543

Query: 1289 HERG-RSNAPSWEKDRGSERDWSGNXXXXXXXXXXXXXXXXXXXXXXXXEA---DGYRDH 1122
            HE+G RS A S EKDRGSERDWSGN                             D YRD 
Sbjct: 544  HEKGARSTAASREKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDR 603

Query: 1121 RLXXXXXXXXXXXXXXXXXXXXXXXXSMMQEDDHRSRSRDVDYGKRRRLPSE 966
            R                           + ++DHRSRSRDVDYGKRRRLPSE
Sbjct: 604  RQRDRDSTYDDNWDRGQSSSRSRSRSGAIPDEDHRSRSRDVDYGKRRRLPSE 655


>ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            CG7185-like isoform X1 [Citrus sinensis]
          Length = 658

 Score =  137 bits (346), Expect = 1e-29
 Identities = 94/232 (40%), Positives = 105/232 (45%), Gaps = 5/232 (2%)
 Frame = -1

Query: 1646 MHPQSMMGAGFDXXXXXXXXXXXXXXXXXXXXXXPSFPTMNTVGLPGVAPHVNPAFFXXX 1467
            MHPQ+MMG GFD                      PSFP +N +GL GVAPHVNPAFF   
Sbjct: 429  MHPQNMMG-GFDPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRG 487

Query: 1466 XXXXXXXXXXXXXXXGHHAGMWTDTSVGGGWGGDEHGR-TKEXXXXXXXXXXXXXXGEAT 1290
                           G H GMWTD+S+GG W G+EHGR T+E              GEA 
Sbjct: 488  MAANGMGMMGSSGMDGPHPGMWTDSSMGG-WLGEEHGRRTRESSYGGDDGASDYGYGEAN 546

Query: 1289 HERG-RSNAPSWEKDRGSERDWSGNXXXXXXXXXXXXXXXXXXXXXXXXEA---DGYRDH 1122
            HE+G RS A S EKDRGSERDWSGN                             D YRD 
Sbjct: 547  HEKGARSTAASREKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDR 606

Query: 1121 RLXXXXXXXXXXXXXXXXXXXXXXXXSMMQEDDHRSRSRDVDYGKRRRLPSE 966
            R                           + ++DHRSRSRDVDYGKRRRLPSE
Sbjct: 607  RQRDRDSTYDDNWDRGPSSSRSRSRSRAIPDEDHRSRSRDVDYGKRRRLPSE 658


>ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citrus clementina]
            gi|567891321|ref|XP_006438181.1| hypothetical protein
            CICLE_v10030917mg [Citrus clementina]
            gi|557540376|gb|ESR51420.1| hypothetical protein
            CICLE_v10030917mg [Citrus clementina]
            gi|557540377|gb|ESR51421.1| hypothetical protein
            CICLE_v10030917mg [Citrus clementina]
          Length = 655

 Score =  137 bits (344), Expect = 2e-29
 Identities = 93/232 (40%), Positives = 105/232 (45%), Gaps = 5/232 (2%)
 Frame = -1

Query: 1646 MHPQSMMGAGFDXXXXXXXXXXXXXXXXXXXXXXPSFPTMNTVGLPGVAPHVNPAFFXXX 1467
            MHPQ+MMG GFD                      PSFP +N +GL GVAPHVNPAFF   
Sbjct: 426  MHPQNMMG-GFDPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRG 484

Query: 1466 XXXXXXXXXXXXXXXGHHAGMWTDTSVGGGWGGDEHGR-TKEXXXXXXXXXXXXXXGEAT 1290
                           G H GMWTD+S+GG W G+EHGR T+E              GEA+
Sbjct: 485  MAANGMGMMGSSGMDGPHPGMWTDSSMGG-WVGEEHGRRTRESSYGGDDGASDYGYGEAS 543

Query: 1289 HERG-RSNAPSWEKDRGSERDWSGNXXXXXXXXXXXXXXXXXXXXXXXXEA---DGYRDH 1122
            HE+G RS   S EKDRGSERDWSGN                             D YRD 
Sbjct: 544  HEKGARSTTASREKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDR 603

Query: 1121 RLXXXXXXXXXXXXXXXXXXXXXXXXSMMQEDDHRSRSRDVDYGKRRRLPSE 966
            R                           + ++DHRSRSRDVDYGKRRRLPSE
Sbjct: 604  RQRDRDSTYDDNWDRGQSSSRSRSRSGAIPDEDHRSRSRDVDYGKRRRLPSE 655


>ref|XP_007044908.1| RNA-binding family protein isoform 5, partial [Theobroma cacao]
            gi|508708843|gb|EOY00740.1| RNA-binding family protein
            isoform 5, partial [Theobroma cacao]
          Length = 656

 Score =  130 bits (327), Expect = 2e-27
 Identities = 85/226 (37%), Positives = 99/226 (43%), Gaps = 4/226 (1%)
 Frame = -1

Query: 1646 MHPQSMMGAGFDXXXXXXXXXXXXXXXXXXXXXXPSFPTMNTVGLPGVAPHVNPAFFXXX 1467
            MHPQ MMGAGFD                      PSFP +NT+GL GVAPHVNPAFF   
Sbjct: 423  MHPQGMMGAGFDPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRG 482

Query: 1466 XXXXXXXXXXXXXXXGHHAGMWTDTSVGGGWGGDEHGR-TKEXXXXXXXXXXXXXXGEAT 1290
                           G HAGMWTD S+GG WGGDEHGR T+E              G+A 
Sbjct: 483  MAPNGMGMMGASGMDGPHAGMWTDASMGG-WGGDEHGRRTRESSYGGEDGASEYGYGDAN 541

Query: 1289 HERGRSNAPSWEKDRGSERDWSGNXXXXXXXXXXXXXXXXXXXXXXXXEA---DGYRDHR 1119
            HE+GRS+  S EK+R SER+WSGN                             D YR+HR
Sbjct: 542  HEKGRSSGASREKERVSEREWSGNSDRRHRDEKEQDWDRSEREHREHRYREEKDSYREHR 601

Query: 1118 LXXXXXXXXXXXXXXXXXXXXXXXXSMMQEDDHRSRSRDVDYGKRR 981
                                       M E++HRSRSRDV Y + +
Sbjct: 602  HRERDLDYDDDWDRGQSSSRSRRRSHAMPEEEHRSRSRDVGYREEK 647


>gb|EXB82464.1| Cleavage and polyadenylation specificity factor subunit [Morus
            notabilis]
          Length = 636

 Score =  127 bits (319), Expect = 1e-26
 Identities = 85/231 (36%), Positives = 98/231 (42%), Gaps = 4/231 (1%)
 Frame = -1

Query: 1646 MHPQSMMGAGFDXXXXXXXXXXXXXXXXXXXXXXPSFPTMNTVGLPGVAPHVNPAFFXXX 1467
            M+PQ MMG GFD                      PSFP +NT+G   VAPHVNPAFF   
Sbjct: 414  MNPQGMMGTGFDPTYMGRGVGYGGFAGPAFPGMLPSFPAVNTMGFAAVAPHVNPAFFGRG 473

Query: 1466 XXXXXXXXXXXXXXXGHHAGMWTDTSVGGGWGGDEHGR-TKEXXXXXXXXXXXXXXGEAT 1290
                           GH  GMW D S+GG WGG+EHGR T+E              G+  
Sbjct: 474  MTNNGMGMVGSSLMDGHQGGMWNDPSIGG-WGGEEHGRRTRESSYGGDDGASEYGYGDTN 532

Query: 1289 HERGRSNAPSWEKDRGSERDWSGNXXXXXXXXXXXXXXXXXXXXXXXXEA---DGYRDHR 1119
            HE+G        ++RGSERDWSGN                             DG RD+R
Sbjct: 533  HEKGG-------RERGSERDWSGNSERRNHEERDQDWDRSQKEQKEHRYREGKDGSRDYR 585

Query: 1118 LXXXXXXXXXXXXXXXXXXXXXXXXSMMQEDDHRSRSRDVDYGKRRRLPSE 966
                                      ++QED HRSRSRDVDYGKRRRLPSE
Sbjct: 586  PKERELDYEDDWDRGQSSSRLRSRSRVVQEDHHRSRSRDVDYGKRRRLPSE 636


>ref|XP_002515040.1| RNA binding protein, putative [Ricinus communis]
            gi|223546091|gb|EEF47594.1| RNA binding protein, putative
            [Ricinus communis]
          Length = 644

 Score =  126 bits (316), Expect = 3e-26
 Identities = 86/229 (37%), Positives = 100/229 (43%), Gaps = 4/229 (1%)
 Frame = -1

Query: 1640 PQSMMGAGFDXXXXXXXXXXXXXXXXXXXXXXPSFPTMNTVGLPGVAPHVNPAFFXXXXX 1461
            PQSMM AGFD                      PSFP +N +GL GVAPHVNPAFF     
Sbjct: 417  PQSMMRAGFDPTYMGRGAGYGGFAGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFGRGMA 476

Query: 1460 XXXXXXXXXXXXXGHHAGMWTDTSVGGGWGGDEHGRTKEXXXXXXXXXXXXXXGEATHER 1281
                         G +AGMW+DTS+GG WG +   RT+E              GE  HE+
Sbjct: 477  PNGMGMMGPSGMDGPNAGMWSDTSMGG-WGEEPGRRTRESSYGGDDGASEYGYGEVNHEK 535

Query: 1280 G-RSNAPSWEKDRGSERDWSGNXXXXXXXXXXXXXXXXXXXXXXXXEAD---GYRDHRLX 1113
            G RS+A S EK+R SERDWSGN                          +    YRDHR  
Sbjct: 536  GARSSAASREKERASERDWSGNSDRRHRDDREHDWDRSEREHKEHRYREEKESYRDHRQR 595

Query: 1112 XXXXXXXXXXXXXXXXXXXXXXXSMMQEDDHRSRSRDVDYGKRRRLPSE 966
                                     + E+D+RSRSRD DYGKRRRLPSE
Sbjct: 596  ERDSGYEDDWDRGQSSSRSRSRSRAVPEEDYRSRSRDADYGKRRRLPSE 644


>ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa]
            gi|550329195|gb|ERP56065.1| hypothetical protein
            POPTR_0010s06150g [Populus trichocarpa]
          Length = 591

 Score =  123 bits (309), Expect = 2e-25
 Identities = 85/228 (37%), Positives = 97/228 (42%), Gaps = 1/228 (0%)
 Frame = -1

Query: 1646 MHPQSMMGAGFDXXXXXXXXXXXXXXXXXXXXXXPSFPTMNTVGLPGVAPHVNPAFFXXX 1467
            MH Q MMGAGFD                      PSFP +N++GL GVAPHVNPAFF   
Sbjct: 365  MHHQGMMGAGFDPLYMGRGGGYGGFPGHGFPGMLPSFPAVNSMGLAGVAPHVNPAFFARG 424

Query: 1466 XXXXXXXXXXXXXXXGHHAGMWTDTSVGGGWGGDEHGRTKEXXXXXXXXXXXXXXGEATH 1287
                           G + G W DTS+GG WG +   RT+E              GE  H
Sbjct: 425  MAPNGMGMMASSGMEGPNPGKWPDTSMGG-WGEEPGRRTRESSYDGDEGASEYGYGEGNH 483

Query: 1286 ERG-RSNAPSWEKDRGSERDWSGNXXXXXXXXXXXXXXXXXXXXXXXXEADGYRDHRLXX 1110
            E+G RS+  S EK+R SERDWSGN                        E D YR HR   
Sbjct: 484  EKGARSSGASREKERVSERDWSGNSDRRHRDEREQDWDRSEREPKYREEKDTYRGHRQRE 543

Query: 1109 XXXXXXXXXXXXXXXXXXXXXXSMMQEDDHRSRSRDVDYGKRRRLPSE 966
                                      E+D+RSRSRDVDYGKRRR PSE
Sbjct: 544  RDSGYEDDRDRGHSSSRARSRSRAAPEEDYRSRSRDVDYGKRRRPPSE 591


>ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobroma cacao]
            gi|508708844|gb|EOY00741.1| RNA-binding family protein
            isoform 6 [Theobroma cacao]
          Length = 602

 Score =  120 bits (302), Expect = 1e-24
 Identities = 69/144 (47%), Positives = 78/144 (54%), Gaps = 1/144 (0%)
 Frame = -1

Query: 1646 MHPQSMMGAGFDXXXXXXXXXXXXXXXXXXXXXXPSFPTMNTVGLPGVAPHVNPAFFXXX 1467
            MHPQ MMGAGFD                      PSFP +NT+GL GVAPHVNPAFF   
Sbjct: 423  MHPQGMMGAGFDPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRG 482

Query: 1466 XXXXXXXXXXXXXXXGHHAGMWTDTSVGGGWGGDEHG-RTKEXXXXXXXXXXXXXXGEAT 1290
                           G HAGMWTD S+ GGWGGDEHG RT+E              G+A 
Sbjct: 483  MAPNGMGMMGASGMDGPHAGMWTDASM-GGWGGDEHGRRTRESSYGGEDGASEYGYGDAN 541

Query: 1289 HERGRSNAPSWEKDRGSERDWSGN 1218
            HE+GRS+  S EK+R SER+WSGN
Sbjct: 542  HEKGRSSGASREKERVSEREWSGN 565


>ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobroma cacao]
            gi|508708842|gb|EOY00739.1| RNA-binding family protein
            isoform 4 [Theobroma cacao]
          Length = 697

 Score =  120 bits (302), Expect = 1e-24
 Identities = 69/144 (47%), Positives = 78/144 (54%), Gaps = 1/144 (0%)
 Frame = -1

Query: 1646 MHPQSMMGAGFDXXXXXXXXXXXXXXXXXXXXXXPSFPTMNTVGLPGVAPHVNPAFFXXX 1467
            MHPQ MMGAGFD                      PSFP +NT+GL GVAPHVNPAFF   
Sbjct: 423  MHPQGMMGAGFDPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRG 482

Query: 1466 XXXXXXXXXXXXXXXGHHAGMWTDTSVGGGWGGDEHG-RTKEXXXXXXXXXXXXXXGEAT 1290
                           G HAGMWTD S+ GGWGGDEHG RT+E              G+A 
Sbjct: 483  MAPNGMGMMGASGMDGPHAGMWTDASM-GGWGGDEHGRRTRESSYGGEDGASEYGYGDAN 541

Query: 1289 HERGRSNAPSWEKDRGSERDWSGN 1218
            HE+GRS+  S EK+R SER+WSGN
Sbjct: 542  HEKGRSSGASREKERVSEREWSGN 565


>emb|CAN66828.1| hypothetical protein VITISV_015886 [Vitis vinifera]
          Length = 168

 Score =  115 bits (288), Expect = 6e-23
 Identities = 69/153 (45%), Positives = 77/153 (50%), Gaps = 2/153 (1%)
 Frame = -1

Query: 1418 HHAGMWTDTSVGGGWGGDEHGR-TKEXXXXXXXXXXXXXXGEATHER-GRSNAPSWEKDR 1245
            HHAGMWTDTS+GG WGG+EHGR T+E              GE  HE+ GRSN  S EK+R
Sbjct: 17   HHAGMWTDTSMGG-WGGEEHGRRTRESSYGGDDGASDYGYGEVNHEKVGRSNTASREKER 75

Query: 1244 GSERDWSGNXXXXXXXXXXXXXXXXXXXXXXXXEADGYRDHRLXXXXXXXXXXXXXXXXX 1065
            GSERDWSGN                        E DGYRDHR                  
Sbjct: 76   GSERDWSGNSERRHRDEREQDWERSDKDHRYREEKDGYRDHRQRERDFNNEDDWDRGQSS 135

Query: 1064 XXXXXXXSMMQEDDHRSRSRDVDYGKRRRLPSE 966
                     + ++DHRSRSRD DYGKRRRLPSE
Sbjct: 136  SRSRSRSRAVADEDHRSRSRDGDYGKRRRLPSE 168


>ref|XP_002312652.1| RNA recognition motif-containing family protein [Populus trichocarpa]
            gi|222852472|gb|EEE90019.1| RNA recognition
            motif-containing family protein [Populus trichocarpa]
          Length = 619

 Score =  114 bits (284), Expect = 2e-22
 Identities = 82/228 (35%), Positives = 95/228 (41%), Gaps = 1/228 (0%)
 Frame = -1

Query: 1646 MHPQSMMGAGFDXXXXXXXXXXXXXXXXXXXXXXPSFPTMNTVGLPGVAPHVNPAFFXXX 1467
            M PQ MMGAGFD                      PSFP +N++GL GVAPHVNPAFF   
Sbjct: 410  MPPQGMMGAGFDPLYMGRGGGYGGFAGPGFPGMLPSFPAVNSMGLAGVAPHVNPAFFARG 469

Query: 1466 XXXXXXXXXXXXXXXGHHAGMWTDTSVGGGWGGDEHGRTKEXXXXXXXXXXXXXXGEATH 1287
                           G + GMW ++S  G  G  E+G                  GE  H
Sbjct: 470  MAPNGMGMMVSSGMDGPNPGMW-ESSYDGDEGASEYG-----------------YGEGNH 511

Query: 1286 ERG-RSNAPSWEKDRGSERDWSGNXXXXXXXXXXXXXXXXXXXXXXXXEADGYRDHRLXX 1110
            E+G RS+  S EK+RGSERDWSGN                        E D YR HR   
Sbjct: 512  EKGARSSGASREKERGSERDWSGNSDRRHRDEREQDWDRPEREHRYKEEKDSYRGHRQRE 571

Query: 1109 XXXXXXXXXXXXXXXXXXXXXXSMMQEDDHRSRSRDVDYGKRRRLPSE 966
                                      E+D+RSR+RDVDYGKRRRLPSE
Sbjct: 572  RDSGYEDDRDRGHSSSRARSRSRAAPEEDYRSRTRDVDYGKRRRLPSE 619


>gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus guttatus]
          Length = 639

 Score =  113 bits (282), Expect = 3e-22
 Identities = 81/228 (35%), Positives = 99/228 (43%), Gaps = 1/228 (0%)
 Frame = -1

Query: 1646 MHPQSMMGAGFDXXXXXXXXXXXXXXXXXXXXXXPSFPTMNTVGLPGVAPHVNPAFFXXX 1467
            MHPQ MMG GFD                      P F  +N++GLPGVAPHVNPAFF   
Sbjct: 419  MHPQGMMGPGFDLAFMGRGGGYGGFSGPPFQGMLPPFQGVNSMGLPGVAPHVNPAFFGRG 478

Query: 1466 XXXXXXXXXXXXXXXGHHAGMWTDTSVGGGWGGDEHGRTKEXXXXXXXXXXXXXXGEATH 1287
                           G H+GMW D ++ GGWGG+EHGR  E              GE +H
Sbjct: 479  MNPNGMGMMGNPGMVGPHSGMWNDPNM-GGWGGEEHGR--ESSYGGEDNASEYGYGEGSH 535

Query: 1286 ERG-RSNAPSWEKDRGSERDWSGNXXXXXXXXXXXXXXXXXXXXXXXXEADGYRDHRLXX 1110
            ++  RS+A   EK+R SER++                           E D YR+HR   
Sbjct: 536  DKSVRSSAAPREKERTSEREY---PERKHREERENDGERNDRDSKYREEKDRYREHR-HK 591

Query: 1109 XXXXXXXXXXXXXXXXXXXXXXSMMQEDDHRSRSRDVDYGKRRRLPSE 966
                                    +QE+DHRSRSRD DYGKRRR+PSE
Sbjct: 592  ERESGYDDDWDRGQSSRSRSRSGAVQEEDHRSRSRDADYGKRRRMPSE 639


>ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309507 [Fragaria vesca
            subsp. vesca]
          Length = 646

 Score =  112 bits (281), Expect = 4e-22
 Identities = 80/232 (34%), Positives = 93/232 (40%), Gaps = 5/232 (2%)
 Frame = -1

Query: 1646 MHPQSMMGAGFDXXXXXXXXXXXXXXXXXXXXXXPSFPTMNTVGLPGVAPHVNPAFFXXX 1467
            M+   MMG GFD                      P FP +N +GL GVAPHVNPAFF   
Sbjct: 415  MNAPGMMGPGFDPTYMGRGGGYGGFPGPGFPGMLPQFPGVNAMGLAGVAPHVNPAFFGRG 474

Query: 1466 XXXXXXXXXXXXXXXGHHAGMWTDTSVGGGWGGDEHGRTKEXXXXXXXXXXXXXXG-EAT 1290
                           GHHA MW D S+ G  G ++  RT+E                EA 
Sbjct: 475  MATNGMGMMGSSGMEGHHAPMWNDPSMAGWTGEEQDRRTRESSYGGDDGGSEYGNYGEAN 534

Query: 1289 HERG-RSNAPSWEKDRGSERDWSGNXXXXXXXXXXXXXXXXXXXXXXXXEA---DGYRDH 1122
            HE+  RS+A   E++R SER+W+G                              D YRDH
Sbjct: 535  HEKPVRSSAAPRERERESEREWTGTSERRHRDEREQDWDRSEREHREPRYKEEKDSYRDH 594

Query: 1121 RLXXXXXXXXXXXXXXXXXXXXXXXXSMMQEDDHRSRSRDVDYGKRRRLPSE 966
            R                           M EDDHRSRSRDVDYGKRRRLPSE
Sbjct: 595  RRRERDVAYEDDRDRGHSSSRPRSRSKAMPEDDHRSRSRDVDYGKRRRLPSE 646


Top