BLASTX nr result

ID: Achyranthes22_contig00004215 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Achyranthes22_contig00004215
         (2229 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOY00734.1| RNA-binding family protein isoform 1 [Theobroma c...   306   3e-80
ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268...   302   5e-79
ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation spec...   300   1e-78
ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citr...   300   2e-78
gb|EOY00741.1| RNA-binding family protein isoform 6 [Theobroma c...   300   2e-78
gb|EOY00740.1| RNA-binding family protein isoform 5, partial [Th...   300   2e-78
gb|EOY00739.1| RNA-binding family protein isoform 4 [Theobroma c...   300   2e-78
gb|EOY00736.1| RNA-binding family protein isoform 1 [Theobroma c...   300   2e-78
ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citr...   297   1e-77
ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation spec...   296   3e-77
ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309...   295   4e-77
gb|EXB82464.1| Cleavage and polyadenylation specificity factor s...   290   2e-75
ref|XP_002515040.1| RNA binding protein, putative [Ricinus commu...   288   7e-75
gb|EMJ26876.1| hypothetical protein PRUPE_ppa002814mg [Prunus pe...   278   6e-72
ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation spec...   276   2e-71
gb|EPS60955.1| hypothetical protein M569_13847, partial [Genlise...   253   2e-64
ref|XP_002312652.1| RNA recognition motif-containing family prot...   248   8e-63
ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Popu...   241   7e-61
ref|XP_002315647.1| RNA recognition motif-containing family prot...   241   7e-61
ref|XP_006417146.1| hypothetical protein EUTSA_v10007191mg [Eutr...   237   1e-59

>gb|EOY00734.1| RNA-binding family protein isoform 1 [Theobroma cacao]
            gi|508708838|gb|EOY00735.1| RNA-binding family protein
            isoform 1 [Theobroma cacao]
          Length = 653

 Score =  306 bits (783), Expect = 3e-80
 Identities = 170/339 (50%), Positives = 208/339 (61%), Gaps = 26/339 (7%)
 Frame = +3

Query: 186  MDPAAEEQIDYGDEEYGGARKVQYQSSGAIPALADEEMIGXXXXXXXXXXGVNVGEGFLQ 365
            MD  AEEQID+GDEEYGGA+K+QYQ SGAIPALADEEM+G           VNVGEGFLQ
Sbjct: 1    MDAMAEEQIDFGDEEYGGAQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60

Query: 366  FHRNEAQVPPSNVGNGVVQARTFNAPQPRQXXXXXXXXXXXXXXXXXX------------ 509
              R+EA   P  +G+  +QA+   AP+PR                               
Sbjct: 61   LQRSEAPPQPGGMGSTGLQAQKNEAPEPRGEAGGSQGLNIPGVSVQGKHLNVTARYPEQD 120

Query: 510  -QHDASLSELGSANHISG-ALGQDKRI----HDVSLGNVSFQGPAHVAQNTATNAQDV-- 665
             Q   S  E+GS ++ SG ++ Q  R+     D  + N+ FQG +  +     +   V  
Sbjct: 121  GQPAVSRPEMGSGSYPSGTSISQKGRVMEGTQDTQVKNMGFQGLSSASHKVGIDPSGVPQ 180

Query: 666  ------GKATNPVIGGPSASQGIVNNQMTANANMGADHMMVNDNIIRPQMDNGPTMLFVG 827
                   ++ N   GGP  +  +  NQM  N N    H M+++N +RP ++NGPTMLFVG
Sbjct: 181  KIANVPAQSLNSGTGGPQGAPHVPPNQMGLNVN----HPMISENQVRPPIENGPTMLFVG 236

Query: 828  ELHWWTTDAELESVLSQYGKVKEIKFFDERASGKSKGYCQVEFYDAGAAATCKEGMNGHI 1007
            ELHWWTTDAELESVLSQYG+VKEIKFFDERASGKSKGYCQVEFYD  +AA CKEGM+G++
Sbjct: 237  ELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPASAAACKEGMDGYM 296

Query: 1008 FNGRACVVAFASPQTLKQMGAAYASKNXXXXXXXXXGRR 1124
            FNGRACVVAFASPQTLKQMGA+Y +KN         GRR
Sbjct: 297  FNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRR 335



 Score =  216 bits (549), Expect = 4e-53
 Identities = 113/221 (51%), Positives = 129/221 (58%), Gaps = 3/221 (1%)
 Frame = +3

Query: 1401 FDPSFMGRGAGYGGFSGPAFPGMMSPFPAVNPMGLAGVAPHVNPAFFXXXXXXXXXXXXX 1580
            FDP++MGRG  YGGF GP FPGM+  FPAVN +GLAGVAPHVNPAFF             
Sbjct: 434  FDPTYMGRGGSYGGFPGPGFPGMLPSFPAVNTLGLAGVAPHVNPAFFGRGMAPNGMGMMG 493

Query: 1581 XXXXXXHPSGMWGDAGMGGWPVEEHGQKTRESSYGGDDGASEYGYGEGNPEKSSRTNAAP 1760
                     GMW D  MGGW  +EHG++TRESSYGG+DGASEYGYG+ N EK  R++ A 
Sbjct: 494  GPGMDGPHVGMWTDTSMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEK-GRSSGAS 552

Query: 1761 REKERASEREWSGNS---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHRLKERDSGYED 1931
            REKER S+REWSGNS                                 HR +ERD  Y+D
Sbjct: 553  REKERVSDREWSGNSDRRHRDEKERDWDRSEREHREHRYREEKDSYREHRHRERDLDYDD 612

Query: 1932 DWDRGQXXXXXXXXXXAMQEDDHRSRSRDVEYGKRRRAPSE 2054
            D DRGQ          AM E+  RSRSRDV+YGKRRR PSE
Sbjct: 613  DLDRGQSSSRSRRRSHAMPEEQRRSRSRDVDYGKRRRLPSE 653


>ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268141 isoform 1 [Vitis
            vinifera] gi|359473133|ref|XP_003631251.1| PREDICTED:
            uncharacterized protein LOC100268141 isoform 2 [Vitis
            vinifera]
          Length = 647

 Score =  302 bits (773), Expect = 5e-79
 Identities = 173/338 (51%), Positives = 206/338 (60%), Gaps = 25/338 (7%)
 Frame = +3

Query: 198  AEEQIDYGDEEYGGARKVQYQSSGAIPALADEEMIGXXXXXXXXXXGVNVGEGFLQFHRN 377
            AEEQ+DY DEEYGGA+K+ +Q  GAI ALAD+E++G           VNVGEGFLQ HR+
Sbjct: 2    AEEQLDYEDEEYGGAQKMPFQGGGAISALADDELMGEDDEYDDLYNDVNVGEGFLQMHRS 61

Query: 378  EAQVPPSNVGNGVVQARTFNAPQPRQXXXXXXXXXXXXXXXXXXQHDASLSE-------- 533
            EA  P   +  G  QA   + P  +                     +    E        
Sbjct: 62   EAPAPSGVMAGGPFQAHKTDVPPQKLEAGTSQGLIIPGVSIEGKYSNPHFHEKKEGPMAV 121

Query: 534  ----LGSANHISG-ALGQDKRI----HDVSLGNVSFQGPAHVAQNTATNAQDV-GKATN- 680
                +GS +H+ G ++ Q  R+    HD  + N+ FQG   + Q T     DV GK  N 
Sbjct: 122  KGPEMGSTSHLDGPSVSQKGRVLEMTHDTQVRNLGFQGSTPIPQKTGAEPSDVHGKIANE 181

Query: 681  --PVI----GGPSASQGIVNNQMTANANMGADHMMVNDNIIRPQMDNGPTMLFVGELHWW 842
              PV+    GGP A   +++NQM  N N+  +  MVN+N IRP +DNG TMLFVGELHWW
Sbjct: 182  STPVLNSGTGGPRAVPQMLSNQMGMNVNV--NRPMVNENQIRPAVDNGATMLFVGELHWW 239

Query: 843  TTDAELESVLSQYGKVKEIKFFDERASGKSKGYCQVEFYDAGAAATCKEGMNGHIFNGRA 1022
            TTDAELESVLSQYG+VKEIKFFDERASGKSKGYCQVEFYDA AAA CKEGMNG+IFNGRA
Sbjct: 240  TTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDASAAAACKEGMNGYIFNGRA 299

Query: 1023 CVVAFASPQTLKQMGAAYASKNXXXXXXXXXGRRNTND 1136
            CVVAFASPQTLKQMGA+Y +K          GRR  ND
Sbjct: 300  CVVAFASPQTLKQMGASYMNKT--QAQSQSQGRRPMND 335



 Score =  238 bits (607), Expect = 8e-60
 Identities = 119/218 (54%), Positives = 133/218 (61%)
 Frame = +3

Query: 1401 FDPSFMGRGAGYGGFSGPAFPGMMSPFPAVNPMGLAGVAPHVNPAFFXXXXXXXXXXXXX 1580
            FDP++MGRG  YGGFSG AFPGM+  FPAVN MGLAGVAPHVNPAFF             
Sbjct: 430  FDPTYMGRGGAYGGFSGSAFPGMVPSFPAVNTMGLAGVAPHVNPAFFGRGMAANGMGMMG 489

Query: 1581 XXXXXXHPSGMWGDAGMGGWPVEEHGQKTRESSYGGDDGASEYGYGEGNPEKSSRTNAAP 1760
                  H +GMW D  MGGW  EEHG++TRESSYGGDDGAS+YGYGE N EK  R+N A 
Sbjct: 490  ATGMDGHHAGMWTDTSMGGWGGEEHGRRTRESSYGGDDGASDYGYGEVNHEKVGRSNTAS 549

Query: 1761 REKERASEREWSGNSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHRLKERDSGYEDDWD 1940
            REKER SER+WSGNS                              HR +ERD   EDDWD
Sbjct: 550  REKERGSERDWSGNSERRHRDEREQDWERSDKDHRYREEKDGYRDHRQRERDFNNEDDWD 609

Query: 1941 RGQXXXXXXXXXXAMQEDDHRSRSRDVEYGKRRRAPSE 2054
            RGQ          A+ ++DHRSRSRD +YGKRRR PSE
Sbjct: 610  RGQSSSRSRSRSRAVADEDHRSRSRDGDYGKRRRLPSE 647


>ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            CG7185-like isoform X1 [Citrus sinensis]
          Length = 658

 Score =  300 bits (769), Expect = 1e-78
 Identities = 172/343 (50%), Positives = 209/343 (60%), Gaps = 26/343 (7%)
 Frame = +3

Query: 186  MDPAAEEQIDYGDEEYGGARKVQYQSSGAIPALADEEMIGXXXXXXXXXXGVNVGEGFLQ 365
            MD  AEEQIDY +EEYGGA+K+QYQ  GAIPALADEE++G           VNVG+G LQ
Sbjct: 1    MDSMAEEQIDYEEEEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQ 60

Query: 366  FHRNEAQVPPSNVGNGVVQARTFNAPQPRQXXXXXXXXXXXXXXXXXX----------QH 515
            F + EA  P + VGNG +Q +  + P+ +                             Q+
Sbjct: 61   FQQPEAPPPSAGVGNGRLQVKKTDVPEQQVQAGVSQGSNVPGVSVEGKYTNAGTHFPAQN 120

Query: 516  DASLS----ELGSANHISGALGQDK-----RIHDVSLGNVSFQGPAHVAQNTATNAQDV- 665
            D  ++     +GS N+  GA    K       HD  + N+ FQG       T  +  ++ 
Sbjct: 121  DVQVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMP 180

Query: 666  GKATN---PVIG-GPSASQG--IVNNQMTANANMGADHMMVNDNIIRPQMDNGPTMLFVG 827
            G+  N   PV+  G +  QG  I  NQM  N N+  +  MVN+N IRP ++NG TMLFVG
Sbjct: 181  GRVANEPAPVLNPGAAGPQGALIPANQMGVNINV--NRAMVNENQIRPPLENGGTMLFVG 238

Query: 828  ELHWWTTDAELESVLSQYGKVKEIKFFDERASGKSKGYCQVEFYDAGAAATCKEGMNGHI 1007
            ELHWWTTDAELESVLSQYG+VKEIKFFDERASGKSKGYCQVEF+DA AAA CK+GMNGH+
Sbjct: 239  ELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHV 298

Query: 1008 FNGRACVVAFASPQTLKQMGAAYASKNXXXXXXXXXGRRNTND 1136
            FNGR CVVAFASPQTLKQMGA+Y +KN         GRR  ND
Sbjct: 299  FNGRPCVVAFASPQTLKQMGASYMNKNQGQPQSQTQGRRPMND 341



 Score =  230 bits (587), Expect = 2e-57
 Identities = 115/221 (52%), Positives = 136/221 (61%), Gaps = 3/221 (1%)
 Frame = +3

Query: 1401 FDPSFMGRGAGYGGFSGPAFPGMMSPFPAVNPMGLAGVAPHVNPAFFXXXXXXXXXXXXX 1580
            FDP++MGRG GYGGFSGP FPGM+  FPAVN MGLAGVAPHVNPAFF             
Sbjct: 438  FDPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMG 497

Query: 1581 XXXXXXHPSGMWGDAGMGGWPVEEHGQKTRESSYGGDDGASEYGYGEGNPEKSSRTNAAP 1760
                     GMW D+ MGGW  EEHG++TRESSYGGDDGAS+YGYGE N EK +R+ AA 
Sbjct: 498  SSGMDGPHPGMWTDSSMGGWLGEEHGRRTRESSYGGDDGASDYGYGEANHEKGARSTAAS 557

Query: 1761 REKERASEREWSGNSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXH---RLKERDSGYED 1931
            REK+R SER+WSGN+                              +   R ++RDS Y+D
Sbjct: 558  REKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDD 617

Query: 1932 DWDRGQXXXXXXXXXXAMQEDDHRSRSRDVEYGKRRRAPSE 2054
            +WDRG           A+ ++DHRSRSRDV+YGKRRR PSE
Sbjct: 618  NWDRGPSSSRSRSRSRAIPDEDHRSRSRDVDYGKRRRLPSE 658


>ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citrus clementina]
            gi|557540375|gb|ESR51419.1| hypothetical protein
            CICLE_v10030915mg [Citrus clementina]
          Length = 658

 Score =  300 bits (768), Expect = 2e-78
 Identities = 172/343 (50%), Positives = 209/343 (60%), Gaps = 26/343 (7%)
 Frame = +3

Query: 186  MDPAAEEQIDYGDEEYGGARKVQYQSSGAIPALADEEMIGXXXXXXXXXXGVNVGEGFLQ 365
            MD  AEEQIDY +EEYGGA+K+QYQ  GAIPALADEE++G           VNVG+G LQ
Sbjct: 1    MDSMAEEQIDYEEEEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQ 60

Query: 366  FHRNEAQVPPSNVGNGVVQARTFNAPQPRQXXXXXXXXXXXXXXXXXX----------QH 515
            F + EA  P + VGNG +Q +  + P+ +                             Q+
Sbjct: 61   FQQPEAPPPSAGVGNGRLQVKKTDVPEQQVQAGVSQGSNVPGVSVEGKYTNAGTHFPAQN 120

Query: 516  DASLS----ELGSANHISGALGQDK-----RIHDVSLGNVSFQGPAHVAQNTATNAQDV- 665
            D  ++     +GS N+  GA    K       HD  + N+ FQG       T  +  ++ 
Sbjct: 121  DVQVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPPRTGVDPSNMP 180

Query: 666  GKATN---PVIG-GPSASQG--IVNNQMTANANMGADHMMVNDNIIRPQMDNGPTMLFVG 827
            G+  N   PV+  G +  QG  I  NQM  N N+  +  MVN+N IRP ++NG TMLFVG
Sbjct: 181  GRVANEPAPVLNPGAAGPQGALIPANQMGVNINV--NRAMVNENQIRPPLENGGTMLFVG 238

Query: 828  ELHWWTTDAELESVLSQYGKVKEIKFFDERASGKSKGYCQVEFYDAGAAATCKEGMNGHI 1007
            ELHWWTTDAELESVLSQYG+VKEIKFFDERASGKSKGYCQVEF+DA AAA CK+GMNGH+
Sbjct: 239  ELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHV 298

Query: 1008 FNGRACVVAFASPQTLKQMGAAYASKNXXXXXXXXXGRRNTND 1136
            FNGR CVVAFASPQTLKQMGA+Y +KN         GRR  ND
Sbjct: 299  FNGRPCVVAFASPQTLKQMGASYMNKNQGQPQSQTQGRRPMND 341



 Score =  231 bits (588), Expect = 1e-57
 Identities = 115/221 (52%), Positives = 136/221 (61%), Gaps = 3/221 (1%)
 Frame = +3

Query: 1401 FDPSFMGRGAGYGGFSGPAFPGMMSPFPAVNPMGLAGVAPHVNPAFFXXXXXXXXXXXXX 1580
            FDP++MGRG GYGGFSGP FPGM+  FPAVN MGLAGVAPHVNPAFF             
Sbjct: 438  FDPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMG 497

Query: 1581 XXXXXXHPSGMWGDAGMGGWPVEEHGQKTRESSYGGDDGASEYGYGEGNPEKSSRTNAAP 1760
                     GMW D+ MGGW  EEHG++TRESSYGGDDGAS+YGYGE N EK +R+ AA 
Sbjct: 498  SSGMDGPHPGMWTDSSMGGWVGEEHGRRTRESSYGGDDGASDYGYGEANHEKGARSTAAS 557

Query: 1761 REKERASEREWSGNSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXH---RLKERDSGYED 1931
            REK+R SER+WSGN+                              +   R ++RDS Y+D
Sbjct: 558  REKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDD 617

Query: 1932 DWDRGQXXXXXXXXXXAMQEDDHRSRSRDVEYGKRRRAPSE 2054
            +WDRG           A+ ++DHRSRSRDV+YGKRRR PSE
Sbjct: 618  NWDRGPSSSRSRSRSRAIPDEDHRSRSRDVDYGKRRRLPSE 658


>gb|EOY00741.1| RNA-binding family protein isoform 6 [Theobroma cacao]
          Length = 602

 Score =  300 bits (767), Expect = 2e-78
 Identities = 165/339 (48%), Positives = 204/339 (60%), Gaps = 26/339 (7%)
 Frame = +3

Query: 186  MDPAAEEQIDYGDEEYGGARKVQYQSSGAIPALADEEMIGXXXXXXXXXXGVNVGEGFLQ 365
            MD  AEEQID+GDEEYGG +K+QYQ SGAIPALADEEM+G           VNVGEGFLQ
Sbjct: 1    MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60

Query: 366  FHRNEAQVPPSNVGNGVVQARTFNAPQPRQXXXXXXXXXXXXXXXXXXQHDASLS----- 530
              R+EA + P  +G+  ++A+   AP+PR                     + S       
Sbjct: 61   LQRSEAPLQPGGLGSTGLKAQRNEAPEPRVEAGGSQGLNIPGVSVQGKHPNVSARYPEKE 120

Query: 531  --------ELGSANHISGALGQDKR-----IHDVSLGNVSFQGPAHVAQNTATN------ 653
                    E+ S ++ SG+    K       HD  + N+ FQG    +     +      
Sbjct: 121  EQPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVPQ 180

Query: 654  --AQDVGKATNPVIGGPSASQGIVNNQMTANANMGADHMMVNDNIIRPQMDNGPTMLFVG 827
              A D  ++ N   GGP     +  NQM  N N    H ++N+N ++P ++NGPTMLFVG
Sbjct: 181  KIANDPAQSLNSGTGGPQGPPHVPPNQMGTNVN----HPVMNENQVQPPIENGPTMLFVG 236

Query: 828  ELHWWTTDAELESVLSQYGKVKEIKFFDERASGKSKGYCQVEFYDAGAAATCKEGMNGHI 1007
            ELHWWTTDAELESVLSQYG++KEIKFFDE+ASGKSKGYCQVEFYD  +AA CKEGMNG++
Sbjct: 237  ELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYM 296

Query: 1008 FNGRACVVAFASPQTLKQMGAAYASKNXXXXXXXXXGRR 1124
            FNGRACVVAFASPQTLKQMGA+Y +KN         GRR
Sbjct: 297  FNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRR 335



 Score =  177 bits (449), Expect = 2e-41
 Identities = 85/135 (62%), Positives = 96/135 (71%)
 Frame = +3

Query: 1401 FDPSFMGRGAGYGGFSGPAFPGMMSPFPAVNPMGLAGVAPHVNPAFFXXXXXXXXXXXXX 1580
            FDP++M RG GYGGF GP FPGM+  FPAVN MGLAGVAPHVNPAFF             
Sbjct: 433  FDPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMG 492

Query: 1581 XXXXXXHPSGMWGDAGMGGWPVEEHGQKTRESSYGGDDGASEYGYGEGNPEKSSRTNAAP 1760
                    +GMW DA MGGW  +EHG++TRESSYGG+DGASEYGYG+ N EK  R++ A 
Sbjct: 493  ASGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEK-GRSSGAS 551

Query: 1761 REKERASEREWSGNS 1805
            REKER SEREWSGNS
Sbjct: 552  REKERVSEREWSGNS 566


>gb|EOY00740.1| RNA-binding family protein isoform 5, partial [Theobroma cacao]
          Length = 656

 Score =  300 bits (767), Expect = 2e-78
 Identities = 165/339 (48%), Positives = 204/339 (60%), Gaps = 26/339 (7%)
 Frame = +3

Query: 186  MDPAAEEQIDYGDEEYGGARKVQYQSSGAIPALADEEMIGXXXXXXXXXXGVNVGEGFLQ 365
            MD  AEEQID+GDEEYGG +K+QYQ SGAIPALADEEM+G           VNVGEGFLQ
Sbjct: 1    MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60

Query: 366  FHRNEAQVPPSNVGNGVVQARTFNAPQPRQXXXXXXXXXXXXXXXXXXQHDASLS----- 530
              R+EA + P  +G+  ++A+   AP+PR                     + S       
Sbjct: 61   LQRSEAPLQPGGLGSTGLKAQRNEAPEPRVEAGGSQGLNIPGVSVQGKHPNVSARYPEKE 120

Query: 531  --------ELGSANHISGALGQDKR-----IHDVSLGNVSFQGPAHVAQNTATN------ 653
                    E+ S ++ SG+    K       HD  + N+ FQG    +     +      
Sbjct: 121  EQPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVPQ 180

Query: 654  --AQDVGKATNPVIGGPSASQGIVNNQMTANANMGADHMMVNDNIIRPQMDNGPTMLFVG 827
              A D  ++ N   GGP     +  NQM  N N    H ++N+N ++P ++NGPTMLFVG
Sbjct: 181  KIANDPAQSLNSGTGGPQGPPHVPPNQMGTNVN----HPVMNENQVQPPIENGPTMLFVG 236

Query: 828  ELHWWTTDAELESVLSQYGKVKEIKFFDERASGKSKGYCQVEFYDAGAAATCKEGMNGHI 1007
            ELHWWTTDAELESVLSQYG++KEIKFFDE+ASGKSKGYCQVEFYD  +AA CKEGMNG++
Sbjct: 237  ELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYM 296

Query: 1008 FNGRACVVAFASPQTLKQMGAAYASKNXXXXXXXXXGRR 1124
            FNGRACVVAFASPQTLKQMGA+Y +KN         GRR
Sbjct: 297  FNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRR 335



 Score =  211 bits (538), Expect = 8e-52
 Identities = 110/216 (50%), Positives = 127/216 (58%), Gaps = 3/216 (1%)
 Frame = +3

Query: 1401 FDPSFMGRGAGYGGFSGPAFPGMMSPFPAVNPMGLAGVAPHVNPAFFXXXXXXXXXXXXX 1580
            FDP++M RG GYGGF GP FPGM+  FPAVN MGLAGVAPHVNPAFF             
Sbjct: 433  FDPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMG 492

Query: 1581 XXXXXXHPSGMWGDAGMGGWPVEEHGQKTRESSYGGDDGASEYGYGEGNPEKSSRTNAAP 1760
                    +GMW DA MGGW  +EHG++TRESSYGG+DGASEYGYG+ N EK  R++ A 
Sbjct: 493  ASGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEK-GRSSGAS 551

Query: 1761 REKERASEREWSGNS---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHRLKERDSGYED 1931
            REKER SEREWSGNS                                 HR +ERD  Y+D
Sbjct: 552  REKERVSEREWSGNSDRRHRDEKEQDWDRSEREHREHRYREEKDSYREHRHRERDLDYDD 611

Query: 1932 DWDRGQXXXXXXXXXXAMQEDDHRSRSRDVEYGKRR 2039
            DWDRGQ          AM E++HRSRSRDV Y + +
Sbjct: 612  DWDRGQSSSRSRRRSHAMPEEEHRSRSRDVGYREEK 647


>gb|EOY00739.1| RNA-binding family protein isoform 4 [Theobroma cacao]
          Length = 697

 Score =  300 bits (767), Expect = 2e-78
 Identities = 165/339 (48%), Positives = 204/339 (60%), Gaps = 26/339 (7%)
 Frame = +3

Query: 186  MDPAAEEQIDYGDEEYGGARKVQYQSSGAIPALADEEMIGXXXXXXXXXXGVNVGEGFLQ 365
            MD  AEEQID+GDEEYGG +K+QYQ SGAIPALADEEM+G           VNVGEGFLQ
Sbjct: 1    MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60

Query: 366  FHRNEAQVPPSNVGNGVVQARTFNAPQPRQXXXXXXXXXXXXXXXXXXQHDASLS----- 530
              R+EA + P  +G+  ++A+   AP+PR                     + S       
Sbjct: 61   LQRSEAPLQPGGLGSTGLKAQRNEAPEPRVEAGGSQGLNIPGVSVQGKHPNVSARYPEKE 120

Query: 531  --------ELGSANHISGALGQDKR-----IHDVSLGNVSFQGPAHVAQNTATN------ 653
                    E+ S ++ SG+    K       HD  + N+ FQG    +     +      
Sbjct: 121  EQPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVPQ 180

Query: 654  --AQDVGKATNPVIGGPSASQGIVNNQMTANANMGADHMMVNDNIIRPQMDNGPTMLFVG 827
              A D  ++ N   GGP     +  NQM  N N    H ++N+N ++P ++NGPTMLFVG
Sbjct: 181  KIANDPAQSLNSGTGGPQGPPHVPPNQMGTNVN----HPVMNENQVQPPIENGPTMLFVG 236

Query: 828  ELHWWTTDAELESVLSQYGKVKEIKFFDERASGKSKGYCQVEFYDAGAAATCKEGMNGHI 1007
            ELHWWTTDAELESVLSQYG++KEIKFFDE+ASGKSKGYCQVEFYD  +AA CKEGMNG++
Sbjct: 237  ELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYM 296

Query: 1008 FNGRACVVAFASPQTLKQMGAAYASKNXXXXXXXXXGRR 1124
            FNGRACVVAFASPQTLKQMGA+Y +KN         GRR
Sbjct: 297  FNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRR 335



 Score =  194 bits (492), Expect = 2e-46
 Identities = 117/266 (43%), Positives = 132/266 (49%), Gaps = 48/266 (18%)
 Frame = +3

Query: 1401 FDPSFMGRGAGYGGFSGPAFPGMMSPFPAVNPMGLAGVAPHVNPAFFXXXXXXXXXXXXX 1580
            FDP++M RG GYGGF GP FPGM+  FPAVN MGLAGVAPHVNPAFF             
Sbjct: 433  FDPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMG 492

Query: 1581 XXXXXXHPSGMWGDAGMGGWPVEEHGQKTRESSYGGDDGASEYGYGEGNPEKSSRTNAAP 1760
                    +GMW DA MGGW  +EHG++TRESSYGG+DGASEYGYG+ N EK  R++ A 
Sbjct: 493  ASGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEK-GRSSGAS 551

Query: 1761 REKERASEREWSGNS---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHRLKERD-SGYE 1928
            REKER SEREWSGNS                                 HR +ER+ SG  
Sbjct: 552  REKERVSEREWSGNSDRRHRDEKEQDWDRSEREHREHRYREEKDSYREHRHREREWSGNS 611

Query: 1929 D---------DWD-----------------------------------RGQXXXXXXXXX 1976
            D         DWD                                   RGQ         
Sbjct: 612  DRRHRDEKERDWDRSEREHREHRYREEKDSYREHRHRERDLDYDDDLDRGQSSSRSRRRS 671

Query: 1977 XAMQEDDHRSRSRDVEYGKRRRAPSE 2054
             AM E+  RSRSRDV+YGKRRR PSE
Sbjct: 672  HAMPEEQRRSRSRDVDYGKRRRLPSE 697


>gb|EOY00736.1| RNA-binding family protein isoform 1 [Theobroma cacao]
            gi|508708840|gb|EOY00737.1| RNA-binding family protein
            isoform 1 [Theobroma cacao] gi|508708841|gb|EOY00738.1|
            RNA-binding family protein isoform 1 [Theobroma cacao]
          Length = 652

 Score =  300 bits (767), Expect = 2e-78
 Identities = 165/339 (48%), Positives = 204/339 (60%), Gaps = 26/339 (7%)
 Frame = +3

Query: 186  MDPAAEEQIDYGDEEYGGARKVQYQSSGAIPALADEEMIGXXXXXXXXXXGVNVGEGFLQ 365
            MD  AEEQID+GDEEYGG +K+QYQ SGAIPALADEEM+G           VNVGEGFLQ
Sbjct: 1    MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60

Query: 366  FHRNEAQVPPSNVGNGVVQARTFNAPQPRQXXXXXXXXXXXXXXXXXXQHDASLS----- 530
              R+EA + P  +G+  ++A+   AP+PR                     + S       
Sbjct: 61   LQRSEAPLQPGGLGSTGLKAQRNEAPEPRVEAGGSQGLNIPGVSVQGKHPNVSARYPEKE 120

Query: 531  --------ELGSANHISGALGQDKR-----IHDVSLGNVSFQGPAHVAQNTATN------ 653
                    E+ S ++ SG+    K       HD  + N+ FQG    +     +      
Sbjct: 121  EQPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVPQ 180

Query: 654  --AQDVGKATNPVIGGPSASQGIVNNQMTANANMGADHMMVNDNIIRPQMDNGPTMLFVG 827
              A D  ++ N   GGP     +  NQM  N N    H ++N+N ++P ++NGPTMLFVG
Sbjct: 181  KIANDPAQSLNSGTGGPQGPPHVPPNQMGTNVN----HPVMNENQVQPPIENGPTMLFVG 236

Query: 828  ELHWWTTDAELESVLSQYGKVKEIKFFDERASGKSKGYCQVEFYDAGAAATCKEGMNGHI 1007
            ELHWWTTDAELESVLSQYG++KEIKFFDE+ASGKSKGYCQVEFYD  +AA CKEGMNG++
Sbjct: 237  ELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYM 296

Query: 1008 FNGRACVVAFASPQTLKQMGAAYASKNXXXXXXXXXGRR 1124
            FNGRACVVAFASPQTLKQMGA+Y +KN         GRR
Sbjct: 297  FNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRR 335



 Score =  227 bits (579), Expect = 1e-56
 Identities = 117/221 (52%), Positives = 134/221 (60%), Gaps = 3/221 (1%)
 Frame = +3

Query: 1401 FDPSFMGRGAGYGGFSGPAFPGMMSPFPAVNPMGLAGVAPHVNPAFFXXXXXXXXXXXXX 1580
            FDP++M RG GYGGF GP FPGM+  FPAVN MGLAGVAPHVNPAFF             
Sbjct: 433  FDPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMG 492

Query: 1581 XXXXXXHPSGMWGDAGMGGWPVEEHGQKTRESSYGGDDGASEYGYGEGNPEKSSRTNAAP 1760
                    +GMW DA MGGW  +EHG++TRESSYGG+DGASEYGYG+ N EK  R++ A 
Sbjct: 493  ASGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEK-GRSSGAS 551

Query: 1761 REKERASEREWSGNS---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHRLKERDSGYED 1931
            REKER SEREWSGNS                                 HR +ERD  Y+D
Sbjct: 552  REKERVSEREWSGNSDRRHRDEKEQDWDRSEREHREHRYREEKDSYREHRHRERDLDYDD 611

Query: 1932 DWDRGQXXXXXXXXXXAMQEDDHRSRSRDVEYGKRRRAPSE 2054
            DWDRGQ          AM E++HRSRSRDV+YGK+RR PSE
Sbjct: 612  DWDRGQSSSRSRRRSHAMPEEEHRSRSRDVDYGKKRRLPSE 652


>ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citrus clementina]
            gi|567891321|ref|XP_006438181.1| hypothetical protein
            CICLE_v10030917mg [Citrus clementina]
            gi|557540376|gb|ESR51420.1| hypothetical protein
            CICLE_v10030917mg [Citrus clementina]
            gi|557540377|gb|ESR51421.1| hypothetical protein
            CICLE_v10030917mg [Citrus clementina]
          Length = 655

 Score =  297 bits (761), Expect = 1e-77
 Identities = 169/339 (49%), Positives = 208/339 (61%), Gaps = 26/339 (7%)
 Frame = +3

Query: 198  AEEQIDYGDEEYGGARKVQYQSSGAIPALADEEMIGXXXXXXXXXXGVNVGEGFLQFHRN 377
            AEEQIDY ++EYGGA+K+QYQ  GAIPALADEE++G           +NVG+G LQF + 
Sbjct: 2    AEEQIDYEEDEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDINVGDGLLQFQQP 61

Query: 378  EAQVPPSNVGNGVVQARTFNAPQPRQ----------XXXXXXXXXXXXXXXXXXQHDASL 527
            EA  P + VGNG +Q +  + P+ R                             Q+D  +
Sbjct: 62   EAPPPSAGVGNGRLQVKKTDVPEQRVQVGGSQGSNIPGVSVEGKYTNAGSDFPAQNDVQV 121

Query: 528  S----ELGSANHISGALGQDK-----RIHDVSLGNVSFQGPAHVAQNTATNAQDV-GKAT 677
            +     +GS N+  GA    K       HD  + N+ FQG       T  +  ++ G+A 
Sbjct: 122  AVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGRAA 181

Query: 678  N---PVIG-GPSASQG--IVNNQMTANANMGADHMMVNDNIIRPQMDNGPTMLFVGELHW 839
            N   PV+  G +  QG  I  NQM  NAN+  + +MVN+N IRP ++NG TMLFVGELHW
Sbjct: 182  NEPAPVLNPGAAGPQGALIPANQMGVNANV--NRVMVNENQIRPPLENGGTMLFVGELHW 239

Query: 840  WTTDAELESVLSQYGKVKEIKFFDERASGKSKGYCQVEFYDAGAAATCKEGMNGHIFNGR 1019
            WTTDAELESVLSQYG+ KEIKFFDERASGKSKGYCQVEF+DA AAA CK+GMNGH+FNGR
Sbjct: 240  WTTDAELESVLSQYGRAKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFNGR 299

Query: 1020 ACVVAFASPQTLKQMGAAYASKNXXXXXXXXXGRRNTND 1136
             CVVAFASPQTLKQMGA+Y +KN         G R  ND
Sbjct: 300  PCVVAFASPQTLKQMGASYMNKNQGQPQSQNQGSRPMND 338



 Score =  229 bits (585), Expect = 3e-57
 Identities = 114/221 (51%), Positives = 136/221 (61%), Gaps = 3/221 (1%)
 Frame = +3

Query: 1401 FDPSFMGRGAGYGGFSGPAFPGMMSPFPAVNPMGLAGVAPHVNPAFFXXXXXXXXXXXXX 1580
            FDP++MGRG GYGGFSGP FPGM+  FPAVN MGLAGVAPHVNPAFF             
Sbjct: 435  FDPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMG 494

Query: 1581 XXXXXXHPSGMWGDAGMGGWPVEEHGQKTRESSYGGDDGASEYGYGEGNPEKSSRTNAAP 1760
                     GMW D+ MGGW  EEHG++TRESSYGGDDGAS+YGYGE + EK +R+  A 
Sbjct: 495  SSGMDGPHPGMWTDSSMGGWVGEEHGRRTRESSYGGDDGASDYGYGEASHEKGARSTTAS 554

Query: 1761 REKERASEREWSGNSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXH---RLKERDSGYED 1931
            REK+R SER+WSGN+                              +   R ++RDS Y+D
Sbjct: 555  REKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDD 614

Query: 1932 DWDRGQXXXXXXXXXXAMQEDDHRSRSRDVEYGKRRRAPSE 2054
            +WDRGQ          A+ ++DHRSRSRDV+YGKRRR PSE
Sbjct: 615  NWDRGQSSSRSRSRSGAIPDEDHRSRSRDVDYGKRRRLPSE 655


>ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            CG7185-like isoform X1 [Citrus sinensis]
          Length = 655

 Score =  296 bits (758), Expect = 3e-77
 Identities = 169/339 (49%), Positives = 207/339 (61%), Gaps = 26/339 (7%)
 Frame = +3

Query: 198  AEEQIDYGDEEYGGARKVQYQSSGAIPALADEEMIGXXXXXXXXXXGVNVGEGFLQFHRN 377
            AEEQIDY ++EYGGA+K+QYQ  GAIPALADEE++G           VNVG+G LQF + 
Sbjct: 2    AEEQIDYEEDEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQFQQP 61

Query: 378  EAQVPPSNVGNGVVQARTFNAPQPRQXXXXXXXXXXXXXXXXXX----------QHDASL 527
            EA  P + VGNG +Q +  + P+ R                             Q+D  +
Sbjct: 62   EAPPPSAGVGNGRLQVKKTDVPEQRVQVGGSQGSNIPGVSVEGKYTNAGSHFPAQNDVQV 121

Query: 528  S----ELGSANHISGALGQDK-----RIHDVSLGNVSFQGPAHVAQNTATNAQDV-GKAT 677
            +     +GS N+  GA    K       HD  + N+ FQG       T  +  ++ G+  
Sbjct: 122  AVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGRVA 181

Query: 678  N---PVIG-GPSASQG--IVNNQMTANANMGADHMMVNDNIIRPQMDNGPTMLFVGELHW 839
            N   PV+  G +  QG  I  NQM  NAN+  + +MVN+N IRP ++NG TMLFVGELHW
Sbjct: 182  NEPAPVLNPGAAGPQGALIPANQMGVNANV--NRVMVNENQIRPPLENGGTMLFVGELHW 239

Query: 840  WTTDAELESVLSQYGKVKEIKFFDERASGKSKGYCQVEFYDAGAAATCKEGMNGHIFNGR 1019
            WTTDAELESVLSQYG+ KEIKFFDERASGKSKGYCQVEF+DA AAA CK+GMNGH+FNGR
Sbjct: 240  WTTDAELESVLSQYGRAKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFNGR 299

Query: 1020 ACVVAFASPQTLKQMGAAYASKNXXXXXXXXXGRRNTND 1136
             CVVAFASPQTLKQMGA+Y +KN         G R  ND
Sbjct: 300  PCVVAFASPQTLKQMGASYMNKNQGQPQSQNQGSRPMND 338



 Score =  233 bits (593), Expect = 3e-58
 Identities = 116/221 (52%), Positives = 137/221 (61%), Gaps = 3/221 (1%)
 Frame = +3

Query: 1401 FDPSFMGRGAGYGGFSGPAFPGMMSPFPAVNPMGLAGVAPHVNPAFFXXXXXXXXXXXXX 1580
            FDP++MGRG GYGGFSGP FPGM+  FPAVN MGLAGVAPHVNPAFF             
Sbjct: 435  FDPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMG 494

Query: 1581 XXXXXXHPSGMWGDAGMGGWPVEEHGQKTRESSYGGDDGASEYGYGEGNPEKSSRTNAAP 1760
                     GMW D+ MGGW  EEHG++TRESSYGGDDGAS+YGYGE N EK +R+ AA 
Sbjct: 495  SSGMDGPHPGMWTDSSMGGWLGEEHGRRTRESSYGGDDGASDYGYGEANHEKGARSTAAS 554

Query: 1761 REKERASEREWSGNSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXH---RLKERDSGYED 1931
            REK+R SER+WSGN+                              +   R ++RDS Y+D
Sbjct: 555  REKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDD 614

Query: 1932 DWDRGQXXXXXXXXXXAMQEDDHRSRSRDVEYGKRRRAPSE 2054
            +WDRGQ          A+ ++DHRSRSRDV+YGKRRR PSE
Sbjct: 615  NWDRGQSSSRSRSRSGAIPDEDHRSRSRDVDYGKRRRLPSE 655


>ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309507 [Fragaria vesca
            subsp. vesca]
          Length = 646

 Score =  295 bits (756), Expect = 4e-77
 Identities = 164/330 (49%), Positives = 200/330 (60%), Gaps = 13/330 (3%)
 Frame = +3

Query: 186  MDPAAEEQIDYGDEEYGGARKVQYQSSGAIPALADEEMIGXXXXXXXXXXGVNVGEGFLQ 365
            MDP  EEQIDY +EEYGGA+K+QYQ SGAIPALADEE +            VNVGEGFLQ
Sbjct: 1    MDPMGEEQIDYEEEEYGGAQKLQYQESGAIPALADEEPMVEDDEYDDLYNDVNVGEGFLQ 60

Query: 366  FHRNEAQVPPSNVGNGVVQARTFNAPQPR-QXXXXXXXXXXXXXXXXXXQHDASLSELGS 542
             HR E  +PP+ VGNG +QA+  N P+ R Q                         +   
Sbjct: 61   MHRPEPPLPPAGVGNGGLQAQKNNVPEQRVQGGASQEVKNPGFSVEGKYSSVPEQKDQPP 120

Query: 543  ANHISGALGQDKRI----HDVSLGNVSFQGPAHVAQNTATNAQDV-GKATNPVI------ 689
             + +     Q  R+    HD  + N+ FQG A +  N   ++ D+ GK  N  I      
Sbjct: 121  VSVVPEMASQKGRVMEMTHDAQVRNMGFQGAATMQSNVVADSSDLTGKIANGPIPSMNSG 180

Query: 690  -GGPSASQGIVNNQMTANANMGADHMMVNDNIIRPQMDNGPTMLFVGELHWWTTDAELES 866
              GP A Q +  NQM  N  +  +  MVN+N IRP ++NG   LFVGELHWWTTDAELE 
Sbjct: 181  SNGPPAVQQMPANQM--NMKINVNRPMVNENQIRPPVENGSATLFVGELHWWTTDAELEG 238

Query: 867  VLSQYGKVKEIKFFDERASGKSKGYCQVEFYDAGAAATCKEGMNGHIFNGRACVVAFASP 1046
            VLSQ+G++KEIKFFDERASGKSKGYCQV+FYD  AA+ CKEGM+G++FNGRACVVAFAS 
Sbjct: 239  VLSQFGRIKEIKFFDERASGKSKGYCQVDFYDPAAASACKEGMDGYVFNGRACVVAFASS 298

Query: 1047 QTLKQMGAAYASKNXXXXXXXXXGRRNTND 1136
            QTLKQMG +Y +K+         GRR  ND
Sbjct: 299  QTLKQMGDSYVNKSQGQVQTQPQGRRPMND 328



 Score =  221 bits (562), Expect = 1e-54
 Identities = 116/222 (52%), Positives = 128/222 (57%), Gaps = 4/222 (1%)
 Frame = +3

Query: 1401 FDPSFMGRGAGYGGFSGPAFPGMMSPFPAVNPMGLAGVAPHVNPAFFXXXXXXXXXXXXX 1580
            FDP++MGRG GYGGF GP FPGM+  FP VN MGLAGVAPHVNPAFF             
Sbjct: 425  FDPTYMGRGGGYGGFPGPGFPGMLPQFPGVNAMGLAGVAPHVNPAFFGRGMATNGMGMMG 484

Query: 1581 XXXXXXHPSGMWGDAGMGGWPVEEHGQKTRESSYGGDDGASEYG-YGEGNPEKSSRTNAA 1757
                  H + MW D  M GW  EE  ++TRESSYGGDDG SEYG YGE N EK  R++AA
Sbjct: 485  SSGMEGHHAPMWNDPSMAGWTGEEQDRRTRESSYGGDDGGSEYGNYGEANHEKPVRSSAA 544

Query: 1758 PREKERASEREWSGNS---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHRLKERDSGYE 1928
            PRE+ER SEREW+G S                                 HR +ERD  YE
Sbjct: 545  PRERERESEREWTGTSERRHRDEREQDWDRSEREHREPRYKEEKDSYRDHRRRERDVAYE 604

Query: 1929 DDWDRGQXXXXXXXXXXAMQEDDHRSRSRDVEYGKRRRAPSE 2054
            DD DRG           AM EDDHRSRSRDV+YGKRRR PSE
Sbjct: 605  DDRDRGHSSSRPRSRSKAMPEDDHRSRSRDVDYGKRRRLPSE 646


>gb|EXB82464.1| Cleavage and polyadenylation specificity factor subunit [Morus
            notabilis]
          Length = 636

 Score =  290 bits (741), Expect = 2e-75
 Identities = 175/340 (51%), Positives = 206/340 (60%), Gaps = 27/340 (7%)
 Frame = +3

Query: 198  AEEQIDYGDEEYGGARKVQYQSSG-AIPALADEEMIGXXXXXXXXXXGVNVGEGFLQFHR 374
            AE+ ID+ DEEYGGA+K QYQ SG AI ALADEE++G           VNVGEGFLQ  R
Sbjct: 2    AEDHIDFEDEEYGGAQKHQYQGSGGAISALADEELMGDDDEYDDLYNDVNVGEGFLQLQR 61

Query: 375  NEAQVPPS--NVGNGVVQARTFNAPQPRQXXXXXXXXXXXXXXXXXXQHDASL------- 527
            +EA   P+   VGNG+ QA+  N P+PR+                     A         
Sbjct: 62   SEAPSLPAAAGVGNGL-QAQKRNFPEPREEIGGSQQPNIPGVSAEGRFSSAGSQFPGQQD 120

Query: 528  -------SELGSANHISGALGQDKRIHDVSLGNV--SFQGPAHVAQNTATNAQDV-GKAT 677
                   SE GS  +  GA G  K       G +   FQG   +  +   ++ D+ GK  
Sbjct: 121  GLKVDKKSEAGSMVYPDGASGSQK-------GRIVAGFQGSKPMLHSVGVDSSDIPGKMV 173

Query: 678  NPVIGGPSAS----QGIV---NNQMTANANMGADHMMVNDNIIRPQMDNGPTMLFVGELH 836
            N  I  P++     +GI+    NQ T NAN+   H +VN+N IRP ++NG TMLFVGELH
Sbjct: 174  NEPIQAPNSGGAGPRGILPMQGNQTTVNANVS--HPIVNENQIRPSIENGSTMLFVGELH 231

Query: 837  WWTTDAELESVLSQYGKVKEIKFFDERASGKSKGYCQVEFYDAGAAATCKEGMNGHIFNG 1016
            WWTTDAELESVLSQYG+VKEIKFFDERASGKSKGYCQVE+YDA AA  CKEGM+GH+FNG
Sbjct: 232  WWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEYYDAAAAVACKEGMHGHVFNG 291

Query: 1017 RACVVAFASPQTLKQMGAAYASKNXXXXXXXXXGRRNTND 1136
            RACVVAFASPQTLKQMGAAY SKN         GRR  ND
Sbjct: 292  RACVVAFASPQTLKQMGAAYMSKNQVQNQSQPQGRRPIND 331



 Score =  220 bits (560), Expect = 2e-54
 Identities = 114/221 (51%), Positives = 127/221 (57%), Gaps = 3/221 (1%)
 Frame = +3

Query: 1401 FDPSFMGRGAGYGGFSGPAFPGMMSPFPAVNPMGLAGVAPHVNPAFFXXXXXXXXXXXXX 1580
            FDP++MGRG GYGGF+GPAFPGM+  FPAVN MG A VAPHVNPAFF             
Sbjct: 424  FDPTYMGRGVGYGGFAGPAFPGMLPSFPAVNTMGFAAVAPHVNPAFFGRGMTNNGMGMVG 483

Query: 1581 XXXXXXHPSGMWGDAGMGGWPVEEHGQKTRESSYGGDDGASEYGYGEGNPEKSSRTNAAP 1760
                  H  GMW D  +GGW  EEHG++TRESSYGGDDGASEYGYG+ N EK  R     
Sbjct: 484  SSLMDGHQGGMWNDPSIGGWGGEEHGRRTRESSYGGDDGASEYGYGDTNHEKGGR----- 538

Query: 1761 REKERASEREWSGNS---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHRLKERDSGYED 1931
               ER SER+WSGNS                                 +R KER+  YED
Sbjct: 539  ---ERGSERDWSGNSERRNHEERDQDWDRSQKEQKEHRYREGKDGSRDYRPKERELDYED 595

Query: 1932 DWDRGQXXXXXXXXXXAMQEDDHRSRSRDVEYGKRRRAPSE 2054
            DWDRGQ           +QED HRSRSRDV+YGKRRR PSE
Sbjct: 596  DWDRGQSSSRLRSRSRVVQEDHHRSRSRDVDYGKRRRLPSE 636


>ref|XP_002515040.1| RNA binding protein, putative [Ricinus communis]
            gi|223546091|gb|EEF47594.1| RNA binding protein, putative
            [Ricinus communis]
          Length = 644

 Score =  288 bits (737), Expect = 7e-75
 Identities = 166/328 (50%), Positives = 197/328 (60%), Gaps = 15/328 (4%)
 Frame = +3

Query: 198  AEEQIDYGDEEYGGARKVQYQSSGAIPALADEEMIGXXXXXXXXXXGVNVGEGFLQFHRN 377
            A+EQIDY DEEYGGA+K+QYQ SGAIPALA+EEM G           VN+GE FLQ HR+
Sbjct: 2    ADEQIDYEDEEYGGAQKLQYQGSGAIPALAEEEM-GEDDEYDDLYNDVNIGENFLQMHRS 60

Query: 378  EAQVPPSNVGNGVVQARTFNAPQPRQXXXXXXXXXXXXXXXXXX------QHDASLSELG 539
            EA   P +VGNG  Q R  N  +                           + +    E+G
Sbjct: 61   EAPPAPPSVGNGGFQPRNSNDLRVESGGSQGLNIPGVAVESKYSTGTHFPEQNVKGPEIG 120

Query: 540  SANHISGA-LGQDKRI----HDVSLGNVSFQGPAHVAQNTATNAQDVG-KATNPVIGGPS 701
            S  +  G+ + Q  R+    +D    N+ FQG      N   +  D+  K +N     P+
Sbjct: 121  SVGYPDGSSIAQKTRVMEMTNDSQARNMGFQGSTSGPSNIGVDPSDMNNKISNDPTPVPN 180

Query: 702  ASQGIVNNQMTA---NANMGADHMMVNDNIIRPQMDNGPTMLFVGELHWWTTDAELESVL 872
            A    V  Q+ A   N NM  +    N+N IRP ++NG TML+VGELHWWTTDAELE+VL
Sbjct: 181  AGVPRVIPQLPASQMNMNMDTNRSATNENQIRPPLENGSTMLYVGELHWWTTDAELENVL 240

Query: 873  SQYGKVKEIKFFDERASGKSKGYCQVEFYDAGAAATCKEGMNGHIFNGRACVVAFASPQT 1052
            SQYG VKEIKFFDERASGKSKGYCQVEFYDA AAA CKEGMNGH+FNGRACVVAFAS QT
Sbjct: 241  SQYGMVKEIKFFDERASGKSKGYCQVEFYDAAAAAACKEGMNGHLFNGRACVVAFASQQT 300

Query: 1053 LKQMGAAYASKNXXXXXXXXXGRRNTND 1136
            LKQMGA+Y +KN         GRR  ND
Sbjct: 301  LKQMGASYMNKNQGQPQSQNQGRRPMND 328



 Score =  238 bits (608), Expect = 6e-60
 Identities = 123/221 (55%), Positives = 139/221 (62%), Gaps = 3/221 (1%)
 Frame = +3

Query: 1401 FDPSFMGRGAGYGGFSGPAFPGMMSPFPAVNPMGLAGVAPHVNPAFFXXXXXXXXXXXXX 1580
            FDP++MGRGAGYGGF+GP FPGM+  FPAVN MGLAGVAPHVNPAFF             
Sbjct: 425  FDPTYMGRGAGYGGFAGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFGRGMAPNGMGMMG 484

Query: 1581 XXXXXXHPSGMWGDAGMGGWPVEEHGQKTRESSYGGDDGASEYGYGEGNPEKSSRTNAAP 1760
                    +GMW D  MGGW  EE G++TRESSYGGDDGASEYGYGE N EK +R++AA 
Sbjct: 485  PSGMDGPNAGMWSDTSMGGWG-EEPGRRTRESSYGGDDGASEYGYGEVNHEKGARSSAAS 543

Query: 1761 REKERASEREWSGNS---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHRLKERDSGYED 1931
            REKERASER+WSGNS                                 HR +ERDSGYED
Sbjct: 544  REKERASERDWSGNSDRRHRDDREHDWDRSEREHKEHRYREEKESYRDHRQRERDSGYED 603

Query: 1932 DWDRGQXXXXXXXXXXAMQEDDHRSRSRDVEYGKRRRAPSE 2054
            DWDRGQ          A+ E+D+RSRSRD +YGKRRR PSE
Sbjct: 604  DWDRGQSSSRSRSRSRAVPEEDYRSRSRDADYGKRRRLPSE 644


>gb|EMJ26876.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica]
          Length = 630

 Score =  278 bits (712), Expect = 6e-72
 Identities = 159/328 (48%), Positives = 196/328 (59%), Gaps = 15/328 (4%)
 Frame = +3

Query: 198  AEEQIDYGDEEYGGARKVQYQSSGAIPALADEEMIGXXXXXXXXXXGVNVGEGFLQFHRN 377
            AEEQIDY DEEYGGA+K+QYQ SGAI ALADEE +            VNV EGFLQ HR+
Sbjct: 2    AEEQIDYEDEEYGGAQKLQYQGSGAISALADEEPMVEDDEYDDLYNDVNVREGFLQMHRS 61

Query: 378  EAQVPPSNVGNGVVQARTFNAPQPRQXXXXXXXXXXXXXXXXXXQHDASLSELGSANHIS 557
            EA +PP  VGNG +QA+  +  + R                     ++ +  +      S
Sbjct: 62   EAPLPPGGVGNGGLQAQKTDVTETR--------------VQAGVSQESKIPGVSVQGKYS 107

Query: 558  GAL-------GQDKRIHDVSLGNVSFQGPAHVAQNTATNAQDVGKAT--------NPVIG 692
             A+       GQ     +  LG+  + G   +  N   ++ D+   T        N    
Sbjct: 108  SAVAQFPEQQGQPPVAKEPELGSTGY-GSTTMPPNVGGDSSDITGKTALESVPSMNSGTA 166

Query: 693  GPSASQGIVNNQMTANANMGADHMMVNDNIIRPQMDNGPTMLFVGELHWWTTDAELESVL 872
            GP+    +  NQ++   N  A+  M N+N IRP ++NG TMLFVGELHWWTTDAELESVL
Sbjct: 167  GPTGVTQMPTNQISIKVN--ANRPMFNENQIRPPVENGSTMLFVGELHWWTTDAELESVL 224

Query: 873  SQYGKVKEIKFFDERASGKSKGYCQVEFYDAGAAATCKEGMNGHIFNGRACVVAFASPQT 1052
            SQYG+VKEIKFFDERASGKSKGYCQVEF+D  AA  CKEGM+G++FNGRACVVAFASPQT
Sbjct: 225  SQYGRVKEIKFFDERASGKSKGYCQVEFHDPAAATACKEGMDGYLFNGRACVVAFASPQT 284

Query: 1053 LKQMGAAYASKNXXXXXXXXXGRRNTND 1136
            LKQMGA+Y SK+         GRR  N+
Sbjct: 285  LKQMGASYLSKSQGQTQSQQPGRRPMNE 312



 Score =  251 bits (642), Expect = 7e-64
 Identities = 126/222 (56%), Positives = 138/222 (62%), Gaps = 4/222 (1%)
 Frame = +3

Query: 1401 FDPSFMGRGAGYGGFSGPAFPGMMSPFPAVNPMGLAGVAPHVNPAFFXXXXXXXXXXXXX 1580
            FDP++MGRG GYGGF GPAFPGM+S FPAVN MGLAGVAPHVNPAFF             
Sbjct: 409  FDPTYMGRGGGYGGFPGPAFPGMLSSFPAVNTMGLAGVAPHVNPAFFGRGMATNGMGMMG 468

Query: 1581 XXXXXXHPSGMWGDAGMGGWPVEEHGQKTRESSYGGDDGASEYGYGEGNPEKSSRTNAAP 1760
                  H +GMW D  MGGW  +EHG++TRESSYGGDDGASEYGYGE N EK  R+NA  
Sbjct: 469  SSGMDGHHAGMWNDPSMGGWGGDEHGRRTRESSYGGDDGASEYGYGEANHEKGGRSNAPS 528

Query: 1761 REKERASEREWSGNS----XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHRLKERDSGYE 1928
            RE+ER SER+WSGNS                                  HR +ERD GYE
Sbjct: 529  RERERGSERDWSGNSERRHRDEREQDWDRSERGEHREHRYKEEKDSYRDHRQRERDVGYE 588

Query: 1929 DDWDRGQXXXXXXXXXXAMQEDDHRSRSRDVEYGKRRRAPSE 2054
            DDWDRGQ          AM EDDHRSRSRDV+YGKRRR PSE
Sbjct: 589  DDWDRGQSSSRPRSRSKAMPEDDHRSRSRDVDYGKRRRLPSE 630


>ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            CG7185-like isoform X1 [Solanum tuberosum]
            gi|565349616|ref|XP_006341787.1| PREDICTED: cleavage and
            polyadenylation specificity factor subunit CG7185-like
            isoform X2 [Solanum tuberosum]
          Length = 648

 Score =  276 bits (707), Expect = 2e-71
 Identities = 152/341 (44%), Positives = 196/341 (57%), Gaps = 24/341 (7%)
 Frame = +3

Query: 186  MDPAAEEQIDYGDEEYGGARKVQYQSSGAIPALADEEMIGXXXXXXXXXXGVNVGEGFLQ 365
            MDP A+EQ+DYGDEEYGG+ K+QY  SG IPALA++EM+G           VN+GEGFLQ
Sbjct: 1    MDPTADEQLDYGDEEYGGSHKMQYHGSGTIPALAEDEMMGEDDEYDDLYNDVNIGEGFLQ 60

Query: 366  FHRNEAQVPPSNVGNGVVQARTFNAPQPRQXXXXXXXXXXXXXXXXXX----------QH 515
              R+E  VP  + GNG  QA+  + P  R                             Q 
Sbjct: 61   LQRSEVPVPSVDAGNGNFQAQKDSFPASRAGGLGSEEAKIPGIATEGKYAGTEVQFPQQK 120

Query: 516  DASLSELGSANHISGALGQDKRIHDVSL------GNVSFQGPAHVAQNTAT--------N 653
               + E  +      A  Q  R   +++      GN  +QG   + Q            N
Sbjct: 121  GEPVVERETERPADAA--QKARPSAITMTLNSQAGNSGYQGSMPMPQKIGADPMAMPEKN 178

Query: 654  AQDVGKATNPVIGGPSASQGIVNNQMTANANMGADHMMVNDNIIRPQMDNGPTMLFVGEL 833
            A +     N V+ GP     +  NQ+ ++ N+  ++ ++++   RP ++NG TMLFVGEL
Sbjct: 179  ASEATPLMNSVVPGPRVVPHMPTNQLNSSGNVNMNNPVISETPFRPSLENGNTMLFVGEL 238

Query: 834  HWWTTDAELESVLSQYGKVKEIKFFDERASGKSKGYCQVEFYDAGAAATCKEGMNGHIFN 1013
            HWWTTDAELESVL+QYG VKEIKFFDERASGKSKGYCQVEF+D  +AA CKEGMNG+ FN
Sbjct: 239  HWWTTDAELESVLTQYGNVKEIKFFDERASGKSKGYCQVEFFDPASAAACKEGMNGYNFN 298

Query: 1014 GRACVVAFASPQTLKQMGAAYASKNXXXXXXXXXGRRNTND 1136
            GRACVVAFA+PQT+KQMG++YA+K          GRR  N+
Sbjct: 299  GRACVVAFATPQTIKQMGSSYANKTQNQVQSQPQGRRPMNE 339



 Score =  238 bits (607), Expect = 8e-60
 Identities = 121/218 (55%), Positives = 135/218 (61%)
 Frame = +3

Query: 1401 FDPSFMGRGAGYGGFSGPAFPGMMSPFPAVNPMGLAGVAPHVNPAFFXXXXXXXXXXXXX 1580
            FDPSFMGRGAGYGGFSGPAFPGMM PF AVNPMGL GVAPHVNPAFF             
Sbjct: 431  FDPSFMGRGAGYGGFSGPAFPGMMPPFQAVNPMGLPGVAPHVNPAFFGRGMAANGMGMMS 490

Query: 1581 XXXXXXHPSGMWGDAGMGGWPVEEHGQKTRESSYGGDDGASEYGYGEGNPEKSSRTNAAP 1760
                     GMW D   GGW  EEHG++TRESSYGG+D ASEYGYGE + +K +R++A  
Sbjct: 491  AAGMDGPHPGMWTDTSGGGWGGEEHGRRTRESSYGGEDNASEYGYGEVSHDKGARSSAVS 550

Query: 1761 REKERASEREWSGNSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHRLKERDSGYEDDWD 1940
            REKER SER+WSGNS                              +R KER+S YE+D+D
Sbjct: 551  REKERGSERDWSGNSDKRHRDEREHDRDRHDKEHRYREERDGYRDYRQKERESEYEEDYD 610

Query: 1941 RGQXXXXXXXXXXAMQEDDHRSRSRDVEYGKRRRAPSE 2054
            RGQ          A QE+DHRSRSRD  YGKRRRAPSE
Sbjct: 611  RGQSSSRSRSKSRAAQEEDHRSRSRDTNYGKRRRAPSE 648


>gb|EPS60955.1| hypothetical protein M569_13847, partial [Genlisea aurea]
          Length = 508

 Score =  253 bits (646), Expect = 2e-64
 Identities = 145/322 (45%), Positives = 180/322 (55%), Gaps = 10/322 (3%)
 Frame = +3

Query: 186  MDPAAEEQIDYGDEEYGGARKVQYQSSGAIPALADEEMIGXXXXXXXXXXG-VNVGEGFL 362
            M+P   EQ D+G+EEYGG +K+QY   GAIPALADEEMIG            VNVGE F+
Sbjct: 1    MEPMNGEQFDFGEEEYGGGQKMQYNQGGAIPALADEEMIGEEDDEYDDLYNDVNVGESFM 60

Query: 363  QFHRNEAQVPPSNVGNGVVQARTFNAPQPRQXXXXXXXXXXXXXXXXXXQHDASLSELGS 542
            Q  R ++Q+PP    N V  + T +   P +                  Q     + L +
Sbjct: 61   QVQRPDSQIPPFKAENRVNPSGTGDESIPSEEANASKYAGNRAFGPGALQFPEQKAGLNT 120

Query: 543  ANHISGALGQDKRIHDVSLGNVSFQGPAHVAQNTATNAQDVGKATNPVIGGPSASQGIVN 722
                S  + + + + +       +QG   VA N  T  +D  K  +  +G PS+    V 
Sbjct: 121  TEETSVTVDRSQTVRNSQTDQSGYQGS--VAPNNKT--EDQVKNMDKTVGDPSSINPNVG 176

Query: 723  NQMTANANMGADHMMVNDNIIRPQMD---------NGPTMLFVGELHWWTTDAELESVLS 875
                        +M  N N IRP  D         NG TML+VGELHWWTTDAE+ESVL 
Sbjct: 177  VGSKGAVPFNFMNMAANANAIRPVDDEYSNLGSSENGNTMLYVGELHWWTTDAEIESVLI 236

Query: 876  QYGKVKEIKFFDERASGKSKGYCQVEFYDAGAAATCKEGMNGHIFNGRACVVAFASPQTL 1055
            QYGKVKEIKFFDERASGKSKGYCQVEF+D  AA  CKEGMNG++FNGRACVVAFA+PQT+
Sbjct: 237  QYGKVKEIKFFDERASGKSKGYCQVEFFDPAAAHACKEGMNGYVFNGRACVVAFATPQTI 296

Query: 1056 KQMGAAYASKNXXXXXXXXXGR 1121
            KQMGA+Y ++N         GR
Sbjct: 297  KQMGASYMNRNQGQPQAQFPGR 318



 Score =  104 bits (260), Expect = 1e-19
 Identities = 56/98 (57%), Positives = 63/98 (64%), Gaps = 2/98 (2%)
 Frame = +3

Query: 1401 FDPSFMGRGAGYGG-FSGPAFPGMMSPFPAVNPMGLAGVAPHVNPAFFXXXXXXXXXXXX 1577
            FD +FMGRGAGYGG F+GPAFPGM+ PFPAVN +GL GVAPHVNPAFF            
Sbjct: 412  FDLAFMGRGAGYGGGFTGPAFPGMLPPFPAVNTLGLPGVAPHVNPAFFGRGMAPNGMGMM 471

Query: 1578 XXXXXXXHPSGMWGDAGM-GGWPVEEHGQKTRESSYGG 1688
                     SG+W DA + GGW  EE G +  ESSYGG
Sbjct: 472  GPSGMGGPYSGLWNDASVGGGWGGEEQG-RGPESSYGG 508


>ref|XP_002312652.1| RNA recognition motif-containing family protein [Populus trichocarpa]
            gi|222852472|gb|EEE90019.1| RNA recognition
            motif-containing family protein [Populus trichocarpa]
          Length = 619

 Score =  248 bits (633), Expect = 8e-63
 Identities = 148/333 (44%), Positives = 187/333 (56%), Gaps = 24/333 (7%)
 Frame = +3

Query: 210  IDYGDEEYGGARKVQYQSSGAIPALADEEMIGXXXXXXXXXXGVNVGEGFLQFHRNEAQV 389
            +DY +EE     K+QYQ SGAIPALA+EEM G           VNVGE FLQ H +EA  
Sbjct: 1    MDYEEEE-----KMQYQGSGAIPALAEEEM-GEDDEYDDLYNDVNVGENFLQMHGSEAPA 54

Query: 390  PPSNVGNGVVQARTFNAPQPRQXXXXXXXXXXXXXXXXXXQHDASLS-----------EL 536
            PP+ VGNG  Q R  +  +                       +A              E 
Sbjct: 55   PPATVGNGGFQTRNAHESRIETGGSQALAITGGGPAVEGIYSNAKAHFPEQKQVAVAVEA 114

Query: 537  GSANHISGA-LGQDKRI----HDVSLGNVSFQGPAHVAQNTATNAQDVGKATN------P 683
                 + G+ + Q  R+    HDV + N+ FQ    V      +  D+ +         P
Sbjct: 115  QDVGPVDGSSVAQKGRVIEMSHDVQVRNMGFQKSTPVPPGIGVDPSDMSRKNAIEPEPLP 174

Query: 684  VIG--GPSASQGIVNNQMTANANMGADHMMVNDNIIRPQMDNGPTMLFVGELHWWTTDAE 857
            + G  GP  +  +  NQM  +A++  +  +VN+N +RP ++NG T L+VGELHWWTTDAE
Sbjct: 175  ITGSAGPRGAPQMQVNQMHMSADV--NRPVVNENQVRPPIENGSTTLYVGELHWWTTDAE 232

Query: 858  LESVLSQYGKVKEIKFFDERASGKSKGYCQVEFYDAGAAATCKEGMNGHIFNGRACVVAF 1037
            LES  SQ+G+VKEIKFFDERASGKSKGYCQV+FY+A AAA CKEGMNGH+FNGR CVVAF
Sbjct: 233  LESFASQFGRVKEIKFFDERASGKSKGYCQVDFYEAAAAAACKEGMNGHVFNGRPCVVAF 292

Query: 1038 ASPQTLKQMGAAYASKNXXXXXXXXXGRRNTND 1136
            ASPQTLKQMGA+Y +K          GR + ND
Sbjct: 293  ASPQTLKQMGASYMNKTQGQPQTQSQGRGSMND 325



 Score =  199 bits (507), Expect = 3e-48
 Identities = 107/218 (49%), Positives = 120/218 (55%)
 Frame = +3

Query: 1401 FDPSFMGRGAGYGGFSGPAFPGMMSPFPAVNPMGLAGVAPHVNPAFFXXXXXXXXXXXXX 1580
            FDP +MGRG GYGGF+GP FPGM+  FPAVN MGLAGVAPHVNPAFF             
Sbjct: 420  FDPLYMGRGGGYGGFAGPGFPGMLPSFPAVNSMGLAGVAPHVNPAFFARGMAPNGMGMMV 479

Query: 1581 XXXXXXHPSGMWGDAGMGGWPVEEHGQKTRESSYGGDDGASEYGYGEGNPEKSSRTNAAP 1760
                     GMW                  ESSY GD+GASEYGYGEGN EK +R++ A 
Sbjct: 480  SSGMDGPNPGMW------------------ESSYDGDEGASEYGYGEGNHEKGARSSGAS 521

Query: 1761 REKERASEREWSGNSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHRLKERDSGYEDDWD 1940
            REKER SER+WSGNS                              HR +ERDSGYEDD D
Sbjct: 522  REKERGSERDWSGNSDRRHRDEREQDWDRPEREHRYKEEKDSYRGHRQRERDSGYEDDRD 581

Query: 1941 RGQXXXXXXXXXXAMQEDDHRSRSRDVEYGKRRRAPSE 2054
            RG           A  E+D+RSR+RDV+YGKRRR PSE
Sbjct: 582  RGHSSSRARSRSRAAPEEDYRSRTRDVDYGKRRRLPSE 619


>ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa]
            gi|550329195|gb|ERP56065.1| hypothetical protein
            POPTR_0010s06150g [Populus trichocarpa]
          Length = 591

 Score =  241 bits (616), Expect = 7e-61
 Identities = 145/309 (46%), Positives = 178/309 (57%)
 Frame = +3

Query: 210  IDYGDEEYGGARKVQYQSSGAIPALADEEMIGXXXXXXXXXXGVNVGEGFLQFHRNEAQV 389
            +D+ +EE     K+QYQ SGAIPALA+EE+ G           VNVGE FLQ H +EA  
Sbjct: 1    MDFEEEE-----KMQYQGSGAIPALAEEEL-GEDDEYDDLYNDVNVGENFLQMHGSEAPA 54

Query: 390  PPSNVGNGVVQARTFNAPQPRQXXXXXXXXXXXXXXXXXXQHDASLSELGSANHISGALG 569
            PP+  GNG  Q R  NA + R                     +   S  G+        G
Sbjct: 55   PPATAGNGGFQTR--NAHESRVETGGSQVLATSGAGVAV---EGKYSNAGAHFPEQKQAG 109

Query: 570  QDKRIHDVSLGNVSFQGPAHVAQNTATNAQDVGKATNPVIGGPSASQGIVNNQMTANANM 749
                 +DV  G++ +   + VAQ  +               GP     +  NQM  NA++
Sbjct: 110  IGVEANDV--GSIGYGDGSSVAQKGSA--------------GPRGVPQMQVNQMNMNADV 153

Query: 750  GADHMMVNDNIIRPQMDNGPTMLFVGELHWWTTDAELESVLSQYGKVKEIKFFDERASGK 929
              +  +VN+N +RP ++NGPT L+VGELHWWTTDAELESV SQYG+VKEIKFFDERASGK
Sbjct: 154  --NRPVVNENQVRPPIENGPTTLYVGELHWWTTDAELESVASQYGRVKEIKFFDERASGK 211

Query: 930  SKGYCQVEFYDAGAAATCKEGMNGHIFNGRACVVAFASPQTLKQMGAAYASKNXXXXXXX 1109
            SKGYCQV+FY+A AAA CKEGMN H+FNGR CVVAFAS QTLKQMGA+Y SK        
Sbjct: 212  SKGYCQVDFYEAAAAAACKEGMNEHVFNGRPCVVAFASAQTLKQMGASYMSKTQGQPQPQ 271

Query: 1110 XXGRRNTND 1136
              GR + ND
Sbjct: 272  SQGRGSMND 280



 Score =  223 bits (567), Expect = 4e-55
 Identities = 116/218 (53%), Positives = 129/218 (59%)
 Frame = +3

Query: 1401 FDPSFMGRGAGYGGFSGPAFPGMMSPFPAVNPMGLAGVAPHVNPAFFXXXXXXXXXXXXX 1580
            FDP +MGRG GYGGF G  FPGM+  FPAVN MGLAGVAPHVNPAFF             
Sbjct: 375  FDPLYMGRGGGYGGFPGHGFPGMLPSFPAVNSMGLAGVAPHVNPAFFARGMAPNGMGMMA 434

Query: 1581 XXXXXXHPSGMWGDAGMGGWPVEEHGQKTRESSYGGDDGASEYGYGEGNPEKSSRTNAAP 1760
                     G W D  MGGW  EE G++TRESSY GD+GASEYGYGEGN EK +R++ A 
Sbjct: 435  SSGMEGPNPGKWPDTSMGGWG-EEPGRRTRESSYDGDEGASEYGYGEGNHEKGARSSGAS 493

Query: 1761 REKERASEREWSGNSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHRLKERDSGYEDDWD 1940
            REKER SER+WSGNS                              HR +ERDSGYEDD D
Sbjct: 494  REKERVSERDWSGNSDRRHRDEREQDWDRSEREPKYREEKDTYRGHRQRERDSGYEDDRD 553

Query: 1941 RGQXXXXXXXXXXAMQEDDHRSRSRDVEYGKRRRAPSE 2054
            RG           A  E+D+RSRSRDV+YGKRRR PSE
Sbjct: 554  RGHSSSRARSRSRAAPEEDYRSRSRDVDYGKRRRPPSE 591


>ref|XP_002315647.1| RNA recognition motif-containing family protein [Populus trichocarpa]
            gi|222864687|gb|EEF01818.1| RNA recognition
            motif-containing family protein [Populus trichocarpa]
          Length = 573

 Score =  241 bits (616), Expect = 7e-61
 Identities = 145/309 (46%), Positives = 178/309 (57%)
 Frame = +3

Query: 210  IDYGDEEYGGARKVQYQSSGAIPALADEEMIGXXXXXXXXXXGVNVGEGFLQFHRNEAQV 389
            +D+ +EE     K+QYQ SGAIPALA+EE+ G           VNVGE FLQ H +EA  
Sbjct: 1    MDFEEEE-----KMQYQGSGAIPALAEEEL-GEDDEYDDLYNDVNVGENFLQMHGSEAPA 54

Query: 390  PPSNVGNGVVQARTFNAPQPRQXXXXXXXXXXXXXXXXXXQHDASLSELGSANHISGALG 569
            PP+  GNG  Q R  NA + R                     +   S  G+        G
Sbjct: 55   PPATAGNGGFQTR--NAHESRVETGGSQVLATSGAGVAV---EGKYSNAGAHFPEQKQAG 109

Query: 570  QDKRIHDVSLGNVSFQGPAHVAQNTATNAQDVGKATNPVIGGPSASQGIVNNQMTANANM 749
                 +DV  G++ +   + VAQ  +               GP     +  NQM  NA++
Sbjct: 110  IGVEANDV--GSIGYGDGSSVAQKGSA--------------GPRGVPQMQVNQMNMNADV 153

Query: 750  GADHMMVNDNIIRPQMDNGPTMLFVGELHWWTTDAELESVLSQYGKVKEIKFFDERASGK 929
              +  +VN+N +RP ++NGPT L+VGELHWWTTDAELESV SQYG+VKEIKFFDERASGK
Sbjct: 154  --NRPVVNENQVRPPIENGPTTLYVGELHWWTTDAELESVASQYGRVKEIKFFDERASGK 211

Query: 930  SKGYCQVEFYDAGAAATCKEGMNGHIFNGRACVVAFASPQTLKQMGAAYASKNXXXXXXX 1109
            SKGYCQV+FY+A AAA CKEGMN H+FNGR CVVAFAS QTLKQMGA+Y SK        
Sbjct: 212  SKGYCQVDFYEAAAAAACKEGMNEHVFNGRPCVVAFASAQTLKQMGASYMSKTQGQPQPQ 271

Query: 1110 XXGRRNTND 1136
              GR + ND
Sbjct: 272  SQGRGSMND 280



 Score =  198 bits (504), Expect = 7e-48
 Identities = 109/218 (50%), Positives = 123/218 (56%)
 Frame = +3

Query: 1401 FDPSFMGRGAGYGGFSGPAFPGMMSPFPAVNPMGLAGVAPHVNPAFFXXXXXXXXXXXXX 1580
            FDP +MGRG GYGGF G  FPGM+  FPAVN MGLAGVAPHVNPAFF             
Sbjct: 375  FDPLYMGRGGGYGGFPGHGFPGMLPSFPAVNSMGLAGVAPHVNPAFF------------- 421

Query: 1581 XXXXXXHPSGMWGDAGMGGWPVEEHGQKTRESSYGGDDGASEYGYGEGNPEKSSRTNAAP 1760
                  +  GM   +GM G          +ESSY GD+GASEYGYGEGN EK +R++ A 
Sbjct: 422  ARGMAPNGMGMMASSGMEG------PNPGKESSYDGDEGASEYGYGEGNHEKGARSSGAS 475

Query: 1761 REKERASEREWSGNSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHRLKERDSGYEDDWD 1940
            REKER SER+WSGNS                              HR +ERDSGYEDD D
Sbjct: 476  REKERVSERDWSGNSDRRHRDEREQDWDRSEREPKYREEKDTYRGHRQRERDSGYEDDRD 535

Query: 1941 RGQXXXXXXXXXXAMQEDDHRSRSRDVEYGKRRRAPSE 2054
            RG           A  E+D+RSRSRDV+YGKRRR PSE
Sbjct: 536  RGHSSSRARSRSRAAPEEDYRSRSRDVDYGKRRRPPSE 573


>ref|XP_006417146.1| hypothetical protein EUTSA_v10007191mg [Eutrema salsugineum]
            gi|557094917|gb|ESQ35499.1| hypothetical protein
            EUTSA_v10007191mg [Eutrema salsugineum]
          Length = 578

 Score =  237 bits (605), Expect = 1e-59
 Identities = 143/309 (46%), Positives = 176/309 (56%), Gaps = 8/309 (2%)
 Frame = +3

Query: 186  MDPAAEEQIDYGDEEYGGARKVQYQSSGAIPALADEEMIGXXXXXXXXXXGVNVGEGFLQ 365
            M+P +EE + YG     G +K+ +Q SG IPALADEE++G           VNVGE F Q
Sbjct: 1    MNPMSEENVSYG-----GNQKLLHQGSGTIPALADEELMGEDDDYDDLYSDVNVGESFFQ 55

Query: 366  FHRNEAQVPPS--NVGNGVVQARTFNAPQPRQXXXXXXXXXXXXXXXXXXQHDASLSELG 539
             H ++ Q P      G+G +QA+  N  +PR                      +      
Sbjct: 56   AH-HQPQTPAQVGGTGSGNIQAQNSNVAEPRMANVSGVTVEGKYRNDGGHNGISGPETRS 114

Query: 540  SANHISGALGQDKRIHDVSLGNVSFQGPAHVAQNT---ATNAQDVGKAT--NPVIGGPSA 704
                 +   G      DV    V  QG   +  NT   + NA +V +    NP    P  
Sbjct: 115  DVYPQASPFGAKGSNIDVQSNKVIPQGSTSIVLNTHGFSGNAVNVPEPPVHNPYGAVPQG 174

Query: 705  SQGIVNNQMTANANMGADHMMVNDNIIRP-QMDNGPTMLFVGELHWWTTDAELESVLSQY 881
            +Q I  +QM AN N      MVN +  +P  +DNG TMLFVGELHWWTTDAE+ESVLSQY
Sbjct: 175  AQQIPVSQMNANPNA-----MVNRSPTQPFVVDNGNTMLFVGELHWWTTDAEIESVLSQY 229

Query: 882  GKVKEIKFFDERASGKSKGYCQVEFYDAGAAATCKEGMNGHIFNGRACVVAFASPQTLKQ 1061
            G+VKEIKFFDER SGKSKGYCQVEFYD+ AAA CKEGMNG +FNG+ACVVAFASP+TLKQ
Sbjct: 230  GRVKEIKFFDERVSGKSKGYCQVEFYDSAAAAACKEGMNGFVFNGKACVVAFASPETLKQ 289

Query: 1062 MGAAYASKN 1088
            MGA +  +N
Sbjct: 290  MGANFTGRN 298



 Score =  134 bits (336), Expect = 2e-28
 Identities = 87/216 (40%), Positives = 108/216 (50%), Gaps = 2/216 (0%)
 Frame = +3

Query: 1401 FDPSFMGRGAGYGGFSGPAFPGMMSPFPAVNPMGLAGVAPHVNPAFFXXXXXXXXXXXXX 1580
            FDP+FMGRG GYGGFSG A+PGM   +P VN MG+ G+APHVNPAFF             
Sbjct: 399  FDPTFMGRGGGYGGFSGLAYPGMPHSYPGVNAMGMVGIAPHVNPAFF----GTGMGTMGS 454

Query: 1581 XXXXXXHPSGMWGDAGMGGWPVEEHGQKTRESSYGGDDGASEY-GYGEGNPEKSSRTNAA 1757
                  H + MW +A  GG               GG++G SEY GY + N EK  + +  
Sbjct: 455  SGMNGAHAAAMWNEANGGG---------------GGEEGGSEYGGYEDENQEKEDKPS-- 497

Query: 1758 PREKERA-SEREWSGNSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXHRLKERDSGYEDD 1934
             R+KERA +EREWS +S                               + ++RDS   D+
Sbjct: 498  -RDKERATTEREWSESS------------GDRRHKSHREEKDSHREYKQQRDRDS---DE 541

Query: 1935 WDRGQXXXXXXXXXXAMQEDDHRSRSRDVEYGKRRR 2042
            +DRGQ           M EDDHRSRSRD +YGKRRR
Sbjct: 542  YDRGQ-SSMKSRSRSRMAEDDHRSRSRDADYGKRRR 576


Top