BLASTX nr result
ID: Akebia24_contig00016714
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00016714 (756 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268... 317 4e-84 ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prun... 305 1e-80 ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobr... 286 4e-75 ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation spec... 284 2e-74 ref|XP_007044908.1| RNA-binding family protein isoform 5, partia... 284 3e-74 ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobr... 283 3e-74 ref|XP_002515040.1| RNA binding protein, putative [Ricinus commu... 282 8e-74 ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation spec... 282 1e-73 ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citr... 281 2e-73 ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citr... 280 3e-73 ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation spec... 280 5e-73 ref|XP_006852230.1| hypothetical protein AMTR_s00049p00146760 [A... 276 6e-72 ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Popu... 266 6e-69 ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobr... 262 8e-68 gb|EXB82464.1| Cleavage and polyadenylation specificity factor s... 261 1e-67 ref|XP_002312652.1| RNA recognition motif-containing family prot... 258 2e-66 ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309... 253 7e-65 gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus... 245 1e-62 ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobr... 225 1e-56 emb|CAN66828.1| hypothetical protein VITISV_015886 [Vitis vinifera] 202 1e-49 >ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268141 isoform 1 [Vitis vinifera] gi|359473133|ref|XP_003631251.1| PREDICTED: uncharacterized protein LOC100268141 isoform 2 [Vitis vinifera] Length = 647 Score = 317 bits (811), Expect = 4e-84 Identities = 154/256 (60%), Positives = 177/256 (69%), Gaps = 5/256 (1%) Frame = +3 Query: 3 IGAKNMIGS----GVNGGNPYGQGFVXXXXXXXXXXXXHPQSMMAAGFDPTYMGRGGGYG 170 +GAKNM+G+ G +GG YGQG HPQ MM +GFDPTYMGRGG YG Sbjct: 384 VGAKNMVGNTAGVGASGGG-YGQGLAGPTFGGPAGGLMHPQGMMGSGFDPTYMGRGGAYG 442 Query: 171 AFQNPVFPGMIPSYQAVNTMGLPGVAPHVNPAFFGRGVSAXXXXXXXXXXXXXXXXXXWT 350 F FPGM+PS+ AVNTMGL GVAPHVNPAFFGRG++A WT Sbjct: 443 GFSGSAFPGMVPSFPAVNTMGLAGVAPHVNPAFFGRGMAANGMGMMGATGMDGHHAGMWT 502 Query: 351 DTSMGEWGGDEHARM-KEXXXXXXXXXXXXXXXXXXHERGGRSNASSREKDRGSERDWSG 527 DTSMG WGG+EH R +E HE+ GRSN +SREK+RGSERDWSG Sbjct: 503 DTSMGGWGGEEHGRRTRESSYGGDDGASDYGYGEVNHEKVGRSNTASREKERGSERDWSG 562 Query: 528 NSERRHRDEREQDWERSDRDHRYKEEKDGYRDHRQREREWDNGDDWDRGQSSSRSRVKSN 707 NSERRHRDEREQDWERSD+DHRY+EEKDGYRDHRQRER+++N DDWDRGQSSSRSR +S Sbjct: 563 NSERRHRDEREQDWERSDKDHRYREEKDGYRDHRQRERDFNNEDDWDRGQSSSRSRSRSR 622 Query: 708 MMQEDDHRSRSRDVDY 755 + ++DHRSRSRD DY Sbjct: 623 AVADEDHRSRSRDGDY 638 >ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] gi|462422613|gb|EMJ26876.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] Length = 630 Score = 305 bits (781), Expect = 1e-80 Identities = 155/262 (59%), Positives = 172/262 (65%), Gaps = 11/262 (4%) Frame = +3 Query: 3 IGAKNM------IGSGVNGGNPYGQGFVXXXXXXXXXXXXHPQSMMAAGFDPTYMGRGGG 164 +GAKNM +G+G NGG YGQG +PQ MM AGFDPTYMGRGGG Sbjct: 362 MGAKNMAGNPAGVGTGANGG--YGQGLAGPGFGGPVGGMMNPQGMMGAGFDPTYMGRGGG 419 Query: 165 YGAFQNPVFPGMIPSYQAVNTMGLPGVAPHVNPAFFGRGVSAXXXXXXXXXXXXXXXXXX 344 YG F P FPGM+ S+ AVNTMGL GVAPHVNPAFFGRG++ Sbjct: 420 YGGFPGPAFPGMLSSFPAVNTMGLAGVAPHVNPAFFGRGMATNGMGMMGSSGMDGHHAGM 479 Query: 345 WTDTSMGEWGGDEHA-RMKEXXXXXXXXXXXXXXXXXXHERGGRSNASSREKDRGSERDW 521 W D SMG WGGDEH R +E HE+GGRSNA SRE++RGSERDW Sbjct: 480 WNDPSMGGWGGDEHGRRTRESSYGGDDGASEYGYGEANHEKGGRSNAPSRERERGSERDW 539 Query: 522 SGNSERRHRDEREQDWERSD----RDHRYKEEKDGYRDHRQREREWDNGDDWDRGQSSSR 689 SGNSERRHRDEREQDW+RS+ R+HRYKEEKD YRDHRQRER+ DDWDRGQSSSR Sbjct: 540 SGNSERRHRDEREQDWDRSERGEHREHRYKEEKDSYRDHRQRERDVGYEDDWDRGQSSSR 599 Query: 690 SRVKSNMMQEDDHRSRSRDVDY 755 R +S M EDDHRSRSRDVDY Sbjct: 600 PRSRSKAMPEDDHRSRSRDVDY 621 >ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695496|ref|XP_007044905.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695500|ref|XP_007044906.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708839|gb|EOY00736.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708840|gb|EOY00737.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708841|gb|EOY00738.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 652 Score = 286 bits (733), Expect = 4e-75 Identities = 143/260 (55%), Positives = 167/260 (64%), Gaps = 9/260 (3%) Frame = +3 Query: 3 IGAKNMIG------SGVNGGNPYGQGFVXXXXXXXXXXXXHPQSMMAAGFDPTYMGRGGG 164 +G KNM+G +G NG YGQG HPQ MM AGFDPTYM RGGG Sbjct: 385 VGVKNMVGISAGVGNGANGAGAYGQG-PGPAFGGPAGGMMHPQGMMGAGFDPTYMVRGGG 443 Query: 165 YGAFQNPVFPGMIPSYQAVNTMGLPGVAPHVNPAFFGRGVSAXXXXXXXXXXXXXXXXXX 344 YG F P FPGM+PS+ AVNTMGL GVAPHVNPAFFGRG++ Sbjct: 444 YGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGASGMDGPHAGM 503 Query: 345 WTDTSMGEWGGDEHARMKEXXXXXXXXXXXXXXXXXXHERGGRSNASSREKDRGSERDWS 524 WTD SMG WGGDEH R + GRS+ +SREK+R SER+WS Sbjct: 504 WTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKGRSSGASREKERVSEREWS 563 Query: 525 GNSERRHRDEREQDWERSD---RDHRYKEEKDGYRDHRQREREWDNGDDWDRGQSSSRSR 695 GNS+RRHRDE+EQDW+RS+ R+HRY+EEKD YR+HR RER+ D DDWDRGQSSSRSR Sbjct: 564 GNSDRRHRDEKEQDWDRSEREHREHRYREEKDSYREHRHRERDLDYDDDWDRGQSSSRSR 623 Query: 696 VKSNMMQEDDHRSRSRDVDY 755 +S+ M E++HRSRSRDVDY Sbjct: 624 RRSHAMPEEEHRSRSRDVDY 643 >ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Solanum tuberosum] gi|565349616|ref|XP_006341787.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X2 [Solanum tuberosum] Length = 648 Score = 284 bits (727), Expect = 2e-74 Identities = 138/255 (54%), Positives = 169/255 (66%), Gaps = 4/255 (1%) Frame = +3 Query: 3 IGAKNMI---GSGVNGGNPYGQGFVXXXXXXXXXXXXHPQSMMAAGFDPTYMGRGGGYGA 173 +G+KNM+ G+G G +GQG HPQ MM GFDP++MGRG GYG Sbjct: 385 MGSKNMMVNPGAGNGAGGAFGQGLAGPAFGGPPAGLMHPQGMMGPGFDPSFMGRGAGYGG 444 Query: 174 FQNPVFPGMIPSYQAVNTMGLPGVAPHVNPAFFGRGVSAXXXXXXXXXXXXXXXXXXWTD 353 F P FPGM+P +QAVN MGLPGVAPHVNPAFFGRG++A WTD Sbjct: 445 FSGPAFPGMMPPFQAVNPMGLPGVAPHVNPAFFGRGMAANGMGMMSAAGMDGPHPGMWTD 504 Query: 354 TSMGEWGGDEHARM-KEXXXXXXXXXXXXXXXXXXHERGGRSNASSREKDRGSERDWSGN 530 TS G WGG+EH R +E H++G RS+A SREK+RGSERDWSGN Sbjct: 505 TSGGGWGGEEHGRRTRESSYGGEDNASEYGYGEVSHDKGARSSAVSREKERGSERDWSGN 564 Query: 531 SERRHRDEREQDWERSDRDHRYKEEKDGYRDHRQREREWDNGDDWDRGQSSSRSRVKSNM 710 S++RHRDERE D +R D++HRY+EE+DGYRD+RQ+ERE + +D+DRGQSSSRSR KS Sbjct: 565 SDKRHRDEREHDRDRHDKEHRYREERDGYRDYRQKERESEYEEDYDRGQSSSRSRSKSRA 624 Query: 711 MQEDDHRSRSRDVDY 755 QE+DHRSRSRD +Y Sbjct: 625 AQEEDHRSRSRDTNY 639 >ref|XP_007044908.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] gi|508708843|gb|EOY00740.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] Length = 656 Score = 284 bits (726), Expect = 3e-74 Identities = 142/260 (54%), Positives = 166/260 (63%), Gaps = 9/260 (3%) Frame = +3 Query: 3 IGAKNMIG------SGVNGGNPYGQGFVXXXXXXXXXXXXHPQSMMAAGFDPTYMGRGGG 164 +G KNM+G +G NG YGQG HPQ MM AGFDPTYM RGGG Sbjct: 385 VGVKNMVGISAGVGNGANGAGAYGQG-PGPAFGGPAGGMMHPQGMMGAGFDPTYMVRGGG 443 Query: 165 YGAFQNPVFPGMIPSYQAVNTMGLPGVAPHVNPAFFGRGVSAXXXXXXXXXXXXXXXXXX 344 YG F P FPGM+PS+ AVNTMGL GVAPHVNPAFFGRG++ Sbjct: 444 YGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGASGMDGPHAGM 503 Query: 345 WTDTSMGEWGGDEHARMKEXXXXXXXXXXXXXXXXXXHERGGRSNASSREKDRGSERDWS 524 WTD SMG WGGDEH R + GRS+ +SREK+R SER+WS Sbjct: 504 WTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKGRSSGASREKERVSEREWS 563 Query: 525 GNSERRHRDEREQDWERSD---RDHRYKEEKDGYRDHRQREREWDNGDDWDRGQSSSRSR 695 GNS+RRHRDE+EQDW+RS+ R+HRY+EEKD YR+HR RER+ D DDWDRGQSSSRSR Sbjct: 564 GNSDRRHRDEKEQDWDRSEREHREHRYREEKDSYREHRHRERDLDYDDDWDRGQSSSRSR 623 Query: 696 VKSNMMQEDDHRSRSRDVDY 755 +S+ M E++HRSRSRDV Y Sbjct: 624 RRSHAMPEEEHRSRSRDVGY 643 >ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695488|ref|XP_007044903.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708837|gb|EOY00734.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708838|gb|EOY00735.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 653 Score = 283 bits (725), Expect = 3e-74 Identities = 141/260 (54%), Positives = 166/260 (63%), Gaps = 9/260 (3%) Frame = +3 Query: 3 IGAKNMIGS------GVNGGNPYGQGFVXXXXXXXXXXXXHPQSMMAAGFDPTYMGRGGG 164 +G KNM+GS G NGG YGQG HPQ MM AGFDPTYMGRGG Sbjct: 385 VGVKNMVGSSAGVGNGANGGAAYGQGPAGPPFGGPAGGMMHPQGMMGAGFDPTYMGRGGS 444 Query: 165 YGAFQNPVFPGMIPSYQAVNTMGLPGVAPHVNPAFFGRGVSAXXXXXXXXXXXXXXXXXX 344 YG F P FPGM+PS+ AVNT+GL GVAPHVNPAFFGRG++ Sbjct: 445 YGGFPGPGFPGMLPSFPAVNTLGLAGVAPHVNPAFFGRGMAPNGMGMMGGPGMDGPHVGM 504 Query: 345 WTDTSMGEWGGDEHARMKEXXXXXXXXXXXXXXXXXXHERGGRSNASSREKDRGSERDWS 524 WTDTSMG WGGDEH R + GRS+ +SREK+R S+R+WS Sbjct: 505 WTDTSMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKGRSSGASREKERVSDREWS 564 Query: 525 GNSERRHRDEREQDWERSD---RDHRYKEEKDGYRDHRQREREWDNGDDWDRGQSSSRSR 695 GNS+RRHRDE+E+DW+RS+ R+HRY+EEKD YR+HR RER+ D DD DRGQSSSRSR Sbjct: 565 GNSDRRHRDEKERDWDRSEREHREHRYREEKDSYREHRHRERDLDYDDDLDRGQSSSRSR 624 Query: 696 VKSNMMQEDDHRSRSRDVDY 755 +S+ M E+ RSRSRDVDY Sbjct: 625 RRSHAMPEEQRRSRSRDVDY 644 >ref|XP_002515040.1| RNA binding protein, putative [Ricinus communis] gi|223546091|gb|EEF47594.1| RNA binding protein, putative [Ricinus communis] Length = 644 Score = 282 bits (722), Expect = 8e-74 Identities = 142/260 (54%), Positives = 168/260 (64%), Gaps = 9/260 (3%) Frame = +3 Query: 3 IGAKNMIG------SGVNGGNPYGQGFVXXXXXXXXXXXXHPQSMMAAGFDPTYMGRGGG 164 +GAKN++G SG NGG YGQG PQSMM AGFDPTYMGRG G Sbjct: 377 MGAKNIVGGAGGVGSGANGGG-YGQGLAGPAFGGPAGAMLPPQSMMRAGFDPTYMGRGAG 435 Query: 165 YGAFQNPVFPGMIPSYQAVNTMGLPGVAPHVNPAFFGRGVSAXXXXXXXXXXXXXXXXXX 344 YG F P FPGM+PS+ AVN MGL GVAPHVNPAFFGRG++ Sbjct: 436 YGGFAGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFGRGMAPNGMGMMGPSGMDGPNAGM 495 Query: 345 WTDTSMGEWGGDEHARMKEXXXXXXXXXXXXXXXXXXHERGGRSNASSREKDRGSERDWS 524 W+DTSMG WG + R +E HE+G RS+A+SREK+R SERDWS Sbjct: 496 WSDTSMGGWGEEPGRRTRESSYGGDDGASEYGYGEVNHEKGARSSAASREKERASERDWS 555 Query: 525 GNSERRHRDEREQDWERSDR---DHRYKEEKDGYRDHRQREREWDNGDDWDRGQSSSRSR 695 GNS+RRHRD+RE DW+RS+R +HRY+EEK+ YRDHRQRER+ DDWDRGQSSSRSR Sbjct: 556 GNSDRRHRDDREHDWDRSEREHKEHRYREEKESYRDHRQRERDSGYEDDWDRGQSSSRSR 615 Query: 696 VKSNMMQEDDHRSRSRDVDY 755 +S + E+D+RSRSRD DY Sbjct: 616 SRSRAVPEEDYRSRSRDADY 635 >ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 655 Score = 282 bits (721), Expect = 1e-73 Identities = 144/263 (54%), Positives = 170/263 (64%), Gaps = 12/263 (4%) Frame = +3 Query: 3 IGAKNMIGS--------GVNGGNPYGQGFVXXXXXXXXXXXXHPQSMMAAGFDPTYMGRG 158 +GA+NMIGS G G YGQG HPQ+MM GFDPTYMGRG Sbjct: 385 MGARNMIGSSSGAGSGVGHAAGGGYGQGLAGPGFGGPAGGMMHPQNMMG-GFDPTYMGRG 443 Query: 159 GGYGAFQNPVFPGMIPSYQAVNTMGLPGVAPHVNPAFFGRGVSAXXXXXXXXXXXXXXXX 338 GGYG F P FPGM+PS+ AVN MGL GVAPHVNPAFF RG++A Sbjct: 444 GGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGSSGMDGPHP 503 Query: 339 XXWTDTSMGEWGGDEHA-RMKEXXXXXXXXXXXXXXXXXXHERGGRSNASSREKDRGSER 515 WTD+SMG W G+EH R +E HE+G RS A+SREKDRGSER Sbjct: 504 GMWTDSSMGGWLGEEHGRRTRESSYGGDDGASDYGYGEANHEKGARSTAASREKDRGSER 563 Query: 516 DWSGNSERRHRDEREQDWERSD---RDHRYKEEKDGYRDHRQREREWDNGDDWDRGQSSS 686 DWSGN++RRHR+EREQDW+RS+ RDHR++EEKD YRD RQR+R+ D+WDRGQSSS Sbjct: 564 DWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDDNWDRGQSSS 623 Query: 687 RSRVKSNMMQEDDHRSRSRDVDY 755 RSR +S + ++DHRSRSRDVDY Sbjct: 624 RSRSRSGAIPDEDHRSRSRDVDY 646 >ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|567891321|ref|XP_006438181.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540376|gb|ESR51420.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540377|gb|ESR51421.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] Length = 655 Score = 281 bits (718), Expect = 2e-73 Identities = 143/263 (54%), Positives = 169/263 (64%), Gaps = 12/263 (4%) Frame = +3 Query: 3 IGAKNMIGS--------GVNGGNPYGQGFVXXXXXXXXXXXXHPQSMMAAGFDPTYMGRG 158 +GA+NMIGS G G YGQG HPQ+MM GFDPTYMGRG Sbjct: 385 MGARNMIGSSSGAGSGAGHAAGGGYGQGLAGPGFGGPAGGMMHPQNMMG-GFDPTYMGRG 443 Query: 159 GGYGAFQNPVFPGMIPSYQAVNTMGLPGVAPHVNPAFFGRGVSAXXXXXXXXXXXXXXXX 338 GGYG F P FPGM+PS+ AVN MGL GVAPHVNPAFF RG++A Sbjct: 444 GGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGSSGMDGPHP 503 Query: 339 XXWTDTSMGEWGGDEHA-RMKEXXXXXXXXXXXXXXXXXXHERGGRSNASSREKDRGSER 515 WTD+SMG W G+EH R +E HE+G RS +SREKDRGSER Sbjct: 504 GMWTDSSMGGWVGEEHGRRTRESSYGGDDGASDYGYGEASHEKGARSTTASREKDRGSER 563 Query: 516 DWSGNSERRHRDEREQDWERSD---RDHRYKEEKDGYRDHRQREREWDNGDDWDRGQSSS 686 DWSGN++RRHR+EREQDW+RS+ RDHR++EEKD YRD RQR+R+ D+WDRGQSSS Sbjct: 564 DWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDDNWDRGQSSS 623 Query: 687 RSRVKSNMMQEDDHRSRSRDVDY 755 RSR +S + ++DHRSRSRDVDY Sbjct: 624 RSRSRSGAIPDEDHRSRSRDVDY 646 >ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] gi|557540375|gb|ESR51419.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] Length = 658 Score = 280 bits (717), Expect = 3e-73 Identities = 143/263 (54%), Positives = 169/263 (64%), Gaps = 12/263 (4%) Frame = +3 Query: 3 IGAKNMIGS--------GVNGGNPYGQGFVXXXXXXXXXXXXHPQSMMAAGFDPTYMGRG 158 +GAKNM+GS G G YGQG HPQ+MM GFDPTYMGRG Sbjct: 388 MGAKNMMGSSSGAGSGAGPAAGGGYGQGLAGPGFGGPAGGMMHPQNMMG-GFDPTYMGRG 446 Query: 159 GGYGAFQNPVFPGMIPSYQAVNTMGLPGVAPHVNPAFFGRGVSAXXXXXXXXXXXXXXXX 338 GGYG F P FPGM+PS+ AVN MGL GVAPHVNPAFF RG++A Sbjct: 447 GGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGSSGMDGPHP 506 Query: 339 XXWTDTSMGEWGGDEHA-RMKEXXXXXXXXXXXXXXXXXXHERGGRSNASSREKDRGSER 515 WTD+SMG W G+EH R +E HE+G RS A+SREKDRGSER Sbjct: 507 GMWTDSSMGGWVGEEHGRRTRESSYGGDDGASDYGYGEANHEKGARSTAASREKDRGSER 566 Query: 516 DWSGNSERRHRDEREQDWERSD---RDHRYKEEKDGYRDHRQREREWDNGDDWDRGQSSS 686 DWSGN++RRHR+EREQDW+RS+ RDHR++EEKD YRD RQR+R+ D+WDRG SSS Sbjct: 567 DWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDDNWDRGPSSS 626 Query: 687 RSRVKSNMMQEDDHRSRSRDVDY 755 RSR +S + ++DHRSRSRDVDY Sbjct: 627 RSRSRSRAIPDEDHRSRSRDVDY 649 >ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 658 Score = 280 bits (715), Expect = 5e-73 Identities = 143/263 (54%), Positives = 169/263 (64%), Gaps = 12/263 (4%) Frame = +3 Query: 3 IGAKNMIGS--------GVNGGNPYGQGFVXXXXXXXXXXXXHPQSMMAAGFDPTYMGRG 158 +GA+NMIGS G G YGQG HPQ+MM GFDPTYMGRG Sbjct: 388 MGARNMIGSSSGAGSGVGHAAGGGYGQGLAGPGFGGPAGGMMHPQNMMG-GFDPTYMGRG 446 Query: 159 GGYGAFQNPVFPGMIPSYQAVNTMGLPGVAPHVNPAFFGRGVSAXXXXXXXXXXXXXXXX 338 GGYG F P FPGM+PS+ AVN MGL GVAPHVNPAFF RG++A Sbjct: 447 GGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGSSGMDGPHP 506 Query: 339 XXWTDTSMGEWGGDEHA-RMKEXXXXXXXXXXXXXXXXXXHERGGRSNASSREKDRGSER 515 WTD+SMG W G+EH R +E HE+G RS A+SREKDRGSER Sbjct: 507 GMWTDSSMGGWLGEEHGRRTRESSYGGDDGASDYGYGEANHEKGARSTAASREKDRGSER 566 Query: 516 DWSGNSERRHRDEREQDWERSD---RDHRYKEEKDGYRDHRQREREWDNGDDWDRGQSSS 686 DWSGN++RRHR+EREQDW+RS+ RDHR++EEKD YRD RQR+R+ D+WDRG SSS Sbjct: 567 DWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDDNWDRGPSSS 626 Query: 687 RSRVKSNMMQEDDHRSRSRDVDY 755 RSR +S + ++DHRSRSRDVDY Sbjct: 627 RSRSRSRAIPDEDHRSRSRDVDY 649 >ref|XP_006852230.1| hypothetical protein AMTR_s00049p00146760 [Amborella trichopoda] gi|548855834|gb|ERN13697.1| hypothetical protein AMTR_s00049p00146760 [Amborella trichopoda] Length = 659 Score = 276 bits (706), Expect = 6e-72 Identities = 139/254 (54%), Positives = 158/254 (62%), Gaps = 6/254 (2%) Frame = +3 Query: 12 KNMIGSGVNGGNPYGQGFVXXXXXXXXXXXXHPQSMMAAGFDPTY---MGRGGGYGAFQN 182 K M+G G +G NPYGQ HPQ MM +GFDPTY +GRG GYG F Sbjct: 401 KAMVG-GPSGANPYGQALSAPPLGGPPGGLLHPQGMMGSGFDPTYGAHLGRGSGYGGFSG 459 Query: 183 PVFPGMIPSYQAVNTMGLPGVAPHVNPAFFGRGVSAXXXXXXXXXXXXXXXXXXWTDTSM 362 P FPGM+PS+ + T+GLPGVAPHVNPAFFGRGVSA W D+SM Sbjct: 460 PHFPGMLPSFSPMGTVGLPGVAPHVNPAFFGRGVSANGMGMMGSGAMDGHHGGMWGDSSM 519 Query: 363 GE---WGGDEHARMKEXXXXXXXXXXXXXXXXXXHERGGRSNASSREKDRGSERDWSGNS 533 G WG +EH R HERGG + REKDRGSERDWS Sbjct: 520 GGGVGWGNEEHGRRTRESSYGDDGASDYGYGDGGHERGGGRSNPGREKDRGSERDWSSGP 579 Query: 534 ERRHRDEREQDWERSDRDHRYKEEKDGYRDHRQREREWDNGDDWDRGQSSSRSRVKSNMM 713 ERRHRD+R+ DW DRD RYK+EKDGY DHRQRER+WDN DDWDRG++SSRSR KS MM Sbjct: 580 ERRHRDDRDSDW---DRDPRYKDEKDGYSDHRQRERDWDNEDDWDRGRTSSRSRSKSRMM 636 Query: 714 QEDDHRSRSRDVDY 755 QE+D RSRS+DVDY Sbjct: 637 QEEDQRSRSKDVDY 650 >ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] gi|550329195|gb|ERP56065.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] Length = 591 Score = 266 bits (680), Expect = 6e-69 Identities = 136/257 (52%), Positives = 160/257 (62%), Gaps = 6/257 (2%) Frame = +3 Query: 3 IGAKNM------IGSGVNGGNPYGQGFVXXXXXXXXXXXXHPQSMMAAGFDPTYMGRGGG 164 +G KNM +GSG NGG YGQG H Q MM AGFDP YMGRGGG Sbjct: 327 MGPKNMAGNVAGVGSGANGGG-YGQGIAGPAFGGPAGGMMHHQGMMGAGFDPLYMGRGGG 385 Query: 165 YGAFQNPVFPGMIPSYQAVNTMGLPGVAPHVNPAFFGRGVSAXXXXXXXXXXXXXXXXXX 344 YG F FPGM+PS+ AVN+MGL GVAPHVNPAFF RG++ Sbjct: 386 YGGFPGHGFPGMLPSFPAVNSMGLAGVAPHVNPAFFARGMAPNGMGMMASSGMEGPNPGK 445 Query: 345 WTDTSMGEWGGDEHARMKEXXXXXXXXXXXXXXXXXXHERGGRSNASSREKDRGSERDWS 524 W DTSMG WG + R +E HE+G RS+ +SREK+R SERDWS Sbjct: 446 WPDTSMGGWGEEPGRRTRESSYDGDEGASEYGYGEGNHEKGARSSGASREKERVSERDWS 505 Query: 525 GNSERRHRDEREQDWERSDRDHRYKEEKDGYRDHRQREREWDNGDDWDRGQSSSRSRVKS 704 GNS+RRHRDEREQDW+RS+R+ +Y+EEKD YR HRQRER+ DD DRG SSSR+R +S Sbjct: 506 GNSDRRHRDEREQDWDRSEREPKYREEKDTYRGHRQRERDSGYEDDRDRGHSSSRARSRS 565 Query: 705 NMMQEDDHRSRSRDVDY 755 E+D+RSRSRDVDY Sbjct: 566 RAAPEEDYRSRSRDVDY 582 >ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobroma cacao] gi|508708842|gb|EOY00739.1| RNA-binding family protein isoform 4 [Theobroma cacao] Length = 697 Score = 262 bits (670), Expect = 8e-68 Identities = 143/305 (46%), Positives = 165/305 (54%), Gaps = 54/305 (17%) Frame = +3 Query: 3 IGAKNMIG------SGVNGGNPYGQGFVXXXXXXXXXXXXHPQSMMAAGFDPTYMGRGGG 164 +G KNM+G +G NG YGQG HPQ MM AGFDPTYM RGGG Sbjct: 385 VGVKNMVGISAGVGNGANGAGAYGQG-PGPAFGGPAGGMMHPQGMMGAGFDPTYMVRGGG 443 Query: 165 YGAFQNPVFPGMIPSYQAVNTMGLPGVAPHVNPAFFGRGVSAXXXXXXXXXXXXXXXXXX 344 YG F P FPGM+PS+ AVNTMGL GVAPHVNPAFFGRG++ Sbjct: 444 YGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGASGMDGPHAGM 503 Query: 345 WTDTSMGEWGGDEHARMKEXXXXXXXXXXXXXXXXXXHERGGRSNASSREKDRGSERDWS 524 WTD SMG WGGDEH R + GRS+ +SREK+R SER+WS Sbjct: 504 WTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKGRSSGASREKERVSEREWS 563 Query: 525 GNSERRHRDEREQDWERSD---RDHRYKEEKDGYRDHRQREREW---------------- 647 GNS+RRHRDE+EQDW+RS+ R+HRY+EEKD YR+HR REREW Sbjct: 564 GNSDRRHRDEKEQDWDRSEREHREHRYREEKDSYREHRHREREWSGNSDRRHRDEKERDW 623 Query: 648 -----------------------------DNGDDWDRGQSSSRSRVKSNMMQEDDHRSRS 740 D DD DRGQSSSRSR +S+ M E+ RSRS Sbjct: 624 DRSEREHREHRYREEKDSYREHRHRERDLDYDDDLDRGQSSSRSRRRSHAMPEEQRRSRS 683 Query: 741 RDVDY 755 RDVDY Sbjct: 684 RDVDY 688 >gb|EXB82464.1| Cleavage and polyadenylation specificity factor subunit [Morus notabilis] Length = 636 Score = 261 bits (668), Expect = 1e-67 Identities = 133/256 (51%), Positives = 158/256 (61%), Gaps = 5/256 (1%) Frame = +3 Query: 3 IGAKNMIGSGVN-GGNPYGQGFVXXXXXXXXXXXXHPQSMMAAGFDPTYMGRGGGYGAFQ 179 +GAKNM+G+ GG YGQG +PQ MM GFDPTYMGRG GYG F Sbjct: 380 MGAKNMVGNNAGVGGGGYGQGLAGPPFGGPAGGMMNPQGMMGTGFDPTYMGRGVGYGGFA 439 Query: 180 NPVFPGMIPSYQAVNTMGLPGVAPHVNPAFFGRGVSAXXXXXXXXXXXXXXXXXXWTDTS 359 P FPGM+PS+ AVNTMG VAPHVNPAFFGRG++ W D S Sbjct: 440 GPAFPGMLPSFPAVNTMGFAAVAPHVNPAFFGRGMTNNGMGMVGSSLMDGHQGGMWNDPS 499 Query: 360 MGEWGGDEHA-RMKEXXXXXXXXXXXXXXXXXXHERGGRSNASSREKDRGSERDWSGNSE 536 +G WGG+EH R +E HE+GGR +RGSERDWSGNSE Sbjct: 500 IGGWGGEEHGRRTRESSYGGDDGASEYGYGDTNHEKGGR--------ERGSERDWSGNSE 551 Query: 537 RRHRDEREQDWERS---DRDHRYKEEKDGYRDHRQREREWDNGDDWDRGQSSSRSRVKSN 707 RR+ +ER+QDW+RS ++HRY+E KDG RD+R +ERE D DDWDRGQSSSR R +S Sbjct: 552 RRNHEERDQDWDRSQKEQKEHRYREGKDGSRDYRPKERELDYEDDWDRGQSSSRLRSRSR 611 Query: 708 MMQEDDHRSRSRDVDY 755 ++QED HRSRSRDVDY Sbjct: 612 VVQEDHHRSRSRDVDY 627 >ref|XP_002312652.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222852472|gb|EEE90019.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 619 Score = 258 bits (658), Expect = 2e-66 Identities = 133/257 (51%), Positives = 157/257 (61%), Gaps = 6/257 (2%) Frame = +3 Query: 3 IGAKNM------IGSGVNGGNPYGQGFVXXXXXXXXXXXXHPQSMMAAGFDPTYMGRGGG 164 +G KNM +GSG NGG YGQG PQ MM AGFDP YMGRGGG Sbjct: 372 MGPKNMAGNVAGVGSGANGGG-YGQGLAGPAFGGPAGGMMPPQGMMGAGFDPLYMGRGGG 430 Query: 165 YGAFQNPVFPGMIPSYQAVNTMGLPGVAPHVNPAFFGRGVSAXXXXXXXXXXXXXXXXXX 344 YG F P FPGM+PS+ AVN+MGL GVAPHVNPAFF RG++ Sbjct: 431 YGGFAGPGFPGMLPSFPAVNSMGLAGVAPHVNPAFFARGMAPNGMGMMVSSGMDGPNPGM 490 Query: 345 WTDTSMGEWGGDEHARMKEXXXXXXXXXXXXXXXXXXHERGGRSNASSREKDRGSERDWS 524 W + G+ G E+ HE+G RS+ +SREK+RGSERDWS Sbjct: 491 WESSYDGDEGASEYG-----------------YGEGNHEKGARSSGASREKERGSERDWS 533 Query: 525 GNSERRHRDEREQDWERSDRDHRYKEEKDGYRDHRQREREWDNGDDWDRGQSSSRSRVKS 704 GNS+RRHRDEREQDW+R +R+HRYKEEKD YR HRQRER+ DD DRG SSSR+R +S Sbjct: 534 GNSDRRHRDEREQDWDRPEREHRYKEEKDSYRGHRQRERDSGYEDDRDRGHSSSRARSRS 593 Query: 705 NMMQEDDHRSRSRDVDY 755 E+D+RSR+RDVDY Sbjct: 594 RAAPEEDYRSRTRDVDY 610 >ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309507 [Fragaria vesca subsp. vesca] Length = 646 Score = 253 bits (645), Expect = 7e-65 Identities = 133/262 (50%), Positives = 157/262 (59%), Gaps = 11/262 (4%) Frame = +3 Query: 3 IGAKNMIGS------GVNGGNPYGQGFVXXXXXXXXXXXXHPQSMMAAGFDPTYMGRGGG 164 +GA+NM+G+ G NGG YGQG + MM GFDPTYMGRGGG Sbjct: 377 MGARNMVGNNAGVGTGANGGG-YGQGLGGPGFGGPVGGMMNAPGMMGPGFDPTYMGRGGG 435 Query: 165 YGAFQNPVFPGMIPSYQAVNTMGLPGVAPHVNPAFFGRGVSAXXXXXXXXXXXXXXXXXX 344 YG F P FPGM+P + VN MGL GVAPHVNPAFFGRG++ Sbjct: 436 YGGFPGPGFPGMLPQFPGVNAMGLAGVAPHVNPAFFGRGMATNGMGMMGSSGMEGHHAPM 495 Query: 345 WTDTSMGEWGGDEHAR--MKEXXXXXXXXXXXXXXXXXXHERGGRSNASSREKDRGSERD 518 W D SM W G+E R + HE+ RS+A+ RE++R SER+ Sbjct: 496 WNDPSMAGWTGEEQDRRTRESSYGGDDGGSEYGNYGEANHEKPVRSSAAPRERERESERE 555 Query: 519 WSGNSERRHRDEREQDWERSDRDH---RYKEEKDGYRDHRQREREWDNGDDWDRGQSSSR 689 W+G SERRHRDEREQDW+RS+R+H RYKEEKD YRDHR+RER+ DD DRG SSSR Sbjct: 556 WTGTSERRHRDEREQDWDRSEREHREPRYKEEKDSYRDHRRRERDVAYEDDRDRGHSSSR 615 Query: 690 SRVKSNMMQEDDHRSRSRDVDY 755 R +S M EDDHRSRSRDVDY Sbjct: 616 PRSRSKAMPEDDHRSRSRDVDY 637 >gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus guttatus] Length = 639 Score = 245 bits (625), Expect = 1e-62 Identities = 128/253 (50%), Positives = 156/253 (61%), Gaps = 2/253 (0%) Frame = +3 Query: 3 IGAKNMIGS--GVNGGNPYGQGFVXXXXXXXXXXXXHPQSMMAAGFDPTYMGRGGGYGAF 176 +G KNMIG+ G GG YGQG + HPQ MM GFD +MGRGGGYG F Sbjct: 385 MGNKNMIGNAPGAGGGGAYGQG-LNGPGFGGPPGMMHPQGMMGPGFDLAFMGRGGGYGGF 443 Query: 177 QNPVFPGMIPSYQAVNTMGLPGVAPHVNPAFFGRGVSAXXXXXXXXXXXXXXXXXXWTDT 356 P F GM+P +Q VN+MGLPGVAPHVNPAFFGRG++ W D Sbjct: 444 SGPPFQGMLPPFQGVNSMGLPGVAPHVNPAFFGRGMNPNGMGMMGNPGMVGPHSGMWNDP 503 Query: 357 SMGEWGGDEHARMKEXXXXXXXXXXXXXXXXXXHERGGRSNASSREKDRGSERDWSGNSE 536 +MG WGG+EH R E H++ RS+A+ REK+R SER++ E Sbjct: 504 NMGGWGGEEHGR--ESSYGGEDNASEYGYGEGSHDKSVRSSAAPREKERTSEREY---PE 558 Query: 537 RRHRDEREQDWERSDRDHRYKEEKDGYRDHRQREREWDNGDDWDRGQSSSRSRVKSNMMQ 716 R+HR+ERE D ER+DRD +Y+EEKD YR+HR +ERE DDWDRGQ SSRSR +S +Q Sbjct: 559 RKHREERENDGERNDRDSKYREEKDRYREHRHKERESGYDDDWDRGQ-SSRSRSRSGAVQ 617 Query: 717 EDDHRSRSRDVDY 755 E+DHRSRSRD DY Sbjct: 618 EEDHRSRSRDADY 630 >ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobroma cacao] gi|508708844|gb|EOY00741.1| RNA-binding family protein isoform 6 [Theobroma cacao] Length = 602 Score = 225 bits (574), Expect = 1e-56 Identities = 113/218 (51%), Positives = 132/218 (60%), Gaps = 9/218 (4%) Frame = +3 Query: 3 IGAKNMIG------SGVNGGNPYGQGFVXXXXXXXXXXXXHPQSMMAAGFDPTYMGRGGG 164 +G KNM+G +G NG YGQG HPQ MM AGFDPTYM RGGG Sbjct: 385 VGVKNMVGISAGVGNGANGAGAYGQG-PGPAFGGPAGGMMHPQGMMGAGFDPTYMVRGGG 443 Query: 165 YGAFQNPVFPGMIPSYQAVNTMGLPGVAPHVNPAFFGRGVSAXXXXXXXXXXXXXXXXXX 344 YG F P FPGM+PS+ AVNTMGL GVAPHVNPAFFGRG++ Sbjct: 444 YGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGASGMDGPHAGM 503 Query: 345 WTDTSMGEWGGDEHARMKEXXXXXXXXXXXXXXXXXXHERGGRSNASSREKDRGSERDWS 524 WTD SMG WGGDEH R + GRS+ +SREK+R SER+WS Sbjct: 504 WTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKGRSSGASREKERVSEREWS 563 Query: 525 GNSERRHRDEREQDWERSD---RDHRYKEEKDGYRDHR 629 GNS+RRHRDE+EQDW+RS+ R+HRY+EEKD YR+HR Sbjct: 564 GNSDRRHRDEKEQDWDRSEREHREHRYREEKDSYREHR 601 >emb|CAN66828.1| hypothetical protein VITISV_015886 [Vitis vinifera] Length = 168 Score = 202 bits (514), Expect = 1e-49 Identities = 94/138 (68%), Positives = 108/138 (78%), Gaps = 1/138 (0%) Frame = +3 Query: 345 WTDTSMGEWGGDEHARM-KEXXXXXXXXXXXXXXXXXXHERGGRSNASSREKDRGSERDW 521 WTDTSMG WGG+EH R +E HE+ GRSN +SREK+RGSERDW Sbjct: 22 WTDTSMGGWGGEEHGRRTRESSYGGDDGASDYGYGEVNHEKVGRSNTASREKERGSERDW 81 Query: 522 SGNSERRHRDEREQDWERSDRDHRYKEEKDGYRDHRQREREWDNGDDWDRGQSSSRSRVK 701 SGNSERRHRDEREQDWERSD+DHRY+EEKDGYRDHRQRER+++N DDWDRGQSSSRSR + Sbjct: 82 SGNSERRHRDEREQDWERSDKDHRYREEKDGYRDHRQRERDFNNEDDWDRGQSSSRSRSR 141 Query: 702 SNMMQEDDHRSRSRDVDY 755 S + ++DHRSRSRD DY Sbjct: 142 SRAVADEDHRSRSRDGDY 159