BLASTX nr result
ID: Atropa21_contig00030604
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00030604 (1164 letters) Database: nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation spec... 237 8e-60 ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268... 155 4e-35 gb|EMJ26876.1| hypothetical protein PRUPE_ppa002814mg [Prunus pe... 154 7e-35 ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation spec... 149 2e-33 ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citr... 149 2e-33 ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citr... 148 5e-33 ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation spec... 147 1e-32 ref|XP_002312652.1| RNA recognition motif-containing family prot... 145 2e-32 ref|XP_002515040.1| RNA binding protein, putative [Ricinus commu... 144 5e-32 ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Popu... 142 2e-31 ref|XP_002315647.1| RNA recognition motif-containing family prot... 142 2e-31 gb|EOY00736.1| RNA-binding family protein isoform 1 [Theobroma c... 140 1e-30 gb|EOY00734.1| RNA-binding family protein isoform 1 [Theobroma c... 132 3e-28 gb|EXB82464.1| Cleavage and polyadenylation specificity factor s... 130 1e-27 ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309... 130 1e-27 gb|EOY00740.1| RNA-binding family protein isoform 5, partial [Th... 127 1e-26 ref|XP_006852230.1| hypothetical protein AMTR_s00049p00146760 [A... 114 6e-23 gb|EOY00739.1| RNA-binding family protein isoform 4 [Theobroma c... 113 2e-22 emb|CAN66828.1| hypothetical protein VITISV_015886 [Vitis vinifera] 105 3e-20 gb|EOY00741.1| RNA-binding family protein isoform 6 [Theobroma c... 100 1e-18 >ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Solanum tuberosum] gi|565349616|ref|XP_006341787.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X2 [Solanum tuberosum] Length = 648 Score = 237 bits (604), Expect = 8e-60 Identities = 150/313 (47%), Positives = 152/313 (48%), Gaps = 2/313 (0%) Frame = -3 Query: 1162 PMNEGVGRGGANYTPGDAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGAMGSKXXXXXXX 983 PMNEGVGRGG NYTPGDA GAMGSK Sbjct: 336 PMNEGVGRGGPNYTPGDAGRNFGRGSWGRGGPGMPNRGPGGGPVRGRGAMGSKNMMVNPG 395 Query: 982 XXXXXXXXXGQVLGGPAFVGPLAGLMHPQGMMDPGFDXXXXXXXXXXXXXXXXXXXXXXX 803 GQ L GPAF GP AGLMHPQGMM PGFD Sbjct: 396 AGNGAGGAFGQGLAGPAFGGPPAGLMHPQGMMGPGFDPSFMGRGAGYGGFSGPAFPGMMP 455 Query: 802 XXXAVNPMCLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXGPHPXXXXXXXXXXXXXXXX 623 AVNPM LPGVAPHVNPAFF GPHP Sbjct: 456 PFQAVNPMGLPGVAPHVNPAFFGRGMAANGMGMMSAAGMDGPHPGMWTDTSGGGWGGEEH 515 Query: 622 XXXXXXXGYGGEDNASEYGYGEVSHDKGARSSAVSREKEWGSERD--NSXXXXXXXXXXX 449 YGGEDNASEYGYGEVSHDKGARSSAVSREKE GSERD + Sbjct: 516 GRRTRESSYGGEDNASEYGYGEVSHDKGARSSAVSREKERGSERDWSGNSDKRHRDEREH 575 Query: 448 XXXXXXXXXXXXXXRNGYRDYCQKEHEPEYEEDYDRGQXXXXXXXXXRAAQEEDHRSRSR 269 R+GYRDY QKE E EYEEDYDRGQ RAAQEEDHRSRSR Sbjct: 576 DRDRHDKEHRYREERDGYRDYRQKERESEYEEDYDRGQSSSRSRSKSRAAQEEDHRSRSR 635 Query: 268 DTNYGKRRRAPSE 230 DTNYGKRRRAPSE Sbjct: 636 DTNYGKRRRAPSE 648 >ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268141 isoform 1 [Vitis vinifera] gi|359473133|ref|XP_003631251.1| PREDICTED: uncharacterized protein LOC100268141 isoform 2 [Vitis vinifera] Length = 647 Score = 155 bits (391), Expect = 4e-35 Identities = 112/316 (35%), Positives = 130/316 (41%), Gaps = 5/316 (1%) Frame = -3 Query: 1162 PMNEGVGRGGA-NYTPGDAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXG--AMGSKXXXX 992 PMN+GVGRGG N GDA A+G+K Sbjct: 332 PMNDGVGRGGGMNMQGGDAGRNYGRGGWGRGGQGILNRGPGGGGPMRGRGGAVGAKNMVG 391 Query: 991 XXXXXXXXXXXXGQVLGGPAFVGPLAGLMHPQGMMDPGFDXXXXXXXXXXXXXXXXXXXX 812 GQ L GP F GP GLMHPQGMM GFD Sbjct: 392 NTAGVGASGGGYGQGLAGPTFGGPAGGLMHPQGMMGSGFDPTYMGRGGAYGGFSGSAFPG 451 Query: 811 XXXXXXAVNPMCLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXGPHPXXXXXXXXXXXXX 632 AVN M L GVAPHVNPAFF G H Sbjct: 452 MVPSFPAVNTMGLAGVAPHVNPAFFGRGMAANGMGMMGATGMDGHHAGMWTDTSMGGWGG 511 Query: 631 XXXXXXXXXXGYGGEDNASEYGYGEVSHDKGARSSAVSREKEWGSERD--NSXXXXXXXX 458 YGG+D AS+YGYGEV+H+K RS+ SREKE GSERD + Sbjct: 512 EEHGRRTRESSYGGDDGASDYGYGEVNHEKVGRSNTASREKERGSERDWSGNSERRHRDE 571 Query: 457 XXXXXXXXXXXXXXXXXRNGYRDYCQKEHEPEYEEDYDRGQXXXXXXXXXRAAQEEDHRS 278 ++GYRD+ Q+E + E+D+DRGQ RA +EDHRS Sbjct: 572 REQDWERSDKDHRYREEKDGYRDHRQRERDFNNEDDWDRGQSSSRSRSRSRAVADEDHRS 631 Query: 277 RSRDTNYGKRRRAPSE 230 RSRD +YGKRRR PSE Sbjct: 632 RSRDGDYGKRRRLPSE 647 >gb|EMJ26876.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] Length = 630 Score = 154 bits (389), Expect = 7e-35 Identities = 113/322 (35%), Positives = 131/322 (40%), Gaps = 11/322 (3%) Frame = -3 Query: 1162 PMNEGVGRGGA-NYTPGDAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXG---AMGSKXXX 995 PMNEGVGRGG NY GD G AMG+K Sbjct: 309 PMNEGVGRGGGVNYQTGDTGGRNFGRGGWGRGGQGVANRGPGGGGPMRGRGGAMGAKNMA 368 Query: 994 XXXXXXXXXXXXXG-QVLGGPAFVGPLAGLMHPQGMMDPGFDXXXXXXXXXXXXXXXXXX 818 Q L GP F GP+ G+M+PQGMM GFD Sbjct: 369 GNPAGVGTGANGGYGQGLAGPGFGGPVGGMMNPQGMMGAGFDPTYMGRGGGYGGFPGPAF 428 Query: 817 XXXXXXXXAVNPMCLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXGPHPXXXXXXXXXXX 638 AVN M L GVAPHVNPAFF G H Sbjct: 429 PGMLSSFPAVNTMGLAGVAPHVNPAFFGRGMATNGMGMMGSSGMDGHHAGMWNDPSMGGW 488 Query: 637 XXXXXXXXXXXXGYGGEDNASEYGYGEVSHDKGARSSAVSREKEWGSERDNSXXXXXXXX 458 YGG+D ASEYGYGE +H+KG RS+A SRE+E GSERD S Sbjct: 489 GGDEHGRRTRESSYGGDDGASEYGYGEANHEKGGRSNAPSRERERGSERDWSGNSERRHR 548 Query: 457 XXXXXXXXXXXXXXXXXR------NGYRDYCQKEHEPEYEEDYDRGQXXXXXXXXXRAAQ 296 + YRD+ Q+E + YE+D+DRGQ +A Sbjct: 549 DEREQDWDRSERGEHREHRYKEEKDSYRDHRQRERDVGYEDDWDRGQSSSRPRSRSKAMP 608 Query: 295 EEDHRSRSRDTNYGKRRRAPSE 230 E+DHRSRSRD +YGKRRR PSE Sbjct: 609 EDDHRSRSRDVDYGKRRRLPSE 630 >ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 655 Score = 149 bits (377), Expect = 2e-33 Identities = 107/322 (33%), Positives = 129/322 (40%), Gaps = 11/322 (3%) Frame = -3 Query: 1162 PMNEGVGRGG-ANYTPGDAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGAMGSKXXXXXX 986 PMN+G GRGG NY GD G MG++ Sbjct: 335 PMNDGGGRGGNTNYQSGDGGRNFGRGGWGRGGQGVPNRGPGGGAMRGRGPMGARNMIGSS 394 Query: 985 XXXXXXXXXXG-----QVLGGPAFVGPLAGLMHPQGMMDPGFDXXXXXXXXXXXXXXXXX 821 Q L GP F GP G+MHPQ MM GFD Sbjct: 395 SGAGSGVGHAAGGGYGQGLAGPGFGGPAGGMMHPQNMMG-GFDPTYMGRGGGYGGFSGPG 453 Query: 820 XXXXXXXXXAVNPMCLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXGPHPXXXXXXXXXX 641 AVN M L GVAPHVNPAFF GPHP Sbjct: 454 FPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGSSGMDGPHPGMWTDSSMGG 513 Query: 640 XXXXXXXXXXXXXGYGGEDNASEYGYGEVSHDKGARSSAVSREKEWGSERD-----NSXX 476 YGG+D AS+YGYGE +H+KGARS+A SREK+ GSERD + Sbjct: 514 WLGEEHGRRTRESSYGGDDGASDYGYGEANHEKGARSTAASREKDRGSERDWSGNTDRRH 573 Query: 475 XXXXXXXXXXXXXXXXXXXXXXXRNGYRDYCQKEHEPEYEEDYDRGQXXXXXXXXXRAAQ 296 ++ YRD Q++ + Y++++DRGQ A Sbjct: 574 REEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDDNWDRGQSSSRSRSRSGAIP 633 Query: 295 EEDHRSRSRDTNYGKRRRAPSE 230 +EDHRSRSRD +YGKRRR PSE Sbjct: 634 DEDHRSRSRDVDYGKRRRLPSE 655 >ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|567891321|ref|XP_006438181.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540376|gb|ESR51420.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540377|gb|ESR51421.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] Length = 655 Score = 149 bits (376), Expect = 2e-33 Identities = 107/322 (33%), Positives = 128/322 (39%), Gaps = 11/322 (3%) Frame = -3 Query: 1162 PMNEGVGRGG-ANYTPGDAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGAMGSKXXXXXX 986 PMN+G GRGG NY GD G MG++ Sbjct: 335 PMNDGGGRGGNTNYQSGDGGRNFGRGGWGRGGQGVPNRGPGGGAMRGRGPMGARNMIGSS 394 Query: 985 XXXXXXXXXXG-----QVLGGPAFVGPLAGLMHPQGMMDPGFDXXXXXXXXXXXXXXXXX 821 Q L GP F GP G+MHPQ MM GFD Sbjct: 395 SGAGSGAGHAAGGGYGQGLAGPGFGGPAGGMMHPQNMMG-GFDPTYMGRGGGYGGFSGPG 453 Query: 820 XXXXXXXXXAVNPMCLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXGPHPXXXXXXXXXX 641 AVN M L GVAPHVNPAFF GPHP Sbjct: 454 FPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGSSGMDGPHPGMWTDSSMGG 513 Query: 640 XXXXXXXXXXXXXGYGGEDNASEYGYGEVSHDKGARSSAVSREKEWGSERD-----NSXX 476 YGG+D AS+YGYGE SH+KGARS+ SREK+ GSERD + Sbjct: 514 WVGEEHGRRTRESSYGGDDGASDYGYGEASHEKGARSTTASREKDRGSERDWSGNTDRRH 573 Query: 475 XXXXXXXXXXXXXXXXXXXXXXXRNGYRDYCQKEHEPEYEEDYDRGQXXXXXXXXXRAAQ 296 ++ YRD Q++ + Y++++DRGQ A Sbjct: 574 REEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDDNWDRGQSSSRSRSRSGAIP 633 Query: 295 EEDHRSRSRDTNYGKRRRAPSE 230 +EDHRSRSRD +YGKRRR PSE Sbjct: 634 DEDHRSRSRDVDYGKRRRLPSE 655 >ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] gi|557540375|gb|ESR51419.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] Length = 658 Score = 148 bits (373), Expect = 5e-33 Identities = 108/322 (33%), Positives = 129/322 (40%), Gaps = 11/322 (3%) Frame = -3 Query: 1162 PMNEGVGRGG-ANYTPGDAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGAMGSKXXXXXX 986 PMN+G GRGG NY GD G MG+K Sbjct: 338 PMNDGGGRGGNMNYQSGDGGRNFGRGGWGRGGQGVPNRGPGGGAMRGRGPMGAKNMMGSS 397 Query: 985 XXXXXXXXXXG-----QVLGGPAFVGPLAGLMHPQGMMDPGFDXXXXXXXXXXXXXXXXX 821 Q L GP F GP G+MHPQ MM GFD Sbjct: 398 SGAGSGAGPAAGGGYGQGLAGPGFGGPAGGMMHPQNMMG-GFDPTYMGRGGGYGGFSGPG 456 Query: 820 XXXXXXXXXAVNPMCLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXGPHPXXXXXXXXXX 641 AVN M L GVAPHVNPAFF GPHP Sbjct: 457 FPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGSSGMDGPHPGMWTDSSMGG 516 Query: 640 XXXXXXXXXXXXXGYGGEDNASEYGYGEVSHDKGARSSAVSREKEWGSERD-----NSXX 476 YGG+D AS+YGYGE +H+KGARS+A SREK+ GSERD + Sbjct: 517 WVGEEHGRRTRESSYGGDDGASDYGYGEANHEKGARSTAASREKDRGSERDWSGNTDRRH 576 Query: 475 XXXXXXXXXXXXXXXXXXXXXXXRNGYRDYCQKEHEPEYEEDYDRGQXXXXXXXXXRAAQ 296 ++ YRD Q++ + Y++++DRG RA Sbjct: 577 REEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDDNWDRGPSSSRSRSRSRAIP 636 Query: 295 EEDHRSRSRDTNYGKRRRAPSE 230 +EDHRSRSRD +YGKRRR PSE Sbjct: 637 DEDHRSRSRDVDYGKRRRLPSE 658 >ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 658 Score = 147 bits (370), Expect = 1e-32 Identities = 107/322 (33%), Positives = 129/322 (40%), Gaps = 11/322 (3%) Frame = -3 Query: 1162 PMNEGVGRGG-ANYTPGDAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGAMGSKXXXXXX 986 PMN+G GRGG NY GD G MG++ Sbjct: 338 PMNDGGGRGGNMNYQSGDGGRNFGRGGWGRGGQGVPNRGPGGGAMRGRGPMGARNMIGSS 397 Query: 985 XXXXXXXXXXG-----QVLGGPAFVGPLAGLMHPQGMMDPGFDXXXXXXXXXXXXXXXXX 821 Q L GP F GP G+MHPQ MM GFD Sbjct: 398 SGAGSGVGHAAGGGYGQGLAGPGFGGPAGGMMHPQNMMG-GFDPTYMGRGGGYGGFSGPG 456 Query: 820 XXXXXXXXXAVNPMCLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXGPHPXXXXXXXXXX 641 AVN M L GVAPHVNPAFF GPHP Sbjct: 457 FPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGSSGMDGPHPGMWTDSSMGG 516 Query: 640 XXXXXXXXXXXXXGYGGEDNASEYGYGEVSHDKGARSSAVSREKEWGSERD-----NSXX 476 YGG+D AS+YGYGE +H+KGARS+A SREK+ GSERD + Sbjct: 517 WLGEEHGRRTRESSYGGDDGASDYGYGEANHEKGARSTAASREKDRGSERDWSGNTDRRH 576 Query: 475 XXXXXXXXXXXXXXXXXXXXXXXRNGYRDYCQKEHEPEYEEDYDRGQXXXXXXXXXRAAQ 296 ++ YRD Q++ + Y++++DRG RA Sbjct: 577 REEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDDNWDRGPSSSRSRSRSRAIP 636 Query: 295 EEDHRSRSRDTNYGKRRRAPSE 230 +EDHRSRSRD +YGKRRR PSE Sbjct: 637 DEDHRSRSRDVDYGKRRRLPSE 658 >ref|XP_002312652.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222852472|gb|EEE90019.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 619 Score = 145 bits (367), Expect = 2e-32 Identities = 111/315 (35%), Positives = 129/315 (40%), Gaps = 5/315 (1%) Frame = -3 Query: 1159 MNEGVGRGG-ANYTPGDAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGAMGSKXXXXXXX 983 MN+G GRGG AN+ GD GAMG K Sbjct: 323 MNDGAGRGGNANFQSGDGGRNYGRGAWGRGGQGILNRGPGGGPMRGRGAMGPKNMAGNVA 382 Query: 982 XXXXXXXXXG--QVLGGPAFVGPLAGLMHPQGMMDPGFDXXXXXXXXXXXXXXXXXXXXX 809 G Q L GPAF GP G+M PQGMM GFD Sbjct: 383 GVGSGANGGGYGQGLAGPAFGGPAGGMMPPQGMMGAGFDPLYMGRGGGYGGFAGPGFPGM 442 Query: 808 XXXXXAVNPMCLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXGPHPXXXXXXXXXXXXXX 629 AVN M L GVAPHVNPAFF GP+P Sbjct: 443 LPSFPAVNSMGLAGVAPHVNPAFFARGMAPNGMGMMVSSGMDGPNPGMWESS-------- 494 Query: 628 XXXXXXXXXGYGGEDNASEYGYGEVSHDKGARSSAVSREKEWGSERD--NSXXXXXXXXX 455 Y G++ ASEYGYGE +H+KGARSS SREKE GSERD + Sbjct: 495 ----------YDGDEGASEYGYGEGNHEKGARSSGASREKERGSERDWSGNSDRRHRDER 544 Query: 454 XXXXXXXXXXXXXXXXRNGYRDYCQKEHEPEYEEDYDRGQXXXXXXXXXRAAQEEDHRSR 275 ++ YR + Q+E + YE+D DRG RAA EED+RSR Sbjct: 545 EQDWDRPEREHRYKEEKDSYRGHRQRERDSGYEDDRDRGHSSSRARSRSRAAPEEDYRSR 604 Query: 274 SRDTNYGKRRRAPSE 230 +RD +YGKRRR PSE Sbjct: 605 TRDVDYGKRRRLPSE 619 >ref|XP_002515040.1| RNA binding protein, putative [Ricinus communis] gi|223546091|gb|EEF47594.1| RNA binding protein, putative [Ricinus communis] Length = 644 Score = 144 bits (364), Expect = 5e-32 Identities = 114/321 (35%), Positives = 130/321 (40%), Gaps = 10/321 (3%) Frame = -3 Query: 1162 PMNEGVGRGG-ANYTPGDAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXG--AMGSKXXXX 992 PMN+G GRGG NY GDA +MG+K Sbjct: 325 PMNDGAGRGGNMNYQGGDAGRNFGRGGWGRGGQGILNRGPGGGGRMGGRGGSMGAKNIVG 384 Query: 991 XXXXXXXXXXXXG--QVLGGPAFVGPLAGLMHPQGMMDPGFDXXXXXXXXXXXXXXXXXX 818 G Q L GPAF GP ++ PQ MM GFD Sbjct: 385 GAGGVGSGANGGGYGQGLAGPAFGGPAGAMLPPQSMMRAGFDPTYMGRGAGYGGFAGPGF 444 Query: 817 XXXXXXXXAVNPMCLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXGPHPXXXXXXXXXXX 638 AVN M L GVAPHVNPAFF GP+ Sbjct: 445 PGMLPSFPAVNAMGLAGVAPHVNPAFFGRGMAPNGMGMMGPSGMDGPNAGMWSDTSMGGW 504 Query: 637 XXXXXXXXXXXXGYGGEDNASEYGYGEVSHDKGARSSAVSREKEWGSERDNSXXXXXXXX 458 YGG+D ASEYGYGEV+H+KGARSSA SREKE SERD S Sbjct: 505 GEEPGRRTRESS-YGGDDGASEYGYGEVNHEKGARSSAASREKERASERDWSGNSDRRHR 563 Query: 457 XXXXXXXXXXXXXXXXXR-----NGYRDYCQKEHEPEYEEDYDRGQXXXXXXXXXRAAQE 293 R YRD+ Q+E + YE+D+DRGQ RA E Sbjct: 564 DDREHDWDRSEREHKEHRYREEKESYRDHRQRERDSGYEDDWDRGQSSSRSRSRSRAVPE 623 Query: 292 EDHRSRSRDTNYGKRRRAPSE 230 ED+RSRSRD +YGKRRR PSE Sbjct: 624 EDYRSRSRDADYGKRRRLPSE 644 >ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] gi|550329195|gb|ERP56065.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] Length = 591 Score = 142 bits (359), Expect = 2e-31 Identities = 110/315 (34%), Positives = 128/315 (40%), Gaps = 5/315 (1%) Frame = -3 Query: 1159 MNEGVGRGG-ANYTPGDAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGAMGSKXXXXXXX 983 MN+G+GRGG ANY GD G MG K Sbjct: 278 MNDGMGRGGNANYQSGDGGRNYGRGGWGRGGQGVLNRGPGGGPMRGRGGMGPKNMAGNVA 337 Query: 982 XXXXXXXXXG--QVLGGPAFVGPLAGLMHPQGMMDPGFDXXXXXXXXXXXXXXXXXXXXX 809 G Q + GPAF GP G+MH QGMM GFD Sbjct: 338 GVGSGANGGGYGQGIAGPAFGGPAGGMMHHQGMMGAGFDPLYMGRGGGYGGFPGHGFPGM 397 Query: 808 XXXXXAVNPMCLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXGPHPXXXXXXXXXXXXXX 629 AVN M L GVAPHVNPAFF GP+P Sbjct: 398 LPSFPAVNSMGLAGVAPHVNPAFFARGMAPNGMGMMASSGMEGPNP-GKWPDTSMGGWGE 456 Query: 628 XXXXXXXXXGYGGEDNASEYGYGEVSHDKGARSSAVSREKEWGSERD--NSXXXXXXXXX 455 Y G++ ASEYGYGE +H+KGARSS SREKE SERD + Sbjct: 457 EPGRRTRESSYDGDEGASEYGYGEGNHEKGARSSGASREKERVSERDWSGNSDRRHRDER 516 Query: 454 XXXXXXXXXXXXXXXXRNGYRDYCQKEHEPEYEEDYDRGQXXXXXXXXXRAAQEEDHRSR 275 ++ YR + Q+E + YE+D DRG RAA EED+RSR Sbjct: 517 EQDWDRSEREPKYREEKDTYRGHRQRERDSGYEDDRDRGHSSSRARSRSRAAPEEDYRSR 576 Query: 274 SRDTNYGKRRRAPSE 230 SRD +YGKRRR PSE Sbjct: 577 SRDVDYGKRRRPPSE 591 >ref|XP_002315647.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222864687|gb|EEF01818.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 573 Score = 142 bits (359), Expect = 2e-31 Identities = 110/315 (34%), Positives = 128/315 (40%), Gaps = 5/315 (1%) Frame = -3 Query: 1159 MNEGVGRGG-ANYTPGDAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGAMGSKXXXXXXX 983 MN+G+GRGG ANY GD G MG K Sbjct: 278 MNDGMGRGGNANYQSGDGGRNYGRGGWGRGGQGVLNRGPGGGPMRGRGGMGPKNMAGNVA 337 Query: 982 XXXXXXXXXG--QVLGGPAFVGPLAGLMHPQGMMDPGFDXXXXXXXXXXXXXXXXXXXXX 809 G Q + GPAF GP G+MH QGMM GFD Sbjct: 338 GVGSGANGGGYGQGIAGPAFGGPAGGMMHHQGMMGAGFDPLYMGRGGGYGGFPGHGFPGM 397 Query: 808 XXXXXAVNPMCLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXGPHPXXXXXXXXXXXXXX 629 AVN M L GVAPHVNPAFF GP+P Sbjct: 398 LPSFPAVNSMGLAGVAPHVNPAFFARGMAPNGMGMMASSGMEGPNPGKESS--------- 448 Query: 628 XXXXXXXXXGYGGEDNASEYGYGEVSHDKGARSSAVSREKEWGSERD--NSXXXXXXXXX 455 Y G++ ASEYGYGE +H+KGARSS SREKE SERD + Sbjct: 449 ----------YDGDEGASEYGYGEGNHEKGARSSGASREKERVSERDWSGNSDRRHRDER 498 Query: 454 XXXXXXXXXXXXXXXXRNGYRDYCQKEHEPEYEEDYDRGQXXXXXXXXXRAAQEEDHRSR 275 ++ YR + Q+E + YE+D DRG RAA EED+RSR Sbjct: 499 EQDWDRSEREPKYREEKDTYRGHRQRERDSGYEDDRDRGHSSSRARSRSRAAPEEDYRSR 558 Query: 274 SRDTNYGKRRRAPSE 230 SRD +YGKRRR PSE Sbjct: 559 SRDVDYGKRRRPPSE 573 >gb|EOY00736.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708840|gb|EOY00737.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708841|gb|EOY00738.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 652 Score = 140 bits (353), Expect = 1e-30 Identities = 91/242 (37%), Positives = 108/242 (44%), Gaps = 5/242 (2%) Frame = -3 Query: 940 GPAFVGPLAGLMHPQGMMDPGFDXXXXXXXXXXXXXXXXXXXXXXXXXXAVNPMCLPGVA 761 GPAF GP G+MHPQGMM GFD AVN M L GVA Sbjct: 412 GPAFGGPAGGMMHPQGMMGAGFDPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVA 471 Query: 760 PHVNPAFFXXXXXXXXXXXXXXXXXXGPHPXXXXXXXXXXXXXXXXXXXXXXXGYGGEDN 581 PHVNPAFF GPH YGGED Sbjct: 472 PHVNPAFFGRGMAPNGMGMMGASGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDG 531 Query: 580 ASEYGYGEVSHDKGARSSAVSREKEWGSERDNSXXXXXXXXXXXXXXXXXXXXXXXXXR- 404 ASEYGYG+ +H+KG RSS SREKE SER+ S R Sbjct: 532 ASEYGYGDANHEKG-RSSGASREKERVSEREWSGNSDRRHRDEKEQDWDRSEREHREHRY 590 Query: 403 ----NGYRDYCQKEHEPEYEEDYDRGQXXXXXXXXXRAAQEEDHRSRSRDTNYGKRRRAP 236 + YR++ +E + +Y++D+DRGQ A EE+HRSRSRD +YGK+RR P Sbjct: 591 REEKDSYREHRHRERDLDYDDDWDRGQSSSRSRRRSHAMPEEEHRSRSRDVDYGKKRRLP 650 Query: 235 SE 230 SE Sbjct: 651 SE 652 >gb|EOY00734.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708838|gb|EOY00735.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 653 Score = 132 bits (331), Expect = 3e-28 Identities = 88/242 (36%), Positives = 104/242 (42%), Gaps = 5/242 (2%) Frame = -3 Query: 940 GPAFVGPLAGLMHPQGMMDPGFDXXXXXXXXXXXXXXXXXXXXXXXXXXAVNPMCLPGVA 761 GP F GP G+MHPQGMM GFD AVN + L GVA Sbjct: 413 GPPFGGPAGGMMHPQGMMGAGFDPTYMGRGGSYGGFPGPGFPGMLPSFPAVNTLGLAGVA 472 Query: 760 PHVNPAFFXXXXXXXXXXXXXXXXXXGPHPXXXXXXXXXXXXXXXXXXXXXXXGYGGEDN 581 PHVNPAFF GPH YGGED Sbjct: 473 PHVNPAFFGRGMAPNGMGMMGGPGMDGPHVGMWTDTSMGGWGGDEHGRRTRESSYGGEDG 532 Query: 580 ASEYGYGEVSHDKGARSSAVSREKEWGSERDNSXXXXXXXXXXXXXXXXXXXXXXXXXR- 404 ASEYGYG+ +H+KG RSS SREKE S+R+ S R Sbjct: 533 ASEYGYGDANHEKG-RSSGASREKERVSDREWSGNSDRRHRDEKERDWDRSEREHREHRY 591 Query: 403 ----NGYRDYCQKEHEPEYEEDYDRGQXXXXXXXXXRAAQEEDHRSRSRDTNYGKRRRAP 236 + YR++ +E + +Y++D DRGQ A EE RSRSRD +YGKRRR P Sbjct: 592 REEKDSYREHRHRERDLDYDDDLDRGQSSSRSRRRSHAMPEEQRRSRSRDVDYGKRRRLP 651 Query: 235 SE 230 SE Sbjct: 652 SE 653 >gb|EXB82464.1| Cleavage and polyadenylation specificity factor subunit [Morus notabilis] Length = 636 Score = 130 bits (326), Expect = 1e-27 Identities = 100/314 (31%), Positives = 117/314 (37%), Gaps = 3/314 (0%) Frame = -3 Query: 1162 PMNEGVGRGG-ANYTPGDAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXG--AMGSKXXXX 992 P+N+GVGRGG N+ GD AMG+K Sbjct: 328 PINDGVGRGGNPNFQSGDGGRNFGRGGWGRGGQGAPNRGPGSGGPMRGRGGAMGAKNMVG 387 Query: 991 XXXXXXXXXXXXGQVLGGPAFVGPLAGLMHPQGMMDPGFDXXXXXXXXXXXXXXXXXXXX 812 Q L GP F GP G+M+PQGMM GFD Sbjct: 388 NNAGVGGGGYG--QGLAGPPFGGPAGGMMNPQGMMGTGFDPTYMGRGVGYGGFAGPAFPG 445 Query: 811 XXXXXXAVNPMCLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXGPHPXXXXXXXXXXXXX 632 AVN M VAPHVNPAFF G Sbjct: 446 MLPSFPAVNTMGFAAVAPHVNPAFFGRGMTNNGMGMVGSSLMDGHQGGMWNDPSIGGWGG 505 Query: 631 XXXXXXXXXXGYGGEDNASEYGYGEVSHDKGARSSAVSREKEWGSERDNSXXXXXXXXXX 452 YGG+D ASEYGYG+ +H+KG R R+ SER N Sbjct: 506 EEHGRRTRESSYGGDDGASEYGYGDTNHEKGGRERGSERDWSGNSERRNHEERDQDWDRS 565 Query: 451 XXXXXXXXXXXXXXXRNGYRDYCQKEHEPEYEEDYDRGQXXXXXXXXXRAAQEEDHRSRS 272 +G RDY KE E +YE+D+DRGQ R QE+ HRSRS Sbjct: 566 QKEQKEHRYREGK---DGSRDYRPKERELDYEDDWDRGQSSSRLRSRSRVVQEDHHRSRS 622 Query: 271 RDTNYGKRRRAPSE 230 RD +YGKRRR PSE Sbjct: 623 RDVDYGKRRRLPSE 636 >ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309507 [Fragaria vesca subsp. vesca] Length = 646 Score = 130 bits (326), Expect = 1e-27 Identities = 86/247 (34%), Positives = 104/247 (42%), Gaps = 6/247 (2%) Frame = -3 Query: 952 QVLGGPAFVGPLAGLMHPQGMMDPGFDXXXXXXXXXXXXXXXXXXXXXXXXXXAVNPMCL 773 Q LGGP F GP+ G+M+ GMM PGFD VN M L Sbjct: 400 QGLGGPGFGGPVGGMMNAPGMMGPGFDPTYMGRGGGYGGFPGPGFPGMLPQFPGVNAMGL 459 Query: 772 PGVAPHVNPAFFXXXXXXXXXXXXXXXXXXGPHPXXXXXXXXXXXXXXXXXXXXXXXGYG 593 GVAPHVNPAFF G H YG Sbjct: 460 AGVAPHVNPAFFGRGMATNGMGMMGSSGMEGHHAPMWNDPSMAGWTGEEQDRRTRESSYG 519 Query: 592 GEDNASEYG-YGEVSHDKGARSSAVSREKEWGSERD-----NSXXXXXXXXXXXXXXXXX 431 G+D SEYG YGE +H+K RSSA RE+E SER+ Sbjct: 520 GDDGGSEYGNYGEANHEKPVRSSAAPRERERESEREWTGTSERRHRDEREQDWDRSEREH 579 Query: 430 XXXXXXXXRNGYRDYCQKEHEPEYEEDYDRGQXXXXXXXXXRAAQEEDHRSRSRDTNYGK 251 ++ YRD+ ++E + YE+D DRG +A E+DHRSRSRD +YGK Sbjct: 580 REPRYKEEKDSYRDHRRRERDVAYEDDRDRGHSSSRPRSRSKAMPEDDHRSRSRDVDYGK 639 Query: 250 RRRAPSE 230 RRR PSE Sbjct: 640 RRRLPSE 646 >gb|EOY00740.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] Length = 656 Score = 127 bits (318), Expect = 1e-26 Identities = 86/245 (35%), Positives = 104/245 (42%), Gaps = 5/245 (2%) Frame = -3 Query: 940 GPAFVGPLAGLMHPQGMMDPGFDXXXXXXXXXXXXXXXXXXXXXXXXXXAVNPMCLPGVA 761 GPAF GP G+MHPQGMM GFD AVN M L GVA Sbjct: 412 GPAFGGPAGGMMHPQGMMGAGFDPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVA 471 Query: 760 PHVNPAFFXXXXXXXXXXXXXXXXXXGPHPXXXXXXXXXXXXXXXXXXXXXXXGYGGEDN 581 PHVNPAFF GPH YGGED Sbjct: 472 PHVNPAFFGRGMAPNGMGMMGASGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDG 531 Query: 580 ASEYGYGEVSHDKGARSSAVSREKEWGSERDNSXXXXXXXXXXXXXXXXXXXXXXXXXR- 404 ASEYGYG+ +H+KG RSS SREKE SER+ S R Sbjct: 532 ASEYGYGDANHEKG-RSSGASREKERVSEREWSGNSDRRHRDEKEQDWDRSEREHREHRY 590 Query: 403 ----NGYRDYCQKEHEPEYEEDYDRGQXXXXXXXXXRAAQEEDHRSRSRDTNYGKRRRAP 236 + YR++ +E + +Y++D+DRGQ A EE+HRSRSRD Y + + + Sbjct: 591 REEKDSYREHRHRERDLDYDDDWDRGQSSSRSRRRSHAMPEEEHRSRSRDVGYREEKDSY 650 Query: 235 SE*SH 221 E H Sbjct: 651 REHRH 655 >ref|XP_006852230.1| hypothetical protein AMTR_s00049p00146760 [Amborella trichopoda] gi|548855834|gb|ERN13697.1| hypothetical protein AMTR_s00049p00146760 [Amborella trichopoda] Length = 659 Score = 114 bits (286), Expect = 6e-23 Identities = 82/246 (33%), Positives = 101/246 (41%), Gaps = 5/246 (2%) Frame = -3 Query: 952 QVLGGPAFVGPLAGLMHPQGMMDPGFDXXXXXXXXXXXXXXXXXXXXXXXXXXAVNPMC- 776 Q L P GP GL+HPQGMM GFD + +PM Sbjct: 415 QALSAPPLGGPPGGLLHPQGMMGSGFDPTYGAHLGRGSGYGGFSGPHFPGMLPSFSPMGT 474 Query: 775 --LPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXGPHPXXXXXXXXXXXXXXXXXXXXXXX 602 LPGVAPHVNPAFF G H Sbjct: 475 VGLPGVAPHVNPAFFGRGVSANGMGMMGSGAMDGHHGGMWGDSSMGGGVGWGNEEHGRRT 534 Query: 601 GYG--GEDNASEYGYGEVSHDKGARSSAVSREKEWGSERDNSXXXXXXXXXXXXXXXXXX 428 G+D AS+YGYG+ H++G S REK+ GSERD S Sbjct: 535 RESSYGDDGASDYGYGDGGHERGGGRSNPGREKDRGSERDWSSGPERRHRDDRDSDWDRD 594 Query: 427 XXXXXXXRNGYRDYCQKEHEPEYEEDYDRGQXXXXXXXXXRAAQEEDHRSRSRDTNYGKR 248 +GY D+ Q+E + + E+D+DRG+ R QEED RSRS+D +YGKR Sbjct: 595 PRYKDEK-DGYSDHRQRERDWDNEDDWDRGRTSSRSRSKSRMMQEEDQRSRSKDVDYGKR 653 Query: 247 RRAPSE 230 RR PSE Sbjct: 654 RRVPSE 659 >gb|EOY00739.1| RNA-binding family protein isoform 4 [Theobroma cacao] Length = 697 Score = 113 bits (282), Expect = 2e-22 Identities = 91/287 (31%), Positives = 104/287 (36%), Gaps = 50/287 (17%) Frame = -3 Query: 940 GPAFVGPLAGLMHPQGMMDPGFDXXXXXXXXXXXXXXXXXXXXXXXXXXAVNPMCLPGVA 761 GPAF GP G+MHPQGMM GFD AVN M L GVA Sbjct: 412 GPAFGGPAGGMMHPQGMMGAGFDPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVA 471 Query: 760 PHVNPAFFXXXXXXXXXXXXXXXXXXGPHPXXXXXXXXXXXXXXXXXXXXXXXGYGGEDN 581 PHVNPAFF GPH YGGED Sbjct: 472 PHVNPAFFGRGMAPNGMGMMGASGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDG 531 Query: 580 ASEYGYGEVSHDKGARSSAVSREKEWGSERD-----NSXXXXXXXXXXXXXXXXXXXXXX 416 ASEYGYG+ +H+KG RSS SREKE SER+ + Sbjct: 532 ASEYGYGDANHEKG-RSSGASREKERVSEREWSGNSDRRHRDEKEQDWDRSEREHREHRY 590 Query: 415 XXXRNGYRDYCQKE----------HEPEYEEDYD-------------------------- 344 ++ YR++ +E H E E D+D Sbjct: 591 REEKDSYREHRHREREWSGNSDRRHRDEKERDWDRSEREHREHRYREEKDSYREHRHRER 650 Query: 343 ---------RGQXXXXXXXXXRAAQEEDHRSRSRDTNYGKRRRAPSE 230 RGQ A EE RSRSRD +YGKRRR PSE Sbjct: 651 DLDYDDDLDRGQSSSRSRRRSHAMPEEQRRSRSRDVDYGKRRRLPSE 697 >emb|CAN66828.1| hypothetical protein VITISV_015886 [Vitis vinifera] Length = 168 Score = 105 bits (263), Expect = 3e-20 Identities = 58/125 (46%), Positives = 73/125 (58%), Gaps = 2/125 (1%) Frame = -3 Query: 598 YGGEDNASEYGYGEVSHDKGARSSAVSREKEWGSERD--NSXXXXXXXXXXXXXXXXXXX 425 YGG+D AS+YGYGEV+H+K RS+ SREKE GSERD + Sbjct: 44 YGGDDGASDYGYGEVNHEKVGRSNTASREKERGSERDWSGNSERRHRDEREQDWERSDKD 103 Query: 424 XXXXXXRNGYRDYCQKEHEPEYEEDYDRGQXXXXXXXXXRAAQEEDHRSRSRDTNYGKRR 245 ++GYRD+ Q+E + E+D+DRGQ RA +EDHRSRSRD +YGKRR Sbjct: 104 HRYREEKDGYRDHRQRERDFNNEDDWDRGQSSSRSRSRSRAVADEDHRSRSRDGDYGKRR 163 Query: 244 RAPSE 230 R PSE Sbjct: 164 RLPSE 168 >gb|EOY00741.1| RNA-binding family protein isoform 6 [Theobroma cacao] Length = 602 Score = 100 bits (249), Expect = 1e-18 Identities = 75/232 (32%), Positives = 89/232 (38%) Frame = -3 Query: 940 GPAFVGPLAGLMHPQGMMDPGFDXXXXXXXXXXXXXXXXXXXXXXXXXXAVNPMCLPGVA 761 GPAF GP G+MHPQGMM GFD AVN M L GVA Sbjct: 412 GPAFGGPAGGMMHPQGMMGAGFDPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVA 471 Query: 760 PHVNPAFFXXXXXXXXXXXXXXXXXXGPHPXXXXXXXXXXXXXXXXXXXXXXXGYGGEDN 581 PHVNPAFF GPH YGGED Sbjct: 472 PHVNPAFFGRGMAPNGMGMMGASGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDG 531 Query: 580 ASEYGYGEVSHDKGARSSAVSREKEWGSERDNSXXXXXXXXXXXXXXXXXXXXXXXXXRN 401 ASEYGYG+ +H+KG RSS SREKE SER+ + Sbjct: 532 ASEYGYGDANHEKG-RSSGASREKERVSERE---------------------------WS 563 Query: 400 GYRDYCQKEHEPEYEEDYDRGQXXXXXXXXXRAAQEEDHRSRSRDTNYGKRR 245 G D + H E E+D+DR + + +HR R +Y + R Sbjct: 564 GNSD---RRHRDEKEQDWDRSE-----------REHREHRYREEKDSYREHR 601