BLASTX nr result
ID: Mentha22_contig00021246
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00021246 (753 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus... 273 4e-71 ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation spec... 239 6e-61 ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citr... 214 3e-53 ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation spec... 213 7e-53 ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation spec... 211 2e-52 ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citr... 211 2e-52 ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prun... 208 1e-51 ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Popu... 206 7e-51 ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobr... 204 2e-50 ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268... 204 3e-50 ref|XP_002515040.1| RNA binding protein, putative [Ricinus commu... 204 3e-50 ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobr... 202 1e-49 ref|XP_007044908.1| RNA-binding family protein isoform 5, partia... 202 1e-49 ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobr... 202 1e-49 ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobr... 202 1e-49 gb|EXB82464.1| Cleavage and polyadenylation specificity factor s... 191 2e-46 ref|XP_002312652.1| RNA recognition motif-containing family prot... 189 9e-46 ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309... 185 2e-44 ref|XP_002315647.1| RNA recognition motif-containing family prot... 183 5e-44 ref|XP_006852230.1| hypothetical protein AMTR_s00049p00146760 [A... 168 2e-39 >gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus guttatus] Length = 639 Score = 273 bits (699), Expect = 4e-71 Identities = 139/224 (62%), Positives = 145/224 (64%), Gaps = 6/224 (2%) Frame = +2 Query: 2 RNPINDAASRGNGANYPSGDAXXXXXXXXXXXXXQ--PPYKPGSGPVRGRGGMMNKNMTX 175 RNP+ND A RGNG NYPSGDA P PG+GP+RGRGGM NKNM Sbjct: 333 RNPMNDGAGRGNGTNYPSGDAGRNFGRGGGWGRGNQAPNRGPGAGPIRGRGGMGNKNMIG 392 Query: 176 XXXXXXXXXXXXXXX----FHAPPVMMPHQGMMGPGFDLAFMGRGAGYGNFSGPGFQGML 343 F PP MM QGMMGPGFDLAFMGRG GYG FSGP FQGML Sbjct: 393 NAPGAGGGGAYGQGLNGPGFGGPPGMMHPQGMMGPGFDLAFMGRGGGYGGFSGPPFQGML 452 Query: 344 PPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXXPHSGMWNDTNMGAWGGEE 523 PPF GVNSMGLPGVAPHVNPAFF PHSGMWND NMG WGGEE Sbjct: 453 PPFQGVNSMGLPGVAPHVNPAFFGRGMNPNGMGMMGNPGMVGPHSGMWNDPNMGGWGGEE 512 Query: 524 HGRESSYGGEDNASEYGYGEASHDKGARSSAASREKEKNSERDW 655 HGRESSYGGEDNASEYGYGE SHDK RSSAA REKE+ SER++ Sbjct: 513 HGRESSYGGEDNASEYGYGEGSHDKSVRSSAAPREKERTSEREY 556 >ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Solanum tuberosum] gi|565349616|ref|XP_006341787.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X2 [Solanum tuberosum] Length = 648 Score = 239 bits (611), Expect = 6e-61 Identities = 129/232 (55%), Positives = 137/232 (59%), Gaps = 11/232 (4%) Frame = +2 Query: 2 RNPINDAASRGNGANYPSGDAXXXXXXXXXXXXX--QPPYKPGSGPVRGRGGMMNKNM-- 169 R P+N+ RG G NY GDA P PG GPVRGRG M +KNM Sbjct: 334 RRPMNEGVGRG-GPNYTPGDAGRNFGRGSWGRGGPGMPNRGPGGGPVRGRGAMGSKNMMV 392 Query: 170 ---TXXXXXXXXXXXXXXXXFHAPPVMMPH-QGMMGPGFDLAFMGRGAGYGNFSGPGFQG 337 F PP + H QGMMGPGFD +FMGRGAGYG FSGP F G Sbjct: 393 NPGAGNGAGGAFGQGLAGPAFGGPPAGLMHPQGMMGPGFDPSFMGRGAGYGGFSGPAFPG 452 Query: 338 MLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXXPHSGMWNDTNMGAWGG 517 M+PPF VN MGLPGVAPHVNPAFF PH GMW DT+ G WGG Sbjct: 453 MMPPFQAVNPMGLPGVAPHVNPAFFGRGMAANGMGMMSAAGMDGPHPGMWTDTSGGGWGG 512 Query: 518 EEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASREKEKNSERDWSSN 664 EEHG RESSYGGEDNASEYGYGE SHDKGARSSA SREKE+ SERDWS N Sbjct: 513 EEHGRRTRESSYGGEDNASEYGYGEVSHDKGARSSAVSREKERGSERDWSGN 564 >ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] gi|557540375|gb|ESR51419.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] Length = 658 Score = 214 bits (545), Expect = 3e-53 Identities = 119/237 (50%), Positives = 132/237 (55%), Gaps = 16/237 (6%) Frame = +2 Query: 2 RNPINDAASRGNGANYPSGDAXXXXXXXXXXXXXQ--PPYKPGSGPVRGRGGMMNKNM-- 169 R P+ND RG NY SGD Q P PG G +RGRG M KNM Sbjct: 336 RRPMNDGGGRGGNMNYQSGDGGRNFGRGGWGRGGQGVPNRGPGGGAMRGRGPMGAKNMMG 395 Query: 170 --------TXXXXXXXXXXXXXXXXFHAPPVMMPH-QGMMGPGFDLAFMGRGAGYGNFSG 322 F P M H Q MMG GFD +MGRG GYG FSG Sbjct: 396 SSSGAGSGAGPAAGGGYGQGLAGPGFGGPAGGMMHPQNMMG-GFDPTYMGRGGGYGGFSG 454 Query: 323 PGFQGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXXPHSGMWNDTNM 502 PGF GMLP FP VN+MGL GVAPHVNPAFF PH GMW D++M Sbjct: 455 PGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGSSGMDGPHPGMWTDSSM 514 Query: 503 GAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASREKEKNSERDWSSN 664 G W GEEHG RESSYGG+D AS+YGYGEA+H+KGARS+AASREK++ SERDWS N Sbjct: 515 GGWVGEEHGRRTRESSYGGDDGASDYGYGEANHEKGARSTAASREKDRGSERDWSGN 571 >ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 658 Score = 213 bits (541), Expect = 7e-53 Identities = 118/237 (49%), Positives = 132/237 (55%), Gaps = 16/237 (6%) Frame = +2 Query: 2 RNPINDAASRGNGANYPSGDAXXXXXXXXXXXXXQ--PPYKPGSGPVRGRGGMMNKNM-- 169 R P+ND RG NY SGD Q P PG G +RGRG M +NM Sbjct: 336 RRPMNDGGGRGGNMNYQSGDGGRNFGRGGWGRGGQGVPNRGPGGGAMRGRGPMGARNMIG 395 Query: 170 --------TXXXXXXXXXXXXXXXXFHAPPVMMPH-QGMMGPGFDLAFMGRGAGYGNFSG 322 F P M H Q MMG GFD +MGRG GYG FSG Sbjct: 396 SSSGAGSGVGHAAGGGYGQGLAGPGFGGPAGGMMHPQNMMG-GFDPTYMGRGGGYGGFSG 454 Query: 323 PGFQGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXXPHSGMWNDTNM 502 PGF GMLP FP VN+MGL GVAPHVNPAFF PH GMW D++M Sbjct: 455 PGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGSSGMDGPHPGMWTDSSM 514 Query: 503 GAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASREKEKNSERDWSSN 664 G W GEEHG RESSYGG+D AS+YGYGEA+H+KGARS+AASREK++ SERDWS N Sbjct: 515 GGWLGEEHGRRTRESSYGGDDGASDYGYGEANHEKGARSTAASREKDRGSERDWSGN 571 >ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 655 Score = 211 bits (537), Expect = 2e-52 Identities = 117/235 (49%), Positives = 131/235 (55%), Gaps = 16/235 (6%) Frame = +2 Query: 8 PINDAASRGNGANYPSGDAXXXXXXXXXXXXXQ--PPYKPGSGPVRGRGGMMNKNM---- 169 P+ND RG NY SGD Q P PG G +RGRG M +NM Sbjct: 335 PMNDGGGRGGNTNYQSGDGGRNFGRGGWGRGGQGVPNRGPGGGAMRGRGPMGARNMIGSS 394 Query: 170 ------TXXXXXXXXXXXXXXXXFHAPPVMMPH-QGMMGPGFDLAFMGRGAGYGNFSGPG 328 F P M H Q MMG GFD +MGRG GYG FSGPG Sbjct: 395 SGAGSGVGHAAGGGYGQGLAGPGFGGPAGGMMHPQNMMG-GFDPTYMGRGGGYGGFSGPG 453 Query: 329 FQGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXXPHSGMWNDTNMGA 508 F GMLP FP VN+MGL GVAPHVNPAFF PH GMW D++MG Sbjct: 454 FPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGSSGMDGPHPGMWTDSSMGG 513 Query: 509 WGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASREKEKNSERDWSSN 664 W GEEHG RESSYGG+D AS+YGYGEA+H+KGARS+AASREK++ SERDWS N Sbjct: 514 WLGEEHGRRTRESSYGGDDGASDYGYGEANHEKGARSTAASREKDRGSERDWSGN 568 >ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|567891321|ref|XP_006438181.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540376|gb|ESR51420.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540377|gb|ESR51421.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] Length = 655 Score = 211 bits (537), Expect = 2e-52 Identities = 117/235 (49%), Positives = 130/235 (55%), Gaps = 16/235 (6%) Frame = +2 Query: 8 PINDAASRGNGANYPSGDAXXXXXXXXXXXXXQ--PPYKPGSGPVRGRGGMMNKNM---- 169 P+ND RG NY SGD Q P PG G +RGRG M +NM Sbjct: 335 PMNDGGGRGGNTNYQSGDGGRNFGRGGWGRGGQGVPNRGPGGGAMRGRGPMGARNMIGSS 394 Query: 170 ------TXXXXXXXXXXXXXXXXFHAPPVMMPH-QGMMGPGFDLAFMGRGAGYGNFSGPG 328 F P M H Q MMG GFD +MGRG GYG FSGPG Sbjct: 395 SGAGSGAGHAAGGGYGQGLAGPGFGGPAGGMMHPQNMMG-GFDPTYMGRGGGYGGFSGPG 453 Query: 329 FQGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXXPHSGMWNDTNMGA 508 F GMLP FP VN+MGL GVAPHVNPAFF PH GMW D++MG Sbjct: 454 FPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGSSGMDGPHPGMWTDSSMGG 513 Query: 509 WGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASREKEKNSERDWSSN 664 W GEEHG RESSYGG+D AS+YGYGEASH+KGARS+ ASREK++ SERDWS N Sbjct: 514 WVGEEHGRRTRESSYGGDDGASDYGYGEASHEKGARSTTASREKDRGSERDWSGN 568 >ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] gi|462422613|gb|EMJ26876.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] Length = 630 Score = 208 bits (530), Expect = 1e-51 Identities = 114/236 (48%), Positives = 129/236 (54%), Gaps = 15/236 (6%) Frame = +2 Query: 2 RNPINDAASRGNGANYPSGDAXXXXXXXXXXXXXQPPYKP----GSGPVRGRGGMMN-KN 166 R P+N+ RG G NY +GD G GP+RGRGG M KN Sbjct: 307 RRPMNEGVGRGGGVNYQTGDTGGRNFGRGGWGRGGQGVANRGPGGGGPMRGRGGAMGAKN 366 Query: 167 MTXXXXXXXXXXXXXXXXFHAPPV-------MMPHQGMMGPGFDLAFMGRGAGYGNFSGP 325 M A P MM QGMMG GFD +MGRG GYG F GP Sbjct: 367 MAGNPAGVGTGANGGYGQGLAGPGFGGPVGGMMNPQGMMGAGFDPTYMGRGGGYGGFPGP 426 Query: 326 GFQGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXXPHSGMWNDTNMG 505 F GML FP VN+MGL GVAPHVNPAFF H+GMWND +MG Sbjct: 427 AFPGMLSSFPAVNTMGLAGVAPHVNPAFFGRGMATNGMGMMGSSGMDGHHAGMWNDPSMG 486 Query: 506 AWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASREKEKNSERDWSSN 664 WGG+EHG RESSYGG+D ASEYGYGEA+H+KG RS+A SRE+E+ SERDWS N Sbjct: 487 GWGGDEHGRRTRESSYGGDDGASEYGYGEANHEKGGRSNAPSRERERGSERDWSGN 542 >ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] gi|550329195|gb|ERP56065.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] Length = 591 Score = 206 bits (524), Expect = 7e-51 Identities = 116/233 (49%), Positives = 126/233 (54%), Gaps = 12/233 (5%) Frame = +2 Query: 2 RNPINDAASRGNGANYPSGDAXXXXXXXXXXXXXQPPYK--PGSGPVRGRGGMMNKNMTX 175 R +ND RG ANY SGD Q PG GP+RGRGGM KNM Sbjct: 275 RGSMNDGMGRGGNANYQSGDGGRNYGRGGWGRGGQGVLNRGPGGGPMRGRGGMGPKNMAG 334 Query: 176 XXXXXXXXXXXXXXX-------FHAPPV-MMPHQGMMGPGFDLAFMGRGAGYGNFSGPGF 331 F P MM HQGMMG GFD +MGRG GYG F G GF Sbjct: 335 NVAGVGSGANGGGYGQGIAGPAFGGPAGGMMHHQGMMGAGFDPLYMGRGGGYGGFPGHGF 394 Query: 332 QGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXXPHSGMWNDTNMGAW 511 GMLP FP VNSMGL GVAPHVNPAFF P+ G W DT+MG W Sbjct: 395 PGMLPSFPAVNSMGLAGVAPHVNPAFFARGMAPNGMGMMASSGMEGPNPGKWPDTSMGGW 454 Query: 512 GGE--EHGRESSYGGEDNASEYGYGEASHDKGARSSAASREKEKNSERDWSSN 664 G E RESSY G++ ASEYGYGE +H+KGARSS ASREKE+ SERDWS N Sbjct: 455 GEEPGRRTRESSYDGDEGASEYGYGEGNHEKGARSSGASREKERVSERDWSGN 507 >ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695488|ref|XP_007044903.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708837|gb|EOY00734.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708838|gb|EOY00735.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 653 Score = 204 bits (520), Expect = 2e-50 Identities = 113/231 (48%), Positives = 126/231 (54%), Gaps = 14/231 (6%) Frame = +2 Query: 14 NDAASRGNGANYPSGDAXXXXXXXXXXXXXQPPYKPGS--GPVRGRGGMMNKNMTXXXXX 187 ND RG NY SGDA Q GP+RGRGG+ KNM Sbjct: 337 NDGLGRGGNMNYQSGDAGRNYGRGGWGRGGQGVVNRSGVGGPMRGRGGVGVKNMVGSSAG 396 Query: 188 XXXXXXXXXXXFHAPPV---------MMPHQGMMGPGFDLAFMGRGAGYGNFSGPGFQGM 340 P MM QGMMG GFD +MGRG YG F GPGF GM Sbjct: 397 VGNGANGGAAYGQGPAGPPFGGPAGGMMHPQGMMGAGFDPTYMGRGGSYGGFPGPGFPGM 456 Query: 341 LPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXXPHSGMWNDTNMGAWGGE 520 LP FP VN++GL GVAPHVNPAFF PH GMW DT+MG WGG+ Sbjct: 457 LPSFPAVNTLGLAGVAPHVNPAFFGRGMAPNGMGMMGGPGMDGPHVGMWTDTSMGGWGGD 516 Query: 521 EHG---RESSYGGEDNASEYGYGEASHDKGARSSAASREKEKNSERDWSSN 664 EHG RESSYGGED ASEYGYG+A+H+KG RSS ASREKE+ S+R+WS N Sbjct: 517 EHGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASREKERVSDREWSGN 566 >ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268141 isoform 1 [Vitis vinifera] gi|359473133|ref|XP_003631251.1| PREDICTED: uncharacterized protein LOC100268141 isoform 2 [Vitis vinifera] Length = 647 Score = 204 bits (519), Expect = 3e-50 Identities = 111/234 (47%), Positives = 126/234 (53%), Gaps = 13/234 (5%) Frame = +2 Query: 2 RNPINDAASRGNGANYPSGDAXXXXXXXXXXXXXQPPYKPG---SGPVRGRGGMMN-KNM 169 R P+ND RG G N GDA Q G GP+RGRGG + KNM Sbjct: 330 RRPMNDGVGRGGGMNMQGGDAGRNYGRGGWGRGGQGILNRGPGGGGPMRGRGGAVGAKNM 389 Query: 170 TXXXXXXXXXXXXXXXXFHAPPV------MMPHQGMMGPGFDLAFMGRGAGYGNFSGPGF 331 P +M QGMMG GFD +MGRG YG FSG F Sbjct: 390 VGNTAGVGASGGGYGQGLAGPTFGGPAGGLMHPQGMMGSGFDPTYMGRGGAYGGFSGSAF 449 Query: 332 QGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXXPHSGMWNDTNMGAW 511 GM+P FP VN+MGL GVAPHVNPAFF H+GMW DT+MG W Sbjct: 450 PGMVPSFPAVNTMGLAGVAPHVNPAFFGRGMAANGMGMMGATGMDGHHAGMWTDTSMGGW 509 Query: 512 GGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASREKEKNSERDWSSN 664 GGEEHG RESSYGG+D AS+YGYGE +H+K RS+ ASREKE+ SERDWS N Sbjct: 510 GGEEHGRRTRESSYGGDDGASDYGYGEVNHEKVGRSNTASREKERGSERDWSGN 563 >ref|XP_002515040.1| RNA binding protein, putative [Ricinus communis] gi|223546091|gb|EEF47594.1| RNA binding protein, putative [Ricinus communis] Length = 644 Score = 204 bits (518), Expect = 3e-50 Identities = 115/235 (48%), Positives = 130/235 (55%), Gaps = 14/235 (5%) Frame = +2 Query: 2 RNPINDAASRGNGANYPSGDAXXXXXXXXXXXXXQPPYKPG---SGPVRGRGGMMN-KNM 169 R P+ND A RG NY GDA Q G G + GRGG M KN+ Sbjct: 323 RRPMNDGAGRGGNMNYQGGDAGRNFGRGGWGRGGQGILNRGPGGGGRMGGRGGSMGAKNI 382 Query: 170 TXXXXXXXXXXXXXXXX-------FHAPP-VMMPHQGMMGPGFDLAFMGRGAGYGNFSGP 325 F P M+P Q MM GFD +MGRGAGYG F+GP Sbjct: 383 VGGAGGVGSGANGGGYGQGLAGPAFGGPAGAMLPPQSMMRAGFDPTYMGRGAGYGGFAGP 442 Query: 326 GFQGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXXPHSGMWNDTNMG 505 GF GMLP FP VN+MGL GVAPHVNPAFF P++GMW+DT+MG Sbjct: 443 GFPGMLPSFPAVNAMGLAGVAPHVNPAFFGRGMAPNGMGMMGPSGMDGPNAGMWSDTSMG 502 Query: 506 AWGGE--EHGRESSYGGEDNASEYGYGEASHDKGARSSAASREKEKNSERDWSSN 664 WG E RESSYGG+D ASEYGYGE +H+KGARSSAASREKE+ SERDWS N Sbjct: 503 GWGEEPGRRTRESSYGGDDGASEYGYGEVNHEKGARSSAASREKERASERDWSGN 557 >ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobroma cacao] gi|508708844|gb|EOY00741.1| RNA-binding family protein isoform 6 [Theobroma cacao] Length = 602 Score = 202 bits (514), Expect = 1e-49 Identities = 113/230 (49%), Positives = 126/230 (54%), Gaps = 13/230 (5%) Frame = +2 Query: 14 NDAASRGNGANYPSGDAXXXXXXXXXXXXXQPPYKP--GSGPVRGRGGMMNKNMTXXXXX 187 N+ RG NY SGDA Q G G +RGRGG+ KNM Sbjct: 337 NEGLGRGGNLNYQSGDAGRNYGRGGWGRGGQGGVNRAGGGGLMRGRGGVGVKNMVGISAG 396 Query: 188 XXXXXXXXXXXFHAPPV--------MMPHQGMMGPGFDLAFMGRGAGYGNFSGPGFQGML 343 P MM QGMMG GFD +M RG GYG F GPGF GML Sbjct: 397 VGNGANGAGAYGQGPGPAFGGPAGGMMHPQGMMGAGFDPTYMVRGGGYGGFPGPGFPGML 456 Query: 344 PPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXXPHSGMWNDTNMGAWGGEE 523 P FP VN+MGL GVAPHVNPAFF PH+GMW D +MG WGG+E Sbjct: 457 PSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGASGMDGPHAGMWTDASMGGWGGDE 516 Query: 524 HG---RESSYGGEDNASEYGYGEASHDKGARSSAASREKEKNSERDWSSN 664 HG RESSYGGED ASEYGYG+A+H+KG RSS ASREKE+ SER+WS N Sbjct: 517 HGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASREKERVSEREWSGN 565 >ref|XP_007044908.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] gi|508708843|gb|EOY00740.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] Length = 656 Score = 202 bits (514), Expect = 1e-49 Identities = 113/230 (49%), Positives = 126/230 (54%), Gaps = 13/230 (5%) Frame = +2 Query: 14 NDAASRGNGANYPSGDAXXXXXXXXXXXXXQPPYKP--GSGPVRGRGGMMNKNMTXXXXX 187 N+ RG NY SGDA Q G G +RGRGG+ KNM Sbjct: 337 NEGLGRGGNLNYQSGDAGRNYGRGGWGRGGQGGVNRAGGGGLMRGRGGVGVKNMVGISAG 396 Query: 188 XXXXXXXXXXXFHAPPV--------MMPHQGMMGPGFDLAFMGRGAGYGNFSGPGFQGML 343 P MM QGMMG GFD +M RG GYG F GPGF GML Sbjct: 397 VGNGANGAGAYGQGPGPAFGGPAGGMMHPQGMMGAGFDPTYMVRGGGYGGFPGPGFPGML 456 Query: 344 PPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXXPHSGMWNDTNMGAWGGEE 523 P FP VN+MGL GVAPHVNPAFF PH+GMW D +MG WGG+E Sbjct: 457 PSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGASGMDGPHAGMWTDASMGGWGGDE 516 Query: 524 HG---RESSYGGEDNASEYGYGEASHDKGARSSAASREKEKNSERDWSSN 664 HG RESSYGGED ASEYGYG+A+H+KG RSS ASREKE+ SER+WS N Sbjct: 517 HGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASREKERVSEREWSGN 565 >ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobroma cacao] gi|508708842|gb|EOY00739.1| RNA-binding family protein isoform 4 [Theobroma cacao] Length = 697 Score = 202 bits (514), Expect = 1e-49 Identities = 113/230 (49%), Positives = 126/230 (54%), Gaps = 13/230 (5%) Frame = +2 Query: 14 NDAASRGNGANYPSGDAXXXXXXXXXXXXXQPPYKP--GSGPVRGRGGMMNKNMTXXXXX 187 N+ RG NY SGDA Q G G +RGRGG+ KNM Sbjct: 337 NEGLGRGGNLNYQSGDAGRNYGRGGWGRGGQGGVNRAGGGGLMRGRGGVGVKNMVGISAG 396 Query: 188 XXXXXXXXXXXFHAPPV--------MMPHQGMMGPGFDLAFMGRGAGYGNFSGPGFQGML 343 P MM QGMMG GFD +M RG GYG F GPGF GML Sbjct: 397 VGNGANGAGAYGQGPGPAFGGPAGGMMHPQGMMGAGFDPTYMVRGGGYGGFPGPGFPGML 456 Query: 344 PPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXXPHSGMWNDTNMGAWGGEE 523 P FP VN+MGL GVAPHVNPAFF PH+GMW D +MG WGG+E Sbjct: 457 PSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGASGMDGPHAGMWTDASMGGWGGDE 516 Query: 524 HG---RESSYGGEDNASEYGYGEASHDKGARSSAASREKEKNSERDWSSN 664 HG RESSYGGED ASEYGYG+A+H+KG RSS ASREKE+ SER+WS N Sbjct: 517 HGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASREKERVSEREWSGN 565 >ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695496|ref|XP_007044905.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695500|ref|XP_007044906.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708839|gb|EOY00736.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708840|gb|EOY00737.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708841|gb|EOY00738.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 652 Score = 202 bits (514), Expect = 1e-49 Identities = 113/230 (49%), Positives = 126/230 (54%), Gaps = 13/230 (5%) Frame = +2 Query: 14 NDAASRGNGANYPSGDAXXXXXXXXXXXXXQPPYKP--GSGPVRGRGGMMNKNMTXXXXX 187 N+ RG NY SGDA Q G G +RGRGG+ KNM Sbjct: 337 NEGLGRGGNLNYQSGDAGRNYGRGGWGRGGQGGVNRAGGGGLMRGRGGVGVKNMVGISAG 396 Query: 188 XXXXXXXXXXXFHAPPV--------MMPHQGMMGPGFDLAFMGRGAGYGNFSGPGFQGML 343 P MM QGMMG GFD +M RG GYG F GPGF GML Sbjct: 397 VGNGANGAGAYGQGPGPAFGGPAGGMMHPQGMMGAGFDPTYMVRGGGYGGFPGPGFPGML 456 Query: 344 PPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXXPHSGMWNDTNMGAWGGEE 523 P FP VN+MGL GVAPHVNPAFF PH+GMW D +MG WGG+E Sbjct: 457 PSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGASGMDGPHAGMWTDASMGGWGGDE 516 Query: 524 HG---RESSYGGEDNASEYGYGEASHDKGARSSAASREKEKNSERDWSSN 664 HG RESSYGGED ASEYGYG+A+H+KG RSS ASREKE+ SER+WS N Sbjct: 517 HGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASREKERVSEREWSGN 565 >gb|EXB82464.1| Cleavage and polyadenylation specificity factor subunit [Morus notabilis] Length = 636 Score = 191 bits (486), Expect = 2e-46 Identities = 110/234 (47%), Positives = 122/234 (52%), Gaps = 13/234 (5%) Frame = +2 Query: 2 RNPINDAASRGNGANYPSGDAXXXXXXXXXXXXXQ--PPYKPGSG-PVRGRGGMMN-KNM 169 R PIND RG N+ SGD Q P PGSG P+RGRGG M KNM Sbjct: 326 RRPINDGVGRGGNPNFQSGDGGRNFGRGGWGRGGQGAPNRGPGSGGPMRGRGGAMGAKNM 385 Query: 170 TXXXXXXXXXXXXXXXXFHAPPV------MMPHQGMMGPGFDLAFMGRGAGYGNFSGPGF 331 PP MM QGMMG GFD +MGRG GYG F+GP F Sbjct: 386 VGNNAGVGGGGYGQGLA--GPPFGGPAGGMMNPQGMMGTGFDPTYMGRGVGYGGFAGPAF 443 Query: 332 QGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXXPHSGMWNDTNMGAW 511 GMLP FP VN+MG VAPHVNPAFF GMWND ++G W Sbjct: 444 PGMLPSFPAVNTMGFAAVAPHVNPAFFGRGMTNNGMGMVGSSLMDGHQGGMWNDPSIGGW 503 Query: 512 GGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASREKEKNSERDWSSN 664 GGEEHG RESSYGG+D ASEYGYG+ +H+KG R E+ SERDWS N Sbjct: 504 GGEEHGRRTRESSYGGDDGASEYGYGDTNHEKGGR--------ERGSERDWSGN 549 >ref|XP_002312652.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222852472|gb|EEE90019.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 619 Score = 189 bits (480), Expect = 9e-46 Identities = 109/231 (47%), Positives = 120/231 (51%), Gaps = 10/231 (4%) Frame = +2 Query: 2 RNPINDAASRGNGANYPSGDAXXXXXXXXXXXXXQPPYK--PGSGPVRGRGGMMNKNMTX 175 R +ND A RG AN+ SGD Q PG GP+RGRG M KNM Sbjct: 320 RGSMNDGAGRGGNANFQSGDGGRNYGRGAWGRGGQGILNRGPGGGPMRGRGAMGPKNMAG 379 Query: 176 XXXXXXXXXXXXXXX-------FHAPPV-MMPHQGMMGPGFDLAFMGRGAGYGNFSGPGF 331 F P MMP QGMMG GFD +MGRG GYG F+GPGF Sbjct: 380 NVAGVGSGANGGGYGQGLAGPAFGGPAGGMMPPQGMMGAGFDPLYMGRGGGYGGFAGPGF 439 Query: 332 QGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXXPHSGMWNDTNMGAW 511 GMLP FP VNSMGL GVAPHVNPAFF P+ GMW Sbjct: 440 PGMLPSFPAVNSMGLAGVAPHVNPAFFARGMAPNGMGMMVSSGMDGPNPGMW-------- 491 Query: 512 GGEEHGRESSYGGEDNASEYGYGEASHDKGARSSAASREKEKNSERDWSSN 664 ESSY G++ ASEYGYGE +H+KGARSS ASREKE+ SERDWS N Sbjct: 492 -------ESSYDGDEGASEYGYGEGNHEKGARSSGASREKERGSERDWSGN 535 >ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309507 [Fragaria vesca subsp. vesca] Length = 646 Score = 185 bits (469), Expect = 2e-44 Identities = 107/235 (45%), Positives = 121/235 (51%), Gaps = 16/235 (6%) Frame = +2 Query: 2 RNPINDAASRGNGANYPSGDAXXXXXXXXXXXXXQPPY----KPGSGPVRGRGGMMNKNM 169 R P+ND A RG N+ GD G GP RGRG M +NM Sbjct: 323 RRPMNDGAGRGGNMNFQGGDTGRNFGRGNNWGRGGQGVLNRGPGGGGPGRGRGAMGARNM 382 Query: 170 TXXXXXXXXXXXXXXXX-------FHAPPV-MMPHQGMMGPGFDLAFMGRGAGYGNFSGP 325 F P MM GMMGPGFD +MGRG GYG F GP Sbjct: 383 VGNNAGVGTGANGGGYGQGLGGPGFGGPVGGMMNAPGMMGPGFDPTYMGRGGGYGGFPGP 442 Query: 326 GFQGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXXPHSGMWNDTNMG 505 GF GMLP FPGVN+MGL GVAPHVNPAFF H+ MWND +M Sbjct: 443 GFPGMLPQFPGVNAMGLAGVAPHVNPAFFGRGMATNGMGMMGSSGMEGHHAPMWNDPSMA 502 Query: 506 AWGGEEHG---RESSYGGEDNASEYG-YGEASHDKGARSSAASREKEKNSERDWS 658 W GEE RESSYGG+D SEYG YGEA+H+K RSSAA RE+E+ SER+W+ Sbjct: 503 GWTGEEQDRRTRESSYGGDDGGSEYGNYGEANHEKPVRSSAAPRERERESEREWT 557 >ref|XP_002315647.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222864687|gb|EEF01818.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 573 Score = 183 bits (465), Expect = 5e-44 Identities = 110/231 (47%), Positives = 120/231 (51%), Gaps = 10/231 (4%) Frame = +2 Query: 2 RNPINDAASRGNGANYPSGDAXXXXXXXXXXXXXQPPYK--PGSGPVRGRGGMMNKNMTX 175 R +ND RG ANY SGD Q PG GP+RGRGGM KNM Sbjct: 275 RGSMNDGMGRGGNANYQSGDGGRNYGRGGWGRGGQGVLNRGPGGGPMRGRGGMGPKNMAG 334 Query: 176 XXXXXXXXXXXXXXX-------FHAPPV-MMPHQGMMGPGFDLAFMGRGAGYGNFSGPGF 331 F P MM HQGMMG GFD +MGRG GYG F G GF Sbjct: 335 NVAGVGSGANGGGYGQGIAGPAFGGPAGGMMHHQGMMGAGFDPLYMGRGGGYGGFPGHGF 394 Query: 332 QGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXXPHSGMWNDTNMGAW 511 GMLP FP VNSMGL GVAPHVNPAFF GM + M Sbjct: 395 PGMLPSFPAVNSMGLAGVAPHVNPAFFARGMAPNG-------------MGMMASSGM--- 438 Query: 512 GGEEHGRESSYGGEDNASEYGYGEASHDKGARSSAASREKEKNSERDWSSN 664 G G+ESSY G++ ASEYGYGE +H+KGARSS ASREKE+ SERDWS N Sbjct: 439 EGPNPGKESSYDGDEGASEYGYGEGNHEKGARSSGASREKERVSERDWSGN 489 >ref|XP_006852230.1| hypothetical protein AMTR_s00049p00146760 [Amborella trichopoda] gi|548855834|gb|ERN13697.1| hypothetical protein AMTR_s00049p00146760 [Amborella trichopoda] Length = 659 Score = 168 bits (425), Expect = 2e-39 Identities = 101/240 (42%), Positives = 124/240 (51%), Gaps = 18/240 (7%) Frame = +2 Query: 2 RNPINDAASRGNGANYPSGDAXXXXXXXXXXXXXQPPYKPGSGP--VRGR-GGMMNKNMT 172 R P+ND R G +Y GD P + G GP +RGR GG+ K M Sbjct: 346 RRPMNDGGGRAGGPSYQGGDRNYGNKMGWGRGNQGVPNR-GQGPAGLRGRPGGLTGKAMV 404 Query: 173 XXXXXXXXXXXXXXXXFHAPPV------MMPHQGMMGPGFDLAF---MGRGAGYGNFSGP 325 APP+ ++ QGMMG GFD + +GRG+GYG FSGP Sbjct: 405 GGPSGANPYGQA----LSAPPLGGPPGGLLHPQGMMGSGFDPTYGAHLGRGSGYGGFSGP 460 Query: 326 GFQGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXXXXXXXPHSGMWNDTNMG 505 F GMLP F + ++GLPGVAPHVNPAFF H GMW D++MG Sbjct: 461 HFPGMLPSFSPMGTVGLPGVAPHVNPAFFGRGVSANGMGMMGSGAMDGHHGGMWGDSSMG 520 Query: 506 ---AWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASREKEKNSERDWSSNP 667 WG EEHG RESSY G+D AS+YGYG+ H++G S REK++ SERDWSS P Sbjct: 521 GGVGWGNEEHGRRTRESSY-GDDGASDYGYGDGGHERGGGRSNPGREKDRGSERDWSSGP 579