BLASTX nr result
ID: Cocculus22_contig00011162
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus22_contig00011162 (1139 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citr... 406 e-110 ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation spec... 403 e-110 ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation spec... 396 e-107 ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citr... 395 e-107 ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobr... 395 e-107 ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268... 393 e-107 ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobr... 384 e-104 ref|XP_007044908.1| RNA-binding family protein isoform 5, partia... 384 e-104 ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobr... 384 e-104 ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobr... 384 e-104 ref|XP_002515040.1| RNA binding protein, putative [Ricinus commu... 384 e-104 gb|EXB82464.1| Cleavage and polyadenylation specificity factor s... 361 4e-97 ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309... 354 3e-95 ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation spec... 353 6e-95 ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prun... 351 3e-94 ref|XP_002312652.1| RNA recognition motif-containing family prot... 343 8e-92 gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus... 325 2e-86 ref|XP_006852230.1| hypothetical protein AMTR_s00049p00146760 [A... 293 7e-77 ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Popu... 288 3e-75 ref|XP_002315647.1| RNA recognition motif-containing family prot... 288 3e-75 >ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] gi|557540375|gb|ESR51419.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] Length = 658 Score = 406 bits (1043), Expect = e-110 Identities = 222/382 (58%), Positives = 258/382 (67%), Gaps = 6/382 (1%) Frame = -2 Query: 1132 MDQMAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQ 956 MD MAEEQ+DY +EEYGG QKMQ+QG GAIPALA+EELMGEDDEYDDLYNDVNVG+G LQ Sbjct: 1 MDSMAEEQIDYEEEEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQ 60 Query: 955 VHRSET---LGGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGE 785 + E GVGNG LQ ++T+ P + + G SQ +PGV V GK + G Sbjct: 61 FQQPEAPPPSAGVGNGRLQVKKTDVPEQQV-QAGVSQGSNVPGVSVEGKYTNAGT----- 114 Query: 784 HNNNKGGYVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTGV 605 H + + SGN ++VSQ G + E DA RN GF+G + PP+TGV Sbjct: 115 HFPAQNDVQVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPPRTGV 174 Query: 604 DPSQISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXN-RSMVNENLSRP-LESGATM 431 DPS + + A E + + + G AG +GA R+MVNEN RP LE+G TM Sbjct: 175 DPSNMPGRVANEPAPVLNPGAAGPQGALIPANQMGVNINVNRAMVNENQIRPPLENGGTM 234 Query: 430 LFVGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKDGM 251 LFVGELHWWTTDAELE+VLSQYGRVKEIKF+DERASGKSKGYC VEFFD AAAACKDGM Sbjct: 235 LFVGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGM 294 Query: 250 NGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPMXXXXXXXXXXXXXXX 71 NGHVFNGR CVVAFASPQT+KQMGA+YMNKNQ Q QSQTQGRRPM Sbjct: 295 NGHVFNGRPCVVAFASPQTLKQMGASYMNKNQGQPQSQTQGRRPMNDGGGRGGNMNYQSG 354 Query: 70 XXXRNYNKVGWGRGGQGMSNRG 5 RN+ + GWGRGGQG+ NRG Sbjct: 355 DGGRNFGRGGWGRGGQGVPNRG 376 >ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 658 Score = 403 bits (1035), Expect = e-110 Identities = 221/382 (57%), Positives = 257/382 (67%), Gaps = 6/382 (1%) Frame = -2 Query: 1132 MDQMAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQ 956 MD MAEEQ+DY +EEYGG QKMQ+QG GAIPALA+EELMGEDDEYDDLYNDVNVG+G LQ Sbjct: 1 MDSMAEEQIDYEEEEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQ 60 Query: 955 VHRSET---LGGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGE 785 + E GVGNG LQ ++T+ P + + G SQ +PGV V GK + G Sbjct: 61 FQQPEAPPPSAGVGNGRLQVKKTDVPEQQV-QAGVSQGSNVPGVSVEGKYTNAGT----- 114 Query: 784 HNNNKGGYVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTGV 605 H + + SGN ++VSQ G + E DA RN GF+G + P +TGV Sbjct: 115 HFPAQNDVQVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGV 174 Query: 604 DPSQISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXN-RSMVNENLSRP-LESGATM 431 DPS + + A E + + + G AG +GA R+MVNEN RP LE+G TM Sbjct: 175 DPSNMPGRVANEPAPVLNPGAAGPQGALIPANQMGVNINVNRAMVNENQIRPPLENGGTM 234 Query: 430 LFVGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKDGM 251 LFVGELHWWTTDAELE+VLSQYGRVKEIKF+DERASGKSKGYC VEFFD AAAACKDGM Sbjct: 235 LFVGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGM 294 Query: 250 NGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPMXXXXXXXXXXXXXXX 71 NGHVFNGR CVVAFASPQT+KQMGA+YMNKNQ Q QSQTQGRRPM Sbjct: 295 NGHVFNGRPCVVAFASPQTLKQMGASYMNKNQGQPQSQTQGRRPMNDGGGRGGNMNYQSG 354 Query: 70 XXXRNYNKVGWGRGGQGMSNRG 5 RN+ + GWGRGGQG+ NRG Sbjct: 355 DGGRNFGRGGWGRGGQGVPNRG 376 >ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 655 Score = 396 bits (1017), Expect = e-107 Identities = 218/379 (57%), Positives = 252/379 (66%), Gaps = 6/379 (1%) Frame = -2 Query: 1123 MAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQVHR 947 MAEEQ+DY ++EYGG QKMQ+QG GAIPALA+EELMGEDDEYDDLYNDVNVG+G LQ + Sbjct: 1 MAEEQIDYEEDEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQFQQ 60 Query: 946 SET---LGGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGEHNN 776 E GVGNG LQ ++T+ P R + GGSQ IPGV V GK + G H Sbjct: 61 PEAPPPSAGVGNGRLQVKKTDVPEQRV-QVGGSQGSNIPGVSVEGKYTNAG-----SHFP 114 Query: 775 NKGGYVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTGVDPS 596 + + SGN ++VSQ G + E DA RN GF+G + P +TGVDPS Sbjct: 115 AQNDVQVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPS 174 Query: 595 QISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXN-RSMVNENLSRP-LESGATMLFV 422 + + A E + + + G AG +GA R MVNEN RP LE+G TMLFV Sbjct: 175 NMPGRVANEPAPVLNPGAAGPQGALIPANQMGVNANVNRVMVNENQIRPPLENGGTMLFV 234 Query: 421 GELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKDGMNGH 242 GELHWWTTDAELE+VLSQYGR KEIKF+DERASGKSKGYC VEFFD AAAACKDGMNGH Sbjct: 235 GELHWWTTDAELESVLSQYGRAKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGH 294 Query: 241 VFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPMXXXXXXXXXXXXXXXXXX 62 VFNGR CVVAFASPQT+KQMGA+YMNKNQ Q QSQ QG RPM Sbjct: 295 VFNGRPCVVAFASPQTLKQMGASYMNKNQGQPQSQNQGSRPMNDGGGRGGNTNYQSGDGG 354 Query: 61 RNYNKVGWGRGGQGMSNRG 5 RN+ + GWGRGGQG+ NRG Sbjct: 355 RNFGRGGWGRGGQGVPNRG 373 >ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|567891321|ref|XP_006438181.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540376|gb|ESR51420.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540377|gb|ESR51421.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] Length = 655 Score = 395 bits (1015), Expect = e-107 Identities = 218/379 (57%), Positives = 255/379 (67%), Gaps = 6/379 (1%) Frame = -2 Query: 1123 MAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQVHR 947 MAEEQ+DY ++EYGG QKMQ+QG GAIPALA+EELMGEDDEYDDLYND+NVG+G LQ + Sbjct: 1 MAEEQIDYEEDEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDINVGDGLLQFQQ 60 Query: 946 SET---LGGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGEHNN 776 E GVGNG LQ ++T+ P R + GGSQ IPGV V GK + G S + N+ Sbjct: 61 PEAPPPSAGVGNGRLQVKKTDVPEQRV-QVGGSQGSNIPGVSVEGKYTNAG-SDFPAQND 118 Query: 775 NKGGYVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTGVDPS 596 + + SGN ++VSQ G + E DA RN GF+G + P +TGVDPS Sbjct: 119 VQ----VAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPS 174 Query: 595 QISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXN-RSMVNENLSRP-LESGATMLFV 422 + + A E + + + G AG +GA R MVNEN RP LE+G TMLFV Sbjct: 175 NMPGRAANEPAPVLNPGAAGPQGALIPANQMGVNANVNRVMVNENQIRPPLENGGTMLFV 234 Query: 421 GELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKDGMNGH 242 GELHWWTTDAELE+VLSQYGR KEIKF+DERASGKSKGYC VEFFD AAAACKDGMNGH Sbjct: 235 GELHWWTTDAELESVLSQYGRAKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGH 294 Query: 241 VFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPMXXXXXXXXXXXXXXXXXX 62 VFNGR CVVAFASPQT+KQMGA+YMNKNQ Q QSQ QG RPM Sbjct: 295 VFNGRPCVVAFASPQTLKQMGASYMNKNQGQPQSQNQGSRPMNDGGGRGGNTNYQSGDGG 354 Query: 61 RNYNKVGWGRGGQGMSNRG 5 RN+ + GWGRGGQG+ NRG Sbjct: 355 RNFGRGGWGRGGQGVPNRG 373 >ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695488|ref|XP_007044903.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708837|gb|EOY00734.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708838|gb|EOY00735.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 653 Score = 395 bits (1014), Expect = e-107 Identities = 217/382 (56%), Positives = 260/382 (68%), Gaps = 7/382 (1%) Frame = -2 Query: 1132 MDQMAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQ 956 MD MAEEQ+D+GDEEYGG QKMQ+QGSGAIPALA+EE+MGEDDEYDDLYNDVNVGEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGAQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 955 VHRSETL---GGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGE 785 + RSE GG+G+ LQAQ+ E P R E GGSQ + IPGV V GK + Y E Sbjct: 61 LQRSEAPPQPGGMGSTGLQAQKNEAPEPRG-EAGGSQGLNIPGVSVQGKHLNV-TARYPE 118 Query: 784 HNNNKGGYVIGKGLEGRSSGNSGYPS--AVSQGGRLPEMAPDAQARNEGFRGQATLPPKT 611 + + G+ YPS ++SQ GR+ E D Q +N GF+G ++ K Sbjct: 119 QDGQPA-------VSRPEMGSGSYPSGTSISQKGRVMEGTQDTQVKNMGFQGLSSASHKV 171 Query: 610 GVDPSQISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXNRSMVNENLSRP-LESGAT 434 G+DPS + K A + + G G +GA N M++EN RP +E+G T Sbjct: 172 GIDPSGVPQKIANVPAQSLNSGTGGPQGAPHVPPNQMGLNVNHPMISENQVRPPIENGPT 231 Query: 433 MLFVGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKDG 254 MLFVGELHWWTTDAELE+VLSQYGRVKEIKF+DERASGKSKGYC VEF+DP +AAACK+G Sbjct: 232 MLFVGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPASAAACKEG 291 Query: 253 MNGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPMXXXXXXXXXXXXXX 74 M+G++FNGRACVVAFASPQT+KQMGA+YMNKNQ Q Q+Q QGRRP Sbjct: 292 MDGYMFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NDGLGRGGNMNYQS 350 Query: 73 XXXXRNYNKVGWGRGGQGMSNR 8 RNY + GWGRGGQG+ NR Sbjct: 351 GDAGRNYGRGGWGRGGQGVVNR 372 >ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268141 isoform 1 [Vitis vinifera] gi|359473133|ref|XP_003631251.1| PREDICTED: uncharacterized protein LOC100268141 isoform 2 [Vitis vinifera] Length = 647 Score = 393 bits (1009), Expect = e-107 Identities = 221/380 (58%), Positives = 257/380 (67%), Gaps = 7/380 (1%) Frame = -2 Query: 1123 MAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQVHR 947 MAEEQLDY DEEYGG QKM FQG GAI ALA++ELMGEDDEYDDLYNDVNVGEGFLQ+HR Sbjct: 1 MAEEQLDYEDEEYGGAQKMPFQGGGAISALADDELMGEDDEYDDLYNDVNVGEGFLQMHR 60 Query: 946 SET---LGGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGEHNN 776 SE G + G QA +T+ P + E G SQ + IPGV + GK S + Sbjct: 61 SEAPAPSGVMAGGPFQAHKTDVPPQKL-EAGTSQGLIIPGVSIEGKYSNP------HFHE 113 Query: 775 NKGGYVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTGVDPS 596 K G + KG E S+ + PS VSQ GR+ EM D Q RN GF+G +P KTG +PS Sbjct: 114 KKEGPMAVKGPEMGSTSHLDGPS-VSQKGRVLEMTHDTQVRNLGFQGSTPIPQKTGAEPS 172 Query: 595 QISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXN--RSMVNENLSRP-LESGATMLF 425 + K A E + + + G G R N R MVNEN RP +++GATMLF Sbjct: 173 DVHGKIANESTPVLNSGTGGPRAVPQMLSNQMGMNVNVNRPMVNENQIRPAVDNGATMLF 232 Query: 424 VGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKDGMNG 245 VGELHWWTTDAELE+VLSQYGRVKEIKF+DERASGKSKGYC VEF+D +AAAACK+GMNG Sbjct: 233 VGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDASAAAACKEGMNG 292 Query: 244 HVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPMXXXXXXXXXXXXXXXXX 65 ++FNGRACVVAFASPQT+KQMGA+YMNK Q QSQ+QGRRPM Sbjct: 293 YIFNGRACVVAFASPQTLKQMGASYMNK--TQAQSQSQGRRPMNDGVGRGGGMNMQGGDA 350 Query: 64 XRNYNKVGWGRGGQGMSNRG 5 RNY + GWGRGGQG+ NRG Sbjct: 351 GRNYGRGGWGRGGQGILNRG 370 >ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobroma cacao] gi|508708844|gb|EOY00741.1| RNA-binding family protein isoform 6 [Theobroma cacao] Length = 602 Score = 384 bits (985), Expect = e-104 Identities = 212/383 (55%), Positives = 255/383 (66%), Gaps = 8/383 (2%) Frame = -2 Query: 1132 MDQMAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQ 956 MD MAEEQ+D+GDEEYGG QKMQ+QGSGAIPALA+EE+MGEDDEYDDLYNDVNVGEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 955 VHRSETL---GGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTG---PSS 794 + RSE GG+G+ L+AQR E P R E GGSQ + IPGV V GK P Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRV-EAGGSQGLNIPGVSVQGKHPNVSARYPEK 119 Query: 793 YGEHNNNKGGYVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPK 614 + N+ V G G S++SQ G + E D Q +N GF+G + K Sbjct: 120 EEQPAVNRPEMVSGSYPSG---------SSISQKGSVTEGTHDKQVKNLGFQGLTSASNK 170 Query: 613 TGVDPSQISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXNRSMVNEN-LSRPLESGA 437 G+DPS + K A + + + G G +G N ++NEN + P+E+G Sbjct: 171 VGIDPSGVPQKIANDPAQSLNSGTGGPQGPPHVPPNQMGTNVNHPVMNENQVQPPIENGP 230 Query: 436 TMLFVGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKD 257 TMLFVGELHWWTTDAELE+VLSQYGR+KEIKF+DE+ASGKSKGYC VEF+DP++AA CK+ Sbjct: 231 TMLFVGELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKE 290 Query: 256 GMNGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPMXXXXXXXXXXXXX 77 GMNG++FNGRACVVAFASPQT+KQMGA+YMNKNQ Q Q+Q QGRRP Sbjct: 291 GMNGYMFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQ 349 Query: 76 XXXXXRNYNKVGWGRGGQGMSNR 8 RNY + GWGRGGQG NR Sbjct: 350 SGDAGRNYGRGGWGRGGQGGVNR 372 >ref|XP_007044908.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] gi|508708843|gb|EOY00740.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] Length = 656 Score = 384 bits (985), Expect = e-104 Identities = 212/383 (55%), Positives = 255/383 (66%), Gaps = 8/383 (2%) Frame = -2 Query: 1132 MDQMAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQ 956 MD MAEEQ+D+GDEEYGG QKMQ+QGSGAIPALA+EE+MGEDDEYDDLYNDVNVGEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 955 VHRSETL---GGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTG---PSS 794 + RSE GG+G+ L+AQR E P R E GGSQ + IPGV V GK P Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRV-EAGGSQGLNIPGVSVQGKHPNVSARYPEK 119 Query: 793 YGEHNNNKGGYVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPK 614 + N+ V G G S++SQ G + E D Q +N GF+G + K Sbjct: 120 EEQPAVNRPEMVSGSYPSG---------SSISQKGSVTEGTHDKQVKNLGFQGLTSASNK 170 Query: 613 TGVDPSQISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXNRSMVNEN-LSRPLESGA 437 G+DPS + K A + + + G G +G N ++NEN + P+E+G Sbjct: 171 VGIDPSGVPQKIANDPAQSLNSGTGGPQGPPHVPPNQMGTNVNHPVMNENQVQPPIENGP 230 Query: 436 TMLFVGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKD 257 TMLFVGELHWWTTDAELE+VLSQYGR+KEIKF+DE+ASGKSKGYC VEF+DP++AA CK+ Sbjct: 231 TMLFVGELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKE 290 Query: 256 GMNGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPMXXXXXXXXXXXXX 77 GMNG++FNGRACVVAFASPQT+KQMGA+YMNKNQ Q Q+Q QGRRP Sbjct: 291 GMNGYMFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQ 349 Query: 76 XXXXXRNYNKVGWGRGGQGMSNR 8 RNY + GWGRGGQG NR Sbjct: 350 SGDAGRNYGRGGWGRGGQGGVNR 372 >ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobroma cacao] gi|508708842|gb|EOY00739.1| RNA-binding family protein isoform 4 [Theobroma cacao] Length = 697 Score = 384 bits (985), Expect = e-104 Identities = 212/383 (55%), Positives = 255/383 (66%), Gaps = 8/383 (2%) Frame = -2 Query: 1132 MDQMAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQ 956 MD MAEEQ+D+GDEEYGG QKMQ+QGSGAIPALA+EE+MGEDDEYDDLYNDVNVGEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 955 VHRSETL---GGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTG---PSS 794 + RSE GG+G+ L+AQR E P R E GGSQ + IPGV V GK P Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRV-EAGGSQGLNIPGVSVQGKHPNVSARYPEK 119 Query: 793 YGEHNNNKGGYVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPK 614 + N+ V G G S++SQ G + E D Q +N GF+G + K Sbjct: 120 EEQPAVNRPEMVSGSYPSG---------SSISQKGSVTEGTHDKQVKNLGFQGLTSASNK 170 Query: 613 TGVDPSQISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXNRSMVNEN-LSRPLESGA 437 G+DPS + K A + + + G G +G N ++NEN + P+E+G Sbjct: 171 VGIDPSGVPQKIANDPAQSLNSGTGGPQGPPHVPPNQMGTNVNHPVMNENQVQPPIENGP 230 Query: 436 TMLFVGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKD 257 TMLFVGELHWWTTDAELE+VLSQYGR+KEIKF+DE+ASGKSKGYC VEF+DP++AA CK+ Sbjct: 231 TMLFVGELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKE 290 Query: 256 GMNGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPMXXXXXXXXXXXXX 77 GMNG++FNGRACVVAFASPQT+KQMGA+YMNKNQ Q Q+Q QGRRP Sbjct: 291 GMNGYMFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQ 349 Query: 76 XXXXXRNYNKVGWGRGGQGMSNR 8 RNY + GWGRGGQG NR Sbjct: 350 SGDAGRNYGRGGWGRGGQGGVNR 372 >ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695496|ref|XP_007044905.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695500|ref|XP_007044906.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708839|gb|EOY00736.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708840|gb|EOY00737.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708841|gb|EOY00738.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 652 Score = 384 bits (985), Expect = e-104 Identities = 212/383 (55%), Positives = 255/383 (66%), Gaps = 8/383 (2%) Frame = -2 Query: 1132 MDQMAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQ 956 MD MAEEQ+D+GDEEYGG QKMQ+QGSGAIPALA+EE+MGEDDEYDDLYNDVNVGEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 955 VHRSETL---GGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTG---PSS 794 + RSE GG+G+ L+AQR E P R E GGSQ + IPGV V GK P Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRV-EAGGSQGLNIPGVSVQGKHPNVSARYPEK 119 Query: 793 YGEHNNNKGGYVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPK 614 + N+ V G G S++SQ G + E D Q +N GF+G + K Sbjct: 120 EEQPAVNRPEMVSGSYPSG---------SSISQKGSVTEGTHDKQVKNLGFQGLTSASNK 170 Query: 613 TGVDPSQISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXNRSMVNEN-LSRPLESGA 437 G+DPS + K A + + + G G +G N ++NEN + P+E+G Sbjct: 171 VGIDPSGVPQKIANDPAQSLNSGTGGPQGPPHVPPNQMGTNVNHPVMNENQVQPPIENGP 230 Query: 436 TMLFVGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKD 257 TMLFVGELHWWTTDAELE+VLSQYGR+KEIKF+DE+ASGKSKGYC VEF+DP++AA CK+ Sbjct: 231 TMLFVGELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKE 290 Query: 256 GMNGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPMXXXXXXXXXXXXX 77 GMNG++FNGRACVVAFASPQT+KQMGA+YMNKNQ Q Q+Q QGRRP Sbjct: 291 GMNGYMFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQ 349 Query: 76 XXXXXRNYNKVGWGRGGQGMSNR 8 RNY + GWGRGGQG NR Sbjct: 350 SGDAGRNYGRGGWGRGGQGGVNR 372 >ref|XP_002515040.1| RNA binding protein, putative [Ricinus communis] gi|223546091|gb|EEF47594.1| RNA binding protein, putative [Ricinus communis] Length = 644 Score = 384 bits (985), Expect = e-104 Identities = 217/380 (57%), Positives = 257/380 (67%), Gaps = 7/380 (1%) Frame = -2 Query: 1123 MAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQVHR 947 MA+EQ+DY DEEYGG QK+Q+QGSGAIPALAEEE MGEDDEYDDLYNDVN+GE FLQ+HR Sbjct: 1 MADEQIDYEDEEYGGAQKLQYQGSGAIPALAEEE-MGEDDEYDDLYNDVNIGENFLQMHR 59 Query: 946 SETLGG---VGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGEHNN 776 SE VGNG Q + + E GGSQ + IPGV V K S TG + + E N Sbjct: 60 SEAPPAPPSVGNGGFQPRNSN---DLRVESGGSQGLNIPGVAVESKYS-TG-THFPEQN- 113 Query: 775 NKGGYVIGKGLEGRSSGNSGYP--SAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTGVD 602 ++G G+ GYP S+++Q R+ EM D+QARN GF+G + P GVD Sbjct: 114 ----------VKGPEIGSVGYPDGSSIAQKTRVMEMTNDSQARNMGFQGSTSGPSNIGVD 163 Query: 601 PSQISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXNRSMVNENLSRP-LESGATMLF 425 PS ++ K + + + +P+ G V NRS NEN RP LE+G+TML+ Sbjct: 164 PSDMNNKISNDPTPVPNAGVPRVIPQLPASQMNMNMDTNRSATNENQIRPPLENGSTMLY 223 Query: 424 VGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKDGMNG 245 VGELHWWTTDAELENVLSQYG VKEIKF+DERASGKSKGYC VEF+D AAAACK+GMNG Sbjct: 224 VGELHWWTTDAELENVLSQYGMVKEIKFFDERASGKSKGYCQVEFYDAAAAAACKEGMNG 283 Query: 244 HVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPMXXXXXXXXXXXXXXXXX 65 H+FNGRACVVAFAS QT+KQMGA+YMNKNQ Q QSQ QGRRPM Sbjct: 284 HLFNGRACVVAFASQQTLKQMGASYMNKNQGQPQSQNQGRRPMNDGAGRGGNMNYQGGDA 343 Query: 64 XRNYNKVGWGRGGQGMSNRG 5 RN+ + GWGRGGQG+ NRG Sbjct: 344 GRNFGRGGWGRGGQGILNRG 363 >gb|EXB82464.1| Cleavage and polyadenylation specificity factor subunit [Morus notabilis] Length = 636 Score = 361 bits (926), Expect = 4e-97 Identities = 214/385 (55%), Positives = 248/385 (64%), Gaps = 12/385 (3%) Frame = -2 Query: 1123 MAEEQLDYGDEEYGG-QKMQFQGSG-AIPALAEEELMGEDDEYDDLYNDVNVGEGFLQVH 950 MAE+ +D+ DEEYGG QK Q+QGSG AI ALA+EELMG+DDEYDDLYNDVNVGEGFLQ+ Sbjct: 1 MAEDHIDFEDEEYGGAQKHQYQGSGGAISALADEELMGDDDEYDDLYNDVNVGEGFLQLQ 60 Query: 949 RSET-----LGGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGE 785 RSE GVGNG LQAQ+ P R E GGSQ+ IPGV G+ S G G+ Sbjct: 61 RSEAPSLPAAAGVGNG-LQAQKRNFPEPRE-EIGGSQQPNIPGVSAEGRFSSAGSQFPGQ 118 Query: 784 HNNNKGGYVIGKGLEGRSSGNSGYPSAVS--QGGRLPEMAPDAQARNEGFRGQATLPPKT 611 + G + K E +G+ YP S Q GR+ GF+G + Sbjct: 119 QD----GLKVDKKSE---AGSMVYPDGASGSQKGRIVA----------GFQGSKPMLHSV 161 Query: 610 GVDPSQISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXNRS--MVNENLSRP-LESG 440 GVD S I K E P+ G AG RG N S +VNEN RP +E+G Sbjct: 162 GVDSSDIPGKMVNEPIQAPNSGGAGPRGILPMQGNQTTVNANVSHPIVNENQIRPSIENG 221 Query: 439 ATMLFVGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACK 260 +TMLFVGELHWWTTDAELE+VLSQYGRVKEIKF+DERASGKSKGYC VE++D AA ACK Sbjct: 222 STMLFVGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEYYDAAAAVACK 281 Query: 259 DGMNGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPMXXXXXXXXXXXX 80 +GM+GHVFNGRACVVAFASPQT+KQMGAAYM+KNQVQ QSQ QGRRP+ Sbjct: 282 EGMHGHVFNGRACVVAFASPQTLKQMGAAYMSKNQVQNQSQPQGRRPINDGVGRGGNPNF 341 Query: 79 XXXXXXRNYNKVGWGRGGQGMSNRG 5 RN+ + GWGRGGQG NRG Sbjct: 342 QSGDGGRNFGRGGWGRGGQGAPNRG 366 >ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309507 [Fragaria vesca subsp. vesca] Length = 646 Score = 354 bits (909), Expect = 3e-95 Identities = 207/385 (53%), Positives = 244/385 (63%), Gaps = 9/385 (2%) Frame = -2 Query: 1132 MDQMAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQ 956 MD M EEQ+DY +EEYGG QK+Q+Q SGAIPALA+EE M EDDEYDDLYNDVNVGEGFLQ Sbjct: 1 MDPMGEEQIDYEEEEYGGAQKLQYQESGAIPALADEEPMVEDDEYDDLYNDVNVGEGFLQ 60 Query: 955 VHRSETL---GGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGE 785 +HR E GVGNG LQAQ+ P R + G SQEV PG V GK S S E Sbjct: 61 MHRPEPPLPPAGVGNGGLQAQKNNVPEQRV-QGGASQEVKNPGFSVEGKYS-----SVPE 114 Query: 784 HNNNKGGYVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTGV 605 + V+ P SQ GR+ EM DAQ RN GF+G AT+ Sbjct: 115 QKDQPPVSVV--------------PEMASQKGRVMEMTHDAQVRNMGFQGAATMQSNVVA 160 Query: 604 DPSQISVKFAG---EQSSLPDQGPAGVRGAXXXXXXXXXXXXNRSMVNENLSRP-LESGA 437 D S ++ K A + GP V+ NR MVNEN RP +E+G+ Sbjct: 161 DSSDLTGKIANGPIPSMNSGSNGPPAVQ-QMPANQMNMKINVNRPMVNENQIRPPVENGS 219 Query: 436 TMLFVGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKD 257 LFVGELHWWTTDAELE VLSQ+GR+KEIKF+DERASGKSKGYC V+F+DP AA+ACK+ Sbjct: 220 ATLFVGELHWWTTDAELEGVLSQFGRIKEIKFFDERASGKSKGYCQVDFYDPAAASACKE 279 Query: 256 GMNGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPMXXXXXXXXXXXXX 77 GM+G+VFNGRACVVAFAS QT+KQMG +Y+NK+Q QVQ+Q QGRRPM Sbjct: 280 GMDGYVFNGRACVVAFASSQTLKQMGDSYVNKSQGQVQTQPQGRRPMNDGAGRGGNMNFQ 339 Query: 76 XXXXXRNYNK-VGWGRGGQGMSNRG 5 RN+ + WGRGGQG+ NRG Sbjct: 340 GGDTGRNFGRGNNWGRGGQGVLNRG 364 >ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Solanum tuberosum] gi|565349616|ref|XP_006341787.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X2 [Solanum tuberosum] Length = 648 Score = 353 bits (907), Expect = 6e-95 Identities = 205/386 (53%), Positives = 245/386 (63%), Gaps = 10/386 (2%) Frame = -2 Query: 1132 MDQMAEEQLDYGDEEYGGQ-KMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQ 956 MD A+EQLDYGDEEYGG KMQ+ GSG IPALAE+E+MGEDDEYDDLYNDVN+GEGFLQ Sbjct: 1 MDPTADEQLDYGDEEYGGSHKMQYHGSGTIPALAEDEMMGEDDEYDDLYNDVNIGEGFLQ 60 Query: 955 VHRSET---LGGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGE 785 + RSE GNG QAQ+ P SRA G S+E IPG+ GK + T + Sbjct: 61 LQRSEVPVPSVDAGNGNFQAQKDSFPASRAGGLG-SEEAKIPGIATEGKYAGTEV----Q 115 Query: 784 HNNNKGGYVIGKGLEGRS-SGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTG 608 KG V+ + E + + PSA++ M ++QA N G++G +P K G Sbjct: 116 FPQQKGEPVVERETERPADAAQKARPSAIT-------MTLNSQAGNSGYQGSMPMPQKIG 168 Query: 607 VDPSQISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXNRSMVNENLS----RP-LES 443 DP + K A E + L + G R N +M N +S RP LE+ Sbjct: 169 ADPMAMPEKNASEATPLMNSVVPGPRVVPHMPTNQLNSSGNVNMNNPVISETPFRPSLEN 228 Query: 442 GATMLFVGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAAC 263 G TMLFVGELHWWTTDAELE+VL+QYG VKEIKF+DERASGKSKGYC VEFFDP +AAAC Sbjct: 229 GNTMLFVGELHWWTTDAELESVLTQYGNVKEIKFFDERASGKSKGYCQVEFFDPASAAAC 288 Query: 262 KDGMNGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPMXXXXXXXXXXX 83 K+GMNG+ FNGRACVVAFA+PQT+KQMG++Y NK Q QVQSQ QGRRPM Sbjct: 289 KEGMNGYNFNGRACVVAFATPQTIKQMGSSYANKTQNQVQSQPQGRRPM-NEGVGRGGPN 347 Query: 82 XXXXXXXRNYNKVGWGRGGQGMSNRG 5 RN+ + WGRGG GM NRG Sbjct: 348 YTPGDAGRNFGRGSWGRGGPGMPNRG 373 >ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] gi|462422613|gb|EMJ26876.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] Length = 630 Score = 351 bits (901), Expect = 3e-94 Identities = 207/381 (54%), Positives = 244/381 (64%), Gaps = 8/381 (2%) Frame = -2 Query: 1123 MAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQVHR 947 MAEEQ+DY DEEYGG QK+Q+QGSGAI ALA+EE M EDDEYDDLYNDVNV EGFLQ+HR Sbjct: 1 MAEEQIDYEDEEYGGAQKLQYQGSGAISALADEEPMVEDDEYDDLYNDVNVREGFLQMHR 60 Query: 946 SETL---GGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGEHNN 776 SE GGVGNG LQAQ+T+ +R + G SQE IPGV V GK Sbjct: 61 SEAPLPPGGVGNGGLQAQKTDVTETRV-QAGVSQESKIPGVSVQGK-------------- 105 Query: 775 NKGGYVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTGVDPS 596 SS + +P Q P +A + + + G+ G T+PP G D S Sbjct: 106 -------------YSSAVAQFPEQQGQ----PPVAKEPELGSTGY-GSTTMPPNVGGDSS 147 Query: 595 QISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXN--RSMVNENLSRP-LESGATMLF 425 I+ K A E + G AG G N R M NEN RP +E+G+TMLF Sbjct: 148 DITGKTALESVPSMNSGTAGPTGVTQMPTNQISIKVNANRPMFNENQIRPPVENGSTMLF 207 Query: 424 VGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKDGMNG 245 VGELHWWTTDAELE+VLSQYGRVKEIKF+DERASGKSKGYC VEF DP AA ACK+GM+G Sbjct: 208 VGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFHDPAAATACKEGMDG 267 Query: 244 HVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPM-XXXXXXXXXXXXXXXX 68 ++FNGRACVVAFASPQT+KQMGA+Y++K+Q Q QSQ GRRPM Sbjct: 268 YLFNGRACVVAFASPQTLKQMGASYLSKSQGQTQSQQPGRRPMNEGVGRGGGVNYQTGDT 327 Query: 67 XXRNYNKVGWGRGGQGMSNRG 5 RN+ + GWGRGGQG++NRG Sbjct: 328 GGRNFGRGGWGRGGQGVANRG 348 >ref|XP_002312652.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222852472|gb|EEE90019.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 619 Score = 343 bits (880), Expect = 8e-92 Identities = 202/377 (53%), Positives = 238/377 (63%), Gaps = 9/377 (2%) Frame = -2 Query: 1108 LDYGDEEYGGQKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQVHRSETLGG 929 +DY +EE KMQ+QGSGAIPALAEEE MGEDDEYDDLYNDVNVGE FLQ+H SE Sbjct: 1 MDYEEEE----KMQYQGSGAIPALAEEE-MGEDDEYDDLYNDVNVGENFLQMHGSEAPAP 55 Query: 928 ---VGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGEHNNNKGGYV 758 VGNG Q T E GGSQ +AI G GP+ G ++N K + Sbjct: 56 PATVGNGGFQ---TRNAHESRIETGGSQALAITG---------GGPAVEGIYSNAKAHFP 103 Query: 757 IGKGLEGRSSGNSGYP---SAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTGVDPSQIS 587 K + P S+V+Q GR+ EM+ D Q RN GF+ +PP GVDPS +S Sbjct: 104 EQKQVAVAVEAQDVGPVDGSSVAQKGRVIEMSHDVQVRNMGFQKSTPVPPGIGVDPSDMS 163 Query: 586 VKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXN--RSMVNENLSRP-LESGATMLFVGE 416 K A E LP G AG RGA + R +VNEN RP +E+G+T L+VGE Sbjct: 164 RKNAIEPEPLPITGSAGPRGAPQMQVNQMHMSADVNRPVVNENQVRPPIENGSTTLYVGE 223 Query: 415 LHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKDGMNGHVF 236 LHWWTTDAELE+ SQ+GRVKEIKF+DERASGKSKGYC V+F++ AAAACK+GMNGHVF Sbjct: 224 LHWWTTDAELESFASQFGRVKEIKFFDERASGKSKGYCQVDFYEAAAAAACKEGMNGHVF 283 Query: 235 NGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPMXXXXXXXXXXXXXXXXXXRN 56 NGR CVVAFASPQT+KQMGA+YMNK Q Q Q+Q+QGR M RN Sbjct: 284 NGRPCVVAFASPQTLKQMGASYMNKTQGQPQTQSQGRGSMNDGAGRGGNANFQSGDGGRN 343 Query: 55 YNKVGWGRGGQGMSNRG 5 Y + WGRGGQG+ NRG Sbjct: 344 YGRGAWGRGGQGILNRG 360 >gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus guttatus] Length = 639 Score = 325 bits (834), Expect = 2e-86 Identities = 197/388 (50%), Positives = 235/388 (60%), Gaps = 12/388 (3%) Frame = -2 Query: 1132 MDQMAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQ 956 MD + +EQLDYGDEEYGG QKMQ+ GAIPALAE+E++G+DDEYDDLYNDVNVGEGF+Q Sbjct: 1 MDPVTDEQLDYGDEEYGGNQKMQYHHGGAIPALAEDEMIGDDDEYDDLYNDVNVGEGFMQ 60 Query: 955 VHRSETL--GGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGEH 782 + RSE VGN + + PG+RA E SQEV VG G + G + Sbjct: 61 MQRSEAPPPSAVGNNSFSISKNTAPGTRA-EAIASQEVNNGRVGNEGSYAPNGVQLSDQK 119 Query: 781 NNNKGGYVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTGVD 602 NN + G P SQ RLPE+A +QA + G++G + KT D Sbjct: 120 NNLT------------AVGGPAQPVDASQRVRLPEVANSSQAAHLGYQGSEIMLHKTATD 167 Query: 601 PSQISVKFAGEQSSL--PDQGPA-GVRGAXXXXXXXXXXXXN---RSMVNENLSRPL--E 446 S GE +SL P+ G + GV A RSM +E L RP E Sbjct: 168 RMNNSENIVGEPASLVYPNTGSSKGVPQAPSNLMNSNANVNVNVNRSMDDEYLIRPSGGE 227 Query: 445 SGATMLFVGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAA 266 +G M++VGELHWWTTDAE+E+VL QYGRVKEIKF+DERASGKSKGYC VEF+DP AA A Sbjct: 228 NGNPMIYVGELHWWTTDAEVESVLIQYGRVKEIKFFDERASGKSKGYCQVEFYDPAAATA 287 Query: 265 CKDGMNGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPMXXXXXXXXXX 86 CKDGM GH+FNGRACVV +A+PQT KQMGA+Y NKNQ Q QSQ QGR PM Sbjct: 288 CKDGMQGHIFNGRACVVTYANPQTSKQMGASY-NKNQGQSQSQLQGRNPMNDGAGRGNGT 346 Query: 85 XXXXXXXXRNYNK-VGWGRGGQGMSNRG 5 RN+ + GWGRG Q NRG Sbjct: 347 NYPSGDAGRNFGRGGGWGRGNQA-PNRG 373 >ref|XP_006852230.1| hypothetical protein AMTR_s00049p00146760 [Amborella trichopoda] gi|548855834|gb|ERN13697.1| hypothetical protein AMTR_s00049p00146760 [Amborella trichopoda] Length = 659 Score = 293 bits (751), Expect = 7e-77 Identities = 197/409 (48%), Positives = 236/409 (57%), Gaps = 32/409 (7%) Frame = -2 Query: 1132 MDQMAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQ 956 MD MAEEQLDY DE+YG QKM FQ GAI ALA+EELMGEDDEYDDLYNDVNVG+GF+Q Sbjct: 1 MDPMAEEQLDYEDEDYGANQKMPFQTGGAISALADEELMGEDDEYDDLYNDVNVGDGFMQ 60 Query: 955 VHRSET---LGGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVG---VGGKDSKTGPSS 794 + + +GNG +QA + E P S P V IPGVG G KD+K S Sbjct: 61 SLQHQEPVQYESMGNG-VQAPKEE-PISTPP-------VNIPGVGHEEKGEKDAKL--SG 109 Query: 793 YGEHNNNKGGYVIGKG-LEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPP 617 + + + K L G SSG R+ E + Q + GFR A PP Sbjct: 110 FSDLDQKKAFQEQASNQLAGASSGLKI---------RVSEPVSEPQPQASGFRN-APAPP 159 Query: 616 KTGVDPSQISVKFAGEQ------SSLPDQGPAGVRG------AXXXXXXXXXXXXNRSMV 473 G + A +Q +++P GP G A +++ Sbjct: 160 AKGSGFNTAGAMDANKQLAQTSSNAVPRVGPGPGPGIGAGPNANMNRMMGPGPNQAGAVI 219 Query: 472 N-------ENLSRPL----ESGATMLFVGELHWWTTDAELENVLSQYGRVKEIKFYDERA 326 + EN +R ESG TMLFVGEL WWTTDAELE+VLSQYGRVK++KF+DERA Sbjct: 220 DTSARFGSENSNRLSHGGGESGNTMLFVGELQWWTTDAELESVLSQYGRVKDLKFFDERA 279 Query: 325 SGKSKGYCHVEFFDPTAAAACKDGMNGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQV 146 SGKSKGYC VEF+DP AAAACK+ MNGHVFNGRACVVAFAS T+KQ+ Y+NK Q Q Sbjct: 280 SGKSKGYCQVEFYDPAAAAACKESMNGHVFNGRACVVAFASQHTLKQLTTNYLNKTQAQA 339 Query: 145 QSQTQGRRPMXXXXXXXXXXXXXXXXXXRNY-NKVGWGRGGQGMSNRGQ 2 Q+Q+QGRRPM RNY NK+GWGRG QG+ NRGQ Sbjct: 340 QAQSQGRRPM--NDGGGRAGGPSYQGGDRNYGNKMGWGRGNQGVPNRGQ 386 >ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] gi|550329195|gb|ERP56065.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] Length = 591 Score = 288 bits (737), Expect = 3e-75 Identities = 181/376 (48%), Positives = 211/376 (56%), Gaps = 8/376 (2%) Frame = -2 Query: 1108 LDYGDEEYGGQKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQVHRSETLGG 929 +D+ +EE KMQ+QGSGAIPALAEEEL GEDDEYDDLYNDVNVGE FLQ+H SE Sbjct: 1 MDFEEEE----KMQYQGSGAIPALAEEEL-GEDDEYDDLYNDVNVGENFLQMHGSEAPAP 55 Query: 928 ---VGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVG--GKDSKTGPSSYGEHNNNKGG 764 GNG Q T E GGSQ +A G GV GK S G + + E Sbjct: 56 PATAGNGGFQ---TRNAHESRVETGGSQVLATSGAGVAVEGKYSNAG-AHFPEQKQ---- 107 Query: 763 YVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTGVDPSQISV 584 G G+E G+ GY Sbjct: 108 --AGIGVEANDVGSIGY------------------------------------------- 122 Query: 583 KFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXN--RSMVNENLSRP-LESGATMLFVGEL 413 G+ SS+ +G AG RG + R +VNEN RP +E+G T L+VGEL Sbjct: 123 ---GDGSSVAQKGSAGPRGVPQMQVNQMNMNADVNRPVVNENQVRPPIENGPTTLYVGEL 179 Query: 412 HWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKDGMNGHVFN 233 HWWTTDAELE+V SQYGRVKEIKF+DERASGKSKGYC V+F++ AAAACK+GMN HVFN Sbjct: 180 HWWTTDAELESVASQYGRVKEIKFFDERASGKSKGYCQVDFYEAAAAAACKEGMNEHVFN 239 Query: 232 GRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPMXXXXXXXXXXXXXXXXXXRNY 53 GR CVVAFAS QT+KQMGA+YM+K Q Q Q Q+QGR M RNY Sbjct: 240 GRPCVVAFASAQTLKQMGASYMSKTQGQPQPQSQGRGSMNDGMGRGGNANYQSGDGGRNY 299 Query: 52 NKVGWGRGGQGMSNRG 5 + GWGRGGQG+ NRG Sbjct: 300 GRGGWGRGGQGVLNRG 315 >ref|XP_002315647.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222864687|gb|EEF01818.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 573 Score = 288 bits (737), Expect = 3e-75 Identities = 181/376 (48%), Positives = 211/376 (56%), Gaps = 8/376 (2%) Frame = -2 Query: 1108 LDYGDEEYGGQKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQVHRSETLGG 929 +D+ +EE KMQ+QGSGAIPALAEEEL GEDDEYDDLYNDVNVGE FLQ+H SE Sbjct: 1 MDFEEEE----KMQYQGSGAIPALAEEEL-GEDDEYDDLYNDVNVGENFLQMHGSEAPAP 55 Query: 928 ---VGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVG--GKDSKTGPSSYGEHNNNKGG 764 GNG Q T E GGSQ +A G GV GK S G + + E Sbjct: 56 PATAGNGGFQ---TRNAHESRVETGGSQVLATSGAGVAVEGKYSNAG-AHFPEQKQ---- 107 Query: 763 YVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTGVDPSQISV 584 G G+E G+ GY Sbjct: 108 --AGIGVEANDVGSIGY------------------------------------------- 122 Query: 583 KFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXN--RSMVNENLSRP-LESGATMLFVGEL 413 G+ SS+ +G AG RG + R +VNEN RP +E+G T L+VGEL Sbjct: 123 ---GDGSSVAQKGSAGPRGVPQMQVNQMNMNADVNRPVVNENQVRPPIENGPTTLYVGEL 179 Query: 412 HWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKDGMNGHVFN 233 HWWTTDAELE+V SQYGRVKEIKF+DERASGKSKGYC V+F++ AAAACK+GMN HVFN Sbjct: 180 HWWTTDAELESVASQYGRVKEIKFFDERASGKSKGYCQVDFYEAAAAAACKEGMNEHVFN 239 Query: 232 GRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPMXXXXXXXXXXXXXXXXXXRNY 53 GR CVVAFAS QT+KQMGA+YM+K Q Q Q Q+QGR M RNY Sbjct: 240 GRPCVVAFASAQTLKQMGASYMSKTQGQPQPQSQGRGSMNDGMGRGGNANYQSGDGGRNY 299 Query: 52 NKVGWGRGGQGMSNRG 5 + GWGRGGQG+ NRG Sbjct: 300 GRGGWGRGGQGVLNRG 315