BLASTX nr result
ID: Cocculus23_contig00000339
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus23_contig00000339 (1161 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citr... 384 e-104 ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation spec... 381 e-103 ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobr... 379 e-102 ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation spec... 374 e-101 ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citr... 373 e-101 ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268... 370 e-100 ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobr... 369 1e-99 ref|XP_007044908.1| RNA-binding family protein isoform 5, partia... 369 1e-99 ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobr... 369 1e-99 ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobr... 369 1e-99 ref|XP_002515040.1| RNA binding protein, putative [Ricinus commu... 362 2e-97 gb|EXB82464.1| Cleavage and polyadenylation specificity factor s... 340 9e-91 ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309... 340 9e-91 ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation spec... 339 2e-90 ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prun... 333 8e-89 ref|XP_002312652.1| RNA recognition motif-containing family prot... 322 1e-85 gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus... 318 4e-84 ref|XP_006852230.1| hypothetical protein AMTR_s00049p00146760 [A... 274 5e-71 gb|EPS60955.1| hypothetical protein M569_13847, partial [Genlise... 266 9e-69 ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Popu... 265 3e-68 >ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] gi|557540375|gb|ESR51419.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] Length = 658 Score = 384 bits (986), Expect = e-104 Identities = 209/345 (60%), Positives = 242/345 (70%), Gaps = 6/345 (1%) Frame = -2 Query: 1160 MDQMAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQ 984 MD MAEEQ+DY +EEYGG QKMQ+QG GAIPALA+EELMGEDDEYDDLYNDVNVG+G LQ Sbjct: 1 MDSMAEEQIDYEEEEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQ 60 Query: 983 VHRSET---LGGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGE 813 + E GVGNG LQ ++T+ P + + G SQ +PGV V GK + G Sbjct: 61 FQQPEAPPPSAGVGNGRLQVKKTDVPEQQV-QAGVSQGSNVPGVSVEGKYTNAGT----- 114 Query: 812 HNNNKGGYVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTGV 633 H + + SGN ++VSQ G + E DA RN GF+G + PP+TGV Sbjct: 115 HFPAQNDVQVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPPRTGV 174 Query: 632 DPSQISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXN-RSMVNENLSRP-LESGATM 459 DPS + + A E + + + G AG +GA R+MVNEN RP LE+G TM Sbjct: 175 DPSNMPGRVANEPAPVLNPGAAGPQGALIPANQMGVNINVNRAMVNENQIRPPLENGGTM 234 Query: 458 LFVGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKDGM 279 LFVGELHWWTTDAELE+VLSQYGRVKEIKF+DERASGKSKGYC VEFFD AAAACKDGM Sbjct: 235 LFVGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGM 294 Query: 278 NGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPM 144 NGHVFNGR CVVAFASPQT+KQMGA+YMNKNQ Q QSQTQGRRPM Sbjct: 295 NGHVFNGRPCVVAFASPQTLKQMGASYMNKNQGQPQSQTQGRRPM 339 >ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 658 Score = 381 bits (978), Expect = e-103 Identities = 208/345 (60%), Positives = 241/345 (69%), Gaps = 6/345 (1%) Frame = -2 Query: 1160 MDQMAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQ 984 MD MAEEQ+DY +EEYGG QKMQ+QG GAIPALA+EELMGEDDEYDDLYNDVNVG+G LQ Sbjct: 1 MDSMAEEQIDYEEEEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQ 60 Query: 983 VHRSET---LGGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGE 813 + E GVGNG LQ ++T+ P + + G SQ +PGV V GK + G Sbjct: 61 FQQPEAPPPSAGVGNGRLQVKKTDVPEQQV-QAGVSQGSNVPGVSVEGKYTNAGT----- 114 Query: 812 HNNNKGGYVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTGV 633 H + + SGN ++VSQ G + E DA RN GF+G + P +TGV Sbjct: 115 HFPAQNDVQVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGV 174 Query: 632 DPSQISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXN-RSMVNENLSRP-LESGATM 459 DPS + + A E + + + G AG +GA R+MVNEN RP LE+G TM Sbjct: 175 DPSNMPGRVANEPAPVLNPGAAGPQGALIPANQMGVNINVNRAMVNENQIRPPLENGGTM 234 Query: 458 LFVGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKDGM 279 LFVGELHWWTTDAELE+VLSQYGRVKEIKF+DERASGKSKGYC VEFFD AAAACKDGM Sbjct: 235 LFVGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGM 294 Query: 278 NGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPM 144 NGHVFNGR CVVAFASPQT+KQMGA+YMNKNQ Q QSQTQGRRPM Sbjct: 295 NGHVFNGRPCVVAFASPQTLKQMGASYMNKNQGQPQSQTQGRRPM 339 >ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695488|ref|XP_007044903.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708837|gb|EOY00734.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708838|gb|EOY00735.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 653 Score = 379 bits (972), Expect = e-102 Identities = 204/345 (59%), Positives = 245/345 (71%), Gaps = 7/345 (2%) Frame = -2 Query: 1160 MDQMAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQ 984 MD MAEEQ+D+GDEEYGG QKMQ+QGSGAIPALA+EE+MGEDDEYDDLYNDVNVGEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGAQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 983 VHRSETL---GGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGE 813 + RSE GG+G+ LQAQ+ E P R E GGSQ + IPGV V GK + Y E Sbjct: 61 LQRSEAPPQPGGMGSTGLQAQKNEAPEPRG-EAGGSQGLNIPGVSVQGKHLNV-TARYPE 118 Query: 812 HNNNKGGYVIGKGLEGRSSGNSGYPS--AVSQGGRLPEMAPDAQARNEGFRGQATLPPKT 639 + + G+ YPS ++SQ GR+ E D Q +N GF+G ++ K Sbjct: 119 QDGQPA-------VSRPEMGSGSYPSGTSISQKGRVMEGTQDTQVKNMGFQGLSSASHKV 171 Query: 638 GVDPSQISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXNRSMVNENLSRP-LESGAT 462 G+DPS + K A + + G G +GA N M++EN RP +E+G T Sbjct: 172 GIDPSGVPQKIANVPAQSLNSGTGGPQGAPHVPPNQMGLNVNHPMISENQVRPPIENGPT 231 Query: 461 MLFVGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKDG 282 MLFVGELHWWTTDAELE+VLSQYGRVKEIKF+DERASGKSKGYC VEF+DP +AAACK+G Sbjct: 232 MLFVGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPASAAACKEG 291 Query: 281 MNGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRP 147 M+G++FNGRACVVAFASPQT+KQMGA+YMNKNQ Q Q+Q QGRRP Sbjct: 292 MDGYMFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP 336 >ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 655 Score = 374 bits (960), Expect = e-101 Identities = 205/342 (59%), Positives = 236/342 (69%), Gaps = 6/342 (1%) Frame = -2 Query: 1151 MAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQVHR 975 MAEEQ+DY ++EYGG QKMQ+QG GAIPALA+EELMGEDDEYDDLYNDVNVG+G LQ + Sbjct: 1 MAEEQIDYEEDEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQFQQ 60 Query: 974 SET---LGGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGEHNN 804 E GVGNG LQ ++T+ P R + GGSQ IPGV V GK + G H Sbjct: 61 PEAPPPSAGVGNGRLQVKKTDVPEQRV-QVGGSQGSNIPGVSVEGKYTNAG-----SHFP 114 Query: 803 NKGGYVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTGVDPS 624 + + SGN ++VSQ G + E DA RN GF+G + P +TGVDPS Sbjct: 115 AQNDVQVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPS 174 Query: 623 QISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXN-RSMVNENLSRP-LESGATMLFV 450 + + A E + + + G AG +GA R MVNEN RP LE+G TMLFV Sbjct: 175 NMPGRVANEPAPVLNPGAAGPQGALIPANQMGVNANVNRVMVNENQIRPPLENGGTMLFV 234 Query: 449 GELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKDGMNGH 270 GELHWWTTDAELE+VLSQYGR KEIKF+DERASGKSKGYC VEFFD AAAACKDGMNGH Sbjct: 235 GELHWWTTDAELESVLSQYGRAKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGH 294 Query: 269 VFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPM 144 VFNGR CVVAFASPQT+KQMGA+YMNKNQ Q QSQ QG RPM Sbjct: 295 VFNGRPCVVAFASPQTLKQMGASYMNKNQGQPQSQNQGSRPM 336 >ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|567891321|ref|XP_006438181.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540376|gb|ESR51420.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540377|gb|ESR51421.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] Length = 655 Score = 373 bits (958), Expect = e-101 Identities = 205/342 (59%), Positives = 239/342 (69%), Gaps = 6/342 (1%) Frame = -2 Query: 1151 MAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQVHR 975 MAEEQ+DY ++EYGG QKMQ+QG GAIPALA+EELMGEDDEYDDLYND+NVG+G LQ + Sbjct: 1 MAEEQIDYEEDEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDINVGDGLLQFQQ 60 Query: 974 SET---LGGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGEHNN 804 E GVGNG LQ ++T+ P R + GGSQ IPGV V GK + G S + N+ Sbjct: 61 PEAPPPSAGVGNGRLQVKKTDVPEQRV-QVGGSQGSNIPGVSVEGKYTNAG-SDFPAQND 118 Query: 803 NKGGYVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTGVDPS 624 + + SGN ++VSQ G + E DA RN GF+G + P +TGVDPS Sbjct: 119 VQ----VAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPS 174 Query: 623 QISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXN-RSMVNENLSRP-LESGATMLFV 450 + + A E + + + G AG +GA R MVNEN RP LE+G TMLFV Sbjct: 175 NMPGRAANEPAPVLNPGAAGPQGALIPANQMGVNANVNRVMVNENQIRPPLENGGTMLFV 234 Query: 449 GELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKDGMNGH 270 GELHWWTTDAELE+VLSQYGR KEIKF+DERASGKSKGYC VEFFD AAAACKDGMNGH Sbjct: 235 GELHWWTTDAELESVLSQYGRAKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGH 294 Query: 269 VFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPM 144 VFNGR CVVAFASPQT+KQMGA+YMNKNQ Q QSQ QG RPM Sbjct: 295 VFNGRPCVVAFASPQTLKQMGASYMNKNQGQPQSQNQGSRPM 336 >ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268141 isoform 1 [Vitis vinifera] gi|359473133|ref|XP_003631251.1| PREDICTED: uncharacterized protein LOC100268141 isoform 2 [Vitis vinifera] Length = 647 Score = 370 bits (949), Expect = e-100 Identities = 207/343 (60%), Positives = 241/343 (70%), Gaps = 7/343 (2%) Frame = -2 Query: 1151 MAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQVHR 975 MAEEQLDY DEEYGG QKM FQG GAI ALA++ELMGEDDEYDDLYNDVNVGEGFLQ+HR Sbjct: 1 MAEEQLDYEDEEYGGAQKMPFQGGGAISALADDELMGEDDEYDDLYNDVNVGEGFLQMHR 60 Query: 974 SET---LGGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGEHNN 804 SE G + G QA +T+ P + E G SQ + IPGV + GK S + Sbjct: 61 SEAPAPSGVMAGGPFQAHKTDVPPQKL-EAGTSQGLIIPGVSIEGKYSNP------HFHE 113 Query: 803 NKGGYVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTGVDPS 624 K G + KG E S+ + PS VSQ GR+ EM D Q RN GF+G +P KTG +PS Sbjct: 114 KKEGPMAVKGPEMGSTSHLDGPS-VSQKGRVLEMTHDTQVRNLGFQGSTPIPQKTGAEPS 172 Query: 623 QISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXN--RSMVNENLSRP-LESGATMLF 453 + K A E + + + G G R N R MVNEN RP +++GATMLF Sbjct: 173 DVHGKIANESTPVLNSGTGGPRAVPQMLSNQMGMNVNVNRPMVNENQIRPAVDNGATMLF 232 Query: 452 VGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKDGMNG 273 VGELHWWTTDAELE+VLSQYGRVKEIKF+DERASGKSKGYC VEF+D +AAAACK+GMNG Sbjct: 233 VGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDASAAAACKEGMNG 292 Query: 272 HVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPM 144 ++FNGRACVVAFASPQT+KQMGA+YMNK Q QSQ+QGRRPM Sbjct: 293 YIFNGRACVVAFASPQTLKQMGASYMNK--TQAQSQSQGRRPM 333 >ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobroma cacao] gi|508708844|gb|EOY00741.1| RNA-binding family protein isoform 6 [Theobroma cacao] Length = 602 Score = 369 bits (947), Expect = 1e-99 Identities = 199/346 (57%), Positives = 241/346 (69%), Gaps = 8/346 (2%) Frame = -2 Query: 1160 MDQMAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQ 984 MD MAEEQ+D+GDEEYGG QKMQ+QGSGAIPALA+EE+MGEDDEYDDLYNDVNVGEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 983 VHRSETL---GGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTG---PSS 822 + RSE GG+G+ L+AQR E P R E GGSQ + IPGV V GK P Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRV-EAGGSQGLNIPGVSVQGKHPNVSARYPEK 119 Query: 821 YGEHNNNKGGYVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPK 642 + N+ V G G S++SQ G + E D Q +N GF+G + K Sbjct: 120 EEQPAVNRPEMVSGSYPSG---------SSISQKGSVTEGTHDKQVKNLGFQGLTSASNK 170 Query: 641 TGVDPSQISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXNRSMVNEN-LSRPLESGA 465 G+DPS + K A + + + G G +G N ++NEN + P+E+G Sbjct: 171 VGIDPSGVPQKIANDPAQSLNSGTGGPQGPPHVPPNQMGTNVNHPVMNENQVQPPIENGP 230 Query: 464 TMLFVGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKD 285 TMLFVGELHWWTTDAELE+VLSQYGR+KEIKF+DE+ASGKSKGYC VEF+DP++AA CK+ Sbjct: 231 TMLFVGELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKE 290 Query: 284 GMNGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRP 147 GMNG++FNGRACVVAFASPQT+KQMGA+YMNKNQ Q Q+Q QGRRP Sbjct: 291 GMNGYMFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP 336 >ref|XP_007044908.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] gi|508708843|gb|EOY00740.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] Length = 656 Score = 369 bits (947), Expect = 1e-99 Identities = 199/346 (57%), Positives = 241/346 (69%), Gaps = 8/346 (2%) Frame = -2 Query: 1160 MDQMAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQ 984 MD MAEEQ+D+GDEEYGG QKMQ+QGSGAIPALA+EE+MGEDDEYDDLYNDVNVGEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 983 VHRSETL---GGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTG---PSS 822 + RSE GG+G+ L+AQR E P R E GGSQ + IPGV V GK P Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRV-EAGGSQGLNIPGVSVQGKHPNVSARYPEK 119 Query: 821 YGEHNNNKGGYVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPK 642 + N+ V G G S++SQ G + E D Q +N GF+G + K Sbjct: 120 EEQPAVNRPEMVSGSYPSG---------SSISQKGSVTEGTHDKQVKNLGFQGLTSASNK 170 Query: 641 TGVDPSQISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXNRSMVNEN-LSRPLESGA 465 G+DPS + K A + + + G G +G N ++NEN + P+E+G Sbjct: 171 VGIDPSGVPQKIANDPAQSLNSGTGGPQGPPHVPPNQMGTNVNHPVMNENQVQPPIENGP 230 Query: 464 TMLFVGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKD 285 TMLFVGELHWWTTDAELE+VLSQYGR+KEIKF+DE+ASGKSKGYC VEF+DP++AA CK+ Sbjct: 231 TMLFVGELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKE 290 Query: 284 GMNGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRP 147 GMNG++FNGRACVVAFASPQT+KQMGA+YMNKNQ Q Q+Q QGRRP Sbjct: 291 GMNGYMFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP 336 >ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobroma cacao] gi|508708842|gb|EOY00739.1| RNA-binding family protein isoform 4 [Theobroma cacao] Length = 697 Score = 369 bits (947), Expect = 1e-99 Identities = 199/346 (57%), Positives = 241/346 (69%), Gaps = 8/346 (2%) Frame = -2 Query: 1160 MDQMAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQ 984 MD MAEEQ+D+GDEEYGG QKMQ+QGSGAIPALA+EE+MGEDDEYDDLYNDVNVGEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 983 VHRSETL---GGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTG---PSS 822 + RSE GG+G+ L+AQR E P R E GGSQ + IPGV V GK P Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRV-EAGGSQGLNIPGVSVQGKHPNVSARYPEK 119 Query: 821 YGEHNNNKGGYVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPK 642 + N+ V G G S++SQ G + E D Q +N GF+G + K Sbjct: 120 EEQPAVNRPEMVSGSYPSG---------SSISQKGSVTEGTHDKQVKNLGFQGLTSASNK 170 Query: 641 TGVDPSQISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXNRSMVNEN-LSRPLESGA 465 G+DPS + K A + + + G G +G N ++NEN + P+E+G Sbjct: 171 VGIDPSGVPQKIANDPAQSLNSGTGGPQGPPHVPPNQMGTNVNHPVMNENQVQPPIENGP 230 Query: 464 TMLFVGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKD 285 TMLFVGELHWWTTDAELE+VLSQYGR+KEIKF+DE+ASGKSKGYC VEF+DP++AA CK+ Sbjct: 231 TMLFVGELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKE 290 Query: 284 GMNGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRP 147 GMNG++FNGRACVVAFASPQT+KQMGA+YMNKNQ Q Q+Q QGRRP Sbjct: 291 GMNGYMFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP 336 >ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695496|ref|XP_007044905.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695500|ref|XP_007044906.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708839|gb|EOY00736.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708840|gb|EOY00737.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708841|gb|EOY00738.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 652 Score = 369 bits (947), Expect = 1e-99 Identities = 199/346 (57%), Positives = 241/346 (69%), Gaps = 8/346 (2%) Frame = -2 Query: 1160 MDQMAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQ 984 MD MAEEQ+D+GDEEYGG QKMQ+QGSGAIPALA+EE+MGEDDEYDDLYNDVNVGEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 983 VHRSETL---GGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTG---PSS 822 + RSE GG+G+ L+AQR E P R E GGSQ + IPGV V GK P Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRV-EAGGSQGLNIPGVSVQGKHPNVSARYPEK 119 Query: 821 YGEHNNNKGGYVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPK 642 + N+ V G G S++SQ G + E D Q +N GF+G + K Sbjct: 120 EEQPAVNRPEMVSGSYPSG---------SSISQKGSVTEGTHDKQVKNLGFQGLTSASNK 170 Query: 641 TGVDPSQISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXNRSMVNEN-LSRPLESGA 465 G+DPS + K A + + + G G +G N ++NEN + P+E+G Sbjct: 171 VGIDPSGVPQKIANDPAQSLNSGTGGPQGPPHVPPNQMGTNVNHPVMNENQVQPPIENGP 230 Query: 464 TMLFVGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKD 285 TMLFVGELHWWTTDAELE+VLSQYGR+KEIKF+DE+ASGKSKGYC VEF+DP++AA CK+ Sbjct: 231 TMLFVGELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKE 290 Query: 284 GMNGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRP 147 GMNG++FNGRACVVAFASPQT+KQMGA+YMNKNQ Q Q+Q QGRRP Sbjct: 291 GMNGYMFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP 336 >ref|XP_002515040.1| RNA binding protein, putative [Ricinus communis] gi|223546091|gb|EEF47594.1| RNA binding protein, putative [Ricinus communis] Length = 644 Score = 362 bits (929), Expect = 2e-97 Identities = 204/343 (59%), Positives = 241/343 (70%), Gaps = 7/343 (2%) Frame = -2 Query: 1151 MAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQVHR 975 MA+EQ+DY DEEYGG QK+Q+QGSGAIPALAEEE MGEDDEYDDLYNDVN+GE FLQ+HR Sbjct: 1 MADEQIDYEDEEYGGAQKLQYQGSGAIPALAEEE-MGEDDEYDDLYNDVNIGENFLQMHR 59 Query: 974 SETLGG---VGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGEHNN 804 SE VGNG Q + + E GGSQ + IPGV V K S TG + + E N Sbjct: 60 SEAPPAPPSVGNGGFQPRNSN---DLRVESGGSQGLNIPGVAVESKYS-TG-THFPEQN- 113 Query: 803 NKGGYVIGKGLEGRSSGNSGYP--SAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTGVD 630 ++G G+ GYP S+++Q R+ EM D+QARN GF+G + P GVD Sbjct: 114 ----------VKGPEIGSVGYPDGSSIAQKTRVMEMTNDSQARNMGFQGSTSGPSNIGVD 163 Query: 629 PSQISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXNRSMVNENLSRP-LESGATMLF 453 PS ++ K + + + +P+ G V NRS NEN RP LE+G+TML+ Sbjct: 164 PSDMNNKISNDPTPVPNAGVPRVIPQLPASQMNMNMDTNRSATNENQIRPPLENGSTMLY 223 Query: 452 VGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKDGMNG 273 VGELHWWTTDAELENVLSQYG VKEIKF+DERASGKSKGYC VEF+D AAAACK+GMNG Sbjct: 224 VGELHWWTTDAELENVLSQYGMVKEIKFFDERASGKSKGYCQVEFYDAAAAAACKEGMNG 283 Query: 272 HVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPM 144 H+FNGRACVVAFAS QT+KQMGA+YMNKNQ Q QSQ QGRRPM Sbjct: 284 HLFNGRACVVAFASQQTLKQMGASYMNKNQGQPQSQNQGRRPM 326 >gb|EXB82464.1| Cleavage and polyadenylation specificity factor subunit [Morus notabilis] Length = 636 Score = 340 bits (871), Expect = 9e-91 Identities = 201/348 (57%), Positives = 233/348 (66%), Gaps = 12/348 (3%) Frame = -2 Query: 1151 MAEEQLDYGDEEYGG-QKMQFQGSG-AIPALAEEELMGEDDEYDDLYNDVNVGEGFLQVH 978 MAE+ +D+ DEEYGG QK Q+QGSG AI ALA+EELMG+DDEYDDLYNDVNVGEGFLQ+ Sbjct: 1 MAEDHIDFEDEEYGGAQKHQYQGSGGAISALADEELMGDDDEYDDLYNDVNVGEGFLQLQ 60 Query: 977 RSET-----LGGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGE 813 RSE GVGNG LQAQ+ P R E GGSQ+ IPGV G+ S G G+ Sbjct: 61 RSEAPSLPAAAGVGNG-LQAQKRNFPEPRE-EIGGSQQPNIPGVSAEGRFSSAGSQFPGQ 118 Query: 812 HNNNKGGYVIGKGLEGRSSGNSGYPSAVS--QGGRLPEMAPDAQARNEGFRGQATLPPKT 639 + G + K E +G+ YP S Q GR+ GF+G + Sbjct: 119 QD----GLKVDKKSE---AGSMVYPDGASGSQKGRIVA----------GFQGSKPMLHSV 161 Query: 638 GVDPSQISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXNRS--MVNENLSRP-LESG 468 GVD S I K E P+ G AG RG N S +VNEN RP +E+G Sbjct: 162 GVDSSDIPGKMVNEPIQAPNSGGAGPRGILPMQGNQTTVNANVSHPIVNENQIRPSIENG 221 Query: 467 ATMLFVGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACK 288 +TMLFVGELHWWTTDAELE+VLSQYGRVKEIKF+DERASGKSKGYC VE++D AA ACK Sbjct: 222 STMLFVGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEYYDAAAAVACK 281 Query: 287 DGMNGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPM 144 +GM+GHVFNGRACVVAFASPQT+KQMGAAYM+KNQVQ QSQ QGRRP+ Sbjct: 282 EGMHGHVFNGRACVVAFASPQTLKQMGAAYMSKNQVQNQSQPQGRRPI 329 >ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309507 [Fragaria vesca subsp. vesca] Length = 646 Score = 340 bits (871), Expect = 9e-91 Identities = 195/347 (56%), Positives = 229/347 (65%), Gaps = 8/347 (2%) Frame = -2 Query: 1160 MDQMAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQ 984 MD M EEQ+DY +EEYGG QK+Q+Q SGAIPALA+EE M EDDEYDDLYNDVNVGEGFLQ Sbjct: 1 MDPMGEEQIDYEEEEYGGAQKLQYQESGAIPALADEEPMVEDDEYDDLYNDVNVGEGFLQ 60 Query: 983 VHRSETL---GGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGE 813 +HR E GVGNG LQAQ+ P R + G SQEV PG V GK S S E Sbjct: 61 MHRPEPPLPPAGVGNGGLQAQKNNVPEQRV-QGGASQEVKNPGFSVEGKYS-----SVPE 114 Query: 812 HNNNKGGYVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTGV 633 + V+ P SQ GR+ EM DAQ RN GF+G AT+ Sbjct: 115 QKDQPPVSVV--------------PEMASQKGRVMEMTHDAQVRNMGFQGAATMQSNVVA 160 Query: 632 DPSQISVKFAG---EQSSLPDQGPAGVRGAXXXXXXXXXXXXNRSMVNENLSRP-LESGA 465 D S ++ K A + GP V+ NR MVNEN RP +E+G+ Sbjct: 161 DSSDLTGKIANGPIPSMNSGSNGPPAVQ-QMPANQMNMKINVNRPMVNENQIRPPVENGS 219 Query: 464 TMLFVGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKD 285 LFVGELHWWTTDAELE VLSQ+GR+KEIKF+DERASGKSKGYC V+F+DP AA+ACK+ Sbjct: 220 ATLFVGELHWWTTDAELEGVLSQFGRIKEIKFFDERASGKSKGYCQVDFYDPAAASACKE 279 Query: 284 GMNGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPM 144 GM+G+VFNGRACVVAFAS QT+KQMG +Y+NK+Q QVQ+Q QGRRPM Sbjct: 280 GMDGYVFNGRACVVAFASSQTLKQMGDSYVNKSQGQVQTQPQGRRPM 326 >ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Solanum tuberosum] gi|565349616|ref|XP_006341787.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X2 [Solanum tuberosum] Length = 648 Score = 339 bits (869), Expect = 2e-90 Identities = 193/349 (55%), Positives = 231/349 (66%), Gaps = 10/349 (2%) Frame = -2 Query: 1160 MDQMAEEQLDYGDEEYGGQ-KMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQ 984 MD A+EQLDYGDEEYGG KMQ+ GSG IPALAE+E+MGEDDEYDDLYNDVN+GEGFLQ Sbjct: 1 MDPTADEQLDYGDEEYGGSHKMQYHGSGTIPALAEDEMMGEDDEYDDLYNDVNIGEGFLQ 60 Query: 983 VHRSET---LGGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGE 813 + RSE GNG QAQ+ P SRA G S+E IPG+ GK + T + Sbjct: 61 LQRSEVPVPSVDAGNGNFQAQKDSFPASRAGGLG-SEEAKIPGIATEGKYAGTEV----Q 115 Query: 812 HNNNKGGYVIGKGLEGRS-SGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTG 636 KG V+ + E + + PSA++ M ++QA N G++G +P K G Sbjct: 116 FPQQKGEPVVERETERPADAAQKARPSAIT-------MTLNSQAGNSGYQGSMPMPQKIG 168 Query: 635 VDPSQISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXNRSMVNENLS----RP-LES 471 DP + K A E + L + G R N +M N +S RP LE+ Sbjct: 169 ADPMAMPEKNASEATPLMNSVVPGPRVVPHMPTNQLNSSGNVNMNNPVISETPFRPSLEN 228 Query: 470 GATMLFVGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAAC 291 G TMLFVGELHWWTTDAELE+VL+QYG VKEIKF+DERASGKSKGYC VEFFDP +AAAC Sbjct: 229 GNTMLFVGELHWWTTDAELESVLTQYGNVKEIKFFDERASGKSKGYCQVEFFDPASAAAC 288 Query: 290 KDGMNGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPM 144 K+GMNG+ FNGRACVVAFA+PQT+KQMG++Y NK Q QVQSQ QGRRPM Sbjct: 289 KEGMNGYNFNGRACVVAFATPQTIKQMGSSYANKTQNQVQSQPQGRRPM 337 >ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] gi|462422613|gb|EMJ26876.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] Length = 630 Score = 333 bits (854), Expect = 8e-89 Identities = 194/343 (56%), Positives = 227/343 (66%), Gaps = 7/343 (2%) Frame = -2 Query: 1151 MAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQVHR 975 MAEEQ+DY DEEYGG QK+Q+QGSGAI ALA+EE M EDDEYDDLYNDVNV EGFLQ+HR Sbjct: 1 MAEEQIDYEDEEYGGAQKLQYQGSGAISALADEEPMVEDDEYDDLYNDVNVREGFLQMHR 60 Query: 974 SETL---GGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGEHNN 804 SE GGVGNG LQAQ+T+ +R + G SQE IPGV V GK Sbjct: 61 SEAPLPPGGVGNGGLQAQKTDVTETRV-QAGVSQESKIPGVSVQGK-------------- 105 Query: 803 NKGGYVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTGVDPS 624 SS + +P Q P +A + + + G+ G T+PP G D S Sbjct: 106 -------------YSSAVAQFPEQQGQ----PPVAKEPELGSTGY-GSTTMPPNVGGDSS 147 Query: 623 QISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXN--RSMVNENLSRP-LESGATMLF 453 I+ K A E + G AG G N R M NEN RP +E+G+TMLF Sbjct: 148 DITGKTALESVPSMNSGTAGPTGVTQMPTNQISIKVNANRPMFNENQIRPPVENGSTMLF 207 Query: 452 VGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKDGMNG 273 VGELHWWTTDAELE+VLSQYGRVKEIKF+DERASGKSKGYC VEF DP AA ACK+GM+G Sbjct: 208 VGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFHDPAAATACKEGMDG 267 Query: 272 HVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPM 144 ++FNGRACVVAFASPQT+KQMGA+Y++K+Q Q QSQ GRRPM Sbjct: 268 YLFNGRACVVAFASPQTLKQMGASYLSKSQGQTQSQQPGRRPM 310 >ref|XP_002312652.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222852472|gb|EEE90019.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 619 Score = 322 bits (826), Expect = 1e-85 Identities = 189/340 (55%), Positives = 223/340 (65%), Gaps = 9/340 (2%) Frame = -2 Query: 1136 LDYGDEEYGGQKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQVHRSETLGG 957 +DY +EE KMQ+QGSGAIPALAEEE MGEDDEYDDLYNDVNVGE FLQ+H SE Sbjct: 1 MDYEEEE----KMQYQGSGAIPALAEEE-MGEDDEYDDLYNDVNVGENFLQMHGSEAPAP 55 Query: 956 ---VGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGEHNNNKGGYV 786 VGNG Q T E GGSQ +AI G GP+ G ++N K + Sbjct: 56 PATVGNGGFQ---TRNAHESRIETGGSQALAITG---------GGPAVEGIYSNAKAHFP 103 Query: 785 IGKGLEGRSSGNSGYP---SAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTGVDPSQIS 615 K + P S+V+Q GR+ EM+ D Q RN GF+ +PP GVDPS +S Sbjct: 104 EQKQVAVAVEAQDVGPVDGSSVAQKGRVIEMSHDVQVRNMGFQKSTPVPPGIGVDPSDMS 163 Query: 614 VKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXN--RSMVNENLSRP-LESGATMLFVGE 444 K A E LP G AG RGA + R +VNEN RP +E+G+T L+VGE Sbjct: 164 RKNAIEPEPLPITGSAGPRGAPQMQVNQMHMSADVNRPVVNENQVRPPIENGSTTLYVGE 223 Query: 443 LHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKDGMNGHVF 264 LHWWTTDAELE+ SQ+GRVKEIKF+DERASGKSKGYC V+F++ AAAACK+GMNGHVF Sbjct: 224 LHWWTTDAELESFASQFGRVKEIKFFDERASGKSKGYCQVDFYEAAAAAACKEGMNGHVF 283 Query: 263 NGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPM 144 NGR CVVAFASPQT+KQMGA+YMNK Q Q Q+Q+QGR M Sbjct: 284 NGRPCVVAFASPQTLKQMGASYMNKTQGQPQTQSQGRGSM 323 >gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus guttatus] Length = 639 Score = 318 bits (814), Expect = 4e-84 Identities = 186/350 (53%), Positives = 222/350 (63%), Gaps = 11/350 (3%) Frame = -2 Query: 1160 MDQMAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQ 984 MD + +EQLDYGDEEYGG QKMQ+ GAIPALAE+E++G+DDEYDDLYNDVNVGEGF+Q Sbjct: 1 MDPVTDEQLDYGDEEYGGNQKMQYHHGGAIPALAEDEMIGDDDEYDDLYNDVNVGEGFMQ 60 Query: 983 VHRSETL--GGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGEH 810 + RSE VGN + + PG+RA E SQEV VG G + G + Sbjct: 61 MQRSEAPPPSAVGNNSFSISKNTAPGTRA-EAIASQEVNNGRVGNEGSYAPNGVQLSDQK 119 Query: 809 NNNKGGYVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTGVD 630 NN + G P SQ RLPE+A +QA + G++G + KT D Sbjct: 120 NNLT------------AVGGPAQPVDASQRVRLPEVANSSQAAHLGYQGSEIMLHKTATD 167 Query: 629 PSQISVKFAGEQSSL--PDQGPA-GVRGAXXXXXXXXXXXXN---RSMVNENLSRPL--E 474 S GE +SL P+ G + GV A RSM +E L RP E Sbjct: 168 RMNNSENIVGEPASLVYPNTGSSKGVPQAPSNLMNSNANVNVNVNRSMDDEYLIRPSGGE 227 Query: 473 SGATMLFVGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAA 294 +G M++VGELHWWTTDAE+E+VL QYGRVKEIKF+DERASGKSKGYC VEF+DP AA A Sbjct: 228 NGNPMIYVGELHWWTTDAEVESVLIQYGRVKEIKFFDERASGKSKGYCQVEFYDPAAATA 287 Query: 293 CKDGMNGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPM 144 CKDGM GH+FNGRACVV +A+PQT KQMGA+Y NKNQ Q QSQ QGR PM Sbjct: 288 CKDGMQGHIFNGRACVVTYANPQTSKQMGASY-NKNQGQSQSQLQGRNPM 336 >ref|XP_006852230.1| hypothetical protein AMTR_s00049p00146760 [Amborella trichopoda] gi|548855834|gb|ERN13697.1| hypothetical protein AMTR_s00049p00146760 [Amborella trichopoda] Length = 659 Score = 274 bits (701), Expect = 5e-71 Identities = 181/370 (48%), Positives = 218/370 (58%), Gaps = 31/370 (8%) Frame = -2 Query: 1160 MDQMAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQ 984 MD MAEEQLDY DE+YG QKM FQ GAI ALA+EELMGEDDEYDDLYNDVNVG+GF+Q Sbjct: 1 MDPMAEEQLDYEDEDYGANQKMPFQTGGAISALADEELMGEDDEYDDLYNDVNVGDGFMQ 60 Query: 983 VHRSET---LGGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVG---VGGKDSKTGPSS 822 + + +GNG +QA + E P S P V IPGVG G KD+K S Sbjct: 61 SLQHQEPVQYESMGNG-VQAPKEE-PISTPP-------VNIPGVGHEEKGEKDAKL--SG 109 Query: 821 YGEHNNNKGGYVIGKG-LEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPP 645 + + + K L G SSG R+ E + Q + GFR A PP Sbjct: 110 FSDLDQKKAFQEQASNQLAGASSGLKI---------RVSEPVSEPQPQASGFRN-APAPP 159 Query: 644 KTGVDPSQISVKFAGEQ------SSLPDQGPAGVRG------AXXXXXXXXXXXXNRSMV 501 G + A +Q +++P GP G A +++ Sbjct: 160 AKGSGFNTAGAMDANKQLAQTSSNAVPRVGPGPGPGIGAGPNANMNRMMGPGPNQAGAVI 219 Query: 500 N-------ENLSRPL----ESGATMLFVGELHWWTTDAELENVLSQYGRVKEIKFYDERA 354 + EN +R ESG TMLFVGEL WWTTDAELE+VLSQYGRVK++KF+DERA Sbjct: 220 DTSARFGSENSNRLSHGGGESGNTMLFVGELQWWTTDAELESVLSQYGRVKDLKFFDERA 279 Query: 353 SGKSKGYCHVEFFDPTAAAACKDGMNGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQV 174 SGKSKGYC VEF+DP AAAACK+ MNGHVFNGRACVVAFAS T+KQ+ Y+NK Q Q Sbjct: 280 SGKSKGYCQVEFYDPAAAAACKESMNGHVFNGRACVVAFASQHTLKQLTTNYLNKTQAQA 339 Query: 173 QSQTQGRRPM 144 Q+Q+QGRRPM Sbjct: 340 QAQSQGRRPM 349 >gb|EPS60955.1| hypothetical protein M569_13847, partial [Genlisea aurea] Length = 508 Score = 266 bits (681), Expect = 9e-69 Identities = 162/352 (46%), Positives = 203/352 (57%), Gaps = 16/352 (4%) Frame = -2 Query: 1160 MDQMAEEQLDYGDEEYGG-QKMQFQGSGAIPALAEEELMGE-DDEYDDLYNDVNVGEGFL 987 M+ M EQ D+G+EEYGG QKMQ+ GAIPALA+EE++GE DDEYDDLYNDVNVGE F+ Sbjct: 1 MEPMNGEQFDFGEEEYGGGQKMQYNQGGAIPALADEEMIGEEDDEYDDLYNDVNVGESFM 60 Query: 986 QVHRSETLGGVGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVGGKDSKTGPSSYGEHN 807 QV R P S+ P V G G ++ PS + Sbjct: 61 QVQR-------------------PDSQIPPFKAENRVNPSGTG-----DESIPSEEANAS 96 Query: 806 NNKGGYVIGKGL----EGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPKT 639 G G G E ++ N+ ++V+ + ++Q G++G KT Sbjct: 97 KYAGNRAFGPGALQFPEQKAGLNTTEETSVTVDRS--QTVRNSQTDQSGYQGSVAPNNKT 154 Query: 638 GVDPSQISVKFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXNRSMVNENLSRPL------ 477 D + K G+ SS+ G +GA N N RP+ Sbjct: 155 E-DQVKNMDKTVGDPSSINPNVGVGSKGAVPFNFM-------NMAANANAIRPVDDEYSN 206 Query: 476 ----ESGATMLFVGELHWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDP 309 E+G TML+VGELHWWTTDAE+E+VL QYG+VKEIKF+DERASGKSKGYC VEFFDP Sbjct: 207 LGSSENGNTMLYVGELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFFDP 266 Query: 308 TAAAACKDGMNGHVFNGRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGR 153 AA ACK+GMNG+VFNGRACVVAFA+PQT+KQMGA+YMN+NQ Q Q+Q GR Sbjct: 267 AAAHACKEGMNGYVFNGRACVVAFATPQTIKQMGASYMNRNQGQPQAQFPGR 318 >ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] gi|550329195|gb|ERP56065.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] Length = 591 Score = 265 bits (677), Expect = 3e-68 Identities = 167/339 (49%), Positives = 195/339 (57%), Gaps = 8/339 (2%) Frame = -2 Query: 1136 LDYGDEEYGGQKMQFQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFLQVHRSETLGG 957 +D+ +EE KMQ+QGSGAIPALAEEEL GEDDEYDDLYNDVNVGE FLQ+H SE Sbjct: 1 MDFEEEE----KMQYQGSGAIPALAEEEL-GEDDEYDDLYNDVNVGENFLQMHGSEAPAP 55 Query: 956 ---VGNGTLQAQRTEGPGSRAPEHGGSQEVAIPGVGVG--GKDSKTGPSSYGEHNNNKGG 792 GNG Q T E GGSQ +A G GV GK S G + + E Sbjct: 56 PATAGNGGFQ---TRNAHESRVETGGSQVLATSGAGVAVEGKYSNAG-AHFPEQKQ---- 107 Query: 791 YVIGKGLEGRSSGNSGYPSAVSQGGRLPEMAPDAQARNEGFRGQATLPPKTGVDPSQISV 612 G G+E G+ GY Sbjct: 108 --AGIGVEANDVGSIGY------------------------------------------- 122 Query: 611 KFAGEQSSLPDQGPAGVRGAXXXXXXXXXXXXN--RSMVNENLSRP-LESGATMLFVGEL 441 G+ SS+ +G AG RG + R +VNEN RP +E+G T L+VGEL Sbjct: 123 ---GDGSSVAQKGSAGPRGVPQMQVNQMNMNADVNRPVVNENQVRPPIENGPTTLYVGEL 179 Query: 440 HWWTTDAELENVLSQYGRVKEIKFYDERASGKSKGYCHVEFFDPTAAAACKDGMNGHVFN 261 HWWTTDAELE+V SQYGRVKEIKF+DERASGKSKGYC V+F++ AAAACK+GMN HVFN Sbjct: 180 HWWTTDAELESVASQYGRVKEIKFFDERASGKSKGYCQVDFYEAAAAAACKEGMNEHVFN 239 Query: 260 GRACVVAFASPQTVKQMGAAYMNKNQVQVQSQTQGRRPM 144 GR CVVAFAS QT+KQMGA+YM+K Q Q Q Q+QGR M Sbjct: 240 GRPCVVAFASAQTLKQMGASYMSKTQGQPQPQSQGRGSM 278