BLASTX nr result
ID: Akebia24_contig00006364
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00006364 (1094 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citr... 414 e-113 ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation spec... 411 e-112 ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobr... 408 e-111 ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prun... 403 e-110 ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268... 403 e-110 ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation spec... 400 e-109 ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citr... 400 e-109 ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobr... 400 e-109 ref|XP_007044908.1| RNA-binding family protein isoform 5, partia... 400 e-109 ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobr... 400 e-109 ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobr... 400 e-109 ref|XP_002515040.1| RNA binding protein, putative [Ricinus commu... 397 e-108 ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation spec... 384 e-104 ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309... 383 e-104 gb|EXB82464.1| Cleavage and polyadenylation specificity factor s... 372 e-100 ref|XP_002312652.1| RNA recognition motif-containing family prot... 363 9e-98 ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Popu... 343 6e-92 ref|XP_002315647.1| RNA recognition motif-containing family prot... 343 6e-92 gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus... 332 1e-88 ref|XP_002889992.1| RNA recognition motif-containing protein [Ar... 301 3e-79 >ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] gi|557540375|gb|ESR51419.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] Length = 658 Score = 414 bits (1065), Expect = e-113 Identities = 224/361 (62%), Positives = 260/361 (72%), Gaps = 27/361 (7%) Frame = +1 Query: 1 QLDYGDEEYG-SQKLQYQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFMQMQQSE-P 174 Q+DY +EEYG +QK+QYQG GAIPALA+EELMGEDDEYDDLYNDVNVG+G +Q QQ E P Sbjct: 8 QIDYEEEEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQFQQPEAP 67 Query: 175 VSAGGVVNEGVQAQMNDDLGSRIPKHGVSKEVTIPGVEIEKKDSNVGATFPDQI------ 336 + GV N +Q + D ++ + GVS+ +PGV +E K +N G FP Q Sbjct: 68 PPSAGVGNGRLQVKKTDVPEQQV-QAGVSQGSNVPGVSVEGKYTNAGTHFPAQNDVQVAV 126 Query: 337 ---TKGIGDYPD--EVSQKGSVSAMGSEAQVGNTEFRGPSPMPPKSGVDPNQMS------ 483 G G+YPD VSQKGSV +A V N F+G + PP++GVDP+ M Sbjct: 127 NRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPPRTGVDPSNMPGRVANE 186 Query: 484 ----LGPGA--PRGVTQMPINQ--VNLNANRPMMNENVIRPVVENGNSMLFVGELHWWTT 639 L PGA P+G +P NQ VN+N NR M+NEN IRP +ENG +MLFVGELHWWTT Sbjct: 187 PAPVLNPGAAGPQGAL-IPANQMGVNINVNRAMVNENQIRPPLENGGTMLFVGELHWWTT 245 Query: 640 DAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACV 819 DAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFF++ AA+ACK+GMNGH FNGR CV Sbjct: 246 DAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFNGRPCV 305 Query: 820 VTFASPQTLKQMGASYLNKTQVQAQSQVSGRRPMNDGVGRGGGMNYQGGADNGRNFGKVG 999 V FASPQTLKQMGASY+NK Q Q QSQ GRRPMNDG GRGG MNYQ G D GRNFG+ G Sbjct: 306 VAFASPQTLKQMGASYMNKNQGQPQSQTQGRRPMNDGGGRGGNMNYQSG-DGGRNFGRGG 364 Query: 1000 W 1002 W Sbjct: 365 W 365 >ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 658 Score = 411 bits (1057), Expect = e-112 Identities = 223/361 (61%), Positives = 259/361 (71%), Gaps = 27/361 (7%) Frame = +1 Query: 1 QLDYGDEEYG-SQKLQYQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFMQMQQSE-P 174 Q+DY +EEYG +QK+QYQG GAIPALA+EELMGEDDEYDDLYNDVNVG+G +Q QQ E P Sbjct: 8 QIDYEEEEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQFQQPEAP 67 Query: 175 VSAGGVVNEGVQAQMNDDLGSRIPKHGVSKEVTIPGVEIEKKDSNVGATFPDQI------ 336 + GV N +Q + D ++ + GVS+ +PGV +E K +N G FP Q Sbjct: 68 PPSAGVGNGRLQVKKTDVPEQQV-QAGVSQGSNVPGVSVEGKYTNAGTHFPAQNDVQVAV 126 Query: 337 ---TKGIGDYPD--EVSQKGSVSAMGSEAQVGNTEFRGPSPMPPKSGVDPNQMS------ 483 G G+YPD VSQKGSV +A V N F+G + P ++GVDP+ M Sbjct: 127 NRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGRVANE 186 Query: 484 ----LGPGA--PRGVTQMPINQ--VNLNANRPMMNENVIRPVVENGNSMLFVGELHWWTT 639 L PGA P+G +P NQ VN+N NR M+NEN IRP +ENG +MLFVGELHWWTT Sbjct: 187 PAPVLNPGAAGPQGAL-IPANQMGVNINVNRAMVNENQIRPPLENGGTMLFVGELHWWTT 245 Query: 640 DAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACV 819 DAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFF++ AA+ACK+GMNGH FNGR CV Sbjct: 246 DAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFNGRPCV 305 Query: 820 VTFASPQTLKQMGASYLNKTQVQAQSQVSGRRPMNDGVGRGGGMNYQGGADNGRNFGKVG 999 V FASPQTLKQMGASY+NK Q Q QSQ GRRPMNDG GRGG MNYQ G D GRNFG+ G Sbjct: 306 VAFASPQTLKQMGASYMNKNQGQPQSQTQGRRPMNDGGGRGGNMNYQSG-DGGRNFGRGG 364 Query: 1000 W 1002 W Sbjct: 365 W 365 >ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695488|ref|XP_007044903.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708837|gb|EOY00734.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708838|gb|EOY00735.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 653 Score = 408 bits (1049), Expect = e-111 Identities = 214/358 (59%), Positives = 262/358 (73%), Gaps = 24/358 (6%) Frame = +1 Query: 1 QLDYGDEEYG-SQKLQYQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFMQMQQSE-P 174 Q+D+GDEEYG +QK+QYQGSGAIPALA+EE+MGEDDEYDDLYNDVNVGEGF+Q+Q+SE P Sbjct: 8 QIDFGDEEYGGAQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQLQRSEAP 67 Query: 175 VSAGGVVNEGVQAQMNDDLGSRIPKHGVSKEVTIPGVEIEKKDSNVGATFPDQITK---- 342 GG+ + G+QAQ N+ R + G S+ + IPGV ++ K NV A +P+Q + Sbjct: 68 PQPGGMGSTGLQAQKNEAPEPR-GEAGGSQGLNIPGVSVQGKHLNVTARYPEQDGQPAVS 126 Query: 343 ----GIGDYPD--EVSQKGSVSAMGSEAQVGNTEFRGPSPMPPKSGVDPN---------- 474 G G YP +SQKG V + QV N F+G S K G+DP+ Sbjct: 127 RPEMGSGSYPSGTSISQKGRVMEGTQDTQVKNMGFQGLSSASHKVGIDPSGVPQKIANVP 186 Query: 475 --QMSLGPGAPRGVTQMPINQVNLNANRPMMNENVIRPVVENGNSMLFVGELHWWTTDAE 648 ++ G G P+G +P NQ+ LN N PM++EN +RP +ENG +MLFVGELHWWTTDAE Sbjct: 187 AQSLNSGTGGPQGAPHVPPNQMGLNVNHPMISENQVRPPIENGPTMLFVGELHWWTTDAE 246 Query: 649 LESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACVVTF 828 LESVLSQYGRVKEIKFFDERASGKSKGYCQVEF++ +A+ACKEGM+G+ FNGRACVV F Sbjct: 247 LESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPASAAACKEGMDGYMFNGRACVVAF 306 Query: 829 ASPQTLKQMGASYLNKTQVQAQSQVSGRRPMNDGVGRGGGMNYQGGADNGRNFGKVGW 1002 ASPQTLKQMGASY+NK Q Q+Q+Q GRRP NDG+GRGG MNYQ G D GRN+G+ GW Sbjct: 307 ASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NDGLGRGGNMNYQSG-DAGRNYGRGGW 362 >ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] gi|462422613|gb|EMJ26876.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] Length = 630 Score = 403 bits (1036), Expect = e-110 Identities = 219/350 (62%), Positives = 256/350 (73%), Gaps = 16/350 (4%) Frame = +1 Query: 1 QLDYGDEEYG-SQKLQYQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFMQMQQSE-P 174 Q+DY DEEYG +QKLQYQGSGAI ALA+EE M EDDEYDDLYNDVNV EGF+QM +SE P Sbjct: 5 QIDYEDEEYGGAQKLQYQGSGAISALADEEPMVEDDEYDDLYNDVNVREGFLQMHRSEAP 64 Query: 175 VSAGGVVNEGVQAQMNDDLGSRIPKHGVSKEVTIPGVEIEKKDSNVGATFPDQITKGIGD 354 + GGV N G+QAQ D +R+ + GVS+E IPGV ++ K S+ A FP+Q G Sbjct: 65 LPPGGVGNGGLQAQKTDVTETRV-QAGVSQESKIPGVSVQGKYSSAVAQFPEQQ----GQ 119 Query: 355 YPDEVSQKGSVSAMGSEAQVGNTEFRGPSPMPPKSGVDPNQ------------MSLGPGA 498 P + E ++G+T + G + MPP G D + M+ G Sbjct: 120 PP-----------VAKEPELGSTGY-GSTTMPPNVGGDSSDITGKTALESVPSMNSGTAG 167 Query: 499 PRGVTQMPINQVNL--NANRPMMNENVIRPVVENGNSMLFVGELHWWTTDAELESVLSQY 672 P GVTQMP NQ+++ NANRPM NEN IRP VENG++MLFVGELHWWTTDAELESVLSQY Sbjct: 168 PTGVTQMPTNQISIKVNANRPMFNENQIRPPVENGSTMLFVGELHWWTTDAELESVLSQY 227 Query: 673 GRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACVVTFASPQTLKQ 852 GRVKEIKFFDERASGKSKGYCQVEF + AA+ACKEGM+G+ FNGRACVV FASPQTLKQ Sbjct: 228 GRVKEIKFFDERASGKSKGYCQVEFHDPAAATACKEGMDGYLFNGRACVVAFASPQTLKQ 287 Query: 853 MGASYLNKTQVQAQSQVSGRRPMNDGVGRGGGMNYQGGADNGRNFGKVGW 1002 MGASYL+K+Q Q QSQ GRRPMN+GVGRGGG+NYQ G GRNFG+ GW Sbjct: 288 MGASYLSKSQGQTQSQQPGRRPMNEGVGRGGGVNYQTGDTGGRNFGRGGW 337 >ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268141 isoform 1 [Vitis vinifera] gi|359473133|ref|XP_003631251.1| PREDICTED: uncharacterized protein LOC100268141 isoform 2 [Vitis vinifera] Length = 647 Score = 403 bits (1035), Expect = e-110 Identities = 218/359 (60%), Positives = 258/359 (71%), Gaps = 25/359 (6%) Frame = +1 Query: 1 QLDYGDEEYG-SQKLQYQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFMQMQQSEPV 177 QLDY DEEYG +QK+ +QG GAI ALA++ELMGEDDEYDDLYNDVNVGEGF+QM +SE Sbjct: 5 QLDYEDEEYGGAQKMPFQGGGAISALADDELMGEDDEYDDLYNDVNVGEGFLQMHRSEAP 64 Query: 178 SAGGVVNEGVQAQMNDDLGSRIPKHGVSKEVTIPGVEIEKKDSN----------VGATFP 327 + GV+ G D+ + + G S+ + IPGV IE K SN + P Sbjct: 65 APSGVMAGGPFQAHKTDVPPQKLEAGTSQGLIIPGVSIEGKYSNPHFHEKKEGPMAVKGP 124 Query: 328 DQITKGIGDYPDEVSQKGSVSAMGSEAQVGNTEFRGPSPMPPKSGVDPNQ---------- 477 + + D P VSQKG V M + QV N F+G +P+P K+G +P+ Sbjct: 125 EMGSTSHLDGPS-VSQKGRVLEMTHDTQVRNLGFQGSTPIPQKTGAEPSDVHGKIANEST 183 Query: 478 --MSLGPGAPRGVTQMPINQV--NLNANRPMMNENVIRPVVENGNSMLFVGELHWWTTDA 645 ++ G G PR V QM NQ+ N+N NRPM+NEN IRP V+NG +MLFVGELHWWTTDA Sbjct: 184 PVLNSGTGGPRAVPQMLSNQMGMNVNVNRPMVNENQIRPAVDNGATMLFVGELHWWTTDA 243 Query: 646 ELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACVVT 825 ELESVLSQYGRVKEIKFFDERASGKSKGYCQVEF+++ AA+ACKEGMNG+ FNGRACVV Sbjct: 244 ELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDASAAAACKEGMNGYIFNGRACVVA 303 Query: 826 FASPQTLKQMGASYLNKTQVQAQSQVSGRRPMNDGVGRGGGMNYQGGADNGRNFGKVGW 1002 FASPQTLKQMGASY+NKTQ Q+QSQ GRRPMNDGVGRGGGMN QGG D GRN+G+ GW Sbjct: 304 FASPQTLKQMGASYMNKTQAQSQSQ--GRRPMNDGVGRGGGMNMQGG-DAGRNYGRGGW 359 >ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 655 Score = 400 bits (1029), Expect = e-109 Identities = 220/361 (60%), Positives = 254/361 (70%), Gaps = 27/361 (7%) Frame = +1 Query: 1 QLDYGDEEYG-SQKLQYQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFMQMQQSE-P 174 Q+DY ++EYG +QK+QYQG GAIPALA+EELMGEDDEYDDLYNDVNVG+G +Q QQ E P Sbjct: 5 QIDYEEDEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQFQQPEAP 64 Query: 175 VSAGGVVNEGVQAQMNDDLGSRIPKHGVSKEVTIPGVEIEKKDSNVGATFPDQI------ 336 + GV N +Q + D R+ G S+ IPGV +E K +N G+ FP Q Sbjct: 65 PPSAGVGNGRLQVKKTDVPEQRVQVGG-SQGSNIPGVSVEGKYTNAGSHFPAQNDVQVAV 123 Query: 337 ---TKGIGDYPD--EVSQKGSVSAMGSEAQVGNTEFRGPSPMPPKSGVDPNQMS------ 483 G G+YPD VSQKGSV +A V N F+G + P ++GVDP+ M Sbjct: 124 NRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGRVANE 183 Query: 484 ----LGPGA--PRGVTQMPINQ--VNLNANRPMMNENVIRPVVENGNSMLFVGELHWWTT 639 L PGA P+G +P NQ VN N NR M+NEN IRP +ENG +MLFVGELHWWTT Sbjct: 184 PAPVLNPGAAGPQGAL-IPANQMGVNANVNRVMVNENQIRPPLENGGTMLFVGELHWWTT 242 Query: 640 DAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACV 819 DAELESVLSQYGR KEIKFFDERASGKSKGYCQVEFF++ AA+ACK+GMNGH FNGR CV Sbjct: 243 DAELESVLSQYGRAKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFNGRPCV 302 Query: 820 VTFASPQTLKQMGASYLNKTQVQAQSQVSGRRPMNDGVGRGGGMNYQGGADNGRNFGKVG 999 V FASPQTLKQMGASY+NK Q Q QSQ G RPMNDG GRGG NYQ G D GRNFG+ G Sbjct: 303 VAFASPQTLKQMGASYMNKNQGQPQSQNQGSRPMNDGGGRGGNTNYQSG-DGGRNFGRGG 361 Query: 1000 W 1002 W Sbjct: 362 W 362 >ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|567891321|ref|XP_006438181.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540376|gb|ESR51420.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540377|gb|ESR51421.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] Length = 655 Score = 400 bits (1029), Expect = e-109 Identities = 219/361 (60%), Positives = 254/361 (70%), Gaps = 27/361 (7%) Frame = +1 Query: 1 QLDYGDEEYG-SQKLQYQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFMQMQQSE-P 174 Q+DY ++EYG +QK+QYQG GAIPALA+EELMGEDDEYDDLYND+NVG+G +Q QQ E P Sbjct: 5 QIDYEEDEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDINVGDGLLQFQQPEAP 64 Query: 175 VSAGGVVNEGVQAQMNDDLGSRIPKHGVSKEVTIPGVEIEKKDSNVGATFPDQI------ 336 + GV N +Q + D R+ G S+ IPGV +E K +N G+ FP Q Sbjct: 65 PPSAGVGNGRLQVKKTDVPEQRVQVGG-SQGSNIPGVSVEGKYTNAGSDFPAQNDVQVAV 123 Query: 337 ---TKGIGDYPD--EVSQKGSVSAMGSEAQVGNTEFRGPSPMPPKSGVDPNQMS------ 483 G G+YPD VSQKGSV +A V N F+G + P ++GVDP+ M Sbjct: 124 NRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGRAANE 183 Query: 484 ----LGPGA--PRGVTQMPINQ--VNLNANRPMMNENVIRPVVENGNSMLFVGELHWWTT 639 L PGA P+G +P NQ VN N NR M+NEN IRP +ENG +MLFVGELHWWTT Sbjct: 184 PAPVLNPGAAGPQGAL-IPANQMGVNANVNRVMVNENQIRPPLENGGTMLFVGELHWWTT 242 Query: 640 DAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACV 819 DAELESVLSQYGR KEIKFFDERASGKSKGYCQVEFF++ AA+ACK+GMNGH FNGR CV Sbjct: 243 DAELESVLSQYGRAKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFNGRPCV 302 Query: 820 VTFASPQTLKQMGASYLNKTQVQAQSQVSGRRPMNDGVGRGGGMNYQGGADNGRNFGKVG 999 V FASPQTLKQMGASY+NK Q Q QSQ G RPMNDG GRGG NYQ G D GRNFG+ G Sbjct: 303 VAFASPQTLKQMGASYMNKNQGQPQSQNQGSRPMNDGGGRGGNTNYQSG-DGGRNFGRGG 361 Query: 1000 W 1002 W Sbjct: 362 W 362 >ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobroma cacao] gi|508708844|gb|EOY00741.1| RNA-binding family protein isoform 6 [Theobroma cacao] Length = 602 Score = 400 bits (1029), Expect = e-109 Identities = 209/358 (58%), Positives = 262/358 (73%), Gaps = 24/358 (6%) Frame = +1 Query: 1 QLDYGDEEYGS-QKLQYQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFMQMQQSE-P 174 Q+D+GDEEYG QK+QYQGSGAIPALA+EE+MGEDDEYDDLYNDVNVGEGF+Q+Q+SE P Sbjct: 8 QIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQLQRSEAP 67 Query: 175 VSAGGVVNEGVQAQMNDDLGSRIPKHGVSKEVTIPGVEIEKKDSNVGATFPDQITKGI-- 348 + GG+ + G++AQ N+ R+ G S+ + IPGV ++ K NV A +P++ + Sbjct: 68 LQPGGLGSTGLKAQRNEAPEPRVEAGG-SQGLNIPGVSVQGKHPNVSARYPEKEEQPAVN 126 Query: 349 ------GDYPD--EVSQKGSVSAMGSEAQVGNTEFRG-----------PSPMPPKSGVDP 471 G YP +SQKGSV+ + QV N F+G PS +P K DP Sbjct: 127 RPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVPQKIANDP 186 Query: 472 NQ-MSLGPGAPRGVTQMPINQVNLNANRPMMNENVIRPVVENGNSMLFVGELHWWTTDAE 648 Q ++ G G P+G +P NQ+ N N P+MNEN ++P +ENG +MLFVGELHWWTTDAE Sbjct: 187 AQSLNSGTGGPQGPPHVPPNQMGTNVNHPVMNENQVQPPIENGPTMLFVGELHWWTTDAE 246 Query: 649 LESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACVVTF 828 LESVLSQYGR+KEIKFFDE+ASGKSKGYCQVEF++ +A+ CKEGMNG+ FNGRACVV F Sbjct: 247 LESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYMFNGRACVVAF 306 Query: 829 ASPQTLKQMGASYLNKTQVQAQSQVSGRRPMNDGVGRGGGMNYQGGADNGRNFGKVGW 1002 ASPQTLKQMGASY+NK Q Q+Q+Q GRRP N+G+GRGG +NYQ G D GRN+G+ GW Sbjct: 307 ASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQSG-DAGRNYGRGGW 362 >ref|XP_007044908.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] gi|508708843|gb|EOY00740.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] Length = 656 Score = 400 bits (1029), Expect = e-109 Identities = 209/358 (58%), Positives = 262/358 (73%), Gaps = 24/358 (6%) Frame = +1 Query: 1 QLDYGDEEYGS-QKLQYQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFMQMQQSE-P 174 Q+D+GDEEYG QK+QYQGSGAIPALA+EE+MGEDDEYDDLYNDVNVGEGF+Q+Q+SE P Sbjct: 8 QIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQLQRSEAP 67 Query: 175 VSAGGVVNEGVQAQMNDDLGSRIPKHGVSKEVTIPGVEIEKKDSNVGATFPDQITKGI-- 348 + GG+ + G++AQ N+ R+ G S+ + IPGV ++ K NV A +P++ + Sbjct: 68 LQPGGLGSTGLKAQRNEAPEPRVEAGG-SQGLNIPGVSVQGKHPNVSARYPEKEEQPAVN 126 Query: 349 ------GDYPD--EVSQKGSVSAMGSEAQVGNTEFRG-----------PSPMPPKSGVDP 471 G YP +SQKGSV+ + QV N F+G PS +P K DP Sbjct: 127 RPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVPQKIANDP 186 Query: 472 NQ-MSLGPGAPRGVTQMPINQVNLNANRPMMNENVIRPVVENGNSMLFVGELHWWTTDAE 648 Q ++ G G P+G +P NQ+ N N P+MNEN ++P +ENG +MLFVGELHWWTTDAE Sbjct: 187 AQSLNSGTGGPQGPPHVPPNQMGTNVNHPVMNENQVQPPIENGPTMLFVGELHWWTTDAE 246 Query: 649 LESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACVVTF 828 LESVLSQYGR+KEIKFFDE+ASGKSKGYCQVEF++ +A+ CKEGMNG+ FNGRACVV F Sbjct: 247 LESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYMFNGRACVVAF 306 Query: 829 ASPQTLKQMGASYLNKTQVQAQSQVSGRRPMNDGVGRGGGMNYQGGADNGRNFGKVGW 1002 ASPQTLKQMGASY+NK Q Q+Q+Q GRRP N+G+GRGG +NYQ G D GRN+G+ GW Sbjct: 307 ASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQSG-DAGRNYGRGGW 362 >ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobroma cacao] gi|508708842|gb|EOY00739.1| RNA-binding family protein isoform 4 [Theobroma cacao] Length = 697 Score = 400 bits (1029), Expect = e-109 Identities = 209/358 (58%), Positives = 262/358 (73%), Gaps = 24/358 (6%) Frame = +1 Query: 1 QLDYGDEEYGS-QKLQYQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFMQMQQSE-P 174 Q+D+GDEEYG QK+QYQGSGAIPALA+EE+MGEDDEYDDLYNDVNVGEGF+Q+Q+SE P Sbjct: 8 QIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQLQRSEAP 67 Query: 175 VSAGGVVNEGVQAQMNDDLGSRIPKHGVSKEVTIPGVEIEKKDSNVGATFPDQITKGI-- 348 + GG+ + G++AQ N+ R+ G S+ + IPGV ++ K NV A +P++ + Sbjct: 68 LQPGGLGSTGLKAQRNEAPEPRVEAGG-SQGLNIPGVSVQGKHPNVSARYPEKEEQPAVN 126 Query: 349 ------GDYPD--EVSQKGSVSAMGSEAQVGNTEFRG-----------PSPMPPKSGVDP 471 G YP +SQKGSV+ + QV N F+G PS +P K DP Sbjct: 127 RPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVPQKIANDP 186 Query: 472 NQ-MSLGPGAPRGVTQMPINQVNLNANRPMMNENVIRPVVENGNSMLFVGELHWWTTDAE 648 Q ++ G G P+G +P NQ+ N N P+MNEN ++P +ENG +MLFVGELHWWTTDAE Sbjct: 187 AQSLNSGTGGPQGPPHVPPNQMGTNVNHPVMNENQVQPPIENGPTMLFVGELHWWTTDAE 246 Query: 649 LESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACVVTF 828 LESVLSQYGR+KEIKFFDE+ASGKSKGYCQVEF++ +A+ CKEGMNG+ FNGRACVV F Sbjct: 247 LESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYMFNGRACVVAF 306 Query: 829 ASPQTLKQMGASYLNKTQVQAQSQVSGRRPMNDGVGRGGGMNYQGGADNGRNFGKVGW 1002 ASPQTLKQMGASY+NK Q Q+Q+Q GRRP N+G+GRGG +NYQ G D GRN+G+ GW Sbjct: 307 ASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQSG-DAGRNYGRGGW 362 >ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695496|ref|XP_007044905.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695500|ref|XP_007044906.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708839|gb|EOY00736.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708840|gb|EOY00737.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708841|gb|EOY00738.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 652 Score = 400 bits (1029), Expect = e-109 Identities = 209/358 (58%), Positives = 262/358 (73%), Gaps = 24/358 (6%) Frame = +1 Query: 1 QLDYGDEEYGS-QKLQYQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFMQMQQSE-P 174 Q+D+GDEEYG QK+QYQGSGAIPALA+EE+MGEDDEYDDLYNDVNVGEGF+Q+Q+SE P Sbjct: 8 QIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQLQRSEAP 67 Query: 175 VSAGGVVNEGVQAQMNDDLGSRIPKHGVSKEVTIPGVEIEKKDSNVGATFPDQITKGI-- 348 + GG+ + G++AQ N+ R+ G S+ + IPGV ++ K NV A +P++ + Sbjct: 68 LQPGGLGSTGLKAQRNEAPEPRVEAGG-SQGLNIPGVSVQGKHPNVSARYPEKEEQPAVN 126 Query: 349 ------GDYPD--EVSQKGSVSAMGSEAQVGNTEFRG-----------PSPMPPKSGVDP 471 G YP +SQKGSV+ + QV N F+G PS +P K DP Sbjct: 127 RPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVPQKIANDP 186 Query: 472 NQ-MSLGPGAPRGVTQMPINQVNLNANRPMMNENVIRPVVENGNSMLFVGELHWWTTDAE 648 Q ++ G G P+G +P NQ+ N N P+MNEN ++P +ENG +MLFVGELHWWTTDAE Sbjct: 187 AQSLNSGTGGPQGPPHVPPNQMGTNVNHPVMNENQVQPPIENGPTMLFVGELHWWTTDAE 246 Query: 649 LESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACVVTF 828 LESVLSQYGR+KEIKFFDE+ASGKSKGYCQVEF++ +A+ CKEGMNG+ FNGRACVV F Sbjct: 247 LESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYMFNGRACVVAF 306 Query: 829 ASPQTLKQMGASYLNKTQVQAQSQVSGRRPMNDGVGRGGGMNYQGGADNGRNFGKVGW 1002 ASPQTLKQMGASY+NK Q Q+Q+Q GRRP N+G+GRGG +NYQ G D GRN+G+ GW Sbjct: 307 ASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQSG-DAGRNYGRGGW 362 >ref|XP_002515040.1| RNA binding protein, putative [Ricinus communis] gi|223546091|gb|EEF47594.1| RNA binding protein, putative [Ricinus communis] Length = 644 Score = 397 bits (1021), Expect = e-108 Identities = 221/354 (62%), Positives = 260/354 (73%), Gaps = 20/354 (5%) Frame = +1 Query: 1 QLDYGDEEYG-SQKLQYQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFMQMQQSE-P 174 Q+DY DEEYG +QKLQYQGSGAIPALAEEE MGEDDEYDDLYNDVN+GE F+QM +SE P Sbjct: 5 QIDYEDEEYGGAQKLQYQGSGAIPALAEEE-MGEDDEYDDLYNDVNIGENFLQMHRSEAP 63 Query: 175 VSAGGVVNEGVQAQMNDDLGSRIPKHGVSKEVTIPGVEIEKKDSNVGATFPDQITKG--I 348 + V N G Q + ++DL R+ G S+ + IPGV +E K S G FP+Q KG I Sbjct: 64 PAPPSVGNGGFQPRNSNDL--RVESGG-SQGLNIPGVAVESKYST-GTHFPEQNVKGPEI 119 Query: 349 GD--YPD--EVSQKGSVSAMGSEAQVGNTEFRGPSPMPPKSGVDP----NQMSLGP---- 492 G YPD ++QK V M +++Q N F+G + P GVDP N++S P Sbjct: 120 GSVGYPDGSSIAQKTRVMEMTNDSQARNMGFQGSTSGPSNIGVDPSDMNNKISNDPTPVP 179 Query: 493 --GAPRGVTQMPINQVNLN--ANRPMMNENVIRPVVENGNSMLFVGELHWWTTDAELESV 660 G PR + Q+P +Q+N+N NR NEN IRP +ENG++ML+VGELHWWTTDAELE+V Sbjct: 180 NAGVPRVIPQLPASQMNMNMDTNRSATNENQIRPPLENGSTMLYVGELHWWTTDAELENV 239 Query: 661 LSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACVVTFASPQ 840 LSQYG VKEIKFFDERASGKSKGYCQVEF+++ AA+ACKEGMNGH FNGRACVV FAS Q Sbjct: 240 LSQYGMVKEIKFFDERASGKSKGYCQVEFYDAAAAAACKEGMNGHLFNGRACVVAFASQQ 299 Query: 841 TLKQMGASYLNKTQVQAQSQVSGRRPMNDGVGRGGGMNYQGGADNGRNFGKVGW 1002 TLKQMGASY+NK Q Q QSQ GRRPMNDG GRGG MNYQGG D GRNFG+ GW Sbjct: 300 TLKQMGASYMNKNQGQPQSQNQGRRPMNDGAGRGGNMNYQGG-DAGRNFGRGGW 352 >ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Solanum tuberosum] gi|565349616|ref|XP_006341787.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X2 [Solanum tuberosum] Length = 648 Score = 384 bits (987), Expect = e-104 Identities = 211/359 (58%), Positives = 250/359 (69%), Gaps = 25/359 (6%) Frame = +1 Query: 1 QLDYGDEEYG-SQKLQYQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFMQMQQSE-P 174 QLDYGDEEYG S K+QY GSG IPALAE+E+MGEDDEYDDLYNDVN+GEGF+Q+Q+SE P Sbjct: 8 QLDYGDEEYGGSHKMQYHGSGTIPALAEDEMMGEDDEYDDLYNDVNIGEGFLQLQRSEVP 67 Query: 175 VSAGGVVNEGVQAQMNDDLGSRIPKHGVSKEVTIPGVEIEKKDSNVGATFPDQ----ITK 342 V + N QAQ + SR G S+E IPG+ E K + FP Q + + Sbjct: 68 VPSVDAGNGNFQAQKDSFPASRAGGLG-SEEAKIPGIATEGKYAGTEVQFPQQKGEPVVE 126 Query: 343 GIGDYPDEVSQKGSVSA--MGSEAQVGNTEFRGPSPMPPKSGVDPNQM------------ 480 + P + +QK SA M +Q GN+ ++G PMP K G DP M Sbjct: 127 RETERPADAAQKARPSAITMTLNSQAGNSGYQGSMPMPQKIGADPMAMPEKNASEATPLM 186 Query: 481 -SLGPGAPRGVTQMPINQVN----LNANRPMMNENVIRPVVENGNSMLFVGELHWWTTDA 645 S+ PG PR V MP NQ+N +N N P+++E RP +ENGN+MLFVGELHWWTTDA Sbjct: 187 NSVVPG-PRVVPHMPTNQLNSSGNVNMNNPVISETPFRPSLENGNTMLFVGELHWWTTDA 245 Query: 646 ELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACVVT 825 ELESVL+QYG VKEIKFFDERASGKSKGYCQVEFF+ +A+ACKEGMNG+NFNGRACVV Sbjct: 246 ELESVLTQYGNVKEIKFFDERASGKSKGYCQVEFFDPASAAACKEGMNGYNFNGRACVVA 305 Query: 826 FASPQTLKQMGASYLNKTQVQAQSQVSGRRPMNDGVGRGGGMNYQGGADNGRNFGKVGW 1002 FA+PQT+KQMG+SY NKTQ Q QSQ GRRPMN+GVGR GG NY G D GRNFG+ W Sbjct: 306 FATPQTIKQMGSSYANKTQNQVQSQPQGRRPMNEGVGR-GGPNYTPG-DAGRNFGRGSW 362 >ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309507 [Fragaria vesca subsp. vesca] Length = 646 Score = 383 bits (984), Expect = e-104 Identities = 209/348 (60%), Positives = 249/348 (71%), Gaps = 17/348 (4%) Frame = +1 Query: 1 QLDYGDEEYG-SQKLQYQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFMQMQQSEP- 174 Q+DY +EEYG +QKLQYQ SGAIPALA+EE M EDDEYDDLYNDVNVGEGF+QM + EP Sbjct: 8 QIDYEEEEYGGAQKLQYQESGAIPALADEEPMVEDDEYDDLYNDVNVGEGFLQMHRPEPP 67 Query: 175 VSAGGVVNEGVQAQMNDDLGSRIPKHGVSKEVTIPGVEIEKKDSNVGATFPDQITKG-IG 351 + GV N G+QAQ N+ R+ + G S+EV PG +E K S+V P+Q + + Sbjct: 68 LPPAGVGNGGLQAQKNNVPEQRV-QGGASQEVKNPGFSVEGKYSSV----PEQKDQPPVS 122 Query: 352 DYPDEVSQKGSVSAMGSEAQVGNTEFRGPSPMPPKSGVDPNQ------------MSLGPG 495 P+ SQKG V M +AQV N F+G + M D + M+ G Sbjct: 123 VVPEMASQKGRVMEMTHDAQVRNMGFQGAATMQSNVVADSSDLTGKIANGPIPSMNSGSN 182 Query: 496 APRGVTQMPINQVNL--NANRPMMNENVIRPVVENGNSMLFVGELHWWTTDAELESVLSQ 669 P V QMP NQ+N+ N NRPM+NEN IRP VENG++ LFVGELHWWTTDAELE VLSQ Sbjct: 183 GPPAVQQMPANQMNMKINVNRPMVNENQIRPPVENGSATLFVGELHWWTTDAELEGVLSQ 242 Query: 670 YGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACVVTFASPQTLK 849 +GR+KEIKFFDERASGKSKGYCQV+F++ AASACKEGM+G+ FNGRACVV FAS QTLK Sbjct: 243 FGRIKEIKFFDERASGKSKGYCQVDFYDPAAASACKEGMDGYVFNGRACVVAFASSQTLK 302 Query: 850 QMGASYLNKTQVQAQSQVSGRRPMNDGVGRGGGMNYQGGADNGRNFGK 993 QMG SY+NK+Q Q Q+Q GRRPMNDG GRGG MN+QGG D GRNFG+ Sbjct: 303 QMGDSYVNKSQGQVQTQPQGRRPMNDGAGRGGNMNFQGG-DTGRNFGR 349 >gb|EXB82464.1| Cleavage and polyadenylation specificity factor subunit [Morus notabilis] Length = 636 Score = 372 bits (955), Expect = e-100 Identities = 211/357 (59%), Positives = 253/357 (70%), Gaps = 24/357 (6%) Frame = +1 Query: 4 LDYGDEEYG-SQKLQYQGSG-AIPALAEEELMGEDDEYDDLYNDVNVGEGFMQMQQSE-- 171 +D+ DEEYG +QK QYQGSG AI ALA+EELMG+DDEYDDLYNDVNVGEGF+Q+Q+SE Sbjct: 6 IDFEDEEYGGAQKHQYQGSGGAISALADEELMGDDDEYDDLYNDVNVGEGFLQLQRSEAP 65 Query: 172 --PVSAGGVVNEGVQAQMNDDLGSRIPKHGVSKEVTIPGVEIEKKDSNVGATFPDQITKG 345 P +AG V G+QAQ + R + G S++ IPGV E + S+ G+ FP Q Sbjct: 66 SLPAAAG--VGNGLQAQKRNFPEPR-EEIGGSQQPNIPGVSAEGRFSSAGSQFPGQQD-- 120 Query: 346 IGDYPDEVSQKGSV----SAMGSEAQVGNTEFRGPSPMPPKSGVD----PNQM------- 480 G D+ S+ GS+ A GS+ F+G PM GVD P +M Sbjct: 121 -GLKVDKKSEAGSMVYPDGASGSQKGRIVAGFQGSKPMLHSVGVDSSDIPGKMVNEPIQA 179 Query: 481 -SLGPGAPRGVTQMPINQVNLNAN--RPMMNENVIRPVVENGNSMLFVGELHWWTTDAEL 651 + G PRG+ M NQ +NAN P++NEN IRP +ENG++MLFVGELHWWTTDAEL Sbjct: 180 PNSGGAGPRGILPMQGNQTTVNANVSHPIVNENQIRPSIENGSTMLFVGELHWWTTDAEL 239 Query: 652 ESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACVVTFA 831 ESVLSQYGRVKEIKFFDERASGKSKGYCQVE++++ AA ACKEGM+GH FNGRACVV FA Sbjct: 240 ESVLSQYGRVKEIKFFDERASGKSKGYCQVEYYDAAAAVACKEGMHGHVFNGRACVVAFA 299 Query: 832 SPQTLKQMGASYLNKTQVQAQSQVSGRRPMNDGVGRGGGMNYQGGADNGRNFGKVGW 1002 SPQTLKQMGA+Y++K QVQ QSQ GRRP+NDGVGRGG N+Q G D GRNFG+ GW Sbjct: 300 SPQTLKQMGAAYMSKNQVQNQSQPQGRRPINDGVGRGGNPNFQSG-DGGRNFGRGGW 355 >ref|XP_002312652.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222852472|gb|EEE90019.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 619 Score = 363 bits (931), Expect = 9e-98 Identities = 201/357 (56%), Positives = 240/357 (67%), Gaps = 24/357 (6%) Frame = +1 Query: 4 LDYGDEEYGSQKLQYQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFMQMQQSE-PVS 180 +DY +EE K+QYQGSGAIPALAEEE MGEDDEYDDLYNDVNVGE F+QM SE P Sbjct: 1 MDYEEEE----KMQYQGSGAIPALAEEE-MGEDDEYDDLYNDVNVGENFLQMHGSEAPAP 55 Query: 181 AGGVVNEGVQAQMNDDLGSRIPKHGVSK-EVTIPGVEIEKKDSNVGATFPDQITKGIGDY 357 V N G Q + + SRI G +T G +E SN A FP+Q + Sbjct: 56 PATVGNGGFQTRNAHE--SRIETGGSQALAITGGGPAVEGIYSNAKAHFPEQKQVAVAVE 113 Query: 358 PDEV--------SQKGSVSAMGSEAQVGNTEFRGPSPMPPKSGVDPNQMS---------- 483 +V +QKG V M + QV N F+ +P+PP GVDP+ MS Sbjct: 114 AQDVGPVDGSSVAQKGRVIEMSHDVQVRNMGFQKSTPVPPGIGVDPSDMSRKNAIEPEPL 173 Query: 484 --LGPGAPRGVTQMPINQVNLNA--NRPMMNENVIRPVVENGNSMLFVGELHWWTTDAEL 651 G PRG QM +NQ++++A NRP++NEN +RP +ENG++ L+VGELHWWTTDAEL Sbjct: 174 PITGSAGPRGAPQMQVNQMHMSADVNRPVVNENQVRPPIENGSTTLYVGELHWWTTDAEL 233 Query: 652 ESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACVVTFA 831 ES SQ+GRVKEIKFFDERASGKSKGYCQV+F+E+ AA+ACKEGMNGH FNGR CVV FA Sbjct: 234 ESFASQFGRVKEIKFFDERASGKSKGYCQVDFYEAAAAAACKEGMNGHVFNGRPCVVAFA 293 Query: 832 SPQTLKQMGASYLNKTQVQAQSQVSGRRPMNDGVGRGGGMNYQGGADNGRNFGKVGW 1002 SPQTLKQMGASY+NKTQ Q Q+Q GR MNDG GRGG N+Q G D GRN+G+ W Sbjct: 294 SPQTLKQMGASYMNKTQGQPQTQSQGRGSMNDGAGRGGNANFQSG-DGGRNYGRGAW 349 >ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] gi|550329195|gb|ERP56065.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] Length = 591 Score = 343 bits (881), Expect = 6e-92 Identities = 195/340 (57%), Positives = 227/340 (66%), Gaps = 7/340 (2%) Frame = +1 Query: 4 LDYGDEEYGSQKLQYQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFMQMQQSE---- 171 +D+ +EE K+QYQGSGAIPALAEEEL GEDDEYDDLYNDVNVGE F+QM SE Sbjct: 1 MDFEEEE----KMQYQGSGAIPALAEEEL-GEDDEYDDLYNDVNVGENFLQMHGSEAPAP 55 Query: 172 PVSAGGVVNEGVQAQMNDDLGSRIPKHGVSKEVTI-PGVEIEKKDSNVGATFPDQITKGI 348 P +AG N G Q + + SR+ G T GV +E K SN GA FP+Q GI Sbjct: 56 PATAG---NGGFQTRNAHE--SRVETGGSQVLATSGAGVAVEGKYSNAGAHFPEQKQAGI 110 Query: 349 GDYPDEVSQKGSVSAMGSEAQVGNTEFRGPSPMPPKSGVDPNQMSLGPGAPRGVTQMPIN 528 G ++V G G + V G PRGV QM +N Sbjct: 111 GVEANDVGSIG----YGDGSSVAQK---------------------GSAGPRGVPQMQVN 145 Query: 529 QVNLNA--NRPMMNENVIRPVVENGNSMLFVGELHWWTTDAELESVLSQYGRVKEIKFFD 702 Q+N+NA NRP++NEN +RP +ENG + L+VGELHWWTTDAELESV SQYGRVKEIKFFD Sbjct: 146 QMNMNADVNRPVVNENQVRPPIENGPTTLYVGELHWWTTDAELESVASQYGRVKEIKFFD 205 Query: 703 ERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACVVTFASPQTLKQMGASYLNKTQ 882 ERASGKSKGYCQV+F+E+ AA+ACKEGMN H FNGR CVV FAS QTLKQMGASY++KTQ Sbjct: 206 ERASGKSKGYCQVDFYEAAAAAACKEGMNEHVFNGRPCVVAFASAQTLKQMGASYMSKTQ 265 Query: 883 VQAQSQVSGRRPMNDGVGRGGGMNYQGGADNGRNFGKVGW 1002 Q Q Q GR MNDG+GRGG NYQ G D GRN+G+ GW Sbjct: 266 GQPQPQSQGRGSMNDGMGRGGNANYQSG-DGGRNYGRGGW 304 >ref|XP_002315647.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222864687|gb|EEF01818.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 573 Score = 343 bits (881), Expect = 6e-92 Identities = 195/340 (57%), Positives = 227/340 (66%), Gaps = 7/340 (2%) Frame = +1 Query: 4 LDYGDEEYGSQKLQYQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFMQMQQSE---- 171 +D+ +EE K+QYQGSGAIPALAEEEL GEDDEYDDLYNDVNVGE F+QM SE Sbjct: 1 MDFEEEE----KMQYQGSGAIPALAEEEL-GEDDEYDDLYNDVNVGENFLQMHGSEAPAP 55 Query: 172 PVSAGGVVNEGVQAQMNDDLGSRIPKHGVSKEVTI-PGVEIEKKDSNVGATFPDQITKGI 348 P +AG N G Q + + SR+ G T GV +E K SN GA FP+Q GI Sbjct: 56 PATAG---NGGFQTRNAHE--SRVETGGSQVLATSGAGVAVEGKYSNAGAHFPEQKQAGI 110 Query: 349 GDYPDEVSQKGSVSAMGSEAQVGNTEFRGPSPMPPKSGVDPNQMSLGPGAPRGVTQMPIN 528 G ++V G G + V G PRGV QM +N Sbjct: 111 GVEANDVGSIG----YGDGSSVAQK---------------------GSAGPRGVPQMQVN 145 Query: 529 QVNLNA--NRPMMNENVIRPVVENGNSMLFVGELHWWTTDAELESVLSQYGRVKEIKFFD 702 Q+N+NA NRP++NEN +RP +ENG + L+VGELHWWTTDAELESV SQYGRVKEIKFFD Sbjct: 146 QMNMNADVNRPVVNENQVRPPIENGPTTLYVGELHWWTTDAELESVASQYGRVKEIKFFD 205 Query: 703 ERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACVVTFASPQTLKQMGASYLNKTQ 882 ERASGKSKGYCQV+F+E+ AA+ACKEGMN H FNGR CVV FAS QTLKQMGASY++KTQ Sbjct: 206 ERASGKSKGYCQVDFYEAAAAAACKEGMNEHVFNGRPCVVAFASAQTLKQMGASYMSKTQ 265 Query: 883 VQAQSQVSGRRPMNDGVGRGGGMNYQGGADNGRNFGKVGW 1002 Q Q Q GR MNDG+GRGG NYQ G D GRN+G+ GW Sbjct: 266 GQPQPQSQGRGSMNDGMGRGGNANYQSG-DGGRNYGRGGW 304 >gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus guttatus] Length = 639 Score = 332 bits (852), Expect = 1e-88 Identities = 191/357 (53%), Positives = 228/357 (63%), Gaps = 24/357 (6%) Frame = +1 Query: 1 QLDYGDEEYG-SQKLQYQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFMQMQQSEPV 177 QLDYGDEEYG +QK+QY GAIPALAE+E++G+DDEYDDLYNDVNVGEGFMQMQ+SE Sbjct: 8 QLDYGDEEYGGNQKMQYHHGGAIPALAEDEMIGDDDEYDDLYNDVNVGEGFMQMQRSEAP 67 Query: 178 SAGGVVNEGVQAQMNDDLGSRIPKHGVSKEVTIPGVEIEKKDSNVGATFPDQITK----G 345 V N N G+R S+EV V E + G DQ G Sbjct: 68 PPSAVGNNSFSISKNTAPGTRAEAIA-SQEVNNGRVGNEGSYAPNGVQLSDQKNNLTAVG 126 Query: 346 IGDYPDEVSQKGSVSAMGSEAQVGNTEFRGPSPMPPKSGVDPNQMSLG------------ 489 P + SQ+ + + + +Q + ++G M K+ D S Sbjct: 127 GPAQPVDASQRVRLPEVANSSQAAHLGYQGSEIMLHKTATDRMNNSENIVGEPASLVYPN 186 Query: 490 PGAPRGVTQMPIN------QVNLNANRPMMNENVIRPVV-ENGNSMLFVGELHWWTTDAE 648 G+ +GV Q P N VN+N NR M +E +IRP ENGN M++VGELHWWTTDAE Sbjct: 187 TGSSKGVPQAPSNLMNSNANVNVNVNRSMDDEYLIRPSGGENGNPMIYVGELHWWTTDAE 246 Query: 649 LESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACVVTF 828 +ESVL QYGRVKEIKFFDERASGKSKGYCQVEF++ AA+ACK+GM GH FNGRACVVT+ Sbjct: 247 VESVLIQYGRVKEIKFFDERASGKSKGYCQVEFYDPAAATACKDGMQGHIFNGRACVVTY 306 Query: 829 ASPQTLKQMGASYLNKTQVQAQSQVSGRRPMNDGVGRGGGMNYQGGADNGRNFGKVG 999 A+PQT KQMGASY NK Q Q+QSQ+ GR PMNDG GRG G NY G D GRNFG+ G Sbjct: 307 ANPQTSKQMGASY-NKNQGQSQSQLQGRNPMNDGAGRGNGTNYPSG-DAGRNFGRGG 361 >ref|XP_002889992.1| RNA recognition motif-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297335834|gb|EFH66251.1| RNA recognition motif-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 573 Score = 301 bits (771), Expect = 3e-79 Identities = 176/352 (50%), Positives = 230/352 (65%), Gaps = 19/352 (5%) Frame = +1 Query: 7 DYGDEEYGSQKLQYQGSGAIPALAEEELMGEDDEYDDLYNDVNVGEGFMQM--QQSEPVS 180 DYG G+QK+ +QGSG IPALA+EELMG+DDEYDDLY+DVNVGE F Q Q P Sbjct: 6 DYG----GNQKILHQGSGTIPALADEELMGDDDEYDDLYSDVNVGESFFQAHNQPQPPAQ 61 Query: 181 AGGVVNEGVQAQMNDDLGSRIPKHGVSKEVTIPGVEIEKKDSNVGATFPDQIT------- 339 GG N +QAQ + P+ G+ T+ G + + G + PD + Sbjct: 62 VGGTGNASLQAQTSHVAAE--PRMGIVSGGTVEG-KYRNDGGHNGISGPDTRSDVYPQAS 118 Query: 340 ----KG--IGDYPDEVSQKGSVSAMGSEAQVGNTEFRGPSPMPPKSGVDPNQMSLGPGAP 501 KG I +++ Q+GS S + + N F G + P+ P G P Sbjct: 119 SFGAKGLNIDIQSNKIGQQGSTSVV-----LNNHGFSGNAVNVPEL---PVHNPYG-APP 169 Query: 502 RGVTQMPINQVNLNANRPMMNENVIRP-VVENGNSMLFVGELHWWTTDAELESVLSQYGR 678 +G Q+P++Q+++N N MMN++ +P VV+NGN+MLFVGELHWWTTDAE+ESVLSQYGR Sbjct: 170 QGAQQIPVSQMSVNPN-VMMNKSPTQPFVVDNGNTMLFVGELHWWTTDAEIESVLSQYGR 228 Query: 679 VKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACVVTFASPQTLKQMG 858 VKEIKFFDER SGKSKGYCQVEF++S AA++CKEGMNG+ FNG+ACVV FASP+TLKQMG Sbjct: 229 VKEIKFFDERVSGKSKGYCQVEFYDSAAAASCKEGMNGYIFNGKACVVAFASPETLKQMG 288 Query: 859 ASYLNKTQVQAQSQVSGRRPMNDGVGRG---GGMNYQGGADNGRNFGKVGWA 1005 A++ + Q Q+Q+ RRP+N+G+GRG MN Q G D GRN+G+ G+A Sbjct: 289 ANFTGRN--QGQNQIQNRRPLNEGMGRGNNNNNMNTQNG-DGGRNYGRGGFA 337