BLASTX nr result
ID: Akebia24_contig00006363
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00006363 (844 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobr... 253 5e-65 ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268... 253 5e-65 ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobr... 245 2e-62 ref|XP_007044908.1| RNA-binding family protein isoform 5, partia... 245 2e-62 ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobr... 245 2e-62 ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobr... 245 2e-62 ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citr... 238 3e-60 ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation spec... 234 2e-59 ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309... 234 2e-59 ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prun... 231 2e-58 ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation spec... 229 1e-57 ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citr... 228 2e-57 emb|CBI16834.3| unnamed protein product [Vitis vinifera] 228 2e-57 ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation spec... 215 2e-53 ref|XP_002515040.1| RNA binding protein, putative [Ricinus commu... 210 6e-52 gb|EXB82464.1| Cleavage and polyadenylation specificity factor s... 208 2e-51 ref|XP_002312652.1| RNA recognition motif-containing family prot... 198 2e-48 gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus... 176 1e-41 ref|XP_006852230.1| hypothetical protein AMTR_s00049p00146760 [A... 169 9e-40 ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Popu... 161 3e-37 >ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695488|ref|XP_007044903.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708837|gb|EOY00734.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708838|gb|EOY00735.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 653 Score = 253 bits (647), Expect = 5e-65 Identities = 136/257 (52%), Positives = 172/257 (66%), Gaps = 22/257 (8%) Frame = +1 Query: 139 MDPMAEEQLDYGDEEYG-TQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQ 315 MD MAEEQ+D+GDEEYG QKMQYQ GAI ALADE++MGEDDEYDDLYNDVN+GEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGAQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 316 LQRSE-PVSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGV-VERNDSNIRATVPDQ 489 LQRSE P GG+ S G+Q Q+ + P + E G SQ +NIPGV V+ N+ A P+Q Sbjct: 61 LQRSEAPPQPGGMGSTGLQAQKNEAPEPRG-EAGGSQGLNIPGVSVQGKHLNVTARYPEQ 119 Query: 490 -----------AKGGF--------KGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQIS 612 G + KG V+E ++ QV+ GF+ + K+G+DP+ + Sbjct: 120 DGQPAVSRPEMGSGSYPSGTSISQKGRVMEGTQDTQVKNMGFQGLSSASHKVGIDPSGVP 179 Query: 613 GKFLSGSSPLSDAGTGAPRVATQIPINRPGLNINRPMVNENMSRPVVENGATMLFVGELH 792 K + + ++GTG P+ A +P N+ GLN+N PM++EN RP +ENG TMLFVGELH Sbjct: 180 QKIANVPAQSLNSGTGGPQGAPHVPPNQMGLNVNHPMISENQVRPPIENGPTMLFVGELH 239 Query: 793 WWTTDAELESVLSQYGR 843 WWTTDAELESVLSQYGR Sbjct: 240 WWTTDAELESVLSQYGR 256 >ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268141 isoform 1 [Vitis vinifera] gi|359473133|ref|XP_003631251.1| PREDICTED: uncharacterized protein LOC100268141 isoform 2 [Vitis vinifera] Length = 647 Score = 253 bits (647), Expect = 5e-65 Identities = 143/255 (56%), Positives = 170/255 (66%), Gaps = 23/255 (9%) Frame = +1 Query: 148 MAEEQLDYGDEEYG-TQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQLQR 324 MAEEQLDY DEEYG QKM +Q GGAISALAD++LMGEDDEYDDLYNDVN+GEGFLQ+ R Sbjct: 1 MAEEQLDYEDEEYGGAQKMPFQGGGAISALADDELMGEDDEYDDLYNDVNVGEGFLQMHR 60 Query: 325 SEPVSLGGVESGG-VQTQETDGPGSKAPEHGASQDVNIPGV-VERNDSN----IRATVPD 486 SE + GV +GG Q +TD P K E G SQ + IPGV +E SN + P Sbjct: 61 SEAPAPSGVMAGGPFQAHKTDVPPQKL-EAGTSQGLIIPGVSIEGKYSNPHFHEKKEGPM 119 Query: 487 QAKG--------------GFKGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQISGKFL 624 KG KG VLEM + QV GF+ S P+P K G +P+ + GK Sbjct: 120 AVKGPEMGSTSHLDGPSVSQKGRVLEMTHDTQVRNLGFQGSTPIPQKTGAEPSDVHGKIA 179 Query: 625 SGSSPLSDAGTGAPRVATQIPINRPGL--NINRPMVNENMSRPVVENGATMLFVGELHWW 798 + S+P+ ++GTG PR Q+ N+ G+ N+NRPMVNEN RP V+NGATMLFVGELHWW Sbjct: 180 NESTPVLNSGTGGPRAVPQMLSNQMGMNVNVNRPMVNENQIRPAVDNGATMLFVGELHWW 239 Query: 799 TTDAELESVLSQYGR 843 TTDAELESVLSQYGR Sbjct: 240 TTDAELESVLSQYGR 254 >ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobroma cacao] gi|508708844|gb|EOY00741.1| RNA-binding family protein isoform 6 [Theobroma cacao] Length = 602 Score = 245 bits (625), Expect = 2e-62 Identities = 132/257 (51%), Positives = 169/257 (65%), Gaps = 22/257 (8%) Frame = +1 Query: 139 MDPMAEEQLDYGDEEYGT-QKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQ 315 MD MAEEQ+D+GDEEYG QKMQYQ GAI ALADE++MGEDDEYDDLYNDVN+GEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 316 LQRSE-PVSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGV-VERNDSNIRATVPDQ 489 LQRSE P+ GG+ S G++ Q + P + E G SQ +NIPGV V+ N+ A P++ Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRV-EAGGSQGLNIPGVSVQGKHPNVSARYPEK 119 Query: 490 AK-----------GGF--------KGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQIS 612 + G + KGSV E + QV+ GF+ K+G+DP+ + Sbjct: 120 EEQPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVP 179 Query: 613 GKFLSGSSPLSDAGTGAPRVATQIPINRPGLNINRPMVNENMSRPVVENGATMLFVGELH 792 K + + ++GTG P+ +P N+ G N+N P++NEN +P +ENG TMLFVGELH Sbjct: 180 QKIANDPAQSLNSGTGGPQGPPHVPPNQMGTNVNHPVMNENQVQPPIENGPTMLFVGELH 239 Query: 793 WWTTDAELESVLSQYGR 843 WWTTDAELESVLSQYGR Sbjct: 240 WWTTDAELESVLSQYGR 256 >ref|XP_007044908.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] gi|508708843|gb|EOY00740.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] Length = 656 Score = 245 bits (625), Expect = 2e-62 Identities = 132/257 (51%), Positives = 169/257 (65%), Gaps = 22/257 (8%) Frame = +1 Query: 139 MDPMAEEQLDYGDEEYGT-QKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQ 315 MD MAEEQ+D+GDEEYG QKMQYQ GAI ALADE++MGEDDEYDDLYNDVN+GEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 316 LQRSE-PVSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGV-VERNDSNIRATVPDQ 489 LQRSE P+ GG+ S G++ Q + P + E G SQ +NIPGV V+ N+ A P++ Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRV-EAGGSQGLNIPGVSVQGKHPNVSARYPEK 119 Query: 490 AK-----------GGF--------KGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQIS 612 + G + KGSV E + QV+ GF+ K+G+DP+ + Sbjct: 120 EEQPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVP 179 Query: 613 GKFLSGSSPLSDAGTGAPRVATQIPINRPGLNINRPMVNENMSRPVVENGATMLFVGELH 792 K + + ++GTG P+ +P N+ G N+N P++NEN +P +ENG TMLFVGELH Sbjct: 180 QKIANDPAQSLNSGTGGPQGPPHVPPNQMGTNVNHPVMNENQVQPPIENGPTMLFVGELH 239 Query: 793 WWTTDAELESVLSQYGR 843 WWTTDAELESVLSQYGR Sbjct: 240 WWTTDAELESVLSQYGR 256 >ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobroma cacao] gi|508708842|gb|EOY00739.1| RNA-binding family protein isoform 4 [Theobroma cacao] Length = 697 Score = 245 bits (625), Expect = 2e-62 Identities = 132/257 (51%), Positives = 169/257 (65%), Gaps = 22/257 (8%) Frame = +1 Query: 139 MDPMAEEQLDYGDEEYGT-QKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQ 315 MD MAEEQ+D+GDEEYG QKMQYQ GAI ALADE++MGEDDEYDDLYNDVN+GEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 316 LQRSE-PVSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGV-VERNDSNIRATVPDQ 489 LQRSE P+ GG+ S G++ Q + P + E G SQ +NIPGV V+ N+ A P++ Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRV-EAGGSQGLNIPGVSVQGKHPNVSARYPEK 119 Query: 490 AK-----------GGF--------KGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQIS 612 + G + KGSV E + QV+ GF+ K+G+DP+ + Sbjct: 120 EEQPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVP 179 Query: 613 GKFLSGSSPLSDAGTGAPRVATQIPINRPGLNINRPMVNENMSRPVVENGATMLFVGELH 792 K + + ++GTG P+ +P N+ G N+N P++NEN +P +ENG TMLFVGELH Sbjct: 180 QKIANDPAQSLNSGTGGPQGPPHVPPNQMGTNVNHPVMNENQVQPPIENGPTMLFVGELH 239 Query: 793 WWTTDAELESVLSQYGR 843 WWTTDAELESVLSQYGR Sbjct: 240 WWTTDAELESVLSQYGR 256 >ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695496|ref|XP_007044905.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695500|ref|XP_007044906.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708839|gb|EOY00736.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708840|gb|EOY00737.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708841|gb|EOY00738.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 652 Score = 245 bits (625), Expect = 2e-62 Identities = 132/257 (51%), Positives = 169/257 (65%), Gaps = 22/257 (8%) Frame = +1 Query: 139 MDPMAEEQLDYGDEEYGT-QKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQ 315 MD MAEEQ+D+GDEEYG QKMQYQ GAI ALADE++MGEDDEYDDLYNDVN+GEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 316 LQRSE-PVSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGV-VERNDSNIRATVPDQ 489 LQRSE P+ GG+ S G++ Q + P + E G SQ +NIPGV V+ N+ A P++ Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRV-EAGGSQGLNIPGVSVQGKHPNVSARYPEK 119 Query: 490 AK-----------GGF--------KGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQIS 612 + G + KGSV E + QV+ GF+ K+G+DP+ + Sbjct: 120 EEQPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVP 179 Query: 613 GKFLSGSSPLSDAGTGAPRVATQIPINRPGLNINRPMVNENMSRPVVENGATMLFVGELH 792 K + + ++GTG P+ +P N+ G N+N P++NEN +P +ENG TMLFVGELH Sbjct: 180 QKIANDPAQSLNSGTGGPQGPPHVPPNQMGTNVNHPVMNENQVQPPIENGPTMLFVGELH 239 Query: 793 WWTTDAELESVLSQYGR 843 WWTTDAELESVLSQYGR Sbjct: 240 WWTTDAELESVLSQYGR 256 >ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] gi|557540375|gb|ESR51419.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] Length = 658 Score = 238 bits (606), Expect = 3e-60 Identities = 135/260 (51%), Positives = 166/260 (63%), Gaps = 25/260 (9%) Frame = +1 Query: 139 MDPMAEEQLDYGDEEYG-TQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQ 315 MD MAEEQ+DY +EEYG QKMQYQ GGAI ALADE+LMGEDDEYDDLYNDVN+G+G LQ Sbjct: 1 MDSMAEEQIDYEEEEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQ 60 Query: 316 LQRSE-PVSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGV-VE------------R 453 Q+ E P GV +G +Q ++TD P + + G SQ N+PGV VE + Sbjct: 61 FQQPEAPPPSAGVGNGRLQVKKTDVPEQQV-QAGVSQGSNVPGVSVEGKYTNAGTHFPAQ 119 Query: 454 NDSNIRATVPDQAKGGF--------KGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQI 609 ND + P+ G + KGSV E + V GF+ S PP+ GVDP+ + Sbjct: 120 NDVQVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPPRTGVDPSNM 179 Query: 610 SGKFLSGSSPLSDAGTGAPRVATQIPINRPGLNIN--RPMVNENMSRPVVENGATMLFVG 783 G+ + +P+ + G P+ A IP N+ G+NIN R MVNEN RP +ENG TMLFVG Sbjct: 180 PGRVANEPAPVLNPGAAGPQGAL-IPANQMGVNINVNRAMVNENQIRPPLENGGTMLFVG 238 Query: 784 ELHWWTTDAELESVLSQYGR 843 ELHWWTTDAELESVLSQYGR Sbjct: 239 ELHWWTTDAELESVLSQYGR 258 >ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 658 Score = 234 bits (598), Expect = 2e-59 Identities = 134/260 (51%), Positives = 165/260 (63%), Gaps = 25/260 (9%) Frame = +1 Query: 139 MDPMAEEQLDYGDEEYG-TQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQ 315 MD MAEEQ+DY +EEYG QKMQYQ GGAI ALADE+LMGEDDEYDDLYNDVN+G+G LQ Sbjct: 1 MDSMAEEQIDYEEEEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQ 60 Query: 316 LQRSE-PVSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGV-VE------------R 453 Q+ E P GV +G +Q ++TD P + + G SQ N+PGV VE + Sbjct: 61 FQQPEAPPPSAGVGNGRLQVKKTDVPEQQV-QAGVSQGSNVPGVSVEGKYTNAGTHFPAQ 119 Query: 454 NDSNIRATVPDQAKGGF--------KGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQI 609 ND + P+ G + KGSV E + V GF+ S P + GVDP+ + Sbjct: 120 NDVQVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNM 179 Query: 610 SGKFLSGSSPLSDAGTGAPRVATQIPINRPGLNIN--RPMVNENMSRPVVENGATMLFVG 783 G+ + +P+ + G P+ A IP N+ G+NIN R MVNEN RP +ENG TMLFVG Sbjct: 180 PGRVANEPAPVLNPGAAGPQGAL-IPANQMGVNINVNRAMVNENQIRPPLENGGTMLFVG 238 Query: 784 ELHWWTTDAELESVLSQYGR 843 ELHWWTTDAELESVLSQYGR Sbjct: 239 ELHWWTTDAELESVLSQYGR 258 >ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309507 [Fragaria vesca subsp. vesca] Length = 646 Score = 234 bits (598), Expect = 2e-59 Identities = 127/248 (51%), Positives = 162/248 (65%), Gaps = 13/248 (5%) Frame = +1 Query: 139 MDPMAEEQLDYGDEEYG-TQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQ 315 MDPM EEQ+DY +EEYG QK+QYQ GAI ALADE+ M EDDEYDDLYNDVN+GEGFLQ Sbjct: 1 MDPMGEEQIDYEEEEYGGAQKLQYQESGAIPALADEEPMVEDDEYDDLYNDVNVGEGFLQ 60 Query: 316 LQRSEP-VSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPG---------VVERNDSN 465 + R EP + GV +GG+Q Q+ + P + + GASQ+V PG V E+ D Sbjct: 61 MHRPEPPLPPAGVGNGGLQAQKNNVPEQRV-QGGASQEVKNPGFSVEGKYSSVPEQKDQP 119 Query: 466 IRATVPDQAKGGFKGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQISGKFLSGSSPLS 645 + VP+ A KG V+EM + QV GF+ +A M + D + ++GK +G P Sbjct: 120 PVSVVPEMASQ--KGRVMEMTHDAQVRNMGFQGAATMQSNVVADSSDLTGKIANGPIPSM 177 Query: 646 DAGTGAPRVATQIPINRPGL--NINRPMVNENMSRPVVENGATMLFVGELHWWTTDAELE 819 ++G+ P Q+P N+ + N+NRPMVNEN RP VENG+ LFVGELHWWTTDAELE Sbjct: 178 NSGSNGPPAVQQMPANQMNMKINVNRPMVNENQIRPPVENGSATLFVGELHWWTTDAELE 237 Query: 820 SVLSQYGR 843 VLSQ+GR Sbjct: 238 GVLSQFGR 245 >ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] gi|462422613|gb|EMJ26876.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] Length = 630 Score = 231 bits (590), Expect = 2e-58 Identities = 132/237 (55%), Positives = 163/237 (68%), Gaps = 5/237 (2%) Frame = +1 Query: 148 MAEEQLDYGDEEYG-TQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQLQR 324 MAEEQ+DY DEEYG QK+QYQ GAISALADE+ M EDDEYDDLYNDVN+ EGFLQ+ R Sbjct: 1 MAEEQIDYEDEEYGGAQKLQYQGSGAISALADEEPMVEDDEYDDLYNDVNVREGFLQMHR 60 Query: 325 SE-PVSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGV-VERNDSNIRATVPDQAKG 498 SE P+ GGV +GG+Q Q+TD ++ + G SQ+ IPGV V+ S+ A P+Q Sbjct: 61 SEAPLPPGGVGNGGLQAQKTDVTETRV-QAGVSQESKIPGVSVQGKYSSAVAQFPEQQ-- 117 Query: 499 GFKGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQISGKFLSGSSPLSDAGTGAPRVAT 678 +A+E ++ +TG+ S MPP +G D + I+GK S P ++GT P T Sbjct: 118 ----GQPPVAKEPELGSTGY-GSTTMPPNVGGDSSDITGKTALESVPSMNSGTAGPTGVT 172 Query: 679 QIPINRPGL--NINRPMVNENMSRPVVENGATMLFVGELHWWTTDAELESVLSQYGR 843 Q+P N+ + N NRPM NEN RP VENG+TMLFVGELHWWTTDAELESVLSQYGR Sbjct: 173 QMPTNQISIKVNANRPMFNENQIRPPVENGSTMLFVGELHWWTTDAELESVLSQYGR 229 >ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 655 Score = 229 bits (584), Expect = 1e-57 Identities = 131/257 (50%), Positives = 163/257 (63%), Gaps = 25/257 (9%) Frame = +1 Query: 148 MAEEQLDYGDEEYG-TQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQLQR 324 MAEEQ+DY ++EYG QKMQYQ GGAI ALADE+LMGEDDEYDDLYNDVN+G+G LQ Q+ Sbjct: 1 MAEEQIDYEEDEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQFQQ 60 Query: 325 SE-PVSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGV-VE------------RNDS 462 E P GV +G +Q ++TD P + + G SQ NIPGV VE +ND Sbjct: 61 PEAPPPSAGVGNGRLQVKKTDVPEQRV-QVGGSQGSNIPGVSVEGKYTNAGSHFPAQNDV 119 Query: 463 NIRATVPDQAKGGF--------KGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQISGK 618 + P+ G + KGSV E + V GF+ S P + GVDP+ + G+ Sbjct: 120 QVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGR 179 Query: 619 FLSGSSPLSDAGTGAPRVATQIPINRPGL--NINRPMVNENMSRPVVENGATMLFVGELH 792 + +P+ + G P+ A IP N+ G+ N+NR MVNEN RP +ENG TMLFVGELH Sbjct: 180 VANEPAPVLNPGAAGPQGAL-IPANQMGVNANVNRVMVNENQIRPPLENGGTMLFVGELH 238 Query: 793 WWTTDAELESVLSQYGR 843 WWTTDAELESVLSQYGR Sbjct: 239 WWTTDAELESVLSQYGR 255 >ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|567891321|ref|XP_006438181.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540376|gb|ESR51420.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540377|gb|ESR51421.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] Length = 655 Score = 228 bits (582), Expect = 2e-57 Identities = 130/257 (50%), Positives = 163/257 (63%), Gaps = 25/257 (9%) Frame = +1 Query: 148 MAEEQLDYGDEEYG-TQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQLQR 324 MAEEQ+DY ++EYG QKMQYQ GGAI ALADE+LMGEDDEYDDLYND+N+G+G LQ Q+ Sbjct: 1 MAEEQIDYEEDEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDINVGDGLLQFQQ 60 Query: 325 SE-PVSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGV-VE------------RNDS 462 E P GV +G +Q ++TD P + + G SQ NIPGV VE +ND Sbjct: 61 PEAPPPSAGVGNGRLQVKKTDVPEQRV-QVGGSQGSNIPGVSVEGKYTNAGSDFPAQNDV 119 Query: 463 NIRATVPDQAKGGF--------KGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQISGK 618 + P+ G + KGSV E + V GF+ S P + GVDP+ + G+ Sbjct: 120 QVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGR 179 Query: 619 FLSGSSPLSDAGTGAPRVATQIPINRPGL--NINRPMVNENMSRPVVENGATMLFVGELH 792 + +P+ + G P+ A IP N+ G+ N+NR MVNEN RP +ENG TMLFVGELH Sbjct: 180 AANEPAPVLNPGAAGPQGAL-IPANQMGVNANVNRVMVNENQIRPPLENGGTMLFVGELH 238 Query: 793 WWTTDAELESVLSQYGR 843 WWTTDAELESVLSQYGR Sbjct: 239 WWTTDAELESVLSQYGR 255 >emb|CBI16834.3| unnamed protein product [Vitis vinifera] Length = 491 Score = 228 bits (581), Expect = 2e-57 Identities = 124/237 (52%), Positives = 153/237 (64%), Gaps = 22/237 (9%) Frame = +1 Query: 199 MQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQLQRSEPVSLGGVESGG-VQTQ 375 M +Q GGAISALAD++LMGEDDEYDDLYNDVN+GEGFLQ+ RSE + GV +GG Q Sbjct: 1 MPFQGGGAISALADDELMGEDDEYDDLYNDVNVGEGFLQMHRSEAPAPSGVMAGGPFQAH 60 Query: 376 ETDGPGSKAPEHGASQDVNIPGVV-----------ERNDSNIRATVPDQAKGGF------ 504 +TD P K E G SQ + IPGV E+ + + P+ Sbjct: 61 KTDVPPQKL-EAGTSQGLIIPGVSIEGKYSNPHFHEKKEGPMAVKGPEMGSTSHLDGPSV 119 Query: 505 --KGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQISGKFLSGSSPLSDAGTGAPRVAT 678 KG VLEM + QV GF+ S P+P K G +P+ + GK + S+P+ ++GTG PR Sbjct: 120 SQKGRVLEMTHDTQVRNLGFQGSTPIPQKTGAEPSDVHGKIANESTPVLNSGTGGPRAVP 179 Query: 679 QIPINRPGLNIN--RPMVNENMSRPVVENGATMLFVGELHWWTTDAELESVLSQYGR 843 Q+ N+ G+N+N RPMVNEN RP V+NGATMLFVGELHWWTTDAELESVLSQYGR Sbjct: 180 QMLSNQMGMNVNVNRPMVNENQIRPAVDNGATMLFVGELHWWTTDAELESVLSQYGR 236 >ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Solanum tuberosum] gi|565349616|ref|XP_006341787.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X2 [Solanum tuberosum] Length = 648 Score = 215 bits (547), Expect = 2e-53 Identities = 123/256 (48%), Positives = 155/256 (60%), Gaps = 22/256 (8%) Frame = +1 Query: 139 MDPMAEEQLDYGDEEYG-TQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQ 315 MDP A+EQLDYGDEEYG + KMQY G I ALA++++MGEDDEYDDLYNDVNIGEGFLQ Sbjct: 1 MDPTADEQLDYGDEEYGGSHKMQYHGSGTIPALAEDEMMGEDDEYDDLYNDVNIGEGFLQ 60 Query: 316 LQRSE-PVSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGV-VERNDSNIRATVPDQ 489 LQRSE PV +G Q Q+ P S+A G S++ IPG+ E + P Q Sbjct: 61 LQRSEVPVPSVDAGNGNFQAQKDSFPASRAGGLG-SEEAKIPGIATEGKYAGTEVQFPQQ 119 Query: 490 ---------------AKGGFKGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQISGKFL 624 A + S + M Q +G++ S PMP KIG DP + K Sbjct: 120 KGEPVVERETERPADAAQKARPSAITMTLNSQAGNSGYQGSMPMPQKIGADPMAMPEKNA 179 Query: 625 SGSSPLSDAGTGAPRVATQIPINR----PGLNINRPMVNENMSRPVVENGATMLFVGELH 792 S ++PL ++ PRV +P N+ +N+N P+++E RP +ENG TMLFVGELH Sbjct: 180 SEATPLMNSVVPGPRVVPHMPTNQLNSSGNVNMNNPVISETPFRPSLENGNTMLFVGELH 239 Query: 793 WWTTDAELESVLSQYG 840 WWTTDAELESVL+QYG Sbjct: 240 WWTTDAELESVLTQYG 255 >ref|XP_002515040.1| RNA binding protein, putative [Ricinus communis] gi|223546091|gb|EEF47594.1| RNA binding protein, putative [Ricinus communis] Length = 644 Score = 210 bits (534), Expect = 6e-52 Identities = 123/250 (49%), Positives = 157/250 (62%), Gaps = 19/250 (7%) Frame = +1 Query: 148 MAEEQLDYGDEEYG-TQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQLQR 324 MA+EQ+DY DEEYG QK+QYQ GAI ALA+E+ MGEDDEYDDLYNDVNIGE FLQ+ R Sbjct: 1 MADEQIDYEDEEYGGAQKLQYQGSGAIPALAEEE-MGEDDEYDDLYNDVNIGENFLQMHR 59 Query: 325 SE-PVSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGVVERNDSNIRATVPDQ-AKG 498 SE P + V +GG Q + ++ E G SQ +NIPGV + + P+Q KG Sbjct: 60 SEAPPAPPSVGNGGFQPRNSN---DLRVESGGSQGLNIPGVAVESKYSTGTHFPEQNVKG 116 Query: 499 GFKGS--------------VLEMAREIQVETTGFRDSAPMPPKIGVDPNQISGKFLSGSS 636 GS V+EM + Q GF+ S P IGVDP+ ++ K + + Sbjct: 117 PEIGSVGYPDGSSIAQKTRVMEMTNDSQARNMGFQGSTSGPSNIGVDPSDMNNKISNDPT 176 Query: 637 PLSDAGTGAPRVATQIPINRPGLNI--NRPMVNENMSRPVVENGATMLFVGELHWWTTDA 810 P+ +A G PRV Q+P ++ +N+ NR NEN RP +ENG+TML+VGELHWWTTDA Sbjct: 177 PVPNA--GVPRVIPQLPASQMNMNMDTNRSATNENQIRPPLENGSTMLYVGELHWWTTDA 234 Query: 811 ELESVLSQYG 840 ELE+VLSQYG Sbjct: 235 ELENVLSQYG 244 >gb|EXB82464.1| Cleavage and polyadenylation specificity factor subunit [Morus notabilis] Length = 636 Score = 208 bits (529), Expect = 2e-51 Identities = 126/259 (48%), Positives = 153/259 (59%), Gaps = 27/259 (10%) Frame = +1 Query: 148 MAEEQLDYGDEEYG-TQKMQYQ-SGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQLQ 321 MAE+ +D+ DEEYG QK QYQ SGGAISALADE+LMG+DDEYDDLYNDVN+GEGFLQLQ Sbjct: 1 MAEDHIDFEDEEYGGAQKHQYQGSGGAISALADEELMGDDDEYDDLYNDVNVGEGFLQLQ 60 Query: 322 RSEPVSLGGVES--GGVQTQETDGPGSKAPEHGASQDVNIPGV----------------- 444 RSE SL G+Q Q+ + P + E G SQ NIPGV Sbjct: 61 RSEAPSLPAAAGVGNGLQAQKRNFPEPRE-EIGGSQQPNIPGVSAEGRFSSAGSQFPGQQ 119 Query: 445 ----VERNDSNIRATVPDQAKGGFKGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQIS 612 V++ PD A G KG ++ GF+ S PM +GVD + I Sbjct: 120 DGLKVDKKSEAGSMVYPDGASGSQKGRIV----------AGFQGSKPMLHSVGVDSSDIP 169 Query: 613 GKFLSGSSPLSDAGTGAPRVATQIPINRPGLNIN--RPMVNENMSRPVVENGATMLFVGE 786 GK ++ ++G PR + N+ +N N P+VNEN RP +ENG+TMLFVGE Sbjct: 170 GKMVNEPIQAPNSGGAGPRGILPMQGNQTTVNANVSHPIVNENQIRPSIENGSTMLFVGE 229 Query: 787 LHWWTTDAELESVLSQYGR 843 LHWWTTDAELESVLSQYGR Sbjct: 230 LHWWTTDAELESVLSQYGR 248 >ref|XP_002312652.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222852472|gb|EEE90019.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 619 Score = 198 bits (503), Expect = 2e-48 Identities = 118/250 (47%), Positives = 150/250 (60%), Gaps = 23/250 (9%) Frame = +1 Query: 163 LDYGDEEYGTQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQLQRSE-PVS 339 +DY +EE KMQYQ GAI ALA+E+ MGEDDEYDDLYNDVN+GE FLQ+ SE P Sbjct: 1 MDYEEEE----KMQYQGSGAIPALAEEE-MGEDDEYDDLYNDVNVGENFLQMHGSEAPAP 55 Query: 340 LGGVESGGVQTQETDGPGSKAPEHGASQDVNIPG---VVERNDSNIRATVPDQAKGGF-- 504 V +GG QT+ E G SQ + I G VE SN +A P+Q + Sbjct: 56 PATVGNGGFQTRNAH---ESRIETGGSQALAITGGGPAVEGIYSNAKAHFPEQKQVAVAV 112 Query: 505 ---------------KGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQISGKFLSGSSP 639 KG V+EM+ ++QV GF+ S P+PP IGVDP+ +S K P Sbjct: 113 EAQDVGPVDGSSVAQKGRVIEMSHDVQVRNMGFQKSTPVPPGIGVDPSDMSRKNAIEPEP 172 Query: 640 LSDAGTGAPRVATQIPINRPGLN--INRPMVNENMSRPVVENGATMLFVGELHWWTTDAE 813 L G+ PR A Q+ +N+ ++ +NRP+VNEN RP +ENG+T L+VGELHWWTTDAE Sbjct: 173 LPITGSAGPRGAPQMQVNQMHMSADVNRPVVNENQVRPPIENGSTTLYVGELHWWTTDAE 232 Query: 814 LESVLSQYGR 843 LES SQ+GR Sbjct: 233 LESFASQFGR 242 >gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus guttatus] Length = 639 Score = 176 bits (446), Expect = 1e-41 Identities = 110/257 (42%), Positives = 144/257 (56%), Gaps = 22/257 (8%) Frame = +1 Query: 139 MDPMAEEQLDYGDEEYG-TQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQ 315 MDP+ +EQLDYGDEEYG QKMQY GGAI ALA+++++G+DDEYDDLYNDVN+GEGF+Q Sbjct: 1 MDPVTDEQLDYGDEEYGGNQKMQYHHGGAIPALAEDEMIGDDDEYDDLYNDVNVGEGFMQ 60 Query: 316 LQRSEPVSLGGVESGGVQTQETDGPGSKAPEHGASQDVN----------IPGVVERNDSN 465 +QRSE V + + PG++A E ASQ+VN P V+ +D Sbjct: 61 MQRSEAPPPSAVGNNSFSISKNTAPGTRA-EAIASQEVNNGRVGNEGSYAPNGVQLSDQK 119 Query: 466 IRATV---PDQ-AKGGFKGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQISGKFLSGS 633 T P Q + + E+A Q G++ S M K D S + Sbjct: 120 NNLTAVGGPAQPVDASQRVRLPEVANSSQAAHLGYQGSEIMLHKTATDRMNNSENIVGEP 179 Query: 634 SPLSDAGTGAPRVATQIPIN------RPGLNINRPMVNENMSRPV-VENGATMLFVGELH 792 + L TG+ + Q P N +N+NR M +E + RP ENG M++VGELH Sbjct: 180 ASLVYPNTGSSKGVPQAPSNLMNSNANVNVNVNRSMDDEYLIRPSGGENGNPMIYVGELH 239 Query: 793 WWTTDAELESVLSQYGR 843 WWTTDAE+ESVL QYGR Sbjct: 240 WWTTDAEVESVLIQYGR 256 >ref|XP_006852230.1| hypothetical protein AMTR_s00049p00146760 [Amborella trichopoda] gi|548855834|gb|ERN13697.1| hypothetical protein AMTR_s00049p00146760 [Amborella trichopoda] Length = 659 Score = 169 bits (429), Expect = 9e-40 Identities = 123/280 (43%), Positives = 149/280 (53%), Gaps = 45/280 (16%) Frame = +1 Query: 139 MDPMAEEQLDYGDEEYGT-QKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQ 315 MDPMAEEQLDY DE+YG QKM +Q+GGAISALADE+LMGEDDEYDDLYNDVN+G+GF+Q Sbjct: 1 MDPMAEEQLDYEDEDYGANQKMPFQTGGAISALADEELMGEDDEYDDLYNDVNVGDGFMQ 60 Query: 316 -LQRSEPVSLGGVESGGVQTQETDGPGSKAP--EHGASQDVNIPGVVER----------- 453 LQ EPV E+ G G +AP E ++ VNIPGV Sbjct: 61 SLQHQEPVQ-----------YESMGNGVQAPKEEPISTPPVNIPGVGHEEKGEKDAKLSG 109 Query: 454 -NDSNIRATVPDQ-------AKGGFKGSVLEMAREIQVETTGFRDSAPMPPKIG------ 591 +D + + +Q A G K V E E Q + +GFR +AP PP G Sbjct: 110 FSDLDQKKAFQEQASNQLAGASSGLKIRVSEPVSEPQPQASGFR-NAPAPPAKGSGFNTA 168 Query: 592 --VDPN----QISGKFLSGSSPLSDAGTGAPRVATQIPINRPGLNINRPMV--------- 726 +D N Q S + P G GA A + PG N ++ Sbjct: 169 GAMDANKQLAQTSSNAVPRVGPGPGPGIGAGPNANMNRMMGPGPNQAGAVIDTSARFGSE 228 Query: 727 -NENMSRPVVENGATMLFVGELHWWTTDAELESVLSQYGR 843 + +S E+G TMLFVGEL WWTTDAELESVLSQYGR Sbjct: 229 NSNRLSHGGGESGNTMLFVGELQWWTTDAELESVLSQYGR 268 >ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] gi|550329195|gb|ERP56065.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] Length = 591 Score = 161 bits (407), Expect = 3e-37 Identities = 101/233 (43%), Positives = 126/233 (54%), Gaps = 6/233 (2%) Frame = +1 Query: 163 LDYGDEEYGTQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQLQRSE-PVS 339 +D+ +EE KMQYQ GAI ALA+E+L GEDDEYDDLYNDVN+GE FLQ+ SE P Sbjct: 1 MDFEEEE----KMQYQGSGAIPALAEEEL-GEDDEYDDLYNDVNVGENFLQMHGSEAPAP 55 Query: 340 LGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGV---VERNDSNIRATVPDQAKGGFKG 510 +GG QT+ E G SQ + G VE SN A P+Q + G Sbjct: 56 PATAGNGGFQTRNAH---ESRVETGGSQVLATSGAGVAVEGKYSNAGAHFPEQKQAG--- 109 Query: 511 SVLEMAREIQVETTGFRDSAPMPPKIGVDPNQISGKFLSGSSPLSDAGTGAPRVATQIPI 690 IGV+ N + S ++ G+ PR Q+ + Sbjct: 110 -------------------------IGVEANDVGSIGYGDGSSVAQKGSAGPRGVPQMQV 144 Query: 691 NRPGLN--INRPMVNENMSRPVVENGATMLFVGELHWWTTDAELESVLSQYGR 843 N+ +N +NRP+VNEN RP +ENG T L+VGELHWWTTDAELESV SQYGR Sbjct: 145 NQMNMNADVNRPVVNENQVRPPIENGPTTLYVGELHWWTTDAELESVASQYGR 197