BLASTX nr result

ID: Akebia26_contig00014273 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia26_contig00014273
         (1220 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268...   422   e-115
ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobr...   409   e-111
ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citr...   404   e-110
ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation spec...   400   e-109
ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prun...   396   e-107
ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobr...   395   e-107
ref|XP_007044908.1| RNA-binding family protein isoform 5, partia...   395   e-107
ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobr...   395   e-107
ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobr...   395   e-107
ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309...   392   e-106
ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation spec...   388   e-105
ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citr...   387   e-105
ref|XP_002515040.1| RNA binding protein, putative [Ricinus commu...   376   e-102
ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation spec...   370   e-100
gb|EXB82464.1| Cleavage and polyadenylation specificity factor s...   368   3e-99
ref|XP_002312652.1| RNA recognition motif-containing family prot...   355   2e-95
gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus...   328   3e-87
ref|XP_006852230.1| hypothetical protein AMTR_s00049p00146760 [A...   319   1e-84
ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Popu...   311   5e-82
ref|XP_002315647.1| RNA recognition motif-containing family prot...   311   5e-82

>ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268141 isoform 1 [Vitis
            vinifera] gi|359473133|ref|XP_003631251.1| PREDICTED:
            uncharacterized protein LOC100268141 isoform 2 [Vitis
            vinifera]
          Length = 647

 Score =  422 bits (1084), Expect = e-115
 Identities = 222/351 (63%), Positives = 258/351 (73%), Gaps = 23/351 (6%)
 Frame = -1

Query: 995  MAEEQLDYGDEEYG-TQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQLQR 819
            MAEEQLDY DEEYG  QKM +Q GGAISALAD++LMGEDDEYDDLYNDVN+GEGFLQ+ R
Sbjct: 1    MAEEQLDYEDEEYGGAQKMPFQGGGAISALADDELMGEDDEYDDLYNDVNVGEGFLQMHR 60

Query: 818  SEPVSLGGVESGG-VQTQETDGPGSKAPEHGASQDVNIPGVV-----------ERNDSNI 675
            SE  +  GV +GG  Q  +TD P  K  E G SQ + IPGV            E+ +  +
Sbjct: 61   SEAPAPSGVMAGGPFQAHKTDVPPQKL-EAGTSQGLIIPGVSIEGKYSNPHFHEKKEGPM 119

Query: 674  RATVPDQAKGGF--------KGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQISGKFV 519
                P+              KG VLEM  + QV   GF+ S P+P K G +P+ + GK  
Sbjct: 120  AVKGPEMGSTSHLDGPSVSQKGRVLEMTHDTQVRNLGFQGSTPIPQKTGAEPSDVHGKIA 179

Query: 518  SGSSPLSDAGTGAPRVATQIPINRPGLNIN--RPMVNENMIRPVVENGATMLFVGELHWW 345
            + S+P+ ++GTG PR   Q+  N+ G+N+N  RPMVNEN IRP V+NGATMLFVGELHWW
Sbjct: 180  NESTPVLNSGTGGPRAVPQMLSNQMGMNVNVNRPMVNENQIRPAVDNGATMLFVGELHWW 239

Query: 344  TTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRA 165
            TTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEF+++ AA+ACKEGMNG+ FNGRA
Sbjct: 240  TTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDASAAAACKEGMNGYIFNGRA 299

Query: 164  CVVTFASPQTLKQMGASYLNKTQVQAQSQVPGRRPMNDGVGRGGGMNYQGG 12
            CVV FASPQTLKQMGASY+NKTQ Q+QSQ  GRRPMNDGVGRGGGMN QGG
Sbjct: 300  CVVAFASPQTLKQMGASYMNKTQAQSQSQ--GRRPMNDGVGRGGGMNMQGG 348


>ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobroma cacao]
            gi|590695488|ref|XP_007044903.1| RNA-binding family
            protein isoform 1 [Theobroma cacao]
            gi|508708837|gb|EOY00734.1| RNA-binding family protein
            isoform 1 [Theobroma cacao] gi|508708838|gb|EOY00735.1|
            RNA-binding family protein isoform 1 [Theobroma cacao]
          Length = 653

 Score =  409 bits (1051), Expect = e-111
 Identities = 211/353 (59%), Positives = 258/353 (73%), Gaps = 22/353 (6%)
 Frame = -1

Query: 1004 MDPMAEEQLDYGDEEYG-TQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQ 828
            MD MAEEQ+D+GDEEYG  QKMQYQ  GAI ALADE++MGEDDEYDDLYNDVN+GEGFLQ
Sbjct: 1    MDAMAEEQIDFGDEEYGGAQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60

Query: 827  LQRSE-PVSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGV-VERNDSNIRATVPDQ 654
            LQRSE P   GG+ S G+Q Q+ + P  +  E G SQ +NIPGV V+    N+ A  P+Q
Sbjct: 61   LQRSEAPPQPGGMGSTGLQAQKNEAPEPRG-EAGGSQGLNIPGVSVQGKHLNVTARYPEQ 119

Query: 653  -----------AKGGF--------KGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQIS 531
                         G +        KG V+E  ++ QV+  GF+  +    K+G+DP+ + 
Sbjct: 120  DGQPAVSRPEMGSGSYPSGTSISQKGRVMEGTQDTQVKNMGFQGLSSASHKVGIDPSGVP 179

Query: 530  GKFVSGSSPLSDAGTGAPRVATQIPINRPGLNINRPMVNENMIRPVVENGATMLFVGELH 351
             K  +  +   ++GTG P+ A  +P N+ GLN+N PM++EN +RP +ENG TMLFVGELH
Sbjct: 180  QKIANVPAQSLNSGTGGPQGAPHVPPNQMGLNVNHPMISENQVRPPIENGPTMLFVGELH 239

Query: 350  WWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNG 171
            WWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEF++  +A+ACKEGM+G+ FNG
Sbjct: 240  WWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPASAAACKEGMDGYMFNG 299

Query: 170  RACVVTFASPQTLKQMGASYLNKTQVQAQSQVPGRRPMNDGVGRGGGMNYQGG 12
            RACVV FASPQTLKQMGASY+NK Q Q+Q+Q  GRRP NDG+GRGG MNYQ G
Sbjct: 300  RACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NDGLGRGGNMNYQSG 351


>ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citrus clementina]
            gi|557540375|gb|ESR51419.1| hypothetical protein
            CICLE_v10030915mg [Citrus clementina]
          Length = 658

 Score =  404 bits (1037), Expect = e-110
 Identities = 215/356 (60%), Positives = 251/356 (70%), Gaps = 25/356 (7%)
 Frame = -1

Query: 1004 MDPMAEEQLDYGDEEYG-TQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQ 828
            MD MAEEQ+DY +EEYG  QKMQYQ GGAI ALADE+LMGEDDEYDDLYNDVN+G+G LQ
Sbjct: 1    MDSMAEEQIDYEEEEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQ 60

Query: 827  LQRSE-PVSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGV-VE------------R 690
             Q+ E P    GV +G +Q ++TD P  +  + G SQ  N+PGV VE            +
Sbjct: 61   FQQPEAPPPSAGVGNGRLQVKKTDVPEQQV-QAGVSQGSNVPGVSVEGKYTNAGTHFPAQ 119

Query: 689  NDSNIRATVPDQAKGGF--------KGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQI 534
            ND  +    P+   G +        KGSV E   +  V   GF+ S   PP+ GVDP+ +
Sbjct: 120  NDVQVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPPRTGVDPSNM 179

Query: 533  SGKFVSGSSPLSDAGTGAPRVATQIPINRPGLNIN--RPMVNENMIRPVVENGATMLFVG 360
             G+  +  +P+ + G   P+ A  IP N+ G+NIN  R MVNEN IRP +ENG TMLFVG
Sbjct: 180  PGRVANEPAPVLNPGAAGPQGAL-IPANQMGVNINVNRAMVNENQIRPPLENGGTMLFVG 238

Query: 359  ELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHN 180
            ELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFF++ AA+ACK+GMNGH 
Sbjct: 239  ELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHV 298

Query: 179  FNGRACVVTFASPQTLKQMGASYLNKTQVQAQSQVPGRRPMNDGVGRGGGMNYQGG 12
            FNGR CVV FASPQTLKQMGASY+NK Q Q QSQ  GRRPMNDG GRGG MNYQ G
Sbjct: 299  FNGRPCVVAFASPQTLKQMGASYMNKNQGQPQSQTQGRRPMNDGGGRGGNMNYQSG 354


>ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            CG7185-like isoform X1 [Citrus sinensis]
          Length = 658

 Score =  400 bits (1029), Expect = e-109
 Identities = 214/356 (60%), Positives = 250/356 (70%), Gaps = 25/356 (7%)
 Frame = -1

Query: 1004 MDPMAEEQLDYGDEEYG-TQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQ 828
            MD MAEEQ+DY +EEYG  QKMQYQ GGAI ALADE+LMGEDDEYDDLYNDVN+G+G LQ
Sbjct: 1    MDSMAEEQIDYEEEEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQ 60

Query: 827  LQRSE-PVSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGV-VE------------R 690
             Q+ E P    GV +G +Q ++TD P  +  + G SQ  N+PGV VE            +
Sbjct: 61   FQQPEAPPPSAGVGNGRLQVKKTDVPEQQV-QAGVSQGSNVPGVSVEGKYTNAGTHFPAQ 119

Query: 689  NDSNIRATVPDQAKGGF--------KGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQI 534
            ND  +    P+   G +        KGSV E   +  V   GF+ S   P + GVDP+ +
Sbjct: 120  NDVQVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNM 179

Query: 533  SGKFVSGSSPLSDAGTGAPRVATQIPINRPGLNIN--RPMVNENMIRPVVENGATMLFVG 360
             G+  +  +P+ + G   P+ A  IP N+ G+NIN  R MVNEN IRP +ENG TMLFVG
Sbjct: 180  PGRVANEPAPVLNPGAAGPQGAL-IPANQMGVNINVNRAMVNENQIRPPLENGGTMLFVG 238

Query: 359  ELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHN 180
            ELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFF++ AA+ACK+GMNGH 
Sbjct: 239  ELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHV 298

Query: 179  FNGRACVVTFASPQTLKQMGASYLNKTQVQAQSQVPGRRPMNDGVGRGGGMNYQGG 12
            FNGR CVV FASPQTLKQMGASY+NK Q Q QSQ  GRRPMNDG GRGG MNYQ G
Sbjct: 299  FNGRPCVVAFASPQTLKQMGASYMNKNQGQPQSQTQGRRPMNDGGGRGGNMNYQSG 354


>ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica]
           gi|462422613|gb|EMJ26876.1| hypothetical protein
           PRUPE_ppa002814mg [Prunus persica]
          Length = 630

 Score =  396 bits (1017), Expect = e-107
 Identities = 212/333 (63%), Positives = 251/333 (75%), Gaps = 5/333 (1%)
 Frame = -1

Query: 995 MAEEQLDYGDEEYG-TQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQLQR 819
           MAEEQ+DY DEEYG  QK+QYQ  GAISALADE+ M EDDEYDDLYNDVN+ EGFLQ+ R
Sbjct: 1   MAEEQIDYEDEEYGGAQKLQYQGSGAISALADEEPMVEDDEYDDLYNDVNVREGFLQMHR 60

Query: 818 SE-PVSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGV-VERNDSNIRATVPDQAKG 645
           SE P+  GGV +GG+Q Q+TD   ++  + G SQ+  IPGV V+   S+  A  P+Q   
Sbjct: 61  SEAPLPPGGVGNGGLQAQKTDVTETRV-QAGVSQESKIPGVSVQGKYSSAVAQFPEQQ-- 117

Query: 644 GFKGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQISGKFVSGSSPLSDAGTGAPRVAT 465
                   +A+E ++ +TG+  S  MPP +G D + I+GK    S P  ++GT  P   T
Sbjct: 118 ----GQPPVAKEPELGSTGY-GSTTMPPNVGGDSSDITGKTALESVPSMNSGTAGPTGVT 172

Query: 464 QIPINRPGL--NINRPMVNENMIRPVVENGATMLFVGELHWWTTDAELESVLSQYGRVKE 291
           Q+P N+  +  N NRPM NEN IRP VENG+TMLFVGELHWWTTDAELESVLSQYGRVKE
Sbjct: 173 QMPTNQISIKVNANRPMFNENQIRPPVENGSTMLFVGELHWWTTDAELESVLSQYGRVKE 232

Query: 290 IKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACVVTFASPQTLKQMGASY 111
           IKFFDERASGKSKGYCQVEF +  AA+ACKEGM+G+ FNGRACVV FASPQTLKQMGASY
Sbjct: 233 IKFFDERASGKSKGYCQVEFHDPAAATACKEGMDGYLFNGRACVVAFASPQTLKQMGASY 292

Query: 110 LNKTQVQAQSQVPGRRPMNDGVGRGGGMNYQGG 12
           L+K+Q Q QSQ PGRRPMN+GVGRGGG+NYQ G
Sbjct: 293 LSKSQGQTQSQQPGRRPMNEGVGRGGGVNYQTG 325


>ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobroma cacao]
            gi|508708844|gb|EOY00741.1| RNA-binding family protein
            isoform 6 [Theobroma cacao]
          Length = 602

 Score =  395 bits (1015), Expect = e-107
 Identities = 203/353 (57%), Positives = 254/353 (71%), Gaps = 22/353 (6%)
 Frame = -1

Query: 1004 MDPMAEEQLDYGDEEYGT-QKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQ 828
            MD MAEEQ+D+GDEEYG  QKMQYQ  GAI ALADE++MGEDDEYDDLYNDVN+GEGFLQ
Sbjct: 1    MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60

Query: 827  LQRSE-PVSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGV-VERNDSNIRATVPDQ 654
            LQRSE P+  GG+ S G++ Q  + P  +  E G SQ +NIPGV V+    N+ A  P++
Sbjct: 61   LQRSEAPLQPGGLGSTGLKAQRNEAPEPRV-EAGGSQGLNIPGVSVQGKHPNVSARYPEK 119

Query: 653  AK-----------GGF--------KGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQIS 531
             +           G +        KGSV E   + QV+  GF+       K+G+DP+ + 
Sbjct: 120  EEQPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVP 179

Query: 530  GKFVSGSSPLSDAGTGAPRVATQIPINRPGLNINRPMVNENMIRPVVENGATMLFVGELH 351
             K  +  +   ++GTG P+    +P N+ G N+N P++NEN ++P +ENG TMLFVGELH
Sbjct: 180  QKIANDPAQSLNSGTGGPQGPPHVPPNQMGTNVNHPVMNENQVQPPIENGPTMLFVGELH 239

Query: 350  WWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNG 171
            WWTTDAELESVLSQYGR+KEIKFFDE+ASGKSKGYCQVEF++  +A+ CKEGMNG+ FNG
Sbjct: 240  WWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYMFNG 299

Query: 170  RACVVTFASPQTLKQMGASYLNKTQVQAQSQVPGRRPMNDGVGRGGGMNYQGG 12
            RACVV FASPQTLKQMGASY+NK Q Q+Q+Q  GRRP N+G+GRGG +NYQ G
Sbjct: 300  RACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQSG 351


>ref|XP_007044908.1| RNA-binding family protein isoform 5, partial [Theobroma cacao]
            gi|508708843|gb|EOY00740.1| RNA-binding family protein
            isoform 5, partial [Theobroma cacao]
          Length = 656

 Score =  395 bits (1015), Expect = e-107
 Identities = 203/353 (57%), Positives = 254/353 (71%), Gaps = 22/353 (6%)
 Frame = -1

Query: 1004 MDPMAEEQLDYGDEEYGT-QKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQ 828
            MD MAEEQ+D+GDEEYG  QKMQYQ  GAI ALADE++MGEDDEYDDLYNDVN+GEGFLQ
Sbjct: 1    MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60

Query: 827  LQRSE-PVSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGV-VERNDSNIRATVPDQ 654
            LQRSE P+  GG+ S G++ Q  + P  +  E G SQ +NIPGV V+    N+ A  P++
Sbjct: 61   LQRSEAPLQPGGLGSTGLKAQRNEAPEPRV-EAGGSQGLNIPGVSVQGKHPNVSARYPEK 119

Query: 653  AK-----------GGF--------KGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQIS 531
             +           G +        KGSV E   + QV+  GF+       K+G+DP+ + 
Sbjct: 120  EEQPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVP 179

Query: 530  GKFVSGSSPLSDAGTGAPRVATQIPINRPGLNINRPMVNENMIRPVVENGATMLFVGELH 351
             K  +  +   ++GTG P+    +P N+ G N+N P++NEN ++P +ENG TMLFVGELH
Sbjct: 180  QKIANDPAQSLNSGTGGPQGPPHVPPNQMGTNVNHPVMNENQVQPPIENGPTMLFVGELH 239

Query: 350  WWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNG 171
            WWTTDAELESVLSQYGR+KEIKFFDE+ASGKSKGYCQVEF++  +A+ CKEGMNG+ FNG
Sbjct: 240  WWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYMFNG 299

Query: 170  RACVVTFASPQTLKQMGASYLNKTQVQAQSQVPGRRPMNDGVGRGGGMNYQGG 12
            RACVV FASPQTLKQMGASY+NK Q Q+Q+Q  GRRP N+G+GRGG +NYQ G
Sbjct: 300  RACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQSG 351


>ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobroma cacao]
            gi|508708842|gb|EOY00739.1| RNA-binding family protein
            isoform 4 [Theobroma cacao]
          Length = 697

 Score =  395 bits (1015), Expect = e-107
 Identities = 203/353 (57%), Positives = 254/353 (71%), Gaps = 22/353 (6%)
 Frame = -1

Query: 1004 MDPMAEEQLDYGDEEYGT-QKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQ 828
            MD MAEEQ+D+GDEEYG  QKMQYQ  GAI ALADE++MGEDDEYDDLYNDVN+GEGFLQ
Sbjct: 1    MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60

Query: 827  LQRSE-PVSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGV-VERNDSNIRATVPDQ 654
            LQRSE P+  GG+ S G++ Q  + P  +  E G SQ +NIPGV V+    N+ A  P++
Sbjct: 61   LQRSEAPLQPGGLGSTGLKAQRNEAPEPRV-EAGGSQGLNIPGVSVQGKHPNVSARYPEK 119

Query: 653  AK-----------GGF--------KGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQIS 531
             +           G +        KGSV E   + QV+  GF+       K+G+DP+ + 
Sbjct: 120  EEQPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVP 179

Query: 530  GKFVSGSSPLSDAGTGAPRVATQIPINRPGLNINRPMVNENMIRPVVENGATMLFVGELH 351
             K  +  +   ++GTG P+    +P N+ G N+N P++NEN ++P +ENG TMLFVGELH
Sbjct: 180  QKIANDPAQSLNSGTGGPQGPPHVPPNQMGTNVNHPVMNENQVQPPIENGPTMLFVGELH 239

Query: 350  WWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNG 171
            WWTTDAELESVLSQYGR+KEIKFFDE+ASGKSKGYCQVEF++  +A+ CKEGMNG+ FNG
Sbjct: 240  WWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYMFNG 299

Query: 170  RACVVTFASPQTLKQMGASYLNKTQVQAQSQVPGRRPMNDGVGRGGGMNYQGG 12
            RACVV FASPQTLKQMGASY+NK Q Q+Q+Q  GRRP N+G+GRGG +NYQ G
Sbjct: 300  RACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQSG 351


>ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobroma cacao]
            gi|590695496|ref|XP_007044905.1| RNA-binding family
            protein isoform 1 [Theobroma cacao]
            gi|590695500|ref|XP_007044906.1| RNA-binding family
            protein isoform 1 [Theobroma cacao]
            gi|508708839|gb|EOY00736.1| RNA-binding family protein
            isoform 1 [Theobroma cacao] gi|508708840|gb|EOY00737.1|
            RNA-binding family protein isoform 1 [Theobroma cacao]
            gi|508708841|gb|EOY00738.1| RNA-binding family protein
            isoform 1 [Theobroma cacao]
          Length = 652

 Score =  395 bits (1015), Expect = e-107
 Identities = 203/353 (57%), Positives = 254/353 (71%), Gaps = 22/353 (6%)
 Frame = -1

Query: 1004 MDPMAEEQLDYGDEEYGT-QKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQ 828
            MD MAEEQ+D+GDEEYG  QKMQYQ  GAI ALADE++MGEDDEYDDLYNDVN+GEGFLQ
Sbjct: 1    MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60

Query: 827  LQRSE-PVSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGV-VERNDSNIRATVPDQ 654
            LQRSE P+  GG+ S G++ Q  + P  +  E G SQ +NIPGV V+    N+ A  P++
Sbjct: 61   LQRSEAPLQPGGLGSTGLKAQRNEAPEPRV-EAGGSQGLNIPGVSVQGKHPNVSARYPEK 119

Query: 653  AK-----------GGF--------KGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQIS 531
             +           G +        KGSV E   + QV+  GF+       K+G+DP+ + 
Sbjct: 120  EEQPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVP 179

Query: 530  GKFVSGSSPLSDAGTGAPRVATQIPINRPGLNINRPMVNENMIRPVVENGATMLFVGELH 351
             K  +  +   ++GTG P+    +P N+ G N+N P++NEN ++P +ENG TMLFVGELH
Sbjct: 180  QKIANDPAQSLNSGTGGPQGPPHVPPNQMGTNVNHPVMNENQVQPPIENGPTMLFVGELH 239

Query: 350  WWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNG 171
            WWTTDAELESVLSQYGR+KEIKFFDE+ASGKSKGYCQVEF++  +A+ CKEGMNG+ FNG
Sbjct: 240  WWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYMFNG 299

Query: 170  RACVVTFASPQTLKQMGASYLNKTQVQAQSQVPGRRPMNDGVGRGGGMNYQGG 12
            RACVV FASPQTLKQMGASY+NK Q Q+Q+Q  GRRP N+G+GRGG +NYQ G
Sbjct: 300  RACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQSG 351


>ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309507 [Fragaria vesca
            subsp. vesca]
          Length = 646

 Score =  392 bits (1007), Expect = e-106
 Identities = 202/344 (58%), Positives = 247/344 (71%), Gaps = 13/344 (3%)
 Frame = -1

Query: 1004 MDPMAEEQLDYGDEEYG-TQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQ 828
            MDPM EEQ+DY +EEYG  QK+QYQ  GAI ALADE+ M EDDEYDDLYNDVN+GEGFLQ
Sbjct: 1    MDPMGEEQIDYEEEEYGGAQKLQYQESGAIPALADEEPMVEDDEYDDLYNDVNVGEGFLQ 60

Query: 827  LQRSEP-VSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPG---------VVERNDSN 678
            + R EP +   GV +GG+Q Q+ + P  +  + GASQ+V  PG         V E+ D  
Sbjct: 61   MHRPEPPLPPAGVGNGGLQAQKNNVPEQRV-QGGASQEVKNPGFSVEGKYSSVPEQKDQP 119

Query: 677  IRATVPDQAKGGFKGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQISGKFVSGSSPLS 498
              + VP+ A    KG V+EM  + QV   GF+ +A M   +  D + ++GK  +G  P  
Sbjct: 120  PVSVVPEMASQ--KGRVMEMTHDAQVRNMGFQGAATMQSNVVADSSDLTGKIANGPIPSM 177

Query: 497  DAGTGAPRVATQIPINRPGL--NINRPMVNENMIRPVVENGATMLFVGELHWWTTDAELE 324
            ++G+  P    Q+P N+  +  N+NRPMVNEN IRP VENG+  LFVGELHWWTTDAELE
Sbjct: 178  NSGSNGPPAVQQMPANQMNMKINVNRPMVNENQIRPPVENGSATLFVGELHWWTTDAELE 237

Query: 323  SVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACVVTFAS 144
             VLSQ+GR+KEIKFFDERASGKSKGYCQV+F++  AASACKEGM+G+ FNGRACVV FAS
Sbjct: 238  GVLSQFGRIKEIKFFDERASGKSKGYCQVDFYDPAAASACKEGMDGYVFNGRACVVAFAS 297

Query: 143  PQTLKQMGASYLNKTQVQAQSQVPGRRPMNDGVGRGGGMNYQGG 12
             QTLKQMG SY+NK+Q Q Q+Q  GRRPMNDG GRGG MN+QGG
Sbjct: 298  SQTLKQMGDSYVNKSQGQVQTQPQGRRPMNDGAGRGGNMNFQGG 341


>ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            CG7185-like isoform X1 [Citrus sinensis]
          Length = 655

 Score =  388 bits (996), Expect = e-105
 Identities = 208/353 (58%), Positives = 245/353 (69%), Gaps = 25/353 (7%)
 Frame = -1

Query: 995  MAEEQLDYGDEEYG-TQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQLQR 819
            MAEEQ+DY ++EYG  QKMQYQ GGAI ALADE+LMGEDDEYDDLYNDVN+G+G LQ Q+
Sbjct: 1    MAEEQIDYEEDEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQFQQ 60

Query: 818  SE-PVSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGV-VE------------RNDS 681
             E P    GV +G +Q ++TD P  +  + G SQ  NIPGV VE            +ND 
Sbjct: 61   PEAPPPSAGVGNGRLQVKKTDVPEQRV-QVGGSQGSNIPGVSVEGKYTNAGSHFPAQNDV 119

Query: 680  NIRATVPDQAKGGF--------KGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQISGK 525
             +    P+   G +        KGSV E   +  V   GF+ S   P + GVDP+ + G+
Sbjct: 120  QVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGR 179

Query: 524  FVSGSSPLSDAGTGAPRVATQIPINRPGLN--INRPMVNENMIRPVVENGATMLFVGELH 351
              +  +P+ + G   P+ A  IP N+ G+N  +NR MVNEN IRP +ENG TMLFVGELH
Sbjct: 180  VANEPAPVLNPGAAGPQGAL-IPANQMGVNANVNRVMVNENQIRPPLENGGTMLFVGELH 238

Query: 350  WWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNG 171
            WWTTDAELESVLSQYGR KEIKFFDERASGKSKGYCQVEFF++ AA+ACK+GMNGH FNG
Sbjct: 239  WWTTDAELESVLSQYGRAKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFNG 298

Query: 170  RACVVTFASPQTLKQMGASYLNKTQVQAQSQVPGRRPMNDGVGRGGGMNYQGG 12
            R CVV FASPQTLKQMGASY+NK Q Q QSQ  G RPMNDG GRGG  NYQ G
Sbjct: 299  RPCVVAFASPQTLKQMGASYMNKNQGQPQSQNQGSRPMNDGGGRGGNTNYQSG 351


>ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citrus clementina]
            gi|567891321|ref|XP_006438181.1| hypothetical protein
            CICLE_v10030917mg [Citrus clementina]
            gi|557540376|gb|ESR51420.1| hypothetical protein
            CICLE_v10030917mg [Citrus clementina]
            gi|557540377|gb|ESR51421.1| hypothetical protein
            CICLE_v10030917mg [Citrus clementina]
          Length = 655

 Score =  387 bits (994), Expect = e-105
 Identities = 207/353 (58%), Positives = 245/353 (69%), Gaps = 25/353 (7%)
 Frame = -1

Query: 995  MAEEQLDYGDEEYG-TQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQLQR 819
            MAEEQ+DY ++EYG  QKMQYQ GGAI ALADE+LMGEDDEYDDLYND+N+G+G LQ Q+
Sbjct: 1    MAEEQIDYEEDEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDINVGDGLLQFQQ 60

Query: 818  SE-PVSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGV-VE------------RNDS 681
             E P    GV +G +Q ++TD P  +  + G SQ  NIPGV VE            +ND 
Sbjct: 61   PEAPPPSAGVGNGRLQVKKTDVPEQRV-QVGGSQGSNIPGVSVEGKYTNAGSDFPAQNDV 119

Query: 680  NIRATVPDQAKGGF--------KGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQISGK 525
             +    P+   G +        KGSV E   +  V   GF+ S   P + GVDP+ + G+
Sbjct: 120  QVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGR 179

Query: 524  FVSGSSPLSDAGTGAPRVATQIPINRPGLN--INRPMVNENMIRPVVENGATMLFVGELH 351
              +  +P+ + G   P+ A  IP N+ G+N  +NR MVNEN IRP +ENG TMLFVGELH
Sbjct: 180  AANEPAPVLNPGAAGPQGAL-IPANQMGVNANVNRVMVNENQIRPPLENGGTMLFVGELH 238

Query: 350  WWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNG 171
            WWTTDAELESVLSQYGR KEIKFFDERASGKSKGYCQVEFF++ AA+ACK+GMNGH FNG
Sbjct: 239  WWTTDAELESVLSQYGRAKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFNG 298

Query: 170  RACVVTFASPQTLKQMGASYLNKTQVQAQSQVPGRRPMNDGVGRGGGMNYQGG 12
            R CVV FASPQTLKQMGASY+NK Q Q QSQ  G RPMNDG GRGG  NYQ G
Sbjct: 299  RPCVVAFASPQTLKQMGASYMNKNQGQPQSQNQGSRPMNDGGGRGGNTNYQSG 351


>ref|XP_002515040.1| RNA binding protein, putative [Ricinus communis]
            gi|223546091|gb|EEF47594.1| RNA binding protein, putative
            [Ricinus communis]
          Length = 644

 Score =  376 bits (966), Expect = e-102
 Identities = 204/347 (58%), Positives = 243/347 (70%), Gaps = 19/347 (5%)
 Frame = -1

Query: 995  MAEEQLDYGDEEYG-TQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQLQR 819
            MA+EQ+DY DEEYG  QK+QYQ  GAI ALA+E+ MGEDDEYDDLYNDVNIGE FLQ+ R
Sbjct: 1    MADEQIDYEDEEYGGAQKLQYQGSGAIPALAEEE-MGEDDEYDDLYNDVNIGENFLQMHR 59

Query: 818  SE-PVSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGVVERNDSNIRATVPDQ-AKG 645
            SE P +   V +GG Q + ++       E G SQ +NIPGV   +  +     P+Q  KG
Sbjct: 60   SEAPPAPPSVGNGGFQPRNSN---DLRVESGGSQGLNIPGVAVESKYSTGTHFPEQNVKG 116

Query: 644  GFKGSV--------------LEMAREIQVETTGFRDSAPMPPKIGVDPNQISGKFVSGSS 507
               GSV              +EM  + Q    GF+ S   P  IGVDP+ ++ K  +  +
Sbjct: 117  PEIGSVGYPDGSSIAQKTRVMEMTNDSQARNMGFQGSTSGPSNIGVDPSDMNNKISNDPT 176

Query: 506  PLSDAGTGAPRVATQIPINRPGLNI--NRPMVNENMIRPVVENGATMLFVGELHWWTTDA 333
            P+ +AG   PRV  Q+P ++  +N+  NR   NEN IRP +ENG+TML+VGELHWWTTDA
Sbjct: 177  PVPNAGV--PRVIPQLPASQMNMNMDTNRSATNENQIRPPLENGSTMLYVGELHWWTTDA 234

Query: 332  ELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACVVT 153
            ELE+VLSQYG VKEIKFFDERASGKSKGYCQVEF+++ AA+ACKEGMNGH FNGRACVV 
Sbjct: 235  ELENVLSQYGMVKEIKFFDERASGKSKGYCQVEFYDAAAAAACKEGMNGHLFNGRACVVA 294

Query: 152  FASPQTLKQMGASYLNKTQVQAQSQVPGRRPMNDGVGRGGGMNYQGG 12
            FAS QTLKQMGASY+NK Q Q QSQ  GRRPMNDG GRGG MNYQGG
Sbjct: 295  FASQQTLKQMGASYMNKNQGQPQSQNQGRRPMNDGAGRGGNMNYQGG 341


>ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            CG7185-like isoform X1 [Solanum tuberosum]
            gi|565349616|ref|XP_006341787.1| PREDICTED: cleavage and
            polyadenylation specificity factor subunit CG7185-like
            isoform X2 [Solanum tuberosum]
          Length = 648

 Score =  370 bits (949), Expect = e-100
 Identities = 199/353 (56%), Positives = 239/353 (67%), Gaps = 22/353 (6%)
 Frame = -1

Query: 1004 MDPMAEEQLDYGDEEYG-TQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQ 828
            MDP A+EQLDYGDEEYG + KMQY   G I ALA++++MGEDDEYDDLYNDVNIGEGFLQ
Sbjct: 1    MDPTADEQLDYGDEEYGGSHKMQYHGSGTIPALAEDEMMGEDDEYDDLYNDVNIGEGFLQ 60

Query: 827  LQRSE-PVSLGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGVV-ERNDSNIRATVPDQ 654
            LQRSE PV      +G  Q Q+   P S+A   G S++  IPG+  E   +      P Q
Sbjct: 61   LQRSEVPVPSVDAGNGNFQAQKDSFPASRAGGLG-SEEAKIPGIATEGKYAGTEVQFPQQ 119

Query: 653  ---------------AKGGFKGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQISGKFV 519
                           A    + S + M    Q   +G++ S PMP KIG DP  +  K  
Sbjct: 120  KGEPVVERETERPADAAQKARPSAITMTLNSQAGNSGYQGSMPMPQKIGADPMAMPEKNA 179

Query: 518  SGSSPLSDAGTGAPRVATQIPINR----PGLNINRPMVNENMIRPVVENGATMLFVGELH 351
            S ++PL ++    PRV   +P N+      +N+N P+++E   RP +ENG TMLFVGELH
Sbjct: 180  SEATPLMNSVVPGPRVVPHMPTNQLNSSGNVNMNNPVISETPFRPSLENGNTMLFVGELH 239

Query: 350  WWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNG 171
            WWTTDAELESVL+QYG VKEIKFFDERASGKSKGYCQVEFF+  +A+ACKEGMNG+NFNG
Sbjct: 240  WWTTDAELESVLTQYGNVKEIKFFDERASGKSKGYCQVEFFDPASAAACKEGMNGYNFNG 299

Query: 170  RACVVTFASPQTLKQMGASYLNKTQVQAQSQVPGRRPMNDGVGRGGGMNYQGG 12
            RACVV FA+PQT+KQMG+SY NKTQ Q QSQ  GRRPMN+GVGR GG NY  G
Sbjct: 300  RACVVAFATPQTIKQMGSSYANKTQNQVQSQPQGRRPMNEGVGR-GGPNYTPG 351


>gb|EXB82464.1| Cleavage and polyadenylation specificity factor subunit [Morus
            notabilis]
          Length = 636

 Score =  368 bits (944), Expect = 3e-99
 Identities = 203/355 (57%), Positives = 239/355 (67%), Gaps = 27/355 (7%)
 Frame = -1

Query: 995  MAEEQLDYGDEEYG-TQKMQYQ-SGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQLQ 822
            MAE+ +D+ DEEYG  QK QYQ SGGAISALADE+LMG+DDEYDDLYNDVN+GEGFLQLQ
Sbjct: 1    MAEDHIDFEDEEYGGAQKHQYQGSGGAISALADEELMGDDDEYDDLYNDVNVGEGFLQLQ 60

Query: 821  RSEPVSLGGVES--GGVQTQETDGPGSKAPEHGASQDVNIPGV----------------- 699
            RSE  SL        G+Q Q+ + P  +  E G SQ  NIPGV                 
Sbjct: 61   RSEAPSLPAAAGVGNGLQAQKRNFPEPRE-EIGGSQQPNIPGVSAEGRFSSAGSQFPGQQ 119

Query: 698  ----VERNDSNIRATVPDQAKGGFKGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQIS 531
                V++         PD A G  KG ++           GF+ S PM   +GVD + I 
Sbjct: 120  DGLKVDKKSEAGSMVYPDGASGSQKGRIV----------AGFQGSKPMLHSVGVDSSDIP 169

Query: 530  GKFVSGSSPLSDAGTGAPRVATQIPINRPGLNIN--RPMVNENMIRPVVENGATMLFVGE 357
            GK V+      ++G   PR    +  N+  +N N   P+VNEN IRP +ENG+TMLFVGE
Sbjct: 170  GKMVNEPIQAPNSGGAGPRGILPMQGNQTTVNANVSHPIVNENQIRPSIENGSTMLFVGE 229

Query: 356  LHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNF 177
            LHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVE++++ AA ACKEGM+GH F
Sbjct: 230  LHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEYYDAAAAVACKEGMHGHVF 289

Query: 176  NGRACVVTFASPQTLKQMGASYLNKTQVQAQSQVPGRRPMNDGVGRGGGMNYQGG 12
            NGRACVV FASPQTLKQMGA+Y++K QVQ QSQ  GRRP+NDGVGRGG  N+Q G
Sbjct: 290  NGRACVVAFASPQTLKQMGAAYMSKNQVQNQSQPQGRRPINDGVGRGGNPNFQSG 344


>ref|XP_002312652.1| RNA recognition motif-containing family protein [Populus trichocarpa]
            gi|222852472|gb|EEE90019.1| RNA recognition
            motif-containing family protein [Populus trichocarpa]
          Length = 619

 Score =  355 bits (911), Expect = 2e-95
 Identities = 193/346 (55%), Positives = 233/346 (67%), Gaps = 23/346 (6%)
 Frame = -1

Query: 980  LDYGDEEYGTQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQLQRSE-PVS 804
            +DY +EE    KMQYQ  GAI ALA+E+ MGEDDEYDDLYNDVN+GE FLQ+  SE P  
Sbjct: 1    MDYEEEE----KMQYQGSGAIPALAEEE-MGEDDEYDDLYNDVNVGENFLQMHGSEAPAP 55

Query: 803  LGGVESGGVQTQETDGPGSKAPEHGASQDVNIPG---VVERNDSNIRATVPDQAKGGF-- 639
               V +GG QT+          E G SQ + I G    VE   SN +A  P+Q +     
Sbjct: 56   PATVGNGGFQTRNAH---ESRIETGGSQALAITGGGPAVEGIYSNAKAHFPEQKQVAVAV 112

Query: 638  ---------------KGSVLEMAREIQVETTGFRDSAPMPPKIGVDPNQISGKFVSGSSP 504
                           KG V+EM+ ++QV   GF+ S P+PP IGVDP+ +S K      P
Sbjct: 113  EAQDVGPVDGSSVAQKGRVIEMSHDVQVRNMGFQKSTPVPPGIGVDPSDMSRKNAIEPEP 172

Query: 503  LSDAGTGAPRVATQIPINRPGLN--INRPMVNENMIRPVVENGATMLFVGELHWWTTDAE 330
            L   G+  PR A Q+ +N+  ++  +NRP+VNEN +RP +ENG+T L+VGELHWWTTDAE
Sbjct: 173  LPITGSAGPRGAPQMQVNQMHMSADVNRPVVNENQVRPPIENGSTTLYVGELHWWTTDAE 232

Query: 329  LESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACVVTF 150
            LES  SQ+GRVKEIKFFDERASGKSKGYCQV+F+E+ AA+ACKEGMNGH FNGR CVV F
Sbjct: 233  LESFASQFGRVKEIKFFDERASGKSKGYCQVDFYEAAAAAACKEGMNGHVFNGRPCVVAF 292

Query: 149  ASPQTLKQMGASYLNKTQVQAQSQVPGRRPMNDGVGRGGGMNYQGG 12
            ASPQTLKQMGASY+NKTQ Q Q+Q  GR  MNDG GRGG  N+Q G
Sbjct: 293  ASPQTLKQMGASYMNKTQGQPQTQSQGRGSMNDGAGRGGNANFQSG 338


>gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus guttatus]
          Length = 639

 Score =  328 bits (841), Expect = 3e-87
 Identities = 186/353 (52%), Positives = 226/353 (64%), Gaps = 22/353 (6%)
 Frame = -1

Query: 1004 MDPMAEEQLDYGDEEYG-TQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQ 828
            MDP+ +EQLDYGDEEYG  QKMQY  GGAI ALA+++++G+DDEYDDLYNDVN+GEGF+Q
Sbjct: 1    MDPVTDEQLDYGDEEYGGNQKMQYHHGGAIPALAEDEMIGDDDEYDDLYNDVNVGEGFMQ 60

Query: 827  LQRSEPVSLGGVESGGVQTQETDGPGSKAPEHGASQDVN----------IPGVVERNDSN 678
            +QRSE      V +      +   PG++A E  ASQ+VN           P  V+ +D  
Sbjct: 61   MQRSEAPPPSAVGNNSFSISKNTAPGTRA-EAIASQEVNNGRVGNEGSYAPNGVQLSDQK 119

Query: 677  IRATV---PDQAKGGFKGSVL-EMAREIQVETTGFRDSAPMPPKIGVDPNQISGKFVSGS 510
               T    P Q     +   L E+A   Q    G++ S  M  K   D    S   V   
Sbjct: 120  NNLTAVGGPAQPVDASQRVRLPEVANSSQAAHLGYQGSEIMLHKTATDRMNNSENIVGEP 179

Query: 509  SPLSDAGTGAPRVATQIPIN------RPGLNINRPMVNENMIRPVV-ENGATMLFVGELH 351
            + L    TG+ +   Q P N         +N+NR M +E +IRP   ENG  M++VGELH
Sbjct: 180  ASLVYPNTGSSKGVPQAPSNLMNSNANVNVNVNRSMDDEYLIRPSGGENGNPMIYVGELH 239

Query: 350  WWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFESVAASACKEGMNGHNFNG 171
            WWTTDAE+ESVL QYGRVKEIKFFDERASGKSKGYCQVEF++  AA+ACK+GM GH FNG
Sbjct: 240  WWTTDAEVESVLIQYGRVKEIKFFDERASGKSKGYCQVEFYDPAAATACKDGMQGHIFNG 299

Query: 170  RACVVTFASPQTLKQMGASYLNKTQVQAQSQVPGRRPMNDGVGRGGGMNYQGG 12
            RACVVT+A+PQT KQMGASY NK Q Q+QSQ+ GR PMNDG GRG G NY  G
Sbjct: 300  RACVVTYANPQTSKQMGASY-NKNQGQSQSQLQGRNPMNDGAGRGNGTNYPSG 351


>ref|XP_006852230.1| hypothetical protein AMTR_s00049p00146760 [Amborella trichopoda]
            gi|548855834|gb|ERN13697.1| hypothetical protein
            AMTR_s00049p00146760 [Amborella trichopoda]
          Length = 659

 Score =  319 bits (818), Expect = 1e-84
 Identities = 198/379 (52%), Positives = 232/379 (61%), Gaps = 45/379 (11%)
 Frame = -1

Query: 1004 MDPMAEEQLDYGDEEYGT-QKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQ 828
            MDPMAEEQLDY DE+YG  QKM +Q+GGAISALADE+LMGEDDEYDDLYNDVN+G+GF+Q
Sbjct: 1    MDPMAEEQLDYEDEDYGANQKMPFQTGGAISALADEELMGEDDEYDDLYNDVNVGDGFMQ 60

Query: 827  -LQRSEPVSLGGVESGGVQTQETDGPGSKAPEHG--ASQDVNIPGVVER----------- 690
             LQ  EPV             E+ G G +AP+    ++  VNIPGV              
Sbjct: 61   SLQHQEPVQY-----------ESMGNGVQAPKEEPISTPPVNIPGVGHEEKGEKDAKLSG 109

Query: 689  -NDSNIRATVPDQAKG-------GFKGSVLEMAREIQVETTGFRDSAPMPPKIG------ 552
             +D + +    +QA         G K  V E   E Q + +GFR+ AP PP  G      
Sbjct: 110  FSDLDQKKAFQEQASNQLAGASSGLKIRVSEPVSEPQPQASGFRN-APAPPAKGSGFNTA 168

Query: 551  --VDPN----QISGKFVSGSSPLSDAGTGAPRVATQIPINRPGLNINRPMVN-------E 411
              +D N    Q S   V    P    G GA   A    +  PG N    +++       E
Sbjct: 169  GAMDANKQLAQTSSNAVPRVGPGPGPGIGAGPNANMNRMMGPGPNQAGAVIDTSARFGSE 228

Query: 410  NMIRPVV---ENGATMLFVGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQ 240
            N  R      E+G TMLFVGEL WWTTDAELESVLSQYGRVK++KFFDERASGKSKGYCQ
Sbjct: 229  NSNRLSHGGGESGNTMLFVGELQWWTTDAELESVLSQYGRVKDLKFFDERASGKSKGYCQ 288

Query: 239  VEFFESVAASACKEGMNGHNFNGRACVVTFASPQTLKQMGASYLNKTQVQAQSQVPGRRP 60
            VEF++  AA+ACKE MNGH FNGRACVV FAS  TLKQ+  +YLNKTQ QAQ+Q  GRRP
Sbjct: 289  VEFYDPAAAAACKESMNGHVFNGRACVVAFASQHTLKQLTTNYLNKTQAQAQAQSQGRRP 348

Query: 59   MNDGVGRGGGMNYQGGADN 3
            MNDG GR GG +YQGG  N
Sbjct: 349  MNDGGGRAGGPSYQGGDRN 367


>ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa]
           gi|550329195|gb|ERP56065.1| hypothetical protein
           POPTR_0010s06150g [Populus trichocarpa]
          Length = 591

 Score =  311 bits (796), Expect = 5e-82
 Identities = 174/329 (52%), Positives = 207/329 (62%), Gaps = 6/329 (1%)
 Frame = -1

Query: 980 LDYGDEEYGTQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQLQRSE-PVS 804
           +D+ +EE    KMQYQ  GAI ALA+E+L GEDDEYDDLYNDVN+GE FLQ+  SE P  
Sbjct: 1   MDFEEEE----KMQYQGSGAIPALAEEEL-GEDDEYDDLYNDVNVGENFLQMHGSEAPAP 55

Query: 803 LGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGV---VERNDSNIRATVPDQAKGGFKG 633
                +GG QT+          E G SQ +   G    VE   SN  A  P+Q + G   
Sbjct: 56  PATAGNGGFQTRNAH---ESRVETGGSQVLATSGAGVAVEGKYSNAGAHFPEQKQAG--- 109

Query: 632 SVLEMAREIQVETTGFRDSAPMPPKIGVDPNQISGKFVSGSSPLSDAGTGAPRVATQIPI 453
                                    IGV+ N +        S ++  G+  PR   Q+ +
Sbjct: 110 -------------------------IGVEANDVGSIGYGDGSSVAQKGSAGPRGVPQMQV 144

Query: 452 NRPGLN--INRPMVNENMIRPVVENGATMLFVGELHWWTTDAELESVLSQYGRVKEIKFF 279
           N+  +N  +NRP+VNEN +RP +ENG T L+VGELHWWTTDAELESV SQYGRVKEIKFF
Sbjct: 145 NQMNMNADVNRPVVNENQVRPPIENGPTTLYVGELHWWTTDAELESVASQYGRVKEIKFF 204

Query: 278 DERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACVVTFASPQTLKQMGASYLNKT 99
           DERASGKSKGYCQV+F+E+ AA+ACKEGMN H FNGR CVV FAS QTLKQMGASY++KT
Sbjct: 205 DERASGKSKGYCQVDFYEAAAAAACKEGMNEHVFNGRPCVVAFASAQTLKQMGASYMSKT 264

Query: 98  QVQAQSQVPGRRPMNDGVGRGGGMNYQGG 12
           Q Q Q Q  GR  MNDG+GRGG  NYQ G
Sbjct: 265 QGQPQPQSQGRGSMNDGMGRGGNANYQSG 293


>ref|XP_002315647.1| RNA recognition motif-containing family protein [Populus
           trichocarpa] gi|222864687|gb|EEF01818.1| RNA recognition
           motif-containing family protein [Populus trichocarpa]
          Length = 573

 Score =  311 bits (796), Expect = 5e-82
 Identities = 174/329 (52%), Positives = 207/329 (62%), Gaps = 6/329 (1%)
 Frame = -1

Query: 980 LDYGDEEYGTQKMQYQSGGAISALADEDLMGEDDEYDDLYNDVNIGEGFLQLQRSE-PVS 804
           +D+ +EE    KMQYQ  GAI ALA+E+L GEDDEYDDLYNDVN+GE FLQ+  SE P  
Sbjct: 1   MDFEEEE----KMQYQGSGAIPALAEEEL-GEDDEYDDLYNDVNVGENFLQMHGSEAPAP 55

Query: 803 LGGVESGGVQTQETDGPGSKAPEHGASQDVNIPGV---VERNDSNIRATVPDQAKGGFKG 633
                +GG QT+          E G SQ +   G    VE   SN  A  P+Q + G   
Sbjct: 56  PATAGNGGFQTRNAH---ESRVETGGSQVLATSGAGVAVEGKYSNAGAHFPEQKQAG--- 109

Query: 632 SVLEMAREIQVETTGFRDSAPMPPKIGVDPNQISGKFVSGSSPLSDAGTGAPRVATQIPI 453
                                    IGV+ N +        S ++  G+  PR   Q+ +
Sbjct: 110 -------------------------IGVEANDVGSIGYGDGSSVAQKGSAGPRGVPQMQV 144

Query: 452 NRPGLN--INRPMVNENMIRPVVENGATMLFVGELHWWTTDAELESVLSQYGRVKEIKFF 279
           N+  +N  +NRP+VNEN +RP +ENG T L+VGELHWWTTDAELESV SQYGRVKEIKFF
Sbjct: 145 NQMNMNADVNRPVVNENQVRPPIENGPTTLYVGELHWWTTDAELESVASQYGRVKEIKFF 204

Query: 278 DERASGKSKGYCQVEFFESVAASACKEGMNGHNFNGRACVVTFASPQTLKQMGASYLNKT 99
           DERASGKSKGYCQV+F+E+ AA+ACKEGMN H FNGR CVV FAS QTLKQMGASY++KT
Sbjct: 205 DERASGKSKGYCQVDFYEAAAAAACKEGMNEHVFNGRPCVVAFASAQTLKQMGASYMSKT 264

Query: 98  QVQAQSQVPGRRPMNDGVGRGGGMNYQGG 12
           Q Q Q Q  GR  MNDG+GRGG  NYQ G
Sbjct: 265 QGQPQPQSQGRGSMNDGMGRGGNANYQSG 293


Top