BLASTX nr result

ID: Mentha22_contig00036409 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00036409
         (678 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus...   314   1e-83
ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation spec...   243   5e-62
ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309...   229   6e-58
gb|EPS60955.1| hypothetical protein M569_13847, partial [Genlise...   227   3e-57
ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation spec...   225   9e-57
ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citr...   225   1e-56
ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268...   223   3e-56
ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobr...   223   4e-56
ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation spec...   221   2e-55
ref|XP_002515040.1| RNA binding protein, putative [Ricinus commu...   220   4e-55
ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citr...   219   5e-55
ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobr...   213   3e-53
ref|XP_007044908.1| RNA-binding family protein isoform 5, partia...   213   3e-53
ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobr...   213   3e-53
ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobr...   213   3e-53
ref|XP_002312652.1| RNA recognition motif-containing family prot...   207   2e-51
ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prun...   205   1e-50
gb|EXB82464.1| Cleavage and polyadenylation specificity factor s...   197   2e-48
emb|CBI16834.3| unnamed protein product [Vitis vinifera]              187   2e-45
ref|XP_006417146.1| hypothetical protein EUTSA_v10007191mg [Eutr...   179   7e-43

>gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus guttatus]
          Length = 639

 Score =  314 bits (805), Expect = 1e-83
 Identities = 152/227 (66%), Positives = 177/227 (77%), Gaps = 3/227 (1%)
 Frame = +3

Query: 6   SKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPGGPPQTMDASQRGR 185
           SK   PGT  E +A QEVNN +V  EG+YA        QK +L   GGP Q +DASQR R
Sbjct: 80  SKNTAPGTRAEAIASQEVNNGRVGNEGSYAPNGVQLSDQKNNLTAVGGPAQPVDASQRVR 139

Query: 186 LPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQVPPN 365
           LPE+A++SQA H GYQGS  M HK A D+MNNSE ++GEPA L+Y N G++KG PQ P N
Sbjct: 140 LPEVANSSQAAHLGYQGSEIMLHKTATDRMNNSENIVGEPASLVYPNTGSSKGVPQAPSN 199

Query: 366 QMNLNPNVNIN--RSMDDEYMVRPSV-ENGNTMLFVGELHWWTTDAEIESVLIQYGKVKE 536
            MN N NVN+N  RSMDDEY++RPS  ENGN M++VGELHWWTTDAE+ESVLIQYG+VKE
Sbjct: 200 LMNSNANVNVNVNRSMDDEYLIRPSGGENGNPMIYVGELHWWTTDAEVESVLIQYGRVKE 259

Query: 537 IKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677
           IKFFDERASGKSKGYCQVEFYDP+AA+ACK+GM GH FNGRACVV +
Sbjct: 260 IKFFDERASGKSKGYCQVEFYDPAAATACKDGMQGHIFNGRACVVTY 306


>ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           CG7185-like isoform X1 [Solanum tuberosum]
           gi|565349616|ref|XP_006341787.1| PREDICTED: cleavage and
           polyadenylation specificity factor subunit CG7185-like
           isoform X2 [Solanum tuberosum]
          Length = 648

 Score =  243 bits (619), Expect = 5e-62
 Identities = 126/225 (56%), Positives = 151/225 (67%), Gaps = 2/225 (0%)
 Frame = +3

Query: 9   KANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPGGPPQTMDASQRGRL 188
           K + P +   G+  +E     +A EG YA T   FP QK          +  DA+Q+ R 
Sbjct: 82  KDSFPASRAGGLGSEEAKIPGIATEGKYAGTEVQFPQQKGEPVVERETERPADAAQKARP 141

Query: 189 PE--MAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQVPP 362
               M  NSQAG+SGYQGS  MP K  AD M   EK   E  PLM + +   +  P +P 
Sbjct: 142 SAITMTLNSQAGNSGYQGSMPMPQKIGADPMAMPEKNASEATPLMNSVVPGPRVVPHMPT 201

Query: 363 NQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKEIK 542
           NQ+N + NVN+N  +  E   RPS+ENGNTMLFVGELHWWTTDAE+ESVL QYG VKEIK
Sbjct: 202 NQLNSSGNVNMNNPVISETPFRPSLENGNTMLFVGELHWWTTDAELESVLTQYGNVKEIK 261

Query: 543 FFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677
           FFDERASGKSKGYCQVEF+DP++A+ACKEGMNG++FNGRACVVAF
Sbjct: 262 FFDERASGKSKGYCQVEFFDPASAAACKEGMNGYNFNGRACVVAF 306


>ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309507 [Fragaria vesca
           subsp. vesca]
          Length = 646

 Score =  229 bits (584), Expect = 6e-58
 Identities = 118/223 (52%), Positives = 150/223 (67%)
 Frame = +3

Query: 9   KANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPGGPPQTMDASQRGRL 188
           K NVP   ++G A QEV N   + EG Y++     P QK   P    P     ASQ+GR+
Sbjct: 82  KNNVPEQRVQGGASQEVKNPGFSVEGKYSSV----PEQKDQPPVSVVPEM---ASQKGRV 134

Query: 189 PEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQVPPNQ 368
            EM H++Q  + G+QG+A+M     AD  + + K+   P P M +         Q+P NQ
Sbjct: 135 MEMTHDAQVRNMGFQGAATMQSNVVADSSDLTGKIANGPIPSMNSGSNGPPAVQQMPANQ 194

Query: 369 MNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKEIKFF 548
           MN+   +N+NR M +E  +RP VENG+  LFVGELHWWTTDAE+E VL Q+G++KEIKFF
Sbjct: 195 MNMK--INVNRPMVNENQIRPPVENGSATLFVGELHWWTTDAELEGVLSQFGRIKEIKFF 252

Query: 549 DERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677
           DERASGKSKGYCQV+FYDP+AASACKEGM+G+ FNGRACVVAF
Sbjct: 253 DERASGKSKGYCQVDFYDPAAASACKEGMDGYVFNGRACVVAF 295


>gb|EPS60955.1| hypothetical protein M569_13847, partial [Genlisea aurea]
          Length = 508

 Score =  227 bits (578), Expect = 3e-57
 Identities = 125/224 (55%), Positives = 146/224 (65%)
 Frame = +3

Query: 6   SKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPGGPPQTMDASQRGR 185
           ++ N  GT  E +  +E N  K A    +   A  FP QK  L        T+D SQ  R
Sbjct: 76  NRVNPSGTGDESIPSEEANASKYAGNRAFGPGALQFPEQKAGLNTTEETSVTVDRSQTVR 135

Query: 186 LPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQVPPN 365
                 NSQ   SGYQGS + P+    DQ+ N +K +G+P+ +       +KGA  VP N
Sbjct: 136 ------NSQTDQSGYQGSVA-PNNKTEDQVKNMDKTVGDPSSINPNVGVGSKGA--VPFN 186

Query: 366 QMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKEIKF 545
            MN+  N N  R +DDEY    S ENGNTML+VGELHWWTTDAEIESVLIQYGKVKEIKF
Sbjct: 187 FMNMAANANAIRPVDDEYSNLGSSENGNTMLYVGELHWWTTDAEIESVLIQYGKVKEIKF 246

Query: 546 FDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677
           FDERASGKSKGYCQVEF+DP+AA ACKEGMNG+ FNGRACVVAF
Sbjct: 247 FDERASGKSKGYCQVEFFDPAAAHACKEGMNGYVFNGRACVVAF 290


>ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           CG7185-like isoform X1 [Citrus sinensis]
          Length = 658

 Score =  225 bits (574), Expect = 9e-57
 Identities = 117/230 (50%), Positives = 147/230 (63%), Gaps = 7/230 (3%)
 Frame = +3

Query: 9   KANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK-----TSLP--GPGGPPQTMD 167
           K +VP   ++    Q  N   V+ EG Y      FP Q       + P  G G  P    
Sbjct: 82  KTDVPEQQVQAGVSQGSNVPGVSVEGKYTNAGTHFPAQNDVQVAVNRPNMGSGNYPDGAS 141

Query: 168 ASQRGRLPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGA 347
            SQ+G + E  H++   + G+QGS S P +   D  N   +V  EPAP++       +GA
Sbjct: 142 VSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGRVANEPAPVLNPGAAGPQGA 201

Query: 348 PQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGK 527
             +P NQM +N  +N+NR+M +E  +RP +ENG TMLFVGELHWWTTDAE+ESVL QYG+
Sbjct: 202 -LIPANQMGVN--INVNRAMVNENQIRPPLENGGTMLFVGELHWWTTDAELESVLSQYGR 258

Query: 528 VKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677
           VKEIKFFDERASGKSKGYCQVEF+D +AA+ACK+GMNGH FNGR CVVAF
Sbjct: 259 VKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFNGRPCVVAF 308


>ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citrus clementina]
           gi|557540375|gb|ESR51419.1| hypothetical protein
           CICLE_v10030915mg [Citrus clementina]
          Length = 658

 Score =  225 bits (573), Expect = 1e-56
 Identities = 117/230 (50%), Positives = 147/230 (63%), Gaps = 7/230 (3%)
 Frame = +3

Query: 9   KANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK-----TSLP--GPGGPPQTMD 167
           K +VP   ++    Q  N   V+ EG Y      FP Q       + P  G G  P    
Sbjct: 82  KTDVPEQQVQAGVSQGSNVPGVSVEGKYTNAGTHFPAQNDVQVAVNRPNMGSGNYPDGAS 141

Query: 168 ASQRGRLPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGA 347
            SQ+G + E  H++   + G+QGS S P +   D  N   +V  EPAP++       +GA
Sbjct: 142 VSQKGSVQETTHDAHVRNMGFQGSTSGPPRTGVDPSNMPGRVANEPAPVLNPGAAGPQGA 201

Query: 348 PQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGK 527
             +P NQM +N  +N+NR+M +E  +RP +ENG TMLFVGELHWWTTDAE+ESVL QYG+
Sbjct: 202 -LIPANQMGVN--INVNRAMVNENQIRPPLENGGTMLFVGELHWWTTDAELESVLSQYGR 258

Query: 528 VKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677
           VKEIKFFDERASGKSKGYCQVEF+D +AA+ACK+GMNGH FNGR CVVAF
Sbjct: 259 VKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFNGRPCVVAF 308


>ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268141 isoform 1 [Vitis
           vinifera] gi|359473133|ref|XP_003631251.1| PREDICTED:
           uncharacterized protein LOC100268141 isoform 2 [Vitis
           vinifera]
          Length = 647

 Score =  223 bits (569), Expect = 3e-56
 Identities = 120/235 (51%), Positives = 150/235 (63%), Gaps = 12/235 (5%)
 Frame = +3

Query: 9   KANVPGTHLEGVALQEVNNVKVAEEGNYA------------ATAAPFPVQKTSLPGPGGP 152
           K +VP   LE    Q +    V+ EG Y+            A   P     + L GP   
Sbjct: 79  KTDVPPQKLEAGTSQGLIIPGVSIEGKYSNPHFHEKKEGPMAVKGPEMGSTSHLDGPS-- 136

Query: 153 PQTMDASQRGRLPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMG 332
                 SQ+GR+ EM H++Q  + G+QGS  +P K  A+  +   K+  E  P++ +  G
Sbjct: 137 -----VSQKGRVLEMTHDTQVRNLGFQGSTPIPQKTGAEPSDVHGKIANESTPVLNSGTG 191

Query: 333 NTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVL 512
             +  PQ+  NQM +N  VN+NR M +E  +RP+V+NG TMLFVGELHWWTTDAE+ESVL
Sbjct: 192 GPRAVPQMLSNQMGMN--VNVNRPMVNENQIRPAVDNGATMLFVGELHWWTTDAELESVL 249

Query: 513 IQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677
            QYG+VKEIKFFDERASGKSKGYCQVEFYD SAA+ACKEGMNG+ FNGRACVVAF
Sbjct: 250 SQYGRVKEIKFFDERASGKSKGYCQVEFYDASAAAACKEGMNGYIFNGRACVVAF 304


>ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobroma cacao]
           gi|590695488|ref|XP_007044903.1| RNA-binding family
           protein isoform 1 [Theobroma cacao]
           gi|508708837|gb|EOY00734.1| RNA-binding family protein
           isoform 1 [Theobroma cacao] gi|508708838|gb|EOY00735.1|
           RNA-binding family protein isoform 1 [Theobroma cacao]
          Length = 653

 Score =  223 bits (568), Expect = 4e-56
 Identities = 119/229 (51%), Positives = 145/229 (63%), Gaps = 6/229 (2%)
 Frame = +3

Query: 9   KANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK----TSLP--GPGGPPQTMDA 170
           K   P    E    Q +N   V+ +G +    A +P Q      S P  G G  P     
Sbjct: 82  KNEAPEPRGEAGGSQGLNIPGVSVQGKHLNVTARYPEQDGQPAVSRPEMGSGSYPSGTSI 141

Query: 171 SQRGRLPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAP 350
           SQ+GR+ E   ++Q  + G+QG +S  HK   D     +K+   PA  + +  G  +GAP
Sbjct: 142 SQKGRVMEGTQDTQVKNMGFQGLSSASHKVGIDPSGVPQKIANVPAQSLNSGTGGPQGAP 201

Query: 351 QVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKV 530
            VPPNQM LN    +N  M  E  VRP +ENG TMLFVGELHWWTTDAE+ESVL QYG+V
Sbjct: 202 HVPPNQMGLN----VNHPMISENQVRPPIENGPTMLFVGELHWWTTDAELESVLSQYGRV 257

Query: 531 KEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677
           KEIKFFDERASGKSKGYCQVEFYDP++A+ACKEGM+G+ FNGRACVVAF
Sbjct: 258 KEIKFFDERASGKSKGYCQVEFYDPASAAACKEGMDGYMFNGRACVVAF 306


>ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           CG7185-like isoform X1 [Citrus sinensis]
          Length = 655

 Score =  221 bits (562), Expect = 2e-55
 Identities = 117/230 (50%), Positives = 145/230 (63%), Gaps = 7/230 (3%)
 Frame = +3

Query: 9   KANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK-----TSLP--GPGGPPQTMD 167
           K +VP   ++    Q  N   V+ EG Y    + FP Q       + P  G G  P    
Sbjct: 79  KTDVPEQRVQVGGSQGSNIPGVSVEGKYTNAGSHFPAQNDVQVAVNRPNMGSGNYPDGAS 138

Query: 168 ASQRGRLPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGA 347
            SQ+G + E  H++   + G+QGS S P +   D  N   +V  EPAP++       +GA
Sbjct: 139 VSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGRVANEPAPVLNPGAAGPQGA 198

Query: 348 PQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGK 527
             +P NQM +N NVN  R M +E  +RP +ENG TMLFVGELHWWTTDAE+ESVL QYG+
Sbjct: 199 -LIPANQMGVNANVN--RVMVNENQIRPPLENGGTMLFVGELHWWTTDAELESVLSQYGR 255

Query: 528 VKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677
            KEIKFFDERASGKSKGYCQVEF+D +AA+ACK+GMNGH FNGR CVVAF
Sbjct: 256 AKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFNGRPCVVAF 305


>ref|XP_002515040.1| RNA binding protein, putative [Ricinus communis]
           gi|223546091|gb|EEF47594.1| RNA binding protein,
           putative [Ricinus communis]
          Length = 644

 Score =  220 bits (560), Expect = 4e-55
 Identities = 117/217 (53%), Positives = 147/217 (67%), Gaps = 2/217 (0%)
 Frame = +3

Query: 33  LEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLP--GPGGPPQTMDASQRGRLPEMAHN 206
           +E    Q +N   VA E  Y+ T   FP Q    P  G  G P     +Q+ R+ EM ++
Sbjct: 84  VESGGSQGLNIPGVAVESKYS-TGTHFPEQNVKGPEIGSVGYPDGSSIAQKTRVMEMTND 142

Query: 207 SQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQVPPNQMNLNPN 386
           SQA + G+QGS S P     D  + + K+  +P P+   N G  +  PQ+P +QMN+N  
Sbjct: 143 SQARNMGFQGSTSGPSNIGVDPSDMNNKISNDPTPV--PNAGVPRVIPQLPASQMNMN-- 198

Query: 387 VNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKEIKFFDERASG 566
           ++ NRS  +E  +RP +ENG+TML+VGELHWWTTDAE+E+VL QYG VKEIKFFDERASG
Sbjct: 199 MDTNRSATNENQIRPPLENGSTMLYVGELHWWTTDAELENVLSQYGMVKEIKFFDERASG 258

Query: 567 KSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677
           KSKGYCQVEFYD +AA+ACKEGMNGH FNGRACVVAF
Sbjct: 259 KSKGYCQVEFYDAAAAAACKEGMNGHLFNGRACVVAF 295


>ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citrus clementina]
           gi|567891321|ref|XP_006438181.1| hypothetical protein
           CICLE_v10030917mg [Citrus clementina]
           gi|557540376|gb|ESR51420.1| hypothetical protein
           CICLE_v10030917mg [Citrus clementina]
           gi|557540377|gb|ESR51421.1| hypothetical protein
           CICLE_v10030917mg [Citrus clementina]
          Length = 655

 Score =  219 bits (559), Expect = 5e-55
 Identities = 116/230 (50%), Positives = 144/230 (62%), Gaps = 7/230 (3%)
 Frame = +3

Query: 9   KANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK-----TSLP--GPGGPPQTMD 167
           K +VP   ++    Q  N   V+ EG Y    + FP Q       + P  G G  P    
Sbjct: 79  KTDVPEQRVQVGGSQGSNIPGVSVEGKYTNAGSDFPAQNDVQVAVNRPNMGSGNYPDGAS 138

Query: 168 ASQRGRLPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGA 347
            SQ+G + E  H++   + G+QGS S P +   D  N   +   EPAP++       +GA
Sbjct: 139 VSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGRAANEPAPVLNPGAAGPQGA 198

Query: 348 PQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGK 527
             +P NQM +N NVN  R M +E  +RP +ENG TMLFVGELHWWTTDAE+ESVL QYG+
Sbjct: 199 -LIPANQMGVNANVN--RVMVNENQIRPPLENGGTMLFVGELHWWTTDAELESVLSQYGR 255

Query: 528 VKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677
            KEIKFFDERASGKSKGYCQVEF+D +AA+ACK+GMNGH FNGR CVVAF
Sbjct: 256 AKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFNGRPCVVAF 305


>ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobroma cacao]
           gi|508708844|gb|EOY00741.1| RNA-binding family protein
           isoform 6 [Theobroma cacao]
          Length = 602

 Score =  213 bits (543), Expect = 3e-53
 Identities = 110/226 (48%), Positives = 142/226 (62%), Gaps = 7/226 (3%)
 Frame = +3

Query: 21  PGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPGGP-------PQTMDASQR 179
           P   +E    Q +N   V+ +G +   +A +P +K   P    P       P     SQ+
Sbjct: 86  PEPRVEAGGSQGLNIPGVSVQGKHPNVSARYP-EKEEQPAVNRPEMVSGSYPSGSSISQK 144

Query: 180 GRLPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQVP 359
           G + E  H+ Q  + G+QG  S  +K   D     +K+  +PA  + +  G  +G P VP
Sbjct: 145 GSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVPQKIANDPAQSLNSGTGGPQGPPHVP 204

Query: 360 PNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKEI 539
           PNQM      N+N  + +E  V+P +ENG TMLFVGELHWWTTDAE+ESVL QYG++KEI
Sbjct: 205 PNQMG----TNVNHPVMNENQVQPPIENGPTMLFVGELHWWTTDAELESVLSQYGRLKEI 260

Query: 540 KFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677
           KFFDE+ASGKSKGYCQVEFYDPS+A+ CKEGMNG+ FNGRACVVAF
Sbjct: 261 KFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYMFNGRACVVAF 306


>ref|XP_007044908.1| RNA-binding family protein isoform 5, partial [Theobroma cacao]
           gi|508708843|gb|EOY00740.1| RNA-binding family protein
           isoform 5, partial [Theobroma cacao]
          Length = 656

 Score =  213 bits (543), Expect = 3e-53
 Identities = 110/226 (48%), Positives = 142/226 (62%), Gaps = 7/226 (3%)
 Frame = +3

Query: 21  PGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPGGP-------PQTMDASQR 179
           P   +E    Q +N   V+ +G +   +A +P +K   P    P       P     SQ+
Sbjct: 86  PEPRVEAGGSQGLNIPGVSVQGKHPNVSARYP-EKEEQPAVNRPEMVSGSYPSGSSISQK 144

Query: 180 GRLPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQVP 359
           G + E  H+ Q  + G+QG  S  +K   D     +K+  +PA  + +  G  +G P VP
Sbjct: 145 GSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVPQKIANDPAQSLNSGTGGPQGPPHVP 204

Query: 360 PNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKEI 539
           PNQM      N+N  + +E  V+P +ENG TMLFVGELHWWTTDAE+ESVL QYG++KEI
Sbjct: 205 PNQMG----TNVNHPVMNENQVQPPIENGPTMLFVGELHWWTTDAELESVLSQYGRLKEI 260

Query: 540 KFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677
           KFFDE+ASGKSKGYCQVEFYDPS+A+ CKEGMNG+ FNGRACVVAF
Sbjct: 261 KFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYMFNGRACVVAF 306


>ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobroma cacao]
           gi|508708842|gb|EOY00739.1| RNA-binding family protein
           isoform 4 [Theobroma cacao]
          Length = 697

 Score =  213 bits (543), Expect = 3e-53
 Identities = 110/226 (48%), Positives = 142/226 (62%), Gaps = 7/226 (3%)
 Frame = +3

Query: 21  PGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPGGP-------PQTMDASQR 179
           P   +E    Q +N   V+ +G +   +A +P +K   P    P       P     SQ+
Sbjct: 86  PEPRVEAGGSQGLNIPGVSVQGKHPNVSARYP-EKEEQPAVNRPEMVSGSYPSGSSISQK 144

Query: 180 GRLPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQVP 359
           G + E  H+ Q  + G+QG  S  +K   D     +K+  +PA  + +  G  +G P VP
Sbjct: 145 GSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVPQKIANDPAQSLNSGTGGPQGPPHVP 204

Query: 360 PNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKEI 539
           PNQM      N+N  + +E  V+P +ENG TMLFVGELHWWTTDAE+ESVL QYG++KEI
Sbjct: 205 PNQMG----TNVNHPVMNENQVQPPIENGPTMLFVGELHWWTTDAELESVLSQYGRLKEI 260

Query: 540 KFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677
           KFFDE+ASGKSKGYCQVEFYDPS+A+ CKEGMNG+ FNGRACVVAF
Sbjct: 261 KFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYMFNGRACVVAF 306


>ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobroma cacao]
           gi|590695496|ref|XP_007044905.1| RNA-binding family
           protein isoform 1 [Theobroma cacao]
           gi|590695500|ref|XP_007044906.1| RNA-binding family
           protein isoform 1 [Theobroma cacao]
           gi|508708839|gb|EOY00736.1| RNA-binding family protein
           isoform 1 [Theobroma cacao] gi|508708840|gb|EOY00737.1|
           RNA-binding family protein isoform 1 [Theobroma cacao]
           gi|508708841|gb|EOY00738.1| RNA-binding family protein
           isoform 1 [Theobroma cacao]
          Length = 652

 Score =  213 bits (543), Expect = 3e-53
 Identities = 110/226 (48%), Positives = 142/226 (62%), Gaps = 7/226 (3%)
 Frame = +3

Query: 21  PGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPGGP-------PQTMDASQR 179
           P   +E    Q +N   V+ +G +   +A +P +K   P    P       P     SQ+
Sbjct: 86  PEPRVEAGGSQGLNIPGVSVQGKHPNVSARYP-EKEEQPAVNRPEMVSGSYPSGSSISQK 144

Query: 180 GRLPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQVP 359
           G + E  H+ Q  + G+QG  S  +K   D     +K+  +PA  + +  G  +G P VP
Sbjct: 145 GSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVPQKIANDPAQSLNSGTGGPQGPPHVP 204

Query: 360 PNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKEI 539
           PNQM      N+N  + +E  V+P +ENG TMLFVGELHWWTTDAE+ESVL QYG++KEI
Sbjct: 205 PNQMG----TNVNHPVMNENQVQPPIENGPTMLFVGELHWWTTDAELESVLSQYGRLKEI 260

Query: 540 KFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677
           KFFDE+ASGKSKGYCQVEFYDPS+A+ CKEGMNG+ FNGRACVVAF
Sbjct: 261 KFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYMFNGRACVVAF 306


>ref|XP_002312652.1| RNA recognition motif-containing family protein [Populus
           trichocarpa] gi|222852472|gb|EEE90019.1| RNA recognition
           motif-containing family protein [Populus trichocarpa]
          Length = 619

 Score =  207 bits (527), Expect = 2e-51
 Identities = 108/205 (52%), Positives = 136/205 (66%), Gaps = 4/205 (1%)
 Frame = +3

Query: 75  AEEGNYAATAAPFPVQKTSLPGPG----GPPQTMDASQRGRLPEMAHNSQAGHSGYQGSA 242
           A EG Y+   A FP QK           GP      +Q+GR+ EM+H+ Q  + G+Q S 
Sbjct: 90  AVEGIYSNAKAHFPEQKQVAVAVEAQDVGPVDGSSVAQKGRVIEMSHDVQVRNMGFQKST 149

Query: 243 SMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQVPPNQMNLNPNVNINRSMDDEYM 422
            +P     D  + S K   EP PL  T     +GAPQ+  NQM+++ +VN  R + +E  
Sbjct: 150 PVPPGIGVDPSDMSRKNAIEPEPLPITGSAGPRGAPQMQVNQMHMSADVN--RPVVNENQ 207

Query: 423 VRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYD 602
           VRP +ENG+T L+VGELHWWTTDAE+ES   Q+G+VKEIKFFDERASGKSKGYCQV+FY+
Sbjct: 208 VRPPIENGSTTLYVGELHWWTTDAELESFASQFGRVKEIKFFDERASGKSKGYCQVDFYE 267

Query: 603 PSAASACKEGMNGHSFNGRACVVAF 677
            +AA+ACKEGMNGH FNGR CVVAF
Sbjct: 268 AAAAAACKEGMNGHVFNGRPCVVAF 292


>ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica]
           gi|462422613|gb|EMJ26876.1| hypothetical protein
           PRUPE_ppa002814mg [Prunus persica]
          Length = 630

 Score =  205 bits (521), Expect = 1e-50
 Identities = 112/223 (50%), Positives = 141/223 (63%)
 Frame = +3

Query: 9   KANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPGGPPQTMDASQRGRL 188
           K +V  T ++    QE     V+ +G Y++  A FP Q+      G PP           
Sbjct: 79  KTDVTETRVQAGVSQESKIPGVSVQGKYSSAVAQFPEQQ------GQPP----------- 121

Query: 189 PEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQVPPNQ 368
             +A   + G +GY GS +MP     D  + + K   E  P M +      G  Q+P NQ
Sbjct: 122 --VAKEPELGSTGY-GSTTMPPNVGGDSSDITGKTALESVPSMNSGTAGPTGVTQMPTNQ 178

Query: 369 MNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKEIKFF 548
           +++   VN NR M +E  +RP VENG+TMLFVGELHWWTTDAE+ESVL QYG+VKEIKFF
Sbjct: 179 ISIK--VNANRPMFNENQIRPPVENGSTMLFVGELHWWTTDAELESVLSQYGRVKEIKFF 236

Query: 549 DERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677
           DERASGKSKGYCQVEF+DP+AA+ACKEGM+G+ FNGRACVVAF
Sbjct: 237 DERASGKSKGYCQVEFHDPAAATACKEGMDGYLFNGRACVVAF 279


>gb|EXB82464.1| Cleavage and polyadenylation specificity factor subunit [Morus
           notabilis]
          Length = 636

 Score =  197 bits (502), Expect = 2e-48
 Identities = 110/227 (48%), Positives = 141/227 (62%), Gaps = 4/227 (1%)
 Frame = +3

Query: 9   KANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPGGPPQTMDASQRGRL 188
           K N P    E    Q+ N   V+ EG +++  + FP Q+  L       +    S+ G +
Sbjct: 81  KRNFPEPREEIGGSQQPNIPGVSAEGRFSSAGSQFPGQQDGL-------KVDKKSEAGSM 133

Query: 189 --PEMAHNSQAGH--SGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQV 356
             P+ A  SQ G   +G+QGS  M H    D  +   K++ EP     +     +G   +
Sbjct: 134 VYPDGASGSQKGRIVAGFQGSKPMLHSVGVDSSDIPGKMVNEPIQAPNSGGAGPRGILPM 193

Query: 357 PPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKE 536
             NQ  +N NV+    + +E  +RPS+ENG+TMLFVGELHWWTTDAE+ESVL QYG+VKE
Sbjct: 194 QGNQTTVNANVS--HPIVNENQIRPSIENGSTMLFVGELHWWTTDAELESVLSQYGRVKE 251

Query: 537 IKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677
           IKFFDERASGKSKGYCQVE+YD +AA ACKEGM+GH FNGRACVVAF
Sbjct: 252 IKFFDERASGKSKGYCQVEYYDAAAAVACKEGMHGHVFNGRACVVAF 298


>emb|CBI16834.3| unnamed protein product [Vitis vinifera]
          Length = 491

 Score =  187 bits (476), Expect = 2e-45
 Identities = 108/232 (46%), Positives = 138/232 (59%), Gaps = 14/232 (6%)
 Frame = +3

Query: 9   KANVPGTHLEGVALQEVNNVKVAEEGNYA------------ATAAPFPVQKTSLPGPGGP 152
           K +VP   LE    Q +    V+ EG Y+            A   P     + L GP   
Sbjct: 61  KTDVPPQKLEAGTSQGLIIPGVSIEGKYSNPHFHEKKEGPMAVKGPEMGSTSHLDGPS-- 118

Query: 153 PQTMDASQRGRLPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMG 332
                 SQ+GR+ EM H++Q  + G+QGS  +P K  A+  +   K+  E  P++ +  G
Sbjct: 119 -----VSQKGRVLEMTHDTQVRNLGFQGSTPIPQKTGAEPSDVHGKIANESTPVLNSGTG 173

Query: 333 NTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVL 512
             +  PQ+  NQM +N  VN+NR M +E  +RP+V+NG TMLFVGELHWWTTDAE+ESVL
Sbjct: 174 GPRAVPQMLSNQMGMN--VNVNRPMVNENQIRPAVDNGATMLFVGELHWWTTDAELESVL 231

Query: 513 IQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASAC--KEGMNGHSFNGRA 662
            QYG+VKEIKFFDERASGKSKGYCQVEFYD SAA+A   KEG+      G A
Sbjct: 232 SQYGRVKEIKFFDERASGKSKGYCQVEFYDASAAAAFSGKEGILNRGPGGLA 283


>ref|XP_006417146.1| hypothetical protein EUTSA_v10007191mg [Eutrema salsugineum]
           gi|557094917|gb|ESQ35499.1| hypothetical protein
           EUTSA_v10007191mg [Eutrema salsugineum]
          Length = 578

 Score =  179 bits (454), Expect = 7e-43
 Identities = 88/133 (66%), Positives = 102/133 (76%), Gaps = 1/133 (0%)
 Frame = +3

Query: 282 SEKVIGEPAPLMYTNMGNT-KGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTML 458
           S   +  P P ++   G   +GA Q+P +QMN NPN  +NRS    ++V    +NGNTML
Sbjct: 153 SGNAVNVPEPPVHNPYGAVPQGAQQIPVSQMNANPNAMVNRSPTQPFVV----DNGNTML 208

Query: 459 FVGELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMN 638
           FVGELHWWTTDAEIESVL QYG+VKEIKFFDER SGKSKGYCQVEFYD +AA+ACKEGMN
Sbjct: 209 FVGELHWWTTDAEIESVLSQYGRVKEIKFFDERVSGKSKGYCQVEFYDSAAAAACKEGMN 268

Query: 639 GHSFNGRACVVAF 677
           G  FNG+ACVVAF
Sbjct: 269 GFVFNGKACVVAF 281


Top