BLASTX nr result
ID: Mentha25_contig00004557
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha25_contig00004557 (1512 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus... 552 e-154 ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation spec... 461 e-127 ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268... 261 6e-67 ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation spec... 259 2e-66 ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citr... 259 3e-66 ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobr... 258 5e-66 ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation spec... 253 1e-64 ref|XP_002515040.1| RNA binding protein, putative [Ricinus commu... 253 1e-64 ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citr... 252 4e-64 ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309... 251 5e-64 gb|EPS60955.1| hypothetical protein M569_13847, partial [Genlise... 251 6e-64 ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobr... 248 4e-63 ref|XP_007044908.1| RNA-binding family protein isoform 5, partia... 248 4e-63 ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobr... 248 4e-63 ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobr... 248 4e-63 ref|XP_002312652.1| RNA recognition motif-containing family prot... 244 1e-61 ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prun... 243 1e-61 gb|EXB82464.1| Cleavage and polyadenylation specificity factor s... 231 6e-58 ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Popu... 211 5e-52 ref|XP_002315647.1| RNA recognition motif-containing family prot... 211 5e-52 >gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus guttatus] Length = 639 Score = 552 bits (1423), Expect = e-154 Identities = 299/510 (58%), Positives = 329/510 (64%), Gaps = 6/510 (1%) Frame = -1 Query: 1512 QTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGN 1333 Q +DASQR RLPEVA++SQA H GYQGS M HK A D+MNNSE ++GEPA L+Y N G+ Sbjct: 130 QPVDASQRVRLPEVANSSQAAHLGYQGSEIMLHKTATDRMNNSENIVGEPASLVYPNTGS 189 Query: 1332 TKGAPQVPPNQMNLNPNVNIN--RSMDDEYMVRPSV-ENGNTMLFVGELHWWTTDAEIES 1162 +KG PQ P N MN N NVN+N RSMDDEY++RPS ENGN M++VGELHWWTTDAE+ES Sbjct: 190 SKGVPQAPSNLMNSNANVNVNVNRSMDDEYLIRPSGGENGNPMIYVGELHWWTTDAEVES 249 Query: 1161 VLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAFATP 982 VLIQYG+VKEIKFFDERASGKSKGYCQVEFYDP+AA+ACK+GM GH FNGRACVV +A P Sbjct: 250 VLIQYGRVKEIKFFDERASGKSKGYCQVEFYDPAAATACKDGMQGHIFNGRACVVTYANP 309 Query: 981 HTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDA-XXXXXXXXXXXXNQ 805 T KQMGASY NK GRNP+ND AGRGNG NYPSGDA NQ Sbjct: 310 QTSKQMGASY-NKNQGQSQSQLQGRNPMNDGAGRGNGTNYPSGDAGRNFGRGGGWGRGNQ 368 Query: 804 PPNKXXXXXXXXXXXMI-NKNMI-XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 631 PN+ + NKNMI Sbjct: 369 APNRGPGAGPIRGRGGMGNKNMIGNAPGAGGGGAYGQGLNGPGFGGPPGMMHPQGMMGPG 428 Query: 630 FDLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXX 451 FDLAFMGRG GYG FSGP F GMLPPF GVNSMGLPGVAPHVNPAFF Sbjct: 429 FDLAFMGRGGGYGGFSGPPFQGMLPPFQGVNSMGLPGVAPHVNPAFFGRGMNPNGMGMMG 488 Query: 450 XXXXXGPHSGMWNDTNMGAWGGEEHGRESSYGGEDNASEYGYGEASHDKGARSSAASREK 271 GPHSGMWND NMG WGGEEHGRESSYGGEDNASEYGYGE SHDK RSSAA REK Sbjct: 489 NPGMVGPHSGMWNDPNMGGWGGEEHGRESSYGGEDNASEYGYGEGSHDKSVRSSAAPREK 548 Query: 270 EKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDDDWDKGQ 91 E+ SER++ P R K+ +SGYDDDWD+GQ Sbjct: 549 ERTSEREY---PERKHREERENDGERNDRDSKYREEKDRYREHRHKERESGYDDDWDRGQ 605 Query: 90 XXXXXXXSGAVPEDDHRSRSRDADYGKRRR 1 SGAV E+DHRSRSRDADYGKRRR Sbjct: 606 SSRSRSRSGAVQEEDHRSRSRDADYGKRRR 635 >ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Solanum tuberosum] gi|565349616|ref|XP_006341787.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X2 [Solanum tuberosum] Length = 648 Score = 461 bits (1187), Expect = e-127 Identities = 256/511 (50%), Positives = 295/511 (57%), Gaps = 10/511 (1%) Frame = -1 Query: 1503 DASQRGRLPEVAH--NSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNT 1330 DA+Q+ R + NSQAG+SGYQGS MP K AD M EK E PLM + + Sbjct: 134 DAAQKARPSAITMTLNSQAGNSGYQGSMPMPQKIGADPMAMPEKNASEATPLMNSVVPGP 193 Query: 1329 KGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQ 1150 + P +P NQ+N + NVN+N + E RPS+ENGNTMLFVGELHWWTTDAE+ESVL Q Sbjct: 194 RVVPHMPTNQLNSSGNVNMNNPVISETPFRPSLENGNTMLFVGELHWWTTDAELESVLTQ 253 Query: 1149 YGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAFATPHTIK 970 YG VKEIKFFDERASGKSKGYCQVEF+DP++A+ACKEGMNG++FNGRACVVAFATP TIK Sbjct: 254 YGNVKEIKFFDERASGKSKGYCQVEFFDPASAAACKEGMNGYNFNGRACVVAFATPQTIK 313 Query: 969 QMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDAXXXXXXXXXXXXNQPPNKX 790 QMG+SY NK GR P+N+ GRG P PN+ Sbjct: 314 QMGSSYANKTQNQVQSQPQGRRPMNEGVGRGGPNYTPGDAGRNFGRGSWGRGGPGMPNRG 373 Query: 789 XXXXXXXXXXMI-NKNMIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXF---DL 622 + +KNM+ D Sbjct: 374 PGGGPVRGRGAMGSKNMMVNPGAGNGAGGAFGQGLAGPAFGGPPAGLMHPQGMMGPGFDP 433 Query: 621 AFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXXXX 442 +FMGRGAGYG FSGPAFPGM+PPF VN MGLPGVAPHVNPAFF Sbjct: 434 SFMGRGAGYGGFSGPAFPGMMPPFQAVNPMGLPGVAPHVNPAFFGRGMAANGMGMMSAAG 493 Query: 441 XXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASREK 271 GPH GMW DT+ G WGGEEHG RESSYGGEDNASEYGYGE SHDKGARSSA SREK Sbjct: 494 MDGPHPGMWTDTSGGGWGGEEHGRRTRESSYGGEDNASEYGYGEVSHDKGARSSAVSREK 553 Query: 270 EKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDDDWDKGQ 91 E+ SERDWS N R K+ +S Y++D+D+GQ Sbjct: 554 ERGSERDWSGNSDKRHRDEREHDRDRHDKEHRYREERDGYRDYRQKERESEYEEDYDRGQ 613 Query: 90 XXXXXXXSG-AVPEDDHRSRSRDADYGKRRR 1 A E+DHRSRSRD +YGKRRR Sbjct: 614 SSSRSRSKSRAAQEEDHRSRSRDTNYGKRRR 644 >ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268141 isoform 1 [Vitis vinifera] gi|359473133|ref|XP_003631251.1| PREDICTED: uncharacterized protein LOC100268141 isoform 2 [Vitis vinifera] Length = 647 Score = 261 bits (667), Expect = 6e-67 Identities = 129/217 (59%), Positives = 158/217 (72%) Frame = -1 Query: 1497 SQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAP 1318 SQ+GR+ E+ H++Q + G+QGS +P K A+ + K+ E P++ + G + P Sbjct: 138 SQKGRVLEMTHDTQVRNLGFQGSTPIPQKTGAEPSDVHGKIANESTPVLNSGTGGPRAVP 197 Query: 1317 QVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKV 1138 Q+ NQM +N VN+NR M +E +RP+V+NG TMLFVGELHWWTTDAE+ESVL QYG+V Sbjct: 198 QMLSNQMGMN--VNVNRPMVNENQIRPAVDNGATMLFVGELHWWTTDAELESVLSQYGRV 255 Query: 1137 KEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAFATPHTIKQMGA 958 KEIKFFDERASGKSKGYCQVEFYD SAA+ACKEGMNG+ FNGRACVVAFA+P T+KQMGA Sbjct: 256 KEIKFFDERASGKSKGYCQVEFYDASAAAACKEGMNGYIFNGRACVVAFASPQTLKQMGA 315 Query: 957 SYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDA 847 SY NK GR P+ND GRG G N GDA Sbjct: 316 SYMNK--TQAQSQSQGRRPMNDGVGRGGGMNMQGGDA 350 Score = 207 bits (526), Expect = 1e-50 Identities = 107/213 (50%), Positives = 124/213 (58%), Gaps = 4/213 (1%) Frame = -1 Query: 627 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 448 D +MGRG YG FSG AFPGM+P FP VN+MGL GVAPHVNPAFF Sbjct: 431 DPTYMGRGGAYGGFSGSAFPGMVPSFPAVNTMGLAGVAPHVNPAFFGRGMAANGMGMMGA 490 Query: 447 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 277 G H+GMW DT+MG WGGEEHG RESSYGG+D AS+YGYGE +H+K RS+ ASR Sbjct: 491 TGMDGHHAGMWTDTSMGGWGGEEHGRRTRESSYGGDDGASDYGYGEVNHEKVGRSNTASR 550 Query: 276 EKEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDDDWDK 97 EKE+ SERDWS N R ++ D +DDWD+ Sbjct: 551 EKERGSERDWSGNSERRHRDEREQDWERSDKDHRYREEKDGYRDHRQRERDFNNEDDWDR 610 Query: 96 GQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRR 1 GQ AV ++DHRSRSRD DYGKRRR Sbjct: 611 GQSSSRSRSRSRAVADEDHRSRSRDGDYGKRRR 643 >ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 658 Score = 259 bits (662), Expect = 2e-66 Identities = 126/216 (58%), Positives = 154/216 (71%) Frame = -1 Query: 1497 SQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAP 1318 SQ+G + E H++ + G+QGS S P + D N +V EPAP++ +GA Sbjct: 143 SQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGRVANEPAPVLNPGAAGPQGA- 201 Query: 1317 QVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKV 1138 +P NQM +N +N+NR+M +E +RP +ENG TMLFVGELHWWTTDAE+ESVL QYG+V Sbjct: 202 LIPANQMGVN--INVNRAMVNENQIRPPLENGGTMLFVGELHWWTTDAELESVLSQYGRV 259 Query: 1137 KEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAFATPHTIKQMGA 958 KEIKFFDERASGKSKGYCQVEF+D +AA+ACK+GMNGH FNGR CVVAFA+P T+KQMGA Sbjct: 260 KEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFNGRPCVVAFASPQTLKQMGA 319 Query: 957 SYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 850 SY NK GR P+ND GRG NY SGD Sbjct: 320 SYMNKNQGQPQSQTQGRRPMNDGGGRGGNMNYQSGD 355 Score = 217 bits (553), Expect = 1e-53 Identities = 113/216 (52%), Positives = 130/216 (60%), Gaps = 7/216 (3%) Frame = -1 Query: 627 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 448 D +MGRG GYG FSGP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 439 DPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGS 498 Query: 447 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 277 GPH GMW D++MG W GEEHG RESSYGG+D AS+YGYGEA+H+KGARS+AASR Sbjct: 499 SGMDGPHPGMWTDSSMGGWLGEEHGRRTRESSYGGDDGASDYGYGEANHEKGARSTAASR 558 Query: 276 EKEKNSERDWSSNP---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDDD 106 EK++ SERDWS N R +D DS YDD+ Sbjct: 559 EKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDDN 618 Query: 105 WDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRR 1 WD+G A+P++DHRSRSRD DYGKRRR Sbjct: 619 WDRGPSSSRSRSRSRAIPDEDHRSRSRDVDYGKRRR 654 >ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] gi|557540375|gb|ESR51419.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] Length = 658 Score = 259 bits (661), Expect = 3e-66 Identities = 126/216 (58%), Positives = 154/216 (71%) Frame = -1 Query: 1497 SQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAP 1318 SQ+G + E H++ + G+QGS S P + D N +V EPAP++ +GA Sbjct: 143 SQKGSVQETTHDAHVRNMGFQGSTSGPPRTGVDPSNMPGRVANEPAPVLNPGAAGPQGA- 201 Query: 1317 QVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKV 1138 +P NQM +N +N+NR+M +E +RP +ENG TMLFVGELHWWTTDAE+ESVL QYG+V Sbjct: 202 LIPANQMGVN--INVNRAMVNENQIRPPLENGGTMLFVGELHWWTTDAELESVLSQYGRV 259 Query: 1137 KEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAFATPHTIKQMGA 958 KEIKFFDERASGKSKGYCQVEF+D +AA+ACK+GMNGH FNGR CVVAFA+P T+KQMGA Sbjct: 260 KEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFNGRPCVVAFASPQTLKQMGA 319 Query: 957 SYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 850 SY NK GR P+ND GRG NY SGD Sbjct: 320 SYMNKNQGQPQSQTQGRRPMNDGGGRGGNMNYQSGD 355 Score = 218 bits (554), Expect = 7e-54 Identities = 113/216 (52%), Positives = 130/216 (60%), Gaps = 7/216 (3%) Frame = -1 Query: 627 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 448 D +MGRG GYG FSGP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 439 DPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGS 498 Query: 447 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 277 GPH GMW D++MG W GEEHG RESSYGG+D AS+YGYGEA+H+KGARS+AASR Sbjct: 499 SGMDGPHPGMWTDSSMGGWVGEEHGRRTRESSYGGDDGASDYGYGEANHEKGARSTAASR 558 Query: 276 EKEKNSERDWSSNP---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDDD 106 EK++ SERDWS N R +D DS YDD+ Sbjct: 559 EKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDDN 618 Query: 105 WDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRR 1 WD+G A+P++DHRSRSRD DYGKRRR Sbjct: 619 WDRGPSSSRSRSRSRAIPDEDHRSRSRDVDYGKRRR 654 >ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695488|ref|XP_007044903.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708837|gb|EOY00734.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708838|gb|EOY00735.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 653 Score = 258 bits (659), Expect = 5e-66 Identities = 130/217 (59%), Positives = 153/217 (70%) Frame = -1 Query: 1497 SQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAP 1318 SQ+GR+ E ++Q + G+QG +S HK D +K+ PA + + G +GAP Sbjct: 142 SQKGRVMEGTQDTQVKNMGFQGLSSASHKVGIDPSGVPQKIANVPAQSLNSGTGGPQGAP 201 Query: 1317 QVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKV 1138 VPPNQM LN +N M E VRP +ENG TMLFVGELHWWTTDAE+ESVL QYG+V Sbjct: 202 HVPPNQMGLN----VNHPMISENQVRPPIENGPTMLFVGELHWWTTDAELESVLSQYGRV 257 Query: 1137 KEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAFATPHTIKQMGA 958 KEIKFFDERASGKSKGYCQVEFYDP++A+ACKEGM+G+ FNGRACVVAFA+P T+KQMGA Sbjct: 258 KEIKFFDERASGKSKGYCQVEFYDPASAAACKEGMDGYMFNGRACVVAFASPQTLKQMGA 317 Query: 957 SYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDA 847 SY NK GR P ND GRG NY SGDA Sbjct: 318 SYMNKNQGQSQAQPQGRRP-NDGLGRGGNMNYQSGDA 353 Score = 199 bits (505), Expect = 4e-48 Identities = 109/216 (50%), Positives = 123/216 (56%), Gaps = 7/216 (3%) Frame = -1 Query: 627 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 448 D +MGRG YG F GP FPGMLP FP VN++GL GVAPHVNPAFF Sbjct: 435 DPTYMGRGGSYGGFPGPGFPGMLPSFPAVNTLGLAGVAPHVNPAFFGRGMAPNGMGMMGG 494 Query: 447 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 277 GPH GMW DT+MG WGG+EHG RESSYGGED ASEYGYG+A+H+KG RSS ASR Sbjct: 495 PGMDGPHVGMWTDTSMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASR 553 Query: 276 EKEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDH---DSGYDDD 106 EKE+ S+R+WS N R H D YDDD Sbjct: 554 EKERVSDREWSGNSDRRHRDEKERDWDRSEREHREHRYREEKDSYREHRHRERDLDYDDD 613 Query: 105 WDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRR 1 D+GQ A+PE+ RSRSRD DYGKRRR Sbjct: 614 LDRGQSSSRSRRRSHAMPEEQRRSRSRDVDYGKRRR 649 >ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 655 Score = 253 bits (647), Expect = 1e-64 Identities = 125/216 (57%), Positives = 150/216 (69%) Frame = -1 Query: 1497 SQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAP 1318 SQ+G + E H++ + G+QGS S P + D N +V EPAP++ +GA Sbjct: 140 SQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGRVANEPAPVLNPGAAGPQGA- 198 Query: 1317 QVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKV 1138 +P NQM +N NVN R M +E +RP +ENG TMLFVGELHWWTTDAE+ESVL QYG+ Sbjct: 199 LIPANQMGVNANVN--RVMVNENQIRPPLENGGTMLFVGELHWWTTDAELESVLSQYGRA 256 Query: 1137 KEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAFATPHTIKQMGA 958 KEIKFFDERASGKSKGYCQVEF+D +AA+ACK+GMNGH FNGR CVVAFA+P T+KQMGA Sbjct: 257 KEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFNGRPCVVAFASPQTLKQMGA 316 Query: 957 SYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 850 SY NK G P+ND GRG NY SGD Sbjct: 317 SYMNKNQGQPQSQNQGSRPMNDGGGRGGNTNYQSGD 352 Score = 222 bits (565), Expect = 4e-55 Identities = 116/216 (53%), Positives = 133/216 (61%), Gaps = 7/216 (3%) Frame = -1 Query: 627 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 448 D +MGRG GYG FSGP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 436 DPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGS 495 Query: 447 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 277 GPH GMW D++MG W GEEHG RESSYGG+D AS+YGYGEA+H+KGARS+AASR Sbjct: 496 SGMDGPHPGMWTDSSMGGWLGEEHGRRTRESSYGGDDGASDYGYGEANHEKGARSTAASR 555 Query: 276 EKEKNSERDWSSNP---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDDD 106 EK++ SERDWS N R +D DS YDD+ Sbjct: 556 EKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDDN 615 Query: 105 WDKGQ-XXXXXXXSGAVPEDDHRSRSRDADYGKRRR 1 WD+GQ SGA+P++DHRSRSRD DYGKRRR Sbjct: 616 WDRGQSSSRSRSRSGAIPDEDHRSRSRDVDYGKRRR 651 >ref|XP_002515040.1| RNA binding protein, putative [Ricinus communis] gi|223546091|gb|EEF47594.1| RNA binding protein, putative [Ricinus communis] Length = 644 Score = 253 bits (647), Expect = 1e-64 Identities = 126/217 (58%), Positives = 157/217 (72%) Frame = -1 Query: 1497 SQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAP 1318 +Q+ R+ E+ ++SQA + G+QGS S P D + + K+ +P P+ N G + P Sbjct: 131 AQKTRVMEMTNDSQARNMGFQGSTSGPSNIGVDPSDMNNKISNDPTPV--PNAGVPRVIP 188 Query: 1317 QVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKV 1138 Q+P +QMN+N ++ NRS +E +RP +ENG+TML+VGELHWWTTDAE+E+VL QYG V Sbjct: 189 QLPASQMNMN--MDTNRSATNENQIRPPLENGSTMLYVGELHWWTTDAELENVLSQYGMV 246 Query: 1137 KEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAFATPHTIKQMGA 958 KEIKFFDERASGKSKGYCQVEFYD +AA+ACKEGMNGH FNGRACVVAFA+ T+KQMGA Sbjct: 247 KEIKFFDERASGKSKGYCQVEFYDAAAAAACKEGMNGHLFNGRACVVAFASQQTLKQMGA 306 Query: 957 SYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDA 847 SY NK GR P+ND AGRG NY GDA Sbjct: 307 SYMNKNQGQPQSQNQGRRPMNDGAGRGGNMNYQGGDA 343 Score = 217 bits (552), Expect = 1e-53 Identities = 115/215 (53%), Positives = 132/215 (61%), Gaps = 6/215 (2%) Frame = -1 Query: 627 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 448 D +MGRGAGYG F+GP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 426 DPTYMGRGAGYGGFAGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFGRGMAPNGMGMMGP 485 Query: 447 XXXXGPHSGMWNDTNMGAWGGE--EHGRESSYGGEDNASEYGYGEASHDKGARSSAASRE 274 GP++GMW+DT+MG WG E RESSYGG+D ASEYGYGE +H+KGARSSAASRE Sbjct: 486 SGMDGPNAGMWSDTSMGGWGEEPGRRTRESSYGGDDGASEYGYGEVNHEKGARSSAASRE 545 Query: 273 KEKNSERDWSSNP---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDDDW 103 KE+ SERDWS N R ++ DSGY+DDW Sbjct: 546 KERASERDWSGNSDRRHRDDREHDWDRSEREHKEHRYREEKESYRDHRQRERDSGYEDDW 605 Query: 102 DKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRR 1 D+GQ AVPE+D+RSRSRDADYGKRRR Sbjct: 606 DRGQSSSRSRSRSRAVPEEDYRSRSRDADYGKRRR 640 >ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|567891321|ref|XP_006438181.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540376|gb|ESR51420.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540377|gb|ESR51421.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] Length = 655 Score = 252 bits (643), Expect = 4e-64 Identities = 124/216 (57%), Positives = 149/216 (68%) Frame = -1 Query: 1497 SQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAP 1318 SQ+G + E H++ + G+QGS S P + D N + EPAP++ +GA Sbjct: 140 SQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGRAANEPAPVLNPGAAGPQGA- 198 Query: 1317 QVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKV 1138 +P NQM +N NVN R M +E +RP +ENG TMLFVGELHWWTTDAE+ESVL QYG+ Sbjct: 199 LIPANQMGVNANVN--RVMVNENQIRPPLENGGTMLFVGELHWWTTDAELESVLSQYGRA 256 Query: 1137 KEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAFATPHTIKQMGA 958 KEIKFFDERASGKSKGYCQVEF+D +AA+ACK+GMNGH FNGR CVVAFA+P T+KQMGA Sbjct: 257 KEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFNGRPCVVAFASPQTLKQMGA 316 Query: 957 SYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 850 SY NK G P+ND GRG NY SGD Sbjct: 317 SYMNKNQGQPQSQNQGSRPMNDGGGRGGNTNYQSGD 352 Score = 222 bits (565), Expect = 4e-55 Identities = 116/216 (53%), Positives = 132/216 (61%), Gaps = 7/216 (3%) Frame = -1 Query: 627 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 448 D +MGRG GYG FSGP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 436 DPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGS 495 Query: 447 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 277 GPH GMW D++MG W GEEHG RESSYGG+D AS+YGYGEASH+KGARS+ ASR Sbjct: 496 SGMDGPHPGMWTDSSMGGWVGEEHGRRTRESSYGGDDGASDYGYGEASHEKGARSTTASR 555 Query: 276 EKEKNSERDWSSNP---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDDD 106 EK++ SERDWS N R +D DS YDD+ Sbjct: 556 EKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDDN 615 Query: 105 WDKGQ-XXXXXXXSGAVPEDDHRSRSRDADYGKRRR 1 WD+GQ SGA+P++DHRSRSRD DYGKRRR Sbjct: 616 WDRGQSSSRSRSRSGAIPDEDHRSRSRDVDYGKRRR 651 >ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309507 [Fragaria vesca subsp. vesca] Length = 646 Score = 251 bits (642), Expect = 5e-64 Identities = 121/217 (55%), Positives = 153/217 (70%) Frame = -1 Query: 1500 ASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGA 1321 ASQ+GR+ E+ H++Q + G+QG+A+M AD + + K+ P P M + Sbjct: 128 ASQKGRVMEMTHDAQVRNMGFQGAATMQSNVVADSSDLTGKIANGPIPSMNSGSNGPPAV 187 Query: 1320 PQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGK 1141 Q+P NQMN+ +N+NR M +E +RP VENG+ LFVGELHWWTTDAE+E VL Q+G+ Sbjct: 188 QQMPANQMNMK--INVNRPMVNENQIRPPVENGSATLFVGELHWWTTDAELEGVLSQFGR 245 Query: 1140 VKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAFATPHTIKQMG 961 +KEIKFFDERASGKSKGYCQV+FYDP+AASACKEGM+G+ FNGRACVVAFA+ T+KQMG Sbjct: 246 IKEIKFFDERASGKSKGYCQVDFYDPAAASACKEGMDGYVFNGRACVVAFASSQTLKQMG 305 Query: 960 ASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 850 SY NK GR P+ND AGRG N+ GD Sbjct: 306 DSYVNKSQGQVQTQPQGRRPMNDGAGRGGNMNFQGGD 342 Score = 186 bits (471), Expect = 3e-44 Identities = 104/217 (47%), Positives = 119/217 (54%), Gaps = 8/217 (3%) Frame = -1 Query: 627 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 448 D +MGRG GYG F GP FPGMLP FPGVN+MGL GVAPHVNPAFF Sbjct: 426 DPTYMGRGGGYGGFPGPGFPGMLPQFPGVNAMGLAGVAPHVNPAFFGRGMATNGMGMMGS 485 Query: 447 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYG-YGEASHDKGARSSAAS 280 G H+ MWND +M W GEE RESSYGG+D SEYG YGEA+H+K RSSAA Sbjct: 486 SGMEGHHAPMWNDPSMAGWTGEEQDRRTRESSYGGDDGGSEYGNYGEANHEKPVRSSAAP 545 Query: 279 REKEKNSERDW---SSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDD 109 RE+E+ SER+W S R ++ D Y+D Sbjct: 546 RERERESEREWTGTSERRHRDEREQDWDRSEREHREPRYKEEKDSYRDHRRRERDVAYED 605 Query: 108 DWDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRR 1 D D+G A+PEDDHRSRSRD DYGKRRR Sbjct: 606 DRDRGHSSSRPRSRSKAMPEDDHRSRSRDVDYGKRRR 642 >gb|EPS60955.1| hypothetical protein M569_13847, partial [Genlisea aurea] Length = 508 Score = 251 bits (641), Expect = 6e-64 Identities = 138/223 (61%), Positives = 157/223 (70%), Gaps = 2/223 (0%) Frame = -1 Query: 1509 TMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNT 1330 T+D SQ R NSQ SGYQGS + P+ DQ+ N +K +G+P+ + + Sbjct: 127 TVDRSQTVR------NSQTDQSGYQGSVA-PNNKTEDQVKNMDKTVGDPSSINPNVGVGS 179 Query: 1329 KGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQ 1150 KGA VP N MN+ N N R +DDEY S ENGNTML+VGELHWWTTDAEIESVLIQ Sbjct: 180 KGA--VPFNFMNMAANANAIRPVDDEYSNLGSSENGNTMLYVGELHWWTTDAEIESVLIQ 237 Query: 1149 YGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAFATPHTIK 970 YGKVKEIKFFDERASGKSKGYCQVEF+DP+AA ACKEGMNG+ FNGRACVVAFATP TIK Sbjct: 238 YGKVKEIKFFDERASGKSKGYCQVEFFDPAAAHACKEGMNGYVFNGRACVVAFATPQTIK 297 Query: 969 QMGASYTNKXXXXXXXXXXGRN-PVND-AAGRGNGANYPSGDA 847 QMGASY N+ GRN +ND AGRG G N+ GDA Sbjct: 298 QMGASYMNRNQGQPQAQFPGRNAAMNDGGAGRGVGTNFSGGDA 340 Score = 123 bits (308), Expect = 2e-25 Identities = 62/96 (64%), Positives = 69/96 (71%), Gaps = 4/96 (4%) Frame = -1 Query: 627 DLAFMGRGAGYGN-FSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXX 451 DLAFMGRGAGYG F+GPAFPGMLPPFP VN++GLPGVAPHVNPAFF Sbjct: 413 DLAFMGRGAGYGGGFTGPAFPGMLPPFPAVNTLGLPGVAPHVNPAFFGRGMAPNGMGMMG 472 Query: 450 XXXXXGPHSGMWNDTNM-GAWGGEEHGR--ESSYGG 352 GP+SG+WND ++ G WGGEE GR ESSYGG Sbjct: 473 PSGMGGPYSGLWNDASVGGGWGGEEQGRGPESSYGG 508 >ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobroma cacao] gi|508708844|gb|EOY00741.1| RNA-binding family protein isoform 6 [Theobroma cacao] Length = 602 Score = 248 bits (634), Expect = 4e-63 Identities = 123/217 (56%), Positives = 150/217 (69%) Frame = -1 Query: 1497 SQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAP 1318 SQ+G + E H+ Q + G+QG S +K D +K+ +PA + + G +G P Sbjct: 142 SQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVPQKIANDPAQSLNSGTGGPQGPP 201 Query: 1317 QVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKV 1138 VPPNQM N+N + +E V+P +ENG TMLFVGELHWWTTDAE+ESVL QYG++ Sbjct: 202 HVPPNQMG----TNVNHPVMNENQVQPPIENGPTMLFVGELHWWTTDAELESVLSQYGRL 257 Query: 1137 KEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAFATPHTIKQMGA 958 KEIKFFDE+ASGKSKGYCQVEFYDPS+A+ CKEGMNG+ FNGRACVVAFA+P T+KQMGA Sbjct: 258 KEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYMFNGRACVVAFASPQTLKQMGA 317 Query: 957 SYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDA 847 SY NK GR P N+ GRG NY SGDA Sbjct: 318 SYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQSGDA 353 Score = 172 bits (437), Expect = 3e-40 Identities = 84/133 (63%), Positives = 94/133 (70%), Gaps = 3/133 (2%) Frame = -1 Query: 627 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 448 D +M RG GYG F GP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 434 DPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGA 493 Query: 447 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 277 GPH+GMW D +MG WGG+EHG RESSYGGED ASEYGYG+A+H+KG RSS ASR Sbjct: 494 SGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASR 552 Query: 276 EKEKNSERDWSSN 238 EKE+ SER+WS N Sbjct: 553 EKERVSEREWSGN 565 >ref|XP_007044908.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] gi|508708843|gb|EOY00740.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] Length = 656 Score = 248 bits (634), Expect = 4e-63 Identities = 123/217 (56%), Positives = 150/217 (69%) Frame = -1 Query: 1497 SQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAP 1318 SQ+G + E H+ Q + G+QG S +K D +K+ +PA + + G +G P Sbjct: 142 SQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVPQKIANDPAQSLNSGTGGPQGPP 201 Query: 1317 QVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKV 1138 VPPNQM N+N + +E V+P +ENG TMLFVGELHWWTTDAE+ESVL QYG++ Sbjct: 202 HVPPNQMG----TNVNHPVMNENQVQPPIENGPTMLFVGELHWWTTDAELESVLSQYGRL 257 Query: 1137 KEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAFATPHTIKQMGA 958 KEIKFFDE+ASGKSKGYCQVEFYDPS+A+ CKEGMNG+ FNGRACVVAFA+P T+KQMGA Sbjct: 258 KEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYMFNGRACVVAFASPQTLKQMGA 317 Query: 957 SYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDA 847 SY NK GR P N+ GRG NY SGDA Sbjct: 318 SYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQSGDA 353 Score = 196 bits (497), Expect = 3e-47 Identities = 106/215 (49%), Positives = 122/215 (56%), Gaps = 7/215 (3%) Frame = -1 Query: 627 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 448 D +M RG GYG F GP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 434 DPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGA 493 Query: 447 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 277 GPH+GMW D +MG WGG+EHG RESSYGGED ASEYGYG+A+H+KG RSS ASR Sbjct: 494 SGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASR 552 Query: 276 EKEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDH---DSGYDDD 106 EKE+ SER+WS N R H D YDDD Sbjct: 553 EKERVSEREWSGNSDRRHRDEKEQDWDRSEREHREHRYREEKDSYREHRHRERDLDYDDD 612 Query: 105 WDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRR 4 WD+GQ A+PE++HRSRSRD Y + + Sbjct: 613 WDRGQSSSRSRRRSHAMPEEEHRSRSRDVGYREEK 647 >ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobroma cacao] gi|508708842|gb|EOY00739.1| RNA-binding family protein isoform 4 [Theobroma cacao] Length = 697 Score = 248 bits (634), Expect = 4e-63 Identities = 123/217 (56%), Positives = 150/217 (69%) Frame = -1 Query: 1497 SQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAP 1318 SQ+G + E H+ Q + G+QG S +K D +K+ +PA + + G +G P Sbjct: 142 SQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVPQKIANDPAQSLNSGTGGPQGPP 201 Query: 1317 QVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKV 1138 VPPNQM N+N + +E V+P +ENG TMLFVGELHWWTTDAE+ESVL QYG++ Sbjct: 202 HVPPNQMG----TNVNHPVMNENQVQPPIENGPTMLFVGELHWWTTDAELESVLSQYGRL 257 Query: 1137 KEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAFATPHTIKQMGA 958 KEIKFFDE+ASGKSKGYCQVEFYDPS+A+ CKEGMNG+ FNGRACVVAFA+P T+KQMGA Sbjct: 258 KEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYMFNGRACVVAFASPQTLKQMGA 317 Query: 957 SYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDA 847 SY NK GR P N+ GRG NY SGDA Sbjct: 318 SYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQSGDA 353 Score = 172 bits (437), Expect = 3e-40 Identities = 84/133 (63%), Positives = 94/133 (70%), Gaps = 3/133 (2%) Frame = -1 Query: 627 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 448 D +M RG GYG F GP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 434 DPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGA 493 Query: 447 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 277 GPH+GMW D +MG WGG+EHG RESSYGGED ASEYGYG+A+H+KG RSS ASR Sbjct: 494 SGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASR 552 Query: 276 EKEKNSERDWSSN 238 EKE+ SER+WS N Sbjct: 553 EKERVSEREWSGN 565 >ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695496|ref|XP_007044905.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695500|ref|XP_007044906.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708839|gb|EOY00736.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708840|gb|EOY00737.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708841|gb|EOY00738.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 652 Score = 248 bits (634), Expect = 4e-63 Identities = 123/217 (56%), Positives = 150/217 (69%) Frame = -1 Query: 1497 SQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAP 1318 SQ+G + E H+ Q + G+QG S +K D +K+ +PA + + G +G P Sbjct: 142 SQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVPQKIANDPAQSLNSGTGGPQGPP 201 Query: 1317 QVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKV 1138 VPPNQM N+N + +E V+P +ENG TMLFVGELHWWTTDAE+ESVL QYG++ Sbjct: 202 HVPPNQMG----TNVNHPVMNENQVQPPIENGPTMLFVGELHWWTTDAELESVLSQYGRL 257 Query: 1137 KEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAFATPHTIKQMGA 958 KEIKFFDE+ASGKSKGYCQVEFYDPS+A+ CKEGMNG+ FNGRACVVAFA+P T+KQMGA Sbjct: 258 KEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYMFNGRACVVAFASPQTLKQMGA 317 Query: 957 SYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDA 847 SY NK GR P N+ GRG NY SGDA Sbjct: 318 SYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQSGDA 353 Score = 207 bits (526), Expect = 1e-50 Identities = 111/216 (51%), Positives = 126/216 (58%), Gaps = 7/216 (3%) Frame = -1 Query: 627 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 448 D +M RG GYG F GP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 434 DPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGA 493 Query: 447 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 277 GPH+GMW D +MG WGG+EHG RESSYGGED ASEYGYG+A+H+KG RSS ASR Sbjct: 494 SGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASR 552 Query: 276 EKEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDH---DSGYDDD 106 EKE+ SER+WS N R H D YDDD Sbjct: 553 EKERVSEREWSGNSDRRHRDEKEQDWDRSEREHREHRYREEKDSYREHRHRERDLDYDDD 612 Query: 105 WDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRR 1 WD+GQ A+PE++HRSRSRD DYGK+RR Sbjct: 613 WDRGQSSSRSRRRSHAMPEEEHRSRSRDVDYGKKRR 648 >ref|XP_002312652.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222852472|gb|EEE90019.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 619 Score = 244 bits (622), Expect = 1e-61 Identities = 121/216 (56%), Positives = 153/216 (70%) Frame = -1 Query: 1497 SQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAP 1318 +Q+GR+ E++H+ Q + G+Q S +P D + S K EP PL T +GAP Sbjct: 126 AQKGRVIEMSHDVQVRNMGFQKSTPVPPGIGVDPSDMSRKNAIEPEPLPITGSAGPRGAP 185 Query: 1317 QVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKV 1138 Q+ NQM+++ +VN R + +E VRP +ENG+T L+VGELHWWTTDAE+ES Q+G+V Sbjct: 186 QMQVNQMHMSADVN--RPVVNENQVRPPIENGSTTLYVGELHWWTTDAELESFASQFGRV 243 Query: 1137 KEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAFATPHTIKQMGA 958 KEIKFFDERASGKSKGYCQV+FY+ +AA+ACKEGMNGH FNGR CVVAFA+P T+KQMGA Sbjct: 244 KEIKFFDERASGKSKGYCQVDFYEAAAAAACKEGMNGHVFNGRPCVVAFASPQTLKQMGA 303 Query: 957 SYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 850 SY NK GR +ND AGRG AN+ SGD Sbjct: 304 SYMNKTQGQPQTQSQGRGSMNDGAGRGGNANFQSGD 339 Score = 181 bits (459), Expect = 8e-43 Identities = 99/210 (47%), Positives = 114/210 (54%), Gaps = 1/210 (0%) Frame = -1 Query: 627 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 448 D +MGRG GYG F+GP FPGMLP FP VNSMGL GVAPHVNPAFF Sbjct: 421 DPLYMGRGGGYGGFAGPGFPGMLPSFPAVNSMGLAGVAPHVNPAFFARGMAPNGMGMMVS 480 Query: 447 XXXXGPHSGMWNDTNMGAWGGEEHGRESSYGGEDNASEYGYGEASHDKGARSSAASREKE 268 GP+ GMW ESSY G++ ASEYGYGE +H+KGARSS ASREKE Sbjct: 481 SGMDGPNPGMW---------------ESSYDGDEGASEYGYGEGNHEKGARSSGASREKE 525 Query: 267 KNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDDDWDKGQX 88 + SERDWS N R ++ DSGY+DD D+G Sbjct: 526 RGSERDWSGNSDRRHRDEREQDWDRPEREHRYKEEKDSYRGHRQRERDSGYEDDRDRGHS 585 Query: 87 XXXXXXSG-AVPEDDHRSRSRDADYGKRRR 1 A PE+D+RSR+RD DYGKRRR Sbjct: 586 SSRARSRSRAAPEEDYRSRTRDVDYGKRRR 615 >ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] gi|462422613|gb|EMJ26876.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] Length = 630 Score = 243 bits (621), Expect = 1e-61 Identities = 124/215 (57%), Positives = 152/215 (70%) Frame = -1 Query: 1494 QRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQ 1315 Q+G+ P VA + G +GY GS +MP D + + K E P M + G Q Sbjct: 116 QQGQ-PPVAKEPELGSTGY-GSTTMPPNVGGDSSDITGKTALESVPSMNSGTAGPTGVTQ 173 Query: 1314 VPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVK 1135 +P NQ+++ VN NR M +E +RP VENG+TMLFVGELHWWTTDAE+ESVL QYG+VK Sbjct: 174 MPTNQISIK--VNANRPMFNENQIRPPVENGSTMLFVGELHWWTTDAELESVLSQYGRVK 231 Query: 1134 EIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAFATPHTIKQMGAS 955 EIKFFDERASGKSKGYCQVEF+DP+AA+ACKEGM+G+ FNGRACVVAFA+P T+KQMGAS Sbjct: 232 EIKFFDERASGKSKGYCQVEFHDPAAATACKEGMDGYLFNGRACVVAFASPQTLKQMGAS 291 Query: 954 YTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 850 Y +K GR P+N+ GRG G NY +GD Sbjct: 292 YLSKSQGQTQSQQPGRRPMNEGVGRGGGVNYQTGD 326 Score = 216 bits (551), Expect = 2e-53 Identities = 113/217 (52%), Positives = 129/217 (59%), Gaps = 8/217 (3%) Frame = -1 Query: 627 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 448 D +MGRG GYG F GPAFPGML FP VN+MGL GVAPHVNPAFF Sbjct: 410 DPTYMGRGGGYGGFPGPAFPGMLSSFPAVNTMGLAGVAPHVNPAFFGRGMATNGMGMMGS 469 Query: 447 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 277 G H+GMWND +MG WGG+EHG RESSYGG+D ASEYGYGEA+H+KG RS+A SR Sbjct: 470 SGMDGHHAGMWNDPSMGGWGGDEHGRRTRESSYGGDDGASEYGYGEANHEKGGRSNAPSR 529 Query: 276 EKEKNSERDWSSNP----XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDD 109 E+E+ SERDWS N R ++ D GY+D Sbjct: 530 ERERGSERDWSGNSERRHRDEREQDWDRSERGEHREHRYKEEKDSYRDHRQRERDVGYED 589 Query: 108 DWDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRR 1 DWD+GQ A+PEDDHRSRSRD DYGKRRR Sbjct: 590 DWDRGQSSSRPRSRSKAMPEDDHRSRSRDVDYGKRRR 626 >gb|EXB82464.1| Cleavage and polyadenylation specificity factor subunit [Morus notabilis] Length = 636 Score = 231 bits (589), Expect = 6e-58 Identities = 117/212 (55%), Positives = 144/212 (67%), Gaps = 2/212 (0%) Frame = -1 Query: 1479 PEVAHNSQAGH--SGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQVPP 1306 P+ A SQ G +G+QGS M H D + K++ EP + +G + Sbjct: 136 PDGASGSQKGRIVAGFQGSKPMLHSVGVDSSDIPGKMVNEPIQAPNSGGAGPRGILPMQG 195 Query: 1305 NQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKEIK 1126 NQ +N NV+ + +E +RPS+ENG+TMLFVGELHWWTTDAE+ESVL QYG+VKEIK Sbjct: 196 NQTTVNANVS--HPIVNENQIRPSIENGSTMLFVGELHWWTTDAELESVLSQYGRVKEIK 253 Query: 1125 FFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAFATPHTIKQMGASYTN 946 FFDERASGKSKGYCQVE+YD +AA ACKEGM+GH FNGRACVVAFA+P T+KQMGA+Y + Sbjct: 254 FFDERASGKSKGYCQVEYYDAAAAVACKEGMHGHVFNGRACVVAFASPQTLKQMGAAYMS 313 Query: 945 KXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 850 K GR P+ND GRG N+ SGD Sbjct: 314 KNQVQNQSQPQGRRPINDGVGRGGNPNFQSGD 345 Score = 184 bits (468), Expect = 7e-44 Identities = 101/216 (46%), Positives = 115/216 (53%), Gaps = 7/216 (3%) Frame = -1 Query: 627 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 448 D +MGRG GYG F+GPAFPGMLP FP VN+MG VAPHVNPAFF Sbjct: 425 DPTYMGRGVGYGGFAGPAFPGMLPSFPAVNTMGFAAVAPHVNPAFFGRGMTNNGMGMVGS 484 Query: 447 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 277 G GMWND ++G WGGEEHG RESSYGG+D ASEYGYG+ +H+KG R Sbjct: 485 SLMDGHQGGMWNDPSIGGWGGEEHGRRTRESSYGGDDGASEYGYGDTNHEKGGR------ 538 Query: 276 EKEKNSERDWSSNP---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDDD 106 E+ SERDWS N R K+ + Y+DD Sbjct: 539 --ERGSERDWSGNSERRNHEERDQDWDRSQKEQKEHRYREGKDGSRDYRPKERELDYEDD 596 Query: 105 WDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRR 1 WD+GQ V ED HRSRSRD DYGKRRR Sbjct: 597 WDRGQSSSRLRSRSRVVQEDHHRSRSRDVDYGKRRR 632 >ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] gi|550329195|gb|ERP56065.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] Length = 591 Score = 211 bits (538), Expect = 5e-52 Identities = 104/174 (59%), Positives = 125/174 (71%) Frame = -1 Query: 1371 GEPAPLMYTNMGNTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELH 1192 G+ + + +G PQ+ NQMN+N +VN R + +E VRP +ENG T L+VGELH Sbjct: 123 GDGSSVAQKGSAGPRGVPQMQVNQMNMNADVN--RPVVNENQVRPPIENGPTTLYVGELH 180 Query: 1191 WWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNG 1012 WWTTDAE+ESV QYG+VKEIKFFDERASGKSKGYCQV+FY+ +AA+ACKEGMN H FNG Sbjct: 181 WWTTDAELESVASQYGRVKEIKFFDERASGKSKGYCQVDFYEAAAAAACKEGMNEHVFNG 240 Query: 1011 RACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 850 R CVVAFA+ T+KQMGASY +K GR +ND GRG ANY SGD Sbjct: 241 RPCVVAFASAQTLKQMGASYMSKTQGQPQPQSQGRGSMNDGMGRGGNANYQSGD 294 Score = 196 bits (499), Expect = 2e-47 Identities = 106/212 (50%), Positives = 120/212 (56%), Gaps = 3/212 (1%) Frame = -1 Query: 627 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 448 D +MGRG GYG F G FPGMLP FP VNSMGL GVAPHVNPAFF Sbjct: 376 DPLYMGRGGGYGGFPGHGFPGMLPSFPAVNSMGLAGVAPHVNPAFFARGMAPNGMGMMAS 435 Query: 447 XXXXGPHSGMWNDTNMGAWGGE--EHGRESSYGGEDNASEYGYGEASHDKGARSSAASRE 274 GP+ G W DT+MG WG E RESSY G++ ASEYGYGE +H+KGARSS ASRE Sbjct: 436 SGMEGPNPGKWPDTSMGGWGEEPGRRTRESSYDGDEGASEYGYGEGNHEKGARSSGASRE 495 Query: 273 KEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDDDWDKG 94 KE+ SERDWS N R ++ DSGY+DD D+G Sbjct: 496 KERVSERDWSGNSDRRHRDEREQDWDRSEREPKYREEKDTYRGHRQRERDSGYEDDRDRG 555 Query: 93 QXXXXXXXSG-AVPEDDHRSRSRDADYGKRRR 1 A PE+D+RSRSRD DYGKRRR Sbjct: 556 HSSSRARSRSRAAPEEDYRSRSRDVDYGKRRR 587 >ref|XP_002315647.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222864687|gb|EEF01818.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 573 Score = 211 bits (538), Expect = 5e-52 Identities = 104/174 (59%), Positives = 125/174 (71%) Frame = -1 Query: 1371 GEPAPLMYTNMGNTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELH 1192 G+ + + +G PQ+ NQMN+N +VN R + +E VRP +ENG T L+VGELH Sbjct: 123 GDGSSVAQKGSAGPRGVPQMQVNQMNMNADVN--RPVVNENQVRPPIENGPTTLYVGELH 180 Query: 1191 WWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNG 1012 WWTTDAE+ESV QYG+VKEIKFFDERASGKSKGYCQV+FY+ +AA+ACKEGMN H FNG Sbjct: 181 WWTTDAELESVASQYGRVKEIKFFDERASGKSKGYCQVDFYEAAAAAACKEGMNEHVFNG 240 Query: 1011 RACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 850 R CVVAFA+ T+KQMGASY +K GR +ND GRG ANY SGD Sbjct: 241 RPCVVAFASAQTLKQMGASYMSKTQGQPQPQSQGRGSMNDGMGRGGNANYQSGD 294 Score = 174 bits (440), Expect = 1e-40 Identities = 99/210 (47%), Positives = 113/210 (53%), Gaps = 1/210 (0%) Frame = -1 Query: 627 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 448 D +MGRG GYG F G FPGMLP FP VNSMGL GVAPHVNPAFF Sbjct: 376 DPLYMGRGGGYGGFPGHGFPGMLPSFPAVNSMGLAGVAPHVNPAFFARGMAPNG------ 429 Query: 447 XXXXGPHSGMWNDTNMGAWGGEEHGRESSYGGEDNASEYGYGEASHDKGARSSAASREKE 268 GM + M G G+ESSY G++ ASEYGYGE +H+KGARSS ASREKE Sbjct: 430 -------MGMMASSGM---EGPNPGKESSYDGDEGASEYGYGEGNHEKGARSSGASREKE 479 Query: 267 KNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDDDWDKGQX 88 + SERDWS N R ++ DSGY+DD D+G Sbjct: 480 RVSERDWSGNSDRRHRDEREQDWDRSEREPKYREEKDTYRGHRQRERDSGYEDDRDRGHS 539 Query: 87 XXXXXXSG-AVPEDDHRSRSRDADYGKRRR 1 A PE+D+RSRSRD DYGKRRR Sbjct: 540 SSRARSRSRAAPEEDYRSRSRDVDYGKRRR 569