BLASTX nr result
ID: Mentha28_contig00014324
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha28_contig00014324 (2262 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus... 701 0.0 ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation spec... 572 e-160 ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobr... 343 2e-91 ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation spec... 341 7e-91 ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citr... 341 1e-90 ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309... 340 2e-90 gb|EPS60955.1| hypothetical protein M569_13847, partial [Genlise... 335 4e-89 ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobr... 332 3e-88 ref|XP_007044908.1| RNA-binding family protein isoform 5, partia... 332 3e-88 ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobr... 332 3e-88 ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobr... 332 3e-88 ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation spec... 330 2e-87 ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citr... 328 5e-87 ref|XP_002515040.1| RNA binding protein, putative [Ricinus commu... 327 1e-86 ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268... 325 4e-86 ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prun... 312 4e-82 gb|EXB82464.1| Cleavage and polyadenylation specificity factor s... 295 5e-77 ref|XP_002312652.1| RNA recognition motif-containing family prot... 294 1e-76 ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Popu... 256 4e-65 ref|XP_002315647.1| RNA recognition motif-containing family prot... 256 4e-65 >gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus guttatus] Length = 639 Score = 701 bits (1810), Expect = 0.0 Identities = 376/643 (58%), Positives = 416/643 (64%), Gaps = 6/643 (0%) Frame = -2 Query: 2051 MDPVTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQ 1872 MDPVTDEQLDYGDEEY GNQKMQYH GGAIPALAE+EMIG+ VGEGF+Q Sbjct: 1 MDPVTDEQLDYGDEEYGGNQKMQYHHGGAIPALAEDEMIGDDDEYDDLYNDVNVGEGFMQ 60 Query: 1871 MQRSDTQVPSVVGNSGIQNSKANVPGTHLEGVALQEVNNIKVAEEGNYAATAAPFPVQKT 1692 MQRS+ PS VGN+ SK PGT E +A QEVNN +V EG+YA QK Sbjct: 61 MQRSEAPPPSAVGNNSFSISKNTAPGTRAEAIASQEVNNGRVGNEGSYAPNGVQLSDQKN 120 Query: 1691 SLPGPGGPPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPA 1512 +L GGP Q +DASQR RLPEVA++SQA H GYQGS M HK A D+MNNSE ++GEPA Sbjct: 121 NLTAVGGPAQPVDASQRVRLPEVANSSQAAHLGYQGSEIMLHKTATDRMNNSENIVGEPA 180 Query: 1511 PFMYTNMGNTKGAPQVPPNQM--NLNPNVNINRSMDDEYMVRPS-VENGNTMLFVGELHW 1341 +Y N G++KG PQ P N M N N NVN+NRSMDDEY++RPS ENGN M++VGELHW Sbjct: 181 SLVYPNTGSSKGVPQAPSNLMNSNANVNVNVNRSMDDEYLIRPSGGENGNPMIYVGELHW 240 Query: 1340 WTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGR 1161 WTTDAE+ESVLIQYG+VKEIKFFDERASGKSKGYCQVEFYDP+AA+ACK+GM GH FNGR Sbjct: 241 WTTDAEVESVLIQYGRVKEIKFFDERASGKSKGYCQVEFYDPAAATACKDGMQGHIFNGR 300 Query: 1160 ACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDA-XXXXX 984 ACVV +A P T KQMGASY NK GRNP+ND AGRGNG NYPSGDA Sbjct: 301 ACVVTYANPQTSKQMGASY-NKNQGQSQSQLQGRNPMNDGAGRGNGTNYPSGDAGRNFGR 359 Query: 983 XXXXXXXNQPPNKXXXXXXXXXXXMI-NKNMI-XXXXXXXXXXXXXXXXXXXXXXXXXXX 810 NQ PN+ + NKNMI Sbjct: 360 GGGWGRGNQAPNRGPGAGPIRGRGGMGNKNMIGNAPGAGGGGAYGQGLNGPGFGGPPGMM 419 Query: 809 XXXXXXXXXFDLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXX 630 FDLAFMGRG GYG FSGP F GMLPPF GVNSMGLPGVAPHVNPAFF Sbjct: 420 HPQGMMGPGFDLAFMGRGGGYGGFSGPPFQGMLPPFQGVNSMGLPGVAPHVNPAFFGRGM 479 Query: 629 XXXXXXXXXXXXXXGPHSGMWNDTNMGAWGGEEHGRESSYGGEDNASEYGYGEASHDKGA 450 GPHSGMWND NMG WGGEEHGRESSYGGEDNASEYGYGE SHDK Sbjct: 480 NPNGMGMMGNPGMVGPHSGMWNDPNMGGWGGEEHGRESSYGGEDNASEYGYGEGSHDKSV 539 Query: 449 RSSAASREKEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSG 270 RSSAA REKE+ SER++ P R K+ +SG Sbjct: 540 RSSAAPREKERTSEREY---PERKHREERENDGERNDRDSKYREEKDRYREHRHKERESG 596 Query: 269 YDDDWDKGQXXXXXXXSGAVPEDDHRSRSRDADYGKRRRLPSE 141 YDDDWD+GQ SGAV E+DHRSRSRDADYGKRRR+PSE Sbjct: 597 YDDDWDRGQSSRSRSRSGAVQEEDHRSRSRDADYGKRRRMPSE 639 >ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Solanum tuberosum] gi|565349616|ref|XP_006341787.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X2 [Solanum tuberosum] Length = 648 Score = 572 bits (1475), Expect = e-160 Identities = 319/648 (49%), Positives = 370/648 (57%), Gaps = 11/648 (1%) Frame = -2 Query: 2051 MDPVTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQ 1872 MDP DEQLDYGDEEY G+ KMQYH G IPALAE+EM+GE +GEGFLQ Sbjct: 1 MDPTADEQLDYGDEEYGGSHKMQYHGSGTIPALAEDEMMGEDDEYDDLYNDVNIGEGFLQ 60 Query: 1871 MQRSDTQVPSV-VGNSGIQNSKANVPGTHLEGVALQEVNNIKVAEEGNYAATAAPFPVQK 1695 +QRS+ VPSV GN Q K + P + G+ +E +A EG YA T FP QK Sbjct: 61 LQRSEVPVPSVDAGNGNFQAQKDSFPASRAGGLGSEEAKIPGIATEGKYAGTEVQFPQQK 120 Query: 1694 TSLPGPGGPPQTMDASQRGRLPEVAH--NSQAGHSGYQGSASMPHKNAADQMNNSEKVIG 1521 + DA+Q+ R + NSQAG+SGYQGS MP K AD M EK Sbjct: 121 GEPVVERETERPADAAQKARPSAITMTLNSQAGNSGYQGSMPMPQKIGADPMAMPEKNAS 180 Query: 1520 EPAPFMYTNMGNTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHW 1341 E P M + + + P +P NQ+N + NVN+N + E RPS+ENGNTMLFVGELHW Sbjct: 181 EATPLMNSVVPGPRVVPHMPTNQLNSSGNVNMNNPVISETPFRPSLENGNTMLFVGELHW 240 Query: 1340 WTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGR 1161 WTTDAE+ESVL QYG VKEIKFFDERASGKSKGYCQVEF+DP++A+ACKEGMNG++FNGR Sbjct: 241 WTTDAELESVLTQYGNVKEIKFFDERASGKSKGYCQVEFFDPASAAACKEGMNGYNFNGR 300 Query: 1160 ACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDAXXXXXX 981 ACVVAFATP TIKQMG+SY NK GR P+N+ GRG P Sbjct: 301 ACVVAFATPQTIKQMGSSYANKTQNQVQSQPQGRRPMNEGVGRGGPNYTPGDAGRNFGRG 360 Query: 980 XXXXXXNQPPNKXXXXXXXXXXXMI-NKNMIXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 804 PN+ + +KNM+ Sbjct: 361 SWGRGGPGMPNRGPGGGPVRGRGAMGSKNMMVNPGAGNGAGGAFGQGLAGPAFGGPPAGL 420 Query: 803 XXXXXXXF---DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXX 633 D +FMGRGAGYG FSGPAFPGM+PPF VN MGLPGVAPHVNPAFF Sbjct: 421 MHPQGMMGPGFDPSFMGRGAGYGGFSGPAFPGMMPPFQAVNPMGLPGVAPHVNPAFFGRG 480 Query: 632 XXXXXXXXXXXXXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASH 462 GPH GMW DT+ G WGGEEHG RESSYGGEDNASEYGYGE SH Sbjct: 481 MAANGMGMMSAAGMDGPHPGMWTDTSGGGWGGEEHGRRTRESSYGGEDNASEYGYGEVSH 540 Query: 461 DKGARSSAASREKEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKD 282 DKGARSSA SREKE+ SERDWS N R K+ Sbjct: 541 DKGARSSAVSREKERGSERDWSGNSDKRHRDEREHDRDRHDKEHRYREERDGYRDYRQKE 600 Query: 281 HDSGYDDDWDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 141 +S Y++D+D+GQ A E+DHRSRSRD +YGKRRR PSE Sbjct: 601 RESEYEEDYDRGQSSSRSRSKSRAAQEEDHRSRSRDTNYGKRRRAPSE 648 >ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695488|ref|XP_007044903.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708837|gb|EOY00734.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708838|gb|EOY00735.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 653 Score = 343 bits (880), Expect = 2e-91 Identities = 187/358 (52%), Positives = 227/358 (63%), Gaps = 7/358 (1%) Frame = -2 Query: 2051 MDPVTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQ 1872 MD + +EQ+D+GDEEY G QKMQY GAIPALA+EEM+GE VGEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGAQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 1871 MQRSDTQV-PSVVGNSGIQNSKANVPGTHLEGVALQEVNNIKVAEEGNYAATAAPFPVQK 1695 +QRS+ P +G++G+Q K P E Q +N V+ +G + A +P Q Sbjct: 61 LQRSEAPPQPGGMGSTGLQAQKNEAPEPRGEAGGSQGLNIPGVSVQGKHLNVTARYPEQD 120 Query: 1694 ----TSLP--GPGGPPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSE 1533 S P G G P SQ+GR+ E ++Q + G+QG +S HK D + Sbjct: 121 GQPAVSRPEMGSGSYPSGTSISQKGRVMEGTQDTQVKNMGFQGLSSASHKVGIDPSGVPQ 180 Query: 1532 KVIGEPAPFMYTNMGNTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVG 1353 K+ PA + + G +GAP VPPNQM LN +N M E VRP +ENG TMLFVG Sbjct: 181 KIANVPAQSLNSGTGGPQGAPHVPPNQMGLN----VNHPMISENQVRPPIENGPTMLFVG 236 Query: 1352 ELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHS 1173 ELHWWTTDAE+ESVL QYG+VKEIKFFDERASGKSKGYCQVEFYDP++A+ACKEGM+G+ Sbjct: 237 ELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPASAAACKEGMDGYM 296 Query: 1172 FNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDA 999 FNGRACVVAFA+P T+KQMGASY NK GR P ND GRG NY SGDA Sbjct: 297 FNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NDGLGRGGNMNYQSGDA 353 Score = 206 bits (525), Expect = 3e-50 Identities = 113/220 (51%), Positives = 127/220 (57%), Gaps = 7/220 (3%) Frame = -2 Query: 779 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 600 D +MGRG YG F GP FPGMLP FP VN++GL GVAPHVNPAFF Sbjct: 435 DPTYMGRGGSYGGFPGPGFPGMLPSFPAVNTLGLAGVAPHVNPAFFGRGMAPNGMGMMGG 494 Query: 599 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 429 GPH GMW DT+MG WGG+EHG RESSYGGED ASEYGYG+A+H+KG RSS ASR Sbjct: 495 PGMDGPHVGMWTDTSMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASR 553 Query: 428 EKEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDH---DSGYDDD 258 EKE+ S+R+WS N R H D YDDD Sbjct: 554 EKERVSDREWSGNSDRRHRDEKERDWDRSEREHREHRYREEKDSYREHRHRERDLDYDDD 613 Query: 257 WDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 141 D+GQ A+PE+ RSRSRD DYGKRRRLPSE Sbjct: 614 LDRGQSSSRSRRRSHAMPEEQRRSRSRDVDYGKRRRLPSE 653 >ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 658 Score = 341 bits (875), Expect = 7e-91 Identities = 181/358 (50%), Positives = 224/358 (62%), Gaps = 8/358 (2%) Frame = -2 Query: 2051 MDPVTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQ 1872 MD + +EQ+DY +EEY G QKMQY GGAIPALA+EE++GE VG+G LQ Sbjct: 1 MDSMAEEQIDYEEEEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQ 60 Query: 1871 MQRSDTQVPSV-VGNSGIQNSKANVPGTHLEGVALQEVNNIKVAEEGNYAATAAPFPVQK 1695 Q+ + PS VGN +Q K +VP ++ Q N V+ EG Y FP Q Sbjct: 61 FQQPEAPPPSAGVGNGRLQVKKTDVPEQQVQAGVSQGSNVPGVSVEGKYTNAGTHFPAQN 120 Query: 1694 -----TSLP--GPGGPPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNS 1536 + P G G P SQ+G + E H++ + G+QGS S P + D N Sbjct: 121 DVQVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMP 180 Query: 1535 EKVIGEPAPFMYTNMGNTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFV 1356 +V EPAP + +GA +P NQM +N +N+NR+M +E +RP +ENG TMLFV Sbjct: 181 GRVANEPAPVLNPGAAGPQGA-LIPANQMGVN--INVNRAMVNENQIRPPLENGGTMLFV 237 Query: 1355 GELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGH 1176 GELHWWTTDAE+ESVL QYG+VKEIKFFDERASGKSKGYCQVEF+D +AA+ACK+GMNGH Sbjct: 238 GELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGH 297 Query: 1175 SFNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 1002 FNGR CVVAFA+P T+KQMGASY NK GR P+ND GRG NY SGD Sbjct: 298 VFNGRPCVVAFASPQTLKQMGASYMNKNQGQPQSQTQGRRPMNDGGGRGGNMNYQSGD 355 Score = 225 bits (573), Expect = 8e-56 Identities = 117/220 (53%), Positives = 134/220 (60%), Gaps = 7/220 (3%) Frame = -2 Query: 779 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 600 D +MGRG GYG FSGP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 439 DPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGS 498 Query: 599 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 429 GPH GMW D++MG W GEEHG RESSYGG+D AS+YGYGEA+H+KGARS+AASR Sbjct: 499 SGMDGPHPGMWTDSSMGGWLGEEHGRRTRESSYGGDDGASDYGYGEANHEKGARSTAASR 558 Query: 428 EKEKNSERDWSSNP---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDDD 258 EK++ SERDWS N R +D DS YDD+ Sbjct: 559 EKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDDN 618 Query: 257 WDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 141 WD+G A+P++DHRSRSRD DYGKRRRLPSE Sbjct: 619 WDRGPSSSRSRSRSRAIPDEDHRSRSRDVDYGKRRRLPSE 658 >ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] gi|557540375|gb|ESR51419.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] Length = 658 Score = 341 bits (874), Expect = 1e-90 Identities = 181/358 (50%), Positives = 224/358 (62%), Gaps = 8/358 (2%) Frame = -2 Query: 2051 MDPVTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQ 1872 MD + +EQ+DY +EEY G QKMQY GGAIPALA+EE++GE VG+G LQ Sbjct: 1 MDSMAEEQIDYEEEEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQ 60 Query: 1871 MQRSDTQVPSV-VGNSGIQNSKANVPGTHLEGVALQEVNNIKVAEEGNYAATAAPFPVQK 1695 Q+ + PS VGN +Q K +VP ++ Q N V+ EG Y FP Q Sbjct: 61 FQQPEAPPPSAGVGNGRLQVKKTDVPEQQVQAGVSQGSNVPGVSVEGKYTNAGTHFPAQN 120 Query: 1694 -----TSLP--GPGGPPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNS 1536 + P G G P SQ+G + E H++ + G+QGS S P + D N Sbjct: 121 DVQVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPPRTGVDPSNMP 180 Query: 1535 EKVIGEPAPFMYTNMGNTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFV 1356 +V EPAP + +GA +P NQM +N +N+NR+M +E +RP +ENG TMLFV Sbjct: 181 GRVANEPAPVLNPGAAGPQGA-LIPANQMGVN--INVNRAMVNENQIRPPLENGGTMLFV 237 Query: 1355 GELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGH 1176 GELHWWTTDAE+ESVL QYG+VKEIKFFDERASGKSKGYCQVEF+D +AA+ACK+GMNGH Sbjct: 238 GELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGH 297 Query: 1175 SFNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 1002 FNGR CVVAFA+P T+KQMGASY NK GR P+ND GRG NY SGD Sbjct: 298 VFNGRPCVVAFASPQTLKQMGASYMNKNQGQPQSQTQGRRPMNDGGGRGGNMNYQSGD 355 Score = 225 bits (574), Expect = 6e-56 Identities = 117/220 (53%), Positives = 134/220 (60%), Gaps = 7/220 (3%) Frame = -2 Query: 779 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 600 D +MGRG GYG FSGP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 439 DPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGS 498 Query: 599 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 429 GPH GMW D++MG W GEEHG RESSYGG+D AS+YGYGEA+H+KGARS+AASR Sbjct: 499 SGMDGPHPGMWTDSSMGGWVGEEHGRRTRESSYGGDDGASDYGYGEANHEKGARSTAASR 558 Query: 428 EKEKNSERDWSSNP---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDDD 258 EK++ SERDWS N R +D DS YDD+ Sbjct: 559 EKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDDN 618 Query: 257 WDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 141 WD+G A+P++DHRSRSRD DYGKRRRLPSE Sbjct: 619 WDRGPSSSRSRSRSRAIPDEDHRSRSRDVDYGKRRRLPSE 658 >ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309507 [Fragaria vesca subsp. vesca] Length = 646 Score = 340 bits (872), Expect = 2e-90 Identities = 179/351 (50%), Positives = 227/351 (64%), Gaps = 1/351 (0%) Frame = -2 Query: 2051 MDPVTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQ 1872 MDP+ +EQ+DY +EEY G QK+QY + GAIPALA+EE + E VGEGFLQ Sbjct: 1 MDPMGEEQIDYEEEEYGGAQKLQYQESGAIPALADEEPMVEDDEYDDLYNDVNVGEGFLQ 60 Query: 1871 MQRSDTQVPSV-VGNSGIQNSKANVPGTHLEGVALQEVNNIKVAEEGNYAATAAPFPVQK 1695 M R + +P VGN G+Q K NVP ++G A QEV N + EG Y++ P QK Sbjct: 61 MHRPEPPLPPAGVGNGGLQAQKNNVPEQRVQGGASQEVKNPGFSVEGKYSSV----PEQK 116 Query: 1694 TSLPGPGGPPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEP 1515 P P ASQ+GR+ E+ H++Q + G+QG+A+M AD + + K+ P Sbjct: 117 DQPPVSVVPEM---ASQKGRVMEMTHDAQVRNMGFQGAATMQSNVVADSSDLTGKIANGP 173 Query: 1514 APFMYTNMGNTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWT 1335 P M + Q+P NQMN+ +N+NR M +E +RP VENG+ LFVGELHWWT Sbjct: 174 IPSMNSGSNGPPAVQQMPANQMNMK--INVNRPMVNENQIRPPVENGSATLFVGELHWWT 231 Query: 1334 TDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRAC 1155 TDAE+E VL Q+G++KEIKFFDERASGKSKGYCQV+FYDP+AASACKEGM+G+ FNGRAC Sbjct: 232 TDAELEGVLSQFGRIKEIKFFDERASGKSKGYCQVDFYDPAAASACKEGMDGYVFNGRAC 291 Query: 1154 VVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 1002 VVAFA+ T+KQMG SY NK GR P+ND AGRG N+ GD Sbjct: 292 VVAFASSQTLKQMGDSYVNKSQGQVQTQPQGRRPMNDGAGRGGNMNFQGGD 342 Score = 193 bits (491), Expect = 2e-46 Identities = 108/221 (48%), Positives = 123/221 (55%), Gaps = 8/221 (3%) Frame = -2 Query: 779 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 600 D +MGRG GYG F GP FPGMLP FPGVN+MGL GVAPHVNPAFF Sbjct: 426 DPTYMGRGGGYGGFPGPGFPGMLPQFPGVNAMGLAGVAPHVNPAFFGRGMATNGMGMMGS 485 Query: 599 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYG-YGEASHDKGARSSAAS 432 G H+ MWND +M W GEE RESSYGG+D SEYG YGEA+H+K RSSAA Sbjct: 486 SGMEGHHAPMWNDPSMAGWTGEEQDRRTRESSYGGDDGGSEYGNYGEANHEKPVRSSAAP 545 Query: 431 REKEKNSERDW---SSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDD 261 RE+E+ SER+W S R ++ D Y+D Sbjct: 546 RERERESEREWTGTSERRHRDEREQDWDRSEREHREPRYKEEKDSYRDHRRRERDVAYED 605 Query: 260 DWDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 141 D D+G A+PEDDHRSRSRD DYGKRRRLPSE Sbjct: 606 DRDRGHSSSRPRSRSKAMPEDDHRSRSRDVDYGKRRRLPSE 646 >gb|EPS60955.1| hypothetical protein M569_13847, partial [Genlisea aurea] Length = 508 Score = 335 bits (860), Expect = 4e-89 Identities = 193/355 (54%), Positives = 228/355 (64%), Gaps = 4/355 (1%) Frame = -2 Query: 2051 MDPVTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXV-GEGFL 1875 M+P+ EQ D+G+EEY G QKMQY+QGGAIPALA+EEMIGE GE F+ Sbjct: 1 MEPMNGEQFDFGEEEYGGGQKMQYNQGGAIPALADEEMIGEEDDEYDDLYNDVNVGESFM 60 Query: 1874 QMQRSDTQVPSVVGNSGIQNSKANVPGTHLEGVALQEVNNIKVAEEGNYAATAAPFPVQK 1695 Q+QR D+Q+P + + N GT E + +E N K A + A FP QK Sbjct: 61 QVQRPDSQIPPFKAEN-----RVNPSGTGDESIPSEEANASKYAGNRAFGPGALQFPEQK 115 Query: 1694 TSLPGPGGPPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEP 1515 L T+D SQ R NSQ SGYQGS + P+ DQ+ N +K +G+P Sbjct: 116 AGLNTTEETSVTVDRSQTVR------NSQTDQSGYQGSVA-PNNKTEDQVKNMDKTVGDP 168 Query: 1514 APFMYTNMG-NTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWW 1338 + + N+G +KGA VP N MN+ N N R +DDEY S ENGNTML+VGELHWW Sbjct: 169 SS-INPNVGVGSKGA--VPFNFMNMAANANAIRPVDDEYSNLGSSENGNTMLYVGELHWW 225 Query: 1337 TTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRA 1158 TTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEF+DP+AA ACKEGMNG+ FNGRA Sbjct: 226 TTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFFDPAAAHACKEGMNGYVFNGRA 285 Query: 1157 CVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRN-PVND-AAGRGNGANYPSGDA 999 CVVAFATP TIKQMGASY N+ GRN +ND AGRG G N+ GDA Sbjct: 286 CVVAFATPQTIKQMGASYMNRNQGQPQAQFPGRNAAMNDGGAGRGVGTNFSGGDA 340 Score = 123 bits (308), Expect = 4e-25 Identities = 62/96 (64%), Positives = 69/96 (71%), Gaps = 4/96 (4%) Frame = -2 Query: 779 DLAFMGRGAGYGN-FSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXX 603 DLAFMGRGAGYG F+GPAFPGMLPPFP VN++GLPGVAPHVNPAFF Sbjct: 413 DLAFMGRGAGYGGGFTGPAFPGMLPPFPAVNTLGLPGVAPHVNPAFFGRGMAPNGMGMMG 472 Query: 602 XXXXXGPHSGMWNDTNM-GAWGGEEHGR--ESSYGG 504 GP+SG+WND ++ G WGGEE GR ESSYGG Sbjct: 473 PSGMGGPYSGLWNDASVGGGWGGEEQGRGPESSYGG 508 >ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobroma cacao] gi|508708844|gb|EOY00741.1| RNA-binding family protein isoform 6 [Theobroma cacao] Length = 602 Score = 332 bits (852), Expect = 3e-88 Identities = 176/359 (49%), Positives = 226/359 (62%), Gaps = 8/359 (2%) Frame = -2 Query: 2051 MDPVTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQ 1872 MD + +EQ+D+GDEEY G QKMQY GAIPALA+EEM+GE VGEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 1871 MQRSDTQV-PSVVGNSGIQNSKANVPGTHLEGVALQEVNNIKVAEEGNYAATAAPFPVQK 1695 +QRS+ + P +G++G++ + P +E Q +N V+ +G + +A +P +K Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRVEAGGSQGLNIPGVSVQGKHPNVSARYP-EK 119 Query: 1694 TSLPGPGGP-------PQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNS 1536 P P P SQ+G + E H+ Q + G+QG S +K D Sbjct: 120 EEQPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVP 179 Query: 1535 EKVIGEPAPFMYTNMGNTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFV 1356 +K+ +PA + + G +G P VPPNQM N+N + +E V+P +ENG TMLFV Sbjct: 180 QKIANDPAQSLNSGTGGPQGPPHVPPNQMG----TNVNHPVMNENQVQPPIENGPTMLFV 235 Query: 1355 GELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGH 1176 GELHWWTTDAE+ESVL QYG++KEIKFFDE+ASGKSKGYCQVEFYDPS+A+ CKEGMNG+ Sbjct: 236 GELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGY 295 Query: 1175 SFNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDA 999 FNGRACVVAFA+P T+KQMGASY NK GR P N+ GRG NY SGDA Sbjct: 296 MFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQSGDA 353 Score = 172 bits (437), Expect = 5e-40 Identities = 84/133 (63%), Positives = 94/133 (70%), Gaps = 3/133 (2%) Frame = -2 Query: 779 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 600 D +M RG GYG F GP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 434 DPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGA 493 Query: 599 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 429 GPH+GMW D +MG WGG+EHG RESSYGGED ASEYGYG+A+H+KG RSS ASR Sbjct: 494 SGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASR 552 Query: 428 EKEKNSERDWSSN 390 EKE+ SER+WS N Sbjct: 553 EKERVSEREWSGN 565 >ref|XP_007044908.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] gi|508708843|gb|EOY00740.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] Length = 656 Score = 332 bits (852), Expect = 3e-88 Identities = 176/359 (49%), Positives = 226/359 (62%), Gaps = 8/359 (2%) Frame = -2 Query: 2051 MDPVTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQ 1872 MD + +EQ+D+GDEEY G QKMQY GAIPALA+EEM+GE VGEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 1871 MQRSDTQV-PSVVGNSGIQNSKANVPGTHLEGVALQEVNNIKVAEEGNYAATAAPFPVQK 1695 +QRS+ + P +G++G++ + P +E Q +N V+ +G + +A +P +K Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRVEAGGSQGLNIPGVSVQGKHPNVSARYP-EK 119 Query: 1694 TSLPGPGGP-------PQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNS 1536 P P P SQ+G + E H+ Q + G+QG S +K D Sbjct: 120 EEQPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVP 179 Query: 1535 EKVIGEPAPFMYTNMGNTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFV 1356 +K+ +PA + + G +G P VPPNQM N+N + +E V+P +ENG TMLFV Sbjct: 180 QKIANDPAQSLNSGTGGPQGPPHVPPNQMG----TNVNHPVMNENQVQPPIENGPTMLFV 235 Query: 1355 GELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGH 1176 GELHWWTTDAE+ESVL QYG++KEIKFFDE+ASGKSKGYCQVEFYDPS+A+ CKEGMNG+ Sbjct: 236 GELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGY 295 Query: 1175 SFNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDA 999 FNGRACVVAFA+P T+KQMGASY NK GR P N+ GRG NY SGDA Sbjct: 296 MFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQSGDA 353 Score = 196 bits (497), Expect = 5e-47 Identities = 106/215 (49%), Positives = 122/215 (56%), Gaps = 7/215 (3%) Frame = -2 Query: 779 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 600 D +M RG GYG F GP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 434 DPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGA 493 Query: 599 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 429 GPH+GMW D +MG WGG+EHG RESSYGGED ASEYGYG+A+H+KG RSS ASR Sbjct: 494 SGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASR 552 Query: 428 EKEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDH---DSGYDDD 258 EKE+ SER+WS N R H D YDDD Sbjct: 553 EKERVSEREWSGNSDRRHRDEKEQDWDRSEREHREHRYREEKDSYREHRHRERDLDYDDD 612 Query: 257 WDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRR 156 WD+GQ A+PE++HRSRSRD Y + + Sbjct: 613 WDRGQSSSRSRRRSHAMPEEEHRSRSRDVGYREEK 647 >ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobroma cacao] gi|508708842|gb|EOY00739.1| RNA-binding family protein isoform 4 [Theobroma cacao] Length = 697 Score = 332 bits (852), Expect = 3e-88 Identities = 176/359 (49%), Positives = 226/359 (62%), Gaps = 8/359 (2%) Frame = -2 Query: 2051 MDPVTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQ 1872 MD + +EQ+D+GDEEY G QKMQY GAIPALA+EEM+GE VGEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 1871 MQRSDTQV-PSVVGNSGIQNSKANVPGTHLEGVALQEVNNIKVAEEGNYAATAAPFPVQK 1695 +QRS+ + P +G++G++ + P +E Q +N V+ +G + +A +P +K Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRVEAGGSQGLNIPGVSVQGKHPNVSARYP-EK 119 Query: 1694 TSLPGPGGP-------PQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNS 1536 P P P SQ+G + E H+ Q + G+QG S +K D Sbjct: 120 EEQPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVP 179 Query: 1535 EKVIGEPAPFMYTNMGNTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFV 1356 +K+ +PA + + G +G P VPPNQM N+N + +E V+P +ENG TMLFV Sbjct: 180 QKIANDPAQSLNSGTGGPQGPPHVPPNQMG----TNVNHPVMNENQVQPPIENGPTMLFV 235 Query: 1355 GELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGH 1176 GELHWWTTDAE+ESVL QYG++KEIKFFDE+ASGKSKGYCQVEFYDPS+A+ CKEGMNG+ Sbjct: 236 GELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGY 295 Query: 1175 SFNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDA 999 FNGRACVVAFA+P T+KQMGASY NK GR P N+ GRG NY SGDA Sbjct: 296 MFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQSGDA 353 Score = 172 bits (437), Expect = 5e-40 Identities = 84/133 (63%), Positives = 94/133 (70%), Gaps = 3/133 (2%) Frame = -2 Query: 779 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 600 D +M RG GYG F GP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 434 DPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGA 493 Query: 599 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 429 GPH+GMW D +MG WGG+EHG RESSYGGED ASEYGYG+A+H+KG RSS ASR Sbjct: 494 SGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASR 552 Query: 428 EKEKNSERDWSSN 390 EKE+ SER+WS N Sbjct: 553 EKERVSEREWSGN 565 >ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695496|ref|XP_007044905.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695500|ref|XP_007044906.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708839|gb|EOY00736.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708840|gb|EOY00737.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708841|gb|EOY00738.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 652 Score = 332 bits (852), Expect = 3e-88 Identities = 176/359 (49%), Positives = 226/359 (62%), Gaps = 8/359 (2%) Frame = -2 Query: 2051 MDPVTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQ 1872 MD + +EQ+D+GDEEY G QKMQY GAIPALA+EEM+GE VGEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 1871 MQRSDTQV-PSVVGNSGIQNSKANVPGTHLEGVALQEVNNIKVAEEGNYAATAAPFPVQK 1695 +QRS+ + P +G++G++ + P +E Q +N V+ +G + +A +P +K Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRVEAGGSQGLNIPGVSVQGKHPNVSARYP-EK 119 Query: 1694 TSLPGPGGP-------PQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNS 1536 P P P SQ+G + E H+ Q + G+QG S +K D Sbjct: 120 EEQPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVP 179 Query: 1535 EKVIGEPAPFMYTNMGNTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFV 1356 +K+ +PA + + G +G P VPPNQM N+N + +E V+P +ENG TMLFV Sbjct: 180 QKIANDPAQSLNSGTGGPQGPPHVPPNQMG----TNVNHPVMNENQVQPPIENGPTMLFV 235 Query: 1355 GELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGH 1176 GELHWWTTDAE+ESVL QYG++KEIKFFDE+ASGKSKGYCQVEFYDPS+A+ CKEGMNG+ Sbjct: 236 GELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGY 295 Query: 1175 SFNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDA 999 FNGRACVVAFA+P T+KQMGASY NK GR P N+ GRG NY SGDA Sbjct: 296 MFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQSGDA 353 Score = 214 bits (546), Expect = 1e-52 Identities = 115/220 (52%), Positives = 130/220 (59%), Gaps = 7/220 (3%) Frame = -2 Query: 779 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 600 D +M RG GYG F GP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 434 DPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGA 493 Query: 599 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 429 GPH+GMW D +MG WGG+EHG RESSYGGED ASEYGYG+A+H+KG RSS ASR Sbjct: 494 SGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASR 552 Query: 428 EKEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDH---DSGYDDD 258 EKE+ SER+WS N R H D YDDD Sbjct: 553 EKERVSEREWSGNSDRRHRDEKEQDWDRSEREHREHRYREEKDSYREHRHRERDLDYDDD 612 Query: 257 WDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 141 WD+GQ A+PE++HRSRSRD DYGK+RRLPSE Sbjct: 613 WDRGQSSSRSRRRSHAMPEEEHRSRSRDVDYGKKRRLPSE 652 >ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 655 Score = 330 bits (845), Expect = 2e-87 Identities = 177/355 (49%), Positives = 219/355 (61%), Gaps = 8/355 (2%) Frame = -2 Query: 2042 VTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQMQR 1863 + +EQ+DY ++EY G QKMQY GGAIPALA+EE++GE VG+G LQ Q+ Sbjct: 1 MAEEQIDYEEDEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQFQQ 60 Query: 1862 SDTQVPSV-VGNSGIQNSKANVPGTHLEGVALQEVNNIKVAEEGNYAATAAPFPVQK--- 1695 + PS VGN +Q K +VP ++ Q N V+ EG Y + FP Q Sbjct: 61 PEAPPPSAGVGNGRLQVKKTDVPEQRVQVGGSQGSNIPGVSVEGKYTNAGSHFPAQNDVQ 120 Query: 1694 --TSLP--GPGGPPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKV 1527 + P G G P SQ+G + E H++ + G+QGS S P + D N +V Sbjct: 121 VAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGRV 180 Query: 1526 IGEPAPFMYTNMGNTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGEL 1347 EPAP + +GA +P NQM +N NVN R M +E +RP +ENG TMLFVGEL Sbjct: 181 ANEPAPVLNPGAAGPQGA-LIPANQMGVNANVN--RVMVNENQIRPPLENGGTMLFVGEL 237 Query: 1346 HWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFN 1167 HWWTTDAE+ESVL QYG+ KEIKFFDERASGKSKGYCQVEF+D +AA+ACK+GMNGH FN Sbjct: 238 HWWTTDAELESVLSQYGRAKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFN 297 Query: 1166 GRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 1002 GR CVVAFA+P T+KQMGASY NK G P+ND GRG NY SGD Sbjct: 298 GRPCVVAFASPQTLKQMGASYMNKNQGQPQSQNQGSRPMNDGGGRGGNTNYQSGD 352 Score = 229 bits (585), Expect = 3e-57 Identities = 120/220 (54%), Positives = 137/220 (62%), Gaps = 7/220 (3%) Frame = -2 Query: 779 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 600 D +MGRG GYG FSGP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 436 DPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGS 495 Query: 599 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 429 GPH GMW D++MG W GEEHG RESSYGG+D AS+YGYGEA+H+KGARS+AASR Sbjct: 496 SGMDGPHPGMWTDSSMGGWLGEEHGRRTRESSYGGDDGASDYGYGEANHEKGARSTAASR 555 Query: 428 EKEKNSERDWSSNP---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDDD 258 EK++ SERDWS N R +D DS YDD+ Sbjct: 556 EKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDDN 615 Query: 257 WDKGQ-XXXXXXXSGAVPEDDHRSRSRDADYGKRRRLPSE 141 WD+GQ SGA+P++DHRSRSRD DYGKRRRLPSE Sbjct: 616 WDRGQSSSRSRSRSGAIPDEDHRSRSRDVDYGKRRRLPSE 655 >ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|567891321|ref|XP_006438181.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540376|gb|ESR51420.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540377|gb|ESR51421.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] Length = 655 Score = 328 bits (842), Expect = 5e-87 Identities = 176/355 (49%), Positives = 218/355 (61%), Gaps = 8/355 (2%) Frame = -2 Query: 2042 VTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQMQR 1863 + +EQ+DY ++EY G QKMQY GGAIPALA+EE++GE VG+G LQ Q+ Sbjct: 1 MAEEQIDYEEDEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDINVGDGLLQFQQ 60 Query: 1862 SDTQVPSV-VGNSGIQNSKANVPGTHLEGVALQEVNNIKVAEEGNYAATAAPFPVQK--- 1695 + PS VGN +Q K +VP ++ Q N V+ EG Y + FP Q Sbjct: 61 PEAPPPSAGVGNGRLQVKKTDVPEQRVQVGGSQGSNIPGVSVEGKYTNAGSDFPAQNDVQ 120 Query: 1694 --TSLP--GPGGPPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKV 1527 + P G G P SQ+G + E H++ + G+QGS S P + D N + Sbjct: 121 VAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGRA 180 Query: 1526 IGEPAPFMYTNMGNTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGEL 1347 EPAP + +GA +P NQM +N NVN R M +E +RP +ENG TMLFVGEL Sbjct: 181 ANEPAPVLNPGAAGPQGA-LIPANQMGVNANVN--RVMVNENQIRPPLENGGTMLFVGEL 237 Query: 1346 HWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFN 1167 HWWTTDAE+ESVL QYG+ KEIKFFDERASGKSKGYCQVEF+D +AA+ACK+GMNGH FN Sbjct: 238 HWWTTDAELESVLSQYGRAKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFN 297 Query: 1166 GRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 1002 GR CVVAFA+P T+KQMGASY NK G P+ND GRG NY SGD Sbjct: 298 GRPCVVAFASPQTLKQMGASYMNKNQGQPQSQNQGSRPMNDGGGRGGNTNYQSGD 352 Score = 229 bits (585), Expect = 3e-57 Identities = 120/220 (54%), Positives = 136/220 (61%), Gaps = 7/220 (3%) Frame = -2 Query: 779 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 600 D +MGRG GYG FSGP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 436 DPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGS 495 Query: 599 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 429 GPH GMW D++MG W GEEHG RESSYGG+D AS+YGYGEASH+KGARS+ ASR Sbjct: 496 SGMDGPHPGMWTDSSMGGWVGEEHGRRTRESSYGGDDGASDYGYGEASHEKGARSTTASR 555 Query: 428 EKEKNSERDWSSNP---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDDD 258 EK++ SERDWS N R +D DS YDD+ Sbjct: 556 EKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDDN 615 Query: 257 WDKGQ-XXXXXXXSGAVPEDDHRSRSRDADYGKRRRLPSE 141 WD+GQ SGA+P++DHRSRSRD DYGKRRRLPSE Sbjct: 616 WDRGQSSSRSRSRSGAIPDEDHRSRSRDVDYGKRRRLPSE 655 >ref|XP_002515040.1| RNA binding protein, putative [Ricinus communis] gi|223546091|gb|EEF47594.1| RNA binding protein, putative [Ricinus communis] Length = 644 Score = 327 bits (839), Expect = 1e-86 Identities = 183/351 (52%), Positives = 222/351 (63%), Gaps = 3/351 (0%) Frame = -2 Query: 2042 VTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQMQR 1863 + DEQ+DY DEEY G QK+QY GAIPALAEEEM GE +GE FLQM R Sbjct: 1 MADEQIDYEDEEYGGAQKLQYQGSGAIPALAEEEM-GEDDEYDDLYNDVNIGENFLQMHR 59 Query: 1862 SDTQ-VPSVVGNSGIQNSKANVPGTHLEGVALQEVNNIKVAEEGNYAATAAPFPVQKTSL 1686 S+ P VGN G Q +N +E Q +N VA E Y+ T FP Q Sbjct: 60 SEAPPAPPSVGNGGFQPRNSN--DLRVESGGSQGLNIPGVAVESKYS-TGTHFPEQNVKG 116 Query: 1685 P--GPGGPPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPA 1512 P G G P +Q+ R+ E+ ++SQA + G+QGS S P D + + K+ +P Sbjct: 117 PEIGSVGYPDGSSIAQKTRVMEMTNDSQARNMGFQGSTSGPSNIGVDPSDMNNKISNDPT 176 Query: 1511 PFMYTNMGNTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTT 1332 P N G + PQ+P +QMN+N ++ NRS +E +RP +ENG+TML+VGELHWWTT Sbjct: 177 PV--PNAGVPRVIPQLPASQMNMN--MDTNRSATNENQIRPPLENGSTMLYVGELHWWTT 232 Query: 1331 DAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACV 1152 DAE+E+VL QYG VKEIKFFDERASGKSKGYCQVEFYD +AA+ACKEGMNGH FNGRACV Sbjct: 233 DAELENVLSQYGMVKEIKFFDERASGKSKGYCQVEFYDAAAAAACKEGMNGHLFNGRACV 292 Query: 1151 VAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDA 999 VAFA+ T+KQMGASY NK GR P+ND AGRG NY GDA Sbjct: 293 VAFASQQTLKQMGASYMNKNQGQPQSQNQGRRPMNDGAGRGGNMNYQGGDA 343 Score = 224 bits (572), Expect = 1e-55 Identities = 119/219 (54%), Positives = 136/219 (62%), Gaps = 6/219 (2%) Frame = -2 Query: 779 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 600 D +MGRGAGYG F+GP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 426 DPTYMGRGAGYGGFAGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFGRGMAPNGMGMMGP 485 Query: 599 XXXXGPHSGMWNDTNMGAWGGE--EHGRESSYGGEDNASEYGYGEASHDKGARSSAASRE 426 GP++GMW+DT+MG WG E RESSYGG+D ASEYGYGE +H+KGARSSAASRE Sbjct: 486 SGMDGPNAGMWSDTSMGGWGEEPGRRTRESSYGGDDGASEYGYGEVNHEKGARSSAASRE 545 Query: 425 KEKNSERDWSSNP---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDDDW 255 KE+ SERDWS N R ++ DSGY+DDW Sbjct: 546 KERASERDWSGNSDRRHRDDREHDWDRSEREHKEHRYREEKESYRDHRQRERDSGYEDDW 605 Query: 254 DKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 141 D+GQ AVPE+D+RSRSRDADYGKRRRLPSE Sbjct: 606 DRGQSSSRSRSRSRAVPEEDYRSRSRDADYGKRRRLPSE 644 >ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268141 isoform 1 [Vitis vinifera] gi|359473133|ref|XP_003631251.1| PREDICTED: uncharacterized protein LOC100268141 isoform 2 [Vitis vinifera] Length = 647 Score = 325 bits (834), Expect = 4e-86 Identities = 181/361 (50%), Positives = 223/361 (61%), Gaps = 13/361 (3%) Frame = -2 Query: 2042 VTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQMQR 1863 + +EQLDY DEEY G QKM + GGAI ALA++E++GE VGEGFLQM R Sbjct: 1 MAEEQLDYEDEEYGGAQKMPFQGGGAISALADDELMGEDDEYDDLYNDVNVGEGFLQMHR 60 Query: 1862 SDTQVPS-VVGNSGIQNSKANVPGTHLEGVALQEVNNIKVAEEGNYA------------A 1722 S+ PS V+ Q K +VP LE Q + V+ EG Y+ A Sbjct: 61 SEAPAPSGVMAGGPFQAHKTDVPPQKLEAGTSQGLIIPGVSIEGKYSNPHFHEKKEGPMA 120 Query: 1721 TAAPFPVQKTSLPGPGGPPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMN 1542 P + L GP SQ+GR+ E+ H++Q + G+QGS +P K A+ + Sbjct: 121 VKGPEMGSTSHLDGPS-------VSQKGRVLEMTHDTQVRNLGFQGSTPIPQKTGAEPSD 173 Query: 1541 NSEKVIGEPAPFMYTNMGNTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTML 1362 K+ E P + + G + PQ+ NQM +N VN+NR M +E +RP+V+NG TML Sbjct: 174 VHGKIANESTPVLNSGTGGPRAVPQMLSNQMGMN--VNVNRPMVNENQIRPAVDNGATML 231 Query: 1361 FVGELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMN 1182 FVGELHWWTTDAE+ESVL QYG+VKEIKFFDERASGKSKGYCQVEFYD SAA+ACKEGMN Sbjct: 232 FVGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDASAAAACKEGMN 291 Query: 1181 GHSFNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 1002 G+ FNGRACVVAFA+P T+KQMGASY NK GR P+ND GRG G N GD Sbjct: 292 GYIFNGRACVVAFASPQTLKQMGASYMNK--TQAQSQSQGRRPMNDGVGRGGGMNMQGGD 349 Query: 1001 A 999 A Sbjct: 350 A 350 Score = 214 bits (546), Expect = 1e-52 Identities = 111/217 (51%), Positives = 128/217 (58%), Gaps = 4/217 (1%) Frame = -2 Query: 779 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 600 D +MGRG YG FSG AFPGM+P FP VN+MGL GVAPHVNPAFF Sbjct: 431 DPTYMGRGGAYGGFSGSAFPGMVPSFPAVNTMGLAGVAPHVNPAFFGRGMAANGMGMMGA 490 Query: 599 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 429 G H+GMW DT+MG WGGEEHG RESSYGG+D AS+YGYGE +H+K RS+ ASR Sbjct: 491 TGMDGHHAGMWTDTSMGGWGGEEHGRRTRESSYGGDDGASDYGYGEVNHEKVGRSNTASR 550 Query: 428 EKEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDDDWDK 249 EKE+ SERDWS N R ++ D +DDWD+ Sbjct: 551 EKERGSERDWSGNSERRHRDEREQDWERSDKDHRYREEKDGYRDHRQRERDFNNEDDWDR 610 Query: 248 GQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 141 GQ AV ++DHRSRSRD DYGKRRRLPSE Sbjct: 611 GQSSSRSRSRSRAVADEDHRSRSRDGDYGKRRRLPSE 647 >ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] gi|462422613|gb|EMJ26876.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] Length = 630 Score = 312 bits (800), Expect = 4e-82 Identities = 173/348 (49%), Positives = 216/348 (62%), Gaps = 1/348 (0%) Frame = -2 Query: 2042 VTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQMQR 1863 + +EQ+DY DEEY G QK+QY GAI ALA+EE + E V EGFLQM R Sbjct: 1 MAEEQIDYEDEEYGGAQKLQYQGSGAISALADEEPMVEDDEYDDLYNDVNVREGFLQMHR 60 Query: 1862 SDTQVP-SVVGNSGIQNSKANVPGTHLEGVALQEVNNIKVAEEGNYAATAAPFPVQKTSL 1686 S+ +P VGN G+Q K +V T ++ QE V+ +G Y++ A FP Q+ Sbjct: 61 SEAPLPPGGVGNGGLQAQKTDVTETRVQAGVSQESKIPGVSVQGKYSSAVAQFPEQQ--- 117 Query: 1685 PGPGGPPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPF 1506 G PP VA + G +GY GS +MP D + + K E P Sbjct: 118 ---GQPP-------------VAKEPELGSTGY-GSTTMPPNVGGDSSDITGKTALESVPS 160 Query: 1505 MYTNMGNTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDA 1326 M + G Q+P NQ+++ VN NR M +E +RP VENG+TMLFVGELHWWTTDA Sbjct: 161 MNSGTAGPTGVTQMPTNQISIK--VNANRPMFNENQIRPPVENGSTMLFVGELHWWTTDA 218 Query: 1325 EIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVA 1146 E+ESVL QYG+VKEIKFFDERASGKSKGYCQVEF+DP+AA+ACKEGM+G+ FNGRACVVA Sbjct: 219 ELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFHDPAAATACKEGMDGYLFNGRACVVA 278 Query: 1145 FATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 1002 FA+P T+KQMGASY +K GR P+N+ GRG G NY +GD Sbjct: 279 FASPQTLKQMGASYLSKSQGQTQSQQPGRRPMNEGVGRGGGVNYQTGD 326 Score = 224 bits (571), Expect = 1e-55 Identities = 117/221 (52%), Positives = 133/221 (60%), Gaps = 8/221 (3%) Frame = -2 Query: 779 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 600 D +MGRG GYG F GPAFPGML FP VN+MGL GVAPHVNPAFF Sbjct: 410 DPTYMGRGGGYGGFPGPAFPGMLSSFPAVNTMGLAGVAPHVNPAFFGRGMATNGMGMMGS 469 Query: 599 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 429 G H+GMWND +MG WGG+EHG RESSYGG+D ASEYGYGEA+H+KG RS+A SR Sbjct: 470 SGMDGHHAGMWNDPSMGGWGGDEHGRRTRESSYGGDDGASEYGYGEANHEKGGRSNAPSR 529 Query: 428 EKEKNSERDWSSNP----XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDD 261 E+E+ SERDWS N R ++ D GY+D Sbjct: 530 ERERGSERDWSGNSERRHRDEREQDWDRSERGEHREHRYKEEKDSYRDHRQRERDVGYED 589 Query: 260 DWDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 141 DWD+GQ A+PEDDHRSRSRD DYGKRRRLPSE Sbjct: 590 DWDRGQSSSRPRSRSKAMPEDDHRSRSRDVDYGKRRRLPSE 630 >gb|EXB82464.1| Cleavage and polyadenylation specificity factor subunit [Morus notabilis] Length = 636 Score = 295 bits (756), Expect = 5e-77 Identities = 166/354 (46%), Positives = 218/354 (61%), Gaps = 7/354 (1%) Frame = -2 Query: 2042 VTDEQLDYGDEEYAGNQKMQYH-QGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQMQ 1866 + ++ +D+ DEEY G QK QY GGAI ALA+EE++G+ VGEGFLQ+Q Sbjct: 1 MAEDHIDFEDEEYGGAQKHQYQGSGGAISALADEELMGDDDEYDDLYNDVNVGEGFLQLQ 60 Query: 1865 RSDT-QVPSVVG-NSGIQNSKANVPGTHLEGVALQEVNNIKVAEEGNYAATAAPFPVQKT 1692 RS+ +P+ G +G+Q K N P E Q+ N V+ EG +++ + FP Q+ Sbjct: 61 RSEAPSLPAAAGVGNGLQAQKRNFPEPREEIGGSQQPNIPGVSAEGRFSSAGSQFPGQQD 120 Query: 1691 SLPGPGGPPQTMDASQRGRL--PEVAHNSQAGH--SGYQGSASMPHKNAADQMNNSEKVI 1524 L + S+ G + P+ A SQ G +G+QGS M H D + K++ Sbjct: 121 GL-------KVDKKSEAGSMVYPDGASGSQKGRIVAGFQGSKPMLHSVGVDSSDIPGKMV 173 Query: 1523 GEPAPFMYTNMGNTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELH 1344 EP + +G + NQ +N NV+ + +E +RPS+ENG+TMLFVGELH Sbjct: 174 NEPIQAPNSGGAGPRGILPMQGNQTTVNANVS--HPIVNENQIRPSIENGSTMLFVGELH 231 Query: 1343 WWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNG 1164 WWTTDAE+ESVL QYG+VKEIKFFDERASGKSKGYCQVE+YD +AA ACKEGM+GH FNG Sbjct: 232 WWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEYYDAAAAVACKEGMHGHVFNG 291 Query: 1163 RACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 1002 RACVVAFA+P T+KQMGA+Y +K GR P+ND GRG N+ SGD Sbjct: 292 RACVVAFASPQTLKQMGAAYMSKNQVQNQSQPQGRRPINDGVGRGGNPNFQSGD 345 Score = 192 bits (488), Expect = 6e-46 Identities = 105/220 (47%), Positives = 119/220 (54%), Gaps = 7/220 (3%) Frame = -2 Query: 779 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 600 D +MGRG GYG F+GPAFPGMLP FP VN+MG VAPHVNPAFF Sbjct: 425 DPTYMGRGVGYGGFAGPAFPGMLPSFPAVNTMGFAAVAPHVNPAFFGRGMTNNGMGMVGS 484 Query: 599 XXXXGPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 429 G GMWND ++G WGGEEHG RESSYGG+D ASEYGYG+ +H+KG R Sbjct: 485 SLMDGHQGGMWNDPSIGGWGGEEHGRRTRESSYGGDDGASEYGYGDTNHEKGGR------ 538 Query: 428 EKEKNSERDWSSNP---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDDD 258 E+ SERDWS N R K+ + Y+DD Sbjct: 539 --ERGSERDWSGNSERRNHEERDQDWDRSQKEQKEHRYREGKDGSRDYRPKERELDYEDD 596 Query: 257 WDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 141 WD+GQ V ED HRSRSRD DYGKRRRLPSE Sbjct: 597 WDRGQSSSRLRSRSRVVQEDHHRSRSRDVDYGKRRRLPSE 636 >ref|XP_002312652.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222852472|gb|EEE90019.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 619 Score = 294 bits (753), Expect = 1e-76 Identities = 168/347 (48%), Positives = 207/347 (59%), Gaps = 5/347 (1%) Frame = -2 Query: 2027 LDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQMQRSDTQV 1848 +DY +EE KMQY GAIPALAEEEM GE VGE FLQM S+ Sbjct: 1 MDYEEEE-----KMQYQGSGAIPALAEEEM-GEDDEYDDLYNDVNVGENFLQMHGSEAPA 54 Query: 1847 P-SVVGNSGIQNSKANVPGTHLEGVALQEVNNIKVAEEGNYAATAAPFPVQKTSLPGPG- 1674 P + VGN G Q A+ G + A EG Y+ A FP QK Sbjct: 55 PPATVGNGGFQTRNAHESRIETGGSQALAITGGGPAVEGIYSNAKAHFPEQKQVAVAVEA 114 Query: 1673 ---GPPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPFM 1503 GP +Q+GR+ E++H+ Q + G+Q S +P D + S K EP P Sbjct: 115 QDVGPVDGSSVAQKGRVIEMSHDVQVRNMGFQKSTPVPPGIGVDPSDMSRKNAIEPEPLP 174 Query: 1502 YTNMGNTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAE 1323 T +GAPQ+ NQM+++ +VN R + +E VRP +ENG+T L+VGELHWWTTDAE Sbjct: 175 ITGSAGPRGAPQMQVNQMHMSADVN--RPVVNENQVRPPIENGSTTLYVGELHWWTTDAE 232 Query: 1322 IESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 1143 +ES Q+G+VKEIKFFDERASGKSKGYCQV+FY+ +AA+ACKEGMNGH FNGR CVVAF Sbjct: 233 LESFASQFGRVKEIKFFDERASGKSKGYCQVDFYEAAAAAACKEGMNGHVFNGRPCVVAF 292 Query: 1142 ATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 1002 A+P T+KQMGASY NK GR +ND AGRG AN+ SGD Sbjct: 293 ASPQTLKQMGASYMNKTQGQPQTQSQGRGSMNDGAGRGGNANFQSGD 339 Score = 189 bits (479), Expect = 6e-45 Identities = 103/214 (48%), Positives = 118/214 (55%), Gaps = 1/214 (0%) Frame = -2 Query: 779 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 600 D +MGRG GYG F+GP FPGMLP FP VNSMGL GVAPHVNPAFF Sbjct: 421 DPLYMGRGGGYGGFAGPGFPGMLPSFPAVNSMGLAGVAPHVNPAFFARGMAPNGMGMMVS 480 Query: 599 XXXXGPHSGMWNDTNMGAWGGEEHGRESSYGGEDNASEYGYGEASHDKGARSSAASREKE 420 GP+ GMW ESSY G++ ASEYGYGE +H+KGARSS ASREKE Sbjct: 481 SGMDGPNPGMW---------------ESSYDGDEGASEYGYGEGNHEKGARSSGASREKE 525 Query: 419 KNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDDDWDKGQX 240 + SERDWS N R ++ DSGY+DD D+G Sbjct: 526 RGSERDWSGNSDRRHRDEREQDWDRPEREHRYKEEKDSYRGHRQRERDSGYEDDRDRGHS 585 Query: 239 XXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 141 A PE+D+RSR+RD DYGKRRRLPSE Sbjct: 586 SSRARSRSRAAPEEDYRSRTRDVDYGKRRRLPSE 619 >ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] gi|550329195|gb|ERP56065.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] Length = 591 Score = 256 bits (653), Expect = 4e-65 Identities = 154/343 (44%), Positives = 187/343 (54%), Gaps = 1/343 (0%) Frame = -2 Query: 2027 LDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQMQRSDTQV 1848 +D+ +EE KMQY GAIPALAEEE+ GE VGE FLQM S+ Sbjct: 1 MDFEEEE-----KMQYQGSGAIPALAEEEL-GEDDEYDDLYNDVNVGENFLQMHGSEAPA 54 Query: 1847 P-SVVGNSGIQNSKANVPGTHLEGVALQEVNNIKVAEEGNYAATAAPFPVQKTSLPGPGG 1671 P + GN G Q A+ G + + VA EG Y+ A FP QK + G Sbjct: 55 PPATAGNGGFQTRNAHESRVETGGSQVLATSGAGVAVEGKYSNAGAHFPEQKQAGIG--- 111 Query: 1670 PPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPFMYTNM 1491 + G GY +S+ K +A Sbjct: 112 ----------------VEANDVGSIGYGDGSSVAQKGSA--------------------- 134 Query: 1490 GNTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESV 1311 +G PQ+ NQMN+N +VN R + +E VRP +ENG T L+VGELHWWTTDAE+ESV Sbjct: 135 -GPRGVPQMQVNQMNMNADVN--RPVVNENQVRPPIENGPTTLYVGELHWWTTDAELESV 191 Query: 1310 LIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAFATPH 1131 QYG+VKEIKFFDERASGKSKGYCQV+FY+ +AA+ACKEGMN H FNGR CVVAFA+ Sbjct: 192 ASQYGRVKEIKFFDERASGKSKGYCQVDFYEAAAAAACKEGMNEHVFNGRPCVVAFASAQ 251 Query: 1130 TIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 1002 T+KQMGASY +K GR +ND GRG ANY SGD Sbjct: 252 TLKQMGASYMSKTQGQPQPQSQGRGSMNDGMGRGGNANYQSGD 294 Score = 201 bits (512), Expect = 9e-49 Identities = 109/216 (50%), Positives = 123/216 (56%), Gaps = 3/216 (1%) Frame = -2 Query: 779 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 600 D +MGRG GYG F G FPGMLP FP VNSMGL GVAPHVNPAFF Sbjct: 376 DPLYMGRGGGYGGFPGHGFPGMLPSFPAVNSMGLAGVAPHVNPAFFARGMAPNGMGMMAS 435 Query: 599 XXXXGPHSGMWNDTNMGAWGGE--EHGRESSYGGEDNASEYGYGEASHDKGARSSAASRE 426 GP+ G W DT+MG WG E RESSY G++ ASEYGYGE +H+KGARSS ASRE Sbjct: 436 SGMEGPNPGKWPDTSMGGWGEEPGRRTRESSYDGDEGASEYGYGEGNHEKGARSSGASRE 495 Query: 425 KEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDDDWDKG 246 KE+ SERDWS N R ++ DSGY+DD D+G Sbjct: 496 KERVSERDWSGNSDRRHRDEREQDWDRSEREPKYREEKDTYRGHRQRERDSGYEDDRDRG 555 Query: 245 QXXXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 141 A PE+D+RSRSRD DYGKRRR PSE Sbjct: 556 HSSSRARSRSRAAPEEDYRSRSRDVDYGKRRRPPSE 591 >ref|XP_002315647.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222864687|gb|EEF01818.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 573 Score = 256 bits (653), Expect = 4e-65 Identities = 154/343 (44%), Positives = 187/343 (54%), Gaps = 1/343 (0%) Frame = -2 Query: 2027 LDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQMQRSDTQV 1848 +D+ +EE KMQY GAIPALAEEE+ GE VGE FLQM S+ Sbjct: 1 MDFEEEE-----KMQYQGSGAIPALAEEEL-GEDDEYDDLYNDVNVGENFLQMHGSEAPA 54 Query: 1847 P-SVVGNSGIQNSKANVPGTHLEGVALQEVNNIKVAEEGNYAATAAPFPVQKTSLPGPGG 1671 P + GN G Q A+ G + + VA EG Y+ A FP QK + G Sbjct: 55 PPATAGNGGFQTRNAHESRVETGGSQVLATSGAGVAVEGKYSNAGAHFPEQKQAGIG--- 111 Query: 1670 PPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPFMYTNM 1491 + G GY +S+ K +A Sbjct: 112 ----------------VEANDVGSIGYGDGSSVAQKGSA--------------------- 134 Query: 1490 GNTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESV 1311 +G PQ+ NQMN+N +VN R + +E VRP +ENG T L+VGELHWWTTDAE+ESV Sbjct: 135 -GPRGVPQMQVNQMNMNADVN--RPVVNENQVRPPIENGPTTLYVGELHWWTTDAELESV 191 Query: 1310 LIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAFATPH 1131 QYG+VKEIKFFDERASGKSKGYCQV+FY+ +AA+ACKEGMN H FNGR CVVAFA+ Sbjct: 192 ASQYGRVKEIKFFDERASGKSKGYCQVDFYEAAAAAACKEGMNEHVFNGRPCVVAFASAQ 251 Query: 1130 TIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 1002 T+KQMGASY +K GR +ND GRG ANY SGD Sbjct: 252 TLKQMGASYMSKTQGQPQPQSQGRGSMNDGMGRGGNANYQSGD 294 Score = 179 bits (453), Expect = 6e-42 Identities = 102/214 (47%), Positives = 116/214 (54%), Gaps = 1/214 (0%) Frame = -2 Query: 779 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 600 D +MGRG GYG F G FPGMLP FP VNSMGL GVAPHVNPAFF Sbjct: 376 DPLYMGRGGGYGGFPGHGFPGMLPSFPAVNSMGLAGVAPHVNPAFFARGMAPNG------ 429 Query: 599 XXXXGPHSGMWNDTNMGAWGGEEHGRESSYGGEDNASEYGYGEASHDKGARSSAASREKE 420 GM + M G G+ESSY G++ ASEYGYGE +H+KGARSS ASREKE Sbjct: 430 -------MGMMASSGM---EGPNPGKESSYDGDEGASEYGYGEGNHEKGARSSGASREKE 479 Query: 419 KNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRAKDHDSGYDDDWDKGQX 240 + SERDWS N R ++ DSGY+DD D+G Sbjct: 480 RVSERDWSGNSDRRHRDEREQDWDRSEREPKYREEKDTYRGHRQRERDSGYEDDRDRGHS 539 Query: 239 XXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 141 A PE+D+RSRSRD DYGKRRR PSE Sbjct: 540 SSRARSRSRAAPEEDYRSRSRDVDYGKRRRPPSE 573