BLASTX nr result
ID: Mentha27_contig00005761
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha27_contig00005761 (2262 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus... 674 0.0 ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation spec... 548 e-153 ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation spec... 320 2e-84 ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citr... 320 2e-84 gb|EPS60955.1| hypothetical protein M569_13847, partial [Genlise... 317 2e-83 ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobr... 316 3e-83 ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309... 315 6e-83 ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation spec... 310 2e-81 ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citr... 308 5e-81 ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobr... 306 2e-80 ref|XP_007044908.1| RNA-binding family protein isoform 5, partia... 306 2e-80 ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobr... 306 2e-80 ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobr... 306 2e-80 ref|XP_002515040.1| RNA binding protein, putative [Ricinus commu... 303 2e-79 ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268... 301 8e-79 ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prun... 289 3e-75 gb|EXB82464.1| Cleavage and polyadenylation specificity factor s... 281 7e-73 ref|XP_002312652.1| RNA recognition motif-containing family prot... 277 2e-71 ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Popu... 237 2e-59 ref|XP_006852230.1| hypothetical protein AMTR_s00049p00146760 [A... 237 2e-59 >gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus guttatus] Length = 639 Score = 674 bits (1738), Expect = 0.0 Identities = 359/643 (55%), Positives = 398/643 (61%), Gaps = 6/643 (0%) Frame = +2 Query: 212 MDPVTDEQLDYGDEGYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXXGEGFLQ 391 MDPVTDEQLDYGDE Y GNQKMQYH GGAIPALAE+EMIG+ GEGF+Q Sbjct: 1 MDPVTDEQLDYGDEEYGGNQKMQYHHGGAIPALAEDEMIGDDDEYDDLYNDVNVGEGFMQ 60 Query: 392 MQRSDTQVPSVVGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKT 571 MQRS+ PS VGN+ SK PGT E +A QEVNN +V EG+YA QK Sbjct: 61 MQRSEAPPPSAVGNNSFSISKNTAPGTRAEAIASQEVNNGRVGNEGSYAPNGVQLSDQKN 120 Query: 572 SLPGPGGPPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPA 751 +L GGP Q +DASQR RLPEVA++SQA H GYQGS M HK A D+MNNSE ++GEPA Sbjct: 121 NLTAVGGPAQPVDASQRVRLPEVANSSQAAHLGYQGSEIMLHKTATDRMNNSENIVGEPA 180 Query: 752 PLMYTNMGNTKGAP--XXXXXXXXXXXXXXXXRSMDDEYMVRPS-VENGNTMLFVGELHW 922 L+Y N G++KG P RSMDDEY++RPS ENGN M++VGELHW Sbjct: 181 SLVYPNTGSSKGVPQAPSNLMNSNANVNVNVNRSMDDEYLIRPSGGENGNPMIYVGELHW 240 Query: 923 WTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGR 1102 WTTDAE+ESVLIQYG+VKEIKFFDERASGKSKGYCQVEFYDP+AA+ACK+GM GH FNGR Sbjct: 241 WTTDAEVESVLIQYGRVKEIKFFDERASGKSKGYCQVEFYDPAAATACKDGMQGHIFNGR 300 Query: 1103 ACVVAFATPHTIKQMGASYTNKXXXXXXXXXXXRNPVNDAAGRGNGANYPSGDA-XXXXX 1279 ACVV +A P T KQMGASY NK RNP+ND AGRGNG NYPSGDA Sbjct: 301 ACVVTYANPQTSKQMGASY-NKNQGQSQSQLQGRNPMNDGAGRGNGTNYPSGDAGRNFGR 359 Query: 1280 XXXXXXXXQPPNKXXXXXXXXXXXXI-NKNMI-XXXXXXXXXXXXXXXXXXXXXXXXXXX 1453 Q PN+ + NKNMI Sbjct: 360 GGGWGRGNQAPNRGPGAGPIRGRGGMGNKNMIGNAPGAGGGGAYGQGLNGPGFGGPPGMM 419 Query: 1454 XXXXXXXXXXDLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXX 1633 DLAFMGRG GYG FSGP F GMLPPF GVNSMGLPGVAPHVNPAFF Sbjct: 420 HPQGMMGPGFDLAFMGRGGGYGGFSGPPFQGMLPPFQGVNSMGLPGVAPHVNPAFFGRGM 479 Query: 1634 XXXXXXXXXXXXXXXPHSGMWNDTNMGAWGGEEHGRESSYGGEDNASEYGYGEASHDKGA 1813 PHSGMWND NMG WGGEEHGRESSYGGEDNASEYGYGE SHDK Sbjct: 480 NPNGMGMMGNPGMVGPHSGMWNDPNMGGWGGEEHGRESSYGGEDNASEYGYGEGSHDKSV 539 Query: 1814 RSSAASREKEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKDHDSG 1993 RSSAA REKE+ SER++ P K+ +SG Sbjct: 540 RSSAAPREKERTSEREY---PERKHREERENDGERNDRDSKYREEKDRYREHRHKERESG 596 Query: 1994 YDDDWDKGQXXXXXXXXGAVPEDDHRSRSRDADYGKRRRLPSE 2122 YDDDWD+GQ GAV E+DHRSRSRDADYGKRRR+PSE Sbjct: 597 YDDDWDRGQSSRSRSRSGAVQEEDHRSRSRDADYGKRRRMPSE 639 >ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Solanum tuberosum] gi|565349616|ref|XP_006341787.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X2 [Solanum tuberosum] Length = 648 Score = 548 bits (1411), Expect = e-153 Identities = 308/648 (47%), Positives = 354/648 (54%), Gaps = 11/648 (1%) Frame = +2 Query: 212 MDPVTDEQLDYGDEGYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXXGEGFLQ 391 MDP DEQLDYGDE Y G+ KMQYH G IPALAE+EM+GE GEGFLQ Sbjct: 1 MDPTADEQLDYGDEEYGGSHKMQYHGSGTIPALAEDEMMGEDDEYDDLYNDVNIGEGFLQ 60 Query: 392 MQRSDTQVPSV-VGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK 568 +QRS+ VPSV GN Q K + P + G+ +E +A EG YA T FP QK Sbjct: 61 LQRSEVPVPSVDAGNGNFQAQKDSFPASRAGGLGSEEAKIPGIATEGKYAGTEVQFPQQK 120 Query: 569 TSLPGPGGPPQTMDASQRGRLPEVAH--NSQAGHSGYQGSASMPHKNAADQMNNSEKVIG 742 + DA+Q+ R + NSQAG+SGYQGS MP K AD M EK Sbjct: 121 GEPVVERETERPADAAQKARPSAITMTLNSQAGNSGYQGSMPMPQKIGADPMAMPEKNAS 180 Query: 743 EPAPLMYTNMGNTKGAPXXXXXXXXXXXXXXXXRSMDDEYMVRPSVENGNTMLFVGELHW 922 E PLM + + + P + E RPS+ENGNTMLFVGELHW Sbjct: 181 EATPLMNSVVPGPRVVPHMPTNQLNSSGNVNMNNPVISETPFRPSLENGNTMLFVGELHW 240 Query: 923 WTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGR 1102 WTTDAE+ESVL QYG VKEIKFFDERASGKSKGYCQVEF+DP++A+ACKEGMNG++FNGR Sbjct: 241 WTTDAELESVLTQYGNVKEIKFFDERASGKSKGYCQVEFFDPASAAACKEGMNGYNFNGR 300 Query: 1103 ACVVAFATPHTIKQMGASYTNKXXXXXXXXXXXRNPVNDAAGRGNGANYPSGDAXXXXXX 1282 ACVVAFATP TIKQMG+SY NK R P+N+ GRG P Sbjct: 301 ACVVAFATPQTIKQMGSSYANKTQNQVQSQPQGRRPMNEGVGRGGPNYTPGDAGRNFGRG 360 Query: 1283 XXXXXXXQPPNKXXXXXXXXXXXXI-NKNMIXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1459 PN+ + +KNM+ Sbjct: 361 SWGRGGPGMPNRGPGGGPVRGRGAMGSKNMMVNPGAGNGAGGAFGQGLAGPAFGGPPAGL 420 Query: 1460 XXXXXXXX---DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXX 1630 D +FMGRGAGYG FSGPAFPGM+PPF VN MGLPGVAPHVNPAFF Sbjct: 421 MHPQGMMGPGFDPSFMGRGAGYGGFSGPAFPGMMPPFQAVNPMGLPGVAPHVNPAFFGRG 480 Query: 1631 XXXXXXXXXXXXXXXXPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASH 1801 PH GMW DT+ G WGGEEHG RESSYGGEDNASEYGYGE SH Sbjct: 481 MAANGMGMMSAAGMDGPHPGMWTDTSGGGWGGEEHGRRTRESSYGGEDNASEYGYGEVSH 540 Query: 1802 DKGARSSAASREKEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKD 1981 DKGARSSA SREKE+ SERDWS N K+ Sbjct: 541 DKGARSSAVSREKERGSERDWSGNSDKRHRDEREHDRDRHDKEHRYREERDGYRDYRQKE 600 Query: 1982 HDSGYDDDWDKGQXXXXXXXXG-AVPEDDHRSRSRDADYGKRRRLPSE 2122 +S Y++D+D+GQ A E+DHRSRSRD +YGKRRR PSE Sbjct: 601 RESEYEEDYDRGQSSSRSRSKSRAAQEEDHRSRSRDTNYGKRRRAPSE 648 >ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 658 Score = 320 bits (820), Expect = 2e-84 Identities = 171/358 (47%), Positives = 211/358 (58%), Gaps = 8/358 (2%) Frame = +2 Query: 212 MDPVTDEQLDYGDEGYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXXGEGFLQ 391 MD + +EQ+DY +E Y G QKMQY GGAIPALA+EE++GE G+G LQ Sbjct: 1 MDSMAEEQIDYEEEEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQ 60 Query: 392 MQRSDTQVPSV-VGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK 568 Q+ + PS VGN +Q K +VP ++ Q N V+ EG Y FP Q Sbjct: 61 FQQPEAPPPSAGVGNGRLQVKKTDVPEQQVQAGVSQGSNVPGVSVEGKYTNAGTHFPAQN 120 Query: 569 -----TSLP--GPGGPPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNS 727 + P G G P SQ+G + E H++ + G+QGS S P + D N Sbjct: 121 DVQVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMP 180 Query: 728 EKVIGEPAPLMYTNMGNTKGAPXXXXXXXXXXXXXXXXRSMDDEYMVRPSVENGNTMLFV 907 +V EPAP++ +GA R+M +E +RP +ENG TMLFV Sbjct: 181 GRVANEPAPVLNPGAAGPQGA---LIPANQMGVNINVNRAMVNENQIRPPLENGGTMLFV 237 Query: 908 GELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGH 1087 GELHWWTTDAE+ESVL QYG+VKEIKFFDERASGKSKGYCQVEF+D +AA+ACK+GMNGH Sbjct: 238 GELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGH 297 Query: 1088 SFNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXXRNPVNDAAGRGNGANYPSGD 1261 FNGR CVVAFA+P T+KQMGASY NK R P+ND GRG NY SGD Sbjct: 298 VFNGRPCVVAFASPQTLKQMGASYMNKNQGQPQSQTQGRRPMNDGGGRGGNMNYQSGD 355 Score = 225 bits (573), Expect = 8e-56 Identities = 115/220 (52%), Positives = 132/220 (60%), Gaps = 7/220 (3%) Frame = +2 Query: 1484 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1663 D +MGRG GYG FSGP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 439 DPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGS 498 Query: 1664 XXXXXPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 1834 PH GMW D++MG W GEEHG RESSYGG+D AS+YGYGEA+H+KGARS+AASR Sbjct: 499 SGMDGPHPGMWTDSSMGGWLGEEHGRRTRESSYGGDDGASDYGYGEANHEKGARSTAASR 558 Query: 1835 EKEKNSERDWSSNP---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKDHDSGYDDD 2005 EK++ SERDWS N +D DS YDD+ Sbjct: 559 EKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDDN 618 Query: 2006 WDKGQXXXXXXXXG-AVPEDDHRSRSRDADYGKRRRLPSE 2122 WD+G A+P++DHRSRSRD DYGKRRRLPSE Sbjct: 619 WDRGPSSSRSRSRSRAIPDEDHRSRSRDVDYGKRRRLPSE 658 >ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] gi|557540375|gb|ESR51419.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] Length = 658 Score = 320 bits (819), Expect = 2e-84 Identities = 171/358 (47%), Positives = 211/358 (58%), Gaps = 8/358 (2%) Frame = +2 Query: 212 MDPVTDEQLDYGDEGYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXXGEGFLQ 391 MD + +EQ+DY +E Y G QKMQY GGAIPALA+EE++GE G+G LQ Sbjct: 1 MDSMAEEQIDYEEEEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQ 60 Query: 392 MQRSDTQVPSV-VGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK 568 Q+ + PS VGN +Q K +VP ++ Q N V+ EG Y FP Q Sbjct: 61 FQQPEAPPPSAGVGNGRLQVKKTDVPEQQVQAGVSQGSNVPGVSVEGKYTNAGTHFPAQN 120 Query: 569 -----TSLP--GPGGPPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNS 727 + P G G P SQ+G + E H++ + G+QGS S P + D N Sbjct: 121 DVQVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPPRTGVDPSNMP 180 Query: 728 EKVIGEPAPLMYTNMGNTKGAPXXXXXXXXXXXXXXXXRSMDDEYMVRPSVENGNTMLFV 907 +V EPAP++ +GA R+M +E +RP +ENG TMLFV Sbjct: 181 GRVANEPAPVLNPGAAGPQGA---LIPANQMGVNINVNRAMVNENQIRPPLENGGTMLFV 237 Query: 908 GELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGH 1087 GELHWWTTDAE+ESVL QYG+VKEIKFFDERASGKSKGYCQVEF+D +AA+ACK+GMNGH Sbjct: 238 GELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGH 297 Query: 1088 SFNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXXRNPVNDAAGRGNGANYPSGD 1261 FNGR CVVAFA+P T+KQMGASY NK R P+ND GRG NY SGD Sbjct: 298 VFNGRPCVVAFASPQTLKQMGASYMNKNQGQPQSQTQGRRPMNDGGGRGGNMNYQSGD 355 Score = 225 bits (574), Expect = 6e-56 Identities = 115/220 (52%), Positives = 132/220 (60%), Gaps = 7/220 (3%) Frame = +2 Query: 1484 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1663 D +MGRG GYG FSGP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 439 DPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGS 498 Query: 1664 XXXXXPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 1834 PH GMW D++MG W GEEHG RESSYGG+D AS+YGYGEA+H+KGARS+AASR Sbjct: 499 SGMDGPHPGMWTDSSMGGWVGEEHGRRTRESSYGGDDGASDYGYGEANHEKGARSTAASR 558 Query: 1835 EKEKNSERDWSSNP---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKDHDSGYDDD 2005 EK++ SERDWS N +D DS YDD+ Sbjct: 559 EKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDDN 618 Query: 2006 WDKGQXXXXXXXXG-AVPEDDHRSRSRDADYGKRRRLPSE 2122 WD+G A+P++DHRSRSRD DYGKRRRLPSE Sbjct: 619 WDRGPSSSRSRSRSRAIPDEDHRSRSRDVDYGKRRRLPSE 658 >gb|EPS60955.1| hypothetical protein M569_13847, partial [Genlisea aurea] Length = 508 Score = 317 bits (811), Expect = 2e-83 Identities = 182/354 (51%), Positives = 215/354 (60%), Gaps = 3/354 (0%) Frame = +2 Query: 212 MDPVTDEQLDYGDEGYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXX-GEGFL 388 M+P+ EQ D+G+E Y G QKMQY+QGGAIPALA+EEMIGE GE F+ Sbjct: 1 MEPMNGEQFDFGEEEYGGGQKMQYNQGGAIPALADEEMIGEEDDEYDDLYNDVNVGESFM 60 Query: 389 QMQRSDTQVPSVVGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK 568 Q+QR D+Q+P + + N GT E + +E N K A + A FP QK Sbjct: 61 QVQRPDSQIPPFKAEN-----RVNPSGTGDESIPSEEANASKYAGNRAFGPGALQFPEQK 115 Query: 569 TSLPGPGGPPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEP 748 L T+D SQ R NSQ SGYQGS + P+ DQ+ N +K +G+P Sbjct: 116 AGLNTTEETSVTVDRSQTVR------NSQTDQSGYQGSVA-PNNKTEDQVKNMDKTVGDP 168 Query: 749 APLMYTNMGNTKGAPXXXXXXXXXXXXXXXXRSMDDEYMVRPSVENGNTMLFVGELHWWT 928 + + +KGA R +DDEY S ENGNTML+VGELHWWT Sbjct: 169 SSINPNVGVGSKGA--VPFNFMNMAANANAIRPVDDEYSNLGSSENGNTMLYVGELHWWT 226 Query: 929 TDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRAC 1108 TDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEF+DP+AA ACKEGMNG+ FNGRAC Sbjct: 227 TDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFFDPAAAHACKEGMNGYVFNGRAC 286 Query: 1109 VVAFATPHTIKQMGASYTNKXXXXXXXXXXXRN-PVND-AAGRGNGANYPSGDA 1264 VVAFATP TIKQMGASY N+ RN +ND AGRG G N+ GDA Sbjct: 287 VVAFATPQTIKQMGASYMNRNQGQPQAQFPGRNAAMNDGGAGRGVGTNFSGGDA 340 Score = 123 bits (308), Expect = 4e-25 Identities = 61/96 (63%), Positives = 68/96 (70%), Gaps = 4/96 (4%) Frame = +2 Query: 1484 DLAFMGRGAGYGN-FSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXX 1660 DLAFMGRGAGYG F+GPAFPGMLPPFP VN++GLPGVAPHVNPAFF Sbjct: 413 DLAFMGRGAGYGGGFTGPAFPGMLPPFPAVNTLGLPGVAPHVNPAFFGRGMAPNGMGMMG 472 Query: 1661 XXXXXXPHSGMWNDTNM-GAWGGEEHGR--ESSYGG 1759 P+SG+WND ++ G WGGEE GR ESSYGG Sbjct: 473 PSGMGGPYSGLWNDASVGGGWGGEEQGRGPESSYGG 508 >ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695488|ref|XP_007044903.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708837|gb|EOY00734.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708838|gb|EOY00735.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 653 Score = 316 bits (809), Expect = 3e-83 Identities = 175/358 (48%), Positives = 214/358 (59%), Gaps = 7/358 (1%) Frame = +2 Query: 212 MDPVTDEQLDYGDEGYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXXGEGFLQ 391 MD + +EQ+D+GDE Y G QKMQY GAIPALA+EEM+GE GEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGAQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 392 MQRSDTQV-PSVVGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK 568 +QRS+ P +G++G+Q K P E Q +N V+ +G + A +P Q Sbjct: 61 LQRSEAPPQPGGMGSTGLQAQKNEAPEPRGEAGGSQGLNIPGVSVQGKHLNVTARYPEQD 120 Query: 569 ----TSLP--GPGGPPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSE 730 S P G G P SQ+GR+ E ++Q + G+QG +S HK D + Sbjct: 121 GQPAVSRPEMGSGSYPSGTSISQKGRVMEGTQDTQVKNMGFQGLSSASHKVGIDPSGVPQ 180 Query: 731 KVIGEPAPLMYTNMGNTKGAPXXXXXXXXXXXXXXXXRSMDDEYMVRPSVENGNTMLFVG 910 K+ PA + + G +GAP M E VRP +ENG TMLFVG Sbjct: 181 KIANVPAQSLNSGTGGPQGAPHVPPNQMGLNVN----HPMISENQVRPPIENGPTMLFVG 236 Query: 911 ELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHS 1090 ELHWWTTDAE+ESVL QYG+VKEIKFFDERASGKSKGYCQVEFYDP++A+ACKEGM+G+ Sbjct: 237 ELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPASAAACKEGMDGYM 296 Query: 1091 FNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXXRNPVNDAAGRGNGANYPSGDA 1264 FNGRACVVAFA+P T+KQMGASY NK R P ND GRG NY SGDA Sbjct: 297 FNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NDGLGRGGNMNYQSGDA 353 Score = 206 bits (525), Expect = 3e-50 Identities = 111/220 (50%), Positives = 125/220 (56%), Gaps = 7/220 (3%) Frame = +2 Query: 1484 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1663 D +MGRG YG F GP FPGMLP FP VN++GL GVAPHVNPAFF Sbjct: 435 DPTYMGRGGSYGGFPGPGFPGMLPSFPAVNTLGLAGVAPHVNPAFFGRGMAPNGMGMMGG 494 Query: 1664 XXXXXPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 1834 PH GMW DT+MG WGG+EHG RESSYGGED ASEYGYG+A+H+KG RSS ASR Sbjct: 495 PGMDGPHVGMWTDTSMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASR 553 Query: 1835 EKEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKDH---DSGYDDD 2005 EKE+ S+R+WS N H D YDDD Sbjct: 554 EKERVSDREWSGNSDRRHRDEKERDWDRSEREHREHRYREEKDSYREHRHRERDLDYDDD 613 Query: 2006 WDKGQXXXXXXXXG-AVPEDDHRSRSRDADYGKRRRLPSE 2122 D+GQ A+PE+ RSRSRD DYGKRRRLPSE Sbjct: 614 LDRGQSSSRSRRRSHAMPEEQRRSRSRDVDYGKRRRLPSE 653 >ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309507 [Fragaria vesca subsp. vesca] Length = 646 Score = 315 bits (807), Expect = 6e-83 Identities = 170/351 (48%), Positives = 214/351 (60%), Gaps = 1/351 (0%) Frame = +2 Query: 212 MDPVTDEQLDYGDEGYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXXGEGFLQ 391 MDP+ +EQ+DY +E Y G QK+QY + GAIPALA+EE + E GEGFLQ Sbjct: 1 MDPMGEEQIDYEEEEYGGAQKLQYQESGAIPALADEEPMVEDDEYDDLYNDVNVGEGFLQ 60 Query: 392 MQRSDTQVPSV-VGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK 568 M R + +P VGN G+Q K NVP ++G A QEV N + EG Y++ P QK Sbjct: 61 MHRPEPPLPPAGVGNGGLQAQKNNVPEQRVQGGASQEVKNPGFSVEGKYSSV----PEQK 116 Query: 569 TSLPGPGGPPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEP 748 P P ASQ+GR+ E+ H++Q + G+QG+A+M AD + + K+ P Sbjct: 117 DQPPVSVVPEM---ASQKGRVMEMTHDAQVRNMGFQGAATMQSNVVADSSDLTGKIANGP 173 Query: 749 APLMYTNMGNTKGAPXXXXXXXXXXXXXXXXRSMDDEYMVRPSVENGNTMLFVGELHWWT 928 P M N G+ R M +E +RP VENG+ LFVGELHWWT Sbjct: 174 IPSM--NSGSNGPPAVQQMPANQMNMKINVNRPMVNENQIRPPVENGSATLFVGELHWWT 231 Query: 929 TDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRAC 1108 TDAE+E VL Q+G++KEIKFFDERASGKSKGYCQV+FYDP+AASACKEGM+G+ FNGRAC Sbjct: 232 TDAELEGVLSQFGRIKEIKFFDERASGKSKGYCQVDFYDPAAASACKEGMDGYVFNGRAC 291 Query: 1109 VVAFATPHTIKQMGASYTNKXXXXXXXXXXXRNPVNDAAGRGNGANYPSGD 1261 VVAFA+ T+KQMG SY NK R P+ND AGRG N+ GD Sbjct: 292 VVAFASSQTLKQMGDSYVNKSQGQVQTQPQGRRPMNDGAGRGGNMNFQGGD 342 Score = 193 bits (491), Expect = 2e-46 Identities = 106/221 (47%), Positives = 121/221 (54%), Gaps = 8/221 (3%) Frame = +2 Query: 1484 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1663 D +MGRG GYG F GP FPGMLP FPGVN+MGL GVAPHVNPAFF Sbjct: 426 DPTYMGRGGGYGGFPGPGFPGMLPQFPGVNAMGLAGVAPHVNPAFFGRGMATNGMGMMGS 485 Query: 1664 XXXXXPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYG-YGEASHDKGARSSAAS 1831 H+ MWND +M W GEE RESSYGG+D SEYG YGEA+H+K RSSAA Sbjct: 486 SGMEGHHAPMWNDPSMAGWTGEEQDRRTRESSYGGDDGGSEYGNYGEANHEKPVRSSAAP 545 Query: 1832 REKEKNSERDW---SSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKDHDSGYDD 2002 RE+E+ SER+W S ++ D Y+D Sbjct: 546 RERERESEREWTGTSERRHRDEREQDWDRSEREHREPRYKEEKDSYRDHRRRERDVAYED 605 Query: 2003 DWDKGQXXXXXXXXG-AVPEDDHRSRSRDADYGKRRRLPSE 2122 D D+G A+PEDDHRSRSRD DYGKRRRLPSE Sbjct: 606 DRDRGHSSSRPRSRSKAMPEDDHRSRSRDVDYGKRRRLPSE 646 >ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 655 Score = 310 bits (793), Expect = 2e-81 Identities = 166/355 (46%), Positives = 207/355 (58%), Gaps = 8/355 (2%) Frame = +2 Query: 221 VTDEQLDYGDEGYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXXGEGFLQMQR 400 + +EQ+DY ++ Y G QKMQY GGAIPALA+EE++GE G+G LQ Q+ Sbjct: 1 MAEEQIDYEEDEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQFQQ 60 Query: 401 SDTQVPSV-VGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK--- 568 + PS VGN +Q K +VP ++ Q N V+ EG Y + FP Q Sbjct: 61 PEAPPPSAGVGNGRLQVKKTDVPEQRVQVGGSQGSNIPGVSVEGKYTNAGSHFPAQNDVQ 120 Query: 569 --TSLP--GPGGPPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKV 736 + P G G P SQ+G + E H++ + G+QGS S P + D N +V Sbjct: 121 VAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGRV 180 Query: 737 IGEPAPLMYTNMGNTKGAPXXXXXXXXXXXXXXXXRSMDDEYMVRPSVENGNTMLFVGEL 916 EPAP++ +GA R M +E +RP +ENG TMLFVGEL Sbjct: 181 ANEPAPVLNPGAAGPQGA---LIPANQMGVNANVNRVMVNENQIRPPLENGGTMLFVGEL 237 Query: 917 HWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFN 1096 HWWTTDAE+ESVL QYG+ KEIKFFDERASGKSKGYCQVEF+D +AA+ACK+GMNGH FN Sbjct: 238 HWWTTDAELESVLSQYGRAKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFN 297 Query: 1097 GRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXXRNPVNDAAGRGNGANYPSGD 1261 GR CVVAFA+P T+KQMGASY NK P+ND GRG NY SGD Sbjct: 298 GRPCVVAFASPQTLKQMGASYMNKNQGQPQSQNQGSRPMNDGGGRGGNTNYQSGD 352 Score = 229 bits (585), Expect = 3e-57 Identities = 117/220 (53%), Positives = 134/220 (60%), Gaps = 7/220 (3%) Frame = +2 Query: 1484 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1663 D +MGRG GYG FSGP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 436 DPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGS 495 Query: 1664 XXXXXPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 1834 PH GMW D++MG W GEEHG RESSYGG+D AS+YGYGEA+H+KGARS+AASR Sbjct: 496 SGMDGPHPGMWTDSSMGGWLGEEHGRRTRESSYGGDDGASDYGYGEANHEKGARSTAASR 555 Query: 1835 EKEKNSERDWSSNP---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKDHDSGYDDD 2005 EK++ SERDWS N +D DS YDD+ Sbjct: 556 EKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDDN 615 Query: 2006 WDKGQ-XXXXXXXXGAVPEDDHRSRSRDADYGKRRRLPSE 2122 WD+GQ GA+P++DHRSRSRD DYGKRRRLPSE Sbjct: 616 WDRGQSSSRSRSRSGAIPDEDHRSRSRDVDYGKRRRLPSE 655 >ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|567891321|ref|XP_006438181.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540376|gb|ESR51420.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540377|gb|ESR51421.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] Length = 655 Score = 308 bits (790), Expect = 5e-81 Identities = 165/355 (46%), Positives = 206/355 (58%), Gaps = 8/355 (2%) Frame = +2 Query: 221 VTDEQLDYGDEGYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXXGEGFLQMQR 400 + +EQ+DY ++ Y G QKMQY GGAIPALA+EE++GE G+G LQ Q+ Sbjct: 1 MAEEQIDYEEDEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDINVGDGLLQFQQ 60 Query: 401 SDTQVPSV-VGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK--- 568 + PS VGN +Q K +VP ++ Q N V+ EG Y + FP Q Sbjct: 61 PEAPPPSAGVGNGRLQVKKTDVPEQRVQVGGSQGSNIPGVSVEGKYTNAGSDFPAQNDVQ 120 Query: 569 --TSLP--GPGGPPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKV 736 + P G G P SQ+G + E H++ + G+QGS S P + D N + Sbjct: 121 VAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGRA 180 Query: 737 IGEPAPLMYTNMGNTKGAPXXXXXXXXXXXXXXXXRSMDDEYMVRPSVENGNTMLFVGEL 916 EPAP++ +GA R M +E +RP +ENG TMLFVGEL Sbjct: 181 ANEPAPVLNPGAAGPQGA---LIPANQMGVNANVNRVMVNENQIRPPLENGGTMLFVGEL 237 Query: 917 HWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFN 1096 HWWTTDAE+ESVL QYG+ KEIKFFDERASGKSKGYCQVEF+D +AA+ACK+GMNGH FN Sbjct: 238 HWWTTDAELESVLSQYGRAKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFN 297 Query: 1097 GRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXXRNPVNDAAGRGNGANYPSGD 1261 GR CVVAFA+P T+KQMGASY NK P+ND GRG NY SGD Sbjct: 298 GRPCVVAFASPQTLKQMGASYMNKNQGQPQSQNQGSRPMNDGGGRGGNTNYQSGD 352 Score = 229 bits (585), Expect = 3e-57 Identities = 117/220 (53%), Positives = 133/220 (60%), Gaps = 7/220 (3%) Frame = +2 Query: 1484 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1663 D +MGRG GYG FSGP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 436 DPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGS 495 Query: 1664 XXXXXPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 1834 PH GMW D++MG W GEEHG RESSYGG+D AS+YGYGEASH+KGARS+ ASR Sbjct: 496 SGMDGPHPGMWTDSSMGGWVGEEHGRRTRESSYGGDDGASDYGYGEASHEKGARSTTASR 555 Query: 1835 EKEKNSERDWSSNP---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKDHDSGYDDD 2005 EK++ SERDWS N +D DS YDD+ Sbjct: 556 EKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDDN 615 Query: 2006 WDKGQ-XXXXXXXXGAVPEDDHRSRSRDADYGKRRRLPSE 2122 WD+GQ GA+P++DHRSRSRD DYGKRRRLPSE Sbjct: 616 WDRGQSSSRSRSRSGAIPDEDHRSRSRDVDYGKRRRLPSE 655 >ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobroma cacao] gi|508708844|gb|EOY00741.1| RNA-binding family protein isoform 6 [Theobroma cacao] Length = 602 Score = 306 bits (785), Expect = 2e-80 Identities = 165/359 (45%), Positives = 214/359 (59%), Gaps = 8/359 (2%) Frame = +2 Query: 212 MDPVTDEQLDYGDEGYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXXGEGFLQ 391 MD + +EQ+D+GDE Y G QKMQY GAIPALA+EEM+GE GEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 392 MQRSDTQV-PSVVGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK 568 +QRS+ + P +G++G++ + P +E Q +N V+ +G + +A +P +K Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRVEAGGSQGLNIPGVSVQGKHPNVSARYP-EK 119 Query: 569 TSLPGPGGP-------PQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNS 727 P P P SQ+G + E H+ Q + G+QG S +K D Sbjct: 120 EEQPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVP 179 Query: 728 EKVIGEPAPLMYTNMGNTKGAPXXXXXXXXXXXXXXXXRSMDDEYMVRPSVENGNTMLFV 907 +K+ +PA + + G +G P + +E V+P +ENG TMLFV Sbjct: 180 QKIANDPAQSLNSGTGGPQGPPHVPPNQMGTNVN----HPVMNENQVQPPIENGPTMLFV 235 Query: 908 GELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGH 1087 GELHWWTTDAE+ESVL QYG++KEIKFFDE+ASGKSKGYCQVEFYDPS+A+ CKEGMNG+ Sbjct: 236 GELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGY 295 Query: 1088 SFNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXXRNPVNDAAGRGNGANYPSGDA 1264 FNGRACVVAFA+P T+KQMGASY NK R P N+ GRG NY SGDA Sbjct: 296 MFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQSGDA 353 Score = 172 bits (437), Expect = 5e-40 Identities = 83/133 (62%), Positives = 93/133 (69%), Gaps = 3/133 (2%) Frame = +2 Query: 1484 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1663 D +M RG GYG F GP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 434 DPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGA 493 Query: 1664 XXXXXPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 1834 PH+GMW D +MG WGG+EHG RESSYGGED ASEYGYG+A+H+KG RSS ASR Sbjct: 494 SGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASR 552 Query: 1835 EKEKNSERDWSSN 1873 EKE+ SER+WS N Sbjct: 553 EKERVSEREWSGN 565 >ref|XP_007044908.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] gi|508708843|gb|EOY00740.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] Length = 656 Score = 306 bits (785), Expect = 2e-80 Identities = 165/359 (45%), Positives = 214/359 (59%), Gaps = 8/359 (2%) Frame = +2 Query: 212 MDPVTDEQLDYGDEGYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXXGEGFLQ 391 MD + +EQ+D+GDE Y G QKMQY GAIPALA+EEM+GE GEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 392 MQRSDTQV-PSVVGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK 568 +QRS+ + P +G++G++ + P +E Q +N V+ +G + +A +P +K Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRVEAGGSQGLNIPGVSVQGKHPNVSARYP-EK 119 Query: 569 TSLPGPGGP-------PQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNS 727 P P P SQ+G + E H+ Q + G+QG S +K D Sbjct: 120 EEQPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVP 179 Query: 728 EKVIGEPAPLMYTNMGNTKGAPXXXXXXXXXXXXXXXXRSMDDEYMVRPSVENGNTMLFV 907 +K+ +PA + + G +G P + +E V+P +ENG TMLFV Sbjct: 180 QKIANDPAQSLNSGTGGPQGPPHVPPNQMGTNVN----HPVMNENQVQPPIENGPTMLFV 235 Query: 908 GELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGH 1087 GELHWWTTDAE+ESVL QYG++KEIKFFDE+ASGKSKGYCQVEFYDPS+A+ CKEGMNG+ Sbjct: 236 GELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGY 295 Query: 1088 SFNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXXRNPVNDAAGRGNGANYPSGDA 1264 FNGRACVVAFA+P T+KQMGASY NK R P N+ GRG NY SGDA Sbjct: 296 MFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQSGDA 353 Score = 196 bits (497), Expect = 5e-47 Identities = 104/215 (48%), Positives = 120/215 (55%), Gaps = 7/215 (3%) Frame = +2 Query: 1484 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1663 D +M RG GYG F GP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 434 DPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGA 493 Query: 1664 XXXXXPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 1834 PH+GMW D +MG WGG+EHG RESSYGGED ASEYGYG+A+H+KG RSS ASR Sbjct: 494 SGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASR 552 Query: 1835 EKEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKDH---DSGYDDD 2005 EKE+ SER+WS N H D YDDD Sbjct: 553 EKERVSEREWSGNSDRRHRDEKEQDWDRSEREHREHRYREEKDSYREHRHRERDLDYDDD 612 Query: 2006 WDKGQXXXXXXXXG-AVPEDDHRSRSRDADYGKRR 2107 WD+GQ A+PE++HRSRSRD Y + + Sbjct: 613 WDRGQSSSRSRRRSHAMPEEEHRSRSRDVGYREEK 647 >ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobroma cacao] gi|508708842|gb|EOY00739.1| RNA-binding family protein isoform 4 [Theobroma cacao] Length = 697 Score = 306 bits (785), Expect = 2e-80 Identities = 165/359 (45%), Positives = 214/359 (59%), Gaps = 8/359 (2%) Frame = +2 Query: 212 MDPVTDEQLDYGDEGYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXXGEGFLQ 391 MD + +EQ+D+GDE Y G QKMQY GAIPALA+EEM+GE GEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 392 MQRSDTQV-PSVVGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK 568 +QRS+ + P +G++G++ + P +E Q +N V+ +G + +A +P +K Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRVEAGGSQGLNIPGVSVQGKHPNVSARYP-EK 119 Query: 569 TSLPGPGGP-------PQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNS 727 P P P SQ+G + E H+ Q + G+QG S +K D Sbjct: 120 EEQPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVP 179 Query: 728 EKVIGEPAPLMYTNMGNTKGAPXXXXXXXXXXXXXXXXRSMDDEYMVRPSVENGNTMLFV 907 +K+ +PA + + G +G P + +E V+P +ENG TMLFV Sbjct: 180 QKIANDPAQSLNSGTGGPQGPPHVPPNQMGTNVN----HPVMNENQVQPPIENGPTMLFV 235 Query: 908 GELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGH 1087 GELHWWTTDAE+ESVL QYG++KEIKFFDE+ASGKSKGYCQVEFYDPS+A+ CKEGMNG+ Sbjct: 236 GELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGY 295 Query: 1088 SFNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXXRNPVNDAAGRGNGANYPSGDA 1264 FNGRACVVAFA+P T+KQMGASY NK R P N+ GRG NY SGDA Sbjct: 296 MFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQSGDA 353 Score = 172 bits (437), Expect = 5e-40 Identities = 83/133 (62%), Positives = 93/133 (69%), Gaps = 3/133 (2%) Frame = +2 Query: 1484 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1663 D +M RG GYG F GP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 434 DPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGA 493 Query: 1664 XXXXXPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 1834 PH+GMW D +MG WGG+EHG RESSYGGED ASEYGYG+A+H+KG RSS ASR Sbjct: 494 SGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASR 552 Query: 1835 EKEKNSERDWSSN 1873 EKE+ SER+WS N Sbjct: 553 EKERVSEREWSGN 565 >ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695496|ref|XP_007044905.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695500|ref|XP_007044906.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708839|gb|EOY00736.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708840|gb|EOY00737.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708841|gb|EOY00738.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 652 Score = 306 bits (785), Expect = 2e-80 Identities = 165/359 (45%), Positives = 214/359 (59%), Gaps = 8/359 (2%) Frame = +2 Query: 212 MDPVTDEQLDYGDEGYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXXGEGFLQ 391 MD + +EQ+D+GDE Y G QKMQY GAIPALA+EEM+GE GEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 392 MQRSDTQV-PSVVGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK 568 +QRS+ + P +G++G++ + P +E Q +N V+ +G + +A +P +K Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRVEAGGSQGLNIPGVSVQGKHPNVSARYP-EK 119 Query: 569 TSLPGPGGP-------PQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNS 727 P P P SQ+G + E H+ Q + G+QG S +K D Sbjct: 120 EEQPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVP 179 Query: 728 EKVIGEPAPLMYTNMGNTKGAPXXXXXXXXXXXXXXXXRSMDDEYMVRPSVENGNTMLFV 907 +K+ +PA + + G +G P + +E V+P +ENG TMLFV Sbjct: 180 QKIANDPAQSLNSGTGGPQGPPHVPPNQMGTNVN----HPVMNENQVQPPIENGPTMLFV 235 Query: 908 GELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGH 1087 GELHWWTTDAE+ESVL QYG++KEIKFFDE+ASGKSKGYCQVEFYDPS+A+ CKEGMNG+ Sbjct: 236 GELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGY 295 Query: 1088 SFNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXXRNPVNDAAGRGNGANYPSGDA 1264 FNGRACVVAFA+P T+KQMGASY NK R P N+ GRG NY SGDA Sbjct: 296 MFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQSGDA 353 Score = 214 bits (546), Expect = 1e-52 Identities = 113/220 (51%), Positives = 128/220 (58%), Gaps = 7/220 (3%) Frame = +2 Query: 1484 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1663 D +M RG GYG F GP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 434 DPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGA 493 Query: 1664 XXXXXPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 1834 PH+GMW D +MG WGG+EHG RESSYGGED ASEYGYG+A+H+KG RSS ASR Sbjct: 494 SGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASR 552 Query: 1835 EKEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKDH---DSGYDDD 2005 EKE+ SER+WS N H D YDDD Sbjct: 553 EKERVSEREWSGNSDRRHRDEKEQDWDRSEREHREHRYREEKDSYREHRHRERDLDYDDD 612 Query: 2006 WDKGQXXXXXXXXG-AVPEDDHRSRSRDADYGKRRRLPSE 2122 WD+GQ A+PE++HRSRSRD DYGK+RRLPSE Sbjct: 613 WDRGQSSSRSRRRSHAMPEEEHRSRSRDVDYGKKRRLPSE 652 >ref|XP_002515040.1| RNA binding protein, putative [Ricinus communis] gi|223546091|gb|EEF47594.1| RNA binding protein, putative [Ricinus communis] Length = 644 Score = 303 bits (777), Expect = 2e-79 Identities = 174/351 (49%), Positives = 208/351 (59%), Gaps = 3/351 (0%) Frame = +2 Query: 221 VTDEQLDYGDEGYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXXGEGFLQMQR 400 + DEQ+DY DE Y G QK+QY GAIPALAEEEM GE GE FLQM R Sbjct: 1 MADEQIDYEDEEYGGAQKLQYQGSGAIPALAEEEM-GEDDEYDDLYNDVNIGENFLQMHR 59 Query: 401 SDTQ-VPSVVGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSL 577 S+ P VGN G Q +N +E Q +N VA E Y+ T FP Q Sbjct: 60 SEAPPAPPSVGNGGFQPRNSN--DLRVESGGSQGLNIPGVAVESKYS-TGTHFPEQNVKG 116 Query: 578 P--GPGGPPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPA 751 P G G P +Q+ R+ E+ ++SQA + G+QGS S P D + + K+ +P Sbjct: 117 PEIGSVGYPDGSSIAQKTRVMEMTNDSQARNMGFQGSTSGPSNIGVDPSDMNNKISNDPT 176 Query: 752 PLMYTNMGNTKGAPXXXXXXXXXXXXXXXXRSMDDEYMVRPSVENGNTMLFVGELHWWTT 931 P+ N G + P RS +E +RP +ENG+TML+VGELHWWTT Sbjct: 177 PV--PNAGVPRVIPQLPASQMNMNMDTN--RSATNENQIRPPLENGSTMLYVGELHWWTT 232 Query: 932 DAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACV 1111 DAE+E+VL QYG VKEIKFFDERASGKSKGYCQVEFYD +AA+ACKEGMNGH FNGRACV Sbjct: 233 DAELENVLSQYGMVKEIKFFDERASGKSKGYCQVEFYDAAAAAACKEGMNGHLFNGRACV 292 Query: 1112 VAFATPHTIKQMGASYTNKXXXXXXXXXXXRNPVNDAAGRGNGANYPSGDA 1264 VAFA+ T+KQMGASY NK R P+ND AGRG NY GDA Sbjct: 293 VAFASQQTLKQMGASYMNKNQGQPQSQNQGRRPMNDGAGRGGNMNYQGGDA 343 Score = 224 bits (572), Expect = 1e-55 Identities = 117/219 (53%), Positives = 134/219 (61%), Gaps = 6/219 (2%) Frame = +2 Query: 1484 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1663 D +MGRGAGYG F+GP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 426 DPTYMGRGAGYGGFAGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFGRGMAPNGMGMMGP 485 Query: 1664 XXXXXPHSGMWNDTNMGAWGGE--EHGRESSYGGEDNASEYGYGEASHDKGARSSAASRE 1837 P++GMW+DT+MG WG E RESSYGG+D ASEYGYGE +H+KGARSSAASRE Sbjct: 486 SGMDGPNAGMWSDTSMGGWGEEPGRRTRESSYGGDDGASEYGYGEVNHEKGARSSAASRE 545 Query: 1838 KEKNSERDWSSNP---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKDHDSGYDDDW 2008 KE+ SERDWS N ++ DSGY+DDW Sbjct: 546 KERASERDWSGNSDRRHRDDREHDWDRSEREHKEHRYREEKESYRDHRQRERDSGYEDDW 605 Query: 2009 DKGQXXXXXXXXG-AVPEDDHRSRSRDADYGKRRRLPSE 2122 D+GQ AVPE+D+RSRSRDADYGKRRRLPSE Sbjct: 606 DRGQSSSRSRSRSRAVPEEDYRSRSRDADYGKRRRLPSE 644 >ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268141 isoform 1 [Vitis vinifera] gi|359473133|ref|XP_003631251.1| PREDICTED: uncharacterized protein LOC100268141 isoform 2 [Vitis vinifera] Length = 647 Score = 301 bits (771), Expect = 8e-79 Identities = 170/361 (47%), Positives = 210/361 (58%), Gaps = 13/361 (3%) Frame = +2 Query: 221 VTDEQLDYGDEGYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXXGEGFLQMQR 400 + +EQLDY DE Y G QKM + GGAI ALA++E++GE GEGFLQM R Sbjct: 1 MAEEQLDYEDEEYGGAQKMPFQGGGAISALADDELMGEDDEYDDLYNDVNVGEGFLQMHR 60 Query: 401 SDTQVPS-VVGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYA------------A 541 S+ PS V+ Q K +VP LE Q + V+ EG Y+ A Sbjct: 61 SEAPAPSGVMAGGPFQAHKTDVPPQKLEAGTSQGLIIPGVSIEGKYSNPHFHEKKEGPMA 120 Query: 542 TAAPFPVQKTSLPGPGGPPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMN 721 P + L GP SQ+GR+ E+ H++Q + G+QGS +P K A+ + Sbjct: 121 VKGPEMGSTSHLDGPS-------VSQKGRVLEMTHDTQVRNLGFQGSTPIPQKTGAEPSD 173 Query: 722 NSEKVIGEPAPLMYTNMGNTKGAPXXXXXXXXXXXXXXXXRSMDDEYMVRPSVENGNTML 901 K+ E P++ + G + P R M +E +RP+V+NG TML Sbjct: 174 VHGKIANESTPVLNSGTGGPRAVPQMLSNQMGMNVNVN--RPMVNENQIRPAVDNGATML 231 Query: 902 FVGELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMN 1081 FVGELHWWTTDAE+ESVL QYG+VKEIKFFDERASGKSKGYCQVEFYD SAA+ACKEGMN Sbjct: 232 FVGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDASAAAACKEGMN 291 Query: 1082 GHSFNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXXRNPVNDAAGRGNGANYPSGD 1261 G+ FNGRACVVAFA+P T+KQMGASY NK R P+ND GRG G N GD Sbjct: 292 GYIFNGRACVVAFASPQTLKQMGASYMNK--TQAQSQSQGRRPMNDGVGRGGGMNMQGGD 349 Query: 1262 A 1264 A Sbjct: 350 A 350 Score = 214 bits (546), Expect = 1e-52 Identities = 109/217 (50%), Positives = 126/217 (58%), Gaps = 4/217 (1%) Frame = +2 Query: 1484 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1663 D +MGRG YG FSG AFPGM+P FP VN+MGL GVAPHVNPAFF Sbjct: 431 DPTYMGRGGAYGGFSGSAFPGMVPSFPAVNTMGLAGVAPHVNPAFFGRGMAANGMGMMGA 490 Query: 1664 XXXXXPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 1834 H+GMW DT+MG WGGEEHG RESSYGG+D AS+YGYGE +H+K RS+ ASR Sbjct: 491 TGMDGHHAGMWTDTSMGGWGGEEHGRRTRESSYGGDDGASDYGYGEVNHEKVGRSNTASR 550 Query: 1835 EKEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKDHDSGYDDDWDK 2014 EKE+ SERDWS N ++ D +DDWD+ Sbjct: 551 EKERGSERDWSGNSERRHRDEREQDWERSDKDHRYREEKDGYRDHRQRERDFNNEDDWDR 610 Query: 2015 GQXXXXXXXXG-AVPEDDHRSRSRDADYGKRRRLPSE 2122 GQ AV ++DHRSRSRD DYGKRRRLPSE Sbjct: 611 GQSSSRSRSRSRAVADEDHRSRSRDGDYGKRRRLPSE 647 >ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] gi|462422613|gb|EMJ26876.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] Length = 630 Score = 289 bits (740), Expect = 3e-75 Identities = 164/348 (47%), Positives = 202/348 (58%), Gaps = 1/348 (0%) Frame = +2 Query: 221 VTDEQLDYGDEGYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXXGEGFLQMQR 400 + +EQ+DY DE Y G QK+QY GAI ALA+EE + E EGFLQM R Sbjct: 1 MAEEQIDYEDEEYGGAQKLQYQGSGAISALADEEPMVEDDEYDDLYNDVNVREGFLQMHR 60 Query: 401 SDTQVP-SVVGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSL 577 S+ +P VGN G+Q K +V T ++ QE V+ +G Y++ A FP Q+ Sbjct: 61 SEAPLPPGGVGNGGLQAQKTDVTETRVQAGVSQESKIPGVSVQGKYSSAVAQFPEQQ--- 117 Query: 578 PGPGGPPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPL 757 G PP VA + G +GY GS +MP D + + K E P Sbjct: 118 ---GQPP-------------VAKEPELGSTGY-GSTTMPPNVGGDSSDITGKTALESVPS 160 Query: 758 MYTNMGNTKGAPXXXXXXXXXXXXXXXXRSMDDEYMVRPSVENGNTMLFVGELHWWTTDA 937 M N G R M +E +RP VENG+TMLFVGELHWWTTDA Sbjct: 161 M--NSGTAGPTGVTQMPTNQISIKVNANRPMFNENQIRPPVENGSTMLFVGELHWWTTDA 218 Query: 938 EIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVA 1117 E+ESVL QYG+VKEIKFFDERASGKSKGYCQVEF+DP+AA+ACKEGM+G+ FNGRACVVA Sbjct: 219 ELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFHDPAAATACKEGMDGYLFNGRACVVA 278 Query: 1118 FATPHTIKQMGASYTNKXXXXXXXXXXXRNPVNDAAGRGNGANYPSGD 1261 FA+P T+KQMGASY +K R P+N+ GRG G NY +GD Sbjct: 279 FASPQTLKQMGASYLSKSQGQTQSQQPGRRPMNEGVGRGGGVNYQTGD 326 Score = 224 bits (571), Expect = 1e-55 Identities = 115/221 (52%), Positives = 131/221 (59%), Gaps = 8/221 (3%) Frame = +2 Query: 1484 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1663 D +MGRG GYG F GPAFPGML FP VN+MGL GVAPHVNPAFF Sbjct: 410 DPTYMGRGGGYGGFPGPAFPGMLSSFPAVNTMGLAGVAPHVNPAFFGRGMATNGMGMMGS 469 Query: 1664 XXXXXPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 1834 H+GMWND +MG WGG+EHG RESSYGG+D ASEYGYGEA+H+KG RS+A SR Sbjct: 470 SGMDGHHAGMWNDPSMGGWGGDEHGRRTRESSYGGDDGASEYGYGEANHEKGGRSNAPSR 529 Query: 1835 EKEKNSERDWSSNP----XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKDHDSGYDD 2002 E+E+ SERDWS N ++ D GY+D Sbjct: 530 ERERGSERDWSGNSERRHRDEREQDWDRSERGEHREHRYKEEKDSYRDHRQRERDVGYED 589 Query: 2003 DWDKGQXXXXXXXXG-AVPEDDHRSRSRDADYGKRRRLPSE 2122 DWD+GQ A+PEDDHRSRSRD DYGKRRRLPSE Sbjct: 590 DWDRGQSSSRPRSRSKAMPEDDHRSRSRDVDYGKRRRLPSE 630 >gb|EXB82464.1| Cleavage and polyadenylation specificity factor subunit [Morus notabilis] Length = 636 Score = 281 bits (720), Expect = 7e-73 Identities = 159/354 (44%), Positives = 207/354 (58%), Gaps = 7/354 (1%) Frame = +2 Query: 221 VTDEQLDYGDEGYAGNQKMQYH-QGGAIPALAEEEMIGEXXXXXXXXXXXXXGEGFLQMQ 397 + ++ +D+ DE Y G QK QY GGAI ALA+EE++G+ GEGFLQ+Q Sbjct: 1 MAEDHIDFEDEEYGGAQKHQYQGSGGAISALADEELMGDDDEYDDLYNDVNVGEGFLQLQ 60 Query: 398 RSDT-QVPSVVG-NSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKT 571 RS+ +P+ G +G+Q K N P E Q+ N V+ EG +++ + FP Q+ Sbjct: 61 RSEAPSLPAAAGVGNGLQAQKRNFPEPREEIGGSQQPNIPGVSAEGRFSSAGSQFPGQQD 120 Query: 572 SLPGPGGPPQTMDASQRGRL--PEVAHNSQAGH--SGYQGSASMPHKNAADQMNNSEKVI 739 L + S+ G + P+ A SQ G +G+QGS M H D + K++ Sbjct: 121 GL-------KVDKKSEAGSMVYPDGASGSQKGRIVAGFQGSKPMLHSVGVDSSDIPGKMV 173 Query: 740 GEPAPLMYTNMGNTKGAPXXXXXXXXXXXXXXXXRSMDDEYMVRPSVENGNTMLFVGELH 919 EP + N G + +E +RPS+ENG+TMLFVGELH Sbjct: 174 NEP--IQAPNSGGAGPRGILPMQGNQTTVNANVSHPIVNENQIRPSIENGSTMLFVGELH 231 Query: 920 WWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNG 1099 WWTTDAE+ESVL QYG+VKEIKFFDERASGKSKGYCQVE+YD +AA ACKEGM+GH FNG Sbjct: 232 WWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEYYDAAAAVACKEGMHGHVFNG 291 Query: 1100 RACVVAFATPHTIKQMGASYTNKXXXXXXXXXXXRNPVNDAAGRGNGANYPSGD 1261 RACVVAFA+P T+KQMGA+Y +K R P+ND GRG N+ SGD Sbjct: 292 RACVVAFASPQTLKQMGAAYMSKNQVQNQSQPQGRRPINDGVGRGGNPNFQSGD 345 Score = 192 bits (488), Expect = 6e-46 Identities = 103/220 (46%), Positives = 117/220 (53%), Gaps = 7/220 (3%) Frame = +2 Query: 1484 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1663 D +MGRG GYG F+GPAFPGMLP FP VN+MG VAPHVNPAFF Sbjct: 425 DPTYMGRGVGYGGFAGPAFPGMLPSFPAVNTMGFAAVAPHVNPAFFGRGMTNNGMGMVGS 484 Query: 1664 XXXXXPHSGMWNDTNMGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 1834 GMWND ++G WGGEEHG RESSYGG+D ASEYGYG+ +H+KG R Sbjct: 485 SLMDGHQGGMWNDPSIGGWGGEEHGRRTRESSYGGDDGASEYGYGDTNHEKGGR------ 538 Query: 1835 EKEKNSERDWSSNP---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKDHDSGYDDD 2005 E+ SERDWS N K+ + Y+DD Sbjct: 539 --ERGSERDWSGNSERRNHEERDQDWDRSQKEQKEHRYREGKDGSRDYRPKERELDYEDD 596 Query: 2006 WDKGQXXXXXXXXG-AVPEDDHRSRSRDADYGKRRRLPSE 2122 WD+GQ V ED HRSRSRD DYGKRRRLPSE Sbjct: 597 WDRGQSSSRLRSRSRVVQEDHHRSRSRDVDYGKRRRLPSE 636 >ref|XP_002312652.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222852472|gb|EEE90019.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 619 Score = 277 bits (708), Expect = 2e-71 Identities = 158/340 (46%), Positives = 191/340 (56%), Gaps = 5/340 (1%) Frame = +2 Query: 257 YAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXXGEGFLQMQRSDTQVP-SVVGN 433 Y +KMQY GAIPALAEEEM GE GE FLQM S+ P + VGN Sbjct: 3 YEEEEKMQYQGSGAIPALAEEEM-GEDDEYDDLYNDVNVGENFLQMHGSEAPAPPATVGN 61 Query: 434 SGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPG----GPPQ 601 G Q A+ G + A EG Y+ A FP QK GP Sbjct: 62 GGFQTRNAHESRIETGGSQALAITGGGPAVEGIYSNAKAHFPEQKQVAVAVEAQDVGPVD 121 Query: 602 TMDASQRGRLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNT 781 +Q+GR+ E++H+ Q + G+Q S +P D + S K EP PL T Sbjct: 122 GSSVAQKGRVIEMSHDVQVRNMGFQKSTPVPPGIGVDPSDMSRKNAIEPEPLPITGSAGP 181 Query: 782 KGAPXXXXXXXXXXXXXXXXRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQ 961 +GAP R + +E VRP +ENG+T L+VGELHWWTTDAE+ES Q Sbjct: 182 RGAPQMQVNQMHMSADVN--RPVVNENQVRPPIENGSTTLYVGELHWWTTDAELESFASQ 239 Query: 962 YGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAFATPHTIK 1141 +G+VKEIKFFDERASGKSKGYCQV+FY+ +AA+ACKEGMNGH FNGR CVVAFA+P T+K Sbjct: 240 FGRVKEIKFFDERASGKSKGYCQVDFYEAAAAAACKEGMNGHVFNGRPCVVAFASPQTLK 299 Query: 1142 QMGASYTNKXXXXXXXXXXXRNPVNDAAGRGNGANYPSGD 1261 QMGASY NK R +ND AGRG AN+ SGD Sbjct: 300 QMGASYMNKTQGQPQTQSQGRGSMNDGAGRGGNANFQSGD 339 Score = 189 bits (479), Expect = 6e-45 Identities = 101/214 (47%), Positives = 116/214 (54%), Gaps = 1/214 (0%) Frame = +2 Query: 1484 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1663 D +MGRG GYG F+GP FPGMLP FP VNSMGL GVAPHVNPAFF Sbjct: 421 DPLYMGRGGGYGGFAGPGFPGMLPSFPAVNSMGLAGVAPHVNPAFFARGMAPNGMGMMVS 480 Query: 1664 XXXXXPHSGMWNDTNMGAWGGEEHGRESSYGGEDNASEYGYGEASHDKGARSSAASREKE 1843 P+ GMW ESSY G++ ASEYGYGE +H+KGARSS ASREKE Sbjct: 481 SGMDGPNPGMW---------------ESSYDGDEGASEYGYGEGNHEKGARSSGASREKE 525 Query: 1844 KNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKDHDSGYDDDWDKGQX 2023 + SERDWS N ++ DSGY+DD D+G Sbjct: 526 RGSERDWSGNSDRRHRDEREQDWDRPEREHRYKEEKDSYRGHRQRERDSGYEDDRDRGHS 585 Query: 2024 XXXXXXXG-AVPEDDHRSRSRDADYGKRRRLPSE 2122 A PE+D+RSR+RD DYGKRRRLPSE Sbjct: 586 SSRARSRSRAAPEEDYRSRTRDVDYGKRRRLPSE 619 >ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] gi|550329195|gb|ERP56065.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] Length = 591 Score = 237 bits (605), Expect = 2e-59 Identities = 143/332 (43%), Positives = 170/332 (51%), Gaps = 1/332 (0%) Frame = +2 Query: 269 QKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXXGEGFLQMQRSDTQVP-SVVGNSGIQ 445 +KMQY GAIPALAEEE+ GE GE FLQM S+ P + GN G Q Sbjct: 7 EKMQYQGSGAIPALAEEEL-GEDDEYDDLYNDVNVGENFLQMHGSEAPAPPATAGNGGFQ 65 Query: 446 NSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPGGPPQTMDASQRG 625 A+ G + + VA EG Y+ A FP QK + G Sbjct: 66 TRNAHESRVETGGSQVLATSGAGVAVEGKYSNAGAHFPEQKQAGIG-------------- 111 Query: 626 RLPEVAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPXXXX 805 + G GY +S+ K +A P M N N Sbjct: 112 -----VEANDVGSIGYGDGSSVAQKGSAGPRG---------VPQMQVNQMNMNA------ 151 Query: 806 XXXXXXXXXXXXRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKEIK 985 R + +E VRP +ENG T L+VGELHWWTTDAE+ESV QYG+VKEIK Sbjct: 152 ---------DVNRPVVNENQVRPPIENGPTTLYVGELHWWTTDAELESVASQYGRVKEIK 202 Query: 986 FFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAFATPHTIKQMGASYTN 1165 FFDERASGKSKGYCQV+FY+ +AA+ACKEGMN H FNGR CVVAFA+ T+KQMGASY + Sbjct: 203 FFDERASGKSKGYCQVDFYEAAAAAACKEGMNEHVFNGRPCVVAFASAQTLKQMGASYMS 262 Query: 1166 KXXXXXXXXXXXRNPVNDAAGRGNGANYPSGD 1261 K R +ND GRG ANY SGD Sbjct: 263 KTQGQPQPQSQGRGSMNDGMGRGGNANYQSGD 294 Score = 201 bits (512), Expect = 9e-49 Identities = 107/216 (49%), Positives = 121/216 (56%), Gaps = 3/216 (1%) Frame = +2 Query: 1484 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1663 D +MGRG GYG F G FPGMLP FP VNSMGL GVAPHVNPAFF Sbjct: 376 DPLYMGRGGGYGGFPGHGFPGMLPSFPAVNSMGLAGVAPHVNPAFFARGMAPNGMGMMAS 435 Query: 1664 XXXXXPHSGMWNDTNMGAWGGE--EHGRESSYGGEDNASEYGYGEASHDKGARSSAASRE 1837 P+ G W DT+MG WG E RESSY G++ ASEYGYGE +H+KGARSS ASRE Sbjct: 436 SGMEGPNPGKWPDTSMGGWGEEPGRRTRESSYDGDEGASEYGYGEGNHEKGARSSGASRE 495 Query: 1838 KEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKDHDSGYDDDWDKG 2017 KE+ SERDWS N ++ DSGY+DD D+G Sbjct: 496 KERVSERDWSGNSDRRHRDEREQDWDRSEREPKYREEKDTYRGHRQRERDSGYEDDRDRG 555 Query: 2018 QXXXXXXXXG-AVPEDDHRSRSRDADYGKRRRLPSE 2122 A PE+D+RSRSRD DYGKRRR PSE Sbjct: 556 HSSSRARSRSRAAPEEDYRSRSRDVDYGKRRRPPSE 591 >ref|XP_006852230.1| hypothetical protein AMTR_s00049p00146760 [Amborella trichopoda] gi|548855834|gb|ERN13697.1| hypothetical protein AMTR_s00049p00146760 [Amborella trichopoda] Length = 659 Score = 237 bits (605), Expect = 2e-59 Identities = 148/372 (39%), Positives = 191/372 (51%), Gaps = 22/372 (5%) Frame = +2 Query: 212 MDPVTDEQLDYGDEGYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXXGEGFLQ 391 MDP+ +EQLDY DE Y NQKM + GGAI ALA+EE++GE G+GF+Q Sbjct: 1 MDPMAEEQLDYEDEDYGANQKMPFQTGGAISALADEELMGEDDEYDDLYNDVNVGDGFMQ 60 Query: 392 MQRSDTQVPSVVGNSGIQNSKA---NVPGTHLEGVALQE-------------VNNVKVAE 523 + V +G+Q K + P ++ GV +E ++ K + Sbjct: 61 SLQHQEPVQYESMGNGVQAPKEEPISTPPVNIPGVGHEEKGEKDAKLSGFSDLDQKKAFQ 120 Query: 524 EGNYAATAAPFPVQKTSLPGPGGPPQTMDASQRGRLPEVAHNSQAGHSGYQGSASMP-HK 700 E A K + P PQ + R A A SG+ + +M +K Sbjct: 121 EQASNQLAGASSGLKIRVSEPVSEPQPQASGFRN-----APAPPAKGSGFNTAGAMDANK 175 Query: 701 NAADQMNNSEKVIGE-PAPLM----YTNMGNTKGAPXXXXXXXXXXXXXXXXRSMDDEYM 865 A +N+ +G P P + NM G S + + Sbjct: 176 QLAQTSSNAVPRVGPGPGPGIGAGPNANMNRMMGPGPNQAGAVIDTSARFG--SENSNRL 233 Query: 866 VRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYD 1045 E+GNTMLFVGEL WWTTDAE+ESVL QYG+VK++KFFDERASGKSKGYCQVEFYD Sbjct: 234 SHGGGESGNTMLFVGELQWWTTDAELESVLSQYGRVKDLKFFDERASGKSKGYCQVEFYD 293 Query: 1046 PSAASACKEGMNGHSFNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXXRNPVNDAA 1225 P+AA+ACKE MNGH FNGRACVVAFA+ HT+KQ+ +Y NK R P+ND Sbjct: 294 PAAAAACKESMNGHVFNGRACVVAFASQHTLKQLTTNYLNKTQAQAQAQSQGRRPMNDGG 353 Query: 1226 GRGNGANYPSGD 1261 GR G +Y GD Sbjct: 354 GRAGGPSYQGGD 365 Score = 175 bits (443), Expect = 9e-41 Identities = 95/218 (43%), Positives = 118/218 (54%), Gaps = 7/218 (3%) Frame = +2 Query: 1490 AFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXXXX 1669 A +GRG+GYG FSGP FPGMLP F + ++GLPGVAPHVNPAFF Sbjct: 446 AHLGRGSGYGGFSGPHFPGMLPSFSPMGTVGLPGVAPHVNPAFFGRGVSANGMGMMGSGA 505 Query: 1670 XXXPHSGMWNDTNMG---AWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAAS 1831 H GMW D++MG WG EEHG RESSY G+D AS+YGYG+ H++G S Sbjct: 506 MDGHHGGMWGDSSMGGGVGWGNEEHGRRTRESSY-GDDGASDYGYGDGGHERGGGRSNPG 564 Query: 1832 REKEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKDHDSGYDDDWD 2011 REK++ SERDWSS P ++ D +DDWD Sbjct: 565 REKDRGSERDWSSGP---ERRHRDDRDSDWDRDPRYKDEKDGYSDHRQRERDWDNEDDWD 621 Query: 2012 KGQXXXXXXXXG-AVPEDDHRSRSRDADYGKRRRLPSE 2122 +G+ + E+D RSRS+D DYGKRRR+PSE Sbjct: 622 RGRTSSRSRSKSRMMQEEDQRSRSKDVDYGKRRRVPSE 659