BLASTX nr result
ID: Mentha29_contig00004748
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00004748 (2977 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus... 674 0.0 ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation spec... 550 e-153 ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation spec... 327 3e-86 ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citr... 326 3e-86 ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309... 321 1e-84 ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobr... 319 4e-84 gb|EPS60955.1| hypothetical protein M569_13847, partial [Genlise... 317 2e-83 ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation spec... 316 4e-83 ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citr... 315 8e-83 ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobr... 310 3e-81 ref|XP_007044908.1| RNA-binding family protein isoform 5, partia... 310 3e-81 ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobr... 310 3e-81 ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobr... 310 3e-81 ref|XP_002515040.1| RNA binding protein, putative [Ricinus commu... 306 4e-80 ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268... 304 2e-79 ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prun... 295 1e-76 gb|EXB82464.1| Cleavage and polyadenylation specificity factor s... 287 2e-74 ref|XP_002312652.1| RNA recognition motif-containing family prot... 279 6e-72 ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Popu... 242 6e-61 ref|XP_002315647.1| RNA recognition motif-containing family prot... 242 6e-61 >gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus guttatus] Length = 639 Score = 674 bits (1739), Expect = 0.0 Identities = 366/643 (56%), Positives = 408/643 (63%), Gaps = 6/643 (0%) Frame = -2 Query: 2799 MDPVTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQ 2620 MDPVTDEQLDYGDEEY GNQKMQYH GGAIPALAE+EMIG+ VGEGF+Q Sbjct: 1 MDPVTDEQLDYGDEEYGGNQKMQYHHGGAIPALAEDEMIGDDDEYDDLYNDVNVGEGFMQ 60 Query: 2619 MQRSDTQVPSVVGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKT 2440 MQRS+ PS VGN+ SK PGT E +A QEVNN +V EG+YA QK Sbjct: 61 MQRSEAPPPSAVGNNSFSISKNTAPGTRAEAIASQEVNNGRVGNEGSYAPNGVQLSDQKN 120 Query: 2439 SLPGPGGPPQAMDASQRGRLPEVAHNSQAGYSGYQGSASMPHKNAADQMNNSEKVIGEPA 2260 +L GGP Q +DASQR RLPEVA++SQA + GYQGS M HK A D+MNNSE ++GEPA Sbjct: 121 NLTAVGGPAQPVDASQRVRLPEVANSSQAAHLGYQGSEIMLHKTATDRMNNSENIVGEPA 180 Query: 2259 PLMYTNMGNTKGA--XXXXXXXXXXXXXXNINRSMDDEYMVRPS-VENGNTMLFVGELHW 2089 L+Y N G++KG N+NRSMDDEY++RPS ENGN M++VGELHW Sbjct: 181 SLVYPNTGSSKGVPQAPSNLMNSNANVNVNVNRSMDDEYLIRPSGGENGNPMIYVGELHW 240 Query: 2088 WTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGR 1909 WTTDAE+ESVLIQYG+VKEIKFFDERASGKSKGYCQVEFYDP+AA+ACK+GM GH FNGR Sbjct: 241 WTTDAEVESVLIQYGRVKEIKFFDERASGKSKGYCQVEFYDPAAATACKDGMQGHIFNGR 300 Query: 1908 ACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDA-XXXXX 1732 ACVV +A P T KQMGASY NK GRNP+ND AGRGNG NYPSGDA Sbjct: 301 ACVVTYANPQTSKQMGASY-NKNQGQSQSQLQGRNPMNDGAGRGNGTNYPSGDAGRNFGR 359 Query: 1731 XXXXXXXNQPPNKXXXXXXXXXXXMI-NKNMI-XXXXXXXXXXXXXXXXXXXXXXXXXXX 1558 NQ PN+ + NKNMI Sbjct: 360 GGGWGRGNQAPNRGPGAGPIRGRGGMGNKNMIGNAPGAGGGGAYGQGLNGPGFGGPPGMM 419 Query: 1557 XXXXXXXXXFDLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXX 1378 FDLAFMGRG GYG FSGP F GMLPPF GVNSMGLPGVAPHVNPAFF Sbjct: 420 HPQGMMGPGFDLAFMGRGGGYGGFSGPPFQGMLPPFQGVNSMGLPGVAPHVNPAFFGRGM 479 Query: 1377 XXXXXXXXXXXXXXGPHSGMWNDTNIGAWGGEEHGRESSYGGEDNASEYGYGEASHDKGA 1198 GPHSGMWND N+G WGGEEHGRESSYGGEDNASEYGYGE SHDK Sbjct: 480 NPNGMGMMGNPGMVGPHSGMWNDPNMGGWGGEEHGRESSYGGEDNASEYGYGEGSHDKSV 539 Query: 1197 RSSAASREKEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRSKDHDSG 1018 RSSAA REKE+ SER++ P R K+ +SG Sbjct: 540 RSSAAPREKERTSEREY---PERKHREERENDGERNDRDSKYREEKDRYREHRHKERESG 596 Query: 1017 YDDDWDKGQXXXXXXXSGAVPEDDHRSRSRDADYGKRRRLPSE 889 YDDDWD+GQ SGAV E+DHRSRSRDADYGKRRR+PSE Sbjct: 597 YDDDWDRGQSSRSRSRSGAVQEEDHRSRSRDADYGKRRRMPSE 639 >ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Solanum tuberosum] gi|565349616|ref|XP_006341787.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X2 [Solanum tuberosum] Length = 648 Score = 550 bits (1416), Expect = e-153 Identities = 313/648 (48%), Positives = 360/648 (55%), Gaps = 11/648 (1%) Frame = -2 Query: 2799 MDPVTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQ 2620 MDP DEQLDYGDEEY G+ KMQYH G IPALAE+EM+GE +GEGFLQ Sbjct: 1 MDPTADEQLDYGDEEYGGSHKMQYHGSGTIPALAEDEMMGEDDEYDDLYNDVNIGEGFLQ 60 Query: 2619 MQRSDTQVPSV-VGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK 2443 +QRS+ VPSV GN Q K + P + G+ +E +A EG YA T FP QK Sbjct: 61 LQRSEVPVPSVDAGNGNFQAQKDSFPASRAGGLGSEEAKIPGIATEGKYAGTEVQFPQQK 120 Query: 2442 TSLPGPGGPPQAMDASQRGRLPEVAH--NSQAGYSGYQGSASMPHKNAADQMNNSEKVIG 2269 + DA+Q+ R + NSQAG SGYQGS MP K AD M EK Sbjct: 121 GEPVVERETERPADAAQKARPSAITMTLNSQAGNSGYQGSMPMPQKIGADPMAMPEKNAS 180 Query: 2268 EPAPLMYTNMGNTKGAXXXXXXXXXXXXXXNINRSMDDEYMVRPSVENGNTMLFVGELHW 2089 E PLM + + + N+N + E RPS+ENGNTMLFVGELHW Sbjct: 181 EATPLMNSVVPGPRVVPHMPTNQLNSSGNVNMNNPVISETPFRPSLENGNTMLFVGELHW 240 Query: 2088 WTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGR 1909 WTTDAE+ESVL QYG VKEIKFFDERASGKSKGYCQVEF+DP++A+ACKEGMNG++FNGR Sbjct: 241 WTTDAELESVLTQYGNVKEIKFFDERASGKSKGYCQVEFFDPASAAACKEGMNGYNFNGR 300 Query: 1908 ACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDAXXXXXX 1729 ACVVAFATP TIKQMG+SY NK GR P+N+ GRG P Sbjct: 301 ACVVAFATPQTIKQMGSSYANKTQNQVQSQPQGRRPMNEGVGRGGPNYTPGDAGRNFGRG 360 Query: 1728 XXXXXXNQPPNKXXXXXXXXXXXMI-NKNMIXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1552 PN+ + +KNM+ Sbjct: 361 SWGRGGPGMPNRGPGGGPVRGRGAMGSKNMMVNPGAGNGAGGAFGQGLAGPAFGGPPAGL 420 Query: 1551 XXXXXXXF---DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXX 1381 D +FMGRGAGYG FSGPAFPGM+PPF VN MGLPGVAPHVNPAFF Sbjct: 421 MHPQGMMGPGFDPSFMGRGAGYGGFSGPAFPGMMPPFQAVNPMGLPGVAPHVNPAFFGRG 480 Query: 1380 XXXXXXXXXXXXXXXGPHSGMWNDTNIGAWGGEEHG---RESSYGGEDNASEYGYGEASH 1210 GPH GMW DT+ G WGGEEHG RESSYGGEDNASEYGYGE SH Sbjct: 481 MAANGMGMMSAAGMDGPHPGMWTDTSGGGWGGEEHGRRTRESSYGGEDNASEYGYGEVSH 540 Query: 1209 DKGARSSAASREKEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRSKD 1030 DKGARSSA SREKE+ SERDWS N R K+ Sbjct: 541 DKGARSSAVSREKERGSERDWSGNSDKRHRDEREHDRDRHDKEHRYREERDGYRDYRQKE 600 Query: 1029 HDSGYDDDWDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 889 +S Y++D+D+GQ A E+DHRSRSRD +YGKRRR PSE Sbjct: 601 RESEYEEDYDRGQSSSRSRSKSRAAQEEDHRSRSRDTNYGKRRRAPSE 648 >ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 658 Score = 327 bits (837), Expect = 3e-86 Identities = 175/358 (48%), Positives = 215/358 (60%), Gaps = 8/358 (2%) Frame = -2 Query: 2799 MDPVTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQ 2620 MD + +EQ+DY +EEY G QKMQY GGAIPALA+EE++GE VG+G LQ Sbjct: 1 MDSMAEEQIDYEEEEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQ 60 Query: 2619 MQRSDTQVPSV-VGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK 2443 Q+ + PS VGN +Q K +VP ++ Q N V+ EG Y FP Q Sbjct: 61 FQQPEAPPPSAGVGNGRLQVKKTDVPEQQVQAGVSQGSNVPGVSVEGKYTNAGTHFPAQN 120 Query: 2442 -----TSLP--GPGGPPQAMDASQRGRLPEVAHNSQAGYSGYQGSASMPHKNAADQMNNS 2284 + P G G P SQ+G + E H++ G+QGS S P + D N Sbjct: 121 DVQVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMP 180 Query: 2283 EKVIGEPAPLMYTNMGNTKGAXXXXXXXXXXXXXXNINRSMDDEYMVRPSVENGNTMLFV 2104 +V EPAP++ +GA +NR+M +E +RP +ENG TMLFV Sbjct: 181 GRVANEPAPVLNPGAAGPQGALIPANQMGVNIN---VNRAMVNENQIRPPLENGGTMLFV 237 Query: 2103 GELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGH 1924 GELHWWTTDAE+ESVL QYG+VKEIKFFDERASGKSKGYCQVEF+D +AA+ACK+GMNGH Sbjct: 238 GELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGH 297 Query: 1923 SFNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 1750 FNGR CVVAFA+P T+KQMGASY NK GR P+ND GRG NY SGD Sbjct: 298 VFNGRPCVVAFASPQTLKQMGASYMNKNQGQPQSQTQGRRPMNDGGGRGGNMNYQSGD 355 Score = 224 bits (570), Expect = 2e-55 Identities = 116/220 (52%), Positives = 134/220 (60%), Gaps = 7/220 (3%) Frame = -2 Query: 1527 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1348 D +MGRG GYG FSGP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 439 DPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGS 498 Query: 1347 XXXXGPHSGMWNDTNIGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 1177 GPH GMW D+++G W GEEHG RESSYGG+D AS+YGYGEA+H+KGARS+AASR Sbjct: 499 SGMDGPHPGMWTDSSMGGWLGEEHGRRTRESSYGGDDGASDYGYGEANHEKGARSTAASR 558 Query: 1176 EKEKNSERDWSSNP---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRSKDHDSGYDDD 1006 EK++ SERDWS N R +D DS YDD+ Sbjct: 559 EKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDDN 618 Query: 1005 WDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 889 WD+G A+P++DHRSRSRD DYGKRRRLPSE Sbjct: 619 WDRGPSSSRSRSRSRAIPDEDHRSRSRDVDYGKRRRLPSE 658 >ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] gi|557540375|gb|ESR51419.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] Length = 658 Score = 326 bits (836), Expect = 3e-86 Identities = 175/358 (48%), Positives = 215/358 (60%), Gaps = 8/358 (2%) Frame = -2 Query: 2799 MDPVTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQ 2620 MD + +EQ+DY +EEY G QKMQY GGAIPALA+EE++GE VG+G LQ Sbjct: 1 MDSMAEEQIDYEEEEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQ 60 Query: 2619 MQRSDTQVPSV-VGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK 2443 Q+ + PS VGN +Q K +VP ++ Q N V+ EG Y FP Q Sbjct: 61 FQQPEAPPPSAGVGNGRLQVKKTDVPEQQVQAGVSQGSNVPGVSVEGKYTNAGTHFPAQN 120 Query: 2442 -----TSLP--GPGGPPQAMDASQRGRLPEVAHNSQAGYSGYQGSASMPHKNAADQMNNS 2284 + P G G P SQ+G + E H++ G+QGS S P + D N Sbjct: 121 DVQVAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPPRTGVDPSNMP 180 Query: 2283 EKVIGEPAPLMYTNMGNTKGAXXXXXXXXXXXXXXNINRSMDDEYMVRPSVENGNTMLFV 2104 +V EPAP++ +GA +NR+M +E +RP +ENG TMLFV Sbjct: 181 GRVANEPAPVLNPGAAGPQGALIPANQMGVNIN---VNRAMVNENQIRPPLENGGTMLFV 237 Query: 2103 GELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGH 1924 GELHWWTTDAE+ESVL QYG+VKEIKFFDERASGKSKGYCQVEF+D +AA+ACK+GMNGH Sbjct: 238 GELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGH 297 Query: 1923 SFNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 1750 FNGR CVVAFA+P T+KQMGASY NK GR P+ND GRG NY SGD Sbjct: 298 VFNGRPCVVAFASPQTLKQMGASYMNKNQGQPQSQTQGRRPMNDGGGRGGNMNYQSGD 355 Score = 224 bits (571), Expect = 2e-55 Identities = 116/220 (52%), Positives = 134/220 (60%), Gaps = 7/220 (3%) Frame = -2 Query: 1527 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1348 D +MGRG GYG FSGP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 439 DPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGS 498 Query: 1347 XXXXGPHSGMWNDTNIGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 1177 GPH GMW D+++G W GEEHG RESSYGG+D AS+YGYGEA+H+KGARS+AASR Sbjct: 499 SGMDGPHPGMWTDSSMGGWVGEEHGRRTRESSYGGDDGASDYGYGEANHEKGARSTAASR 558 Query: 1176 EKEKNSERDWSSNP---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRSKDHDSGYDDD 1006 EK++ SERDWS N R +D DS YDD+ Sbjct: 559 EKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDDN 618 Query: 1005 WDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 889 WD+G A+P++DHRSRSRD DYGKRRRLPSE Sbjct: 619 WDRGPSSSRSRSRSRAIPDEDHRSRSRDVDYGKRRRLPSE 658 >ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309507 [Fragaria vesca subsp. vesca] Length = 646 Score = 321 bits (822), Expect = 1e-84 Identities = 175/351 (49%), Positives = 219/351 (62%), Gaps = 1/351 (0%) Frame = -2 Query: 2799 MDPVTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQ 2620 MDP+ +EQ+DY +EEY G QK+QY + GAIPALA+EE + E VGEGFLQ Sbjct: 1 MDPMGEEQIDYEEEEYGGAQKLQYQESGAIPALADEEPMVEDDEYDDLYNDVNVGEGFLQ 60 Query: 2619 MQRSDTQVPSV-VGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK 2443 M R + +P VGN G+Q K NVP ++G A QEV N + EG Y++ P QK Sbjct: 61 MHRPEPPLPPAGVGNGGLQAQKNNVPEQRVQGGASQEVKNPGFSVEGKYSSV----PEQK 116 Query: 2442 TSLPGPGGPPQAMDASQRGRLPEVAHNSQAGYSGYQGSASMPHKNAADQMNNSEKVIGEP 2263 P P A SQ+GR+ E+ H++Q G+QG+A+M AD + + K+ P Sbjct: 117 DQPPVSVVPEMA---SQKGRVMEMTHDAQVRNMGFQGAATMQSNVVADSSDLTGKIANGP 173 Query: 2262 APLMYTNMGNTKGAXXXXXXXXXXXXXXNINRSMDDEYMVRPSVENGNTMLFVGELHWWT 2083 P M N G+ N+NR M +E +RP VENG+ LFVGELHWWT Sbjct: 174 IPSM--NSGSNGPPAVQQMPANQMNMKINVNRPMVNENQIRPPVENGSATLFVGELHWWT 231 Query: 2082 TDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRAC 1903 TDAE+E VL Q+G++KEIKFFDERASGKSKGYCQV+FYDP+AASACKEGM+G+ FNGRAC Sbjct: 232 TDAELEGVLSQFGRIKEIKFFDERASGKSKGYCQVDFYDPAAASACKEGMDGYVFNGRAC 291 Query: 1902 VVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 1750 VVAFA+ T+KQMG SY NK GR P+ND AGRG N+ GD Sbjct: 292 VVAFASSQTLKQMGDSYVNKSQGQVQTQPQGRRPMNDGAGRGGNMNFQGGD 342 Score = 192 bits (487), Expect = 1e-45 Identities = 107/221 (48%), Positives = 123/221 (55%), Gaps = 8/221 (3%) Frame = -2 Query: 1527 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1348 D +MGRG GYG F GP FPGMLP FPGVN+MGL GVAPHVNPAFF Sbjct: 426 DPTYMGRGGGYGGFPGPGFPGMLPQFPGVNAMGLAGVAPHVNPAFFGRGMATNGMGMMGS 485 Query: 1347 XXXXGPHSGMWNDTNIGAWGGEEHG---RESSYGGEDNASEYG-YGEASHDKGARSSAAS 1180 G H+ MWND ++ W GEE RESSYGG+D SEYG YGEA+H+K RSSAA Sbjct: 486 SGMEGHHAPMWNDPSMAGWTGEEQDRRTRESSYGGDDGGSEYGNYGEANHEKPVRSSAAP 545 Query: 1179 REKEKNSERDW---SSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRSKDHDSGYDD 1009 RE+E+ SER+W S R ++ D Y+D Sbjct: 546 RERERESEREWTGTSERRHRDEREQDWDRSEREHREPRYKEEKDSYRDHRRRERDVAYED 605 Query: 1008 DWDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 889 D D+G A+PEDDHRSRSRD DYGKRRRLPSE Sbjct: 606 DRDRGHSSSRPRSRSKAMPEDDHRSRSRDVDYGKRRRLPSE 646 >ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695488|ref|XP_007044903.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708837|gb|EOY00734.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708838|gb|EOY00735.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 653 Score = 319 bits (818), Expect = 4e-84 Identities = 178/358 (49%), Positives = 217/358 (60%), Gaps = 7/358 (1%) Frame = -2 Query: 2799 MDPVTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQ 2620 MD + +EQ+D+GDEEY G QKMQY GAIPALA+EEM+GE VGEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGAQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 2619 MQRSDTQV-PSVVGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK 2443 +QRS+ P +G++G+Q K P E Q +N V+ +G + A +P Q Sbjct: 61 LQRSEAPPQPGGMGSTGLQAQKNEAPEPRGEAGGSQGLNIPGVSVQGKHLNVTARYPEQD 120 Query: 2442 ----TSLP--GPGGPPQAMDASQRGRLPEVAHNSQAGYSGYQGSASMPHKNAADQMNNSE 2281 S P G G P SQ+GR+ E ++Q G+QG +S HK D + Sbjct: 121 GQPAVSRPEMGSGSYPSGTSISQKGRVMEGTQDTQVKNMGFQGLSSASHKVGIDPSGVPQ 180 Query: 2280 KVIGEPAPLMYTNMGNTKGAXXXXXXXXXXXXXXNINRSMDDEYMVRPSVENGNTMLFVG 2101 K+ PA + + G +GA +N M E VRP +ENG TMLFVG Sbjct: 181 KIANVPAQSLNSGTGGPQGAPHVPPNQMGLN----VNHPMISENQVRPPIENGPTMLFVG 236 Query: 2100 ELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHS 1921 ELHWWTTDAE+ESVL QYG+VKEIKFFDERASGKSKGYCQVEFYDP++A+ACKEGM+G+ Sbjct: 237 ELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDPASAAACKEGMDGYM 296 Query: 1920 FNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDA 1747 FNGRACVVAFA+P T+KQMGASY NK GR P ND GRG NY SGDA Sbjct: 297 FNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NDGLGRGGNMNYQSGDA 353 Score = 205 bits (522), Expect = 9e-50 Identities = 112/220 (50%), Positives = 127/220 (57%), Gaps = 7/220 (3%) Frame = -2 Query: 1527 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1348 D +MGRG YG F GP FPGMLP FP VN++GL GVAPHVNPAFF Sbjct: 435 DPTYMGRGGSYGGFPGPGFPGMLPSFPAVNTLGLAGVAPHVNPAFFGRGMAPNGMGMMGG 494 Query: 1347 XXXXGPHSGMWNDTNIGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 1177 GPH GMW DT++G WGG+EHG RESSYGGED ASEYGYG+A+H+KG RSS ASR Sbjct: 495 PGMDGPHVGMWTDTSMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASR 553 Query: 1176 EKEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRSKDH---DSGYDDD 1006 EKE+ S+R+WS N R H D YDDD Sbjct: 554 EKERVSDREWSGNSDRRHRDEKERDWDRSEREHREHRYREEKDSYREHRHRERDLDYDDD 613 Query: 1005 WDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 889 D+GQ A+PE+ RSRSRD DYGKRRRLPSE Sbjct: 614 LDRGQSSSRSRRRSHAMPEEQRRSRSRDVDYGKRRRLPSE 653 >gb|EPS60955.1| hypothetical protein M569_13847, partial [Genlisea aurea] Length = 508 Score = 317 bits (812), Expect = 2e-83 Identities = 183/354 (51%), Positives = 216/354 (61%), Gaps = 3/354 (0%) Frame = -2 Query: 2799 MDPVTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXV-GEGFL 2623 M+P+ EQ D+G+EEY G QKMQY+QGGAIPALA+EEMIGE GE F+ Sbjct: 1 MEPMNGEQFDFGEEEYGGGQKMQYNQGGAIPALADEEMIGEEDDEYDDLYNDVNVGESFM 60 Query: 2622 QMQRSDTQVPSVVGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK 2443 Q+QR D+Q+P + + N GT E + +E N K A + A FP QK Sbjct: 61 QVQRPDSQIPPFKAEN-----RVNPSGTGDESIPSEEANASKYAGNRAFGPGALQFPEQK 115 Query: 2442 TSLPGPGGPPQAMDASQRGRLPEVAHNSQAGYSGYQGSASMPHKNAADQMNNSEKVIGEP 2263 L +D SQ R NSQ SGYQGS + P+ DQ+ N +K +G+P Sbjct: 116 AGLNTTEETSVTVDRSQTVR------NSQTDQSGYQGSVA-PNNKTEDQVKNMDKTVGDP 168 Query: 2262 APLMYTNMGNTKGAXXXXXXXXXXXXXXNINRSMDDEYMVRPSVENGNTMLFVGELHWWT 2083 + + +KGA R +DDEY S ENGNTML+VGELHWWT Sbjct: 169 SSINPNVGVGSKGAVPFNFMNMAANANAI--RPVDDEYSNLGSSENGNTMLYVGELHWWT 226 Query: 2082 TDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRAC 1903 TDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEF+DP+AA ACKEGMNG+ FNGRAC Sbjct: 227 TDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFFDPAAAHACKEGMNGYVFNGRAC 286 Query: 1902 VVAFATPHTIKQMGASYTNKXXXXXXXXXXGRN-PVND-AAGRGNGANYPSGDA 1747 VVAFATP TIKQMGASY N+ GRN +ND AGRG G N+ GDA Sbjct: 287 VVAFATPQTIKQMGASYMNRNQGQPQAQFPGRNAAMNDGGAGRGVGTNFSGGDA 340 Score = 124 bits (310), Expect = 3e-25 Identities = 62/96 (64%), Positives = 69/96 (71%), Gaps = 4/96 (4%) Frame = -2 Query: 1527 DLAFMGRGAGYGN-FSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXX 1351 DLAFMGRGAGYG F+GPAFPGMLPPFP VN++GLPGVAPHVNPAFF Sbjct: 413 DLAFMGRGAGYGGGFTGPAFPGMLPPFPAVNTLGLPGVAPHVNPAFFGRGMAPNGMGMMG 472 Query: 1350 XXXXXGPHSGMWNDTNI-GAWGGEEHGR--ESSYGG 1252 GP+SG+WND ++ G WGGEE GR ESSYGG Sbjct: 473 PSGMGGPYSGLWNDASVGGGWGGEEQGRGPESSYGG 508 >ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 655 Score = 316 bits (810), Expect = 4e-83 Identities = 170/355 (47%), Positives = 211/355 (59%), Gaps = 8/355 (2%) Frame = -2 Query: 2790 VTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQMQR 2611 + +EQ+DY ++EY G QKMQY GGAIPALA+EE++GE VG+G LQ Q+ Sbjct: 1 MAEEQIDYEEDEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDVNVGDGLLQFQQ 60 Query: 2610 SDTQVPSV-VGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK--- 2443 + PS VGN +Q K +VP ++ Q N V+ EG Y + FP Q Sbjct: 61 PEAPPPSAGVGNGRLQVKKTDVPEQRVQVGGSQGSNIPGVSVEGKYTNAGSHFPAQNDVQ 120 Query: 2442 --TSLP--GPGGPPQAMDASQRGRLPEVAHNSQAGYSGYQGSASMPHKNAADQMNNSEKV 2275 + P G G P SQ+G + E H++ G+QGS S P + D N +V Sbjct: 121 VAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGRV 180 Query: 2274 IGEPAPLMYTNMGNTKGAXXXXXXXXXXXXXXNINRSMDDEYMVRPSVENGNTMLFVGEL 2095 EPAP++ +GA +NR M +E +RP +ENG TMLFVGEL Sbjct: 181 ANEPAPVLNPGAAGPQGALIPANQMGVNAN---VNRVMVNENQIRPPLENGGTMLFVGEL 237 Query: 2094 HWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFN 1915 HWWTTDAE+ESVL QYG+ KEIKFFDERASGKSKGYCQVEF+D +AA+ACK+GMNGH FN Sbjct: 238 HWWTTDAELESVLSQYGRAKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFN 297 Query: 1914 GRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 1750 GR CVVAFA+P T+KQMGASY NK G P+ND GRG NY SGD Sbjct: 298 GRPCVVAFASPQTLKQMGASYMNKNQGQPQSQNQGSRPMNDGGGRGGNTNYQSGD 352 Score = 228 bits (582), Expect = 1e-56 Identities = 119/220 (54%), Positives = 137/220 (62%), Gaps = 7/220 (3%) Frame = -2 Query: 1527 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1348 D +MGRG GYG FSGP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 436 DPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGS 495 Query: 1347 XXXXGPHSGMWNDTNIGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 1177 GPH GMW D+++G W GEEHG RESSYGG+D AS+YGYGEA+H+KGARS+AASR Sbjct: 496 SGMDGPHPGMWTDSSMGGWLGEEHGRRTRESSYGGDDGASDYGYGEANHEKGARSTAASR 555 Query: 1176 EKEKNSERDWSSNP---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRSKDHDSGYDDD 1006 EK++ SERDWS N R +D DS YDD+ Sbjct: 556 EKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDDN 615 Query: 1005 WDKGQ-XXXXXXXSGAVPEDDHRSRSRDADYGKRRRLPSE 889 WD+GQ SGA+P++DHRSRSRD DYGKRRRLPSE Sbjct: 616 WDRGQSSSRSRSRSGAIPDEDHRSRSRDVDYGKRRRLPSE 655 >ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|567891321|ref|XP_006438181.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540376|gb|ESR51420.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540377|gb|ESR51421.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] Length = 655 Score = 315 bits (807), Expect = 8e-83 Identities = 169/355 (47%), Positives = 210/355 (59%), Gaps = 8/355 (2%) Frame = -2 Query: 2790 VTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQMQR 2611 + +EQ+DY ++EY G QKMQY GGAIPALA+EE++GE VG+G LQ Q+ Sbjct: 1 MAEEQIDYEEDEYGGAQKMQYQGGGAIPALADEELMGEDDEYDDLYNDINVGDGLLQFQQ 60 Query: 2610 SDTQVPSV-VGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK--- 2443 + PS VGN +Q K +VP ++ Q N V+ EG Y + FP Q Sbjct: 61 PEAPPPSAGVGNGRLQVKKTDVPEQRVQVGGSQGSNIPGVSVEGKYTNAGSDFPAQNDVQ 120 Query: 2442 --TSLP--GPGGPPQAMDASQRGRLPEVAHNSQAGYSGYQGSASMPHKNAADQMNNSEKV 2275 + P G G P SQ+G + E H++ G+QGS S P + D N + Sbjct: 121 VAVNRPNMGSGNYPDGASVSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGRA 180 Query: 2274 IGEPAPLMYTNMGNTKGAXXXXXXXXXXXXXXNINRSMDDEYMVRPSVENGNTMLFVGEL 2095 EPAP++ +GA +NR M +E +RP +ENG TMLFVGEL Sbjct: 181 ANEPAPVLNPGAAGPQGALIPANQMGVNAN---VNRVMVNENQIRPPLENGGTMLFVGEL 237 Query: 2094 HWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFN 1915 HWWTTDAE+ESVL QYG+ KEIKFFDERASGKSKGYCQVEF+D +AA+ACK+GMNGH FN Sbjct: 238 HWWTTDAELESVLSQYGRAKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFN 297 Query: 1914 GRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 1750 GR CVVAFA+P T+KQMGASY NK G P+ND GRG NY SGD Sbjct: 298 GRPCVVAFASPQTLKQMGASYMNKNQGQPQSQNQGSRPMNDGGGRGGNTNYQSGD 352 Score = 228 bits (582), Expect = 1e-56 Identities = 119/220 (54%), Positives = 136/220 (61%), Gaps = 7/220 (3%) Frame = -2 Query: 1527 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1348 D +MGRG GYG FSGP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 436 DPTYMGRGGGYGGFSGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFNRGMAANGMGMMGS 495 Query: 1347 XXXXGPHSGMWNDTNIGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 1177 GPH GMW D+++G W GEEHG RESSYGG+D AS+YGYGEASH+KGARS+ ASR Sbjct: 496 SGMDGPHPGMWTDSSMGGWVGEEHGRRTRESSYGGDDGASDYGYGEASHEKGARSTTASR 555 Query: 1176 EKEKNSERDWSSNP---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRSKDHDSGYDDD 1006 EK++ SERDWS N R +D DS YDD+ Sbjct: 556 EKDRGSERDWSGNTDRRHREEREQDWDRSERDHRDHRHREEKDSYRDRRQRDRDSTYDDN 615 Query: 1005 WDKGQ-XXXXXXXSGAVPEDDHRSRSRDADYGKRRRLPSE 889 WD+GQ SGA+P++DHRSRSRD DYGKRRRLPSE Sbjct: 616 WDRGQSSSRSRSRSGAIPDEDHRSRSRDVDYGKRRRLPSE 655 >ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobroma cacao] gi|508708844|gb|EOY00741.1| RNA-binding family protein isoform 6 [Theobroma cacao] Length = 602 Score = 310 bits (794), Expect = 3e-81 Identities = 168/359 (46%), Positives = 217/359 (60%), Gaps = 8/359 (2%) Frame = -2 Query: 2799 MDPVTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQ 2620 MD + +EQ+D+GDEEY G QKMQY GAIPALA+EEM+GE VGEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 2619 MQRSDTQV-PSVVGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK 2443 +QRS+ + P +G++G++ + P +E Q +N V+ +G + +A +P +K Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRVEAGGSQGLNIPGVSVQGKHPNVSARYP-EK 119 Query: 2442 TSLPGPGGP-------PQAMDASQRGRLPEVAHNSQAGYSGYQGSASMPHKNAADQMNNS 2284 P P P SQ+G + E H+ Q G+QG S +K D Sbjct: 120 EEQPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVP 179 Query: 2283 EKVIGEPAPLMYTNMGNTKGAXXXXXXXXXXXXXXNINRSMDDEYMVRPSVENGNTMLFV 2104 +K+ +PA + + G +G +N + +E V+P +ENG TMLFV Sbjct: 180 QKIANDPAQSLNSGTGGPQGPPHVPPNQMGTN----VNHPVMNENQVQPPIENGPTMLFV 235 Query: 2103 GELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGH 1924 GELHWWTTDAE+ESVL QYG++KEIKFFDE+ASGKSKGYCQVEFYDPS+A+ CKEGMNG+ Sbjct: 236 GELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGY 295 Query: 1923 SFNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDA 1747 FNGRACVVAFA+P T+KQMGASY NK GR P N+ GRG NY SGDA Sbjct: 296 MFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQSGDA 353 Score = 171 bits (433), Expect = 2e-39 Identities = 83/133 (62%), Positives = 94/133 (70%), Gaps = 3/133 (2%) Frame = -2 Query: 1527 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1348 D +M RG GYG F GP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 434 DPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGA 493 Query: 1347 XXXXGPHSGMWNDTNIGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 1177 GPH+GMW D ++G WGG+EHG RESSYGGED ASEYGYG+A+H+KG RSS ASR Sbjct: 494 SGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASR 552 Query: 1176 EKEKNSERDWSSN 1138 EKE+ SER+WS N Sbjct: 553 EKERVSEREWSGN 565 >ref|XP_007044908.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] gi|508708843|gb|EOY00740.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] Length = 656 Score = 310 bits (794), Expect = 3e-81 Identities = 168/359 (46%), Positives = 217/359 (60%), Gaps = 8/359 (2%) Frame = -2 Query: 2799 MDPVTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQ 2620 MD + +EQ+D+GDEEY G QKMQY GAIPALA+EEM+GE VGEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 2619 MQRSDTQV-PSVVGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK 2443 +QRS+ + P +G++G++ + P +E Q +N V+ +G + +A +P +K Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRVEAGGSQGLNIPGVSVQGKHPNVSARYP-EK 119 Query: 2442 TSLPGPGGP-------PQAMDASQRGRLPEVAHNSQAGYSGYQGSASMPHKNAADQMNNS 2284 P P P SQ+G + E H+ Q G+QG S +K D Sbjct: 120 EEQPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVP 179 Query: 2283 EKVIGEPAPLMYTNMGNTKGAXXXXXXXXXXXXXXNINRSMDDEYMVRPSVENGNTMLFV 2104 +K+ +PA + + G +G +N + +E V+P +ENG TMLFV Sbjct: 180 QKIANDPAQSLNSGTGGPQGPPHVPPNQMGTN----VNHPVMNENQVQPPIENGPTMLFV 235 Query: 2103 GELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGH 1924 GELHWWTTDAE+ESVL QYG++KEIKFFDE+ASGKSKGYCQVEFYDPS+A+ CKEGMNG+ Sbjct: 236 GELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGY 295 Query: 1923 SFNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDA 1747 FNGRACVVAFA+P T+KQMGASY NK GR P N+ GRG NY SGDA Sbjct: 296 MFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQSGDA 353 Score = 194 bits (494), Expect = 2e-46 Identities = 105/215 (48%), Positives = 122/215 (56%), Gaps = 7/215 (3%) Frame = -2 Query: 1527 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1348 D +M RG GYG F GP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 434 DPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGA 493 Query: 1347 XXXXGPHSGMWNDTNIGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 1177 GPH+GMW D ++G WGG+EHG RESSYGGED ASEYGYG+A+H+KG RSS ASR Sbjct: 494 SGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASR 552 Query: 1176 EKEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRSKDH---DSGYDDD 1006 EKE+ SER+WS N R H D YDDD Sbjct: 553 EKERVSEREWSGNSDRRHRDEKEQDWDRSEREHREHRYREEKDSYREHRHRERDLDYDDD 612 Query: 1005 WDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRR 904 WD+GQ A+PE++HRSRSRD Y + + Sbjct: 613 WDRGQSSSRSRRRSHAMPEEEHRSRSRDVGYREEK 647 >ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobroma cacao] gi|508708842|gb|EOY00739.1| RNA-binding family protein isoform 4 [Theobroma cacao] Length = 697 Score = 310 bits (794), Expect = 3e-81 Identities = 168/359 (46%), Positives = 217/359 (60%), Gaps = 8/359 (2%) Frame = -2 Query: 2799 MDPVTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQ 2620 MD + +EQ+D+GDEEY G QKMQY GAIPALA+EEM+GE VGEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 2619 MQRSDTQV-PSVVGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK 2443 +QRS+ + P +G++G++ + P +E Q +N V+ +G + +A +P +K Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRVEAGGSQGLNIPGVSVQGKHPNVSARYP-EK 119 Query: 2442 TSLPGPGGP-------PQAMDASQRGRLPEVAHNSQAGYSGYQGSASMPHKNAADQMNNS 2284 P P P SQ+G + E H+ Q G+QG S +K D Sbjct: 120 EEQPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVP 179 Query: 2283 EKVIGEPAPLMYTNMGNTKGAXXXXXXXXXXXXXXNINRSMDDEYMVRPSVENGNTMLFV 2104 +K+ +PA + + G +G +N + +E V+P +ENG TMLFV Sbjct: 180 QKIANDPAQSLNSGTGGPQGPPHVPPNQMGTN----VNHPVMNENQVQPPIENGPTMLFV 235 Query: 2103 GELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGH 1924 GELHWWTTDAE+ESVL QYG++KEIKFFDE+ASGKSKGYCQVEFYDPS+A+ CKEGMNG+ Sbjct: 236 GELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGY 295 Query: 1923 SFNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDA 1747 FNGRACVVAFA+P T+KQMGASY NK GR P N+ GRG NY SGDA Sbjct: 296 MFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQSGDA 353 Score = 171 bits (433), Expect = 2e-39 Identities = 83/133 (62%), Positives = 94/133 (70%), Gaps = 3/133 (2%) Frame = -2 Query: 1527 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1348 D +M RG GYG F GP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 434 DPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGA 493 Query: 1347 XXXXGPHSGMWNDTNIGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 1177 GPH+GMW D ++G WGG+EHG RESSYGGED ASEYGYG+A+H+KG RSS ASR Sbjct: 494 SGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASR 552 Query: 1176 EKEKNSERDWSSN 1138 EKE+ SER+WS N Sbjct: 553 EKERVSEREWSGN 565 >ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695496|ref|XP_007044905.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695500|ref|XP_007044906.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708839|gb|EOY00736.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708840|gb|EOY00737.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708841|gb|EOY00738.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 652 Score = 310 bits (794), Expect = 3e-81 Identities = 168/359 (46%), Positives = 217/359 (60%), Gaps = 8/359 (2%) Frame = -2 Query: 2799 MDPVTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQ 2620 MD + +EQ+D+GDEEY G QKMQY GAIPALA+EEM+GE VGEGFLQ Sbjct: 1 MDAMAEEQIDFGDEEYGGGQKMQYQGSGAIPALADEEMMGEDDEYDDLYNDVNVGEGFLQ 60 Query: 2619 MQRSDTQV-PSVVGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK 2443 +QRS+ + P +G++G++ + P +E Q +N V+ +G + +A +P +K Sbjct: 61 LQRSEAPLQPGGLGSTGLKAQRNEAPEPRVEAGGSQGLNIPGVSVQGKHPNVSARYP-EK 119 Query: 2442 TSLPGPGGP-------PQAMDASQRGRLPEVAHNSQAGYSGYQGSASMPHKNAADQMNNS 2284 P P P SQ+G + E H+ Q G+QG S +K D Sbjct: 120 EEQPAVNRPEMVSGSYPSGSSISQKGSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVP 179 Query: 2283 EKVIGEPAPLMYTNMGNTKGAXXXXXXXXXXXXXXNINRSMDDEYMVRPSVENGNTMLFV 2104 +K+ +PA + + G +G +N + +E V+P +ENG TMLFV Sbjct: 180 QKIANDPAQSLNSGTGGPQGPPHVPPNQMGTN----VNHPVMNENQVQPPIENGPTMLFV 235 Query: 2103 GELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGH 1924 GELHWWTTDAE+ESVL QYG++KEIKFFDE+ASGKSKGYCQVEFYDPS+A+ CKEGMNG+ Sbjct: 236 GELHWWTTDAELESVLSQYGRLKEIKFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGY 295 Query: 1923 SFNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDA 1747 FNGRACVVAFA+P T+KQMGASY NK GR P N+ GRG NY SGDA Sbjct: 296 MFNGRACVVAFASPQTLKQMGASYMNKNQGQSQAQPQGRRP-NEGLGRGGNLNYQSGDA 353 Score = 213 bits (543), Expect = 3e-52 Identities = 114/220 (51%), Positives = 130/220 (59%), Gaps = 7/220 (3%) Frame = -2 Query: 1527 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1348 D +M RG GYG F GP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 434 DPTYMVRGGGYGGFPGPGFPGMLPSFPAVNTMGLAGVAPHVNPAFFGRGMAPNGMGMMGA 493 Query: 1347 XXXXGPHSGMWNDTNIGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 1177 GPH+GMW D ++G WGG+EHG RESSYGGED ASEYGYG+A+H+KG RSS ASR Sbjct: 494 SGMDGPHAGMWTDASMGGWGGDEHGRRTRESSYGGEDGASEYGYGDANHEKG-RSSGASR 552 Query: 1176 EKEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRSKDH---DSGYDDD 1006 EKE+ SER+WS N R H D YDDD Sbjct: 553 EKERVSEREWSGNSDRRHRDEKEQDWDRSEREHREHRYREEKDSYREHRHRERDLDYDDD 612 Query: 1005 WDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 889 WD+GQ A+PE++HRSRSRD DYGK+RRLPSE Sbjct: 613 WDRGQSSSRSRRRSHAMPEEEHRSRSRDVDYGKKRRLPSE 652 >ref|XP_002515040.1| RNA binding protein, putative [Ricinus communis] gi|223546091|gb|EEF47594.1| RNA binding protein, putative [Ricinus communis] Length = 644 Score = 306 bits (784), Expect = 4e-80 Identities = 176/351 (50%), Positives = 211/351 (60%), Gaps = 3/351 (0%) Frame = -2 Query: 2790 VTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQMQR 2611 + DEQ+DY DEEY G QK+QY GAIPALAEEEM GE +GE FLQM R Sbjct: 1 MADEQIDYEDEEYGGAQKLQYQGSGAIPALAEEEM-GEDDEYDDLYNDVNIGENFLQMHR 59 Query: 2610 SDTQ-VPSVVGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSL 2434 S+ P VGN G Q +N +E Q +N VA E Y+ T FP Q Sbjct: 60 SEAPPAPPSVGNGGFQPRNSN--DLRVESGGSQGLNIPGVAVESKYS-TGTHFPEQNVKG 116 Query: 2433 P--GPGGPPQAMDASQRGRLPEVAHNSQAGYSGYQGSASMPHKNAADQMNNSEKVIGEPA 2260 P G G P +Q+ R+ E+ ++SQA G+QGS S P D + + K+ +P Sbjct: 117 PEIGSVGYPDGSSIAQKTRVMEMTNDSQARNMGFQGSTSGPSNIGVDPSDMNNKISNDPT 176 Query: 2259 PLMYTNMGNTKGAXXXXXXXXXXXXXXNINRSMDDEYMVRPSVENGNTMLFVGELHWWTT 2080 P+ N G + + NRS +E +RP +ENG+TML+VGELHWWTT Sbjct: 177 PV--PNAGVPR--VIPQLPASQMNMNMDTNRSATNENQIRPPLENGSTMLYVGELHWWTT 232 Query: 2079 DAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACV 1900 DAE+E+VL QYG VKEIKFFDERASGKSKGYCQVEFYD +AA+ACKEGMNGH FNGRACV Sbjct: 233 DAELENVLSQYGMVKEIKFFDERASGKSKGYCQVEFYDAAAAAACKEGMNGHLFNGRACV 292 Query: 1899 VAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGDA 1747 VAFA+ T+KQMGASY NK GR P+ND AGRG NY GDA Sbjct: 293 VAFASQQTLKQMGASYMNKNQGQPQSQNQGRRPMNDGAGRGGNMNYQGGDA 343 Score = 223 bits (569), Expect = 3e-55 Identities = 118/219 (53%), Positives = 136/219 (62%), Gaps = 6/219 (2%) Frame = -2 Query: 1527 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1348 D +MGRGAGYG F+GP FPGMLP FP VN+MGL GVAPHVNPAFF Sbjct: 426 DPTYMGRGAGYGGFAGPGFPGMLPSFPAVNAMGLAGVAPHVNPAFFGRGMAPNGMGMMGP 485 Query: 1347 XXXXGPHSGMWNDTNIGAWGGE--EHGRESSYGGEDNASEYGYGEASHDKGARSSAASRE 1174 GP++GMW+DT++G WG E RESSYGG+D ASEYGYGE +H+KGARSSAASRE Sbjct: 486 SGMDGPNAGMWSDTSMGGWGEEPGRRTRESSYGGDDGASEYGYGEVNHEKGARSSAASRE 545 Query: 1173 KEKNSERDWSSNP---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRSKDHDSGYDDDW 1003 KE+ SERDWS N R ++ DSGY+DDW Sbjct: 546 KERASERDWSGNSDRRHRDDREHDWDRSEREHKEHRYREEKESYRDHRQRERDSGYEDDW 605 Query: 1002 DKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 889 D+GQ AVPE+D+RSRSRDADYGKRRRLPSE Sbjct: 606 DRGQSSSRSRSRSRAVPEEDYRSRSRDADYGKRRRLPSE 644 >ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268141 isoform 1 [Vitis vinifera] gi|359473133|ref|XP_003631251.1| PREDICTED: uncharacterized protein LOC100268141 isoform 2 [Vitis vinifera] Length = 647 Score = 304 bits (778), Expect = 2e-79 Identities = 173/361 (47%), Positives = 213/361 (59%), Gaps = 13/361 (3%) Frame = -2 Query: 2790 VTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQMQR 2611 + +EQLDY DEEY G QKM + GGAI ALA++E++GE VGEGFLQM R Sbjct: 1 MAEEQLDYEDEEYGGAQKMPFQGGGAISALADDELMGEDDEYDDLYNDVNVGEGFLQMHR 60 Query: 2610 SDTQVPS-VVGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYA------------A 2470 S+ PS V+ Q K +VP LE Q + V+ EG Y+ A Sbjct: 61 SEAPAPSGVMAGGPFQAHKTDVPPQKLEAGTSQGLIIPGVSIEGKYSNPHFHEKKEGPMA 120 Query: 2469 TAAPFPVQKTSLPGPGGPPQAMDASQRGRLPEVAHNSQAGYSGYQGSASMPHKNAADQMN 2290 P + L GP SQ+GR+ E+ H++Q G+QGS +P K A+ + Sbjct: 121 VKGPEMGSTSHLDGPS-------VSQKGRVLEMTHDTQVRNLGFQGSTPIPQKTGAEPSD 173 Query: 2289 NSEKVIGEPAPLMYTNMGNTKGAXXXXXXXXXXXXXXNINRSMDDEYMVRPSVENGNTML 2110 K+ E P++ + G + +NR M +E +RP+V+NG TML Sbjct: 174 VHGKIANESTPVLNSGTGGPRAVPQMLSNQMGMNVN--VNRPMVNENQIRPAVDNGATML 231 Query: 2109 FVGELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMN 1930 FVGELHWWTTDAE+ESVL QYG+VKEIKFFDERASGKSKGYCQVEFYD SAA+ACKEGMN Sbjct: 232 FVGELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFYDASAAAACKEGMN 291 Query: 1929 GHSFNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 1750 G+ FNGRACVVAFA+P T+KQMGASY NK GR P+ND GRG G N GD Sbjct: 292 GYIFNGRACVVAFASPQTLKQMGASYMNK--TQAQSQSQGRRPMNDGVGRGGGMNMQGGD 349 Query: 1749 A 1747 A Sbjct: 350 A 350 Score = 213 bits (543), Expect = 3e-52 Identities = 110/217 (50%), Positives = 128/217 (58%), Gaps = 4/217 (1%) Frame = -2 Query: 1527 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1348 D +MGRG YG FSG AFPGM+P FP VN+MGL GVAPHVNPAFF Sbjct: 431 DPTYMGRGGAYGGFSGSAFPGMVPSFPAVNTMGLAGVAPHVNPAFFGRGMAANGMGMMGA 490 Query: 1347 XXXXGPHSGMWNDTNIGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 1177 G H+GMW DT++G WGGEEHG RESSYGG+D AS+YGYGE +H+K RS+ ASR Sbjct: 491 TGMDGHHAGMWTDTSMGGWGGEEHGRRTRESSYGGDDGASDYGYGEVNHEKVGRSNTASR 550 Query: 1176 EKEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRSKDHDSGYDDDWDK 997 EKE+ SERDWS N R ++ D +DDWD+ Sbjct: 551 EKERGSERDWSGNSERRHRDEREQDWERSDKDHRYREEKDGYRDHRQRERDFNNEDDWDR 610 Query: 996 GQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 889 GQ AV ++DHRSRSRD DYGKRRRLPSE Sbjct: 611 GQSSSRSRSRSRAVADEDHRSRSRDGDYGKRRRLPSE 647 >ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] gi|462422613|gb|EMJ26876.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] Length = 630 Score = 295 bits (754), Expect = 1e-76 Identities = 169/348 (48%), Positives = 207/348 (59%), Gaps = 1/348 (0%) Frame = -2 Query: 2790 VTDEQLDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQMQR 2611 + +EQ+DY DEEY G QK+QY GAI ALA+EE + E V EGFLQM R Sbjct: 1 MAEEQIDYEDEEYGGAQKLQYQGSGAISALADEEPMVEDDEYDDLYNDVNVREGFLQMHR 60 Query: 2610 SDTQVP-SVVGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSL 2434 S+ +P VGN G+Q K +V T ++ QE V+ +G Y++ A FP Q+ Sbjct: 61 SEAPLPPGGVGNGGLQAQKTDVTETRVQAGVSQESKIPGVSVQGKYSSAVAQFPEQQ--- 117 Query: 2433 PGPGGPPQAMDASQRGRLPEVAHNSQAGYSGYQGSASMPHKNAADQMNNSEKVIGEPAPL 2254 G PP VA + G +GY GS +MP D + + K E P Sbjct: 118 ---GQPP-------------VAKEPELGSTGY-GSTTMPPNVGGDSSDITGKTALESVPS 160 Query: 2253 MYTNMGNTKGAXXXXXXXXXXXXXXNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDA 2074 M N G N NR M +E +RP VENG+TMLFVGELHWWTTDA Sbjct: 161 M--NSGTAGPTGVTQMPTNQISIKVNANRPMFNENQIRPPVENGSTMLFVGELHWWTTDA 218 Query: 2073 EIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVA 1894 E+ESVL QYG+VKEIKFFDERASGKSKGYCQVEF+DP+AA+ACKEGM+G+ FNGRACVVA Sbjct: 219 ELESVLSQYGRVKEIKFFDERASGKSKGYCQVEFHDPAAATACKEGMDGYLFNGRACVVA 278 Query: 1893 FATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 1750 FA+P T+KQMGASY +K GR P+N+ GRG G NY +GD Sbjct: 279 FASPQTLKQMGASYLSKSQGQTQSQQPGRRPMNEGVGRGGGVNYQTGD 326 Score = 223 bits (568), Expect = 4e-55 Identities = 116/221 (52%), Positives = 133/221 (60%), Gaps = 8/221 (3%) Frame = -2 Query: 1527 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1348 D +MGRG GYG F GPAFPGML FP VN+MGL GVAPHVNPAFF Sbjct: 410 DPTYMGRGGGYGGFPGPAFPGMLSSFPAVNTMGLAGVAPHVNPAFFGRGMATNGMGMMGS 469 Query: 1347 XXXXGPHSGMWNDTNIGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 1177 G H+GMWND ++G WGG+EHG RESSYGG+D ASEYGYGEA+H+KG RS+A SR Sbjct: 470 SGMDGHHAGMWNDPSMGGWGGDEHGRRTRESSYGGDDGASEYGYGEANHEKGGRSNAPSR 529 Query: 1176 EKEKNSERDWSSNP----XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRSKDHDSGYDD 1009 E+E+ SERDWS N R ++ D GY+D Sbjct: 530 ERERGSERDWSGNSERRHRDEREQDWDRSERGEHREHRYKEEKDSYRDHRQRERDVGYED 589 Query: 1008 DWDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 889 DWD+GQ A+PEDDHRSRSRD DYGKRRRLPSE Sbjct: 590 DWDRGQSSSRPRSRSKAMPEDDHRSRSRDVDYGKRRRLPSE 630 >gb|EXB82464.1| Cleavage and polyadenylation specificity factor subunit [Morus notabilis] Length = 636 Score = 287 bits (735), Expect = 2e-74 Identities = 163/357 (45%), Positives = 210/357 (58%), Gaps = 10/357 (2%) Frame = -2 Query: 2790 VTDEQLDYGDEEYAGNQKMQYH-QGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQMQ 2614 + ++ +D+ DEEY G QK QY GGAI ALA+EE++G+ VGEGFLQ+Q Sbjct: 1 MAEDHIDFEDEEYGGAQKHQYQGSGGAISALADEELMGDDDEYDDLYNDVNVGEGFLQLQ 60 Query: 2613 RSDT-QVPSVVG-NSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFP---- 2452 RS+ +P+ G +G+Q K N P E Q+ N V+ EG +++ + FP Sbjct: 61 RSEAPSLPAAAGVGNGLQAQKRNFPEPREEIGGSQQPNIPGVSAEGRFSSAGSQFPGQQD 120 Query: 2451 ---VQKTSLPGPGGPPQAMDASQRGRLPEVAHNSQAGYSGYQGSASMPHKNAADQMNNSE 2281 V K S G P SQ+GR+ +G+QGS M H D + Sbjct: 121 GLKVDKKSEAGSMVYPDGASGSQKGRI----------VAGFQGSKPMLHSVGVDSSDIPG 170 Query: 2280 KVIGEPAPLMYTNMGNTKGAXXXXXXXXXXXXXXNINRSMDDEYMVRPSVENGNTMLFVG 2101 K++ EP + N G N++ + +E +RPS+ENG+TMLFVG Sbjct: 171 KMVNEP--IQAPNSGGAGPRGILPMQGNQTTVNANVSHPIVNENQIRPSIENGSTMLFVG 228 Query: 2100 ELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHS 1921 ELHWWTTDAE+ESVL QYG+VKEIKFFDERASGKSKGYCQVE+YD +AA ACKEGM+GH Sbjct: 229 ELHWWTTDAELESVLSQYGRVKEIKFFDERASGKSKGYCQVEYYDAAAAVACKEGMHGHV 288 Query: 1920 FNGRACVVAFATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 1750 FNGRACVVAFA+P T+KQMGA+Y +K GR P+ND GRG N+ SGD Sbjct: 289 FNGRACVVAFASPQTLKQMGAAYMSKNQVQNQSQPQGRRPINDGVGRGGNPNFQSGD 345 Score = 194 bits (492), Expect = 3e-46 Identities = 106/220 (48%), Positives = 119/220 (54%), Gaps = 7/220 (3%) Frame = -2 Query: 1527 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1348 D +MGRG GYG F+GPAFPGMLP FP VN+MG VAPHVNPAFF Sbjct: 425 DPTYMGRGVGYGGFAGPAFPGMLPSFPAVNTMGFAAVAPHVNPAFFGRGMTNNGMGMVGS 484 Query: 1347 XXXXGPHSGMWNDTNIGAWGGEEHG---RESSYGGEDNASEYGYGEASHDKGARSSAASR 1177 G GMWND +IG WGGEEHG RESSYGG+D ASEYGYG+ +H+KG R Sbjct: 485 SLMDGHQGGMWNDPSIGGWGGEEHGRRTRESSYGGDDGASEYGYGDTNHEKGGR------ 538 Query: 1176 EKEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRS---KDHDSGYDDD 1006 E+ SERDWS N R K+ + Y+DD Sbjct: 539 --ERGSERDWSGNSERRNHEERDQDWDRSQKEQKEHRYREGKDGSRDYRPKERELDYEDD 596 Query: 1005 WDKGQXXXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 889 WD+GQ V ED HRSRSRD DYGKRRRLPSE Sbjct: 597 WDRGQSSSRLRSRSRVVQEDHHRSRSRDVDYGKRRRLPSE 636 >ref|XP_002312652.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222852472|gb|EEE90019.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 619 Score = 279 bits (713), Expect = 6e-72 Identities = 163/347 (46%), Positives = 197/347 (56%), Gaps = 5/347 (1%) Frame = -2 Query: 2775 LDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQMQRSDTQV 2596 +DY +EE KMQY GAIPALAEEEM GE VGE FLQM S+ Sbjct: 1 MDYEEEE-----KMQYQGSGAIPALAEEEM-GEDDEYDDLYNDVNVGENFLQMHGSEAPA 54 Query: 2595 P-SVVGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPG- 2422 P + VGN G Q A+ G + A EG Y+ A FP QK Sbjct: 55 PPATVGNGGFQTRNAHESRIETGGSQALAITGGGPAVEGIYSNAKAHFPEQKQVAVAVEA 114 Query: 2421 ---GPPQAMDASQRGRLPEVAHNSQAGYSGYQGSASMPHKNAADQMNNSEKVIGEPAPLM 2251 GP +Q+GR+ E++H+ Q G+Q S +P D + S K EP PL Sbjct: 115 QDVGPVDGSSVAQKGRVIEMSHDVQVRNMGFQKSTPVPPGIGVDPSDMSRKNAIEPEPLP 174 Query: 2250 YTNMGNTKGAXXXXXXXXXXXXXXNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAE 2071 T +GA +NR + +E VRP +ENG+T L+VGELHWWTTDAE Sbjct: 175 ITGSAGPRGAPQMQVNQMHMSAD--VNRPVVNENQVRPPIENGSTTLYVGELHWWTTDAE 232 Query: 2070 IESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 1891 +ES Q+G+VKEIKFFDERASGKSKGYCQV+FY+ +AA+ACKEGMNGH FNGR CVVAF Sbjct: 233 LESFASQFGRVKEIKFFDERASGKSKGYCQVDFYEAAAAAACKEGMNGHVFNGRPCVVAF 292 Query: 1890 ATPHTIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 1750 A+P T+KQMGASY NK GR +ND AGRG AN+ SGD Sbjct: 293 ASPQTLKQMGASYMNKTQGQPQTQSQGRGSMNDGAGRGGNANFQSGD 339 Score = 189 bits (480), Expect = 6e-45 Identities = 103/214 (48%), Positives = 118/214 (55%), Gaps = 1/214 (0%) Frame = -2 Query: 1527 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1348 D +MGRG GYG F+GP FPGMLP FP VNSMGL GVAPHVNPAFF Sbjct: 421 DPLYMGRGGGYGGFAGPGFPGMLPSFPAVNSMGLAGVAPHVNPAFFARGMAPNGMGMMVS 480 Query: 1347 XXXXGPHSGMWNDTNIGAWGGEEHGRESSYGGEDNASEYGYGEASHDKGARSSAASREKE 1168 GP+ GMW ESSY G++ ASEYGYGE +H+KGARSS ASREKE Sbjct: 481 SGMDGPNPGMW---------------ESSYDGDEGASEYGYGEGNHEKGARSSGASREKE 525 Query: 1167 KNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRSKDHDSGYDDDWDKGQX 988 + SERDWS N R ++ DSGY+DD D+G Sbjct: 526 RGSERDWSGNSDRRHRDEREQDWDRPEREHRYKEEKDSYRGHRQRERDSGYEDDRDRGHS 585 Query: 987 XXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 889 A PE+D+RSR+RD DYGKRRRLPSE Sbjct: 586 SSRARSRSRAAPEEDYRSRTRDVDYGKRRRLPSE 619 >ref|XP_006378268.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] gi|550329195|gb|ERP56065.1| hypothetical protein POPTR_0010s06150g [Populus trichocarpa] Length = 591 Score = 242 bits (618), Expect = 6e-61 Identities = 149/343 (43%), Positives = 180/343 (52%), Gaps = 1/343 (0%) Frame = -2 Query: 2775 LDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQMQRSDTQV 2596 +D+ +EE KMQY GAIPALAEEE+ GE VGE FLQM S+ Sbjct: 1 MDFEEEE-----KMQYQGSGAIPALAEEEL-GEDDEYDDLYNDVNVGENFLQMHGSEAPA 54 Query: 2595 P-SVVGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPGG 2419 P + GN G Q A+ G + + VA EG Y+ A FP QK + G Sbjct: 55 PPATAGNGGFQTRNAHESRVETGGSQVLATSGAGVAVEGKYSNAGAHFPEQKQAGIG--- 111 Query: 2418 PPQAMDASQRGRLPEVAHNSQAGYSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNM 2239 + G GY +S+ K +A P M N Sbjct: 112 ----------------VEANDVGSIGYGDGSSVAQKGSAGPRG---------VPQMQVNQ 146 Query: 2238 GNTKGAXXXXXXXXXXXXXXNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESV 2059 N ++NR + +E VRP +ENG T L+VGELHWWTTDAE+ESV Sbjct: 147 MNMNA---------------DVNRPVVNENQVRPPIENGPTTLYVGELHWWTTDAELESV 191 Query: 2058 LIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAFATPH 1879 QYG+VKEIKFFDERASGKSKGYCQV+FY+ +AA+ACKEGMN H FNGR CVVAFA+ Sbjct: 192 ASQYGRVKEIKFFDERASGKSKGYCQVDFYEAAAAAACKEGMNEHVFNGRPCVVAFASAQ 251 Query: 1878 TIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 1750 T+KQMGASY +K GR +ND GRG ANY SGD Sbjct: 252 TLKQMGASYMSKTQGQPQPQSQGRGSMNDGMGRGGNANYQSGD 294 Score = 200 bits (509), Expect = 3e-48 Identities = 108/216 (50%), Positives = 123/216 (56%), Gaps = 3/216 (1%) Frame = -2 Query: 1527 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1348 D +MGRG GYG F G FPGMLP FP VNSMGL GVAPHVNPAFF Sbjct: 376 DPLYMGRGGGYGGFPGHGFPGMLPSFPAVNSMGLAGVAPHVNPAFFARGMAPNGMGMMAS 435 Query: 1347 XXXXGPHSGMWNDTNIGAWGGE--EHGRESSYGGEDNASEYGYGEASHDKGARSSAASRE 1174 GP+ G W DT++G WG E RESSY G++ ASEYGYGE +H+KGARSS ASRE Sbjct: 436 SGMEGPNPGKWPDTSMGGWGEEPGRRTRESSYDGDEGASEYGYGEGNHEKGARSSGASRE 495 Query: 1173 KEKNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRSKDHDSGYDDDWDKG 994 KE+ SERDWS N R ++ DSGY+DD D+G Sbjct: 496 KERVSERDWSGNSDRRHRDEREQDWDRSEREPKYREEKDTYRGHRQRERDSGYEDDRDRG 555 Query: 993 QXXXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 889 A PE+D+RSRSRD DYGKRRR PSE Sbjct: 556 HSSSRARSRSRAAPEEDYRSRSRDVDYGKRRRPPSE 591 >ref|XP_002315647.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222864687|gb|EEF01818.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 573 Score = 242 bits (618), Expect = 6e-61 Identities = 149/343 (43%), Positives = 180/343 (52%), Gaps = 1/343 (0%) Frame = -2 Query: 2775 LDYGDEEYAGNQKMQYHQGGAIPALAEEEMIGEXXXXXXXXXXXXVGEGFLQMQRSDTQV 2596 +D+ +EE KMQY GAIPALAEEE+ GE VGE FLQM S+ Sbjct: 1 MDFEEEE-----KMQYQGSGAIPALAEEEL-GEDDEYDDLYNDVNVGENFLQMHGSEAPA 54 Query: 2595 P-SVVGNSGIQNSKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPGG 2419 P + GN G Q A+ G + + VA EG Y+ A FP QK + G Sbjct: 55 PPATAGNGGFQTRNAHESRVETGGSQVLATSGAGVAVEGKYSNAGAHFPEQKQAGIG--- 111 Query: 2418 PPQAMDASQRGRLPEVAHNSQAGYSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNM 2239 + G GY +S+ K +A P M N Sbjct: 112 ----------------VEANDVGSIGYGDGSSVAQKGSAGPRG---------VPQMQVNQ 146 Query: 2238 GNTKGAXXXXXXXXXXXXXXNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESV 2059 N ++NR + +E VRP +ENG T L+VGELHWWTTDAE+ESV Sbjct: 147 MNMNA---------------DVNRPVVNENQVRPPIENGPTTLYVGELHWWTTDAELESV 191 Query: 2058 LIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAFATPH 1879 QYG+VKEIKFFDERASGKSKGYCQV+FY+ +AA+ACKEGMN H FNGR CVVAFA+ Sbjct: 192 ASQYGRVKEIKFFDERASGKSKGYCQVDFYEAAAAAACKEGMNEHVFNGRPCVVAFASAQ 251 Query: 1878 TIKQMGASYTNKXXXXXXXXXXGRNPVNDAAGRGNGANYPSGD 1750 T+KQMGASY +K GR +ND GRG ANY SGD Sbjct: 252 TLKQMGASYMSKTQGQPQPQSQGRGSMNDGMGRGGNANYQSGD 294 Score = 178 bits (452), Expect = 1e-41 Identities = 102/214 (47%), Positives = 117/214 (54%), Gaps = 1/214 (0%) Frame = -2 Query: 1527 DLAFMGRGAGYGNFSGPAFPGMLPPFPGVNSMGLPGVAPHVNPAFFXXXXXXXXXXXXXX 1348 D +MGRG GYG F G FPGMLP FP VNSMGL GVAPHVNPAFF Sbjct: 376 DPLYMGRGGGYGGFPGHGFPGMLPSFPAVNSMGLAGVAPHVNPAFFARGMAP-------- 427 Query: 1347 XXXXGPHSGMWNDTNIGAWGGEEHGRESSYGGEDNASEYGYGEASHDKGARSSAASREKE 1168 +GM + G G G+ESSY G++ ASEYGYGE +H+KGARSS ASREKE Sbjct: 428 -------NGMGMMASSG-MEGPNPGKESSYDGDEGASEYGYGEGNHEKGARSSGASREKE 479 Query: 1167 KNSERDWSSNPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRSKDHDSGYDDDWDKGQX 988 + SERDWS N R ++ DSGY+DD D+G Sbjct: 480 RVSERDWSGNSDRRHRDEREQDWDRSEREPKYREEKDTYRGHRQRERDSGYEDDRDRGHS 539 Query: 987 XXXXXXSG-AVPEDDHRSRSRDADYGKRRRLPSE 889 A PE+D+RSRSRD DYGKRRR PSE Sbjct: 540 SSRARSRSRAAPEEDYRSRSRDVDYGKRRRPPSE 573