BLASTX nr result
ID: Mentha22_contig00036409
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00036409 (678 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus... 314 1e-83 ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation spec... 243 5e-62 ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309... 229 6e-58 gb|EPS60955.1| hypothetical protein M569_13847, partial [Genlise... 227 3e-57 ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation spec... 225 9e-57 ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citr... 225 1e-56 ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268... 223 3e-56 ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobr... 223 4e-56 ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation spec... 221 2e-55 ref|XP_002515040.1| RNA binding protein, putative [Ricinus commu... 220 4e-55 ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citr... 219 5e-55 ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobr... 213 3e-53 ref|XP_007044908.1| RNA-binding family protein isoform 5, partia... 213 3e-53 ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobr... 213 3e-53 ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobr... 213 3e-53 ref|XP_002312652.1| RNA recognition motif-containing family prot... 207 2e-51 ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prun... 205 1e-50 gb|EXB82464.1| Cleavage and polyadenylation specificity factor s... 197 2e-48 emb|CBI16834.3| unnamed protein product [Vitis vinifera] 187 2e-45 ref|XP_006417146.1| hypothetical protein EUTSA_v10007191mg [Eutr... 179 7e-43 >gb|EYU30596.1| hypothetical protein MIMGU_mgv1a002773mg [Mimulus guttatus] Length = 639 Score = 314 bits (805), Expect = 1e-83 Identities = 152/227 (66%), Positives = 177/227 (77%), Gaps = 3/227 (1%) Frame = +3 Query: 6 SKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPGGPPQTMDASQRGR 185 SK PGT E +A QEVNN +V EG+YA QK +L GGP Q +DASQR R Sbjct: 80 SKNTAPGTRAEAIASQEVNNGRVGNEGSYAPNGVQLSDQKNNLTAVGGPAQPVDASQRVR 139 Query: 186 LPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQVPPN 365 LPE+A++SQA H GYQGS M HK A D+MNNSE ++GEPA L+Y N G++KG PQ P N Sbjct: 140 LPEVANSSQAAHLGYQGSEIMLHKTATDRMNNSENIVGEPASLVYPNTGSSKGVPQAPSN 199 Query: 366 QMNLNPNVNIN--RSMDDEYMVRPSV-ENGNTMLFVGELHWWTTDAEIESVLIQYGKVKE 536 MN N NVN+N RSMDDEY++RPS ENGN M++VGELHWWTTDAE+ESVLIQYG+VKE Sbjct: 200 LMNSNANVNVNVNRSMDDEYLIRPSGGENGNPMIYVGELHWWTTDAEVESVLIQYGRVKE 259 Query: 537 IKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677 IKFFDERASGKSKGYCQVEFYDP+AA+ACK+GM GH FNGRACVV + Sbjct: 260 IKFFDERASGKSKGYCQVEFYDPAAATACKDGMQGHIFNGRACVVTY 306 >ref|XP_006341786.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Solanum tuberosum] gi|565349616|ref|XP_006341787.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X2 [Solanum tuberosum] Length = 648 Score = 243 bits (619), Expect = 5e-62 Identities = 126/225 (56%), Positives = 151/225 (67%), Gaps = 2/225 (0%) Frame = +3 Query: 9 KANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPGGPPQTMDASQRGRL 188 K + P + G+ +E +A EG YA T FP QK + DA+Q+ R Sbjct: 82 KDSFPASRAGGLGSEEAKIPGIATEGKYAGTEVQFPQQKGEPVVERETERPADAAQKARP 141 Query: 189 PE--MAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQVPP 362 M NSQAG+SGYQGS MP K AD M EK E PLM + + + P +P Sbjct: 142 SAITMTLNSQAGNSGYQGSMPMPQKIGADPMAMPEKNASEATPLMNSVVPGPRVVPHMPT 201 Query: 363 NQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKEIK 542 NQ+N + NVN+N + E RPS+ENGNTMLFVGELHWWTTDAE+ESVL QYG VKEIK Sbjct: 202 NQLNSSGNVNMNNPVISETPFRPSLENGNTMLFVGELHWWTTDAELESVLTQYGNVKEIK 261 Query: 543 FFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677 FFDERASGKSKGYCQVEF+DP++A+ACKEGMNG++FNGRACVVAF Sbjct: 262 FFDERASGKSKGYCQVEFFDPASAAACKEGMNGYNFNGRACVVAF 306 >ref|XP_004310003.1| PREDICTED: uncharacterized protein LOC101309507 [Fragaria vesca subsp. vesca] Length = 646 Score = 229 bits (584), Expect = 6e-58 Identities = 118/223 (52%), Positives = 150/223 (67%) Frame = +3 Query: 9 KANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPGGPPQTMDASQRGRL 188 K NVP ++G A QEV N + EG Y++ P QK P P ASQ+GR+ Sbjct: 82 KNNVPEQRVQGGASQEVKNPGFSVEGKYSSV----PEQKDQPPVSVVPEM---ASQKGRV 134 Query: 189 PEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQVPPNQ 368 EM H++Q + G+QG+A+M AD + + K+ P P M + Q+P NQ Sbjct: 135 MEMTHDAQVRNMGFQGAATMQSNVVADSSDLTGKIANGPIPSMNSGSNGPPAVQQMPANQ 194 Query: 369 MNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKEIKFF 548 MN+ +N+NR M +E +RP VENG+ LFVGELHWWTTDAE+E VL Q+G++KEIKFF Sbjct: 195 MNMK--INVNRPMVNENQIRPPVENGSATLFVGELHWWTTDAELEGVLSQFGRIKEIKFF 252 Query: 549 DERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677 DERASGKSKGYCQV+FYDP+AASACKEGM+G+ FNGRACVVAF Sbjct: 253 DERASGKSKGYCQVDFYDPAAASACKEGMDGYVFNGRACVVAF 295 >gb|EPS60955.1| hypothetical protein M569_13847, partial [Genlisea aurea] Length = 508 Score = 227 bits (578), Expect = 3e-57 Identities = 125/224 (55%), Positives = 146/224 (65%) Frame = +3 Query: 6 SKANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPGGPPQTMDASQRGR 185 ++ N GT E + +E N K A + A FP QK L T+D SQ R Sbjct: 76 NRVNPSGTGDESIPSEEANASKYAGNRAFGPGALQFPEQKAGLNTTEETSVTVDRSQTVR 135 Query: 186 LPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQVPPN 365 NSQ SGYQGS + P+ DQ+ N +K +G+P+ + +KGA VP N Sbjct: 136 ------NSQTDQSGYQGSVA-PNNKTEDQVKNMDKTVGDPSSINPNVGVGSKGA--VPFN 186 Query: 366 QMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKEIKF 545 MN+ N N R +DDEY S ENGNTML+VGELHWWTTDAEIESVLIQYGKVKEIKF Sbjct: 187 FMNMAANANAIRPVDDEYSNLGSSENGNTMLYVGELHWWTTDAEIESVLIQYGKVKEIKF 246 Query: 546 FDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677 FDERASGKSKGYCQVEF+DP+AA ACKEGMNG+ FNGRACVVAF Sbjct: 247 FDERASGKSKGYCQVEFFDPAAAHACKEGMNGYVFNGRACVVAF 290 >ref|XP_006483997.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 658 Score = 225 bits (574), Expect = 9e-57 Identities = 117/230 (50%), Positives = 147/230 (63%), Gaps = 7/230 (3%) Frame = +3 Query: 9 KANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK-----TSLP--GPGGPPQTMD 167 K +VP ++ Q N V+ EG Y FP Q + P G G P Sbjct: 82 KTDVPEQQVQAGVSQGSNVPGVSVEGKYTNAGTHFPAQNDVQVAVNRPNMGSGNYPDGAS 141 Query: 168 ASQRGRLPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGA 347 SQ+G + E H++ + G+QGS S P + D N +V EPAP++ +GA Sbjct: 142 VSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGRVANEPAPVLNPGAAGPQGA 201 Query: 348 PQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGK 527 +P NQM +N +N+NR+M +E +RP +ENG TMLFVGELHWWTTDAE+ESVL QYG+ Sbjct: 202 -LIPANQMGVN--INVNRAMVNENQIRPPLENGGTMLFVGELHWWTTDAELESVLSQYGR 258 Query: 528 VKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677 VKEIKFFDERASGKSKGYCQVEF+D +AA+ACK+GMNGH FNGR CVVAF Sbjct: 259 VKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFNGRPCVVAF 308 >ref|XP_006438179.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] gi|557540375|gb|ESR51419.1| hypothetical protein CICLE_v10030915mg [Citrus clementina] Length = 658 Score = 225 bits (573), Expect = 1e-56 Identities = 117/230 (50%), Positives = 147/230 (63%), Gaps = 7/230 (3%) Frame = +3 Query: 9 KANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK-----TSLP--GPGGPPQTMD 167 K +VP ++ Q N V+ EG Y FP Q + P G G P Sbjct: 82 KTDVPEQQVQAGVSQGSNVPGVSVEGKYTNAGTHFPAQNDVQVAVNRPNMGSGNYPDGAS 141 Query: 168 ASQRGRLPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGA 347 SQ+G + E H++ + G+QGS S P + D N +V EPAP++ +GA Sbjct: 142 VSQKGSVQETTHDAHVRNMGFQGSTSGPPRTGVDPSNMPGRVANEPAPVLNPGAAGPQGA 201 Query: 348 PQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGK 527 +P NQM +N +N+NR+M +E +RP +ENG TMLFVGELHWWTTDAE+ESVL QYG+ Sbjct: 202 -LIPANQMGVN--INVNRAMVNENQIRPPLENGGTMLFVGELHWWTTDAELESVLSQYGR 258 Query: 528 VKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677 VKEIKFFDERASGKSKGYCQVEF+D +AA+ACK+GMNGH FNGR CVVAF Sbjct: 259 VKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFNGRPCVVAF 308 >ref|XP_002282072.2| PREDICTED: uncharacterized protein LOC100268141 isoform 1 [Vitis vinifera] gi|359473133|ref|XP_003631251.1| PREDICTED: uncharacterized protein LOC100268141 isoform 2 [Vitis vinifera] Length = 647 Score = 223 bits (569), Expect = 3e-56 Identities = 120/235 (51%), Positives = 150/235 (63%), Gaps = 12/235 (5%) Frame = +3 Query: 9 KANVPGTHLEGVALQEVNNVKVAEEGNYA------------ATAAPFPVQKTSLPGPGGP 152 K +VP LE Q + V+ EG Y+ A P + L GP Sbjct: 79 KTDVPPQKLEAGTSQGLIIPGVSIEGKYSNPHFHEKKEGPMAVKGPEMGSTSHLDGPS-- 136 Query: 153 PQTMDASQRGRLPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMG 332 SQ+GR+ EM H++Q + G+QGS +P K A+ + K+ E P++ + G Sbjct: 137 -----VSQKGRVLEMTHDTQVRNLGFQGSTPIPQKTGAEPSDVHGKIANESTPVLNSGTG 191 Query: 333 NTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVL 512 + PQ+ NQM +N VN+NR M +E +RP+V+NG TMLFVGELHWWTTDAE+ESVL Sbjct: 192 GPRAVPQMLSNQMGMN--VNVNRPMVNENQIRPAVDNGATMLFVGELHWWTTDAELESVL 249 Query: 513 IQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677 QYG+VKEIKFFDERASGKSKGYCQVEFYD SAA+ACKEGMNG+ FNGRACVVAF Sbjct: 250 SQYGRVKEIKFFDERASGKSKGYCQVEFYDASAAAACKEGMNGYIFNGRACVVAF 304 >ref|XP_007044902.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695488|ref|XP_007044903.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708837|gb|EOY00734.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708838|gb|EOY00735.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 653 Score = 223 bits (568), Expect = 4e-56 Identities = 119/229 (51%), Positives = 145/229 (63%), Gaps = 6/229 (2%) Frame = +3 Query: 9 KANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK----TSLP--GPGGPPQTMDA 170 K P E Q +N V+ +G + A +P Q S P G G P Sbjct: 82 KNEAPEPRGEAGGSQGLNIPGVSVQGKHLNVTARYPEQDGQPAVSRPEMGSGSYPSGTSI 141 Query: 171 SQRGRLPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAP 350 SQ+GR+ E ++Q + G+QG +S HK D +K+ PA + + G +GAP Sbjct: 142 SQKGRVMEGTQDTQVKNMGFQGLSSASHKVGIDPSGVPQKIANVPAQSLNSGTGGPQGAP 201 Query: 351 QVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKV 530 VPPNQM LN +N M E VRP +ENG TMLFVGELHWWTTDAE+ESVL QYG+V Sbjct: 202 HVPPNQMGLN----VNHPMISENQVRPPIENGPTMLFVGELHWWTTDAELESVLSQYGRV 257 Query: 531 KEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677 KEIKFFDERASGKSKGYCQVEFYDP++A+ACKEGM+G+ FNGRACVVAF Sbjct: 258 KEIKFFDERASGKSKGYCQVEFYDPASAAACKEGMDGYMFNGRACVVAF 306 >ref|XP_006483998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit CG7185-like isoform X1 [Citrus sinensis] Length = 655 Score = 221 bits (562), Expect = 2e-55 Identities = 117/230 (50%), Positives = 145/230 (63%), Gaps = 7/230 (3%) Frame = +3 Query: 9 KANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK-----TSLP--GPGGPPQTMD 167 K +VP ++ Q N V+ EG Y + FP Q + P G G P Sbjct: 79 KTDVPEQRVQVGGSQGSNIPGVSVEGKYTNAGSHFPAQNDVQVAVNRPNMGSGNYPDGAS 138 Query: 168 ASQRGRLPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGA 347 SQ+G + E H++ + G+QGS S P + D N +V EPAP++ +GA Sbjct: 139 VSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGRVANEPAPVLNPGAAGPQGA 198 Query: 348 PQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGK 527 +P NQM +N NVN R M +E +RP +ENG TMLFVGELHWWTTDAE+ESVL QYG+ Sbjct: 199 -LIPANQMGVNANVN--RVMVNENQIRPPLENGGTMLFVGELHWWTTDAELESVLSQYGR 255 Query: 528 VKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677 KEIKFFDERASGKSKGYCQVEF+D +AA+ACK+GMNGH FNGR CVVAF Sbjct: 256 AKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFNGRPCVVAF 305 >ref|XP_002515040.1| RNA binding protein, putative [Ricinus communis] gi|223546091|gb|EEF47594.1| RNA binding protein, putative [Ricinus communis] Length = 644 Score = 220 bits (560), Expect = 4e-55 Identities = 117/217 (53%), Positives = 147/217 (67%), Gaps = 2/217 (0%) Frame = +3 Query: 33 LEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLP--GPGGPPQTMDASQRGRLPEMAHN 206 +E Q +N VA E Y+ T FP Q P G G P +Q+ R+ EM ++ Sbjct: 84 VESGGSQGLNIPGVAVESKYS-TGTHFPEQNVKGPEIGSVGYPDGSSIAQKTRVMEMTND 142 Query: 207 SQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQVPPNQMNLNPN 386 SQA + G+QGS S P D + + K+ +P P+ N G + PQ+P +QMN+N Sbjct: 143 SQARNMGFQGSTSGPSNIGVDPSDMNNKISNDPTPV--PNAGVPRVIPQLPASQMNMN-- 198 Query: 387 VNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKEIKFFDERASG 566 ++ NRS +E +RP +ENG+TML+VGELHWWTTDAE+E+VL QYG VKEIKFFDERASG Sbjct: 199 MDTNRSATNENQIRPPLENGSTMLYVGELHWWTTDAELENVLSQYGMVKEIKFFDERASG 258 Query: 567 KSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677 KSKGYCQVEFYD +AA+ACKEGMNGH FNGRACVVAF Sbjct: 259 KSKGYCQVEFYDAAAAAACKEGMNGHLFNGRACVVAF 295 >ref|XP_006438180.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|567891321|ref|XP_006438181.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540376|gb|ESR51420.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] gi|557540377|gb|ESR51421.1| hypothetical protein CICLE_v10030917mg [Citrus clementina] Length = 655 Score = 219 bits (559), Expect = 5e-55 Identities = 116/230 (50%), Positives = 144/230 (62%), Gaps = 7/230 (3%) Frame = +3 Query: 9 KANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQK-----TSLP--GPGGPPQTMD 167 K +VP ++ Q N V+ EG Y + FP Q + P G G P Sbjct: 79 KTDVPEQRVQVGGSQGSNIPGVSVEGKYTNAGSDFPAQNDVQVAVNRPNMGSGNYPDGAS 138 Query: 168 ASQRGRLPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGA 347 SQ+G + E H++ + G+QGS S P + D N + EPAP++ +GA Sbjct: 139 VSQKGSVQETTHDAHVRNMGFQGSTSGPSRTGVDPSNMPGRAANEPAPVLNPGAAGPQGA 198 Query: 348 PQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGK 527 +P NQM +N NVN R M +E +RP +ENG TMLFVGELHWWTTDAE+ESVL QYG+ Sbjct: 199 -LIPANQMGVNANVN--RVMVNENQIRPPLENGGTMLFVGELHWWTTDAELESVLSQYGR 255 Query: 528 VKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677 KEIKFFDERASGKSKGYCQVEF+D +AA+ACK+GMNGH FNGR CVVAF Sbjct: 256 AKEIKFFDERASGKSKGYCQVEFFDAAAAAACKDGMNGHVFNGRPCVVAF 305 >ref|XP_007044909.1| RNA-binding family protein isoform 6 [Theobroma cacao] gi|508708844|gb|EOY00741.1| RNA-binding family protein isoform 6 [Theobroma cacao] Length = 602 Score = 213 bits (543), Expect = 3e-53 Identities = 110/226 (48%), Positives = 142/226 (62%), Gaps = 7/226 (3%) Frame = +3 Query: 21 PGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPGGP-------PQTMDASQR 179 P +E Q +N V+ +G + +A +P +K P P P SQ+ Sbjct: 86 PEPRVEAGGSQGLNIPGVSVQGKHPNVSARYP-EKEEQPAVNRPEMVSGSYPSGSSISQK 144 Query: 180 GRLPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQVP 359 G + E H+ Q + G+QG S +K D +K+ +PA + + G +G P VP Sbjct: 145 GSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVPQKIANDPAQSLNSGTGGPQGPPHVP 204 Query: 360 PNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKEI 539 PNQM N+N + +E V+P +ENG TMLFVGELHWWTTDAE+ESVL QYG++KEI Sbjct: 205 PNQMG----TNVNHPVMNENQVQPPIENGPTMLFVGELHWWTTDAELESVLSQYGRLKEI 260 Query: 540 KFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677 KFFDE+ASGKSKGYCQVEFYDPS+A+ CKEGMNG+ FNGRACVVAF Sbjct: 261 KFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYMFNGRACVVAF 306 >ref|XP_007044908.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] gi|508708843|gb|EOY00740.1| RNA-binding family protein isoform 5, partial [Theobroma cacao] Length = 656 Score = 213 bits (543), Expect = 3e-53 Identities = 110/226 (48%), Positives = 142/226 (62%), Gaps = 7/226 (3%) Frame = +3 Query: 21 PGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPGGP-------PQTMDASQR 179 P +E Q +N V+ +G + +A +P +K P P P SQ+ Sbjct: 86 PEPRVEAGGSQGLNIPGVSVQGKHPNVSARYP-EKEEQPAVNRPEMVSGSYPSGSSISQK 144 Query: 180 GRLPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQVP 359 G + E H+ Q + G+QG S +K D +K+ +PA + + G +G P VP Sbjct: 145 GSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVPQKIANDPAQSLNSGTGGPQGPPHVP 204 Query: 360 PNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKEI 539 PNQM N+N + +E V+P +ENG TMLFVGELHWWTTDAE+ESVL QYG++KEI Sbjct: 205 PNQMG----TNVNHPVMNENQVQPPIENGPTMLFVGELHWWTTDAELESVLSQYGRLKEI 260 Query: 540 KFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677 KFFDE+ASGKSKGYCQVEFYDPS+A+ CKEGMNG+ FNGRACVVAF Sbjct: 261 KFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYMFNGRACVVAF 306 >ref|XP_007044907.1| RNA-binding family protein isoform 4 [Theobroma cacao] gi|508708842|gb|EOY00739.1| RNA-binding family protein isoform 4 [Theobroma cacao] Length = 697 Score = 213 bits (543), Expect = 3e-53 Identities = 110/226 (48%), Positives = 142/226 (62%), Gaps = 7/226 (3%) Frame = +3 Query: 21 PGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPGGP-------PQTMDASQR 179 P +E Q +N V+ +G + +A +P +K P P P SQ+ Sbjct: 86 PEPRVEAGGSQGLNIPGVSVQGKHPNVSARYP-EKEEQPAVNRPEMVSGSYPSGSSISQK 144 Query: 180 GRLPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQVP 359 G + E H+ Q + G+QG S +K D +K+ +PA + + G +G P VP Sbjct: 145 GSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVPQKIANDPAQSLNSGTGGPQGPPHVP 204 Query: 360 PNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKEI 539 PNQM N+N + +E V+P +ENG TMLFVGELHWWTTDAE+ESVL QYG++KEI Sbjct: 205 PNQMG----TNVNHPVMNENQVQPPIENGPTMLFVGELHWWTTDAELESVLSQYGRLKEI 260 Query: 540 KFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677 KFFDE+ASGKSKGYCQVEFYDPS+A+ CKEGMNG+ FNGRACVVAF Sbjct: 261 KFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYMFNGRACVVAF 306 >ref|XP_007044904.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695496|ref|XP_007044905.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|590695500|ref|XP_007044906.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708839|gb|EOY00736.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708840|gb|EOY00737.1| RNA-binding family protein isoform 1 [Theobroma cacao] gi|508708841|gb|EOY00738.1| RNA-binding family protein isoform 1 [Theobroma cacao] Length = 652 Score = 213 bits (543), Expect = 3e-53 Identities = 110/226 (48%), Positives = 142/226 (62%), Gaps = 7/226 (3%) Frame = +3 Query: 21 PGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPGGP-------PQTMDASQR 179 P +E Q +N V+ +G + +A +P +K P P P SQ+ Sbjct: 86 PEPRVEAGGSQGLNIPGVSVQGKHPNVSARYP-EKEEQPAVNRPEMVSGSYPSGSSISQK 144 Query: 180 GRLPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQVP 359 G + E H+ Q + G+QG S +K D +K+ +PA + + G +G P VP Sbjct: 145 GSVTEGTHDKQVKNLGFQGLTSASNKVGIDPSGVPQKIANDPAQSLNSGTGGPQGPPHVP 204 Query: 360 PNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKEI 539 PNQM N+N + +E V+P +ENG TMLFVGELHWWTTDAE+ESVL QYG++KEI Sbjct: 205 PNQMG----TNVNHPVMNENQVQPPIENGPTMLFVGELHWWTTDAELESVLSQYGRLKEI 260 Query: 540 KFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677 KFFDE+ASGKSKGYCQVEFYDPS+A+ CKEGMNG+ FNGRACVVAF Sbjct: 261 KFFDEKASGKSKGYCQVEFYDPSSAAVCKEGMNGYMFNGRACVVAF 306 >ref|XP_002312652.1| RNA recognition motif-containing family protein [Populus trichocarpa] gi|222852472|gb|EEE90019.1| RNA recognition motif-containing family protein [Populus trichocarpa] Length = 619 Score = 207 bits (527), Expect = 2e-51 Identities = 108/205 (52%), Positives = 136/205 (66%), Gaps = 4/205 (1%) Frame = +3 Query: 75 AEEGNYAATAAPFPVQKTSLPGPG----GPPQTMDASQRGRLPEMAHNSQAGHSGYQGSA 242 A EG Y+ A FP QK GP +Q+GR+ EM+H+ Q + G+Q S Sbjct: 90 AVEGIYSNAKAHFPEQKQVAVAVEAQDVGPVDGSSVAQKGRVIEMSHDVQVRNMGFQKST 149 Query: 243 SMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQVPPNQMNLNPNVNINRSMDDEYM 422 +P D + S K EP PL T +GAPQ+ NQM+++ +VN R + +E Sbjct: 150 PVPPGIGVDPSDMSRKNAIEPEPLPITGSAGPRGAPQMQVNQMHMSADVN--RPVVNENQ 207 Query: 423 VRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYD 602 VRP +ENG+T L+VGELHWWTTDAE+ES Q+G+VKEIKFFDERASGKSKGYCQV+FY+ Sbjct: 208 VRPPIENGSTTLYVGELHWWTTDAELESFASQFGRVKEIKFFDERASGKSKGYCQVDFYE 267 Query: 603 PSAASACKEGMNGHSFNGRACVVAF 677 +AA+ACKEGMNGH FNGR CVVAF Sbjct: 268 AAAAAACKEGMNGHVFNGRPCVVAF 292 >ref|XP_007225677.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] gi|462422613|gb|EMJ26876.1| hypothetical protein PRUPE_ppa002814mg [Prunus persica] Length = 630 Score = 205 bits (521), Expect = 1e-50 Identities = 112/223 (50%), Positives = 141/223 (63%) Frame = +3 Query: 9 KANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPGGPPQTMDASQRGRL 188 K +V T ++ QE V+ +G Y++ A FP Q+ G PP Sbjct: 79 KTDVTETRVQAGVSQESKIPGVSVQGKYSSAVAQFPEQQ------GQPP----------- 121 Query: 189 PEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQVPPNQ 368 +A + G +GY GS +MP D + + K E P M + G Q+P NQ Sbjct: 122 --VAKEPELGSTGY-GSTTMPPNVGGDSSDITGKTALESVPSMNSGTAGPTGVTQMPTNQ 178 Query: 369 MNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKEIKFF 548 +++ VN NR M +E +RP VENG+TMLFVGELHWWTTDAE+ESVL QYG+VKEIKFF Sbjct: 179 ISIK--VNANRPMFNENQIRPPVENGSTMLFVGELHWWTTDAELESVLSQYGRVKEIKFF 236 Query: 549 DERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677 DERASGKSKGYCQVEF+DP+AA+ACKEGM+G+ FNGRACVVAF Sbjct: 237 DERASGKSKGYCQVEFHDPAAATACKEGMDGYLFNGRACVVAF 279 >gb|EXB82464.1| Cleavage and polyadenylation specificity factor subunit [Morus notabilis] Length = 636 Score = 197 bits (502), Expect = 2e-48 Identities = 110/227 (48%), Positives = 141/227 (62%), Gaps = 4/227 (1%) Frame = +3 Query: 9 KANVPGTHLEGVALQEVNNVKVAEEGNYAATAAPFPVQKTSLPGPGGPPQTMDASQRGRL 188 K N P E Q+ N V+ EG +++ + FP Q+ L + S+ G + Sbjct: 81 KRNFPEPREEIGGSQQPNIPGVSAEGRFSSAGSQFPGQQDGL-------KVDKKSEAGSM 133 Query: 189 --PEMAHNSQAGH--SGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMGNTKGAPQV 356 P+ A SQ G +G+QGS M H D + K++ EP + +G + Sbjct: 134 VYPDGASGSQKGRIVAGFQGSKPMLHSVGVDSSDIPGKMVNEPIQAPNSGGAGPRGILPM 193 Query: 357 PPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVLIQYGKVKE 536 NQ +N NV+ + +E +RPS+ENG+TMLFVGELHWWTTDAE+ESVL QYG+VKE Sbjct: 194 QGNQTTVNANVS--HPIVNENQIRPSIENGSTMLFVGELHWWTTDAELESVLSQYGRVKE 251 Query: 537 IKFFDERASGKSKGYCQVEFYDPSAASACKEGMNGHSFNGRACVVAF 677 IKFFDERASGKSKGYCQVE+YD +AA ACKEGM+GH FNGRACVVAF Sbjct: 252 IKFFDERASGKSKGYCQVEYYDAAAAVACKEGMHGHVFNGRACVVAF 298 >emb|CBI16834.3| unnamed protein product [Vitis vinifera] Length = 491 Score = 187 bits (476), Expect = 2e-45 Identities = 108/232 (46%), Positives = 138/232 (59%), Gaps = 14/232 (6%) Frame = +3 Query: 9 KANVPGTHLEGVALQEVNNVKVAEEGNYA------------ATAAPFPVQKTSLPGPGGP 152 K +VP LE Q + V+ EG Y+ A P + L GP Sbjct: 61 KTDVPPQKLEAGTSQGLIIPGVSIEGKYSNPHFHEKKEGPMAVKGPEMGSTSHLDGPS-- 118 Query: 153 PQTMDASQRGRLPEMAHNSQAGHSGYQGSASMPHKNAADQMNNSEKVIGEPAPLMYTNMG 332 SQ+GR+ EM H++Q + G+QGS +P K A+ + K+ E P++ + G Sbjct: 119 -----VSQKGRVLEMTHDTQVRNLGFQGSTPIPQKTGAEPSDVHGKIANESTPVLNSGTG 173 Query: 333 NTKGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTMLFVGELHWWTTDAEIESVL 512 + PQ+ NQM +N VN+NR M +E +RP+V+NG TMLFVGELHWWTTDAE+ESVL Sbjct: 174 GPRAVPQMLSNQMGMN--VNVNRPMVNENQIRPAVDNGATMLFVGELHWWTTDAELESVL 231 Query: 513 IQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASAC--KEGMNGHSFNGRA 662 QYG+VKEIKFFDERASGKSKGYCQVEFYD SAA+A KEG+ G A Sbjct: 232 SQYGRVKEIKFFDERASGKSKGYCQVEFYDASAAAAFSGKEGILNRGPGGLA 283 >ref|XP_006417146.1| hypothetical protein EUTSA_v10007191mg [Eutrema salsugineum] gi|557094917|gb|ESQ35499.1| hypothetical protein EUTSA_v10007191mg [Eutrema salsugineum] Length = 578 Score = 179 bits (454), Expect = 7e-43 Identities = 88/133 (66%), Positives = 102/133 (76%), Gaps = 1/133 (0%) Frame = +3 Query: 282 SEKVIGEPAPLMYTNMGNT-KGAPQVPPNQMNLNPNVNINRSMDDEYMVRPSVENGNTML 458 S + P P ++ G +GA Q+P +QMN NPN +NRS ++V +NGNTML Sbjct: 153 SGNAVNVPEPPVHNPYGAVPQGAQQIPVSQMNANPNAMVNRSPTQPFVV----DNGNTML 208 Query: 459 FVGELHWWTTDAEIESVLIQYGKVKEIKFFDERASGKSKGYCQVEFYDPSAASACKEGMN 638 FVGELHWWTTDAEIESVL QYG+VKEIKFFDER SGKSKGYCQVEFYD +AA+ACKEGMN Sbjct: 209 FVGELHWWTTDAEIESVLSQYGRVKEIKFFDERVSGKSKGYCQVEFYDSAAAAACKEGMN 268 Query: 639 GHSFNGRACVVAF 677 G FNG+ACVVAF Sbjct: 269 GFVFNGKACVVAF 281