BLASTX nr result
ID: Mentha22_contig00032056
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00032056 (776 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU32028.1| hypothetical protein MIMGU_mgv1a001944mg [Mimulus... 155 1e-35 ref|XP_007051995.1| Nucleotidyltransferase family protein isofor... 111 3e-22 ref|XP_007051994.1| Nucleotidyltransferase family protein isofor... 111 3e-22 ref|XP_007051993.1| Nucleotidyltransferase family protein isofor... 111 3e-22 ref|XP_007051992.1| Nucleotidyltransferase family protein isofor... 111 3e-22 ref|XP_007051991.1| Nucleotidyltransferase family protein isofor... 111 3e-22 gb|EXC11712.1| Poly(A) RNA polymerase cid11 [Morus notabilis] 106 1e-20 ref|XP_006339776.1| PREDICTED: uncharacterized protein LOC102603... 105 1e-20 ref|XP_002511755.1| poly(A) polymerase cid, putative [Ricinus co... 100 1e-18 ref|XP_004229872.1| PREDICTED: uncharacterized protein LOC101244... 97 5e-18 ref|XP_002301312.2| hypothetical protein POPTR_0002s15230g [Popu... 97 6e-18 ref|XP_006490961.1| PREDICTED: uncharacterized protein LOC102611... 95 3e-17 ref|XP_006445207.1| hypothetical protein CICLE_v10023615mg, part... 95 3e-17 dbj|BAJ53142.1| JHL05D22.13 [Jatropha curcas] 93 1e-16 ref|XP_002880188.1| hypothetical protein ARALYDRAFT_483698 [Arab... 89 2e-15 ref|NP_566048.1| Nucleotidyltransferase family protein [Arabidop... 81 5e-13 ref|XP_003529982.1| PREDICTED: uncharacterized protein LOC100812... 79 2e-12 ref|XP_006295859.1| hypothetical protein CARUB_v10024989mg [Caps... 79 2e-12 ref|XP_006375316.1| hypothetical protein POPTR_0014s06910g, part... 76 1e-11 ref|XP_004308428.1| PREDICTED: uncharacterized protein LOC101313... 76 1e-11 >gb|EYU32028.1| hypothetical protein MIMGU_mgv1a001944mg [Mimulus guttatus] Length = 735 Score = 155 bits (393), Expect = 1e-35 Identities = 114/312 (36%), Positives = 158/312 (50%), Gaps = 59/312 (18%) Frame = +2 Query: 14 HSPPQFDNQSRRILPSGDDARNLRFYGDNSKPSLA------------------------- 118 ++P QF+ QS RI P G+DAR L YGDNS+PS A Sbjct: 116 YAPHQFNLQSNRISP-GEDARKLAPYGDNSRPSAAAHQQLQSNRIPLGEDARRLGVFGEI 174 Query: 119 -------NQPGQN-LMFGSLSREV----AANALDQSLYRMND----------------NR 214 +Q QN L+FGSL+R++ A + L QSL+ M+ NR Sbjct: 175 ATPSVAQHQREQNHLIFGSLNRDILQTDAGDVLHQSLHPMDKLGNSYLEEVLGMDRRMNR 234 Query: 215 FPQMRISEDILRTQSSTSNDRVKLGDGGSNKTVAPPPDFLSNSKDVRHREPGYGRSASDV 394 FP ++ + +S+ N+R GD GS++ +APP +N K+V +RE GY D Sbjct: 235 FPVNEVNGN--SRGNSSGNERRNQGDNGSHRALAPPGFSSNNMKNVGNREHGYVTRNPDN 292 Query: 395 NGDKGKASSGQLHKNDKLSNQLDFPGLPAGSSIHSASTFEIGESMKQLHAENGQDSKR-- 568 DKGK +SG +KN +SN ++ PG SM +H E+G K Sbjct: 293 YVDKGKGNSGGSYKNGGVSNPINSPG-----------------SMMGIHVEDGGKGKELR 335 Query: 569 --GVEKKVNNDG--SEMDDLENQVDSLGVEDESGEKNNKNKLHRDKDYRSDDRGKWIMGQ 736 G K D S+M+ +E+Q+ SLG+E+ESGE ++K K DK+YRSD RG+WIMGQ Sbjct: 336 FGGQNNKNQGDRAQSKMNGIEDQMGSLGIEEESGETSDKKKNPHDKEYRSDQRGQWIMGQ 395 Query: 737 RMRIMKRQTTCR 772 RMR +K QT CR Sbjct: 396 RMRHVKMQTACR 407 >ref|XP_007051995.1| Nucleotidyltransferase family protein isoform 5 [Theobroma cacao] gi|508704256|gb|EOX96152.1| Nucleotidyltransferase family protein isoform 5 [Theobroma cacao] Length = 635 Score = 111 bits (278), Expect = 3e-22 Identities = 87/268 (32%), Positives = 132/268 (49%), Gaps = 31/268 (11%) Frame = +2 Query: 65 DDARNLRFYG-DNSKPSLANQ------PGQNLMFGSLSREVAA----------NALDQSL 193 DD R L G DN+K + Q L+FGS ++ N L+ S Sbjct: 128 DDLRRLGLSGIDNNKNHVIQNRVQQKHQDQKLVFGSFPSDIQTLKTPEGSPNGNLLENSK 187 Query: 194 YRMNDNRFPQMRISEDILRT---QSSTSNDRVKLGDGGSNKTVAP-------PPDFLSNS 343 +++ + S Q S DR K G + P PP FL Sbjct: 188 LNLSNQQLDSRLNSNPNTSPYVFQHRNSGDRGKQQQHGGSYRPTPSPEARRSPPGFLGKP 247 Query: 344 KDVR-HREPGYGRSASDVNGDKGKASSGQLHKNDK--LSNQLDFPGLPAGSSIHSASTFE 514 + +R+ G R + N DK KA Q +++ LS QLD PG PAGS++ S S + Sbjct: 248 RGGGGNRDFGNRRRHFEHNVDKAKAEYSQPSSDNEVGLSGQLDRPGPPAGSNLQSVSATD 307 Query: 515 IGESMKQLHAENGQDSKRGVEKKVNNDGSEMDDL-ENQVDSLGVEDESGEKNNKNKLHRD 691 I ES+ +LH++ G+D +K DG E+D++ E ++SL +EDES +KN+K + R+ Sbjct: 308 IEESLLELHSDGGRDRFSRRDKFRREDGGEVDEVGEQLLESLLIEDESDDKNDKKQHRRE 367 Query: 692 KDYRSDDRGKWIMGQRMRIMKRQTTCRN 775 K+ R D+RG+ ++ QRMR++KRQ CR+ Sbjct: 368 KESRIDNRGQRLLSQRMRMLKRQMECRS 395 >ref|XP_007051994.1| Nucleotidyltransferase family protein isoform 4, partial [Theobroma cacao] gi|508704255|gb|EOX96151.1| Nucleotidyltransferase family protein isoform 4, partial [Theobroma cacao] Length = 585 Score = 111 bits (278), Expect = 3e-22 Identities = 87/268 (32%), Positives = 132/268 (49%), Gaps = 31/268 (11%) Frame = +2 Query: 65 DDARNLRFYG-DNSKPSLANQ------PGQNLMFGSLSREVAA----------NALDQSL 193 DD R L G DN+K + Q L+FGS ++ N L+ S Sbjct: 128 DDLRRLGLSGIDNNKNHVIQNRVQQKHQDQKLVFGSFPSDIQTLKTPEGSPNGNLLENSK 187 Query: 194 YRMNDNRFPQMRISEDILRT---QSSTSNDRVKLGDGGSNKTVAP-------PPDFLSNS 343 +++ + S Q S DR K G + P PP FL Sbjct: 188 LNLSNQQLDSRLNSNPNTSPYVFQHRNSGDRGKQQQHGGSYRPTPSPEARRSPPGFLGKP 247 Query: 344 KDVR-HREPGYGRSASDVNGDKGKASSGQLHKNDK--LSNQLDFPGLPAGSSIHSASTFE 514 + +R+ G R + N DK KA Q +++ LS QLD PG PAGS++ S S + Sbjct: 248 RGGGGNRDFGNRRRHFEHNVDKAKAEYSQPSSDNEVGLSGQLDRPGPPAGSNLQSVSATD 307 Query: 515 IGESMKQLHAENGQDSKRGVEKKVNNDGSEMDDL-ENQVDSLGVEDESGEKNNKNKLHRD 691 I ES+ +LH++ G+D +K DG E+D++ E ++SL +EDES +KN+K + R+ Sbjct: 308 IEESLLELHSDGGRDRFSRRDKFRREDGGEVDEVGEQLLESLLIEDESDDKNDKKQHRRE 367 Query: 692 KDYRSDDRGKWIMGQRMRIMKRQTTCRN 775 K+ R D+RG+ ++ QRMR++KRQ CR+ Sbjct: 368 KESRIDNRGQRLLSQRMRMLKRQMECRS 395 >ref|XP_007051993.1| Nucleotidyltransferase family protein isoform 3, partial [Theobroma cacao] gi|508704254|gb|EOX96150.1| Nucleotidyltransferase family protein isoform 3, partial [Theobroma cacao] Length = 584 Score = 111 bits (278), Expect = 3e-22 Identities = 87/268 (32%), Positives = 132/268 (49%), Gaps = 31/268 (11%) Frame = +2 Query: 65 DDARNLRFYG-DNSKPSLANQ------PGQNLMFGSLSREVAA----------NALDQSL 193 DD R L G DN+K + Q L+FGS ++ N L+ S Sbjct: 128 DDLRRLGLSGIDNNKNHVIQNRVQQKHQDQKLVFGSFPSDIQTLKTPEGSPNGNLLENSK 187 Query: 194 YRMNDNRFPQMRISEDILRT---QSSTSNDRVKLGDGGSNKTVAP-------PPDFLSNS 343 +++ + S Q S DR K G + P PP FL Sbjct: 188 LNLSNQQLDSRLNSNPNTSPYVFQHRNSGDRGKQQQHGGSYRPTPSPEARRSPPGFLGKP 247 Query: 344 KDVR-HREPGYGRSASDVNGDKGKASSGQLHKNDK--LSNQLDFPGLPAGSSIHSASTFE 514 + +R+ G R + N DK KA Q +++ LS QLD PG PAGS++ S S + Sbjct: 248 RGGGGNRDFGNRRRHFEHNVDKAKAEYSQPSSDNEVGLSGQLDRPGPPAGSNLQSVSATD 307 Query: 515 IGESMKQLHAENGQDSKRGVEKKVNNDGSEMDDL-ENQVDSLGVEDESGEKNNKNKLHRD 691 I ES+ +LH++ G+D +K DG E+D++ E ++SL +EDES +KN+K + R+ Sbjct: 308 IEESLLELHSDGGRDRFSRRDKFRREDGGEVDEVGEQLLESLLIEDESDDKNDKKQHRRE 367 Query: 692 KDYRSDDRGKWIMGQRMRIMKRQTTCRN 775 K+ R D+RG+ ++ QRMR++KRQ CR+ Sbjct: 368 KESRIDNRGQRLLSQRMRMLKRQMECRS 395 >ref|XP_007051992.1| Nucleotidyltransferase family protein isoform 2 [Theobroma cacao] gi|508704253|gb|EOX96149.1| Nucleotidyltransferase family protein isoform 2 [Theobroma cacao] Length = 621 Score = 111 bits (278), Expect = 3e-22 Identities = 87/268 (32%), Positives = 132/268 (49%), Gaps = 31/268 (11%) Frame = +2 Query: 65 DDARNLRFYG-DNSKPSLANQ------PGQNLMFGSLSREVAA----------NALDQSL 193 DD R L G DN+K + Q L+FGS ++ N L+ S Sbjct: 128 DDLRRLGLSGIDNNKNHVIQNRVQQKHQDQKLVFGSFPSDIQTLKTPEGSPNGNLLENSK 187 Query: 194 YRMNDNRFPQMRISEDILRT---QSSTSNDRVKLGDGGSNKTVAP-------PPDFLSNS 343 +++ + S Q S DR K G + P PP FL Sbjct: 188 LNLSNQQLDSRLNSNPNTSPYVFQHRNSGDRGKQQQHGGSYRPTPSPEARRSPPGFLGKP 247 Query: 344 KDVR-HREPGYGRSASDVNGDKGKASSGQLHKNDK--LSNQLDFPGLPAGSSIHSASTFE 514 + +R+ G R + N DK KA Q +++ LS QLD PG PAGS++ S S + Sbjct: 248 RGGGGNRDFGNRRRHFEHNVDKAKAEYSQPSSDNEVGLSGQLDRPGPPAGSNLQSVSATD 307 Query: 515 IGESMKQLHAENGQDSKRGVEKKVNNDGSEMDDL-ENQVDSLGVEDESGEKNNKNKLHRD 691 I ES+ +LH++ G+D +K DG E+D++ E ++SL +EDES +KN+K + R+ Sbjct: 308 IEESLLELHSDGGRDRFSRRDKFRREDGGEVDEVGEQLLESLLIEDESDDKNDKKQHRRE 367 Query: 692 KDYRSDDRGKWIMGQRMRIMKRQTTCRN 775 K+ R D+RG+ ++ QRMR++KRQ CR+ Sbjct: 368 KESRIDNRGQRLLSQRMRMLKRQMECRS 395 >ref|XP_007051991.1| Nucleotidyltransferase family protein isoform 1 [Theobroma cacao] gi|508704252|gb|EOX96148.1| Nucleotidyltransferase family protein isoform 1 [Theobroma cacao] Length = 722 Score = 111 bits (278), Expect = 3e-22 Identities = 87/268 (32%), Positives = 132/268 (49%), Gaps = 31/268 (11%) Frame = +2 Query: 65 DDARNLRFYG-DNSKPSLANQ------PGQNLMFGSLSREVAA----------NALDQSL 193 DD R L G DN+K + Q L+FGS ++ N L+ S Sbjct: 128 DDLRRLGLSGIDNNKNHVIQNRVQQKHQDQKLVFGSFPSDIQTLKTPEGSPNGNLLENSK 187 Query: 194 YRMNDNRFPQMRISEDILRT---QSSTSNDRVKLGDGGSNKTVAP-------PPDFLSNS 343 +++ + S Q S DR K G + P PP FL Sbjct: 188 LNLSNQQLDSRLNSNPNTSPYVFQHRNSGDRGKQQQHGGSYRPTPSPEARRSPPGFLGKP 247 Query: 344 KDVR-HREPGYGRSASDVNGDKGKASSGQLHKNDK--LSNQLDFPGLPAGSSIHSASTFE 514 + +R+ G R + N DK KA Q +++ LS QLD PG PAGS++ S S + Sbjct: 248 RGGGGNRDFGNRRRHFEHNVDKAKAEYSQPSSDNEVGLSGQLDRPGPPAGSNLQSVSATD 307 Query: 515 IGESMKQLHAENGQDSKRGVEKKVNNDGSEMDDL-ENQVDSLGVEDESGEKNNKNKLHRD 691 I ES+ +LH++ G+D +K DG E+D++ E ++SL +EDES +KN+K + R+ Sbjct: 308 IEESLLELHSDGGRDRFSRRDKFRREDGGEVDEVGEQLLESLLIEDESDDKNDKKQHRRE 367 Query: 692 KDYRSDDRGKWIMGQRMRIMKRQTTCRN 775 K+ R D+RG+ ++ QRMR++KRQ CR+ Sbjct: 368 KESRIDNRGQRLLSQRMRMLKRQMECRS 395 >gb|EXC11712.1| Poly(A) RNA polymerase cid11 [Morus notabilis] Length = 703 Score = 106 bits (264), Expect = 1e-20 Identities = 98/287 (34%), Positives = 145/287 (50%), Gaps = 30/287 (10%) Frame = +2 Query: 5 GFAHS--PPQFDNQSRRILPS-GDDARNLRFYGD-NSKPSL-----------ANQPGQNL 139 GF HS P QF Q +++ + G+D R L F G NS P+L NQ L Sbjct: 112 GFPHSFFPNQF--QGKQVSGNVGEDLRRLGFSGGVNSNPNLNLNPIHGIVQQKNQLEHKL 169 Query: 140 MFGSLSREVAANALDQSLYRMNDNRFPQMRISEDILRTQSSTSNDRVKLGDGGSNKTVAP 319 FGSL E+ + ++L +++ + F + + R S++S++ V+ G+ +T P Sbjct: 170 KFGSLPSEIVI--IPEALPKVDASNFNNL--VDRSRRLSSNSSSNAVRQGNYEHQRT-NP 224 Query: 320 PPDFLSNSK--DVRHREPGYGRSASDVN----------GDKGKASSGQLHKNDKLSNQLD 463 PP F S K + H G + D+ G +G S G +LS QLD Sbjct: 225 PPGFRSKPKRTGLNHSIGGENSVSGDLMRTRDVLAEDIGIRGDGSRGL-----ELSAQLD 279 Query: 464 FPGLPAGSSIHSASTFEIGESMKQLHAENGQDSKRGVEKKVNNDGSEMDDL-ENQVDSLG 640 PG P+GS++ S ++ ESM +L ++ VE G E+DD+ + VDSL Sbjct: 280 RPGPPSGSNLRSVLASDVEESMMKLESD-------AVEV---GGGHEIDDIGQRLVDSLL 329 Query: 641 VEDESGEKNN--KNKLHRDKDYRSDDRGKWIMGQRMRIMKRQTTCRN 775 +EDES +KN K+K RDKD RSD RG+ ++ QRMR+ KRQ CR+ Sbjct: 330 IEDESDDKNETKKHKNSRDKDSRSDSRGQRLLSQRMRVYKRQMRCRS 376 >ref|XP_006339776.1| PREDICTED: uncharacterized protein LOC102603223 [Solanum tuberosum] Length = 775 Score = 105 bits (263), Expect = 1e-20 Identities = 88/310 (28%), Positives = 143/310 (46%), Gaps = 72/310 (23%) Frame = +2 Query: 62 GDDARNLRFYGDNSKPSLANQP-GQNLMFGSLSREVAANA--------------LDQSLY 196 G++ NL +G N+K S +N NL+FGSL R++ N + Sbjct: 139 GENMGNLGIFGANAKASNSNNEFDHNLIFGSLRRDIQGNVSMLNDRFSDDLACKVGNFEQ 198 Query: 197 RMNDNRFPQMRISEDIL-RTQSSTSNDRVKLGD--------------------------- 292 + ++R +R+ + + ++ + R +LG+ Sbjct: 199 KNQESRLTNVRMLNGVEGKRENVIGSGRKQLGNLRGLEQQNRGGGGGESESGGLGRGRQF 258 Query: 293 -GGSNKTVAPPPDFLSN--SKDVRHREPGYGRSASDVN------GDKGKASSGQLHKNDK 445 G+ + PPP F S S+D H + ++N K + S L +N K Sbjct: 259 HSGTVRGAVPPPGFSSKPRSRDFEHNVDNEKNNFVELNHRGIGLNHKYERESKHLTRNGK 318 Query: 446 ----------LSNQLDFPGLPAGSSIHSASTFEIGESMKQLH---AENGQDSKRGVEKKV 586 + QLD P PAGS +HS ++ +S +LH AE+G+++ G+ + Sbjct: 319 NYAIGSDDQRVFRQLDSPVPPAGSKLHSVLGSDVEDSTLELHGEDAESGEETVSGMRNVL 378 Query: 587 NNDG----SEMDDL-ENQVDSLGVEDESGEKNNKNKLH--RDKDYRSDDRGKWIMGQRMR 745 S++D+L E+ + SLG+EDE E+++K K H RDKDYRSD RG +I+GQRMR Sbjct: 379 GRSSAQGQSDLDELGEHVISSLGLEDEPDERSDKKKHHASRDKDYRSDKRGAYILGQRMR 438 Query: 746 IMKRQTTCRN 775 ++KRQ CR+ Sbjct: 439 MLKRQIACRS 448 >ref|XP_002511755.1| poly(A) polymerase cid, putative [Ricinus communis] gi|223548935|gb|EEF50424.1| poly(A) polymerase cid, putative [Ricinus communis] Length = 696 Score = 99.8 bits (247), Expect = 1e-18 Identities = 96/311 (30%), Positives = 136/311 (43%), Gaps = 55/311 (17%) Frame = +2 Query: 5 GFAHSPP----QFDNQSRRILPSGDDARNLRFYGDNSK----PSLANQPGQNLMFGSLSR 160 GF + P QF +R GDD + L N++ Q Q L FGS Sbjct: 108 GFPQNHPWQGSQFQGSDQRGF-LGDDLQRLGLSSGNTRIRNLVQQKQQLEQKLQFGSFRS 166 Query: 161 EV--------------AANALDQSLYRMNDNRFPQMRISEDILRTQSSTSNDRVKLGDGG 298 ++ AA L L N N + E L + TS+ R + GG Sbjct: 167 DIQPPEGLLNLNSKLNAAKELGVDLGIRNLNGMERNLHFEPQLMSNLRTSDLREQDQRGG 226 Query: 299 -----------SNKTVAPPPDFLSNSKDVRHREPGYGRSASDVNGDKGKASSGQLHKNDK 445 S +T PPP F + + + + R D N +K K + +L K + Sbjct: 227 WGKQPHGSNYRSQETRMPPPGFSNKPRGGGNMDHVSRRRELDHNVNKEKGNHSELSKRNA 286 Query: 446 -------------------LSNQLDFPGLPAGSSIHSASTFEIGESMKQLHAENGQDSKR 568 L+ QLD PG PAGS++HS S +I ES+ +AE +D K Sbjct: 287 FLSSESKSLRDGNGSRDLGLTRQLDHPGPPAGSNLHSVSALDIEESLLNFNAEMVEDGK- 345 Query: 569 GVEKKVNNDGSEMDDL-ENQVDSLGVEDES-GEKNNKNKLH-RDKDYRSDDRGKWIMGQR 739 NDG ++DD+ E D+L +E ES G+ +NK H RDK+ RSD+RG+ I+ QR Sbjct: 346 -------NDGHDLDDVGEELADTLLLEGESEGKNDNKQNRHSRDKESRSDNRGQQILSQR 398 Query: 740 MRIMKRQTTCR 772 MR++KRQ CR Sbjct: 399 MRMLKRQMECR 409 >ref|XP_004229872.1| PREDICTED: uncharacterized protein LOC101244121 [Solanum lycopersicum] Length = 775 Score = 97.4 bits (241), Expect = 5e-18 Identities = 84/312 (26%), Positives = 137/312 (43%), Gaps = 74/312 (23%) Frame = +2 Query: 62 GDDARNLRFYGDNSKPSLANQP-GQNLMFGSLSREVAANA--------------LDQSLY 196 G++ NL +G N+K S +N NL+FGSL + N + Sbjct: 137 GENMGNLGIFGANAKASNSNNEFDHNLIFGSLRSHIQGNVSMMNDRFSDDLASKVGNFEQ 196 Query: 197 RMNDNRFPQMRISEDIL-RTQSSTSNDRVKLGD--------------------------- 292 + +++R +R+ + + ++ + R +LG+ Sbjct: 197 KNHESRLANVRMLNGVEGKLENVIGSGRKQLGNLRGLEQQNSGGGGGESESESGGLGWGR 256 Query: 293 ---GGSNKTVAPPPDFLSN--SKDVRHREPGYGRSASDVN------GDKGKASSGQLHKN 439 G+ + V PPP F S S+D H + ++N K + S L +N Sbjct: 257 QFHSGTVRGVVPPPGFSSKPRSRDFEHNVDNEKNNFVELNHRGIGLNHKYERESKHLSRN 316 Query: 440 DK----------LSNQLDFPGLPAGSSIHSASTFEIGESMKQLHAENGQDSKRGVE---- 577 K + +LD P PAGS +HS ++ +S +L E+ + + V Sbjct: 317 GKNYAIGSDDQRVFRRLDSPVPPAGSKLHSVLASDVEDSTLELRGEDAESGEETVSVMRD 376 Query: 578 ---KKVNNDGSEMDDL-ENQVDSLGVEDESGEKNNKNKLH--RDKDYRSDDRGKWIMGQR 739 + SE+D+L E+ + SLG+EDE E+++K H RDKDYRSD RG +I+GQR Sbjct: 377 VLGRSSAQGQSELDELGEHVISSLGLEDEPNERSDKKNHHASRDKDYRSDKRGAYILGQR 436 Query: 740 MRIMKRQTTCRN 775 MR++KRQ CR+ Sbjct: 437 MRMLKRQIACRS 448 >ref|XP_002301312.2| hypothetical protein POPTR_0002s15230g [Populus trichocarpa] gi|550345065|gb|EEE80585.2| hypothetical protein POPTR_0002s15230g [Populus trichocarpa] Length = 728 Score = 97.1 bits (240), Expect = 6e-18 Identities = 58/129 (44%), Positives = 81/129 (62%), Gaps = 1/129 (0%) Frame = +2 Query: 386 SDVNGDKGKASSGQLHKNDKLSNQLDFPGLPAGSSIHSASTFEIGESMKQLHAENGQDSK 565 S++N +K + S G + L+ QLD PG PAGS++HS EIGES+ L ENG+D K Sbjct: 283 SEMNNEKVRRSEGSVELG--LTRQLDRPGPPAGSNLHSVLGSEIGESLINLDGENGEDGK 340 Query: 566 RGVEKKVNNDGSEMDDL-ENQVDSLGVEDESGEKNNKNKLHRDKDYRSDDRGKWIMGQRM 742 +DG E+DDL E VDSL + +S + K+K +K+ RSD+RGK I+ QRM Sbjct: 341 --------DDGGELDDLGEELVDSLLLNGQS--EGKKDKKQSNKESRSDNRGKKILSQRM 390 Query: 743 RIMKRQTTC 769 R++K+QT C Sbjct: 391 RMLKKQTQC 399 >ref|XP_006490961.1| PREDICTED: uncharacterized protein LOC102611932 [Citrus sinensis] Length = 699 Score = 94.7 bits (234), Expect = 3e-17 Identities = 93/292 (31%), Positives = 139/292 (47%), Gaps = 36/292 (12%) Frame = +2 Query: 5 GFAHSP---PQFDNQSRRILPSGDDARNLRFYGDNSKP--SLANQPG----QNLMFGSLS 157 GF +P +NQ +R+L +D L F N +L QP QNL FGS Sbjct: 87 GFPQNPWASSSTENQQQRLLC--EDFGRLGFSNANYAAIHNLIQQPNHQQQQNLRFGSFQ 144 Query: 158 RE----VAANALDQSLYRMNDN-RFPQMRISED------ILRTQSSTSNDRVKLGDGGSN 304 + + N L+ Y ++ N +F Q R S + R ++ ++LG Sbjct: 145 VQPDSLLNLNHLENLKYNLDRNSQFDQPRASSISNPNSFLHRNLENSREHDLRLGKQHYG 204 Query: 305 KTVAPPPDFLSNSK--DVRHREPGYGRSASDVNGDKGKASSGQLHKNDKLSNQLDFPGLP 478 T PPP F + ++ + G+ + +N A G L+ QLD PG P Sbjct: 205 ST--PPPGFSNKARVGGSGNSRRGFEHNVDMINRFTSSAVEGG--NGVGLTRQLDRPGPP 260 Query: 479 AGSSIHSASTFEIGESMKQLHAENGQDSKRGVEKKVNND------GSEMDDL-ENQVDSL 637 +GS++HS S +I ES+ L E G++ G++K+ N G +MDD E+ VDSL Sbjct: 261 SGSNLHSVSALDIEESLLDLRRE-GRERHLGLDKRRENGPGYSQGGDDMDDFGEDLVDSL 319 Query: 638 GVEDESGEKNN-------KNKLHRDKDYRSDDRGKWIMGQRMRIMKRQTTCR 772 +DES KN+ K++ RDK+ RSD+RGK ++ QRMR +K Q CR Sbjct: 320 LPDDESELKNDTHERNDKKHRNSRDKEIRSDNRGKRLLSQRMRNLKWQIECR 371 >ref|XP_006445207.1| hypothetical protein CICLE_v10023615mg, partial [Citrus clementina] gi|557547469|gb|ESR58447.1| hypothetical protein CICLE_v10023615mg, partial [Citrus clementina] Length = 1046 Score = 94.7 bits (234), Expect = 3e-17 Identities = 93/292 (31%), Positives = 139/292 (47%), Gaps = 36/292 (12%) Frame = +2 Query: 5 GFAHSP---PQFDNQSRRILPSGDDARNLRFYGDNSKP--SLANQPG----QNLMFGSLS 157 GF +P +NQ +R+L +D L F N +L QP QNL FGS Sbjct: 118 GFPQNPWASSSTENQQQRLLC--EDFGRLGFSNANYAAIHNLIQQPNHQQQQNLRFGSFQ 175 Query: 158 RE----VAANALDQSLYRMNDN-RFPQMRISED------ILRTQSSTSNDRVKLGDGGSN 304 + + N L+ Y ++ N +F Q R S + R ++ ++LG Sbjct: 176 VQPDSLLNLNHLENLKYNLDRNSQFDQPRASSISNPNSFLHRNLENSREHDLRLGKQHYG 235 Query: 305 KTVAPPPDFLSNSK--DVRHREPGYGRSASDVNGDKGKASSGQLHKNDKLSNQLDFPGLP 478 T PPP F + ++ + G+ + +N A G L+ QLD PG P Sbjct: 236 ST--PPPGFSNKARVGGSGNSRRGFEHNVDMINRFTSSAVEGG--NGVGLTRQLDRPGPP 291 Query: 479 AGSSIHSASTFEIGESMKQLHAENGQDSKRGVEKKVNND------GSEMDDL-ENQVDSL 637 +GS++HS S +I ES+ L E G++ G++K+ N G +MDD E+ VDSL Sbjct: 292 SGSNLHSVSALDIEESLLDLRRE-GRERHLGLDKRRENGPGYSQGGDDMDDFGEDLVDSL 350 Query: 638 GVEDESGEKNN-------KNKLHRDKDYRSDDRGKWIMGQRMRIMKRQTTCR 772 +DES KN+ K++ RDK+ RSD+RGK ++ QRMR +K Q CR Sbjct: 351 LPDDESELKNDTHERNDKKHRNSRDKEIRSDNRGKRLLSQRMRNLKWQIECR 402 >dbj|BAJ53142.1| JHL05D22.13 [Jatropha curcas] Length = 748 Score = 92.8 bits (229), Expect = 1e-16 Identities = 90/301 (29%), Positives = 133/301 (44%), Gaps = 52/301 (17%) Frame = +2 Query: 26 QFDNQSRRILPSGDDARNLRFYGDN--------SKPSLANQPGQNLMFGSLSREV----- 166 QF + +L GDD + L F G + ++ Q Q L FGS ++ Sbjct: 130 QFQGNQQGVL--GDDLQILGFSGADVRANNTIHNRVQQKQQLEQKLQFGSFRSDIQNVEA 187 Query: 167 ---------AANALDQSLYRMNDNRFPQMRISEDILRTQSSTSNDRV-----KLGDGGS- 301 AA L+ L N N + + LRT DR K GG+ Sbjct: 188 LLNVNSKLNAAKELEVRLATRNLNGLESDQKFDSQLRTFDLREQDRSGGGWRKQPHGGNY 247 Query: 302 --NKTVAPPPDFLSNSKDVRHREPGYGRSASDVNGDKGKASSGQLHKNDKL--------- 448 +T PPP F + + + + R D N +K K + G+L + L Sbjct: 248 RPQETRMPPPGFSNKPRGGGNWDYVSRRRELDYNVNKEKGNQGELSNRNALFSSEDKIPR 307 Query: 449 ----------SNQLDFPGLPAGSSIHSASTFEIGESMKQLHAENGQDSKRGVEKKVNNDG 598 + QLD PG PAGS+++S S ++ SM + AE +D K ++G Sbjct: 308 DGDRSRDLGLTGQLDRPGPPAGSNLYSVSAADVELSMLNVEAEVVEDGK--------DEG 359 Query: 599 SEMDDL-ENQVDSLGVEDESGEKNNK--NKLHRDKDYRSDDRGKWIMGQRMRIMKRQTTC 769 E+D+ E VDSL +E ES KN+K N+ R+K+ RSD+RG+ + QRMR++KRQ C Sbjct: 360 RELDEAGEELVDSLLLEGESDGKNDKKQNRHSREKESRSDNRGQRTLSQRMRMLKRQMEC 419 Query: 770 R 772 R Sbjct: 420 R 420 >ref|XP_002880188.1| hypothetical protein ARALYDRAFT_483698 [Arabidopsis lyrata subsp. lyrata] gi|297326027|gb|EFH56447.1| hypothetical protein ARALYDRAFT_483698 [Arabidopsis lyrata subsp. lyrata] Length = 757 Score = 88.6 bits (218), Expect = 2e-15 Identities = 97/321 (30%), Positives = 142/321 (44%), Gaps = 71/321 (22%) Frame = +2 Query: 26 QFDNQSRRILPSGDDARNLRFYG--DNSKPSLANQPGQNL----------MFGSLSREV- 166 QFD R S +DA L F G +++ S+ Q Q L +FGS S + Sbjct: 105 QFDGNQR---VSPEDAFRLGFPGTANHAIQSMVQQQQQQLPPPQSENRKLVFGSFSGDAT 161 Query: 167 -AANALDQSLYRMNDNRFPQ-MRISEDILRTQSSTSN---------DRVKLGDGGSN--- 304 + N L + + N+ Q MR + +L + N R G G+N Sbjct: 162 QSLNGLHNGNLKYDSNQHEQLMRHPQSVLSNSNMDPNLHEPRGSHSGRGNWGHIGNNGRG 221 Query: 305 -KTVAPPPDFLSN---------SKDVRHREPGYGRSASDVNGDKGK---------ASSGQ 427 K+ PPP F SN SKD + R+ G+ K A + + Sbjct: 222 FKSTPPPPGFSSNQRGRDMNLTSKDDDRGMGSFHRNHDQAMGEHSKFWDQSVNFSAEADR 281 Query: 428 LH----KNDK---LSNQLDFPGLPAGSSIHSASTFEIGESMKQLHAENGQDSKRGVE--- 577 L +ND LS Q+D PGLP G+S+HS S + +S L+ E S+R E Sbjct: 282 LRGLSIQNDSKFNLSQQIDHPGLPKGTSLHSVSAADAADSFSMLNKEARGGSERKEELGR 341 Query: 578 ----KKVNNDGS-----EMDDL-ENQVDSLGVEDESGEKNNK-----NKLHRDKDYRSDD 712 K+ N S E++D E+ V SL +EDE+GEK+ K +K R+KD R D+ Sbjct: 342 LSKGKREGNANSGPVDDEIEDFGEDIVKSLLLEDETGEKDAKDGKKDSKTSREKDSRMDN 401 Query: 713 RGKWIMGQRMRIMKRQTTCRN 775 RG+ ++GQ+ R++K CRN Sbjct: 402 RGQRLLGQKARMVKMYMACRN 422 >ref|NP_566048.1| Nucleotidyltransferase family protein [Arabidopsis thaliana] gi|13430538|gb|AAK25891.1|AF360181_1 unknown protein [Arabidopsis thaliana] gi|14532746|gb|AAK64074.1| unknown protein [Arabidopsis thaliana] gi|20197056|gb|AAC06161.2| expressed protein [Arabidopsis thaliana] gi|330255483|gb|AEC10577.1| Nucleotidyltransferase family protein [Arabidopsis thaliana] Length = 764 Score = 80.9 bits (198), Expect = 5e-13 Identities = 96/340 (28%), Positives = 135/340 (39%), Gaps = 83/340 (24%) Frame = +2 Query: 5 GFAHSPP------QFDNQSRRILPSGDDARNLRFYGDNSKP--SLANQPGQN-------- 136 GF PP QFD R S +DA L F G + S+ Q Q Sbjct: 95 GFPQFPPSPFTTNQFDGNQR---VSPEDAYRLGFPGTTNPAIQSMVQQQQQQQLPPPQSE 151 Query: 137 ---LMFGSLSREV--AANALDQSLYRMNDN------RFPQMRISEDI------------L 247 L+FGS S + + N L + + N R PQ +S L Sbjct: 152 TRKLVFGSFSGDATQSLNGLHNGNLKYDSNQHEQLMRHPQSTLSNSNMDPNLSHHRNHDL 211 Query: 248 RTQSSTSNDRVKLGDGGSN------KTVAPPPDFLSNSKD------VRHREPGYGRSASD 391 Q + R G G+N PPP F SN + + + G GR+ Sbjct: 212 HEQRGGHSGRGNWGHIGNNGRGLKSTPPPPPPGFSSNQRGWDMSLGSKDDDRGMGRNHDQ 271 Query: 392 VNGDKGKA----------------SSGQLHKNDKLSNQLDFPGLPAGSSIHSASTFEIGE 523 G+ K S Q LS Q+D PG P G+S+HS S + + Sbjct: 272 AMGEHSKVWNQSVDFSAEANRLRGLSIQNESKFNLSQQIDHPGPPKGASLHSVSAADAAD 331 Query: 524 SMKQLHAEN----------GQDSKRGVEKKVNNDGSEMDDL-ENQVDSLGVEDESGEKN- 667 S L+ E GQ SK E N+D E++D E+ V SL +EDE+GEK+ Sbjct: 332 SFSMLNKEARRGGERREELGQLSKAKREGNANSD--EIEDFGEDIVKSLLLEDETGEKDA 389 Query: 668 ----NKNKLHRDKDYRSDDRGKWIMGQRMRIMKRQTTCRN 775 +K R+K+ R D+RG+ ++GQ+ R++K CRN Sbjct: 390 NDGKKDSKTSREKESRVDNRGQRLLGQKARMVKMYMACRN 429 >ref|XP_003529982.1| PREDICTED: uncharacterized protein LOC100812787 [Glycine max] Length = 732 Score = 79.0 bits (193), Expect = 2e-12 Identities = 78/238 (32%), Positives = 113/238 (47%), Gaps = 26/238 (10%) Frame = +2 Query: 137 LMFGSL------SREVAANALDQSLYRMNDNRF--PQMRISEDILRTQSSTSNDRVKLGD 292 L FGSL + EV++N D SL + NR P S +++ + + +R + G Sbjct: 170 LQFGSLPTVAYSAAEVSSNGGD-SLLNLKFNRVDHPTSNSSGNVVVQGNHDAVERERRGL 228 Query: 293 GGSNKTVAPPPDFLSNSKDVRHREPGYGRSASD------------VNGDKGKASSGQLHK 436 GG + PP+ +R G G + V+G++ HK Sbjct: 229 GGYRAGGSLPPETSRVPPGFGNRTRGKGLEGRNENLYDRREGGRMVSGERSNVRGNVGHK 288 Query: 437 NDKLSNQLDFPGLPAGSSIHSASTFE--IGE--SMKQLHAENGQDSKRGVEKKVNNDGSE 604 L +QLD PG PAGS +HS S + IGE H E G+ GV + G++ Sbjct: 289 MG-LVDQLDRPGPPAGSHLHSGSGNDAGIGEVGGRDGKHKEIGRLRMEGVPES-GGGGAD 346 Query: 605 MDDLENQV-DSLGVEDESGEKNNKNKLHRDKDYR-SDDRGKWIMGQRMRIMKRQTTCR 772 +D L Q+ DSL V+DES ++ N + R+KD R SD RG+ IM QR R+ +RQ CR Sbjct: 347 VDVLGEQLADSLLVKDESDDRTNLRQRRREKDVRLSDSRGQQIMSQRGRMYRRQMMCR 404 >ref|XP_006295859.1| hypothetical protein CARUB_v10024989mg [Capsella rubella] gi|482564567|gb|EOA28757.1| hypothetical protein CARUB_v10024989mg [Capsella rubella] Length = 764 Score = 78.6 bits (192), Expect = 2e-12 Identities = 62/197 (31%), Positives = 91/197 (46%), Gaps = 36/197 (18%) Frame = +2 Query: 293 GGSNKTVAPPPDFLSN---------SKDV---------RHREPGYGRSASDVNGDKGKAS 418 G + PPP F SN SKD H + S + D+ + Sbjct: 233 GFKSTPTPPPPGFSSNQRGWDMNLGSKDDDRGIGSFQRNHDRAMWEHSNLNAEADRLRGL 292 Query: 419 SGQLHKNDKLSNQLDFPGLPAGSSIHSASTFEIGESMKQLHAENGQDSKR----GVEKKV 586 S Q LS Q+D PG P G+S+HS ST + S L+ E S+R G K+ Sbjct: 293 SLQNESKFNLSQQIDHPGPPKGTSLHSVSTADAANSFSMLNKEARGGSERKDELGQLSKM 352 Query: 587 NNDGS--------EMDDL-ENQVDSLGVEDESGEKNNK-----NKLHRDKDYRSDDRGKW 724 +G+ E+DD E+ VDSL +E ++ +K+ K +K R+K+ R D+RG+W Sbjct: 353 KREGNEKSGPGDDEIDDFGEDIVDSLLLEVDTDDKDAKDGKKNSKTSREKESRVDNRGRW 412 Query: 725 IMGQRMRIMKRQTTCRN 775 ++ QR+R K CRN Sbjct: 413 LLSQRLRERKMYMACRN 429 >ref|XP_006375316.1| hypothetical protein POPTR_0014s06910g, partial [Populus trichocarpa] gi|550323667|gb|ERP53113.1| hypothetical protein POPTR_0014s06910g, partial [Populus trichocarpa] Length = 497 Score = 76.3 bits (186), Expect = 1e-11 Identities = 70/239 (29%), Positives = 105/239 (43%), Gaps = 23/239 (9%) Frame = +2 Query: 125 PGQNLMFGSLSREVAANALDQSLYRMNDNRFPQMRISEDI----LRTQSSTSNDRVKLGD 292 P L+ +L REV + ++ + NR + + + +R ++S R L Sbjct: 186 PADGLVNANLMREVGPGS--RNFNGLERNRHLEKQANSHSTNFEVRQPGASSGGRGNLHK 243 Query: 293 GGSNKTVAPPPDFLSNSKD------------------VRHREPGYGRSASDVNGDKGKAS 418 +PPP F + + +RE G S++N +K + + Sbjct: 244 EQHQNYKSPPPGFSNKPRGGGGGGNWDHGGRRRELEHTMYREKG---DYSELNNEKARRN 300 Query: 419 SGQLHKNDKLSNQLDFPGLPAGSSIHSASTFEIGESMKQLHAENGQDSKRGVEKKVNNDG 598 G + + + QLD PG P GS++HS EI ES+ L E DG Sbjct: 301 EGSVEV--RFTRQLDRPGPPPGSNLHSVLGSEIKESLINLDGE---------------DG 343 Query: 599 SEMDDL-ENQVDSLGVEDESGEKNNKNKLHRDKDYRSDDRGKWIMGQRMRIMKRQTTCR 772 +DDL E +DSL +E ES K K+K K+ RSD RG I+ QRMR++KRQ CR Sbjct: 344 GLLDDLGEELMDSLLLEGESDGK--KDKKQSSKESRSDSRGHNILSQRMRMLKRQMQCR 400 >ref|XP_004308428.1| PREDICTED: uncharacterized protein LOC101313262 [Fragaria vesca subsp. vesca] Length = 699 Score = 76.3 bits (186), Expect = 1e-11 Identities = 69/239 (28%), Positives = 106/239 (44%), Gaps = 33/239 (13%) Frame = +2 Query: 155 SREVA--ANALDQSLYRMNDNRFPQMRISEDILRTQSSTSNDRVKLGDGGSN----KTVA 316 S E+A +N LD++L+ + N S + R + ++ G GG Sbjct: 152 SSEIAKLSNGLDRNLHLNSSNS----SASNEFRRANYGSGEGELRGGGGGERGKQVHRTM 207 Query: 317 PPPDFLSNSKDVRHREPGYGRSASDVNGDKGKASSGQLHKNDK----------------- 445 PPP F + + + + G R + N D+ + SS +N + Sbjct: 208 PPPGFGNKPRGGGNWDSGGRRGGMEYNVDRERQSSSGFARNREGSFDNERVRRLAGEDGG 267 Query: 446 ----------LSNQLDFPGLPAGSSIHSASTFEIGESMKQLHAENGQDSKRGVEKKVNND 595 LS QLD PG PAG+++HS S EI ESM ++ + G+ +++ ++D Sbjct: 268 MRGNGDGRKGLSAQLDRPGPPAGTNLHSVSASEIEESM--MNFDGGERARK------DSD 319 Query: 596 GSEMDDLENQVDSLGVEDESGEKNNKNKLHRDKDYRSDDRGKWIMGQRMRIMKRQTTCR 772 G E V +E+E +K + H KD RSDDRG+ + QRMR KRQT CR Sbjct: 320 GVE------DVGQHSLEEERDDKIEGKQHH--KDSRSDDRGQHQLSQRMRSYKRQTLCR 370