BLASTX nr result
ID: Sinomenium21_contig00030039
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00030039 (1146 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002270552.1| PREDICTED: uncharacterized protein LOC100261... 152 3e-34 emb|CAN61787.1| hypothetical protein VITISV_006025 [Vitis vinifera] 152 3e-34 ref|XP_002301615.2| dentin sialophosphoprotein [Populus trichoca... 144 5e-32 ref|XP_002529332.1| hypothetical protein RCOM_1016710 [Ricinus c... 142 3e-31 gb|EXB37857.1| hypothetical protein L484_011917 [Morus notabilis] 137 1e-29 ref|XP_007215644.1| hypothetical protein PRUPE_ppa003889mg [Prun... 133 1e-28 ref|XP_003553208.1| PREDICTED: dentin sialophosphoprotein-like [... 128 5e-27 ref|XP_003628559.1| GD3A [Medicago truncatula] gi|355522581|gb|A... 128 5e-27 ref|XP_004509926.1| PREDICTED: dentin sialophosphoprotein-like [... 125 4e-26 ref|XP_006585997.1| PREDICTED: dentin sialophosphoprotein-like i... 115 3e-23 ref|XP_003530674.1| PREDICTED: dentin sialophosphoprotein-like i... 115 3e-23 ref|XP_007153864.1| hypothetical protein PHAVU_003G071200g [Phas... 115 5e-23 gb|EXB63812.1| hypothetical protein L484_021084 [Morus notabilis] 107 7e-21 ref|XP_006447598.1| hypothetical protein CICLE_v10014304mg [Citr... 103 1e-19 ref|XP_004287962.1| PREDICTED: uncharacterized protein LOC101297... 103 1e-19 ref|XP_004243489.1| PREDICTED: uncharacterized protein LOC101260... 99 3e-18 gb|EYU43120.1| hypothetical protein MIMGU_mgv1a003739mg [Mimulus... 91 7e-16 ref|XP_006364172.1| PREDICTED: dentin sialophosphoprotein-like [... 91 7e-16 ref|XP_007049104.1| Uncharacterized protein isoform 2, partial [... 89 5e-15 ref|XP_007049103.1| Uncharacterized protein isoform 1 [Theobroma... 89 5e-15 >ref|XP_002270552.1| PREDICTED: uncharacterized protein LOC100261856 [Vitis vinifera] gi|297739184|emb|CBI28835.3| unnamed protein product [Vitis vinifera] Length = 514 Score = 152 bits (384), Expect = 3e-34 Identities = 110/341 (32%), Positives = 168/341 (49%), Gaps = 22/341 (6%) Frame = +2 Query: 23 ATEGLNVSRDPLILSDLFDFELKWPSKPEKNESHTADKAQHQSKSSLNVTGVDLDSFFLE 202 A+ G + +D LSDL D E+KW S+ EK + +++A +SKS LN+ GVDLD+F E Sbjct: 121 ASRGRSALKDEGSLSDLLDLEIKWTSESEKLGAGVSNEASVRSKSPLNLAGVDLDNFLSE 180 Query: 203 A-KETSASAFDDQSIPKKRILGTETSAFPGQESLSFFENAQAPDFAVRSVSTDDAVDESF 379 A ++T A ++Q K I E++A G E+LS FEN + VR +D +F Sbjct: 181 ARRDTVIKASEEQFAATKEIRSKESNALQGHENLSLFENVHPSETVVR--PAEDKNSAAF 238 Query: 380 SGWESEFQSASSQ---------------TLEISSPLDAASGSYTDIKLINSTDNDAKSKC 514 SGWE+EFQ+A+S+ T+++SS +DA GS DI + +D+ Sbjct: 239 SGWEAEFQNANSESVHEGSKEFDPFVGSTVDLSSHMDAVFGSGKDINSAHVSDDTTP--- 295 Query: 515 QSVPLASMSDSWIPSDQSHSTNTGGSYDYGQFKINNEANNYEEAKVDSNNL---SSINDD 685 AS ++ WI D + N+ GQ +A D+ NL SS +D Sbjct: 296 -----ASRTNDWIQDDLYKNLNSKVPAHVGQVDSTIQAE-------DAQNLAGPSSTRND 343 Query: 686 WIPEELWTTGITKAS-NSQKINEDDDSFDTWRXXXXXXXXXXXXXXXWKQTN-STTSFSD 859 W ++ W K++ N + ++D+ FD W WK N S+ + S Sbjct: 344 WFQDDQWKNSSAKSTDNKIALGKNDNLFDAWNDFPSSSTSQDPFRSSWKHNNGSSLTPSV 403 Query: 860 ELTPKMNLANSTKSFEEIEFGSV-VQPDLFTGASKLHNGST 979 E T + NL +ST + +E+EFG+ Q DL +GA N S+ Sbjct: 404 EQTSEPNLLSSTSNLQEMEFGNFSQQEDLSSGADNNQNDSS 444 >emb|CAN61787.1| hypothetical protein VITISV_006025 [Vitis vinifera] Length = 633 Score = 152 bits (384), Expect = 3e-34 Identities = 110/341 (32%), Positives = 168/341 (49%), Gaps = 22/341 (6%) Frame = +2 Query: 23 ATEGLNVSRDPLILSDLFDFELKWPSKPEKNESHTADKAQHQSKSSLNVTGVDLDSFFLE 202 A+ G + +D LSDL D E+KW S+ EK + +++A +SKS LN+ GVDLD+F E Sbjct: 163 ASRGRSALKDEGSLSDLLDLEIKWTSESEKLGAGVSNEASVRSKSPLNLAGVDLDNFLSE 222 Query: 203 A-KETSASAFDDQSIPKKRILGTETSAFPGQESLSFFENAQAPDFAVRSVSTDDAVDESF 379 A ++T A ++Q K I E++A G E+LS FEN + VR +D +F Sbjct: 223 ARRDTVIKASEEQFAATKEIRSKESNALQGHENLSLFENVHPSETVVR--PAEDKNSAAF 280 Query: 380 SGWESEFQSASSQ---------------TLEISSPLDAASGSYTDIKLINSTDNDAKSKC 514 SGWE+EFQ+A+S+ T+++SS +DA GS DI + +D+ Sbjct: 281 SGWEAEFQNANSESVHEGSKEFDPFVGSTVDLSSHMDAVFGSGKDINSAHVSDDTTP--- 337 Query: 515 QSVPLASMSDSWIPSDQSHSTNTGGSYDYGQFKINNEANNYEEAKVDSNNL---SSINDD 685 AS ++ WI D + N+ GQ +A D+ NL SS +D Sbjct: 338 -----ASRTNDWIQDDLYKNLNSKVPAHVGQVDSTIQAE-------DAQNLAGPSSTRND 385 Query: 686 WIPEELWTTGITKAS-NSQKINEDDDSFDTWRXXXXXXXXXXXXXXXWKQTN-STTSFSD 859 W ++ W K++ N + ++D+ FD W WK N S+ + S Sbjct: 386 WFQDDQWKNSSAKSTDNKIALGKNDNLFDAWNDFPSSSTSQDPFRSSWKHNNGSSLTPSV 445 Query: 860 ELTPKMNLANSTKSFEEIEFGSV-VQPDLFTGASKLHNGST 979 E T + NL +ST + +E+EFG+ Q DL +GA N S+ Sbjct: 446 EQTSEPNLLSSTSNLQEMEFGNFSQQEDLSSGADNNQNDSS 486 >ref|XP_002301615.2| dentin sialophosphoprotein [Populus trichocarpa] gi|550345520|gb|EEE80888.2| dentin sialophosphoprotein [Populus trichocarpa] Length = 518 Score = 144 bits (364), Expect = 5e-32 Identities = 118/380 (31%), Positives = 180/380 (47%), Gaps = 23/380 (6%) Frame = +2 Query: 2 EAVSGTVATEGLN----VSRDPLILSDLFDFELKWPSKPEKNESHTADKAQHQSKSSLNV 169 E V + + LN S++ LSDL D E++WPS+ E++E+ DK Q+ S+LN+ Sbjct: 110 EIVESIIEAKELNRRQSTSQNNFPLSDLLDLEIRWPSESERSETSVTDKTPAQNLSTLNL 169 Query: 170 TGVDLDSFFLEAKETSASAFDDQSIP-KKRILGTETSAFPGQESLSFFENAQAPDFAVRS 346 GVDL++FF E K S A + + K + T +A G +LS FEN Q P + Sbjct: 170 GGVDLNNFFGEPKVDSVPALSQEQLTLNKDMDATGGNAVQGHGNLSLFENVQ-PSETIGG 228 Query: 347 VSTDDAVDESFSGWESEFQSASSQT----------------LEISSPLDAASGSYTDIKL 478 D + D S SGWE+EFQSASS T +++S+ +D+ G DI Sbjct: 229 SDKDVSGDWS-SGWEAEFQSASSGTQHRESKTSDPFVSSSSVDLSAHMDSVFGPAKDI-F 286 Query: 479 INSTDNDAKSKCQSVPLASMSDSW-IPSDQSHSTNTGGSYDYGQFKINNEANNYEEAKVD 655 T+ +A S S A D W IP T G + + IN+E + Sbjct: 287 EGKTNENATSSASS---AFKDDLWSIP-----GTGVTGQDELFKLDINDEGGG---KRGT 335 Query: 656 SNNLSSINDDWIPEELW-TTGITKASNSQKINEDDDSFDTWRXXXXXXXXXXXXXXXWKQ 832 +NN S +N DWI + W TT +K+ ++ I+E+DDSFD W +Q Sbjct: 336 TNNSSMMNVDWIEDNQWQTTTTSKSDENKTIDENDDSFDAWNDFRGSTSAQVPSNNSLEQ 395 Query: 833 TNSTTSFSDELTPKMNLANSTKSFEEIEFGSVVQPDLFTGASKLHNGSTDSWKQTSSTTT 1012 + S + ++NL + ++++FGS QPD F+G NGS++ + T+ Sbjct: 396 DANHILPSVDQESEINLFGGSSISQDVDFGSFSQPDFFSGTLNNQNGSSEV-NVMQTETS 454 Query: 1013 FSDEHPSKMNLVNSTNSFEE 1072 SD ++N VN + E Sbjct: 455 VSD----RINSVNQDDGNTE 470 >ref|XP_002529332.1| hypothetical protein RCOM_1016710 [Ricinus communis] gi|223531203|gb|EEF33049.1| hypothetical protein RCOM_1016710 [Ricinus communis] Length = 467 Score = 142 bits (357), Expect = 3e-31 Identities = 102/340 (30%), Positives = 172/340 (50%), Gaps = 20/340 (5%) Frame = +2 Query: 23 ATEGLNVSRDPLILSDLFDFELKWPSKPEKNESHTADKAQHQSKSSLNVTGVDLDSFFLE 202 + G N + LSDL + E++WPS+PE+ E+ +K Q LN +G+D+D++F E Sbjct: 119 SNRGQNTPEIHIPLSDLLNLEIRWPSEPEEFETSALEKKPIQ---MLNFSGIDIDNYFTE 175 Query: 203 AKETSAS-AFDDQSIPKKRILGTETSAFPGQESLSFFENAQAPDFAVRSVSTDDAVDESF 379 +K S S + + Q K+ E +AF G E+LS FE+ + + A R S D +SF Sbjct: 176 SKLDSVSTSAEGQFTLKQHEDAAENNAFQGHENLSLFESVEPSETAAR--SKKDESGDSF 233 Query: 380 SGWESEFQSASSQT----------------LEISSPLDAASGSYTDIKLINSTDNDAKSK 511 SGWE++FQS+ ++T +++SS +DA G +++ ++ K+K Sbjct: 234 SGWEADFQSSGAKTQHQKSNFPDPFVGSSSVDLSSHMDALFGPGSNL-------SNEKTK 286 Query: 512 CQSVPLASMSDSWIPSDQSHSTNTGGSYDYGQFKINNEANNYEEAKVDSNNLSSINDDWI 691 ++V AS + W D S + N G ++ QF++ ++N + ++ N SS+N DW+ Sbjct: 287 -ENVTSASNMNDWFERDTSSNANAGVAFQNDQFEV-PVSDNRDGTVGNTGNSSSMNVDWV 344 Query: 692 PEELWTTGITKASNSQKI---NEDDDSFDTWRXXXXXXXXXXXXXXXWKQTNSTTSFSDE 862 + W T +S+S+K +E+DDSFDTW K T S E Sbjct: 345 QDNQWQT----SSSSRKATDNDENDDSFDTWNDFTSSSNVQVPSNNSLKGDIHTVP-SVE 399 Query: 863 LTPKMNLANSTKSFEEIEFGSVVQPDLFTGASKLHNGSTD 982 +++ + + ++I+FGS QPD F+ NGS + Sbjct: 400 QGSEISFFSGADNSKDIDFGSFSQPDFFSATFSNQNGSAE 439 >gb|EXB37857.1| hypothetical protein L484_011917 [Morus notabilis] Length = 547 Score = 137 bits (344), Expect = 1e-29 Identities = 111/366 (30%), Positives = 174/366 (47%), Gaps = 38/366 (10%) Frame = +2 Query: 32 GLNVSRDPLILSDLFDFELKWPSKPEKNESHTADKAQHQSKSSLNVTGVDLDSFFLE-AK 208 G +D L LSDL + E+KWPS+ EK S +++ QSKSSLN++GV++++FF + K Sbjct: 120 GRTAPKDELPLSDLLNLEIKWPSELEKFGSRLSNETPVQSKSSLNLSGVNIENFFAQKEK 179 Query: 209 ETSASAFDDQSIPKKRILGTE---------------------------TSAFP---GQES 298 S++ + S+ +I G E +A P G+E+ Sbjct: 180 GASSNVSAEPSMSSNQIDGGEIRQGHEIDDLFESAKLSGTVHENLSLFENAKPSETGREN 239 Query: 299 LSFFENAQAPDFAVRSVSTDDAVDESFSGWESEFQSASSQT-LEISSPLDAASGS----- 460 L+ FENAQ +V S + ++ + S SGW ++FQSA+S T + S+ D GS Sbjct: 240 LTLFENAQPSKTSVSSTES-ESKNLSDSGWGTDFQSAASATPHKDSTSFDPFMGSTDLST 298 Query: 461 YTDIKLINSTDNDAKSKCQSVPLASMSDSWIPSDQSHSTNTGGSYDYGQFKINNEANNYE 640 + D + D+ K ++V ASM+ W D ++N+G + FK NN E Sbjct: 299 HMDEVFGPAKDSIGKKDEETVGSASMASDWFVDDAQKNSNSGLNSPLEDFKTTANVNN-E 357 Query: 641 EAKVDSNNLSSINDDWIPEELWTTGITKASNSQKINEDDDSFDTWRXXXXXXXXXXXXXX 820 + N SS + DW+ + W + +K K +E++DSFD W Sbjct: 358 NIVGNVNYSSSTDVDWVEDNRWQSN-SKNEPGSKADEENDSFDDWNDFASSTVAQDPSNT 416 Query: 821 XWKQTNSTTSFSDELTPKMNLANSTKSFEEIEFG-SVVQPDLFTGASKLHNGSTDSWKQT 997 WKQ TT S++ T ++NL +S ++I F S +QPDLF+ N ST+ K+ Sbjct: 417 TWKQ---TTMPSNDKTSEINLFSSDDHSQDINFSDSFLQPDLFSRVFSSSNASTEGNKRL 473 Query: 998 SSTTTF 1015 F Sbjct: 474 PEAIVF 479 >ref|XP_007215644.1| hypothetical protein PRUPE_ppa003889mg [Prunus persica] gi|462411794|gb|EMJ16843.1| hypothetical protein PRUPE_ppa003889mg [Prunus persica] Length = 542 Score = 133 bits (335), Expect = 1e-28 Identities = 102/347 (29%), Positives = 157/347 (45%), Gaps = 31/347 (8%) Frame = +2 Query: 32 GLNVSRDPLILSDLFDFELKWPSKPEKNESHTADKAQHQSKSSLNVTGVDLDSFFLEAKE 211 G +D L LSDL + E+KW SKPEK E+ +++ Q KS ++ GV+LD+FF E K+ Sbjct: 122 GQTARKDDLSLSDLLNLEIKWTSKPEKVETDFSNETPTQPKSLPDLAGVNLDNFFSEGKK 181 Query: 212 TSASAFDDQSI--PKKRILGTETSAFPGQESLSFFENAQAPDFAVRSVSTDDAVDESFSG 385 +A ++ + + G E +AF +E+LS FEN Q + V ST+ +SFSG Sbjct: 182 DAAVNISEEQLFESSTQTTGEEINAFEVRETLSLFENVQPFETVVE--STEGESGDSFSG 239 Query: 386 WESEFQSASSQTL----------------------EISSPLDAASGSYTDIKLINST--- 490 W + FQSA+S+TL + S +D GS D+ T Sbjct: 240 WAANFQSAASETLPHASETLPHASENLHQASENIPQESKVIDPFVGSTVDLSAHIDTVFG 299 Query: 491 ----DNDAKSKCQSVPLASMSDSWIPSDQSHSTNTGGSYDYGQFKINNEANNYEEAKVDS 658 D KS A ++ W D +N+G + QF+ E E + Sbjct: 300 SAVHSTDEKSNHSMTGSAPLTTDWFRGDLLGVSNSGFAGGPEQFETLAEVKGITE---NV 356 Query: 659 NNLSSINDDWIPEELWTTGITKASNSQKINEDDDSFDTWRXXXXXXXXXXXXXXXWKQTN 838 NN + D + + T A +++ +ED+DSFD W KQ+ Sbjct: 357 NNSFPADVDRVQDNQLQTTSNNAPDNKTTDEDEDSFDAWNDFATSNSAPNLVDSSLKQST 416 Query: 839 STTSFSDELTPKMNLANSTKSFEEIEFGSVVQPDLFTGASKLHNGST 979 + T+ D+ T ++L + + ++ FGS+ QPD GA NGST Sbjct: 417 NQTTPVDQ-TSVVDLFGTASNSGDLNFGSLSQPDFSAGAFNSSNGST 462 >ref|XP_003553208.1| PREDICTED: dentin sialophosphoprotein-like [Glycine max] Length = 729 Score = 128 bits (321), Expect = 5e-27 Identities = 105/379 (27%), Positives = 170/379 (44%), Gaps = 15/379 (3%) Frame = +2 Query: 32 GLNVSRDPLILSDLFDFELKWPSKPEKNESHTADKAQHQSKSSLNVTGVDLDSFFLEAKE 211 G + S+D + LS+L D E++WPS+ E+ ++ T+D A Q KSSLN+ GVDLDSFF + +E Sbjct: 125 GRSESKDEIPLSELLDLEIRWPSEAERVQTSTSDPAVFQGKSSLNLAGVDLDSFF-DRRE 183 Query: 212 TSASAFDDQSIPKKRILGTETSAFPGQESLSFFENAQAPDFAVRSVSTDDAVDESFSGWE 391 + + F+ ++++ G + E+LS F+N QA + V + S ++ SFS WE Sbjct: 184 SDSDMFEQNLASERQVGGASDKSLQASENLSLFQNVQASE--VDAGSVENQSGNSFSSWE 241 Query: 392 SEFQSASSQTL-EISS-------PLDAASGSYTDIKLINSTDNDAKSKCQSVPLASMSDS 547 + F SASS + E+S LD ASG +N D+ S AS D Sbjct: 242 ASFTSASSGPVHEVSKSVDHSKVELDMASGFSKHSVGVNKNDDFNLS-------ASTEDD 294 Query: 548 WIPSDQSHSTNTGGSYDYGQFKINNEANNYEEAKVDSNNLSSINDDWIPEELWTTGITKA 727 + D ++N+ G+ + + + + A +S N SS N DW+ ++LW K Sbjct: 295 YFQGDGWSTSNSEVHCQTGKSESTMDISGTKTA--ESANGSSRNLDWMQDDLWQGSDNKT 352 Query: 728 SNSQKINEDDDSFDTWRXXXXXXXXXXXXXXXWKQTNSTTSFSDELTPKMNLANSTKSFE 907 +++ ED DSFD W N T P ++NS + + Sbjct: 353 TDTVATAEDKDSFDEW--------------------NDFTGSGSTQDPSSTISNSKTNAQ 392 Query: 908 EIEFGSVVQ-------PDLFTGASKLHNGSTDSWKQTSSTTTFSDEHPSKMNLVNSTNSF 1066 G V D + ++K + D W+ ++ TT + + ++ N F Sbjct: 393 TGNVGYSVDFNVTKTLKDANSSSNKDFDWMQDQWQDNNNKTTNTISANEAADSFDAWNDF 452 Query: 1067 EEIEFGSVVQPDLFTGASN 1123 GS + G SN Sbjct: 453 T----GSANTQHSYFGLSN 467 >ref|XP_003628559.1| GD3A [Medicago truncatula] gi|355522581|gb|AET03035.1| GD3A [Medicago truncatula] Length = 742 Score = 128 bits (321), Expect = 5e-27 Identities = 100/377 (26%), Positives = 171/377 (45%), Gaps = 18/377 (4%) Frame = +2 Query: 2 EAVSGTVATEGLNVSR----DPLILSDLFDFELKWPSKPEKNESHTADKAQHQSKSSLNV 169 E V+ TV + +R D + LS+L D E++WPS+ E+ S +D +SSL++ Sbjct: 111 EMVAPTVEEHASSRTRSELNDEIPLSELLDLEIRWPSEAERALSSNSDSEAFPGESSLSL 170 Query: 170 TGVDLDSFFLEAKETSASAFDDQSIPKKRILGTETSAFPGQESLSFFENAQAPDFAVRSV 349 +GVDLDSFF + +E+ + + S ++ + G + F E+LS F+N QA + + SV Sbjct: 171 SGVDLDSFF-DRRESDFNVSEQNSAFERNVGGASDNTFQANENLSLFQNFQASEASGGSV 229 Query: 350 STDDAVDESFSGWESEFQSASSQTL-EISSPLDAASGSYTDIKLINSTDNDAKSKCQSVP 526 +D SFSGWE+ F+SASS + + S+ +D + + + K P Sbjct: 230 --EDQSGGSFSGWEANFKSASSAPVHKESNSVDHSKVELDTVSGYGKDSDGVKENDDFNP 287 Query: 527 LASMSDSWIPSDQSHSTNTGGSYDYGQFKINNEANNYEEAKVDSNNLSSINDDWIPEELW 706 AS D W D+ ++N+ G+ + + + E+ ++ + + S+ N DW+ ++ W Sbjct: 288 SASGEDDWFQGDEFQTSNSKIDGQPGKSETTTDLYHMEKEEIATGS-STRNLDWMQDDQW 346 Query: 707 TTGITKASNSQKINEDDDSFDTWRXXXXXXXXXXXXXXXWKQTNSTTSFSDELTPKMNLA 886 TK N +E+DDSFD W Q + + + E + +N A Sbjct: 347 QESETKIPNIGATDEEDDSFDAWNDFTGSAGTQDPSGIISSQNMTAQTGNFEFSADLNDA 406 Query: 887 --------NSTKSFEEIEFGSVVQPD---LFTGASKLHNGSTDSWKQ--TSSTTTFSDEH 1027 +S + F+ +E D + + + S DSW S+TT + Sbjct: 407 KTAEDANSSSNRDFDWMENDQRQDNDNRTIDNVGTNEGSYSFDSWNDFTGSATTQYPSHS 466 Query: 1028 PSKMNLVNSTNSFEEIE 1078 S + T FE E Sbjct: 467 VSNSEITGQTGKFEMTE 483 >ref|XP_004509926.1| PREDICTED: dentin sialophosphoprotein-like [Cicer arietinum] Length = 738 Score = 125 bits (313), Expect = 4e-26 Identities = 95/355 (26%), Positives = 168/355 (47%), Gaps = 11/355 (3%) Frame = +2 Query: 47 RDPLILSDLFDFELKWPSKPEKNESHTADKAQHQSKSSLNVTGVDLDSFFLEAKETSASA 226 +D + LS+L D E++WPS+ E+ S +D A +SSL++ GVDLD FF + +E+ ++ Sbjct: 130 KDEIPLSELLDLEIRWPSEAERILSSNSDSAALLGESSLDLAGVDLDCFF-DRRESDSNV 188 Query: 227 FDDQSIPKKRILGTETSAFPGQESLSFFENAQAPDFAVRSVSTDDAVDESFSGWESEFQS 406 ++ S +K + + F E+LS F+N QA + + SV +D +SFSGWE+ F+S Sbjct: 189 SEENSTAEKHVGAASENNFQANENLSLFQNVQASEASGWSV--EDQSGDSFSGWEANFKS 246 Query: 407 ASSQTLEI-SSPLDAASGSYTDIKLINSTDNDAKSKCQSVPLASMSDSWIPSDQSHSTNT 583 ASS + + S +D + + + K P AS D W D+ ++++ Sbjct: 247 ASSGPVHVESKSVDHSKVELDTVSIYGKDSVGVKKNDDFNPSASSEDDWFQGDRFRTSDS 306 Query: 584 GGSYDYGQFKINNEANNYEEAKVDSNNLSSINDDWIPEELWTTGITKASNSQKINEDDDS 763 G+ + + +N + A++D N S+ N DW+ ++ W TK + +E+DDS Sbjct: 307 KIDGHSGKSETTMDFHNTKTAEID-NGSSNRNLDWMQDDQWQMSDTKIPDIVAADEEDDS 365 Query: 764 FD--TWRXXXXXXXXXXXXXXXWKQTNS---TTSFSDELTPKMNLANSTKSFEEIEFGSV 928 ++ T QT + + SD T + ++S + F+ +E Sbjct: 366 WNDFTGSVRTQDPSGIISSSKITAQTGNLEFSADLSDMKTAEGANSSSNRDFDWMEDDQQ 425 Query: 929 VQPDLFTGASKLHNGSTDS---WKQ--TSSTTTFSDEHPSKMNLVNSTNSFEEIE 1078 + T + + + DS W S+TT +S S + + T FE+ E Sbjct: 426 QDNNNKTTDNVSTDEAADSFDAWNDFTGSATTQYSSHSVSNSEITDQTGKFEKNE 480 >ref|XP_006585997.1| PREDICTED: dentin sialophosphoprotein-like isoform X2 [Glycine max] gi|571473681|ref|XP_006585998.1| PREDICTED: dentin sialophosphoprotein-like isoform X3 [Glycine max] Length = 615 Score = 115 bits (289), Expect = 3e-23 Identities = 93/354 (26%), Positives = 156/354 (44%), Gaps = 9/354 (2%) Frame = +2 Query: 32 GLNVSRDPLILSDLFDFELKWPSKPEKNESHTADKAQHQSKSSLNVTGVDLDSFFLEAKE 211 G + +D + LS+L D E++WPS+ E+ ++ T+D A Q KSSLN+ GVDLDSFF + +E Sbjct: 14 GRSELKDEIPLSELLDLEIRWPSEAERAQTSTSDLAA-QGKSSLNLAGVDLDSFF-DRRE 71 Query: 212 TSASAFDDQSIPKKRILGTETSAFPGQESLSFFENAQAPDFAVRSVSTDDAVDESFSGWE 391 + + ++ K++ G +F E+LS F+N QA + A S +SFS WE Sbjct: 72 SDSEVYEQNLASGKQVGGASDKSFQANENLSLFQNVQALEAAAGSAENQSG--DSFSSWE 129 Query: 392 SEFQSASS--------QTLEISSPLDAASGSYTDIKLINSTDNDAKSKCQSVPLASMSDS 547 + F SASS LD SG D + D+ P AS D Sbjct: 130 TSFMSASSGPVHEMPKSVYHSKVELDMTSGFLKDSVGVKKNDDFN-------PSASTEDD 182 Query: 548 WIPSDQSHSTNTGGSYDYGQFKINNEANNYEEAKVDSNNLSSINDDWIPEELWTTGITKA 727 + + N+ G+ + + + + A ++ N SS N DW+ ++LW K Sbjct: 183 YFQGGW-RTFNSEVHDQTGKSESTMDPSGIKTA--ENANGSSRNLDWMQDDLWQGSDNKT 239 Query: 728 SNSQKINEDDDSFDTWRXXXXXXXXXXXXXXXWKQTNSTTSFSDELTPKMNLANSTKSFE 907 +++ ED DSFD W + + ST S ++ A + Sbjct: 240 TDTVPTAEDKDSFDEWN--------------DFTGSGSTQDPSSTISNSKTTAQTGNVGY 285 Query: 908 EIEFGSV-VQPDLFTGASKLHNGSTDSWKQTSSTTTFSDEHPSKMNLVNSTNSF 1066 ++F D + ++K + D W+ ++ TT + + ++ N+F Sbjct: 286 SVDFNDTKTSQDANSSSNKDFDWMQDQWQDNNNKTTNAISGNEAADAFDAWNNF 339 >ref|XP_003530674.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Glycine max] Length = 726 Score = 115 bits (289), Expect = 3e-23 Identities = 93/354 (26%), Positives = 156/354 (44%), Gaps = 9/354 (2%) Frame = +2 Query: 32 GLNVSRDPLILSDLFDFELKWPSKPEKNESHTADKAQHQSKSSLNVTGVDLDSFFLEAKE 211 G + +D + LS+L D E++WPS+ E+ ++ T+D A Q KSSLN+ GVDLDSFF + +E Sbjct: 125 GRSELKDEIPLSELLDLEIRWPSEAERAQTSTSDLAA-QGKSSLNLAGVDLDSFF-DRRE 182 Query: 212 TSASAFDDQSIPKKRILGTETSAFPGQESLSFFENAQAPDFAVRSVSTDDAVDESFSGWE 391 + + ++ K++ G +F E+LS F+N QA + A S +SFS WE Sbjct: 183 SDSEVYEQNLASGKQVGGASDKSFQANENLSLFQNVQALEAAAGSAENQSG--DSFSSWE 240 Query: 392 SEFQSASS--------QTLEISSPLDAASGSYTDIKLINSTDNDAKSKCQSVPLASMSDS 547 + F SASS LD SG D + D+ P AS D Sbjct: 241 TSFMSASSGPVHEMPKSVYHSKVELDMTSGFLKDSVGVKKNDDFN-------PSASTEDD 293 Query: 548 WIPSDQSHSTNTGGSYDYGQFKINNEANNYEEAKVDSNNLSSINDDWIPEELWTTGITKA 727 + + N+ G+ + + + + A ++ N SS N DW+ ++LW K Sbjct: 294 YFQGGW-RTFNSEVHDQTGKSESTMDPSGIKTA--ENANGSSRNLDWMQDDLWQGSDNKT 350 Query: 728 SNSQKINEDDDSFDTWRXXXXXXXXXXXXXXXWKQTNSTTSFSDELTPKMNLANSTKSFE 907 +++ ED DSFD W + + ST S ++ A + Sbjct: 351 TDTVPTAEDKDSFDEWN--------------DFTGSGSTQDPSSTISNSKTTAQTGNVGY 396 Query: 908 EIEFGSV-VQPDLFTGASKLHNGSTDSWKQTSSTTTFSDEHPSKMNLVNSTNSF 1066 ++F D + ++K + D W+ ++ TT + + ++ N+F Sbjct: 397 SVDFNDTKTSQDANSSSNKDFDWMQDQWQDNNNKTTNAISGNEAADAFDAWNNF 450 >ref|XP_007153864.1| hypothetical protein PHAVU_003G071200g [Phaseolus vulgaris] gi|561027218|gb|ESW25858.1| hypothetical protein PHAVU_003G071200g [Phaseolus vulgaris] Length = 731 Score = 115 bits (287), Expect = 5e-23 Identities = 82/256 (32%), Positives = 125/256 (48%), Gaps = 8/256 (3%) Frame = +2 Query: 32 GLNVSRDPLILSDLFDFELKWPSKPEKNESHTADKAQHQSKSSLNVTGVDLDSFFLEAKE 211 G NVS+D + LS+L D E++WPS+ E + T+D A Q KSSL++ GVDLDS+F KE Sbjct: 125 GRNVSKDEIPLSELLDLEIRWPSESEIAQLSTSDSAAFQGKSSLSLAGVDLDSYF-NQKE 183 Query: 212 TSASAFDDQSIPKKRILGTETSAFPGQESLSFFENAQAPDFAVRSVSTDDAVDESFSGWE 391 + + FD ++++ + F E+LS F+N QA + A + S ++ +S S WE Sbjct: 184 SDSDVFDKNLASERQVGTALDNTFKANENLSLFQNVQASELA--AASAENPSVDSLSSWE 241 Query: 392 SEFQSASSQTLEISS--------PLDAASGSYTDIKLINSTDNDAKSKCQSVPLASMSDS 547 + F SASS + S LD ASG D + DN P AS Sbjct: 242 ASFTSASSGPVHEMSKSVDYSNVDLDTASGFGKDSVGVKENDN-------FNPSASTEHD 294 Query: 548 WIPSDQSHSTNTGGSYDYGQFKINNEANNYEEAKVDSNNLSSINDDWIPEELWTTGITKA 727 + D ++N+ + G K + + DS N SS N +W+ ++ K Sbjct: 295 YFQGDGWRTSNSISHAEAG--KSESTMDLIGTKTTDSANGSSRNLEWMQDDQLQGSDNKT 352 Query: 728 SNSQKINEDDDSFDTW 775 +++ +ED SFD W Sbjct: 353 TDTVVTSEDRYSFDEW 368 >gb|EXB63812.1| hypothetical protein L484_021084 [Morus notabilis] Length = 424 Score = 107 bits (268), Expect = 7e-21 Identities = 87/283 (30%), Positives = 139/283 (49%), Gaps = 37/283 (13%) Frame = +2 Query: 32 GLNVSRDPLILSDLFDFELKWPSKPEKNESHTADKAQHQSKSSLNVTGVDLDSFFLE-AK 208 G +D L LSDL + E+KWPS+ EK S +++ QSKSSLN++GV++++FF + K Sbjct: 120 GRTAPKDELPLSDLLNLEIKWPSELEKFGSRLSNETPVQSKSSLNLSGVNIENFFAQKEK 179 Query: 209 ETSASAFDDQSIPKKRILGTE---------------------------TSAFP---GQES 298 S++ + S+ +I G E +A P G+E+ Sbjct: 180 GASSNVSAEPSMSSNQIDGGEIRQGHEIDDLFESAKLSGTVHENLSLFENAKPSETGREN 239 Query: 299 LSFFENAQAPDFAVRSVSTDDAVDESFSGWESEFQSASSQT-LEISSPLDAASGS----- 460 L+ FENAQ +V S + ++ + S SGW ++FQSA+S T + S+ D GS Sbjct: 240 LTLFENAQPSKTSVSSTES-ESKNLSDSGWGTDFQSAASATPHKDSTSFDPFMGSTDLST 298 Query: 461 YTDIKLINSTDNDAKSKCQSVPLASMSDSWIPSDQSHSTNTGGSYDYGQFKINNEANNYE 640 + D + D+ K ++V ASM+ W D ++N+G + FK NN E Sbjct: 299 HMDEVFGPAKDSIGKKDEETVGSASMASDWFVDDAQKNSNSGLNSPLEDFKTTANVNN-E 357 Query: 641 EAKVDSNNLSSINDDWIPEELWTTGITKASNSQKINEDDDSFD 769 + N SS + DW+ + W + +K K +E++DSFD Sbjct: 358 NIVGNVNYSSSTDVDWVEDNRWQSN-SKNEPGSKADEENDSFD 399 >ref|XP_006447598.1| hypothetical protein CICLE_v10014304mg [Citrus clementina] gi|568830757|ref|XP_006469654.1| PREDICTED: uncharacterized protein DDB_G0290685-like [Citrus sinensis] gi|557550209|gb|ESR60838.1| hypothetical protein CICLE_v10014304mg [Citrus clementina] Length = 810 Score = 103 bits (258), Expect = 1e-19 Identities = 80/267 (29%), Positives = 126/267 (47%), Gaps = 16/267 (5%) Frame = +2 Query: 23 ATEGLNVSRDPLILSDLFDFELKWPSKPEKNESHTADKAQHQSKSSLNVTGVDLDSFFLE 202 + G N ++ LSDL D E+ W S+ +K + + S+SS N GV+ D+F E Sbjct: 120 SNRGQNATQKEFPLSDLLDLEITWNSEFDKLGTES-------SQSSFNFAGVNPDNFLAE 172 Query: 203 AKETSASAFD-DQSIPKKRILGTETSAFPGQESLSFFENA-------QAPDFAVRSVSTD 358 K ASA +QS G+ F ++++ FEN Q+ + AVR++ + Sbjct: 173 RKRAGASAVSVEQSQLINNDNGSGNDDFQVRDNVHLFENVVHLFENVQSSETAVRTIEVE 232 Query: 359 DAVDESFSGWESEFQSASSQTL-EISSPLDAASGSYTDI-----KLINSTDNDAKSKCQS 520 ES GWE+ FQSA + T E S +D GS D+ +++ N K K + Sbjct: 233 SGT-ESLGGWEANFQSAGTGTSHEESKSVDPVVGSSVDLSGQMDEVLGYGKNFGKDKEEI 291 Query: 521 VPLASMSDSWIPSDQSHSTNTGGSYDYGQFKINNEANNYEEAKV--DSNNLSSINDDWIP 694 + S S+ W DQ + + S GQ K N ++ + ++NN SS+ D + Sbjct: 292 ISSGSRSNDWFQDDQFSGSRSSTS---GQSKQVEVTGNEKDGRPMQNANNSSSMGIDGVQ 348 Query: 695 EELWTTGITKASNSQKINEDDDSFDTW 775 + W T K ++ ++E DDSFDTW Sbjct: 349 DGQWNTESKKTQENKTVHELDDSFDTW 375 Score = 57.8 bits (138), Expect = 9e-06 Identities = 44/185 (23%), Positives = 82/185 (44%) Frame = +2 Query: 557 SDQSHSTNTGGSYDYGQFKINNEANNYEEAKVDSNNLSSINDDWIPEELWTTGITKASNS 736 + S + +TG + D F++N ++ V +++ DW+ + T KA + Sbjct: 612 AQDSTNKHTGRAKD---FEVNTNVKDHGIMDVSNSSF-----DWLQGDQLQTSSNKAPDG 663 Query: 737 QKINEDDDSFDTWRXXXXXXXXXXXXXXXWKQTNSTTSFSDELTPKMNLANSTKSFEEIE 916 + +ED DSFD W + N TS S E T ++ +++TK+ + ++ Sbjct: 664 KITDEDPDSFDAWNDFTSSISAQDPSNN--QPVNHVTS-SAEQTSEIK-SSATKNLQNVD 719 Query: 917 FGSVVQPDLFTGASKLHNGSTDSWKQTSSTTTFSDEHPSKMNLVNSTNSFEEIEFGSVVQ 1096 FGS ++PD+F GAS NGS + PS N ++ + + + G + Sbjct: 720 FGSFLEPDIFLGASHNQNGSFE--------VNIMKSEPSVSNRISDVKAEDGVNAGDSAK 771 Query: 1097 PDLFT 1111 D+ + Sbjct: 772 GDILS 776 >ref|XP_004287962.1| PREDICTED: uncharacterized protein LOC101297479 [Fragaria vesca subsp. vesca] Length = 647 Score = 103 bits (258), Expect = 1e-19 Identities = 92/324 (28%), Positives = 150/324 (46%), Gaps = 6/324 (1%) Frame = +2 Query: 188 SFFLEAKETSASAFDDQSIPKKRILGTETSAFPGQESLSFFENAQAPDFAVRSVSTDDAV 367 S F ++ S A + K+I E +AF G E+LS FEN ++ + V+S + Sbjct: 257 SAFHGSENLSLFASEKPFESSKQITTAEGTAFQGNETLSMFENVESSETDVKSTQGESG- 315 Query: 368 DESFSGWESEFQSASSQTL-EISSPLDAASGSYTDIKLINSTDNDA---KSKCQSVPLAS 535 S S W + QSA+S+ L + S LD GS D+ T + +K +S AS Sbjct: 316 -HSISSWPASLQSAASENLPQESKSLDPLVGSIVDLSAHIDTVFGSVGDSTKVKSNHSAS 374 Query: 536 MSDSWIPSDQSHSTNTGGSYDYGQFKINNEANNYEEAKV--DSNNLSSINDDWIPEELWT 709 S+ W D +N+G + GQ + ++ + + NNL S DW+ + W Sbjct: 375 TSNDWFSDDLLSISNSGLA---GQPQPLESLATVKDGIIAENENNLHSTGIDWVEDTQWQ 431 Query: 710 TGITKASNSQKINEDDDSFDTWRXXXXXXXXXXXXXXXWKQTNSTTSFSDELTPKMNLAN 889 T A +++ +EDDDSF W KQ T+ +DE + + Sbjct: 432 TTSKDARDNKIADEDDDSFGAWNDFTSLSSAQNPSSSS-KQIVDQTTLTDETSMTDLFSI 490 Query: 890 STKSFEEIEFGSVVQPDLFTGASKLHNGSTDSWKQTSSTTTFSDEHPSKMNLVNSTNSFE 1069 ++ S + FG+ + FT + N S+ S+KQT +DE S +L ++ + Sbjct: 491 ASNSQADDSFGAW---NDFTSFNSAQNASS-SFKQTVDQMRPADE-TSVTDLFSTATDSQ 545 Query: 1070 EIEFGSVVQPDLFTGASNLHNGST 1141 +++FGS +QPDL GA++ +GST Sbjct: 546 DLDFGSFLQPDLSAGATSSSHGST 569 >ref|XP_004243489.1| PREDICTED: uncharacterized protein LOC101260063 [Solanum lycopersicum] Length = 586 Score = 99.4 bits (246), Expect = 3e-18 Identities = 107/420 (25%), Positives = 165/420 (39%), Gaps = 57/420 (13%) Frame = +2 Query: 29 EGLNVSRDPLILSDLFDFELKWPSKPEKNESHTADKAQHQSKSSLNVTGVDLDSFF-LEA 205 +G + D ++LSD D +++WP++ E + + K + SKSS + TG DLD+F Sbjct: 123 KGPSSPHDEVLLSDFLDLKIRWPTELETDNTIMTKKLE-LSKSSYDPTGFDLDNFLSFPK 181 Query: 206 KETSASAFDDQSIPKKRILGTETSAFPGQESLSFFENAQAPDFAVRSVSTDDAVDESFSG 385 +E +SA +Q++ I E LS FEN ++ + AV S + + D FSG Sbjct: 182 RENISSAHKEQTVTSDNIGSAANKTVGSHEDLSLFENLRSAEPAVTSSTIQTSDD--FSG 239 Query: 386 WESEFQSASSQTLEIS----SPLDAASGS-----------YT------------------ 466 W+++FQ+A S +S SPL +A GS YT Sbjct: 240 WQADFQAAGSGEQNVSNESISPLSSAIGSGVQHSFAAFDTYTSSTVSSGNHEGSKSTDAL 299 Query: 467 ---DIKLINSTD---------NDAKSKCQSVPLASMSDSWIPSDQSHSTNTGGSYDYGQF 610 DI L D D K K V + ++ W D S N S G+ Sbjct: 300 VGADIDLSAQLDTVFGTTEGPTDGKLK-DVVDVPPAANDWPAVDLWDSANLEASQKAGEI 358 Query: 611 KINNEANNYEEAKVDSNNLSSINDDWIPEELWTTGITKASNSQKINEDDDSFDTWRXXXX 790 + N E + +SI DW ++ W T A N D DSFD W Sbjct: 359 LPISRPKNAELQNSSEDPSTSI--DWFQDDTWQTHNAPAPKHDSTNGDLDSFDEW----- 411 Query: 791 XXXXXXXXXXXWKQTNSTTSFSDELTPKMNLANSTKSFEEIEFGSVVQPDLFTGASKLHN 970 N+ TS + P F +V P+L T + + Sbjct: 412 ---------------NTLTSSAPTKDP---------------FENVPAPELDT--TNGDH 439 Query: 971 GSTDSWKQTSSTTTFSD-----------EHPSKMNLVNSTNSFEEIEFGSVVQPDLFTGA 1117 S D W +++T D ++ + L N +++ E+++FGS Q D F+GA Sbjct: 440 DSFDEWNTFATSTPSKDPFENMLAQSNSDNNNDAELTNFSSNLEDMDFGSFSQSDPFSGA 499 >gb|EYU43120.1| hypothetical protein MIMGU_mgv1a003739mg [Mimulus guttatus] Length = 567 Score = 91.3 bits (225), Expect = 7e-16 Identities = 98/396 (24%), Positives = 173/396 (43%), Gaps = 35/396 (8%) Frame = +2 Query: 62 LSDLFDFELKWPSKPEKNESHTADKAQHQSK-SSLNVTGVDLDSFFLEAKETSASAFDDQ 238 LSD +F+++WP + +K+E +DK ++K SSL++ G D FFL +K + D Sbjct: 126 LSDFLNFKIQWPIESDKDEISFSDKHSEETKRSSLSLPGFAPDKFFLNSK---GNVLSDT 182 Query: 239 SIPKKRILGT-------ETSAFPGQESLSFFENAQAPDFAVRSVSTDDAVDESFSGWESE 397 SI K I + +AF Q+ F+N Q+ + A +S++ + E+FS W+++ Sbjct: 183 SIEKSFIHNQFHTADRKDVAAFEDQD---LFQNVQSSNPA--EISSEHKISEAFSEWDAD 237 Query: 398 FQSASSQTLEI---SSPLDAASGSYTDIKLINSTDNDAKSKCQSVPLASMSDSWIP---- 556 FQSA + I SS A S + + IK + D + + V L+ DS Sbjct: 238 FQSADFENQHIGDKSSEQFADSSASSGIKFQDPLSLDPSTGLK-VDLSDEIDSVFGPGKD 296 Query: 557 -SDQSHSTNTGGSYDYGQFKINNEANNYEEAK--VD-----------------SNNLSSI 676 +D + N S + + NN ++N + +D S NLSS Sbjct: 297 LNDGELNDNPAVSPAFDDWDWNNLSSNKSDFSGVIDATVSTKNGMEDDYGLEYSKNLSSG 356 Query: 677 NDDWIPEELWTTGITKASNSQKINEDDDSFDTWRXXXXXXXXXXXXXXXWKQTNSTTSFS 856 +D + + T A N + E + ++ S T++ Sbjct: 357 DDLFQDFQSPTNYSEMAENKNNVEERGLEYSK---------DLSIGDDLFQDFQSPTNYR 407 Query: 857 DELTPKMNLANSTKSFEEIEFGSVVQPDLFTGASKLHNGSTDSWKQTSSTTTFSDEHPSK 1036 D K+++ K+ +E F + D FTG++ S +W + S++ S+ Sbjct: 408 DMAENKIDVEEHNKTMDEDLFE---EWDDFTGSTSSQVASQSAWTGGDYQVSTSEQKSSE 464 Query: 1037 MNLVNSTNSFEEIEFGSVVQPDLFTGASNLHNGSTD 1144 M+ +S N F E++FG QP+LF+ +++ N T+ Sbjct: 465 MDFFSSNNHFVEVDFGGFSQPNLFSASTSNSNVLTE 500 >ref|XP_006364172.1| PREDICTED: dentin sialophosphoprotein-like [Solanum tuberosum] Length = 587 Score = 91.3 bits (225), Expect = 7e-16 Identities = 99/411 (24%), Positives = 168/411 (40%), Gaps = 48/411 (11%) Frame = +2 Query: 29 EGLNVSRDPLILSDLFDFELKWPSKPEKNESHTADKAQHQSKSSLNVTGVDLDSFF-LEA 205 +G + D ++LSD D +++WP++ E + + K + SKSS + TG DLD+F Sbjct: 123 KGPSSPHDEVLLSDFLDLKIRWPTELETDNTLMTKKLE-LSKSSYDPTGFDLDNFLSFPK 181 Query: 206 KETSASAFDDQSIPKKRILGTETSAFPGQESLSFFENAQAPDFAVRSVSTDDAVDESFSG 385 +E + A +Q+ I E+LS FEN ++ + AV S + + D FSG Sbjct: 182 RENISIAHKEQTAISDNIGSAANKTVGSHENLSLFENLRSAEPAVTSSTVQTSDD--FSG 239 Query: 386 WESEFQSASS----QTLEISSPLDAASGS-----------YT------------------ 466 W+++FQ+A S + E SSP+ +A GS YT Sbjct: 240 WQADFQAAGSGEQNVSNESSSPISSAVGSGGQHAFAAFDTYTSSTVSSGNQHEGSKSTDA 299 Query: 467 ----DIKL-------INSTDNDAKSKCQSVPLAS-MSDSWIPSDQSHSTNTGGSYDYGQF 610 DI L +T+ + K + V S ++ W D S N S G+ Sbjct: 300 FVGSDIDLSAQLDTVFGTTEGPTEGKLKDVVAVSPAANDWPAVDLWDSANLEASQKAGEI 359 Query: 611 KINNEANNYEEAKVDSN-NLSSINDDWIPEELWTTGITKASNSQKINEDDDSFDTWRXXX 787 + ++A++ +N N S + DW ++ W T N D DSFD W Sbjct: 360 L---PISRPKDAELQNNSNDPSTSIDWYQDDTWQTHNAPVPKHDTTNGDHDSFDEWNTLT 416 Query: 788 XXXXXXXXXXXXWKQTNSTTSFSDELTPKMNLANST-KSFEEIEFGSVVQPDLFTGASKL 964 + F + PK++ N SF+E + P Sbjct: 417 -------------SSAPTKDPFENVPAPKLDTTNGDHDSFDEWNTFATSAP--------- 454 Query: 965 HNGSTDSWKQTSSTTTFSDEHPSKMNLVNSTNSFEEIEFGSVVQPDLFTGA 1117 S D ++ + ++++ + L N +++ E+++FGS Q + F+GA Sbjct: 455 ---SKDPFE--NMLVQSNNDNNNNAELTNFSSNLEDMDFGSFSQSNPFSGA 500 >ref|XP_007049104.1| Uncharacterized protein isoform 2, partial [Theobroma cacao] gi|508701365|gb|EOX93261.1| Uncharacterized protein isoform 2, partial [Theobroma cacao] Length = 826 Score = 88.6 bits (218), Expect = 5e-15 Identities = 98/370 (26%), Positives = 148/370 (40%), Gaps = 4/370 (1%) Frame = +2 Query: 47 RDPLILSDLFDFELKW-PSKPEKNESHTADKAQHQSKSSLNVTGVDLDSFFL-EAKETSA 220 R LSDL D E++W ++PE ES + K H LN+ G+DLD FL E K S Sbjct: 95 RQEFRLSDLLDLEIRWNDAEPESFES-SLGKNNH-----LNLAGLDLDDDFLAERKGDSV 148 Query: 221 SAFDDQSIPKKRILGTETSAFPGQESLSFFENAQAPDFAVRSVSTDDAVDESFSGWESEF 400 S ++P K + + S F +++LS FEN VS S SGW+++F Sbjct: 149 SIPTQGTLPLKEEIDSTGSEFQSRQNLSLFEN---------QVSKSSG---SVSGWQADF 196 Query: 401 QSASSQTLEISSPLDAASGSYTDIKLINSTDNDAKSKCQSVPLASMSDSWIPSDQSHSTN 580 QSA S+T +A S +D + +S D A S + ++ D +QS S Sbjct: 197 QSADSRT-----DHNAISSQSSDPFVGSSKDLSAHVDMVSGQVNNLFDGKEDDNQSSSK- 250 Query: 581 TGGSYDYGQFKINNEANNYEEAKVDSNNL-SSINDDWIPEELWTTGITKASNSQKINEDD 757 S F+ + ++N+ ++D N+ SS N DW+ + N + ++DD Sbjct: 251 ---SQTNNSFRDDMQSNSTSGVRIDQANISSSANVDWVQGDQGQIIGNNTPNKRTPDDDD 307 Query: 758 DSFDTWRXXXXXXXXXXXXXXXWKQTNSTTSFSDELTPKMNLANSTKSFEEIEFGSVVQP 937 DSFD W W QT + KS E Sbjct: 308 DSFDAWNDFKGSASAPDAAKTYWDQT----------------TDGMKSMNE--------- 342 Query: 938 DLFTGASKLHNGSTDSWKQTSSTTTFSDEHPSKMNLVNSTNSFEEIEFGSVVQPDLF-TG 1114 K+H+ S W S +T F +H + N S ++ S +F TG Sbjct: 343 -------KVHD-SFSGWGPGSESTAFETQHEVSKSFDNFAGSSADL---STHTDSVFGTG 391 Query: 1115 ASNLHNGSTD 1144 + H + D Sbjct: 392 KDSFHGKAVD 401 Score = 75.1 bits (183), Expect = 5e-11 Identities = 74/287 (25%), Positives = 126/287 (43%), Gaps = 20/287 (6%) Frame = +2 Query: 344 SVSTDDAVDESFSGWESEFQSASSQTLEISSP-LDAASGSYTDIK-----LINSTDNDAK 505 S ST++ + FSGW+++FQSASS SS D GS D+ + S + Sbjct: 505 SSSTEEKSSDPFSGWDTDFQSASSTNHNDSSKSFDPLVGSSIDLSDHMDTVFASGKDFVD 564 Query: 506 SKCQSVPLASMSDSWIPSDQ-SHSTNTGGSYDYGQFKINNEANNYEE--------AKVDS 658 K + S +++W D S+ST+ K+ +A N++ A Sbjct: 565 GKAKDGSNVSSTNNWFQDDLWSNSTS----------KVTCQAENFDATIDVMDSGAAQSM 614 Query: 659 NNLSSINDDWIPEELWTTGITKASNSQKINEDDDSFDTWRXXXXXXXXXXXXXXXWKQTN 838 +N S+N DW P++ W TG KA + + +++ D+SF W T Sbjct: 615 HNSPSMNVDWFPDDQWLTGNNKAPDRKNVDKSDNSFREWNDFK-------------SSTT 661 Query: 839 STTSFSDELTPKMNLANSTK-SFEEIEFGSVVQPDLFTGASKLHNGSTDSWKQTSSTTTF 1015 +FSD P A K + ++ + S D FT + ++ S+ S+K T Sbjct: 662 MQDAFSD---PSKQAARPDKITIDDNDDLSAAWND-FTSSISANDPSSISFKH-----TV 712 Query: 1016 SDEHP----SKMNLVNSTNSFEEIEFGSVVQPDLFTGASNLHNGSTD 1144 + E P S+++ + ++ + G++ QPDLF + + NGST+ Sbjct: 713 NHEKPSIGTSEIHFFSMDSNSHDNNSGNLSQPDLFPRSFSNQNGSTE 759 >ref|XP_007049103.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508701364|gb|EOX93260.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 864 Score = 88.6 bits (218), Expect = 5e-15 Identities = 98/370 (26%), Positives = 148/370 (40%), Gaps = 4/370 (1%) Frame = +2 Query: 47 RDPLILSDLFDFELKW-PSKPEKNESHTADKAQHQSKSSLNVTGVDLDSFFL-EAKETSA 220 R LSDL D E++W ++PE ES + K H LN+ G+DLD FL E K S Sbjct: 134 RQEFRLSDLLDLEIRWNDAEPESFES-SLGKNNH-----LNLAGLDLDDDFLAERKGDSV 187 Query: 221 SAFDDQSIPKKRILGTETSAFPGQESLSFFENAQAPDFAVRSVSTDDAVDESFSGWESEF 400 S ++P K + + S F +++LS FEN VS S SGW+++F Sbjct: 188 SIPTQGTLPLKEEIDSTGSEFQSRQNLSLFEN---------QVSKSSG---SVSGWQADF 235 Query: 401 QSASSQTLEISSPLDAASGSYTDIKLINSTDNDAKSKCQSVPLASMSDSWIPSDQSHSTN 580 QSA S+T +A S +D + +S D A S + ++ D +QS S Sbjct: 236 QSADSRT-----DHNAISSQSSDPFVGSSKDLSAHVDMVSGQVNNLFDGKEDDNQSSSK- 289 Query: 581 TGGSYDYGQFKINNEANNYEEAKVDSNNL-SSINDDWIPEELWTTGITKASNSQKINEDD 757 S F+ + ++N+ ++D N+ SS N DW+ + N + ++DD Sbjct: 290 ---SQTNNSFRDDMQSNSTSGVRIDQANISSSANVDWVQGDQGQIIGNNTPNKRTPDDDD 346 Query: 758 DSFDTWRXXXXXXXXXXXXXXXWKQTNSTTSFSDELTPKMNLANSTKSFEEIEFGSVVQP 937 DSFD W W QT + KS E Sbjct: 347 DSFDAWNDFKGSASAPDAAKTYWDQT----------------TDGMKSMNE--------- 381 Query: 938 DLFTGASKLHNGSTDSWKQTSSTTTFSDEHPSKMNLVNSTNSFEEIEFGSVVQPDLF-TG 1114 K+H+ S W S +T F +H + N S ++ S +F TG Sbjct: 382 -------KVHD-SFSGWGPGSESTAFETQHEVSKSFDNFAGSSADL---STHTDSVFGTG 430 Query: 1115 ASNLHNGSTD 1144 + H + D Sbjct: 431 KDSFHGKAVD 440 Score = 75.1 bits (183), Expect = 5e-11 Identities = 74/287 (25%), Positives = 126/287 (43%), Gaps = 20/287 (6%) Frame = +2 Query: 344 SVSTDDAVDESFSGWESEFQSASSQTLEISSP-LDAASGSYTDIK-----LINSTDNDAK 505 S ST++ + FSGW+++FQSASS SS D GS D+ + S + Sbjct: 544 SSSTEEKSSDPFSGWDTDFQSASSTNHNDSSKSFDPLVGSSIDLSDHMDTVFASGKDFVD 603 Query: 506 SKCQSVPLASMSDSWIPSDQ-SHSTNTGGSYDYGQFKINNEANNYEE--------AKVDS 658 K + S +++W D S+ST+ K+ +A N++ A Sbjct: 604 GKAKDGSNVSSTNNWFQDDLWSNSTS----------KVTCQAENFDATIDVMDSGAAQSM 653 Query: 659 NNLSSINDDWIPEELWTTGITKASNSQKINEDDDSFDTWRXXXXXXXXXXXXXXXWKQTN 838 +N S+N DW P++ W TG KA + + +++ D+SF W T Sbjct: 654 HNSPSMNVDWFPDDQWLTGNNKAPDRKNVDKSDNSFREWNDFK-------------SSTT 700 Query: 839 STTSFSDELTPKMNLANSTK-SFEEIEFGSVVQPDLFTGASKLHNGSTDSWKQTSSTTTF 1015 +FSD P A K + ++ + S D FT + ++ S+ S+K T Sbjct: 701 MQDAFSD---PSKQAARPDKITIDDNDDLSAAWND-FTSSISANDPSSISFKH-----TV 751 Query: 1016 SDEHP----SKMNLVNSTNSFEEIEFGSVVQPDLFTGASNLHNGSTD 1144 + E P S+++ + ++ + G++ QPDLF + + NGST+ Sbjct: 752 NHEKPSIGTSEIHFFSMDSNSHDNNSGNLSQPDLFPRSFSNQNGSTE 798