BLASTX nr result
ID: Akebia25_contig00008532
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia25_contig00008532 (1893 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007209905.1| hypothetical protein PRUPE_ppa004205mg [Prun... 332 3e-88 ref|XP_006438810.1| hypothetical protein CICLE_v10031172mg [Citr... 304 8e-80 ref|XP_006483051.1| PREDICTED: uncharacterized protein LOC102614... 304 1e-79 ref|XP_004299321.1| PREDICTED: uncharacterized protein LOC101293... 299 3e-78 ref|XP_006354761.1| PREDICTED: uncharacterized protein LOC102579... 293 2e-76 ref|XP_004241596.1| PREDICTED: uncharacterized protein LOC101258... 290 2e-75 ref|XP_002281450.1| PREDICTED: uncharacterized protein LOC100263... 289 3e-75 ref|XP_007045913.1| Uncharacterized protein isoform 2 [Theobroma... 286 2e-74 ref|XP_007045912.1| Uncharacterized protein isoform 1 [Theobroma... 286 2e-74 ref|XP_007037501.1| Uncharacterized protein isoform 1 [Theobroma... 283 2e-73 ref|XP_002274465.2| PREDICTED: uncharacterized protein LOC100250... 283 2e-73 emb|CAN60165.1| hypothetical protein VITISV_040087 [Vitis vinifera] 283 2e-73 ref|XP_007037503.1| Uncharacterized protein isoform 3 [Theobroma... 280 1e-72 ref|XP_004138186.1| PREDICTED: uncharacterized protein LOC101205... 263 2e-67 ref|XP_003522999.1| PREDICTED: uncharacterized protein LOC100793... 261 8e-67 gb|EXC11036.1| hypothetical protein L484_015256 [Morus notabilis] 258 7e-66 ref|XP_007032692.1| Uncharacterized protein TCM_018715 [Theobrom... 254 7e-65 ref|XP_007138262.1| hypothetical protein PHAVU_009G193800g [Phas... 254 1e-64 ref|XP_004499488.1| PREDICTED: uncharacterized protein LOC101494... 249 3e-63 ref|XP_007224589.1| hypothetical protein PRUPE_ppa1027132mg [Pru... 246 3e-62 >ref|XP_007209905.1| hypothetical protein PRUPE_ppa004205mg [Prunus persica] gi|462405640|gb|EMJ11104.1| hypothetical protein PRUPE_ppa004205mg [Prunus persica] Length = 523 Score = 332 bits (852), Expect = 3e-88 Identities = 205/497 (41%), Positives = 267/497 (53%), Gaps = 51/497 (10%) Frame = +2 Query: 20 QGEEKFINDEITELSVGADKDFETSVPGCLSSLSWVTNNANEEDSRSEIAVWASFSHNVF 199 +GEEK +D +L G + D ETS PG + SW T++ +EEDS E SF + F Sbjct: 57 KGEEKLSDDIFYDLPKGGE-DVETSGPGSFTISSWTTSSTSEEDSLLEAPFHGSFP-DCF 114 Query: 200 EPDCSTKTIVQPEGIYSPLLDYHPRKLVPVGPDHQADVPPWGLKGIVNASVDEDNGD--- 370 P+ +T+ Q E IYS LLD+ PRK V +GP+HQA+VP WG +G N S + D + Sbjct: 115 NPERPIRTLAQSEDIYSFLLDHPPRKSVSIGPEHQAEVPLWGAQGNNNNSNNLDTSEAVS 174 Query: 371 --------KLMGTCIIPMPNSNVFAYNRDGVGHGFCDCSCLDEGSIGCVRQHIVEAREML 526 +LMGTC+IPMP+S++ A G G DCSC DE S+ CVRQHI+EARE L Sbjct: 175 NSDLEDEKRLMGTCVIPMPDSDLSADTGCIAGIGRTDCSCEDEDSVRCVRQHILEAREKL 234 Query: 527 RKTLG---------------------QEXXXXXXEVVFSNLASSGRNFWDHLSVVFPSRT 643 KT+G +E +VVFSN AS G+NFWD+LS VFPSRT Sbjct: 235 IKTIGPKRFEELGFSDMGEQVAQRWSEEEEQLFHQVVFSNPASLGKNFWDNLSTVFPSRT 294 Query: 644 KKDIISYYFNVFMLRIRAEQNRLDPMNIDSDNDEWRSSXXXXXXXXXXXXXXXSVVESP- 820 KK+I+SYYFNVFML RA QNR DP+N+DSDNDEW+ S SVVESP Sbjct: 295 KKEIVSYYFNVFMLVKRAGQNRYDPINVDSDNDEWQGSNDYGDNQLAVTEDEDSVVESPI 354 Query: 821 -ANQSGFYIEECFHADETYXXXXXXXXXXXXEEYNERI------DNVSD----------- 946 N G+Y +EY+E + DNV+ Sbjct: 355 CQNVPGYY----------------QSWKDNLQEYDEEVVDDTCDDNVNVDMFGGGTKQIL 398 Query: 947 RRVRKFLGDCSFDTEVQLESKNSQNGGEDHDIQDDSCMSYECQPHKADSNGPVDAGAETQ 1126 R + +CS QL+ K S + D ++QDDSC S+ DA A +Q Sbjct: 399 DRCYGLVDNCSTCPIAQLQDKISWDEKGDQEVQDDSCTSF-------------DAAAASQ 445 Query: 1127 ESHVESDHCQHFQCSYGELSGGIDHDYVLELSDARLWDVGYQNCPEKDLDFLSTCNMIEE 1306 E+ ++S+ H+ + S DH+YVLE D ++WD GY CPE +DFL TCNMIEE Sbjct: 446 ENQLKSEEGNHWSGGFNGSSNRGDHEYVLEPCDTKIWDAGYMTCPENKVDFLPTCNMIEE 505 Query: 1307 VFGEEAWNWNNKGRDEK 1357 VFG+E+WN+ K RD K Sbjct: 506 VFGKESWNY--KARDGK 520 >ref|XP_006438810.1| hypothetical protein CICLE_v10031172mg [Citrus clementina] gi|557541006|gb|ESR52050.1| hypothetical protein CICLE_v10031172mg [Citrus clementina] Length = 541 Score = 304 bits (779), Expect = 8e-80 Identities = 194/483 (40%), Positives = 252/483 (52%), Gaps = 48/483 (9%) Frame = +2 Query: 26 EEKFINDEITELSVGADKDFETSVPGCLSSLSWVTNNANEEDSRSEIAVWASFSHNVFEP 205 E + +E+T L DKDFETS P LSWVT+++ EED+ S A S E Sbjct: 65 ENGTVANELTNL---VDKDFETSAP-----LSWVTSSSCEEDAGSGSTTHAPLSLEHIEY 116 Query: 206 DCSTKTIVQPEGIYSPLLDYHPRKLVPVGPDHQADVPPWGL---KGIVNAS--------- 349 D +T V E YS LLD PRK VP+GP+HQA +P W K I++ Sbjct: 117 DYPRRTFVPFEDSYSSLLDRSPRKQVPLGPNHQAILPSWDRSMGKNILDGKATLRGNNSL 176 Query: 350 --------VDEDNGDKLMGTCIIPMPNSNVFAYNRDGVGHGFCDCSCLDEGSIGCVRQHI 505 VD DN +K MGTCIIPMP+SN FA+N D VG G DC CLDEGSI CV+QH+ Sbjct: 177 DHLGSHNVVDNDNEEKWMGTCIIPMPDSNSFAHNIDQVGRGIMDCDCLDEGSIRCVQQHV 236 Query: 506 VEAREMLRKTLG---------------------QEXXXXXXEVVFSNLASSGRNFWDHLS 622 +EARE L K+LG +E EVV+SN S GRNFW LS Sbjct: 237 MEAREKLLKSLGHEKFVKLGLCDMGEEVSCKWSEEEEQVFHEVVYSNPFSLGRNFWKQLS 296 Query: 623 VVFPSRTKKDIISYYFNVFMLRIRAEQNRLDPMNIDSDNDEWRSSXXXXXXXXXXXXXXX 802 VFPSRTKK+I+SYYFNVF+LR RA QNR D + IDSD+DEW Sbjct: 297 AVFPSRTKKEIVSYYFNVFVLRRRAVQNRSDLLEIDSDDDEWHGGYGGSDEIRISEEDED 356 Query: 803 SVVESPANQSGFYI-EECFHADETYXXXXXXXXXXXXEEYNER---IDNVSDRRVRKFLG 970 S +ESP +Q E+ D+ E D+VSD + K Sbjct: 357 SAIESPVDQENADCGEDSSDEDDDDGGDSDGDVGDGGGEVTGETCGTDHVSDTNIAKSFD 416 Query: 971 DCSFDTEVQLESKNSQNGGEDHDIQDDSCMSYECQPHKADSNGPVDAGAETQESHVESDH 1150 + FD V K + G+D +++D+SC S+E QP +DS G +DA Q S V + Sbjct: 417 EGGFDAVVPHMDKIPGDAGDDFNVEDESCTSFEFQPDMSDSCGAIDAEHALQLSGVRT-- 474 Query: 1151 CQHFQCSYGELSGGID---HDYVLELSDARLWDVGYQNCPEKDLDFLSTCNMIEEVFGEE 1321 +H + +G L G D H +L+ DA++WD Y + P K ++ L TCN+IEE+FG+ Sbjct: 475 -EHGKALHGRLDGYNDLVGHMNLLDSCDAKVWDARYLS-PIKGVELLPTCNIIEEIFGQG 532 Query: 1322 AWN 1330 W+ Sbjct: 533 TWD 535 >ref|XP_006483051.1| PREDICTED: uncharacterized protein LOC102614272 [Citrus sinensis] Length = 541 Score = 304 bits (778), Expect = 1e-79 Identities = 192/473 (40%), Positives = 247/473 (52%), Gaps = 48/473 (10%) Frame = +2 Query: 56 ELSVGADKDFETSVPGCLSSLSWVTNNANEEDSRSEIAVWASFSHNVFEPDCSTKTIVQP 235 EL DKDFETS P LSWVT+++ EED+ S A S E D +T V Sbjct: 72 ELMNLVDKDFETSAP-----LSWVTSSSCEEDAGSGSTTHAPLSLEHIEYDYPRRTFVPF 126 Query: 236 EGIYSPLLDYHPRKLVPVGPDHQADVPPW----------------GLKGIVNAS----VD 355 E YS LLD PRK VP+GP+HQA +P W G +V+ VD Sbjct: 127 EDSYSSLLDRSPRKQVPLGPNHQAILPSWDRSMGKNILDGKATLRGNNSLVHLGSHNVVD 186 Query: 356 EDNGDKLMGTCIIPMPNSNVFAYNRDGVGHGFCDCSCLDEGSIGCVRQHIVEAREMLRKT 535 DN +K MGTCIIPMP+SN FA+N D VG G DC CLDEGSI CV+QH++EARE L K+ Sbjct: 187 NDNEEKWMGTCIIPMPDSNSFAHNIDQVGRGIMDCDCLDEGSIRCVQQHVMEAREKLLKS 246 Query: 536 LG---------------------QEXXXXXXEVVFSNLASSGRNFWDHLSVVFPSRTKKD 652 LG +E EVV+SN S GRNFW LS VFPSRTKK+ Sbjct: 247 LGHEKFVKLGLCDMGEEVSCKWSEEEEQVFHEVVYSNPFSLGRNFWKQLSAVFPSRTKKE 306 Query: 653 IISYYFNVFMLRIRAEQNRLDPMNIDSDNDEWRSSXXXXXXXXXXXXXXXSVVESPANQS 832 I+SYYFNVF+LR RA QNR D + IDSD+DEW S +ESP +Q Sbjct: 307 IVSYYFNVFVLRRRAVQNRSDLLEIDSDDDEWHGGYGGSDEIRISEEDEDSAIESPVDQE 366 Query: 833 GFYI-EECFHADETYXXXXXXXXXXXXEEYNER---IDNVSDRRVRKFLGDCSFDTEVQL 1000 E+ D+ E D+VSD + K + FD V Sbjct: 367 NADCGEDSSDEDDDDGGDSDGDVGDGGGEVTGETCGTDHVSDTNIAKSFDEGGFDAVVPH 426 Query: 1001 ESKNSQNGGEDHDIQDDSCMSYECQPHKADSNGPVDAGAETQESHVESDHCQHFQCSYGE 1180 K + G+D +++D+SC S+E QP +DS G +DA Q S V + +H + +G Sbjct: 427 MDKIPGDAGDDFNVEDESCTSFEFQPDMSDSCGAIDAAHALQLSGVRT---EHGKALHGR 483 Query: 1181 LSGGID---HDYVLELSDARLWDVGYQNCPEKDLDFLSTCNMIEEVFGEEAWN 1330 L G D H +L+ DA++WD Y + P K ++ L TCN+IEE+FG+ W+ Sbjct: 484 LDGYNDLVGHMNLLDSCDAKVWDARYLS-PIKGVELLPTCNIIEEIFGQGTWD 535 >ref|XP_004299321.1| PREDICTED: uncharacterized protein LOC101293785 [Fragaria vesca subsp. vesca] Length = 533 Score = 299 bits (765), Expect = 3e-78 Identities = 186/484 (38%), Positives = 245/484 (50%), Gaps = 39/484 (8%) Frame = +2 Query: 23 GEEKFINDEITELSVGADKDFETSVPGCLSSLSWVTNNANEEDSRSEIAVWASFSHNVFE 202 GEEK D L +D + S PG ++ SW + EDS + + F Sbjct: 64 GEEKHSGDVYASLPT-VGEDIKASAPGSFTNSSWTASTTRGEDSFPQAPCHGFYFPEYFN 122 Query: 203 PDCSTKTIVQPEGIYSPLLDYHPRKLVPVGPDHQADVPPWGLKGIVNAS----------- 349 P+ +T+ E IYS LLD+ PRK +GP+HQA +PPWG G+ N S Sbjct: 123 PERPIRTLAS-EDIYSFLLDHSPRKSASIGPEHQAVIPPWGAHGVNNTSSSSHLDTSQSV 181 Query: 350 VDED--NGDKLMGTCIIPMPNSNVFAYNRDGVGHGFCDCSCLDEGSIGCVRQHIVEAREM 523 VD D N ++MGTC+IPMPNS + VG G DCSC D SI CVRQHI+EARE Sbjct: 182 VDSDLENEKRMMGTCVIPMPNSELSTDCESIVGRGRTDCSCEDRASIRCVRQHILEAREK 241 Query: 524 LRKTLGQEXXXXXX---------------------EVVFSNLASSGRNFWDHLSVVFPSR 640 L K +G E +VVFSN AS +NFWD LS VFP R Sbjct: 242 LIKNIGPERFAELGFCDMGEQVAEKWSDYEEKLFHQVVFSNPASLDKNFWDSLSAVFPLR 301 Query: 641 TKKDIISYYFNVFMLRIRAEQNRLDPMNIDSDNDEWRSSXXXXXXXXXXXXXXXSVVESP 820 TK +I+SYYFNVFMLR RA QNR DP+N+DSDNDEW S SVV+SP Sbjct: 302 TKMEIVSYYFNVFMLRKRARQNRYDPVNVDSDNDEWEGSTVHGDNEPGVTDDDDSVVDSP 361 Query: 821 ANQSGFYIEECFHAD-ETY--XXXXXXXXXXXXEEYNERIDNVSDRRVRKFLGDCSFDTE 991 Q+ + + D + Y + Y +SDR + + Sbjct: 362 GYQNDPGFIKSWGGDMQEYDEDVVDDACDNVNVDIYGGSGKQISDRCPGNLVSNGGSSPI 421 Query: 992 VQLESKNSQNGGEDHDIQDDSCMSYECQPHKADSNGPVDAGAETQESHVESDHCQHFQ-- 1165 VQ + + + D ++QDDSC S+E AG +Q++ + S++ H++ Sbjct: 422 VQFQKNIAWDEKGDQEVQDDSCTSFE-------------AGVASQDNQLRSENGDHWEVG 468 Query: 1166 CSYGELSGGIDHDYVLELSDARLWDVGYQNCPEKDLDFLSTCNMIEEVFGEEAWNWNNKG 1345 C G G DH+YVLE DA++WD GY C + +DFL TCNMIEEVFG+++WN + K Sbjct: 469 CFNGTSKLG-DHEYVLEPCDAKVWDAGYSTCRKNKVDFLPTCNMIEEVFGKDSWN-SYKA 526 Query: 1346 RDEK 1357 RD K Sbjct: 527 RDGK 530 >ref|XP_006354761.1| PREDICTED: uncharacterized protein LOC102579656 isoform X1 [Solanum tuberosum] gi|565376542|ref|XP_006354762.1| PREDICTED: uncharacterized protein LOC102579656 isoform X2 [Solanum tuberosum] Length = 545 Score = 293 bits (749), Expect = 2e-76 Identities = 185/495 (37%), Positives = 259/495 (52%), Gaps = 52/495 (10%) Frame = +2 Query: 14 KSQGEEKFINDEITELSVGADKDFETSVPGCLSSLSWVTNNANEEDSRSEIAVWASFSHN 193 K+ E++ + ++ + V ++K ETS+ G S+ SW + + ++ED RSE+ + Sbjct: 58 KAFSEKRPDSCDVAAVPVSSEKAIETSIHGSASNSSWTSGSTSKEDIRSEVPFHVLTASK 117 Query: 194 VFEPDCSTKTIVQPEGIYSPLLDYHPRKLVPVGPDHQADVPPWGLKGIVNASVDE----- 358 + D S + ++ P +YSPLL+ PRK VP+GPD QA++P WG N S+ E Sbjct: 118 YYNTDPSFRVVIHPMEVYSPLLNNPPRKSVPIGPDFQAELPEWGAYDCKNISMKESTQES 177 Query: 359 ----------------DNGDKLMGTCIIPMPNSNVFAYNRDGVGHGFCDCSCLDEGSIGC 490 D +KL GTCIIPMP + A + + VG G CSC D GS GC Sbjct: 178 PNLPSQALESGFVDHHDEENKLAGTCIIPMPKLELPADHEENVGAGKIGCSCEDAGSFGC 237 Query: 491 VRQHIVEAREMLRKTLGQEXXXXXX---------------------EVVFSNLASSGRNF 607 VR HI+EARE L+ LG+E EVVFSN A+ G+NF Sbjct: 238 VRLHIMEAREKLKAALGEETFVRLGVYDMGEIVAAKWSDEEEELFHEVVFSNPAALGKNF 297 Query: 608 WDHLSVVFPSRTKKDIISYYFNVFMLRIRAEQNRLDPMNIDSDNDEWR--SSXXXXXXXX 781 W+HL+V FPSR+K+D++SYYFNVF+LR RA+QNR DP NIDSDNDEW+ Sbjct: 298 WEHLAVEFPSRSKRDLVSYYFNVFILRKRAKQNRFDPSNIDSDNDEWQEIDDDVVATGAQ 357 Query: 782 XXXXXXXSVVESPANQ-----SGFYIEECFHADETYXXXXXXXXXXXXEEYNERIDNVSD 946 S+VESP Q + Y+ E DE E+Y R N Sbjct: 358 MTDEDEDSMVESPIYQNYPGHNEIYVTEKQAYDE-------EAGVATFEDY--RTINFCR 408 Query: 947 RRVRKFLGDCSFDTEVQLESKNSQNGGEDHDIQD-DSCMSYEC-QPHKADSNGPVD-AGA 1117 R+V L D S +L NS G H+IQ D S E P D++ D AGA Sbjct: 409 RKV---LSDASKACPDELIDNNSSCG---HNIQPLDRHHSNEVGNPDVEDNSCTTDAAGA 462 Query: 1118 ETQESHVESDHCQHFQCSYGELSGGIDHDYVLELSDARLWDVGYQNCPEKDLDFLSTCNM 1297 ++ V++D C+H+ + + G HD+V+E S+ + WD GY +C + ++D L TC+M Sbjct: 463 SSETPQVKTDDCKHWASHFAGVGIGSVHDFVMEPSNGKEWDTGYLSCAKNEVDLLPTCSM 522 Query: 1298 IEEVFGEEAWNWNNK 1342 IEEVFG+EAW+ N+ Sbjct: 523 IEEVFGDEAWSSKNR 537 >ref|XP_004241596.1| PREDICTED: uncharacterized protein LOC101258762 isoform 1 [Solanum lycopersicum] gi|460391983|ref|XP_004241597.1| PREDICTED: uncharacterized protein LOC101258762 isoform 2 [Solanum lycopersicum] Length = 546 Score = 290 bits (742), Expect = 2e-75 Identities = 186/488 (38%), Positives = 252/488 (51%), Gaps = 53/488 (10%) Frame = +2 Query: 47 EITELSVGADKDFETSVPGCLSSLSWVTNNANEEDSRSEIAVWASFSHNVFEPDCSTKTI 226 ++T + V ++K ETS+ G S+ SW +++ +EED RSE+ + + D + + Sbjct: 69 DVTAVPVSSEKAIETSIHGSASNSSWTSSSTSEEDIRSEVPFHVLTASKYYSSDPPFRVV 128 Query: 227 VQPEGIYSPLLDYHPRKLVPVGPDHQADVPPWGLKGIVNASVDE---------------- 358 + P +YSPL + PRK VP+GPD QA++P WG N SV E Sbjct: 129 IHPMEVYSPLFNNPPRKSVPIGPDFQAELPEWGAYDSKNISVKESTQESSNLPSQALESD 188 Query: 359 -----DNGDKLMGTCIIPMPNSNVFAYNRDGVGHGFCDCSCLDEGSIGCVRQHIVEAREM 523 D +KL GTCIIPMP A + + VG G CSC D GS GCVR HI+EARE Sbjct: 189 FVDHHDEENKLAGTCIIPMPKLESPADHEENVGAGRIGCSCGDAGSFGCVRLHIMEAREK 248 Query: 524 LRKTLGQEXXXXXX---------------------EVVFSNLASSGRNFWDHLSVVFPSR 640 L+ LG+E EVVFSN A+ G+NFWDHL+V FPSR Sbjct: 249 LKAALGEETFVRLGVYDMGEIVAEKWSEEEEELFHEVVFSNPAALGKNFWDHLAVEFPSR 308 Query: 641 TKKDIISYYFNVFMLRIRAEQNRLDPMNIDSDNDEWR--SSXXXXXXXXXXXXXXXSVVE 814 +K+D++SYYFNVF+LR RA+QNR DP NIDSDNDEW+ SVVE Sbjct: 309 SKRDLVSYYFNVFILRKRAKQNRFDPSNIDSDNDEWQEIDDDVVATGAQMTDDDEDSVVE 368 Query: 815 SPANQ-----SGFYIEECFHADETYXXXXXXXXXXXXEEYNERIDNVSDRRVRKFLGDCS 979 SP Q + Y+ E DE E+Y ++ R RK L D S Sbjct: 369 SPIYQNYPGHNEIYVTEKQAYDE-------EAGVATLEDY----QTINFCR-RKVLSDVS 416 Query: 980 FDTEVQLESKNSQNGGEDHDIQD-DSCMSYECQPHKADSNGPVD--AGAETQESHVESDH 1150 +L NS G H+IQ D S E H + N AGA + V++D Sbjct: 417 KACPDELIDNNSSCG---HNIQPLDRHHSNEVGNHDVEDNSCTTDAAGASSDTPQVKTDD 473 Query: 1151 CQHFQCSYGELSGGIDHDYVLELSDARLWDV-GYQNCPEKDLDFLSTCNMIEEVFGEEAW 1327 C+H+ + + HD+V+E S+ + WD+ GY +CP+ ++D L TC+MIEEVFG+EA Sbjct: 474 CKHWASHFAGVGIDSGHDFVMEPSNGKEWDMGGYLSCPKNEVDLLPTCSMIEEVFGDEA- 532 Query: 1328 NWNNKGRD 1351 W++K RD Sbjct: 533 -WSSKHRD 539 >ref|XP_002281450.1| PREDICTED: uncharacterized protein LOC100263964 [Vitis vinifera] Length = 521 Score = 289 bits (740), Expect = 3e-75 Identities = 192/497 (38%), Positives = 250/497 (50%), Gaps = 48/497 (9%) Frame = +2 Query: 14 KSQGEEKFINDEITELSVGADKDFETSVPGCLSSLSWVTNNANEEDSRSEIAVWASFSHN 193 K++G+EK ++ T+ + A KD ET + GC+S+ SW T++ +E+D+RSE + S Sbjct: 57 KTEGDEKLLSGFCTDFPISA-KDTETFMRGCISTSSWATSSTSEDDARSEAPIDVSLFPE 115 Query: 194 VFEPDCSTKTIVQPEGIYSPLLDYHPRKLVPVGPDHQADVPPWGLKGIVNA--------- 346 F D + + Y LLDY PRK VP+G DHQ DVP W +GI+++ Sbjct: 116 YFSSDSPVRASNDSDDYYLSLLDYPPRKSVPIGSDHQVDVPAWS-QGIMDSLDYLETSEQ 174 Query: 347 ------------SVDEDNGDKLMGTCIIPMPNSNVFAYNRDGVGHGFCDCSCLDEGSIGC 490 SV + +L+GTC++PMP S F N VG+G DCSC D GS C Sbjct: 175 VIFSPQASGLELSVGNIDEKRLIGTCVMPMPKSEPFC-NDAVVGNGRTDCSCHDRGSYRC 233 Query: 491 VRQHIVEAREMLRKTLGQEXXXXXX---------------------EVVFSNLASSGRNF 607 VRQHI EARE LR TLG+E EVVFSN S G+NF Sbjct: 234 VRQHIAEAREKLRGTLGEERFVKLGFHDMGEEVAEKWNEEEEQLFHEVVFSNPVSLGKNF 293 Query: 608 WDHLSVVFPSRTKKDIISYYFNVFMLRIRAEQNRLDPMNIDSDNDEW-RSSXXXXXXXXX 784 WD+LS+VFPSRT ++I+SYYFNVFMLR RAEQNR DP NIDSDNDEW + Sbjct: 294 WDNLSLVFPSRTTREIVSYYFNVFMLRKRAEQNRYDPENIDSDNDEWPETDDYCNDEHEM 353 Query: 785 XXXXXXSVVESPANQSGFYIEECFHADETYXXXXXXXXXXXXEE---YNERID--NVSDR 949 SVVESP Q C HAD+ E Y +D ++S+ Sbjct: 354 TEEDEDSVVESPIYQEDPSYNPC-HADDKRKYEDIGDGTHGDNENVNYGSGMDILDISES 412 Query: 950 RVRKFLGDCSFDTEVQLESKNSQNGGEDHDIQDDSCMSYECQPHKADSNGPVDAGAETQE 1129 K L + D+ QL S +G DH I+D SC S + GA++Q Sbjct: 413 CTDKLLNNSGSDSICQL-SDVPWDGKGDHGIKDGSCTS-------------SNTGADSQR 458 Query: 1130 SHVESDHCQHFQCSYGELSGGIDHDYVLELSDARLWDVGYQNCPEKDLDFLSTCNMIEEV 1309 + + +G DH Y LE DA++WD GY C + +D LSTC+MIEEV Sbjct: 459 TQAK--------------AGNGDHWYALEPCDAKVWDAGYVTCSKTKVDLLSTCSMIEEV 504 Query: 1310 FGEEAWNWNNKGRDEKG 1360 FG A KG D +G Sbjct: 505 FG--AGTGTYKGADGQG 519 >ref|XP_007045913.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508709848|gb|EOY01745.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 527 Score = 286 bits (732), Expect = 2e-74 Identities = 177/484 (36%), Positives = 253/484 (52%), Gaps = 37/484 (7%) Frame = +2 Query: 14 KSQGEEKFINDEITELSVGADKDFETSVPGCLSSLSWVTNNANEEDSRSEIAVWASFSHN 193 K Q +E F D + +++ DKDFETS P LS VT+ ++EED+ + A S Sbjct: 57 KYQWDEVFETDALNDVTHFVDKDFETSAP-----LSLVTSPSSEEDTGTGAAAILPVSPE 111 Query: 194 VFEPDCSTKTIVQPEGIYSPLLDYHPRKLVPVGPDHQADVPPWG--------LKGIVNAS 349 F+ D +T E YS LD PR+ V +GP+HQA+VP WG + + S Sbjct: 112 YFDFDLPRRTFAPVEDAYSLFLDRSPRRQVLLGPNHQANVPSWGRHVKKYEFAQSDASDS 171 Query: 350 VDEDNGDKLMGTCIIPMPNSNVFAYNRDGVGHGFCDCSCLDEGSIGCVRQHIVEAREMLR 529 D D + +MGTC+IPMP S + A N VG G DCSCLD GS+ CV+QH++EARE LR Sbjct: 172 TDNDKEEMMMGTCVIPMPESYLSANNSGKVGAGRTDCSCLDRGSLRCVQQHVMEARERLR 231 Query: 530 KTLG---------------------QEXXXXXXEVVFSNLASSGRNFWDHLSVVFPSRTK 646 K+LG +E EVV+SN +S G+ FW LSVVFPSR+K Sbjct: 232 KSLGHEKFVKLGFYDMGEDVAYKWSEEDEEIFREVVYSNPSSLGKKFWKDLSVVFPSRSK 291 Query: 647 KDIISYYFNVFMLRIRAEQNRLDPMNIDSDNDEWRSSXXXXXXXXXXXXXXXSV-----V 811 ++++SYYFNVF+L+ RA QNR ++IDSD+DEW S ++ Sbjct: 292 RELVSYYFNVFILQRRAVQNRSSMLDIDSDDDEWHGSQQAYEVQDSDEDEDSAIESLADQ 351 Query: 812 ESPANQSGFYIEECFHADETYXXXXXXXXXXXXEEYNERIDNVSDRRVRKFLGDCSFDTE 991 E AN+ G +++ D+ + ++++ + V K + FD Sbjct: 352 EDLANREGECLQDDDDDDDDDDESDVGDGSCALTREDYGVNHLLEGHVAKSFDESRFDPC 411 Query: 992 VQLESKNSQNGGEDHDIQDDSCMSYECQPHKADSNGPVDAGAETQESHVESDHCQHFQCS 1171 Q +K S GED ++QDDSCMS+E QP+ DS +D A + + V++D+C Sbjct: 412 FQQTNKVS-GIGEDFNVQDDSCMSFEFQPNMVDSLSVIDTKANSHVNGVKTDNCLR---- 466 Query: 1172 YGELSGGID---HDYVLELSDARLWDVGYQNCPEKDLDFLSTCNMIEEVFGEEAWNWNNK 1342 G L G D H Y+ + D ++WD Y P K +D TCN+IEE+FG++ +NK Sbjct: 467 -GRLDGSSDLAHHVYLFDSCDTKIWDTRYPTAPTKGIDLQPTCNIIEEIFGQD--TRDNK 523 Query: 1343 GRDE 1354 R E Sbjct: 524 TRIE 527 >ref|XP_007045912.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508709847|gb|EOY01744.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 526 Score = 286 bits (732), Expect = 2e-74 Identities = 177/484 (36%), Positives = 253/484 (52%), Gaps = 37/484 (7%) Frame = +2 Query: 14 KSQGEEKFINDEITELSVGADKDFETSVPGCLSSLSWVTNNANEEDSRSEIAVWASFSHN 193 K Q +E F D + +++ DKDFETS P LS VT+ ++EED+ + A S Sbjct: 56 KYQWDEVFETDALNDVTHFVDKDFETSAP-----LSLVTSPSSEEDTGTGAAAILPVSPE 110 Query: 194 VFEPDCSTKTIVQPEGIYSPLLDYHPRKLVPVGPDHQADVPPWG--------LKGIVNAS 349 F+ D +T E YS LD PR+ V +GP+HQA+VP WG + + S Sbjct: 111 YFDFDLPRRTFAPVEDAYSLFLDRSPRRQVLLGPNHQANVPSWGRHVKKYEFAQSDASDS 170 Query: 350 VDEDNGDKLMGTCIIPMPNSNVFAYNRDGVGHGFCDCSCLDEGSIGCVRQHIVEAREMLR 529 D D + +MGTC+IPMP S + A N VG G DCSCLD GS+ CV+QH++EARE LR Sbjct: 171 TDNDKEEMMMGTCVIPMPESYLSANNSGKVGAGRTDCSCLDRGSLRCVQQHVMEARERLR 230 Query: 530 KTLG---------------------QEXXXXXXEVVFSNLASSGRNFWDHLSVVFPSRTK 646 K+LG +E EVV+SN +S G+ FW LSVVFPSR+K Sbjct: 231 KSLGHEKFVKLGFYDMGEDVAYKWSEEDEEIFREVVYSNPSSLGKKFWKDLSVVFPSRSK 290 Query: 647 KDIISYYFNVFMLRIRAEQNRLDPMNIDSDNDEWRSSXXXXXXXXXXXXXXXSV-----V 811 ++++SYYFNVF+L+ RA QNR ++IDSD+DEW S ++ Sbjct: 291 RELVSYYFNVFILQRRAVQNRSSMLDIDSDDDEWHGSQQAYEVQDSDEDEDSAIESLADQ 350 Query: 812 ESPANQSGFYIEECFHADETYXXXXXXXXXXXXEEYNERIDNVSDRRVRKFLGDCSFDTE 991 E AN+ G +++ D+ + ++++ + V K + FD Sbjct: 351 EDLANREGECLQDDDDDDDDDDESDVGDGSCALTREDYGVNHLLEGHVAKSFDESRFDPC 410 Query: 992 VQLESKNSQNGGEDHDIQDDSCMSYECQPHKADSNGPVDAGAETQESHVESDHCQHFQCS 1171 Q +K S GED ++QDDSCMS+E QP+ DS +D A + + V++D+C Sbjct: 411 FQQTNKVS-GIGEDFNVQDDSCMSFEFQPNMVDSLSVIDTKANSHVNGVKTDNCLR---- 465 Query: 1172 YGELSGGID---HDYVLELSDARLWDVGYQNCPEKDLDFLSTCNMIEEVFGEEAWNWNNK 1342 G L G D H Y+ + D ++WD Y P K +D TCN+IEE+FG++ +NK Sbjct: 466 -GRLDGSSDLAHHVYLFDSCDTKIWDTRYPTAPTKGIDLQPTCNIIEEIFGQD--TRDNK 522 Query: 1343 GRDE 1354 R E Sbjct: 523 TRIE 526 >ref|XP_007037501.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|590668470|ref|XP_007037502.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508774746|gb|EOY22002.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508774747|gb|EOY22003.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 515 Score = 283 bits (724), Expect = 2e-73 Identities = 186/492 (37%), Positives = 248/492 (50%), Gaps = 55/492 (11%) Frame = +2 Query: 8 VGKSQGEEKFIN----------DEITELSVGADKDFETSVPGCLSSLSWVTNNANEEDSR 157 + + GE++FIN + I G +DFE +VP C++ S T EEDS Sbjct: 44 ISNASGEDRFINANTECDEKLANAIDTKHPGNAEDFEANVPSCIAISSLGTCCTGEEDSW 103 Query: 158 SEIAVWASFSHNVFEPDCSTKTIVQPEGIYSPLLDYHPRKLVPVGPDHQADVPPWGLKGI 337 E + F P+ +T + + IYS LL+ PRK V GP++QAD+P W + Sbjct: 104 PEEPLHIPSFAECFHPERQVRTSARWDDIYSILLECPPRKQVLAGPNYQADIPEWDSQVA 163 Query: 338 VNASVDEDNGD--------KLMGTCIIPMPNSNVFAYNRDGVGHGFCDCSCLDEGSIGCV 493 N S D D + KLMGTCIIPMP AY+ D VG G DCSC D+ S+ CV Sbjct: 164 RNTSNDTDASETAADRYENKLMGTCIIPMPAFECSAYD-DKVGSGRSDCSCEDKDSVRCV 222 Query: 494 RQHIVEAREMLRKTLG---------------------QEXXXXXXEVVFSNLASSGRNFW 610 RQHI+EARE LRK+LG +E +VVFSN AS GRNFW Sbjct: 223 RQHIMEAREELRKSLGHEKFVELGFCDMGELVTMKWSEEEEQLFHKVVFSNPASLGRNFW 282 Query: 611 DHLSVVFPSRTKKDIISYYFNVFMLRIRAEQNRLDPMNIDSDNDEWR-SSXXXXXXXXXX 787 D L V+P RTK+DI+SYYFNVFMLR R+EQNR + M+IDSDNDEW+ + Sbjct: 283 DSLVSVYPYRTKEDIVSYYFNVFMLRKRSEQNRCESMSIDSDNDEWQGTDDSGNNEVGFS 342 Query: 788 XXXXXSVVESP----------ANQSGFYIEECFHADETYXXXXXXXXXXXXEEYNERID- 934 SV+ESP + ++G + + ADET ++ R D Sbjct: 343 DEDEDSVIESPICQEDFDNHRSQEAGLCVFDEDIADETCDNHSI--------DFGSRGDA 394 Query: 935 -NVSDRRVRKFLGDCSFDTEVQLES---KNSQNGGEDHDIQDDSCMSYECQPHKADSNGP 1102 VS+ K C D QL K++Q E+ ++QD SC S Sbjct: 395 TKVSETYSEKLFSSCGSDPTAQLHGKTLKDTQGEQEEREVQDYSCTS------------- 441 Query: 1103 VDAGAETQESHVESDHCQHFQCSYGELSGGIDHDYVLELSDARLWDVGYQNCPEKDLDFL 1282 D GA + E+ V +D+ +Q + L+ G H YVLE D ++WD GY C + +DFL Sbjct: 442 SDTGAASHETPVNADNADQWQGNLNGLNNGGSHGYVLEPCDTKVWDAGYPTCQKNKIDFL 501 Query: 1283 STCNMIEEVFGE 1318 TC+MIEEVFG+ Sbjct: 502 PTCSMIEEVFGD 513 >ref|XP_002274465.2| PREDICTED: uncharacterized protein LOC100250913 [Vitis vinifera] Length = 550 Score = 283 bits (724), Expect = 2e-73 Identities = 186/494 (37%), Positives = 243/494 (49%), Gaps = 61/494 (12%) Frame = +2 Query: 59 LSVGADKDFETSVPGCLSSLSWVTNNANEEDSRSEIAVWASFSHNVFEPDCSTKTIVQPE 238 +SV DK FE S P LS N ++EED RS A ++S S FE +T+ Q E Sbjct: 67 VSVLDDKGFEISAP-----LS--CNGSSEEDGRSVAAAYSSLSPEYFESYLPRRTVAQFE 119 Query: 239 GIYSPLLDYHPRKLVPVGPDHQADVPPWGLKGIVNA---------------------SVD 355 IYS LLD PR+ VPVGPDHQA+VP W L+ + N +VD Sbjct: 120 DIYSSLLDCSPRRQVPVGPDHQANVPVWSLQKVKNRLDKLETSNRYISSSQSMVSDQTVD 179 Query: 356 EDNGDKLMGTCIIPMPNSNVFAYNRDGVGHGFCDCSCLDEGSIGCVRQHIVEAREMLRKT 535 +N ++ MGTC+IPMP N+ A N G G DC CLD SI CVRQH++EARE LRKT Sbjct: 180 GENEERWMGTCVIPMPEENLSAENGVKTGDGRTDCGCLDNDSIRCVRQHVMEAREKLRKT 239 Query: 536 LGQ---------------------EXXXXXXEVVFSNLASSGRNFWDHLSVVFPSRTKKD 652 LGQ E EVVFS+ AS G+NFW+HLS F R K++ Sbjct: 240 LGQEKFMELGFCDMGEEVALKWHEEEEQAFHEVVFSHPASLGQNFWEHLSATFSYRAKQE 299 Query: 653 IISYYFNVFMLRIRAEQNRLDPMNIDSDNDEWRSSXXXXXXXXXXXXXXXSVVESPANQS 832 ++SYYFNVFMLR RA QNR + + IDSD+DEW + S +ES ++Q Sbjct: 300 LVSYYFNVFMLRQRAAQNRSNFLYIDSDDDEWHGNNRSLNEVGTAEEEDDSGIESLSDQH 359 Query: 833 GFYIEECFHADETYXXXXXXXXXXXXEEYNERIDN-----------------VSDRRVR- 958 +H +E + +E ++ D+ D V Sbjct: 360 ----NHAYHEEEPHEEDDDDDDDDDDDEEDDDKDDSDFDGDGGFGDDKQGATKEDGMVHN 415 Query: 959 -KFLGDCSFDTEVQLESKNSQNGGEDHDIQDDSCMSYECQPHKADSNGPVDAGAETQESH 1135 K L FD + K + GED +QDDSCMS+ECQP+ A+ P D A QES Sbjct: 416 GKLLDYNMFDPVARNMDKVPDSNGEDFSVQDDSCMSFECQPNVANPCAPSDPEASVQESG 475 Query: 1136 VESDHCQHFQCSYGELSGGIDHDYVLELSDARLWDVGYQNCPEKDLDFLSTCNMIEEVFG 1315 + F S +D Y+LE S+ ++WD Y +D L TCNMIEE+FG Sbjct: 476 ARITQQKSFHGDDDGSSTRVDPGYLLEPSETKVWDGRYWTGSINGVDLLPTCNMIEEIFG 535 Query: 1316 EEAWNWNNKGRDEK 1357 N+K +D+K Sbjct: 536 --LGTPNSKTKDDK 547 >emb|CAN60165.1| hypothetical protein VITISV_040087 [Vitis vinifera] Length = 605 Score = 283 bits (724), Expect = 2e-73 Identities = 186/494 (37%), Positives = 243/494 (49%), Gaps = 61/494 (12%) Frame = +2 Query: 59 LSVGADKDFETSVPGCLSSLSWVTNNANEEDSRSEIAVWASFSHNVFEPDCSTKTIVQPE 238 +SV DK FE S P LS N ++EED RS A ++S S FE +T+ Q E Sbjct: 122 VSVLDDKGFEISAP-----LS--CNGSSEEDGRSVAAAYSSLSPEYFESYLPRRTVAQFE 174 Query: 239 GIYSPLLDYHPRKLVPVGPDHQADVPPWGLKGIVNA---------------------SVD 355 IYS LLD PR+ VPVGPDHQA+VP W L+ + N +VD Sbjct: 175 DIYSSLLDCSPRRQVPVGPDHQANVPVWSLQKVKNRLDKLETSNRYISSSQSMVSDQTVD 234 Query: 356 EDNGDKLMGTCIIPMPNSNVFAYNRDGVGHGFCDCSCLDEGSIGCVRQHIVEAREMLRKT 535 +N ++ MGTC+IPMP N+ A N G G DC CLD SI CVRQH++EARE LRKT Sbjct: 235 GENEERWMGTCVIPMPEENLSAENGVKTGDGRTDCGCLDNDSIRCVRQHVMEAREKLRKT 294 Query: 536 LGQEXXXXXX---------------------EVVFSNLASSGRNFWDHLSVVFPSRTKKD 652 LGQE EVVFS+ AS G+NFW+HLS F R K++ Sbjct: 295 LGQEKFMELGFCDMGEEVALKWHEEEEQAFHEVVFSHPASLGQNFWEHLSATFSYRAKQE 354 Query: 653 IISYYFNVFMLRIRAEQNRLDPMNIDSDNDEWRSSXXXXXXXXXXXXXXXSVVESPANQS 832 ++SYYFNVFMLR RA QNR + + IDSD+DEW + S +ES ++Q Sbjct: 355 LVSYYFNVFMLRQRAAQNRSNFLYIDSDDDEWHGNNRSLNEVGTAEEEDDSGIESLSDQH 414 Query: 833 GFYIEECFHADETYXXXXXXXXXXXXEEYNERIDNVS-----------------DRRVR- 958 +H +E + +E ++ D+ D V Sbjct: 415 N----HAYHEEEPHEEDDDDDDDDDDDEEDDDKDDSDFDGDGGFGDDKLGATKEDGMVHN 470 Query: 959 -KFLGDCSFDTEVQLESKNSQNGGEDHDIQDDSCMSYECQPHKADSNGPVDAGAETQESH 1135 K L FD + K + GED +QDDSCMS+ECQP+ A+ P D A QES Sbjct: 471 GKLLDYNMFDPVARNMDKVPDSNGEDFSVQDDSCMSFECQPNVANPCAPSDPEASVQESG 530 Query: 1136 VESDHCQHFQCSYGELSGGIDHDYVLELSDARLWDVGYQNCPEKDLDFLSTCNMIEEVFG 1315 + F S +D Y+LE S+ ++WD Y +D L TCNMIEE+FG Sbjct: 531 ARITQQKSFHGDDDGSSTRVDPGYLLEPSETKVWDGRYWTGSINGVDLLPTCNMIEEIFG 590 Query: 1316 EEAWNWNNKGRDEK 1357 N+K +D+K Sbjct: 591 --LGTPNSKTKDDK 602 >ref|XP_007037503.1| Uncharacterized protein isoform 3 [Theobroma cacao] gi|508774748|gb|EOY22004.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 490 Score = 280 bits (717), Expect = 1e-72 Identities = 181/471 (38%), Positives = 241/471 (51%), Gaps = 45/471 (9%) Frame = +2 Query: 41 NDEITELSVGADKDFETSVPGCLSSLSWVTNNANEEDSRSEIAVWASFSHNVFEPDCSTK 220 +D + + G +DFE +VP C++ S T EEDS E + F P+ + Sbjct: 40 DDSLISNASGNAEDFEANVPSCIAISSLGTCCTGEEDSWPEEPLHIPSFAECFHPERQVR 99 Query: 221 TIVQPEGIYSPLLDYHPRKLVPVGPDHQADVPPWGLKGIVNASVDEDNGD--------KL 376 T + + IYS LL+ PRK V GP++QAD+P W + N S D D + KL Sbjct: 100 TSARWDDIYSILLECPPRKQVLAGPNYQADIPEWDSQVARNTSNDTDASETAADRYENKL 159 Query: 377 MGTCIIPMPNSNVFAYNRDGVGHGFCDCSCLDEGSIGCVRQHIVEAREMLRKTLG----- 541 MGTCIIPMP AY+ D VG G DCSC D+ S+ CVRQHI+EARE LRK+LG Sbjct: 160 MGTCIIPMPAFECSAYD-DKVGSGRSDCSCEDKDSVRCVRQHIMEAREELRKSLGHEKFV 218 Query: 542 ----------------QEXXXXXXEVVFSNLASSGRNFWDHLSVVFPSRTKKDIISYYFN 673 +E +VVFSN AS GRNFWD L V+P RTK+DI+SYYFN Sbjct: 219 ELGFCDMGELVTMKWSEEEEQLFHKVVFSNPASLGRNFWDSLVSVYPYRTKEDIVSYYFN 278 Query: 674 VFMLRIRAEQNRLDPMNIDSDNDEWR-SSXXXXXXXXXXXXXXXSVVESP---------- 820 VFMLR R+EQNR + M+IDSDNDEW+ + SV+ESP Sbjct: 279 VFMLRKRSEQNRCESMSIDSDNDEWQGTDDSGNNEVGFSDEDEDSVIESPICQEDFDNHR 338 Query: 821 ANQSGFYIEECFHADETYXXXXXXXXXXXXEEYNERID--NVSDRRVRKFLGDCSFDTEV 994 + ++G + + ADET ++ R D VS+ K C D Sbjct: 339 SQEAGLCVFDEDIADETCDNHSI--------DFGSRGDATKVSETYSEKLFSSCGSDPTA 390 Query: 995 QLES---KNSQNGGEDHDIQDDSCMSYECQPHKADSNGPVDAGAETQESHVESDHCQHFQ 1165 QL K++Q E+ ++QD SC S D GA + E+ V +D+ +Q Sbjct: 391 QLHGKTLKDTQGEQEEREVQDYSCTS-------------SDTGAASHETPVNADNADQWQ 437 Query: 1166 CSYGELSGGIDHDYVLELSDARLWDVGYQNCPEKDLDFLSTCNMIEEVFGE 1318 + L+ G H YVLE D ++WD GY C + +DFL TC+MIEEVFG+ Sbjct: 438 GNLNGLNNGGSHGYVLEPCDTKVWDAGYPTCQKNKIDFLPTCSMIEEVFGD 488 >ref|XP_004138186.1| PREDICTED: uncharacterized protein LOC101205795 [Cucumis sativus] gi|449477160|ref|XP_004154947.1| PREDICTED: uncharacterized LOC101205795 [Cucumis sativus] Length = 520 Score = 263 bits (673), Expect = 2e-67 Identities = 179/474 (37%), Positives = 242/474 (51%), Gaps = 44/474 (9%) Frame = +2 Query: 68 GADKDFETSVPGCLSSLSWVTNNAN-EEDSRSEIAVWASFSHNVFEP-DCSTKTIVQPEG 241 G+ DF+TSVP CLS S NN EE S S+ S S + F P + + + E Sbjct: 74 GSSDDFDTSVPHCLSFSSGTNNNKTLEEGSPSKSPPHYSISSDFFNPVNHQRRILTYCEE 133 Query: 242 IYSPLLDYHPRKLVPVGPDHQADVPPWGLKGIV-------NASVDEDNGD----KLMGTC 388 IYS LLD+ P+K V +GP+HQA VPPW + + + S GD +L GTC Sbjct: 134 IYSLLLDHAPQKSVSIGPEHQAIVPPWRPREVDVILHAPGSDSKSNFTGDEYEKRLTGTC 193 Query: 389 IIPMPNSNVFAYNRDGVGHGFCDCSCLDEGSIGCVRQHIVEAREMLRKTLG--------- 541 +IPMP+ + + VG G CSC D GS+GCV HI EARE L+ ++G Sbjct: 194 VIPMPDVDSSISSGQEVGSGRAACSCEDCGSVGCVSTHIAEAREQLKSSIGPDRFADLGF 253 Query: 542 ------------QEXXXXXXEVVFSNLASSGRNFWDHLSVVFPSRTKKDIISYYFNVFML 685 +E EVVFSN S G+NFW LSVVF S++K++I+SYYFNVFML Sbjct: 254 SEMGEQLAQKWSEEEERLFYEVVFSNPVSMGKNFWSDLSVVFASKSKREIVSYYFNVFML 313 Query: 686 RIRAEQNRLDPMNIDSDNDEW-RSSXXXXXXXXXXXXXXXSVVESPANQSGFYIEECFHA 862 R RAEQNR D +NIDSDNDEW + SVVESP + G CF Sbjct: 314 RRRAEQNRCDSLNIDSDNDEWPGTDDYGDNEPGMTEEDDDSVVESPLHDIG----SCFDR 369 Query: 863 DETYXXXXXXXXXXXXEEY-----NERIDNVSDRRVRKFLGDCSFDTEVQLESKNSQNGG 1027 +EY +ER D+ + +C +Q + + + GG Sbjct: 370 SR----------EDELQEYDEDIADERFDDDESGGIGNCFNNCGSSPTLQEKIPHDERGG 419 Query: 1028 EDHDIQDDSCMSYECQPHKADSNGPVDAGAETQESHVESDHCQHFQCSY-GELSG-GIDH 1201 DH++QDDSC S + P TQ +++HC + S+ G +G G+ H Sbjct: 420 -DHEVQDDSCTSSDTCP-------------ATQVLPAKTEHCDQWLSSFTGPNNGVGLGH 465 Query: 1202 D--YVLELSDARLWDVGYQNCPEKDLDFLSTCNMIEEVFGEEAWNWNNKGRDEK 1357 + V E DA++WDVGY C + ++DFL T +MIEEVFG+++ N+ K RD K Sbjct: 466 EPSSVQEHCDAKVWDVGYLTCSKSEVDFLPTSSMIEEVFGDDSSNY--KARDGK 517 >ref|XP_003522999.1| PREDICTED: uncharacterized protein LOC100793553 [Glycine max] Length = 522 Score = 261 bits (667), Expect = 8e-67 Identities = 176/473 (37%), Positives = 234/473 (49%), Gaps = 39/473 (8%) Frame = +2 Query: 17 SQGEEKFINDEITELSVGADKDFETSVPGC-LSSLSWVTNNANEEDSRSEIAVWASFSHN 193 S+G EK + EL GA ETS P + + SW T+ E D E + S Sbjct: 59 SEGIEKLGGESFGELPTGAGNS-ETSFPVIDIPASSWATSGTIE-DLHLEPPLHLSLFPE 116 Query: 194 VFEPDCSTKTIVQPEGIYSPLLDYHPRKLVPVGPDHQADVPPWGLKGIVN---------- 343 F P+ +T+ + E IYS LL++ PRK V VG DHQADVP W + G N Sbjct: 117 YFSPERPIRTLTRYEDIYSILLEHSPRKPVSVGSDHQADVPAWDILGATNRPNASDAVSV 176 Query: 344 -----ASVDEDNGDKLMGTCIIPMPNSNVFAYNRDGVGHGFCDCSCLDEGSIGCVRQHIV 508 +DE +LMGTC+IPMP + + N D VG DCSC D+GS+ CVRQHI Sbjct: 177 SDFTVGHIDETE-KRLMGTCVIPMPQMELSS-NDDEVGKASTDCSCEDQGSMRCVRQHIA 234 Query: 509 EAREMLRKTLG---------------------QEXXXXXXEVVFSNLASSGRNFWDHLSV 625 E RE KT G E EVVF+N S +NFW++LS+ Sbjct: 235 EEREKHIKTFGVEKFTELGFTNMGEQVAENWSAEDEQLFHEVVFNNPVSLDKNFWNYLSI 294 Query: 626 VFPSRTKKDIISYYFNVFMLRIRAEQNRLDPMNIDSDNDEWRSSXXXXXXXXXXXXXXXS 805 FPSRTKK+I+SYYFNVFML+ RAEQNR D ++IDSDNDEW+ S S Sbjct: 295 AFPSRTKKEIVSYYFNVFMLQRRAEQNRNDLLSIDSDNDEWQGS--EGNDIATREEDEDS 352 Query: 806 VVESPANQSGFYIEECFHAD-ETYXXXXXXXXXXXXEEYNERIDNVSDRRVRKFLGDCSF 982 V ESP + +C + D + Y E + N+ D D + Sbjct: 353 VAESPVCHDETCMADCHNNDLQAYNEYAADETCAANETVDFTNKNIDD--------DSQY 404 Query: 983 D-TEVQLESKNSQNGGEDHDIQDDSCMSYECQPHKADSNGPVDAGAETQESHVESDHCQH 1159 D E+ S + +D I DSC K DS D G +QE+ V +++ H Sbjct: 405 DPIEMHHSSGSPLIQPQDQPIWQDSCDG----KVKEDSCTSSDVGVASQETKVNTENGDH 460 Query: 1160 FQCSYGELSGGIDHDYVLELSDARLWDVGYQNCPEKDLDFLSTCNMIEEVFGE 1318 + +Y ++ G YVLE DA++WD G+ +C + +DF+ TCNMIEEVFG+ Sbjct: 461 WCGNYNGVNNGYSQGYVLEHCDAKVWDSGFVSCSKNKIDFVPTCNMIEEVFGD 513 >gb|EXC11036.1| hypothetical protein L484_015256 [Morus notabilis] Length = 608 Score = 258 bits (659), Expect = 7e-66 Identities = 177/492 (35%), Positives = 235/492 (47%), Gaps = 57/492 (11%) Frame = +2 Query: 26 EEKFINDEITELSVGADKDFETSVPGCLSSLSWVTNNANEEDSRSEIAVWASFSHNVFEP 205 ++K D +T+ G + D ++S PG S SW T++ EEDS SE S + Sbjct: 140 DKKLSGDNLTDPPKGGE-DIDSSAPGSFSFSSWPTSSTGEEDSLSEPPFLMSVFPEYYSL 198 Query: 206 DCSTKTIVQPEGIYSPLLDYHPRKLVPVGPDHQADVPPWGLKGIVNAS------------ 349 + +T+ E IYS LL++ P K +P+GP+HQADVP W + N S Sbjct: 199 EHPVRTLAHCEDIYSLLLNHPPHKTIPIGPNHQADVPSWDQQCARNISSLSCPSEEVSKS 258 Query: 350 -VDEDNGDKLMGTCIIPMPNSNVFAYNRDGVGHGFCDCSCLDEGSIGCVRQHIVEAREML 526 V+E+ +LMGTCI+P+P+ + AY VG G DC C ++GS GCV +HIV+ARE L Sbjct: 259 EVEEEK--RLMGTCILPLPDLDSPAYPDLKVGKGRTDCDCEEKGSFGCVGKHIVKAREEL 316 Query: 527 RKTLGQEXXXXXX---------------------EVVFSNLASSGRNFWDHLSVVFPSRT 643 KT G E ++VF + AS G NFWD LS F SRT Sbjct: 317 LKTFGAEKFMELGFGDMGEQVAQSWSVEEEQTFHQIVFCHPASLGWNFWDKLSAAFLSRT 376 Query: 644 KKDIISYYFNVFMLRIRAEQNRLDPMNIDSDNDEWRSSXXXXXXXXXXXXXXXSVVESPA 823 KK+I+SYYFNVFMLR RAEQNR +P NIDSDNDEW SV++SP Sbjct: 377 KKEIVSYYFNVFMLRKRAEQNRHNPTNIDSDNDEWEGG------------DDDSVIDSPV 424 Query: 824 NQSGFYIEECFHADETYXXXXXXXXXXXXEEYNERIDNVSDRRVRKFLGDCSFDTEVQLE 1003 ++ D+T E D++ D V L ++ Sbjct: 425 SE-----------DDTGNMQSRGANLLECNEDVAVTDDICDDNVNLDLNAPKSPEISEMR 473 Query: 1004 SKNSQNGGE----------------DHDIQDDSCMSYECQPHKADSNGPVDAGAETQES- 1132 +N N G D ++QDDSC S E + + G S Sbjct: 474 PENLSNYGSSPKFLPQDTTADDEKGDQEVQDDSCTSSETGIAALRNQMESENGIYCPSSF 533 Query: 1133 -----HVESDHCQHFQCSY-GELSGGIDHDYVLELSDARLWDVGYQNCPEKDLDFLSTCN 1294 HV+S++ H C LSGG D YVLE +A+ DVGY C + +DFL TCN Sbjct: 534 IGLSNHVKSENGNHCLCRLSSSLSGGGDIGYVLEHCEAKDCDVGYMTCSKDKVDFLPTCN 593 Query: 1295 MIEEVFGEEAWN 1330 MIEEVFG+E N Sbjct: 594 MIEEVFGQEIRN 605 >ref|XP_007032692.1| Uncharacterized protein TCM_018715 [Theobroma cacao] gi|508711721|gb|EOY03618.1| Uncharacterized protein TCM_018715 [Theobroma cacao] Length = 481 Score = 254 bits (650), Expect = 7e-65 Identities = 157/420 (37%), Positives = 215/420 (51%), Gaps = 58/420 (13%) Frame = +2 Query: 14 KSQGEEKFINDEITELSVGADKDFETSVPGCLSSLSWVTNNANEEDSRSEIAVWASFSHN 193 K Q E +F D ++ GA+K++ETS + WV +N + D+ SE+AV Sbjct: 57 KCQDEGRFDEDPCNKVLSGANKEYETSASCSVPHFWWVNSNGIDADTESEVAVHLPLFPE 116 Query: 194 VFEPDCSTKTIVQPEGIYSPLLDYHPRKLVPVGPDHQADVPPWGLKGIVNASV------- 352 F + + + IYS +L PRKLV +GP+HQA++P W +G+ ++S Sbjct: 117 YFASGHQIRAFLHADEIYSSILS--PRKLVSIGPEHQANIPEWRQQGLKSSSDCPDTSDP 174 Query: 353 --------------DEDNGDKLMGTCIIPMPNSNVFA-YNRDGVGHGFCDCSCLDEGSIG 487 D+D+ K+MGTC+IPMP+S A + + VGH DC CLD+GSI Sbjct: 175 QVPLKSSCASLMVDDDDDQKKMMGTCVIPMPDSETTAKFCCEDVGHRI-DCECLDQGSIR 233 Query: 488 CVRQHIVEAREMLRKTLG---------------------QEXXXXXXEVVFSNLASSGRN 604 C+RQH+ EARE LRK LG +E VV +N S G+N Sbjct: 234 CIRQHVTEARENLRKNLGPELFGELGFCDTGEELAKRWPEEEELAFQNVVLTNPVSLGKN 293 Query: 605 FWDHLSVVFPSRTKKDIISYYFNVFMLRIRAEQNRLDPMNIDSDNDEWRSSXXXXXXXXX 784 FWDHL VFPS +K+D++SYYFNVFMLR RAEQNR+DP+NIDSD+DEW+++ Sbjct: 294 FWDHLPAVFPSHSKRDLVSYYFNVFMLRKRAEQNRVDPVNIDSDDDEWQTA-----ECGI 348 Query: 785 XXXXXXSVVESPANQSGF------YIEECFH---------ADETYXXXXXXXXXXXXEEY 919 SVVESP++Q ++E+C D + EE Sbjct: 349 PAEDDDSVVESPSDQGTSAHFEHNHVEDCHEYIEDDDEDGVDSSGNVVADICRAATDEED 408 Query: 920 NERIDNVSDRRVRKFLGDCSFDTEVQLESKNSQNGGEDHDIQDDSCMSYECQPHKADSNG 1099 ID +S V F+G+ + QL SK N +D+DIQDDSC SYE Q K D G Sbjct: 409 EGDIDEISGPHVENFIGNYD-SCDFQLSSKVQGNNEDDYDIQDDSCTSYEYQREKVDCCG 467 >ref|XP_007138262.1| hypothetical protein PHAVU_009G193800g [Phaseolus vulgaris] gi|561011349|gb|ESW10256.1| hypothetical protein PHAVU_009G193800g [Phaseolus vulgaris] Length = 522 Score = 254 bits (648), Expect = 1e-64 Identities = 173/471 (36%), Positives = 235/471 (49%), Gaps = 37/471 (7%) Frame = +2 Query: 17 SQGEEKFINDEITELSVGADKDFETSVPGC-LSSLSWVTNNANEEDSRSEIAVWASFSHN 193 S+G EK ++ + + A ETS P + + SW T + E D E + S Sbjct: 59 SEGIEKLESESFGDPPIEAGNS-ETSFPVIDIPASSWATCSTTE-DLHLEPPLHLSLFPE 116 Query: 194 VFEPDCSTKTIVQPEGIYSPLLDYHPRKLVPVGPDHQADVPPWGLKGIVNAS-------- 349 F P+ +T+ + E IYS LL++ PRK V VG +HQADVP G N S Sbjct: 117 YFSPERPIRTLTRYEDIYSILLEHSPRKPVSVGANHQADVPALDCLGATNKSNVSASDSD 176 Query: 350 VDEDNGD------KLMGTCIIPMPNSNVFAYNRDGVGHGFCDCSCLDEGSIGCVRQHIVE 511 D GD KL+GTC+IP+P + + + D VG G +C+C D+GS+ CVRQHI E Sbjct: 177 TDFTVGDRDETEKKLLGTCVIPLPQMELSSCD-DEVGKGRTECNCEDQGSMRCVRQHIAE 235 Query: 512 AREMLRKTLGQEXXXXXX---------------------EVVFSNLASSGRNFWDHLSVV 628 R+ L KT G E EVVF+N AS +NFW++LS+ Sbjct: 236 ERDKLLKTFGPEKFTELGFTNMGEQVAEKWSVEDEQLFHEVVFNNPASLDKNFWNYLSIA 295 Query: 629 FPSRTKKDIISYYFNVFMLRIRAEQNRLDPMNIDSDNDEWRSSXXXXXXXXXXXXXXXSV 808 FPSRTKK+I+SYYFNVFMLR RAEQNR D +NIDSDNDEW+ S SV Sbjct: 296 FPSRTKKEIVSYYFNVFMLRRRAEQNRNDLLNIDSDNDEWQGS--DSNDIATREEDEDSV 353 Query: 809 VESPANQSGFYIEECFHAD-ETYXXXXXXXXXXXXEEYNERIDNVSDRRVRKFLGDCSFD 985 ESP Q + +C D +TY E + N+ D G Sbjct: 354 AESPVCQDESCMADCHDNDLQTYDEYAADETCAANETVDFTSRNIDD-------GSKYDP 406 Query: 986 TEVQLESKNSQNGGEDHDIQDDSCMSYECQPHKADSNGPVDAGAETQESHVESDHCQHFQ 1165 E+ + D + DSC + K DS D G +Q++ V +++ H+ Sbjct: 407 VELHHSGRCPLIQPPDQPVWQDSC----DEKVKDDSCTSSDTGVASQQTKVNTENGDHWC 462 Query: 1166 CSYGELSGGIDHDYVLELSDARLWDVGYQNCPEKDLDFLSTCNMIEEVFGE 1318 +Y +S G + YVLE DA++WD G+ +C + +DFL TCNMIEEVFG+ Sbjct: 463 GNYNGVSNGYNQGYVLEPCDAKVWDSGFVSCSKNKMDFLPTCNMIEEVFGD 513 >ref|XP_004499488.1| PREDICTED: uncharacterized protein LOC101494171 isoform X1 [Cicer arietinum] gi|502126914|ref|XP_004499489.1| PREDICTED: uncharacterized protein LOC101494171 isoform X2 [Cicer arietinum] Length = 533 Score = 249 bits (636), Expect = 3e-63 Identities = 185/495 (37%), Positives = 243/495 (49%), Gaps = 59/495 (11%) Frame = +2 Query: 11 GKSQGEEKFINDEITELSVGADKDFETSVPGCLSSLSWVTNNANEEDSRSEIAVWAS--- 181 G + EK + EL GA D E S P W T++A E D RSE + S Sbjct: 57 GSCECNEKLAGEICDELPKGAG-DSEASFPVVGIPAPWATSSATE-DLRSEQPIHLSLFP 114 Query: 182 --FSHN---VFEPDCSTKTIVQPEGIYSPLLDYHPRKLVPVGPDHQADVPPWGLK----- 331 FS F P+ +T+ + E IYS LL++ PRK V VG +HQADVPPWG Sbjct: 115 EYFSPERPIYFSPERPIRTLTRYEDIYSILLEHSPRKPVSVGANHQADVPPWGFSRASYV 174 Query: 332 ----GIVNASV----DEDNGDK-LMGTCIIPMPNSNVFAYNRDGVGHGFCDCSCLDEGSI 484 G V+ S + D +K LMGTCIIPMP + + ++ VG G DCSC+D S+ Sbjct: 175 PHASGTVSDSNFTAWNRDEAEKRLMGTCIIPMPEMELTSIDQK-VGKGRTDCSCVDRESM 233 Query: 485 GCVRQHIVEAREMLRKTLG---------------------QEXXXXXXEVVFSNLASSGR 601 CVRQHI+E RE L K++G E +VVF+N AS R Sbjct: 234 RCVRQHIMEEREKLLKSIGFEKFTELGFADMGEQVAEKWSAEDEHLFHKVVFNNPASLNR 293 Query: 602 NFWDHLSVVFPSRTKKDIISYYFNVFMLRIRAEQNRLDPMNIDSDNDEWRSSXXXXXXXX 781 NFW++LS+VFPSRTKK+I+SYYFNVFMLR RAEQNR +N DSDNDEW+ + Sbjct: 294 NFWNYLSIVFPSRTKKEIVSYYFNVFMLRKRAEQNRNHLLNADSDNDEWQGN--DENEIS 351 Query: 782 XXXXXXXSVVESPA-------NQSGFYIEEC---FHADETYXXXXXXXXXXXXEEYNERI 931 SV E P N + ++EE F ADET+ + Sbjct: 352 THDEDDDSVTEYPICQDDCNNNCNDNHLEEYDDEFAADETF-----------------TV 394 Query: 932 DNVSDRRVRKFLGDCSFDTEVQLESKNSQN--GGEDHDIQDDSCMSYECQPHKADSNGPV 1105 D R D +D V + + N +D + DSC + K DS Sbjct: 395 KGTMDCTKRNIGDDSKYD-HVGMHNSNGSPLIQPQDQHVWQDSC----DEKVKGDSYTSH 449 Query: 1106 DAGAETQESHVESDHCQHFQCSYGELSGGIDH----DYVLELSDARLWDVGYQNCPEKDL 1273 D G ++E V+S H+ +Y +S G H YVLE DA +WD G+ +C + + Sbjct: 450 DIGVASREIKVKSGSGDHWSSNYNGVSNGYSHGYSQGYVLEPCDAPVWDSGFVSCSKNKI 509 Query: 1274 DFLSTCNMIEEVFGE 1318 DFL TC+MIEEVFG+ Sbjct: 510 DFLPTCSMIEEVFGD 524 >ref|XP_007224589.1| hypothetical protein PRUPE_ppa1027132mg [Prunus persica] gi|462421525|gb|EMJ25788.1| hypothetical protein PRUPE_ppa1027132mg [Prunus persica] Length = 511 Score = 246 bits (628), Expect = 3e-62 Identities = 164/446 (36%), Positives = 219/446 (49%), Gaps = 57/446 (12%) Frame = +2 Query: 197 FEPDCSTKTIVQPEGIYSPLLDYHPRKLVPVGPDHQADVPPWGLKGIVNASVDEDN---- 364 FE D +T V + +YS L D PRK VPVGPDHQA +P W + DE N Sbjct: 65 FELDFPRRTFVPFKDVYSSLADRFPRKPVPVGPDHQARIPTWTGRVKCLDQTDESNLNRF 124 Query: 365 ---------------GDKLMGTCIIPMPNSNVFAYNRDGVGHGFCDCSCLDEGSIGCVRQ 499 + L+GT +IPMP+SN+ A D VG G DCSCLD G++ CV++ Sbjct: 125 SLHSLESEKVVNNASEENLLGTSVIPMPDSNLSALKCDKVGLGRTDCSCLDPGTVRCVQK 184 Query: 500 HIVEAREMLRKTLG---------------------QEXXXXXXEVVFSNLASSGRNFWDH 616 H+++ARE LR+TLG +E EVV+SN AS GRNFW Sbjct: 185 HVMDAREELRRTLGNEKFVKLGFCDMGEEVARRWSEEEEETFLEVVYSNPASVGRNFWKQ 244 Query: 617 LSVVFPSRTKKDIISYYFNVFMLRIRAEQNRLDPMNIDSDNDEWRSSXXXXXXXXXXXXX 796 LSVVFPSR++++++SYYFNVFMLR RA QNR + + IDSD+DEW Sbjct: 245 LSVVFPSRSRRELVSYYFNVFMLRRRAVQNRSNILEIDSDDDEWHGDNGGSIDRRVAEYD 304 Query: 797 XXSVVESPANQSGFYIEECFHADET--------------YXXXXXXXXXXXXEEYNERID 934 SV+ES Q E ++DE E + ID Sbjct: 305 EDSVIESRVYQDDHVDHEEDYSDEDDSDDDDVDDDGSDGDGDGDGGHVKGDSSEEDGGID 364 Query: 935 NVSDRRVRKFLGDCSFDTEVQLESKNSQNGGEDHDIQDDSCMSYECQPHKADSNGPVDAG 1114 N+ + + K + D FDT Q K S E+ D QDDSC+S+E Q + DS +DAG Sbjct: 365 NM-ESYMLKTVDDGKFDTVGQHGEKTSGCTREEFDFQDDSCVSFEFQSNMHDSCDRIDAG 423 Query: 1115 AETQESHVESDHCQHFQCSYGELSGGID---HDYVLELSDARLWDVGYQNCPEKDLDFLS 1285 A V +C +G+ D H Y+LE DA++WD + K +D L Sbjct: 424 AAGSALQVTGFRNDRSKCLHGQPDASSDVVGHAYLLEPCDAKVWDARFPLDAMKGVDVLP 483 Query: 1286 TCNMIEEVFGEEAWNWNNKGRDEKGG 1363 T +MIEE+F E ++ K RDE+ G Sbjct: 484 TWSMIEEIFDEGMGDY--KTRDEQKG 507