BLASTX nr result
ID: Cinnamomum23_contig00035550
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cinnamomum23_contig00035550 (664 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_010245479.1| PREDICTED: sacsin [Nelumbo nucifera] 289 8e-76 ref|XP_010935539.1| PREDICTED: sacsin [Elaeis guineensis] 280 5e-73 ref|XP_006847865.2| PREDICTED: uncharacterized protein LOC184375... 278 2e-72 gb|ERN09446.1| hypothetical protein AMTR_s00029p00083380 [Ambore... 278 2e-72 ref|XP_008803352.1| PREDICTED: sacsin [Phoenix dactylifera] 277 4e-72 ref|XP_012438099.1| PREDICTED: sacsin isoform X3 [Gossypium raim... 275 2e-71 ref|XP_012438098.1| PREDICTED: sacsin isoform X2 [Gossypium raim... 275 2e-71 gb|KJB49996.1| hypothetical protein B456_008G149000 [Gossypium r... 275 2e-71 gb|KJB49995.1| hypothetical protein B456_008G149000 [Gossypium r... 275 2e-71 gb|KJB49994.1| hypothetical protein B456_008G149000 [Gossypium r... 275 2e-71 ref|XP_012438097.1| PREDICTED: uncharacterized protein LOC105764... 275 2e-71 gb|KHG13033.1| Sacsin [Gossypium arboreum] 273 6e-71 ref|XP_007043304.1| Binding protein, putative isoform 2 [Theobro... 273 6e-71 ref|XP_007043303.1| Binding protein, putative isoform 1 [Theobro... 273 6e-71 ref|XP_011463440.1| PREDICTED: sacsin [Fragaria vesca subsp. vesca] 272 1e-70 ref|XP_008221054.1| PREDICTED: LOW QUALITY PROTEIN: sacsin [Prun... 272 1e-70 ref|XP_007221931.1| hypothetical protein PRUPE_ppa000003mg [Prun... 271 2e-70 ref|XP_002307173.2| hypothetical protein POPTR_0005s09590g [Popu... 271 3e-70 ref|XP_002527141.1| protein binding protein, putative [Ricinus c... 270 4e-70 gb|KDO52761.1| hypothetical protein CISIN_1g0000071mg, partial [... 270 7e-70 >ref|XP_010245479.1| PREDICTED: sacsin [Nelumbo nucifera] Length = 4779 Score = 289 bits (740), Expect = 8e-76 Identities = 143/221 (64%), Positives = 175/221 (79%), Gaps = 1/221 (0%) Frame = +3 Query: 3 ISELEPDVRNTVMISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEE 182 I EL+P+VR+ VM+SIL DLPQL E+++ ++ L+KLEFVPTL G LKCP ALYDPRNEE Sbjct: 900 ICELQPEVRDRVMLSILQDLPQLCAEETSLRDSLRKLEFVPTLSGILKCPDALYDPRNEE 959 Query: 183 LYALLEDSDRFPSGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSR 362 LYALLEDSD +P G+FQES R+ VSPET+IQSARQIELMMH+D KA+ + Sbjct: 960 LYALLEDSDSYPYGLFQESGALDMLIGLGLRTFVSPETIIQSARQIELMMHKDQQKAHVK 1019 Query: 363 GRVLLSYLEVNATKWFCNSLN-GERIMNKMFSKVTAAIISRNGSLEANLEKFWNDLRIIC 539 G+ LLSYLEVNA KW N LN G+R MN++FS+V + RN E +LEKFWNDLR+IC Sbjct: 1020 GKALLSYLEVNAVKWSFNLLNDGKRRMNRLFSQVATSFKPRNS--EIDLEKFWNDLRMIC 1077 Query: 540 WCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 WCPVLV +P+ +LPWPS++SMVAPPK VRLPAD+WLVSAS+ Sbjct: 1078 WCPVLVAAPYPSLPWPSISSMVAPPKLVRLPADMWLVSASL 1118 Score = 80.5 bits (197), Expect = 7e-13 Identities = 58/234 (24%), Positives = 104/234 (44%), Gaps = 28/234 (11%) Frame = +3 Query: 45 SILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEELYALLEDSDRFPSG 224 SI H++ L ED++ K + FV G+ + P LYDPR L +L + FPS Sbjct: 2321 SIFHEIKLLIEEDTSIKSVFSQTAFVLAANGSWQHPSRLYDPRVPGLRKVLHNEAYFPSD 2380 Query: 225 VFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSRGRVLLSYLEVNATK 404 F + + + ++ AR ++++ + L++ + G LL+ L+ +K Sbjct: 2381 KFLDDEALELLVCLGLKRMLGFTGLLDCARSVKMLHDSEDLESLNYGSRLLACLDALGSK 2440 Query: 405 W---------------FC---NSLNGERIMNKMFSK--------VTAAIISRNGSL--EA 500 C + L + ++ F K + I+S G + + Sbjct: 2441 LSHLEKDSCDDTSHFSLCEIQSDLGDDGEVSVDFPKKDMENGCKLDLDIVSCLGDMIYDK 2500 Query: 501 NLEKFWNDLRIICWCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 E+FW++++ I WCP+ + P LPW + VAPP VR + +W+VS++M Sbjct: 2501 PEEEFWSEMKTIAWCPIYTDPPIQGLPWFTSKQKVAPPGIVRPKSQMWMVSSAM 2554 >ref|XP_010935539.1| PREDICTED: sacsin [Elaeis guineensis] Length = 4766 Score = 280 bits (716), Expect = 5e-73 Identities = 141/220 (64%), Positives = 170/220 (77%), Gaps = 1/220 (0%) Frame = +3 Query: 3 ISELEPDVRNTVMISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEE 182 + +LEP+VR+TVM+SIL DLPQL LEDS+FKE LK+L FVPT+ G+LK PQ+LYDPR +E Sbjct: 901 VVKLEPEVRDTVMLSILQDLPQLCLEDSSFKELLKRLTFVPTIHGSLKSPQSLYDPRVDE 960 Query: 183 LYALLEDSDRFPSGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSR 362 L ALLE+SD FP G FQE R+SVS +T+IQSARQ+EL+MH+D LKA SR Sbjct: 961 LLALLEESDCFPCGSFQEQGVLDMLLLLGLRTSVSADTIIQSARQVELLMHKDQLKAYSR 1020 Query: 363 GRVLLSYLEVNATKWFCNSLN-GERIMNKMFSKVTAAIISRNGSLEANLEKFWNDLRIIC 539 G+VLLSYLEVNA KW N N + +N MFSKV A+ R +EA+LEKFWNDLR+IC Sbjct: 1021 GKVLLSYLEVNAVKWLYNMPNDSQSRVNVMFSKVATALRPREMPMEADLEKFWNDLRMIC 1080 Query: 540 WCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSAS 659 WCPVLV +P ALPWPSV+SMVAPPK VRL D+W+VSAS Sbjct: 1081 WCPVLVTAPHPALPWPSVSSMVAPPKLVRLQVDMWIVSAS 1120 Score = 93.6 bits (231), Expect = 8e-17 Identities = 63/232 (27%), Positives = 100/232 (43%), Gaps = 22/232 (9%) Frame = +3 Query: 33 TVMISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEELYALLEDSDR 212 +++ SIL D+ L+ ED+ FK L + FV G+ P LYDPR L LL Sbjct: 2318 SILTSILLDVKFLNEEDAAFKSALSETHFVLAADGSWHHPSRLYDPRVPGLQNLLHKEVF 2377 Query: 213 FPSGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSRGRVLLSYLEV 392 FPS FQ++ R ++ +I AR + ++ + A G+ LL YL Sbjct: 2378 FPSDKFQDAEILESLASLGLRKTLGFTALIDCARSVSMLHDSGSINAPIYGKRLLVYLNA 2437 Query: 393 NATKWFCNSLNGERI---MNKMFSKVTAAI-------------------ISRNGSLEANL 506 K N N E + ++ + S + + N + + Sbjct: 2438 VGLK-LSNVSNIEEVNHGVDNIMSSIDGGLHDGDSQSKTPEECDQDVFSFLSNFDYDQSE 2496 Query: 507 EKFWNDLRIICWCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 ++FW+ ++ I WCPV V +P LPW +APP R + +W+VS+ M Sbjct: 2497 DEFWSQIKAIAWCPVYVTAPHKELPWSISGDCIAPPNITRPKSQMWIVSSKM 2548 >ref|XP_006847865.2| PREDICTED: uncharacterized protein LOC18437599 [Amborella trichopoda] Length = 4710 Score = 278 bits (710), Expect = 2e-72 Identities = 140/221 (63%), Positives = 171/221 (77%), Gaps = 1/221 (0%) Frame = +3 Query: 3 ISELEPDVRNTVMISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEE 182 I EL+P+VR+TV+++I+ LPQL E+++FK+ LKKL+FVPTLGG LK PQ LYDPRNEE Sbjct: 858 IGELQPEVRDTVLLAIVQGLPQLCAEEASFKDTLKKLDFVPTLGGCLKSPQMLYDPRNEE 917 Query: 183 LYALLEDSDRFPSGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSR 362 LYALLEDSD FP G F+E R+ VSP+TVI SARQIE +M+ DP KA SR Sbjct: 918 LYALLEDSDDFPCGRFREPEVLDMLQGLGLRTLVSPDTVIHSARQIEQIMYTDPQKAYSR 977 Query: 363 GRVLLSYLEVNATKWFCNSL-NGERIMNKMFSKVTAAIISRNGSLEANLEKFWNDLRIIC 539 RVLL +LEVNATKW+ +S+ + +I+N+MFSKV A SR EA+L KFWND+R+IC Sbjct: 978 SRVLLLFLEVNATKWYTDSISDSHKIINQMFSKVAMAFKSRETLQEADLVKFWNDMRMIC 1037 Query: 540 WCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 WCPVLV P+ ALPWPSV+SMVAPPK VRL +DLWLVSASM Sbjct: 1038 WCPVLVKPPYHALPWPSVSSMVAPPKLVRLQSDLWLVSASM 1078 Score = 93.6 bits (231), Expect = 8e-17 Identities = 69/237 (29%), Positives = 103/237 (43%), Gaps = 27/237 (11%) Frame = +3 Query: 33 TVMISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEELYALLEDSDR 212 +V+ S+L DL L EDS+FK + + FV T G+ +CP LYDPR L LL Sbjct: 2269 SVLSSMLEDLKLLIEEDSSFKSDVSQTPFVLTANGSRQCPCRLYDPRIPGLQQLLYKDAF 2328 Query: 213 FPSGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSRGRVLLSYLEV 392 FP G F + ++++ ++ SAR + ++ +A + GR LL L+ Sbjct: 2329 FPCGEFLKCDILEILLSLGMKNTLGFSGLLDSARSVSMLYDSGSKEAMNFGRRLLDCLDA 2388 Query: 393 NATKW--FCNSLNGERIMNKMFSKVTAAI---------------ISRNGSLEAN------ 503 K + + F K A + +S G L+ Sbjct: 2389 VGFKLADMIEYKTSDDYGSSNFDKKEAGMPSSRARSMLLGELNDVSSEGDLDMQWCINFT 2448 Query: 504 ----LEKFWNDLRIICWCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 + FW +LR I WCPVLV+ P LPW VA P VR + +W+VS++M Sbjct: 2449 HDEPKDDFWLELRDIAWCPVLVDPPIEGLPWAVSEIQVASPGYVRPMSQMWMVSSTM 2505 >gb|ERN09446.1| hypothetical protein AMTR_s00029p00083380 [Amborella trichopoda] Length = 4752 Score = 278 bits (710), Expect = 2e-72 Identities = 140/221 (63%), Positives = 171/221 (77%), Gaps = 1/221 (0%) Frame = +3 Query: 3 ISELEPDVRNTVMISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEE 182 I EL+P+VR+TV+++I+ LPQL E+++FK+ LKKL+FVPTLGG LK PQ LYDPRNEE Sbjct: 900 IGELQPEVRDTVLLAIVQGLPQLCAEEASFKDTLKKLDFVPTLGGCLKSPQMLYDPRNEE 959 Query: 183 LYALLEDSDRFPSGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSR 362 LYALLEDSD FP G F+E R+ VSP+TVI SARQIE +M+ DP KA SR Sbjct: 960 LYALLEDSDDFPCGRFREPEVLDMLQGLGLRTLVSPDTVIHSARQIEQIMYTDPQKAYSR 1019 Query: 363 GRVLLSYLEVNATKWFCNSL-NGERIMNKMFSKVTAAIISRNGSLEANLEKFWNDLRIIC 539 RVLL +LEVNATKW+ +S+ + +I+N+MFSKV A SR EA+L KFWND+R+IC Sbjct: 1020 SRVLLLFLEVNATKWYTDSISDSHKIINQMFSKVAMAFKSRETLQEADLVKFWNDMRMIC 1079 Query: 540 WCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 WCPVLV P+ ALPWPSV+SMVAPPK VRL +DLWLVSASM Sbjct: 1080 WCPVLVKPPYHALPWPSVSSMVAPPKLVRLQSDLWLVSASM 1120 Score = 93.6 bits (231), Expect = 8e-17 Identities = 69/237 (29%), Positives = 103/237 (43%), Gaps = 27/237 (11%) Frame = +3 Query: 33 TVMISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEELYALLEDSDR 212 +V+ S+L DL L EDS+FK + + FV T G+ +CP LYDPR L LL Sbjct: 2311 SVLSSMLEDLKLLIEEDSSFKSDVSQTPFVLTANGSRQCPCRLYDPRIPGLQQLLYKDAF 2370 Query: 213 FPSGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSRGRVLLSYLEV 392 FP G F + ++++ ++ SAR + ++ +A + GR LL L+ Sbjct: 2371 FPCGEFLKCDILEILLSLGMKNTLGFSGLLDSARSVSMLYDSGSKEAMNFGRRLLDCLDA 2430 Query: 393 NATKW--FCNSLNGERIMNKMFSKVTAAI---------------ISRNGSLEAN------ 503 K + + F K A + +S G L+ Sbjct: 2431 VGFKLADMIEYKTSDDYGSSNFDKKEAGMPSSRARSMLLGELNDVSSEGDLDMQWCINFT 2490 Query: 504 ----LEKFWNDLRIICWCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 + FW +LR I WCPVLV+ P LPW VA P VR + +W+VS++M Sbjct: 2491 HDEPKDDFWLELRDIAWCPVLVDPPIEGLPWAVSEIQVASPGYVRPMSQMWMVSSTM 2547 >ref|XP_008803352.1| PREDICTED: sacsin [Phoenix dactylifera] Length = 4767 Score = 277 bits (708), Expect = 4e-72 Identities = 140/220 (63%), Positives = 170/220 (77%), Gaps = 1/220 (0%) Frame = +3 Query: 3 ISELEPDVRNTVMISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEE 182 + ELEP+VR+ VM+SIL DLPQL LEDS+FKE LK+L FVPT+ G+LK PQ+LYDPR +E Sbjct: 901 VVELEPEVRDAVMLSILQDLPQLCLEDSSFKELLKRLTFVPTIHGSLKSPQSLYDPRVDE 960 Query: 183 LYALLEDSDRFPSGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSR 362 L ALLE+SD FPSG+FQE R+SVS +T+IQSARQ+E +MH+D LKA SR Sbjct: 961 LLALLEESDCFPSGLFQEPGVLDMLLLLGLRTSVSTDTIIQSARQVESLMHKDQLKAYSR 1020 Query: 363 GRVLLSYLEVNATKWFCNSLN-GERIMNKMFSKVTAAIISRNGSLEANLEKFWNDLRIIC 539 G+VLLSYLEVN KW N N + +N MFSKV A+ R+ +EA+LEKFW+DLR+IC Sbjct: 1021 GKVLLSYLEVNPVKWLHNMPNDSQSRVNGMFSKVATALRPRDMPIEADLEKFWSDLRMIC 1080 Query: 540 WCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSAS 659 WCPVLV +P ALPWPSV+SMVAPPK VRL D+WLVSAS Sbjct: 1081 WCPVLVTAPHPALPWPSVSSMVAPPKLVRLQVDMWLVSAS 1120 Score = 97.8 bits (242), Expect = 4e-18 Identities = 64/232 (27%), Positives = 106/232 (45%), Gaps = 22/232 (9%) Frame = +3 Query: 33 TVMISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEELYALLEDSDR 212 +++ SIL D+ L+ D+ FK L + FV G+ + P LYDPR L+ LL Sbjct: 2318 SILSSILLDVKFLNEVDTAFKTALSETHFVLAANGSWRHPSRLYDPRVPSLHNLLHKEVF 2377 Query: 213 FPSGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSRGRVLLSYLEV 392 FPS FQ++ R ++S ++ SAR + ++ + A G+ LL YL Sbjct: 2378 FPSEKFQDAAILESLASLGLRKTLSFTALLDSARSVSMLHDSGSINALIYGKRLLVYL-- 2435 Query: 393 NATKWFCNSLNGERI--------------------MNKMFSKVTAAIIS--RNGSLEANL 506 NA + ++ N E + +K + + S N + + Sbjct: 2436 NALGFKLSNANIEEVNHGVDNIMSSIDGGSHDGDPQSKTHEECDQEVFSFLSNFDHDQSE 2495 Query: 507 EKFWNDLRIICWCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 ++FW+ +++I WCPV V +P LPW +APP R + +W+VS+ M Sbjct: 2496 DEFWSQIKVIAWCPVYVTAPHKELPWSKSGDCIAPPNVTRPKSQMWIVSSKM 2547 >ref|XP_012438099.1| PREDICTED: sacsin isoform X3 [Gossypium raimondii] Length = 4192 Score = 275 bits (702), Expect = 2e-71 Identities = 136/221 (61%), Positives = 169/221 (76%), Gaps = 1/221 (0%) Frame = +3 Query: 3 ISELEPDVRNTVMISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEE 182 I E+ +VR+ VM+SIL +LPQLS+ED++ +++L+ LEFVPT G LKCP LYDPRNEE Sbjct: 901 IKEMHNEVRDNVMLSILENLPQLSIEDASLRDYLRNLEFVPTFTGALKCPSVLYDPRNEE 960 Query: 183 LYALLEDSDRFPSGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSR 362 LYALLEDSD FPSG FQES R+SV+PETVI SA+QIE MMH D KA+SR Sbjct: 961 LYALLEDSDSFPSGPFQESGILDMLQGLGLRTSVTPETVIGSAQQIEQMMHEDQHKAHSR 1020 Query: 363 GRVLLSYLEVNATKWFCNSLNGER-IMNKMFSKVTAAIISRNGSLEANLEKFWNDLRIIC 539 G++LLSYLEVNA KW N ++ ++ +N++FS+ A RN + ++LEKFWNDLR+IC Sbjct: 1021 GKILLSYLEVNAMKWLPNQVSDDQGAVNRIFSRAATAFRPRN--MRSDLEKFWNDLRMIC 1078 Query: 540 WCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 WCPVLV+SPF ALPWP V+S VAPPK VRL DLWL+SASM Sbjct: 1079 WCPVLVSSPFQALPWPVVSSKVAPPKLVRLQTDLWLISASM 1119 Score = 80.9 bits (198), Expect = 6e-13 Identities = 69/252 (27%), Positives = 99/252 (39%), Gaps = 46/252 (18%) Frame = +3 Query: 45 SILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEELYALLEDSDRFPSG 224 +ILHD+ L ED + + L FV G+ + P LYDPR EL LL FPS Sbjct: 2317 AILHDVKMLVEEDISIRSALSTTPFVLAANGSWQPPSRLYDPRVPELQKLLCKEVFFPSE 2376 Query: 225 VFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSRGRVLLSYLEVNATK 404 F R ++ + AR I + +A + GR LL YL+ A K Sbjct: 2377 KFSGPETLDTLVSLGLRRTLGFIGFLDCARSISTLHESGDPEAATYGRKLLLYLDALACK 2436 Query: 405 WFCNSLNGE---RIMNKM----------FSKVTAAIISRNGSL----------------- 494 + G+ I NK+ S++ +I N L Sbjct: 2437 -LSSVREGDVQRAISNKLPENYPASEGNGSEMPGDLIDLNSDLVCGDAVAVDFPKREETI 2495 Query: 495 ----------------EANLEKFWNDLRIICWCPVLVNSPFAALPWPSVTSMVAPPKQVR 626 + E FW++++ I WCPV VN PF LPW TS + VR Sbjct: 2496 CKDDIDIDNVIGNSMDDMPEEDFWSEMKTIAWCPVCVNPPFQGLPWLKPTSHLVSSSTVR 2555 Query: 627 LPADLWLVSASM 662 + +W+VS++M Sbjct: 2556 PKSQMWMVSSTM 2567 >ref|XP_012438098.1| PREDICTED: sacsin isoform X2 [Gossypium raimondii] Length = 4265 Score = 275 bits (702), Expect = 2e-71 Identities = 136/221 (61%), Positives = 169/221 (76%), Gaps = 1/221 (0%) Frame = +3 Query: 3 ISELEPDVRNTVMISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEE 182 I E+ +VR+ VM+SIL +LPQLS+ED++ +++L+ LEFVPT G LKCP LYDPRNEE Sbjct: 901 IKEMHNEVRDNVMLSILENLPQLSIEDASLRDYLRNLEFVPTFTGALKCPSVLYDPRNEE 960 Query: 183 LYALLEDSDRFPSGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSR 362 LYALLEDSD FPSG FQES R+SV+PETVI SA+QIE MMH D KA+SR Sbjct: 961 LYALLEDSDSFPSGPFQESGILDMLQGLGLRTSVTPETVIGSAQQIEQMMHEDQHKAHSR 1020 Query: 363 GRVLLSYLEVNATKWFCNSLNGER-IMNKMFSKVTAAIISRNGSLEANLEKFWNDLRIIC 539 G++LLSYLEVNA KW N ++ ++ +N++FS+ A RN + ++LEKFWNDLR+IC Sbjct: 1021 GKILLSYLEVNAMKWLPNQVSDDQGAVNRIFSRAATAFRPRN--MRSDLEKFWNDLRMIC 1078 Query: 540 WCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 WCPVLV+SPF ALPWP V+S VAPPK VRL DLWL+SASM Sbjct: 1079 WCPVLVSSPFQALPWPVVSSKVAPPKLVRLQTDLWLISASM 1119 Score = 80.9 bits (198), Expect = 6e-13 Identities = 69/252 (27%), Positives = 99/252 (39%), Gaps = 46/252 (18%) Frame = +3 Query: 45 SILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEELYALLEDSDRFPSG 224 +ILHD+ L ED + + L FV G+ + P LYDPR EL LL FPS Sbjct: 2317 AILHDVKMLVEEDISIRSALSTTPFVLAANGSWQPPSRLYDPRVPELQKLLCKEVFFPSE 2376 Query: 225 VFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSRGRVLLSYLEVNATK 404 F R ++ + AR I + +A + GR LL YL+ A K Sbjct: 2377 KFSGPETLDTLVSLGLRRTLGFIGFLDCARSISTLHESGDPEAATYGRKLLLYLDALACK 2436 Query: 405 WFCNSLNGE---RIMNKM----------FSKVTAAIISRNGSL----------------- 494 + G+ I NK+ S++ +I N L Sbjct: 2437 -LSSVREGDVQRAISNKLPENYPASEGNGSEMPGDLIDLNSDLVCGDAVAVDFPKREETI 2495 Query: 495 ----------------EANLEKFWNDLRIICWCPVLVNSPFAALPWPSVTSMVAPPKQVR 626 + E FW++++ I WCPV VN PF LPW TS + VR Sbjct: 2496 CKDDIDIDNVIGNSMDDMPEEDFWSEMKTIAWCPVCVNPPFQGLPWLKPTSHLVSSSTVR 2555 Query: 627 LPADLWLVSASM 662 + +W+VS++M Sbjct: 2556 PKSQMWMVSSTM 2567 >gb|KJB49996.1| hypothetical protein B456_008G149000 [Gossypium raimondii] Length = 4409 Score = 275 bits (702), Expect = 2e-71 Identities = 136/221 (61%), Positives = 169/221 (76%), Gaps = 1/221 (0%) Frame = +3 Query: 3 ISELEPDVRNTVMISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEE 182 I E+ +VR+ VM+SIL +LPQLS+ED++ +++L+ LEFVPT G LKCP LYDPRNEE Sbjct: 686 IKEMHNEVRDNVMLSILENLPQLSIEDASLRDYLRNLEFVPTFTGALKCPSVLYDPRNEE 745 Query: 183 LYALLEDSDRFPSGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSR 362 LYALLEDSD FPSG FQES R+SV+PETVI SA+QIE MMH D KA+SR Sbjct: 746 LYALLEDSDSFPSGPFQESGILDMLQGLGLRTSVTPETVIGSAQQIEQMMHEDQHKAHSR 805 Query: 363 GRVLLSYLEVNATKWFCNSLNGER-IMNKMFSKVTAAIISRNGSLEANLEKFWNDLRIIC 539 G++LLSYLEVNA KW N ++ ++ +N++FS+ A RN + ++LEKFWNDLR+IC Sbjct: 806 GKILLSYLEVNAMKWLPNQVSDDQGAVNRIFSRAATAFRPRN--MRSDLEKFWNDLRMIC 863 Query: 540 WCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 WCPVLV+SPF ALPWP V+S VAPPK VRL DLWL+SASM Sbjct: 864 WCPVLVSSPFQALPWPVVSSKVAPPKLVRLQTDLWLISASM 904 Score = 80.9 bits (198), Expect = 6e-13 Identities = 69/252 (27%), Positives = 99/252 (39%), Gaps = 46/252 (18%) Frame = +3 Query: 45 SILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEELYALLEDSDRFPSG 224 +ILHD+ L ED + + L FV G+ + P LYDPR EL LL FPS Sbjct: 2102 AILHDVKMLVEEDISIRSALSTTPFVLAANGSWQPPSRLYDPRVPELQKLLCKEVFFPSE 2161 Query: 225 VFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSRGRVLLSYLEVNATK 404 F R ++ + AR I + +A + GR LL YL+ A K Sbjct: 2162 KFSGPETLDTLVSLGLRRTLGFIGFLDCARSISTLHESGDPEAATYGRKLLLYLDALACK 2221 Query: 405 WFCNSLNGE---RIMNKM----------FSKVTAAIISRNGSL----------------- 494 + G+ I NK+ S++ +I N L Sbjct: 2222 -LSSVREGDVQRAISNKLPENYPASEGNGSEMPGDLIDLNSDLVCGDAVAVDFPKREETI 2280 Query: 495 ----------------EANLEKFWNDLRIICWCPVLVNSPFAALPWPSVTSMVAPPKQVR 626 + E FW++++ I WCPV VN PF LPW TS + VR Sbjct: 2281 CKDDIDIDNVIGNSMDDMPEEDFWSEMKTIAWCPVCVNPPFQGLPWLKPTSHLVSSSTVR 2340 Query: 627 LPADLWLVSASM 662 + +W+VS++M Sbjct: 2341 PKSQMWMVSSTM 2352 >gb|KJB49995.1| hypothetical protein B456_008G149000 [Gossypium raimondii] Length = 4223 Score = 275 bits (702), Expect = 2e-71 Identities = 136/221 (61%), Positives = 169/221 (76%), Gaps = 1/221 (0%) Frame = +3 Query: 3 ISELEPDVRNTVMISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEE 182 I E+ +VR+ VM+SIL +LPQLS+ED++ +++L+ LEFVPT G LKCP LYDPRNEE Sbjct: 500 IKEMHNEVRDNVMLSILENLPQLSIEDASLRDYLRNLEFVPTFTGALKCPSVLYDPRNEE 559 Query: 183 LYALLEDSDRFPSGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSR 362 LYALLEDSD FPSG FQES R+SV+PETVI SA+QIE MMH D KA+SR Sbjct: 560 LYALLEDSDSFPSGPFQESGILDMLQGLGLRTSVTPETVIGSAQQIEQMMHEDQHKAHSR 619 Query: 363 GRVLLSYLEVNATKWFCNSLNGER-IMNKMFSKVTAAIISRNGSLEANLEKFWNDLRIIC 539 G++LLSYLEVNA KW N ++ ++ +N++FS+ A RN + ++LEKFWNDLR+IC Sbjct: 620 GKILLSYLEVNAMKWLPNQVSDDQGAVNRIFSRAATAFRPRN--MRSDLEKFWNDLRMIC 677 Query: 540 WCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 WCPVLV+SPF ALPWP V+S VAPPK VRL DLWL+SASM Sbjct: 678 WCPVLVSSPFQALPWPVVSSKVAPPKLVRLQTDLWLISASM 718 Score = 80.9 bits (198), Expect = 6e-13 Identities = 69/252 (27%), Positives = 99/252 (39%), Gaps = 46/252 (18%) Frame = +3 Query: 45 SILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEELYALLEDSDRFPSG 224 +ILHD+ L ED + + L FV G+ + P LYDPR EL LL FPS Sbjct: 1916 AILHDVKMLVEEDISIRSALSTTPFVLAANGSWQPPSRLYDPRVPELQKLLCKEVFFPSE 1975 Query: 225 VFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSRGRVLLSYLEVNATK 404 F R ++ + AR I + +A + GR LL YL+ A K Sbjct: 1976 KFSGPETLDTLVSLGLRRTLGFIGFLDCARSISTLHESGDPEAATYGRKLLLYLDALACK 2035 Query: 405 WFCNSLNGE---RIMNKM----------FSKVTAAIISRNGSL----------------- 494 + G+ I NK+ S++ +I N L Sbjct: 2036 -LSSVREGDVQRAISNKLPENYPASEGNGSEMPGDLIDLNSDLVCGDAVAVDFPKREETI 2094 Query: 495 ----------------EANLEKFWNDLRIICWCPVLVNSPFAALPWPSVTSMVAPPKQVR 626 + E FW++++ I WCPV VN PF LPW TS + VR Sbjct: 2095 CKDDIDIDNVIGNSMDDMPEEDFWSEMKTIAWCPVCVNPPFQGLPWLKPTSHLVSSSTVR 2154 Query: 627 LPADLWLVSASM 662 + +W+VS++M Sbjct: 2155 PKSQMWMVSSTM 2166 >gb|KJB49994.1| hypothetical protein B456_008G149000 [Gossypium raimondii] Length = 4506 Score = 275 bits (702), Expect = 2e-71 Identities = 136/221 (61%), Positives = 169/221 (76%), Gaps = 1/221 (0%) Frame = +3 Query: 3 ISELEPDVRNTVMISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEE 182 I E+ +VR+ VM+SIL +LPQLS+ED++ +++L+ LEFVPT G LKCP LYDPRNEE Sbjct: 901 IKEMHNEVRDNVMLSILENLPQLSIEDASLRDYLRNLEFVPTFTGALKCPSVLYDPRNEE 960 Query: 183 LYALLEDSDRFPSGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSR 362 LYALLEDSD FPSG FQES R+SV+PETVI SA+QIE MMH D KA+SR Sbjct: 961 LYALLEDSDSFPSGPFQESGILDMLQGLGLRTSVTPETVIGSAQQIEQMMHEDQHKAHSR 1020 Query: 363 GRVLLSYLEVNATKWFCNSLNGER-IMNKMFSKVTAAIISRNGSLEANLEKFWNDLRIIC 539 G++LLSYLEVNA KW N ++ ++ +N++FS+ A RN + ++LEKFWNDLR+IC Sbjct: 1021 GKILLSYLEVNAMKWLPNQVSDDQGAVNRIFSRAATAFRPRN--MRSDLEKFWNDLRMIC 1078 Query: 540 WCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 WCPVLV+SPF ALPWP V+S VAPPK VRL DLWL+SASM Sbjct: 1079 WCPVLVSSPFQALPWPVVSSKVAPPKLVRLQTDLWLISASM 1119 Score = 80.9 bits (198), Expect = 6e-13 Identities = 69/252 (27%), Positives = 99/252 (39%), Gaps = 46/252 (18%) Frame = +3 Query: 45 SILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEELYALLEDSDRFPSG 224 +ILHD+ L ED + + L FV G+ + P LYDPR EL LL FPS Sbjct: 2317 AILHDVKMLVEEDISIRSALSTTPFVLAANGSWQPPSRLYDPRVPELQKLLCKEVFFPSE 2376 Query: 225 VFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSRGRVLLSYLEVNATK 404 F R ++ + AR I + +A + GR LL YL+ A K Sbjct: 2377 KFSGPETLDTLVSLGLRRTLGFIGFLDCARSISTLHESGDPEAATYGRKLLLYLDALACK 2436 Query: 405 WFCNSLNGE---RIMNKM----------FSKVTAAIISRNGSL----------------- 494 + G+ I NK+ S++ +I N L Sbjct: 2437 -LSSVREGDVQRAISNKLPENYPASEGNGSEMPGDLIDLNSDLVCGDAVAVDFPKREETI 2495 Query: 495 ----------------EANLEKFWNDLRIICWCPVLVNSPFAALPWPSVTSMVAPPKQVR 626 + E FW++++ I WCPV VN PF LPW TS + VR Sbjct: 2496 CKDDIDIDNVIGNSMDDMPEEDFWSEMKTIAWCPVCVNPPFQGLPWLKPTSHLVSSSTVR 2555 Query: 627 LPADLWLVSASM 662 + +W+VS++M Sbjct: 2556 PKSQMWMVSSTM 2567 >ref|XP_012438097.1| PREDICTED: uncharacterized protein LOC105764150 isoform X1 [Gossypium raimondii] gi|763782922|gb|KJB49993.1| hypothetical protein B456_008G149000 [Gossypium raimondii] Length = 4789 Score = 275 bits (702), Expect = 2e-71 Identities = 136/221 (61%), Positives = 169/221 (76%), Gaps = 1/221 (0%) Frame = +3 Query: 3 ISELEPDVRNTVMISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEE 182 I E+ +VR+ VM+SIL +LPQLS+ED++ +++L+ LEFVPT G LKCP LYDPRNEE Sbjct: 901 IKEMHNEVRDNVMLSILENLPQLSIEDASLRDYLRNLEFVPTFTGALKCPSVLYDPRNEE 960 Query: 183 LYALLEDSDRFPSGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSR 362 LYALLEDSD FPSG FQES R+SV+PETVI SA+QIE MMH D KA+SR Sbjct: 961 LYALLEDSDSFPSGPFQESGILDMLQGLGLRTSVTPETVIGSAQQIEQMMHEDQHKAHSR 1020 Query: 363 GRVLLSYLEVNATKWFCNSLNGER-IMNKMFSKVTAAIISRNGSLEANLEKFWNDLRIIC 539 G++LLSYLEVNA KW N ++ ++ +N++FS+ A RN + ++LEKFWNDLR+IC Sbjct: 1021 GKILLSYLEVNAMKWLPNQVSDDQGAVNRIFSRAATAFRPRN--MRSDLEKFWNDLRMIC 1078 Query: 540 WCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 WCPVLV+SPF ALPWP V+S VAPPK VRL DLWL+SASM Sbjct: 1079 WCPVLVSSPFQALPWPVVSSKVAPPKLVRLQTDLWLISASM 1119 Score = 80.9 bits (198), Expect = 6e-13 Identities = 69/252 (27%), Positives = 99/252 (39%), Gaps = 46/252 (18%) Frame = +3 Query: 45 SILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEELYALLEDSDRFPSG 224 +ILHD+ L ED + + L FV G+ + P LYDPR EL LL FPS Sbjct: 2317 AILHDVKMLVEEDISIRSALSTTPFVLAANGSWQPPSRLYDPRVPELQKLLCKEVFFPSE 2376 Query: 225 VFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSRGRVLLSYLEVNATK 404 F R ++ + AR I + +A + GR LL YL+ A K Sbjct: 2377 KFSGPETLDTLVSLGLRRTLGFIGFLDCARSISTLHESGDPEAATYGRKLLLYLDALACK 2436 Query: 405 WFCNSLNGE---RIMNKM----------FSKVTAAIISRNGSL----------------- 494 + G+ I NK+ S++ +I N L Sbjct: 2437 -LSSVREGDVQRAISNKLPENYPASEGNGSEMPGDLIDLNSDLVCGDAVAVDFPKREETI 2495 Query: 495 ----------------EANLEKFWNDLRIICWCPVLVNSPFAALPWPSVTSMVAPPKQVR 626 + E FW++++ I WCPV VN PF LPW TS + VR Sbjct: 2496 CKDDIDIDNVIGNSMDDMPEEDFWSEMKTIAWCPVCVNPPFQGLPWLKPTSHLVSSSTVR 2555 Query: 627 LPADLWLVSASM 662 + +W+VS++M Sbjct: 2556 PKSQMWMVSSTM 2567 >gb|KHG13033.1| Sacsin [Gossypium arboreum] Length = 4398 Score = 273 bits (698), Expect = 6e-71 Identities = 134/221 (60%), Positives = 171/221 (77%), Gaps = 1/221 (0%) Frame = +3 Query: 3 ISELEPDVRNTVMISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEE 182 I E+ +VR++VM+S+L +LPQLS+ED++ +++L+ LEFVPT G LKCP LYDPRNEE Sbjct: 548 IKEMHNEVRDSVMLSVLENLPQLSIEDTSLRDYLRNLEFVPTSTGALKCPSVLYDPRNEE 607 Query: 183 LYALLEDSDRFPSGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSR 362 LYALLEDSD FPSG FQES R+SV+PETVI+SA+QIE MMH D KA+SR Sbjct: 608 LYALLEDSDSFPSGPFQESGILDMLQGLGLRTSVTPETVIESAQQIERMMHEDQHKAHSR 667 Query: 363 GRVLLSYLEVNATKWFCNSLNGER-IMNKMFSKVTAAIISRNGSLEANLEKFWNDLRIIC 539 G++LLSYLEVNA KW N ++ ++ +N++FS+ A RN + ++LEKFW+DLR+IC Sbjct: 668 GKILLSYLEVNAMKWLPNQVSDDQGAVNRIFSRAATAFRPRN--MRSDLEKFWSDLRMIC 725 Query: 540 WCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 WCPVLV+SPF ALPWP V+S VAPPK VRL DLWL+SASM Sbjct: 726 WCPVLVSSPFQALPWPVVSSKVAPPKIVRLQTDLWLISASM 766 Score = 80.5 bits (197), Expect = 7e-13 Identities = 68/251 (27%), Positives = 98/251 (39%), Gaps = 45/251 (17%) Frame = +3 Query: 45 SILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEELYALLEDSDRFPSG 224 +ILHD+ L ED + + L FV G+ + P LYDP+ EL LL FPS Sbjct: 1964 AILHDVKMLVEEDISIRSALSTTPFVLAANGSWQPPSRLYDPQVPELQKLLCKDVFFPSE 2023 Query: 225 VFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSRGRVLLSYLEVNATK 404 F + R ++ + AR I + +A + GR LL YL+ A K Sbjct: 2024 KFSDPETLDTLVSLGLRRTLGFIGFLDCARSISTLHESGDPEAATYGRKLLLYLDALACK 2083 Query: 405 W-----------FCNSL---------NGERIMNKM----------------FSKVTAAII 476 N L NG + + F K I Sbjct: 2084 LSSVREGDVQKAISNKLPENYPASEGNGSEMPGDLIDLNSDVVCRDAVSVDFPKREETIC 2143 Query: 477 SRNGSLEANL---------EKFWNDLRIICWCPVLVNSPFAALPWPSVTSMVAPPKQVRL 629 + +E + E FW++++ I WCPV VN PF LPW TS + VR Sbjct: 2144 KDDIDIENVMGNSMDDMPEEDFWSEMKTIAWCPVCVNPPFQGLPWLKPTSHLVSSSTVRP 2203 Query: 630 PADLWLVSASM 662 + +W+VS++M Sbjct: 2204 KSQMWMVSSTM 2214 >ref|XP_007043304.1| Binding protein, putative isoform 2 [Theobroma cacao] gi|508707239|gb|EOX99135.1| Binding protein, putative isoform 2 [Theobroma cacao] Length = 3525 Score = 273 bits (698), Expect = 6e-71 Identities = 133/221 (60%), Positives = 170/221 (76%), Gaps = 1/221 (0%) Frame = +3 Query: 3 ISELEPDVRNTVMISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEE 182 I E+ +VR++VM+S+L +LPQLS+ED++ +++L+ LEFVPT+ G +KCP LYDPRNEE Sbjct: 291 IKEMHAEVRDSVMLSVLENLPQLSVEDTSLRDYLRNLEFVPTVSGAIKCPSVLYDPRNEE 350 Query: 183 LYALLEDSDRFPSGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSR 362 LYALLEDSD FP G FQES R+SV+PETVI+SARQ+E +MH D KA+SR Sbjct: 351 LYALLEDSDSFPFGPFQESGILDMLQGLGLRTSVTPETVIESARQVERIMHEDQDKAHSR 410 Query: 363 GRVLLSYLEVNATKWFCNSLNGER-IMNKMFSKVTAAIISRNGSLEANLEKFWNDLRIIC 539 G+VLLSYLEVNA KW N L ++ +N++FS+ A RN L++++EKFWNDLR+IC Sbjct: 411 GKVLLSYLEVNAMKWLPNQLGDDQGTVNRLFSRAATAFKPRN--LKSDMEKFWNDLRLIC 468 Query: 540 WCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 WCPVLV+SPF +PWP V+S VAPPK VRL DLWLVSASM Sbjct: 469 WCPVLVSSPFQDIPWPVVSSKVAPPKLVRLQTDLWLVSASM 509 Score = 87.8 bits (216), Expect = 5e-15 Identities = 71/255 (27%), Positives = 108/255 (42%), Gaps = 49/255 (19%) Frame = +3 Query: 45 SILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEELYALLEDSDRFPSG 224 +ILHD+ L ED + + L FV G+ + P LYDPR EL +L FPS Sbjct: 1707 AILHDVKLLLEEDISIRSALAATPFVLAANGSWQQPSRLYDPRVPELQKVLHKEVFFPSE 1766 Query: 225 VFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSRGRVLLSYLEVNATK 404 F + R S+ ++ AR + ++ +A + GR LL YL+ A K Sbjct: 1767 KFSDPETLDTLVILGLRRSLGFIGLLDCARSVSILHESGDPQAATCGRKLLLYLDALACK 1826 Query: 405 WFCNSLNGER-------IMNKM----------FSKVTAAIISRN---------------- 485 L+ ER I NK+ +++ +A+ RN Sbjct: 1827 -----LSSEREGDVEQIISNKLPKNDPASEGNDNEMPSALFCRNSDIIDGDAVDVDSSNR 1881 Query: 486 --------------GSLEANL--EKFWNDLRIICWCPVLVNSPFAALPWPSVTSMVAPPK 617 G+L N+ E FW++++ I WCP+ VN P LPW S +A P Sbjct: 1882 ENTCKDDIDIDNVIGNLIDNMPEEDFWSEMKTIAWCPICVNPPLQGLPWLKSPSHLASPS 1941 Query: 618 QVRLPADLWLVSASM 662 VR + +W+VS++M Sbjct: 1942 IVRPKSQMWVVSSTM 1956 >ref|XP_007043303.1| Binding protein, putative isoform 1 [Theobroma cacao] gi|508707238|gb|EOX99134.1| Binding protein, putative isoform 1 [Theobroma cacao] Length = 4780 Score = 273 bits (698), Expect = 6e-71 Identities = 133/221 (60%), Positives = 170/221 (76%), Gaps = 1/221 (0%) Frame = +3 Query: 3 ISELEPDVRNTVMISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEE 182 I E+ +VR++VM+S+L +LPQLS+ED++ +++L+ LEFVPT+ G +KCP LYDPRNEE Sbjct: 901 IKEMHAEVRDSVMLSVLENLPQLSVEDTSLRDYLRNLEFVPTVSGAIKCPSVLYDPRNEE 960 Query: 183 LYALLEDSDRFPSGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSR 362 LYALLEDSD FP G FQES R+SV+PETVI+SARQ+E +MH D KA+SR Sbjct: 961 LYALLEDSDSFPFGPFQESGILDMLQGLGLRTSVTPETVIESARQVERIMHEDQDKAHSR 1020 Query: 363 GRVLLSYLEVNATKWFCNSLNGER-IMNKMFSKVTAAIISRNGSLEANLEKFWNDLRIIC 539 G+VLLSYLEVNA KW N L ++ +N++FS+ A RN L++++EKFWNDLR+IC Sbjct: 1021 GKVLLSYLEVNAMKWLPNQLGDDQGTVNRLFSRAATAFKPRN--LKSDMEKFWNDLRLIC 1078 Query: 540 WCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 WCPVLV+SPF +PWP V+S VAPPK VRL DLWLVSASM Sbjct: 1079 WCPVLVSSPFQDIPWPVVSSKVAPPKLVRLQTDLWLVSASM 1119 Score = 87.8 bits (216), Expect = 5e-15 Identities = 71/255 (27%), Positives = 108/255 (42%), Gaps = 49/255 (19%) Frame = +3 Query: 45 SILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEELYALLEDSDRFPSG 224 +ILHD+ L ED + + L FV G+ + P LYDPR EL +L FPS Sbjct: 2317 AILHDVKLLLEEDISIRSALAATPFVLAANGSWQQPSRLYDPRVPELQKVLHKEVFFPSE 2376 Query: 225 VFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSRGRVLLSYLEVNATK 404 F + R S+ ++ AR + ++ +A + GR LL YL+ A K Sbjct: 2377 KFSDPETLDTLVILGLRRSLGFIGLLDCARSVSILHESGDPQAATCGRKLLLYLDALACK 2436 Query: 405 WFCNSLNGER-------IMNKM----------FSKVTAAIISRN---------------- 485 L+ ER I NK+ +++ +A+ RN Sbjct: 2437 -----LSSEREGDVEQIISNKLPKNDPASEGNDNEMPSALFCRNSDIIDGDAVDVDSSNR 2491 Query: 486 --------------GSLEANL--EKFWNDLRIICWCPVLVNSPFAALPWPSVTSMVAPPK 617 G+L N+ E FW++++ I WCP+ VN P LPW S +A P Sbjct: 2492 ENTCKDDIDIDNVIGNLIDNMPEEDFWSEMKTIAWCPICVNPPLQGLPWLKSPSHLASPS 2551 Query: 618 QVRLPADLWLVSASM 662 VR + +W+VS++M Sbjct: 2552 IVRPKSQMWVVSSTM 2566 >ref|XP_011463440.1| PREDICTED: sacsin [Fragaria vesca subsp. vesca] Length = 4772 Score = 272 bits (695), Expect = 1e-70 Identities = 130/221 (58%), Positives = 170/221 (76%), Gaps = 1/221 (0%) Frame = +3 Query: 3 ISELEPDVRNTVMISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEE 182 + EL+P+VRN +++SI+ +LPQL +ED++F+E+L+ LEF+PTL G L+CP ALYDPRNEE Sbjct: 903 VGELQPEVRNNIVLSIIQNLPQLCIEDTSFREYLRNLEFLPTLSGALRCPTALYDPRNEE 962 Query: 183 LYALLEDSDRFPSGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSR 362 LYALL+DSD FP G FQE R+SV+PET+IQSA+Q+E +MH D KA+ R Sbjct: 963 LYALLDDSDSFPYGPFQEPGILDMLQGLGLRTSVTPETIIQSAQQVERLMHEDQQKAHLR 1022 Query: 363 GRVLLSYLEVNATKWFCNSLNGER-IMNKMFSKVTAAIISRNGSLEANLEKFWNDLRIIC 539 G++LLSYLEVNA KW N +G++ +N+M S+ A RN L++NLEKFWNDLR++ Sbjct: 1023 GKILLSYLEVNAMKWIPNLASGDQGTVNRMLSRAGTAFRPRN--LKSNLEKFWNDLRLVS 1080 Query: 540 WCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 WCPVLV++PF LPWP V+S VAPPK VRL AD+WLVSASM Sbjct: 1081 WCPVLVSAPFLTLPWPVVSSTVAPPKLVRLQADMWLVSASM 1121 >ref|XP_008221054.1| PREDICTED: LOW QUALITY PROTEIN: sacsin [Prunus mume] Length = 4734 Score = 272 bits (695), Expect = 1e-70 Identities = 132/221 (59%), Positives = 170/221 (76%), Gaps = 1/221 (0%) Frame = +3 Query: 3 ISELEPDVRNTVMISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEE 182 + EL+P+VR+++++SIL +LPQL +ED +F+++LK LEF+PT GG L+ P ALYDPRNEE Sbjct: 901 VGELQPEVRDSIVLSILQNLPQLCVEDLSFRDYLKNLEFIPTFGGALRSPTALYDPRNEE 960 Query: 183 LYALLEDSDRFPSGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSR 362 LYALLEDSD FP G FQE ++SV+PETVIQSARQ+E +MH D K+ + Sbjct: 961 LYALLEDSDSFPCGPFQEPGILDMLHGLGLKTSVTPETVIQSARQVERLMHEDQQKSQLK 1020 Query: 363 GRVLLSYLEVNATKWFCNSLNGER-IMNKMFSKVTAAIISRNGSLEANLEKFWNDLRIIC 539 G+VLLSYLEVNA +W N+LN ++ MN+M S+ A RN L++ LEKFWNDLR+I Sbjct: 1021 GKVLLSYLEVNAMRWIPNALNDDQGTMNRMLSRAATAFRPRN--LKSELEKFWNDLRLIS 1078 Query: 540 WCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 WCPV+V++PF LPWP+V+SMVAPPK VRL ADLWLVSASM Sbjct: 1079 WCPVVVSAPFQTLPWPAVSSMVAPPKLVRLQADLWLVSASM 1119 >ref|XP_007221931.1| hypothetical protein PRUPE_ppa000003mg [Prunus persica] gi|462418867|gb|EMJ23130.1| hypothetical protein PRUPE_ppa000003mg [Prunus persica] Length = 4774 Score = 271 bits (693), Expect = 2e-70 Identities = 132/221 (59%), Positives = 170/221 (76%), Gaps = 1/221 (0%) Frame = +3 Query: 3 ISELEPDVRNTVMISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEE 182 + EL+P+VR+++++SIL +LPQL +ED +F+++LK LEF+PT GG L+ P ALYDPRNEE Sbjct: 902 VGELQPEVRDSIVLSILQNLPQLCVEDLSFRDYLKNLEFIPTFGGALRSPTALYDPRNEE 961 Query: 183 LYALLEDSDRFPSGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSR 362 LYALLEDSD FP G FQE ++SV+PETVIQSARQ+E +MH D K+ + Sbjct: 962 LYALLEDSDSFPCGPFQEPGILDMLHGLGLKTSVTPETVIQSARQVERLMHEDQQKSQLK 1021 Query: 363 GRVLLSYLEVNATKWFCNSLNGER-IMNKMFSKVTAAIISRNGSLEANLEKFWNDLRIIC 539 G+VLLSYLEVNA +W N+LN ++ MN+M S+ A RN L+++LEKFWNDLR+I Sbjct: 1022 GKVLLSYLEVNAMRWIPNALNDDQGTMNRMLSRAATAFRPRN--LKSDLEKFWNDLRLIS 1079 Query: 540 WCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 WCPV+V++PF LPWP V+SMVAPPK VRL ADLWLVSASM Sbjct: 1080 WCPVVVSAPFQTLPWPVVSSMVAPPKLVRLQADLWLVSASM 1120 >ref|XP_002307173.2| hypothetical protein POPTR_0005s09590g [Populus trichocarpa] gi|550338481|gb|EEE94169.2| hypothetical protein POPTR_0005s09590g [Populus trichocarpa] Length = 4775 Score = 271 bits (692), Expect = 3e-70 Identities = 135/218 (61%), Positives = 166/218 (76%), Gaps = 1/218 (0%) Frame = +3 Query: 12 LEPDVRNTVMISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEELYA 191 L+P+VR+ M+S+L +LPQL +ED++F+E L+ LEFVPT GTLK P LYDPRNEEL+A Sbjct: 913 LQPEVRDRTMLSVLQNLPQLCVEDASFRECLRNLEFVPTFSGTLKHPSVLYDPRNEELWA 972 Query: 192 LLEDSDRFPSGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSRGRV 371 LLE+SD FP G FQE +++ SPETVI+SARQ+E +MH D KA+SRG+V Sbjct: 973 LLEESDSFPCGAFQEPNILDMLHGLGLKTTASPETVIESARQVERLMHEDQQKAHSRGKV 1032 Query: 372 LLSYLEVNATKWFCNSLN-GERIMNKMFSKVTAAIISRNGSLEANLEKFWNDLRIICWCP 548 LLSYLEVNA KW N LN ER +N++FS+ AA R L+++LEKFWNDLR+ICWCP Sbjct: 1033 LLSYLEVNAMKWLPNQLNDDERTVNRIFSR--AATAFRPRGLKSDLEKFWNDLRMICWCP 1090 Query: 549 VLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 V+V +PF LPWP VTSMVAPPK VRL ADLWLVSASM Sbjct: 1091 VMVTAPFKTLPWPIVTSMVAPPKLVRLQADLWLVSASM 1128 Score = 85.1 bits (209), Expect = 3e-14 Identities = 59/235 (25%), Positives = 99/235 (42%), Gaps = 27/235 (11%) Frame = +3 Query: 39 MISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEELYALLEDSDRFP 218 + +ILHD+ L +D + K L FV G+ + P LYDPR +L +L FP Sbjct: 2323 LTAILHDVKLLIEDDISIKSALSMTPFVLAANGSWQQPSRLYDPRIPQLRKVLHREAFFP 2382 Query: 219 SGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSRGRVLLSYLE--- 389 S F + + ++ + AR + ++ + S GR L++ L+ Sbjct: 2383 SNEFSDPETLETLVKLGLKKNLGFTGFLDCARSVSMLHESRDSETVSYGRKLVALLDALA 2442 Query: 390 --VNATKWFCNSL----------------------NGERIMNKMFSKVTAAIISRNGSLE 497 ++A + CN + ER ++ + N + Sbjct: 2443 YKLSAEEGECNRNELQKTVLCQNSSDWNSDLAYLDSSERDKDQFIDDLEIDYFLANLIDD 2502 Query: 498 ANLEKFWNDLRIICWCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 E+FW++++ I WCPV V+ P LPW + S VA P VR + +W+VS +M Sbjct: 2503 KTEEEFWSEMKAISWCPVCVHPPLQGLPWLNSNSQVASPSSVRPKSQMWVVSCTM 2557 >ref|XP_002527141.1| protein binding protein, putative [Ricinus communis] gi|223533501|gb|EEF35243.1| protein binding protein, putative [Ricinus communis] Length = 4704 Score = 270 bits (691), Expect = 4e-70 Identities = 132/221 (59%), Positives = 167/221 (75%), Gaps = 1/221 (0%) Frame = +3 Query: 3 ISELEPDVRNTVMISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEE 182 I EL+P+VR+ +M+S+L +LPQL +ED TF+E +K LEFVPT G++K P LYDPRNEE Sbjct: 901 IKELQPEVRDNIMLSVLQNLPQLCVEDVTFREIVKNLEFVPTFSGSIKSPAVLYDPRNEE 960 Query: 183 LYALLEDSDRFPSGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSR 362 L ALL+D D FPSGVFQE R+SVSPETVI+SARQ+E +MH D KA+SR Sbjct: 961 LCALLDDFDGFPSGVFQEPDILDMLHALGLRTSVSPETVIESARQVEKLMHEDQQKAHSR 1020 Query: 363 GRVLLSYLEVNATKWFCNSLNGER-IMNKMFSKVTAAIISRNGSLEANLEKFWNDLRIIC 539 G+VL+SYLEVNA KW N +N ++ +N++FS+ A RN L+++LE FWNDLR+IC Sbjct: 1021 GKVLISYLEVNAMKWLSNQINDDQGTVNRIFSRAATAFRPRN--LKSDLENFWNDLRMIC 1078 Query: 540 WCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 WCPV+V++PF LPWP V+S VAPPK VRL DLWLVSASM Sbjct: 1079 WCPVMVSAPFQTLPWPVVSSTVAPPKLVRLQTDLWLVSASM 1119 >gb|KDO52761.1| hypothetical protein CISIN_1g0000071mg, partial [Citrus sinensis] Length = 3749 Score = 270 bits (689), Expect = 7e-70 Identities = 131/221 (59%), Positives = 170/221 (76%), Gaps = 1/221 (0%) Frame = +3 Query: 3 ISELEPDVRNTVMISILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEE 182 I +L+P++R+ VM+S+L LPQL +ED++F+E +KKLEFVPT G +K PQ LYDPRNEE Sbjct: 899 IRDLQPEIRDRVMLSVLQSLPQLCVEDTSFRECVKKLEFVPTTSGVVKSPQVLYDPRNEE 958 Query: 183 LYALLEDSDRFPSGVFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSR 362 LYAL+E+SD FP G FQES ++SVSPETVI+SAR++E ++H DP +A+SR Sbjct: 959 LYALMEESDSFPCGAFQESGILDMLQGLGLKTSVSPETVIESARKVERLLHEDPERAHSR 1018 Query: 363 GRVLLSYLEVNATKWFCNSLNGER-IMNKMFSKVTAAIISRNGSLEANLEKFWNDLRIIC 539 G+VLLSYLEVNA KW + LN ++ +N+MFS+ A RN L+++LEKFW DLR+IC Sbjct: 1019 GKVLLSYLEVNAMKWLPDQLNDDQGTVNRMFSRAATAFRPRN--LKSDLEKFWIDLRMIC 1076 Query: 540 WCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 WCPVLV++PF LPWP V+S VAPPK VRL DLW+VSASM Sbjct: 1077 WCPVLVSAPFECLPWPVVSSTVAPPKLVRLQEDLWIVSASM 1117 Score = 90.5 bits (223), Expect = 7e-16 Identities = 69/232 (29%), Positives = 96/232 (41%), Gaps = 26/232 (11%) Frame = +3 Query: 45 SILHDLPQLSLEDSTFKEFLKKLEFVPTLGGTLKCPQALYDPRNEELYALLEDSDRFPSG 224 +ILHD+ L ED + K L FV G+ + P LYDPR EL LL FPS Sbjct: 2313 AILHDVKLLIEEDISIKSTLSMASFVLAANGSWQAPSRLYDPRVPELRKLLHGEMFFPSD 2372 Query: 225 VFQESXXXXXXXXXXXRSSVSPETVIQSARQIELMMHRDPLKANSRGRVLLSYLEVNATK 404 F + ++ ++ AR + + +A G L L+ A K Sbjct: 2373 QFSDPETLDTLVSLGLNRTLGFTGLLDCARSVSMFHDSRDSQAIDYGWRLFKCLDTLAPK 2432 Query: 405 WFCNS--LNGERIMNKMFSK---------VTAAIISRNGSLEANL--------------- 506 NG ++N MF + V ++ N S E +L Sbjct: 2433 LSTEKGESNGAEVLNPMFIQNNEVADVQCVDTSVGEENHS-EGDLDFAYVVDNLIDDKPG 2491 Query: 507 EKFWNDLRIICWCPVLVNSPFAALPWPSVTSMVAPPKQVRLPADLWLVSASM 662 E FW+++R I WCPV PF LPW ++ VA P VR + +WLVS SM Sbjct: 2492 ENFWSEMRAIPWCPVCAEPPFLGLPWLKSSNQVASPCYVRPKSQMWLVSFSM 2543