BLASTX nr result
ID: Magnolia22_contig00015680
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Magnolia22_contig00015680 (1380 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_010267380.1 PREDICTED: uncharacterized protein LOC104604644 i... 244 2e-73 XP_010267379.1 PREDICTED: uncharacterized protein LOC104604644 i... 244 3e-73 XP_010662932.1 PREDICTED: uncharacterized protein LOC100853133 [... 210 1e-60 XP_010922572.1 PREDICTED: uncharacterized protein LOC105045848 i... 198 1e-56 XP_010922571.1 PREDICTED: uncharacterized protein LOC105045848 i... 198 5e-56 XP_008805185.1 PREDICTED: uncharacterized protein LOC103718240 [... 196 2e-55 XP_010907408.1 PREDICTED: uncharacterized protein LOC105034083 [... 194 2e-54 XP_017700949.1 PREDICTED: uncharacterized protein LOC103718115 [... 183 3e-51 XP_017974077.1 PREDICTED: uncharacterized protein LOC18605858 is... 182 9e-50 EOY23701.1 Uncharacterized protein TCM_015509 isoform 1 [Theobro... 182 1e-49 XP_017974076.1 PREDICTED: uncharacterized protein LOC18605858 is... 181 3e-49 EOY23702.1 Uncharacterized protein TCM_015509 isoform 2 [Theobro... 181 4e-49 OMO95080.1 hypothetical protein CCACVL1_05581 [Corchorus capsula... 179 2e-48 XP_009420008.1 PREDICTED: neural Wiskott-Aldrich syndrome protei... 177 3e-48 XP_009420007.1 PREDICTED: neural Wiskott-Aldrich syndrome protei... 177 6e-48 XP_017612899.1 PREDICTED: uncharacterized protein LOC108458135 i... 176 2e-47 XP_020100428.1 circumsporozoite protein isoform X3 [Ananas comosus] 175 2e-47 XP_020100426.1 circumsporozoite protein isoform X2 [Ananas comosus] 175 2e-47 XP_017612898.1 PREDICTED: uncharacterized protein LOC108458135 i... 176 2e-47 KJB74655.1 hypothetical protein B456_012G000700 [Gossypium raimo... 174 2e-47 >XP_010267380.1 PREDICTED: uncharacterized protein LOC104604644 isoform X2 [Nelumbo nucifera] Length = 358 Score = 244 bits (622), Expect = 2e-73 Identities = 150/306 (49%), Positives = 190/306 (62%), Gaps = 24/306 (7%) Frame = +1 Query: 124 MLCSISTGKSGSTWLDRLRTSKGFPVQTGLDLEHFLNPNP-----TSKVPTNPIPEENPT 288 MLCSIS+GKS WLDRLR+SKGFPV GLDLEHFLNPNP +S+ + +E Sbjct: 1 MLCSISSGKSAPNWLDRLRSSKGFPVADGLDLEHFLNPNPNQTTLSSETNASYATQEIGY 60 Query: 289 GSNHDATLDGEKSTVDRKKKTSSNPTPPMMIGEKNN---EDWFDIMSSVLSELFNMGDSS 459 H + ++ V +KK+ + P G++ N EDWF IM +VL+ELFNMGDS Sbjct: 61 SKPHPESTSLDEKPVADRKKSMAGP------GDRKNQGKEDWFGIMGNVLAELFNMGDSG 114 Query: 460 ---RIRALDARRKKSSRKQSNPRICPLSTSASVDDSCLAG------VPAMSPSSVDNSVA 612 +IR D K+S RKQ NP+IC S SASV+DS LA VP+MSP S DNSV Sbjct: 115 EFQKIRGFD--EKRSCRKQPNPKICVFSASASVNDSFLAAAPRLESVPSMSPPSGDNSVT 172 Query: 613 EVKES------RKQGIAESINPEDEILQAVTVDSDLSAFSRAEVTIIDTSAPAWKSEKLI 774 E+KE+ +KQG SI +++ LQ +DLS +SR EVTIIDTS P WKSEKL+ Sbjct: 173 EMKETVNSLKPKKQGKVVSIAHDEDKLQ-----TDLSTYSRVEVTIIDTSCPVWKSEKLL 227 Query: 775 YRKGSVWKIRDKKRNSWNALACRKKRKLGQRDGDGGEEKQQKSSFLP-SNSFKEADSEEH 951 +RKGSVWK+RDKK S NA + RKKRK D + G K++ FLP N +EA EE+ Sbjct: 228 FRKGSVWKVRDKKWKSRNASSFRKKRKANHSDKEAGGGKKKGKFFLPLVNITREAGPEEN 287 Query: 952 LISSNE 969 + +E Sbjct: 288 KVPLDE 293 >XP_010267379.1 PREDICTED: uncharacterized protein LOC104604644 isoform X1 [Nelumbo nucifera] Length = 364 Score = 244 bits (622), Expect = 3e-73 Identities = 150/306 (49%), Positives = 190/306 (62%), Gaps = 24/306 (7%) Frame = +1 Query: 124 MLCSISTGKSGSTWLDRLRTSKGFPVQTGLDLEHFLNPNP-----TSKVPTNPIPEENPT 288 MLCSIS+GKS WLDRLR+SKGFPV GLDLEHFLNPNP +S+ + +E Sbjct: 1 MLCSISSGKSAPNWLDRLRSSKGFPVADGLDLEHFLNPNPNQTTLSSETNASYATQEIGY 60 Query: 289 GSNHDATLDGEKSTVDRKKKTSSNPTPPMMIGEKNN---EDWFDIMSSVLSELFNMGDSS 459 H + ++ V +KK+ + P G++ N EDWF IM +VL+ELFNMGDS Sbjct: 61 SKPHPESTSLDEKPVADRKKSMAGP------GDRKNQGKEDWFGIMGNVLAELFNMGDSG 114 Query: 460 ---RIRALDARRKKSSRKQSNPRICPLSTSASVDDSCLAG------VPAMSPSSVDNSVA 612 +IR D K+S RKQ NP+IC S SASV+DS LA VP+MSP S DNSV Sbjct: 115 EFQKIRGFD--EKRSCRKQPNPKICVFSASASVNDSFLAAAPRLESVPSMSPPSGDNSVT 172 Query: 613 EVKES------RKQGIAESINPEDEILQAVTVDSDLSAFSRAEVTIIDTSAPAWKSEKLI 774 E+KE+ +KQG SI +++ LQ +DLS +SR EVTIIDTS P WKSEKL+ Sbjct: 173 EMKETVNSLKPKKQGKVVSIAHDEDKLQ-----TDLSTYSRVEVTIIDTSCPVWKSEKLL 227 Query: 775 YRKGSVWKIRDKKRNSWNALACRKKRKLGQRDGDGGEEKQQKSSFLP-SNSFKEADSEEH 951 +RKGSVWK+RDKK S NA + RKKRK D + G K++ FLP N +EA EE+ Sbjct: 228 FRKGSVWKVRDKKWKSRNASSFRKKRKANHSDKEAGGGKKKGKFFLPLVNITREAGPEEN 287 Query: 952 LISSNE 969 + +E Sbjct: 288 KVPLDE 293 >XP_010662932.1 PREDICTED: uncharacterized protein LOC100853133 [Vitis vinifera] Length = 329 Score = 210 bits (534), Expect = 1e-60 Identities = 133/289 (46%), Positives = 174/289 (60%), Gaps = 6/289 (2%) Frame = +1 Query: 124 MLCSISTGKSGSTWLDRLRTSKGFPVQTGLDLEHFLNPNPTSKVPTNPIPEENPTGSNHD 303 MLCSISTGKSGS WLDRLR++KGFP DLEHFL + + +PI + + S D Sbjct: 1 MLCSISTGKSGSKWLDRLRSAKGFPTGNDDDLEHFLTHRDPN-LSNSPITKPSDPKSISD 59 Query: 304 ATLDGEKSTVDRKKKTSSNPTPPMMIGEKNNEDWFDIMSSVLSELFNMGDSSRIRALDAR 483 +T EK DR + PP E ++WF IMS+VL+ELFNMGDS++I L Sbjct: 60 STCSDEKPVQDRSQ-------PP----ETGEKEWFGIMSNVLAELFNMGDSNQIPKLSG- 107 Query: 484 RKKSSRKQSNPRICPLSTSASVDDSCLAGVPAMSPSSVDNSVAEVKESR------KQGIA 645 KKSSRKQ+NP+IC LS+ D+ VPA +PSS DNS+ E+K+S QG Sbjct: 108 -KKSSRKQTNPKICLLSSVRQEDE-----VPATAPSSGDNSLTEMKDSNGEVKTVNQGKV 161 Query: 646 ESINPEDEILQAVTVDSDLSAFSRAEVTIIDTSAPAWKSEKLIYRKGSVWKIRDKKRNSW 825 + ++ E+E + DLSA+SR+EVT+IDTS WK EKL++RK +VWK+RDKK S Sbjct: 162 DCLDAEEE-----KCNQDLSAYSRSEVTVIDTSCAVWKFEKLLFRKKNVWKVRDKKGKSR 216 Query: 826 NALACRKKRKLGQRDGDGGEEKQQKSSFLPSNSFKEADSEEHLISSNED 972 + RKKRK + D K+ K L SFKE + EE + SNE+ Sbjct: 217 S--IGRKKRKASECDEQLEARKKMK---LSVESFKERNEEESAMPSNEE 260 >XP_010922572.1 PREDICTED: uncharacterized protein LOC105045848 isoform X2 [Elaeis guineensis] Length = 285 Score = 198 bits (503), Expect = 1e-56 Identities = 135/304 (44%), Positives = 169/304 (55%), Gaps = 22/304 (7%) Frame = +1 Query: 124 MLCSISTGKSGSTWLDRLRTSKGFPVQTGLDLEHFLNPNPTSKVPTN------PIPEENP 285 MLCSIST +S S WLDRL TSKGF + LDL+HFL+ NP TN P PE P Sbjct: 1 MLCSISTSRSSSNWLDRLHTSKGFSIPADLDLDHFLSSNPNPDPNTNSCFPPLPPPETRP 60 Query: 286 TGSNHDATLDGEKSTVDRKKKTSSNPTPPMMI-GEKN---NEDWFDIMSSVLSELFNMGD 453 + + + +P+PP+ G K E FD+M S L+ELF MGD Sbjct: 61 SDA----------------WRRQPHPSPPVSAAGNKTAGGKEQIFDLMGSALAELFIMGD 104 Query: 454 SSRIRALDARRKKSSRKQSNPRICPLSTSASVDDSCLAGVPAM-------SPSSVDNSVA 612 S L A KKS+RKQ NP+ C S SAS+ + LAG PA SPSS +NSVA Sbjct: 105 GSAPATLRAS-KKSARKQPNPKACVPSISASIGGNFLAGAPAACRVTPATSPSSAENSVA 163 Query: 613 EVKESR-----KQGIAESINPEDEILQAVTVDSDLSAFSRAEVTIIDTSAPAWKSEKLIY 777 E K+SR K+G A S V+SDLS +S+ EVT+IDTS+P WKSEKLI+ Sbjct: 164 EAKKSRTKARRKRGTAGS-----------PVESDLSTYSKTEVTVIDTSSPGWKSEKLIF 212 Query: 778 RKGSVWKIRDKKRNSWNALACRKKRKLGQRDGDGGEEKQQKSSFLPSNSFKEADSEEHLI 957 RKG VWK+RDKK WN CRKKRKLG + E+++++ P K +EH Sbjct: 213 RKGIVWKVRDKK--LWN--VCRKKRKLGLVERLISEKEKEQ----PLIDMKVPAGKEHSR 264 Query: 958 SSNE 969 S +E Sbjct: 265 SVDE 268 >XP_010922571.1 PREDICTED: uncharacterized protein LOC105045848 isoform X1 [Elaeis guineensis] Length = 332 Score = 198 bits (503), Expect = 5e-56 Identities = 135/304 (44%), Positives = 169/304 (55%), Gaps = 22/304 (7%) Frame = +1 Query: 124 MLCSISTGKSGSTWLDRLRTSKGFPVQTGLDLEHFLNPNPTSKVPTN------PIPEENP 285 MLCSIST +S S WLDRL TSKGF + LDL+HFL+ NP TN P PE P Sbjct: 1 MLCSISTSRSSSNWLDRLHTSKGFSIPADLDLDHFLSSNPNPDPNTNSCFPPLPPPETRP 60 Query: 286 TGSNHDATLDGEKSTVDRKKKTSSNPTPPMMI-GEKN---NEDWFDIMSSVLSELFNMGD 453 + + + +P+PP+ G K E FD+M S L+ELF MGD Sbjct: 61 SDA----------------WRRQPHPSPPVSAAGNKTAGGKEQIFDLMGSALAELFIMGD 104 Query: 454 SSRIRALDARRKKSSRKQSNPRICPLSTSASVDDSCLAGVPAM-------SPSSVDNSVA 612 S L A KKS+RKQ NP+ C S SAS+ + LAG PA SPSS +NSVA Sbjct: 105 GSAPATLRAS-KKSARKQPNPKACVPSISASIGGNFLAGAPAACRVTPATSPSSAENSVA 163 Query: 613 EVKESR-----KQGIAESINPEDEILQAVTVDSDLSAFSRAEVTIIDTSAPAWKSEKLIY 777 E K+SR K+G A S V+SDLS +S+ EVT+IDTS+P WKSEKLI+ Sbjct: 164 EAKKSRTKARRKRGTAGS-----------PVESDLSTYSKTEVTVIDTSSPGWKSEKLIF 212 Query: 778 RKGSVWKIRDKKRNSWNALACRKKRKLGQRDGDGGEEKQQKSSFLPSNSFKEADSEEHLI 957 RKG VWK+RDKK WN CRKKRKLG + E+++++ P K +EH Sbjct: 213 RKGIVWKVRDKK--LWN--VCRKKRKLGLVERLISEKEKEQ----PLIDMKVPAGKEHSR 264 Query: 958 SSNE 969 S +E Sbjct: 265 SVDE 268 >XP_008805185.1 PREDICTED: uncharacterized protein LOC103718240 [Phoenix dactylifera] Length = 330 Score = 196 bits (499), Expect = 2e-55 Identities = 131/264 (49%), Positives = 155/264 (58%), Gaps = 18/264 (6%) Frame = +1 Query: 124 MLCSISTGKSGSTWLDRLRTSKGFPVQTGLDLEHFLNPNPTSKVPTNPIPEENPTGSNHD 303 MLCSIST +S S WLDRL TSKGF + LDL+HFL+ NP NP P N Sbjct: 1 MLCSISTSRSSSNWLDRLHTSKGFSIPADLDLDHFLSSNP------NPDPNSNSCFPPPP 54 Query: 304 ATLDGEKSTVDRKKKTSSNPTPPMMI------GEKNNEDWFDIMSSVLSELFNMGDSSRI 465 + + S RK+ +P PP+ GEK E FD+MSS L+ELF MGD Sbjct: 55 PS-ETRPSCARRKQ----HPPPPVSASGSKTAGEK--EQIFDLMSSALAELFVMGDRPAP 107 Query: 466 RALDARRKKSSRKQSNPRICPLSTSASVDDSCLAGVPAM-------SPSSVDNSVAEVKE 624 L A KKSSRKQ NP+ C S SAS+ + LAG PA SPSS +NSVAE K+ Sbjct: 108 GTLRAS-KKSSRKQPNPKACVPSVSASIGGNFLAGAPAACHVTPATSPSSAENSVAEAKK 166 Query: 625 SR-----KQGIAESINPEDEILQAVTVDSDLSAFSRAEVTIIDTSAPAWKSEKLIYRKGS 789 SR K+G A S V+SDLS +S+ EVT+IDTS+P WKSEKLI+RKG Sbjct: 167 SRTKARRKRGTAGS-----------PVESDLSTYSKTEVTVIDTSSPGWKSEKLIFRKGI 215 Query: 790 VWKIRDKKRNSWNALACRKKRKLG 861 VWK+RDKK WN CRKKRKLG Sbjct: 216 VWKVRDKK--LWN--VCRKKRKLG 235 >XP_010907408.1 PREDICTED: uncharacterized protein LOC105034083 [Elaeis guineensis] Length = 342 Score = 194 bits (493), Expect = 2e-54 Identities = 136/313 (43%), Positives = 171/313 (54%), Gaps = 24/313 (7%) Frame = +1 Query: 124 MLCSISTGKSGSTWLDRLRTSKGFPVQTGLDLEHFL----NPNPTSKVPT-NPIPEENPT 288 MLCS +S S WLDRL TSKG + LDL+ FL NPNP S + +P P E Sbjct: 1 MLCS----RSSSNWLDRLHTSKGLSIPADLDLDQFLSSIPNPNPNSNPKSCSPRPPEARP 56 Query: 289 GSNHDATLDGEKSTVDRKKKTSSNPTPP-------MMIGEKNNEDWFDIMSSVLSELFNM 447 + G+K R++ P P + +GEK E FD+MSS L+ELF M Sbjct: 57 SDAPLSQPTGDKPAASRRRWKQQPPPPEEVAAGNKIFVGEK--EQLFDLMSSALAELFIM 114 Query: 448 GDSSRIRALDARRKKSSRKQSNPRICPLSTSASVDDSCLAGV-------PAMSPSSVDNS 606 D S L KKS+RKQ NP+ C S SAS+D S LAG PA SPSS DNS Sbjct: 115 RDHSATGILGPS-KKSARKQPNPKACVPSASASIDGSFLAGAAAACHVPPATSPSSADNS 173 Query: 607 VAEVKESR-----KQGIAESINPEDEILQAVTVDSDLSAFSRAEVTIIDTSAPAWKSEKL 771 VAE K+SR K+G S V+SDLS +S+ EVT+IDTS+P WKSEKL Sbjct: 174 VAEAKKSRTKARRKRGTTGS-----------PVESDLSTYSKTEVTVIDTSSPGWKSEKL 222 Query: 772 IYRKGSVWKIRDKKRNSWNALACRKKRKLGQRDGDGGEEKQQKSSFLPSNSFKEADSEEH 951 I+RKG VWK+RDKK WN CRKKRK+G + GE+++++ P KE +EH Sbjct: 223 IFRKGMVWKVRDKK--LWN--VCRKKRKVGLVERLIGEKEKEQ----PLIDMKEPSPKEH 274 Query: 952 LISSNEDAELVDN 990 S +E +N Sbjct: 275 SGSVDEGGAHAEN 287 >XP_017700949.1 PREDICTED: uncharacterized protein LOC103718115 [Phoenix dactylifera] XP_008805010.2 PREDICTED: uncharacterized protein LOC103718115 [Phoenix dactylifera] Length = 258 Score = 183 bits (464), Expect = 3e-51 Identities = 126/271 (46%), Positives = 148/271 (54%), Gaps = 25/271 (9%) Frame = +1 Query: 124 MLCSISTGKSGSTWLDRLRTSKGFPVQTG-LDLEHFLNPNPTSKVPTNP--IPEENPTGS 294 MLCS +S S WLDRL TSKGF + DL+HFL+ P TNP P Sbjct: 1 MLCS----RSSSNWLDRLHTSKGFCIPAADHDLDHFLSSIPNPNPNTNPKSCSPPRPETW 56 Query: 295 NHDATLD---GEKSTVDRKKKTSSNPTPPM-------MIGEKNNEDWFDIMSSVLSELFN 444 DA L EK R+++ P GEK E FD+MSS L+ELF Sbjct: 57 TSDAPLSQPPAEKPAAPRRRRKQQQQQQPQYAAGNKTFAGEK--EQLFDLMSSALAELFI 114 Query: 445 MGDSSRIRALDARRKKSSRKQSNPRICPLSTSASVDDSCLAGV-------PAMSPSSVDN 603 MGD S L A KKS+RKQ+NP+ C S SAS+D S LAG P SPSS DN Sbjct: 115 MGDRSATGILRAS-KKSARKQANPKACVPSASASIDGSFLAGAAAACHVPPVTSPSSADN 173 Query: 604 SVAEVKESR-----KQGIAESINPEDEILQAVTVDSDLSAFSRAEVTIIDTSAPAWKSEK 768 SVAE K SR K+G S V+SDLS +S+ + T+IDTS+P WKSEK Sbjct: 174 SVAEAKNSRTKARWKRGTTGS-----------PVESDLSTYSKTDATVIDTSSPGWKSEK 222 Query: 769 LIYRKGSVWKIRDKKRNSWNALACRKKRKLG 861 LI+RKG VWK+RDK N WN CRKKRKLG Sbjct: 223 LIFRKGMVWKVRDK--NLWN--VCRKKRKLG 249 >XP_017974077.1 PREDICTED: uncharacterized protein LOC18605858 isoform X2 [Theobroma cacao] Length = 353 Score = 182 bits (462), Expect = 9e-50 Identities = 120/304 (39%), Positives = 174/304 (57%), Gaps = 20/304 (6%) Frame = +1 Query: 124 MLCSISTGKSGSTWLDRLRTSKGFPVQTGLDLEHFL-NPNPTSKVPTNPIPEENPTGSNH 300 MLCSISTGKSGS WLDRLR+SKGFP LDL+HFL NPNP+ T+ N SN Sbjct: 1 MLCSISTGKSGSNWLDRLRSSKGFPTGDNLDLDHFLTNPNPSDSPITD---ASNSPNSNS 57 Query: 301 DATLDGEKSTVDRKKKTSSNPTPPMMIGE-KNNEDWFDIMSSVLSELFNMGDSSRIRALD 477 ++T +K +RK P P ++ E +++WF IMS+VLSELFNMGD ++ Sbjct: 58 ESTHSNDKELQNRKA-----PPPEVVSSEPAGDKEWFGIMSNVLSELFNMGDQAQTSRFS 112 Query: 478 ARRKKSSRKQSNPRICPLSTS--------ASVDDSCL--AGVPAMSPSSVDNSVAEVKES 627 RKK+SRKQ+NP+IC + TS S DS +PA S +S+++ +E Sbjct: 113 --RKKTSRKQTNPKICIIKTSNVNTSEEQKSSSDSVRKDENIPA-STTSLNSKEEAKREW 169 Query: 628 RKQGIAESINPEDEILQAVTVDSDLSAFSRAEVTIIDTSAPAWKSEKLIYRKGSVWKIRD 807 +++G ++ E++ + + +L +SR+EVT+IDTS WK +KLI+R+ ++WK++D Sbjct: 170 KEEGDDYNVEEEEQEEEKGKGERELLGYSRSEVTVIDTSCEVWKVDKLIFRRKNIWKVKD 229 Query: 808 KKRNSWNALACRKKRKL-------GQRDGDGGE-EKQQKSSFLPSNSFKEADSEEHLISS 963 KK S + RKKRK D +GG K++K S S K+ +E + Sbjct: 230 KKGKS--RIVGRKKRKAPPPPPPPSYDDNNGGVWNKKRKISSSELRSLKDTSGKESGSPT 287 Query: 964 NEDA 975 N +A Sbjct: 288 NHNA 291 >EOY23701.1 Uncharacterized protein TCM_015509 isoform 1 [Theobroma cacao] Length = 353 Score = 182 bits (461), Expect = 1e-49 Identities = 120/304 (39%), Positives = 174/304 (57%), Gaps = 20/304 (6%) Frame = +1 Query: 124 MLCSISTGKSGSTWLDRLRTSKGFPVQTGLDLEHFL-NPNPTSKVPTNPIPEENPTGSNH 300 MLCSISTGKSGS WLDRLR+SKGFP LDL+HFL NPNP+ T+ N SN Sbjct: 1 MLCSISTGKSGSNWLDRLRSSKGFPTGDNLDLDHFLTNPNPSDSPITD---ASNSPNSNS 57 Query: 301 DATLDGEKSTVDRKKKTSSNPTPPMMIGE-KNNEDWFDIMSSVLSELFNMGDSSRIRALD 477 ++T +K +RK P P ++ E +++WF IMS+VLSELFNMGD ++ Sbjct: 58 ESTHSNDKELQNRKA-----PPPEVVSSEPAGDKEWFGIMSNVLSELFNMGDQAQTSRFS 112 Query: 478 ARRKKSSRKQSNPRICPLSTS--------ASVDDSCL--AGVPAMSPSSVDNSVAEVKES 627 RKK+SRKQ+NP+IC + TS S DS +PA S +S+++ +E Sbjct: 113 --RKKTSRKQTNPKICIIKTSNVNTSEEQKSSSDSVRKDENIPA-STTSLNSKEEAKREW 169 Query: 628 RKQGIAESINPEDEILQAVTVDSDLSAFSRAEVTIIDTSAPAWKSEKLIYRKGSVWKIRD 807 +++G ++ E++ + + +L +SR+EVT+IDTS WK +KLI+R+ ++WK++D Sbjct: 170 KEEGDDYNVEEEEQEEENGKGERELLGYSRSEVTVIDTSCEVWKVDKLIFRRKNIWKVKD 229 Query: 808 KKRNSWNALACRKKRKL-------GQRDGDGGE-EKQQKSSFLPSNSFKEADSEEHLISS 963 KK S + RKKRK D +GG K++K S S K+ +E + Sbjct: 230 KKGKS--RIVGRKKRKAPPPPPPPSYDDNNGGVWNKKRKISSSELRSLKDTSGKESGSPT 287 Query: 964 NEDA 975 N +A Sbjct: 288 NHNA 291 >XP_017974076.1 PREDICTED: uncharacterized protein LOC18605858 isoform X1 [Theobroma cacao] Length = 355 Score = 181 bits (459), Expect = 3e-49 Identities = 124/316 (39%), Positives = 177/316 (56%), Gaps = 27/316 (8%) Frame = +1 Query: 124 MLCSISTGKSGSTWLDRLRTSKGFPVQTGLDLEHFL-NPNPTSKVPTNPIPEENPTGSNH 300 MLCSISTGKSGS WLDRLR+SKGFP LDL+HFL NPNP+ T+ N SN Sbjct: 1 MLCSISTGKSGSNWLDRLRSSKGFPTGDNLDLDHFLTNPNPSDSPITD---ASNSPNSNS 57 Query: 301 DATLDGEKSTVDRKKKTSSNPTPPMMIGE-KNNEDWFDIMSSVLSELFNMGDSSRIRALD 477 ++T +K +RK P P ++ E +++WF IMS+VLSELFNMGD ++ Sbjct: 58 ESTHSNDKELQNRKA-----PPPEVVSSEPAGDKEWFGIMSNVLSELFNMGDQAQTSRFS 112 Query: 478 ARRKKSSRKQSNPRICPLSTS--------ASVDDSCL--AGVPAMSPSSVDNSVAEVKES 627 RKK+SRKQ+NP+IC + TS S DS +PA S +S+++ +E Sbjct: 113 --RKKTSRKQTNPKICIIKTSNVNTSEEQKSSSDSVRKDENIPA-STTSLNSKEEAKREW 169 Query: 628 RKQGIAESINPEDEILQAVTVDSDLSAFSRAEVTIIDTSAPAWKSEKLIYRKGSVWKIRD 807 +++G ++ E++ + + +L +SR+EVT+IDTS WK +KLI+R+ ++WK++D Sbjct: 170 KEEGDDYNVEEEEQEEEKGKGERELLGYSRSEVTVIDTSCEVWKVDKLIFRRKNIWKVKD 229 Query: 808 KKRNSWNALACRKKRKL-------GQRDGDGGE-EKQQKSSFLPSNSFKEADSEEHLISS 963 KK S + RKKRK D +GG K++K S S K+ +E + Sbjct: 230 KKGKS--RIVGRKKRKAPPPPPPPSYDDNNGGVWNKKRKISSSELRSLKDTSGKESGSPT 287 Query: 964 N-------EDAELVDN 990 N E ELV N Sbjct: 288 NHGQNAPGEKGELVCN 303 >EOY23702.1 Uncharacterized protein TCM_015509 isoform 2 [Theobroma cacao] Length = 355 Score = 181 bits (458), Expect = 4e-49 Identities = 124/316 (39%), Positives = 177/316 (56%), Gaps = 27/316 (8%) Frame = +1 Query: 124 MLCSISTGKSGSTWLDRLRTSKGFPVQTGLDLEHFL-NPNPTSKVPTNPIPEENPTGSNH 300 MLCSISTGKSGS WLDRLR+SKGFP LDL+HFL NPNP+ T+ N SN Sbjct: 1 MLCSISTGKSGSNWLDRLRSSKGFPTGDNLDLDHFLTNPNPSDSPITD---ASNSPNSNS 57 Query: 301 DATLDGEKSTVDRKKKTSSNPTPPMMIGE-KNNEDWFDIMSSVLSELFNMGDSSRIRALD 477 ++T +K +RK P P ++ E +++WF IMS+VLSELFNMGD ++ Sbjct: 58 ESTHSNDKELQNRKA-----PPPEVVSSEPAGDKEWFGIMSNVLSELFNMGDQAQTSRFS 112 Query: 478 ARRKKSSRKQSNPRICPLSTS--------ASVDDSCL--AGVPAMSPSSVDNSVAEVKES 627 RKK+SRKQ+NP+IC + TS S DS +PA S +S+++ +E Sbjct: 113 --RKKTSRKQTNPKICIIKTSNVNTSEEQKSSSDSVRKDENIPA-STTSLNSKEEAKREW 169 Query: 628 RKQGIAESINPEDEILQAVTVDSDLSAFSRAEVTIIDTSAPAWKSEKLIYRKGSVWKIRD 807 +++G ++ E++ + + +L +SR+EVT+IDTS WK +KLI+R+ ++WK++D Sbjct: 170 KEEGDDYNVEEEEQEEENGKGERELLGYSRSEVTVIDTSCEVWKVDKLIFRRKNIWKVKD 229 Query: 808 KKRNSWNALACRKKRKL-------GQRDGDGGE-EKQQKSSFLPSNSFKEADSEEHLISS 963 KK S + RKKRK D +GG K++K S S K+ +E + Sbjct: 230 KKGKS--RIVGRKKRKAPPPPPPPSYDDNNGGVWNKKRKISSSELRSLKDTSGKESGSPT 287 Query: 964 N-------EDAELVDN 990 N E ELV N Sbjct: 288 NHGQNAPGEKGELVCN 303 >OMO95080.1 hypothetical protein CCACVL1_05581 [Corchorus capsularis] Length = 354 Score = 179 bits (453), Expect = 2e-48 Identities = 119/315 (37%), Positives = 176/315 (55%), Gaps = 30/315 (9%) Frame = +1 Query: 124 MLCSISTGKSGSTWLDRLRTSKGFPVQTGLDLEHFL-NPNPTSKVPTNPIPEENPTGSNH 300 MLCSI TGKSGS WLDRLR+SKGFP LDL+HFL N NP+ TN N SN Sbjct: 1 MLCSIPTGKSGSNWLDRLRSSKGFPTGDNLDLDHFLTNSNPSDSPLTN---ASNSPNSNA 57 Query: 301 DATLDGEKSTVDRKKKTSSNPTPPMMIGE-KNNEDWFDIMSSVLSELFNMGDSSRIRALD 477 ++T + D++ + P P ++ GE +++WF IMS+VLSELFNMGD ++ Sbjct: 58 EST-----HSNDKQLQNPEPPPPEVISGEPAGDKEWFGIMSNVLSELFNMGDGAQSSRFS 112 Query: 478 ARRKKSSRKQSNPRICPLST---SASVDDSCLAG-------VPAMSPSSVDNSVAEVKES 627 +KK+SRKQ+NPRIC + T ++S + +G VPA S +S+++S +ES Sbjct: 113 --KKKTSRKQTNPRICIIKTPTANSSEEQRSSSGSVRRDKNVPA-STTSLNSSQEAKRES 169 Query: 628 RKQGIAESI-NPEDEILQAVTVDSDLSAFSRAEVTIIDTSAPAWKSEKLIYRKGSVWKIR 804 +++G ++ EDE + +L FSR+EVT+IDTS WK++KLI+R+ ++WK++ Sbjct: 170 KEEGDNSNVAEDEDEEEGKEKGEKELLGFSRSEVTVIDTSCQVWKADKLIFRRKNIWKVK 229 Query: 805 DKKRNSW-----------------NALACRKKRKLGQRDGDGGEEKQQKSSFLPSNSFKE 933 DKK S N C KK+K+ + E + + P N ++ Sbjct: 230 DKKGKSRSFGRKKRKVPPPTSDDNNGGFCNKKQKISSSELRSLTEPRGRECGSPMNHGQK 289 Query: 934 ADSEEHLISSNEDAE 978 A ++ + NE AE Sbjct: 290 APGDKEEQACNETAE 304 >XP_009420008.1 PREDICTED: neural Wiskott-Aldrich syndrome protein isoform X2 [Musa acuminata subsp. malaccensis] Length = 303 Score = 177 bits (448), Expect = 3e-48 Identities = 118/259 (45%), Positives = 146/259 (56%), Gaps = 13/259 (5%) Frame = +1 Query: 124 MLCSISTG----KSGSTWLDRLRTSKGFPVQTGLDLEHFLNPNPTSKVPTNPIPEENPTG 291 ML SIST KS S WL+RL +S+GF V L L+HFL+P+ S N P P Sbjct: 1 MLFSISTSTSNTKSTSNWLERLHSSRGFSVPAHLHLDHFLSPDSASNPSPNSPPPPPPPP 60 Query: 292 SNHDATLDG---EKSTVDRKKKTSSNPTPPMMIGEKNNEDWFDIMSSVLSELFNMGDSSR 462 + D E R++K P PP + FD++ VL+ELF MG Sbjct: 61 PPEEVLSDPPPPEPLANPRRRKKHLQPPPPPGASTDGKQRLFDLVGGVLAELFVMGGPPV 120 Query: 463 IRALDARRKKSSRKQSNPRICPLSTSASVDDSCLAGVPAMSP-SSVDNSVAEVKESR--- 630 +RAL A KKSSRKQ NP++C S SAS+D +PA SP SS DNSVAE K+SR Sbjct: 121 VRALKA--KKSSRKQPNPKVCVPSASASIDGC--RSLPATSPPSSADNSVAEAKKSRSKL 176 Query: 631 --KQGIAESINPEDEILQAVTVDSDLSAFSRAEVTIIDTSAPAWKSEKLIYRKGSVWKIR 804 K+G A S VD DLSA+SR +VT+IDTS P WKSEK+I+RKG +WK+R Sbjct: 177 RRKRGTAGS-----------PVDLDLSAYSRTDVTVIDTSCPGWKSEKVIFRKGIMWKVR 225 Query: 805 DKKRNSWNALACRKKRKLG 861 DKK W RKKRK+G Sbjct: 226 DKK--VWT--LSRKKRKMG 240 >XP_009420007.1 PREDICTED: neural Wiskott-Aldrich syndrome protein isoform X1 [Musa acuminata subsp. malaccensis] Length = 335 Score = 177 bits (448), Expect = 6e-48 Identities = 118/259 (45%), Positives = 146/259 (56%), Gaps = 13/259 (5%) Frame = +1 Query: 124 MLCSISTG----KSGSTWLDRLRTSKGFPVQTGLDLEHFLNPNPTSKVPTNPIPEENPTG 291 ML SIST KS S WL+RL +S+GF V L L+HFL+P+ S N P P Sbjct: 1 MLFSISTSTSNTKSTSNWLERLHSSRGFSVPAHLHLDHFLSPDSASNPSPNSPPPPPPPP 60 Query: 292 SNHDATLDG---EKSTVDRKKKTSSNPTPPMMIGEKNNEDWFDIMSSVLSELFNMGDSSR 462 + D E R++K P PP + FD++ VL+ELF MG Sbjct: 61 PPEEVLSDPPPPEPLANPRRRKKHLQPPPPPGASTDGKQRLFDLVGGVLAELFVMGGPPV 120 Query: 463 IRALDARRKKSSRKQSNPRICPLSTSASVDDSCLAGVPAMSP-SSVDNSVAEVKESR--- 630 +RAL A KKSSRKQ NP++C S SAS+D +PA SP SS DNSVAE K+SR Sbjct: 121 VRALKA--KKSSRKQPNPKVCVPSASASIDGC--RSLPATSPPSSADNSVAEAKKSRSKL 176 Query: 631 --KQGIAESINPEDEILQAVTVDSDLSAFSRAEVTIIDTSAPAWKSEKLIYRKGSVWKIR 804 K+G A S VD DLSA+SR +VT+IDTS P WKSEK+I+RKG +WK+R Sbjct: 177 RRKRGTAGS-----------PVDLDLSAYSRTDVTVIDTSCPGWKSEKVIFRKGIMWKVR 225 Query: 805 DKKRNSWNALACRKKRKLG 861 DKK W RKKRK+G Sbjct: 226 DKK--VWT--LSRKKRKMG 240 >XP_017612899.1 PREDICTED: uncharacterized protein LOC108458135 isoform X2 [Gossypium arboreum] Length = 341 Score = 176 bits (445), Expect = 2e-47 Identities = 117/283 (41%), Positives = 152/283 (53%), Gaps = 8/283 (2%) Frame = +1 Query: 124 MLCSISTGKSGSTWLDRLRTSKGFPVQTGLDLEHFL-NPNPTSKVPTNPIPEENPTGSNH 300 MLCSI T KSGS WLDRLR+SKGFP LDL+HFL NPNP+ TN N SN Sbjct: 1 MLCSIPTAKSGSNWLDRLRSSKGFPTGDNLDLDHFLANPNPSDSPITNASDSPN---SNS 57 Query: 301 DATLDGEKSTVDRKKKTSSNPTPPMMIGE--KNNEDWFDIMSSVLSELFNMGDSSRIRAL 474 ++T + + K P PP I + +++WF IMS+VLSELFNMG+ ++ Sbjct: 58 ESTHSNDGQLQNPK------PPPPEAISSDPEGDKEWFGIMSNVLSELFNMGEQAQTSRF 111 Query: 475 DARRKKSSRKQSNPRICPLST---SASVDDSCLAGVPAMSPSSVDNSVAEVKESRKQGIA 645 RKK+SRKQ+NPRIC T S D+ + + NS E KE+ + Sbjct: 112 S--RKKASRKQTNPRICTFKTPEEQKSSSDNVRNDKDILVSTRSSNSREESKEAGENNNV 169 Query: 646 ESINPEDEILQAVTVDSDLSAFSRAEVTIIDTSAPAWKSEKLIYRKGSVWKIRDKKRNSW 825 E E E + +L +SR+EVT+IDTS P WK +KLI+R+ ++WK++DKK S Sbjct: 170 EEDKEEGE----GEGERELKGYSRSEVTVIDTSCPVWKVDKLIFRRKNIWKVKDKKGKS- 224 Query: 826 NALACRKKRKLGQRD--GDGGEEKQQKSSFLPSNSFKEADSEE 948 RKKRK D G K+ K S L S KE E Sbjct: 225 -RTIGRKKRKPPPSDINNVGISNKKPKISSLELRSLKETSGRE 266 >XP_020100428.1 circumsporozoite protein isoform X3 [Ananas comosus] Length = 317 Score = 175 bits (443), Expect = 2e-47 Identities = 115/278 (41%), Positives = 153/278 (55%), Gaps = 17/278 (6%) Frame = +1 Query: 124 MLCSISTGKSGSTWLDRLRTSKGFPVQTGLDLEHFL---------NPNPTSKVPTNPIPE 276 M CS+S KS S WLDRL SKGF + LDL+ FL NPNP NP P Sbjct: 1 MQCSLSPPKSSSNWLDRLHASKGFSISADLDLDRFLASSSSDPDPNPNPNPNPNPNPNPN 60 Query: 277 ENPTGSNHDATLDGEKSTVDRKKKTSSNPTPPMMIGEKNNEDWFDIMSSVLSELFNMGDS 456 NP S +ATL + R+++ + PP+ FD+MSSVL+ELF M Sbjct: 61 PNPP-SPRNATLPDPPTKRRRRRRPAPAANPPL----------FDLMSSVLAELFVMAGP 109 Query: 457 SRIRALDA------RRKKSSRKQSNPRICPLSTSASV--DDSCLAGVPAMSPSSVDNSVA 612 S +A+ ++KKSSRKQ+NP+ CP S SAS D + G +PSS DNSVA Sbjct: 110 SPSQAIGTPGERRKKKKKSSRKQANPKACPPSASASAAADGAACGG----APSSADNSVA 165 Query: 613 EVKESRKQGIAESINPEDEILQAVTVDSDLSAFSRAEVTIIDTSAPAWKSEKLIYRKGSV 792 E E+ K + + + + DSDL+ + R +VT+IDTS+P WKS KLIYRKG Sbjct: 166 E--EATK-----GLKKKRAAAEGPSKDSDLAGYRRTDVTVIDTSSPGWKSVKLIYRKGKE 218 Query: 793 WKIRDKKRNSWNALACRKKRKLGQRDGDGGEEKQQKSS 906 WK+R KK WN AC+KK++ G+ G+E+ + S Sbjct: 219 WKVRVKKH--WN--ACQKKKRTVGLVGEKGKEQSKLGS 252 >XP_020100426.1 circumsporozoite protein isoform X2 [Ananas comosus] Length = 319 Score = 175 bits (443), Expect = 2e-47 Identities = 115/278 (41%), Positives = 153/278 (55%), Gaps = 17/278 (6%) Frame = +1 Query: 124 MLCSISTGKSGSTWLDRLRTSKGFPVQTGLDLEHFL---------NPNPTSKVPTNPIPE 276 M CS+S KS S WLDRL SKGF + LDL+ FL NPNP NP P Sbjct: 1 MQCSLSPPKSSSNWLDRLHASKGFSISADLDLDRFLASSSSDPDPNPNPNPNPNPNPNPN 60 Query: 277 ENPTGSNHDATLDGEKSTVDRKKKTSSNPTPPMMIGEKNNEDWFDIMSSVLSELFNMGDS 456 NP S +ATL + R+++ + PP+ FD+MSSVL+ELF M Sbjct: 61 PNPP-SPRNATLPDPPTKRRRRRRPAPAANPPL----------FDLMSSVLAELFVMAGP 109 Query: 457 SRIRALDA------RRKKSSRKQSNPRICPLSTSASV--DDSCLAGVPAMSPSSVDNSVA 612 S +A+ ++KKSSRKQ+NP+ CP S SAS D + G +PSS DNSVA Sbjct: 110 SPSQAIGTPGERRKKKKKSSRKQANPKACPPSASASAAADGAACGG----APSSADNSVA 165 Query: 613 EVKESRKQGIAESINPEDEILQAVTVDSDLSAFSRAEVTIIDTSAPAWKSEKLIYRKGSV 792 E E+ K + + + + DSDL+ + R +VT+IDTS+P WKS KLIYRKG Sbjct: 166 E--EATK-----GLKKKRAAAEGPSKDSDLAGYRRTDVTVIDTSSPGWKSVKLIYRKGKE 218 Query: 793 WKIRDKKRNSWNALACRKKRKLGQRDGDGGEEKQQKSS 906 WK+R KK WN AC+KK++ G+ G+E+ + S Sbjct: 219 WKVRVKKH--WN--ACQKKKRTVGLVGEKGKEQSKLGS 252 >XP_017612898.1 PREDICTED: uncharacterized protein LOC108458135 isoform X1 [Gossypium arboreum] Length = 346 Score = 176 bits (445), Expect = 2e-47 Identities = 117/283 (41%), Positives = 152/283 (53%), Gaps = 8/283 (2%) Frame = +1 Query: 124 MLCSISTGKSGSTWLDRLRTSKGFPVQTGLDLEHFL-NPNPTSKVPTNPIPEENPTGSNH 300 MLCSI T KSGS WLDRLR+SKGFP LDL+HFL NPNP+ TN N SN Sbjct: 1 MLCSIPTAKSGSNWLDRLRSSKGFPTGDNLDLDHFLANPNPSDSPITNASDSPN---SNS 57 Query: 301 DATLDGEKSTVDRKKKTSSNPTPPMMIGE--KNNEDWFDIMSSVLSELFNMGDSSRIRAL 474 ++T + + K P PP I + +++WF IMS+VLSELFNMG+ ++ Sbjct: 58 ESTHSNDGQLQNPK------PPPPEAISSDPEGDKEWFGIMSNVLSELFNMGEQAQTSRF 111 Query: 475 DARRKKSSRKQSNPRICPLST---SASVDDSCLAGVPAMSPSSVDNSVAEVKESRKQGIA 645 RKK+SRKQ+NPRIC T S D+ + + NS E KE+ + Sbjct: 112 S--RKKASRKQTNPRICTFKTPEEQKSSSDNVRNDKDILVSTRSSNSREESKEAGENNNV 169 Query: 646 ESINPEDEILQAVTVDSDLSAFSRAEVTIIDTSAPAWKSEKLIYRKGSVWKIRDKKRNSW 825 E E E + +L +SR+EVT+IDTS P WK +KLI+R+ ++WK++DKK S Sbjct: 170 EEDKEEGE----GEGERELKGYSRSEVTVIDTSCPVWKVDKLIFRRKNIWKVKDKKGKS- 224 Query: 826 NALACRKKRKLGQRD--GDGGEEKQQKSSFLPSNSFKEADSEE 948 RKKRK D G K+ K S L S KE E Sbjct: 225 -RTIGRKKRKPPPSDINNVGISNKKPKISSLELRSLKETSGRE 266 >KJB74655.1 hypothetical protein B456_012G000700 [Gossypium raimondii] Length = 308 Score = 174 bits (442), Expect = 2e-47 Identities = 116/283 (40%), Positives = 150/283 (53%), Gaps = 8/283 (2%) Frame = +1 Query: 124 MLCSISTGKSGSTWLDRLRTSKGFPVQTGLDLEHFL-NPNPTSKVPTNPIPEENPTGSNH 300 MLCSI T KSGS WLDRLR+SKGFP LDL+HFL NPNP+ TN N SN Sbjct: 1 MLCSIPTAKSGSNWLDRLRSSKGFPTGDNLDLDHFLANPNPSDSPITNASDSPN---SNS 57 Query: 301 DATLDGEKSTVDRKKKTSSNPTPPMMIGE--KNNEDWFDIMSSVLSELFNMGDSSRIRAL 474 ++T + + K P PP I + +++WF IM +VLSELFNMG+ ++ Sbjct: 58 ESTHSNDGQLQNPK------PPPPEAISSDPEGDKEWFGIMRNVLSELFNMGEQAQTSRF 111 Query: 475 DARRKKSSRKQSNPRICPLST---SASVDDSCLAGVPAMSPSSVDNSVAEVKESRKQGIA 645 RKK+SRKQ+NPRIC T S D+ + + NS E KE + Sbjct: 112 S--RKKASRKQTNPRICTFKTPEEQKSSSDNVRNDKDTLVSTRSSNSREESKEEGENNNV 169 Query: 646 ESINPEDEILQAVTVDSDLSAFSRAEVTIIDTSAPAWKSEKLIYRKGSVWKIRDKKRNSW 825 E E E + +L +SR+EVT+IDTS P WK +KLI+R+ ++WK++DKK S Sbjct: 170 EEDKEEGE----GEGERELKGYSRSEVTVIDTSCPVWKVDKLIFRRKNIWKVKDKKGKS- 224 Query: 826 NALACRKKRKLGQRD--GDGGEEKQQKSSFLPSNSFKEADSEE 948 RKKRK D G K+ K S L S KE E Sbjct: 225 -RTIGRKKRKTPPSDLNNVGISNKKPKISSLELRSLKETSGRE 266