BLASTX nr result
ID: Chrysanthemum22_contig00017935
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Chrysanthemum22_contig00017935 (1629 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|KVI05297.1| hypothetical protein Ccrd_016381 [Cynara carduncu... 425 e-140 gb|OTG17750.1| hypothetical protein HannXRQ_Chr08g0215591 [Helia... 316 3e-97 ref|XP_021976689.1| uncharacterized protein LOC110872210 [Helian... 314 9e-97 gb|KVH97515.1| hypothetical protein Ccrd_000374 [Cynara carduncu... 248 2e-71 ref|XP_023748348.1| protein ecdysoneless homolog isoform X2 [Lac... 238 5e-70 ref|XP_023748345.1| protein ecdysoneless homolog isoform X1 [Lac... 238 9e-70 ref|XP_021997453.1| uncharacterized protein LOC110894540 [Helian... 195 5e-54 ref|XP_022733309.1| kinesin-related protein 8-like [Durio zibeth... 182 2e-47 gb|OMO72168.1| hypothetical protein COLO4_27800 [Corchorus olito... 177 2e-45 ref|XP_021292808.1| uncharacterized protein LOC110423037 [Herran... 169 2e-42 ref|XP_007045750.2| PREDICTED: uncharacterized protein LOC186101... 165 2e-41 gb|EOY01582.1| 18S pre-ribosomal assembly protein gar2-related, ... 165 2e-41 ref|XP_016724637.1| PREDICTED: uncharacterized protein LOC107936... 164 3e-41 ref|XP_017971961.1| PREDICTED: uncharacterized protein LOC186101... 165 6e-41 gb|EOY01581.1| 18S pre-ribosomal assembly protein gar2-related, ... 165 6e-41 ref|XP_017971960.1| PREDICTED: uncharacterized protein LOC186101... 165 7e-41 ref|XP_007045751.2| PREDICTED: uncharacterized protein LOC186101... 165 7e-41 ref|XP_012478809.1| PREDICTED: uncharacterized protein LOC105794... 162 2e-40 ref|XP_016667087.1| PREDICTED: uncharacterized protein LOC107887... 161 4e-40 ref|XP_016667085.1| PREDICTED: uncharacterized protein LOC107887... 161 9e-40 >gb|KVI05297.1| hypothetical protein Ccrd_016381 [Cynara cardunculus var. scolymus] Length = 547 Score = 425 bits (1093), Expect = e-140 Identities = 262/568 (46%), Positives = 318/568 (55%), Gaps = 97/568 (17%) Frame = +2 Query: 17 MKESPRRYTNPFLSDEENAKVNFWNDQELEHSNIENDLTNTCKDDNFRKSLELSNHPSEF 196 MKESP+RYTNPFLSDEEN KVNFWN+QELE SNIE+DLTN C D+ FR SL N PSEF Sbjct: 1 MKESPKRYTNPFLSDEENEKVNFWNNQELECSNIEDDLTNNCDDEKFR-SLAPCNLPSEF 59 Query: 197 CGNETKFYTDKNVMECELPELLVCYKENAFPVKDICVDEGIPHGERVLFDENNNEM---- 364 ET FYTDKNVMECELPELLVCYKE+AF VKDICVDEGIPHGER+LFDENN+E+ Sbjct: 60 FEKETDFYTDKNVMECELPELLVCYKESAFHVKDICVDEGIPHGERILFDENNHEIHCIS 119 Query: 365 --------------------LKHDQLKISPTEDYYMESKLYSETNGKIDTDLPVVEPTSD 484 LK + L+ S EDYYMES DLPV+EPT D Sbjct: 120 SPANEGKQDEIIEDSLHTQYLKPEGLRFSHMEDYYMES------------DLPVLEPTDD 167 Query: 485 YTDIGDSHDEVTDR----------------------KSDASSGIQEVDASFPVNGPIDDH 598 + D+GD+ DEV +R KSDASS QE+D + PV+ P++D+ Sbjct: 168 HMDVGDNRDEVIERNLDIQLVMGEKIRPSSSKDSCMKSDASSETQEIDTNLPVSEPVNDY 227 Query: 599 RSMDNIGIGTQIPVQDSQVSIPXXXXXXXXXXXXXXX---EEVIDSVAPNELKNISKD-- 763 I T+I V+ S+P EE +DS+ PNELKN SKD Sbjct: 228 -------IDTEIAVRCFHSSVPDKDSKDCEDDTAKECGPKEEALDSIVPNELKNTSKDDN 280 Query: 764 ---DGDDE---DE-----HSECAPEKL-----KVSQES-----DTVSKAIDNNGPD---- 868 DG E DE S+ P+ ++S E+ ++ + +DN + Sbjct: 281 GDDDGPSECSLDELKISAESDTVPKATDNYGPEISTETGEKQINSSANLLDNASTEQVVS 340 Query: 869 -----------INSSDGLLDNVSTEEVVYSXXXXXXXXXXXXXXXXXXXXXIP------- 994 + S LLD+V+ ++ P Sbjct: 341 ISVPSLQQDEPLPSLQFLLDSVNRARDIHQQPCQSAVEEVSERPVVENEAEEPGRSIQTT 400 Query: 995 ---NENMIDENDTLNLDNGKPPTTNGVYKVETPESAHEPSIDTQTEHSHQDVAPDNIATI 1165 NE+M++ N+ LNL+NGKP T+ G++ V+ PE+ HE I+ Q +H DVA DN+A + Sbjct: 401 DISNESMMEGNNALNLNNGKPATSGGLHGVQNPENVHELPIEAQGAPNHLDVASDNVAMV 460 Query: 1166 NPVQRGEGGESSFSVAGPVSGLITYSGPIAFXXXXXXXXXXXXXXXXXFAFPILQNEWNS 1345 NPVQRGEG ESSFSVAGPVSG ITYSGPIAF FAFPILQ EWNS Sbjct: 461 NPVQRGEG-ESSFSVAGPVSGRITYSGPIAFSGSVSIRSDSSTTSTRSFAFPILQTEWNS 519 Query: 1346 SPVRMAKADRRRLQKHRGWRHGLLCCRF 1429 SPVRMAKADRRRLQKHRGWRHG+LCCRF Sbjct: 520 SPVRMAKADRRRLQKHRGWRHGILCCRF 547 >gb|OTG17750.1| hypothetical protein HannXRQ_Chr08g0215591 [Helianthus annuus] Length = 598 Score = 316 bits (810), Expect = 3e-97 Identities = 230/586 (39%), Positives = 287/586 (48%), Gaps = 110/586 (18%) Frame = +2 Query: 2 EQKETMKESPRRYTNPFLSDEENAKVNFWNDQELEHSNIENDLTNTCKDDNFRKSLELSN 181 E K MKESPRRYTNPFLSDEEN K+N W QELEHSNIE+D T Sbjct: 32 ELKGAMKESPRRYTNPFLSDEENEKLNLWT-QELEHSNIEHDTLTTLG------------ 78 Query: 182 HPSEFCGNETKFYTDKNVMECELPELLVCYKENAFPVKDICVDEGIPHGERVLFDENNNE 361 PS+FC ET+ YTDKNV ECE PELL+CYKE AF VKDICVDEGIPHGER LFDENNNE Sbjct: 79 -PSDFCEKETELYTDKNVTECEFPELLICYKEGAFHVKDICVDEGIPHGERFLFDENNNE 137 Query: 362 MLKHDQLKISPTEDYYMESKLYSETNGK-------------------------------- 445 ML ++L+ + EDYYMES L S T+ K Sbjct: 138 MLHPEKLRFTTMEDYYMESNLCSGTDVKVDTPLPVLEPSNDHMNIGDNRDEVDAALALNG 197 Query: 446 ------------IDTDLPV---VEPTSDYTDIGD----------------SHDEVTDRK- 529 ID ++PV +P DY + G S D++ D Sbjct: 198 PISDHRNMGKISIDMEIPVHDLQDPVPDYKECGYKEEVSDFVGPNESKEISKDDINDGNG 257 Query: 530 ------------SDASSGIQEVDASFP-VNGPIDDHR--SMDNI--GIGTQIPVQDSQVS 658 SD+ + + D P ++ +D++ S DN+ + T+ V +S +S Sbjct: 258 ISKSFADGFMVSSDSVTASKATDNDGPDISVQLDENMIYSSDNLLDNVSTEKVVSNSGIS 317 Query: 659 IPXXXXXXXXXXXXXXXEEVIDSVAPNELKNISKDDG-DDEDEHSECAPEKLKVSQE-SD 832 +++ SV P+E N S + D+ + + + + Q+ S Sbjct: 318 SEQDQSVTVSKATENDGKDI--SVQPDEKMNFSSHNLLDNVSTNKVVSNSGISLEQDRSV 375 Query: 833 TVSKAIDNNGPDIN---------SSDGLLDNVSTEEVVYSXXXXXXXXXXXXXXXXXXXX 985 TVSKA + + P I SSD LL++VSTE+VV + Sbjct: 376 TVSKAAEIDDPGIAVQPDEKITYSSDNLLEDVSTEKVVLNSGPSLEQDRILPSLKSLLES 435 Query: 986 XI--------------PNE--NMIDENDTLNLDNGKPPTTNGVYKVETPESAHEP--SID 1111 P+E N ++ NDTL+L++ KP T G V E+ P SI Sbjct: 436 IDQQPCQSPIEEISKRPDESGNAVEGNDTLHLNSIKPAT--GSEHVHNMENLEHPELSIV 493 Query: 1112 TQTEHSHQDVAPDNIATINPVQRGEGGESSFSVAGPVSGLITYSGPIAFXXXXXXXXXXX 1291 QD DN+A +N + RGEG ESSFSVA PV ITYSGPIAF Sbjct: 494 PNGAPKLQDSGSDNVAMVNQLHRGEG-ESSFSVAAPVPEHITYSGPIAFSGSTSLRSDSS 552 Query: 1292 XXXXXXFAFPILQNEWNSSPVRMAKADRRRLQKHRGWRHGLLCCRF 1429 FAFPILQNEWNSSPVRMAKADRRRLQKHRGW+HGLLCCRF Sbjct: 553 TTSTRSFAFPILQNEWNSSPVRMAKADRRRLQKHRGWKHGLLCCRF 598 >ref|XP_021976689.1| uncharacterized protein LOC110872210 [Helianthus annuus] Length = 562 Score = 314 bits (804), Expect = 9e-97 Identities = 228/581 (39%), Positives = 285/581 (49%), Gaps = 110/581 (18%) Frame = +2 Query: 17 MKESPRRYTNPFLSDEENAKVNFWNDQELEHSNIENDLTNTCKDDNFRKSLELSNHPSEF 196 MKESPRRYTNPFLSDEEN K+N W QELEHSNIE+D T PS+F Sbjct: 1 MKESPRRYTNPFLSDEENEKLNLWT-QELEHSNIEHDTLTTLG-------------PSDF 46 Query: 197 CGNETKFYTDKNVMECELPELLVCYKENAFPVKDICVDEGIPHGERVLFDENNNEMLKHD 376 C ET+ YTDKNV ECE PELL+CYKE AF VKDICVDEGIPHGER LFDENNNEML + Sbjct: 47 CEKETELYTDKNVTECEFPELLICYKEGAFHVKDICVDEGIPHGERFLFDENNNEMLHPE 106 Query: 377 QLKISPTEDYYMESKLYSETNGK------------------------------------- 445 +L+ + EDYYMES L S T+ K Sbjct: 107 KLRFTTMEDYYMESNLCSGTDVKVDTPLPVLEPSNDHMNIGDNRDEVDAALALNGPISDH 166 Query: 446 -------IDTDLPV---VEPTSDYTDIGD----------------SHDEVTDRK------ 529 ID ++PV +P DY + G S D++ D Sbjct: 167 RNMGKISIDMEIPVHDLQDPVPDYKECGYKEEVSDFVGPNESKEISKDDINDGNGISKSF 226 Query: 530 -------SDASSGIQEVDASFP-VNGPIDDHR--SMDNI--GIGTQIPVQDSQVSIPXXX 673 SD+ + + D P ++ +D++ S DN+ + T+ V +S +S Sbjct: 227 ADGFMVSSDSVTASKATDNDGPDISVQLDENMIYSSDNLLDNVSTEKVVSNSGISSEQDQ 286 Query: 674 XXXXXXXXXXXXEEVIDSVAPNELKNISKDDG-DDEDEHSECAPEKLKVSQE-SDTVSKA 847 +++ SV P+E N S + D+ + + + + Q+ S TVSKA Sbjct: 287 SVTVSKATENDGKDI--SVQPDEKMNFSSHNLLDNVSTNKVVSNSGISLEQDRSVTVSKA 344 Query: 848 IDNNGPDIN---------SSDGLLDNVSTEEVVYSXXXXXXXXXXXXXXXXXXXXXI--- 991 + + P I SSD LL++VSTE+VV + Sbjct: 345 AEIDDPGIAVQPDEKITYSSDNLLEDVSTEKVVLNSGPSLEQDRILPSLKSLLESIDQQP 404 Query: 992 -----------PNE--NMIDENDTLNLDNGKPPTTNGVYKVETPESAHEP--SIDTQTEH 1126 P+E N ++ NDTL+L++ KP T G V E+ P SI Sbjct: 405 CQSPIEEISKRPDESGNAVEGNDTLHLNSIKPAT--GSEHVHNMENLEHPELSIVPNGAP 462 Query: 1127 SHQDVAPDNIATINPVQRGEGGESSFSVAGPVSGLITYSGPIAFXXXXXXXXXXXXXXXX 1306 QD DN+A +N + RGEG ESSFSVA PV ITYSGPIAF Sbjct: 463 KLQDSGSDNVAMVNQLHRGEG-ESSFSVAAPVPEHITYSGPIAFSGSTSLRSDSSTTSTR 521 Query: 1307 XFAFPILQNEWNSSPVRMAKADRRRLQKHRGWRHGLLCCRF 1429 FAFPILQNEWNSSPVRMAKADRRRLQKHRGW+HGLLCCRF Sbjct: 522 SFAFPILQNEWNSSPVRMAKADRRRLQKHRGWKHGLLCCRF 562 >gb|KVH97515.1| hypothetical protein Ccrd_000374 [Cynara cardunculus var. scolymus] Length = 555 Score = 248 bits (632), Expect = 2e-71 Identities = 190/577 (32%), Positives = 257/577 (44%), Gaps = 101/577 (17%) Frame = +2 Query: 2 EQKETMKESPRRYTNPFLSDEENAKVNFWNDQELEHSNIENDLTNTCKDDNFRKSLELSN 181 E K TMKESP R+TNPFLSDEEN KVNFWN++ELE S +E D T + L Sbjct: 31 ELKATMKESPIRHTNPFLSDEENDKVNFWNNRELELSIVEEDFTTNLES--------LEK 82 Query: 182 HPSEFCGNETKFYTDKNVMECELPELLVCYKENAFPV-KDICVDEGIPHGERV------- 337 PS + Y DKNV ECELPEL+ CY E+ F V KDICVDEG+ HGE++ Sbjct: 83 APSYSLEKAKELYIDKNV-ECELPELIACYHESGFHVVKDICVDEGVSHGEKIGIHKVHR 141 Query: 338 ----------------LFDEN-NNEMLKHDQLKISPTEDYYMESKLYSETNGKIDTDLPV 466 + +E + LKH Q SP E+ + L+S T K+ D P+ Sbjct: 142 GLSCHPVTVNEDKHDDMIEEGLGTQFLKHQQSISSPVEECGKNTDLFSVTKEKLHADFPI 201 Query: 467 VEPTSDYTDIGDSHDEVTDRK----------------------SDASSGIQE-VDASFPV 577 + + +T+IG +HD++ R SD+S G +E D + + Sbjct: 202 PKHSISHTNIGYNHDDMIGRNLETQFLKHEKTRSSSGEDDYTSSDSSFGTKEKTDTNVFI 261 Query: 578 NGPIDDHRSMDNIGIGTQIPVQDSQVSIPXXXXXXXXXXXXXXXEEVIDSVAPNELKNIS 757 P DDHR M N T+I Q+S + SV +EL+N+S Sbjct: 262 TEPTDDHRHMGNC-YDTKIHGQNSSQG---------KDCKEDATKVANHSVITDELENVS 311 Query: 758 KDDG--------------DDEDEHS--ECAPEKLKVSQES--DTVSKAIDNN-------- 859 +D + HS C+P+KL + E D+ ++DN+ Sbjct: 312 EDSNGPYNCASDKLPLFVESSTAHSTENCSPDKLMQTGEENIDSSFNSLDNSSREQFVSC 371 Query: 860 ---------------------------GPDINSSDGLLDNVSTEEVVYSXXXXXXXXXXX 958 G ++ + G ++ + + S Sbjct: 372 STLSLNQDQLPTSIKNWESSNNGVNDVGQQLSEAQGPVEEILKRHLAGSEAEELVRNSHA 431 Query: 959 XXXXXXXXXXIPNENMIDENDTLNLDNGKPPTTNGVYKVETPESAHEPSIDTQTEHSHQD 1138 E ++EN T N DN KP T + +PE HE D Q+ +HQ+ Sbjct: 432 INTS--------TEIKMEENITSNSDNVKPATFS------SPECIHELPPDMQSAANHQE 477 Query: 1139 VAPDNIATINPVQRGEGGESSFSVAGPVSGLITYSGPIAFXXXXXXXXXXXXXXXXXFAF 1318 DN+ N +Q G GGESSFSVAG +SGLITYSGPIA Sbjct: 478 ETSDNVTESNQLQHG-GGESSFSVAGTISGLITYSGPIASSGS----------------- 519 Query: 1319 PILQNEWNSSPVRMAKADRRRLQKHRGWRHGLLCCRF 1429 ILQ EWNSSPVRMAK D+RR +KHRGW+ L+CC+F Sbjct: 520 -ILQTEWNSSPVRMAKVDQRRSKKHRGWKQTLMCCKF 555 >ref|XP_023748348.1| protein ecdysoneless homolog isoform X2 [Lactuca sativa] Length = 373 Score = 238 bits (608), Expect = 5e-70 Identities = 190/502 (37%), Positives = 224/502 (44%), Gaps = 26/502 (5%) Frame = +2 Query: 2 EQKETMKESPRRYTNPFLSDEENAKVNFWNDQELEHSNIENDLTNTCKDDNFRKSLELSN 181 ++K TMKESP+RYTNPFLSDEEN KVN N QE+ Sbjct: 10 KEKNTMKESPKRYTNPFLSDEENEKVNNNNHQEV-------------------------- 43 Query: 182 HPSEFCGNETKFYTDKNVME----CELPELLVCYKENAFPVKDICVDEGIPHGERVLFDE 349 YTDKNVME CELPELLVCY + F VKDICVD+GIPH Sbjct: 44 ------------YTDKNVMELELECELPELLVCYNDGGFHVKDICVDDGIPH-------- 83 Query: 350 NNNEMLKHDQLKISPTEDYYMESKLYSETNGKIDTDLPVVEPTSDYTDIGDSHD-EVTDR 526 + ++ + + L SP EDYYME ++ DIGD+ D V D Sbjct: 84 DKHDDIINQSLIFSPMEDYYMEE-------------------SNHVVDIGDNLDISVEDL 124 Query: 527 KSDASSGIQEVDASFPVNGPIDDHRSMDNIGIGTQIPVQDSQVSIPXXXXXXXXXXXXXX 706 ++ E D DD D GT+ Sbjct: 125 QNSTPDKDCEDD---------DDDDDDDAKECGTK------------------------- 150 Query: 707 XEEVIDSVAPNELKNISKDDGDDEDEHSECAPEKLKVSQESDTVSKAIDNNGP-----DI 871 EE I+S+ PNE+KNISKDD SECAPE+LK + ESDTV K IDN P D+ Sbjct: 151 EEEDIESIDPNEIKNISKDDNH---VISECAPEELKCA-ESDTVPKGIDNYVPENQQMDL 206 Query: 872 NS---------------SDGLLDNV-STEEVVYSXXXXXXXXXXXXXXXXXXXXXIPNEN 1003 NS D L ++ S E + + Sbjct: 207 NSVFLDDKDKDKDKDKDEDRPLPSLKSLLESINGVDDKDQHPSQSCVEGNEGEEEVSTS- 265 Query: 1004 MIDENDTLNLDNGKPPTTNGVYKVETPESAHEPSIDTQTEHSHQDVAPDNIATINPVQRG 1183 I+ N+TLNL NGK +NG++ V +RG Sbjct: 266 -IEGNNTLNLTNGKTVISNGLHDVH--------------------------------ERG 292 Query: 1184 EGGESSFSVAGPVSGLITYSGPIAFXXXXXXXXXXXXXXXXXFAFPILQNEWNSSPVRMA 1363 EG ESSFSVAGPVSG I YSG IAF FAFPILQ EWNSSPVRMA Sbjct: 293 EG-ESSFSVAGPVSGRINYSGQIAFSGSISLRSDSSTTSTRSFAFPILQTEWNSSPVRMA 351 Query: 1364 KADRRRLQKHRGWRHGLLCCRF 1429 KADRRRLQKHRGWR+GLLCCRF Sbjct: 352 KADRRRLQKHRGWRNGLLCCRF 373 >ref|XP_023748345.1| protein ecdysoneless homolog isoform X1 [Lactuca sativa] ref|XP_023748346.1| protein ecdysoneless homolog isoform X1 [Lactuca sativa] ref|XP_023748347.1| protein ecdysoneless homolog isoform X1 [Lactuca sativa] gb|PLY62731.1| hypothetical protein LSAT_8X37061 [Lactuca sativa] Length = 393 Score = 238 bits (608), Expect = 9e-70 Identities = 190/502 (37%), Positives = 224/502 (44%), Gaps = 26/502 (5%) Frame = +2 Query: 2 EQKETMKESPRRYTNPFLSDEENAKVNFWNDQELEHSNIENDLTNTCKDDNFRKSLELSN 181 ++K TMKESP+RYTNPFLSDEEN KVN N QE+ Sbjct: 30 KEKNTMKESPKRYTNPFLSDEENEKVNNNNHQEV-------------------------- 63 Query: 182 HPSEFCGNETKFYTDKNVME----CELPELLVCYKENAFPVKDICVDEGIPHGERVLFDE 349 YTDKNVME CELPELLVCY + F VKDICVD+GIPH Sbjct: 64 ------------YTDKNVMELELECELPELLVCYNDGGFHVKDICVDDGIPH-------- 103 Query: 350 NNNEMLKHDQLKISPTEDYYMESKLYSETNGKIDTDLPVVEPTSDYTDIGDSHD-EVTDR 526 + ++ + + L SP EDYYME ++ DIGD+ D V D Sbjct: 104 DKHDDIINQSLIFSPMEDYYMEE-------------------SNHVVDIGDNLDISVEDL 144 Query: 527 KSDASSGIQEVDASFPVNGPIDDHRSMDNIGIGTQIPVQDSQVSIPXXXXXXXXXXXXXX 706 ++ E D DD D GT+ Sbjct: 145 QNSTPDKDCEDD---------DDDDDDDAKECGTK------------------------- 170 Query: 707 XEEVIDSVAPNELKNISKDDGDDEDEHSECAPEKLKVSQESDTVSKAIDNNGP-----DI 871 EE I+S+ PNE+KNISKDD SECAPE+LK + ESDTV K IDN P D+ Sbjct: 171 EEEDIESIDPNEIKNISKDDNH---VISECAPEELKCA-ESDTVPKGIDNYVPENQQMDL 226 Query: 872 NS---------------SDGLLDNV-STEEVVYSXXXXXXXXXXXXXXXXXXXXXIPNEN 1003 NS D L ++ S E + + Sbjct: 227 NSVFLDDKDKDKDKDKDEDRPLPSLKSLLESINGVDDKDQHPSQSCVEGNEGEEEVSTS- 285 Query: 1004 MIDENDTLNLDNGKPPTTNGVYKVETPESAHEPSIDTQTEHSHQDVAPDNIATINPVQRG 1183 I+ N+TLNL NGK +NG++ V +RG Sbjct: 286 -IEGNNTLNLTNGKTVISNGLHDVH--------------------------------ERG 312 Query: 1184 EGGESSFSVAGPVSGLITYSGPIAFXXXXXXXXXXXXXXXXXFAFPILQNEWNSSPVRMA 1363 EG ESSFSVAGPVSG I YSG IAF FAFPILQ EWNSSPVRMA Sbjct: 313 EG-ESSFSVAGPVSGRINYSGQIAFSGSISLRSDSSTTSTRSFAFPILQTEWNSSPVRMA 371 Query: 1364 KADRRRLQKHRGWRHGLLCCRF 1429 KADRRRLQKHRGWR+GLLCCRF Sbjct: 372 KADRRRLQKHRGWRNGLLCCRF 393 >ref|XP_021997453.1| uncharacterized protein LOC110894540 [Helianthus annuus] ref|XP_021997454.1| uncharacterized protein LOC110894540 [Helianthus annuus] gb|OTG04681.1| hypothetical protein HannXRQ_Chr12g0365071 [Helianthus annuus] Length = 327 Score = 195 bits (495), Expect = 5e-54 Identities = 105/169 (62%), Positives = 119/169 (70%) Frame = +2 Query: 17 MKESPRRYTNPFLSDEENAKVNFWNDQELEHSNIENDLTNTCKDDNFRKSLELSNHPSEF 196 MKE+P+ YTNPFLSDEEN KVN W QELEHSNIE DD SL SEF Sbjct: 1 MKETPKIYTNPFLSDEENEKVNLWT-QELEHSNIE--------DDKLIDSLA----SSEF 47 Query: 197 CGNETKFYTDKNVMECELPELLVCYKENAFPVKDICVDEGIPHGERVLFDENNNEMLKHD 376 ET+ YTDKNVMECELPE LVCYKE AF VKDICVDEGIP ER++FDENNNE +KH+ Sbjct: 48 FKKETEVYTDKNVMECELPESLVCYKEGAFHVKDICVDEGIPCEERIVFDENNNETMKHE 107 Query: 377 QLKISPTEDYYMESKLYSETNGKIDTDLPVVEPTSDYTDIGDSHDEVTD 523 L S EDYYMES + SE NGKIDT L V+EP+ D+ +++EV D Sbjct: 108 PLTFSTVEDYYMESNVCSEVNGKIDTGLTVLEPSDDH----GTNEEVID 152 >ref|XP_022733309.1| kinesin-related protein 8-like [Durio zibethinus] ref|XP_022733310.1| kinesin-related protein 8-like [Durio zibethinus] ref|XP_022733312.1| kinesin-related protein 8-like [Durio zibethinus] ref|XP_022733313.1| kinesin-related protein 8-like [Durio zibethinus] ref|XP_022733314.1| kinesin-related protein 8-like [Durio zibethinus] ref|XP_022733315.1| kinesin-related protein 8-like [Durio zibethinus] ref|XP_022733316.1| kinesin-related protein 8-like [Durio zibethinus] ref|XP_022733317.1| kinesin-related protein 8-like [Durio zibethinus] ref|XP_022733318.1| kinesin-related protein 8-like [Durio zibethinus] ref|XP_022733319.1| kinesin-related protein 8-like [Durio zibethinus] Length = 515 Score = 182 bits (462), Expect = 2e-47 Identities = 145/474 (30%), Positives = 221/474 (46%), Gaps = 19/474 (4%) Frame = +2 Query: 65 ENAKVNFWNDQELEHSNIENDLTNTCKDDNFRKSLELSNHPSEFCGN--ETKFYTDKNVM 238 EN + N W +L++S ND N + FR + + H S+ + E+ FY DK+VM Sbjct: 63 ENTR-NGWPASKLDYSMSVNDFVNG-NEKEFRDFVTSNTHSSKNMDSFQESVFYLDKSVM 120 Query: 239 ECELPELLVCYKENAFPV-KDICVDEGIPHGERVLFDENNNEMLKHDQLKISPTEDYYME 415 EC+LPEL+VCYKE+ + V KDIC+DEG+P ++ LFD + ++ P + + Sbjct: 121 ECQLPELVVCYKESTYNVVKDICIDEGVPTQDKFLFDSGVDVKSDYN----FPPSEKDQD 176 Query: 416 SKLYSETNGKIDTDLPVVEPTSDYTDIGDSHDEV--TDRKSDASSGIQEVDASFPVNGPI 589 SKL E +ID L VV + + G D+ +++K DA + ++++ S N Sbjct: 177 SKLMKEKL-EIDMSLQVVYVSPEENQYGKDIDDECGSNKKLDADTRMRDISFSLEENE-- 233 Query: 590 DDHRSMDNIGIGTQIPVQDSQVSIPXXXXXXXXXXXXXXXE----------EVIDSVAPN 739 N GI Q +D ++ E + +V Sbjct: 234 ------SNKGIPNQYDSKDLMLTREMKDDAMKVVTDDVSKELFTLGELLSMPELSAVKSK 287 Query: 740 ELKNISKDDGDDEDEHSECAPEKLKVSQESDTVSKAIDNNGPDINSSDGLLDNVSTEEVV 919 + + K DG ++ + +++ V+ + ++ N+ + S L + + E Sbjct: 288 AMSSDCKSDGVEQQSFQNSSEKEVMVTPPLVSAAEESYNSSEEAILSAPALVSAAEESDS 347 Query: 920 YSXXXXXXXXXXXXXXXXXXXXXIPNENMID---ENDTLNLD-NGKPPTTNGVYKVETPE 1087 + NE D E ++ +D + PT++ K E P Sbjct: 348 GKGEATLISPAQASASEESTSCSLVNEVSSDSKLETGSITVDYDSSAPTSS---KDECPH 404 Query: 1088 SAHEPSIDTQTEHSHQDVAPDNIATINPVQRGEGGESSFSVAGPVSGLITYSGPIAFXXX 1267 + ++T + +D A + N +QRG G ESSFS +GPV+GLI+YSGPIA+ Sbjct: 405 NLDHGPLETGSTPKLEDTADQPFS--NNLQRGNG-ESSFSASGPVTGLISYSGPIAYSGS 461 Query: 1268 XXXXXXXXXXXXXXFAFPILQNEWNSSPVRMAKADRRRLQKHRGWRHGLLCCRF 1429 FAFPILQ+EWNSSPVRMAKADRR QKHR WR GLLCCRF Sbjct: 462 LSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRRHYQKHRCWRQGLLCCRF 515 >gb|OMO72168.1| hypothetical protein COLO4_27800 [Corchorus olitorius] Length = 503 Score = 177 bits (448), Expect = 2e-45 Identities = 137/469 (29%), Positives = 209/469 (44%), Gaps = 21/469 (4%) Frame = +2 Query: 86 WNDQELEHSNIENDLTNTCKDDNFRKSLELSNHPSEFCGN--ETKFYTDKNVMECELPEL 259 W +L+ S N+ N + FR + +H S+ + + FY DK+VMEC+LPEL Sbjct: 80 WPASKLDSSMHVNEFGNG-NEKEFRDFVTSDSHSSKKMDSLQGSVFYLDKSVMECDLPEL 138 Query: 260 LVCYKENAFPV-KDICVDEGIPHGERVLFDENNNEMLKHDQLKISPTEDYYMESKLYSET 436 +VCYKEN + V KDIC+DEG+P ++ LF+ + NE ++ KL E Sbjct: 139 VVCYKENTYHVVKDICIDEGVPTQDKFLFESDMNE---------KNNCNFLPSCKLVEEK 189 Query: 437 NGKIDTDLPVVEPTSDY-------TDIGDSHDEVTDRKSDASSGIQEVDASFPVNGPIDD 595 D+P+ P D + D R+ +++ G Q F + + D Sbjct: 190 Q-----DIPISSPEDQSGKNIDNGCDFNEKLDADACRQDESNKGNQCDFEDFMMKRKVKD 244 Query: 596 HRSMDNIGIGTQIPVQDSQVSIPXXXXXXXXXXXXXXXEEVIDSVAPNELKNISKDDGDD 775 + +IP + +V + + K DG + Sbjct: 245 ----------------EEMKTIPDDLSKELFTLGELLSMTELSTVTSKAMSSECKSDGIE 288 Query: 776 EDEHSECAPEKLKVSQESDTVSKAIDNN------GPDINSSDGLLDNVSTEEVVYSXXXX 937 + + +++ V+ S V++ +NN P + S+ G DN + + S Sbjct: 289 QQSIQSSSEKEVNVNPPSVFVAEESNNNTEAMLDAPGLISAAGESDNGKEDAIPISTSQV 348 Query: 938 XXXXXXXXXXXXXXXXXIPNENMID-ENDTLNLDNGKPPTTNGVYK----VETPESAHEP 1102 + ++N ++ E+ T N + P + + E PE+ P Sbjct: 349 SVSEESTNNTLSNE---VSDDNRLETESITFNFGSSAPTNSKDECRPNLNCELPETGTTP 405 Query: 1103 SIDTQTEHSHQDVAPDNIATINPVQRGEGGESSFSVAGPVSGLITYSGPIAFXXXXXXXX 1282 ++ D A I+ I +QRG G E+SFS +GPV+GLI+YSGPIA+ Sbjct: 406 KLE--------DTADQPISNI--LQRGTG-ETSFSASGPVTGLISYSGPIAYSGSLSLRS 454 Query: 1283 XXXXXXXXXFAFPILQNEWNSSPVRMAKADRRRLQKHRGWRHGLLCCRF 1429 FAFP+LQ+EWNSSPVRMAKADRR +KHRGWRHGL CCRF Sbjct: 455 DSSTTSTRSFAFPVLQSEWNSSPVRMAKADRRHYRKHRGWRHGLFCCRF 503 >ref|XP_021292808.1| uncharacterized protein LOC110423037 [Herrania umbratica] ref|XP_021292810.1| uncharacterized protein LOC110423037 [Herrania umbratica] ref|XP_021292811.1| uncharacterized protein LOC110423037 [Herrania umbratica] ref|XP_021292812.1| uncharacterized protein LOC110423037 [Herrania umbratica] Length = 527 Score = 169 bits (427), Expect = 2e-42 Identities = 144/474 (30%), Positives = 219/474 (46%), Gaps = 26/474 (5%) Frame = +2 Query: 86 WNDQELEHSNIENDLTNTCKDDNFRKSLELSNHPSEFCGN--ETKFYTDKNVMECELPEL 259 W +L+ S ND N + R + ++H + + + FY DK+VMECELPEL Sbjct: 81 WPASKLDCSISVNDFANG-NEKEVRHFMTSNSHSLKNMDSFQNSVFYLDKSVMECELPEL 139 Query: 260 LVCYKENAFPV-KDICVDEGIPHGERVLFDENNNEMLKHDQLKISPTEDYYMESKLYSET 436 +VCYKE+ + V KDIC+DEG+P ++ LF+ +E + + L +D SKL E Sbjct: 140 VVCYKESTYHVVKDICIDEGVPTQDKFLFETGMDEKIDCNFLPSEKDQD----SKLMKE- 194 Query: 437 NGKIDTDLPV--VEPTSDYTDIGDSHDEV--TDRKSDASSGIQEVDASFPVN----GPID 592 K++TD+ + V + + G D +++K D + +Q+V S N G ++ Sbjct: 195 --KLETDMCMQDVSMSPEENQSGKDIDSECGSNKKLDTDTCMQDVSLSLEKNESNKGILN 252 Query: 593 DHRSMDNIGIGTQIPVQDSQVSIPXXXXXXXXXXXXXXXEEVIDSVAPNELKNISKDDGD 772 S D + T+ D+ + + V P + + K DG Sbjct: 253 QCDSKDLML--TREVKDDAMKMVTDDVSKELFTLGELLSMPELSKVNPEAMSSDCKSDGI 310 Query: 773 DEDEHSEC-------------APEKLKVSQESDTVS-KAIDNNGPDINSSDGLLDNVSTE 910 ++ A E+ K S E VS A+ + +++S G +S Sbjct: 311 EQQSFQSSSEKEVMVLPPLVSAVEESKNSNEEAIVSVPALVSTTEELDSGKGEASLISPA 370 Query: 911 EVVYSXXXXXXXXXXXXXXXXXXXXXIPNENMIDEND-TLNLDNGKPPTTNGVYKVETPE 1087 +V S + +N ++ T N D+ P ++ K E Sbjct: 371 QVSTSEESTGSSLVNE----------VSCDNKLETGSITFNFDSSAPTSS----KDECHH 416 Query: 1088 SAHEPSIDTQTEHSHQDVAPDNIATINPVQRGEGGESSFSVAGPVSGLITYSGPIAFXXX 1267 + + T + + A +I+ N +Q+G G ESSFS AG V+GLI+YSGP+A+ Sbjct: 417 NLDSEPLGTGSTPKLEVAADQSIS--NNLQQGIG-ESSFSAAGLVTGLISYSGPVAYSGS 473 Query: 1268 XXXXXXXXXXXXXXFAFPILQNEWNSSPVRMAKADRRRLQKHRGWRHGLLCCRF 1429 FAFPILQ+EWNSSPVRMAKADRR +KH+GWRHGLLCCRF Sbjct: 474 LSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRRHYRKHKGWRHGLLCCRF 527 >ref|XP_007045750.2| PREDICTED: uncharacterized protein LOC18610175 isoform X4 [Theobroma cacao] ref|XP_007045752.2| PREDICTED: uncharacterized protein LOC18610175 isoform X4 [Theobroma cacao] Length = 470 Score = 165 bits (417), Expect = 2e-41 Identities = 139/469 (29%), Positives = 220/469 (46%), Gaps = 21/469 (4%) Frame = +2 Query: 86 WNDQELEHSNIENDLTNTCKDDNFRKSLELSNHPS-----EFCGNETKFYTDKNVMECEL 250 W +L+ S ND N ++ + SN PS F + FY DK+VMECEL Sbjct: 24 WPALKLDCSISVNDFANG--NEKEVRDFVTSNSPSLKNMDSF--QNSVFYLDKSVMECEL 79 Query: 251 PELLVCYKENAFPV-KDICVDEGIPHGERVLFDENNNEMLKHDQLKISPTEDYYMESKLY 427 PEL+VCYKE+ + V KDIC+DEG+P ++ LF+ +E + + L +D S+L Sbjct: 80 PELVVCYKESTYHVVKDICIDEGVPTQDKFLFETGMDEKIDCNFLPSEKEQD----SQLM 135 Query: 428 SETNGKIDTDLPV----VEPTSDYTDIGDSHDEVTDRKSDASSGIQEVDASFPVNGP--- 586 +E K++TD+ + + P + + ++ +++K D + +Q+V S N Sbjct: 136 TE---KLETDMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEKNESNKG 192 Query: 587 IDDHRSMDNIGIGTQIPVQDSQVSIPXXXXXXXXXXXXXXXEEVIDSVAPNELKNISKDD 766 I + ++ + T++ D+ + + V + + K D Sbjct: 193 IPNQCDSKDLML-TRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDCKSD 251 Query: 767 GDDEDEHSECAPEKLKVSQES-DTVSKAIDNNGPDINSSDGL------LDNVSTEEVVYS 925 G ++ + +++ V V ++ D+N I S L LD+ E ++ S Sbjct: 252 GIEQQSFQSSSKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDSGKGEAILIS 311 Query: 926 XXXXXXXXXXXXXXXXXXXXXIPNENMIDEND-TLNLDNGKPPTTNGVYKVETPESAHEP 1102 + +N ++ T NLD+ P ++ K E + Sbjct: 312 PAQVSTPEESTSSSLVNE---VSYDNKLETGSITFNLDSSAPTSS----KDECHHNLDSE 364 Query: 1103 SIDTQTEHSHQDVAPDNIATINPVQRGEGGESSFSVAGPVSGLITYSGPIAFXXXXXXXX 1282 + T + + A +I+ N +Q+G G ESSFS AG V+GLI+YSGP+A+ Sbjct: 365 PLGTGSTPKLEVAADQSIS--NNLQQGIG-ESSFSAAGLVTGLISYSGPVAYSGSLSLRS 421 Query: 1283 XXXXXXXXXFAFPILQNEWNSSPVRMAKADRRRLQKHRGWRHGLLCCRF 1429 FAFPILQ+EWN SPVRMAKADRR +KH+GWRHGLLCCRF Sbjct: 422 DSSTTSTRSFAFPILQSEWNCSPVRMAKADRRHYRKHKGWRHGLLCCRF 470 >gb|EOY01582.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 2 [Theobroma cacao] gb|EOY01583.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 2 [Theobroma cacao] gb|EOY01584.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 2 [Theobroma cacao] Length = 470 Score = 165 bits (417), Expect = 2e-41 Identities = 139/469 (29%), Positives = 220/469 (46%), Gaps = 21/469 (4%) Frame = +2 Query: 86 WNDQELEHSNIENDLTNTCKDDNFRKSLELSNHPS-----EFCGNETKFYTDKNVMECEL 250 W +L+ S ND N ++ + SN PS F + FY DK+VMECEL Sbjct: 24 WPALKLDCSISVNDFANG--NEKEVRDFVTSNSPSLKNMDSF--QNSVFYLDKSVMECEL 79 Query: 251 PELLVCYKENAFPV-KDICVDEGIPHGERVLFDENNNEMLKHDQLKISPTEDYYMESKLY 427 PEL+VCYKE+ + V KDIC+DEG+P ++ LF+ +E + + L +D S+L Sbjct: 80 PELVVCYKESTYHVVKDICIDEGVPTQDKFLFETGMDEKIDCNFLPSEKEQD----SQLM 135 Query: 428 SETNGKIDTDLPV----VEPTSDYTDIGDSHDEVTDRKSDASSGIQEVDASFPVNGP--- 586 +E K++TD+ + + P + + ++ +++K D + +Q+V S N Sbjct: 136 TE---KLETDMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEKNESNKG 192 Query: 587 IDDHRSMDNIGIGTQIPVQDSQVSIPXXXXXXXXXXXXXXXEEVIDSVAPNELKNISKDD 766 I + ++ + T++ D+ + + V + + K D Sbjct: 193 IPNQCDSKDLML-TRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDCKSD 251 Query: 767 GDDEDEHSECAPEKLKVSQES-DTVSKAIDNNGPDINSSDGL------LDNVSTEEVVYS 925 G ++ + +++ V V ++ D+N I S L LD+ E ++ S Sbjct: 252 GIEQQSFQSSSKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDSGKGEAILIS 311 Query: 926 XXXXXXXXXXXXXXXXXXXXXIPNENMIDEND-TLNLDNGKPPTTNGVYKVETPESAHEP 1102 + +N ++ T NLD+ P ++ K E + Sbjct: 312 PAQVSTSEESTSSSLVNE---VSYDNKLETGSITFNLDSSAPTSS----KDECHHNLDSE 364 Query: 1103 SIDTQTEHSHQDVAPDNIATINPVQRGEGGESSFSVAGPVSGLITYSGPIAFXXXXXXXX 1282 + T + + A +I+ N +Q+G G ESSFS AG V+GLI+YSGP+A+ Sbjct: 365 PLGTGSTPKLEVAADQSIS--NNLQQGIG-ESSFSAAGLVTGLISYSGPVAYSGSLSLRS 421 Query: 1283 XXXXXXXXXFAFPILQNEWNSSPVRMAKADRRRLQKHRGWRHGLLCCRF 1429 FAFPILQ+EWN SPVRMAKADRR +KH+GWRHGLLCCRF Sbjct: 422 DSSTTSTRSFAFPILQSEWNCSPVRMAKADRRHYRKHKGWRHGLLCCRF 470 >ref|XP_016724637.1| PREDICTED: uncharacterized protein LOC107936401 isoform X2 [Gossypium hirsutum] ref|XP_016724645.1| PREDICTED: uncharacterized protein LOC107936401 isoform X2 [Gossypium hirsutum] Length = 462 Score = 164 bits (416), Expect = 3e-41 Identities = 140/480 (29%), Positives = 226/480 (47%), Gaps = 17/480 (3%) Frame = +2 Query: 41 TNPFLSDEENAKVNFWNDQELEHSNIENDLTNTCKDDNFRKSLELSNHPSEFCGN--ETK 214 T+P L E+ + W +L+ S ND +N + R + ++H + G+ ++ Sbjct: 11 TDPMLYLEKTG--DGWPASKLDCSMSVNDFSNG-NEKEARDFVPPNSHSLKNRGSFQDSV 67 Query: 215 FYTDKNVMECELPELLVCYKENAFPV-KDICVDEGIPHGERVLFDENNNEMLKHDQLKIS 391 FY DK+VMEC LPEL+VCYKE+A+ V KDIC+DEG+P ++ LFD + + K Sbjct: 68 FYLDKSVMECALPELVVCYKESAYHVVKDICIDEGVPTQDKFLFD---SVVDKKSDCNFL 124 Query: 392 PTEDYYMESKLYSETNGKIDTDLPV----VEPTSDYTDIGDSHDEVTDRKSDASSGIQEV 559 P+E+ +SKL E K+++D+ + + P + D ++ +++K+ + Q++ Sbjct: 125 PSEED-QDSKLLKE---KLESDISMQAGSMYPEENQMDKDIDNERDSNKKTISDKCTQDI 180 Query: 560 DASFPVNGP---IDDHRSMDNIGIGTQIPVQDSQVSIPXXXXXXXXXXXXXXXEEVIDSV 730 S N P I +++ + ++ +++ E + +V Sbjct: 181 SLSLEENEPKNRIPSQCDTEDLILSRKMTDDTMKMARDDVSKELFTLGELLSMPE-LSTV 239 Query: 731 APNELKNISKDDG-------DDEDEHSECAPEKLKVSQESDTVSKAIDNNGPDINSSDGL 889 P + + K DG + +++ P + +ESD K + S Sbjct: 240 KPKAMSSNCKSDGIKQQCFQNSKEKEVMVMPPLVSADKESDNSCKETILSASAPVSVAEE 299 Query: 890 LDNVSTEEVVYSXXXXXXXXXXXXXXXXXXXXXIPNENMIDENDTLNLDNGKPPTTNGVY 1069 +D+ E ++S + ++ D+ L + K + + Sbjct: 300 MDSRKEEATMFSPVTSSSLVNEVSDDSK-----LAARSIAFGFDSSALTSSKDEGCHNLD 354 Query: 1070 KVETPESAHEPSIDTQTEHSHQDVAPDNIATINPVQRGEGGESSFSVAGPVSGLITYSGP 1249 + E E+ H P ++ D+A + N +Q G G ESSFS AG V+GLI+YSGP Sbjct: 355 R-EALETGHTPKLE--------DIADQ--PSSNNLQCGNG-ESSFSAAGLVTGLISYSGP 402 Query: 1250 IAFXXXXXXXXXXXXXXXXXFAFPILQNEWNSSPVRMAKADRRRLQKHRGWRHGLLCCRF 1429 IA+ FAFPILQ+EWNSSPVRMAKADRR +KHRGWR GLLCCRF Sbjct: 403 IAYSGSLSHRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRRHYRKHRGWRQGLLCCRF 462 >ref|XP_017971961.1| PREDICTED: uncharacterized protein LOC18610175 isoform X3 [Theobroma cacao] Length = 527 Score = 165 bits (417), Expect = 6e-41 Identities = 139/469 (29%), Positives = 220/469 (46%), Gaps = 21/469 (4%) Frame = +2 Query: 86 WNDQELEHSNIENDLTNTCKDDNFRKSLELSNHPS-----EFCGNETKFYTDKNVMECEL 250 W +L+ S ND N ++ + SN PS F + FY DK+VMECEL Sbjct: 81 WPALKLDCSISVNDFANG--NEKEVRDFVTSNSPSLKNMDSF--QNSVFYLDKSVMECEL 136 Query: 251 PELLVCYKENAFPV-KDICVDEGIPHGERVLFDENNNEMLKHDQLKISPTEDYYMESKLY 427 PEL+VCYKE+ + V KDIC+DEG+P ++ LF+ +E + + L +D S+L Sbjct: 137 PELVVCYKESTYHVVKDICIDEGVPTQDKFLFETGMDEKIDCNFLPSEKEQD----SQLM 192 Query: 428 SETNGKIDTDLPV----VEPTSDYTDIGDSHDEVTDRKSDASSGIQEVDASFPVNGP--- 586 +E K++TD+ + + P + + ++ +++K D + +Q+V S N Sbjct: 193 TE---KLETDMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEKNESNKG 249 Query: 587 IDDHRSMDNIGIGTQIPVQDSQVSIPXXXXXXXXXXXXXXXEEVIDSVAPNELKNISKDD 766 I + ++ + T++ D+ + + V + + K D Sbjct: 250 IPNQCDSKDLML-TRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDCKSD 308 Query: 767 GDDEDEHSECAPEKLKVSQES-DTVSKAIDNNGPDINSSDGL------LDNVSTEEVVYS 925 G ++ + +++ V V ++ D+N I S L LD+ E ++ S Sbjct: 309 GIEQQSFQSSSKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDSGKGEAILIS 368 Query: 926 XXXXXXXXXXXXXXXXXXXXXIPNENMIDEND-TLNLDNGKPPTTNGVYKVETPESAHEP 1102 + +N ++ T NLD+ P ++ K E + Sbjct: 369 PAQVSTPEESTSSSLVNE---VSYDNKLETGSITFNLDSSAPTSS----KDECHHNLDSE 421 Query: 1103 SIDTQTEHSHQDVAPDNIATINPVQRGEGGESSFSVAGPVSGLITYSGPIAFXXXXXXXX 1282 + T + + A +I+ N +Q+G G ESSFS AG V+GLI+YSGP+A+ Sbjct: 422 PLGTGSTPKLEVAADQSIS--NNLQQGIG-ESSFSAAGLVTGLISYSGPVAYSGSLSLRS 478 Query: 1283 XXXXXXXXXFAFPILQNEWNSSPVRMAKADRRRLQKHRGWRHGLLCCRF 1429 FAFPILQ+EWN SPVRMAKADRR +KH+GWRHGLLCCRF Sbjct: 479 DSSTTSTRSFAFPILQSEWNCSPVRMAKADRRHYRKHKGWRHGLLCCRF 527 >gb|EOY01581.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 1 [Theobroma cacao] Length = 527 Score = 165 bits (417), Expect = 6e-41 Identities = 139/469 (29%), Positives = 220/469 (46%), Gaps = 21/469 (4%) Frame = +2 Query: 86 WNDQELEHSNIENDLTNTCKDDNFRKSLELSNHPS-----EFCGNETKFYTDKNVMECEL 250 W +L+ S ND N ++ + SN PS F + FY DK+VMECEL Sbjct: 81 WPALKLDCSISVNDFANG--NEKEVRDFVTSNSPSLKNMDSF--QNSVFYLDKSVMECEL 136 Query: 251 PELLVCYKENAFPV-KDICVDEGIPHGERVLFDENNNEMLKHDQLKISPTEDYYMESKLY 427 PEL+VCYKE+ + V KDIC+DEG+P ++ LF+ +E + + L +D S+L Sbjct: 137 PELVVCYKESTYHVVKDICIDEGVPTQDKFLFETGMDEKIDCNFLPSEKEQD----SQLM 192 Query: 428 SETNGKIDTDLPV----VEPTSDYTDIGDSHDEVTDRKSDASSGIQEVDASFPVNGP--- 586 +E K++TD+ + + P + + ++ +++K D + +Q+V S N Sbjct: 193 TE---KLETDMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEKNESNKG 249 Query: 587 IDDHRSMDNIGIGTQIPVQDSQVSIPXXXXXXXXXXXXXXXEEVIDSVAPNELKNISKDD 766 I + ++ + T++ D+ + + V + + K D Sbjct: 250 IPNQCDSKDLML-TRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDCKSD 308 Query: 767 GDDEDEHSECAPEKLKVSQES-DTVSKAIDNNGPDINSSDGL------LDNVSTEEVVYS 925 G ++ + +++ V V ++ D+N I S L LD+ E ++ S Sbjct: 309 GIEQQSFQSSSKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDSGKGEAILIS 368 Query: 926 XXXXXXXXXXXXXXXXXXXXXIPNENMIDEND-TLNLDNGKPPTTNGVYKVETPESAHEP 1102 + +N ++ T NLD+ P ++ K E + Sbjct: 369 PAQVSTSEESTSSSLVNE---VSYDNKLETGSITFNLDSSAPTSS----KDECHHNLDSE 421 Query: 1103 SIDTQTEHSHQDVAPDNIATINPVQRGEGGESSFSVAGPVSGLITYSGPIAFXXXXXXXX 1282 + T + + A +I+ N +Q+G G ESSFS AG V+GLI+YSGP+A+ Sbjct: 422 PLGTGSTPKLEVAADQSIS--NNLQQGIG-ESSFSAAGLVTGLISYSGPVAYSGSLSLRS 478 Query: 1283 XXXXXXXXXFAFPILQNEWNSSPVRMAKADRRRLQKHRGWRHGLLCCRF 1429 FAFPILQ+EWN SPVRMAKADRR +KH+GWRHGLLCCRF Sbjct: 479 DSSTTSTRSFAFPILQSEWNCSPVRMAKADRRHYRKHKGWRHGLLCCRF 527 >ref|XP_017971960.1| PREDICTED: uncharacterized protein LOC18610175 isoform X2 [Theobroma cacao] Length = 538 Score = 165 bits (417), Expect = 7e-41 Identities = 139/469 (29%), Positives = 220/469 (46%), Gaps = 21/469 (4%) Frame = +2 Query: 86 WNDQELEHSNIENDLTNTCKDDNFRKSLELSNHPS-----EFCGNETKFYTDKNVMECEL 250 W +L+ S ND N ++ + SN PS F + FY DK+VMECEL Sbjct: 92 WPALKLDCSISVNDFANG--NEKEVRDFVTSNSPSLKNMDSF--QNSVFYLDKSVMECEL 147 Query: 251 PELLVCYKENAFPV-KDICVDEGIPHGERVLFDENNNEMLKHDQLKISPTEDYYMESKLY 427 PEL+VCYKE+ + V KDIC+DEG+P ++ LF+ +E + + L +D S+L Sbjct: 148 PELVVCYKESTYHVVKDICIDEGVPTQDKFLFETGMDEKIDCNFLPSEKEQD----SQLM 203 Query: 428 SETNGKIDTDLPV----VEPTSDYTDIGDSHDEVTDRKSDASSGIQEVDASFPVNGP--- 586 +E K++TD+ + + P + + ++ +++K D + +Q+V S N Sbjct: 204 TE---KLETDMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEKNESNKG 260 Query: 587 IDDHRSMDNIGIGTQIPVQDSQVSIPXXXXXXXXXXXXXXXEEVIDSVAPNELKNISKDD 766 I + ++ + T++ D+ + + V + + K D Sbjct: 261 IPNQCDSKDLML-TRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDCKSD 319 Query: 767 GDDEDEHSECAPEKLKVSQES-DTVSKAIDNNGPDINSSDGL------LDNVSTEEVVYS 925 G ++ + +++ V V ++ D+N I S L LD+ E ++ S Sbjct: 320 GIEQQSFQSSSKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDSGKGEAILIS 379 Query: 926 XXXXXXXXXXXXXXXXXXXXXIPNENMIDEND-TLNLDNGKPPTTNGVYKVETPESAHEP 1102 + +N ++ T NLD+ P ++ K E + Sbjct: 380 PAQVSTPEESTSSSLVNE---VSYDNKLETGSITFNLDSSAPTSS----KDECHHNLDSE 432 Query: 1103 SIDTQTEHSHQDVAPDNIATINPVQRGEGGESSFSVAGPVSGLITYSGPIAFXXXXXXXX 1282 + T + + A +I+ N +Q+G G ESSFS AG V+GLI+YSGP+A+ Sbjct: 433 PLGTGSTPKLEVAADQSIS--NNLQQGIG-ESSFSAAGLVTGLISYSGPVAYSGSLSLRS 489 Query: 1283 XXXXXXXXXFAFPILQNEWNSSPVRMAKADRRRLQKHRGWRHGLLCCRF 1429 FAFPILQ+EWN SPVRMAKADRR +KH+GWRHGLLCCRF Sbjct: 490 DSSTTSTRSFAFPILQSEWNCSPVRMAKADRRHYRKHKGWRHGLLCCRF 538 >ref|XP_007045751.2| PREDICTED: uncharacterized protein LOC18610175 isoform X1 [Theobroma cacao] Length = 543 Score = 165 bits (417), Expect = 7e-41 Identities = 139/469 (29%), Positives = 220/469 (46%), Gaps = 21/469 (4%) Frame = +2 Query: 86 WNDQELEHSNIENDLTNTCKDDNFRKSLELSNHPS-----EFCGNETKFYTDKNVMECEL 250 W +L+ S ND N ++ + SN PS F + FY DK+VMECEL Sbjct: 97 WPALKLDCSISVNDFANG--NEKEVRDFVTSNSPSLKNMDSF--QNSVFYLDKSVMECEL 152 Query: 251 PELLVCYKENAFPV-KDICVDEGIPHGERVLFDENNNEMLKHDQLKISPTEDYYMESKLY 427 PEL+VCYKE+ + V KDIC+DEG+P ++ LF+ +E + + L +D S+L Sbjct: 153 PELVVCYKESTYHVVKDICIDEGVPTQDKFLFETGMDEKIDCNFLPSEKEQD----SQLM 208 Query: 428 SETNGKIDTDLPV----VEPTSDYTDIGDSHDEVTDRKSDASSGIQEVDASFPVNGP--- 586 +E K++TD+ + + P + + ++ +++K D + +Q+V S N Sbjct: 209 TE---KLETDMCMQDVSMSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEKNESNKG 265 Query: 587 IDDHRSMDNIGIGTQIPVQDSQVSIPXXXXXXXXXXXXXXXEEVIDSVAPNELKNISKDD 766 I + ++ + T++ D+ + + V + + K D Sbjct: 266 IPNQCDSKDLML-TRVVKGDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDCKSD 324 Query: 767 GDDEDEHSECAPEKLKVSQES-DTVSKAIDNNGPDINSSDGL------LDNVSTEEVVYS 925 G ++ + +++ V V ++ D+N I S L LD+ E ++ S Sbjct: 325 GIEQQSFQSSSKKEVMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDSGKGEAILIS 384 Query: 926 XXXXXXXXXXXXXXXXXXXXXIPNENMIDEND-TLNLDNGKPPTTNGVYKVETPESAHEP 1102 + +N ++ T NLD+ P ++ K E + Sbjct: 385 PAQVSTPEESTSSSLVNE---VSYDNKLETGSITFNLDSSAPTSS----KDECHHNLDSE 437 Query: 1103 SIDTQTEHSHQDVAPDNIATINPVQRGEGGESSFSVAGPVSGLITYSGPIAFXXXXXXXX 1282 + T + + A +I+ N +Q+G G ESSFS AG V+GLI+YSGP+A+ Sbjct: 438 PLGTGSTPKLEVAADQSIS--NNLQQGIG-ESSFSAAGLVTGLISYSGPVAYSGSLSLRS 494 Query: 1283 XXXXXXXXXFAFPILQNEWNSSPVRMAKADRRRLQKHRGWRHGLLCCRF 1429 FAFPILQ+EWN SPVRMAKADRR +KH+GWRHGLLCCRF Sbjct: 495 DSSTTSTRSFAFPILQSEWNCSPVRMAKADRRHYRKHKGWRHGLLCCRF 543 >ref|XP_012478809.1| PREDICTED: uncharacterized protein LOC105794265 isoform X2 [Gossypium raimondii] ref|XP_012478810.1| PREDICTED: uncharacterized protein LOC105794265 isoform X2 [Gossypium raimondii] gb|KJB30519.1| hypothetical protein B456_005G147700 [Gossypium raimondii] gb|KJB30522.1| hypothetical protein B456_005G147700 [Gossypium raimondii] Length = 466 Score = 162 bits (410), Expect = 2e-40 Identities = 136/476 (28%), Positives = 218/476 (45%), Gaps = 13/476 (2%) Frame = +2 Query: 41 TNPFLSDEENAKVNFWNDQELEHSNIENDLTNTCKDDNFRKSLELSNHPSEFCGN--ETK 214 T+P L E+ + W +L+ S ND +N + R + ++H + G+ ++ Sbjct: 15 TDPMLYLEKTG--DGWPASKLDCSMSVNDFSNG-NEKEARDFVPPNSHSLKNMGSFQDSV 71 Query: 215 FYTDKNVMECELPELLVCYKENAFPV-KDICVDEGIPHGERVLFDENNNEMLKHDQLKIS 391 FY DK+VME LPEL+VCYKE+A+ V KDIC+DEG+P ++ LFD + + K Sbjct: 72 FYLDKSVMEYALPELVVCYKESAYHVVKDICIDEGVPTQDKFLFD---SVVDKKSDCNFL 128 Query: 392 PTEDYYMESKLYSETNGKIDTDLPVVEPTSDYTDIGDSHDEVTDRKSDASSGIQEVDASF 571 P+E+ L ++ I + P + D ++ +++K+ + Q++ S Sbjct: 129 PSEEDQDSKLLKEKSESDISMQAGSMYPEENQMDKDIDNERDSNKKTISDKCTQDISLSL 188 Query: 572 PVNGP---IDDHRSMDNIGIGTQIPVQDSQVSIPXXXXXXXXXXXXXXXEEVIDSVAPNE 742 N P I +++ + ++ +++ E + +V P Sbjct: 189 EENEPKNRIPSQCDTEDLILSRKMTDDTMKMARDDVSKELFTLGELLSMPE-LSTVKPKA 247 Query: 743 LKNISKDDG-------DDEDEHSECAPEKLKVSQESDTVSKAIDNNGPDINSSDGLLDNV 901 + + K DG + +++ P + +ESD SK + S +D+ Sbjct: 248 MSSNCKSDGIKQQCFQNSKEKEVMVMPPLVSADKESDNSSKETILSASAPVSVAEEMDSR 307 Query: 902 STEEVVYSXXXXXXXXXXXXXXXXXXXXXIPNENMIDENDTLNLDNGKPPTTNGVYKVET 1081 E ++S + ++ D+ L + K + + + E Sbjct: 308 KEEATMFSPVTSSSLVNEVSDDSK-----LAARSIAFGFDSSALTSSKNEGCHNLDR-EA 361 Query: 1082 PESAHEPSIDTQTEHSHQDVAPDNIATINPVQRGEGGESSFSVAGPVSGLITYSGPIAFX 1261 E+ H P ++ D+A + N +Q G G ESSFS AG V+GLI+YSGPIA+ Sbjct: 362 LETGHTPKLE--------DIADQ--PSSNNLQCGNG-ESSFSAAGLVTGLISYSGPIAYS 410 Query: 1262 XXXXXXXXXXXXXXXXFAFPILQNEWNSSPVRMAKADRRRLQKHRGWRHGLLCCRF 1429 FAFPILQ+EWNSSPVRMAKADRR +KHRGWR GLLCCRF Sbjct: 411 GSLSHRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRRHYRKHRGWRQGLLCCRF 466 >ref|XP_016667087.1| PREDICTED: uncharacterized protein LOC107887384 isoform X2 [Gossypium hirsutum] ref|XP_016667088.1| PREDICTED: uncharacterized protein LOC107887384 isoform X2 [Gossypium hirsutum] Length = 464 Score = 161 bits (408), Expect = 4e-40 Identities = 140/492 (28%), Positives = 214/492 (43%), Gaps = 29/492 (5%) Frame = +2 Query: 41 TNPFLSDEENAKVNFWNDQELEHSNIENDLTNTCKDDNFRKSLELSNHPSEFCGN--ETK 214 T+P L E+ + W +L S ND +N + R + ++H + G+ ++ Sbjct: 15 TDPMLYLEKTG--DGWPASKLNCSMSVNDFSNG-NEKEARDFVPPNSHSLKNMGSFQDSV 71 Query: 215 FYTDKNVMECELPELLVCYKENAFPV-KDICVDEGIPHGERVLFDENNNEMLKHDQLKIS 391 FY DK+VMEC LPEL+VCYKE+A+ V KDIC+DEG+P ++ LFD + K Sbjct: 72 FYLDKSVMECALPELVVCYKESAYHVVKDICIDEGVPTQDKFLFDSGVD---KKSDCNFL 128 Query: 392 PTEDYYMESKLYSETNGKIDTDLPVVEPTSDYTDIGDSHDEVTDRKSDASSGIQEVDASF 571 P+E+ L + I + P + D + D +++K+ + Q++ S Sbjct: 129 PSEEDQDSKLLKEKPESDISMQAGSMYPEENQMDKDNERD--SNKKTISDKYTQDISLSL 186 Query: 572 PVNGP-------------------IDDHRSMDNIGIGTQIPVQDSQVSIPXXXXXXXXXX 694 N P +DD M + ++ +S+P Sbjct: 187 EENEPKNRIPSQCDTEDLILSRKMMDDTMKMARDDVSKELFTLGELLSMPE--------- 237 Query: 695 XXXXXEEVIDSVAPNELKNISKDDG-------DDEDEHSECAPEKLKVSQESDTVSKAID 853 +V P L + DG + +++ P + +ES+ K Sbjct: 238 --------FSTVKPEALSSHCTSDGIKQQCFQNSKEKEVMVMPPLVSADKESNNSCKETI 289 Query: 854 NNGPDINSSDGLLDNVSTEEVVYSXXXXXXXXXXXXXXXXXXXXXIPNENMIDENDTLNL 1033 + S +D+V E ++S + ++ D+ L Sbjct: 290 LSASAPVSVAEEMDSVKGEATMFSPATSSSLVNEVSDDSK-----LAARSIAFGFDSSAL 344 Query: 1034 DNGKPPTTNGVYKVETPESAHEPSIDTQTEHSHQDVAPDNIATINPVQRGEGGESSFSVA 1213 + K + + + E E+ H P ++ D+A + N +Q G G ESSFS A Sbjct: 345 TSSKDEGCHNLDR-EALETGHTPKLE--------DIADQ--PSSNNLQCGNG-ESSFSAA 392 Query: 1214 GPVSGLITYSGPIAFXXXXXXXXXXXXXXXXXFAFPILQNEWNSSPVRMAKADRRRLQKH 1393 G V+GLI+YSGPIA+ FAFPILQ+EWNSSPVRMAKADRR +KH Sbjct: 393 GLVTGLISYSGPIAYSGSLSHRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRRHYRKH 452 Query: 1394 RGWRHGLLCCRF 1429 RGWR GLLCCRF Sbjct: 453 RGWRQGLLCCRF 464 >ref|XP_016667085.1| PREDICTED: uncharacterized protein LOC107887384 isoform X1 [Gossypium hirsutum] ref|XP_016667086.1| PREDICTED: uncharacterized protein LOC107887384 isoform X1 [Gossypium hirsutum] Length = 516 Score = 161 bits (408), Expect = 9e-40 Identities = 140/492 (28%), Positives = 214/492 (43%), Gaps = 29/492 (5%) Frame = +2 Query: 41 TNPFLSDEENAKVNFWNDQELEHSNIENDLTNTCKDDNFRKSLELSNHPSEFCGN--ETK 214 T+P L E+ + W +L S ND +N + R + ++H + G+ ++ Sbjct: 67 TDPMLYLEKTG--DGWPASKLNCSMSVNDFSNG-NEKEARDFVPPNSHSLKNMGSFQDSV 123 Query: 215 FYTDKNVMECELPELLVCYKENAFPV-KDICVDEGIPHGERVLFDENNNEMLKHDQLKIS 391 FY DK+VMEC LPEL+VCYKE+A+ V KDIC+DEG+P ++ LFD + K Sbjct: 124 FYLDKSVMECALPELVVCYKESAYHVVKDICIDEGVPTQDKFLFDSGVD---KKSDCNFL 180 Query: 392 PTEDYYMESKLYSETNGKIDTDLPVVEPTSDYTDIGDSHDEVTDRKSDASSGIQEVDASF 571 P+E+ L + I + P + D + D +++K+ + Q++ S Sbjct: 181 PSEEDQDSKLLKEKPESDISMQAGSMYPEENQMDKDNERD--SNKKTISDKYTQDISLSL 238 Query: 572 PVNGP-------------------IDDHRSMDNIGIGTQIPVQDSQVSIPXXXXXXXXXX 694 N P +DD M + ++ +S+P Sbjct: 239 EENEPKNRIPSQCDTEDLILSRKMMDDTMKMARDDVSKELFTLGELLSMPE--------- 289 Query: 695 XXXXXEEVIDSVAPNELKNISKDDG-------DDEDEHSECAPEKLKVSQESDTVSKAID 853 +V P L + DG + +++ P + +ES+ K Sbjct: 290 --------FSTVKPEALSSHCTSDGIKQQCFQNSKEKEVMVMPPLVSADKESNNSCKETI 341 Query: 854 NNGPDINSSDGLLDNVSTEEVVYSXXXXXXXXXXXXXXXXXXXXXIPNENMIDENDTLNL 1033 + S +D+V E ++S + ++ D+ L Sbjct: 342 LSASAPVSVAEEMDSVKGEATMFSPATSSSLVNEVSDDSK-----LAARSIAFGFDSSAL 396 Query: 1034 DNGKPPTTNGVYKVETPESAHEPSIDTQTEHSHQDVAPDNIATINPVQRGEGGESSFSVA 1213 + K + + + E E+ H P ++ D+A + N +Q G G ESSFS A Sbjct: 397 TSSKDEGCHNLDR-EALETGHTPKLE--------DIADQ--PSSNNLQCGNG-ESSFSAA 444 Query: 1214 GPVSGLITYSGPIAFXXXXXXXXXXXXXXXXXFAFPILQNEWNSSPVRMAKADRRRLQKH 1393 G V+GLI+YSGPIA+ FAFPILQ+EWNSSPVRMAKADRR +KH Sbjct: 445 GLVTGLISYSGPIAYSGSLSHRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRRHYRKH 504 Query: 1394 RGWRHGLLCCRF 1429 RGWR GLLCCRF Sbjct: 505 RGWRQGLLCCRF 516