BLASTX nr result
ID: Rehmannia30_contig00020957
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia30_contig00020957 (794 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011088327.1| uncharacterized protein LOC105169600 [Sesamu... 430 e-146 gb|EYU37748.1| hypothetical protein MIMGU_mgv1a022304mg, partial... 372 e-125 gb|KZV52983.1| hypothetical protein F511_31948 [Dorcoceras hygro... 375 e-125 ref|XP_012836808.1| PREDICTED: uncharacterized protein LOC105957... 372 e-124 gb|AAO59425.1| putative ternary complex factor MIP1 [Antirrhinum... 372 e-124 gb|PIN12372.1| hypothetical protein CDL12_15025 [Handroanthus im... 369 e-123 ref|XP_017969768.1| PREDICTED: uncharacterized protein LOC186110... 316 e-101 ref|XP_021273633.1| uncharacterized protein LOC110408847 isoform... 317 e-101 ref|XP_021300265.1| uncharacterized protein LOC110428688 isoform... 316 e-101 ref|XP_021273632.1| uncharacterized protein LOC110408847 isoform... 317 e-101 ref|XP_021273631.1| uncharacterized protein LOC110408847 isoform... 317 e-101 ref|XP_017969766.1| PREDICTED: uncharacterized protein LOC186110... 316 e-101 ref|XP_017969765.1| PREDICTED: uncharacterized protein LOC186110... 316 e-101 ref|XP_007047079.2| PREDICTED: uncharacterized protein LOC186110... 316 e-101 ref|XP_021300264.1| uncharacterized protein LOC110428688 isoform... 316 e-101 ref|XP_021300263.1| uncharacterized protein LOC110428688 isoform... 316 e-101 ref|XP_007041397.2| PREDICTED: uncharacterized protein LOC186072... 314 e-101 gb|EOX97227.1| Uncharacterized protein TCM_006317 isoform 1 [The... 314 e-101 gb|OMO84200.1| hypothetical protein COLO4_22164 [Corchorus olito... 315 e-100 ref|XP_018629833.1| PREDICTED: uncharacterized protein LOC104106... 313 e-100 >ref|XP_011088327.1| uncharacterized protein LOC105169600 [Sesamum indicum] ref|XP_011088328.1| uncharacterized protein LOC105169600 [Sesamum indicum] ref|XP_020551916.1| uncharacterized protein LOC105169600 [Sesamum indicum] Length = 569 Score = 430 bits (1106), Expect = e-146 Identities = 213/281 (75%), Positives = 230/281 (81%), Gaps = 19/281 (6%) Frame = +1 Query: 7 SSFPGRVCETLNWLSEEMIKCISTVYCHLSDPPLF-------------------EHHDIY 129 SSFP RV ET NWLSEEMIKCIST+YCHLSDPPLF HHDI Sbjct: 228 SSFPDRVRETPNWLSEEMIKCISTIYCHLSDPPLFARGLNSTSHSSPPSKFSPRRHHDIP 287 Query: 130 XXXXXXXXXXXXXXXNPFHLEASNEFSGSLFTMVEIHGLFRDSQRLNDVEDLLQNYRSLI 309 NPFH++AS EFSG+ FTMVE+ G+ RDS RL+ V+DLLQ+YRSLI Sbjct: 288 GLSSEENLSFYSWLNNPFHVDASKEFSGTFFTMVEVQGICRDSTRLHAVQDLLQSYRSLI 347 Query: 310 SKLAQVDPGKLRHEEKLAFWINIHNALVMHAFMVYGIPRGNLKRISLVLKAAYNIGGHTV 489 S+LAQVDPGKL+HEEKLAFWIN+HNALVMHAF+VYGIPRGNLKRISL LKAAYNIGGHTV Sbjct: 348 SRLAQVDPGKLKHEEKLAFWINVHNALVMHAFLVYGIPRGNLKRISLALKAAYNIGGHTV 407 Query: 490 SVDTIQSSILGCRLPRPTPWLHSLFFPKKKFKAGDPRKAFAVKHSEPRLHFALCSGCQSD 669 +VDTIQSSILGCRLPRPT WLHSLFFPK KFKAGDPRKA+AVKH EPRLHFALCSGCQSD Sbjct: 408 NVDTIQSSILGCRLPRPTQWLHSLFFPKTKFKAGDPRKAYAVKHPEPRLHFALCSGCQSD 467 Query: 670 PAVRLYTPKKVFQELEMAKEDYIQMNIKLRKEQRLLVPKNV 792 PAVR YTPKKVFQELEMAKE+YIQMNIKL KEQRLL+PKNV Sbjct: 468 PAVRSYTPKKVFQELEMAKEEYIQMNIKLHKEQRLLIPKNV 508 >gb|EYU37748.1| hypothetical protein MIMGU_mgv1a022304mg, partial [Erythranthe guttata] Length = 433 Score = 372 bits (955), Expect = e-125 Identities = 188/262 (71%), Positives = 211/262 (80%) Frame = +1 Query: 7 SSFPGRVCETLNWLSEEMIKCISTVYCHLSDPPLFEHHDIYXXXXXXXXXXXXXXXNPFH 186 SSF RV ET NWLSEEMIKCISTVYCHLSDPPLF N Sbjct: 135 SSFADRVSETPNWLSEEMIKCISTVYCHLSDPPLF---------------------NRGR 173 Query: 187 LEASNEFSGSLFTMVEIHGLFRDSQRLNDVEDLLQNYRSLISKLAQVDPGKLRHEEKLAF 366 + FSG+LF MVEI G+FRDS+RLN V++ LQNYRSLIS+L ++DPGKL++EEKLAF Sbjct: 174 DDEEYSFSGTLFAMVEIQGIFRDSERLNAVQEPLQNYRSLISRLGKIDPGKLKNEEKLAF 233 Query: 367 WINIHNALVMHAFMVYGIPRGNLKRISLVLKAAYNIGGHTVSVDTIQSSILGCRLPRPTP 546 WIN+HNALVMHAF+VYGIPRG++KRISLVLKAAYNIGG T+S+DTIQSSILGCRLPRP Sbjct: 234 WINVHNALVMHAFIVYGIPRGSVKRISLVLKAAYNIGGQTISIDTIQSSILGCRLPRPAL 293 Query: 547 WLHSLFFPKKKFKAGDPRKAFAVKHSEPRLHFALCSGCQSDPAVRLYTPKKVFQELEMAK 726 WL+SLF KKKFKAGD RKA+ VK +EPRL FALCSGC SDP VR+YT KKVFQELEMAK Sbjct: 294 WLNSLFSTKKKFKAGDVRKAYTVKQAEPRLRFALCSGCLSDPLVRVYTAKKVFQELEMAK 353 Query: 727 EDYIQMNIKLRKEQRLLVPKNV 792 E+YIQMNIK+ KEQRLLVPKNV Sbjct: 354 EEYIQMNIKIYKEQRLLVPKNV 375 >gb|KZV52983.1| hypothetical protein F511_31948 [Dorcoceras hygrometricum] Length = 555 Score = 375 bits (963), Expect = e-125 Identities = 182/271 (67%), Positives = 213/271 (78%), Gaps = 9/271 (3%) Frame = +1 Query: 7 SSFPGRVCETLNWLSEEMIKCISTVYCHLSDPPLFEH---------HDIYXXXXXXXXXX 159 S FP RV E+ NW+SEEMIKCIST+YCHLSDPPLF H HD+ Sbjct: 227 SRFPNRVRESPNWISEEMIKCISTIYCHLSDPPLFNHGSNSSPQGQHDLSSLSCDENSSS 286 Query: 160 XXXXXNPFHLEASNEFSGSLFTMVEIHGLFRDSQRLNDVEDLLQNYRSLISKLAQVDPGK 339 NPF+ E S +FSGSL +++E+ GL + QRLN VE LLQ +RSLIS+LA VDPGK Sbjct: 287 NSWMNNPFNGETSKDFSGSLRSIIEVQGLVPEPQRLNVVEVLLQKFRSLISRLATVDPGK 346 Query: 340 LRHEEKLAFWINIHNALVMHAFMVYGIPRGNLKRISLVLKAAYNIGGHTVSVDTIQSSIL 519 L+HEEKLAFWIN+HNA+VMHAF+V+GIPRGN KRISL LKAAYNIGGH +SV TIQSSIL Sbjct: 347 LKHEEKLAFWINVHNAIVMHAFLVHGIPRGNQKRISLALKAAYNIGGHIISVVTIQSSIL 406 Query: 520 GCRLPRPTPWLHSLFFPKKKFKAGDPRKAFAVKHSEPRLHFALCSGCQSDPAVRLYTPKK 699 GCRLPRP+ WLHS FPK K K+G+P K+FA+KH EPRL FALCSGCQSDP +RLYTPKK Sbjct: 407 GCRLPRPSQWLHSWLFPKTKLKSGEPGKSFAIKHPEPRLRFALCSGCQSDPLIRLYTPKK 466 Query: 700 VFQELEMAKEDYIQMNIKLRKEQRLLVPKNV 792 +FQELE+AKE+YIQMNI+ K+QRL +PKNV Sbjct: 467 IFQELEIAKEEYIQMNIRKHKDQRLHIPKNV 497 >ref|XP_012836808.1| PREDICTED: uncharacterized protein LOC105957431 [Erythranthe guttata] Length = 497 Score = 372 bits (955), Expect = e-124 Identities = 188/262 (71%), Positives = 211/262 (80%) Frame = +1 Query: 7 SSFPGRVCETLNWLSEEMIKCISTVYCHLSDPPLFEHHDIYXXXXXXXXXXXXXXXNPFH 186 SSF RV ET NWLSEEMIKCISTVYCHLSDPPLF N Sbjct: 199 SSFADRVSETPNWLSEEMIKCISTVYCHLSDPPLF---------------------NRGR 237 Query: 187 LEASNEFSGSLFTMVEIHGLFRDSQRLNDVEDLLQNYRSLISKLAQVDPGKLRHEEKLAF 366 + FSG+LF MVEI G+FRDS+RLN V++ LQNYRSLIS+L ++DPGKL++EEKLAF Sbjct: 238 DDEEYSFSGTLFAMVEIQGIFRDSERLNAVQEPLQNYRSLISRLGKIDPGKLKNEEKLAF 297 Query: 367 WINIHNALVMHAFMVYGIPRGNLKRISLVLKAAYNIGGHTVSVDTIQSSILGCRLPRPTP 546 WIN+HNALVMHAF+VYGIPRG++KRISLVLKAAYNIGG T+S+DTIQSSILGCRLPRP Sbjct: 298 WINVHNALVMHAFIVYGIPRGSVKRISLVLKAAYNIGGQTISIDTIQSSILGCRLPRPAL 357 Query: 547 WLHSLFFPKKKFKAGDPRKAFAVKHSEPRLHFALCSGCQSDPAVRLYTPKKVFQELEMAK 726 WL+SLF KKKFKAGD RKA+ VK +EPRL FALCSGC SDP VR+YT KKVFQELEMAK Sbjct: 358 WLNSLFSTKKKFKAGDVRKAYTVKQAEPRLRFALCSGCLSDPLVRVYTAKKVFQELEMAK 417 Query: 727 EDYIQMNIKLRKEQRLLVPKNV 792 E+YIQMNIK+ KEQRLLVPKNV Sbjct: 418 EEYIQMNIKIYKEQRLLVPKNV 439 >gb|AAO59425.1| putative ternary complex factor MIP1 [Antirrhinum majus] Length = 555 Score = 372 bits (956), Expect = e-124 Identities = 181/270 (67%), Positives = 212/270 (78%), Gaps = 16/270 (5%) Frame = +1 Query: 31 ETLNWLSEEMIKCISTVYCHLSDPPLFEH----------------HDIYXXXXXXXXXXX 162 E N+LSEEMIKCIST+YCHLSDPPLF H + Sbjct: 225 EAPNYLSEEMIKCISTIYCHLSDPPLFNHGFNSVSLLSPPTTFSPQAQHGKCSEENTSFG 284 Query: 163 XXXXNPFHLEASNEFSGSLFTMVEIHGLFRDSQRLNDVEDLLQNYRSLISKLAQVDPGKL 342 NPF++E S EF+GSL++MVE+ GL RDSQ L+ VE+LLQNYR LISKL +VDPGKL Sbjct: 285 SWMNNPFNVEESKEFNGSLYSMVEVQGLLRDSQSLDSVEELLQNYRFLISKLGEVDPGKL 344 Query: 343 RHEEKLAFWINIHNALVMHAFMVYGIPRGNLKRISLVLKAAYNIGGHTVSVDTIQSSILG 522 +H+EKLAFWIN+HN+LVMHAF+VYGIP+GN+KRISL LKAAYN+GGHT+SVDTIQSSIL Sbjct: 345 KHDEKLAFWINVHNSLVMHAFLVYGIPQGNMKRISLALKAAYNVGGHTISVDTIQSSILR 404 Query: 523 CRLPRPTPWLHSLFFPKKKFKAGDPRKAFAVKHSEPRLHFALCSGCQSDPAVRLYTPKKV 702 CRLPRP+ WL SLFFPK+KFKA DPRK +A++HSEPRL FALCSGC SD VR+YT KKV Sbjct: 405 CRLPRPSQWLQSLFFPKQKFKACDPRKVYAIRHSEPRLRFALCSGCNSDAPVRIYTSKKV 464 Query: 703 FQELEMAKEDYIQMNIKLRKEQRLLVPKNV 792 FQELE+AKE+YIQMN+ + KEQRLLVPKNV Sbjct: 465 FQELEIAKEEYIQMNVSVHKEQRLLVPKNV 494 >gb|PIN12372.1| hypothetical protein CDL12_15025 [Handroanthus impetiginosus] Length = 527 Score = 369 bits (947), Expect = e-123 Identities = 182/255 (71%), Positives = 209/255 (81%), Gaps = 1/255 (0%) Frame = +1 Query: 31 ETLNWLSEEMIKCISTVYCHLSD-PPLFEHHDIYXXXXXXXXXXXXXXXNPFHLEASNEF 207 E+ NWLSEEMIKCIS++YCHLS+ PP E + NPFH E S E Sbjct: 230 ESPNWLSEEMIKCISSIYCHLSESPPSCEENS--------------WLNNPFHYEPSKET 275 Query: 208 SGSLFTMVEIHGLFRDSQRLNDVEDLLQNYRSLISKLAQVDPGKLRHEEKLAFWINIHNA 387 SG+ F E++ + D +RLN V+DLLQ+YRSLI +LAQVDPGKL+H+EKLAFWIN+HNA Sbjct: 276 SGTFF---EVNVILTDCERLNSVQDLLQDYRSLICRLAQVDPGKLKHDEKLAFWINVHNA 332 Query: 388 LVMHAFMVYGIPRGNLKRISLVLKAAYNIGGHTVSVDTIQSSILGCRLPRPTPWLHSLFF 567 LVMHAF+VYGIPRG+LKR+SLVLKAAYNIGGHT+SVD IQSSIL CRLPRPT WLHSLFF Sbjct: 333 LVMHAFLVYGIPRGSLKRVSLVLKAAYNIGGHTISVDKIQSSILRCRLPRPTQWLHSLFF 392 Query: 568 PKKKFKAGDPRKAFAVKHSEPRLHFALCSGCQSDPAVRLYTPKKVFQELEMAKEDYIQMN 747 PK KFKAGDPRK++A+KH EPRLHFALCSGCQSDP VRLYT KKVFQELE+AKE+YIQMN Sbjct: 393 PKTKFKAGDPRKSYALKHPEPRLHFALCSGCQSDPPVRLYTHKKVFQELEIAKEEYIQMN 452 Query: 748 IKLRKEQRLLVPKNV 792 KL KEQ+LL+PKNV Sbjct: 453 TKLHKEQKLLIPKNV 467 >ref|XP_017969768.1| PREDICTED: uncharacterized protein LOC18611014 isoform X4 [Theobroma cacao] ref|XP_007047081.2| PREDICTED: uncharacterized protein LOC18611014 isoform X4 [Theobroma cacao] Length = 569 Score = 316 bits (810), Expect = e-101 Identities = 156/274 (56%), Positives = 192/274 (70%), Gaps = 20/274 (7%) Frame = +1 Query: 31 ETLNWLSEEMIKCISTVYCHLSDPPLFEH--------------------HDIYXXXXXXX 150 ET N LSE+MIKC+S +YC L+DPPL ++ D++ Sbjct: 235 ETPNKLSEDMIKCMSAIYCKLADPPLIQNDFSSPNSSVSSASAFSPQDQQDMWSPGFRNN 294 Query: 151 XXXXXXXXNPFHLEASNEFSGSLFTMVEIHGLFRDSQRLNDVEDLLQNYRSLISKLAQVD 330 NPFH+E EFSG TMVE+ +FRDSQ+L DVE LLQN+RSLI +L +VD Sbjct: 295 SSFDVRLDNPFHVEGLKEFSGPYSTMVEVPWIFRDSQKLGDVEHLLQNFRSLICRLEEVD 354 Query: 331 PGKLRHEEKLAFWINIHNALVMHAFMVYGIPRGNLKRISLVLKAAYNIGGHTVSVDTIQS 510 P KL+HEEKLAFWINIHNALVMHAF+ YG+P+ N+KR L+L+AAYNIGGHT+S DTIQ Sbjct: 355 PSKLKHEEKLAFWINIHNALVMHAFLAYGVPQNNMKRFFLLLRAAYNIGGHTISADTIQG 414 Query: 511 SILGCRLPRPTPWLHSLFFPKKKFKAGDPRKAFAVKHSEPRLHFALCSGCQSDPAVRLYT 690 SILGCR+ RP WL L + KFK GD R+A+A++H EP LHFALCSG SDPAVR YT Sbjct: 415 SILGCRMSRPGQWLRLLLSSRAKFKTGDERQAYAIEHPEPLLHFALCSGNHSDPAVRAYT 474 Query: 691 PKKVFQELEMAKEDYIQMNIKLRKEQRLLVPKNV 792 PK+VFQELE AKE+YI+ +RKEQ++L+PK V Sbjct: 475 PKRVFQELETAKEEYIRATFGIRKEQKILLPKIV 508 >ref|XP_021273633.1| uncharacterized protein LOC110408847 isoform X3 [Herrania umbratica] Length = 611 Score = 317 bits (813), Expect = e-101 Identities = 157/276 (56%), Positives = 193/276 (69%), Gaps = 20/276 (7%) Frame = +1 Query: 25 VCETLNWLSEEMIKCISTVYCHLSDPPLFEH--------------------HDIYXXXXX 144 V ET N LSE+MIKC+S +YC L+DPPL ++ HD++ Sbjct: 275 VPETPNKLSEDMIKCMSAIYCKLADPPLIQNGFSSPNSSMSSASAFSPQDQHDVWSPGFR 334 Query: 145 XXXXXXXXXXNPFHLEASNEFSGSLFTMVEIHGLFRDSQRLNDVEDLLQNYRSLISKLAQ 324 NPFH+E EFSG TMVE+ +FRDSQ+L DVE LLQN+RSLI +L + Sbjct: 335 NNSSFDVRLDNPFHVEGLKEFSGPYSTMVEVPWIFRDSQKLGDVEHLLQNFRSLICRLEE 394 Query: 325 VDPGKLRHEEKLAFWINIHNALVMHAFMVYGIPRGNLKRISLVLKAAYNIGGHTVSVDTI 504 VDP KL+HEEKLAFWINIHNALVMHAF+ YG+P+ N+KR L+L+AAYNIGGHT+S DTI Sbjct: 395 VDPRKLKHEEKLAFWINIHNALVMHAFLAYGVPQNNMKRFLLLLRAAYNIGGHTISADTI 454 Query: 505 QSSILGCRLPRPTPWLHSLFFPKKKFKAGDPRKAFAVKHSEPRLHFALCSGCQSDPAVRL 684 Q SILGCR+ RP WL L + KFK GD R+A+A+ H EP LHFALCSG SDPAVR Sbjct: 455 QGSILGCRMSRPGQWLRLLLSSRAKFKTGDERQAYAIGHPEPLLHFALCSGNHSDPAVRA 514 Query: 685 YTPKKVFQELEMAKEDYIQMNIKLRKEQRLLVPKNV 792 YTPK+VFQELE AKE+YI+ +RK+Q++L+PK V Sbjct: 515 YTPKRVFQELETAKEEYIRATFGIRKDQKILLPKIV 550 >ref|XP_021300265.1| uncharacterized protein LOC110428688 isoform X3 [Herrania umbratica] Length = 566 Score = 316 bits (809), Expect = e-101 Identities = 157/275 (57%), Positives = 194/275 (70%), Gaps = 15/275 (5%) Frame = +1 Query: 7 SSFPGRVCETLNWLSEEMIKCISTVYCHLSDPPLFEHH---------------DIYXXXX 141 SS V ET NWLSEEMIK IS +YC L+DPPL H D + Sbjct: 227 SSISHHVPETPNWLSEEMIKTISAIYCELADPPLINHESLSSPVSNSSSQGQGDTWSPQC 286 Query: 142 XXXXXXXXXXXNPFHLEASNEFSGSLFTMVEIHGLFRDSQRLNDVEDLLQNYRSLISKLA 321 NPF + S EFSG +MV++ + RDS++L D+E LQ YRSL+ +L Sbjct: 287 GKFSSFNSHFDNPFSIGESKEFSGPYCSMVKVQWICRDSKKLQDIEHKLQYYRSLVYQLE 346 Query: 322 QVDPGKLRHEEKLAFWINIHNALVMHAFMVYGIPRGNLKRISLVLKAAYNIGGHTVSVDT 501 +VD +++HEEKLAFWIN+HNALVMHAF+VYGIP+ NLKR+SL+LKAAYN+GG T+S+DT Sbjct: 347 EVDVRRMKHEEKLAFWINVHNALVMHAFLVYGIPKNNLKRLSLLLKAAYNVGGQTISIDT 406 Query: 502 IQSSILGCRLPRPTPWLHSLFFPKKKFKAGDPRKAFAVKHSEPRLHFALCSGCQSDPAVR 681 IQSSILGCRLPRP WL LF K KFK D R+A+A++ EP LHFALCSG SDPAVR Sbjct: 407 IQSSILGCRLPRPGQWLRFLFPSKTKFKVVDARRAYAIESPEPLLHFALCSGSYSDPAVR 466 Query: 682 LYTPKKVFQELEMAKEDYIQMNIKLRKEQRLLVPK 786 +YTPKKVFQELE+AKE+YIQ N+ + KEQ++L+PK Sbjct: 467 IYTPKKVFQELEIAKEEYIQSNLSVNKEQKILLPK 501 >ref|XP_021273632.1| uncharacterized protein LOC110408847 isoform X2 [Herrania umbratica] Length = 624 Score = 317 bits (813), Expect = e-101 Identities = 157/276 (56%), Positives = 193/276 (69%), Gaps = 20/276 (7%) Frame = +1 Query: 25 VCETLNWLSEEMIKCISTVYCHLSDPPLFEH--------------------HDIYXXXXX 144 V ET N LSE+MIKC+S +YC L+DPPL ++ HD++ Sbjct: 288 VPETPNKLSEDMIKCMSAIYCKLADPPLIQNGFSSPNSSMSSASAFSPQDQHDVWSPGFR 347 Query: 145 XXXXXXXXXXNPFHLEASNEFSGSLFTMVEIHGLFRDSQRLNDVEDLLQNYRSLISKLAQ 324 NPFH+E EFSG TMVE+ +FRDSQ+L DVE LLQN+RSLI +L + Sbjct: 348 NNSSFDVRLDNPFHVEGLKEFSGPYSTMVEVPWIFRDSQKLGDVEHLLQNFRSLICRLEE 407 Query: 325 VDPGKLRHEEKLAFWINIHNALVMHAFMVYGIPRGNLKRISLVLKAAYNIGGHTVSVDTI 504 VDP KL+HEEKLAFWINIHNALVMHAF+ YG+P+ N+KR L+L+AAYNIGGHT+S DTI Sbjct: 408 VDPRKLKHEEKLAFWINIHNALVMHAFLAYGVPQNNMKRFLLLLRAAYNIGGHTISADTI 467 Query: 505 QSSILGCRLPRPTPWLHSLFFPKKKFKAGDPRKAFAVKHSEPRLHFALCSGCQSDPAVRL 684 Q SILGCR+ RP WL L + KFK GD R+A+A+ H EP LHFALCSG SDPAVR Sbjct: 468 QGSILGCRMSRPGQWLRLLLSSRAKFKTGDERQAYAIGHPEPLLHFALCSGNHSDPAVRA 527 Query: 685 YTPKKVFQELEMAKEDYIQMNIKLRKEQRLLVPKNV 792 YTPK+VFQELE AKE+YI+ +RK+Q++L+PK V Sbjct: 528 YTPKRVFQELETAKEEYIRATFGIRKDQKILLPKIV 563 >ref|XP_021273631.1| uncharacterized protein LOC110408847 isoform X1 [Herrania umbratica] Length = 625 Score = 317 bits (813), Expect = e-101 Identities = 157/276 (56%), Positives = 193/276 (69%), Gaps = 20/276 (7%) Frame = +1 Query: 25 VCETLNWLSEEMIKCISTVYCHLSDPPLFEH--------------------HDIYXXXXX 144 V ET N LSE+MIKC+S +YC L+DPPL ++ HD++ Sbjct: 289 VPETPNKLSEDMIKCMSAIYCKLADPPLIQNGFSSPNSSMSSASAFSPQDQHDVWSPGFR 348 Query: 145 XXXXXXXXXXNPFHLEASNEFSGSLFTMVEIHGLFRDSQRLNDVEDLLQNYRSLISKLAQ 324 NPFH+E EFSG TMVE+ +FRDSQ+L DVE LLQN+RSLI +L + Sbjct: 349 NNSSFDVRLDNPFHVEGLKEFSGPYSTMVEVPWIFRDSQKLGDVEHLLQNFRSLICRLEE 408 Query: 325 VDPGKLRHEEKLAFWINIHNALVMHAFMVYGIPRGNLKRISLVLKAAYNIGGHTVSVDTI 504 VDP KL+HEEKLAFWINIHNALVMHAF+ YG+P+ N+KR L+L+AAYNIGGHT+S DTI Sbjct: 409 VDPRKLKHEEKLAFWINIHNALVMHAFLAYGVPQNNMKRFLLLLRAAYNIGGHTISADTI 468 Query: 505 QSSILGCRLPRPTPWLHSLFFPKKKFKAGDPRKAFAVKHSEPRLHFALCSGCQSDPAVRL 684 Q SILGCR+ RP WL L + KFK GD R+A+A+ H EP LHFALCSG SDPAVR Sbjct: 469 QGSILGCRMSRPGQWLRLLLSSRAKFKTGDERQAYAIGHPEPLLHFALCSGNHSDPAVRA 528 Query: 685 YTPKKVFQELEMAKEDYIQMNIKLRKEQRLLVPKNV 792 YTPK+VFQELE AKE+YI+ +RK+Q++L+PK V Sbjct: 529 YTPKRVFQELETAKEEYIRATFGIRKDQKILLPKIV 564 >ref|XP_017969766.1| PREDICTED: uncharacterized protein LOC18611014 isoform X3 [Theobroma cacao] Length = 610 Score = 316 bits (810), Expect = e-101 Identities = 156/274 (56%), Positives = 192/274 (70%), Gaps = 20/274 (7%) Frame = +1 Query: 31 ETLNWLSEEMIKCISTVYCHLSDPPLFEH--------------------HDIYXXXXXXX 150 ET N LSE+MIKC+S +YC L+DPPL ++ D++ Sbjct: 276 ETPNKLSEDMIKCMSAIYCKLADPPLIQNDFSSPNSSVSSASAFSPQDQQDMWSPGFRNN 335 Query: 151 XXXXXXXXNPFHLEASNEFSGSLFTMVEIHGLFRDSQRLNDVEDLLQNYRSLISKLAQVD 330 NPFH+E EFSG TMVE+ +FRDSQ+L DVE LLQN+RSLI +L +VD Sbjct: 336 SSFDVRLDNPFHVEGLKEFSGPYSTMVEVPWIFRDSQKLGDVEHLLQNFRSLICRLEEVD 395 Query: 331 PGKLRHEEKLAFWINIHNALVMHAFMVYGIPRGNLKRISLVLKAAYNIGGHTVSVDTIQS 510 P KL+HEEKLAFWINIHNALVMHAF+ YG+P+ N+KR L+L+AAYNIGGHT+S DTIQ Sbjct: 396 PSKLKHEEKLAFWINIHNALVMHAFLAYGVPQNNMKRFFLLLRAAYNIGGHTISADTIQG 455 Query: 511 SILGCRLPRPTPWLHSLFFPKKKFKAGDPRKAFAVKHSEPRLHFALCSGCQSDPAVRLYT 690 SILGCR+ RP WL L + KFK GD R+A+A++H EP LHFALCSG SDPAVR YT Sbjct: 456 SILGCRMSRPGQWLRLLLSSRAKFKTGDERQAYAIEHPEPLLHFALCSGNHSDPAVRAYT 515 Query: 691 PKKVFQELEMAKEDYIQMNIKLRKEQRLLVPKNV 792 PK+VFQELE AKE+YI+ +RKEQ++L+PK V Sbjct: 516 PKRVFQELETAKEEYIRATFGIRKEQKILLPKIV 549 >ref|XP_017969765.1| PREDICTED: uncharacterized protein LOC18611014 isoform X2 [Theobroma cacao] Length = 611 Score = 316 bits (810), Expect = e-101 Identities = 156/274 (56%), Positives = 192/274 (70%), Gaps = 20/274 (7%) Frame = +1 Query: 31 ETLNWLSEEMIKCISTVYCHLSDPPLFEH--------------------HDIYXXXXXXX 150 ET N LSE+MIKC+S +YC L+DPPL ++ D++ Sbjct: 277 ETPNKLSEDMIKCMSAIYCKLADPPLIQNDFSSPNSSVSSASAFSPQDQQDMWSPGFRNN 336 Query: 151 XXXXXXXXNPFHLEASNEFSGSLFTMVEIHGLFRDSQRLNDVEDLLQNYRSLISKLAQVD 330 NPFH+E EFSG TMVE+ +FRDSQ+L DVE LLQN+RSLI +L +VD Sbjct: 337 SSFDVRLDNPFHVEGLKEFSGPYSTMVEVPWIFRDSQKLGDVEHLLQNFRSLICRLEEVD 396 Query: 331 PGKLRHEEKLAFWINIHNALVMHAFMVYGIPRGNLKRISLVLKAAYNIGGHTVSVDTIQS 510 P KL+HEEKLAFWINIHNALVMHAF+ YG+P+ N+KR L+L+AAYNIGGHT+S DTIQ Sbjct: 397 PSKLKHEEKLAFWINIHNALVMHAFLAYGVPQNNMKRFFLLLRAAYNIGGHTISADTIQG 456 Query: 511 SILGCRLPRPTPWLHSLFFPKKKFKAGDPRKAFAVKHSEPRLHFALCSGCQSDPAVRLYT 690 SILGCR+ RP WL L + KFK GD R+A+A++H EP LHFALCSG SDPAVR YT Sbjct: 457 SILGCRMSRPGQWLRLLLSSRAKFKTGDERQAYAIEHPEPLLHFALCSGNHSDPAVRAYT 516 Query: 691 PKKVFQELEMAKEDYIQMNIKLRKEQRLLVPKNV 792 PK+VFQELE AKE+YI+ +RKEQ++L+PK V Sbjct: 517 PKRVFQELETAKEEYIRATFGIRKEQKILLPKIV 550 >ref|XP_007047079.2| PREDICTED: uncharacterized protein LOC18611014 isoform X1 [Theobroma cacao] Length = 619 Score = 316 bits (810), Expect = e-101 Identities = 156/274 (56%), Positives = 192/274 (70%), Gaps = 20/274 (7%) Frame = +1 Query: 31 ETLNWLSEEMIKCISTVYCHLSDPPLFEH--------------------HDIYXXXXXXX 150 ET N LSE+MIKC+S +YC L+DPPL ++ D++ Sbjct: 285 ETPNKLSEDMIKCMSAIYCKLADPPLIQNDFSSPNSSVSSASAFSPQDQQDMWSPGFRNN 344 Query: 151 XXXXXXXXNPFHLEASNEFSGSLFTMVEIHGLFRDSQRLNDVEDLLQNYRSLISKLAQVD 330 NPFH+E EFSG TMVE+ +FRDSQ+L DVE LLQN+RSLI +L +VD Sbjct: 345 SSFDVRLDNPFHVEGLKEFSGPYSTMVEVPWIFRDSQKLGDVEHLLQNFRSLICRLEEVD 404 Query: 331 PGKLRHEEKLAFWINIHNALVMHAFMVYGIPRGNLKRISLVLKAAYNIGGHTVSVDTIQS 510 P KL+HEEKLAFWINIHNALVMHAF+ YG+P+ N+KR L+L+AAYNIGGHT+S DTIQ Sbjct: 405 PSKLKHEEKLAFWINIHNALVMHAFLAYGVPQNNMKRFFLLLRAAYNIGGHTISADTIQG 464 Query: 511 SILGCRLPRPTPWLHSLFFPKKKFKAGDPRKAFAVKHSEPRLHFALCSGCQSDPAVRLYT 690 SILGCR+ RP WL L + KFK GD R+A+A++H EP LHFALCSG SDPAVR YT Sbjct: 465 SILGCRMSRPGQWLRLLLSSRAKFKTGDERQAYAIEHPEPLLHFALCSGNHSDPAVRAYT 524 Query: 691 PKKVFQELEMAKEDYIQMNIKLRKEQRLLVPKNV 792 PK+VFQELE AKE+YI+ +RKEQ++L+PK V Sbjct: 525 PKRVFQELETAKEEYIRATFGIRKEQKILLPKIV 558 >ref|XP_021300264.1| uncharacterized protein LOC110428688 isoform X2 [Herrania umbratica] Length = 607 Score = 316 bits (809), Expect = e-101 Identities = 157/275 (57%), Positives = 194/275 (70%), Gaps = 15/275 (5%) Frame = +1 Query: 7 SSFPGRVCETLNWLSEEMIKCISTVYCHLSDPPLFEHH---------------DIYXXXX 141 SS V ET NWLSEEMIK IS +YC L+DPPL H D + Sbjct: 268 SSISHHVPETPNWLSEEMIKTISAIYCELADPPLINHESLSSPVSNSSSQGQGDTWSPQC 327 Query: 142 XXXXXXXXXXXNPFHLEASNEFSGSLFTMVEIHGLFRDSQRLNDVEDLLQNYRSLISKLA 321 NPF + S EFSG +MV++ + RDS++L D+E LQ YRSL+ +L Sbjct: 328 GKFSSFNSHFDNPFSIGESKEFSGPYCSMVKVQWICRDSKKLQDIEHKLQYYRSLVYQLE 387 Query: 322 QVDPGKLRHEEKLAFWINIHNALVMHAFMVYGIPRGNLKRISLVLKAAYNIGGHTVSVDT 501 +VD +++HEEKLAFWIN+HNALVMHAF+VYGIP+ NLKR+SL+LKAAYN+GG T+S+DT Sbjct: 388 EVDVRRMKHEEKLAFWINVHNALVMHAFLVYGIPKNNLKRLSLLLKAAYNVGGQTISIDT 447 Query: 502 IQSSILGCRLPRPTPWLHSLFFPKKKFKAGDPRKAFAVKHSEPRLHFALCSGCQSDPAVR 681 IQSSILGCRLPRP WL LF K KFK D R+A+A++ EP LHFALCSG SDPAVR Sbjct: 448 IQSSILGCRLPRPGQWLRFLFPSKTKFKVVDARRAYAIESPEPLLHFALCSGSYSDPAVR 507 Query: 682 LYTPKKVFQELEMAKEDYIQMNIKLRKEQRLLVPK 786 +YTPKKVFQELE+AKE+YIQ N+ + KEQ++L+PK Sbjct: 508 IYTPKKVFQELEIAKEEYIQSNLSVNKEQKILLPK 542 >ref|XP_021300263.1| uncharacterized protein LOC110428688 isoform X1 [Herrania umbratica] Length = 616 Score = 316 bits (809), Expect = e-101 Identities = 157/275 (57%), Positives = 194/275 (70%), Gaps = 15/275 (5%) Frame = +1 Query: 7 SSFPGRVCETLNWLSEEMIKCISTVYCHLSDPPLFEHH---------------DIYXXXX 141 SS V ET NWLSEEMIK IS +YC L+DPPL H D + Sbjct: 277 SSISHHVPETPNWLSEEMIKTISAIYCELADPPLINHESLSSPVSNSSSQGQGDTWSPQC 336 Query: 142 XXXXXXXXXXXNPFHLEASNEFSGSLFTMVEIHGLFRDSQRLNDVEDLLQNYRSLISKLA 321 NPF + S EFSG +MV++ + RDS++L D+E LQ YRSL+ +L Sbjct: 337 GKFSSFNSHFDNPFSIGESKEFSGPYCSMVKVQWICRDSKKLQDIEHKLQYYRSLVYQLE 396 Query: 322 QVDPGKLRHEEKLAFWINIHNALVMHAFMVYGIPRGNLKRISLVLKAAYNIGGHTVSVDT 501 +VD +++HEEKLAFWIN+HNALVMHAF+VYGIP+ NLKR+SL+LKAAYN+GG T+S+DT Sbjct: 397 EVDVRRMKHEEKLAFWINVHNALVMHAFLVYGIPKNNLKRLSLLLKAAYNVGGQTISIDT 456 Query: 502 IQSSILGCRLPRPTPWLHSLFFPKKKFKAGDPRKAFAVKHSEPRLHFALCSGCQSDPAVR 681 IQSSILGCRLPRP WL LF K KFK D R+A+A++ EP LHFALCSG SDPAVR Sbjct: 457 IQSSILGCRLPRPGQWLRFLFPSKTKFKVVDARRAYAIESPEPLLHFALCSGSYSDPAVR 516 Query: 682 LYTPKKVFQELEMAKEDYIQMNIKLRKEQRLLVPK 786 +YTPKKVFQELE+AKE+YIQ N+ + KEQ++L+PK Sbjct: 517 IYTPKKVFQELEIAKEEYIQSNLSVNKEQKILLPK 551 >ref|XP_007041397.2| PREDICTED: uncharacterized protein LOC18607270 isoform X1 [Theobroma cacao] Length = 567 Score = 314 bits (805), Expect = e-101 Identities = 156/275 (56%), Positives = 195/275 (70%), Gaps = 15/275 (5%) Frame = +1 Query: 7 SSFPGRVCETLNWLSEEMIKCISTVYCHLSDPPLFEHH---------------DIYXXXX 141 SS V ET NWLSEEMIK IS +YC L+DPPL H D++ Sbjct: 228 SSISHHVPETPNWLSEEMIKTISAIYCELADPPLINHGYLSSPVSNSSSQGQGDMWSPQC 287 Query: 142 XXXXXXXXXXXNPFHLEASNEFSGSLFTMVEIHGLFRDSQRLNDVEDLLQNYRSLISKLA 321 +PF + S EFSG +MV++ + RDS++L D+E LQ YRSL+ +L Sbjct: 288 GKFSSFNSHFDSPFGIGESKEFSGPYCSMVKVQWICRDSKKLQDIEHKLQYYRSLVCRLE 347 Query: 322 QVDPGKLRHEEKLAFWINIHNALVMHAFMVYGIPRGNLKRISLVLKAAYNIGGHTVSVDT 501 +VD +++HEEKLAFWIN+HNALVMHAF+VYGIP+ NLKR+SL+LKAAYN+GG T+S+DT Sbjct: 348 EVDVRRMKHEEKLAFWINVHNALVMHAFLVYGIPKNNLKRLSLLLKAAYNVGGQTISIDT 407 Query: 502 IQSSILGCRLPRPTPWLHSLFFPKKKFKAGDPRKAFAVKHSEPRLHFALCSGCQSDPAVR 681 IQSSILGCRLPRP WL LF K KFK D R+A+A++ EP LHFALCSG SDPAVR Sbjct: 408 IQSSILGCRLPRPGQWLRFLFPSKTKFKVVDARRAYAIESPEPLLHFALCSGSYSDPAVR 467 Query: 682 LYTPKKVFQELEMAKEDYIQMNIKLRKEQRLLVPK 786 +YTPKKVFQELE+AKE+YIQ N+ + KEQ++L+PK Sbjct: 468 IYTPKKVFQELEVAKEEYIQSNLSVNKEQKILLPK 502 >gb|EOX97227.1| Uncharacterized protein TCM_006317 isoform 1 [Theobroma cacao] gb|EOX97228.1| Uncharacterized protein TCM_006317 isoform 1 [Theobroma cacao] Length = 567 Score = 314 bits (805), Expect = e-101 Identities = 156/275 (56%), Positives = 195/275 (70%), Gaps = 15/275 (5%) Frame = +1 Query: 7 SSFPGRVCETLNWLSEEMIKCISTVYCHLSDPPLFEHH---------------DIYXXXX 141 SS V ET NWLSEEMIK IS +YC L+DPPL H D++ Sbjct: 228 SSISHHVPETPNWLSEEMIKTISAIYCELADPPLINHGYLSSPVSNSSSQGQGDMWSPQC 287 Query: 142 XXXXXXXXXXXNPFHLEASNEFSGSLFTMVEIHGLFRDSQRLNDVEDLLQNYRSLISKLA 321 +PF + S EFSG +MV++ + RDS++L D+E LQ YRSL+ +L Sbjct: 288 GKFSSFNSHFDSPFGIGESKEFSGPYCSMVKVQWICRDSKKLQDIEHKLQYYRSLVCRLE 347 Query: 322 QVDPGKLRHEEKLAFWINIHNALVMHAFMVYGIPRGNLKRISLVLKAAYNIGGHTVSVDT 501 +VD +++HEEKLAFWIN+HNALVMHAF+VYGIP+ NLKR+SL+LKAAYN+GG T+S+DT Sbjct: 348 EVDVRRMKHEEKLAFWINVHNALVMHAFLVYGIPKNNLKRLSLLLKAAYNVGGQTISIDT 407 Query: 502 IQSSILGCRLPRPTPWLHSLFFPKKKFKAGDPRKAFAVKHSEPRLHFALCSGCQSDPAVR 681 IQSSILGCRLPRP WL LF K KFK D R+A+A++ EP LHFALCSG SDPAVR Sbjct: 408 IQSSILGCRLPRPGQWLRFLFPSKTKFKVVDARRAYAIESPEPLLHFALCSGSYSDPAVR 467 Query: 682 LYTPKKVFQELEMAKEDYIQMNIKLRKEQRLLVPK 786 +YTPKKVFQELE+AKE+YIQ N+ + KEQ++L+PK Sbjct: 468 IYTPKKVFQELEVAKEEYIQSNLSVNKEQKILLPK 502 >gb|OMO84200.1| hypothetical protein COLO4_22164 [Corchorus olitorius] Length = 628 Score = 315 bits (808), Expect = e-100 Identities = 156/276 (56%), Positives = 194/276 (70%), Gaps = 20/276 (7%) Frame = +1 Query: 25 VCETLNWLSEEMIKCISTVYCHLSDPPLFEH--------------------HDIYXXXXX 144 V ET N LSE+MIKC+S +YC L+DPPL ++ HD++ Sbjct: 292 VPETPNKLSEDMIKCMSAIYCKLADPPLIQNGFSSPISSMSSASAFSPQDQHDMWSPGFR 351 Query: 145 XXXXXXXXXXNPFHLEASNEFSGSLFTMVEIHGLFRDSQRLNDVEDLLQNYRSLISKLAQ 324 NPFH+E EFSG TMVE+ +FRDSQ+L DVE LLQN+RSLIS+L + Sbjct: 352 NNSSFDVRLDNPFHVEGLKEFSGPYSTMVEVPWIFRDSQKLGDVEHLLQNFRSLISRLEE 411 Query: 325 VDPGKLRHEEKLAFWINIHNALVMHAFMVYGIPRGNLKRISLVLKAAYNIGGHTVSVDTI 504 VDP KL+HEEKLAFWIN+HNALVMHAF+ YGIP+ N+KR+ L+L+AAYNIGGHT+S DTI Sbjct: 412 VDPRKLKHEEKLAFWINVHNALVMHAFLAYGIPQNNVKRLFLLLRAAYNIGGHTISADTI 471 Query: 505 QSSILGCRLPRPTPWLHSLFFPKKKFKAGDPRKAFAVKHSEPRLHFALCSGCQSDPAVRL 684 QSSILGCR+ RP WL L + KFK GD R+A+A+ H EP LHFALCSG SDPAVR Sbjct: 472 QSSILGCRMSRPGQWLRLLLPSRAKFKTGDERQAYAIDHPEPLLHFALCSGNHSDPAVRA 531 Query: 685 YTPKKVFQELEMAKEDYIQMNIKLRKEQRLLVPKNV 792 YTPK++F ELE AKE+YI+ +RK+ ++L+PK V Sbjct: 532 YTPKRIFHELETAKEEYIRATFGVRKDHKILLPKIV 567 >ref|XP_018629833.1| PREDICTED: uncharacterized protein LOC104106708 isoform X2 [Nicotiana tomentosiformis] Length = 564 Score = 313 bits (803), Expect = e-100 Identities = 154/273 (56%), Positives = 194/273 (71%), Gaps = 17/273 (6%) Frame = +1 Query: 25 VCETLNWLSEEMIKCISTVYCHLSDPPL-----------------FEHHDIYXXXXXXXX 153 V ET N LSE+MIKC+ T+YC L+DPPL F DI+ Sbjct: 231 VPETPNKLSEDMIKCMCTIYCKLADPPLTNPGLSSPTSSLSSISAFSPKDIWSPGFRNDS 290 Query: 154 XXXXXXXNPFHLEASNEFSGSLFTMVEIHGLFRDSQRLNDVEDLLQNYRSLISKLAQVDP 333 NPFH+E EFSG TMVE+ ++RD+Q+L D+E +LQN+RSLIS+L Q+DP Sbjct: 291 SFDVRLDNPFHVEGLKEFSGPYSTMVEVQCVYRDTQKLGDIEPMLQNFRSLISRLEQIDP 350 Query: 334 GKLRHEEKLAFWINIHNALVMHAFMVYGIPRGNLKRISLVLKAAYNIGGHTVSVDTIQSS 513 KL HEEKLAFWIN+HNALVMHAF+ YGIP+ N+KRI L+LKAAYN+GGH VS D IQ+S Sbjct: 351 RKLTHEEKLAFWINVHNALVMHAFLAYGIPQNNVKRIYLLLKAAYNVGGHVVSADVIQNS 410 Query: 514 ILGCRLPRPTPWLHSLFFPKKKFKAGDPRKAFAVKHSEPRLHFALCSGCQSDPAVRLYTP 693 ILGCR+ RP WL L K KFKAGD R+ +A++H EP LHF+LCSG SDPAVR+YTP Sbjct: 411 ILGCRMSRPGQWLRLLLSSKGKFKAGDERQTYAIEHPEPLLHFSLCSGNHSDPAVRVYTP 470 Query: 694 KKVFQELEMAKEDYIQMNIKLRKEQRLLVPKNV 792 K+VFQELE+AKE+YI+ +RK+Q++++PK V Sbjct: 471 KRVFQELEVAKEEYIRATFGVRKDQKIVLPKVV 503