BLASTX nr result
ID: Rehmannia31_contig00012903
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia31_contig00012903 (1567 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PIN10670.1| hypothetical protein CDL12_16736 [Handroanthus im... 521 0.0 ref|XP_012847189.1| PREDICTED: uncharacterized protein LOC105967... 476 e-163 ref|XP_011081734.1| uncharacterized protein LOC105164711 isoform... 463 e-158 ref|XP_011081733.1| uncharacterized protein LOC105164711 isoform... 463 e-158 ref|XP_022884477.1| uncharacterized protein LOC111401129 [Olea e... 442 e-150 ref|XP_022844642.1| uncharacterized protein LOC111367812 [Olea e... 435 e-147 emb|CDP15037.1| unnamed protein product [Coffea canephora] 431 e-145 ref|XP_022893926.1| uncharacterized protein LOC111408391 [Olea e... 422 e-142 ref|XP_021282170.1| uncharacterized protein LOC110415026 [Herran... 417 e-140 ref|XP_011072848.1| uncharacterized protein LOC105157974 [Sesamu... 414 e-139 gb|OMO58077.1| hypothetical protein CCACVL1_25598 [Corchorus cap... 413 e-139 ref|XP_007029708.2| PREDICTED: uncharacterized protein LOC185996... 413 e-139 ref|XP_007029707.2| PREDICTED: uncharacterized protein LOC185996... 413 e-138 ref|XP_022742444.1| uncharacterized protein LOC111293778 isoform... 412 e-138 gb|EOY10210.1| Intracellular protein transport protein USO1 isof... 412 e-138 gb|EOY10211.1| Intracellular protein transport protein USO1 isof... 412 e-138 ref|XP_017985313.1| PREDICTED: uncharacterized protein LOC185996... 412 e-138 gb|EOY10209.1| Intracellular protein transport protein USO1 isof... 412 e-138 gb|OMO96726.1| hypothetical protein COLO4_15125 [Corchorus olito... 410 e-137 gb|PPS00570.1| hypothetical protein GOBAR_AA20087 [Gossypium bar... 407 e-136 >gb|PIN10670.1| hypothetical protein CDL12_16736 [Handroanthus impetiginosus] Length = 339 Score = 521 bits (1341), Expect = 0.0 Identities = 269/339 (79%), Positives = 282/339 (83%), Gaps = 3/339 (0%) Frame = -1 Query: 1273 MKPRNSEAPRRTQNLQGEGPNWILIAGSALLSTLSIRLGFKLKQVFDAKQPDNSSRSLKA 1094 MKPR +EAPRR++NLQGEGPNW+LIAGSALLSTLSIRLG+KLKQVFDAKQ DN +R LK Sbjct: 1 MKPRATEAPRRSRNLQGEGPNWVLIAGSALLSTLSIRLGYKLKQVFDAKQTDNGNRGLKV 60 Query: 1093 NGKSTDRKKSGTCHLHPNAFCFPQDEDGCYNHYSGSRNAVEIKQHCNGQ-MSEPEMALPL 917 NGKSTDR KSG+CHLHPNAFCFPQ EDGCYNHY+GSRN VE KQHCNGQ +SEPEMALPL Sbjct: 61 NGKSTDRNKSGSCHLHPNAFCFPQGEDGCYNHYTGSRNVVETKQHCNGQIISEPEMALPL 120 Query: 916 VTVPTSEFNKENGVMWSSSPDRLELPQKPFQHXXXXXXXXXXXXXSDIFSKREVIQKLRQ 737 VTVPTSE NKENGVMWSSSPDRLELPQKPF SDIFSKREVIQKLRQ Sbjct: 121 VTVPTSELNKENGVMWSSSPDRLELPQKPFHQSNSSESPCVSESGSDIFSKREVIQKLRQ 180 Query: 736 QLKRRDDMILEMQDQIAEXXXXXXXXXXXXXXXXXXLDAANRDLFDSEREIQRLRKAIAD 557 QLKRRDDMILEMQDQIAE LDAANRDLFDSEREIQRLRK IAD Sbjct: 181 QLKRRDDMILEMQDQIAELQNSLSAQLSHSSHLQSHLDAANRDLFDSEREIQRLRKVIAD 240 Query: 556 HCVGHNNTGEKTPTVPVWPSEMRNGFVNGYPEVESNLGSSEKG--GEKIEMLKREVSELK 383 HCVGH N+ EK PTVPVWP+E RNG NGYP+VE L S EKG GEKIEMLKREVSELK Sbjct: 241 HCVGHINSCEKPPTVPVWPAEGRNGHANGYPKVECILESPEKGREGEKIEMLKREVSELK 300 Query: 382 ELIEGKDYLLQSYKEQKSELSMKIKELQQRLDSQLPNIL 266 ELIEGKDYLLQSYKEQKSELSMKIK+LQQRLDSQLPNIL Sbjct: 301 ELIEGKDYLLQSYKEQKSELSMKIKDLQQRLDSQLPNIL 339 >ref|XP_012847189.1| PREDICTED: uncharacterized protein LOC105967153 [Erythranthe guttata] gb|EYU29300.1| hypothetical protein MIMGU_mgv1a009640mg [Erythranthe guttata] Length = 336 Score = 476 bits (1225), Expect = e-163 Identities = 254/339 (74%), Positives = 271/339 (79%), Gaps = 3/339 (0%) Frame = -1 Query: 1273 MKPRNSEAPRRTQNLQGEGPNWILIAGSALLSTLSIRLGFKLKQVFDAKQPDNSSRSLKA 1094 MKPR +EAPRR++N QGEG NW+LIAGSALLSTLSIRLG+KLKQVFDAKQ DNSS+ LKA Sbjct: 1 MKPRTNEAPRRSRNPQGEGNNWMLIAGSALLSTLSIRLGYKLKQVFDAKQVDNSSKKLKA 60 Query: 1093 NGKST-DRKKSGTCHLHPNAFCFPQDEDGCYNHYSGSRNAVEIKQHCNGQ-MSEPEMALP 920 NGKS DRKKSG+CHLH NA CFPQDEDGCYNHY SRNA +IKQHCN Q MSE EM LP Sbjct: 61 NGKSADDRKKSGSCHLHSNACCFPQDEDGCYNHYPASRNAADIKQHCNSQTMSESEMVLP 120 Query: 919 LVTVPTSEFNKE-NGVMWSSSPDRLELPQKPFQHXXXXXXXXXXXXXSDIFSKREVIQKL 743 LV+VPTSEFNK+ NGVMWSSSPDRLELP KPF SDIFSKREVI KL Sbjct: 121 LVSVPTSEFNKDNNGVMWSSSPDRLELPHKPFHQSNSSESPCVSEAGSDIFSKREVIHKL 180 Query: 742 RQQLKRRDDMILEMQDQIAEXXXXXXXXXXXXXXXXXXLDAANRDLFDSEREIQRLRKAI 563 RQQLKRRDDM+LEMQDQIAE LD+ANRDLFDSEREIQRLRKAI Sbjct: 181 RQQLKRRDDMVLEMQDQIAELQNSLSMQLSHSSHQQALLDSANRDLFDSEREIQRLRKAI 240 Query: 562 ADHCVGHNNTGEKTPTVPVWPSEMRNGFVNGYPEVESNLGSSEKGGEKIEMLKREVSELK 383 ADHCVGH + P+VP+WP E RNG NGYPEVESNL SS GEKIEMLKREVSELK Sbjct: 241 ADHCVGH--VDKSPPSVPIWPPEGRNGHSNGYPEVESNLESS-LSGEKIEMLKREVSELK 297 Query: 382 ELIEGKDYLLQSYKEQKSELSMKIKELQQRLDSQLPNIL 266 ELI+GKDYLL SYKEQK ELS+KIKELQQRLDSQLPNIL Sbjct: 298 ELIDGKDYLLLSYKEQKCELSVKIKELQQRLDSQLPNIL 336 >ref|XP_011081734.1| uncharacterized protein LOC105164711 isoform X2 [Sesamum indicum] ref|XP_011081735.1| uncharacterized protein LOC105164711 isoform X2 [Sesamum indicum] Length = 354 Score = 463 bits (1192), Expect = e-158 Identities = 252/354 (71%), Positives = 274/354 (77%), Gaps = 6/354 (1%) Frame = -1 Query: 1309 RSFWSLTSIALAMKPRNSEAPRRTQNLQGEGPNWILIAGSALLSTLSIRLGFKLKQVFDA 1130 RS WSLT I L MKPR E PR +N Q GPNWILIAG ALLSTLSIRLG+KLKQV DA Sbjct: 3 RSIWSLTLITLIMKPRTGEVPRG-RNFQEGGPNWILIAGGALLSTLSIRLGYKLKQVHDA 61 Query: 1129 KQPDNSSRSLKANGKSTDRKKSGTCHLHPNAFCFPQDEDGCYNHYSGSRNAVEIKQHCNG 950 KQ DNSS+ LK NGKS D KKS +C LH N+FCF Q +DGCY+ Y+GSRN VEIK NG Sbjct: 62 KQLDNSSQRLK-NGKSDDWKKSESCPLHSNSFCFSQQDDGCYSRYNGSRNVVEIKPQHNG 120 Query: 949 QM-SEPEMALPLVTVPTSEFNKENGVMWSSSPDRLELPQKPFQHXXXXXXXXXXXXXSDI 773 QM +EPE+ALPLVTVPT+EF KENGVMWSSSPD LELP KPF H SDI Sbjct: 121 QMMTEPEVALPLVTVPTAEFQKENGVMWSSSPDCLELPHKPFHHSNSSESPCVSDSGSDI 180 Query: 772 FSKREVIQKLRQQLKRRDDMILEMQDQIAEXXXXXXXXXXXXXXXXXXLDAANRDLFDSE 593 FSKREVIQKLRQQLKRRDDMILEMQDQIAE LDAANRDLFDSE Sbjct: 181 FSKREVIQKLRQQLKRRDDMILEMQDQIAELQNSLSAQLSHSSHLQSLLDAANRDLFDSE 240 Query: 592 REIQRLRKAIADHCVGHNNTGEKTPTVPVWPSEM--RNGFVNGYPEVESNLGSSEKG--- 428 REIQRLRK IADHCVG N+G+K+ VPVWP++ NG+ NGY EVESNLGS EKG Sbjct: 241 REIQRLRKVIADHCVGDINSGDKSTAVPVWPAQADGMNGYTNGYLEVESNLGSLEKGRGD 300 Query: 427 GEKIEMLKREVSELKELIEGKDYLLQSYKEQKSELSMKIKELQQRLDSQLPNIL 266 GEKIEMLK+EV+ELKELIEGK+YLLQSY+EQK+ELSMKIKELQQRLDSQLPNIL Sbjct: 301 GEKIEMLKKEVNELKELIEGKNYLLQSYREQKTELSMKIKELQQRLDSQLPNIL 354 >ref|XP_011081733.1| uncharacterized protein LOC105164711 isoform X1 [Sesamum indicum] Length = 364 Score = 463 bits (1192), Expect = e-158 Identities = 252/354 (71%), Positives = 274/354 (77%), Gaps = 6/354 (1%) Frame = -1 Query: 1309 RSFWSLTSIALAMKPRNSEAPRRTQNLQGEGPNWILIAGSALLSTLSIRLGFKLKQVFDA 1130 RS WSLT I L MKPR E PR +N Q GPNWILIAG ALLSTLSIRLG+KLKQV DA Sbjct: 13 RSIWSLTLITLIMKPRTGEVPRG-RNFQEGGPNWILIAGGALLSTLSIRLGYKLKQVHDA 71 Query: 1129 KQPDNSSRSLKANGKSTDRKKSGTCHLHPNAFCFPQDEDGCYNHYSGSRNAVEIKQHCNG 950 KQ DNSS+ LK NGKS D KKS +C LH N+FCF Q +DGCY+ Y+GSRN VEIK NG Sbjct: 72 KQLDNSSQRLK-NGKSDDWKKSESCPLHSNSFCFSQQDDGCYSRYNGSRNVVEIKPQHNG 130 Query: 949 QM-SEPEMALPLVTVPTSEFNKENGVMWSSSPDRLELPQKPFQHXXXXXXXXXXXXXSDI 773 QM +EPE+ALPLVTVPT+EF KENGVMWSSSPD LELP KPF H SDI Sbjct: 131 QMMTEPEVALPLVTVPTAEFQKENGVMWSSSPDCLELPHKPFHHSNSSESPCVSDSGSDI 190 Query: 772 FSKREVIQKLRQQLKRRDDMILEMQDQIAEXXXXXXXXXXXXXXXXXXLDAANRDLFDSE 593 FSKREVIQKLRQQLKRRDDMILEMQDQIAE LDAANRDLFDSE Sbjct: 191 FSKREVIQKLRQQLKRRDDMILEMQDQIAELQNSLSAQLSHSSHLQSLLDAANRDLFDSE 250 Query: 592 REIQRLRKAIADHCVGHNNTGEKTPTVPVWPSEM--RNGFVNGYPEVESNLGSSEKG--- 428 REIQRLRK IADHCVG N+G+K+ VPVWP++ NG+ NGY EVESNLGS EKG Sbjct: 251 REIQRLRKVIADHCVGDINSGDKSTAVPVWPAQADGMNGYTNGYLEVESNLGSLEKGRGD 310 Query: 427 GEKIEMLKREVSELKELIEGKDYLLQSYKEQKSELSMKIKELQQRLDSQLPNIL 266 GEKIEMLK+EV+ELKELIEGK+YLLQSY+EQK+ELSMKIKELQQRLDSQLPNIL Sbjct: 311 GEKIEMLKKEVNELKELIEGKNYLLQSYREQKTELSMKIKELQQRLDSQLPNIL 364 >ref|XP_022884477.1| uncharacterized protein LOC111401129 [Olea europaea var. sylvestris] Length = 340 Score = 442 bits (1138), Expect = e-150 Identities = 235/340 (69%), Positives = 261/340 (76%), Gaps = 4/340 (1%) Frame = -1 Query: 1273 MKPRNSEAPRRTQNLQGEGPNWILIAGSALLSTLSIRLGFKLKQVFDAKQPDNSSRSLKA 1094 MK R +E PR ++ LQ EGPNW+LIAGSALLS LS+RLGFKLKQV DAK+P+NS LK Sbjct: 1 MKRRTNEVPRSSRGLQVEGPNWVLIAGSALLSALSVRLGFKLKQVLDAKRPENSGNLLKG 60 Query: 1093 NGKSTDRKKSGTCHLHPNAFCFPQDEDGCYNHYSGSRNAVEIKQHCNGQ-MSEPEMALPL 917 NGKSTD KK H+H NAF FPQDE+GC+N YSGS N +EIKQ C+GQ M+EPEM LPL Sbjct: 61 NGKSTDEKKLRNSHMHSNAFRFPQDENGCHNCYSGSGNMLEIKQQCDGQMMTEPEMVLPL 120 Query: 916 VTVPTSEFNKENGVMWSSSPDRLELPQKPFQHXXXXXXXXXXXXXSDIFSKREVIQKLRQ 737 VTVP SE KENGV+W+SSPDRLELPQKPF SDIFSKREVIQKLRQ Sbjct: 121 VTVPASELRKENGVIWASSPDRLELPQKPFHRSNSSDSPCVSESGSDIFSKREVIQKLRQ 180 Query: 736 QLKRRDDMILEMQDQIAEXXXXXXXXXXXXXXXXXXLDAANRDLFDSEREIQRLRKAIAD 557 QLKRRDDMILEMQDQI E LDAANRD+FDSEREIQRLRKAIAD Sbjct: 181 QLKRRDDMILEMQDQITELQNSLSAQLSHSSHLQLLLDAANRDIFDSEREIQRLRKAIAD 240 Query: 556 HCVGHNNTGEKTPTVPVWPSEMRNGFVNGYPEVESNLGSSEKG-GE--KIEMLKREVSEL 386 HCVGH N+ PTVP WPS RNG +N Y +VESNL S EKG G+ KIEML++EVSEL Sbjct: 241 HCVGHVNSSNNPPTVPAWPSGGRNGHLNAYLKVESNLESLEKGRGDEGKIEMLRQEVSEL 300 Query: 385 KELIEGKDYLLQSYKEQKSELSMKIKELQQRLDSQLPNIL 266 KE+IEGK+YLLQSYKEQK+ELS KIKELQQRLDSQLP+IL Sbjct: 301 KEVIEGKEYLLQSYKEQKAELSEKIKELQQRLDSQLPHIL 340 >ref|XP_022844642.1| uncharacterized protein LOC111367812 [Olea europaea var. sylvestris] Length = 340 Score = 435 bits (1119), Expect = e-147 Identities = 227/340 (66%), Positives = 260/340 (76%), Gaps = 4/340 (1%) Frame = -1 Query: 1273 MKPRNSEAPRRTQNLQGEGPNWILIAGSALLSTLSIRLGFKLKQVFDAKQPDNSSRSLKA 1094 MK R +E PR ++ LQGEGPNW+LIAG ALLSTLS+RLG+KLKQV DAK+P+ SS LK Sbjct: 1 MKRRTNEVPRSSRGLQGEGPNWVLIAGGALLSTLSVRLGYKLKQVLDAKRPETSSNLLKG 60 Query: 1093 NGKSTDRKKSGTCHLHPNAFCFPQDEDGCYNHYSGSRNAVEIKQHCNGQ-MSEPEMALPL 917 +GK TD KKS HLH N++CFP+D GC+ YSGSRN VEIKQ CNGQ ++EPE+ALPL Sbjct: 61 SGKFTDEKKSRNSHLHSNSYCFPRDGHGCHKCYSGSRNMVEIKQQCNGQIVTEPEIALPL 120 Query: 916 VTVPTSEFNKENGVMWSSSPDRLELPQKPFQHXXXXXXXXXXXXXSDIFSKREVIQKLRQ 737 VTVP SEF+KENG +W+SSPDRLEL QKPF SDIFSKREVIQKLR+ Sbjct: 121 VTVPASEFSKENGAIWASSPDRLELLQKPFHQSNSSDSPCVSESGSDIFSKREVIQKLRR 180 Query: 736 QLKRRDDMILEMQDQIAEXXXXXXXXXXXXXXXXXXLDAANRDLFDSEREIQRLRKAIAD 557 QLKRRDDMILEMQDQI E LDAANRD+FDSEREIQRLRKAIAD Sbjct: 181 QLKRRDDMILEMQDQITELQNSLGAQLSHSSHLQSLLDAANRDIFDSEREIQRLRKAIAD 240 Query: 556 HCVGHNNTGEKTPTVPVWPSEMRNGFVNGYPEVESNLGSSEKGG---EKIEMLKREVSEL 386 HCVGH ++ + P VP WP+ RNG NGY +VESN+ SSE GG KIEML+REVSEL Sbjct: 241 HCVGHVDSSDNPPMVPAWPTGGRNGHSNGYLKVESNVESSENGGGDEGKIEMLRREVSEL 300 Query: 385 KELIEGKDYLLQSYKEQKSELSMKIKELQQRLDSQLPNIL 266 KE++EGK+YLLQSYK QK+ELS KIKELQQRLDSQLP+IL Sbjct: 301 KEVVEGKEYLLQSYKGQKAELSEKIKELQQRLDSQLPHIL 340 >emb|CDP15037.1| unnamed protein product [Coffea canephora] Length = 341 Score = 431 bits (1107), Expect = e-145 Identities = 229/341 (67%), Positives = 259/341 (75%), Gaps = 5/341 (1%) Frame = -1 Query: 1273 MKPRNSEAPR--RTQNLQGEGPNWILIAGSALLSTLSIRLGFKLKQVFDAKQPDNSSRSL 1100 MKP + PR R + Q EGPNW+LIAG ALLSTLSIRLG+KLKQV D K PDN+S SL Sbjct: 1 MKPIANGVPRTQRQKGFQSEGPNWVLIAGGALLSTLSIRLGYKLKQVLDMKPPDNTSNSL 60 Query: 1099 KANGKSTDRKKSGTCHLHPNAFCFPQDEDGCYNHYSGSRNAVEIKQHCNGQ-MSEPEMAL 923 K +GK T+RKKSG+C LHPNA+ F QD + C N SGS N +EIKQ NGQ +SEPEMAL Sbjct: 61 KGSGKFTERKKSGSCSLHPNAYSFHQDGNACCNCLSGSVNVMEIKQQRNGQVLSEPEMAL 120 Query: 922 PLVTVPTSEFNKENGVMWSSSPDRLELPQKPFQHXXXXXXXXXXXXXSDIFSKREVIQKL 743 PLV V +SEF+KENGV+W+SSPDRLELPQKPF H SDIFS REVIQKL Sbjct: 121 PLVKVSSSEFSKENGVIWASSPDRLELPQKPFHHSNSSDSPCVSEAGSDIFSNREVIQKL 180 Query: 742 RQQLKRRDDMILEMQDQIAEXXXXXXXXXXXXXXXXXXLDAANRDLFDSEREIQRLRKAI 563 RQQLKRRDDMI+EMQDQI E LDAANRDLFDSEREIQRLRK I Sbjct: 181 RQQLKRRDDMIIEMQDQIVELQNSLSTQLTHSTQLQALLDAANRDLFDSEREIQRLRKVI 240 Query: 562 ADHCVGHNNTGEKTPTVPVWPSEMRNGFVNGYPEVESNLGSSEK--GGEKIEMLKREVSE 389 ADHCVG +N G+K + PVWP+E+RNG +N Y EVE +L S EK G KIEML+REV+E Sbjct: 241 ADHCVGQDNCGDKLSSAPVWPAELRNGHLNEYSEVEGHLDSLEKDRNGGKIEMLRREVNE 300 Query: 388 LKELIEGKDYLLQSYKEQKSELSMKIKELQQRLDSQLPNIL 266 L+E+I+GKDYLLQ+YKEQKSELSMKIKELQQRLDSQLPNIL Sbjct: 301 LREVIDGKDYLLQNYKEQKSELSMKIKELQQRLDSQLPNIL 341 >ref|XP_022893926.1| uncharacterized protein LOC111408391 [Olea europaea var. sylvestris] ref|XP_022893929.1| uncharacterized protein LOC111408391 [Olea europaea var. sylvestris] ref|XP_022893935.1| uncharacterized protein LOC111408391 [Olea europaea var. sylvestris] Length = 344 Score = 422 bits (1085), Expect = e-142 Identities = 223/342 (65%), Positives = 253/342 (73%), Gaps = 6/342 (1%) Frame = -1 Query: 1273 MKPRNSEAPRRTQNLQGEGPNWILIAGSALLSTLSIRLGFKLKQVFDAKQPDNSSRSLKA 1094 MKPR +E RR++ LQ EGPNW+LIAGSALLSTL+IRLG+K+KQV D K+P+NS+ +LK Sbjct: 5 MKPRTNEVSRRSRGLQEEGPNWVLIAGSALLSTLAIRLGYKVKQVLDTKKPENSNNNLKG 64 Query: 1093 NGKSTDRKKSGTCHLHPNAFCFPQDEDGCYNHYSGSRNAVEIKQHCNGQM-SEPEMALPL 917 NGKSTD KS + H P+A+CFP DGCYN YSGSRN VEIKQ NG M E EM LPL Sbjct: 65 NGKSTDENKSSSFHFQPSAYCFPDHVDGCYNSYSGSRNVVEIKQQANGHMVPEHEMVLPL 124 Query: 916 VTVPTSEFNKENGVMWSSSPDRLELPQKPFQHXXXXXXXXXXXXXSDIFSKREVIQKLRQ 737 VT P EF+KENGV+W+SSPD LELPQKPF SDIFSKREVIQKLRQ Sbjct: 125 VTRPAPEFSKENGVLWASSPDHLELPQKPFHQSNSSDSLCVSESGSDIFSKREVIQKLRQ 184 Query: 736 QLKRRDDMILEMQDQIAEXXXXXXXXXXXXXXXXXXLDAANRDLFDSEREIQRLRKAIAD 557 QL+RRDD ILEMQDQI E LDAANRDLFDSE EIQRLRKAIAD Sbjct: 185 QLRRRDDTILEMQDQITELHNSLNSQLSCSSHLQSLLDAANRDLFDSESEIQRLRKAIAD 244 Query: 556 HCVGHNNTGEKTPTVPVWPSEMRNGFVNGYPEVESN-----LGSSEKGGEKIEMLKREVS 392 HCV H ++ K P VP+WP+E RNG N Y EVE++ G E+ EKIEMLKREVS Sbjct: 245 HCVEHIDSRYKPPAVPIWPTEGRNGHANEYLEVENSSMYPKTGKGER--EKIEMLKREVS 302 Query: 391 ELKELIEGKDYLLQSYKEQKSELSMKIKELQQRLDSQLPNIL 266 ELKE+IEGK+YLL SYK+QK+ELS+KIKELQQRLDSQLPNIL Sbjct: 303 ELKEVIEGKEYLLSSYKDQKAELSVKIKELQQRLDSQLPNIL 344 >ref|XP_021282170.1| uncharacterized protein LOC110415026 [Herrania umbratica] Length = 340 Score = 417 bits (1071), Expect = e-140 Identities = 227/342 (66%), Positives = 253/342 (73%), Gaps = 6/342 (1%) Frame = -1 Query: 1273 MKPRNSEAPR--RTQNLQGEGPNWILIAGSALLSTLSIRLGFKLKQVFDAKQPDNSSRSL 1100 M R+ R +++N QGEGPNWILIAG ALLSTLSIRLG+KLKQ D KQ DN++ SL Sbjct: 1 MNTRSGRVSRGEKSKNFQGEGPNWILIAGGALLSTLSIRLGYKLKQALDTKQKDNATTSL 60 Query: 1099 KANGKSTDRKKSGTCHLHPNAFCFPQDEDGCYNHYSGSRNAVEIKQHCNGQ-MSEPEMAL 923 K +G S DR++S C LH N F F Q++DGC+N SG+ + E K NGQ + E E+ L Sbjct: 61 KGHGTS-DRRRSSGCRLHSNMFSFTQEDDGCFNCISGTESMGE-KHPPNGQILPESEVTL 118 Query: 922 PLVTVPTSEFNKENGVMWSSSPDRLELPQKPFQHXXXXXXXXXXXXXSDIFSKREVIQKL 743 PLVTVPTSEFNK+NGVMW+SSPDRLELP KPF H SDIFSKREVIQKL Sbjct: 119 PLVTVPTSEFNKDNGVMWASSPDRLELPPKPFHHSNCSDSPCVSESGSDIFSKREVIQKL 178 Query: 742 RQQLKRRDDMILEMQDQIAEXXXXXXXXXXXXXXXXXXLDAANRDLFDSEREIQRLRKAI 563 RQQLKRRDDMILEMQDQI E LDA+NRDLFDSEREIQRLRKAI Sbjct: 179 RQQLKRRDDMILEMQDQIMELQNSLNAQVAHSSHLQSQLDASNRDLFDSEREIQRLRKAI 238 Query: 562 ADHCVGHNNTGEKTPTVPVWPSEMRNGFVNGYPEVESNLGSSEKG---GEKIEMLKREVS 392 ADHCVGH T EKT TV WP ++RNG NGY E ESN GS EKG GE+IEMLKREV Sbjct: 239 ADHCVGHVGTNEKTTTVTAWPPDIRNGHANGYLEGESNSGSPEKGRGDGERIEMLKREVG 298 Query: 391 ELKELIEGKDYLLQSYKEQKSELSMKIKELQQRLDSQLPNIL 266 ELKE+IEGK+YLLQSYKEQK+ELSMKIKELQQRLDSQLPNIL Sbjct: 299 ELKEVIEGKEYLLQSYKEQKTELSMKIKELQQRLDSQLPNIL 340 >ref|XP_011072848.1| uncharacterized protein LOC105157974 [Sesamum indicum] ref|XP_020547840.1| uncharacterized protein LOC105157974 [Sesamum indicum] ref|XP_020547841.1| uncharacterized protein LOC105157974 [Sesamum indicum] ref|XP_020547842.1| uncharacterized protein LOC105157974 [Sesamum indicum] Length = 339 Score = 414 bits (1065), Expect = e-139 Identities = 230/340 (67%), Positives = 250/340 (73%), Gaps = 4/340 (1%) Frame = -1 Query: 1273 MKPRNSEAPRRTQNLQGEGPNWILIAGSALLSTLSIRLGFKLKQVFDAKQPDNSSRSLKA 1094 M PR PRR++N Q G NWILIAG ALLSTLSIRLG+KLKQV D KQ +NSS+SLK Sbjct: 1 MNPRTRGVPRRSRNFQEGGLNWILIAGGALLSTLSIRLGYKLKQVLDTKQLNNSSQSLK- 59 Query: 1093 NGKSTDRKKSGTCHLHPNAFCFPQDEDGCYNHYSGSRNAVEIKQHCNGQM-SEPEMALPL 917 +GKS KKS +C LH + CF Q+EDGCY+ Y+GS N VEIKQ NGQM +E +M LPL Sbjct: 60 DGKSDIWKKSASCPLHADGLCFTQEEDGCYSGYNGSTNMVEIKQQDNGQMITERKMPLPL 119 Query: 916 VTVPTSEFNKENGVMWSSSPDRLELPQKPFQHXXXXXXXXXXXXXSDIFSKREVIQKLRQ 737 V VPT EFNKENGVMWSSSPDRLELP KP H S IFSK EVIQKLRQ Sbjct: 120 VIVPTPEFNKENGVMWSSSPDRLELPHKPSHHSNSSESPCVSESGSCIFSKGEVIQKLRQ 179 Query: 736 QLKRRDDMILEMQDQIAEXXXXXXXXXXXXXXXXXXLDAANRDLFDSEREIQRLRKAIAD 557 QLKRRDDMILEMQDQIAE LDAANRDLFDSEREIQRLRK IAD Sbjct: 180 QLKRRDDMILEMQDQIAELKNSLSSELSHSSHLQSLLDAANRDLFDSEREIQRLRKVIAD 239 Query: 556 HCVGHNNTGEKTPTVPVWPSEMRNGFVNGYPEVESNLGSSEKGG---EKIEMLKREVSEL 386 HCVGH +G+K PVW +E NG NGYP+VE L SSEKG +KIEMLK EVSEL Sbjct: 240 HCVGHIYSGQKLTADPVWLAEGMNGHTNGYPKVEGKLESSEKGRGEVDKIEMLKGEVSEL 299 Query: 385 KELIEGKDYLLQSYKEQKSELSMKIKELQQRLDSQLPNIL 266 +ELIEGKDYLLQSYKEQK ELS+KIKELQQRLDSQLPNIL Sbjct: 300 RELIEGKDYLLQSYKEQKWELSVKIKELQQRLDSQLPNIL 339 >gb|OMO58077.1| hypothetical protein CCACVL1_25598 [Corchorus capsularis] Length = 340 Score = 413 bits (1062), Expect = e-139 Identities = 223/331 (67%), Positives = 251/331 (75%), Gaps = 4/331 (1%) Frame = -1 Query: 1246 RRTQNLQGEGPNWILIAGSALLSTLSIRLGFKLKQVFDAKQPDNSSRSLKANGKSTDRKK 1067 +R++ Q EGPNWILIAG ALLSTLSIRLG+KLKQ D KQ +N++ SLK NG ++ R+ Sbjct: 12 QRSKQFQAEGPNWILIAGGALLSTLSIRLGYKLKQALDTKQKNNATSSLKGNGNASRRRS 71 Query: 1066 SGTCHLHPNAFCFPQDEDGCYNHYSGSRNAVEIKQHCNGQM-SEPEMALPLVTVPTSEFN 890 SG C LH N + F Q++DGC+N SG+ + E K NGQM E E+ALPLVTVPTSEFN Sbjct: 72 SG-CPLHSNMYSFAQEDDGCFNCISGTESIGE-KHPPNGQMLPESEVALPLVTVPTSEFN 129 Query: 889 KENGVMWSSSPDRLELPQKPFQHXXXXXXXXXXXXXSDIFSKREVIQKLRQQLKRRDDMI 710 K+NGVMW+SSPDRLELP KPF H SDIFSKREVIQKLRQQLKRRDDMI Sbjct: 130 KDNGVMWASSPDRLELPPKPFHHSNCSDSPCVSESGSDIFSKREVIQKLRQQLKRRDDMI 189 Query: 709 LEMQDQIAEXXXXXXXXXXXXXXXXXXLDAANRDLFDSEREIQRLRKAIADHCVGHNNTG 530 LEMQDQI E LDA+NR+LFDSEREIQRLRKAIADHCVG T Sbjct: 190 LEMQDQIMELRNSLNSQVAHSNHLQSQLDASNRELFDSEREIQRLRKAIADHCVGQVGTN 249 Query: 529 EKTPTVPVWPSEMRNGFVNGYPEVESNLGSSEKG---GEKIEMLKREVSELKELIEGKDY 359 EKT TV WPS+MRNG VNGY + ESNLGS EKG GE+IEML+REV ELKE+IEGK+Y Sbjct: 250 EKTSTVTAWPSDMRNGHVNGYLDGESNLGSPEKGRGDGERIEMLRREVGELKEVIEGKEY 309 Query: 358 LLQSYKEQKSELSMKIKELQQRLDSQLPNIL 266 LLQSYKEQK+ELSMKIKELQQRLDSQLPNIL Sbjct: 310 LLQSYKEQKAELSMKIKELQQRLDSQLPNIL 340 >ref|XP_007029708.2| PREDICTED: uncharacterized protein LOC18599603 isoform X3 [Theobroma cacao] Length = 340 Score = 413 bits (1062), Expect = e-139 Identities = 224/331 (67%), Positives = 249/331 (75%), Gaps = 4/331 (1%) Frame = -1 Query: 1246 RRTQNLQGEGPNWILIAGSALLSTLSIRLGFKLKQVFDAKQPDNSSRSLKANGKSTDRKK 1067 ++++N QGEGPNWILIAG ALLSTLSIRLG+KLKQ D KQ DN++ SLK +G S R+ Sbjct: 12 QKSKNFQGEGPNWILIAGGALLSTLSIRLGYKLKQALDTKQKDNATTSLKGHGTSDRRRL 71 Query: 1066 SGTCHLHPNAFCFPQDEDGCYNHYSGSRNAVEIKQHCNGQM-SEPEMALPLVTVPTSEFN 890 SG C LH N F F Q+EDGC+N SG+ + E K NGQM E E+ALPLVTVP SEFN Sbjct: 72 SG-CRLHSNMFSFTQEEDGCFNCISGTESIGE-KHPPNGQMLPESEVALPLVTVPMSEFN 129 Query: 889 KENGVMWSSSPDRLELPQKPFQHXXXXXXXXXXXXXSDIFSKREVIQKLRQQLKRRDDMI 710 K+NGVMW+SSPDRLELP KPF H SDIFSKREVIQKLRQQLKRRDDMI Sbjct: 130 KDNGVMWASSPDRLELPPKPFHHSNCSDSPCVSESGSDIFSKREVIQKLRQQLKRRDDMI 189 Query: 709 LEMQDQIAEXXXXXXXXXXXXXXXXXXLDAANRDLFDSEREIQRLRKAIADHCVGHNNTG 530 LEMQDQI E LDA+NRDLFDSEREIQRLRKAIADHCVGH + Sbjct: 190 LEMQDQIMELQNSLNAQVAHSSHLQAQLDASNRDLFDSEREIQRLRKAIADHCVGHVSMN 249 Query: 529 EKTPTVPVWPSEMRNGFVNGYPEVESNLGSSEKG---GEKIEMLKREVSELKELIEGKDY 359 EKT TV WP ++RNG NGY + ESN GS EKG GE+IEMLKREV ELKE+IEGK+Y Sbjct: 250 EKTTTVTAWPPDIRNGHANGYLDGESNSGSPEKGRGDGERIEMLKREVGELKEVIEGKEY 309 Query: 358 LLQSYKEQKSELSMKIKELQQRLDSQLPNIL 266 LLQSYKEQK+ELSMKIKELQQRLDSQLPNIL Sbjct: 310 LLQSYKEQKTELSMKIKELQQRLDSQLPNIL 340 >ref|XP_007029707.2| PREDICTED: uncharacterized protein LOC18599603 isoform X1 [Theobroma cacao] Length = 362 Score = 413 bits (1062), Expect = e-138 Identities = 224/331 (67%), Positives = 249/331 (75%), Gaps = 4/331 (1%) Frame = -1 Query: 1246 RRTQNLQGEGPNWILIAGSALLSTLSIRLGFKLKQVFDAKQPDNSSRSLKANGKSTDRKK 1067 ++++N QGEGPNWILIAG ALLSTLSIRLG+KLKQ D KQ DN++ SLK +G S R+ Sbjct: 12 QKSKNFQGEGPNWILIAGGALLSTLSIRLGYKLKQALDTKQKDNATTSLKGHGTSDRRRL 71 Query: 1066 SGTCHLHPNAFCFPQDEDGCYNHYSGSRNAVEIKQHCNGQM-SEPEMALPLVTVPTSEFN 890 SG C LH N F F Q+EDGC+N SG+ + E K NGQM E E+ALPLVTVP SEFN Sbjct: 72 SG-CRLHSNMFSFTQEEDGCFNCISGTESIGE-KHPPNGQMLPESEVALPLVTVPMSEFN 129 Query: 889 KENGVMWSSSPDRLELPQKPFQHXXXXXXXXXXXXXSDIFSKREVIQKLRQQLKRRDDMI 710 K+NGVMW+SSPDRLELP KPF H SDIFSKREVIQKLRQQLKRRDDMI Sbjct: 130 KDNGVMWASSPDRLELPPKPFHHSNCSDSPCVSESGSDIFSKREVIQKLRQQLKRRDDMI 189 Query: 709 LEMQDQIAEXXXXXXXXXXXXXXXXXXLDAANRDLFDSEREIQRLRKAIADHCVGHNNTG 530 LEMQDQI E LDA+NRDLFDSEREIQRLRKAIADHCVGH + Sbjct: 190 LEMQDQIMELQNSLNAQVAHSSHLQAQLDASNRDLFDSEREIQRLRKAIADHCVGHVSMN 249 Query: 529 EKTPTVPVWPSEMRNGFVNGYPEVESNLGSSEKG---GEKIEMLKREVSELKELIEGKDY 359 EKT TV WP ++RNG NGY + ESN GS EKG GE+IEMLKREV ELKE+IEGK+Y Sbjct: 250 EKTTTVTAWPPDIRNGHANGYLDGESNSGSPEKGRGDGERIEMLKREVGELKEVIEGKEY 309 Query: 358 LLQSYKEQKSELSMKIKELQQRLDSQLPNIL 266 LLQSYKEQK+ELSMKIKELQQRLDSQLPNIL Sbjct: 310 LLQSYKEQKTELSMKIKELQQRLDSQLPNIL 340 >ref|XP_022742444.1| uncharacterized protein LOC111293778 isoform X2 [Durio zibethinus] Length = 339 Score = 412 bits (1059), Expect = e-138 Identities = 224/342 (65%), Positives = 254/342 (74%), Gaps = 6/342 (1%) Frame = -1 Query: 1273 MKPRNSEAPR--RTQNLQGEGPNWILIAGSALLSTLSIRLGFKLKQVFDAKQPDNSSRSL 1100 M R+ + R +++N +GEGPNW+LIAG ALLSTLSIRLG+KLKQ D KQ DN++ +L Sbjct: 1 MNTRSGQVSRGEKSKNFKGEGPNWVLIAGGALLSTLSIRLGYKLKQSLDTKQQDNATTTL 60 Query: 1099 KANGKSTDRKKSGTCHLHPNAFCFPQDEDGCYNHYSGSRNAVEIKQHCNGQMS-EPEMAL 923 K NG S+DR++S CHLH N + F Q++DGC+N SG+ + E H NGQM E E+AL Sbjct: 61 KGNG-SSDRRRSSGCHLHSNMYSF-QEDDGCFNCISGAESIGEKHPH-NGQMQLESEVAL 117 Query: 922 PLVTVPTSEFNKENGVMWSSSPDRLELPQKPFQHXXXXXXXXXXXXXSDIFSKREVIQKL 743 PLVTVPTSEFNK+NGVMW+SSPDR ELP KPF H SDIFSKREVIQKL Sbjct: 118 PLVTVPTSEFNKDNGVMWASSPDRHELPPKPFHHSNCSDSPCVSESGSDIFSKREVIQKL 177 Query: 742 RQQLKRRDDMILEMQDQIAEXXXXXXXXXXXXXXXXXXLDAANRDLFDSEREIQRLRKAI 563 RQQLKRRDDMILEMQDQI E +DAANRDLFDSEREIQRLRKAI Sbjct: 178 RQQLKRRDDMILEMQDQIMELQNSLNAQVAHSGHLQSLVDAANRDLFDSEREIQRLRKAI 237 Query: 562 ADHCVGHNNTGEKTPTVPVWPSEMRNGFVNGYPEVESNLGSSEKG---GEKIEMLKREVS 392 ADHC GH T KT V WPS++RNG NGY + ESNLGS EKG GE+IEMLKREV Sbjct: 238 ADHCAGHVGTNGKTSAVTSWPSDIRNGHANGYLDGESNLGSPEKGRGDGERIEMLKREVG 297 Query: 391 ELKELIEGKDYLLQSYKEQKSELSMKIKELQQRLDSQLPNIL 266 ELKE+IEGK+YLLQSYKEQK ELSMKIKELQQRLDSQLPNIL Sbjct: 298 ELKEVIEGKEYLLQSYKEQKMELSMKIKELQQRLDSQLPNIL 339 >gb|EOY10210.1| Intracellular protein transport protein USO1 isoform 2 [Theobroma cacao] Length = 340 Score = 412 bits (1058), Expect = e-138 Identities = 223/331 (67%), Positives = 249/331 (75%), Gaps = 4/331 (1%) Frame = -1 Query: 1246 RRTQNLQGEGPNWILIAGSALLSTLSIRLGFKLKQVFDAKQPDNSSRSLKANGKSTDRKK 1067 ++++N QGEGPNWILIAG ALLSTLSIRLG+KLKQ D KQ DN++ SLK +G S R+ Sbjct: 12 QKSKNFQGEGPNWILIAGGALLSTLSIRLGYKLKQALDTKQKDNATTSLKGHGTSDRRRL 71 Query: 1066 SGTCHLHPNAFCFPQDEDGCYNHYSGSRNAVEIKQHCNGQM-SEPEMALPLVTVPTSEFN 890 SG C LH N F F Q++DGC+N SG+ + E K NG M E E+ALPLVTVPTSEFN Sbjct: 72 SG-CRLHSNMFSFTQEDDGCFNCISGTESIGE-KHPPNGLMLPESEVALPLVTVPTSEFN 129 Query: 889 KENGVMWSSSPDRLELPQKPFQHXXXXXXXXXXXXXSDIFSKREVIQKLRQQLKRRDDMI 710 K+NGVMW+SSPDRLELP KPF H SDIFSKREVIQKLRQQLKRRDDMI Sbjct: 130 KDNGVMWASSPDRLELPPKPFHHSNCSDSPCVSESGSDIFSKREVIQKLRQQLKRRDDMI 189 Query: 709 LEMQDQIAEXXXXXXXXXXXXXXXXXXLDAANRDLFDSEREIQRLRKAIADHCVGHNNTG 530 LEMQDQI E LDA+NRDLFDSEREIQRLRKAIADHCVGH + Sbjct: 190 LEMQDQIMELQNSLNAQVAHSSHLQAQLDASNRDLFDSEREIQRLRKAIADHCVGHVSMN 249 Query: 529 EKTPTVPVWPSEMRNGFVNGYPEVESNLGSSEKG---GEKIEMLKREVSELKELIEGKDY 359 EKT TV WP ++RNG NGY + ESN GS EKG GE+IEMLKREV ELKE+IEGK+Y Sbjct: 250 EKTTTVTAWPPDIRNGHANGYLDGESNSGSPEKGRGDGERIEMLKREVGELKEVIEGKEY 309 Query: 358 LLQSYKEQKSELSMKIKELQQRLDSQLPNIL 266 LLQSYKEQK+ELSMKIKELQQRLDSQLPNIL Sbjct: 310 LLQSYKEQKTELSMKIKELQQRLDSQLPNIL 340 >gb|EOY10211.1| Intracellular protein transport protein USO1 isoform 3 [Theobroma cacao] Length = 344 Score = 412 bits (1058), Expect = e-138 Identities = 223/331 (67%), Positives = 249/331 (75%), Gaps = 4/331 (1%) Frame = -1 Query: 1246 RRTQNLQGEGPNWILIAGSALLSTLSIRLGFKLKQVFDAKQPDNSSRSLKANGKSTDRKK 1067 ++++N QGEGPNWILIAG ALLSTLSIRLG+KLKQ D KQ DN++ SLK +G S R+ Sbjct: 12 QKSKNFQGEGPNWILIAGGALLSTLSIRLGYKLKQALDTKQKDNATTSLKGHGTSDRRRL 71 Query: 1066 SGTCHLHPNAFCFPQDEDGCYNHYSGSRNAVEIKQHCNGQM-SEPEMALPLVTVPTSEFN 890 SG C LH N F F Q++DGC+N SG+ + E K NG M E E+ALPLVTVPTSEFN Sbjct: 72 SG-CRLHSNMFSFTQEDDGCFNCISGTESIGE-KHPPNGLMLPESEVALPLVTVPTSEFN 129 Query: 889 KENGVMWSSSPDRLELPQKPFQHXXXXXXXXXXXXXSDIFSKREVIQKLRQQLKRRDDMI 710 K+NGVMW+SSPDRLELP KPF H SDIFSKREVIQKLRQQLKRRDDMI Sbjct: 130 KDNGVMWASSPDRLELPPKPFHHSNCSDSPCVSESGSDIFSKREVIQKLRQQLKRRDDMI 189 Query: 709 LEMQDQIAEXXXXXXXXXXXXXXXXXXLDAANRDLFDSEREIQRLRKAIADHCVGHNNTG 530 LEMQDQI E LDA+NRDLFDSEREIQRLRKAIADHCVGH + Sbjct: 190 LEMQDQIMELQNSLNAQVAHSSHLQAQLDASNRDLFDSEREIQRLRKAIADHCVGHVSMN 249 Query: 529 EKTPTVPVWPSEMRNGFVNGYPEVESNLGSSEKG---GEKIEMLKREVSELKELIEGKDY 359 EKT TV WP ++RNG NGY + ESN GS EKG GE+IEMLKREV ELKE+IEGK+Y Sbjct: 250 EKTTTVTAWPPDIRNGHANGYLDGESNSGSPEKGRGDGERIEMLKREVGELKEVIEGKEY 309 Query: 358 LLQSYKEQKSELSMKIKELQQRLDSQLPNIL 266 LLQSYKEQK+ELSMKIKELQQRLDSQLPNIL Sbjct: 310 LLQSYKEQKTELSMKIKELQQRLDSQLPNIL 340 >ref|XP_017985313.1| PREDICTED: uncharacterized protein LOC18599603 isoform X2 [Theobroma cacao] Length = 347 Score = 412 bits (1058), Expect = e-138 Identities = 223/330 (67%), Positives = 248/330 (75%), Gaps = 4/330 (1%) Frame = -1 Query: 1246 RRTQNLQGEGPNWILIAGSALLSTLSIRLGFKLKQVFDAKQPDNSSRSLKANGKSTDRKK 1067 ++++N QGEGPNWILIAG ALLSTLSIRLG+KLKQ D KQ DN++ SLK +G S R+ Sbjct: 12 QKSKNFQGEGPNWILIAGGALLSTLSIRLGYKLKQALDTKQKDNATTSLKGHGTSDRRRL 71 Query: 1066 SGTCHLHPNAFCFPQDEDGCYNHYSGSRNAVEIKQHCNGQM-SEPEMALPLVTVPTSEFN 890 SG C LH N F F Q+EDGC+N SG+ + E K NGQM E E+ALPLVTVP SEFN Sbjct: 72 SG-CRLHSNMFSFTQEEDGCFNCISGTESIGE-KHPPNGQMLPESEVALPLVTVPMSEFN 129 Query: 889 KENGVMWSSSPDRLELPQKPFQHXXXXXXXXXXXXXSDIFSKREVIQKLRQQLKRRDDMI 710 K+NGVMW+SSPDRLELP KPF H SDIFSKREVIQKLRQQLKRRDDMI Sbjct: 130 KDNGVMWASSPDRLELPPKPFHHSNCSDSPCVSESGSDIFSKREVIQKLRQQLKRRDDMI 189 Query: 709 LEMQDQIAEXXXXXXXXXXXXXXXXXXLDAANRDLFDSEREIQRLRKAIADHCVGHNNTG 530 LEMQDQI E LDA+NRDLFDSEREIQRLRKAIADHCVGH + Sbjct: 190 LEMQDQIMELQNSLNAQVAHSSHLQAQLDASNRDLFDSEREIQRLRKAIADHCVGHVSMN 249 Query: 529 EKTPTVPVWPSEMRNGFVNGYPEVESNLGSSEKG---GEKIEMLKREVSELKELIEGKDY 359 EKT TV WP ++RNG NGY + ESN GS EKG GE+IEMLKREV ELKE+IEGK+Y Sbjct: 250 EKTTTVTAWPPDIRNGHANGYLDGESNSGSPEKGRGDGERIEMLKREVGELKEVIEGKEY 309 Query: 358 LLQSYKEQKSELSMKIKELQQRLDSQLPNI 269 LLQSYKEQK+ELSMKIKELQQRLDSQLPNI Sbjct: 310 LLQSYKEQKTELSMKIKELQQRLDSQLPNI 339 >gb|EOY10209.1| Intracellular protein transport protein USO1 isoform 1 [Theobroma cacao] Length = 362 Score = 412 bits (1058), Expect = e-138 Identities = 223/331 (67%), Positives = 249/331 (75%), Gaps = 4/331 (1%) Frame = -1 Query: 1246 RRTQNLQGEGPNWILIAGSALLSTLSIRLGFKLKQVFDAKQPDNSSRSLKANGKSTDRKK 1067 ++++N QGEGPNWILIAG ALLSTLSIRLG+KLKQ D KQ DN++ SLK +G S R+ Sbjct: 12 QKSKNFQGEGPNWILIAGGALLSTLSIRLGYKLKQALDTKQKDNATTSLKGHGTSDRRRL 71 Query: 1066 SGTCHLHPNAFCFPQDEDGCYNHYSGSRNAVEIKQHCNGQM-SEPEMALPLVTVPTSEFN 890 SG C LH N F F Q++DGC+N SG+ + E K NG M E E+ALPLVTVPTSEFN Sbjct: 72 SG-CRLHSNMFSFTQEDDGCFNCISGTESIGE-KHPPNGLMLPESEVALPLVTVPTSEFN 129 Query: 889 KENGVMWSSSPDRLELPQKPFQHXXXXXXXXXXXXXSDIFSKREVIQKLRQQLKRRDDMI 710 K+NGVMW+SSPDRLELP KPF H SDIFSKREVIQKLRQQLKRRDDMI Sbjct: 130 KDNGVMWASSPDRLELPPKPFHHSNCSDSPCVSESGSDIFSKREVIQKLRQQLKRRDDMI 189 Query: 709 LEMQDQIAEXXXXXXXXXXXXXXXXXXLDAANRDLFDSEREIQRLRKAIADHCVGHNNTG 530 LEMQDQI E LDA+NRDLFDSEREIQRLRKAIADHCVGH + Sbjct: 190 LEMQDQIMELQNSLNAQVAHSSHLQAQLDASNRDLFDSEREIQRLRKAIADHCVGHVSMN 249 Query: 529 EKTPTVPVWPSEMRNGFVNGYPEVESNLGSSEKG---GEKIEMLKREVSELKELIEGKDY 359 EKT TV WP ++RNG NGY + ESN GS EKG GE+IEMLKREV ELKE+IEGK+Y Sbjct: 250 EKTTTVTAWPPDIRNGHANGYLDGESNSGSPEKGRGDGERIEMLKREVGELKEVIEGKEY 309 Query: 358 LLQSYKEQKSELSMKIKELQQRLDSQLPNIL 266 LLQSYKEQK+ELSMKIKELQQRLDSQLPNIL Sbjct: 310 LLQSYKEQKTELSMKIKELQQRLDSQLPNIL 340 >gb|OMO96726.1| hypothetical protein COLO4_15125 [Corchorus olitorius] Length = 340 Score = 410 bits (1053), Expect = e-137 Identities = 223/331 (67%), Positives = 249/331 (75%), Gaps = 4/331 (1%) Frame = -1 Query: 1246 RRTQNLQGEGPNWILIAGSALLSTLSIRLGFKLKQVFDAKQPDNSSRSLKANGKSTDRKK 1067 +R++ +QGEGPNWILIAG ALLSTLSIRLG+KLKQ D KQ +N++ SLK NG + R+ Sbjct: 12 QRSKQVQGEGPNWILIAGGALLSTLSIRLGYKLKQALDTKQKNNATTSLKGNGNAGRRRS 71 Query: 1066 SGTCHLHPNAFCFPQDEDGCYNHYSGSRNAVEIKQHCNGQM-SEPEMALPLVTVPTSEFN 890 SG C LH N + F Q++DGC+N SG+ E K NGQM E E+ALPLVTVPTSEFN Sbjct: 72 SG-CPLHSNMYSFAQEDDGCFNCISGTECIGE-KHPPNGQMLPESEVALPLVTVPTSEFN 129 Query: 889 KENGVMWSSSPDRLELPQKPFQHXXXXXXXXXXXXXSDIFSKREVIQKLRQQLKRRDDMI 710 K NGVMW+SSPDRLELP KPF H SDIFSKREVIQKLRQQLKRRDDMI Sbjct: 130 KHNGVMWASSPDRLELPPKPFHHSNCSDSPCVSESGSDIFSKREVIQKLRQQLKRRDDMI 189 Query: 709 LEMQDQIAEXXXXXXXXXXXXXXXXXXLDAANRDLFDSEREIQRLRKAIADHCVGHNNTG 530 LEMQDQI E LDA+NR+LFDSEREIQRLRKAIADHCVG T Sbjct: 190 LEMQDQIMELQNSLNSQVAHSNHLQSQLDASNRELFDSEREIQRLRKAIADHCVGQVGTN 249 Query: 529 EKTPTVPVWPSEMRNGFVNGYPEVESNLGSSEKG---GEKIEMLKREVSELKELIEGKDY 359 EKT TV WPS+MRNG VNGY + ESNL S EKG GE+IEML+REV ELKE+IEGK+Y Sbjct: 250 EKTSTVTAWPSDMRNGHVNGYLDGESNLDSPEKGRGDGERIEMLRREVGELKEVIEGKEY 309 Query: 358 LLQSYKEQKSELSMKIKELQQRLDSQLPNIL 266 LLQSYKEQK+ELSMKIKELQQRLDSQLPNIL Sbjct: 310 LLQSYKEQKAELSMKIKELQQRLDSQLPNIL 340 >gb|PPS00570.1| hypothetical protein GOBAR_AA20087 [Gossypium barbadense] Length = 339 Score = 407 bits (1047), Expect = e-136 Identities = 222/343 (64%), Positives = 254/343 (74%), Gaps = 7/343 (2%) Frame = -1 Query: 1273 MKPRNSEAPR--RTQNLQGEGPNWILIAGSALLSTLSIRLGFKLKQVFDAKQPDNSSRSL 1100 M RNS R +++N QGEGPNWILIAG ALLSTLS+RLG+KLKQ D KQ DN++ SL Sbjct: 1 MNTRNSRVSRGQKSKNFQGEGPNWILIAGGALLSTLSVRLGYKLKQALDTKQQDNATASL 60 Query: 1099 KANGKSTDRKKSGTCHLHPNAFCFPQDEDGCYNHYSGSRNAVEIKQHC-NGQ-MSEPEMA 926 K NG S DR++S C LH N + F +++DGC+N SG+ N I++H NGQ + E E+A Sbjct: 61 KENGTS-DRRRSSGCRLHSNMYAFTEEDDGCFNCMSGAEN---IEKHPPNGQILPESEVA 116 Query: 925 LPLVTVPTSEFNKENGVMWSSSPDRLELPQKPFQHXXXXXXXXXXXXXSDIFSKREVIQK 746 LPLVTVPTS+F+K+NGVMW+SSPDRLELP +PF H SDIFSKREVIQK Sbjct: 117 LPLVTVPTSDFSKDNGVMWASSPDRLELPPRPFHHSNCSDSPCVSESGSDIFSKREVIQK 176 Query: 745 LRQQLKRRDDMILEMQDQIAEXXXXXXXXXXXXXXXXXXLDAANRDLFDSEREIQRLRKA 566 LRQ LKRRDDMILEMQDQI E LDAANRDLFDSEREIQRLRKA Sbjct: 177 LRQHLKRRDDMILEMQDQIMELQNSLNAQVAHSTHLQSQLDAANRDLFDSEREIQRLRKA 236 Query: 565 IADHCVGHNNTGEKTPTVPVWPSEMRNGFVNGYPEVESNLGSSEKG---GEKIEMLKREV 395 IADHCVG+ T + T VWPS++RNG NGY +VESN S EKG GE+IEMLKREV Sbjct: 237 IADHCVGYGGTNKMTSIDTVWPSDIRNGHANGYLDVESNSDSPEKGRGDGERIEMLKREV 296 Query: 394 SELKELIEGKDYLLQSYKEQKSELSMKIKELQQRLDSQLPNIL 266 ELKE+IEGK+YLLQSYKEQK ELSMKIKELQQRLDSQLPNIL Sbjct: 297 GELKEVIEGKEYLLQSYKEQKLELSMKIKELQQRLDSQLPNIL 339