BLASTX nr result
ID: Rehmannia32_contig00007393
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia32_contig00007393 (2578 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PIN11890.1| hypothetical protein CDL12_15503 [Handroanthus im... 201 4e-51 ref|XP_011071645.1| uncharacterized protein LOC105157045 [Sesamu... 186 6e-47 ref|XP_011101871.1| uncharacterized protein LOC105179909 [Sesamu... 185 2e-45 ref|XP_020549416.1| uncharacterized protein LOC110011984 isoform... 170 9e-42 ref|XP_020549415.1| uncharacterized protein LOC110011984 isoform... 170 1e-41 ref|XP_020549414.1| uncharacterized protein LOC110011984 isoform... 170 1e-41 gb|KZV41985.1| hypothetical protein F511_16933 [Dorcoceras hygro... 168 3e-40 ref|XP_012846704.1| PREDICTED: uncharacterized protein LOC105966... 165 3e-39 ref|XP_024035374.1| extensin-like [Citrus clementina] 162 4e-38 gb|KZV16468.1| hypothetical protein F511_10080 [Dorcoceras hygro... 151 7e-38 gb|PIN08004.1| hypothetical protein CDL12_19422 [Handroanthus im... 154 5e-37 ref|XP_022866206.1| uncharacterized protein LOC111386010 [Olea e... 150 6e-37 gb|PIM98497.1| hypothetical protein CDL12_29024 [Handroanthus im... 155 6e-37 emb|CDP20930.1| unnamed protein product [Coffea canephora] 156 8e-37 gb|KZV31111.1| hypothetical protein F511_14992 [Dorcoceras hygro... 152 3e-35 ref|XP_020552547.1| uncharacterized protein LOC110012599 [Sesamu... 145 3e-35 gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] 156 5e-35 ref|XP_019161197.1| PREDICTED: uncharacterized protein LOC109157... 147 1e-34 ref|XP_019159873.1| PREDICTED: uncharacterized protein LOC109156... 142 2e-34 ref|XP_019186523.1| PREDICTED: uncharacterized protein LOC109181... 142 3e-34 >gb|PIN11890.1| hypothetical protein CDL12_15503 [Handroanthus impetiginosus] Length = 694 Score = 201 bits (512), Expect = 4e-51 Identities = 108/284 (38%), Positives = 149/284 (52%), Gaps = 15/284 (5%) Frame = -3 Query: 2060 GSSPPDGQYTSD------NPPLKSYANV---------AGSSSPSQINLSFDPKKIIPIGT 1926 G +PP+ T + +PP +SYA A + +D K++ +G Sbjct: 9 GGTPPNQTSTENTPEKPPDPPQRSYAAALHGRPSMAEAHGGKRDALRSFYDEKQLKTLGQ 68 Query: 1925 SENKEGQKALLFSSLETERLDAAWRLTLIGKFSFAIPHAQAISKGLSSLGIKGPFSWSFA 1746 + +K + FS E L + LIGKFS P Q + ++ G+KG F+ Sbjct: 69 ISKYQTRKTIKFSEQEMADLSKPFNFALIGKFSHGYPPMQTLRLKMAGFGLKGDFNIGVL 128 Query: 1745 NQSHIIIKLQLEEDYNLLWMGTIWMFGACPMRILKWTPTFNPKMEAPLAPVWIRLPGLPI 1566 N H++I+L LEEDY LWM +W F PMR+ KWTPTFNP+ E+ L PVWI LP LPI Sbjct: 129 NIKHVLIRLTLEEDYTRLWMRQLWFFDGFPMRLFKWTPTFNPREESSLIPVWINLPNLPI 188 Query: 1565 QFFDYHALFAISKELGNPLQVDPPTARKTRLSFARVCXXXXXXXXXXXXXXLKFGNTSHT 1386 QFF+ +ALF+IS +G PL++D TA TR S ARVC ++F + Sbjct: 189 QFFNKNALFSISSMIGTPLRIDEATAALTRPSNARVCIEIDLMQQLEREIDIRFNDALWI 248 Query: 1385 QKIIYERVPPYCNFCKHIGHSLEDCYMNGNKAKPPPPVRHPTTQ 1254 Q++ YER+P YC CKH+GH +E+CY N A P H Q Sbjct: 249 QRVEYERLPKYCMQCKHLGHGVEECY-EANPALKVAPRAHARKQ 291 >ref|XP_011071645.1| uncharacterized protein LOC105157045 [Sesamum indicum] Length = 507 Score = 186 bits (471), Expect = 6e-47 Identities = 109/317 (34%), Positives = 153/317 (48%), Gaps = 7/317 (2%) Frame = -3 Query: 2048 PDGQYTSDNPPLKSYANVAGSSSPSQINLSFDPKKIIP-------IGTSENKEGQKALLF 1890 P + N P K++A V + S+ + P K P IGT + LLF Sbjct: 58 PSSSIPTSNFPKKTFAEVLAPTRASK-PATPAPHKYFPVDLPSPGIGTVLTGDKGPTLLF 116 Query: 1889 SSLETERLDAAWRLTLIGKFSFAIPHAQAISKGLSSLGIKGPFSWSFANQSHIIIKLQLE 1710 + ETE L A ++ L+GKFS P + K ++ GIK F+ S N H++I L E Sbjct: 117 TDDETEVLAAPFKFALVGKFSHGAPSYSILHKLIAGTGIKNKFTVSMLNTRHVLISLSCE 176 Query: 1709 EDYNLLWMGTIWMFGACPMRILKWTPTFNPKMEAPLAPVWIRLPGLPIQFFDYHALFAIS 1530 D++ LW+ IW PMR+ KWTP F P E+ + PVW+ P LP F LF ++ Sbjct: 177 ADFSRLWLRRIWYIQGYPMRVFKWTPAFTPSKESSIVPVWVSFPELPAHLFRKEVLFTVA 236 Query: 1529 KELGNPLQVDPPTARKTRLSFARVCXXXXXXXXXXXXXXLKFGNTSHTQKIIYERVPPYC 1350 +G PLQ+D T +++LS AR C ++ T+ Q+I YE +P YC Sbjct: 237 SMIGTPLQIDDATLNQSKLSKARACIELDLLKPRLENFQIQICGTTIVQRIEYEDIPHYC 296 Query: 1349 NFCKHIGHSLEDCYMNGNKAKPPPPVRHPTTQAGNSNCGGKQVVTSNPAWIMVNKKGAKE 1170 + CKH+GH DCY G+ KPPP R P SN GK+V + K AKE Sbjct: 297 SLCKHVGHQDSDCYTKGDAPKPPP--RKP------SNRAGKKVAEE----VGRGKAVAKE 344 Query: 1169 TGLVPKELLNTTKKSKH 1119 TG K + K ++ Sbjct: 345 TGESSKMMDQPAKDPRY 361 >ref|XP_011101871.1| uncharacterized protein LOC105179909 [Sesamum indicum] Length = 733 Score = 185 bits (470), Expect = 2e-45 Identities = 86/220 (39%), Positives = 125/220 (56%) Frame = -3 Query: 1937 PIGTSENKEGQKALLFSSLETERLDAAWRLTLIGKFSFAIPHAQAISKGLSSLGIKGPFS 1758 P+G +G+ + F++ ETE L A +R +L+GKFS P + + ++ LGI+G F+ Sbjct: 71 PLGIKSVNQGRPTISFTNTETEELAAPFRFSLVGKFSHGAPPYSQMHQLIARLGIQGAFT 130 Query: 1757 WSFANQSHIIIKLQLEEDYNLLWMGTIWMFGACPMRILKWTPTFNPKMEAPLAPVWIRLP 1578 S N H +I L E DY+ LW+ IW PMRI KWTPTF P E+ + P+++ P Sbjct: 131 VSMINSKHTLISLSCESDYSRLWLRRIWFLQGFPMRIFKWTPTFTPTQESSVVPIFVCFP 190 Query: 1577 GLPIQFFDYHALFAISKELGNPLQVDPPTARKTRLSFARVCXXXXXXXXXXXXXXLKFGN 1398 LP F ALF+++ +G+PLQ+D T K++LS ARVC L + Sbjct: 191 KLPAHLFHKEALFSVASMVGSPLQIDALTLNKSKLSQARVCVEIDLLKPIIEEFDLHIND 250 Query: 1397 TSHTQKIIYERVPPYCNFCKHIGHSLEDCYMNGNKAKPPP 1278 + QK+++E +P YC CKH+GH DC+ GN KPPP Sbjct: 251 VTIVQKVVFEYLPKYCFLCKHVGHKDSDCFSKGNAPKPPP 290 >ref|XP_020549416.1| uncharacterized protein LOC110011984 isoform X3 [Sesamum indicum] Length = 468 Score = 170 bits (430), Expect = 9e-42 Identities = 86/244 (35%), Positives = 123/244 (50%) Frame = -3 Query: 2009 SYANVAGSSSPSQINLSFDPKKIIPIGTSENKEGQKALLFSSLETERLDAAWRLTLIGKF 1830 S+A+ ++P + L+ P I GT E LLF E E L A +R L+GKF Sbjct: 177 SWASKIAKTAPHKYFLADSPPPAI--GTILTGEKGPTLLFIDAEIEVLAAPFRFALVGKF 234 Query: 1829 SFAIPHAQAISKGLSSLGIKGPFSWSFANQSHIIIKLQLEEDYNLLWMGTIWMFGACPMR 1650 S P + K ++ IK F+ N H++I L E DY+ LW+ IW PMR Sbjct: 235 SHGAPSYSILHKLIAGTDIKNNFTVIMLNNRHVLISLSCEADYSRLWLRRIWYIQGYPMR 294 Query: 1649 ILKWTPTFNPKMEAPLAPVWIRLPGLPIQFFDYHALFAISKELGNPLQVDPPTARKTRLS 1470 + KWTPTF P E+ + PVW+ P LP F LF ++ +G PLQ+D T +++ S Sbjct: 295 VFKWTPTFTPSQESSIVPVWVSFPELPAHLFRKKVLFTVASMIGTPLQIDDATLSQSKFS 354 Query: 1469 FARVCXXXXXXXXXXXXXXLKFGNTSHTQKIIYERVPPYCNFCKHIGHSLEDCYMNGNKA 1290 AR C ++ ++I YE++P YC+ CKH+GH +CY G+ Sbjct: 355 KARACIKLDLLKPHLEEFQIQICGDIIVKRIEYEQIPHYCSLCKHVGHRDSECYSKGDAP 414 Query: 1289 KPPP 1278 KPPP Sbjct: 415 KPPP 418 >ref|XP_020549415.1| uncharacterized protein LOC110011984 isoform X2 [Sesamum indicum] Length = 474 Score = 170 bits (430), Expect = 1e-41 Identities = 86/244 (35%), Positives = 123/244 (50%) Frame = -3 Query: 2009 SYANVAGSSSPSQINLSFDPKKIIPIGTSENKEGQKALLFSSLETERLDAAWRLTLIGKF 1830 S+A+ ++P + L+ P I GT E LLF E E L A +R L+GKF Sbjct: 177 SWASKIAKTAPHKYFLADSPPPAI--GTILTGEKGPTLLFIDAEIEVLAAPFRFALVGKF 234 Query: 1829 SFAIPHAQAISKGLSSLGIKGPFSWSFANQSHIIIKLQLEEDYNLLWMGTIWMFGACPMR 1650 S P + K ++ IK F+ N H++I L E DY+ LW+ IW PMR Sbjct: 235 SHGAPSYSILHKLIAGTDIKNNFTVIMLNNRHVLISLSCEADYSRLWLRRIWYIQGYPMR 294 Query: 1649 ILKWTPTFNPKMEAPLAPVWIRLPGLPIQFFDYHALFAISKELGNPLQVDPPTARKTRLS 1470 + KWTPTF P E+ + PVW+ P LP F LF ++ +G PLQ+D T +++ S Sbjct: 295 VFKWTPTFTPSQESSIVPVWVSFPELPAHLFRKKVLFTVASMIGTPLQIDDATLSQSKFS 354 Query: 1469 FARVCXXXXXXXXXXXXXXLKFGNTSHTQKIIYERVPPYCNFCKHIGHSLEDCYMNGNKA 1290 AR C ++ ++I YE++P YC+ CKH+GH +CY G+ Sbjct: 355 KARACIKLDLLKPHLEEFQIQICGDIIVKRIEYEQIPHYCSLCKHVGHRDSECYSKGDAP 414 Query: 1289 KPPP 1278 KPPP Sbjct: 415 KPPP 418 >ref|XP_020549414.1| uncharacterized protein LOC110011984 isoform X1 [Sesamum indicum] Length = 474 Score = 170 bits (430), Expect = 1e-41 Identities = 86/244 (35%), Positives = 123/244 (50%) Frame = -3 Query: 2009 SYANVAGSSSPSQINLSFDPKKIIPIGTSENKEGQKALLFSSLETERLDAAWRLTLIGKF 1830 S+A+ ++P + L+ P I GT E LLF E E L A +R L+GKF Sbjct: 177 SWASKIAKTAPHKYFLADSPPPAI--GTILTGEKGPTLLFIDAEIEVLAAPFRFALVGKF 234 Query: 1829 SFAIPHAQAISKGLSSLGIKGPFSWSFANQSHIIIKLQLEEDYNLLWMGTIWMFGACPMR 1650 S P + K ++ IK F+ N H++I L E DY+ LW+ IW PMR Sbjct: 235 SHGAPSYSILHKLIAGTDIKNNFTVIMLNNRHVLISLSCEADYSRLWLRRIWYIQGYPMR 294 Query: 1649 ILKWTPTFNPKMEAPLAPVWIRLPGLPIQFFDYHALFAISKELGNPLQVDPPTARKTRLS 1470 + KWTPTF P E+ + PVW+ P LP F LF ++ +G PLQ+D T +++ S Sbjct: 295 VFKWTPTFTPSQESSIVPVWVSFPELPAHLFRKKVLFTVASMIGTPLQIDDATLSQSKFS 354 Query: 1469 FARVCXXXXXXXXXXXXXXLKFGNTSHTQKIIYERVPPYCNFCKHIGHSLEDCYMNGNKA 1290 AR C ++ ++I YE++P YC+ CKH+GH +CY G+ Sbjct: 355 KARACIKLDLLKPHLEEFQIQICGDIIVKRIEYEQIPHYCSLCKHVGHRDSECYSKGDAP 414 Query: 1289 KPPP 1278 KPPP Sbjct: 415 KPPP 418 >gb|KZV41985.1| hypothetical protein F511_16933 [Dorcoceras hygrometricum] Length = 583 Score = 168 bits (425), Expect = 3e-40 Identities = 122/401 (30%), Positives = 184/401 (45%), Gaps = 22/401 (5%) Frame = -3 Query: 2078 PFMAGPGSSPPDGQYTSDNPPLKSYANVAGSSSPSQINLSFDPKKIIPIGTSENKEGQKA 1899 P GP PPD +S++ ++YA SSP SF + G E + ++ Sbjct: 22 PPPTGPPPKPPDEHGSSNS---RTYAEAFIDSSPRTKRKSF-----MEYGHGEQIQAKEI 73 Query: 1898 LLF--------SSLETERLDAAWRLTLIGKFSFAIPHAQAISKGLSSLGIKGPFSWSFAN 1743 ++F SS E + A+ LIGKFS P I K + L + GP+S F Sbjct: 74 VMFRNTPSIQFSSDEVLDMGRAYPFALIGKFSGGWPSRDHIMKAFADLELAGPYSIKFLR 133 Query: 1742 QSHIIIKLQLEEDYNLLWMGTIWMFGACPMRILKWTPTFNPKMEAPLAPVWIRLPGLPIQ 1563 + + +LEED + +W W +RILKWTP F+ E+ + PVW+R P LPI Sbjct: 134 PGFLFLDFKLEEDMSRIWSKGRWFICGHLLRILKWTPHFDYSKESSIVPVWVRFPDLPIP 193 Query: 1562 FFDYHALFAISKELGNPLQVDPPTARKTRLSFARVCXXXXXXXXXXXXXXLKFGNTSHTQ 1383 F+ LF+++ +G PL++D T++ RL+FARVC L FG+ Q Sbjct: 194 MFERSRLFSLANIIGKPLKMDELTSKAERLTFARVCVEVDLLKPLQEKVYLLFGSNPVEQ 253 Query: 1382 KIIYERVPPYCNFCKHIGHSLEDCYMNGNKAKP---PPPVR-----------HPTTQAGN 1245 +++YE +P YC C H+GH +++CY G +P P R + + Sbjct: 254 RVVYENLPKYCLDCHHVGHDVQECYAFGKNPRPDRKKPAKRDLRDVLIQKRQNENQETYK 313 Query: 1244 SNCGGKQVVTSNPAWIMVNKKGAKETGLVPKELLNTTKKSKHTYFEASHSNIEGSSSNQF 1065 + G Q T N W VNK+G TGL E K+ N+EG++ NQF Sbjct: 314 NTEDGMQENTGN-VWRAVNKRG---TGLNINEPRTILKR---------QPNLEGNNINQF 360 Query: 1064 SILANEVFEPLDDRLQSIKGQDLSKGSNMDVSFPKNMDENQ 942 + L + L D Q ++ + S GS + M+EN+ Sbjct: 361 AALNEDAGFDLGDE-QEVENEK-SNGS-------EQMNENE 392 >ref|XP_012846704.1| PREDICTED: uncharacterized protein LOC105966659 [Erythranthe guttata] Length = 582 Score = 165 bits (417), Expect = 3e-39 Identities = 107/349 (30%), Positives = 169/349 (48%), Gaps = 6/349 (1%) Frame = -3 Query: 1943 IIPIGTSENKEGQKALLFSSLETERLDAAWRLTLIGKFSFAIPHAQAISKGLSSLGIKGP 1764 I PIGT + +G+ L FS E +++ + TLIGKFS I H + + K + L +G Sbjct: 88 IAPIGTIKVIDGKNVLYFSKEEVDKMLEPLKYTLIGKFSHGIHHYKVMEKFIYDLKPRGS 147 Query: 1763 FSWSFANQSHIIIKLQLEEDYNLLWMGTIWMFGACPMRILKWTPTFNPKMEAPLAPVWIR 1584 F N H++I+ + + Y+LL +I PMR+ K+TP FN K E +APVW+ Sbjct: 148 FELHKLNYRHVLIQFSVLDYYSLLLRRSICYIDGLPMRVFKYTPGFNLKNETSIAPVWVN 207 Query: 1583 LPGLPIQFFDYHALFAISKELGNPLQVDPPTARKTRLSFARVCXXXXXXXXXXXXXXLKF 1404 +PG+P ++ A+F ++ +GNPL+ D TA + +LS AR C + Sbjct: 208 VPGVPPYMYNREAIFFLASSIGNPLEFDDFTADRKKLSVARFCVEIDLLKPRVEQIPVMT 267 Query: 1403 GNTS---HTQKIIYERVPPYCNFCKHIGHSLEDCYMNGNKAK---PPPPVRHPTTQAGNS 1242 G + + YE VP +C FC H+GHS+E+CYMNGN K PPPP R P A Sbjct: 268 GYDDVEMISLPVNYENVPKFCTFCSHLGHSVENCYMNGNAKKPDFPPPPQRIPKPTA--- 324 Query: 1241 NCGGKQVVTSNPAWIMVNKKGAKETGLVPKELLNTTKKSKHTYFEASHSNIEGSSSNQFS 1062 + W V K ++ +V + T +K T + I ++++ S Sbjct: 325 ------LPKEKQVWRRVEK---RKNVVVENMDIPKTSGTKSTENPSFSQAIVRTTADNIS 375 Query: 1061 ILANEVFEPLDDRLQSIKGQDLSKGSNMDVSFPKNMDENQEKGPHTTTN 915 E + P + L+S +D+ K++ ++ +KG T+N Sbjct: 376 EGDFEHYNPF-ELLESGAQEDIEAAQIQPDVASKHVTKSSKKGRKNTSN 423 >ref|XP_024035374.1| extensin-like [Citrus clementina] Length = 621 Score = 162 bits (410), Expect = 4e-38 Identities = 95/278 (34%), Positives = 140/278 (50%), Gaps = 12/278 (4%) Frame = -3 Query: 2039 QYTSDNPPLKSYANVAGSSSPSQINLSFD-PKKI----------IPIGTSENKEGQKALL 1893 Q S P SYAN+ S++ N S PK + IPI + G+ A+L Sbjct: 336 QNASQPLPKPSYANITKSTTVWNNNASTQYPKSLDSLPGLTPSDIPIKPTTVYHGEPAVL 395 Query: 1892 FSSLETERLDAAWRLTLIGKFSFAIPHAQAISKGLSSLGIKGPFSWSFANQSHIIIKLQL 1713 FS LE + + A ++L+L+GKFSF P I + +SLG+KG + HI+I L+L Sbjct: 396 FSKLEVQSMAAPYKLSLVGKFSFGRPKMPIIRQFFTSLGLKGNAQVLLLDPKHILINLEL 455 Query: 1712 EEDYNLLWMGTIWMFGACPMRILKWTPTFNPKMEAPLAPVWIRLPGLPIQFFD-YHALFA 1536 +EDY+ +W+ W C MRI KWT F E+P+ PVW+ P LPI + A + Sbjct: 456 KEDYSRIWIRQFWCISGCTMRIFKWTTNFRCYEESPIVPVWVSFPFLPIHYIQCKSARIS 515 Query: 1535 ISKELGNPLQVDPPTARKTRLSFARVCXXXXXXXXXXXXXXLKFGNTSHTQKIIYERVPP 1356 I+ +G PL+VD T R S ARV + G++ Q++I+E++P Sbjct: 516 IASAIGKPLRVDHATTAVIRPSVARVLIEYDVSRPPVPRIWIGAGDSGFWQEVIFEQIPA 575 Query: 1355 YCNFCKHIGHSLEDCYMNGNKAKPPPPVRHPTTQAGNS 1242 YC CKH G+S E+C++ P +R P G + Sbjct: 576 YCASCKHHGYSTEECFLAN------PGLRKPQQTCGET 607 >gb|KZV16468.1| hypothetical protein F511_10080 [Dorcoceras hygrometricum] gb|KZV35591.1| hypothetical protein F511_32757 [Dorcoceras hygrometricum] Length = 215 Score = 151 bits (381), Expect = 7e-38 Identities = 72/195 (36%), Positives = 106/195 (54%) Frame = -3 Query: 1868 LDAAWRLTLIGKFSFAIPHAQAISKGLSSLGIKGPFSWSFANQSHIIIKLQLEEDYNLLW 1689 ++ A LIGKFS P I++ +L + GP+S F + + +LEED +W Sbjct: 1 MEKANPFALIGKFSGGWPSRDIITRAFGNLELAGPYSIRFLRPGFLFLDFKLEEDMTRIW 60 Query: 1688 MGTIWMFGACPMRILKWTPTFNPKMEAPLAPVWIRLPGLPIQFFDYHALFAISKELGNPL 1509 W G +RILKWTP F+ E+ + PVW+R P LPI F+ + LF+I+ +G PL Sbjct: 61 SKGRWFIGGHLLRILKWTPYFDYSKESSIVPVWVRFPDLPIPMFERNRLFSIANIVGKPL 120 Query: 1508 QVDPPTARKTRLSFARVCXXXXXXXXXXXXXXLKFGNTSHTQKIIYERVPPYCNFCKHIG 1329 ++D T++ RL+ ARVC L FGNT Q+++YE +P YC C H+G Sbjct: 121 KMDELTSKAERLTMARVCIEVDLLKPLPDKIFLIFGNTPLEQRVVYENIPKYCTDCLHVG 180 Query: 1328 HSLEDCYMNGNKAKP 1284 H + +CY G +P Sbjct: 181 HEVVECYALGKNPRP 195 >gb|PIN08004.1| hypothetical protein CDL12_19422 [Handroanthus impetiginosus] Length = 371 Score = 154 bits (388), Expect = 5e-37 Identities = 95/295 (32%), Positives = 145/295 (49%), Gaps = 4/295 (1%) Frame = -3 Query: 2078 PFMAGPGSSPPDGQYTSDNPPLKSYANVAGSSSPSQ----INLSFDPKKIIPIGTSENKE 1911 P + P ++ PDG P +SYA+ +P++ +NL F PK IG + Sbjct: 21 PSSSFPSTAGPDGA-----PLARSYADTLREGAPNRKPQFLNLEF-PK----IGNLSSCN 70 Query: 1910 GQKALLFSSLETERLDAAWRLTLIGKFSFAIPHAQAISKGLSSLGIKGPFSWSFANQSHI 1731 G K +FS E +L ++ TL+GKFSF P +I + SL NQ H+ Sbjct: 71 GAKMAVFSEEEVSQLSVPFQHTLVGKFSFGRPPLASIKQHFLSLECFS-VRIQLLNQRHV 129 Query: 1730 IIKLQLEEDYNLLWMGTIWMFGACPMRILKWTPTFNPKMEAPLAPVWIRLPGLPIQFFDY 1551 +I L +D+ LW+ + PMR+ KW+P+FN K E +AP+W+R+ GLPI FD Sbjct: 130 LIFLNNADDFAKLWLRREVFIDSLPMRLFKWSPSFNVKHEPSVAPLWVRISGLPIHLFDK 189 Query: 1550 HALFAISKELGNPLQVDPPTARKTRLSFARVCXXXXXXXXXXXXXXLKFGNTSHTQKIIY 1371 ALF I +G P ++D T +R+++ARVC + + ++Y Sbjct: 190 CALFTIDGLIGEPFKIDEATCNLSRINYARVCIEIDLKHQPPSEIRVMNAGELLSVLVVY 249 Query: 1370 ERVPPYCNFCKHIGHSLEDCYMNGNKAKPPPPVRHPTTQAGNSNCGGKQVVTSNP 1206 ER+P YC++C H+GH CY+ K P P R+ Q + G ++V P Sbjct: 250 ERLPKYCSYCHHLGHEENACYI---KQAGPRPRRNLRKQPDIHSKGKEKVTEVEP 301 >ref|XP_022866206.1| uncharacterized protein LOC111386010 [Olea europaea var. sylvestris] Length = 257 Score = 150 bits (378), Expect = 6e-37 Identities = 84/247 (34%), Positives = 121/247 (48%), Gaps = 6/247 (2%) Frame = -3 Query: 1910 GQKALLFSSLETERLDAAWRLTLIGKFSFAIPHAQAISKGLSSLGIKGPFSWSFANQSHI 1731 G+ A++ + E L ++ TL+GKF P + + + S G G F + +HI Sbjct: 2 GEPAVILTESEEATLADPYKYTLVGKFPHRKPTMKKVRENFSKFGFHGCFEVGLIDTTHI 61 Query: 1730 IIKLQLEEDYNLLWMGTIWMFGACPMRILKWTPTFNPKMEAPLAPVWIRLPGLPIQFFDY 1551 +I L EEDY+ L++ W CPMR+LKWT F P+ E P+ PVW+ P LP+ Sbjct: 62 LIHLTHEEDYSRLFLKPSWYIDGCPMRVLKWTCDFTPEQETPIVPVWVSFPLLPVHLRAK 121 Query: 1550 HALFAISKELGNPLQVDPPTARKTRLSFARVC-XXXXXXXXXXXXXXLKFGNTSHTQKII 1374 LFA+S+ +G PL++D T R S ARVC + GN Q +I Sbjct: 122 GFLFALSRAIGMPLRIDEATTDLRRPSEARVCIEVNLEHKLPDRVWIERAGNRGFWQTVI 181 Query: 1373 YERVPPYCNFCKHIGHSLEDCYMNGNKAKPP-----PPVRHPTTQAGNSNCGGKQVVTSN 1209 YE+ P +C CKH+GHS + C PP PPVR P T+ + + G V Sbjct: 182 YEKRPIFCFSCKHLGHSYDQCTTVPVPPPPPTVVVHPPVRPPITRT-DKDKGKVTFVEPK 240 Query: 1208 PAWIMVN 1188 W+ V+ Sbjct: 241 KQWVPVS 247 >gb|PIM98497.1| hypothetical protein CDL12_29024 [Handroanthus impetiginosus] Length = 430 Score = 155 bits (391), Expect = 6e-37 Identities = 81/251 (32%), Positives = 132/251 (52%), Gaps = 5/251 (1%) Frame = -3 Query: 2021 PPLKSYANVAGSSSPSQINLSFDPKKIIP-----IGTSENKEGQKALLFSSLETERLDAA 1857 P KS+A+ ++ Q + P + G E +G+ ++FS+ E + L+ Sbjct: 160 PLQKSFADAVRGANVHQRPSNTSPTSFLNPQSPRFGRKETVDGESVVVFSANELQVLEEP 219 Query: 1856 WRLTLIGKFSFAIPHAQAISKGLSSLGIKGPFSWSFANQSHIIIKLQLEEDYNLLWMGTI 1677 R +LIGKFSF P ++I ++ I G F NQ HI+I+L EDY +W+ Sbjct: 220 LRFSLIGKFSFGRPQLRSIRSYFTAQRI-GQFRVQLLNQKHILIELTNAEDYARIWLRRE 278 Query: 1676 WMFGACPMRILKWTPTFNPKMEAPLAPVWIRLPGLPIQFFDYHALFAISKELGNPLQVDP 1497 + PMR+ KW F+ + E+ +APVW+R GLP+ ++ ALF I + +G PL+VD Sbjct: 279 IVVEGLPMRLFKWMRNFDFQFESAIAPVWVRFEGLPLHLYNAAALFTIGELIGQPLKVDE 338 Query: 1496 PTARKTRLSFARVCXXXXXXXXXXXXXXLKFGNTSHTQKIIYERVPPYCNFCKHIGHSLE 1317 T ++R +ARVC ++ + T +++E++P YC +CKH+GH + Sbjct: 339 ATRLRSRTGYARVCIEVDLLKPTPESVKIQQDDELTTVPVVFEKMPKYCTYCKHVGHDEQ 398 Query: 1316 DCYMNGNKAKP 1284 DCY+ G +P Sbjct: 399 DCYIKGPHPRP 409 Score = 65.5 bits (158), Expect = 3e-07 Identities = 34/107 (31%), Positives = 56/107 (52%) Frame = -3 Query: 1895 LFSSLETERLDAAWRLTLIGKFSFAIPHAQAISKGLSSLGIKGPFSWSFANQSHIIIKLQ 1716 +FS E E+L ++ TL+ ++KG S+ ++ NQ H++I L Sbjct: 3 VFSHEEVEKLSVLFQFTLVASIK-----QHVLNKGCPSVRVQ------LLNQRHVLIFLD 51 Query: 1715 LEEDYNLLWMGTIWMFGACPMRILKWTPTFNPKMEAPLAPVWIRLPG 1575 E++ LW+ + PMR+ KW+PTF+ K E +APVW+R+ G Sbjct: 52 NAEEFAKLWLCREIFIESLPMRLFKWSPTFDVKQEPSVAPVWVRISG 98 >emb|CDP20930.1| unnamed protein product [Coffea canephora] Length = 497 Score = 156 bits (394), Expect = 8e-37 Identities = 79/205 (38%), Positives = 114/205 (55%), Gaps = 1/205 (0%) Frame = -3 Query: 1913 EGQKALLFSSLETERLDAAWRLTLIGKFSFAIPHAQAISKGLSSLGIKGPFSWSFANQSH 1734 +G+ A++FS + ++L A ++ L+GKFS P + I K +SL +K S + H Sbjct: 42 KGEAAVVFSKADADKLAAPFQWALVGKFSHGRPSLEDIRKFFASLNLKDHVSIGLMDYRH 101 Query: 1733 IIIKLQLEEDYNLLWMGTIWMFGACPMRILKWTPTFNPKMEAPLAPVWIRLPGLPIQFFD 1554 ++IK E D+N +WM IW G PMR+ +WT F+ E+ LAPVW+ LP LPI +FD Sbjct: 102 VLIKCMAEADFNRIWMRGIWQLGKYPMRVFRWTREFHVLRESSLAPVWVVLPALPIHYFD 161 Query: 1553 YHALFAISKELGNPLQVDPPTARKTRLSFARVCXXXXXXXXXXXXXXLKF-GNTSHTQKI 1377 H+LF+I +G PL +D TA TR S ARVC + G + Q+I Sbjct: 162 KHSLFSILSPVGRPLFLDSATAAGTRPSLARVCVELDVAKSFTQRVWVAVEGESGFWQRI 221 Query: 1376 IYERVPPYCNFCKHIGHSLEDCYMN 1302 + E +P YC+ C +GHS E C N Sbjct: 222 VPENMPLYCSSCSRLGHSQEQCKKN 246 >gb|KZV31111.1| hypothetical protein F511_14992 [Dorcoceras hygrometricum] Length = 547 Score = 152 bits (385), Expect = 3e-35 Identities = 75/206 (36%), Positives = 112/206 (54%) Frame = -3 Query: 1901 ALLFSSLETERLDAAWRLTLIGKFSFAIPHAQAISKGLSSLGIKGPFSWSFANQSHIIIK 1722 ++ F++ E ++ A LIGKFS P + I++ L + GP+S F + ++ Sbjct: 22 SIQFTTDEVLEMEQAHPFALIGKFSGGWPSRENITRAFGELELAGPYSIRFLRPGFLFLE 81 Query: 1721 LQLEEDYNLLWMGTIWMFGACPMRILKWTPTFNPKMEAPLAPVWIRLPGLPIQFFDYHAL 1542 +LEED +W W +RILKWTP F+ E+ + PVW+R P LPI F+ + L Sbjct: 82 FKLEEDMARIWSKGRWFICGHLLRILKWTPFFDYAKESSIVPVWVRFPDLPIPMFERNRL 141 Query: 1541 FAISKELGNPLQVDPPTARKTRLSFARVCXXXXXXXXXXXXXXLKFGNTSHTQKIIYERV 1362 F+I+ +G PL++D T++ RL+ ARVC L FGNT Q+++YE + Sbjct: 142 FSIANIIGKPLKMDELTSKSERLTMARVCIEVDLLKSLPDKVYLIFGNTPVEQRVVYENM 201 Query: 1361 PPYCNFCKHIGHSLEDCYMNGNKAKP 1284 P YC C H+GH + DCY G KP Sbjct: 202 PKYCLDCHHVGHDVVDCYAFGKNPKP 227 >ref|XP_020552547.1| uncharacterized protein LOC110012599 [Sesamum indicum] Length = 269 Score = 145 bits (366), Expect = 3e-35 Identities = 77/196 (39%), Positives = 112/196 (57%), Gaps = 3/196 (1%) Frame = -3 Query: 2036 YTSDNPPLKSYANV-AGSSSPSQINLSFDPKKIIP--IGTSENKEGQKALLFSSLETERL 1866 +++ PP +++A V A S +P + F P IG E L+FS ETE L Sbjct: 50 FSTPMPPKRTFAEVVAPSKAPQTASQKFFFADSPPPSIGAVLTDEKGPTLVFSDAETETL 109 Query: 1865 DAAWRLTLIGKFSFAIPHAQAISKGLSSLGIKGPFSWSFANQSHIIIKLQLEEDYNLLWM 1686 A +RL L+GKFS P + + + ++ LG+KG F+ S N H++I L E D++ LW+ Sbjct: 110 AAPFRLALVGKFSHGKPQFRHLHRLIAGLGVKGAFTVSMLNAKHVLICLSNESDFSYLWL 169 Query: 1685 GTIWMFGACPMRILKWTPTFNPKMEAPLAPVWIRLPGLPIQFFDYHALFAISKELGNPLQ 1506 IW PMRI KWTP+F P E+ + P+W+R P LP F ALFAI+ +G PLQ Sbjct: 170 RRIWHIQGFPMRIFKWTPSFTPTQESSIIPIWVRFPELPANLFHKDALFAIANMIGTPLQ 229 Query: 1505 VDPPTARKTRLSFARV 1458 +D T +++LS AR+ Sbjct: 230 IDDCTLNQSKLSNARI 245 >gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] Length = 3503 Score = 156 bits (395), Expect = 5e-35 Identities = 100/374 (26%), Positives = 171/374 (45%), Gaps = 9/374 (2%) Frame = -3 Query: 2012 KSYANVAGSSSPSQINLSFDPKKIIPIGTSENKEGQKALLFSSLETERLDAAWRLTLIGK 1833 KS+ ++ PS + L+ DP + + A F E + L ++L+L+GK Sbjct: 1757 KSFLSIITGEKPSVVPLTRDPFVF---------KDRPAAAFFEDEIQTLAKPFKLSLVGK 1807 Query: 1832 FSFAIPHAQAISKGLSSLGIKGPFSWSFANQSHIIIKLQLEEDYNLLWMGTIWMFGACPM 1653 FS +P Q + +G+ G + + + H++I L E+D+N +W W M Sbjct: 1808 FS-RMPKLQDVRAAFKGIGLAGAYEVRWLDYKHVLIHLSNEQDFNRIWTKQNWFIATQKM 1866 Query: 1652 RILKWTPTFNPKMEAPLAPVWIRLPGLPIQFFDYHALFAISKELGNPLQVDPPTARKTRL 1473 R+ KWTP F P+ E+ + PVWI P L F+ AL I+K +G PL VD TA +R Sbjct: 1867 RVFKWTPEFEPEKESAVVPVWISFPNLKAHLFEKSALLLIAKTVGKPLFVDEATANGSRP 1926 Query: 1472 SFARVCXXXXXXXXXXXXXXLKFGN-------TSHTQKIIYERVPPYCNFCKHIGHSLED 1314 S ARVC + N ++Q++ + ++P YC+ C H+GH D Sbjct: 1927 SVARVCVEFDCRQPPLDQVWIVVQNRKTGEITNGYSQRVEFAQMPAYCDHCCHVGHKETD 1986 Query: 1313 CYMNGNKAKPPPPVRHPTTQAGNSNCGGKQVVTSNPAWIMVNKKGAKETGLVPK--ELLN 1140 C + GNKA+PP + P ++ + GG++V + K+ E P+ ++L Sbjct: 1987 CILLGNKARPPGITKQPNSRLED---GGRRVGSKEDGEFTTEKRKNIENSKKPQNDKILY 2043 Query: 1139 TTKKSKHTYFEASHSNIEGSSSNQFSILANEVFEPLDDRLQSIKGQDLSKGSNMDVSFPK 960 + KH G +N+ S ++++ + ++ SK N+ VS Sbjct: 2044 PEEPPKH--------QKRGQPANKGSTSGTKIWQG-----KKVQSDKASKDENISVSNRF 2090 Query: 959 NMDENQEKGPHTTT 918 ++ +E+ H+ T Sbjct: 2091 HIISEEEEDEHSRT 2104 Score = 119 bits (298), Expect = 2e-23 Identities = 103/469 (21%), Positives = 193/469 (41%), Gaps = 57/469 (12%) Frame = -3 Query: 1820 IPHAQAISKGLSSLGIKGPFSWSFANQSHIIIKLQLEEDYNLLWMGTIWMFGACPMRILK 1641 +P Q I + +G+ G + + + HI+I L E+D+N +W W MR+ K Sbjct: 1 MPKMQEIRQAFKGIGLTGAYVIRWLDYKHILIHLSNEQDFNRIWTKQQWFIANQKMRVFK 60 Query: 1640 WTPTFNPKMEAPLAPVWIRLPGLPIQFFDYHALFAISKELGNPLQVDPPTARKTRLSFAR 1461 W+P F + E+P+ PVWI P L ++ AL I+K +G PL +D T+ +R S AR Sbjct: 61 WSPDFEAEKESPIVPVWISFPNLKAHLYEKSALLLIAKTVGKPLFIDEATSNASRPSVAR 120 Query: 1460 VCXXXXXXXXXXXXXXLKFGNT-------SHTQKIIYERVPPYCNFCKHIGHSLEDCYMN 1302 VC + + + QK+ + ++P YC C H+GHS+ C + Sbjct: 121 VCVEYNCRNAPVEEIWIVIKDRVTGTVTGGYAQKVEFSKMPDYCEHCGHVGHSVSTCLVL 180 Query: 1301 GNKA----KPPPPVRHPTTQAG-----NSNCG---------------GKQVVTSNPAWIM 1194 GN++ K H + AG N + G +++ P M Sbjct: 181 GNRSENLRKEKLSNVHSKSLAGKKQTENDDKGLDSKPMDDLKRNKETDRKISEERP---M 237 Query: 1193 VNKKGAKETGLVPKELLNTTKKSKHTY---------------FEASHSNIEGSSS----- 1074 + + + T ++LN +KH+ F+ + ++E + Sbjct: 238 MTGRNTEATAEKRNKILNREVLAKHSLQWQAVGHLGQPKFNGFKGAERHLEDEGTKQFQN 297 Query: 1073 -NQFSILANEVFEPLDDRLQSIKGQDLSKGSNMDVSFPKNMDENQEKGPHTTTNPELVVC 897 N+FS L + +++++ K + + S D +D + H N E++ Sbjct: 298 VNRFSALGSVQDTENEEQIREGKQILMGESSGKDKQGKDGIDLIFKSEGHQKLNGEVLNA 357 Query: 896 AGNNMEGANHSFNGSQKSCIRNKEGFNILHTTKEDNLVIETFPNI--MANPIKENLDTFL 723 +GN+ ++ + FN+ + E+ V+ P++ ++LD Sbjct: 358 SGNH------------QAAVEKDATFNVTKSAGEE--VLPKVPHVHGARGMAGKSLDILE 403 Query: 722 KT--NTKSGNDLDEEALVEEG-EILSPKISKQTSGYVENGASCNLIQLN 585 +T T+ D +E + E G E+ +++K+ G + A + QL+ Sbjct: 404 ETVPETRVSRDSTKEFIEESGQELHQERVNKENRGISFDNAENSSNQLH 452 >ref|XP_019161197.1| PREDICTED: uncharacterized protein LOC109157811 [Ipomoea nil] Length = 408 Score = 147 bits (372), Expect = 1e-34 Identities = 82/238 (34%), Positives = 116/238 (48%), Gaps = 1/238 (0%) Frame = -3 Query: 2021 PPLKSYANVAGSSSPSQINLSFDPKKIIPIGTSENKEGQKALLFSSLETERLDAAWRLTL 1842 PP +++A++ SSS S P + P E +G A+ F + +R L Sbjct: 10 PPPRTFADLLSSSSAS-------PPLLRP---PERFKGMPAVSFLDEDVRSFSHKFRFAL 59 Query: 1841 IGKFSFAIPHAQAISKGLSSLGIKGPFSWSFANQSHIIIKLQLEEDYNLLWMGTIWMFGA 1662 IGKF+ + P I K +G G FS +Q HI++ E D+ W+ W Sbjct: 60 IGKFAKSRPPMAEIRKAFDLIGFGGAFSLGLLDQRHILMNFDYEADFQRCWLRKSWSIKG 119 Query: 1661 CPMRILKWTPTFNPKMEAPLAPVWIRLPGLPIQFFDYHALFAISKELGNPLQVDPPTARK 1482 MR+ KWTP F P+ E+P+ PVWI GLP D A+F+I+ +G+PL+VD T Sbjct: 120 AIMRVFKWTPDFRPEFESPIVPVWIAFEGLPAHLQDKRAIFSIANLIGSPLKVDSSTLLH 179 Query: 1481 TRLSFARVCXXXXXXXXXXXXXXLKFGN-TSHTQKIIYERVPPYCNFCKHIGHSLEDC 1311 R S ARVC + G+ TQK+ YE VPPYC C+ GH++ DC Sbjct: 180 NRPSVARVCVELDVSTDLTHQIWINNGSFGGFTQKVTYEFVPPYCKECRKFGHTVADC 237 >ref|XP_019159873.1| PREDICTED: uncharacterized protein LOC109156471 [Ipomoea nil] Length = 243 Score = 142 bits (358), Expect = 2e-34 Identities = 75/238 (31%), Positives = 116/238 (48%), Gaps = 1/238 (0%) Frame = -3 Query: 2021 PPLKSYANVAGSSSPSQINLSFDPKKIIPIGTSENKEGQKALLFSSLETERLDAAWRLTL 1842 PP +++A++ SS S P+ +G A+ FS + ++ +R L Sbjct: 9 PPHRTFADILSKSSESS-----------PLRPPRRYKGMPAVSFSDDDVQQFSEKFRFAL 57 Query: 1841 IGKFSFAIPHAQAISKGLSSLGIKGPFSWSFANQSHIIIKLQLEEDYNLLWMGTIWMFGA 1662 +GKF+ + P + K +G G F+ +Q HI+I E D+ W+ W Sbjct: 58 VGKFAKSRPPMADLRKTFDQIGFGGAFTLGLIDQRHILINFDHEIDFQRCWLRKTWSIKG 117 Query: 1661 CPMRILKWTPTFNPKMEAPLAPVWIRLPGLPIQFFDYHALFAISKELGNPLQVDPPTARK 1482 MR+ +WTP F P +E+P+ PVW+ L GLP D A+++I+ +GNPL+VD T Sbjct: 118 SLMRVFRWTPDFRPDVESPIIPVWLALEGLPAHLHDKRAIYSIANLVGNPLKVDTSTLVP 177 Query: 1481 TRLSFARVCXXXXXXXXXXXXXXLKFGN-TSHTQKIIYERVPPYCNFCKHIGHSLEDC 1311 R S ARVC + G+ TQ+I++E VPPYC C+ GH +C Sbjct: 178 NRPSVARVCVELNVSLPLVEQVWINNGSYGGFTQRIVHEFVPPYCLGCRKFGHQDSEC 235 >ref|XP_019186523.1| PREDICTED: uncharacterized protein LOC109181225 [Ipomoea nil] Length = 276 Score = 142 bits (359), Expect = 3e-34 Identities = 80/254 (31%), Positives = 117/254 (46%), Gaps = 1/254 (0%) Frame = -3 Query: 2021 PPLKSYANVAGSSSPSQINLSFDPKKIIPIGTSENKEGQKALLFSSLETERLDAAWRLTL 1842 PP ++A+V +S DP P+ T +G A+ F+ + ++ +R L Sbjct: 10 PPPHTFADVLSKTSS-------DPP---PLRTPGRYKGMPAISFTDQDIQQFPQKFRFAL 59 Query: 1841 IGKFSFAIPHAQAISKGLSSLGIKGPFSWSFANQSHIIIKLQLEEDYNLLWMGTIWMFGA 1662 +GKFS P + +G G FS +Q H++I LE ++ W+ W Sbjct: 60 VGKFSKGRPSMAELRMTFDQIGFGGAFSLGLLDQRHVLINFDLETNFQRCWLRKSWSVRG 119 Query: 1661 CPMRILKWTPTFNPKMEAPLAPVWIRLPGLPIQFFDYHALFAISKELGNPLQVDPPTARK 1482 MRI KW+P F +E+P+ PVWI GLP D A+++I+ +G PL+VD T Sbjct: 120 FIMRIFKWSPDFRLDIESPIVPVWIAFDGLPAHLQDKRAIYSIANLIGTPLKVDSSTLLH 179 Query: 1481 TRLSFARVCXXXXXXXXXXXXXXLKFGN-TSHTQKIIYERVPPYCNFCKHIGHSLEDCYM 1305 R S ARVC + G+ QK+ YE +PPYC C+ GH DC Sbjct: 180 NRPSLARVCVELNVSQSLPNQVWITNGSYGGFNQKVTYEYIPPYCMGCRKFGHLWSDCRS 239 Query: 1304 NGNKAKPPPPVRHP 1263 N +P P R P Sbjct: 240 NYVDDRPQEPYREP 253