BLASTX nr result
ID: Cheilocostus21_contig00040979
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cheilocostus21_contig00040979 (1534 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_009388297.1| PREDICTED: uncharacterized protein LOC103975... 389 e-128 ref|XP_018686543.1| PREDICTED: helicase swr-1 isoform X2 [Musa a... 355 e-114 ref|XP_009411395.1| PREDICTED: helicase swr-1 isoform X1 [Musa a... 353 e-114 ref|XP_010920313.1| PREDICTED: uncharacterized protein LOC105044... 343 e-110 ref|XP_010934255.1| PREDICTED: protein bfr2-like [Elaeis guineen... 259 9e-78 ref|XP_008780393.1| PREDICTED: protein JASON-like [Phoenix dacty... 246 5e-75 ref|XP_020692266.1| protein JASON-like isoform X2 [Dendrobium ca... 249 4e-74 ref|XP_020692265.1| mediator of RNA polymerase II transcription ... 243 1e-71 gb|PKU72536.1| Protein BREAST CANCER SUSCEPTIBILITY 1 like [Dend... 249 4e-70 ref|XP_020093425.1| nuclear polyadenylated RNA-binding protein 3... 236 9e-69 ref|XP_009382207.1| PREDICTED: protein bfr2-like [Musa acuminata... 230 6e-67 ref|XP_010264923.1| PREDICTED: rRNA biogenesis protein rrp36 [Ne... 231 3e-66 gb|OVA02338.1| hypothetical protein BVC80_9099g138 [Macleaya cor... 228 1e-65 ref|XP_020247430.1| LOW QUALITY PROTEIN: uncharacterized protein... 225 4e-64 ref|XP_020584860.1| uncharacterized protein LOC110027684 [Phalae... 212 2e-59 ref|XP_017984350.1| PREDICTED: DNA ligase 1 [Theobroma cacao] 210 1e-58 gb|EOX94353.1| Uncharacterized protein TCM_003948 isoform 1 [The... 209 3e-58 ref|XP_010093087.1| putative vacuolar protein sorting-associated... 206 3e-57 gb|PON44348.1| eisosome protein [Parasponia andersonii] 205 6e-57 gb|EMS58738.1| hypothetical protein TRIUR3_23723 [Triticum urartu] 206 1e-56 >ref|XP_009388297.1| PREDICTED: uncharacterized protein LOC103975115 [Musa acuminata subsp. malaccensis] Length = 420 Score = 389 bits (999), Expect = e-128 Identities = 232/430 (53%), Positives = 264/430 (61%), Gaps = 7/430 (1%) Frame = +1 Query: 58 MGCFLGCFKVPKDRKRRRPLKQSLSIDH-RECCRALQPSLSPKQITLDLSXXXXXXXXXX 234 MGCFL CFK KDRKR R ++SL ID E R LQ SLSPKQIT Sbjct: 1 MGCFLACFKGSKDRKRGRSPRKSLPIDRVHESYRPLQTSLSPKQIT-----PKAEVVAAP 55 Query: 235 XXXLKENVKQGSGVNVRKKVTFDLNVKTYEAVAYEEDRDCSSEEGEASEKIDE-RKLEEE 411 L+EN++QGS + RKKVTFDLNV TYE + D +CSSE + +E IDE RK + Sbjct: 56 LPALRENLEQGSCSSTRKKVTFDLNVTTYEEALVDNDPNCSSEVDKETEAIDEGRKANDG 115 Query: 412 RDESSPKSSAFPVNHRYQNCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYDS 591 +D+SSP S AFP+N RY NC YDS Sbjct: 116 QDKSSPVSGAFPLNRRYHNCESSDDDVEHDEEDYWDSDFDEEEDTEVVIEGNEEES-YDS 174 Query: 592 FFSLPIDKEP-QGLPEVNSPKPKFAPSPALDTIPILVAGGTTKRDRSQFVHPVLNPVENL 768 FFSLPIDKEP Q + E NSP PK A SP D PILVAGG+T RDRSQFVHPVLNPVENL Sbjct: 175 FFSLPIDKEPSQSIQEANSPNPKSASSP--DRQPILVAGGST-RDRSQFVHPVLNPVENL 231 Query: 769 SQWKEVKVRTAPTKNLKKENIFSEQENNKLTFISETTIKV-KTREPVCSTKQDVSVDASL 945 SQWK VK R P KN KKEN+ ++ N + FISE IK K++ P S KQ++S+DASL Sbjct: 232 SQWKVVKARAMPVKNPKKENVVGTEQENNVAFISEPVIKAKKSQAPNRSAKQEISMDASL 291 Query: 946 SNWLPSSENSTAAGPQAXXXXXXXXXXXREDRPILGALTIEDIKQXXXXXXXXXXXXXXX 1125 S WLPSSENSTA G QA RE+RPILGALT+EDIKQ Sbjct: 292 STWLPSSENSTAEGSQASNSHRSNSSFSREERPILGALTMEDIKQ-SSVTSSPVKRSPSR 350 Query: 1126 XXDEIPILGTVGRYWNCEEQGDDSATSR---GEVKGIPNATSKYQEDKNVNLHTTPFEVR 1296 DEIPILGTVG YWNC+ GDDSA+SR E KGIPN TSKY+EDK VN H+TPFEVR Sbjct: 351 SPDEIPILGTVGGYWNCKNHGDDSASSRPPSNEFKGIPNTTSKYREDKTVNWHSTPFEVR 410 Query: 1297 LEIALRNGVA 1326 LE AL+NG A Sbjct: 411 LERALKNGAA 420 >ref|XP_018686543.1| PREDICTED: helicase swr-1 isoform X2 [Musa acuminata subsp. malaccensis] Length = 438 Score = 355 bits (910), Expect = e-114 Identities = 217/443 (48%), Positives = 254/443 (57%), Gaps = 20/443 (4%) Frame = +1 Query: 58 MGCFLGCFKVPKDRKRRRPLKQSLSIDHRECCRALQPSLS-----PKQITLDLSXXXXXX 222 MGCFL CF K+RKRRR ++SL ++ RE CR + ++S PK +T DLS Sbjct: 1 MGCFLACFGGSKNRKRRRSPRKSLPLE-RERCRPPRRNVSTKQITPKSLTPDLSPLNPAV 59 Query: 223 XXXXXXXLKENVKQGSGVNVRKKVTFDLNVKTYEAVAYEEDRDCSSEEGEASEKIDER-K 399 LKEN+ QGS RKKVTF+LNV+TYE V+ +ED C SE+ E E +DE K Sbjct: 60 EDNTLPELKENLAQGSLSRTRKKVTFNLNVQTYEEVSGDEDSKCLSEDDEGPEAVDEDGK 119 Query: 400 LEEERDESSPKSSAFPVNHRYQNCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 579 E +DE SPK AFP+NHRYQNC Sbjct: 120 PHEGQDEFSPKLGAFPLNHRYQNCEISDDDGSDCGGDEEEDYSDSDFDEEEDDEVGTEEK 179 Query: 580 X---YDSFFSLPIDKEPQGLPEVNSPKPKFAPSPALDTIPILVAGGTTKRDRSQFVHPVL 750 YDSFFSL ID+EPQ L EV SP+PK SP PIL+AGG KRDRSQ+VHPVL Sbjct: 180 EEESYDSFFSLAIDEEPQRLQEVRSPEPKCPSSPGRQ--PILLAGGGNKRDRSQYVHPVL 237 Query: 751 NPVENLSQWKEVKVRTAPTKNLKKENIFSEQENNKLTFISETTIKVKT--------REPV 906 NPV+NL+QWKEVKV +A KN K EN +E+EN + K K R P Sbjct: 238 NPVQNLTQWKEVKVHSARAKNAKVENAGTEKENQMMFCSEAMNNKAKKPQGSIGLDRTPN 297 Query: 907 CSTKQDVSVDASLSNWLPSSENSTAAGPQAXXXXXXXXXXXREDRPILGALTIEDIKQXX 1086 CSTKQ++SVDASLSNWL SSENST GP+A REDRPILGALT+ED+KQ Sbjct: 298 CSTKQEISVDASLSNWLLSSENSTVEGPEASNSPRSNSAFSREDRPILGALTVEDLKQ-- 355 Query: 1087 XXXXXXXXXXXXXXXDEIPILGTVGRYWNCEEQGDDSATSR---GEVKGIPNATSKYQED 1257 DE PILGTVGRYWN QGDDSA+SR G PN T KY+ED Sbjct: 356 ASVTSSPRRSPSRSTDEKPILGTVGRYWNSRNQGDDSASSRQPSSGHNGSPNTTIKYRED 415 Query: 1258 KNVNLHTTPFEVRLEIALRNGVA 1326 K VN ++TPFEV LE AL G A Sbjct: 416 KPVNWNSTPFEVMLERALETGTA 438 >ref|XP_009411395.1| PREDICTED: helicase swr-1 isoform X1 [Musa acuminata subsp. malaccensis] Length = 440 Score = 353 bits (906), Expect = e-114 Identities = 216/444 (48%), Positives = 254/444 (57%), Gaps = 21/444 (4%) Frame = +1 Query: 58 MGCFLGCFKVPKDRKRRRPLKQSLSIDH-RECCRALQPSLS-----PKQITLDLSXXXXX 219 MGCFL CF K+RKRRR ++SL ++ +E CR + ++S PK +T DLS Sbjct: 1 MGCFLACFGGSKNRKRRRSPRKSLPLERAQERCRPPRRNVSTKQITPKSLTPDLSPLNPA 60 Query: 220 XXXXXXXXLKENVKQGSGVNVRKKVTFDLNVKTYEAVAYEEDRDCSSEEGEASEKIDER- 396 LKEN+ QGS RKKVTF+LNV+TYE V+ +ED C SE+ E E +DE Sbjct: 61 VEDNTLPELKENLAQGSLSRTRKKVTFNLNVQTYEEVSGDEDSKCLSEDDEGPEAVDEDG 120 Query: 397 KLEEERDESSPKSSAFPVNHRYQNCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 576 K E +DE SPK AFP+NHRYQNC Sbjct: 121 KPHEGQDEFSPKLGAFPLNHRYQNCEISDDDGSDCGGDEEEDYSDSDFDEEEDDEVGTEE 180 Query: 577 XX---YDSFFSLPIDKEPQGLPEVNSPKPKFAPSPALDTIPILVAGGTTKRDRSQFVHPV 747 YDSFFSL ID+EPQ L EV SP+PK SP PIL+AGG KRDRSQ+VHPV Sbjct: 181 KEEESYDSFFSLAIDEEPQRLQEVRSPEPKCPSSPGRQ--PILLAGGGNKRDRSQYVHPV 238 Query: 748 LNPVENLSQWKEVKVRTAPTKNLKKENIFSEQENNKLTFISETTIKVKT--------REP 903 LNPV+NL+QWKEVKV +A KN K EN +E+EN + K K R P Sbjct: 239 LNPVQNLTQWKEVKVHSARAKNAKVENAGTEKENQMMFCSEAMNNKAKKPQGSIGLDRTP 298 Query: 904 VCSTKQDVSVDASLSNWLPSSENSTAAGPQAXXXXXXXXXXXREDRPILGALTIEDIKQX 1083 CSTKQ++SVDASLSNWL SSENST GP+A REDRPILGALT+ED+KQ Sbjct: 299 NCSTKQEISVDASLSNWLLSSENSTVEGPEASNSPRSNSAFSREDRPILGALTVEDLKQ- 357 Query: 1084 XXXXXXXXXXXXXXXXDEIPILGTVGRYWNCEEQGDDSATSR---GEVKGIPNATSKYQE 1254 DE PILGTVGRYWN QGDDSA+SR G PN T KY+E Sbjct: 358 -ASVTSSPRRSPSRSTDEKPILGTVGRYWNSRNQGDDSASSRQPSSGHNGSPNTTIKYRE 416 Query: 1255 DKNVNLHTTPFEVRLEIALRNGVA 1326 DK VN ++TPFEV LE AL G A Sbjct: 417 DKPVNWNSTPFEVMLERALETGTA 440 >ref|XP_010920313.1| PREDICTED: uncharacterized protein LOC105044198 [Elaeis guineensis] Length = 438 Score = 343 bits (880), Expect = e-110 Identities = 216/444 (48%), Positives = 257/444 (57%), Gaps = 21/444 (4%) Frame = +1 Query: 58 MGCFLGCFKVPKDRKRRRPLKQSLSIDHR-ECCRALQPSLSPKQITLDLSXXXXXXXXXX 234 MGCFL CF KDRKRRRP +SLS+D E + L+ S PKQ+T S Sbjct: 1 MGCFLACFGGAKDRKRRRPANKSLSVDRGCETYKPLRRSPPPKQLTPSPSKKRVSEAVLD 60 Query: 235 XXX-LKENVKQGSGVNVRKKVTFDLNVKTYEAVAYEEDRDCSSEEGEASEKIDE-RKLEE 408 L+EN +QGS + +KKVTFDLNVKTYE V +ED SSE+ + I + RK EE Sbjct: 61 STPELRENYEQGSFSSSKKKVTFDLNVKTYEEVLIDEDPAHSSEDNKEDGAIRKGRKTEE 120 Query: 409 ERDESSPKSSAFPVNHRYQNCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX-- 582 DES PKS AFP+NHRYQNC Sbjct: 121 GDDESIPKSGAFPLNHRYQNCESSDDDDDIDFGDEEDDFEEEDYEDSDLDEEDNDVNLEG 180 Query: 583 -----YDSFFSLPIDKEPQGLPEVNSPKPKFAPSPALDTIPILVAGGTTKRDRSQFVHPV 747 YDSFFSLP++KE Q + EVNSPK K A SP D P L+ GT RDRSQ+VH V Sbjct: 181 NEEESYDSFFSLPMEKEQQCIQEVNSPKSKCASSP--DRQPSLLPKGTA-RDRSQYVHSV 237 Query: 748 LNPVENLSQWKEVKVRTAPTKNLKKENIFSEQENNKLTFISETTIKV-KTREPVCS---- 912 LNPVENL+QWKEVKVR APTKN KKENI S+ + N++TF E T K+ + ++P S Sbjct: 238 LNPVENLTQWKEVKVRAAPTKNPKKENINSDID-NQMTFTPEPTFKIDEFQKPASSNPSP 296 Query: 913 ---TKQDVSVDASLSNWLPSSENSTAAGPQAXXXXXXXXXXXREDRPILGALTIEDIKQX 1083 KQD++VDASLSNWL SS+NS GPQ RE+RPILGALT+ED+KQ Sbjct: 297 NFPAKQDIAVDASLSNWLVSSDNSAKEGPQQSNSHFSNSSVSREERPILGALTVEDLKQ- 355 Query: 1084 XXXXXXXXXXXXXXXXDEIPILGTVGRYWNCEEQGDDSAT---SRGEVKGIPNATSKYQE 1254 DEIPI+GTVG YW+C Q DS + S E+KGIPN TSKY+E Sbjct: 356 -SSVTSSPRRSPSRSPDEIPIVGTVGSYWSCLNQRSDSGSSCQSGSEIKGIPNTTSKYRE 414 Query: 1255 DKNVNLHTTPFEVRLEIALRNGVA 1326 DK VN H TPF+VR+E AL G A Sbjct: 415 DKRVNWHATPFDVRVERALDKGAA 438 >ref|XP_010934255.1| PREDICTED: protein bfr2-like [Elaeis guineensis] Length = 420 Score = 259 bits (662), Expect = 9e-78 Identities = 178/415 (42%), Positives = 222/415 (53%), Gaps = 16/415 (3%) Frame = +1 Query: 58 MGCFLGCFKVPKDRKRRRPLKQSLSIDHRECCRALQPSLSP--KQITLDLSXXXXXXXXX 231 MGCFL CF KDRKRRRP +S +D C + +P +P KQ+TL+LS Sbjct: 1 MGCFLTCFGGAKDRKRRRPPNKSPPVDR--VCESYKPLRNPSSKQLTLNLSPKKLISEAV 58 Query: 232 XXXX--LKENVKQGSGVNVRKKVTFDLNVKTYEAVAYEEDRDCSSEEGEASEKI-DERKL 402 ++N +QGS + +KKVTFDLNVKTYE V +ED SSE+ +E I +ERK Sbjct: 59 VGSTPESRDNDEQGSFGSSKKKVTFDLNVKTYEEVLVDEDPKHSSEDKRENEVIREERKP 118 Query: 403 EEERDESSPKSSAFPVNHRYQNCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 582 EE DES P S A +NHRYQNC Sbjct: 119 EEADDESLPNSGASLLNHRYQNCEGSDDDDDIDYGEEEEDEDEDYEDIDLDEDDGNVVDI 178 Query: 583 YD-------SFFSLPIDKEPQGLPEVNSPKPKFAPSPALDTIPILVAGGTTKRDRSQFVH 741 SF L ++K+ Q + EVNS K K S + D P L+A GT RDRSQ+VH Sbjct: 179 EGNEEESNASFLLLEVEKQQQSIQEVNSSKSK---SVSPDRQPPLLAKGTA-RDRSQYVH 234 Query: 742 PVLNPVENLSQWKEVKVRTAPTKNLKKENIFSEQENNKLTFISETTIKVKTR-EPVCSTK 918 PVLNPVENL+QW+EVKVR AP+KN KKENI S+ E NK+TF E T K++ R C + Sbjct: 235 PVLNPVENLTQWREVKVRAAPSKNRKKENINSDIE-NKITFSPEPTFKIEGRLRTNCPAQ 293 Query: 919 QDVSVDASLSNWLPSSENSTAAGPQAXXXXXXXXXXXREDRPILGALTIEDIKQXXXXXX 1098 D +VDASLSNWL SS+NS P +E+RP G LT+EDIKQ Sbjct: 294 HDATVDASLSNWLVSSDNSDKERPHQSNSDFSKLSVGQEERPTSGVLTVEDIKQ--SSVT 351 Query: 1099 XXXXXXXXXXXDEIPILGTVGRYWNCEEQGDDSAT---SRGEVKGIPNATSKYQE 1254 DEIPI+GTVG + + Q DS + S +KGIPNATS Y++ Sbjct: 352 SSLRRSPSRSPDEIPIVGTVGGCGSWKNQRSDSGSSCQSGSTIKGIPNATSTYRK 406 >ref|XP_008780393.1| PREDICTED: protein JASON-like [Phoenix dactylifera] Length = 246 Score = 246 bits (628), Expect = 5e-75 Identities = 143/251 (56%), Positives = 163/251 (64%), Gaps = 11/251 (4%) Frame = +1 Query: 607 IDKEPQGLPEVNSPKPKFAPSPALDTIPILVAGGTTKRDRSQFVHPVLNPVENLSQWKEV 786 ++KE Q + EVNSPKPK SP D P L+A GT RDRSQ+VH VLNPVENL+QWKEV Sbjct: 1 MEKEHQCIQEVNSPKPKSVSSP--DRQPSLLAIGTA-RDRSQYVHSVLNPVENLTQWKEV 57 Query: 787 KVRTAPTKNLKKENIFSEQENNKLTFISETTIKVKTRE--------PVCSTKQDVSVDAS 942 KVR AP KN KKENI S+ EN K+TF E+T K++ E P C KQDV+VDAS Sbjct: 58 KVRAAPMKNPKKENINSDIEN-KMTFTPESTFKIEKFEKLASSNPSPNCPAKQDVAVDAS 116 Query: 943 LSNWLPSSENSTAAGPQAXXXXXXXXXXXREDRPILGALTIEDIKQXXXXXXXXXXXXXX 1122 LSNWL SS+NST GPQ REDRPILGALT+ED+KQ Sbjct: 117 LSNWLVSSDNSTKEGPQKSNSHFSNSSVSREDRPILGALTVEDLKQ--SSVTSSPRRSPS 174 Query: 1123 XXXDEIPILGTVGRYWNCEEQGDDSAT---SRGEVKGIPNATSKYQEDKNVNLHTTPFEV 1293 DEIPI+GTVG YW+C Q DS + S E+KGIPN TSKY+EDK VN H TPFEV Sbjct: 175 RSPDEIPIVGTVGSYWSCMNQRSDSGSSCQSGSEIKGIPNTTSKYKEDKRVNWHATPFEV 234 Query: 1294 RLEIALRNGVA 1326 RLE AL G A Sbjct: 235 RLERALNKGAA 245 >ref|XP_020692266.1| protein JASON-like isoform X2 [Dendrobium catenatum] Length = 419 Score = 249 bits (637), Expect = 4e-74 Identities = 165/434 (38%), Positives = 224/434 (51%), Gaps = 16/434 (3%) Frame = +1 Query: 58 MGCFLGCFKVPKDRKRRRPLKQSLSIDHRECCRALQPSLSPKQITLDLSXXXXXXXXXXX 237 MGCFL CFK KDRKR R K+SLS++ E S+S I + +S Sbjct: 1 MGCFLVCFKGSKDRKRCRRSKKSLSVNKEEENGVYFHSISSSGIEI-ISETTAAATDNPF 59 Query: 238 XXLKENVKQGSGVNVRKKVTFDLNVKTYEAVAYEEDRDCSSEEGEASEKIDERKLEEERD 417 L+EN +QG+ N RKKVTFDL+V T+E +ED +E+ E ++ E K ++E + Sbjct: 60 PKLRENQEQGTAANARKKVTFDLHVTTFEIAPVQEDPKDFTEDDEIEREVKEEKKDKEEE 119 Query: 418 ESS--PKSSAFPVNHRYQNCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX--Y 585 +SS PK +++P NHRYQNC Y Sbjct: 120 KSSDEPKLASYPPNHRYQNCESSDDDEGLNEDDEASDEDDEIDFDEGNDIGINGDEEESY 179 Query: 586 DSFFSLPIDKEPQGLPEVNSPKPKFAPSPALDTIPILVAGGTTKRDRSQFVHPVLNPVEN 765 DS+FSLP++ E + + E++SPKP + + +++ RDRS+++H VLNPVEN Sbjct: 180 DSYFSLPMESEQKHIQEISSPKPTIESNTERE---LVILAKAHVRDRSRYIHSVLNPVEN 236 Query: 766 LSQWKEVKVRTAPTKNLKKENIFSEQENNKLTFISETTIKVKTREPVCST---------- 915 +S WKEVKVR KNL KEN+ + FIS K+ +P S+ Sbjct: 237 ISHWKEVKVRPTSLKNLNKENL----DIKTKPFISPEPTFSKSEKPHKSSSSNSNSDDFG 292 Query: 916 KQDVSVDASLSNWLPSSENSTAAGPQAXXXXXXXXXXXREDRPILGALTIEDIKQXXXXX 1095 KQ +++DASLS WL S NS + +E+RPILGALT+EDIKQ Sbjct: 293 KQGIALDASLSTWLASPNNSRLSNSS----------ISQEERPILGALTMEDIKQ--SSR 340 Query: 1096 XXXXXXXXXXXXDEIPILGTVGRYWN--CEEQGDDSATSRGEVKGIPNATSKYQEDKNVN 1269 +E+PI+GTVG YW+ +E S +S E +GIPN TSKY+EDK VN Sbjct: 341 TSSPRRSPSRSPEEVPIVGTVGGYWSSKSKEASASSWSSSSEKRGIPNTTSKYREDKMVN 400 Query: 1270 LHTTPFEVRLEIAL 1311 L PFE RLE AL Sbjct: 401 LCYIPFETRLERAL 414 >ref|XP_020692265.1| mediator of RNA polymerase II transcription subunit 13-like isoform X1 [Dendrobium catenatum] Length = 426 Score = 243 bits (621), Expect = 1e-71 Identities = 164/440 (37%), Positives = 222/440 (50%), Gaps = 22/440 (5%) Frame = +1 Query: 58 MGCFLGCFKVPKDRKRRRPLKQSLSIDHRECCRALQPSLSPKQI------TLDLSXXXXX 219 MGCFL CFK KDRKR R K+SLS++ E S+S I T + Sbjct: 1 MGCFLVCFKGSKDRKRCRRSKKSLSVNKEEENGVYFHSISSSGIEIISETTAAATDNPFP 60 Query: 220 XXXXXXXXLKENVKQGSGVNVRKKVTFDLNVKTYEAVAYEEDRDCSSEEGEASEKIDERK 399 +EN +QG+ N RKKVTFDL+V T+E +ED +E+ E ++ E K Sbjct: 61 KLRHARFFYQENQEQGTAANARKKVTFDLHVTTFEIAPVQEDPKDFTEDDEIEREVKEEK 120 Query: 400 LEEERDESS--PKSSAFPVNHRYQNCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 573 ++E ++SS PK +++P NHRYQNC Sbjct: 121 KDKEEEKSSDEPKLASYPPNHRYQNCESSDDDEGLNEDDEASDEDDEIDFDEGNDIGING 180 Query: 574 XXX--YDSFFSLPIDKEPQGLPEVNSPKPKFAPSPALDTIPILVAGGTTKRDRSQFVHPV 747 YDS+FSLP++ E + + E++SPKP + + +++ RDRS+++H V Sbjct: 181 DEEESYDSYFSLPMESEQKHIQEISSPKPTIESNTERE---LVILAKAHVRDRSRYIHSV 237 Query: 748 LNPVENLSQWKEVKVRTAPTKNLKKENIFSEQENNKLTFISETTIKVKTREPVCST---- 915 LNPVEN+S WKEVKVR KNL KEN+ + FIS K+ +P S+ Sbjct: 238 LNPVENISHWKEVKVRPTSLKNLNKENL----DIKTKPFISPEPTFSKSEKPHKSSSSNS 293 Query: 916 ------KQDVSVDASLSNWLPSSENSTAAGPQAXXXXXXXXXXXREDRPILGALTIEDIK 1077 KQ +++DASLS WL S NS + +E+RPILGALT+EDIK Sbjct: 294 NSDDFGKQGIALDASLSTWLASPNNSRLSNSS----------ISQEERPILGALTMEDIK 343 Query: 1078 QXXXXXXXXXXXXXXXXXDEIPILGTVGRYWN--CEEQGDDSATSRGEVKGIPNATSKYQ 1251 Q +E+PI+GTVG YW+ +E S +S E +GIPN TSKY+ Sbjct: 344 Q--SSRTSSPRRSPSRSPEEVPIVGTVGGYWSSKSKEASASSWSSSSEKRGIPNTTSKYR 401 Query: 1252 EDKNVNLHTTPFEVRLEIAL 1311 EDK VNL PFE RLE AL Sbjct: 402 EDKMVNLCYIPFETRLERAL 421 >gb|PKU72536.1| Protein BREAST CANCER SUSCEPTIBILITY 1 like [Dendrobium catenatum] Length = 860 Score = 249 bits (637), Expect = 4e-70 Identities = 165/434 (38%), Positives = 224/434 (51%), Gaps = 16/434 (3%) Frame = +1 Query: 58 MGCFLGCFKVPKDRKRRRPLKQSLSIDHRECCRALQPSLSPKQITLDLSXXXXXXXXXXX 237 MGCFL CFK KDRKR R K+SLS++ E S+S I + +S Sbjct: 442 MGCFLVCFKGSKDRKRCRRSKKSLSVNKEEENGVYFHSISSSGIEI-ISETTAAATDNPF 500 Query: 238 XXLKENVKQGSGVNVRKKVTFDLNVKTYEAVAYEEDRDCSSEEGEASEKIDERKLEEERD 417 L+EN +QG+ N RKKVTFDL+V T+E +ED +E+ E ++ E K ++E + Sbjct: 501 PKLRENQEQGTAANARKKVTFDLHVTTFEIAPVQEDPKDFTEDDEIEREVKEEKKDKEEE 560 Query: 418 ESS--PKSSAFPVNHRYQNCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX--Y 585 +SS PK +++P NHRYQNC Y Sbjct: 561 KSSDEPKLASYPPNHRYQNCESSDDDEGLNEDDEASDEDDEIDFDEGNDIGINGDEEESY 620 Query: 586 DSFFSLPIDKEPQGLPEVNSPKPKFAPSPALDTIPILVAGGTTKRDRSQFVHPVLNPVEN 765 DS+FSLP++ E + + E++SPKP + + +++ RDRS+++H VLNPVEN Sbjct: 621 DSYFSLPMESEQKHIQEISSPKPTIESNTERE---LVILAKAHVRDRSRYIHSVLNPVEN 677 Query: 766 LSQWKEVKVRTAPTKNLKKENIFSEQENNKLTFISETTIKVKTREPVCST---------- 915 +S WKEVKVR KNL KEN+ + FIS K+ +P S+ Sbjct: 678 ISHWKEVKVRPTSLKNLNKENL----DIKTKPFISPEPTFSKSEKPHKSSSSNSNSDDFG 733 Query: 916 KQDVSVDASLSNWLPSSENSTAAGPQAXXXXXXXXXXXREDRPILGALTIEDIKQXXXXX 1095 KQ +++DASLS WL S NS + +E+RPILGALT+EDIKQ Sbjct: 734 KQGIALDASLSTWLASPNNSRLSNSS----------ISQEERPILGALTMEDIKQ--SSR 781 Query: 1096 XXXXXXXXXXXXDEIPILGTVGRYWN--CEEQGDDSATSRGEVKGIPNATSKYQEDKNVN 1269 +E+PI+GTVG YW+ +E S +S E +GIPN TSKY+EDK VN Sbjct: 782 TSSPRRSPSRSPEEVPIVGTVGGYWSSKSKEASASSWSSSSEKRGIPNTTSKYREDKMVN 841 Query: 1270 LHTTPFEVRLEIAL 1311 L PFE RLE AL Sbjct: 842 LCYIPFETRLERAL 855 >ref|XP_020093425.1| nuclear polyadenylated RNA-binding protein 3-like [Ananas comosus] Length = 430 Score = 236 bits (602), Expect = 9e-69 Identities = 173/436 (39%), Positives = 211/436 (48%), Gaps = 45/436 (10%) Frame = +1 Query: 58 MGCFLGCFKVPKDRKRRRPLKQSLSIDHR-ECCRALQPSLSP-----------KQITLDL 201 MGCFL C PKDRKRRR K+S D R E + LQ + SP Q++L+ Sbjct: 1 MGCFLACLGGPKDRKRRRSPKKSPPRDRRHESYKLLQQNASPLRAQPLLEQITPQLSLEK 60 Query: 202 SXXXXXXXXXXXXXLKENVKQGSGVNVRKKVTFDLNVKTYEAVAYEEDRDCSSEEGEASE 381 ++E +QGS + RKKVTFDLN+KTYE A + + C EE E +E Sbjct: 61 PSTEVAANSIPDQEIREINEQGSFSSCRKKVTFDLNIKTYEEAATQGNEKCILEEDEKNE 120 Query: 382 -KIDERKLEEERDESSPKSSAFPVNHRYQNCXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 558 + +E+ +E+ + + +S +P+NHRYQNC Sbjct: 121 AREEEKNPQEDAGKPTSQSGNYPLNHRYQNCSNSDDEDGGDGEEDGDGEDDDEDEYGEED 180 Query: 559 XXXXXXXXYDSFFSLPIDKE--PQGL----------------------PEVNSPKPKFAP 666 + + IDKE P GL EV SP P Sbjct: 181 ED-------EDYDDCGIDKERNPIGLQEDEESYESFFSLPIEKELENNQEVISPGPNCVS 233 Query: 667 SPALDTIPILVAGGTTKRDRSQFVHPVLNPVENLSQWKEVKVRTAPTKNLKKENIFSEQE 846 SP D PIL+A G RDRS +V+ VLNPVENLSQWKEVKV P KN KENI EQE Sbjct: 234 SP--DKQPILLAKG-NPRDRSHYVNSVLNPVENLSQWKEVKVGAVPLKNTNKENINLEQE 290 Query: 847 NNKLTFISETTIKVKTREPVCST-----KQDVSVDASLSNWLPSSENSTAAGPQAXXXXX 1011 NK+ E T K K E + KQ+VSVDASLS WL SSENS+ Q Sbjct: 291 KNKMVLSPEPTFKTKKLEGTADSNNSPCKQEVSVDASLSTWLVSSENSSVQRIQESNSAR 350 Query: 1012 XXXXXXREDRPILGALTIEDIKQXXXXXXXXXXXXXXXXXDEIPILGTVGRYWNCEEQGD 1191 RE+RPILGALT+EDIKQ DEIPI+GTVG YW+C QGD Sbjct: 351 SNWSLSREERPILGALTVEDIKQ--SSVTSSPRRSPSRSPDEIPIVGTVGSYWSCRSQGD 408 Query: 1192 DSATS--RGEV-KGIP 1230 S+ S G V KGIP Sbjct: 409 TSSYSSRSGSVTKGIP 424 >ref|XP_009382207.1| PREDICTED: protein bfr2-like [Musa acuminata subsp. malaccensis] Length = 388 Score = 230 bits (586), Expect = 6e-67 Identities = 173/419 (41%), Positives = 210/419 (50%), Gaps = 20/419 (4%) Frame = +1 Query: 58 MGCFLGCFKVPKDRKRRRPLKQSLSIDH-RECCRALQPSLSPKQIT-------------- 192 MGCFL CF KDRKRRR K+SL ++ E R L+ ++S KQIT Sbjct: 1 MGCFLACFGDVKDRKRRRSPKKSLPLERVHERYRLLRSNVSRKQITPMPLMPKVSPPNLP 60 Query: 193 LDLSXXXXXXXXXXXXXLKENVKQGSGVNVRKKVTFDLNVKTYEAVAYEEDRDCSSEEGE 372 +D + ++EN+ QGS RKKVTFDLNV+T+E V +E C SE+ E Sbjct: 61 VD-TVTDTLLQLKCHDLIQENLAQGSFRRTRKKVTFDLNVQTHEYVLGDEGPKCPSEDDE 119 Query: 373 ASEKIDE-RKLEEERDESSPKSSAFPVNHRYQNCXXXXXXXXXXXXXXXXXXXXXXXXXX 549 A+E IDE RK + +D S K +FP NHRYQNC Sbjct: 120 ATEVIDEERKPHKGQDTSFTKFGSFPSNHRYQNCGSSDDDEGVSEEDEEDYEDSDIDEEE 179 Query: 550 XXXXXXXXXXXYDSFFSLPIDKEPQGLPEVNSPKPKFAPSPALDTIPILVAGGTTKRDRS 729 YD FSL D EPQG+ E NSP S + D PIL+A G T RDR Sbjct: 180 DNEDGDEEGS-YDFLFSLANDDEPQGVQEANSP------SSSPDRRPILLARGNT-RDRR 231 Query: 730 QFVHPVLNPVENLSQWKEVKVRTAPT-KNLKKENIFSEQENNKLTFISETTIKVKTREPV 906 Q+VH VLNPVEN+SQWKEVKVR AP KN KEN+ +E+EN E +KVK Sbjct: 232 QYVHSVLNPVENVSQWKEVKVRAAPAKKNSTKENVGAEKENQ-----MEPMVKVKK---- 282 Query: 907 CSTKQDVSVDASLSNWLPSSENSTAAGPQ---AXXXXXXXXXXXREDRPILGALTIEDIK 1077 TKQ+VSV+ASLS+WL SENST GP+ + E RP+LG D+K Sbjct: 283 -PTKQEVSVEASLSSWLSWSENSTVEGPELSNSRDRPNYSWFSQEERRPVLG-----DVK 336 Query: 1078 QXXXXXXXXXXXXXXXXXDEIPILGTVGRYWNCEEQGDDSATSRGEVKGIPNATSKYQE 1254 Q +EIPILGTV Y C+ GD E GI N SKY E Sbjct: 337 Q--ASVASSPGRFPSRSTEEIPILGTVAVYRTCKNHGD-------EHTGIQNTISKYGE 386 >ref|XP_010264923.1| PREDICTED: rRNA biogenesis protein rrp36 [Nelumbo nucifera] Length = 470 Score = 231 bits (588), Expect = 3e-66 Identities = 179/472 (37%), Positives = 222/472 (47%), Gaps = 49/472 (10%) Frame = +1 Query: 58 MGCFLGCFKVPKDRKRRRPLKQSLSIDHRECCRALQPSLSPKQITLDLSXXXXXXXXXXX 237 MGCFL CF KDRKRRRP K+ L D R P Q T+ L Sbjct: 1 MGCFLACFGTKKDRKRRRPAKRVLPGDQRHGI------YEPLQPTVSLMQDIASKPISSV 54 Query: 238 XXLKENVKQGSGVNVRKKVTFDLNVKTYEAVA------YEEDRDCSSEEGEASEKIDERK 399 L ++ ++ RKKVTFDLNVKTYE V Y + D E G E+ +E + Sbjct: 55 SELGGKPEEQLSLSTRKKVTFDLNVKTYEEVVTHETVNYASESDDKKENGNKEEEEEEEE 114 Query: 400 LEEERDES--SPKSS------------AFPVNHRYQNCXXXXXXXXXXXXXXXXXXXXXX 537 EEE +E+ S +SS ++P NHRYQNC Sbjct: 115 EEEEGEETGKSKQSSVSEDDSVTSSLGSYPSNHRYQNCRDSDDEADGIESDESDLDDDDD 174 Query: 538 XXXXXXXXXXXXXXX-----YDSFFSLPIDKEPQGLP------EVNSPKPKFAPSPALDT 684 +SFFSLP++ P EVNSP P SP + Sbjct: 175 DEDYNDDEEDDVRGVPVEESSESFFSLPMESRNHTCPTPPFEKEVNSPMPTCG-SPGREL 233 Query: 685 IPILVAGGTTKRDRSQFVHPVLNPVENLSQWKEVKVRTAPT--KNLKKENIFSEQENNKL 858 + T RDRSQ+V+ VLNPVENL+QWK VK RTAP ++ KENI E+E ++ Sbjct: 234 KTLRPTQNT--RDRSQYVNSVLNPVENLTQWKAVKARTAPPLKQHQTKENINFEKEP-EI 290 Query: 859 TFISETTIKVKTREPVCS------TKQDVSVDASLSNWLPSS--------ENSTAAGPQA 996 F SE T K +P + K +++VDASLSNWL SS N Q+ Sbjct: 291 PFSSEPTFKPLPLKPNSNFNHSKPPKHEIAVDASLSNWLGSSGTTPITKTSNDVVRSEQS 350 Query: 997 XXXXXXXXXXXR--EDRPILGALTIEDIKQXXXXXXXXXXXXXXXXXDEIPILGTVGRYW 1170 R EDRPILGALT+E++KQ +++PILGTVG YW Sbjct: 351 EKSISQRSNSPRSYEDRPILGALTMEELKQLSASSSPRRSPSRSP--EDMPILGTVGSYW 408 Query: 1171 NCEEQGDDSATSRGEVKGIPNATSKYQEDKNVNLHTTPFEVRLEIALRNGVA 1326 N Q DS+ S KGIPN TSKY+EDK VN H+TPFE RLE AL G A Sbjct: 409 NHTGQAMDSS-SGSSCKGIPNTTSKYREDKRVNWHSTPFETRLERALNTGSA 459 >gb|OVA02338.1| hypothetical protein BVC80_9099g138 [Macleaya cordata] Length = 451 Score = 228 bits (582), Expect = 1e-65 Identities = 164/460 (35%), Positives = 212/460 (46%), Gaps = 37/460 (8%) Frame = +1 Query: 58 MGCFLGCFKVPKDRKRRRPLKQSLSIDHRECCRALQPSLSPKQITLDLSXXXXXXXXXXX 237 MGCFL CF K+ KRR+P + + R S P Q T + Sbjct: 1 MGCFLACFGSSKNGKRRKPANKVIPRGPRH---QRHGSYEPLQSTDSIKQEIIETPISPV 57 Query: 238 XXLKENVKQGSGVNVRKKVTFDLNVKTYEAVAYEE------DRDCSSEEGEASEKIDERK 399 ++ +++ RKKVTFDLNVK YE V +E + + +E E +++I Sbjct: 58 PESRDKIEEQLSFGTRKKVTFDLNVKAYEEVTTQEITQSEDEEEKEKKENEIAKQIRSTS 117 Query: 400 LEEERDESSPKSSAFPVNHRYQNCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 579 L E+ SS S +FP NHRY NC Sbjct: 118 LSED---SSNSSGSFPSNHRYGNCRNSDDEYEELESDESDLDDDDEDDDDYEDDDDDDED 174 Query: 580 XYD----------SFFSLPIDKEPQGLP----EVNSPKPKFAPSPALDTIPILVAGGTTK 717 D SFFSLP+ P EVNSP + +LD + Sbjct: 175 DVDHRFGEPESSESFFSLPVGSRTSATPVAEKEVNSP---LKTNDSLDQEKKTLGSNRNA 231 Query: 718 RDRSQFVHPVLNPVENLSQWKEVKVRT-APTKNLKKENIFSEQENNKLTFISETTIKVK- 891 RDRSQ+VH VLNPVENL+QWK VK +T +P K+ KENI E E K+ F +E T K+ Sbjct: 232 RDRSQYVHSVLNPVENLTQWKAVKAKTPSPLKHQTKENINFETEP-KIPFSAEPTFKLSP 290 Query: 892 -----TREPVCSTKQDVSVDASLSNWLPSSENSTA----------AGPQAXXXXXXXXXX 1026 + K +++VDASLS WL SSE++ + + Sbjct: 291 FSFNPNIDQSTPPKPEIAVDASLSTWLGSSESTPSIKSSNISIGTGSAEVNKSPRSNSPR 350 Query: 1027 XREDRPILGALTIEDIKQXXXXXXXXXXXXXXXXXDEIPILGTVGRYWNCEEQGDDSATS 1206 +EDRPILGALT+E++KQ DEIPILGTVG YW+ Q DS++S Sbjct: 351 SQEDRPILGALTLEELKQFSATSSPRKSPTHCP--DEIPILGTVGSYWSHTNQAIDSSSS 408 Query: 1207 RGEVKGIPNATSKYQEDKNVNLHTTPFEVRLEIALRNGVA 1326 KGIPN TSKY+EDK VN H+TPFE RLE AL G A Sbjct: 409 SSSCKGIPNTTSKYREDKRVNWHSTPFETRLERALNKGAA 448 >ref|XP_020247430.1| LOW QUALITY PROTEIN: uncharacterized protein LOC109825109 [Asparagus officinalis] Length = 461 Score = 225 bits (573), Expect = 4e-64 Identities = 163/425 (38%), Positives = 206/425 (48%), Gaps = 2/425 (0%) Frame = +1 Query: 58 MGCFLGCFKVPKDRKRRRPLKQSLSIDHRECCRALQPSLSPKQITLDLSXXXXXXXXXXX 237 MGCFL CF KDRK+RR K+ I+ + + P QI S Sbjct: 80 MGCFLACFGGAKDRKQRRRSKKPKPINR------VTQNYQPLQIQASPSPQKPEAAAVVP 133 Query: 238 XX-LKENVKQGSGVNVRKKVTFDLNVKTYEAVAYEEDRDCSSEEGEASEKIDERKLEEER 414 L+EN +QG + RKKVTFDLNV+TYE V + +++ +++ + Sbjct: 134 VPELRENQEQGISGSCRKKVTFDLNVRTYENVT-------------VYDPLEDERVKGAK 180 Query: 415 DESSPKSSAFPVNHRYQNCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYDSF 594 +E K S+ N+RYQNC Sbjct: 181 EERETKVSS-DANYRYQNCDSSDDEGLDDDEEEVEYDVSDFEDEEGEAPVVVAXXXXQQ- 238 Query: 595 FSLPIDKEPQGLPEVNSPKPKFAPSPALDTIPILVAGGTTKRDRSQFVHPVLNPVENLSQ 774 + EV+SP P I ++ + RDRSQ+VH VLNPVENLSQ Sbjct: 239 -----QRNQSNCEEVDSPAP------------ISISTKGSARDRSQYVHSVLNPVENLSQ 281 Query: 775 WKEVKVRTAPTKNLKKENIFSEQENNKLTFISETT-IKVKTREPVCSTKQDVSVDASLSN 951 WKEVKV P K KENI ++ENN+ S T + R+P STK+++SVDASLS Sbjct: 282 WKEVKVGALPLKAPVKENII-DKENNRSGLESRVTGFSPQPRKPRDSTKEEISVDASLST 340 Query: 952 WLPSSENSTAAGPQAXXXXXXXXXXXREDRPILGALTIEDIKQXXXXXXXXXXXXXXXXX 1131 WL SSE + G Q RED+PILGALT+ED+K Sbjct: 341 WLVSSEEKSPKGAQGSNSQWSNSSVSREDQPILGALTVEDLKH--SSVTSSPRRSPNKNP 398 Query: 1132 DEIPILGTVGRYWNCEEQGDDSATSRGEVKGIPNATSKYQEDKNVNLHTTPFEVRLEIAL 1311 DE+PILGTVG YWN + Q +S KGIPN TSKY+EDK VN H+TPFEVRLE AL Sbjct: 399 DEMPILGTVGSYWNSKAQYSES-------KGIPNTTSKYREDKKVNWHSTPFEVRLERAL 451 Query: 1312 RNGVA 1326 G A Sbjct: 452 NKGSA 456 >ref|XP_020584860.1| uncharacterized protein LOC110027684 [Phalaenopsis equestris] Length = 446 Score = 212 bits (540), Expect = 2e-59 Identities = 156/452 (34%), Positives = 217/452 (48%), Gaps = 32/452 (7%) Frame = +1 Query: 58 MGCFLGCFKVPKDRKRRRPLKQSLSIDH--------------RECCRALQPSLSPKQITL 195 MGCF CFK R+R R K+S + + +E C + S +I Sbjct: 1 MGCFFLCFKGSNGRRRCRKHKKSSPVKYGGFVFLLPRAITQEKESCAYFRSISSGIEIIA 60 Query: 196 DLSXXXXXXXXXXXXXLKENVKQGSGVNVRKKVTFDLNVKTYEAVAYEEDRDCSSEEGEA 375 + + L EN +QG+ N+RKKVTFDL+V T+E V + ED +E+GE Sbjct: 61 ETAAAAVNPFPK----LSENEEQGTVSNIRKKVTFDLHVTTFEIVPFHEDSKDFTEDGEI 116 Query: 376 SEKIDERKLEEERDESSPKS---SAFPVNHRYQNCXXXXXXXXXXXXXXXXXXXXXXXXX 546 + E K ++E +E + +S +A P NHRYQNC Sbjct: 117 VREAKEGKKDKEEEECNHESKIAAACPPNHRYQNCDSSDDEGLDEEDEASDDDEENEFDE 176 Query: 547 XXXXXXXXXXXX-YDSFFSLPIDKEPQGLPEVNSPKPKFAPSPALDTIPILVAGGTTKRD 723 YDS+FSL ++ E + +V KP + + + + + + RD Sbjct: 177 GDDISINGDEEESYDSYFSLHMEGESKHFHDVT--KPNHTTTESNTDVELPFSSKASVRD 234 Query: 724 RSQFVHPVLNPVENLSQWKEVKVR-TAPTKNLKK-----------ENIFSEQENNKLTFI 867 RS++++ VLNPVEN+S WKE+KVR T+ + KK E FSE E +F Sbjct: 235 RSRYINSVLNPVENISHWKELKVRPTSKNPSNKKFEPNTRSFISPEPTFSEAERPHKSFS 294 Query: 868 SETTIKVKTREPVCSTKQDVSVDASLSNWLPSSENSTAAGPQAXXXXXXXXXXXREDRPI 1047 S P KQ++++DASLSNWL S NS PQ +E+RPI Sbjct: 295 SNPN-------PDDFGKQEIALDASLSNWLVSPNNSKPE-PQRSNSMLSNSSISQEERPI 346 Query: 1048 LGALTIEDIKQXXXXXXXXXXXXXXXXXDEIPILGTVGRYWNCE--EQGDDSATSRGEVK 1221 LGALTIEDIK +E+PI+GTVG YW+ + E+ S +S E + Sbjct: 347 LGALTIEDIKHSSRTSSPRKSPSWSA--EEVPIVGTVGGYWSSKSGEESSSSWSSISEER 404 Query: 1222 GIPNATSKYQEDKNVNLHTTPFEVRLEIALRN 1317 GIPN T KY+EDK VNL+ TPFE RLE AL + Sbjct: 405 GIPNTTGKYREDKRVNLYFTPFETRLERALNS 436 >ref|XP_017984350.1| PREDICTED: DNA ligase 1 [Theobroma cacao] Length = 441 Score = 210 bits (534), Expect = 1e-58 Identities = 158/447 (35%), Positives = 203/447 (45%), Gaps = 29/447 (6%) Frame = +1 Query: 58 MGCFLGCFKVPKDRKRRRPLKQSLSIDHRECCRALQPSLSPKQITLDLSXXXXXXXXXXX 237 MGCFL CF KDRK R+ + R Q ++S +Q L+ Sbjct: 1 MGCFLACFGSSKDRKTRKQRHKVQPRFQRNASYNAQSTVSLEQSNLEKPIGPVKEVRDD- 59 Query: 238 XXLKENVKQGSGVNVRKKVTFDLNVKTYEAVAYEEDRDCS------SEEGEASEKIDERK 399 + + GSG + RKKVTFD NVKTYE V +E D EEGE K++E Sbjct: 60 ---EAEEQLGSGSSNRKKVTFDTNVKTYEHVLIDESTDFELHNEEEEEEGENKGKVNEDN 116 Query: 400 LEEERDESSPK--------SSAFPVNHRYQNCXXXXXXXXXXXXXXXXXXXXXXXXXXXX 555 L + R+ + S+ +P NHRYQNC Sbjct: 117 LTKRRESENSSEHSSITSSSTFYPPNHRYQNCRESDNEDEDGELDYEESDLDDDEDDD-- 174 Query: 556 XXXXXXXXXYDSFFSLPIDKEPQGLPEVNSPKPKFAPSPALDTIPILVAGGTTKRDRSQF 735 Y+ F ++ + V K + PI + G RDRS Sbjct: 175 ---------YEDFDDGAVESRDM-IRGVRGVTEKVDGLVQEEVKPIGLIRGV--RDRSGN 222 Query: 736 VHPVLNPVENLSQWKEVKVRTAPTKNLKKENIFSEQENNKLTFISETTIKV-------KT 894 V PVLNPVENL+QWK VK + AP L+KEN+ EQE +L+F S+ + K K+ Sbjct: 223 VPPVLNPVENLTQWKAVKAKGAPPPKLRKENLSLEQEEPRLSFSSDPSFKELSFSFKSKS 282 Query: 895 REPVCSTKQDVSVDASLSNWLPSSE--------NSTAAGPQAXXXXXXXXXXXREDRPIL 1050 Q+VSVDASLSNWL SSE N A+ P+ EDRPIL Sbjct: 283 DHEPMKLDQEVSVDASLSNWLSSSETTPVKKKSNFDASTPERSMSQGSNSLRSPEDRPIL 342 Query: 1051 GALTIEDIKQXXXXXXXXXXXXXXXXXDEIPILGTVGRYWNCEEQGDDSATSRGEVKGIP 1230 GALT+E+I + DE+PI+GTVG YW+ + S KGIP Sbjct: 343 GALTLEEINKFSASSSPRKSPSRSP--DEMPIIGTVGTYWSHHVSTTKDSGSATSFKGIP 400 Query: 1231 NATSKYQEDKNVNLHTTPFEVRLEIAL 1311 N TSKY+EDKNV+ H+TPFE RLE AL Sbjct: 401 NTTSKYREDKNVSWHSTPFETRLERAL 427 >gb|EOX94353.1| Uncharacterized protein TCM_003948 isoform 1 [Theobroma cacao] gb|EOX94354.1| Uncharacterized protein TCM_003948 isoform 1 [Theobroma cacao] Length = 442 Score = 209 bits (531), Expect = 3e-58 Identities = 158/448 (35%), Positives = 202/448 (45%), Gaps = 30/448 (6%) Frame = +1 Query: 58 MGCFLGCFKVPKDRKRRRPLKQSLSIDHRECCRALQPSLSPKQITLDLSXXXXXXXXXXX 237 MGCFL CF KDRK R+ + R Q ++S +Q L+ Sbjct: 1 MGCFLACFGSSKDRKTRKQRHKVQPRFQRNASYNAQSTVSLEQSNLEKPIGPVKEVRDDD 60 Query: 238 XXLKENVKQGSGVNVRKKVTFDLNVKTYEAVAYEEDRDCS-------SEEGEASEKIDER 396 + GSG + RKKVTFD NVKTYE V +E D EEGE K++E Sbjct: 61 A----EEQLGSGSSNRKKVTFDTNVKTYEHVLIDESTDFELHNEEEEEEEGENKGKVNED 116 Query: 397 KLEEERDESSPK--------SSAFPVNHRYQNCXXXXXXXXXXXXXXXXXXXXXXXXXXX 552 L + R+ + S+ +P NHRYQNC Sbjct: 117 NLTKRRESENSSEHSSITSSSTFYPPNHRYQNCRESDNEDEDGELDYEESDLDDDEDDD- 175 Query: 553 XXXXXXXXXXYDSFFSLPIDKEPQGLPEVNSPKPKFAPSPALDTIPILVAGGTTKRDRSQ 732 Y+ F ++ + V K + PI + G RDRS Sbjct: 176 ----------YEDFDDGAVESRDM-IRGVRGVTEKVDGLVQEEVKPIGLIRGV--RDRSG 222 Query: 733 FVHPVLNPVENLSQWKEVKVRTAPTKNLKKENIFSEQENNKLTFISETTIKV-------K 891 V PVLNPVENL+QWK VK + AP L+KEN+ EQE +L+F S+ + K K Sbjct: 223 NVPPVLNPVENLTQWKAVKAKGAPPPKLRKENLSLEQEEPRLSFSSDPSFKELSFSFKSK 282 Query: 892 TREPVCSTKQDVSVDASLSNWLPSSE--------NSTAAGPQAXXXXXXXXXXXREDRPI 1047 + Q+VSVDASLSNWL SSE N A+ P+ EDRPI Sbjct: 283 SDHEPMKLDQEVSVDASLSNWLSSSETTPVKKKSNFDASTPERSMSQGSNSLRSPEDRPI 342 Query: 1048 LGALTIEDIKQXXXXXXXXXXXXXXXXXDEIPILGTVGRYWNCEEQGDDSATSRGEVKGI 1227 LGALT+E+I + DE+PI+GTVG YW+ + S KGI Sbjct: 343 LGALTLEEINKFSASSSPRKSPSRSP--DEMPIIGTVGTYWSHHVSTTKDSGSATSFKGI 400 Query: 1228 PNATSKYQEDKNVNLHTTPFEVRLEIAL 1311 PN TSKY+EDKNV+ H+TPFE RLE AL Sbjct: 401 PNTTSKYREDKNVSWHSTPFETRLERAL 428 >ref|XP_010093087.1| putative vacuolar protein sorting-associated protein 13F [Morus notabilis] gb|EXB53515.1| hypothetical protein L484_005945 [Morus notabilis] Length = 424 Score = 206 bits (523), Expect = 3e-57 Identities = 164/449 (36%), Positives = 210/449 (46%), Gaps = 26/449 (5%) Frame = +1 Query: 58 MGCFLGCFKVPKDRKRRRPLKQSLSIDHRECCRALQPSLSPKQITLDLSXXXXXXXXXXX 237 MGCFL CF K+ +RRR + + R Q + SPK + +S Sbjct: 1 MGCFLACFGTSKNDRRRRKQRNQVQP------RLHQRNESPKAVQSAVSSVQVESENLVS 54 Query: 238 XXLKENVKQGSGVNVRKKVTFDLNVKTYEAVAYEEDRDC--SSEEGEASEKIDERKLEEE 411 +Q + ++ RKKVTFD NV+TYE V+ +D D SE+ E E+ D KL Sbjct: 55 LVSVVREEQPN-LSPRKKVTFDSNVRTYEHVSTYDDSDLLRESEDFEKKEEDDLGKLSLS 113 Query: 412 RDESSPKS-----SAFPVNHRYQNCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 576 + S S ++P NHRYQNC Sbjct: 114 KSPSEDSSVTSSLGSYPPNHRYQNCRESDDEDEELDFEDSDLDDEDENGDEDDGEVEYE- 172 Query: 577 XXYDSFFSLPIDKEPQGLPEVNSPKPKFAPSPALDTIPILVAGGTTKRDRSQFVHPVLNP 756 D L E EVNSP P S L++ + RDRS +VH VLNP Sbjct: 173 ---DEVIELSRASE-----EVNSPMPV---SGLLESEVL----NKNVRDRSAYVHSVLNP 217 Query: 757 VENLSQWKEVKVRTAPTKN--LKKENIFSEQENNKLTFISETTIK-------VKTREPVC 909 VENL+QWK VK R P ++KEN +QE +++F SE K KT +PV Sbjct: 218 VENLTQWKAVKARGKPKTRPQIQKENFTLDQEEPRISFNSEPAFKDLSLSSKSKTDQPV- 276 Query: 910 STKQDVSVDASLSNWLPSSEN------STAA----GPQAXXXXXXXXXXXREDRPILGAL 1059 KQ+++VDASLSNWL S E+ ST A P +EDRPILGAL Sbjct: 277 KPKQEMAVDASLSNWLVSPESTPVNKTSTFAFDNLSPGRCTSQGSNSVRSQEDRPILGAL 336 Query: 1060 TIEDIKQXXXXXXXXXXXXXXXXXDEIPILGTVGRYWNCEEQGDDSATSRGEVKGIPNAT 1239 T+E+I+Q DE+PI+GTVG YWN DS ++ +KGIPN T Sbjct: 337 TVEEIRQFSASSSPRKSPSRSP--DEMPIIGTVGTYWNNTGPAKDSGSATS-LKGIPNTT 393 Query: 1240 SKYQEDKNVNLHTTPFEVRLEIALRNGVA 1326 SKY+EDK VN HTTPFE RLE AL G + Sbjct: 394 SKYREDKRVNWHTTPFETRLERALNRGAS 422 >gb|PON44348.1| eisosome protein [Parasponia andersonii] Length = 442 Score = 205 bits (522), Expect = 6e-57 Identities = 155/449 (34%), Positives = 209/449 (46%), Gaps = 26/449 (5%) Frame = +1 Query: 58 MGCFLGCF-KVPKDRKRRRPLKQSLSIDHRECCRALQPSLSPKQITLDLSXXXXXXXXXX 234 MGCFL CF KDRKRRR Q + R+ R S Q + Sbjct: 1 MGCFLACFGNSKKDRKRRRQRNQ---VQPRDQYR--NASFKAVQSVVSSVQEESERPISL 55 Query: 235 XXXLKENVKQGSGVNVRKKVTFDLNVKTYEAVAYEEDRDCSSEEGEASEKIDERK----- 399 +++ ++ ++ RKKVTFD NVKTYE V+ ED + E +A++ ++++ Sbjct: 56 VSEVRDKPEEQPSLSTRKKVTFDSNVKTYEHVSTHEDSEKLPESEDATKTKEDKEDLANL 115 Query: 400 -LEEERDESSPKSSA---FPVNHRYQNCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 567 L + E S +S+ +P NHRYQNC Sbjct: 116 MLSKSFSEDSSITSSLGSYPPNHRYQNCRDSDDEDEELDFGDSDLDDEDDEYEDEDDGEI 175 Query: 568 XXXXXYDSFFSLPIDKEPQGLPEVNSPKPKFAPSPALDTIPILVAGGTTKRDRSQFVHPV 747 +D + + EVNSP P S L++ + RDRS +VH V Sbjct: 176 DYEDEI-------VDLKRRNSEEVNSPMPV---SGLLESEIQPIGLNRNARDRSGYVHSV 225 Query: 748 LNPVENLSQWKEVKVRTAPTKNLKKENIFSEQENNKLTFISETTIKVKT------REPVC 909 LNPVENLSQWK +K + P +KEN EQE ++F SE K + ++ Sbjct: 226 LNPVENLSQWKAIKAKGTPKMKPQKENFILEQEP-WISFSSEPAFKELSLSFKSKKDQPT 284 Query: 910 STKQDVSVDASLSNWLPSSE----NSTAA------GPQAXXXXXXXXXXXREDRPILGAL 1059 KQ+V+VDASLSNWL + E N T++ P +EDRPILGAL Sbjct: 285 RPKQEVAVDASLSNWLATPETKPVNKTSSFAFSTFSPDRSTSQGSNSVRSQEDRPILGAL 344 Query: 1060 TIEDIKQXXXXXXXXXXXXXXXXXDEIPILGTVGRYWNCEEQGDDSATSRGEVKGIPNAT 1239 T+E+++Q DE+PI+GTVG YW+ DS S KG+PN T Sbjct: 345 TVEELRQYSASNSPRKSPSRSP--DEMPIIGTVGTYWSHSGNAKDSG-SASSFKGVPNTT 401 Query: 1240 SKYQEDKNVNLHTTPFEVRLEIALRNGVA 1326 SKY+EDKNVN H+TPFE RLE AL G A Sbjct: 402 SKYREDKNVNWHSTPFETRLERALNRGAA 430 >gb|EMS58738.1| hypothetical protein TRIUR3_23723 [Triticum urartu] Length = 477 Score = 206 bits (523), Expect = 1e-56 Identities = 143/409 (34%), Positives = 199/409 (48%), Gaps = 18/409 (4%) Frame = +1 Query: 154 RALQPSLSPKQITLDLSXXXXXXXXXXXXXLKENVKQGSGVNVRKKVTFDLNVKTYEAVA 333 + L P LSP + S L+E ++ S +KKVTFD+NV TYE + Sbjct: 92 KVLTPPLSP----VKCSPVAAAVASTPDMELREVSEEDSHSGGKKKVTFDMNVTTYENTS 147 Query: 334 YEEDRDCSSEEGEASEKIDERKLEEERDESSPKSSAFPVNHRYQNCXXXXXXXXXXXXXX 513 + + SE + +E+E +E K+ F NHRY NC Sbjct: 148 SPDQEEIPSEL--------VKWMEDEEEEHMQKTVLFSENHRYGNCTDSDDDNGDEYGED 199 Query: 514 XXXXXXXXXXXXXXXXXXXXXXX------------YDSFFSLPIDKEPQGLPEVNSPKPK 657 +S FSLP+ Q +V+SP PK Sbjct: 200 DNYGDDSDAEEDFVDCKIDLLDEEEIRTEENPEESQESLFSLPMSNYTQDDQDVSSPVPK 259 Query: 658 FAPSPALDTIPILVAGGTTKRDRSQFVHPVLNPVENLSQWKEVKVRTAPTKNLKKENIFS 837 + +PA + P++ G RDRSQ+V PVLNPV+N QWKEVK + P K L KEN+ Sbjct: 260 SSVTPAQEESPLIQ--GNNHRDRSQYVRPVLNPVQNREQWKEVKAQAGPVKKLYKENV-- 315 Query: 838 EQENNKLTFISET-TIKVKTRE---PVCSTKQDVSVDASLSNWLPSSENSTAAGPQAXXX 1005 N + + T T KV + P S+K +VSVDASLS WL SS+NST Q+ Sbjct: 316 ----NSVPNVGATLTCKVANQTKIGPSNSSKGEVSVDASLSTWLVSSDNSTVDKAQS-KS 370 Query: 1006 XXXXXXXXREDRPILGALTIEDIKQXXXXXXXXXXXXXXXXXDEIPILGTVGRYWNCEEQ 1185 R++RP+LGALT++D+KQ +E+PILGTVG YW+ +Q Sbjct: 371 PRSVSSVCRQERPVLGALTVDDLKQ--SSATSSPRRSPSHNHEEVPILGTVGSYWSSTKQ 428 Query: 1186 GDDSATSRGE--VKGIPNATSKYQEDKNVNLHTTPFEVRLEIALRNGVA 1326 G++ +SR + GIPN+TSKY+ED+ VN H+TPF VRL+ A++ A Sbjct: 429 GNEYCSSRSDSGTNGIPNSTSKYREDRRVNWHSTPFNVRLDKAMKKSSA 477