BLASTX nr result
ID: Cheilocostus21_contig00026303
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cheilocostus21_contig00026303 (692 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_009404763.1| PREDICTED: uncharacterized protein LOC103988... 249 2e-72 ref|XP_018679559.1| PREDICTED: uncharacterized protein LOC103979... 209 1e-58 ref|XP_009393858.1| PREDICTED: uncharacterized protein LOC103979... 208 5e-58 ref|XP_010936375.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 196 7e-54 ref|XP_008777040.1| PREDICTED: uncharacterized protein LOC103697... 193 6e-53 ref|XP_019706661.1| PREDICTED: uncharacterized protein LOC105046... 193 6e-53 ref|XP_008777039.1| PREDICTED: uncharacterized protein LOC103697... 193 6e-53 ref|XP_010923576.1| PREDICTED: uncharacterized protein LOC105046... 193 6e-53 ref|XP_017978215.1| PREDICTED: uncharacterized protein LOC186139... 185 4e-50 ref|XP_020104050.1| uncharacterized protein LOC109721046 isoform... 182 3e-49 ref|XP_021281040.1| uncharacterized protein LOC110414270 [Herran... 181 1e-48 gb|OAY67602.1| hypothetical protein ACMD2_05229 [Ananas comosus] 180 3e-48 gb|EOX95734.1| Poly(A) RNA polymerase cid14, putative [Theobroma... 179 6e-48 ref|XP_019710291.1| PREDICTED: uncharacterized protein LOC105057... 179 8e-48 ref|XP_010938971.1| PREDICTED: uncharacterized protein LOC105057... 179 8e-48 ref|XP_010261538.1| PREDICTED: uncharacterized protein LOC104600... 177 2e-47 ref|XP_010261537.1| PREDICTED: uncharacterized protein LOC104600... 177 2e-47 gb|KHN06300.1| Poly(A) RNA polymerase cid14 [Glycine soja] 177 2e-47 gb|PKA49759.1| hypothetical protein AXF42_Ash004300 [Apostasia s... 177 3e-47 ref|XP_006583248.1| PREDICTED: uncharacterized protein LOC100809... 177 4e-47 >ref|XP_009404763.1| PREDICTED: uncharacterized protein LOC103988002 [Musa acuminata subsp. malaccensis] Length = 1351 Score = 249 bits (636), Expect = 2e-72 Identities = 125/237 (52%), Positives = 155/237 (65%), Gaps = 8/237 (3%) Frame = +3 Query: 3 ELPEEPNADILNSDLNGHRQNLVYGRFCQNSYHAPFMYAPRAVESPINLSAHYPWDGSGR 182 E EEP++DILNSDLN H QNL+YGR+CQNS H PF+Y+ PI L +H+ WDG GR Sbjct: 1039 ETSEEPSSDILNSDLNDHWQNLIYGRYCQNSNHGPFVYSSPVAGQPIYLPSHHLWDGPGR 1098 Query: 183 SIATNLNYTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPX 362 +A NLNY MGH PR+VPM PLQ+GPD++ FQR+VD A RYRGGTGTYLPNPKV Sbjct: 1099 PLAANLNYIPIMGHGPRLVPMMPLQTGPDQAPGVFQRHVDGAPRYRGGTGTYLPNPKVSF 1158 Query: 363 XXXXXXXXXXXXXXXXXWNDPGDR--------FRAVDRSHGRSQAERATFRPDRQAAAKN 518 NDP DR RA DRS+GR+QAER++ PD AA++N Sbjct: 1159 RDRQSSLRSHRGNRNYDHNDPVDREGRWVHSKSRAFDRSYGRNQAERSSLPPDHLAASRN 1218 Query: 519 QEEKRWATYKHEPLAAHRGPDRTVPSTNSSNSSENAFGMYPETAFRSNGVNSTRPTN 689 Q+ K+W +Y HEPLA+++G T+ NSS++ ENA GMYP+ A SNGV TN Sbjct: 1219 QDVKKWVSYGHEPLASYQGSFATI---NSSHNLENALGMYPQNAVGSNGVTPPDSTN 1272 >ref|XP_018679559.1| PREDICTED: uncharacterized protein LOC103979435 isoform X1 [Musa acuminata subsp. malaccensis] ref|XP_018679560.1| PREDICTED: uncharacterized protein LOC103979435 isoform X1 [Musa acuminata subsp. malaccensis] Length = 1338 Score = 209 bits (533), Expect = 1e-58 Identities = 113/235 (48%), Positives = 144/235 (61%), Gaps = 7/235 (2%) Frame = +3 Query: 3 ELPEEPNADILNSDLNGHRQNLVYGRFCQNSYHAPFMYAPRAVESPINLSAHYPWDGSGR 182 E EE ++DI N D H QNLVYGR CQNSY P MY PI L +HYPWDG G+ Sbjct: 1039 ESSEELSSDIFNVDSVSHWQNLVYGRSCQNSYPGPVMYTSPVARPPIYLPSHYPWDGPGK 1098 Query: 183 SIATNLNYTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVP- 359 ++++L+YTQ GHNP +VP+ P Q G DR++ FQ DEA YRGGTGTYLPN KV Sbjct: 1099 LLSSDLHYTQMTGHNPLLVPIMPFQPGSDRASGVFQDYADEAPTYRGGTGTYLPNSKVSF 1158 Query: 360 XXXXXXXXXXXXXXXXXXWNDPGDRF------RAVDRSHGRSQAERATFRPDRQAAAKNQ 521 +DP DR RA HGR++AER RPD+ AA+KNQ Sbjct: 1159 GDHRQSGSRNHEGNCNYDKDDPVDRSWVSSKPRAFGHGHGRNRAERPRLRPDQLAASKNQ 1218 Query: 522 EEKRWATYKHEPLAAHRGPDRTVPSTNSSNSSENAFGMYPETAFRSNGVNSTRPT 686 EK+W +++ EP+A++R R+ STNSS SE+A GM+P+TA S GVN++ PT Sbjct: 1219 -EKKWESWRSEPVASNRRRGRSFASTNSSYISESAPGMHPQTASSSEGVNASDPT 1272 >ref|XP_009393858.1| PREDICTED: uncharacterized protein LOC103979435 isoform X2 [Musa acuminata subsp. malaccensis] Length = 1337 Score = 208 bits (529), Expect = 5e-58 Identities = 111/234 (47%), Positives = 142/234 (60%), Gaps = 6/234 (2%) Frame = +3 Query: 3 ELPEEPNADILNSDLNGHRQNLVYGRFCQNSYHAPFMYAPRAVESPINLSAHYPWDGSGR 182 E EE ++DI N D H QNLVYGR CQNSY P MY PI L +HYPWDG G+ Sbjct: 1039 ESSEELSSDIFNVDSVSHWQNLVYGRSCQNSYPGPVMYTSPVARPPIYLPSHYPWDGPGK 1098 Query: 183 SIATNLNYTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPX 362 ++++L+YTQ GHNP +VP+ P Q G DR++ FQ DEA YRGGTGTYLPN Sbjct: 1099 LLSSDLHYTQMTGHNPLLVPIMPFQPGSDRASGVFQDYADEAPTYRGGTGTYLPNSVSFG 1158 Query: 363 XXXXXXXXXXXXXXXXXWNDPGDRF------RAVDRSHGRSQAERATFRPDRQAAAKNQE 524 +DP DR RA HGR++AER RPD+ AA+KNQ Sbjct: 1159 DHRQSGSRNHEGNCNYDKDDPVDRSWVSSKPRAFGHGHGRNRAERPRLRPDQLAASKNQ- 1217 Query: 525 EKRWATYKHEPLAAHRGPDRTVPSTNSSNSSENAFGMYPETAFRSNGVNSTRPT 686 EK+W +++ EP+A++R R+ STNSS SE+A GM+P+TA S GVN++ PT Sbjct: 1218 EKKWESWRSEPVASNRRRGRSFASTNSSYISESAPGMHPQTASSSEGVNASDPT 1271 >ref|XP_010936375.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC105056012 [Elaeis guineensis] Length = 1356 Score = 196 bits (498), Expect = 7e-54 Identities = 104/234 (44%), Positives = 141/234 (60%), Gaps = 9/234 (3%) Frame = +3 Query: 9 PEEPNADILNSDLNGHRQNLVYGRFCQNSYHAPFMYAPRAVESPINLSAHYPWDGSGRSI 188 P E +DILNSD H QNL+YGRFCQ+ +H P +Y AV P L H+P DG GR + Sbjct: 1045 PLEEKSDILNSDFASHLQNLLYGRFCQD-FHGPIIYPSPAVVPPSYLQGHFPLDGPGRPL 1103 Query: 189 ATNLNYTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPXXX 368 + N+N+TQ M + P++VP+ P+Q PDR+A FQR DEA RYRGGTGTYLPNPK+ Sbjct: 1104 SANVNFTQVMNYGPQLVPVMPIQPVPDRTAGVFQRYGDEAPRYRGGTGTYLPNPKMSFRD 1163 Query: 369 XXXXXXXXXXXXXXXWNDPGDR--------FRAVDRSHGRSQAERATFRPDRQAAAKNQE 524 +D GDR RA RSH RSQAER T D+ AA+++ Sbjct: 1164 RQPTSRNHRGNYGYDRSDHGDREGSWINSKTRAAGRSHVRSQAERPTSWHDQLAASEHHA 1223 Query: 525 EKRWATYKHEPLAAHRGPDRTVPSTNSSNSSEN-AFGMYPETAFRSNGVNSTRP 683 +++W + +HEP+A++ P+ + ST S++SS N A+ ++P S GVN RP Sbjct: 1224 DRQWESQRHEPVASYLVPNNSFVSTKSAHSSTNMAYALHPPPVAGSEGVNPARP 1277 >ref|XP_008777040.1| PREDICTED: uncharacterized protein LOC103697050 isoform X2 [Phoenix dactylifera] Length = 1330 Score = 193 bits (491), Expect = 6e-53 Identities = 103/234 (44%), Positives = 131/234 (55%), Gaps = 11/234 (4%) Frame = +3 Query: 15 EPNADILNSDLNGHRQNLVYGRFCQNS-YHAPFMYAPRAVESPINLSAHYPWDGSGRSIA 191 E +DILN D H +NL YGR CQN+ YH PFMY + P+ L H+PWDG GR + Sbjct: 1019 EHKSDILNGDFLSHWENLQYGRSCQNAHYHGPFMYQSPVMVPPVYLQGHFPWDGPGRPFS 1078 Query: 192 TNLN-YTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPXXX 368 N N +TQ M + PR+VP+TPLQ GP R++ FQR DE RYRGGTGTYLPNPK+ Sbjct: 1079 ANGNLFTQIMSYGPRLVPVTPLQPGPHRTSGVFQRFGDEVPRYRGGTGTYLPNPKISFRD 1138 Query: 369 XXXXXXXXXXXXXXXWNDPGDR--------FRAVDRSHGRSQAERATFRPDRQAAAKNQE 524 ND DR RA RS+GR+ AE+ R DR + N Sbjct: 1139 RQSSTRNHRGNYNYDRNDKADREGSWIYEKSRASGRSYGRTPAEKRGLRSDRSSTTDNHV 1198 Query: 525 EKRWATYKHEPLAAHRGPDRTVPSTNS-SNSSENAFGMYPETAFRSNGVNSTRP 683 ++ W ++HEPLA+ +G R+ NS NS A+GMYP S+GV+ T P Sbjct: 1199 DRSWGPHRHEPLASDQGQSRSFGVANSLPNSPNMAYGMYPVPTVNSSGVSPTGP 1252 >ref|XP_019706661.1| PREDICTED: uncharacterized protein LOC105046626 isoform X2 [Elaeis guineensis] Length = 1342 Score = 193 bits (491), Expect = 6e-53 Identities = 109/241 (45%), Positives = 135/241 (56%), Gaps = 11/241 (4%) Frame = +3 Query: 3 ELPEEPNADILNSDLNGHRQNLVYGRFCQNS-YHAPFMYAPRAVESPINLSAHYPWDGSG 179 E PEE +DILN D H QNL Y R CQN+ YH F+Y + PI L H+P DG G Sbjct: 1024 EPPEEHKSDILNGDFLSHWQNLQYVRSCQNTHYHGYFLYQSPVMVPPIYLQGHFPQDGPG 1083 Query: 180 RSIATNLNY-TQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKV 356 R + N N TQ M + PRVVP+TP+Q GP R++ FQ DE RYRGGTGTYLPNPKV Sbjct: 1084 RPLTANANLLTQIMSYGPRVVPITPMQPGPHRTSGIFQNFGDEIFRYRGGTGTYLPNPKV 1143 Query: 357 PXXXXXXXXXXXXXXXXXXWNDPGDR--------FRAVDRSHGRSQAERATFRPDRQAAA 512 ND DR RA +RSHGR QAE+ + RPDR + A Sbjct: 1144 SFRDRQSSTKNHRRNCSYDRNDNADREGSWIYAKSRAANRSHGRIQAEKLSLRPDRLSTA 1203 Query: 513 KNQEEKRWATYKHEPLAAHRGPDRTVPSTNSSNSSEN-AFGMYPETAFRSNGVNSTRPTN 689 N+ +K W ++HEP A+ + +R+ NSS SS N ++GMYP SNGV+ P Sbjct: 1204 DNRIDKPWDPHRHEPPASKQAQNRSFGLANSSRSSPNLSYGMYPVPTVNSNGVSPVNPAV 1263 Query: 690 S 692 S Sbjct: 1264 S 1264 >ref|XP_008777039.1| PREDICTED: uncharacterized protein LOC103697050 isoform X1 [Phoenix dactylifera] Length = 1349 Score = 193 bits (491), Expect = 6e-53 Identities = 103/234 (44%), Positives = 131/234 (55%), Gaps = 11/234 (4%) Frame = +3 Query: 15 EPNADILNSDLNGHRQNLVYGRFCQNS-YHAPFMYAPRAVESPINLSAHYPWDGSGRSIA 191 E +DILN D H +NL YGR CQN+ YH PFMY + P+ L H+PWDG GR + Sbjct: 1019 EHKSDILNGDFLSHWENLQYGRSCQNAHYHGPFMYQSPVMVPPVYLQGHFPWDGPGRPFS 1078 Query: 192 TNLN-YTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPXXX 368 N N +TQ M + PR+VP+TPLQ GP R++ FQR DE RYRGGTGTYLPNPK+ Sbjct: 1079 ANGNLFTQIMSYGPRLVPVTPLQPGPHRTSGVFQRFGDEVPRYRGGTGTYLPNPKISFRD 1138 Query: 369 XXXXXXXXXXXXXXXWNDPGDR--------FRAVDRSHGRSQAERATFRPDRQAAAKNQE 524 ND DR RA RS+GR+ AE+ R DR + N Sbjct: 1139 RQSSTRNHRGNYNYDRNDKADREGSWIYEKSRASGRSYGRTPAEKRGLRSDRSSTTDNHV 1198 Query: 525 EKRWATYKHEPLAAHRGPDRTVPSTNS-SNSSENAFGMYPETAFRSNGVNSTRP 683 ++ W ++HEPLA+ +G R+ NS NS A+GMYP S+GV+ T P Sbjct: 1199 DRSWGPHRHEPLASDQGQSRSFGVANSLPNSPNMAYGMYPVPTVNSSGVSPTGP 1252 >ref|XP_010923576.1| PREDICTED: uncharacterized protein LOC105046626 isoform X1 [Elaeis guineensis] Length = 1387 Score = 193 bits (491), Expect = 6e-53 Identities = 109/241 (45%), Positives = 135/241 (56%), Gaps = 11/241 (4%) Frame = +3 Query: 3 ELPEEPNADILNSDLNGHRQNLVYGRFCQNS-YHAPFMYAPRAVESPINLSAHYPWDGSG 179 E PEE +DILN D H QNL Y R CQN+ YH F+Y + PI L H+P DG G Sbjct: 1024 EPPEEHKSDILNGDFLSHWQNLQYVRSCQNTHYHGYFLYQSPVMVPPIYLQGHFPQDGPG 1083 Query: 180 RSIATNLNY-TQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKV 356 R + N N TQ M + PRVVP+TP+Q GP R++ FQ DE RYRGGTGTYLPNPKV Sbjct: 1084 RPLTANANLLTQIMSYGPRVVPITPMQPGPHRTSGIFQNFGDEIFRYRGGTGTYLPNPKV 1143 Query: 357 PXXXXXXXXXXXXXXXXXXWNDPGDR--------FRAVDRSHGRSQAERATFRPDRQAAA 512 ND DR RA +RSHGR QAE+ + RPDR + A Sbjct: 1144 SFRDRQSSTKNHRRNCSYDRNDNADREGSWIYAKSRAANRSHGRIQAEKLSLRPDRLSTA 1203 Query: 513 KNQEEKRWATYKHEPLAAHRGPDRTVPSTNSSNSSEN-AFGMYPETAFRSNGVNSTRPTN 689 N+ +K W ++HEP A+ + +R+ NSS SS N ++GMYP SNGV+ P Sbjct: 1204 DNRIDKPWDPHRHEPPASKQAQNRSFGLANSSRSSPNLSYGMYPVPTVNSNGVSPVNPAV 1263 Query: 690 S 692 S Sbjct: 1264 S 1264 >ref|XP_017978215.1| PREDICTED: uncharacterized protein LOC18613995 [Theobroma cacao] Length = 1347 Score = 185 bits (470), Expect = 4e-50 Identities = 101/242 (41%), Positives = 140/242 (57%), Gaps = 16/242 (6%) Frame = +3 Query: 9 PEEPNADILNSDLNGHRQNLVYGRFCQNS-YHAPFMYAPRAVESPINLSAHYPWDGSGRS 185 P EP DILN D+ H +NL YGR CQNS YH P +Y + P+ L H+PWDG GR Sbjct: 1029 PSEPKRDILNGDIASHWKNLQYGRICQNSRYHPPLIYPSSVMVPPVCLQGHFPWDGPGRP 1088 Query: 186 IATNLN-YTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPX 362 ++T++N ++Q M + PRVVP+TP QS +R AS +QR DE RYRGGTGTYLPNPKVP Sbjct: 1089 LSTDVNLFSQLMNYGPRVVPVTPFQSVSNRPASVYQRYADEMPRYRGGTGTYLPNPKVPM 1148 Query: 363 XXXXXXXXXXXXXXXXXWNDPGDR---------FRAVDRSHGRSQAERATFRPDRQA--A 509 + GDR RA RSH R+Q E++ F D A A Sbjct: 1149 RERHSTNTRRGKYNYDRNDHHGDREGSWTANSKSRAAGRSHSRNQNEKSRFTIDHLAAVA 1208 Query: 510 AKNQEEKRWATYKHEPLA---AHRGPDRTVPSTNSSNSSENAFGMYPETAFRSNGVNSTR 680 +++ E+ W++++H+ +H GP R+ +++ S+S+ +GMYP A +GV+S Sbjct: 1209 GESRAERPWSSHRHDSFTSYQSHNGPVRS--NSSQSSSASMPYGMYPLPAMNPSGVSSNG 1266 Query: 681 PT 686 PT Sbjct: 1267 PT 1268 >ref|XP_020104050.1| uncharacterized protein LOC109721046 isoform X1 [Ananas comosus] ref|XP_020104051.1| uncharacterized protein LOC109721046 isoform X1 [Ananas comosus] Length = 1340 Score = 182 bits (463), Expect = 3e-49 Identities = 99/237 (41%), Positives = 131/237 (55%), Gaps = 10/237 (4%) Frame = +3 Query: 3 ELPEEPNADILNSDLNGHRQNLVYGRFCQNSYHAPFMYAPRAVESPINLSAHYPWDGSGR 182 E E DIL SD + H QNL+YGR+CQ++ PF+ P + P+ L H+PWDG GR Sbjct: 1025 ESSNEHEPDILKSDFDSHLQNLLYGRYCQDTRQGPFICQPPVLVPPVYLQGHFPWDGPGR 1084 Query: 183 SIATNLNYTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPX 362 ++ N N TQ MG+ PR VPM PL+ G +R + FQR+ +EA RYRGGTGTYLPNPKVP Sbjct: 1085 PVSANANLTQMMGYAPRFVPMVPLKPGSERPSGVFQRHGEEAPRYRGGTGTYLPNPKVPF 1144 Query: 363 XXXXXXXXXXXXXXXXXWNDPGDR--------FRAVDRSHGRSQAERATFRPDRQAAAKN 518 D GDR RA R HG Q ER+ F+P+ Q+ Sbjct: 1145 RDRQASTRSYRRNYNSERGDQGDREGSWISAKARAAGRGHGH-QVERSNFQPENQSDRHR 1203 Query: 519 QEEKRWATYKHEPLAAHRGPDRTVPSTNSSNSSEN-AFGMYPETAFRS-NGVNSTRP 683 Q K ++HEP++++ P + TNS+ S N +GMYP + NGV +T P Sbjct: 1204 QSYKN-DPHRHEPVSSNLAPGNSFRPTNSNRSPRNLTYGMYPPPPVTNPNGVINTGP 1259 >ref|XP_021281040.1| uncharacterized protein LOC110414270 [Herrania umbratica] Length = 1347 Score = 181 bits (459), Expect = 1e-48 Identities = 101/242 (41%), Positives = 141/242 (58%), Gaps = 16/242 (6%) Frame = +3 Query: 9 PEEPNADILNSDLNGHRQNLVYGRFCQNS-YHAPFMYAPRAVESPINLSAHYPWDGSGRS 185 P E ADILN D+ H +NL YGR CQNS Y +P + + P+ L H+PWDG GR Sbjct: 1029 PSEHKADILNGDIASHWKNLQYGRICQNSRYPSPLICPSPVMVPPVYLQGHFPWDGPGRP 1088 Query: 186 IATNLN-YTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPX 362 ++TN+N ++Q M + PRVVP+TPLQS +R AS +QR DE RYRGGTGTYLPNPKVP Sbjct: 1089 LSTNVNLFSQLMNYGPRVVPVTPLQSVSNRPASVYQRYADEMPRYRGGTGTYLPNPKVPM 1148 Query: 363 XXXXXXXXXXXXXXXXXWNDPGDR---------FRAVDRSHGRSQAERATFRPDRQA--A 509 + GDR RA RSH R+Q E++ F D A A Sbjct: 1149 RERHSTNTRRGKYNYDRNDHHGDREGNWTANSKSRAAGRSHSRNQNEKSRFTFDHLAAVA 1208 Query: 510 AKNQEEKRWATYKHEPLA---AHRGPDRTVPSTNSSNSSENAFGMYPETAFRSNGVNSTR 680 +++ E+ W++++H+ +H GP R+ +++ S+S+ +GMYP A ++GV+S Sbjct: 1209 GESRAERPWSSHRHDSFTSYQSHNGPVRS--NSSQSSSASMPYGMYPLPAMNASGVSSNG 1266 Query: 681 PT 686 PT Sbjct: 1267 PT 1268 >gb|OAY67602.1| hypothetical protein ACMD2_05229 [Ananas comosus] Length = 1370 Score = 180 bits (456), Expect = 3e-48 Identities = 97/235 (41%), Positives = 129/235 (54%), Gaps = 10/235 (4%) Frame = +3 Query: 3 ELPEEPNADILNSDLNGHRQNLVYGRFCQNSYHAPFMYAPRAVESPINLSAHYPWDGSGR 182 E E DIL SD + H QNL+YGR+CQ++ PF+ P + P+ L H+PWDG GR Sbjct: 1025 ESSNEHEPDILKSDFDSHLQNLLYGRYCQDTRQGPFICQPPVLVPPVYLQGHFPWDGPGR 1084 Query: 183 SIATNLNYTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPX 362 ++ N N TQ MG+ PR VPM PL+ G +R + FQR+ +EA RYRGGTGTYLPNPKVP Sbjct: 1085 PVSANANLTQMMGYAPRFVPMVPLKPGSERPSGVFQRHGEEAPRYRGGTGTYLPNPKVPF 1144 Query: 363 XXXXXXXXXXXXXXXXXWNDPGDR--------FRAVDRSHGRSQAERATFRPDRQAAAKN 518 D GDR RA R HG Q ER+ F+P+ Q+ Sbjct: 1145 RDRQASTRSYRRNYNSERGDQGDREGSWISAKARAAGRGHGH-QVERSNFQPENQSDRHR 1203 Query: 519 QEEKRWATYKHEPLAAHRGPDRTVPSTNSSNSSEN-AFGMYPETAFRS-NGVNST 677 Q K ++HEP++++ P + TNS+ S N +GMYP + NGV + Sbjct: 1204 QSYKN-DPHRHEPVSSNLAPGNSFRPTNSNRSPRNLTYGMYPPPPVTNPNGVTGS 1257 >gb|EOX95734.1| Poly(A) RNA polymerase cid14, putative [Theobroma cacao] Length = 1347 Score = 179 bits (454), Expect = 6e-48 Identities = 99/242 (40%), Positives = 138/242 (57%), Gaps = 16/242 (6%) Frame = +3 Query: 9 PEEPNADILNSDLNGHRQNLVYGRFCQNS-YHAPFMYAPRAVESPINLSAHYPWDGSGRS 185 P E DILN D+ H +NL YGR CQNS Y P +Y + P+ L H+PWDG GR Sbjct: 1029 PSESKRDILNGDIASHWKNLQYGRICQNSRYRPPLIYPSSVMVPPVCLQGHFPWDGPGRP 1088 Query: 186 IATNLN-YTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPX 362 ++T++N ++Q M + PRVVP+TP QS +R AS +QR DE RYRGGTGTYLPNPKVP Sbjct: 1089 LSTDVNLFSQLMNYGPRVVPVTPFQSVSNRPASVYQRYADEMPRYRGGTGTYLPNPKVPM 1148 Query: 363 XXXXXXXXXXXXXXXXXWNDPGDR---------FRAVDRSHGRSQAERATFRPDRQA--A 509 + GDR RA RSH R+Q E++ F D A A Sbjct: 1149 RERHSTNTRRGKYNYDRNDHHGDREGNWTANSKSRAAGRSHSRNQNEKSRFTIDHLAAVA 1208 Query: 510 AKNQEEKRWATYKHEPLA---AHRGPDRTVPSTNSSNSSENAFGMYPETAFRSNGVNSTR 680 +++ E+ W++++H+ +H GP R+ +++ S+S+ +GMYP A +GV+S Sbjct: 1209 GESRAERPWSSHRHDSFTSYQSHNGPVRS--NSSQSSSASMPYGMYPLPAMNPSGVSSNG 1266 Query: 681 PT 686 PT Sbjct: 1267 PT 1268 >ref|XP_019710291.1| PREDICTED: uncharacterized protein LOC105057942 isoform X2 [Elaeis guineensis] Length = 1375 Score = 179 bits (453), Expect = 8e-48 Identities = 102/235 (43%), Positives = 130/235 (55%), Gaps = 12/235 (5%) Frame = +3 Query: 15 EPNADILNSDLNGHRQNLVYGRFCQNSYH-APFMYAPRAVESPINLSAHYPWDGSGRSIA 191 E +DILN D H QNL YGR CQN++H PFMY + P+ L H+ DG GR A Sbjct: 1021 EHKSDILNGDFLSHWQNLQYGRSCQNAHHHGPFMYQSPVMVPPVYLQGHFSCDGPGRPHA 1080 Query: 192 TNLN-YTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEA-QRYRGGTGTYLPNPKVPXX 365 N N +TQ M + P++VP+TPLQ GP R + FQ DE RYRGGTGTYLPNPK+ Sbjct: 1081 ANGNLFTQIMSYGPQLVPVTPLQPGPHRISGVFQHFGDEVLPRYRGGTGTYLPNPKISFR 1140 Query: 366 XXXXXXXXXXXXXXXXWNDPGDR--------FRAVDRSHGRSQAERATFRPDRQAAAKNQ 521 ND DR RA RSHGR+ AE+ + RPDR + NQ Sbjct: 1141 DRQSSTRNHRGNYSYDRNDHADREGSWINAKSRASGRSHGRTPAEKPSLRPDRLSTTDNQ 1200 Query: 522 EEKRWATYKHEPLAAHRGPDRTVPSTNSSNSSEN-AFGMYPETAFRSNGVNSTRP 683 ++ W +HE A+ +G +R+ NSS +S N A+GMYP + NGV+ T P Sbjct: 1201 VDRPWGPRRHETPASDQGQNRSFGFANSSRNSPNMAYGMYPVSTVSPNGVSPTGP 1255 >ref|XP_010938971.1| PREDICTED: uncharacterized protein LOC105057942 isoform X1 [Elaeis guineensis] Length = 1380 Score = 179 bits (453), Expect = 8e-48 Identities = 102/235 (43%), Positives = 130/235 (55%), Gaps = 12/235 (5%) Frame = +3 Query: 15 EPNADILNSDLNGHRQNLVYGRFCQNSYH-APFMYAPRAVESPINLSAHYPWDGSGRSIA 191 E +DILN D H QNL YGR CQN++H PFMY + P+ L H+ DG GR A Sbjct: 1021 EHKSDILNGDFLSHWQNLQYGRSCQNAHHHGPFMYQSPVMVPPVYLQGHFSCDGPGRPHA 1080 Query: 192 TNLN-YTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEA-QRYRGGTGTYLPNPKVPXX 365 N N +TQ M + P++VP+TPLQ GP R + FQ DE RYRGGTGTYLPNPK+ Sbjct: 1081 ANGNLFTQIMSYGPQLVPVTPLQPGPHRISGVFQHFGDEVLPRYRGGTGTYLPNPKISFR 1140 Query: 366 XXXXXXXXXXXXXXXXWNDPGDR--------FRAVDRSHGRSQAERATFRPDRQAAAKNQ 521 ND DR RA RSHGR+ AE+ + RPDR + NQ Sbjct: 1141 DRQSSTRNHRGNYSYDRNDHADREGSWINAKSRASGRSHGRTPAEKPSLRPDRLSTTDNQ 1200 Query: 522 EEKRWATYKHEPLAAHRGPDRTVPSTNSSNSSEN-AFGMYPETAFRSNGVNSTRP 683 ++ W +HE A+ +G +R+ NSS +S N A+GMYP + NGV+ T P Sbjct: 1201 VDRPWGPRRHETPASDQGQNRSFGFANSSRNSPNMAYGMYPVSTVSPNGVSPTGP 1255 >ref|XP_010261538.1| PREDICTED: uncharacterized protein LOC104600345 isoform X2 [Nelumbo nucifera] Length = 1367 Score = 177 bits (450), Expect = 2e-47 Identities = 99/239 (41%), Positives = 132/239 (55%), Gaps = 14/239 (5%) Frame = +3 Query: 3 ELPEEPNADILNSDLNGHRQNLVYGRFCQNS-YHAPFMYAPRAVESPINLSAHYPWDGSG 179 E EE +DILNSD H QNL YGRFCQN Y P Y + P+ L H+PWDG G Sbjct: 1048 EPTEEHKSDILNSDFASHWQNLQYGRFCQNPRYPGPLFYPSPVMVPPVYLQGHFPWDGPG 1107 Query: 180 RSIATNLN-YTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKV 356 R ++ N N +TQ + + PR+ P+ PLQ G +R A+QR DEA RYRGGTGTYLPNPKV Sbjct: 1108 RPLSANGNLFTQLVNYGPRLFPVAPLQPGSNRPGGAYQRYGDEAPRYRGGTGTYLPNPKV 1167 Query: 357 PXXXXXXXXXXXXXXXXXXWND-PGDRF---------RAVDRSHGRSQAERATFRPDRQA 506 ND GDR RA R+HGR+Q E+ + +PD+ A Sbjct: 1168 SFRDRQASTARNHRGNNYDRNDHHGDREGTWNTNSKPRAAGRNHGRNQVEKLSSKPDQLA 1227 Query: 507 AAKNQEEKRWATYKHEPLAAHRGPDRTVPSTNS--SNSSENAFGMYPETAFRSNGVNST 677 A N+ ++ W +Y+H +++ + ++NS S+S+ A+GMYP SNG T Sbjct: 1228 ANDNRADRPWGSYRHNSFPSYQSQNGPFSASNSMHSSSANLAYGMYPLPPINSNGNTPT 1286 >ref|XP_010261537.1| PREDICTED: uncharacterized protein LOC104600345 isoform X1 [Nelumbo nucifera] Length = 1413 Score = 177 bits (450), Expect = 2e-47 Identities = 99/239 (41%), Positives = 132/239 (55%), Gaps = 14/239 (5%) Frame = +3 Query: 3 ELPEEPNADILNSDLNGHRQNLVYGRFCQNS-YHAPFMYAPRAVESPINLSAHYPWDGSG 179 E EE +DILNSD H QNL YGRFCQN Y P Y + P+ L H+PWDG G Sbjct: 1048 EPTEEHKSDILNSDFASHWQNLQYGRFCQNPRYPGPLFYPSPVMVPPVYLQGHFPWDGPG 1107 Query: 180 RSIATNLN-YTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKV 356 R ++ N N +TQ + + PR+ P+ PLQ G +R A+QR DEA RYRGGTGTYLPNPKV Sbjct: 1108 RPLSANGNLFTQLVNYGPRLFPVAPLQPGSNRPGGAYQRYGDEAPRYRGGTGTYLPNPKV 1167 Query: 357 PXXXXXXXXXXXXXXXXXXWND-PGDRF---------RAVDRSHGRSQAERATFRPDRQA 506 ND GDR RA R+HGR+Q E+ + +PD+ A Sbjct: 1168 SFRDRQASTARNHRGNNYDRNDHHGDREGTWNTNSKPRAAGRNHGRNQVEKLSSKPDQLA 1227 Query: 507 AAKNQEEKRWATYKHEPLAAHRGPDRTVPSTNS--SNSSENAFGMYPETAFRSNGVNST 677 A N+ ++ W +Y+H +++ + ++NS S+S+ A+GMYP SNG T Sbjct: 1228 ANDNRADRPWGSYRHNSFPSYQSQNGPFSASNSMHSSSANLAYGMYPLPPINSNGNTPT 1286 >gb|KHN06300.1| Poly(A) RNA polymerase cid14 [Glycine soja] Length = 1496 Score = 177 bits (450), Expect = 2e-47 Identities = 99/236 (41%), Positives = 132/236 (55%), Gaps = 12/236 (5%) Frame = +3 Query: 15 EPNADILNSDLNGHRQNLVYGRFCQNSYHAPFMYAPRAVE-SPINLSAHYPWDGSGRSIA 191 E DILNSD H QNL YGRFCQNS H P M P V P+ L YPWDG GR I+ Sbjct: 1119 EHRPDILNSDFVSHWQNLQYGRFCQNSRHPPSMTYPSPVMVPPVYLQGRYPWDGPGRPIS 1178 Query: 192 TNLN-YTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPXXX 368 N+N ++Q M + PR+VP+ PLQS +R AS +QR VD+ RYR GTGTYLPNPKV Sbjct: 1179 GNMNIFSQLMSYGPRLVPVAPLQSVSNRPASIYQRYVDDMPRYRSGTGTYLPNPKVSARD 1238 Query: 369 XXXXXXXXXXXXXXXWNDPGDR---------FRAVDRSHGRSQAERATFRPDRQAAAKNQ 521 + GDR R R H R+Q E+ + +R A ++++ Sbjct: 1239 RHSTNTRRGNYNYDRSDHHGDREGNWNTNSKLRGTGRGHNRNQTEKPNSKMERSATSESR 1298 Query: 522 EEKRWATYKHEPLAAHR-GPDRTVPSTNSSNSSENAFGMYPETAFRSNGVNSTRPT 686 E+ W +++H+ H+ GP R+ +++ SN S A+GMYP A +GV+S PT Sbjct: 1299 AERPWGSHRHDTFIPHQNGPVRS--NSSQSNPSNVAYGMYPMPAMNPSGVSSNGPT 1352 >gb|PKA49759.1| hypothetical protein AXF42_Ash004300 [Apostasia shenzhenica] Length = 1363 Score = 177 bits (449), Expect = 3e-47 Identities = 99/230 (43%), Positives = 127/230 (55%), Gaps = 11/230 (4%) Frame = +3 Query: 12 EEPNADILNSDLNGHRQNLVYGRFCQN-SYHAPFMYAPRAVESPINLSAHYPWDGSGRSI 188 EE +DILNSD H QNL YGR CQN H F+Y+P + P+ L H+P DG GR + Sbjct: 1051 EEHKSDILNSDFASHWQNLQYGRLCQNIPNHGQFIYSPPVMVPPVYLQGHFPLDGPGRPL 1110 Query: 189 ATNLNY-TQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPXX 365 A NLN+ TQ MG+ PR+VP+ PLQ GP R ++ F R DE RYR GTGTYLPNPK Sbjct: 1111 APNLNFFTQMMGYGPRLVPVAPLQPGPSRPSNVFHRYGDEPPRYRAGTGTYLPNPKATFR 1170 Query: 366 XXXXXXXXXXXXXXXXWNDPGDR--------FRAVDRSHGRSQAERATFRPDRQAAAKNQ 521 DR R+ RSHGR+ E+ + RPDR AA+ + Sbjct: 1171 DRHYSSTRNHRGTYNYDRGDADREGSWVNSKNRSGGRSHGRNHNEKLSLRPDRLAASDVR 1230 Query: 522 EEKRWATYKHEPLAAHRGPDRTV-PSTNSSNSSENAFGMYPETAFRSNGV 668 E+ W +Y+HE H+ + + ST S NS+ A GMYP + SNG+ Sbjct: 1231 NERVWESYRHEQAVPHQVQNSSFGSSTTSLNSANVAHGMYPVSGASSNGL 1280 >ref|XP_006583248.1| PREDICTED: uncharacterized protein LOC100809742 isoform X3 [Glycine max] gb|KRH47923.1| hypothetical protein GLYMA_07G056700 [Glycine max] Length = 1329 Score = 177 bits (448), Expect = 4e-47 Identities = 99/236 (41%), Positives = 132/236 (55%), Gaps = 12/236 (5%) Frame = +3 Query: 15 EPNADILNSDLNGHRQNLVYGRFCQNSYHAPFMYAPRAVE-SPINLSAHYPWDGSGRSIA 191 E DILNSD H QNL YGRFCQNS H P M P V P+ L YPWDG GR I+ Sbjct: 1017 EHRPDILNSDFVSHWQNLQYGRFCQNSRHPPSMTYPSPVMVPPVYLQGRYPWDGPGRPIS 1076 Query: 192 TNLN-YTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPXXX 368 N+N ++Q M + PR+VP+ PLQS +R AS +QR VD+ RYR GTGTYLPNPKV Sbjct: 1077 GNMNIFSQLMSYGPRLVPVAPLQSVSNRPASIYQRYVDDMPRYRSGTGTYLPNPKVSARD 1136 Query: 369 XXXXXXXXXXXXXXXWNDPGDR---------FRAVDRSHGRSQAERATFRPDRQAAAKNQ 521 + GDR R R H R+Q E+ + +R A ++++ Sbjct: 1137 RHSTNTRRGNYPYDRSDHHGDREGNWNTNSKLRGTGRGHNRNQTEKPNSKMERLATSESR 1196 Query: 522 EEKRWATYKHEPLAAHR-GPDRTVPSTNSSNSSENAFGMYPETAFRSNGVNSTRPT 686 E+ W +++H+ H+ GP R+ +++ SN S A+GMYP A +GV+S PT Sbjct: 1197 AERPWGSHRHDTFIPHQNGPVRS--NSSQSNPSNVAYGMYPMPAMNPSGVSSNGPT 1250