BLASTX nr result

ID: Cheilocostus21_contig00026303 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cheilocostus21_contig00026303
         (692 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_009404763.1| PREDICTED: uncharacterized protein LOC103988...   249   2e-72
ref|XP_018679559.1| PREDICTED: uncharacterized protein LOC103979...   209   1e-58
ref|XP_009393858.1| PREDICTED: uncharacterized protein LOC103979...   208   5e-58
ref|XP_010936375.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   196   7e-54
ref|XP_008777040.1| PREDICTED: uncharacterized protein LOC103697...   193   6e-53
ref|XP_019706661.1| PREDICTED: uncharacterized protein LOC105046...   193   6e-53
ref|XP_008777039.1| PREDICTED: uncharacterized protein LOC103697...   193   6e-53
ref|XP_010923576.1| PREDICTED: uncharacterized protein LOC105046...   193   6e-53
ref|XP_017978215.1| PREDICTED: uncharacterized protein LOC186139...   185   4e-50
ref|XP_020104050.1| uncharacterized protein LOC109721046 isoform...   182   3e-49
ref|XP_021281040.1| uncharacterized protein LOC110414270 [Herran...   181   1e-48
gb|OAY67602.1| hypothetical protein ACMD2_05229 [Ananas comosus]      180   3e-48
gb|EOX95734.1| Poly(A) RNA polymerase cid14, putative [Theobroma...   179   6e-48
ref|XP_019710291.1| PREDICTED: uncharacterized protein LOC105057...   179   8e-48
ref|XP_010938971.1| PREDICTED: uncharacterized protein LOC105057...   179   8e-48
ref|XP_010261538.1| PREDICTED: uncharacterized protein LOC104600...   177   2e-47
ref|XP_010261537.1| PREDICTED: uncharacterized protein LOC104600...   177   2e-47
gb|KHN06300.1| Poly(A) RNA polymerase cid14 [Glycine soja]            177   2e-47
gb|PKA49759.1| hypothetical protein AXF42_Ash004300 [Apostasia s...   177   3e-47
ref|XP_006583248.1| PREDICTED: uncharacterized protein LOC100809...   177   4e-47

>ref|XP_009404763.1| PREDICTED: uncharacterized protein LOC103988002 [Musa acuminata
            subsp. malaccensis]
          Length = 1351

 Score =  249 bits (636), Expect = 2e-72
 Identities = 125/237 (52%), Positives = 155/237 (65%), Gaps = 8/237 (3%)
 Frame = +3

Query: 3    ELPEEPNADILNSDLNGHRQNLVYGRFCQNSYHAPFMYAPRAVESPINLSAHYPWDGSGR 182
            E  EEP++DILNSDLN H QNL+YGR+CQNS H PF+Y+      PI L +H+ WDG GR
Sbjct: 1039 ETSEEPSSDILNSDLNDHWQNLIYGRYCQNSNHGPFVYSSPVAGQPIYLPSHHLWDGPGR 1098

Query: 183  SIATNLNYTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPX 362
             +A NLNY   MGH PR+VPM PLQ+GPD++   FQR+VD A RYRGGTGTYLPNPKV  
Sbjct: 1099 PLAANLNYIPIMGHGPRLVPMMPLQTGPDQAPGVFQRHVDGAPRYRGGTGTYLPNPKVSF 1158

Query: 363  XXXXXXXXXXXXXXXXXWNDPGDR--------FRAVDRSHGRSQAERATFRPDRQAAAKN 518
                              NDP DR         RA DRS+GR+QAER++  PD  AA++N
Sbjct: 1159 RDRQSSLRSHRGNRNYDHNDPVDREGRWVHSKSRAFDRSYGRNQAERSSLPPDHLAASRN 1218

Query: 519  QEEKRWATYKHEPLAAHRGPDRTVPSTNSSNSSENAFGMYPETAFRSNGVNSTRPTN 689
            Q+ K+W +Y HEPLA+++G   T+   NSS++ ENA GMYP+ A  SNGV     TN
Sbjct: 1219 QDVKKWVSYGHEPLASYQGSFATI---NSSHNLENALGMYPQNAVGSNGVTPPDSTN 1272


>ref|XP_018679559.1| PREDICTED: uncharacterized protein LOC103979435 isoform X1 [Musa
            acuminata subsp. malaccensis]
 ref|XP_018679560.1| PREDICTED: uncharacterized protein LOC103979435 isoform X1 [Musa
            acuminata subsp. malaccensis]
          Length = 1338

 Score =  209 bits (533), Expect = 1e-58
 Identities = 113/235 (48%), Positives = 144/235 (61%), Gaps = 7/235 (2%)
 Frame = +3

Query: 3    ELPEEPNADILNSDLNGHRQNLVYGRFCQNSYHAPFMYAPRAVESPINLSAHYPWDGSGR 182
            E  EE ++DI N D   H QNLVYGR CQNSY  P MY       PI L +HYPWDG G+
Sbjct: 1039 ESSEELSSDIFNVDSVSHWQNLVYGRSCQNSYPGPVMYTSPVARPPIYLPSHYPWDGPGK 1098

Query: 183  SIATNLNYTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVP- 359
             ++++L+YTQ  GHNP +VP+ P Q G DR++  FQ   DEA  YRGGTGTYLPN KV  
Sbjct: 1099 LLSSDLHYTQMTGHNPLLVPIMPFQPGSDRASGVFQDYADEAPTYRGGTGTYLPNSKVSF 1158

Query: 360  XXXXXXXXXXXXXXXXXXWNDPGDRF------RAVDRSHGRSQAERATFRPDRQAAAKNQ 521
                               +DP DR       RA    HGR++AER   RPD+ AA+KNQ
Sbjct: 1159 GDHRQSGSRNHEGNCNYDKDDPVDRSWVSSKPRAFGHGHGRNRAERPRLRPDQLAASKNQ 1218

Query: 522  EEKRWATYKHEPLAAHRGPDRTVPSTNSSNSSENAFGMYPETAFRSNGVNSTRPT 686
             EK+W +++ EP+A++R   R+  STNSS  SE+A GM+P+TA  S GVN++ PT
Sbjct: 1219 -EKKWESWRSEPVASNRRRGRSFASTNSSYISESAPGMHPQTASSSEGVNASDPT 1272


>ref|XP_009393858.1| PREDICTED: uncharacterized protein LOC103979435 isoform X2 [Musa
            acuminata subsp. malaccensis]
          Length = 1337

 Score =  208 bits (529), Expect = 5e-58
 Identities = 111/234 (47%), Positives = 142/234 (60%), Gaps = 6/234 (2%)
 Frame = +3

Query: 3    ELPEEPNADILNSDLNGHRQNLVYGRFCQNSYHAPFMYAPRAVESPINLSAHYPWDGSGR 182
            E  EE ++DI N D   H QNLVYGR CQNSY  P MY       PI L +HYPWDG G+
Sbjct: 1039 ESSEELSSDIFNVDSVSHWQNLVYGRSCQNSYPGPVMYTSPVARPPIYLPSHYPWDGPGK 1098

Query: 183  SIATNLNYTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPX 362
             ++++L+YTQ  GHNP +VP+ P Q G DR++  FQ   DEA  YRGGTGTYLPN     
Sbjct: 1099 LLSSDLHYTQMTGHNPLLVPIMPFQPGSDRASGVFQDYADEAPTYRGGTGTYLPNSVSFG 1158

Query: 363  XXXXXXXXXXXXXXXXXWNDPGDRF------RAVDRSHGRSQAERATFRPDRQAAAKNQE 524
                              +DP DR       RA    HGR++AER   RPD+ AA+KNQ 
Sbjct: 1159 DHRQSGSRNHEGNCNYDKDDPVDRSWVSSKPRAFGHGHGRNRAERPRLRPDQLAASKNQ- 1217

Query: 525  EKRWATYKHEPLAAHRGPDRTVPSTNSSNSSENAFGMYPETAFRSNGVNSTRPT 686
            EK+W +++ EP+A++R   R+  STNSS  SE+A GM+P+TA  S GVN++ PT
Sbjct: 1218 EKKWESWRSEPVASNRRRGRSFASTNSSYISESAPGMHPQTASSSEGVNASDPT 1271


>ref|XP_010936375.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC105056012
            [Elaeis guineensis]
          Length = 1356

 Score =  196 bits (498), Expect = 7e-54
 Identities = 104/234 (44%), Positives = 141/234 (60%), Gaps = 9/234 (3%)
 Frame = +3

Query: 9    PEEPNADILNSDLNGHRQNLVYGRFCQNSYHAPFMYAPRAVESPINLSAHYPWDGSGRSI 188
            P E  +DILNSD   H QNL+YGRFCQ+ +H P +Y   AV  P  L  H+P DG GR +
Sbjct: 1045 PLEEKSDILNSDFASHLQNLLYGRFCQD-FHGPIIYPSPAVVPPSYLQGHFPLDGPGRPL 1103

Query: 189  ATNLNYTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPXXX 368
            + N+N+TQ M + P++VP+ P+Q  PDR+A  FQR  DEA RYRGGTGTYLPNPK+    
Sbjct: 1104 SANVNFTQVMNYGPQLVPVMPIQPVPDRTAGVFQRYGDEAPRYRGGTGTYLPNPKMSFRD 1163

Query: 369  XXXXXXXXXXXXXXXWNDPGDR--------FRAVDRSHGRSQAERATFRPDRQAAAKNQE 524
                            +D GDR         RA  RSH RSQAER T   D+ AA+++  
Sbjct: 1164 RQPTSRNHRGNYGYDRSDHGDREGSWINSKTRAAGRSHVRSQAERPTSWHDQLAASEHHA 1223

Query: 525  EKRWATYKHEPLAAHRGPDRTVPSTNSSNSSEN-AFGMYPETAFRSNGVNSTRP 683
            +++W + +HEP+A++  P+ +  ST S++SS N A+ ++P     S GVN  RP
Sbjct: 1224 DRQWESQRHEPVASYLVPNNSFVSTKSAHSSTNMAYALHPPPVAGSEGVNPARP 1277


>ref|XP_008777040.1| PREDICTED: uncharacterized protein LOC103697050 isoform X2 [Phoenix
            dactylifera]
          Length = 1330

 Score =  193 bits (491), Expect = 6e-53
 Identities = 103/234 (44%), Positives = 131/234 (55%), Gaps = 11/234 (4%)
 Frame = +3

Query: 15   EPNADILNSDLNGHRQNLVYGRFCQNS-YHAPFMYAPRAVESPINLSAHYPWDGSGRSIA 191
            E  +DILN D   H +NL YGR CQN+ YH PFMY    +  P+ L  H+PWDG GR  +
Sbjct: 1019 EHKSDILNGDFLSHWENLQYGRSCQNAHYHGPFMYQSPVMVPPVYLQGHFPWDGPGRPFS 1078

Query: 192  TNLN-YTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPXXX 368
             N N +TQ M + PR+VP+TPLQ GP R++  FQR  DE  RYRGGTGTYLPNPK+    
Sbjct: 1079 ANGNLFTQIMSYGPRLVPVTPLQPGPHRTSGVFQRFGDEVPRYRGGTGTYLPNPKISFRD 1138

Query: 369  XXXXXXXXXXXXXXXWNDPGDR--------FRAVDRSHGRSQAERATFRPDRQAAAKNQE 524
                            ND  DR         RA  RS+GR+ AE+   R DR +   N  
Sbjct: 1139 RQSSTRNHRGNYNYDRNDKADREGSWIYEKSRASGRSYGRTPAEKRGLRSDRSSTTDNHV 1198

Query: 525  EKRWATYKHEPLAAHRGPDRTVPSTNS-SNSSENAFGMYPETAFRSNGVNSTRP 683
            ++ W  ++HEPLA+ +G  R+    NS  NS   A+GMYP     S+GV+ T P
Sbjct: 1199 DRSWGPHRHEPLASDQGQSRSFGVANSLPNSPNMAYGMYPVPTVNSSGVSPTGP 1252


>ref|XP_019706661.1| PREDICTED: uncharacterized protein LOC105046626 isoform X2 [Elaeis
            guineensis]
          Length = 1342

 Score =  193 bits (491), Expect = 6e-53
 Identities = 109/241 (45%), Positives = 135/241 (56%), Gaps = 11/241 (4%)
 Frame = +3

Query: 3    ELPEEPNADILNSDLNGHRQNLVYGRFCQNS-YHAPFMYAPRAVESPINLSAHYPWDGSG 179
            E PEE  +DILN D   H QNL Y R CQN+ YH  F+Y    +  PI L  H+P DG G
Sbjct: 1024 EPPEEHKSDILNGDFLSHWQNLQYVRSCQNTHYHGYFLYQSPVMVPPIYLQGHFPQDGPG 1083

Query: 180  RSIATNLNY-TQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKV 356
            R +  N N  TQ M + PRVVP+TP+Q GP R++  FQ   DE  RYRGGTGTYLPNPKV
Sbjct: 1084 RPLTANANLLTQIMSYGPRVVPITPMQPGPHRTSGIFQNFGDEIFRYRGGTGTYLPNPKV 1143

Query: 357  PXXXXXXXXXXXXXXXXXXWNDPGDR--------FRAVDRSHGRSQAERATFRPDRQAAA 512
                                ND  DR         RA +RSHGR QAE+ + RPDR + A
Sbjct: 1144 SFRDRQSSTKNHRRNCSYDRNDNADREGSWIYAKSRAANRSHGRIQAEKLSLRPDRLSTA 1203

Query: 513  KNQEEKRWATYKHEPLAAHRGPDRTVPSTNSSNSSEN-AFGMYPETAFRSNGVNSTRPTN 689
             N+ +K W  ++HEP A+ +  +R+    NSS SS N ++GMYP     SNGV+   P  
Sbjct: 1204 DNRIDKPWDPHRHEPPASKQAQNRSFGLANSSRSSPNLSYGMYPVPTVNSNGVSPVNPAV 1263

Query: 690  S 692
            S
Sbjct: 1264 S 1264


>ref|XP_008777039.1| PREDICTED: uncharacterized protein LOC103697050 isoform X1 [Phoenix
            dactylifera]
          Length = 1349

 Score =  193 bits (491), Expect = 6e-53
 Identities = 103/234 (44%), Positives = 131/234 (55%), Gaps = 11/234 (4%)
 Frame = +3

Query: 15   EPNADILNSDLNGHRQNLVYGRFCQNS-YHAPFMYAPRAVESPINLSAHYPWDGSGRSIA 191
            E  +DILN D   H +NL YGR CQN+ YH PFMY    +  P+ L  H+PWDG GR  +
Sbjct: 1019 EHKSDILNGDFLSHWENLQYGRSCQNAHYHGPFMYQSPVMVPPVYLQGHFPWDGPGRPFS 1078

Query: 192  TNLN-YTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPXXX 368
             N N +TQ M + PR+VP+TPLQ GP R++  FQR  DE  RYRGGTGTYLPNPK+    
Sbjct: 1079 ANGNLFTQIMSYGPRLVPVTPLQPGPHRTSGVFQRFGDEVPRYRGGTGTYLPNPKISFRD 1138

Query: 369  XXXXXXXXXXXXXXXWNDPGDR--------FRAVDRSHGRSQAERATFRPDRQAAAKNQE 524
                            ND  DR         RA  RS+GR+ AE+   R DR +   N  
Sbjct: 1139 RQSSTRNHRGNYNYDRNDKADREGSWIYEKSRASGRSYGRTPAEKRGLRSDRSSTTDNHV 1198

Query: 525  EKRWATYKHEPLAAHRGPDRTVPSTNS-SNSSENAFGMYPETAFRSNGVNSTRP 683
            ++ W  ++HEPLA+ +G  R+    NS  NS   A+GMYP     S+GV+ T P
Sbjct: 1199 DRSWGPHRHEPLASDQGQSRSFGVANSLPNSPNMAYGMYPVPTVNSSGVSPTGP 1252


>ref|XP_010923576.1| PREDICTED: uncharacterized protein LOC105046626 isoform X1 [Elaeis
            guineensis]
          Length = 1387

 Score =  193 bits (491), Expect = 6e-53
 Identities = 109/241 (45%), Positives = 135/241 (56%), Gaps = 11/241 (4%)
 Frame = +3

Query: 3    ELPEEPNADILNSDLNGHRQNLVYGRFCQNS-YHAPFMYAPRAVESPINLSAHYPWDGSG 179
            E PEE  +DILN D   H QNL Y R CQN+ YH  F+Y    +  PI L  H+P DG G
Sbjct: 1024 EPPEEHKSDILNGDFLSHWQNLQYVRSCQNTHYHGYFLYQSPVMVPPIYLQGHFPQDGPG 1083

Query: 180  RSIATNLNY-TQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKV 356
            R +  N N  TQ M + PRVVP+TP+Q GP R++  FQ   DE  RYRGGTGTYLPNPKV
Sbjct: 1084 RPLTANANLLTQIMSYGPRVVPITPMQPGPHRTSGIFQNFGDEIFRYRGGTGTYLPNPKV 1143

Query: 357  PXXXXXXXXXXXXXXXXXXWNDPGDR--------FRAVDRSHGRSQAERATFRPDRQAAA 512
                                ND  DR         RA +RSHGR QAE+ + RPDR + A
Sbjct: 1144 SFRDRQSSTKNHRRNCSYDRNDNADREGSWIYAKSRAANRSHGRIQAEKLSLRPDRLSTA 1203

Query: 513  KNQEEKRWATYKHEPLAAHRGPDRTVPSTNSSNSSEN-AFGMYPETAFRSNGVNSTRPTN 689
             N+ +K W  ++HEP A+ +  +R+    NSS SS N ++GMYP     SNGV+   P  
Sbjct: 1204 DNRIDKPWDPHRHEPPASKQAQNRSFGLANSSRSSPNLSYGMYPVPTVNSNGVSPVNPAV 1263

Query: 690  S 692
            S
Sbjct: 1264 S 1264


>ref|XP_017978215.1| PREDICTED: uncharacterized protein LOC18613995 [Theobroma cacao]
          Length = 1347

 Score =  185 bits (470), Expect = 4e-50
 Identities = 101/242 (41%), Positives = 140/242 (57%), Gaps = 16/242 (6%)
 Frame = +3

Query: 9    PEEPNADILNSDLNGHRQNLVYGRFCQNS-YHAPFMYAPRAVESPINLSAHYPWDGSGRS 185
            P EP  DILN D+  H +NL YGR CQNS YH P +Y    +  P+ L  H+PWDG GR 
Sbjct: 1029 PSEPKRDILNGDIASHWKNLQYGRICQNSRYHPPLIYPSSVMVPPVCLQGHFPWDGPGRP 1088

Query: 186  IATNLN-YTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPX 362
            ++T++N ++Q M + PRVVP+TP QS  +R AS +QR  DE  RYRGGTGTYLPNPKVP 
Sbjct: 1089 LSTDVNLFSQLMNYGPRVVPVTPFQSVSNRPASVYQRYADEMPRYRGGTGTYLPNPKVPM 1148

Query: 363  XXXXXXXXXXXXXXXXXWNDPGDR---------FRAVDRSHGRSQAERATFRPDRQA--A 509
                              +  GDR          RA  RSH R+Q E++ F  D  A  A
Sbjct: 1149 RERHSTNTRRGKYNYDRNDHHGDREGSWTANSKSRAAGRSHSRNQNEKSRFTIDHLAAVA 1208

Query: 510  AKNQEEKRWATYKHEPLA---AHRGPDRTVPSTNSSNSSENAFGMYPETAFRSNGVNSTR 680
             +++ E+ W++++H+      +H GP R+  +++ S+S+   +GMYP  A   +GV+S  
Sbjct: 1209 GESRAERPWSSHRHDSFTSYQSHNGPVRS--NSSQSSSASMPYGMYPLPAMNPSGVSSNG 1266

Query: 681  PT 686
            PT
Sbjct: 1267 PT 1268


>ref|XP_020104050.1| uncharacterized protein LOC109721046 isoform X1 [Ananas comosus]
 ref|XP_020104051.1| uncharacterized protein LOC109721046 isoform X1 [Ananas comosus]
          Length = 1340

 Score =  182 bits (463), Expect = 3e-49
 Identities = 99/237 (41%), Positives = 131/237 (55%), Gaps = 10/237 (4%)
 Frame = +3

Query: 3    ELPEEPNADILNSDLNGHRQNLVYGRFCQNSYHAPFMYAPRAVESPINLSAHYPWDGSGR 182
            E   E   DIL SD + H QNL+YGR+CQ++   PF+  P  +  P+ L  H+PWDG GR
Sbjct: 1025 ESSNEHEPDILKSDFDSHLQNLLYGRYCQDTRQGPFICQPPVLVPPVYLQGHFPWDGPGR 1084

Query: 183  SIATNLNYTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPX 362
             ++ N N TQ MG+ PR VPM PL+ G +R +  FQR+ +EA RYRGGTGTYLPNPKVP 
Sbjct: 1085 PVSANANLTQMMGYAPRFVPMVPLKPGSERPSGVFQRHGEEAPRYRGGTGTYLPNPKVPF 1144

Query: 363  XXXXXXXXXXXXXXXXXWNDPGDR--------FRAVDRSHGRSQAERATFRPDRQAAAKN 518
                               D GDR         RA  R HG  Q ER+ F+P+ Q+    
Sbjct: 1145 RDRQASTRSYRRNYNSERGDQGDREGSWISAKARAAGRGHGH-QVERSNFQPENQSDRHR 1203

Query: 519  QEEKRWATYKHEPLAAHRGPDRTVPSTNSSNSSEN-AFGMYPETAFRS-NGVNSTRP 683
            Q  K    ++HEP++++  P  +   TNS+ S  N  +GMYP     + NGV +T P
Sbjct: 1204 QSYKN-DPHRHEPVSSNLAPGNSFRPTNSNRSPRNLTYGMYPPPPVTNPNGVINTGP 1259


>ref|XP_021281040.1| uncharacterized protein LOC110414270 [Herrania umbratica]
          Length = 1347

 Score =  181 bits (459), Expect = 1e-48
 Identities = 101/242 (41%), Positives = 141/242 (58%), Gaps = 16/242 (6%)
 Frame = +3

Query: 9    PEEPNADILNSDLNGHRQNLVYGRFCQNS-YHAPFMYAPRAVESPINLSAHYPWDGSGRS 185
            P E  ADILN D+  H +NL YGR CQNS Y +P +     +  P+ L  H+PWDG GR 
Sbjct: 1029 PSEHKADILNGDIASHWKNLQYGRICQNSRYPSPLICPSPVMVPPVYLQGHFPWDGPGRP 1088

Query: 186  IATNLN-YTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPX 362
            ++TN+N ++Q M + PRVVP+TPLQS  +R AS +QR  DE  RYRGGTGTYLPNPKVP 
Sbjct: 1089 LSTNVNLFSQLMNYGPRVVPVTPLQSVSNRPASVYQRYADEMPRYRGGTGTYLPNPKVPM 1148

Query: 363  XXXXXXXXXXXXXXXXXWNDPGDR---------FRAVDRSHGRSQAERATFRPDRQA--A 509
                              +  GDR          RA  RSH R+Q E++ F  D  A  A
Sbjct: 1149 RERHSTNTRRGKYNYDRNDHHGDREGNWTANSKSRAAGRSHSRNQNEKSRFTFDHLAAVA 1208

Query: 510  AKNQEEKRWATYKHEPLA---AHRGPDRTVPSTNSSNSSENAFGMYPETAFRSNGVNSTR 680
             +++ E+ W++++H+      +H GP R+  +++ S+S+   +GMYP  A  ++GV+S  
Sbjct: 1209 GESRAERPWSSHRHDSFTSYQSHNGPVRS--NSSQSSSASMPYGMYPLPAMNASGVSSNG 1266

Query: 681  PT 686
            PT
Sbjct: 1267 PT 1268


>gb|OAY67602.1| hypothetical protein ACMD2_05229 [Ananas comosus]
          Length = 1370

 Score =  180 bits (456), Expect = 3e-48
 Identities = 97/235 (41%), Positives = 129/235 (54%), Gaps = 10/235 (4%)
 Frame = +3

Query: 3    ELPEEPNADILNSDLNGHRQNLVYGRFCQNSYHAPFMYAPRAVESPINLSAHYPWDGSGR 182
            E   E   DIL SD + H QNL+YGR+CQ++   PF+  P  +  P+ L  H+PWDG GR
Sbjct: 1025 ESSNEHEPDILKSDFDSHLQNLLYGRYCQDTRQGPFICQPPVLVPPVYLQGHFPWDGPGR 1084

Query: 183  SIATNLNYTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPX 362
             ++ N N TQ MG+ PR VPM PL+ G +R +  FQR+ +EA RYRGGTGTYLPNPKVP 
Sbjct: 1085 PVSANANLTQMMGYAPRFVPMVPLKPGSERPSGVFQRHGEEAPRYRGGTGTYLPNPKVPF 1144

Query: 363  XXXXXXXXXXXXXXXXXWNDPGDR--------FRAVDRSHGRSQAERATFRPDRQAAAKN 518
                               D GDR         RA  R HG  Q ER+ F+P+ Q+    
Sbjct: 1145 RDRQASTRSYRRNYNSERGDQGDREGSWISAKARAAGRGHGH-QVERSNFQPENQSDRHR 1203

Query: 519  QEEKRWATYKHEPLAAHRGPDRTVPSTNSSNSSEN-AFGMYPETAFRS-NGVNST 677
            Q  K    ++HEP++++  P  +   TNS+ S  N  +GMYP     + NGV  +
Sbjct: 1204 QSYKN-DPHRHEPVSSNLAPGNSFRPTNSNRSPRNLTYGMYPPPPVTNPNGVTGS 1257


>gb|EOX95734.1| Poly(A) RNA polymerase cid14, putative [Theobroma cacao]
          Length = 1347

 Score =  179 bits (454), Expect = 6e-48
 Identities = 99/242 (40%), Positives = 138/242 (57%), Gaps = 16/242 (6%)
 Frame = +3

Query: 9    PEEPNADILNSDLNGHRQNLVYGRFCQNS-YHAPFMYAPRAVESPINLSAHYPWDGSGRS 185
            P E   DILN D+  H +NL YGR CQNS Y  P +Y    +  P+ L  H+PWDG GR 
Sbjct: 1029 PSESKRDILNGDIASHWKNLQYGRICQNSRYRPPLIYPSSVMVPPVCLQGHFPWDGPGRP 1088

Query: 186  IATNLN-YTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPX 362
            ++T++N ++Q M + PRVVP+TP QS  +R AS +QR  DE  RYRGGTGTYLPNPKVP 
Sbjct: 1089 LSTDVNLFSQLMNYGPRVVPVTPFQSVSNRPASVYQRYADEMPRYRGGTGTYLPNPKVPM 1148

Query: 363  XXXXXXXXXXXXXXXXXWNDPGDR---------FRAVDRSHGRSQAERATFRPDRQA--A 509
                              +  GDR          RA  RSH R+Q E++ F  D  A  A
Sbjct: 1149 RERHSTNTRRGKYNYDRNDHHGDREGNWTANSKSRAAGRSHSRNQNEKSRFTIDHLAAVA 1208

Query: 510  AKNQEEKRWATYKHEPLA---AHRGPDRTVPSTNSSNSSENAFGMYPETAFRSNGVNSTR 680
             +++ E+ W++++H+      +H GP R+  +++ S+S+   +GMYP  A   +GV+S  
Sbjct: 1209 GESRAERPWSSHRHDSFTSYQSHNGPVRS--NSSQSSSASMPYGMYPLPAMNPSGVSSNG 1266

Query: 681  PT 686
            PT
Sbjct: 1267 PT 1268


>ref|XP_019710291.1| PREDICTED: uncharacterized protein LOC105057942 isoform X2 [Elaeis
            guineensis]
          Length = 1375

 Score =  179 bits (453), Expect = 8e-48
 Identities = 102/235 (43%), Positives = 130/235 (55%), Gaps = 12/235 (5%)
 Frame = +3

Query: 15   EPNADILNSDLNGHRQNLVYGRFCQNSYH-APFMYAPRAVESPINLSAHYPWDGSGRSIA 191
            E  +DILN D   H QNL YGR CQN++H  PFMY    +  P+ L  H+  DG GR  A
Sbjct: 1021 EHKSDILNGDFLSHWQNLQYGRSCQNAHHHGPFMYQSPVMVPPVYLQGHFSCDGPGRPHA 1080

Query: 192  TNLN-YTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEA-QRYRGGTGTYLPNPKVPXX 365
             N N +TQ M + P++VP+TPLQ GP R +  FQ   DE   RYRGGTGTYLPNPK+   
Sbjct: 1081 ANGNLFTQIMSYGPQLVPVTPLQPGPHRISGVFQHFGDEVLPRYRGGTGTYLPNPKISFR 1140

Query: 366  XXXXXXXXXXXXXXXXWNDPGDR--------FRAVDRSHGRSQAERATFRPDRQAAAKNQ 521
                             ND  DR         RA  RSHGR+ AE+ + RPDR +   NQ
Sbjct: 1141 DRQSSTRNHRGNYSYDRNDHADREGSWINAKSRASGRSHGRTPAEKPSLRPDRLSTTDNQ 1200

Query: 522  EEKRWATYKHEPLAAHRGPDRTVPSTNSSNSSEN-AFGMYPETAFRSNGVNSTRP 683
             ++ W   +HE  A+ +G +R+    NSS +S N A+GMYP +    NGV+ T P
Sbjct: 1201 VDRPWGPRRHETPASDQGQNRSFGFANSSRNSPNMAYGMYPVSTVSPNGVSPTGP 1255


>ref|XP_010938971.1| PREDICTED: uncharacterized protein LOC105057942 isoform X1 [Elaeis
            guineensis]
          Length = 1380

 Score =  179 bits (453), Expect = 8e-48
 Identities = 102/235 (43%), Positives = 130/235 (55%), Gaps = 12/235 (5%)
 Frame = +3

Query: 15   EPNADILNSDLNGHRQNLVYGRFCQNSYH-APFMYAPRAVESPINLSAHYPWDGSGRSIA 191
            E  +DILN D   H QNL YGR CQN++H  PFMY    +  P+ L  H+  DG GR  A
Sbjct: 1021 EHKSDILNGDFLSHWQNLQYGRSCQNAHHHGPFMYQSPVMVPPVYLQGHFSCDGPGRPHA 1080

Query: 192  TNLN-YTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEA-QRYRGGTGTYLPNPKVPXX 365
             N N +TQ M + P++VP+TPLQ GP R +  FQ   DE   RYRGGTGTYLPNPK+   
Sbjct: 1081 ANGNLFTQIMSYGPQLVPVTPLQPGPHRISGVFQHFGDEVLPRYRGGTGTYLPNPKISFR 1140

Query: 366  XXXXXXXXXXXXXXXXWNDPGDR--------FRAVDRSHGRSQAERATFRPDRQAAAKNQ 521
                             ND  DR         RA  RSHGR+ AE+ + RPDR +   NQ
Sbjct: 1141 DRQSSTRNHRGNYSYDRNDHADREGSWINAKSRASGRSHGRTPAEKPSLRPDRLSTTDNQ 1200

Query: 522  EEKRWATYKHEPLAAHRGPDRTVPSTNSSNSSEN-AFGMYPETAFRSNGVNSTRP 683
             ++ W   +HE  A+ +G +R+    NSS +S N A+GMYP +    NGV+ T P
Sbjct: 1201 VDRPWGPRRHETPASDQGQNRSFGFANSSRNSPNMAYGMYPVSTVSPNGVSPTGP 1255


>ref|XP_010261538.1| PREDICTED: uncharacterized protein LOC104600345 isoform X2 [Nelumbo
            nucifera]
          Length = 1367

 Score =  177 bits (450), Expect = 2e-47
 Identities = 99/239 (41%), Positives = 132/239 (55%), Gaps = 14/239 (5%)
 Frame = +3

Query: 3    ELPEEPNADILNSDLNGHRQNLVYGRFCQNS-YHAPFMYAPRAVESPINLSAHYPWDGSG 179
            E  EE  +DILNSD   H QNL YGRFCQN  Y  P  Y    +  P+ L  H+PWDG G
Sbjct: 1048 EPTEEHKSDILNSDFASHWQNLQYGRFCQNPRYPGPLFYPSPVMVPPVYLQGHFPWDGPG 1107

Query: 180  RSIATNLN-YTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKV 356
            R ++ N N +TQ + + PR+ P+ PLQ G +R   A+QR  DEA RYRGGTGTYLPNPKV
Sbjct: 1108 RPLSANGNLFTQLVNYGPRLFPVAPLQPGSNRPGGAYQRYGDEAPRYRGGTGTYLPNPKV 1167

Query: 357  PXXXXXXXXXXXXXXXXXXWND-PGDRF---------RAVDRSHGRSQAERATFRPDRQA 506
                                ND  GDR          RA  R+HGR+Q E+ + +PD+ A
Sbjct: 1168 SFRDRQASTARNHRGNNYDRNDHHGDREGTWNTNSKPRAAGRNHGRNQVEKLSSKPDQLA 1227

Query: 507  AAKNQEEKRWATYKHEPLAAHRGPDRTVPSTNS--SNSSENAFGMYPETAFRSNGVNST 677
            A  N+ ++ W +Y+H    +++  +    ++NS  S+S+  A+GMYP     SNG   T
Sbjct: 1228 ANDNRADRPWGSYRHNSFPSYQSQNGPFSASNSMHSSSANLAYGMYPLPPINSNGNTPT 1286


>ref|XP_010261537.1| PREDICTED: uncharacterized protein LOC104600345 isoform X1 [Nelumbo
            nucifera]
          Length = 1413

 Score =  177 bits (450), Expect = 2e-47
 Identities = 99/239 (41%), Positives = 132/239 (55%), Gaps = 14/239 (5%)
 Frame = +3

Query: 3    ELPEEPNADILNSDLNGHRQNLVYGRFCQNS-YHAPFMYAPRAVESPINLSAHYPWDGSG 179
            E  EE  +DILNSD   H QNL YGRFCQN  Y  P  Y    +  P+ L  H+PWDG G
Sbjct: 1048 EPTEEHKSDILNSDFASHWQNLQYGRFCQNPRYPGPLFYPSPVMVPPVYLQGHFPWDGPG 1107

Query: 180  RSIATNLN-YTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKV 356
            R ++ N N +TQ + + PR+ P+ PLQ G +R   A+QR  DEA RYRGGTGTYLPNPKV
Sbjct: 1108 RPLSANGNLFTQLVNYGPRLFPVAPLQPGSNRPGGAYQRYGDEAPRYRGGTGTYLPNPKV 1167

Query: 357  PXXXXXXXXXXXXXXXXXXWND-PGDRF---------RAVDRSHGRSQAERATFRPDRQA 506
                                ND  GDR          RA  R+HGR+Q E+ + +PD+ A
Sbjct: 1168 SFRDRQASTARNHRGNNYDRNDHHGDREGTWNTNSKPRAAGRNHGRNQVEKLSSKPDQLA 1227

Query: 507  AAKNQEEKRWATYKHEPLAAHRGPDRTVPSTNS--SNSSENAFGMYPETAFRSNGVNST 677
            A  N+ ++ W +Y+H    +++  +    ++NS  S+S+  A+GMYP     SNG   T
Sbjct: 1228 ANDNRADRPWGSYRHNSFPSYQSQNGPFSASNSMHSSSANLAYGMYPLPPINSNGNTPT 1286


>gb|KHN06300.1| Poly(A) RNA polymerase cid14 [Glycine soja]
          Length = 1496

 Score =  177 bits (450), Expect = 2e-47
 Identities = 99/236 (41%), Positives = 132/236 (55%), Gaps = 12/236 (5%)
 Frame = +3

Query: 15   EPNADILNSDLNGHRQNLVYGRFCQNSYHAPFMYAPRAVE-SPINLSAHYPWDGSGRSIA 191
            E   DILNSD   H QNL YGRFCQNS H P M  P  V   P+ L   YPWDG GR I+
Sbjct: 1119 EHRPDILNSDFVSHWQNLQYGRFCQNSRHPPSMTYPSPVMVPPVYLQGRYPWDGPGRPIS 1178

Query: 192  TNLN-YTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPXXX 368
             N+N ++Q M + PR+VP+ PLQS  +R AS +QR VD+  RYR GTGTYLPNPKV    
Sbjct: 1179 GNMNIFSQLMSYGPRLVPVAPLQSVSNRPASIYQRYVDDMPRYRSGTGTYLPNPKVSARD 1238

Query: 369  XXXXXXXXXXXXXXXWNDPGDR---------FRAVDRSHGRSQAERATFRPDRQAAAKNQ 521
                            +  GDR          R   R H R+Q E+   + +R A ++++
Sbjct: 1239 RHSTNTRRGNYNYDRSDHHGDREGNWNTNSKLRGTGRGHNRNQTEKPNSKMERSATSESR 1298

Query: 522  EEKRWATYKHEPLAAHR-GPDRTVPSTNSSNSSENAFGMYPETAFRSNGVNSTRPT 686
             E+ W +++H+    H+ GP R+  +++ SN S  A+GMYP  A   +GV+S  PT
Sbjct: 1299 AERPWGSHRHDTFIPHQNGPVRS--NSSQSNPSNVAYGMYPMPAMNPSGVSSNGPT 1352


>gb|PKA49759.1| hypothetical protein AXF42_Ash004300 [Apostasia shenzhenica]
          Length = 1363

 Score =  177 bits (449), Expect = 3e-47
 Identities = 99/230 (43%), Positives = 127/230 (55%), Gaps = 11/230 (4%)
 Frame = +3

Query: 12   EEPNADILNSDLNGHRQNLVYGRFCQN-SYHAPFMYAPRAVESPINLSAHYPWDGSGRSI 188
            EE  +DILNSD   H QNL YGR CQN   H  F+Y+P  +  P+ L  H+P DG GR +
Sbjct: 1051 EEHKSDILNSDFASHWQNLQYGRLCQNIPNHGQFIYSPPVMVPPVYLQGHFPLDGPGRPL 1110

Query: 189  ATNLNY-TQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPXX 365
            A NLN+ TQ MG+ PR+VP+ PLQ GP R ++ F R  DE  RYR GTGTYLPNPK    
Sbjct: 1111 APNLNFFTQMMGYGPRLVPVAPLQPGPSRPSNVFHRYGDEPPRYRAGTGTYLPNPKATFR 1170

Query: 366  XXXXXXXXXXXXXXXXWNDPGDR--------FRAVDRSHGRSQAERATFRPDRQAAAKNQ 521
                                 DR         R+  RSHGR+  E+ + RPDR AA+  +
Sbjct: 1171 DRHYSSTRNHRGTYNYDRGDADREGSWVNSKNRSGGRSHGRNHNEKLSLRPDRLAASDVR 1230

Query: 522  EEKRWATYKHEPLAAHRGPDRTV-PSTNSSNSSENAFGMYPETAFRSNGV 668
             E+ W +Y+HE    H+  + +   ST S NS+  A GMYP +   SNG+
Sbjct: 1231 NERVWESYRHEQAVPHQVQNSSFGSSTTSLNSANVAHGMYPVSGASSNGL 1280


>ref|XP_006583248.1| PREDICTED: uncharacterized protein LOC100809742 isoform X3 [Glycine
            max]
 gb|KRH47923.1| hypothetical protein GLYMA_07G056700 [Glycine max]
          Length = 1329

 Score =  177 bits (448), Expect = 4e-47
 Identities = 99/236 (41%), Positives = 132/236 (55%), Gaps = 12/236 (5%)
 Frame = +3

Query: 15   EPNADILNSDLNGHRQNLVYGRFCQNSYHAPFMYAPRAVE-SPINLSAHYPWDGSGRSIA 191
            E   DILNSD   H QNL YGRFCQNS H P M  P  V   P+ L   YPWDG GR I+
Sbjct: 1017 EHRPDILNSDFVSHWQNLQYGRFCQNSRHPPSMTYPSPVMVPPVYLQGRYPWDGPGRPIS 1076

Query: 192  TNLN-YTQFMGHNPRVVPMTPLQSGPDRSASAFQRNVDEAQRYRGGTGTYLPNPKVPXXX 368
             N+N ++Q M + PR+VP+ PLQS  +R AS +QR VD+  RYR GTGTYLPNPKV    
Sbjct: 1077 GNMNIFSQLMSYGPRLVPVAPLQSVSNRPASIYQRYVDDMPRYRSGTGTYLPNPKVSARD 1136

Query: 369  XXXXXXXXXXXXXXXWNDPGDR---------FRAVDRSHGRSQAERATFRPDRQAAAKNQ 521
                            +  GDR          R   R H R+Q E+   + +R A ++++
Sbjct: 1137 RHSTNTRRGNYPYDRSDHHGDREGNWNTNSKLRGTGRGHNRNQTEKPNSKMERLATSESR 1196

Query: 522  EEKRWATYKHEPLAAHR-GPDRTVPSTNSSNSSENAFGMYPETAFRSNGVNSTRPT 686
             E+ W +++H+    H+ GP R+  +++ SN S  A+GMYP  A   +GV+S  PT
Sbjct: 1197 AERPWGSHRHDTFIPHQNGPVRS--NSSQSNPSNVAYGMYPMPAMNPSGVSSNGPT 1250


Top