BLASTX nr result

ID: Rehmannia31_contig00009672 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia31_contig00009672
         (743 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011094935.1| uncharacterized protein LOC105174506 [Sesamu...   355   e-120
gb|PIN01452.1| hypothetical protein CDL12_26041 [Handroanthus im...   320   e-106
ref|XP_012832070.1| PREDICTED: uncharacterized protein LOC105953...   295   4e-96
ref|XP_011077613.1| uncharacterized protein LOC105161579 isoform...   216   1e-65
ref|XP_011077612.1| uncharacterized protein LOC105161579 isoform...   216   2e-65
ref|XP_012850440.1| PREDICTED: uncharacterized protein LOC105970...   200   2e-59
ref|XP_022846619.1| uncharacterized protein LOC111369366 isoform...   188   4e-54
ref|XP_022846618.1| uncharacterized protein LOC111369366 isoform...   188   5e-54
gb|OMO66447.1| hypothetical protein CCACVL1_21133 [Corchorus cap...   176   2e-49
gb|KZV18544.1| hypothetical protein F511_21572 [Dorcoceras hygro...   172   2e-48
dbj|GAV65464.1| hypothetical protein CFOL_v3_08979 [Cephalotus f...   174   3e-48
ref|XP_022842721.1| uncharacterized protein LOC111366238 isoform...   172   5e-48
ref|XP_009797440.1| PREDICTED: uncharacterized protein LOC104243...   170   3e-47
ref|XP_021897389.1| uncharacterized protein LOC110814282 [Carica...   170   4e-47
ref|XP_022842720.1| uncharacterized protein LOC111366238 isoform...   169   6e-47
gb|OMO97344.1| hypothetical protein COLO4_14683 [Corchorus olito...   169   1e-46
ref|XP_016490474.1| PREDICTED: uncharacterized protein LOC107810...   168   2e-46
gb|EOY07362.1| Uncharacterized protein TCM_021816 [Theobroma cacao]   169   2e-46
ref|XP_017977724.1| PREDICTED: uncharacterized protein LOC185976...   168   3e-46
ref|XP_009596622.1| PREDICTED: uncharacterized protein LOC104092...   166   7e-46

>ref|XP_011094935.1| uncharacterized protein LOC105174506 [Sesamum indicum]
          Length = 383

 Score =  355 bits (912), Expect = e-120
 Identities = 176/246 (71%), Positives = 191/246 (77%)
 Frame = +3

Query: 6   KGWKLSGSPQSTLCACKPGSIQTSPNGVSRVSSPPDAKDVVGWDLLYAAAGEVSRMRMIE 185
           KGWKLSGSPQSTLC CKPGS + SPN VSRVSSPPD KD VGWDLLYAAAGEV+RMRMIE
Sbjct: 98  KGWKLSGSPQSTLCGCKPGSSRGSPNSVSRVSSPPDGKDAVGWDLLYAAAGEVARMRMIE 157

Query: 186 ETTPFHSTKLFAPAPKPSPVTLPVEKQTPTTGFYPTQTQTQAHIAYLXXXXXXXXXXXXX 365
           ETTPFHS KLFAP PKPSPV +PV+K  P TGFYP Q QTQAH+AYL             
Sbjct: 158 ETTPFHSNKLFAPPPKPSPVAVPVKKPNPATGFYPNQAQTQAHLAYLQLQATQFQRMKQQ 217

Query: 366 XXXRRGVREQGKIENQFQNGKMTGEAGRTQGLSMAAWPTLXXXXXXXXXXPGSGMRAFFL 545
              +  V  QGK++ QFQNG+ TG  GRTQGLSMAAWPTL          PGSGMRA FL
Sbjct: 218 QMVKSNVWRQGKLDYQFQNGR-TGIVGRTQGLSMAAWPTLQQSQQQQHQQPGSGMRAVFL 276

Query: 546 GETGVKKERIGTGVFLPRRYGASPTETHKKPGCSTVLLPDRVVHALNLNMDSVDAQSQHR 725
           G+TG KKER GTGVFLPRR+G++PTET KKPGCSTVLLPDRVVHALNLN++SVD QSQ R
Sbjct: 277 GDTGAKKERTGTGVFLPRRFGSNPTETRKKPGCSTVLLPDRVVHALNLNLESVDDQSQLR 336

Query: 726 GNRNFT 743
           GN NFT
Sbjct: 337 GNGNFT 342


>gb|PIN01452.1| hypothetical protein CDL12_26041 [Handroanthus impetiginosus]
          Length = 373

 Score =  320 bits (820), Expect = e-106
 Identities = 165/246 (67%), Positives = 182/246 (73%), Gaps = 1/246 (0%)
 Frame = +3

Query: 6   KGWKLSGSPQSTLCACKPGSIQTSPNGVSRVSSPPDAKDVVGWDLLYAAAGEVSRMRMIE 185
           KGWKLSGSPQSTLC CKP S   SPN VSRVSSP +AKDVVGWDLLYAAAGEV+RMRMIE
Sbjct: 88  KGWKLSGSPQSTLCGCKPVSGGGSPNCVSRVSSPSNAKDVVGWDLLYAAAGEVARMRMIE 147

Query: 186 ETTPFHSTKLFAPAPKPSPVTLPVEKQTPTTGFYPTQTQTQAHIAYLXXXXXXXXXXXXX 365
           ETT  +S K F P+PKPSPVTLPV K  P +GFYPTQ QT+AH+AYL             
Sbjct: 148 ETTQLNSNKRFTPSPKPSPVTLPVRKPIPASGFYPTQNQTEAHLAYLQLQATQFQRMRQQ 207

Query: 366 XXXRRGVREQGKIENQFQNGKMTGEAGRTQGLSMAAWPTLXXXXXXXXXXPGSGMRAFFL 545
              + GV  Q K++ QFQNG+  G  GR QGLS AAWPTL          PGSGMRA FL
Sbjct: 208 QMMKTGVWGQRKMDYQFQNGRTAG-TGRNQGLSTAAWPTLQQSEQQQQQQPGSGMRAVFL 266

Query: 546 GETGVKKERIGTGVFLPRRYGASPTETHKKP-GCSTVLLPDRVVHALNLNMDSVDAQSQH 722
           GETGVKKER GTGVFLPRR+G +P ET KKP GCST LLPDRVVHALNLN++++DAQS  
Sbjct: 267 GETGVKKERTGTGVFLPRRFGTNPAETRKKPAGCSTALLPDRVVHALNLNLETLDAQSPL 326

Query: 723 RGNRNF 740
           RGN NF
Sbjct: 327 RGNGNF 332


>ref|XP_012832070.1| PREDICTED: uncharacterized protein LOC105953004 [Erythranthe
           guttata]
 gb|EYU41797.1| hypothetical protein MIMGU_mgv1a008453mg [Erythranthe guttata]
          Length = 373

 Score =  295 bits (755), Expect = 4e-96
 Identities = 158/254 (62%), Positives = 176/254 (69%), Gaps = 8/254 (3%)
 Frame = +3

Query: 6   KGWKLSGSPQSTLCACKPGSIQTSPNGVSRVSSPPDAKDVVGWDLLYAAAGEVSRMRMIE 185
           KG KLSGSPQSTLC CKP S Q+SPN  SRVS+PPD KDV GWDLLYAAAGEV+R+RM +
Sbjct: 94  KGIKLSGSPQSTLCGCKPASSQSSPNSASRVSTPPDEKDVAGWDLLYAAAGEVARIRMSD 153

Query: 186 ETTPFHSTKLFAPA--PKPSPVTLPVEKQTPTTGFYPTQTQTQAHIAYLXXXXXXXXXXX 359
           ETTPFHS KLF P   P PSP+ +PV+K T        QTQTQA I+YL           
Sbjct: 154 ETTPFHSAKLFPPTLNPNPSPIAVPVQKNT------NNQTQTQAQISYLQFQATRFQRMK 207

Query: 360 XXXXXRR-GVREQGKIENQFQNGKMTGEAGRTQGLSMAAWPTL-----XXXXXXXXXXPG 521
                 R GV   G+ E +FQN  ++ E  RTQGLSMAAWPTL               PG
Sbjct: 208 QQQQMMRDGVWRHGETEYKFQNMNISAEPVRTQGLSMAAWPTLKQSQHQHQHQQQQQQPG 267

Query: 522 SGMRAFFLGETGVKKERIGTGVFLPRRYGASPTETHKKPGCSTVLLPDRVVHALNLNMDS 701
           SGMRA FLGETG KKER GTGVFLPRR G  PT+T KKPGCSTVLLPDRVVHALNLN+D+
Sbjct: 268 SGMRAVFLGETGAKKERTGTGVFLPRRVGPDPTQTRKKPGCSTVLLPDRVVHALNLNLDT 327

Query: 702 VDAQSQHRGNRNFT 743
           +D+QS  RGNRNFT
Sbjct: 328 MDSQSHLRGNRNFT 341


>ref|XP_011077613.1| uncharacterized protein LOC105161579 isoform X2 [Sesamum indicum]
          Length = 358

 Score =  216 bits (551), Expect = 1e-65
 Identities = 122/253 (48%), Positives = 148/253 (58%), Gaps = 11/253 (4%)
 Frame = +3

Query: 6   KGWKLSGSPQSTLCACKPGSIQTSPNGVSRVSSPPDAKDVVGWDLLYAAAGEVSRMRMIE 185
           KGW+ SGSP+STLC  KP S +   N VSRV SP D +  + WD L AA  EV RMRMIE
Sbjct: 94  KGWRPSGSPRSTLCGLKPAS-RGGSNSVSRVCSPQDGRSALTWDWLNAAGEEVMRMRMIE 152

Query: 186 ETTPFHSTKLFAPAPKPSPVTLPVEKQTPTTGFYPTQTQTQAHIAYLXXXXXXXXXXXXX 365
           E   F+STKLF P PKP+PVT+P       +GFY +Q Q QA ++               
Sbjct: 153 EAASFYSTKLFGPPPKPNPVTMPQHNPNSASGFYLSQFQRQAQLS-CEQFQAAQFQQMNQ 211

Query: 366 XXXRRGVREQGKIENQFQNGKMTGEAGRTQGLSMAAWPTLXXXXXXXXXXPGSGMRAFFL 545
                GV  QG +  Q  NG + G  GR QGLSM+AWPTL          P  GMRA   
Sbjct: 212 HHVIDGVNGQGGLRYQLHNGSINGAGGRAQGLSMSAWPTL---QQSQQQQPVLGMRAVLF 268

Query: 546 GETGVKKERIGTGVFLPRRYGASPTETHKKPGCSTVLLPDRVVHALNLNMDS-------- 701
           GETG KKER+GTGVF+PRR+ ++PT T   PGCSTVLLPDR+ H +NL++D+        
Sbjct: 269 GETGAKKERVGTGVFMPRRFSSNPTGT---PGCSTVLLPDRLDHTMNLDLDAQVQSNGGF 325

Query: 702 ---VDAQSQHRGN 731
               DA S+ RGN
Sbjct: 326 KTQYDAVSKQRGN 338


>ref|XP_011077612.1| uncharacterized protein LOC105161579 isoform X1 [Sesamum indicum]
          Length = 362

 Score =  216 bits (551), Expect = 2e-65
 Identities = 122/254 (48%), Positives = 149/254 (58%), Gaps = 12/254 (4%)
 Frame = +3

Query: 6   KGWKLSGSPQSTLCACKPGSIQTSPNGVSRVSSPPDAKDVVGWDLLYAAAGEVSRMRMIE 185
           KGW+ SGSP+STLC  KP S +   N VSRV SP D +  + WD L AA  EV RMRMIE
Sbjct: 94  KGWRPSGSPRSTLCGLKPAS-RGGSNSVSRVCSPQDGRSALTWDWLNAAGEEVMRMRMIE 152

Query: 186 ETTPFHSTKLFAPAPKPSPVTLPVEKQTPTTGFYPTQTQTQAHIAYLXXXXXXXXXXXXX 365
           E   F+STKLF P PKP+PVT+P       +GFY +Q Q QA ++               
Sbjct: 153 EAASFYSTKLFGPPPKPNPVTMPQHNPNSASGFYLSQFQRQAQLS-CEQFQAAQFQQMNQ 211

Query: 366 XXXRRGVREQGKIENQFQNGKMTGEAGRTQGLSMAAWPTLXXXXXXXXXXPGSGMRAFFL 545
                GV  QG +  Q  NG + G  GR QGLSM+AWPTL          P  GMRA   
Sbjct: 212 HHVIDGVNGQGGLRYQLHNGSINGAGGRAQGLSMSAWPTL---QQSQQQQPVLGMRAVLF 268

Query: 546 GETGVKKERIGTGVFLPRRYGASPTETHKK-PGCSTVLLPDRVVHALNLNMDS------- 701
           GETG KKER+GTGVF+PRR+ ++PT T K+  GCSTVLLPDR+ H +NL++D+       
Sbjct: 269 GETGAKKERVGTGVFMPRRFSSNPTGTRKESAGCSTVLLPDRLDHTMNLDLDAQVQSNGG 328

Query: 702 ----VDAQSQHRGN 731
                DA S+ RGN
Sbjct: 329 FKTQYDAVSKQRGN 342


>ref|XP_012850440.1| PREDICTED: uncharacterized protein LOC105970184 [Erythranthe
           guttata]
 gb|EYU26624.1| hypothetical protein MIMGU_mgv1a009884mg [Erythranthe guttata]
          Length = 328

 Score =  200 bits (508), Expect = 2e-59
 Identities = 116/225 (51%), Positives = 139/225 (61%), Gaps = 1/225 (0%)
 Frame = +3

Query: 6   KGWKLSGSPQSTLCACKPGSIQTSPNGVSRVSSPPDAKDVVGWDLLYAAAGEVSRMRMIE 185
           KG KLS SP+STLC  KPGS + SPN VS VSSPPDA D +  +LLYAAAGEV+R+++IE
Sbjct: 91  KGLKLSSSPKSTLCGYKPGS-RGSPNSVSGVSSPPDANDAIQLELLYAAAGEVARLQLIE 149

Query: 186 ETTPFHSTKLFAPAPKPSPVTLPVEKQTPTTGFYPTQTQTQAHIAYLXXXXXXXXXXXXX 365
           ET  F+S   FAP PK SPVTLP       +GFY  Q Q QA   Y              
Sbjct: 150 ETAAFYSGNFFAPPPKRSPVTLPQPNPNLASGFYHNQPQKQARFTYEQLQAAKFRQMKQY 209

Query: 366 XXXRRGVREQGKIENQFQNGKMTGEAGRTQGLSMAAWPTLXXXXXXXXXXPGSGMRAFFL 545
                   EQG ++ Q QN +  G   + Q LS AAWPTL           GSGMRA FL
Sbjct: 210 QMMNNW--EQGIVDYQLQN-RRNGGGQKCQSLSPAAWPTL----QQSHHQQGSGMRAVFL 262

Query: 546 GETGVKKERIGTGVFLPRRYGASPTETHKK-PGCSTVLLPDRVVH 677
           GE  +KKER+GTGVF+PRR+ ++PT+T KK  GCST LLP++VVH
Sbjct: 263 GENVLKKERVGTGVFMPRRFCSNPTDTPKKTAGCSTALLPEKVVH 307


>ref|XP_022846619.1| uncharacterized protein LOC111369366 isoform X2 [Olea europaea var.
           sylvestris]
          Length = 394

 Score =  188 bits (477), Expect = 4e-54
 Identities = 124/265 (46%), Positives = 152/265 (57%), Gaps = 25/265 (9%)
 Frame = +3

Query: 18  LSGSPQSTLCACKP------GSIQTSPNGVSRVSSPP--DAKDVVGWDLLYAAAGEVSRM 173
           LS SPQSTLC          GS   SPN VS VSSPP  + K    WD LY AAGE++RM
Sbjct: 97  LSRSPQSTLCGAMGAHGSNFGSSGASPNCVSMVSSPPGVNRKGAADWDALYEAAGELARM 156

Query: 174 RMIEETTPFHST-----KLFAPAPKPSPVTLPVEKQTPTTGFYPTQTQTQAHIAYLXXXX 338
           R+IEET  F+S      ++F    K +P++LP+E     +GFYP Q   +  ++Y     
Sbjct: 157 RIIEETNGFYSGNRCAGEVFRAPKKLNPISLPLENPQRGSGFYPNQ---EPQLSYRQLQA 213

Query: 339 XXXXXXXXXXXXRR----GVREQGK------IENQFQNGKMTGEAGRTQGLSMAAWPTLX 488
                       ++    GV  QGK      + N  +NG   GE    +  SMA WPTL 
Sbjct: 214 TQFQQLKMQLMMKQQQGSGVWGQGKGGYPQMVPNVRRNG---GERTLPRDFSMATWPTLQ 270

Query: 489 XXXXXXXXXPGSGMRAFFLGETGVKKERIGTGVFLPRRYGASPTETHKKPGCSTVLLPDR 668
                    PG+GMRA FLG+TG KKER GTGVFLPRR+G +PTET KKPGCSTVLLPDR
Sbjct: 271 QSQRQ----PGAGMRAMFLGDTGAKKERAGTGVFLPRRFG-TPTETRKKPGCSTVLLPDR 325

Query: 669 VVHALNLNMDSVDAQSQ--HRGNRN 737
           VV ALNL++D+VD Q Q   RG+ N
Sbjct: 326 VVQALNLHLDNVDPQPQVHSRGSAN 350


>ref|XP_022846618.1| uncharacterized protein LOC111369366 isoform X1 [Olea europaea var.
           sylvestris]
          Length = 395

 Score =  188 bits (477), Expect = 5e-54
 Identities = 124/265 (46%), Positives = 152/265 (57%), Gaps = 25/265 (9%)
 Frame = +3

Query: 18  LSGSPQSTLCACKP------GSIQTSPNGVSRVSSPP--DAKDVVGWDLLYAAAGEVSRM 173
           LS SPQSTLC          GS   SPN VS VSSPP  + K    WD LY AAGE++RM
Sbjct: 97  LSRSPQSTLCGAMGAHGSNFGSSGASPNCVSMVSSPPGVNRKGAADWDALYEAAGELARM 156

Query: 174 RMIEETTPFHST-----KLFAPAPKPSPVTLPVEKQTPTTGFYPTQTQTQAHIAYLXXXX 338
           R+IEET  F+S      ++F    K +P++LP+E     +GFYP Q   +  ++Y     
Sbjct: 157 RIIEETNGFYSGNRCAGEVFRAPKKLNPISLPLENPQRGSGFYPNQ---EPQLSYRQLQA 213

Query: 339 XXXXXXXXXXXXRR----GVREQGK------IENQFQNGKMTGEAGRTQGLSMAAWPTLX 488
                       ++    GV  QGK      + N  +NG   GE    +  SMA WPTL 
Sbjct: 214 TQFQQLKMQLMMKQQQGSGVWGQGKGGYPQMVPNVRRNG---GERTLPRDFSMATWPTLQ 270

Query: 489 XXXXXXXXXPGSGMRAFFLGETGVKKERIGTGVFLPRRYGASPTETHKKPGCSTVLLPDR 668
                    PG+GMRA FLG+TG KKER GTGVFLPRR+G +PTET KKPGCSTVLLPDR
Sbjct: 271 QSQRQ----PGAGMRAMFLGDTGAKKERAGTGVFLPRRFG-TPTETRKKPGCSTVLLPDR 325

Query: 669 VVHALNLNMDSVDAQSQ--HRGNRN 737
           VV ALNL++D+VD Q Q   RG+ N
Sbjct: 326 VVQALNLHLDNVDPQPQVHSRGSAN 350


>gb|OMO66447.1| hypothetical protein CCACVL1_21133 [Corchorus capsularis]
          Length = 408

 Score =  176 bits (446), Expect = 2e-49
 Identities = 120/269 (44%), Positives = 144/269 (53%), Gaps = 23/269 (8%)
 Frame = +3

Query: 6   KGWKLSGSPQSTLCA------CKPGSIQTSPNGVSRVSSPPDAKDVVGWDLLYAAAGEVS 167
           KGW LSGSPQSTLC       CK GS + SPN  SRVSSPP       WDLLYAAAGEV+
Sbjct: 119 KGWVLSGSPQSTLCGLGNGCGCKQGSSRGSPNCQSRVSSPPGT-----WDLLYAAAGEVA 173

Query: 168 RMRMIEETTPFHSTK--LFAPAPKPSPVTLPVEKQTPTTGFYPTQTQTQAHIAYLXXXXX 341
           RMRM EE+    + +  L  PA KPSP           + +YP   Q+ +H   L     
Sbjct: 174 RMRMNEESYGGFNNRGLLGPPARKPSP-------NLDVSSYYPPSHQSFSHHQKLQATQF 226

Query: 342 XXXXXXXXXXXRR-------GVREQGK------IENQFQNGKMTGEAGRTQGLSMAAWPT 482
                      ++       G ++Q        ++N+ +NG        + GLS +AWP 
Sbjct: 227 QQLKQQQQQLMKQQSASVWGGQKQQQHQNNHHVVQNRGRNGNSNSNNRPSLGLSPSAWPP 286

Query: 483 LXXXXXXXXXXPGSGMRAFFLGETGVKKERIGTGVFLPRRYGASPTETHKKPGCSTVLLP 662
           L           GSGMRA FLG    K+E  GTGVFLPRR GA PTET KK GCSTVLLP
Sbjct: 287 LQQQQQQTQSQNGSGMRAVFLGNPAGKRECAGTGVFLPRRVGA-PTETRKKTGCSTVLLP 345

Query: 663 DRVVHALNLNMDSVDAQSQ--HRGNRNFT 743
            RVV ALNLN+D ++AQ Q   R N +FT
Sbjct: 346 ARVVQALNLNLDEINAQPQLHPRFNASFT 374


>gb|KZV18544.1| hypothetical protein F511_21572 [Dorcoceras hygrometricum]
          Length = 364

 Score =  172 bits (437), Expect = 2e-48
 Identities = 111/249 (44%), Positives = 139/249 (55%), Gaps = 5/249 (2%)
 Frame = +3

Query: 6   KGWKLSGSPQSTLCACKPGSIQTSPNGVSRVSSPPDAKDVVGWDLLYAAAGEVSRMRMIE 185
           +G KLSGSPQSTL    PGS + SP   S+V+SP + KD  GWDLLYAAAGEV+RM+  +
Sbjct: 98  QGLKLSGSPQSTLSGYYPGSSRGSPKCTSQVASPTEVKDPAGWDLLYAAAGEVARMKAFQ 157

Query: 186 ETTPFHSTKLFAPAPKPSPVTLPVEKQTPTTGFYPT-QTQTQAHIAYLXXXXXXXXXXXX 362
           E   F+ST       KP+P T P    +  +G Y   +TQ Q    ++            
Sbjct: 158 EAKAFYST-------KPNPFTTPQTNPSIMSGIYSNHKTQVQLSQQHM-------QAANF 203

Query: 363 XXXXRRGVREQGKIENQFQNGKMTGEAGRTQGLSMAAWPTL--XXXXXXXXXXPGSGMRA 536
               R G+  Q +  +QFQNG+  GE  R  G+S  AWPTL            P SGMRA
Sbjct: 204 QKMTRSGIWGQQQAGSQFQNGRSIGE--RRNGVSTGAWPTLQQSHSHKQPPLIPRSGMRA 261

Query: 537 FFLGETGVKKERIGTGVFLPRRYGASPTETHKKPGCSTVLLPDRVVHALNLNMDSVD--A 710
            + GE G KKER GTGVFLPRRY  +P E+ KKP        DRVVH LNLN++S+D  A
Sbjct: 262 TYPGENGAKKERTGTGVFLPRRYETTPPESRKKPA-------DRVVH-LNLNLESIDIPA 313

Query: 711 QSQHRGNRN 737
           ++  RG  N
Sbjct: 314 EAHFRGISN 322


>dbj|GAV65464.1| hypothetical protein CFOL_v3_08979 [Cephalotus follicularis]
          Length = 425

 Score =  174 bits (440), Expect = 3e-48
 Identities = 115/261 (44%), Positives = 139/261 (53%), Gaps = 16/261 (6%)
 Frame = +3

Query: 6   KGWKLSGSPQSTL------CACKPGSIQTSPNGVSRVSSPPDAKDVVGWDLLYAAAGEVS 167
           KGW LSGSPQSTL      C C+ GS + S N  SRVSSPP+      WDLLYAAAGEV+
Sbjct: 130 KGWGLSGSPQSTLSAIASGCGCRQGSSKGSSNCQSRVSSPPET-----WDLLYAAAGEVA 184

Query: 168 RMRMIEETTPFHSTKLFAPAPKPSP----VTLPVEKQTPTTGFYPTQTQTQAHIAYLXXX 335
           RM+M E+   F+  K     PK +     V++P++  +  TGFYP           L   
Sbjct: 185 RMKMNEDMYGFNHNKGLLGLPKNNTRKPSVSVPLKNPSSDTGFYPHDKLQAIQYQQLKQQ 244

Query: 336 XXXXXXXXXXXXXRRGVREQGKIENQFQ-----NGKMTGEAGRTQGLSMAAWPTLXXXXX 500
                          G + +G   +  Q       + +G  GR  GLS +AWP L     
Sbjct: 245 QIMKQQQQQHISAVWGGQTKGSTGSYSQLVQRSGVRNSGVVGRPLGLSPSAWPPLQQPQQ 304

Query: 501 XXXXXP-GSGMRAFFLGETGVKKERIGTGVFLPRRYGASPTETHKKPGCSTVLLPDRVVH 677
                  GSGMRA FLG  G K+E +GTGVFLPRR G + TE  KKPGCSTVLLP RVV 
Sbjct: 305 QYNNQRNGSGMRAVFLGSNGTKRESVGTGVFLPRRIG-TVTECRKKPGCSTVLLPARVVQ 363

Query: 678 ALNLNMDSVDAQSQHRGNRNF 740
           ALNLN+D   AQ Q R N +F
Sbjct: 364 ALNLNLDDPRAQLQPRFNGSF 384


>ref|XP_022842721.1| uncharacterized protein LOC111366238 isoform X2 [Olea europaea var.
           sylvestris]
          Length = 375

 Score =  172 bits (435), Expect = 5e-48
 Identities = 111/254 (43%), Positives = 143/254 (56%), Gaps = 17/254 (6%)
 Frame = +3

Query: 3   PKGWKLSGSPQSTLCA------CKPGSIQTSPNGVSRVSSPP--DAKDVVGWDLLYAAAG 158
           PKG  LS SPQSTLC+      CK      S   +SRVSS P  + KD   WDLLYAA  
Sbjct: 86  PKGMGLSCSPQSTLCSVTGWCECK----HRSSCCISRVSSHPRVNLKDGAAWDLLYAATE 141

Query: 159 EVSRMRMIEETTPFHSTK-----LFAPAPKPSPVTLPVEKQTPTTGFYPTQTQTQAHIAY 323
           E +RMRMI+E   F+S+      +  P  KP  VT P++K    +G +  Q+    H++Y
Sbjct: 142 EAARMRMIQEANGFYSSNKRVGGVLVPPRKPGYVTAPLKKPNYGSGIFANQS----HLSY 197

Query: 324 LXXXXXXXXXXXXXXXXRRGV----REQGKIENQFQNGKMTGEAGRTQGLSMAAWPTLXX 491
                            R+G     +E+   +   Q+ +  GE  R+QGLS+AAWPTL  
Sbjct: 198 RQLQATQELKLHSMMKQRQGSGFLGQEKIGYQQMVQSVRKNGE--RSQGLSVAAWPTLPK 255

Query: 492 XXXXXXXXPGSGMRAFFLGETGVKKERIGTGVFLPRRYGASPTETHKKPGCSTVLLPDRV 671
                   PGSGMRA FLG+ G KKER GTGVFLPRR+  + T T KKPGCST L+PD+V
Sbjct: 256 SQQKRELQPGSGMRAVFLGDRGTKKERSGTGVFLPRRFEPA-TGTRKKPGCSTFLIPDQV 314

Query: 672 VHALNLNMDSVDAQ 713
             ALNLN++S++ Q
Sbjct: 315 AQALNLNLESMNTQ 328


>ref|XP_009797440.1| PREDICTED: uncharacterized protein LOC104243877 [Nicotiana
           sylvestris]
          Length = 384

 Score =  170 bits (430), Expect = 3e-47
 Identities = 110/253 (43%), Positives = 138/253 (54%), Gaps = 11/253 (4%)
 Frame = +3

Query: 6   KGWKLSGSPQSTLCACKPGSIQTSPNGVSRVSSPPD-AKDVVGWDLLYAAAGEVSRMRMI 182
           KGW LS SPQSTLC CK GS + SPNG S+ SSPP   +  V  DLLYAAAGEV+R+RM+
Sbjct: 94  KGWGLSRSPQSTLCGCKQGSSRGSPNGPSQASSPPRMTRPEVSMDLLYAAAGEVARIRMM 153

Query: 183 EETTPFHSTK------LFAPAPKPSPVTLPVEKQTPTTGFYPTQTQTQAHIAYLXXXXXX 344
           EE+T  +S          AP  K SPV +  +   P  G +      Q  ++Y       
Sbjct: 154 EESTALYSHNRAGGGIWAAPPRKTSPVPVGPKNTKPNLGSF---NSNQPPLSYQQLQVAQ 210

Query: 345 XXXXXXXXXXRRGVREQGKIENQFQ---NGKMTGEAGRTQGLSMAAWPTL-XXXXXXXXX 512
                     ++G   Q  +    Q   N    G  G    L  +AWPTL          
Sbjct: 211 FQRLKQQQMMKQGGYRQFPVNQNQQVAANRARNGVNGSLMNLPNSAWPTLQQSQSQQQQP 270

Query: 513 XPGSGMRAFFLGETGVKKERIGTGVFLPRRYGASPTETHKKPGCSTVLLPDRVVHALNLN 692
             GSGMRA FLG  G K+E +GTGVFLPRR G + TET KKPGC  V+LPDRVV ALNLN
Sbjct: 271 NTGSGMRAVFLGNPGPKRECVGTGVFLPRRIG-TQTETRKKPGC-PVILPDRVVQALNLN 328

Query: 693 MDSVDAQSQHRGN 731
           ++++DA ++ + N
Sbjct: 329 LETMDAATRPQSN 341


>ref|XP_021897389.1| uncharacterized protein LOC110814282 [Carica papaya]
          Length = 411

 Score =  170 bits (431), Expect = 4e-47
 Identities = 111/249 (44%), Positives = 135/249 (54%), Gaps = 11/249 (4%)
 Frame = +3

Query: 6   KGWKLSGSPQSTLCA------CKPGSIQTSPNGVSRVSSPPDAKDVVGWDLLYAAAGEVS 167
           KG  +SGSPQSTLC       CK GS Q SPN  SRVSSPP A     WDLLYAAAGEV+
Sbjct: 118 KGRVMSGSPQSTLCTVGSGCGCKQGSNQGSPNCQSRVSSPPGA-----WDLLYAAAGEVA 172

Query: 168 RMRMIEETTPF-HSTKLFAPAPKPSPVTLPVEKQTPTTGFYPTQTQTQAHI--AYLXXXX 338
           RMRM E++  + H   L  P  KPSPV++P +      G+Y  Q+ +   +         
Sbjct: 173 RMRMNEKSYGYNHGRGLLGPPGKPSPVSVPAKNPNIDVGYYAYQSLSHQKMQATQFHQLK 232

Query: 339 XXXXXXXXXXXXRRGVREQGKIENQFQNGKMTGE--AGRTQGLSMAAWPTLXXXXXXXXX 512
                        +G  +   + N+ +N +      +GR  GLS +AWP L         
Sbjct: 233 QQQMMKLRPPAWSKGTGQYQMVPNRGRNIEFIANRASGRPLGLSPSAWPPLQQAPQQQN- 291

Query: 513 XPGSGMRAFFLGETGVKKERIGTGVFLPRRYGASPTETHKKPGCSTVLLPDRVVHALNLN 692
             GSGMRA FLG    K+E  GTGVFLPRR G S TET KKP CSTVLLP RVV ALNLN
Sbjct: 292 --GSGMRAVFLGNPAAKRECTGTGVFLPRRVGTS-TETRKKPACSTVLLPARVVQALNLN 348

Query: 693 MDSVDAQSQ 719
           ++ +    Q
Sbjct: 349 VEEMGVHPQ 357


>ref|XP_022842720.1| uncharacterized protein LOC111366238 isoform X1 [Olea europaea var.
           sylvestris]
          Length = 377

 Score =  169 bits (428), Expect = 6e-47
 Identities = 112/256 (43%), Positives = 143/256 (55%), Gaps = 19/256 (7%)
 Frame = +3

Query: 3   PKGWKLSGSPQSTLCA------CKPGSIQTSPNGVSRVSSPP--DAKDVVGWDLLYAAAG 158
           PKG  LS SPQSTLC+      CK      S   +SRVSS P  + KD   WDLLYAA  
Sbjct: 86  PKGMGLSCSPQSTLCSVTGWCECK----HRSSCCISRVSSHPRVNLKDGAAWDLLYAATE 141

Query: 159 EVSRMRMIEETTPFHSTK-----LFAPAPKPSPVTLPVEKQTPTTGFYPTQTQTQAHIAY 323
           E +RMRMI+E   F+S+      +  P  KP  VT P++K    +G +  Q+    H++Y
Sbjct: 142 EAARMRMIQEANGFYSSNKRVGGVLVPPRKPGYVTAPLKKPNYGSGIFANQS----HLSY 197

Query: 324 LXXXXXXXXXXXXXXXXRR----GVREQGKI--ENQFQNGKMTGEAGRTQGLSMAAWPTL 485
                            ++    G   Q KI  +   Q+ +  GE  R+QGLS+AAWPTL
Sbjct: 198 RQLQATQFQELKLHSMMKQRQGSGFLGQEKIGYQQMVQSVRKNGE--RSQGLSVAAWPTL 255

Query: 486 XXXXXXXXXXPGSGMRAFFLGETGVKKERIGTGVFLPRRYGASPTETHKKPGCSTVLLPD 665
                     PGSGMRA FLG+ G KKER GTGVFLPRR+  + T T KKPGCST L+PD
Sbjct: 256 PKSQQKRELQPGSGMRAVFLGDRGTKKERSGTGVFLPRRFEPA-TGTRKKPGCSTFLIPD 314

Query: 666 RVVHALNLNMDSVDAQ 713
           +V  ALNLN++S++ Q
Sbjct: 315 QVAQALNLNLESMNTQ 330


>gb|OMO97344.1| hypothetical protein COLO4_14683 [Corchorus olitorius]
          Length = 402

 Score =  169 bits (428), Expect = 1e-46
 Identities = 119/265 (44%), Positives = 141/265 (53%), Gaps = 19/265 (7%)
 Frame = +3

Query: 6   KGWKLSGSPQSTLCA------CKPGSIQTSPNGVSRVSSPPDAKDVVGWDLLYAAAGEVS 167
           KGW LSGSPQSTLC       CK GS + SPN  SRVSSPP       WDLLYAAAGEV+
Sbjct: 116 KGWVLSGSPQSTLCGLGNGCGCKQGSSRGSPNCQSRVSSPPGT-----WDLLYAAAGEVA 170

Query: 168 RMRMIEETTP-FHSTKLFAPAPKPSPVTLPVEKQTPTTGFYPTQTQTQAHIAYLXXXXXX 344
           RMRM EE+   F++  L  P  + +   L V      + +YP   Q+ +H   L      
Sbjct: 171 RMRMNEESYGGFNNRGLLGPPARKASPNLDV------SSYYPPH-QSFSHHQKLQATQFQ 223

Query: 345 XXXXXXXXXXRRGVREQGKIENQFQNG----KMTGEAGRTQ------GLSMAAWPTLXXX 494
                     ++     G  + Q QN     +  G  G +       GLS +AWP L   
Sbjct: 224 QLKQQQQLMKQQNASVWGGQKQQHQNNHHVVQNRGRNGNSNSNRPSLGLSPSAWPPLQQQ 283

Query: 495 XXXXXXXPGSGMRAFFLGETGVKKERIGTGVFLPRRYGASPTETHKKPGCSTVLLPDRVV 674
                   GSGMRA FLG    K+E  GTGVFLPRR GA PTET KK GCSTVLLP RVV
Sbjct: 284 QQQTQSQNGSGMRAVFLGNPAGKRECAGTGVFLPRRVGA-PTETRKKTGCSTVLLPARVV 342

Query: 675 HALNLNMDSVDAQSQ--HRGNRNFT 743
            ALNLN+D ++AQ Q   R N +FT
Sbjct: 343 QALNLNLDEINAQPQLHPRFNASFT 367


>ref|XP_016490474.1| PREDICTED: uncharacterized protein LOC107810232 [Nicotiana tabacum]
          Length = 384

 Score =  168 bits (425), Expect = 2e-46
 Identities = 109/253 (43%), Positives = 141/253 (55%), Gaps = 11/253 (4%)
 Frame = +3

Query: 6   KGWKLSGSPQSTLCACKPGSIQTSPNGVSRVSSPPD-AKDVVGWDLLYAAAGEVSRMRMI 182
           KGW LS SPQSTLC CK GS + SPNG S+ SSPP   +  V  DLLYAAAGEV+R+RM+
Sbjct: 94  KGWGLSRSPQSTLCGCKQGSSRGSPNGPSQASSPPRMTRPEVSMDLLYAAAGEVARIRMM 153

Query: 183 EETTPFHSTK------LFAPAPKPSPVTLPVEKQTPTTGFYPTQTQTQAHIAYLXXXXXX 344
           EE+T  ++          AP  K SPV +  +   P  G +      Q  ++Y       
Sbjct: 154 EESTALYNHNKAGGGIWAAPPRKTSPVPVGPKNTKPNLGSF---NSNQPSLSYQQLQVAQ 210

Query: 345 XXXXXXXXXXRRGVREQGKI-ENQ--FQNGKMTGEAGRTQGLSMAAWPTL-XXXXXXXXX 512
                     ++G   Q  + +NQ   +N    G  G    L  ++WPTL          
Sbjct: 211 FQRLKQQQMMKQGGYRQFPVNQNQQVAENRARNGVNGSLMNLPNSSWPTLQQSQSQQQQP 270

Query: 513 XPGSGMRAFFLGETGVKKERIGTGVFLPRRYGASPTETHKKPGCSTVLLPDRVVHALNLN 692
             GSGMRA FLG  G K+E +GTGVFLPRR G + TET KKPGC  V+LPDRVV ALNLN
Sbjct: 271 NTGSGMRAVFLGNPGPKRECVGTGVFLPRRIG-TQTETRKKPGC-PVILPDRVVQALNLN 328

Query: 693 MDSVDAQSQHRGN 731
           ++++DA ++ + N
Sbjct: 329 LETMDAATRPQSN 341


>gb|EOY07362.1| Uncharacterized protein TCM_021816 [Theobroma cacao]
          Length = 437

 Score =  169 bits (428), Expect = 2e-46
 Identities = 121/261 (46%), Positives = 138/261 (52%), Gaps = 15/261 (5%)
 Frame = +3

Query: 6   KGWKLSGSPQSTLCA------CKPGSIQTSPNGVSRVSSPPDAKDVVGWDLLYAAAGEVS 167
           KGW LS SPQSTLCA      CK GS + SPN  SRVSSPP       WDLLYAAAGEV+
Sbjct: 118 KGWILSRSPQSTLCAVGSGCGCKQGSSRGSPNSQSRVSSPPGT-----WDLLYAAAGEVA 172

Query: 168 RMRMIEETTP-FHSTKLFAP-APKPSPVTLPVEKQTPTTGFYPTQTQTQAHIAYLXXXXX 341
           RMRM EE+   F++  L  P A KPSP            G+YP   Q+ +H         
Sbjct: 173 RMRMNEESYGGFNNRSLLGPPARKPSP-------NLDVPGYYPPH-QSLSHQKLQATQFQ 224

Query: 342 XXXXXXXXXXXRRGVREQGKIENQF-----QNGKMTGEAGRTQGLSMAAWPTLXXXXXXX 506
                         V    K ++Q      QN    G + R  GLS +AWP L       
Sbjct: 225 QLKQEQLMKQQNASVWGGQKQQHQHHHHVVQNRGRNGNSNRPLGLSPSAWPPLQQQQQPQ 284

Query: 507 XXXPGSGMRAFFLGETGVKKERIGTGVFLPRRYGASPTETHKKPGCSTVLLPDRVVHALN 686
               GSGMRA FLG    K+E  GTGVFLPRR G +P ET KKP CSTVLLP RVV ALN
Sbjct: 285 TQN-GSGMRAVFLGNPTAKRECAGTGVFLPRRIG-TPAETRKKPACSTVLLPARVVQALN 342

Query: 687 LNMDSVDAQSQ--HRGNRNFT 743
           LN+D + AQ Q   R N +FT
Sbjct: 343 LNLDEIGAQPQLHPRFNASFT 363


>ref|XP_017977724.1| PREDICTED: uncharacterized protein LOC18597650 [Theobroma cacao]
          Length = 405

 Score =  168 bits (425), Expect = 3e-46
 Identities = 121/261 (46%), Positives = 138/261 (52%), Gaps = 15/261 (5%)
 Frame = +3

Query: 6   KGWKLSGSPQSTLCA------CKPGSIQTSPNGVSRVSSPPDAKDVVGWDLLYAAAGEVS 167
           KGW LS SPQSTLCA      CK GS + SPN  SRVSSPP       WDLLYAAAGEV+
Sbjct: 118 KGWILSRSPQSTLCAVGSGCGCKQGSSRGSPNSQSRVSSPPGT-----WDLLYAAAGEVA 172

Query: 168 RMRMIEETTP-FHSTKLFAP-APKPSPVTLPVEKQTPTTGFYPTQTQTQAHIAYLXXXXX 341
           RMRM EE+   F++  L  P A KPSP            G+YP   Q+ +H         
Sbjct: 173 RMRMNEESYGGFNNRSLLGPPARKPSP-------NLDVPGYYPPH-QSLSHQKLQATQFQ 224

Query: 342 XXXXXXXXXXXRRGVREQGKIENQF-----QNGKMTGEAGRTQGLSMAAWPTLXXXXXXX 506
                         V    K ++Q      QN    G + R  GLS +AWP L       
Sbjct: 225 QLKQQQLMKQQNASVWGGQKQQHQHHHHVVQNRGRNGNSNRPLGLSPSAWPPLQQQQQPQ 284

Query: 507 XXXPGSGMRAFFLGETGVKKERIGTGVFLPRRYGASPTETHKKPGCSTVLLPDRVVHALN 686
               GSGMRA FLG    K+E  GTGVFLPRR G +P ET KKP CSTVLLP RVV ALN
Sbjct: 285 TQN-GSGMRAVFLGNPTAKRECAGTGVFLPRRIG-TPAETCKKPACSTVLLPARVVQALN 342

Query: 687 LNMDSVDAQSQ--HRGNRNFT 743
           LN+D + AQ Q   R N +FT
Sbjct: 343 LNLDEIGAQPQLHPRFNASFT 363


>ref|XP_009596622.1| PREDICTED: uncharacterized protein LOC104092672 isoform X1
           [Nicotiana tomentosiformis]
 ref|XP_016490812.1| PREDICTED: uncharacterized protein LOC107810532 [Nicotiana tabacum]
          Length = 383

 Score =  166 bits (421), Expect = 7e-46
 Identities = 111/253 (43%), Positives = 141/253 (55%), Gaps = 11/253 (4%)
 Frame = +3

Query: 6   KGWKLSGSPQSTLCACKPGSIQTSPNGVSRVSSPPD-AKDVVGWDLLYAAAGEVSRMRMI 182
           KGW LS SPQSTLC CK GS + SPNG S+ SSPP  A+  V  DLLYAAAGEV+R+RM+
Sbjct: 94  KGWGLSRSPQSTLCGCKQGSSRGSPNGPSQASSPPRMARPEVSMDLLYAAAGEVARIRMM 153

Query: 183 EETTPFHS------TKLFAPAPKPSPVTLPVEKQTPTTGFYPTQTQTQAHIAYLXXXXXX 344
           EE+T  ++          AP  K SPV +  +   P  G +      Q  ++Y       
Sbjct: 154 EESTGLYNHIKAGGGIWAAPPRKTSPVPVGPKNTKPNPGSF---NSNQPPLSYQQLQVAQ 210

Query: 345 XXXXXXXXXXRRGVREQGKI-ENQFQ---NGKMTGEAGRTQGLSMAAWPTLXXXXXXXXX 512
                     ++G   Q  + +NQ Q   N    G  G    L  +AWPTL         
Sbjct: 211 FQRLKQQQMMKQGGYRQFPVNQNQQQVAENRAGNGVKGSLMNLPNSAWPTL-QQSQQQQP 269

Query: 513 XPGSGMRAFFLGETGVKKERIGTGVFLPRRYGASPTETHKKPGCSTVLLPDRVVHALNLN 692
             GSGMRA FLG  G K+E  GTGVFLPRR G + TET KKPGC  V+LPDRVV ALNLN
Sbjct: 270 NTGSGMRAVFLGNPGPKRECAGTGVFLPRRIG-TQTETRKKPGC-PVILPDRVVQALNLN 327

Query: 693 MDSVDAQSQHRGN 731
           +++++A ++ + N
Sbjct: 328 LEAMEAATRPKSN 340


Top