BLASTX nr result

ID: Rehmannia32_contig00020478 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia32_contig00020478
         (1738 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011091419.1| uncharacterized protein LOC105171867 [Sesamu...   500   e-171
gb|PIN01885.1| hypothetical protein CDL12_25602 [Handroanthus im...   483   e-165
ref|XP_022852835.1| protein disulfide isomerase pTAC5, chloropla...   426   e-142
ref|XP_022852834.1| protein disulfide isomerase pTAC5, chloropla...   420   e-139
ref|XP_012844590.1| PREDICTED: uncharacterized protein LOC105964...   412   e-137
ref|XP_022852836.1| protein disulfide isomerase pTAC5, chloropla...   407   e-135
gb|KZV18808.1| hypothetical protein F511_25735 [Dorcoceras hygro...   405   e-134
ref|XP_019156893.1| PREDICTED: uncharacterized protein LOC109153...   374   e-122
emb|CDO99443.1| unnamed protein product [Coffea canephora]            367   e-119
ref|XP_010241120.1| PREDICTED: uncharacterized protein LOC104585...   355   e-114
ref|XP_018837990.1| PREDICTED: uncharacterized protein LOC109004...   354   e-114
ref|XP_009601807.1| PREDICTED: uncharacterized protein LOC104097...   350   e-112
ref|XP_007034404.2| PREDICTED: golgin subfamily A member 6-like ...   348   e-112
gb|EOY05330.1| Plastid transcriptionally active 5, putative isof...   347   e-111
gb|PHT39288.1| hypothetical protein CQW23_22861 [Capsicum baccatum]   346   e-110
ref|XP_016452615.1| PREDICTED: uncharacterized protein LOC107777...   346   e-110
gb|PHU21429.1| hypothetical protein BC332_06536 [Capsicum chinense]   345   e-110
ref|XP_017617412.1| PREDICTED: golgin subfamily A member 6-like ...   345   e-110
ref|XP_012456811.1| PREDICTED: putative golgin subfamily A membe...   345   e-110
ref|XP_016566118.1| PREDICTED: uncharacterized protein LOC107864...   344   e-110

>ref|XP_011091419.1| uncharacterized protein LOC105171867 [Sesamum indicum]
          Length = 387

 Score =  500 bits (1288), Expect = e-171
 Identities = 249/364 (68%), Positives = 285/364 (78%), Gaps = 3/364 (0%)
 Frame = +2

Query: 167  KPHFLAVSFRPLSKSHICFAFTPSNQNPIGXXXXXXXXXXXXXXXXXXXXXXXXXXNAER 346
            KPH LA S RPLSKSH+CF F+ SN+NPIG                          NAER
Sbjct: 19   KPHILAFSLRPLSKSHVCFTFSSSNENPIGEEARWLREEQRWLREEQRWLREESRWNAER 78

Query: 347  QTLLHEISSLKLRLQELERLNSLQGASVSETVANIAKLLQVLKDGDLGKNTNRIADSGTS 526
            Q LL EI++LKLR+QELERLNSLQGASVSETVA+IAKLLQV+K+ DLGKN NRIADSGT+
Sbjct: 79   QALLQEINTLKLRIQELERLNSLQGASVSETVASIAKLLQVVKEADLGKNVNRIADSGTT 138

Query: 527  AVPLXXXXXXXXXXXXXXXXXSIPDKKEILKTRATLRKGSEGDQVREMQEALQKLGFYSG 706
            AVPL                 SIP KKE ++ RATLR GSEGD+VR MQEALQKLGFYSG
Sbjct: 139  AVPLVVEAAKEEEEVVIKEVISIPHKKEAVQKRATLRIGSEGDEVRVMQEALQKLGFYSG 198

Query: 707  EEDIEFSSFSSGTERAVKTWQATLGAREDGIMTAELLETLFGSSGSGLTKNQEHKSRGPD 886
            EEDIE+SSFSSGTERAVKTWQATLGA EDGIMT +LLE LFG+SG G+T NQEH+  GP 
Sbjct: 199  EEDIEYSSFSSGTERAVKTWQATLGAPEDGIMTGDLLERLFGNSGLGMTNNQEHEGIGPK 258

Query: 887  KIANGAPVASISEVSKVQQTVVTDEGVNEVNVSQHRVFLLGENRWEEPSRLSGSSKKTI- 1063
            K  NG PV +I+++SKV+QTVVT+EGV++V  SQHRVFLLGENRWEEPSRLSG+SK+T+ 
Sbjct: 259  KSTNGVPVGAITDISKVEQTVVTEEGVSKVEASQHRVFLLGENRWEEPSRLSGNSKRTVT 318

Query: 1064 --KSGNATTKCLTCRGEGRLLCMECDGTGEPNIEPQFMEWVDEGAKCPYCDGVGYSTCDV 1237
               S N  TKCL+CRGEGRLLCMECDGTGEPN+EPQF+EWVDEG+KCPYC+G+GY TCDV
Sbjct: 319  KSTSANVATKCLSCRGEGRLLCMECDGTGEPNVEPQFLEWVDEGSKCPYCEGLGYVTCDV 378

Query: 1238 CGGK 1249
            CGG+
Sbjct: 379  CGGR 382


>gb|PIN01885.1| hypothetical protein CDL12_25602 [Handroanthus impetiginosus]
          Length = 387

 Score =  483 bits (1244), Expect = e-165
 Identities = 254/368 (69%), Positives = 279/368 (75%), Gaps = 6/368 (1%)
 Frame = +2

Query: 164  HKPHFLAVSFRPLSKSHICFAFTPSNQNPIGXXXXXXXXXXXXXXXXXXXXXXXXXXNAE 343
            HKPHFL  SFR  SKSHICFAF+PSNQNPIG                           AE
Sbjct: 18   HKPHFLTDSFRASSKSHICFAFSPSNQNPIGDEARWLREEQRWLREEQRWLREESRWKAE 77

Query: 344  RQTLLHEISSLKLRLQELERLNSLQGASVSETVANIAKLLQVLKDGDLGKNTNRIADSGT 523
            RQ LL EISSLKLR+QELERLNSLQGASVSETVANIAKLLQVLK+GD GK  NRIADS +
Sbjct: 78   RQALLLEISSLKLRIQELERLNSLQGASVSETVANIAKLLQVLKEGDFGKKVNRIADSVS 137

Query: 524  SAVPLXXXXXXXXXXXXXXXXX---SIPDKKEILKTRATLRKGSEGDQVREMQEALQKLG 694
            SAVPL                    SIPDKKE +K R TLR GSEGD+VR MQEALQKLG
Sbjct: 138  SAVPLEVEAAKDEEEEEEVVVKEVISIPDKKETVKRRTTLRTGSEGDEVRVMQEALQKLG 197

Query: 695  FYSGEEDIEFSSFSSGTERAVKTWQATLGAREDGIMTAELLETLFGSSGSGLTKNQEHKS 874
            FYSGEED+E+SSFSSGTERAVKTWQATLG  E+GIMTAELLE LFGSS   + K QE K+
Sbjct: 198  FYSGEEDMEYSSFSSGTERAVKTWQATLGVPENGIMTAELLERLFGSS---VAKTQEPKA 254

Query: 875  RGPDKIANGAPVASISEVSKVQQTVVTDEGVNEVNVSQHRVFLLGENRWEEPSRLSGSSK 1054
               +K ANGA V SI+++S+V+QT+VT+EGV E   SQHRVFLLGENRWEEPSRLSG SK
Sbjct: 255  TDQEKSANGAVVTSIADISEVKQTIVTEEGVTEFEASQHRVFLLGENRWEEPSRLSGGSK 314

Query: 1055 KTI---KSGNATTKCLTCRGEGRLLCMECDGTGEPNIEPQFMEWVDEGAKCPYCDGVGYS 1225
            +T     +GNATT CL+CRGEGRLLCMECDGTGEPNIEPQF+EWVD+GAKCPYC+GVGY 
Sbjct: 315  RTTAKSTTGNATTTCLSCRGEGRLLCMECDGTGEPNIEPQFLEWVDDGAKCPYCEGVGYI 374

Query: 1226 TCDVCGGK 1249
            TCDVCGGK
Sbjct: 375  TCDVCGGK 382


>ref|XP_022852835.1| protein disulfide isomerase pTAC5, chloroplastic-like isoform X2
            [Olea europaea var. sylvestris]
          Length = 394

 Score =  426 bits (1094), Expect = e-142
 Identities = 228/374 (60%), Positives = 261/374 (69%), Gaps = 13/374 (3%)
 Frame = +2

Query: 164  HKPHFLA-------VSFRPLSKSHICFAFTPS---NQNPIGXXXXXXXXXXXXXXXXXXX 313
            HKP FL+       +  RP SKSHICF+  PS   N N                      
Sbjct: 17   HKPQFLSHFATSLSLPSRPFSKSHICFSIPPSSNSNSNSPQEEARWLREEQRWLREEQRW 76

Query: 314  XXXXXXXNAERQTLLHEISSLKLRLQELERLNSLQGASVSETVANIAKLLQVLKDGDLGK 493
                   NAER+TLLHEI SLKLR+QELERLNSLQGAS SETVANIAKLLQVLK+GDLGK
Sbjct: 77   IREESRWNAERETLLHEIHSLKLRIQELERLNSLQGASASETVANIAKLLQVLKEGDLGK 136

Query: 494  NTNRIADSGTSAVPLXXXXXXXXXXXXXXXXXSIPDKKEILKTRATLRKGSEGDQVREMQ 673
            N NR+A+SG+ AVPL                  + D++E ++ R TLRKGSEGD+VR MQ
Sbjct: 137  NVNRLAESGSIAVPLVIDAAKRGEEVIVKEVIKVADREENVRKRETLRKGSEGDEVRVMQ 196

Query: 674  EALQKLGFYSGEEDIEFSSFSSGTERAVKTWQATLGAREDGIMTAELLETLFGSSGSGLT 853
            EALQKLGFYSGEED+E+SSFSSGTERAVKTWQATLGA EDGIMT+ELLE L       + 
Sbjct: 197  EALQKLGFYSGEEDMEYSSFSSGTERAVKTWQATLGAPEDGIMTSELLERL--CIEQKVE 254

Query: 854  KNQEHKSRGPDKIANGAPVASISEVSKVQQTVVTDEGVNEVNVSQHRVFLLGENRWEEPS 1033
             +Q  K +  +K  NGA  AS + +S+VQQTV  ++G  EV VS HRVFLLGENRWEEPS
Sbjct: 255  SSQGLKPKLLEKSENGALTASTTAISEVQQTVSKEDGTTEVEVSHHRVFLLGENRWEEPS 314

Query: 1034 RLSGSSKKTI---KSGNATTKCLTCRGEGRLLCMECDGTGEPNIEPQFMEWVDEGAKCPY 1204
            RLSG S +T      GN  TKCLTCRGEGRLLCMECDG+GEPNIEPQF+EWVDEG KCPY
Sbjct: 315  RLSGKSNQTAVKNDKGNTKTKCLTCRGEGRLLCMECDGSGEPNIEPQFLEWVDEGTKCPY 374

Query: 1205 CDGVGYSTCDVCGG 1246
            C+G+GY+TCDVC G
Sbjct: 375  CEGLGYTTCDVCAG 388


>ref|XP_022852834.1| protein disulfide isomerase pTAC5, chloroplastic-like isoform X1
            [Olea europaea var. sylvestris]
          Length = 398

 Score =  420 bits (1079), Expect = e-139
 Identities = 228/378 (60%), Positives = 261/378 (69%), Gaps = 17/378 (4%)
 Frame = +2

Query: 164  HKPHFLA-------VSFRPLSKSHICFAFTPS---NQNPIGXXXXXXXXXXXXXXXXXXX 313
            HKP FL+       +  RP SKSHICF+  PS   N N                      
Sbjct: 17   HKPQFLSHFATSLSLPSRPFSKSHICFSIPPSSNSNSNSPQEEARWLREEQRWLREEQRW 76

Query: 314  XXXXXXXNAERQTLLHEISSLKLRLQELERLNSLQGASVSETVANIAKLLQV----LKDG 481
                   NAER+TLLHEI SLKLR+QELERLNSLQGAS SETVANIAKLLQV    LK+G
Sbjct: 77   IREESRWNAERETLLHEIHSLKLRIQELERLNSLQGASASETVANIAKLLQVYIEVLKEG 136

Query: 482  DLGKNTNRIADSGTSAVPLXXXXXXXXXXXXXXXXXSIPDKKEILKTRATLRKGSEGDQV 661
            DLGKN NR+A+SG+ AVPL                  + D++E ++ R TLRKGSEGD+V
Sbjct: 137  DLGKNVNRLAESGSIAVPLVIDAAKRGEEVIVKEVIKVADREENVRKRETLRKGSEGDEV 196

Query: 662  REMQEALQKLGFYSGEEDIEFSSFSSGTERAVKTWQATLGAREDGIMTAELLETLFGSSG 841
            R MQEALQKLGFYSGEED+E+SSFSSGTERAVKTWQATLGA EDGIMT+ELLE L     
Sbjct: 197  RVMQEALQKLGFYSGEEDMEYSSFSSGTERAVKTWQATLGAPEDGIMTSELLERL--CIE 254

Query: 842  SGLTKNQEHKSRGPDKIANGAPVASISEVSKVQQTVVTDEGVNEVNVSQHRVFLLGENRW 1021
              +  +Q  K +  +K  NGA  AS + +S+VQQTV  ++G  EV VS HRVFLLGENRW
Sbjct: 255  QKVESSQGLKPKLLEKSENGALTASTTAISEVQQTVSKEDGTTEVEVSHHRVFLLGENRW 314

Query: 1022 EEPSRLSGSSKKTI---KSGNATTKCLTCRGEGRLLCMECDGTGEPNIEPQFMEWVDEGA 1192
            EEPSRLSG S +T      GN  TKCLTCRGEGRLLCMECDG+GEPNIEPQF+EWVDEG 
Sbjct: 315  EEPSRLSGKSNQTAVKNDKGNTKTKCLTCRGEGRLLCMECDGSGEPNIEPQFLEWVDEGT 374

Query: 1193 KCPYCDGVGYSTCDVCGG 1246
            KCPYC+G+GY+TCDVC G
Sbjct: 375  KCPYCEGLGYTTCDVCAG 392


>ref|XP_012844590.1| PREDICTED: uncharacterized protein LOC105964637 [Erythranthe guttata]
 gb|EYU31404.1| hypothetical protein MIMGU_mgv1a008663mg [Erythranthe guttata]
          Length = 366

 Score =  412 bits (1059), Expect = e-137
 Identities = 223/368 (60%), Positives = 262/368 (71%), Gaps = 7/368 (1%)
 Frame = +2

Query: 167  KPHFLAVSFRPLSKSHICFAFTPSNQNPIGXXXXXXXXXXXXXXXXXXXXXXXXXXNAER 346
            KP FLAVS+RP SKSH CF+F+PS+QNPIG                          NAER
Sbjct: 19   KPQFLAVSYRPFSKSHTCFSFSPSDQNPIGEEARWLREEQRWLREEQRWLREESRWNAER 78

Query: 347  QTLLHEISSLKLRLQELERLNSLQGASVSETVANIAKLLQVLKDGDLGKNTNRIADSGTS 526
            Q LL          QE++RLNSLQG  VSETVA IAKL+QVLK+GD  K+ NRIADSGTS
Sbjct: 79   QILL----------QEIQRLNSLQGLPVSETVATIAKLMQVLKEGDSAKSVNRIADSGTS 128

Query: 527  AVPLXXXXXXXXXXXXXXXXX---SIPDKKEILKTRATLRKGSEGDQVREMQEALQKLGF 697
            AVPL                    +IPDKK+I+K RATLR GSEGD+VRE+Q+ALQKLGF
Sbjct: 129  AVPLPLMFESAKMEEEEVVVKEVINIPDKKKIVKKRATLRVGSEGDEVRELQDALQKLGF 188

Query: 698  YSGEEDIEFSSFSSGTERAVKTWQATLGAREDGIMTAELLETLFGSSGSGLTKNQEHKSR 877
            YSGEED+EFSSF+SGT RAVKTWQA++G  EDG MTA+LLE LFG+SG+     +E++S 
Sbjct: 189  YSGEEDVEFSSFASGTARAVKTWQASVGVPEDGTMTAQLLEMLFGNSGT-----EEYEST 243

Query: 878  GPDKIANGAP-VASISEVSKVQQTVVTDEGVNEVNVSQHRVFLLGENRWEEPSRLSGSSK 1054
             P+K ANGAP  AS++EVS+V+ +VV +       VS +RVFLLGENRWE+PSRLSGSS+
Sbjct: 244  DPEKSANGAPFTASVTEVSEVKPSVVGE------YVSDNRVFLLGENRWEDPSRLSGSSQ 297

Query: 1055 KTI---KSGNATTKCLTCRGEGRLLCMECDGTGEPNIEPQFMEWVDEGAKCPYCDGVGYS 1225
             T    KSGNA   CL+CRGEGRLLCMECDGTGEPNIE QFMEWVDEGAKCPYC G+G+ 
Sbjct: 298  NTTNKSKSGNAKNNCLSCRGEGRLLCMECDGTGEPNIEEQFMEWVDEGAKCPYCIGLGFV 357

Query: 1226 TCDVCGGK 1249
             CD+CG K
Sbjct: 358  ACDLCGVK 365


>ref|XP_022852836.1| protein disulfide isomerase pTAC5, chloroplastic-like isoform X3
            [Olea europaea var. sylvestris]
          Length = 394

 Score =  407 bits (1046), Expect = e-135
 Identities = 224/378 (59%), Positives = 257/378 (67%), Gaps = 17/378 (4%)
 Frame = +2

Query: 164  HKPHFLA-------VSFRPLSKSHICFAFTPS---NQNPIGXXXXXXXXXXXXXXXXXXX 313
            HKP FL+       +  RP SKSHICF+  PS   N N                      
Sbjct: 17   HKPQFLSHFATSLSLPSRPFSKSHICFSIPPSSNSNSNSPQEEARWLREEQRWLREEQRW 76

Query: 314  XXXXXXXNAERQTLLHEISSLKLRLQELERLNSLQGASVSETVANIAKLLQV----LKDG 481
                   NAER+TLLHEI SLKLR+QELERLNSLQGAS SETVANIAKLLQV    LK+G
Sbjct: 77   IREESRWNAERETLLHEIHSLKLRIQELERLNSLQGASASETVANIAKLLQVYIEVLKEG 136

Query: 482  DLGKNTNRIADSGTSAVPLXXXXXXXXXXXXXXXXXSIPDKKEILKTRATLRKGSEGDQV 661
            DLGKN NR+A+SG+ AVPL                  + D++E ++ R TLRKGSEGD+V
Sbjct: 137  DLGKNVNRLAESGSIAVPLVIDAAKRGEEVIVKEVIKVADREENVRKRETLRKGSEGDEV 196

Query: 662  REMQEALQKLGFYSGEEDIEFSSFSSGTERAVKTWQATLGAREDGIMTAELLETLFGSSG 841
            R MQ    KLGFYSGEED+E+SSFSSGTERAVKTWQATLGA EDGIMT+ELLE L     
Sbjct: 197  RVMQ----KLGFYSGEEDMEYSSFSSGTERAVKTWQATLGAPEDGIMTSELLERL--CIE 250

Query: 842  SGLTKNQEHKSRGPDKIANGAPVASISEVSKVQQTVVTDEGVNEVNVSQHRVFLLGENRW 1021
              +  +Q  K +  +K  NGA  AS + +S+VQQTV  ++G  EV VS HRVFLLGENRW
Sbjct: 251  QKVESSQGLKPKLLEKSENGALTASTTAISEVQQTVSKEDGTTEVEVSHHRVFLLGENRW 310

Query: 1022 EEPSRLSGSSKKTI---KSGNATTKCLTCRGEGRLLCMECDGTGEPNIEPQFMEWVDEGA 1192
            EEPSRLSG S +T      GN  TKCLTCRGEGRLLCMECDG+GEPNIEPQF+EWVDEG 
Sbjct: 311  EEPSRLSGKSNQTAVKNDKGNTKTKCLTCRGEGRLLCMECDGSGEPNIEPQFLEWVDEGT 370

Query: 1193 KCPYCDGVGYSTCDVCGG 1246
            KCPYC+G+GY+TCDVC G
Sbjct: 371  KCPYCEGLGYTTCDVCAG 388


>gb|KZV18808.1| hypothetical protein F511_25735 [Dorcoceras hygrometricum]
          Length = 389

 Score =  405 bits (1040), Expect = e-134
 Identities = 220/374 (58%), Positives = 265/374 (70%), Gaps = 13/374 (3%)
 Frame = +2

Query: 164  HKPHFLAVSFRPLSKSHICFAFTPSNQNPIGXXXXXXXXXXXXXXXXXXXXXXXXXXNA- 340
            H     ++S +   KSH+CF+F+PSN++PIG                             
Sbjct: 17   HNSQHSSISSKFWPKSHVCFSFSPSNESPIGYGPQEETRWLREEQRWLREEQRWLREELR 76

Query: 341  ---ERQTLLHEISSLKLRLQELERLNSLQGASVSETVANIAKLLQVLKDGDLGKNTNRIA 511
               ERQ L+ EI+SL+LRLQELE+LNSL  +SVS+TVANIA LLQVLKDGD GK  NRI 
Sbjct: 77   WKEERQRLVQEINSLRLRLQELEKLNSLPASSVSDTVANIANLLQVLKDGDSGKIVNRIT 136

Query: 512  DSGTSAVPLXXXXXXXXXXXXXXXXX-SIPDKKEILKTRATLRKGSEGDQVREMQEALQK 688
            +SG+SAVPL                  S+ D KE +K R TLR GSEG++V+ MQEALQK
Sbjct: 137  ESGSSAVPLVVEASIKNEEEDVVKEIISVSDGKETVKRRVTLRSGSEGEEVQAMQEALQK 196

Query: 689  LGFYSGEEDIEFSSFSSGTERAVKTWQATLGAREDGIMTAELLETL-----FGSSGSGLT 853
            LGFYSGEEDIE+SSFS+GTERAVKTWQ+++GAREDGIMTAELLE L     F SSG G T
Sbjct: 197  LGFYSGEEDIEYSSFSTGTERAVKTWQSSIGAREDGIMTAELLEQLYKEQKFRSSGLGAT 256

Query: 854  KNQEHKSRGPDKIANGAPVASISEVSKVQQTVVTDEGVNEVNVSQHRVFLLGENRWEEPS 1033
            K +E+KS GP K  NGAP++S + +S+ Q+  V   G +EV+ S HRVFLLGENRWEEPS
Sbjct: 257  KIKENKSSGPKKTENGAPISSTTSISENQKREV---GASEVSNSPHRVFLLGENRWEEPS 313

Query: 1034 RLSGSSKKTIKSG--NAT-TKCLTCRGEGRLLCMECDGTGEPNIEPQFMEWVDEGAKCPY 1204
            RLSGSSK+ +K G  NAT  KCL+C GEGR+LCMECDGTGEPNIEPQF+EWVDEG KCPY
Sbjct: 314  RLSGSSKR-MKPGSINATVVKCLSCHGEGRVLCMECDGTGEPNIEPQFLEWVDEGTKCPY 372

Query: 1205 CDGVGYSTCDVCGG 1246
            C+G+G++TCDVC G
Sbjct: 373  CEGLGFTTCDVCCG 386


>ref|XP_019156893.1| PREDICTED: uncharacterized protein LOC109153479 [Ipomoea nil]
          Length = 408

 Score =  374 bits (961), Expect = e-122
 Identities = 210/376 (55%), Positives = 245/376 (65%), Gaps = 15/376 (3%)
 Frame = +2

Query: 170  PHFLAVSFRPLSKSHICFAFTPSNQNPIGXXXXXXXXXXXXXXXXXXXXXXXXXX----- 334
            P F   S   LSKSHICFA  PSN                                    
Sbjct: 30   PFFFKPSLEILSKSHICFALPPSNSGDSSGFEQRAEVRWLREEQRWLREEQRWLREERRW 89

Query: 335  NAERQTLLHEISSLKLRLQELERLNSLQGASVSETVANIAKLLQVLKDGDLGKNTNRIAD 514
            NAER+ LL EI +L+L++QEL+  + LQ ASVSET ANI KLLQVLK+GDL  N NRIA+
Sbjct: 90   NAEREALLREIQALQLQIQELKSRSPLQEASVSETFANIVKLLQVLKEGDLVNNVNRIAE 149

Query: 515  SGTSAVPLXXXXXXXXXXXXXXXXXSIPDKKE-ILKTRATLRKGSEGDQVREMQEALQKL 691
            SG+ AVPL                    +++   +K R TLRKGSEGD VR MQEAL KL
Sbjct: 150  SGSIAVPLVMEATKEEEEVVIKEIVKEVEREAGEVKKRKTLRKGSEGDDVRFMQEALLKL 209

Query: 692  GFYSGEEDIEFSSFSSGTERAVKTWQATLGAREDGIMTAELLETL-----FGSSGSGLTK 856
            GFY GEED+E+SSFSSGTERAVKTWQA+ GA EDGIM++ELLE L     F  S   L K
Sbjct: 210  GFYCGEEDMEYSSFSSGTERAVKTWQASFGATEDGIMSSELLERLYMEQTFEKSDLRLNK 269

Query: 857  NQE-HKSRGPDKIANGAPVASISEVSKVQQTVVTDEGVNEVNVSQHRVFLLGENRWEEPS 1033
            N E H    P+  ANGAPVASI E+S+ +Q VV  EG  E +++QHRVFLLGENRWEEPS
Sbjct: 270  NPEQHDVNSPEARANGAPVASIMEISEFEQKVVK-EGDTETDITQHRVFLLGENRWEEPS 328

Query: 1034 RLSGSSKKTI---KSGNATTKCLTCRGEGRLLCMECDGTGEPNIEPQFMEWVDEGAKCPY 1204
            RL G+ +       SG ATTKCL+CRGEGRLLCMECDG+GEPNIEPQF+EWVDE  KCPY
Sbjct: 329  RLKGNMQPAATKSSSGKATTKCLSCRGEGRLLCMECDGSGEPNIEPQFLEWVDEDTKCPY 388

Query: 1205 CDGVGYSTCDVCGGKQ 1252
            C+G+G+ TCDVC GK+
Sbjct: 389  CEGLGFVTCDVCEGKK 404


>emb|CDO99443.1| unnamed protein product [Coffea canephora]
          Length = 405

 Score =  367 bits (943), Expect = e-119
 Identities = 187/315 (59%), Positives = 235/315 (74%), Gaps = 10/315 (3%)
 Frame = +2

Query: 338  AERQTLLHEISSLKLRLQELERLNSLQGASVSETVANIAKLLQVLKDGDLGKNTNRIADS 517
            AER  LL+EI  L LR+QELE+LNS++  S+SET+ANIA LLQVLK+G LGK+ N+I +S
Sbjct: 87   AERLALLNEIQRLNLRVQELEQLNSVRETSLSETIANIATLLQVLKEGGLGKSINKIPES 146

Query: 518  GTSAVPLXXXXXXXXXXXXXXXXXSIPDKKEILKTRATLRKGSEGDQVREMQEALQKLGF 697
             + A+P                   +P+K+     RATLR GSEGD V+ MQEAL KLGF
Sbjct: 147  RSGALPFGVETAQKVEEMFVKEVGEVPEKENNGNKRATLRMGSEGDDVQAMQEALLKLGF 206

Query: 698  YSGEEDIEFSSFSSGTERAVKTWQATLGAREDGIMTAELLETLFGSSGSGLTK------N 859
            YSGEED+EFSSFS+GTERAVKTWQA++GA EDGIMTAEL+E L+     G          
Sbjct: 207  YSGEEDMEFSSFSTGTERAVKTWQASIGAPEDGIMTAELVERLYIEQNGGTPSFTGGKGP 266

Query: 860  QEHKSRGPD-KIANGAPVASISEVSKVQQTVVTDEGVNEVNVSQHRVFLLGENRWEEPSR 1036
            Q+  +  P+ K ANGA  ASI+E+S+V++TVV ++GV  +N+S+HRVFLLGENRWEEPSR
Sbjct: 267  QDSSTTNPEEKGANGAATASITEISEVKETVVGEDGVTGINMSEHRVFLLGENRWEEPSR 326

Query: 1037 LSGSSKKT---IKSGNATTKCLTCRGEGRLLCMECDGTGEPNIEPQFMEWVDEGAKCPYC 1207
            L+G +K+    I SG   TKC +CRGEGRLLCMECDGTGEPN+EPQF+EWVDEGAKCPYC
Sbjct: 327  LAGRNKQPETKIGSGKTMTKCPSCRGEGRLLCMECDGTGEPNVEPQFLEWVDEGAKCPYC 386

Query: 1208 DGVGYSTCDVCGGKQ 1252
            +G+G++TCDVC G++
Sbjct: 387  EGLGHTTCDVCEGRK 401


>ref|XP_010241120.1| PREDICTED: uncharacterized protein LOC104585818 [Nelumbo nucifera]
          Length = 415

 Score =  355 bits (912), Expect = e-114
 Identities = 191/326 (58%), Positives = 230/326 (70%), Gaps = 21/326 (6%)
 Frame = +2

Query: 335  NAERQTLLHEISSLKLRLQELERLNSLQGASVSETVANIAKLLQVLKDGDLGK-NTNRIA 511
            N+ER+ LL EIS LKL+++ LER NSLQGAS+SETVANIA LLQ LKD +L     +RIA
Sbjct: 89   NSEREALLREISELKLQIEALERQNSLQGASMSETVANIASLLQTLKDAELNSPKQHRIA 148

Query: 512  DSGTSAVPLXXXXXXXXXXXXXXXXXSI---------PDKKEILKTRATLRKGSEGDQVR 664
            +SG+   P+                  +          +K+E +K   +LR GSEG+ V+
Sbjct: 149  ESGSGPTPMLLGLDSEKVKETVVEEVRVLESSKENKEEEKQEKVKEVRSLRMGSEGEDVQ 208

Query: 665  EMQEALQKLGFYSGEEDIEFSSFSSGTERAVKTWQATLGAREDGIMTAELLETLFGSSGS 844
             MQEAL  LGFYSGEED+EFSSFSSGTERAVKTWQA++GAREDGIMTAELL+ L+   GS
Sbjct: 209  AMQEALLILGFYSGEEDMEFSSFSSGTERAVKTWQASIGAREDGIMTAELLQRLYMKQGS 268

Query: 845  GLTKNQEHKSRG--------PDKI-ANGAPVASISEVSKVQQTVVTDEGVNEVNVSQHRV 997
               KN+  K+          PDK  ANGA V S++E+S++Q+TVV ++G  E+ VSQ RV
Sbjct: 269  ---KNESLKTNAYQKGDVVVPDKEGANGAAVVSVTEISEIQETVVKEDGATEIEVSQQRV 325

Query: 998  FLLGENRWEEPSRLSGSSKKT--IKSGNATTKCLTCRGEGRLLCMECDGTGEPNIEPQFM 1171
            FLLGENRWEEPSRL G  KK    K  + TTKCL CRGEG LLC ECDGTGEPNIEPQF+
Sbjct: 326  FLLGENRWEEPSRLRGRDKKIGGDKPESPTTKCLACRGEGHLLCTECDGTGEPNIEPQFL 385

Query: 1172 EWVDEGAKCPYCDGVGYSTCDVCGGK 1249
            EWVDEGAKCPYC+G+GY+ CDVC GK
Sbjct: 386  EWVDEGAKCPYCEGLGYTVCDVCEGK 411


>ref|XP_018837990.1| PREDICTED: uncharacterized protein LOC109004053 [Juglans regia]
 ref|XP_018837991.1| PREDICTED: uncharacterized protein LOC109004053 [Juglans regia]
          Length = 394

 Score =  354 bits (908), Expect = e-114
 Identities = 193/317 (60%), Positives = 229/317 (72%), Gaps = 13/317 (4%)
 Frame = +2

Query: 341  ERQTLLHEISSLKLRLQELERLNSL--QGASVSETVANIAKLLQVLKDGDLGKNTNRIAD 514
            ER +LL +I+ LKLR+Q LE  NS   +GASVSET+++IA LLQVLK+  L      IA+
Sbjct: 82   ERDSLLGQIAELKLRIQHLEHQNSTLGEGASVSETISSIAGLLQVLKEKGL------IAE 135

Query: 515  SGTSAVPLXXXXXXXXXXXXXXXXXSIPDKKEIL--KTRATLRKGSEGDQVREMQEALQK 688
            S +SA P+                  +    E +  K R TLRKGSEGD+VR +Q ALQK
Sbjct: 136  SSSSASPMVLLEEDLKEKEVVVVDKEVVRVLEDVAKKRRKTLRKGSEGDEVRALQVALQK 195

Query: 689  LGFYSGEEDIEFSSFSSGTERAVKTWQATLGAREDGIMTAELLETLF-------GSSGSG 847
            LGFYSGEED+EFSSFSSGTERAVKTWQ+TLGA EDGIMTAELLE LF         S + 
Sbjct: 196  LGFYSGEEDMEFSSFSSGTERAVKTWQSTLGATEDGIMTAELLERLFMEQQIVGARSNTD 255

Query: 848  LTKNQEHKSRGPDKIANGAPVASISEVSKVQQTVVTDEGVNEVNVSQHRVFLLGENRWEE 1027
              +   + S  P   ANGAPVA+I+EVS+ QQ VV +E V EV VSQHRVFLLGENRWEE
Sbjct: 256  ADQKGSNVSVSPKVGANGAPVAAITEVSEFQQKVVNEESVTEVEVSQHRVFLLGENRWEE 315

Query: 1028 PSRLSGSSKKT--IKSGNATTKCLTCRGEGRLLCMECDGTGEPNIEPQFMEWVDEGAKCP 1201
            PSR++G +K+    K+ +ATTKCLTCRGEGRLLC ECDGTGEPNIEPQF+EWVDEG KCP
Sbjct: 316  PSRIAGRNKQVGNSKTKDATTKCLTCRGEGRLLCTECDGTGEPNIEPQFLEWVDEGMKCP 375

Query: 1202 YCDGVGYSTCDVCGGKQ 1252
            YC+GVG++ CDVCGGK+
Sbjct: 376  YCEGVGFTVCDVCGGKR 392


>ref|XP_009601807.1| PREDICTED: uncharacterized protein LOC104097013 isoform X2 [Nicotiana
            tomentosiformis]
 ref|XP_016452623.1| PREDICTED: uncharacterized protein LOC107777139 isoform X2 [Nicotiana
            tabacum]
          Length = 408

 Score =  350 bits (899), Expect = e-112
 Identities = 197/365 (53%), Positives = 237/365 (64%), Gaps = 14/365 (3%)
 Frame = +2

Query: 200  LSKSHICFAFTPSNQNPIGXXXXXXXXXXXXXXXXXXXXXXXXXXNAERQTLLHEISSLK 379
            LS SHIC++  P  +                               AER+ LL +I SL+
Sbjct: 45   LSVSHICYSSLPEREES-----RWLREEQRWLREEARWLREEKRWEAEREALLLQIQSLQ 99

Query: 380  LRLQELE-RLNSLQGASVSETVANIAKLLQVLKDGDLGKNTNRIADSGTSAVPLXXXXXX 556
            LRL+E+E R+NSL   SV+ETVANIAKLLQ+LK+G+LGKN N I +SG+ +VPL      
Sbjct: 100  LRLKEVENRMNSLPETSVTETVANIAKLLQLLKEGELGKNVNVITESGSISVPLILEAAK 159

Query: 557  XXXXXXXXXXXS------IPDKKEI----LKTRATLRKGSEGDQVREMQEALQKLGFYSG 706
                              +P + E      K R  LRKGSEGD+VR +QE L KLGFY G
Sbjct: 160  ENEVIVKEAIEQEKVIREVPKESEREGKEAKKRRPLRKGSEGDEVRLLQEQLLKLGFYCG 219

Query: 707  EEDIEFSSFSSGTERAVKTWQATLGAREDGIMTAELLETLFGSSG-SGLTKNQEHKSRGP 883
            EED+EFSSF+SGTE AVKTWQA+ G  EDGIMT+ELLETL+      G+ +N +      
Sbjct: 220  EEDMEFSSFTSGTESAVKTWQASSGVPEDGIMTSELLETLYMVQNIDGVRENPKQPDGTE 279

Query: 884  DKI-ANGAPVASISEVSKVQQTVVTDEGVNEVNVSQHRVFLLGENRWEEPSRLSGSSKKT 1060
             K  ANGAPVASI E+ +VQQT+V ++ V+E  VS HRVFLLGENRWEEPSRL+ S K  
Sbjct: 280  AKTSANGAPVASIMEIEEVQQTIVKEDSVSETEVSHHRVFLLGENRWEEPSRLTTSRKPA 339

Query: 1061 -IKSGNATTKCLTCRGEGRLLCMECDGTGEPNIEPQFMEWVDEGAKCPYCDGVGYSTCDV 1237
                G+ TTKCLTCRGEGRLLCMECDGTGEPNIE QFMEW+DEG KCPYC+G G+ TCDV
Sbjct: 340  ETTGGSTTTKCLTCRGEGRLLCMECDGTGEPNIEEQFMEWIDEGMKCPYCEGHGFVTCDV 399

Query: 1238 CGGKQ 1252
            C GK+
Sbjct: 400  CEGKK 404


>ref|XP_007034404.2| PREDICTED: golgin subfamily A member 6-like protein 2 [Theobroma
            cacao]
          Length = 384

 Score =  348 bits (893), Expect = e-112
 Identities = 185/312 (59%), Positives = 232/312 (74%), Gaps = 9/312 (2%)
 Frame = +2

Query: 341  ERQTLLHEISSLKLRLQELERLNSLQGASVSETVANIAKLLQVLKDGDLGKNTNRIADSG 520
            E+++LL +IS LKL++Q LE  NS+QGASVSET+++I  LLQVLK+       NRIADSG
Sbjct: 79   EKESLLRQISELKLQIQTLENRNSVQGASVSETISSIGALLQVLKE------KNRIADSG 132

Query: 521  --TSAVPLXXXXXXXXXXXXXXXXXSIPDKKEILKT-RATLRKGSEGDQVREMQEALQKL 691
              TS + L                 +  +K+E  K  R  LR GSEG+QVREMQEAL+KL
Sbjct: 133  ESTSEMVLEEVKEKEVVVEEGVRVLARREKEEEKKIERKALRVGSEGEQVREMQEALEKL 192

Query: 692  GFYSGEEDIEFSSFSSGTERAVKTWQATLGAREDGIMTAELLETLFGSSGSGLTKNQEHK 871
            GFYSGEED+EFSSFSSGTERAVKTWQAT+GAREDGIM+AELL+ LF  +G  + K+ +H+
Sbjct: 193  GFYSGEEDMEFSSFSSGTERAVKTWQATVGAREDGIMSAELLQRLF--TGQQI-KSSDHE 249

Query: 872  SRGP----DKIANGAPVASISEVSKVQQTVVTDEGVNEVNVSQHRVFLLGENRWEEPSRL 1039
               P     +  NGA +AS++E+S++QQ VV +EG  +  VSQHRVFLLGENRWEEPSRL
Sbjct: 250  VAPPTVPEKEQTNGAAIASLTEISEIQQEVVKEEGFTQAEVSQHRVFLLGENRWEEPSRL 309

Query: 1040 SGSSKKTIKSGN--ATTKCLTCRGEGRLLCMECDGTGEPNIEPQFMEWVDEGAKCPYCDG 1213
            +G  KK ++S N  +TT+C  CRGEGRL+C ECDGTGEPN+EPQF+EWV+EGA CPYC+G
Sbjct: 310  AGKDKKAMESKNRDSTTRCHACRGEGRLMCTECDGTGEPNVEPQFLEWVEEGANCPYCEG 369

Query: 1214 VGYSTCDVCGGK 1249
            +GY+ CDVC GK
Sbjct: 370  LGYTICDVCQGK 381


>gb|EOY05330.1| Plastid transcriptionally active 5, putative isoform 1 [Theobroma
            cacao]
          Length = 384

 Score =  347 bits (890), Expect = e-111
 Identities = 185/312 (59%), Positives = 231/312 (74%), Gaps = 9/312 (2%)
 Frame = +2

Query: 341  ERQTLLHEISSLKLRLQELERLNSLQGASVSETVANIAKLLQVLKDGDLGKNTNRIADSG 520
            E++ LL +IS LKL++Q LE  NS+QGASVSET+++I  LLQVLK+       NRIADSG
Sbjct: 79   EKEFLLRQISELKLQIQALENRNSVQGASVSETISSIGALLQVLKE------KNRIADSG 132

Query: 521  --TSAVPLXXXXXXXXXXXXXXXXXSIPDKKEILKT-RATLRKGSEGDQVREMQEALQKL 691
              TS + L                 +  +K+E  K  R  LR GSEG+QVREMQEAL+KL
Sbjct: 133  ESTSEMVLEEVKEKEVVVEEGVRVLARREKEEEKKIERKALRVGSEGEQVREMQEALEKL 192

Query: 692  GFYSGEEDIEFSSFSSGTERAVKTWQATLGAREDGIMTAELLETLFGSSGSGLTKNQEHK 871
            GFYSGEED+EFSSFSSGTERAVKTWQAT+GAREDGIM+AELL+ LF  +G  + K+ +H+
Sbjct: 193  GFYSGEEDMEFSSFSSGTERAVKTWQATVGAREDGIMSAELLQRLF--TGQQI-KSSDHE 249

Query: 872  SRGP----DKIANGAPVASISEVSKVQQTVVTDEGVNEVNVSQHRVFLLGENRWEEPSRL 1039
               P     +  NGA +AS++E+S++QQ VV +EG  +  VSQHRVFLLGENRWEEPSRL
Sbjct: 250  VAPPTVPEKEQTNGAAIASLTEISEIQQEVVKEEGFTQAEVSQHRVFLLGENRWEEPSRL 309

Query: 1040 SGSSKKTIKSGN--ATTKCLTCRGEGRLLCMECDGTGEPNIEPQFMEWVDEGAKCPYCDG 1213
            +G  KK ++S N  +TT+C  CRGEGRL+C ECDGTGEPN+EPQF+EWV+EGA CPYC+G
Sbjct: 310  AGKDKKAMESKNRDSTTRCHACRGEGRLMCTECDGTGEPNVEPQFLEWVEEGANCPYCEG 369

Query: 1214 VGYSTCDVCGGK 1249
            +GY+ CDVC GK
Sbjct: 370  LGYTICDVCQGK 381


>gb|PHT39288.1| hypothetical protein CQW23_22861 [Capsicum baccatum]
          Length = 402

 Score =  346 bits (887), Expect = e-110
 Identities = 191/371 (51%), Positives = 236/371 (63%), Gaps = 20/371 (5%)
 Frame = +2

Query: 200  LSKSHICFAFTPSNQNPIGXXXXXXXXXXXXXXXXXXXXXXXXXXNAERQTLLHEISSLK 379
            LS SHIC++ +P  +                                +R+ LL EI  L+
Sbjct: 36   LSISHICYSTSPEREEE----SRWLREEARWLREEQRWLREEKRWEKQREALLVEIQKLQ 91

Query: 380  LRLQELERLNSL--QGASVSETVANIAKLLQVLKDGDLGKNTNRIADSGTSAVPLXXXXX 553
            LR++ELE  NS+  QG+SVSETV+ IAKLLQ+LK+G+ GKN   IA+SG+ A+PL     
Sbjct: 92   LRVKELESRNSVVVQGSSVSETVSAIAKLLQLLKEGEAGKNVTVIAESGSIALPL----V 147

Query: 554  XXXXXXXXXXXXSIPDKKEIL--------------KTRATLRKGSEGDQVREMQEALQKL 691
                          P +++++              K R TL+KGSEGD+VR MQE L KL
Sbjct: 148  LEAAKQNEVVVKEAPQQEKVIREAPKEAEGDGNKAKKRRTLKKGSEGDEVRLMQEQLLKL 207

Query: 692  GFYSGEEDIEFSSFSSGTERAVKTWQATLGAREDGIMTAELLETLFGSSGSGLTKNQEHK 871
            GFY GEED+EFSSF+ GTERAVKTWQA+   REDGIMT+ELLE L+        K    +
Sbjct: 208  GFYCGEEDMEFSSFAGGTERAVKTWQASCDVREDGIMTSELLEKLYTVQKIDTVKENPKQ 267

Query: 872  SRGPD--KIANGAPVASISEVSKVQQTVVTDEGVNEVNVSQHRVFLLGENRWEEPSRLSG 1045
              G +    ANGAP+ASI E+ +VQQT+V ++GV+E  VS HRVFLLGENRWEEPSRLS 
Sbjct: 268  PDGTEAKASANGAPIASIMEIEEVQQTIVKEDGVSETEVSHHRVFLLGENRWEEPSRLST 327

Query: 1046 SSK--KTIKSGNATTKCLTCRGEGRLLCMECDGTGEPNIEPQFMEWVDEGAKCPYCDGVG 1219
            S K  +T    + T KCLTCRGEGRLLCMECDGTGEPNIE QFMEW+DEG KCPYC+G G
Sbjct: 328  SKKPAETTNGSSTTVKCLTCRGEGRLLCMECDGTGEPNIEEQFMEWIDEGMKCPYCEGHG 387

Query: 1220 YSTCDVCGGKQ 1252
            + TCDVC GK+
Sbjct: 388  FVTCDVCDGKK 398


>ref|XP_016452615.1| PREDICTED: uncharacterized protein LOC107777139 isoform X1 [Nicotiana
            tabacum]
 ref|XP_018626626.1| PREDICTED: uncharacterized protein LOC104097013 isoform X1 [Nicotiana
            tomentosiformis]
          Length = 409

 Score =  346 bits (887), Expect = e-110
 Identities = 197/366 (53%), Positives = 237/366 (64%), Gaps = 15/366 (4%)
 Frame = +2

Query: 200  LSKSHICFAFTPSNQNPIGXXXXXXXXXXXXXXXXXXXXXXXXXXNAERQTLLHEISSLK 379
            LS SHIC++  P  +                               AER+ LL +I SL+
Sbjct: 45   LSVSHICYSSLPEREES-----RWLREEQRWLREEARWLREEKRWEAEREALLLQIQSLQ 99

Query: 380  LRLQELE-RLNSLQGASVSETVANIAKLLQVLKDGDLGKNTNRIADSGTSAVPLXXXXXX 556
            LRL+E+E R+NSL   SV+ETVANIAKLLQ+LK+G+LGKN N I +SG+ +VPL      
Sbjct: 100  LRLKEVENRMNSLPETSVTETVANIAKLLQLLKEGELGKNVNVITESGSISVPLILEAAK 159

Query: 557  XXXXXXXXXXXS------IPDKKEI----LKTRATLRKGSEGDQVREMQEALQKLGFYSG 706
                              +P + E      K R  LRKGSEGD+VR +QE L KLGFY G
Sbjct: 160  ENEVIVKEAIEQEKVIREVPKESEREGKEAKKRRPLRKGSEGDEVRLLQEQLLKLGFYCG 219

Query: 707  EEDIEFSSFSSGTERAVKTWQATLGAREDGIMTAELLETLFGSSG-SGLTKNQEHKSRGP 883
            EED+EFSSF+SGTE AVKTWQA+ G  EDGIMT+ELLETL+      G+ +N +      
Sbjct: 220  EEDMEFSSFTSGTESAVKTWQASSGVPEDGIMTSELLETLYMVQNIDGVRENPKQPDGTE 279

Query: 884  DKI-ANGAPVASISEVSKVQQTVVTDEGVNEVNVSQHRVFLLGENRWEEPSRLSGSSKKT 1060
             K  ANGAPVASI E+ +VQQT+V ++ V+E  VS HRVFLLGENRWEEPSRL+ S K  
Sbjct: 280  AKTSANGAPVASIMEIEEVQQTIVKEDSVSETEVSHHRVFLLGENRWEEPSRLTTSRKPA 339

Query: 1061 -IKSGNATTKCLTCRGEGRLLCMECDGTGEPNI-EPQFMEWVDEGAKCPYCDGVGYSTCD 1234
                G+ TTKCLTCRGEGRLLCMECDGTGEPNI E QFMEW+DEG KCPYC+G G+ TCD
Sbjct: 340  ETTGGSTTTKCLTCRGEGRLLCMECDGTGEPNIEEQQFMEWIDEGMKCPYCEGHGFVTCD 399

Query: 1235 VCGGKQ 1252
            VC GK+
Sbjct: 400  VCEGKK 405


>gb|PHU21429.1| hypothetical protein BC332_06536 [Capsicum chinense]
          Length = 402

 Score =  345 bits (885), Expect = e-110
 Identities = 190/371 (51%), Positives = 237/371 (63%), Gaps = 20/371 (5%)
 Frame = +2

Query: 200  LSKSHICFAFTPSNQNPIGXXXXXXXXXXXXXXXXXXXXXXXXXXNAERQTLLHEISSLK 379
            LS SHIC++ +P  +                                +R+ LL EI  L+
Sbjct: 36   LSISHICYSTSPEREEE----SRWLREEARWLREEQRWLREERRWEKQREALLVEIQKLQ 91

Query: 380  LRLQELERLNSL--QGASVSETVANIAKLLQVLKDGDLGKNTNRIADSGTSAVPLXXXXX 553
            LR++ELE  NS+  +G+SVSETV+ IAKLLQ+LK+G++GKN   IA+SG+ A+PL     
Sbjct: 92   LRVKELESRNSVVVEGSSVSETVSAIAKLLQLLKEGEVGKNVTVIAESGSIALPL----V 147

Query: 554  XXXXXXXXXXXXSIPDKKEIL--------------KTRATLRKGSEGDQVREMQEALQKL 691
                          P +++++              K R TL+KGSEGD+VR MQE L KL
Sbjct: 148  LEAAKQNEVVVKEAPQQEKVIREAPKEAEGDGNKAKKRRTLKKGSEGDEVRLMQEQLLKL 207

Query: 692  GFYSGEEDIEFSSFSSGTERAVKTWQATLGAREDGIMTAELLETLFGSSGSGLTKNQEHK 871
            GFY GEED+EFSSF+ GTERAVKTWQA+   REDGIMT+ELLE L+        K    +
Sbjct: 208  GFYCGEEDMEFSSFAGGTERAVKTWQASCDVREDGIMTSELLEKLYTVQKIDTVKENPKQ 267

Query: 872  SRGPDKI--ANGAPVASISEVSKVQQTVVTDEGVNEVNVSQHRVFLLGENRWEEPSRLSG 1045
              G +    ANGAP+ASI E+ +VQQT+V ++GV+E  VS HRVFLLGENRWEEPSRLS 
Sbjct: 268  PDGTEAKAGANGAPIASIMEIEEVQQTIVKEDGVSETEVSHHRVFLLGENRWEEPSRLST 327

Query: 1046 SSK--KTIKSGNATTKCLTCRGEGRLLCMECDGTGEPNIEPQFMEWVDEGAKCPYCDGVG 1219
            S K  +T    + T KCLTCRGEGRLLCMECDGTGEPNIE QFMEW+DEG KCPYC+G G
Sbjct: 328  SKKPAETTNGSSTTVKCLTCRGEGRLLCMECDGTGEPNIEEQFMEWIDEGMKCPYCEGHG 387

Query: 1220 YSTCDVCGGKQ 1252
            + TCDVC GK+
Sbjct: 388  FVTCDVCDGKK 398


>ref|XP_017617412.1| PREDICTED: golgin subfamily A member 6-like protein 6 [Gossypium
            arboreum]
 gb|KHG13022.1| Spore cortex-lytic enzyme [Gossypium arboreum]
          Length = 397

 Score =  345 bits (884), Expect = e-110
 Identities = 188/315 (59%), Positives = 219/315 (69%), Gaps = 13/315 (4%)
 Frame = +2

Query: 341  ERQTLLHEISSLKLRLQELERLNSLQGASVSETVANIAKLLQVLKDGDLGKNTNRIADSG 520
            E+++LL EIS LKL++Q LE  NS  GASV+ET++ I  LLQVLKD       NRIA+SG
Sbjct: 93   EKESLLWEISQLKLQIQALENRNSFHGASVTETISRIGALLQVLKD------KNRIAESG 146

Query: 521  TSAVPLXXXXXXXXXXXXXXXXXSIPDK-KEILKT--RATLRKGSEGDQVREMQEALQKL 691
             SA  +                  +  K KE+ K   R TLR GSEG+QVREMQEAL KL
Sbjct: 147  ESARDMVFEEVKEKEVVVEEGVRVLEKKAKEVEKKIERKTLRVGSEGEQVREMQEALGKL 206

Query: 692  GFYSGEEDIEFSSFSSGTERAVKTWQATLGAREDGIMTAELLETLFGSSGSGLTKNQEHK 871
            GFYSGEEDIEFSSFSSGTERAVKTWQAT+GAREDGIMTAELL+ LF        + QE K
Sbjct: 207  GFYSGEEDIEFSSFSSGTERAVKTWQATIGAREDGIMTAELLQRLF--------EEQEVK 258

Query: 872  SRGPDKIA--------NGAPVASISEVSKVQQTVVTDEGVNEVNVSQHRVFLLGENRWEE 1027
            S     IA        NG  + S++E+S++QQ VV +EG  E  VSQHRVFLLGENRWEE
Sbjct: 259  SSSSSNIATIREKEGTNGTAITSLTEISEIQQKVVKEEGFTEAEVSQHRVFLLGENRWEE 318

Query: 1028 PSRLSGSSKKTIKSGN--ATTKCLTCRGEGRLLCMECDGTGEPNIEPQFMEWVDEGAKCP 1201
            PSRL+G  K+   S N  A T C  CRGEGRL+C ECDGTGEPN+EPQF+EWVDEGA CP
Sbjct: 319  PSRLTGKDKQATGSKNIDAKTSCHACRGEGRLMCAECDGTGEPNVEPQFLEWVDEGANCP 378

Query: 1202 YCDGVGYSTCDVCGG 1246
            YCDG+GY+TC+VC G
Sbjct: 379  YCDGLGYTTCEVCQG 393


>ref|XP_012456811.1| PREDICTED: putative golgin subfamily A member 6-like protein 6
            [Gossypium raimondii]
 gb|KJB72117.1| hypothetical protein B456_011G160100 [Gossypium raimondii]
          Length = 399

 Score =  345 bits (884), Expect = e-110
 Identities = 188/315 (59%), Positives = 219/315 (69%), Gaps = 13/315 (4%)
 Frame = +2

Query: 341  ERQTLLHEISSLKLRLQELERLNSLQGASVSETVANIAKLLQVLKDGDLGKNTNRIADSG 520
            E+++LL EIS LKL++Q LE  NS  GASV+ET++ I  LLQVLKD       NRIA+SG
Sbjct: 95   EKESLLWEISQLKLQIQALENRNSFHGASVTETISRIGALLQVLKD------KNRIAESG 148

Query: 521  TSAVPLXXXXXXXXXXXXXXXXXSIPDK-KEILKT--RATLRKGSEGDQVREMQEALQKL 691
             SA  +                  +  K KE+ K   R TLR GSEG+QVREMQEAL KL
Sbjct: 149  ESARDMVFEEVKEKEVVVEEGVRVLEKKAKEVEKKIERKTLRVGSEGEQVREMQEALGKL 208

Query: 692  GFYSGEEDIEFSSFSSGTERAVKTWQATLGAREDGIMTAELLETLFGSSGSGLTKNQEHK 871
            GFYSGEEDIEFSSFSSGTERAVKTWQAT+GAREDGIMTAELL+ LF        + QE K
Sbjct: 209  GFYSGEEDIEFSSFSSGTERAVKTWQATIGAREDGIMTAELLQRLF--------EEQEVK 260

Query: 872  SRGPDKIA--------NGAPVASISEVSKVQQTVVTDEGVNEVNVSQHRVFLLGENRWEE 1027
            S     IA        NG  + S++E+S++QQ VV +EG  E  VSQHRVFLLGENRWEE
Sbjct: 261  SSSSSNIATIWEKEGTNGTAITSLTEISEIQQKVVKEEGFTEAEVSQHRVFLLGENRWEE 320

Query: 1028 PSRLSGSSKKTIKSGN--ATTKCLTCRGEGRLLCMECDGTGEPNIEPQFMEWVDEGAKCP 1201
            PSRL+G  K+   S N  A T C  CRGEGRL+C ECDGTGEPN+EPQF+EWVDEGA CP
Sbjct: 321  PSRLTGKDKQATGSENIDAKTSCHACRGEGRLMCAECDGTGEPNVEPQFLEWVDEGANCP 380

Query: 1202 YCDGVGYSTCDVCGG 1246
            YCDG+GY+TC+VC G
Sbjct: 381  YCDGLGYTTCEVCQG 395


>ref|XP_016566118.1| PREDICTED: uncharacterized protein LOC107864304 [Capsicum annuum]
 gb|PHT85366.1| hypothetical protein T459_07472 [Capsicum annuum]
          Length = 402

 Score =  344 bits (883), Expect = e-110
 Identities = 190/371 (51%), Positives = 236/371 (63%), Gaps = 20/371 (5%)
 Frame = +2

Query: 200  LSKSHICFAFTPSNQNPIGXXXXXXXXXXXXXXXXXXXXXXXXXXNAERQTLLHEISSLK 379
            LS SHIC++ +P  +                                +R+ LL EI  L+
Sbjct: 36   LSISHICYSTSPEREEE----SRWLREEARWLREEQRWLREERRWEKQREALLVEIQKLQ 91

Query: 380  LRLQELERLNSL--QGASVSETVANIAKLLQVLKDGDLGKNTNRIADSGTSAVPLXXXXX 553
            LR++ELE  NS+  +G+SVSETV+ IAKLLQ+LK+G+ GKN   IA+SG+ A+PL     
Sbjct: 92   LRVKELESRNSVVVEGSSVSETVSAIAKLLQLLKEGEAGKNVTVIAESGSIALPL----V 147

Query: 554  XXXXXXXXXXXXSIPDKKEIL--------------KTRATLRKGSEGDQVREMQEALQKL 691
                          P +++++              K R TL+KGSEGD+VR MQE L KL
Sbjct: 148  LEAAKQNEVVVKEAPQQEKVIREAPKEAEGDGNKAKKRRTLKKGSEGDEVRLMQEQLLKL 207

Query: 692  GFYSGEEDIEFSSFSSGTERAVKTWQATLGAREDGIMTAELLETLFGSSGSGLTKNQEHK 871
            GFY GEED+EFSSF+ GTERAVKTWQA+   REDGIMT+ELLE L+        K    +
Sbjct: 208  GFYCGEEDMEFSSFAGGTERAVKTWQASSDVREDGIMTSELLEKLYTVQKIDTVKENPKQ 267

Query: 872  SRGPD--KIANGAPVASISEVSKVQQTVVTDEGVNEVNVSQHRVFLLGENRWEEPSRLSG 1045
              G +    ANGAP+ASI E+ +VQQT+V ++GV+E  VS HRVFLLGENRWEEPSRLS 
Sbjct: 268  PDGTEAKASANGAPIASIMEIEEVQQTIVKEDGVSETEVSHHRVFLLGENRWEEPSRLST 327

Query: 1046 SSK--KTIKSGNATTKCLTCRGEGRLLCMECDGTGEPNIEPQFMEWVDEGAKCPYCDGVG 1219
            S K  +T    + T KCLTCRGEGRLLCMECDGTGEPNIE QFMEW+DEG KCPYC+G G
Sbjct: 328  SKKPAETTNGSSTTVKCLTCRGEGRLLCMECDGTGEPNIEEQFMEWIDEGMKCPYCEGHG 387

Query: 1220 YSTCDVCGGKQ 1252
            + TCDVC GK+
Sbjct: 388  FVTCDVCDGKK 398


Top