BLASTX nr result

ID: Cornus23_contig00015673 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cornus23_contig00015673
         (1658 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002266002.1| PREDICTED: uncharacterized protein LOC100258...   303   4e-79
ref|XP_012087305.1| PREDICTED: uncharacterized protein LOC105646...   288   1e-74
ref|XP_012087307.1| PREDICTED: uncharacterized protein LOC105646...   286   3e-74
ref|XP_007023644.1| Tetratricopeptide repeat (TPR)-like superfam...   283   2e-73
ref|XP_010088748.1| hypothetical protein L484_001367 [Morus nota...   280   3e-72
ref|XP_007023645.1| Tetratricopeptide repeat-like superfamily pr...   279   6e-72
gb|KDO41699.1| hypothetical protein CISIN_1g015907mg [Citrus sin...   278   1e-71
ref|XP_006468126.1| PREDICTED: uncharacterized protein LOC102618...   278   1e-71
ref|XP_002310923.2| hypothetical protein POPTR_0008s00440g [Popu...   274   1e-70
ref|XP_011033077.1| PREDICTED: uncharacterized protein LOC105131...   270   2e-69
ref|XP_003529421.1| PREDICTED: uncharacterized protein LOC100790...   270   2e-69
ref|XP_004303801.1| PREDICTED: uncharacterized protein LOC101309...   267   2e-68
emb|CAN63743.1| hypothetical protein VITISV_041630 [Vitis vinifera]   266   4e-68
gb|KHN39380.1| hypothetical protein glysoja_018152 [Glycine soja]     265   9e-68
gb|KHN38106.1| hypothetical protein glysoja_007148 [Glycine soja]     265   1e-67
ref|XP_006379387.1| hypothetical protein POPTR_0008s00440g [Popu...   265   1e-67
ref|XP_003531780.1| PREDICTED: uncharacterized protein LOC100777...   262   8e-67
ref|XP_012450807.1| PREDICTED: uncharacterized protein LOC105773...   261   2e-66
gb|KJB66490.1| hypothetical protein B456_010G141500 [Gossypium r...   261   2e-66
gb|KJB66489.1| hypothetical protein B456_010G141500 [Gossypium r...   261   2e-66

>ref|XP_002266002.1| PREDICTED: uncharacterized protein LOC100258138 isoform X1 [Vitis
            vinifera] gi|297735765|emb|CBI18452.3| unnamed protein
            product [Vitis vinifera]
          Length = 418

 Score =  303 bits (775), Expect = 4e-79
 Identities = 167/269 (62%), Positives = 197/269 (73%), Gaps = 1/269 (0%)
 Frame = -1

Query: 1295 VYTARIGVNRKLPPPRRLHMFRLHCS-DSDPGRGFGRPSTDKNSKKATRDTKSRREKDVV 1119
            VY    G+NRK   P  L  FR+HCS DS P RGFG P   +   K ++ T S+  K  V
Sbjct: 13   VYGYGCGLNRK---PPSLLTFRIHCSSDSKPTRGFG-PQPPQRDNKMSKSTTSKEGKGGV 68

Query: 1118 VQQRKSATNQSGSIPRQAPGLNPRTEGKSRNIAFDVEFEERLEAVKRSALEQKKADEVEE 939
            +QQRKS + QSGS+P QAPGL+ R+ GKS + A D++FEERLEAV+R+ALEQKKADE +E
Sbjct: 69   LQQRKSTSKQSGSVPTQAPGLSSRSGGKSNDAAIDLDFEERLEAVRRTALEQKKADEKKE 128

Query: 938  YGAIDYDAPIESKGSTIGLGTKIXXXXXXXXXXXXXXXGDFLPSGSVSPTEEAPVVNEKP 759
            YGAIDYD P+ES+  TIGLGTKI               GDFLPSGS SP+EEA VV++K 
Sbjct: 129  YGAIDYDTPVESEEKTIGLGTKIGVGVAVVVFGLVFALGDFLPSGSDSPSEEATVVSKKL 188

Query: 758  SGEEKLKLQTRLQQYEATLKISPEDPTALEGAAVTLTELGEYSRAASLLEDLTKKRPSDP 579
            S EEK  LQ RLQQYEATL  SP+D TALE AAVTL ELGEY+RAASLLED  K++P+DP
Sbjct: 189  SEEEKSTLQARLQQYEATLSSSPKDQTALEAAAVTLVELGEYTRAASLLEDFVKEKPNDP 248

Query: 578  DALRLLGEVKYQLKDYEGSAAAYKSSAMV 492
            +A RLLGEVK+ LKDYEGSAAAY+SSA V
Sbjct: 249  EAFRLLGEVKFALKDYEGSAAAYRSSAKV 277


>ref|XP_012087305.1| PREDICTED: uncharacterized protein LOC105646135 isoform X1 [Jatropha
            curcas] gi|643711533|gb|KDP25040.1| hypothetical protein
            JCGZ_22575 [Jatropha curcas]
          Length = 395

 Score =  288 bits (737), Expect = 1e-74
 Identities = 153/248 (61%), Positives = 188/248 (75%)
 Frame = -1

Query: 1235 FRLHCSDSDPGRGFGRPSTDKNSKKATRDTKSRREKDVVVQQRKSATNQSGSIPRQAPGL 1056
            FR+ C+DS P RGFG  + D N+ K  + T SR EK + +QQRKS + QSG  P QAPGL
Sbjct: 17   FRVQCADSKPRRGFGAKN-DPNNNKTKKVTASREEKGMALQQRKSTSRQSGPSPTQAPGL 75

Query: 1055 NPRTEGKSRNIAFDVEFEERLEAVKRSALEQKKADEVEEYGAIDYDAPIESKGSTIGLGT 876
            + R +GK +++  D+EFEERLEAV+RSALEQKKADE++E+G IDYDAP+ES   TIGLGT
Sbjct: 76   SFRIDGKPKSM--DLEFEERLEAVRRSALEQKKADEIKEFGPIDYDAPVESDKKTIGLGT 133

Query: 875  KIXXXXXXXXXXXXXXXGDFLPSGSVSPTEEAPVVNEKPSGEEKLKLQTRLQQYEATLKI 696
            KI               GDFLPSGS SP EEA  V++K S EEK  L T+L+QYE TL +
Sbjct: 134  KIGVGVAVLVFGLVFALGDFLPSGSDSPPEEAATVDKKLSKEEKAILLTQLKQYETTLAV 193

Query: 695  SPEDPTALEGAAVTLTELGEYSRAASLLEDLTKKRPSDPDALRLLGEVKYQLKDYEGSAA 516
            SP+DP ALEGAAVTL+ELG+Y++AASLL+DL K++P+DPD  RLLGEVKY+LKDYEGSA 
Sbjct: 194  SPKDPVALEGAAVTLSELGKYTQAASLLQDLAKEKPNDPDVFRLLGEVKYELKDYEGSAN 253

Query: 515  AYKSSAMV 492
            AY+SSAMV
Sbjct: 254  AYRSSAMV 261


>ref|XP_012087307.1| PREDICTED: uncharacterized protein LOC105646135 isoform X2 [Jatropha
            curcas]
          Length = 370

 Score =  286 bits (733), Expect = 3e-74
 Identities = 152/247 (61%), Positives = 187/247 (75%)
 Frame = -1

Query: 1235 FRLHCSDSDPGRGFGRPSTDKNSKKATRDTKSRREKDVVVQQRKSATNQSGSIPRQAPGL 1056
            FR+ C+DS P RGFG  + D N+ K  + T SR EK + +QQRKS + QSG  P QAPGL
Sbjct: 17   FRVQCADSKPRRGFGAKN-DPNNNKTKKVTASREEKGMALQQRKSTSRQSGPSPTQAPGL 75

Query: 1055 NPRTEGKSRNIAFDVEFEERLEAVKRSALEQKKADEVEEYGAIDYDAPIESKGSTIGLGT 876
            + R +GK +++  D+EFEERLEAV+RSALEQKKADE++E+G IDYDAP+ES   TIGLGT
Sbjct: 76   SFRIDGKPKSM--DLEFEERLEAVRRSALEQKKADEIKEFGPIDYDAPVESDKKTIGLGT 133

Query: 875  KIXXXXXXXXXXXXXXXGDFLPSGSVSPTEEAPVVNEKPSGEEKLKLQTRLQQYEATLKI 696
            KI               GDFLPSGS SP EEA  V++K S EEK  L T+L+QYE TL +
Sbjct: 134  KIGVGVAVLVFGLVFALGDFLPSGSDSPPEEAATVDKKLSKEEKAILLTQLKQYETTLAV 193

Query: 695  SPEDPTALEGAAVTLTELGEYSRAASLLEDLTKKRPSDPDALRLLGEVKYQLKDYEGSAA 516
            SP+DP ALEGAAVTL+ELG+Y++AASLL+DL K++P+DPD  RLLGEVKY+LKDYEGSA 
Sbjct: 194  SPKDPVALEGAAVTLSELGKYTQAASLLQDLAKEKPNDPDVFRLLGEVKYELKDYEGSAN 253

Query: 515  AYKSSAM 495
            AY+SSAM
Sbjct: 254  AYRSSAM 260


>ref|XP_007023644.1| Tetratricopeptide repeat (TPR)-like superfamily protein isoform 1
            [Theobroma cacao] gi|508779010|gb|EOY26266.1|
            Tetratricopeptide repeat (TPR)-like superfamily protein
            isoform 1 [Theobroma cacao]
          Length = 406

 Score =  283 bits (725), Expect = 2e-73
 Identities = 149/252 (59%), Positives = 188/252 (74%)
 Frame = -1

Query: 1247 RLHMFRLHCSDSDPGRGFGRPSTDKNSKKATRDTKSRREKDVVVQQRKSATNQSGSIPRQ 1068
            R   F++ CSDS   RGFG   + K ++KA + + SR EK + +QQRKS + QSG  P Q
Sbjct: 17   RFLSFQIQCSDSKAKRGFG---SKKPNQKANKVSASREEKGMKLQQRKSTSKQSGPSPAQ 73

Query: 1067 APGLNPRTEGKSRNIAFDVEFEERLEAVKRSALEQKKADEVEEYGAIDYDAPIESKGSTI 888
            APGL+ + +GKS + + D++FEERLEA++R+A++QKKA+E +E+G IDYDAP ES   TI
Sbjct: 74   APGLSAQFDGKSNSSSLDIDFEERLEAIRRAAVQQKKAEEQKEFGPIDYDAPAESDKKTI 133

Query: 887  GLGTKIXXXXXXXXXXXXXXXGDFLPSGSVSPTEEAPVVNEKPSGEEKLKLQTRLQQYEA 708
            GLGT+I               GDFLPSGS +P EEA V+++K S EEK  LQTRL+Q+EA
Sbjct: 134  GLGTQIGVGVAVVVFGLVFALGDFLPSGSTNPPEEAAVIDKKLSNEEKATLQTRLKQFEA 193

Query: 707  TLKISPEDPTALEGAAVTLTELGEYSRAASLLEDLTKKRPSDPDALRLLGEVKYQLKDYE 528
             L ISP+DPTALEGAAVTLTELG+Y+RAASLL+DL K++ SDPD  RLLGEVKY LKDY+
Sbjct: 194  MLSISPKDPTALEGAAVTLTELGDYARAASLLQDLAKEKTSDPDVFRLLGEVKYALKDYD 253

Query: 527  GSAAAYKSSAMV 492
            GSAAAYK SAMV
Sbjct: 254  GSAAAYKLSAMV 265


>ref|XP_010088748.1| hypothetical protein L484_001367 [Morus notabilis]
            gi|587950074|gb|EXC36067.1| hypothetical protein
            L484_001367 [Morus notabilis]
          Length = 398

 Score =  280 bits (716), Expect = 3e-72
 Identities = 151/250 (60%), Positives = 187/250 (74%), Gaps = 4/250 (1%)
 Frame = -1

Query: 1229 LHCSDSD----PGRGFGRPSTDKNSKKATRDTKSRREKDVVVQQRKSATNQSGSIPRQAP 1062
            + CSDS+    P RGFG P+T+ N+K      K ++ K +V+ QRKSA  +SGS P QAP
Sbjct: 16   IRCSDSNSNSNPKRGFG-PNTNDNNKT----NKGKKNKGLVIDQRKSAARRSGSEPAQAP 70

Query: 1061 GLNPRTEGKSRNIAFDVEFEERLEAVKRSALEQKKADEVEEYGAIDYDAPIESKGSTIGL 882
            GL  +  GKS+N + DV+FEERL+A+KR+ALEQKK +E +E+GAIDYD PIES+  TIGL
Sbjct: 71   GLRSQFGGKSKNSSIDVDFEERLKAIKRAALEQKKVEEEKEFGAIDYDVPIESEKKTIGL 130

Query: 881  GTKIXXXXXXXXXXXXXXXGDFLPSGSVSPTEEAPVVNEKPSGEEKLKLQTRLQQYEATL 702
            GTKI               GDFLP+GS+ P+EEA VV+ + S EEK  LQT+L++YEATL
Sbjct: 131  GTKIGVGVAVAVFGLVFALGDFLPTGSIGPSEEAAVVDNQLSKEEKTILQTQLKEYEATL 190

Query: 701  KISPEDPTALEGAAVTLTELGEYSRAASLLEDLTKKRPSDPDALRLLGEVKYQLKDYEGS 522
              SP+DPTALEGAAVTL ELGEY+RAASLLEDLTK++PSDPD   LLGEVKY+LKDYEGS
Sbjct: 191  SNSPKDPTALEGAAVTLAELGEYTRAASLLEDLTKEKPSDPDVFLLLGEVKYKLKDYEGS 250

Query: 521  AAAYKSSAMV 492
            A AYK+S+ V
Sbjct: 251  ADAYKTSSKV 260


>ref|XP_007023645.1| Tetratricopeptide repeat-like superfamily protein isoform 2
            [Theobroma cacao] gi|508779011|gb|EOY26267.1|
            Tetratricopeptide repeat-like superfamily protein isoform
            2 [Theobroma cacao]
          Length = 373

 Score =  279 bits (713), Expect = 6e-72
 Identities = 149/253 (58%), Positives = 188/253 (74%), Gaps = 1/253 (0%)
 Frame = -1

Query: 1247 RLHMFRLHCSDSDPGRGFGRPSTDKNSKKATRDTKSRREKDVVVQQRKSATNQSGSIPRQ 1068
            R   F++ CSDS   RGFG   + K ++KA + + SR EK + +QQRKS + QSG  P Q
Sbjct: 17   RFLSFQIQCSDSKAKRGFG---SKKPNQKANKVSASREEKGMKLQQRKSTSKQSGPSPAQ 73

Query: 1067 APGLNPRTEGKSRNIAFDVEFEERLEAVKRSALEQKKADEVEEYGAIDYDAPIESKGSTI 888
            APGL+ + +GKS + + D++FEERLEA++R+A++QKKA+E +E+G IDYDAP ES   TI
Sbjct: 74   APGLSAQFDGKSNSSSLDIDFEERLEAIRRAAVQQKKAEEQKEFGPIDYDAPAESDKKTI 133

Query: 887  GLGTKIXXXXXXXXXXXXXXXGDFLPSGSVSPTEEAPVVNEKPSGEEKLKLQ-TRLQQYE 711
            GLGT+I               GDFLPSGS +P EEA V+++K S EEK  LQ TRL+Q+E
Sbjct: 134  GLGTQIGVGVAVVVFGLVFALGDFLPSGSTNPPEEAAVIDKKLSNEEKATLQQTRLKQFE 193

Query: 710  ATLKISPEDPTALEGAAVTLTELGEYSRAASLLEDLTKKRPSDPDALRLLGEVKYQLKDY 531
            A L ISP+DPTALEGAAVTLTELG+Y+RAASLL+DL K++ SDPD  RLLGEVKY LKDY
Sbjct: 194  AMLSISPKDPTALEGAAVTLTELGDYARAASLLQDLAKEKTSDPDVFRLLGEVKYALKDY 253

Query: 530  EGSAAAYKSSAMV 492
            +GSAAAYK SAMV
Sbjct: 254  DGSAAAYKLSAMV 266


>gb|KDO41699.1| hypothetical protein CISIN_1g015907mg [Citrus sinensis]
          Length = 398

 Score =  278 bits (711), Expect = 1e-71
 Identities = 151/248 (60%), Positives = 181/248 (72%)
 Frame = -1

Query: 1235 FRLHCSDSDPGRGFGRPSTDKNSKKATRDTKSRREKDVVVQQRKSATNQSGSIPRQAPGL 1056
            FR+ CSDS P RGFG   TDK +K+  +   S+        +RKS + QSGS+P QAP L
Sbjct: 16   FRIQCSDSKPRRGFGN-KTDKTNKEEKKGVMSQ-------PKRKSLSKQSGSLPTQAPFL 67

Query: 1055 NPRTEGKSRNIAFDVEFEERLEAVKRSALEQKKADEVEEYGAIDYDAPIESKGSTIGLGT 876
            +     KS N + D+ FEERL AV+RSALEQKKA+E++E+G IDYDAPIE++  TIGLGT
Sbjct: 68   SSGYNSKSNNSSSDINFEERLAAVRRSALEQKKAEEIKEFGPIDYDAPIETEKKTIGLGT 127

Query: 875  KIXXXXXXXXXXXXXXXGDFLPSGSVSPTEEAPVVNEKPSGEEKLKLQTRLQQYEATLKI 696
            KI               GDFLPSGSVSPTEEA VVN++ S EEK  LQTRL++YE TL I
Sbjct: 128  KIGVGVAVVIFGLVFALGDFLPSGSVSPTEEAGVVNKELSEEEKNVLQTRLKKYEETLSI 187

Query: 695  SPEDPTALEGAAVTLTELGEYSRAASLLEDLTKKRPSDPDALRLLGEVKYQLKDYEGSAA 516
            SP+D TALEGAAVTL ELG+Y+RA SLL+DL K++PSDPD  RLLGEVKY+LKDYEGSAA
Sbjct: 188  SPKDSTALEGAAVTLAELGDYTRAVSLLQDLAKEKPSDPDVFRLLGEVKYELKDYEGSAA 247

Query: 515  AYKSSAMV 492
            AY+ S MV
Sbjct: 248  AYRVSTMV 255


>ref|XP_006468126.1| PREDICTED: uncharacterized protein LOC102618377 [Citrus sinensis]
          Length = 398

 Score =  278 bits (711), Expect = 1e-71
 Identities = 151/248 (60%), Positives = 181/248 (72%)
 Frame = -1

Query: 1235 FRLHCSDSDPGRGFGRPSTDKNSKKATRDTKSRREKDVVVQQRKSATNQSGSIPRQAPGL 1056
            FR+ CSDS P RGFG   TDK +K+  +   S+        +RKS + QSGS+P QAP L
Sbjct: 16   FRIQCSDSKPRRGFGN-KTDKTNKEEKKGVMSQ-------PKRKSLSKQSGSLPTQAPIL 67

Query: 1055 NPRTEGKSRNIAFDVEFEERLEAVKRSALEQKKADEVEEYGAIDYDAPIESKGSTIGLGT 876
                  KS N + D++FEERL AV+RSALEQKKA+E++E+G IDYDAPIE++  TIGLGT
Sbjct: 68   GSGYNSKSNNSSSDIDFEERLAAVRRSALEQKKAEEIKEFGPIDYDAPIETEKKTIGLGT 127

Query: 875  KIXXXXXXXXXXXXXXXGDFLPSGSVSPTEEAPVVNEKPSGEEKLKLQTRLQQYEATLKI 696
            KI               GDFLPSGSVSPTEEA VVN++ S EEK  LQTRL++YE TL I
Sbjct: 128  KIGVGVAVVIFGLVFALGDFLPSGSVSPTEEAGVVNKELSEEEKNVLQTRLKKYEETLSI 187

Query: 695  SPEDPTALEGAAVTLTELGEYSRAASLLEDLTKKRPSDPDALRLLGEVKYQLKDYEGSAA 516
            SP+D TALEGAAVTL ELG+Y+RA SLL+DL K++PSDPD  RLLGEVKY+LKDYEGSAA
Sbjct: 188  SPKDSTALEGAAVTLAELGDYTRAVSLLQDLAKEKPSDPDVFRLLGEVKYELKDYEGSAA 247

Query: 515  AYKSSAMV 492
            AY+ S MV
Sbjct: 248  AYRVSTMV 255


>ref|XP_002310923.2| hypothetical protein POPTR_0008s00440g [Populus trichocarpa]
            gi|550332069|gb|EEE88290.2| hypothetical protein
            POPTR_0008s00440g [Populus trichocarpa]
          Length = 411

 Score =  274 bits (701), Expect = 1e-70
 Identities = 145/250 (58%), Positives = 180/250 (72%), Gaps = 2/250 (0%)
 Frame = -1

Query: 1235 FRLHCSD-SDPGRGFGRPSTDKNSKKATRDTKSRREKDVVVQQRKSATNQSG-SIPRQAP 1062
            F + CSD S P RGFG  S +  + K  R + SR EK + +QQRKS T QSG S+P QAP
Sbjct: 21   FGVQCSDNSSPRRGFGSKSDNNTNNKKVRSSSSREEKGMALQQRKSTTKQSGASLPSQAP 80

Query: 1061 GLNPRTEGKSRNIAFDVEFEERLEAVKRSALEQKKADEVEEYGAIDYDAPIESKGSTIGL 882
            GL+ R +GKS   + D +FEERL+AV+RSALEQKK + ++E+G IDYD P++++  TIGL
Sbjct: 81   GLSSRFDGKSSRNSADTDFEERLQAVRRSALEQKKTEAIKEFGPIDYDEPVKTENKTIGL 140

Query: 881  GTKIXXXXXXXXXXXXXXXGDFLPSGSVSPTEEAPVVNEKPSGEEKLKLQTRLQQYEATL 702
            GTKI               GDFLPSGS  PTEEA VVN+K S EE+  L+ RL+QYE TL
Sbjct: 141  GTKIGVGVAVLVFGLVFALGDFLPSGSDGPTEEATVVNKKLSEEEQNTLRARLKQYELTL 200

Query: 701  KISPEDPTALEGAAVTLTELGEYSRAASLLEDLTKKRPSDPDALRLLGEVKYQLKDYEGS 522
              +P+D  ALEGAAVTL ELGEY+RAASLL+DL K++P DPD  RLLGE+KY+LKDY+GS
Sbjct: 201  STAPKDSIALEGAAVTLAELGEYTRAASLLQDLAKEKPGDPDVFRLLGEIKYELKDYDGS 260

Query: 521  AAAYKSSAMV 492
            AAAY+ SA V
Sbjct: 261  AAAYRISAAV 270


>ref|XP_011033077.1| PREDICTED: uncharacterized protein LOC105131691 isoform X1 [Populus
            euphratica]
          Length = 411

 Score =  270 bits (691), Expect = 2e-69
 Identities = 142/248 (57%), Positives = 178/248 (71%), Gaps = 2/248 (0%)
 Frame = -1

Query: 1235 FRLHCSD-SDPGRGFGRPSTDKNSKKATRDTKSRREKDVVVQQRKSATNQSG-SIPRQAP 1062
            F + CSD S P RGFG  S    + K  R   SR EK + +QQRKS T QSG S+P QAP
Sbjct: 21   FGVQCSDNSSPRRGFGSKSDTNTNNKKVRSGSSREEKGMALQQRKSTTKQSGASLPSQAP 80

Query: 1061 GLNPRTEGKSRNIAFDVEFEERLEAVKRSALEQKKADEVEEYGAIDYDAPIESKGSTIGL 882
            GL+ R +GKS   + D++FE+RL+A++RSALEQKK + ++E+G IDYDAP++++  TIGL
Sbjct: 81   GLSSRFDGKSSRNSADIDFEQRLQAIRRSALEQKKTESIKEFGPIDYDAPVKTENKTIGL 140

Query: 881  GTKIXXXXXXXXXXXXXXXGDFLPSGSVSPTEEAPVVNEKPSGEEKLKLQTRLQQYEATL 702
            GTKI               GDFLPSGS  P EEA VVN+K S EE+  L+ RL+QYE TL
Sbjct: 141  GTKIGVGVAVLVFGLVFALGDFLPSGSDGPAEEATVVNKKLSEEEQNTLRARLKQYELTL 200

Query: 701  KISPEDPTALEGAAVTLTELGEYSRAASLLEDLTKKRPSDPDALRLLGEVKYQLKDYEGS 522
              +P+D  ALEGAAVTL ELGEY+RAASLL+DL K++P DPD  RLLGE+KY+LKDY+GS
Sbjct: 201  STAPKDSIALEGAAVTLFELGEYTRAASLLQDLAKEKPGDPDVFRLLGEIKYELKDYDGS 260

Query: 521  AAAYKSSA 498
            AAAY+ SA
Sbjct: 261  AAAYRISA 268


>ref|XP_003529421.1| PREDICTED: uncharacterized protein LOC100790462 [Glycine max]
            gi|947098487|gb|KRH46979.1| hypothetical protein
            GLYMA_07G001600 [Glycine max]
          Length = 389

 Score =  270 bits (691), Expect = 2e-69
 Identities = 149/249 (59%), Positives = 183/249 (73%), Gaps = 1/249 (0%)
 Frame = -1

Query: 1235 FRLHCSDSDPGRGFGRPSTDKNSKKATRDTKSRREKDVVVQQRK-SATNQSGSIPRQAPG 1059
            F+++CSDS  GRGFG  +T+ N  K  +      +K +V QQ K SA  QS  +  QAP 
Sbjct: 19   FQINCSDSKQGRGFGE-NTNSNRIKTNKS-----DKGLVSQQSKGSANKQSRPLSSQAPR 72

Query: 1058 LNPRTEGKSRNIAFDVEFEERLEAVKRSALEQKKADEVEEYGAIDYDAPIESKGSTIGLG 879
            L+ + +GKSRN   DV+FEERL+AV+RSALEQKKA+E +E+GAIDYDAPI S  +TIG+G
Sbjct: 73   LSSQLDGKSRNDFLDVDFEERLKAVRRSALEQKKAEEEKEFGAIDYDAPIPSDNTTIGVG 132

Query: 878  TKIXXXXXXXXXXXXXXXGDFLPSGSVSPTEEAPVVNEKPSGEEKLKLQTRLQQYEATLK 699
            TKI               GDFLPSGSVSPTE++ VVN K S E+K  LQ+RL+++EATL 
Sbjct: 133  TKIGVGVAVAVFGLVFAFGDFLPSGSVSPTEDSAVVNSKLSEEDKATLQSRLKEFEATLS 192

Query: 698  ISPEDPTALEGAAVTLTELGEYSRAASLLEDLTKKRPSDPDALRLLGEVKYQLKDYEGSA 519
             S  DPTALEGAA+TL ELGEY+RAASLL+DLTK++P+D D  RLLGEVKY+LKDYEGS 
Sbjct: 193  NSSRDPTALEGAAITLAELGEYARAASLLDDLTKEKPNDADVFRLLGEVKYELKDYEGSV 252

Query: 518  AAYKSSAMV 492
            AAYKSSA V
Sbjct: 253  AAYKSSARV 261


>ref|XP_004303801.1| PREDICTED: uncharacterized protein LOC101309273 [Fragaria vesca
            subsp. vesca]
          Length = 404

 Score =  267 bits (683), Expect = 2e-68
 Identities = 156/287 (54%), Positives = 188/287 (65%), Gaps = 6/287 (2%)
 Frame = -1

Query: 1334 MSIIAATTTTVCFVYTARIGVNRKLPPPRRLHMFRLHCSDSD-PGRGFGRPSTDKNSKK- 1161
            M I AATTT V                  R HM R+ C+DS  P  GFG  + +K+ KK 
Sbjct: 1    MFIAAATTTAVTTC---------------RFHMLRIQCADSSKPRPGFGTKTNNKSKKKN 45

Query: 1160 ----ATRDTKSRREKDVVVQQRKSATNQSGSIPRQAPGLNPRTEGKSRNIAFDVEFEERL 993
                  + T S  +K    QQ KS TN+S  +  QAPGL+ R +GK +    D+EFEERL
Sbjct: 46   NLNQTNKATTSVDQKGTGFQQGKSTTNRS--VTNQAPGLSSRFDGKVKRNLGDLEFEERL 103

Query: 992  EAVKRSALEQKKADEVEEYGAIDYDAPIESKGSTIGLGTKIXXXXXXXXXXXXXXXGDFL 813
            EAV+ SAL+QKK  E EEYGAIDYDAP++S+   IGLG +I               GDFL
Sbjct: 104  EAVRSSALQQKKTVEKEEYGAIDYDAPVKSEKKKIGLGAQIGVGVAVLVFGLVFALGDFL 163

Query: 812  PSGSVSPTEEAPVVNEKPSGEEKLKLQTRLQQYEATLKISPEDPTALEGAAVTLTELGEY 633
            PS SVSPTE+A + + K S EEK  LQTRL++YEATL  SP+DPTALEGAAVTL ELGEY
Sbjct: 164  PSSSVSPTEDAALTSNKLSEEEKASLQTRLKEYEATLSNSPKDPTALEGAAVTLAELGEY 223

Query: 632  SRAASLLEDLTKKRPSDPDALRLLGEVKYQLKDYEGSAAAYKSSAMV 492
            SRA++LLEDLTK++PSDP+  RLLGEVKY+LKDYEGS AAYK S+ V
Sbjct: 224  SRASTLLEDLTKEKPSDPEVFRLLGEVKYELKDYEGSVAAYKVSSKV 270


>emb|CAN63743.1| hypothetical protein VITISV_041630 [Vitis vinifera]
          Length = 410

 Score =  266 bits (680), Expect = 4e-68
 Identities = 155/269 (57%), Positives = 185/269 (68%), Gaps = 1/269 (0%)
 Frame = -1

Query: 1295 VYTARIGVNRKLPPPRRLHMFRLHCS-DSDPGRGFGRPSTDKNSKKATRDTKSRREKDVV 1119
            VY    G+NRK   P  L  FR+HCS DS P RGFG P   +  KK  +   S    ++ 
Sbjct: 13   VYGYGCGLNRK---PPSLLTFRIHCSSDSKPTRGFG-PQPPQRDKKYFQSLMSIDAGNLH 68

Query: 1118 VQQRKSATNQSGSIPRQAPGLNPRTEGKSRNIAFDVEFEERLEAVKRSALEQKKADEVEE 939
            +           ++ R APGL+ R+ GKS + A D++FEERLEAV+R+ALEQKKADE +E
Sbjct: 69   LNNL--------ALYRLAPGLSSRSGGKSNDAAIDLDFEERLEAVRRTALEQKKADEKKE 120

Query: 938  YGAIDYDAPIESKGSTIGLGTKIXXXXXXXXXXXXXXXGDFLPSGSVSPTEEAPVVNEKP 759
            YGAIDYD P+ES+  TIGLGTKI               GDFLPSGS SP+EEA VV++K 
Sbjct: 121  YGAIDYDTPVESEEKTIGLGTKIGVGVAVVVFGLVFALGDFLPSGSDSPSEEATVVSKKL 180

Query: 758  SGEEKLKLQTRLQQYEATLKISPEDPTALEGAAVTLTELGEYSRAASLLEDLTKKRPSDP 579
            S EEK  LQ RLQQYEATL  SP+D TALE AAVTL ELGEY+RAASLLED  K++P+DP
Sbjct: 181  SEEEKATLQARLQQYEATLSSSPKDQTALEAAAVTLVELGEYTRAASLLEDFVKEKPNDP 240

Query: 578  DALRLLGEVKYQLKDYEGSAAAYKSSAMV 492
            +A RLLGEVK+ LKDYEGSAAAY+SSA V
Sbjct: 241  EAFRLLGEVKFALKDYEGSAAAYRSSAKV 269


>gb|KHN39380.1| hypothetical protein glysoja_018152 [Glycine soja]
          Length = 379

 Score =  265 bits (677), Expect = 9e-68
 Identities = 146/248 (58%), Positives = 178/248 (71%)
 Frame = -1

Query: 1235 FRLHCSDSDPGRGFGRPSTDKNSKKATRDTKSRREKDVVVQQRKSATNQSGSIPRQAPGL 1056
            F+++CSDS  GRGFG             +T S R K V +    S+  QS  +  QAP L
Sbjct: 17   FQINCSDSKQGRGFGE------------NTNSNRIK-VSLSVSYSSNKQSRPLSSQAPRL 63

Query: 1055 NPRTEGKSRNIAFDVEFEERLEAVKRSALEQKKADEVEEYGAIDYDAPIESKGSTIGLGT 876
            + + +GKSRN   DV+FEERL+AV+RSALEQKKA+E +E+GAIDYDAPI S  +TIG+GT
Sbjct: 64   SSQLDGKSRNDFLDVDFEERLKAVRRSALEQKKAEEEKEFGAIDYDAPIPSDNTTIGVGT 123

Query: 875  KIXXXXXXXXXXXXXXXGDFLPSGSVSPTEEAPVVNEKPSGEEKLKLQTRLQQYEATLKI 696
            KI               GDFLPSGSVSPTE++ VVN K S E+K  LQ+RL+++EATL  
Sbjct: 124  KIGVGVAVAVFGLVFAFGDFLPSGSVSPTEDSAVVNSKLSEEDKATLQSRLKEFEATLSN 183

Query: 695  SPEDPTALEGAAVTLTELGEYSRAASLLEDLTKKRPSDPDALRLLGEVKYQLKDYEGSAA 516
            SP DPTALEGAA+TL ELGEY+RAASLL+DLTK++P+D D  RLLGEVKY+LKDYEGS A
Sbjct: 184  SPRDPTALEGAAITLAELGEYARAASLLDDLTKEKPNDADVFRLLGEVKYELKDYEGSVA 243

Query: 515  AYKSSAMV 492
            AYKSSA V
Sbjct: 244  AYKSSARV 251


>gb|KHN38106.1| hypothetical protein glysoja_007148 [Glycine soja]
          Length = 393

 Score =  265 bits (676), Expect = 1e-67
 Identities = 143/248 (57%), Positives = 175/248 (70%)
 Frame = -1

Query: 1235 FRLHCSDSDPGRGFGRPSTDKNSKKATRDTKSRREKDVVVQQRKSATNQSGSIPRQAPGL 1056
            F+++CSDS  GRGFG  +   ++    +  KS          + S T QS  +  QAP L
Sbjct: 20   FQINCSDSKQGRGFGENTNSNSNSNRIKTNKS---------DKGSTTKQSRPLSSQAPRL 70

Query: 1055 NPRTEGKSRNIAFDVEFEERLEAVKRSALEQKKADEVEEYGAIDYDAPIESKGSTIGLGT 876
            + + +GKSRN   DV+FEERL+AV+RSALEQKKA+E +E+GAIDYDAPI S   TIGLGT
Sbjct: 71   SSQLDGKSRNDFLDVDFEERLKAVRRSALEQKKAEEEKEFGAIDYDAPIPSDNKTIGLGT 130

Query: 875  KIXXXXXXXXXXXXXXXGDFLPSGSVSPTEEAPVVNEKPSGEEKLKLQTRLQQYEATLKI 696
            KI               GDFLPSGSVSPTE++ VVN K S E+K  LQ+RL+++EATL  
Sbjct: 131  KIGVGVAVAVFGLVFAFGDFLPSGSVSPTEDSAVVNSKLSEEDKATLQSRLKEFEATLSN 190

Query: 695  SPEDPTALEGAAVTLTELGEYSRAASLLEDLTKKRPSDPDALRLLGEVKYQLKDYEGSAA 516
            SP D  ALEGAAVTL ELGEY+RA+SLL+DLTK++P+D D  RLLGEVKY+LKDYEGS A
Sbjct: 191  SPRDQIALEGAAVTLAELGEYARASSLLDDLTKEKPNDADVFRLLGEVKYELKDYEGSVA 250

Query: 515  AYKSSAMV 492
            AYKSSA V
Sbjct: 251  AYKSSARV 258


>ref|XP_006379387.1| hypothetical protein POPTR_0008s00440g [Populus trichocarpa]
            gi|118486611|gb|ABK95143.1| unknown [Populus trichocarpa]
            gi|550332068|gb|ERP57184.1| hypothetical protein
            POPTR_0008s00440g [Populus trichocarpa]
          Length = 405

 Score =  265 bits (676), Expect = 1e-67
 Identities = 141/249 (56%), Positives = 176/249 (70%), Gaps = 1/249 (0%)
 Frame = -1

Query: 1235 FRLHCSD-SDPGRGFGRPSTDKNSKKATRDTKSRREKDVVVQQRKSATNQSGSIPRQAPG 1059
            F + CSD S P RGFG  S +  + K  R + SR EK + +QQRKS T QS     +APG
Sbjct: 21   FGVQCSDNSSPRRGFGSKSDNNTNNKKVRSSSSREEKGMALQQRKSTTKQS-----EAPG 75

Query: 1058 LNPRTEGKSRNIAFDVEFEERLEAVKRSALEQKKADEVEEYGAIDYDAPIESKGSTIGLG 879
            L+ R +GKS   + D +FEERL+AV+RSALEQKK + ++E+G IDYD P++++  TIGLG
Sbjct: 76   LSSRFDGKSSRNSADTDFEERLQAVRRSALEQKKTEAIKEFGPIDYDEPVKTENKTIGLG 135

Query: 878  TKIXXXXXXXXXXXXXXXGDFLPSGSVSPTEEAPVVNEKPSGEEKLKLQTRLQQYEATLK 699
            TKI               GDFLPSGS  PTEEA VVN+K S EE+  L+ RL+QYE TL 
Sbjct: 136  TKIGVGVAVLVFGLVFALGDFLPSGSDGPTEEATVVNKKLSEEEQNTLRARLKQYELTLS 195

Query: 698  ISPEDPTALEGAAVTLTELGEYSRAASLLEDLTKKRPSDPDALRLLGEVKYQLKDYEGSA 519
             +P+D  ALEGAAVTL ELGEY+RAASLL+DL K++P DPD  RLLGE+KY+LKDY+GSA
Sbjct: 196  TAPKDSIALEGAAVTLAELGEYTRAASLLQDLAKEKPGDPDVFRLLGEIKYELKDYDGSA 255

Query: 518  AAYKSSAMV 492
            AAY+ SA V
Sbjct: 256  AAYRISAAV 264


>ref|XP_003531780.1| PREDICTED: uncharacterized protein LOC100777868 [Glycine max]
            gi|947096099|gb|KRH44684.1| hypothetical protein
            GLYMA_08G225400 [Glycine max] gi|947096100|gb|KRH44685.1|
            hypothetical protein GLYMA_08G225400 [Glycine max]
          Length = 393

 Score =  262 bits (669), Expect = 8e-67
 Identities = 142/248 (57%), Positives = 174/248 (70%)
 Frame = -1

Query: 1235 FRLHCSDSDPGRGFGRPSTDKNSKKATRDTKSRREKDVVVQQRKSATNQSGSIPRQAPGL 1056
            F+++CSDS  GRGFG  +   ++    +  KS          + S T QS  +  QAP L
Sbjct: 20   FQINCSDSKQGRGFGENTNSNSNSNRIKTNKS---------DKGSTTKQSRPLSSQAPRL 70

Query: 1055 NPRTEGKSRNIAFDVEFEERLEAVKRSALEQKKADEVEEYGAIDYDAPIESKGSTIGLGT 876
            + + +GKSRN   DV+FEERL+AV+RSALEQKKA+E +E+GAIDY API S   TIGLGT
Sbjct: 71   SSQLDGKSRNDFLDVDFEERLKAVRRSALEQKKAEEEKEFGAIDYGAPIPSDNKTIGLGT 130

Query: 875  KIXXXXXXXXXXXXXXXGDFLPSGSVSPTEEAPVVNEKPSGEEKLKLQTRLQQYEATLKI 696
            KI               GDFLPSGSVSPTE++ VVN K S E+K  LQ+RL+++EATL  
Sbjct: 131  KIGVGVAVAVFGLVFAFGDFLPSGSVSPTEDSAVVNSKLSEEDKATLQSRLKEFEATLSN 190

Query: 695  SPEDPTALEGAAVTLTELGEYSRAASLLEDLTKKRPSDPDALRLLGEVKYQLKDYEGSAA 516
            SP D  ALEGAAVTL ELGEY+RA+SLL+DLTK++P+D D  RLLGEVKY+LKDYEGS A
Sbjct: 191  SPRDQIALEGAAVTLAELGEYARASSLLDDLTKEKPNDADVFRLLGEVKYELKDYEGSVA 250

Query: 515  AYKSSAMV 492
            AYKSSA V
Sbjct: 251  AYKSSARV 258


>ref|XP_012450807.1| PREDICTED: uncharacterized protein LOC105773438 isoform X2 [Gossypium
            raimondii]
          Length = 404

 Score =  261 bits (666), Expect = 2e-66
 Identities = 143/251 (56%), Positives = 183/251 (72%)
 Frame = -1

Query: 1244 LHMFRLHCSDSDPGRGFGRPSTDKNSKKATRDTKSRREKDVVVQQRKSATNQSGSIPRQA 1065
            L  F++ CSDS+  RGFG     K+++KAT    S+ EK   +QQRK A+ QSG  P  A
Sbjct: 13   LSFFQIQCSDSNRKRGFG---PRKSNQKAT----SKEEKGFNLQQRKLASKQSGPSPAAA 65

Query: 1064 PGLNPRTEGKSRNIAFDVEFEERLEAVKRSALEQKKADEVEEYGAIDYDAPIESKGSTIG 885
            PGL+ + +GKS + + D++FEERL+A++R+ALEQKK +E +E+G IDYDAP+ES+  TIG
Sbjct: 66   PGLSVQFDGKSNSRSLDIDFEERLKAIRRAALEQKKVEEQKEFGPIDYDAPVESEKKTIG 125

Query: 884  LGTKIXXXXXXXXXXXXXXXGDFLPSGSVSPTEEAPVVNEKPSGEEKLKLQTRLQQYEAT 705
            LGTKI               GDFLP GSV+P EEA V+++K S E+K  L+TRL Q+EAT
Sbjct: 126  LGTKIGVGIAVAVFGLVFSLGDFLP-GSVNPPEEAAVIDKKLSREQKAILETRLAQFEAT 184

Query: 704  LKISPEDPTALEGAAVTLTELGEYSRAASLLEDLTKKRPSDPDALRLLGEVKYQLKDYEG 525
            L  SPED TALEGAAVTLTELG+Y+RA SLL++L K + SDP+  RLLGEVKY+LKDY+G
Sbjct: 185  LSTSPEDETALEGAAVTLTELGDYARATSLLQELVKVKTSDPEVFRLLGEVKYELKDYDG 244

Query: 524  SAAAYKSSAMV 492
            SAAAYK SA V
Sbjct: 245  SAAAYKLSAAV 255


>gb|KJB66490.1| hypothetical protein B456_010G141500 [Gossypium raimondii]
          Length = 364

 Score =  261 bits (666), Expect = 2e-66
 Identities = 143/251 (56%), Positives = 183/251 (72%)
 Frame = -1

Query: 1244 LHMFRLHCSDSDPGRGFGRPSTDKNSKKATRDTKSRREKDVVVQQRKSATNQSGSIPRQA 1065
            L  F++ CSDS+  RGFG     K+++KAT    S+ EK   +QQRK A+ QSG  P  A
Sbjct: 13   LSFFQIQCSDSNRKRGFG---PRKSNQKAT----SKEEKGFNLQQRKLASKQSGPSPAAA 65

Query: 1064 PGLNPRTEGKSRNIAFDVEFEERLEAVKRSALEQKKADEVEEYGAIDYDAPIESKGSTIG 885
            PGL+ + +GKS + + D++FEERL+A++R+ALEQKK +E +E+G IDYDAP+ES+  TIG
Sbjct: 66   PGLSVQFDGKSNSRSLDIDFEERLKAIRRAALEQKKVEEQKEFGPIDYDAPVESEKKTIG 125

Query: 884  LGTKIXXXXXXXXXXXXXXXGDFLPSGSVSPTEEAPVVNEKPSGEEKLKLQTRLQQYEAT 705
            LGTKI               GDFLP GSV+P EEA V+++K S E+K  L+TRL Q+EAT
Sbjct: 126  LGTKIGVGIAVAVFGLVFSLGDFLP-GSVNPPEEAAVIDKKLSREQKAILETRLAQFEAT 184

Query: 704  LKISPEDPTALEGAAVTLTELGEYSRAASLLEDLTKKRPSDPDALRLLGEVKYQLKDYEG 525
            L  SPED TALEGAAVTLTELG+Y+RA SLL++L K + SDP+  RLLGEVKY+LKDY+G
Sbjct: 185  LSTSPEDETALEGAAVTLTELGDYARATSLLQELVKVKTSDPEVFRLLGEVKYELKDYDG 244

Query: 524  SAAAYKSSAMV 492
            SAAAYK SA V
Sbjct: 245  SAAAYKLSAAV 255


>gb|KJB66489.1| hypothetical protein B456_010G141500 [Gossypium raimondii]
          Length = 393

 Score =  261 bits (666), Expect = 2e-66
 Identities = 143/251 (56%), Positives = 183/251 (72%)
 Frame = -1

Query: 1244 LHMFRLHCSDSDPGRGFGRPSTDKNSKKATRDTKSRREKDVVVQQRKSATNQSGSIPRQA 1065
            L  F++ CSDS+  RGFG     K+++KAT    S+ EK   +QQRK A+ QSG  P  A
Sbjct: 13   LSFFQIQCSDSNRKRGFG---PRKSNQKAT----SKEEKGFNLQQRKLASKQSGPSPAAA 65

Query: 1064 PGLNPRTEGKSRNIAFDVEFEERLEAVKRSALEQKKADEVEEYGAIDYDAPIESKGSTIG 885
            PGL+ + +GKS + + D++FEERL+A++R+ALEQKK +E +E+G IDYDAP+ES+  TIG
Sbjct: 66   PGLSVQFDGKSNSRSLDIDFEERLKAIRRAALEQKKVEEQKEFGPIDYDAPVESEKKTIG 125

Query: 884  LGTKIXXXXXXXXXXXXXXXGDFLPSGSVSPTEEAPVVNEKPSGEEKLKLQTRLQQYEAT 705
            LGTKI               GDFLP GSV+P EEA V+++K S E+K  L+TRL Q+EAT
Sbjct: 126  LGTKIGVGIAVAVFGLVFSLGDFLP-GSVNPPEEAAVIDKKLSREQKAILETRLAQFEAT 184

Query: 704  LKISPEDPTALEGAAVTLTELGEYSRAASLLEDLTKKRPSDPDALRLLGEVKYQLKDYEG 525
            L  SPED TALEGAAVTLTELG+Y+RA SLL++L K + SDP+  RLLGEVKY+LKDY+G
Sbjct: 185  LSTSPEDETALEGAAVTLTELGDYARATSLLQELVKVKTSDPEVFRLLGEVKYELKDYDG 244

Query: 524  SAAAYKSSAMV 492
            SAAAYK SA V
Sbjct: 245  SAAAYKLSAAV 255


Top