BLASTX nr result

ID: Cornus23_contig00018079 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cornus23_contig00018079
         (903 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_008234961.1| PREDICTED: uncharacterized protein LOC103333...   223   2e-55
ref|XP_007205638.1| hypothetical protein PRUPE_ppa009241mg [Prun...   221   6e-55
ref|XP_006353221.1| PREDICTED: uncharacterized protein LOC102592...   219   2e-54
emb|CDP01491.1| unnamed protein product [Coffea canephora]            218   4e-54
ref|XP_012081607.1| PREDICTED: uncharacterized protein LOC105641...   216   2e-53
ref|XP_010312631.1| PREDICTED: uncharacterized protein LOC101259...   216   2e-53
ref|XP_009802512.1| PREDICTED: uncharacterized protein LOC104248...   207   1e-50
ref|XP_010245835.1| PREDICTED: uncharacterized protein LOC104589...   206   1e-50
ref|XP_009604212.1| PREDICTED: uncharacterized protein LOC104099...   204   1e-49
ref|XP_010087627.1| hypothetical protein L484_022154 [Morus nota...   201   5e-49
ref|XP_010260511.1| PREDICTED: uncharacterized protein LOC104599...   198   4e-48
ref|XP_007047892.1| Uncharacterized protein isoform 1 [Theobroma...   198   5e-48
ref|XP_010245837.1| PREDICTED: uncharacterized protein LOC104589...   196   2e-47
ref|XP_010027912.1| PREDICTED: uncharacterized protein LOC104418...   195   3e-47
ref|XP_011097551.1| PREDICTED: uncharacterized protein LOC105176...   194   1e-46
ref|XP_011022815.1| PREDICTED: uncharacterized protein LOC105124...   194   1e-46
ref|XP_007047893.1| Uncharacterized protein isoform 2 [Theobroma...   191   8e-46
gb|KGN48737.1| hypothetical protein Csa_6G499810 [Cucumis sativus]    188   6e-45
ref|XP_004143460.1| PREDICTED: uncharacterized protein LOC101207...   188   6e-45
ref|XP_012469812.1| PREDICTED: uncharacterized protein LOC105787...   185   5e-44

>ref|XP_008234961.1| PREDICTED: uncharacterized protein LOC103333834 [Prunus mume]
          Length = 300

 Score =  223 bits (567), Expect = 2e-55
 Identities = 131/217 (60%), Positives = 146/217 (67%), Gaps = 5/217 (2%)
 Frame = -2

Query: 638 PHFQILKSKSPKLNDFISFRTHQWPLRVYDSEGTVPKSDGIDN-FNFDVFLSIAEFLCLX 462
           PHF I     P  ++  S  +H   LRVY+S+GT+  +D ++  FN D FLS+AEFLCL 
Sbjct: 39  PHFPISNHSLPNPHNSTSLSSHHSRLRVYESDGTLQSNDVVNGAFNLDYFLSVAEFLCLA 98

Query: 461 XXXXXXXXXXXXXXXXXSQKPVIVWLGNRVLVWQFVLLVSAVAFGASIRRRQWRRICVDY 282
                             +K  +V +GN VL    V LV AV  GA IR RQWRRIC + 
Sbjct: 99  SSALVSVGFALNCAVLSLKKTALVAMGNNVLASGAVALVMAVGIGAWIRMRQWRRICRES 158

Query: 281 XXXXXXXX----IEKLEEDLRSSATIIRVLSRQLEKLGIRFRVTRKGLKDPIAETAALAQ 114
                       IEKLEEDLRSSATIIRVLSRQLEKLGIRFRVTRK LK+PIAETAALAQ
Sbjct: 159 VKGGLEVNLFERIEKLEEDLRSSATIIRVLSRQLEKLGIRFRVTRKALKEPIAETAALAQ 218

Query: 113 KNSEATRALAVQEDILEKELGEIQKVLLAMQEQQQKQ 3
           KNSEATRALAVQED LEKELGEIQKVLLAMQEQQQKQ
Sbjct: 219 KNSEATRALAVQEDNLEKELGEIQKVLLAMQEQQQKQ 255


>ref|XP_007205638.1| hypothetical protein PRUPE_ppa009241mg [Prunus persica]
           gi|462401280|gb|EMJ06837.1| hypothetical protein
           PRUPE_ppa009241mg [Prunus persica]
          Length = 300

 Score =  221 bits (563), Expect = 6e-55
 Identities = 130/217 (59%), Positives = 146/217 (67%), Gaps = 5/217 (2%)
 Frame = -2

Query: 638 PHFQILKSKSPKLNDFISFRTHQWPLRVYDSEGTVPKSDGIDN-FNFDVFLSIAEFLCLX 462
           PHF I     P  ++  S  +H   LRVY+S+GT+  +D ++  FN D FL++AEFLCL 
Sbjct: 39  PHFPISNHSLPNPHNSTSLSSHHSRLRVYESDGTLQSNDVVNGAFNLDYFLTVAEFLCLA 98

Query: 461 XXXXXXXXXXXXXXXXXSQKPVIVWLGNRVLVWQFVLLVSAVAFGASIRRRQWRRICVDY 282
                             +K  +V +GN VL    V LV AV  GA IR RQWRRIC + 
Sbjct: 99  SSAIVSVGFALNCAVLSLKKTALVAMGNSVLASGAVALVMAVGIGAWIRMRQWRRICRES 158

Query: 281 XXXXXXXX----IEKLEEDLRSSATIIRVLSRQLEKLGIRFRVTRKGLKDPIAETAALAQ 114
                       IEKLEEDLRSSATIIRVLSRQLEKLGIRFRVTRK LK+PIAETAALAQ
Sbjct: 159 VKGGLEVNLFERIEKLEEDLRSSATIIRVLSRQLEKLGIRFRVTRKALKEPIAETAALAQ 218

Query: 113 KNSEATRALAVQEDILEKELGEIQKVLLAMQEQQQKQ 3
           KNSEATRALAVQED LEKELGEIQKVLLAMQEQQQKQ
Sbjct: 219 KNSEATRALAVQEDNLEKELGEIQKVLLAMQEQQQKQ 255


>ref|XP_006353221.1| PREDICTED: uncharacterized protein LOC102592816 [Solanum tuberosum]
          Length = 313

 Score =  219 bits (558), Expect = 2e-54
 Identities = 133/239 (55%), Positives = 151/239 (63%), Gaps = 9/239 (3%)
 Frame = -2

Query: 692 LKNPGFHFSIHSKCFSNPPHFQILKSKSPKLNDFISFRTHQWPLRVYDSEGTVPKSDGID 513
           LKNP     +H K    P    I +++ P L+     + HQW ++ +DSEGTV      +
Sbjct: 21  LKNPKISTPLHLKPLKTP---LIFRTQKPHLDKIEFLQCHQWKVKSFDSEGTVNGQVSAE 77

Query: 512 -NFNFDVFLSIAEFLCLXXXXXXXXXXXXXXXXXXSQKPVIVWLGNRVLVWQFVLLVSAV 336
             FNFD FLSI EFLCL                  SQK    WLGNRVL  Q V+LV  V
Sbjct: 78  YEFNFDGFLSILEFLCLLSSAVVAIGFAVNSWVLGSQK----WLGNRVLAAQCVVLVGGV 133

Query: 335 AFGASIRRRQWRRICV--------DYXXXXXXXXIEKLEEDLRSSATIIRVLSRQLEKLG 180
             G+ IRRRQWRRIC+        D         IEK+EEDLRSSATIIRVLSRQLEKLG
Sbjct: 134 IIGSVIRRRQWRRICMNKFSRSGSDLKGVNLLERIEKVEEDLRSSATIIRVLSRQLEKLG 193

Query: 179 IRFRVTRKGLKDPIAETAALAQKNSEATRALAVQEDILEKELGEIQKVLLAMQEQQQKQ 3
           IRFRVTRK LKDPI E A LAQKNSEATRALA+Q++ LEKELGEIQKVLLAMQ+QQ KQ
Sbjct: 194 IRFRVTRKTLKDPITEAAMLAQKNSEATRALALQDERLEKELGEIQKVLLAMQDQQHKQ 252


>emb|CDP01491.1| unnamed protein product [Coffea canephora]
          Length = 326

 Score =  218 bits (556), Expect = 4e-54
 Identities = 133/243 (54%), Positives = 151/243 (62%), Gaps = 11/243 (4%)
 Frame = -2

Query: 698 TPLKNPGFHFSIHSKCFSNPPHFQILKSKSPKLNDFIS-FRTHQWPLRV--YDSEGTVPK 528
           T L+NP      + K F NP  F   K    K   FI   + H W ++V   D +GTV +
Sbjct: 25  THLRNPRLPLLSYPKPFLNPSRFHAQKPHFLKFATFIPPIQNHTWSIQVKSLDLDGTVGE 84

Query: 527 SDGIDN-----FNFDVFLSIAEFLCLXXXXXXXXXXXXXXXXXXSQKPVIVWLGNRVLVW 363
               +N      NFD FLSI EF CL                   Q+ V  WLG + +VW
Sbjct: 85  ESRSENPANWDVNFDAFLSILEFFCLVSSIAISGILAVNSGFLGGQRMVFRWLGEKGMVW 144

Query: 362 QFVLLVSAVAFGASIRRRQWRRIC-VDYXXXXXXXX--IEKLEEDLRSSATIIRVLSRQL 192
           Q V+LV+ V  GA IRRRQWRRIC   Y          IEKLEE+ +SSAT+IR LSRQL
Sbjct: 145 QCVVLVAGVLVGAVIRRRQWRRICQAKYFSRPVNLVERIEKLEENFKSSATVIRALSRQL 204

Query: 191 EKLGIRFRVTRKGLKDPIAETAALAQKNSEATRALAVQEDILEKELGEIQKVLLAMQEQQ 12
           EKLGIRFRV RK LK+PIAETAALAQKNSEATRALA+QEDILEKELGEIQKVLLAMQEQQ
Sbjct: 205 EKLGIRFRVFRKALKEPIAETAALAQKNSEATRALAIQEDILEKELGEIQKVLLAMQEQQ 264

Query: 11  QKQ 3
           QKQ
Sbjct: 265 QKQ 267


>ref|XP_012081607.1| PREDICTED: uncharacterized protein LOC105641632 [Jatropha curcas]
           gi|802673470|ref|XP_012081608.1| PREDICTED:
           uncharacterized protein LOC105641632 [Jatropha curcas]
           gi|643718519|gb|KDP29713.1| hypothetical protein
           JCGZ_18648 [Jatropha curcas]
          Length = 319

 Score =  216 bits (549), Expect = 2e-53
 Identities = 131/249 (52%), Positives = 156/249 (62%), Gaps = 6/249 (2%)
 Frame = -2

Query: 731 RISFSFRIPHVTPLKNPGFHFSIHSKCFSNPPHFQILKSKSPKLNDFISFRTHQWPLRVY 552
           RI+F F  P    L+NP     I S+  +   H Q  + K PK +   + +++ +PL+ Y
Sbjct: 15  RITFYFTTPISLSLQNPNISPRILSRHLATSFHCQTFRYK-PKSSLNFTLKSNSFPLKAY 73

Query: 551 DSEGTVPKSDGIDNFNFDVFLSIAEFLCLXXXXXXXXXXXXXXXXXXSQKPVIVWLG-NR 375
             +G VP     D FN D FLSIAE LC+                  S++ V   +G NR
Sbjct: 74  QYDGVVPTPSS-DGFNLDAFLSIAEILCIISSAVVTVCYAVNSTFLSSKRTVFAVIGSNR 132

Query: 374 VLVWQFVLLVSAVAFGASIRRRQWRRIC-----VDYXXXXXXXXIEKLEEDLRSSATIIR 210
            L W  V+++  V  GA IR+RQW R C                IEKLEEDLRSSATIIR
Sbjct: 133 ALAWGLVVMMGGVLIGALIRKRQWLRFCRVTVREGRESVNLVERIEKLEEDLRSSATIIR 192

Query: 209 VLSRQLEKLGIRFRVTRKGLKDPIAETAALAQKNSEATRALAVQEDILEKELGEIQKVLL 30
           VLSRQLEKLGIRFRVTRK LK+PIAETAALA+KNSEATRALA+QEDILEKELGEIQKVLL
Sbjct: 193 VLSRQLEKLGIRFRVTRKALKEPIAETAALAKKNSEATRALAMQEDILEKELGEIQKVLL 252

Query: 29  AMQEQQQKQ 3
           AMQEQQ+KQ
Sbjct: 253 AMQEQQEKQ 261


>ref|XP_010312631.1| PREDICTED: uncharacterized protein LOC101259600 [Solanum
           lycopersicum]
          Length = 310

 Score =  216 bits (549), Expect = 2e-53
 Identities = 131/239 (54%), Positives = 150/239 (62%), Gaps = 9/239 (3%)
 Frame = -2

Query: 692 LKNPGFHFSIHSKCFSNPPHFQILKSKSPKLNDFISFRTHQWPLRVYDSEGTVPKSDGID 513
           LKNP     +H K  + P    I +++ P L+     + HQW ++ +DSEGTV      +
Sbjct: 21  LKNPKISTPLHLKPLNTP---LIFRTQKPHLDKIEFLQCHQWKVKSFDSEGTVNGQVSAE 77

Query: 512 -NFNFDVFLSIAEFLCLXXXXXXXXXXXXXXXXXXSQKPVIVWLGNRVLVWQFVLLVSAV 336
             FNFD FLSI EFLCL                  S K    WLGNRVL  Q V+LV  V
Sbjct: 78  YEFNFDGFLSILEFLCLLSSAVVAIGFAVNCWFLGSHK----WLGNRVLAAQCVVLVGGV 133

Query: 335 AFGASIRRRQWRRICV--------DYXXXXXXXXIEKLEEDLRSSATIIRVLSRQLEKLG 180
             G+ IRRRQWRRIC+        D         IEK+EEDLRSSATIIRVLSRQLEKLG
Sbjct: 134 IIGSVIRRRQWRRICMNNFSRPGSDLKGVNMLERIEKVEEDLRSSATIIRVLSRQLEKLG 193

Query: 179 IRFRVTRKGLKDPIAETAALAQKNSEATRALAVQEDILEKELGEIQKVLLAMQEQQQKQ 3
           IRFRVTRK LKDPI E A LAQKNSEATRALA+Q + LEKELGE+QKVLLAMQ+QQ KQ
Sbjct: 194 IRFRVTRKTLKDPITEAAMLAQKNSEATRALALQGERLEKELGEVQKVLLAMQDQQHKQ 252


>ref|XP_009802512.1| PREDICTED: uncharacterized protein LOC104248030 [Nicotiana
           sylvestris]
          Length = 363

 Score =  207 bits (526), Expect = 1e-50
 Identities = 128/239 (53%), Positives = 146/239 (61%), Gaps = 16/239 (6%)
 Frame = -2

Query: 671 FSIHSKCFSNPPHFQILKSK------SPKLNDFISFRTHQWPLRVYDSEGTVPKSDGID- 513
           F   +  FS P  F+  K++       P L    S + HQW ++ ++SEG+V +    + 
Sbjct: 68  FEFKNPIFSTPLDFKPFKTRLCFPTQKPHLLKIESLQCHQWKVKAFESEGSVKEQSLAEF 127

Query: 512 NFNFDVFLSIAEFLCLXXXXXXXXXXXXXXXXXXSQKPVIVWLGNRVLVWQFVLLVSAVA 333
            FN D FLSI EFLCL                  SQK    WLGNRVL  Q V+LV  V 
Sbjct: 128 EFNIDAFLSILEFLCLFSSAVVAIGYAVNSWFWGSQK----WLGNRVLGAQCVVLVGGVI 183

Query: 332 FGASIRRRQWRRICV---------DYXXXXXXXXIEKLEEDLRSSATIIRVLSRQLEKLG 180
            G+ IRRRQW RIC                    IEKLEEDLRSSAT+IRVLSRQLEKLG
Sbjct: 184 IGSVIRRRQWSRICTFEFSSRSGSGSRGVNLVERIEKLEEDLRSSATLIRVLSRQLEKLG 243

Query: 179 IRFRVTRKGLKDPIAETAALAQKNSEATRALAVQEDILEKELGEIQKVLLAMQEQQQKQ 3
           IRFRVTRK LKDP+ E AALAQKNSEATRALA+Q + LEKELGEIQKVLLAMQEQQ KQ
Sbjct: 244 IRFRVTRKTLKDPVTEAAALAQKNSEATRALALQGERLEKELGEIQKVLLAMQEQQHKQ 302


>ref|XP_010245835.1| PREDICTED: uncharacterized protein LOC104589273 isoform X1 [Nelumbo
           nucifera]
          Length = 334

 Score =  206 bits (525), Expect = 1e-50
 Identities = 129/227 (56%), Positives = 150/227 (66%), Gaps = 12/227 (5%)
 Frame = -2

Query: 647 SNPPHFQILKSKSPKL-NDFISFRTH-QWPLRVYDSEGTVPKSDGIDN-FNFDVFLSIAE 477
           S+  HF IL+ + P L N   +F    QW +RV++S     + +G+D+  N + FLSI E
Sbjct: 49  SSSLHFGILRRRPPWLRNQQQNFEAGGQWRVRVFESSHDALQREGLDSKLNLEAFLSIVE 108

Query: 476 FLCLXXXXXXXXXXXXXXXXXXS--QKPVIVWLGNRVLVWQFVLLVSAVAFGASIRRRQW 303
            LC+                  S  QK + V L NR+ VWQFVLLV AVA GA +RRRQW
Sbjct: 109 VLCIVPSAVLSVGYAVNWAFFSSPLQKSLQVSLVNRIFVWQFVLLVGAVAAGALVRRRQW 168

Query: 302 RRICVDYXXXXXXXX-------IEKLEEDLRSSATIIRVLSRQLEKLGIRFRVTRKGLKD 144
           RRIC D                IEK+EEDLRSSATIIRVLSRQLEKLG RFRVTRK LK+
Sbjct: 169 RRICRDTIKTGAGGSSVNLIERIEKIEEDLRSSATIIRVLSRQLEKLGTRFRVTRKALKE 228

Query: 143 PIAETAALAQKNSEATRALAVQEDILEKELGEIQKVLLAMQEQQQKQ 3
           PI +TAALAQKNSEATR+LAVQED LEKEL EIQKVLLAMQ+QQQKQ
Sbjct: 229 PITQTAALAQKNSEATRSLAVQEDNLEKELVEIQKVLLAMQDQQQKQ 275


>ref|XP_009604212.1| PREDICTED: uncharacterized protein LOC104099043 [Nicotiana
           tomentosiformis]
          Length = 362

 Score =  204 bits (518), Expect = 1e-49
 Identities = 126/238 (52%), Positives = 147/238 (61%), Gaps = 15/238 (6%)
 Frame = -2

Query: 671 FSIHSKCFSNPPHFQILKSK----SPKLN--DFISFRTHQWPLRVYDSEGTVPKSDGID- 513
           F   +  FS P  F+ LK++    + KL+     S + HQW ++ ++SEG V +    + 
Sbjct: 68  FEFKNPIFSTPLDFKPLKNRLCFPTQKLHLLTIESLQCHQWKVKAFESEGAVKEQSLAEF 127

Query: 512 NFNFDVFLSIAEFLCLXXXXXXXXXXXXXXXXXXSQKPVIVWLGNRVLVWQFVLLVSAVA 333
            FN D FLSI EFLCL                  SQK    WLGNRVL  Q V+LV  V 
Sbjct: 128 EFNIDAFLSILEFLCLFSSAVVSIGYAVNSWFLGSQK----WLGNRVLAAQCVVLVGGVV 183

Query: 332 FGASIRRRQWRRICV--------DYXXXXXXXXIEKLEEDLRSSATIIRVLSRQLEKLGI 177
            G+ IRRRQW RIC+                  IEKLEEDLRSS T+IRVLSRQLEKLGI
Sbjct: 184 IGSVIRRRQWSRICMVEFSRSGSGSRGVNLVERIEKLEEDLRSSTTLIRVLSRQLEKLGI 243

Query: 176 RFRVTRKGLKDPIAETAALAQKNSEATRALAVQEDILEKELGEIQKVLLAMQEQQQKQ 3
           RFR+TRK LKDP+ E A LAQKNSEATRALA+Q + LEKELGEIQKVLLAMQEQQ KQ
Sbjct: 244 RFRITRKTLKDPVTEAATLAQKNSEATRALALQGEHLEKELGEIQKVLLAMQEQQHKQ 301


>ref|XP_010087627.1| hypothetical protein L484_022154 [Morus notabilis]
           gi|587838793|gb|EXB29482.1| hypothetical protein
           L484_022154 [Morus notabilis]
          Length = 374

 Score =  201 bits (512), Expect = 5e-49
 Identities = 123/226 (54%), Positives = 149/226 (65%), Gaps = 5/226 (2%)
 Frame = -2

Query: 665 IHSKCFSNPPHFQILKSKSPKLNDFISFRTHQWPLRVYDSEGTVPKSDGIDNFNFDVFLS 486
           I S+ F++  HF I   + P      S RTH++ L V++SEG V +   +D   FD FLS
Sbjct: 36  IASRHFASRSHFHISNRQLPSPCCSNSPRTHRFRLGVFESEGPVRRDGDLD---FDSFLS 92

Query: 485 IAEFLCLXXXXXXXXXXXXXXXXXXSQKPVIVW-LGNRVLVWQFVLLVSAVAFGASIRRR 309
           I E LC+                  S+K V+   +GN +L    +++V+ +  GA IRRR
Sbjct: 93  IVETLCVFSSAVVSLGFAVNCVVSSSKKTVMAAAMGNGILSCGMLVMVAGLGIGAWIRRR 152

Query: 308 QWRRICVDYXXXXXXXX----IEKLEEDLRSSATIIRVLSRQLEKLGIRFRVTRKGLKDP 141
           QWRR C               +EKLEEDLR+SAT+IRV+SRQLEKLGIRFRVTRK LK+P
Sbjct: 153 QWRRFCSGSVRGGLEVNLLERVEKLEEDLRNSATLIRVISRQLEKLGIRFRVTRKALKEP 212

Query: 140 IAETAALAQKNSEATRALAVQEDILEKELGEIQKVLLAMQEQQQKQ 3
           +AETAALAQKNSEATRALAVQEDILEKELGEIQKVLLAMQEQQQKQ
Sbjct: 213 LAETAALAQKNSEATRALAVQEDILEKELGEIQKVLLAMQEQQQKQ 258


>ref|XP_010260511.1| PREDICTED: uncharacterized protein LOC104599595 [Nelumbo nucifera]
          Length = 333

 Score =  198 bits (504), Expect = 4e-48
 Identities = 124/226 (54%), Positives = 141/226 (62%), Gaps = 11/226 (4%)
 Frame = -2

Query: 647 SNPPHFQILKSKSP----KLNDFISFRTHQWPLRVYDSEGTVPKSDGIDNFNFDVFLSIA 480
           SN  HF++L  +      +L  F + R  QW  R ++ EGT+         N D FLSIA
Sbjct: 49  SNSLHFEVLGQRDSWQRKQLQPFEAGR--QWRSRAFEPEGTLQGERLDGKLNLDAFLSIA 106

Query: 479 EFLCLXXXXXXXXXXXXXXXXXXS-QKPVIVWLGNRVLVWQFVLLVSAVAFGASIRRRQW 303
           E LCL                  S QK + V L N V VWQ+VLLV AVA G  IRRRQW
Sbjct: 107 EVLCLVPSTILTVGYAVNWIYLSSSQKAIQVPLVNGVFVWQYVLLVIAVAVGTLIRRRQW 166

Query: 302 RRICVDYXXXXXXXX------IEKLEEDLRSSATIIRVLSRQLEKLGIRFRVTRKGLKDP 141
           RRIC +               IEKLEEDLR+SATI+RVLSRQLEKLGIRFR+TRK LK+P
Sbjct: 167 RRICRESFKSGGRSSLNLVERIEKLEEDLRNSATIVRVLSRQLEKLGIRFRITRKALKEP 226

Query: 140 IAETAALAQKNSEATRALAVQEDILEKELGEIQKVLLAMQEQQQKQ 3
             +T AL QKNSE TRALA +ED LEKELGEIQKVLLAMQEQQQKQ
Sbjct: 227 TTQTTALVQKNSEVTRALAAREDNLEKELGEIQKVLLAMQEQQQKQ 272


>ref|XP_007047892.1| Uncharacterized protein isoform 1 [Theobroma cacao]
           gi|508700153|gb|EOX92049.1| Uncharacterized protein
           isoform 1 [Theobroma cacao]
          Length = 316

 Score =  198 bits (503), Expect = 5e-48
 Identities = 131/252 (51%), Positives = 156/252 (61%), Gaps = 17/252 (6%)
 Frame = -2

Query: 707 PHVTP-LKNPGFHFSIHSKCFS-NPPHFQILKSKSPKLNDFISFRTHQWP----LRVYDS 546
           PH+ P LKNP     I ++  S    + QIL  ++    +F++F++        L+ Y+S
Sbjct: 14  PHLNPNLKNPNSFPPITTRHLSFTLSNSQILHFRT---RNFLNFKSPHPSSHSLLKAYES 70

Query: 545 EGTVPKSDG----IDNFNFDVFLSIAEFLCLXXXXXXXXXXXXXXXXXXSQKPVIVWLGN 378
           + ++  S       ++FN D FLSIAEFLC+                      ++  +  
Sbjct: 71  DSSIAASQEQNPIFNDFNLDSFLSIAEFLCILSSAVVSVVGAVSGWKGV----ILGGIWR 126

Query: 377 RVLVWQFVLLVSAVAFGASIRRRQWRRICVDYXXXXXXXX-------IEKLEEDLRSSAT 219
           RV+VW  V LVS VA GA IRRRQWRRIC +                IEKLEEDLRS AT
Sbjct: 127 RVMVWGIVGLVSGVAIGAWIRRRQWRRICAETVKGGGGGKNLNLIGRIEKLEEDLRSYAT 186

Query: 218 IIRVLSRQLEKLGIRFRVTRKGLKDPIAETAALAQKNSEATRALAVQEDILEKELGEIQK 39
           I R LSRQLEKLGIRFRVTRK LK+PIAETAALAQKNSEATRALAVQEDILEKELGEIQK
Sbjct: 187 ITRALSRQLEKLGIRFRVTRKALKEPIAETAALAQKNSEATRALAVQEDILEKELGEIQK 246

Query: 38  VLLAMQEQQQKQ 3
           VLLAMQEQQ KQ
Sbjct: 247 VLLAMQEQQGKQ 258


>ref|XP_010245837.1| PREDICTED: uncharacterized protein LOC104589273 isoform X2 [Nelumbo
           nucifera]
          Length = 277

 Score =  196 bits (499), Expect = 2e-47
 Identities = 124/223 (55%), Positives = 145/223 (65%), Gaps = 12/223 (5%)
 Frame = -2

Query: 647 SNPPHFQILKSKSPKL-NDFISFRTH-QWPLRVYDSEGTVPKSDGIDN-FNFDVFLSIAE 477
           S+  HF IL+ + P L N   +F    QW +RV++S     + +G+D+  N + FLSI E
Sbjct: 49  SSSLHFGILRRRPPWLRNQQQNFEAGGQWRVRVFESSHDALQREGLDSKLNLEAFLSIVE 108

Query: 476 FLCLXXXXXXXXXXXXXXXXXXS--QKPVIVWLGNRVLVWQFVLLVSAVAFGASIRRRQW 303
            LC+                  S  QK + V L NR+ VWQFVLLV AVA GA +RRRQW
Sbjct: 109 VLCIVPSAVLSVGYAVNWAFFSSPLQKSLQVSLVNRIFVWQFVLLVGAVAAGALVRRRQW 168

Query: 302 RRICVDYXXXXXXXX-------IEKLEEDLRSSATIIRVLSRQLEKLGIRFRVTRKGLKD 144
           RRIC D                IEK+EEDLRSSATIIRVLSRQLEKLG RFRVTRK LK+
Sbjct: 169 RRICRDTIKTGAGGSSVNLIERIEKIEEDLRSSATIIRVLSRQLEKLGTRFRVTRKALKE 228

Query: 143 PIAETAALAQKNSEATRALAVQEDILEKELGEIQKVLLAMQEQ 15
           PI +TAALAQKNSEATR+LAVQED LEKEL EIQKVLLAMQ +
Sbjct: 229 PITQTAALAQKNSEATRSLAVQEDNLEKELVEIQKVLLAMQAE 271


>ref|XP_010027912.1| PREDICTED: uncharacterized protein LOC104418308 [Eucalyptus
           grandis] gi|629088299|gb|KCW54552.1| hypothetical
           protein EUGRSUZ_I00513 [Eucalyptus grandis]
          Length = 306

 Score =  195 bits (496), Expect = 3e-47
 Identities = 115/192 (59%), Positives = 132/192 (68%), Gaps = 5/192 (2%)
 Frame = -2

Query: 563 LRVYDSEGTVPKSDGIDNFNFDVFLSIAEFLCLXXXXXXXXXXXXXXXXXXSQKPVIVWL 384
           +R Y S+G V + +G+ +F+FD FLSIAE LCL                   ++      
Sbjct: 76  VRAYPSDGAV-RGEGVSHFDFDSFLSIAELLCLVSSAVVSVVFAV-------KRAAFGAA 127

Query: 383 GNRVLVWQFVLLVSAVAFGASIRRRQWRRICVD-----YXXXXXXXXIEKLEEDLRSSAT 219
           G+RVL W  V LV  VA GA +RRRQWRR+                 +EKLEEDLRSS T
Sbjct: 128 GDRVLGWLVVALVGGVASGAWVRRRQWRRVFEQPGKAAVPNVNLVERVEKLEEDLRSSTT 187

Query: 218 IIRVLSRQLEKLGIRFRVTRKGLKDPIAETAALAQKNSEATRALAVQEDILEKELGEIQK 39
           +IRVLSRQLEKLGIRFRVTRK LK+PIAETAALAQKNSEATRALA+QEDILEKELGEIQK
Sbjct: 188 MIRVLSRQLEKLGIRFRVTRKTLKEPIAETAALAQKNSEATRALAMQEDILEKELGEIQK 247

Query: 38  VLLAMQEQQQKQ 3
           VLLAMQ+QQQKQ
Sbjct: 248 VLLAMQDQQQKQ 259


>ref|XP_011097551.1| PREDICTED: uncharacterized protein LOC105176447 [Sesamum indicum]
          Length = 318

 Score =  194 bits (492), Expect = 1e-46
 Identities = 115/180 (63%), Positives = 129/180 (71%), Gaps = 8/180 (4%)
 Frame = -2

Query: 518 IDNFNFDVFLSIAEFLCLXXXXXXXXXXXXXXXXXXSQKPVIVWLGNRVLVWQFVLLVSA 339
           +++F+FD FLS  EFL L                   Q  V+  +G+++LVWQ V+LVS+
Sbjct: 83  LNHFSFDAFLSTLEFLSLASSAAISVYVALSSGVQ--QGGVLGRVGSKILVWQCVVLVSS 140

Query: 338 VAFGASIRRRQWRRIC--------VDYXXXXXXXXIEKLEEDLRSSATIIRVLSRQLEKL 183
           V  GA IRRRQWRRIC          Y        +EKLEEDLRSSATIIRVLSRQLEKL
Sbjct: 141 VVVGAVIRRRQWRRICGAGFSRSSASYGVNLLGR-VEKLEEDLRSSATIIRVLSRQLEKL 199

Query: 182 GIRFRVTRKGLKDPIAETAALAQKNSEATRALAVQEDILEKELGEIQKVLLAMQEQQQKQ 3
           GIR RVTRK L++PIAETAALAQKNSEATRALAVQEDILEKELGEIQKVLLAMQEQQQKQ
Sbjct: 200 GIRVRVTRKALQEPIAETAALAQKNSEATRALAVQEDILEKELGEIQKVLLAMQEQQQKQ 259


>ref|XP_011022815.1| PREDICTED: uncharacterized protein LOC105124481 isoform X1 [Populus
           euphratica]
          Length = 317

 Score =  194 bits (492), Expect = 1e-46
 Identities = 129/270 (47%), Positives = 158/270 (58%), Gaps = 7/270 (2%)
 Frame = -2

Query: 791 HTEHHYXXXXXXXXXXXXXSRISFSFRIPHVTPLKNPGFHFSIHSKCFSNPPHFQILKSK 612
           H  HH+                S S  + ++T  ++   + S+HS  F    HF   K +
Sbjct: 7   HHHHHHHLLFNNSPHRITLLFTSTSLSLRNLTLSRH--LNTSLHSHNF----HF---KPQ 57

Query: 611 SPKLNDFISFRTHQWPLRVYDSEGTVPKSDGIDNFNFDVFLSIAEFLCLXXXXXXXXXXX 432
           +PK        +  + L+ Y S+ T+P  D    FN D FLS+AE LC+           
Sbjct: 58  TPK-------SSFNFTLKAYQSDPTIPTQDS-KQFNLDQFLSVAELLCIFSSSIITISYA 109

Query: 431 XXXXXXXSQKPVIVWLG-NRVLVWQFVLLVSAVAFGASIRRRQWRRICVDYXXXXXXXX- 258
                  S++ V+  +G N    W  V++VS V  GA IRRRQW ++  +          
Sbjct: 110 LNYTVLNSKRGVLGVIGSNTGFAWGMVVMVSGVVIGAWIRRRQWWQVSRETGREGSRESL 169

Query: 257 -----IEKLEEDLRSSATIIRVLSRQLEKLGIRFRVTRKGLKDPIAETAALAQKNSEATR 93
                IEKLEED+RSSATIIRVLSRQLEKLGIRFRVTRK LK+PIAETAALAQKNS+ATR
Sbjct: 170 NLVGRIEKLEEDVRSSATIIRVLSRQLEKLGIRFRVTRKALKEPIAETAALAQKNSDATR 229

Query: 92  ALAVQEDILEKELGEIQKVLLAMQEQQQKQ 3
           ALAVQEDILEKELGEIQKVLLAMQEQQQKQ
Sbjct: 230 ALAVQEDILEKELGEIQKVLLAMQEQQQKQ 259


>ref|XP_007047893.1| Uncharacterized protein isoform 2 [Theobroma cacao]
           gi|508700154|gb|EOX92050.1| Uncharacterized protein
           isoform 2 [Theobroma cacao]
          Length = 313

 Score =  191 bits (484), Expect = 8e-46
 Identities = 127/249 (51%), Positives = 153/249 (61%), Gaps = 17/249 (6%)
 Frame = -2

Query: 707 PHVTP-LKNPGFHFSIHSKCFS-NPPHFQILKSKSPKLNDFISFRTHQWP----LRVYDS 546
           PH+ P LKNP     I ++  S    + QIL  ++    +F++F++        L+ Y+S
Sbjct: 14  PHLNPNLKNPNSFPPITTRHLSFTLSNSQILHFRT---RNFLNFKSPHPSSHSLLKAYES 70

Query: 545 EGTVPKSDG----IDNFNFDVFLSIAEFLCLXXXXXXXXXXXXXXXXXXSQKPVIVWLGN 378
           + ++  S       ++FN D FLSIAEFLC+                      ++  +  
Sbjct: 71  DSSIAASQEQNPIFNDFNLDSFLSIAEFLCILSSAVVSVVGAVSGWKGV----ILGGIWR 126

Query: 377 RVLVWQFVLLVSAVAFGASIRRRQWRRICVDYXXXXXXXX-------IEKLEEDLRSSAT 219
           RV+VW  V LVS VA GA IRRRQWRRIC +                IEKLEEDLRS AT
Sbjct: 127 RVMVWGIVGLVSGVAIGAWIRRRQWRRICAETVKGGGGGKNLNLIGRIEKLEEDLRSYAT 186

Query: 218 IIRVLSRQLEKLGIRFRVTRKGLKDPIAETAALAQKNSEATRALAVQEDILEKELGEIQK 39
           I R LSRQLEKLGIRFRVTRK LK+PIAETAALAQKNSEATRALAVQEDILEKELGEIQK
Sbjct: 187 ITRALSRQLEKLGIRFRVTRKALKEPIAETAALAQKNSEATRALAVQEDILEKELGEIQK 246

Query: 38  VLLAMQEQQ 12
           VLLAMQ +Q
Sbjct: 247 VLLAMQGKQ 255


>gb|KGN48737.1| hypothetical protein Csa_6G499810 [Cucumis sativus]
          Length = 378

 Score =  188 bits (477), Expect = 6e-45
 Identities = 118/233 (50%), Positives = 136/233 (58%), Gaps = 4/233 (1%)
 Frame = -2

Query: 689 KNPGFHFSIHSKCFSNPPHFQILKSKSPKLNDFISFRTHQWPLRVYDSEGTVPKSDGIDN 510
           +NP          F N  HFQIL  K     +F S   H +  RV  S G   +  G+ +
Sbjct: 101 RNPCVSLPFPPSRFPNTLHFQILDYKFRSPFNFGSINAHHFCPRVSTSGGVGRRPGGVAD 160

Query: 509 FNFDVFLSIAEFLCLXXXXXXXXXXXXXXXXXXSQKPVIVWLGNRVLVWQFVLLVSAVAF 330
           F+ D  LS  EF CL                  S+   +   G+ VLV   + LV+ VA 
Sbjct: 161 FDIDSLLSATEFFCLVASLIGSVGFALNCAKTRSKSLFLAVFGDGVLVGTILFLVAGVAI 220

Query: 329 GASIRRRQWRRICVDYXXXXXXXXI----EKLEEDLRSSATIIRVLSRQLEKLGIRFRVT 162
           GA IRRRQW R+  +         +     KLEEDLRSSAT+IRVLSRQLEKLGIRFRVT
Sbjct: 221 GAWIRRRQWNRVFRETAKGVLEVNLMEKTNKLEEDLRSSATLIRVLSRQLEKLGIRFRVT 280

Query: 161 RKGLKDPIAETAALAQKNSEATRALAVQEDILEKELGEIQKVLLAMQEQQQKQ 3
           RK LK P+ ETAALAQK SEATRALAV+ DILEKEL EIQKVLLAMQEQQQKQ
Sbjct: 281 RKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQ 333


>ref|XP_004143460.1| PREDICTED: uncharacterized protein LOC101207421 [Cucumis sativus]
          Length = 323

 Score =  188 bits (477), Expect = 6e-45
 Identities = 118/233 (50%), Positives = 136/233 (58%), Gaps = 4/233 (1%)
 Frame = -2

Query: 689 KNPGFHFSIHSKCFSNPPHFQILKSKSPKLNDFISFRTHQWPLRVYDSEGTVPKSDGIDN 510
           +NP          F N  HFQIL  K     +F S   H +  RV  S G   +  G+ +
Sbjct: 46  RNPCVSLPFPPSRFPNTLHFQILDYKFRSPFNFGSINAHHFCPRVSTSGGVGRRPGGVAD 105

Query: 509 FNFDVFLSIAEFLCLXXXXXXXXXXXXXXXXXXSQKPVIVWLGNRVLVWQFVLLVSAVAF 330
           F+ D  LS  EF CL                  S+   +   G+ VLV   + LV+ VA 
Sbjct: 106 FDIDSLLSATEFFCLVASLIGSVGFALNCAKTRSKSLFLAVFGDGVLVGTILFLVAGVAI 165

Query: 329 GASIRRRQWRRICVDYXXXXXXXXI----EKLEEDLRSSATIIRVLSRQLEKLGIRFRVT 162
           GA IRRRQW R+  +         +     KLEEDLRSSAT+IRVLSRQLEKLGIRFRVT
Sbjct: 166 GAWIRRRQWNRVFRETAKGVLEVNLMEKTNKLEEDLRSSATLIRVLSRQLEKLGIRFRVT 225

Query: 161 RKGLKDPIAETAALAQKNSEATRALAVQEDILEKELGEIQKVLLAMQEQQQKQ 3
           RK LK P+ ETAALAQK SEATRALAV+ DILEKEL EIQKVLLAMQEQQQKQ
Sbjct: 226 RKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQ 278


>ref|XP_012469812.1| PREDICTED: uncharacterized protein LOC105787802 [Gossypium
           raimondii] gi|763750833|gb|KJB18221.1| hypothetical
           protein B456_003G040700 [Gossypium raimondii]
          Length = 311

 Score =  185 bits (469), Expect = 5e-44
 Identities = 117/220 (53%), Positives = 132/220 (60%), Gaps = 9/220 (4%)
 Frame = -2

Query: 635 HFQILKSKSPKLNDFISFRTHQWPLRVYDSEGTVPKSDG----IDNFNFDVFLSIAEFLC 468
           +F  L S+ P  + FI         + YDS  ++  S       D+ N D FLS AE  C
Sbjct: 47  NFLSLSSRHPSSHSFI---------KAYDSGSSIATSSEQNPTFDSINLDSFLSAAELFC 97

Query: 467 LXXXXXXXXXXXXXXXXXXSQKPVIVWLGNRVLVWQFVLLVSAVAFGASIRRRQWRRICV 288
           +                      V+  +  RV+ W  + LVS  A GA IRRRQWRRICV
Sbjct: 98  IFSSAVVSVVYVISDWKGV----VLGGVWRRVMAWNVLGLVSGFAIGAWIRRRQWRRICV 153

Query: 287 DYXXXXXXXX-----IEKLEEDLRSSATIIRVLSRQLEKLGIRFRVTRKGLKDPIAETAA 123
           +              IEKLEEDL+SS  IIRVLSRQLEKLGIRFRVTRKGLK PI ETAA
Sbjct: 154 ETAKAGGKRLNLVDRIEKLEEDLKSSVAIIRVLSRQLEKLGIRFRVTRKGLKQPIEETAA 213

Query: 122 LAQKNSEATRALAVQEDILEKELGEIQKVLLAMQEQQQKQ 3
           LAQKNSEATRALA QE+ILEKEL EIQKVLLAMQEQQQKQ
Sbjct: 214 LAQKNSEATRALAAQEEILEKELEEIQKVLLAMQEQQQKQ 253


Top