BLASTX nr result

ID: Catharanthus23_contig00003221 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00003221
         (1593 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006353221.1| PREDICTED: uncharacterized protein LOC102592...   234   6e-59
ref|XP_004250106.1| PREDICTED: uncharacterized protein LOC101259...   224   8e-56
gb|EOX92049.1| Uncharacterized protein isoform 1 [Theobroma cacao]    219   3e-54
gb|EOX92050.1| Uncharacterized protein isoform 2 [Theobroma cacao]    214   6e-53
ref|XP_006466252.1| PREDICTED: uncharacterized protein LOC102620...   207   8e-51
ref|XP_006426358.1| hypothetical protein CICLE_v10026087mg [Citr...   207   1e-50
gb|EXB29482.1| hypothetical protein L484_022154 [Morus notabilis]     199   3e-48
ref|XP_002307074.1| hypothetical protein POPTR_0005s07470g [Popu...   193   2e-46
gb|EMJ06837.1| hypothetical protein PRUPE_ppa009241mg [Prunus pe...   192   3e-46
ref|XP_003531342.1| PREDICTED: uncharacterized protein LOC100809...   190   1e-45
ref|XP_003525047.1| PREDICTED: uncharacterized protein LOC100790...   185   5e-44
ref|XP_004288180.1| PREDICTED: uncharacterized protein LOC101314...   184   9e-44
ref|XP_002269557.2| PREDICTED: uncharacterized protein LOC100244...   183   2e-43
ref|XP_004143460.1| PREDICTED: uncharacterized protein LOC101207...   182   3e-43
ref|XP_004167033.1| PREDICTED: uncharacterized LOC101207421, par...   181   6e-43
ref|XP_004504099.1| PREDICTED: uncharacterized protein LOC101515...   178   5e-42
ref|XP_002309969.1| hypothetical protein POPTR_0007s05170g [Popu...   178   7e-42
ref|XP_006585264.1| PREDICTED: uncharacterized protein LOC100809...   171   8e-40
ref|XP_006394057.1| hypothetical protein EUTSA_v10004756mg [Eutr...   169   3e-39
ref|XP_006280879.1| hypothetical protein CARUB_v10026870mg [Caps...   169   4e-39

>ref|XP_006353221.1| PREDICTED: uncharacterized protein LOC102592816 [Solanum tuberosum]
          Length = 313

 Score =  234 bits (598), Expect = 6e-59
 Identities = 150/314 (47%), Positives = 184/314 (58%), Gaps = 9/314 (2%)
 Frame = +3

Query: 324  FSLVNCTIDSPKFHCENVKFS-IFRGPQPRLNSLNFCTRNSRFTNFYRVTSHAHRWAIAV 500
            F L N  I +P  H + +K   IFR  +P L+ + F                 H+W   V
Sbjct: 19   FCLKNPKISTP-LHLKPLKTPLIFRTQKPHLDKIEFL--------------QCHQWK--V 61

Query: 501  KPLEQDEMVGRRSYGANEFNFDAFLSILEFLSLASSAVISVIFAVNYGLSGSQRSVFGWF 680
            K  + +  V  +     EFNFD FLSILEFL L SSAV+++ FAVN  + GSQ+    W 
Sbjct: 62   KSFDSEGTVNGQVSAEYEFNFDGFLSILEFLCLLSSAVVAIGFAVNSWVLGSQK----WL 117

Query: 681  GDRFLVWQCXXXXXXXXXXXXIRRRQWRRICSADFSRP------VNLVERIEKLEEDLKN 842
            G+R L  QC            IRRRQWRRIC   FSR       VNL+ERIEK+EEDL++
Sbjct: 118  GNRVLAAQCVVLVGGVIIGSVIRRRQWRRICMNKFSRSGSDLKGVNLLERIEKVEEDLRS 177

Query: 843  SATIIRVLSRQLEKLGIRFRVTRKALKEPLAETAALAQKNSEATRALAVQEDILEKELGE 1022
            SATIIRVLSRQLEKLGIRFRVTRK LK+P+ E A LAQKNSEATRALA+Q++ LEKELGE
Sbjct: 178  SATIIRVLSRQLEKLGIRFRVTRKTLKDPITEAAMLAQKNSEATRALALQDERLEKELGE 237

Query: 1023 IQKVXXXXXXXXXXXXXXXXXXGKSGKLWENRQQHNHEDN--AANCSNPPVDQVSQLDRN 1196
            IQKV                  GK+GKL+EN++  + + N  + + SN   D   QL  N
Sbjct: 238  IQKVLLAMQDQQHKQLELILAIGKTGKLFENKRGLSQDPNKKSNDVSNTAADGFPQLGVN 297

Query: 1197 QIQAFATQKEAGND 1238
            QIQA   Q+E  ND
Sbjct: 298  QIQALKRQRETNND 311


>ref|XP_004250106.1| PREDICTED: uncharacterized protein LOC101259600 [Solanum
            lycopersicum]
          Length = 310

 Score =  224 bits (571), Expect = 8e-56
 Identities = 144/308 (46%), Positives = 177/308 (57%), Gaps = 9/308 (2%)
 Frame = +3

Query: 324  FSLVNCTIDSPKFHCENVKFS-IFRGPQPRLNSLNFCTRNSRFTNFYRVTSHAHRWAIAV 500
            F L N  I +P  H + +    IFR  +P L+ + F                 H+W   V
Sbjct: 19   FCLKNPKISTP-LHLKPLNTPLIFRTQKPHLDKIEFL--------------QCHQWK--V 61

Query: 501  KPLEQDEMVGRRSYGANEFNFDAFLSILEFLSLASSAVISVIFAVNYGLSGSQRSVFGWF 680
            K  + +  V  +     EFNFD FLSILEFL L SSAV+++ FAVN    GS +    W 
Sbjct: 62   KSFDSEGTVNGQVSAEYEFNFDGFLSILEFLCLLSSAVVAIGFAVNCWFLGSHK----WL 117

Query: 681  GDRFLVWQCXXXXXXXXXXXXIRRRQWRRICSADFSRP------VNLVERIEKLEEDLKN 842
            G+R L  QC            IRRRQWRRIC  +FSRP      VN++ERIEK+EEDL++
Sbjct: 118  GNRVLAAQCVVLVGGVIIGSVIRRRQWRRICMNNFSRPGSDLKGVNMLERIEKVEEDLRS 177

Query: 843  SATIIRVLSRQLEKLGIRFRVTRKALKEPLAETAALAQKNSEATRALAVQEDILEKELGE 1022
            SATIIRVLSRQLEKLGIRFRVTRK LK+P+ E A LAQKNSEATRALA+Q + LEKELGE
Sbjct: 178  SATIIRVLSRQLEKLGIRFRVTRKTLKDPITEAAMLAQKNSEATRALALQGERLEKELGE 237

Query: 1023 IQKVXXXXXXXXXXXXXXXXXXGKSGKLWENRQQHNHEDN--AANCSNPPVDQVSQLDRN 1196
            +QKV                  GK+GKL+EN++  + + N    + SN   D   QL  N
Sbjct: 238  VQKVLLAMQDQQHKQLELILAIGKTGKLFENKRGPSQDPNQKTNDMSNTAADGFPQLGVN 297

Query: 1197 QIQAFATQ 1220
            QIQA   Q
Sbjct: 298  QIQALKRQ 305


>gb|EOX92049.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 316

 Score =  219 bits (557), Expect = 3e-54
 Identities = 138/281 (49%), Positives = 171/281 (60%), Gaps = 8/281 (2%)
 Frame = +3

Query: 420  LNFCTRNSRFTNFYRVTSHAHRWAIAVKPLEQDEMVG---RRSYGANEFNFDAFLSILEF 590
            L+F TRN  F NF      +H     +K  E D  +     ++   N+FN D+FLSI EF
Sbjct: 44   LHFRTRN--FLNFKSPHPSSHS---LLKAYESDSSIAASQEQNPIFNDFNLDSFLSIAEF 98

Query: 591  LSLASSAVISVIFAVNYGLSGSQRSVFGWFGDRFLVWQCXXXXXXXXXXXXIRRRQWRRI 770
            L + SSAV+SV+ AV    SG +  + G    R +VW              IRRRQWRRI
Sbjct: 99   LCILSSAVVSVVGAV----SGWKGVILGGIWRRVMVWGIVGLVSGVAIGAWIRRRQWRRI 154

Query: 771  CSADFS-----RPVNLVERIEKLEEDLKNSATIIRVLSRQLEKLGIRFRVTRKALKEPLA 935
            C+         + +NL+ RIEKLEEDL++ ATI R LSRQLEKLGIRFRVTRKALKEP+A
Sbjct: 155  CAETVKGGGGGKNLNLIGRIEKLEEDLRSYATITRALSRQLEKLGIRFRVTRKALKEPIA 214

Query: 936  ETAALAQKNSEATRALAVQEDILEKELGEIQKVXXXXXXXXXXXXXXXXXXGKSGKLWEN 1115
            ETAALAQKNSEATRALAVQEDILEKELGEIQKV                  GKSGKL+E+
Sbjct: 215  ETAALAQKNSEATRALAVQEDILEKELGEIQKVLLAMQEQQGKQLELILAIGKSGKLFED 274

Query: 1116 RQQHNHEDNAANCSNPPVDQVSQLDRNQIQAFATQKEAGND 1238
            +++ + E N     N   ++V+Q++ NQ Q   T K +GND
Sbjct: 275  KREPSQEKNTVEACN-LTEEVNQMEINQTQPLGTSKGSGND 314


>gb|EOX92050.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 313

 Score =  214 bits (546), Expect = 6e-53
 Identities = 138/281 (49%), Positives = 171/281 (60%), Gaps = 8/281 (2%)
 Frame = +3

Query: 420  LNFCTRNSRFTNFYRVTSHAHRWAIAVKPLEQDEMVG---RRSYGANEFNFDAFLSILEF 590
            L+F TRN  F NF      +H     +K  E D  +     ++   N+FN D+FLSI EF
Sbjct: 44   LHFRTRN--FLNFKSPHPSSHS---LLKAYESDSSIAASQEQNPIFNDFNLDSFLSIAEF 98

Query: 591  LSLASSAVISVIFAVNYGLSGSQRSVFGWFGDRFLVWQCXXXXXXXXXXXXIRRRQWRRI 770
            L + SSAV+SV+ AV    SG +  + G    R +VW              IRRRQWRRI
Sbjct: 99   LCILSSAVVSVVGAV----SGWKGVILGGIWRRVMVWGIVGLVSGVAIGAWIRRRQWRRI 154

Query: 771  CSADFS-----RPVNLVERIEKLEEDLKNSATIIRVLSRQLEKLGIRFRVTRKALKEPLA 935
            C+         + +NL+ RIEKLEEDL++ ATI R LSRQLEKLGIRFRVTRKALKEP+A
Sbjct: 155  CAETVKGGGGGKNLNLIGRIEKLEEDLRSYATITRALSRQLEKLGIRFRVTRKALKEPIA 214

Query: 936  ETAALAQKNSEATRALAVQEDILEKELGEIQKVXXXXXXXXXXXXXXXXXXGKSGKLWEN 1115
            ETAALAQKNSEATRALAVQEDILEKELGEIQKV                  GKSGKL+E+
Sbjct: 215  ETAALAQKNSEATRALAVQEDILEKELGEIQKV---LLAMQGKQLELILAIGKSGKLFED 271

Query: 1116 RQQHNHEDNAANCSNPPVDQVSQLDRNQIQAFATQKEAGND 1238
            +++ + E N     N   ++V+Q++ NQ Q   T K +GND
Sbjct: 272  KREPSQEKNTVEACN-LTEEVNQMEINQTQPLGTSKGSGND 311


>ref|XP_006466252.1| PREDICTED: uncharacterized protein LOC102620591 [Citrus sinensis]
          Length = 322

 Score =  207 bits (528), Expect = 8e-51
 Identities = 118/234 (50%), Positives = 151/234 (64%), Gaps = 4/234 (1%)
 Frame = +3

Query: 549  NEFNFDAFLSILEFLSLASSAVISVIFAVNYGLSGSQRSVFGWFGDRFLVWQCXXXXXXX 728
            + FN D+ LSI E L L SS+VI++ FAV YG+ G + S+FG  G R L           
Sbjct: 88   DNFNLDSLLSISEVLCLFSSSVIAIGFAVYYGMFGLKSSLFGLIGSRVLACGVVSLVCGV 147

Query: 729  XXXXXIRRRQWRRICS----ADFSRPVNLVERIEKLEEDLKNSATIIRVLSRQLEKLGIR 896
                 IRRRQWRR+C     A+    VNLV RIEKLEED+K+SATI+RVLSRQLEKLG+R
Sbjct: 148  WIGAIIRRRQWRRVCGEKARAEGRESVNLVGRIEKLEEDMKSSATILRVLSRQLEKLGVR 207

Query: 897  FRVTRKALKEPLAETAALAQKNSEATRALAVQEDILEKELGEIQKVXXXXXXXXXXXXXX 1076
            FRVTRKALK+P+ + AALAQKN+EATRALA+QED+LEKELGEIQKV              
Sbjct: 208  FRVTRKALKDPITQAAALAQKNAEATRALAMQEDVLEKELGEIQKVLLAMQEQQQKQLEL 267

Query: 1077 XXXXGKSGKLWENRQQHNHEDNAANCSNPPVDQVSQLDRNQIQAFATQKEAGND 1238
                GK+GKL+ENRQ+ + E +    S+  +D   Q++  + +A  + +   ND
Sbjct: 268  ILAIGKTGKLFENRQEPSQEQDKLKTSD-FIDGAKQMETQETEALGSSRGNKND 320


>ref|XP_006426358.1| hypothetical protein CICLE_v10026087mg [Citrus clementina]
            gi|557528348|gb|ESR39598.1| hypothetical protein
            CICLE_v10026087mg [Citrus clementina]
          Length = 322

 Score =  207 bits (526), Expect = 1e-50
 Identities = 118/234 (50%), Positives = 150/234 (64%), Gaps = 4/234 (1%)
 Frame = +3

Query: 549  NEFNFDAFLSILEFLSLASSAVISVIFAVNYGLSGSQRSVFGWFGDRFLVWQCXXXXXXX 728
            + FN D+ LSI E + L SS+VI++ FAV YG+ G + S+FG  G R L           
Sbjct: 88   DNFNLDSLLSISEVVCLFSSSVIAIGFAVYYGIFGLKNSLFGLIGSRVLACGVVSLVCGV 147

Query: 729  XXXXXIRRRQWRRICS----ADFSRPVNLVERIEKLEEDLKNSATIIRVLSRQLEKLGIR 896
                 IRRRQWRR+C      +    VNLV RIEKLEED+K+SATI+RVLSRQLEKLG+R
Sbjct: 148  WVGAVIRRRQWRRVCGETVRVEGRERVNLVGRIEKLEEDMKSSATILRVLSRQLEKLGVR 207

Query: 897  FRVTRKALKEPLAETAALAQKNSEATRALAVQEDILEKELGEIQKVXXXXXXXXXXXXXX 1076
            FRVTRKALK+P+ E AALAQKNSEATRALA+Q D+LEKELGEIQKV              
Sbjct: 208  FRVTRKALKDPITEAAALAQKNSEATRALAMQGDVLEKELGEIQKVLLAMQEQQQKQLEL 267

Query: 1077 XXXXGKSGKLWENRQQHNHEDNAANCSNPPVDQVSQLDRNQIQAFATQKEAGND 1238
                GK+GKL+ENRQ+ + E +    S+  +D   Q++  + +AF + +   ND
Sbjct: 268  ILAIGKTGKLFENRQEPSQEQDKLKTSD-FIDGAKQMETQETEAFGSSRGNKND 320


>gb|EXB29482.1| hypothetical protein L484_022154 [Morus notabilis]
          Length = 374

 Score =  199 bits (506), Expect = 3e-48
 Identities = 117/219 (53%), Positives = 143/219 (65%), Gaps = 3/219 (1%)
 Frame = +3

Query: 552  EFNFDAFLSILEFLSLASSAVISVIFAVNYGLSGSQRSVFGW-FGDRFLVWQCXXXXXXX 728
            + +FD+FLSI+E L + SSAV+S+ FAVN  +S S+++V     G+  L           
Sbjct: 84   DLDFDSFLSIVETLCVFSSAVVSLGFAVNCVVSSSKKTVMAAAMGNGILSCGMLVMVAGL 143

Query: 729  XXXXXIRRRQWRRICSADF--SRPVNLVERIEKLEEDLKNSATIIRVLSRQLEKLGIRFR 902
                 IRRRQWRR CS        VNL+ER+EKLEEDL+NSAT+IRV+SRQLEKLGIRFR
Sbjct: 144  GIGAWIRRRQWRRFCSGSVRGGLEVNLLERVEKLEEDLRNSATLIRVISRQLEKLGIRFR 203

Query: 903  VTRKALKEPLAETAALAQKNSEATRALAVQEDILEKELGEIQKVXXXXXXXXXXXXXXXX 1082
            VTRKALKEPLAETAALAQKNSEATRALAVQEDILEKELGEIQKV                
Sbjct: 204  VTRKALKEPLAETAALAQKNSEATRALAVQEDILEKELGEIQKVLLAMQEQQQKQLELIL 263

Query: 1083 XXGKSGKLWENRQQHNHEDNAANCSNPPVDQVSQLDRNQ 1199
              GK+GKL+E R + + E       +   + + Q + +Q
Sbjct: 264  AIGKTGKLFETRPERSQEQERIEIHDSTAESLKQKESHQ 302


>ref|XP_002307074.1| hypothetical protein POPTR_0005s07470g [Populus trichocarpa]
            gi|222856523|gb|EEE94070.1| hypothetical protein
            POPTR_0005s07470g [Populus trichocarpa]
          Length = 307

 Score =  193 bits (490), Expect = 2e-46
 Identities = 119/272 (43%), Positives = 162/272 (59%), Gaps = 3/272 (1%)
 Frame = +3

Query: 432  TRNSRFTNF-YRVTSHAHRWAIAVKPLEQDEMVGRRSYGANEFNFDAFLSILEFLSLASS 608
            T +   +NF ++  +  + ++  +K  + D  +  R+  +N+FN D FLSI E L + SS
Sbjct: 39   TTSLHSSNFHFKPQTPRNSFSFTLKAYQSDPTI--RTQVSNQFNLDQFLSIAELLCIISS 96

Query: 609  AVISVIFAVNYGLSGSQRSVFGWFGDRF-LVWQCXXXXXXXXXXXXIRRRQWRRIC-SAD 782
            ++I++ +A+N   + S+    G  G      W              IRRRQW RIC    
Sbjct: 97   SIITISYALN--CTFSKTGALGVIGSNTGFAWGMVVMVSGVVIGAWIRRRQWWRICRETG 154

Query: 783  FSRPVNLVERIEKLEEDLKNSATIIRVLSRQLEKLGIRFRVTRKALKEPLAETAALAQKN 962
                +NLV RIEKLE+D+++SATIIRVLSRQLEKLGIRFRVTRKALKEP+ ETAALAQKN
Sbjct: 155  REGSLNLVGRIEKLEQDMRSSATIIRVLSRQLEKLGIRFRVTRKALKEPIVETAALAQKN 214

Query: 963  SEATRALAVQEDILEKELGEIQKVXXXXXXXXXXXXXXXXXXGKSGKLWENRQQHNHEDN 1142
            SEATRALA+QE+ILEKELGE QK+                  GKSGK W+NR++   E  
Sbjct: 215  SEATRALALQENILEKELGETQKILLAMQEQQQKQLELILAIGKSGKSWDNRRERVEEQE 274

Query: 1143 AANCSNPPVDQVSQLDRNQIQAFATQKEAGND 1238
                S+   + V+QL+ ++ Q   T K + N+
Sbjct: 275  LIKTSD-LTEGVNQLESHEAQPSVTSKRSNNN 305


>gb|EMJ06837.1| hypothetical protein PRUPE_ppa009241mg [Prunus persica]
          Length = 300

 Score =  192 bits (488), Expect = 3e-46
 Identities = 122/252 (48%), Positives = 147/252 (58%), Gaps = 2/252 (0%)
 Frame = +3

Query: 453  NFYRVTSHAHRWAIAVKPLEQDEMVGRRSYGANEFNFDAFLSILEFLSLASSAVISVIFA 632
            N   ++SH  R    ++  E D  +         FN D FL++ EFL LASSA++SV FA
Sbjct: 53   NSTSLSSHHSR----LRVYESDGTLQSNDVVNGAFNLDYFLTVAEFLCLASSAIVSVGFA 108

Query: 633  VNYGLSGSQRSVFGWFGDRFLVWQCXXXXXXXXXXXXIRRRQWRRIC--SADFSRPVNLV 806
            +N  +   +++     G+  L                IR RQWRRIC  S      VNL 
Sbjct: 109  LNCAVLSLKKTALVAMGNSVLASGAVALVMAVGIGAWIRMRQWRRICRESVKGGLEVNLF 168

Query: 807  ERIEKLEEDLKNSATIIRVLSRQLEKLGIRFRVTRKALKEPLAETAALAQKNSEATRALA 986
            ERIEKLEEDL++SATIIRVLSRQLEKLGIRFRVTRKALKEP+AETAALAQKNSEATRALA
Sbjct: 169  ERIEKLEEDLRSSATIIRVLSRQLEKLGIRFRVTRKALKEPIAETAALAQKNSEATRALA 228

Query: 987  VQEDILEKELGEIQKVXXXXXXXXXXXXXXXXXXGKSGKLWENRQQHNHEDNAANCSNPP 1166
            VQED LEKELGEIQKV                    SGKL E+RQ  + E +     +  
Sbjct: 229  VQEDNLEKELGEIQKVLLAMQEQQQKQLELILAIATSGKLRESRQVRDQEQSTTIIRDSS 288

Query: 1167 VDQVSQLDRNQI 1202
             +   Q + +QI
Sbjct: 289  EEDSKQKEAHQI 300


>ref|XP_003531342.1| PREDICTED: uncharacterized protein LOC100809936 isoform X1 [Glycine
            max]
          Length = 287

 Score =  190 bits (483), Expect = 1e-45
 Identities = 121/258 (46%), Positives = 150/258 (58%), Gaps = 2/258 (0%)
 Frame = +3

Query: 417  SLNFCTRNSRFTNFYRVTSHAHRWAIAVKPLEQDEMVGRRSYGANEFNFDAFLSILEFLS 596
            SL+F    SR    +  T  AHR+      +  D    R  + A + NFD+ LS+LEF  
Sbjct: 32   SLSFSIVTSR--PLHLTTHTAHRFNSLT--VRADSFRLRSEHAAADSNFDSLLSLLEFSC 87

Query: 597  LASSAVISVIFAVNYGLSGSQRSVFGWFGDRFLVWQCXXXXXXXXXXXXIRRRQWRRIC- 773
            L SSA+ S   AV   L+GS+  +    G R   +              IRRRQWRR+  
Sbjct: 88   LLSSAISSAAAAV---LAGSKNELIAGIGARAAPFGGALLVVGVLVGAWIRRRQWRRVSV 144

Query: 774  -SADFSRPVNLVERIEKLEEDLKNSATIIRVLSRQLEKLGIRFRVTRKALKEPLAETAAL 950
             +      VNL+ERIEKLEEDL++SAT++RVLSRQLEKLG+RFRVTRK LK+P+AETAAL
Sbjct: 145  EAGKGGLEVNLLERIEKLEEDLRSSATVVRVLSRQLEKLGVRFRVTRKGLKDPIAETAAL 204

Query: 951  AQKNSEATRALAVQEDILEKELGEIQKVXXXXXXXXXXXXXXXXXXGKSGKLWENRQQHN 1130
            AQKNSEA RALAVQ DILEKELGEIQ+V                  GK+ KLWE++Q+ N
Sbjct: 205  AQKNSEAARALAVQSDILEKELGEIQQVLLAMQEQQRKQLDLILAVGKASKLWESKQETN 264

Query: 1131 HEDNAANCSNPPVDQVSQ 1184
               +    SN   D V Q
Sbjct: 265  ERHDTLELSNSAEDGVKQ 282


>ref|XP_003525047.1| PREDICTED: uncharacterized protein LOC100790782 isoform X1 [Glycine
            max]
          Length = 293

 Score =  185 bits (469), Expect = 5e-44
 Identities = 108/211 (51%), Positives = 133/211 (63%), Gaps = 2/211 (0%)
 Frame = +3

Query: 558  NFDAFLSILEFLSLASSAVISVIFAVNYGLSGSQRSVFGWFGDRFLVWQCXXXXXXXXXX 737
            NFD+ LS+LEF  L SSAV S   AV   ++GS+  +    G R   +            
Sbjct: 81   NFDSLLSLLEFSCLLSSAVASAAAAV---VAGSKNELLVGIGTRAAPFGGALLVVGVLVG 137

Query: 738  XXIRRRQWRRIC--SADFSRPVNLVERIEKLEEDLKNSATIIRVLSRQLEKLGIRFRVTR 911
              IRRRQWRR C  +      VNL+ERIEKLEED+++SAT++RVLSRQLEKLG+RFRVTR
Sbjct: 138  AWIRRRQWRRACVETGKGGLEVNLLERIEKLEEDMRSSATVVRVLSRQLEKLGVRFRVTR 197

Query: 912  KALKEPLAETAALAQKNSEATRALAVQEDILEKELGEIQKVXXXXXXXXXXXXXXXXXXG 1091
            KALK+P+AETAALAQKNSEA RALAVQ DILEKELGEIQ+V                  G
Sbjct: 198  KALKDPIAETAALAQKNSEAARALAVQSDILEKELGEIQQVLLAMQEQQRKQLDLILAIG 257

Query: 1092 KSGKLWENRQQHNHEDNAANCSNPPVDQVSQ 1184
            K+ KLWE++ + +   +    SN   D+V Q
Sbjct: 258  KASKLWESKHETSERHDTLEMSNSAEDEVKQ 288


>ref|XP_004288180.1| PREDICTED: uncharacterized protein LOC101314793 [Fragaria vesca
            subsp. vesca]
          Length = 300

 Score =  184 bits (467), Expect = 9e-44
 Identities = 110/218 (50%), Positives = 132/218 (60%), Gaps = 2/218 (0%)
 Frame = +3

Query: 555  FNFDAFLSILEFLSLASSAVISVIFAVNYGLSGSQRSVFGWFGDRFLVWQCXXXXXXXXX 734
            FN D FLS+ E L LASSAV+S+ + ++  + G + + F   G   L             
Sbjct: 85   FNLDYFLSVAELLCLASSAVVSIGYGLSSAVPGWKNAAF--IGGTALGGGAAALVMAVGI 142

Query: 735  XXXIRRRQWRRIC--SADFSRPVNLVERIEKLEEDLKNSATIIRVLSRQLEKLGIRFRVT 908
               IRRRQWRR+   +      VNL ERIEKLEEDL++S TI+RVLSRQLEKLGIRFRVT
Sbjct: 143  GAWIRRRQWRRVSRETVKGGLEVNLFERIEKLEEDLRSSVTIVRVLSRQLEKLGIRFRVT 202

Query: 909  RKALKEPLAETAALAQKNSEATRALAVQEDILEKELGEIQKVXXXXXXXXXXXXXXXXXX 1088
            RKALKEP+AETAALAQKNSEATRALA QEDILEKELGE QKV                  
Sbjct: 203  RKALKEPIAETAALAQKNSEATRALAAQEDILEKELGETQKVLLALQEQQQKQFDLILAI 262

Query: 1089 GKSGKLWENRQQHNHEDNAANCSNPPVDQVSQLDRNQI 1202
             KSGKL +N+  H+  ++     +   D   Q +  QI
Sbjct: 263  AKSGKLLDNKHAHDQPESTIRTHDSSTDNSKQKEPQQI 300


>ref|XP_002269557.2| PREDICTED: uncharacterized protein LOC100244969 [Vitis vinifera]
          Length = 193

 Score =  183 bits (464), Expect = 2e-43
 Identities = 104/190 (54%), Positives = 128/190 (67%), Gaps = 4/190 (2%)
 Frame = +3

Query: 681  GDRFLVWQCXXXXXXXXXXXXIRRRQWRRICSADFSRP----VNLVERIEKLEEDLKNSA 848
            G+R L+WQ             IRRRQW RI + D ++P    VNLVER+EK+EED+++ A
Sbjct: 5    GNRILLWQAVALVGGVVVGSWIRRRQWWRIFN-DTAKPGIESVNLVERMEKMEEDIRSMA 63

Query: 849  TIIRVLSRQLEKLGIRFRVTRKALKEPLAETAALAQKNSEATRALAVQEDILEKELGEIQ 1028
            T+IRV+SRQLEKLGIRFRVTRKALK+P+AETA LAQKNSEATRALA+QEDILEKELGEIQ
Sbjct: 64   TLIRVMSRQLEKLGIRFRVTRKALKQPIAETAVLAQKNSEATRALAIQEDILEKELGEIQ 123

Query: 1029 KVXXXXXXXXXXXXXXXXXXGKSGKLWENRQQHNHEDNAANCSNPPVDQVSQLDRNQIQA 1208
            KV                  GK+GKLWENR+  + E +A    +    +V Q+  +QI A
Sbjct: 124  KVLLAMQEQQQKQLDLILAIGKAGKLWENRRGQSEEQDAIEACDSA--EVGQMKAHQIPA 181

Query: 1209 FATQKEAGND 1238
             A QK + ND
Sbjct: 182  AARQKGSNND 191


>ref|XP_004143460.1| PREDICTED: uncharacterized protein LOC101207421 [Cucumis sativus]
          Length = 323

 Score =  182 bits (463), Expect = 3e-43
 Identities = 119/258 (46%), Positives = 151/258 (58%), Gaps = 3/258 (1%)
 Frame = +3

Query: 366  CENVKFSIFRGPQPRLNSLNFCTRNSRFTNFYRVTS-HAHRWAIAVKPLEQDEMVGRRSY 542
            C ++ F   R P    N+L+F   + +F + +   S +AH +   V        VGRR  
Sbjct: 49   CVSLPFPPSRFP----NTLHFQILDYKFRSPFNFGSINAHHFCPRVST---SGGVGRRPG 101

Query: 543  GANEFNFDAFLSILEFLSLASSAVISVIFAVNYGLSGSQRSVFGWFGDRFLVWQCXXXXX 722
            G  +F+ D+ LS  EF  L +S + SV FA+N   + S+      FGD  LV        
Sbjct: 102  GVADFDIDSLLSATEFFCLVASLIGSVGFALNCAKTRSKSLFLAVFGDGVLVGTILFLVA 161

Query: 723  XXXXXXXIRRRQWRRIC--SADFSRPVNLVERIEKLEEDLKNSATIIRVLSRQLEKLGIR 896
                   IRRRQW R+   +A     VNL+E+  KLEEDL++SAT+IRVLSRQLEKLGIR
Sbjct: 162  GVAIGAWIRRRQWNRVFRETAKGVLEVNLMEKTNKLEEDLRSSATLIRVLSRQLEKLGIR 221

Query: 897  FRVTRKALKEPLAETAALAQKNSEATRALAVQEDILEKELGEIQKVXXXXXXXXXXXXXX 1076
            FRVTRKALK+P+ ETAALAQK SEATRALAV+ DILEKEL EIQKV              
Sbjct: 222  FRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLDL 281

Query: 1077 XXXXGKSGKLWENRQQHN 1130
                G SGK+WE+RQ+H+
Sbjct: 282  ILAIGNSGKMWESRQEHS 299


>ref|XP_004167033.1| PREDICTED: uncharacterized LOC101207421, partial [Cucumis sativus]
          Length = 246

 Score =  181 bits (460), Expect = 6e-43
 Identities = 107/204 (52%), Positives = 129/204 (63%), Gaps = 2/204 (0%)
 Frame = +3

Query: 525  VGRRSYGANEFNFDAFLSILEFLSLASSAVISVIFAVNYGLSGSQRSVFGWFGDRFLVWQ 704
            VGRR  G  +F+ D+ LS  EF  L +S + SV FA+N   + S+      FGD  LV  
Sbjct: 19   VGRRPGGVADFDIDSLLSATEFFCLVASLIGSVGFALNCAKTRSKSLFLAVFGDGVLVGT 78

Query: 705  CXXXXXXXXXXXXIRRRQWRRIC--SADFSRPVNLVERIEKLEEDLKNSATIIRVLSRQL 878
                         IRRRQW R+   +A     VNL+E+  KLEEDL++SAT+IRVLSRQL
Sbjct: 79   ILFLVAGVAIGAWIRRRQWNRVFRETAKGVLEVNLMEKTNKLEEDLRSSATLIRVLSRQL 138

Query: 879  EKLGIRFRVTRKALKEPLAETAALAQKNSEATRALAVQEDILEKELGEIQKVXXXXXXXX 1058
            EKLGIRFRVTRKALK+P+ ETAALAQK SEATRALAV+ DILEKEL EIQKV        
Sbjct: 139  EKLGIRFRVTRKALKKPVEETAALAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQ 198

Query: 1059 XXXXXXXXXXGKSGKLWENRQQHN 1130
                      G SGK+WE+RQ+H+
Sbjct: 199  QKQLDLILAIGNSGKMWESRQEHS 222


>ref|XP_004504099.1| PREDICTED: uncharacterized protein LOC101515258 [Cicer arietinum]
          Length = 296

 Score =  178 bits (452), Expect = 5e-42
 Identities = 110/258 (42%), Positives = 148/258 (57%), Gaps = 2/258 (0%)
 Frame = +3

Query: 417  SLNFCTRNSRFTNFYRVTSHAHRWAIAVKPLEQDEMVGRRSYGANEFNFDAFLSILEFLS 596
            +L   T   RFT+   +T HA  +        +   +    +   + NFD+FLS LE   
Sbjct: 47   NLKLPTTTHRFTS---ITVHADSF-------RRRSQISVPKHVTGDSNFDSFLSFLELSC 96

Query: 597  LASSAVISVIFAVNYGLSGSQRSVFGWFGDRFLVWQCXXXXXXXXXXXXIRRRQWRR-IC 773
            L SS ++S   AV   ++  ++ +F   G+R   W              IRRR+WR+ + 
Sbjct: 97   LLSSVIVSASVAV---IAVWKKELFVAIGNRVSPWSVLLLVVGVLTGALIRRRKWRQTVV 153

Query: 774  SADFS-RPVNLVERIEKLEEDLKNSATIIRVLSRQLEKLGIRFRVTRKALKEPLAETAAL 950
               F    VN ++R+EKLEEDL++SA ++RVLSRQLEKLGIRFRVTRK+LKEP+ ETAAL
Sbjct: 154  DGGFPVSEVNFLQRMEKLEEDLRSSAMVVRVLSRQLEKLGIRFRVTRKSLKEPITETAAL 213

Query: 951  AQKNSEATRALAVQEDILEKELGEIQKVXXXXXXXXXXXXXXXXXXGKSGKLWENRQQHN 1130
            AQKNSEA RALA+Q DILEKELGEIQKV                  GK+GKLWE++++ +
Sbjct: 214  AQKNSEAARALAMQSDILEKELGEIQKVLLAMQEQQRKQLDLILAIGKAGKLWESKRETS 273

Query: 1131 HEDNAANCSNPPVDQVSQ 1184
             E      SN   ++V Q
Sbjct: 274  EEHGTIEMSNSAANEVKQ 291


>ref|XP_002309969.1| hypothetical protein POPTR_0007s05170g [Populus trichocarpa]
            gi|222852872|gb|EEE90419.1| hypothetical protein
            POPTR_0007s05170g [Populus trichocarpa]
          Length = 267

 Score =  178 bits (451), Expect = 7e-42
 Identities = 109/222 (49%), Positives = 138/222 (62%), Gaps = 16/222 (7%)
 Frame = +3

Query: 417  SLNFCTRNSRFTNFYRVTSHAHRWA-----------IAVKPLEQDEMVGRRSYGANEFNF 563
            S +   RN   +     + H+H +            + +K  + D  +  +   + +FN 
Sbjct: 28   STSLSLRNLTLSRHVNTSLHSHNFHFKPQTPKSSFNLTLKAYQSDPTIPTQD--SKQFNL 85

Query: 564  DAFLSILEFLSLASSAVISVIFAVNYGLSGSQRSVFGWFGDRF-LVWQCXXXXXXXXXXX 740
            D FLS+ E L + SS++I++ +A+NY +  S+R V G  G      W             
Sbjct: 86   DHFLSVAELLCIFSSSIITISYALNYTVLNSKRGVLGVIGSNTGFAWGMVVMVSGVVIGA 145

Query: 741  XIRRRQWRRIC---SADFSRP-VNLVERIEKLEEDLKNSATIIRVLSRQLEKLGIRFRVT 908
             IRRR W R+      + SR  +NLV RIEKLEEDL++SATIIRVLSRQLEKLGIRFRVT
Sbjct: 146  WIRRRMWWRVSRETGREGSRESLNLVGRIEKLEEDLRSSATIIRVLSRQLEKLGIRFRVT 205

Query: 909  RKALKEPLAETAALAQKNSEATRALAVQEDILEKELGEIQKV 1034
            RKALKEP+AETAALAQKNS+ATRALAVQEDILEKELGEIQKV
Sbjct: 206  RKALKEPIAETAALAQKNSDATRALAVQEDILEKELGEIQKV 247


>ref|XP_006585264.1| PREDICTED: uncharacterized protein LOC100809936 isoform X2 [Glycine
            max]
          Length = 248

 Score =  171 bits (433), Expect = 8e-40
 Identities = 108/208 (51%), Positives = 132/208 (63%), Gaps = 2/208 (0%)
 Frame = +3

Query: 417  SLNFCTRNSRFTNFYRVTSHAHRWAIAVKPLEQDEMVGRRSYGANEFNFDAFLSILEFLS 596
            SL+F    SR    +  T  AHR+      +  D    R  + A + NFD+ LS+LEF  
Sbjct: 32   SLSFSIVTSR--PLHLTTHTAHRFNSLT--VRADSFRLRSEHAAADSNFDSLLSLLEFSC 87

Query: 597  LASSAVISVIFAVNYGLSGSQRSVFGWFGDRFLVWQCXXXXXXXXXXXXIRRRQWRRIC- 773
            L SSA+ S   AV   L+GS+  +    G R   +              IRRRQWRR+  
Sbjct: 88   LLSSAISSAAAAV---LAGSKNELIAGIGARAAPFGGALLVVGVLVGAWIRRRQWRRVSV 144

Query: 774  -SADFSRPVNLVERIEKLEEDLKNSATIIRVLSRQLEKLGIRFRVTRKALKEPLAETAAL 950
             +      VNL+ERIEKLEEDL++SAT++RVLSRQLEKLG+RFRVTRK LK+P+AETAAL
Sbjct: 145  EAGKGGLEVNLLERIEKLEEDLRSSATVVRVLSRQLEKLGVRFRVTRKGLKDPIAETAAL 204

Query: 951  AQKNSEATRALAVQEDILEKELGEIQKV 1034
            AQKNSEA RALAVQ DILEKELGEIQ+V
Sbjct: 205  AQKNSEAARALAVQSDILEKELGEIQQV 232


>ref|XP_006394057.1| hypothetical protein EUTSA_v10004756mg [Eutrema salsugineum]
            gi|557090696|gb|ESQ31343.1| hypothetical protein
            EUTSA_v10004756mg [Eutrema salsugineum]
          Length = 276

 Score =  169 bits (428), Expect = 3e-39
 Identities = 99/192 (51%), Positives = 121/192 (63%), Gaps = 1/192 (0%)
 Frame = +3

Query: 543  GANEFNFDAFLSILEFLSLASSAVISVIFAVNYGLSGSQRSVFGWFGDRFLVWQCXXXXX 722
            G++ F+  +F+S  E L + SSAVISV+ AVNY       +V G  G + L         
Sbjct: 87   GSDGFDLGSFISFAEVLCILSSAVISVVLAVNY-------AVVGEIGKKVLSLGFVGLVG 139

Query: 723  XXXXXXXIRRRQWRRICS-ADFSRPVNLVERIEKLEEDLKNSATIIRVLSRQLEKLGIRF 899
                   +RRRQW RIC  A      NL+ R+EKLEEDLK S TI+R+LS+ LEKLGIRF
Sbjct: 140  SVASGSWLRRRQWMRICKGAREGEGTNLISRLEKLEEDLKTSTTIVRLLSKHLEKLGIRF 199

Query: 900  RVTRKALKEPLAETAALAQKNSEATRALAVQEDILEKELGEIQKVXXXXXXXXXXXXXXX 1079
            RVTRKALKEP++ETAALAQKNSEATR LA Q++ILEKELGEIQKV               
Sbjct: 200  RVTRKALKEPISETAALAQKNSEATRVLAAQQEILEKELGEIQKVLLAMQDQQRKQLELI 259

Query: 1080 XXXGKSGKLWEN 1115
                K+GKL+E+
Sbjct: 260  LTIAKNGKLFES 271


>ref|XP_006280879.1| hypothetical protein CARUB_v10026870mg [Capsella rubella]
            gi|482549583|gb|EOA13777.1| hypothetical protein
            CARUB_v10026870mg [Capsella rubella]
          Length = 299

 Score =  169 bits (427), Expect = 4e-39
 Identities = 105/207 (50%), Positives = 126/207 (60%), Gaps = 3/207 (1%)
 Frame = +3

Query: 555  FNFDAFLSILEFLSLASSAVISVIFAVNYGLSGSQRSVFGWFGDRFLVWQCXXXXXXXXX 734
            F+  +F+S  E L + SSAVISV+ AVNY        V G  G + L             
Sbjct: 91   FDLGSFVSFAEALCIISSAVISVVLAVNY-------VVVGEIGKKVLSLGFVGLVGSVAT 143

Query: 735  XXXIRRRQWRRICS-ADFSRPVNLVERIEKLEEDLKNSATIIRVLSRQLEKLGIRFRVTR 911
               +RRRQW+RIC  A  S   NL+ R+EKLEEDLK+S TI+RVLSR LEKLGIRFRVTR
Sbjct: 144  GSWLRRRQWKRICKGARKSEGTNLICRLEKLEEDLKSSTTIVRVLSRHLEKLGIRFRVTR 203

Query: 912  KALKEPLAETAALAQKNSEATRALAVQEDILEKELGEIQKVXXXXXXXXXXXXXXXXXXG 1091
            KALKEP++ETAALAQKNSEATR LA Q++ILEKELGEIQKV                   
Sbjct: 204  KALKEPISETAALAQKNSEATRVLAAQQEILEKELGEIQKVLLAMQEQQRKQLELILTIA 263

Query: 1092 KSGKLWE--NRQQHNHEDNAANCSNPP 1166
            KS KL+E  + +Q  +E  A   +  P
Sbjct: 264  KSSKLFESSSSKQAPNEQKANKAAEEP 290


Top