BLASTX nr result

ID: Catharanthus23_contig00014759 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00014759
         (1665 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006364460.1| PREDICTED: uncharacterized protein LOC102606...   329   2e-87
ref|XP_004245966.1| PREDICTED: uncharacterized protein LOC101264...   328   4e-87
gb|EMJ10522.1| hypothetical protein PRUPE_ppa008010mg [Prunus pe...   327   1e-86
ref|XP_002264344.1| PREDICTED: uncharacterized protein LOC100266...   325   5e-86
ref|XP_002326587.1| predicted protein [Populus trichocarpa]           325   5e-86
ref|XP_006368288.1| hypothetical protein POPTR_0001s01320g [Popu...   325   5e-86
ref|XP_004300141.1| PREDICTED: uncharacterized protein LOC101302...   316   2e-83
gb|EXB49813.1| hypothetical protein L484_006351 [Morus notabilis]     314   6e-83
ref|XP_006432860.1| hypothetical protein CICLE_v10002048mg [Citr...   312   3e-82
ref|XP_002303468.2| hypothetical protein POPTR_0003s10200g [Popu...   311   7e-82
gb|EOY25327.1| Uncharacterized protein isoform 2 [Theobroma cacao]    306   1e-80
gb|EOY25326.1| Uncharacterized protein isoform 1 [Theobroma cacao]    296   2e-77
ref|XP_004146124.1| PREDICTED: uncharacterized protein LOC101211...   296   2e-77
emb|CBI37457.3| unnamed protein product [Vitis vinifera]              294   7e-77
ref|XP_004512397.1| PREDICTED: uncharacterized protein LOC101511...   287   8e-75
ref|XP_003534316.1| PREDICTED: uncharacterized protein LOC100813...   287   1e-74
gb|ESW30148.1| hypothetical protein PHAVU_002G128700g [Phaseolus...   285   3e-74
gb|EPS59472.1| hypothetical protein M569_15336, partial [Genlise...   281   8e-73
ref|NP_001241403.1| uncharacterized protein LOC100811221 [Glycin...   279   3e-72
ref|XP_006414004.1| hypothetical protein EUTSA_v10025843mg [Eutr...   272   3e-70

>ref|XP_006364460.1| PREDICTED: uncharacterized protein LOC102606400 [Solanum tuberosum]
          Length = 301

 Score =  329 bits (844), Expect = 2e-87
 Identities = 159/246 (64%), Positives = 192/246 (78%), Gaps = 4/246 (1%)
 Frame = +3

Query: 489  GNLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWA 668
            GNLSI +PIS+PSQCKIVSSSVDLRSSKVCELG LNYKAKHV YP +RKKFRCHYDYYWA
Sbjct: 51   GNLSISSPISLPSQCKIVSSSVDLRSSKVCELGLLNYKAKHVLYPSERKKFRCHYDYYWA 110

Query: 669  SVFKVEYMDHSGQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGIST 848
            SVFKVEYMDHSGQ R A AEAP+EALPS+CRPNFS AWLTKDKF+VN+TY CWYT+GIS 
Sbjct: 111  SVFKVEYMDHSGQARSALAEAPNEALPSDCRPNFSGAWLTKDKFEVNKTYECWYTLGISK 170

Query: 849  VNMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFS 1028
            V++YQ G F+C AKDPS +EM  RY IL MR+LKS++ +G     HWRW+AV G+I GF 
Sbjct: 171  VHIYQAGFFDCDAKDPSTIEMFIRYLILFMRILKSWYVSGV-LYWHWRWEAVAGVIAGFC 229

Query: 1029 TSLLSFGLVAVLHQMKSSICQLFSGR----VHSAVHFKRICLFIAYFSFMGWLGIQYVKR 1196
            TS+++  L A+L +  S I QL   R      + +  +R+C  +AY SF  WL +QY++R
Sbjct: 230  TSIMTVILFALLRKFFSCIYQLSVVRRFTLPFNKIRLRRVCFLLAYVSFTSWLAVQYLRR 289

Query: 1197 LGLIKI 1214
            +GL +I
Sbjct: 290  IGLPEI 295


>ref|XP_004245966.1| PREDICTED: uncharacterized protein LOC101264543 [Solanum
            lycopersicum]
          Length = 301

 Score =  328 bits (841), Expect = 4e-87
 Identities = 159/246 (64%), Positives = 192/246 (78%), Gaps = 4/246 (1%)
 Frame = +3

Query: 489  GNLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWA 668
            GNLSI +PIS+PSQCKIVSSSVDLRSSKVCELG LNYKAKHV YP +RKKFRCHYDYYWA
Sbjct: 51   GNLSISSPISLPSQCKIVSSSVDLRSSKVCELGLLNYKAKHVLYPSERKKFRCHYDYYWA 110

Query: 669  SVFKVEYMDHSGQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGIST 848
            SVFKVEY+DHSGQ R A AEAP+EALPS+CRPNFS AWLTKDKF+VN+TY CWYT+GIS 
Sbjct: 111  SVFKVEYVDHSGQARSALAEAPNEALPSDCRPNFSGAWLTKDKFEVNKTYKCWYTLGISK 170

Query: 849  VNMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFS 1028
            V++YQ G F+C AKDPS +EM  RY IL MR+LKS++ +G     HWRW+AV G+I GF 
Sbjct: 171  VHIYQAGFFDCDAKDPSTIEMFIRYLILFMRILKSWYVSGV-LYWHWRWEAVAGVIAGFC 229

Query: 1029 TSLLSFGLVAVLHQMKSSICQLFSGR----VHSAVHFKRICLFIAYFSFMGWLGIQYVKR 1196
            TS+++  L A+L ++ S I QL   R      + V  +R+C  +AY SF  WL +QY +R
Sbjct: 230  TSIMTVILFALLRKLFSCIYQLSVVRRFTLPFNKVRLRRVCFLLAYVSFTSWLAVQYFRR 289

Query: 1197 LGLIKI 1214
            +GL +I
Sbjct: 290  IGLPEI 295


>gb|EMJ10522.1| hypothetical protein PRUPE_ppa008010mg [Prunus persica]
          Length = 349

 Score =  327 bits (837), Expect = 1e-86
 Identities = 162/284 (57%), Positives = 197/284 (69%), Gaps = 3/284 (1%)
 Frame = +3

Query: 381  KKGLFYILFRCCFGXXXXXXXXXXXXXXXXXXXXXXGNLSIWNPISVPSQCKIVSSSVDL 560
            K  + Y++ RC                          N SI +PISV SQC+I+SSSVDL
Sbjct: 64   KGNMAYLILRCSLALVLPIVAIFALSLLVGFVAIFVANSSIPSPISVSSQCRILSSSVDL 123

Query: 561  RSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWASVFKVEYMDHS-GQQRLAQAEAPS 737
            +SSKVCELG  NYKAKHVFYPF+ ++FRC YDYYWAS+FKVEY D S GQ +LA AEAP+
Sbjct: 124  KSSKVCELGLFNYKAKHVFYPFEGRRFRCRYDYYWASIFKVEYKDQSSGQTQLALAEAPN 183

Query: 738  EALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGISTVNMYQDGLFNCQAKDPSNVEMLR 917
            EALP +CRPNF AAWLTKDKFKVNETY CWYT GIS V++Y DG F+CQAKDPS  EM+R
Sbjct: 184  EALPLDCRPNFGAAWLTKDKFKVNETYDCWYTYGISKVSLYHDGFFSCQAKDPSTFEMIR 243

Query: 918  RYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFSTSLLSFGLVAVLHQMKSSICQLF 1097
            RYFIL+ ++L S+F         WRW+ V G+I GFSTSL+S   + +L QMKS + QLF
Sbjct: 244  RYFILATKILHSWFV-AQERAGFWRWETVAGVIAGFSTSLISISFIRLLQQMKSRLPQLF 302

Query: 1098 SGRVHS--AVHFKRICLFIAYFSFMGWLGIQYVKRLGLIKIFSL 1223
            + RV     + F+R C  +AY SFM WL IQY KRLGL++I +L
Sbjct: 303  AARVLPLYMIRFRRTCFLVAYISFMSWLVIQYGKRLGLLEIITL 346


>ref|XP_002264344.1| PREDICTED: uncharacterized protein LOC100266685 [Vitis vinifera]
          Length = 291

 Score =  325 bits (832), Expect = 5e-86
 Identities = 158/251 (62%), Positives = 192/251 (76%), Gaps = 4/251 (1%)
 Frame = +3

Query: 489  GNLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWA 668
            GNLS+ +P+SVPSQCKIVSSSVDLRSSKVCELG LNYKAKHVFYP +++KFRCHYDYYWA
Sbjct: 42   GNLSVSSPVSVPSQCKIVSSSVDLRSSKVCELGLLNYKAKHVFYPLEKRKFRCHYDYYWA 101

Query: 669  SVFKVEYMDHSGQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGIST 848
            SVFKVEY D  GQ RL   EAP+EALP +CRPNF AAWLTKDKFKVNETY CWY  GIS 
Sbjct: 102  SVFKVEYKDSLGQTRLTLTEAPNEALPLDCRPNFGAAWLTKDKFKVNETYDCWYASGISK 161

Query: 849  VNMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFF-SNGTSSLVHWRWDAVVGLITGF 1025
            V++YQD  F+CQAK+PS +EM+RRY ILS R+L+S+  S G     +WR + V G+ITGF
Sbjct: 162  VSIYQDSFFSCQAKEPSTIEMIRRYSILSTRILQSWLASQGRGK--YWRLETVAGVITGF 219

Query: 1026 STSLLSFGLVAVLHQMKSSICQLFSGRVHSA---VHFKRICLFIAYFSFMGWLGIQYVKR 1196
            STSL+S  LV +LHQ+KS + ++   ++  A   V F+R    + Y  FMGWL I+Y KR
Sbjct: 220  STSLISISLVRILHQVKSWLPRILKRQLLLAVKSVRFRRAFFLVTYVIFMGWLAIEYGKR 279

Query: 1197 LGLIKIFSLSY 1229
            LG+  I+ + Y
Sbjct: 280  LGISNIYRVYY 290


>ref|XP_002326587.1| predicted protein [Populus trichocarpa]
          Length = 299

 Score =  325 bits (832), Expect = 5e-86
 Identities = 160/284 (56%), Positives = 196/284 (69%), Gaps = 6/284 (2%)
 Frame = +3

Query: 381  KKGLFYILFRCCFGXXXXXXXXXXXXXXXXXXXXXXGNLSIWNPISVPSQCKIVSSSVDL 560
            KKG  Y+ FR                          G+LSI  P+S+PSQCKI+SSSVDL
Sbjct: 12   KKGFIYLAFRLTLALLFPIFAFLSLSILLGFLAIFMGHLSITTPLSLPSQCKILSSSVDL 71

Query: 561  RSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWASVFKVEYMDHS-GQQRLAQAEAPS 737
            RSSK+CE GFLNYKAKHVFYP++R KFRC YDYYWASVF+VEY D+S GQ + A AEAP+
Sbjct: 72   RSSKICEPGFLNYKAKHVFYPYNRSKFRCRYDYYWASVFEVEYKDYSLGQTQFALAEAPN 131

Query: 738  EALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGISTVNMYQDGLFNCQAKDPSNVEMLR 917
            EALP NCRPNF AAWLTKDKFKVN+TY CWYT GI  V++Y+D LF+CQAKDPS VEM++
Sbjct: 132  EALPLNCRPNFGAAWLTKDKFKVNKTYDCWYTSGILKVSLYRDDLFSCQAKDPSQVEMIK 191

Query: 918  RYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFSTSLLSFGLVAVLHQMKS-----S 1082
            R+FILS  ML S          +WRW+ + G+I GFSTS+++   + +L  +KS     S
Sbjct: 192  RFFILSKEMLHSSLVQKKGKAGYWRWETIAGVIAGFSTSIITISFIRILQHIKSWFRLPS 251

Query: 1083 ICQLFSGRVHSAVHFKRICLFIAYFSFMGWLGIQYVKRLGLIKI 1214
            + ++FS    + V FKR C  +AY SFMGWL IQY KRLGL +I
Sbjct: 252  VARMFSHT--NIVFFKRACFLVAYISFMGWLTIQYGKRLGLPEI 293


>ref|XP_006368288.1| hypothetical protein POPTR_0001s01320g [Populus trichocarpa]
            gi|118483148|gb|ABK93480.1| unknown [Populus trichocarpa]
            gi|550346193|gb|ERP64857.1| hypothetical protein
            POPTR_0001s01320g [Populus trichocarpa]
          Length = 306

 Score =  325 bits (832), Expect = 5e-86
 Identities = 160/284 (56%), Positives = 196/284 (69%), Gaps = 6/284 (2%)
 Frame = +3

Query: 381  KKGLFYILFRCCFGXXXXXXXXXXXXXXXXXXXXXXGNLSIWNPISVPSQCKIVSSSVDL 560
            KKG  Y+ FR                          G+LSI  P+S+PSQCKI+SSSVDL
Sbjct: 19   KKGFIYLAFRLTLALLFPIFAFLSLSILLGFLAIFMGHLSITTPLSLPSQCKILSSSVDL 78

Query: 561  RSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWASVFKVEYMDHS-GQQRLAQAEAPS 737
            RSSK+CE GFLNYKAKHVFYP++R KFRC YDYYWASVF+VEY D+S GQ + A AEAP+
Sbjct: 79   RSSKICEPGFLNYKAKHVFYPYNRSKFRCRYDYYWASVFEVEYKDYSLGQTQFALAEAPN 138

Query: 738  EALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGISTVNMYQDGLFNCQAKDPSNVEMLR 917
            EALP NCRPNF AAWLTKDKFKVN+TY CWYT GI  V++Y+D LF+CQAKDPS VEM++
Sbjct: 139  EALPLNCRPNFGAAWLTKDKFKVNKTYDCWYTSGILKVSLYRDDLFSCQAKDPSQVEMIK 198

Query: 918  RYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFSTSLLSFGLVAVLHQMKS-----S 1082
            R+FILS  ML S          +WRW+ + G+I GFSTS+++   + +L  +KS     S
Sbjct: 199  RFFILSKEMLHSSLVQKKGKAGYWRWETIAGVIAGFSTSIITISFIRILQHIKSWFRLPS 258

Query: 1083 ICQLFSGRVHSAVHFKRICLFIAYFSFMGWLGIQYVKRLGLIKI 1214
            + ++FS    + V FKR C  +AY SFMGWL IQY KRLGL +I
Sbjct: 259  VARMFSHT--NIVFFKRACFLVAYISFMGWLTIQYGKRLGLPEI 300


>ref|XP_004300141.1| PREDICTED: uncharacterized protein LOC101302166 [Fragaria vesca
            subsp. vesca]
          Length = 290

 Score =  316 bits (810), Expect = 2e-83
 Identities = 152/249 (61%), Positives = 195/249 (78%), Gaps = 5/249 (2%)
 Frame = +3

Query: 492  NLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWAS 671
            N S+  PISV SQC+IVSSSVDL+S+KVCELG LNYKAK+VFYP + ++FRC YDYYWAS
Sbjct: 40   NSSVPGPISVSSQCRIVSSSVDLKSAKVCELGLLNYKAKNVFYPLEGRRFRCRYDYYWAS 99

Query: 672  VFKVEYMD-HSGQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGIST 848
            VFKVEY D  SGQ R+A AEAPSEALP NCRPNF AAWLTKDKFKVN+TY CWYT G+S 
Sbjct: 100  VFKVEYQDLSSGQTRVALAEAPSEALPLNCRPNFGAAWLTKDKFKVNKTYDCWYTYGVSQ 159

Query: 849  VNMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFS 1028
            V++Y+DG F+CQAKDPS  EM+RRYFIL  ++L+S+F +   ++  WRW+ +VG++TGFS
Sbjct: 160  VSLYEDGFFSCQAKDPSTFEMIRRYFILLTKILQSWFLSQEPAM-FWRWETMVGVVTGFS 218

Query: 1029 TSLLSFGLVAVLHQMKSSICQLFSGRV----HSAVHFKRICLFIAYFSFMGWLGIQYVKR 1196
            T+++S   + ++  +KS + Q+ + R+     SAV F+R C  +AY SFMGWL I+Y KR
Sbjct: 219  TAMISITFIRLMQLLKSQLPQISARRMLTQSVSAVLFRRTCFLVAYISFMGWLTIEYGKR 278

Query: 1197 LGLIKIFSL 1223
            LGL +I +L
Sbjct: 279  LGLPEILTL 287


>gb|EXB49813.1| hypothetical protein L484_006351 [Morus notabilis]
          Length = 298

 Score =  314 bits (805), Expect = 6e-83
 Identities = 156/250 (62%), Positives = 183/250 (73%), Gaps = 6/250 (2%)
 Frame = +3

Query: 492  NLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWAS 671
            N S+ +PISVPSQCKIVSSSVDLRSSKVCELG LNYKAKHVFYPF + KFRC YDYYWAS
Sbjct: 50   NSSVSSPISVPSQCKIVSSSVDLRSSKVCELGLLNYKAKHVFYPFGKNKFRCRYDYYWAS 109

Query: 672  VFKVEYMD-HSGQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGIST 848
            VFKVEY D  SG  R A AEAP+EALP NCRPNF AAWL KDKFKVNETY CWYT GI  
Sbjct: 110  VFKVEYKDLSSGVNRFASAEAPNEALPLNCRPNFGAAWLNKDKFKVNETYDCWYTHGIPK 169

Query: 849  VNMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFS 1028
            V++  DG F+CQA DPS  EM++RY ILS+++L+S+  +   S  HWRWD + G+  GFS
Sbjct: 170  VSLPDDGFFSCQANDPSTFEMIKRYSILSVKVLQSWLLSREKS-KHWRWDVLAGVFVGFS 228

Query: 1029 TSLLSFGLVAVLHQMKSSICQLFSG-----RVHSAVHFKRICLFIAYFSFMGWLGIQYVK 1193
            TSL+S   V  L Q+KSS   LFS      R    + F R C  + YFS M WL +QY K
Sbjct: 229  TSLISITFVVFLRQLKSS---LFSAAKSLLRAFLRITFTRACFLVVYFSVMAWLAVQYGK 285

Query: 1194 RLGLIKIFSL 1223
            R+GL++IF++
Sbjct: 286  RIGLLEIFTI 295


>ref|XP_006432860.1| hypothetical protein CICLE_v10002048mg [Citrus clementina]
            gi|557534982|gb|ESR46100.1| hypothetical protein
            CICLE_v10002048mg [Citrus clementina]
          Length = 292

 Score =  312 bits (799), Expect = 3e-82
 Identities = 151/244 (61%), Positives = 187/244 (76%), Gaps = 1/244 (0%)
 Frame = +3

Query: 489  GNLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWA 668
            G  S+ + I VPSQCKIVSSSVD+RSSKVCELG LNYKAK VFYPF+  KFRC YDYYWA
Sbjct: 51   GESSVSSSIFVPSQCKIVSSSVDIRSSKVCELGVLNYKAKRVFYPFEASKFRCRYDYYWA 110

Query: 669  SVFKVEYMDHS-GQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGIS 845
            S+FKVEY+DHS GQ RLA AEAP+EALP +CRPNF AAWLTKDKFKVNETY CWYT+G+S
Sbjct: 111  SIFKVEYLDHSLGQTRLAFAEAPNEALPHSCRPNFGAAWLTKDKFKVNETYGCWYTIGMS 170

Query: 846  TVNMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGF 1025
             V++Y+DG F+CQAKDPS  EM+RRY ILS+++L+S+F++      +  W+ V GL TGF
Sbjct: 171  KVSLYRDGFFSCQAKDPSMAEMIRRYSILSVKILQSWFTS-KKKAKYLSWEIVAGLTTGF 229

Query: 1026 STSLLSFGLVAVLHQMKSSICQLFSGRVHSAVHFKRICLFIAYFSFMGWLGIQYVKRLGL 1205
             TSL++  +V +L QMK  +      RV + + FKR C  + Y S MGW+ I Y ++LGL
Sbjct: 230  LTSLITISVVGILQQMKPWML----ARVFTRICFKRACFLVVYLSVMGWVAILYGEKLGL 285

Query: 1206 IKIF 1217
            +KIF
Sbjct: 286  LKIF 289


>ref|XP_002303468.2| hypothetical protein POPTR_0003s10200g [Populus trichocarpa]
            gi|550342886|gb|EEE78447.2| hypothetical protein
            POPTR_0003s10200g [Populus trichocarpa]
          Length = 364

 Score =  311 bits (796), Expect = 7e-82
 Identities = 158/287 (55%), Positives = 195/287 (67%), Gaps = 8/287 (2%)
 Frame = +3

Query: 381  KKGLFYILFRCCFGXXXXXXXXXXXXXXXXXXXXXXGNLSIWNPISVPSQCKIVSSSVDL 560
            KKG  Y+ FR                          G+ SI  P S+P QC+I+SSSVDL
Sbjct: 77   KKGFMYLAFRLTSALLFPIFAFLFLSILLGFLAILMGHFSITTPPSLPFQCRILSSSVDL 136

Query: 561  RSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWASVFKVEYMDHS-GQQRLAQAEAPS 737
            RSSK+CELG LNYKAKHVFYP +R KFRC YDYYWASVF+VEY D+S GQ + A AEAP+
Sbjct: 137  RSSKICELGLLNYKAKHVFYPNNRSKFRCRYDYYWASVFEVEYEDYSLGQTQFALAEAPN 196

Query: 738  EALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGISTVNMYQDGLFNCQAKDPSNVEMLR 917
            EALP NCRPNF AAWL KDKFKVN+TY CWYT GIS V++Y+D LF+CQAKDPS  EM++
Sbjct: 197  EALPLNCRPNFGAAWLAKDKFKVNKTYDCWYTSGISKVSLYRDDLFSCQAKDPSQAEMIK 256

Query: 918  RYFILSMRMLKS--FFSNGTSSLVHWRWDAVVGLITGFSTSLLSFGLVAVLHQMKS---- 1079
            RYFILS  ML S   +  G +S  +W W+ + G+ITGFSTS+++   + +L  +KS    
Sbjct: 257  RYFILSKEMLHSSPVWKKGKAS--YWGWETIAGVITGFSTSIITISFIKILQYIKSWLRL 314

Query: 1080 -SICQLFSGRVHSAVHFKRICLFIAYFSFMGWLGIQYVKRLGLIKIF 1217
             S+ ++FS    + V FKR C  +AYFSFMGWL IQ  KR GL +I+
Sbjct: 315  TSVARMFSRA--NVVFFKRACFLVAYFSFMGWLTIQCGKRFGLPEIY 359


>gb|EOY25327.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 259

 Score =  306 bits (785), Expect = 1e-80
 Identities = 144/248 (58%), Positives = 183/248 (73%), Gaps = 1/248 (0%)
 Frame = +3

Query: 489  GNLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWA 668
            G LSI N I++P+QCKIVSSSVD+RSSK+CELG LNYKAKHV Y F+R KFRC YDYYW 
Sbjct: 18   GELSIPNSITIPTQCKIVSSSVDIRSSKICELGLLNYKAKHVLYHFERSKFRCRYDYYWT 77

Query: 669  SVFKVEYMDHS-GQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGIS 845
            SVF+VEY DHS GQ RLA  EAP+EALP +CRPNF AAWLTKDKFKVNETY CWYT GIS
Sbjct: 78   SVFEVEYRDHSLGQTRLAFTEAPNEALPLSCRPNFGAAWLTKDKFKVNETYDCWYTSGIS 137

Query: 846  TVNMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGF 1025
             V +Y DG F+CQAKDPS +EM++RY ++S +++ S+ S+     ++WRW+ + G++TGF
Sbjct: 138  KVKLYNDGFFSCQAKDPSTIEMIKRYLMISSKIVYSWLSSKGRG-IYWRWETIAGVVTGF 196

Query: 1026 STSLLSFGLVAVLHQMKSSICQLFSGRVHSAVHFKRICLFIAYFSFMGWLGIQYVKRLGL 1205
            STS+++   + +L  MKS + Q       + VH KR+C  + Y S MGWL  QY +RL +
Sbjct: 197  STSIITISFIRILQHMKSWLPQAL-----NTVHIKRVCFLLVYVSVMGWLVSQYWRRLNI 251

Query: 1206 IKIFSLSY 1229
              I   +Y
Sbjct: 252  PLINVYNY 259


>gb|EOY25326.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 313

 Score =  296 bits (758), Expect = 2e-77
 Identities = 149/308 (48%), Positives = 190/308 (61%), Gaps = 26/308 (8%)
 Frame = +3

Query: 384  KGLFYILFRCCFGXXXXXXXXXXXXXXXXXXXXXXGNLSIWNPISVPSQCKIVSSSVDLR 563
            KG  ++ FR  F                       G LSI N I++P+QCKIVSSSVD+R
Sbjct: 12   KGFLFMFFRIAFALLFPIFAFFFLSFLVGFVAVFIGELSIPNSITIPTQCKIVSSSVDIR 71

Query: 564  SSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWASVFKVEYMDHS-GQQRLAQAEAPSE 740
            SSK+CELG LNYKAKHV Y F+R KFRC YDYYW SVF+VEY DHS GQ RLA  EAP+E
Sbjct: 72   SSKICELGLLNYKAKHVLYHFERSKFRCRYDYYWTSVFEVEYRDHSLGQTRLAFTEAPNE 131

Query: 741  ALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGISTVNMYQDGLFNCQAKDPSNVEMLRR 920
            ALP +CRPNF AAWLTKDKFKVNETY CWYT GIS V +Y DG F+CQAKDPS +EM++R
Sbjct: 132  ALPLSCRPNFGAAWLTKDKFKVNETYDCWYTSGISKVKLYNDGFFSCQAKDPSTIEMIKR 191

Query: 921  YFIL-------------------------SMRMLKSFFSNGTSSLVHWRWDAVVGLITGF 1025
            Y ++                         S +++ S+ S+     ++WRW+ + G++TGF
Sbjct: 192  YLMIEQYSGTSRTALSETWEERIRISECRSSKIVYSWLSSKGRG-IYWRWETIAGVVTGF 250

Query: 1026 STSLLSFGLVAVLHQMKSSICQLFSGRVHSAVHFKRICLFIAYFSFMGWLGIQYVKRLGL 1205
            STS+++   + +L  MKS + Q       + VH KR+C  + Y S MGWL  QY +RL +
Sbjct: 251  STSIITISFIRILQHMKSWLPQAL-----NTVHIKRVCFLLVYVSVMGWLVSQYWRRLNI 305

Query: 1206 IKIFSLSY 1229
              I   +Y
Sbjct: 306  PLINVYNY 313


>ref|XP_004146124.1| PREDICTED: uncharacterized protein LOC101211843 [Cucumis sativus]
          Length = 303

 Score =  296 bits (758), Expect = 2e-77
 Identities = 147/282 (52%), Positives = 190/282 (67%), Gaps = 1/282 (0%)
 Frame = +3

Query: 381  KKGLFYILFRCCFGXXXXXXXXXXXXXXXXXXXXXXGNLSIWNPISVPSQCKIVSSSVDL 560
            K+G   +  R C                         N SI +PIS+ SQCKIVSSSVDL
Sbjct: 16   KRGFLAVTMRWCAALLLPVVSFFVVTLSLSLVAVFVANSSITSPISLRSQCKIVSSSVDL 75

Query: 561  RSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWASVFKVEYMDH-SGQQRLAQAEAPS 737
            RSSKVCELG LNYKAK+VFYP++R KFRC YDYYWASVFKVE  DH SG+ R+A AEAP+
Sbjct: 76   RSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFSGKARVALAEAPN 135

Query: 738  EALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGISTVNMYQDGLFNCQAKDPSNVEMLR 917
            EALP  CRPNF AAWL K KFKVNETY CWY+ GIS V++  DG   CQA++P+ +EM++
Sbjct: 136  EALPHKCRPNFGAAWLAKYKFKVNETYDCWYSSGISKVSLDYDGFSGCQAQEPTTIEMIK 195

Query: 918  RYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFSTSLLSFGLVAVLHQMKSSICQLF 1097
            RY+ L  ++L S++S+     + WRWD + GL+TGFSTSL++  ++ +L  +   + + F
Sbjct: 196  RYYFLCTKILLSWYSS-KEKAIFWRWDMLGGLVTGFSTSLITITVLRILQPLIPWMLRYF 254

Query: 1098 SGRVHSAVHFKRICLFIAYFSFMGWLGIQYVKRLGLIKIFSL 1223
            + R    +H  R C  +AYFSF+GWL IQY KRL L +IF++
Sbjct: 255  TTRFF--IHLNRACFLVAYFSFVGWLIIQYGKRLSLPEIFNI 294


>emb|CBI37457.3| unnamed protein product [Vitis vinifera]
          Length = 310

 Score =  294 bits (753), Expect = 7e-77
 Identities = 140/198 (70%), Positives = 163/198 (82%), Gaps = 1/198 (0%)
 Frame = +3

Query: 489  GNLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWA 668
            GNLS+ +P+SVPSQCKIVSSSVDLRSSKVCELG LNYKAKHVFYP +++KFRCHYDYYWA
Sbjct: 42   GNLSVSSPVSVPSQCKIVSSSVDLRSSKVCELGLLNYKAKHVFYPLEKRKFRCHYDYYWA 101

Query: 669  SVFKVEYMDHSGQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGIST 848
            SVFKVEY D  GQ RL   EAP+EALP +CRPNF AAWLTKDKFKVNETY CWY  GIS 
Sbjct: 102  SVFKVEYKDSLGQTRLTLTEAPNEALPLDCRPNFGAAWLTKDKFKVNETYDCWYASGISK 161

Query: 849  VNMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFF-SNGTSSLVHWRWDAVVGLITGF 1025
            V++YQD  F+CQAK+PS +EM+RRY ILS R+L+S+  S G     +WR + V G+ITGF
Sbjct: 162  VSIYQDSFFSCQAKEPSTIEMIRRYSILSTRILQSWLASQGRGK--YWRLETVAGVITGF 219

Query: 1026 STSLLSFGLVAVLHQMKS 1079
            STSL+S  LV +LHQ+KS
Sbjct: 220  STSLISISLVRILHQVKS 237


>ref|XP_004512397.1| PREDICTED: uncharacterized protein LOC101511402 [Cicer arietinum]
          Length = 305

 Score =  287 bits (735), Expect = 8e-75
 Identities = 139/249 (55%), Positives = 180/249 (72%), Gaps = 5/249 (2%)
 Frame = +3

Query: 492  NLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWAS 671
            + S+ NPIS+PS C+IVS+ VD+RSSK+CELG  NYKAK +F  F+R KFRC YDYYWAS
Sbjct: 54   DFSVPNPISLPSHCRIVSTGVDIRSSKICELGLSNYKAKDIFRHFERSKFRCRYDYYWAS 113

Query: 672  VFKVEYMDH-SGQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGIST 848
            VFKVEY DH SGQ++ A AEAPSEALP  CRPNF AAWLT+ KFKVNETY CWYT GIS 
Sbjct: 114  VFKVEYKDHFSGQRQFAFAEAPSEALPLYCRPNFGAAWLTQYKFKVNETYDCWYTSGISK 173

Query: 849  VNMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFS 1028
            V++YQD LF C+A + S +E + +Y   +M M+  + S+    + +WRW+ +VG+++GF+
Sbjct: 174  VHLYQDNLFGCRADEQSTIEKIIQYSTQAMEMINYWISDIGRRVKYWRWEVIVGVVSGFA 233

Query: 1029 TSLLSFGLVAVLHQMKSSICQLFSGRVHS----AVHFKRICLFIAYFSFMGWLGIQYVKR 1196
            TSL+S   +  L  + SS+ Q F+  + S    AV  +R C F AY SF+GWL I+Y KR
Sbjct: 234  TSLISITFIMFLKLLLSSLRQSFAAWILSWRVNAVLIRRTCFFFAYLSFVGWLAIEYGKR 293

Query: 1197 LGLIKIFSL 1223
            LGL+ IF L
Sbjct: 294  LGLMDIFRL 302


>ref|XP_003534316.1| PREDICTED: uncharacterized protein LOC100813000 [Glycine max]
          Length = 303

 Score =  287 bits (734), Expect = 1e-74
 Identities = 144/250 (57%), Positives = 181/250 (72%), Gaps = 6/250 (2%)
 Frame = +3

Query: 492  NLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWAS 671
            + SI NPIS+PSQCKIVS+ VD+RSSK+CELG L+YKAK VF+ F+R KFRC YDYYWAS
Sbjct: 53   DFSIPNPISLPSQCKIVSTGVDIRSSKICELGLLDYKAKDVFHHFERSKFRCRYDYYWAS 112

Query: 672  VFKVEYMDH-SGQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGIST 848
            VFKVEY DH SGQ ++A AEAP+EALP  CRPNF AAWLT+ KFKVNETY CWYT GIS 
Sbjct: 113  VFKVEYKDHFSGQTQVAFAEAPNEALPLYCRPNFGAAWLTQYKFKVNETYDCWYTSGISK 172

Query: 849  VNMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFS 1028
            V ++QD LF C A + S +E  R+Y  ++M M+ S+FS+      HWRW+ + G++TGF 
Sbjct: 173  VRLHQDSLFGCDAHEQSTLEKSRQYSTMAMEMVISWFSS-RGRTKHWRWETLAGVVTGFL 231

Query: 1029 TSLLSFGLVAVLHQMKSSICQ-----LFSGRVHSAVHFKRICLFIAYFSFMGWLGIQYVK 1193
            TSL+S   +  L  +  S+ Q     +FS RV +AV  +R C  +AY SF+ WL I+Y K
Sbjct: 232  TSLISITFIRFLQLLLPSLYQSFTTWIFSWRV-NAVLIRRACFLLAYLSFVAWLAIEYGK 290

Query: 1194 RLGLIKIFSL 1223
            RLGL+ IF L
Sbjct: 291  RLGLMDIFRL 300


>gb|ESW30148.1| hypothetical protein PHAVU_002G128700g [Phaseolus vulgaris]
          Length = 305

 Score =  285 bits (730), Expect = 3e-74
 Identities = 143/248 (57%), Positives = 175/248 (70%), Gaps = 5/248 (2%)
 Frame = +3

Query: 492  NLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWAS 671
            + SI NPIS+PSQCKIVS+ VD+RSSK+CELG LNYKAK VF  F+R KFRC YDYYWAS
Sbjct: 55   DFSIPNPISLPSQCKIVSTGVDIRSSKICELGLLNYKAKDVFQHFERSKFRCRYDYYWAS 114

Query: 672  VFKVEYMDH-SGQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGIST 848
            VFKVEY DH SGQ ++A AEAP+EALP  CRPNF AAWLT+ KFKVNETY CWYT GIS 
Sbjct: 115  VFKVEYKDHFSGQTQVAFAEAPNEALPLYCRPNFGAAWLTQYKFKVNETYNCWYTSGISK 174

Query: 849  VNMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFS 1028
            V + QD LF C A   S +E  R+Y  ++M M  S+ S G     HWRW+ + G+++GF 
Sbjct: 175  VRLRQDNLFGCHAHQQSTLEKSRQYSTMAMEMAISWLS-GRGRTKHWRWETLAGVVSGFL 233

Query: 1029 TSLLSFGLVAVLHQMKSSICQLFSGRVH----SAVHFKRICLFIAYFSFMGWLGIQYVKR 1196
            TSL+S   +   H + SSI Q F+  +     +AV  +R C  +AY SF+ WL I+Y KR
Sbjct: 234  TSLISITFIRFAHILLSSIYQSFTTWIFPWRVNAVFIRRSCFLLAYLSFVAWLAIEYGKR 293

Query: 1197 LGLIKIFS 1220
            LGL+ IFS
Sbjct: 294  LGLMDIFS 301


>gb|EPS59472.1| hypothetical protein M569_15336, partial [Genlisea aurea]
          Length = 260

 Score =  281 bits (718), Expect = 8e-73
 Identities = 131/240 (54%), Positives = 172/240 (71%), Gaps = 4/240 (1%)
 Frame = +3

Query: 492  NLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWAS 671
            +L +W+PISV   C++VSSSVDLRSSKVC +G LNY AK+V YP ++ K+RCHYDYYWA+
Sbjct: 22   SLRLWSPISVRCLCRVVSSSVDLRSSKVCAIGVLNYNAKNVLYPLEKNKYRCHYDYYWAA 81

Query: 672  VFKVEYMDHSGQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGISTV 851
            + KVE+ DH G +R A AEAP+EALP NCRP+FS AWLTK KF +NETY CWYT+GIS V
Sbjct: 82   ILKVEFTDHLGHERFALAEAPNEALPYNCRPSFSGAWLTKSKFMINETYDCWYTLGISKV 141

Query: 852  NMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFST 1031
            N+  +GLFNC+A DPS +EM++ +  L +R+LKS FS G  SL+HWRWD + GL TGF T
Sbjct: 142  NINHEGLFNCRADDPSTLEMMKLHSTLLIRILKSSFS-GWGSLMHWRWDVIAGLTTGFIT 200

Query: 1032 SLLSFGLVAVLHQMKSSICQLFSG----RVHSAVHFKRICLFIAYFSFMGWLGIQYVKRL 1199
            +LL   LV+++  +  S  +L       R    +  KR  +F  Y  FM W+ +QY++RL
Sbjct: 201  ALLVIALVSLIWPLIQSTTRLLGSWLFIRYPITLFLKRAFVFTVYLMFMCWITLQYLRRL 260


>ref|NP_001241403.1| uncharacterized protein LOC100811221 [Glycine max]
            gi|255642352|gb|ACU21440.1| unknown [Glycine max]
          Length = 305

 Score =  279 bits (713), Expect = 3e-72
 Identities = 141/248 (56%), Positives = 177/248 (71%), Gaps = 6/248 (2%)
 Frame = +3

Query: 492  NLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWAS 671
            + SI NPIS+PSQCKIVS+ VD+RSSK+CELG LNYKAK VF+ F+R KFRC YDYYWAS
Sbjct: 55   DFSIPNPISLPSQCKIVSTGVDIRSSKICELGLLNYKAKDVFHHFERSKFRCRYDYYWAS 114

Query: 672  VFKVEYMDH-SGQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGIST 848
            VFKVEY DH SGQ ++A AEAP+EALP  CRPNF AAW T+ KFKVNE+Y CWYT G S 
Sbjct: 115  VFKVEYKDHFSGQTQVAFAEAPNEALPLYCRPNFGAAWFTQYKFKVNESYDCWYTSGNSK 174

Query: 849  VNMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFS 1028
            V+++QD LF C A + S +E  R+Y  ++M M  S+ S+      HWRW+ + G++TGF 
Sbjct: 175  VHLHQDNLFGCDAHEQSTLEKSRQYSTMAMEMAISWLSS-RGRTKHWRWETLAGVVTGFL 233

Query: 1029 TSLLSFGLVAVLHQMKSSICQ-----LFSGRVHSAVHFKRICLFIAYFSFMGWLGIQYVK 1193
            TSL+S   +  L    SS+ Q     +FS RV +AV  +R C  +AY SF+ WL I+Y K
Sbjct: 234  TSLISITFIRFLQLFLSSLHQSFTTWIFSWRV-NAVLIRRACFLLAYLSFVAWLVIEYGK 292

Query: 1194 RLGLIKIF 1217
            RLGL+ IF
Sbjct: 293  RLGLMDIF 300


>ref|XP_006414004.1| hypothetical protein EUTSA_v10025843mg [Eutrema salsugineum]
            gi|557115174|gb|ESQ55457.1| hypothetical protein
            EUTSA_v10025843mg [Eutrema salsugineum]
          Length = 299

 Score =  272 bits (696), Expect = 3e-70
 Identities = 131/235 (55%), Positives = 169/235 (71%), Gaps = 7/235 (2%)
 Frame = +3

Query: 516  SVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWASVFKVEYMD 695
            S+ S+CKIVSSSVDLRSSKVC +G LN KA+HVFYPF+R KFRC YDYYWASVFKVEY D
Sbjct: 62   SLASRCKIVSSSVDLRSSKVCGIGLLNIKAQHVFYPFERDKFRCRYDYYWASVFKVEYKD 121

Query: 696  H-SGQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGISTVNMYQDGL 872
            H  GQ RLA +EAP+EALP  CRPNF AA LTKD FKVNETY CWYT+GI  + +YQDG 
Sbjct: 122  HLMGQTRLAFSEAPNEALPPECRPNFGAALLTKDNFKVNETYDCWYTLGIPKIKLYQDGF 181

Query: 873  FNCQAKDPSNVEMLRRYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFSTSLLSFGL 1052
            F CQA D S  ++ ++Y +L  R+L+S+F NG     +WR+D + G+++GFSTS+++  +
Sbjct: 182  FGCQANDRSFTDIFKQYAVLFSRLLQSWF-NGKGRPKYWRYDVIAGIVSGFSTSIITVFV 240

Query: 1053 VAVLHQMKSSICQLFS------GRVHSAVHFKRICLFIAYFSFMGWLGIQYVKRL 1199
            + +L   KS + + F        +V+  V  KR CL + YFS +GW+  QY+K L
Sbjct: 241  MRILRHAKSWVPRAFCSVKSQYSKVNLVVQMKRACLVLVYFSALGWMATQYLKIL 295


Top