BLASTX nr result
ID: Catharanthus23_contig00014759
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00014759 (1665 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006364460.1| PREDICTED: uncharacterized protein LOC102606... 329 2e-87 ref|XP_004245966.1| PREDICTED: uncharacterized protein LOC101264... 328 4e-87 gb|EMJ10522.1| hypothetical protein PRUPE_ppa008010mg [Prunus pe... 327 1e-86 ref|XP_002264344.1| PREDICTED: uncharacterized protein LOC100266... 325 5e-86 ref|XP_002326587.1| predicted protein [Populus trichocarpa] 325 5e-86 ref|XP_006368288.1| hypothetical protein POPTR_0001s01320g [Popu... 325 5e-86 ref|XP_004300141.1| PREDICTED: uncharacterized protein LOC101302... 316 2e-83 gb|EXB49813.1| hypothetical protein L484_006351 [Morus notabilis] 314 6e-83 ref|XP_006432860.1| hypothetical protein CICLE_v10002048mg [Citr... 312 3e-82 ref|XP_002303468.2| hypothetical protein POPTR_0003s10200g [Popu... 311 7e-82 gb|EOY25327.1| Uncharacterized protein isoform 2 [Theobroma cacao] 306 1e-80 gb|EOY25326.1| Uncharacterized protein isoform 1 [Theobroma cacao] 296 2e-77 ref|XP_004146124.1| PREDICTED: uncharacterized protein LOC101211... 296 2e-77 emb|CBI37457.3| unnamed protein product [Vitis vinifera] 294 7e-77 ref|XP_004512397.1| PREDICTED: uncharacterized protein LOC101511... 287 8e-75 ref|XP_003534316.1| PREDICTED: uncharacterized protein LOC100813... 287 1e-74 gb|ESW30148.1| hypothetical protein PHAVU_002G128700g [Phaseolus... 285 3e-74 gb|EPS59472.1| hypothetical protein M569_15336, partial [Genlise... 281 8e-73 ref|NP_001241403.1| uncharacterized protein LOC100811221 [Glycin... 279 3e-72 ref|XP_006414004.1| hypothetical protein EUTSA_v10025843mg [Eutr... 272 3e-70 >ref|XP_006364460.1| PREDICTED: uncharacterized protein LOC102606400 [Solanum tuberosum] Length = 301 Score = 329 bits (844), Expect = 2e-87 Identities = 159/246 (64%), Positives = 192/246 (78%), Gaps = 4/246 (1%) Frame = +3 Query: 489 GNLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWA 668 GNLSI +PIS+PSQCKIVSSSVDLRSSKVCELG LNYKAKHV YP +RKKFRCHYDYYWA Sbjct: 51 GNLSISSPISLPSQCKIVSSSVDLRSSKVCELGLLNYKAKHVLYPSERKKFRCHYDYYWA 110 Query: 669 SVFKVEYMDHSGQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGIST 848 SVFKVEYMDHSGQ R A AEAP+EALPS+CRPNFS AWLTKDKF+VN+TY CWYT+GIS Sbjct: 111 SVFKVEYMDHSGQARSALAEAPNEALPSDCRPNFSGAWLTKDKFEVNKTYECWYTLGISK 170 Query: 849 VNMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFS 1028 V++YQ G F+C AKDPS +EM RY IL MR+LKS++ +G HWRW+AV G+I GF Sbjct: 171 VHIYQAGFFDCDAKDPSTIEMFIRYLILFMRILKSWYVSGV-LYWHWRWEAVAGVIAGFC 229 Query: 1029 TSLLSFGLVAVLHQMKSSICQLFSGR----VHSAVHFKRICLFIAYFSFMGWLGIQYVKR 1196 TS+++ L A+L + S I QL R + + +R+C +AY SF WL +QY++R Sbjct: 230 TSIMTVILFALLRKFFSCIYQLSVVRRFTLPFNKIRLRRVCFLLAYVSFTSWLAVQYLRR 289 Query: 1197 LGLIKI 1214 +GL +I Sbjct: 290 IGLPEI 295 >ref|XP_004245966.1| PREDICTED: uncharacterized protein LOC101264543 [Solanum lycopersicum] Length = 301 Score = 328 bits (841), Expect = 4e-87 Identities = 159/246 (64%), Positives = 192/246 (78%), Gaps = 4/246 (1%) Frame = +3 Query: 489 GNLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWA 668 GNLSI +PIS+PSQCKIVSSSVDLRSSKVCELG LNYKAKHV YP +RKKFRCHYDYYWA Sbjct: 51 GNLSISSPISLPSQCKIVSSSVDLRSSKVCELGLLNYKAKHVLYPSERKKFRCHYDYYWA 110 Query: 669 SVFKVEYMDHSGQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGIST 848 SVFKVEY+DHSGQ R A AEAP+EALPS+CRPNFS AWLTKDKF+VN+TY CWYT+GIS Sbjct: 111 SVFKVEYVDHSGQARSALAEAPNEALPSDCRPNFSGAWLTKDKFEVNKTYKCWYTLGISK 170 Query: 849 VNMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFS 1028 V++YQ G F+C AKDPS +EM RY IL MR+LKS++ +G HWRW+AV G+I GF Sbjct: 171 VHIYQAGFFDCDAKDPSTIEMFIRYLILFMRILKSWYVSGV-LYWHWRWEAVAGVIAGFC 229 Query: 1029 TSLLSFGLVAVLHQMKSSICQLFSGR----VHSAVHFKRICLFIAYFSFMGWLGIQYVKR 1196 TS+++ L A+L ++ S I QL R + V +R+C +AY SF WL +QY +R Sbjct: 230 TSIMTVILFALLRKLFSCIYQLSVVRRFTLPFNKVRLRRVCFLLAYVSFTSWLAVQYFRR 289 Query: 1197 LGLIKI 1214 +GL +I Sbjct: 290 IGLPEI 295 >gb|EMJ10522.1| hypothetical protein PRUPE_ppa008010mg [Prunus persica] Length = 349 Score = 327 bits (837), Expect = 1e-86 Identities = 162/284 (57%), Positives = 197/284 (69%), Gaps = 3/284 (1%) Frame = +3 Query: 381 KKGLFYILFRCCFGXXXXXXXXXXXXXXXXXXXXXXGNLSIWNPISVPSQCKIVSSSVDL 560 K + Y++ RC N SI +PISV SQC+I+SSSVDL Sbjct: 64 KGNMAYLILRCSLALVLPIVAIFALSLLVGFVAIFVANSSIPSPISVSSQCRILSSSVDL 123 Query: 561 RSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWASVFKVEYMDHS-GQQRLAQAEAPS 737 +SSKVCELG NYKAKHVFYPF+ ++FRC YDYYWAS+FKVEY D S GQ +LA AEAP+ Sbjct: 124 KSSKVCELGLFNYKAKHVFYPFEGRRFRCRYDYYWASIFKVEYKDQSSGQTQLALAEAPN 183 Query: 738 EALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGISTVNMYQDGLFNCQAKDPSNVEMLR 917 EALP +CRPNF AAWLTKDKFKVNETY CWYT GIS V++Y DG F+CQAKDPS EM+R Sbjct: 184 EALPLDCRPNFGAAWLTKDKFKVNETYDCWYTYGISKVSLYHDGFFSCQAKDPSTFEMIR 243 Query: 918 RYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFSTSLLSFGLVAVLHQMKSSICQLF 1097 RYFIL+ ++L S+F WRW+ V G+I GFSTSL+S + +L QMKS + QLF Sbjct: 244 RYFILATKILHSWFV-AQERAGFWRWETVAGVIAGFSTSLISISFIRLLQQMKSRLPQLF 302 Query: 1098 SGRVHS--AVHFKRICLFIAYFSFMGWLGIQYVKRLGLIKIFSL 1223 + RV + F+R C +AY SFM WL IQY KRLGL++I +L Sbjct: 303 AARVLPLYMIRFRRTCFLVAYISFMSWLVIQYGKRLGLLEIITL 346 >ref|XP_002264344.1| PREDICTED: uncharacterized protein LOC100266685 [Vitis vinifera] Length = 291 Score = 325 bits (832), Expect = 5e-86 Identities = 158/251 (62%), Positives = 192/251 (76%), Gaps = 4/251 (1%) Frame = +3 Query: 489 GNLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWA 668 GNLS+ +P+SVPSQCKIVSSSVDLRSSKVCELG LNYKAKHVFYP +++KFRCHYDYYWA Sbjct: 42 GNLSVSSPVSVPSQCKIVSSSVDLRSSKVCELGLLNYKAKHVFYPLEKRKFRCHYDYYWA 101 Query: 669 SVFKVEYMDHSGQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGIST 848 SVFKVEY D GQ RL EAP+EALP +CRPNF AAWLTKDKFKVNETY CWY GIS Sbjct: 102 SVFKVEYKDSLGQTRLTLTEAPNEALPLDCRPNFGAAWLTKDKFKVNETYDCWYASGISK 161 Query: 849 VNMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFF-SNGTSSLVHWRWDAVVGLITGF 1025 V++YQD F+CQAK+PS +EM+RRY ILS R+L+S+ S G +WR + V G+ITGF Sbjct: 162 VSIYQDSFFSCQAKEPSTIEMIRRYSILSTRILQSWLASQGRGK--YWRLETVAGVITGF 219 Query: 1026 STSLLSFGLVAVLHQMKSSICQLFSGRVHSA---VHFKRICLFIAYFSFMGWLGIQYVKR 1196 STSL+S LV +LHQ+KS + ++ ++ A V F+R + Y FMGWL I+Y KR Sbjct: 220 STSLISISLVRILHQVKSWLPRILKRQLLLAVKSVRFRRAFFLVTYVIFMGWLAIEYGKR 279 Query: 1197 LGLIKIFSLSY 1229 LG+ I+ + Y Sbjct: 280 LGISNIYRVYY 290 >ref|XP_002326587.1| predicted protein [Populus trichocarpa] Length = 299 Score = 325 bits (832), Expect = 5e-86 Identities = 160/284 (56%), Positives = 196/284 (69%), Gaps = 6/284 (2%) Frame = +3 Query: 381 KKGLFYILFRCCFGXXXXXXXXXXXXXXXXXXXXXXGNLSIWNPISVPSQCKIVSSSVDL 560 KKG Y+ FR G+LSI P+S+PSQCKI+SSSVDL Sbjct: 12 KKGFIYLAFRLTLALLFPIFAFLSLSILLGFLAIFMGHLSITTPLSLPSQCKILSSSVDL 71 Query: 561 RSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWASVFKVEYMDHS-GQQRLAQAEAPS 737 RSSK+CE GFLNYKAKHVFYP++R KFRC YDYYWASVF+VEY D+S GQ + A AEAP+ Sbjct: 72 RSSKICEPGFLNYKAKHVFYPYNRSKFRCRYDYYWASVFEVEYKDYSLGQTQFALAEAPN 131 Query: 738 EALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGISTVNMYQDGLFNCQAKDPSNVEMLR 917 EALP NCRPNF AAWLTKDKFKVN+TY CWYT GI V++Y+D LF+CQAKDPS VEM++ Sbjct: 132 EALPLNCRPNFGAAWLTKDKFKVNKTYDCWYTSGILKVSLYRDDLFSCQAKDPSQVEMIK 191 Query: 918 RYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFSTSLLSFGLVAVLHQMKS-----S 1082 R+FILS ML S +WRW+ + G+I GFSTS+++ + +L +KS S Sbjct: 192 RFFILSKEMLHSSLVQKKGKAGYWRWETIAGVIAGFSTSIITISFIRILQHIKSWFRLPS 251 Query: 1083 ICQLFSGRVHSAVHFKRICLFIAYFSFMGWLGIQYVKRLGLIKI 1214 + ++FS + V FKR C +AY SFMGWL IQY KRLGL +I Sbjct: 252 VARMFSHT--NIVFFKRACFLVAYISFMGWLTIQYGKRLGLPEI 293 >ref|XP_006368288.1| hypothetical protein POPTR_0001s01320g [Populus trichocarpa] gi|118483148|gb|ABK93480.1| unknown [Populus trichocarpa] gi|550346193|gb|ERP64857.1| hypothetical protein POPTR_0001s01320g [Populus trichocarpa] Length = 306 Score = 325 bits (832), Expect = 5e-86 Identities = 160/284 (56%), Positives = 196/284 (69%), Gaps = 6/284 (2%) Frame = +3 Query: 381 KKGLFYILFRCCFGXXXXXXXXXXXXXXXXXXXXXXGNLSIWNPISVPSQCKIVSSSVDL 560 KKG Y+ FR G+LSI P+S+PSQCKI+SSSVDL Sbjct: 19 KKGFIYLAFRLTLALLFPIFAFLSLSILLGFLAIFMGHLSITTPLSLPSQCKILSSSVDL 78 Query: 561 RSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWASVFKVEYMDHS-GQQRLAQAEAPS 737 RSSK+CE GFLNYKAKHVFYP++R KFRC YDYYWASVF+VEY D+S GQ + A AEAP+ Sbjct: 79 RSSKICEPGFLNYKAKHVFYPYNRSKFRCRYDYYWASVFEVEYKDYSLGQTQFALAEAPN 138 Query: 738 EALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGISTVNMYQDGLFNCQAKDPSNVEMLR 917 EALP NCRPNF AAWLTKDKFKVN+TY CWYT GI V++Y+D LF+CQAKDPS VEM++ Sbjct: 139 EALPLNCRPNFGAAWLTKDKFKVNKTYDCWYTSGILKVSLYRDDLFSCQAKDPSQVEMIK 198 Query: 918 RYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFSTSLLSFGLVAVLHQMKS-----S 1082 R+FILS ML S +WRW+ + G+I GFSTS+++ + +L +KS S Sbjct: 199 RFFILSKEMLHSSLVQKKGKAGYWRWETIAGVIAGFSTSIITISFIRILQHIKSWFRLPS 258 Query: 1083 ICQLFSGRVHSAVHFKRICLFIAYFSFMGWLGIQYVKRLGLIKI 1214 + ++FS + V FKR C +AY SFMGWL IQY KRLGL +I Sbjct: 259 VARMFSHT--NIVFFKRACFLVAYISFMGWLTIQYGKRLGLPEI 300 >ref|XP_004300141.1| PREDICTED: uncharacterized protein LOC101302166 [Fragaria vesca subsp. vesca] Length = 290 Score = 316 bits (810), Expect = 2e-83 Identities = 152/249 (61%), Positives = 195/249 (78%), Gaps = 5/249 (2%) Frame = +3 Query: 492 NLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWAS 671 N S+ PISV SQC+IVSSSVDL+S+KVCELG LNYKAK+VFYP + ++FRC YDYYWAS Sbjct: 40 NSSVPGPISVSSQCRIVSSSVDLKSAKVCELGLLNYKAKNVFYPLEGRRFRCRYDYYWAS 99 Query: 672 VFKVEYMD-HSGQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGIST 848 VFKVEY D SGQ R+A AEAPSEALP NCRPNF AAWLTKDKFKVN+TY CWYT G+S Sbjct: 100 VFKVEYQDLSSGQTRVALAEAPSEALPLNCRPNFGAAWLTKDKFKVNKTYDCWYTYGVSQ 159 Query: 849 VNMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFS 1028 V++Y+DG F+CQAKDPS EM+RRYFIL ++L+S+F + ++ WRW+ +VG++TGFS Sbjct: 160 VSLYEDGFFSCQAKDPSTFEMIRRYFILLTKILQSWFLSQEPAM-FWRWETMVGVVTGFS 218 Query: 1029 TSLLSFGLVAVLHQMKSSICQLFSGRV----HSAVHFKRICLFIAYFSFMGWLGIQYVKR 1196 T+++S + ++ +KS + Q+ + R+ SAV F+R C +AY SFMGWL I+Y KR Sbjct: 219 TAMISITFIRLMQLLKSQLPQISARRMLTQSVSAVLFRRTCFLVAYISFMGWLTIEYGKR 278 Query: 1197 LGLIKIFSL 1223 LGL +I +L Sbjct: 279 LGLPEILTL 287 >gb|EXB49813.1| hypothetical protein L484_006351 [Morus notabilis] Length = 298 Score = 314 bits (805), Expect = 6e-83 Identities = 156/250 (62%), Positives = 183/250 (73%), Gaps = 6/250 (2%) Frame = +3 Query: 492 NLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWAS 671 N S+ +PISVPSQCKIVSSSVDLRSSKVCELG LNYKAKHVFYPF + KFRC YDYYWAS Sbjct: 50 NSSVSSPISVPSQCKIVSSSVDLRSSKVCELGLLNYKAKHVFYPFGKNKFRCRYDYYWAS 109 Query: 672 VFKVEYMD-HSGQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGIST 848 VFKVEY D SG R A AEAP+EALP NCRPNF AAWL KDKFKVNETY CWYT GI Sbjct: 110 VFKVEYKDLSSGVNRFASAEAPNEALPLNCRPNFGAAWLNKDKFKVNETYDCWYTHGIPK 169 Query: 849 VNMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFS 1028 V++ DG F+CQA DPS EM++RY ILS+++L+S+ + S HWRWD + G+ GFS Sbjct: 170 VSLPDDGFFSCQANDPSTFEMIKRYSILSVKVLQSWLLSREKS-KHWRWDVLAGVFVGFS 228 Query: 1029 TSLLSFGLVAVLHQMKSSICQLFSG-----RVHSAVHFKRICLFIAYFSFMGWLGIQYVK 1193 TSL+S V L Q+KSS LFS R + F R C + YFS M WL +QY K Sbjct: 229 TSLISITFVVFLRQLKSS---LFSAAKSLLRAFLRITFTRACFLVVYFSVMAWLAVQYGK 285 Query: 1194 RLGLIKIFSL 1223 R+GL++IF++ Sbjct: 286 RIGLLEIFTI 295 >ref|XP_006432860.1| hypothetical protein CICLE_v10002048mg [Citrus clementina] gi|557534982|gb|ESR46100.1| hypothetical protein CICLE_v10002048mg [Citrus clementina] Length = 292 Score = 312 bits (799), Expect = 3e-82 Identities = 151/244 (61%), Positives = 187/244 (76%), Gaps = 1/244 (0%) Frame = +3 Query: 489 GNLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWA 668 G S+ + I VPSQCKIVSSSVD+RSSKVCELG LNYKAK VFYPF+ KFRC YDYYWA Sbjct: 51 GESSVSSSIFVPSQCKIVSSSVDIRSSKVCELGVLNYKAKRVFYPFEASKFRCRYDYYWA 110 Query: 669 SVFKVEYMDHS-GQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGIS 845 S+FKVEY+DHS GQ RLA AEAP+EALP +CRPNF AAWLTKDKFKVNETY CWYT+G+S Sbjct: 111 SIFKVEYLDHSLGQTRLAFAEAPNEALPHSCRPNFGAAWLTKDKFKVNETYGCWYTIGMS 170 Query: 846 TVNMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGF 1025 V++Y+DG F+CQAKDPS EM+RRY ILS+++L+S+F++ + W+ V GL TGF Sbjct: 171 KVSLYRDGFFSCQAKDPSMAEMIRRYSILSVKILQSWFTS-KKKAKYLSWEIVAGLTTGF 229 Query: 1026 STSLLSFGLVAVLHQMKSSICQLFSGRVHSAVHFKRICLFIAYFSFMGWLGIQYVKRLGL 1205 TSL++ +V +L QMK + RV + + FKR C + Y S MGW+ I Y ++LGL Sbjct: 230 LTSLITISVVGILQQMKPWML----ARVFTRICFKRACFLVVYLSVMGWVAILYGEKLGL 285 Query: 1206 IKIF 1217 +KIF Sbjct: 286 LKIF 289 >ref|XP_002303468.2| hypothetical protein POPTR_0003s10200g [Populus trichocarpa] gi|550342886|gb|EEE78447.2| hypothetical protein POPTR_0003s10200g [Populus trichocarpa] Length = 364 Score = 311 bits (796), Expect = 7e-82 Identities = 158/287 (55%), Positives = 195/287 (67%), Gaps = 8/287 (2%) Frame = +3 Query: 381 KKGLFYILFRCCFGXXXXXXXXXXXXXXXXXXXXXXGNLSIWNPISVPSQCKIVSSSVDL 560 KKG Y+ FR G+ SI P S+P QC+I+SSSVDL Sbjct: 77 KKGFMYLAFRLTSALLFPIFAFLFLSILLGFLAILMGHFSITTPPSLPFQCRILSSSVDL 136 Query: 561 RSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWASVFKVEYMDHS-GQQRLAQAEAPS 737 RSSK+CELG LNYKAKHVFYP +R KFRC YDYYWASVF+VEY D+S GQ + A AEAP+ Sbjct: 137 RSSKICELGLLNYKAKHVFYPNNRSKFRCRYDYYWASVFEVEYEDYSLGQTQFALAEAPN 196 Query: 738 EALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGISTVNMYQDGLFNCQAKDPSNVEMLR 917 EALP NCRPNF AAWL KDKFKVN+TY CWYT GIS V++Y+D LF+CQAKDPS EM++ Sbjct: 197 EALPLNCRPNFGAAWLAKDKFKVNKTYDCWYTSGISKVSLYRDDLFSCQAKDPSQAEMIK 256 Query: 918 RYFILSMRMLKS--FFSNGTSSLVHWRWDAVVGLITGFSTSLLSFGLVAVLHQMKS---- 1079 RYFILS ML S + G +S +W W+ + G+ITGFSTS+++ + +L +KS Sbjct: 257 RYFILSKEMLHSSPVWKKGKAS--YWGWETIAGVITGFSTSIITISFIKILQYIKSWLRL 314 Query: 1080 -SICQLFSGRVHSAVHFKRICLFIAYFSFMGWLGIQYVKRLGLIKIF 1217 S+ ++FS + V FKR C +AYFSFMGWL IQ KR GL +I+ Sbjct: 315 TSVARMFSRA--NVVFFKRACFLVAYFSFMGWLTIQCGKRFGLPEIY 359 >gb|EOY25327.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 259 Score = 306 bits (785), Expect = 1e-80 Identities = 144/248 (58%), Positives = 183/248 (73%), Gaps = 1/248 (0%) Frame = +3 Query: 489 GNLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWA 668 G LSI N I++P+QCKIVSSSVD+RSSK+CELG LNYKAKHV Y F+R KFRC YDYYW Sbjct: 18 GELSIPNSITIPTQCKIVSSSVDIRSSKICELGLLNYKAKHVLYHFERSKFRCRYDYYWT 77 Query: 669 SVFKVEYMDHS-GQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGIS 845 SVF+VEY DHS GQ RLA EAP+EALP +CRPNF AAWLTKDKFKVNETY CWYT GIS Sbjct: 78 SVFEVEYRDHSLGQTRLAFTEAPNEALPLSCRPNFGAAWLTKDKFKVNETYDCWYTSGIS 137 Query: 846 TVNMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGF 1025 V +Y DG F+CQAKDPS +EM++RY ++S +++ S+ S+ ++WRW+ + G++TGF Sbjct: 138 KVKLYNDGFFSCQAKDPSTIEMIKRYLMISSKIVYSWLSSKGRG-IYWRWETIAGVVTGF 196 Query: 1026 STSLLSFGLVAVLHQMKSSICQLFSGRVHSAVHFKRICLFIAYFSFMGWLGIQYVKRLGL 1205 STS+++ + +L MKS + Q + VH KR+C + Y S MGWL QY +RL + Sbjct: 197 STSIITISFIRILQHMKSWLPQAL-----NTVHIKRVCFLLVYVSVMGWLVSQYWRRLNI 251 Query: 1206 IKIFSLSY 1229 I +Y Sbjct: 252 PLINVYNY 259 >gb|EOY25326.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 313 Score = 296 bits (758), Expect = 2e-77 Identities = 149/308 (48%), Positives = 190/308 (61%), Gaps = 26/308 (8%) Frame = +3 Query: 384 KGLFYILFRCCFGXXXXXXXXXXXXXXXXXXXXXXGNLSIWNPISVPSQCKIVSSSVDLR 563 KG ++ FR F G LSI N I++P+QCKIVSSSVD+R Sbjct: 12 KGFLFMFFRIAFALLFPIFAFFFLSFLVGFVAVFIGELSIPNSITIPTQCKIVSSSVDIR 71 Query: 564 SSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWASVFKVEYMDHS-GQQRLAQAEAPSE 740 SSK+CELG LNYKAKHV Y F+R KFRC YDYYW SVF+VEY DHS GQ RLA EAP+E Sbjct: 72 SSKICELGLLNYKAKHVLYHFERSKFRCRYDYYWTSVFEVEYRDHSLGQTRLAFTEAPNE 131 Query: 741 ALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGISTVNMYQDGLFNCQAKDPSNVEMLRR 920 ALP +CRPNF AAWLTKDKFKVNETY CWYT GIS V +Y DG F+CQAKDPS +EM++R Sbjct: 132 ALPLSCRPNFGAAWLTKDKFKVNETYDCWYTSGISKVKLYNDGFFSCQAKDPSTIEMIKR 191 Query: 921 YFIL-------------------------SMRMLKSFFSNGTSSLVHWRWDAVVGLITGF 1025 Y ++ S +++ S+ S+ ++WRW+ + G++TGF Sbjct: 192 YLMIEQYSGTSRTALSETWEERIRISECRSSKIVYSWLSSKGRG-IYWRWETIAGVVTGF 250 Query: 1026 STSLLSFGLVAVLHQMKSSICQLFSGRVHSAVHFKRICLFIAYFSFMGWLGIQYVKRLGL 1205 STS+++ + +L MKS + Q + VH KR+C + Y S MGWL QY +RL + Sbjct: 251 STSIITISFIRILQHMKSWLPQAL-----NTVHIKRVCFLLVYVSVMGWLVSQYWRRLNI 305 Query: 1206 IKIFSLSY 1229 I +Y Sbjct: 306 PLINVYNY 313 >ref|XP_004146124.1| PREDICTED: uncharacterized protein LOC101211843 [Cucumis sativus] Length = 303 Score = 296 bits (758), Expect = 2e-77 Identities = 147/282 (52%), Positives = 190/282 (67%), Gaps = 1/282 (0%) Frame = +3 Query: 381 KKGLFYILFRCCFGXXXXXXXXXXXXXXXXXXXXXXGNLSIWNPISVPSQCKIVSSSVDL 560 K+G + R C N SI +PIS+ SQCKIVSSSVDL Sbjct: 16 KRGFLAVTMRWCAALLLPVVSFFVVTLSLSLVAVFVANSSITSPISLRSQCKIVSSSVDL 75 Query: 561 RSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWASVFKVEYMDH-SGQQRLAQAEAPS 737 RSSKVCELG LNYKAK+VFYP++R KFRC YDYYWASVFKVE DH SG+ R+A AEAP+ Sbjct: 76 RSSKVCELGLLNYKAKNVFYPYERNKFRCRYDYYWASVFKVEMKDHFSGKARVALAEAPN 135 Query: 738 EALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGISTVNMYQDGLFNCQAKDPSNVEMLR 917 EALP CRPNF AAWL K KFKVNETY CWY+ GIS V++ DG CQA++P+ +EM++ Sbjct: 136 EALPHKCRPNFGAAWLAKYKFKVNETYDCWYSSGISKVSLDYDGFSGCQAQEPTTIEMIK 195 Query: 918 RYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFSTSLLSFGLVAVLHQMKSSICQLF 1097 RY+ L ++L S++S+ + WRWD + GL+TGFSTSL++ ++ +L + + + F Sbjct: 196 RYYFLCTKILLSWYSS-KEKAIFWRWDMLGGLVTGFSTSLITITVLRILQPLIPWMLRYF 254 Query: 1098 SGRVHSAVHFKRICLFIAYFSFMGWLGIQYVKRLGLIKIFSL 1223 + R +H R C +AYFSF+GWL IQY KRL L +IF++ Sbjct: 255 TTRFF--IHLNRACFLVAYFSFVGWLIIQYGKRLSLPEIFNI 294 >emb|CBI37457.3| unnamed protein product [Vitis vinifera] Length = 310 Score = 294 bits (753), Expect = 7e-77 Identities = 140/198 (70%), Positives = 163/198 (82%), Gaps = 1/198 (0%) Frame = +3 Query: 489 GNLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWA 668 GNLS+ +P+SVPSQCKIVSSSVDLRSSKVCELG LNYKAKHVFYP +++KFRCHYDYYWA Sbjct: 42 GNLSVSSPVSVPSQCKIVSSSVDLRSSKVCELGLLNYKAKHVFYPLEKRKFRCHYDYYWA 101 Query: 669 SVFKVEYMDHSGQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGIST 848 SVFKVEY D GQ RL EAP+EALP +CRPNF AAWLTKDKFKVNETY CWY GIS Sbjct: 102 SVFKVEYKDSLGQTRLTLTEAPNEALPLDCRPNFGAAWLTKDKFKVNETYDCWYASGISK 161 Query: 849 VNMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFF-SNGTSSLVHWRWDAVVGLITGF 1025 V++YQD F+CQAK+PS +EM+RRY ILS R+L+S+ S G +WR + V G+ITGF Sbjct: 162 VSIYQDSFFSCQAKEPSTIEMIRRYSILSTRILQSWLASQGRGK--YWRLETVAGVITGF 219 Query: 1026 STSLLSFGLVAVLHQMKS 1079 STSL+S LV +LHQ+KS Sbjct: 220 STSLISISLVRILHQVKS 237 >ref|XP_004512397.1| PREDICTED: uncharacterized protein LOC101511402 [Cicer arietinum] Length = 305 Score = 287 bits (735), Expect = 8e-75 Identities = 139/249 (55%), Positives = 180/249 (72%), Gaps = 5/249 (2%) Frame = +3 Query: 492 NLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWAS 671 + S+ NPIS+PS C+IVS+ VD+RSSK+CELG NYKAK +F F+R KFRC YDYYWAS Sbjct: 54 DFSVPNPISLPSHCRIVSTGVDIRSSKICELGLSNYKAKDIFRHFERSKFRCRYDYYWAS 113 Query: 672 VFKVEYMDH-SGQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGIST 848 VFKVEY DH SGQ++ A AEAPSEALP CRPNF AAWLT+ KFKVNETY CWYT GIS Sbjct: 114 VFKVEYKDHFSGQRQFAFAEAPSEALPLYCRPNFGAAWLTQYKFKVNETYDCWYTSGISK 173 Query: 849 VNMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFS 1028 V++YQD LF C+A + S +E + +Y +M M+ + S+ + +WRW+ +VG+++GF+ Sbjct: 174 VHLYQDNLFGCRADEQSTIEKIIQYSTQAMEMINYWISDIGRRVKYWRWEVIVGVVSGFA 233 Query: 1029 TSLLSFGLVAVLHQMKSSICQLFSGRVHS----AVHFKRICLFIAYFSFMGWLGIQYVKR 1196 TSL+S + L + SS+ Q F+ + S AV +R C F AY SF+GWL I+Y KR Sbjct: 234 TSLISITFIMFLKLLLSSLRQSFAAWILSWRVNAVLIRRTCFFFAYLSFVGWLAIEYGKR 293 Query: 1197 LGLIKIFSL 1223 LGL+ IF L Sbjct: 294 LGLMDIFRL 302 >ref|XP_003534316.1| PREDICTED: uncharacterized protein LOC100813000 [Glycine max] Length = 303 Score = 287 bits (734), Expect = 1e-74 Identities = 144/250 (57%), Positives = 181/250 (72%), Gaps = 6/250 (2%) Frame = +3 Query: 492 NLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWAS 671 + SI NPIS+PSQCKIVS+ VD+RSSK+CELG L+YKAK VF+ F+R KFRC YDYYWAS Sbjct: 53 DFSIPNPISLPSQCKIVSTGVDIRSSKICELGLLDYKAKDVFHHFERSKFRCRYDYYWAS 112 Query: 672 VFKVEYMDH-SGQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGIST 848 VFKVEY DH SGQ ++A AEAP+EALP CRPNF AAWLT+ KFKVNETY CWYT GIS Sbjct: 113 VFKVEYKDHFSGQTQVAFAEAPNEALPLYCRPNFGAAWLTQYKFKVNETYDCWYTSGISK 172 Query: 849 VNMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFS 1028 V ++QD LF C A + S +E R+Y ++M M+ S+FS+ HWRW+ + G++TGF Sbjct: 173 VRLHQDSLFGCDAHEQSTLEKSRQYSTMAMEMVISWFSS-RGRTKHWRWETLAGVVTGFL 231 Query: 1029 TSLLSFGLVAVLHQMKSSICQ-----LFSGRVHSAVHFKRICLFIAYFSFMGWLGIQYVK 1193 TSL+S + L + S+ Q +FS RV +AV +R C +AY SF+ WL I+Y K Sbjct: 232 TSLISITFIRFLQLLLPSLYQSFTTWIFSWRV-NAVLIRRACFLLAYLSFVAWLAIEYGK 290 Query: 1194 RLGLIKIFSL 1223 RLGL+ IF L Sbjct: 291 RLGLMDIFRL 300 >gb|ESW30148.1| hypothetical protein PHAVU_002G128700g [Phaseolus vulgaris] Length = 305 Score = 285 bits (730), Expect = 3e-74 Identities = 143/248 (57%), Positives = 175/248 (70%), Gaps = 5/248 (2%) Frame = +3 Query: 492 NLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWAS 671 + SI NPIS+PSQCKIVS+ VD+RSSK+CELG LNYKAK VF F+R KFRC YDYYWAS Sbjct: 55 DFSIPNPISLPSQCKIVSTGVDIRSSKICELGLLNYKAKDVFQHFERSKFRCRYDYYWAS 114 Query: 672 VFKVEYMDH-SGQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGIST 848 VFKVEY DH SGQ ++A AEAP+EALP CRPNF AAWLT+ KFKVNETY CWYT GIS Sbjct: 115 VFKVEYKDHFSGQTQVAFAEAPNEALPLYCRPNFGAAWLTQYKFKVNETYNCWYTSGISK 174 Query: 849 VNMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFS 1028 V + QD LF C A S +E R+Y ++M M S+ S G HWRW+ + G+++GF Sbjct: 175 VRLRQDNLFGCHAHQQSTLEKSRQYSTMAMEMAISWLS-GRGRTKHWRWETLAGVVSGFL 233 Query: 1029 TSLLSFGLVAVLHQMKSSICQLFSGRVH----SAVHFKRICLFIAYFSFMGWLGIQYVKR 1196 TSL+S + H + SSI Q F+ + +AV +R C +AY SF+ WL I+Y KR Sbjct: 234 TSLISITFIRFAHILLSSIYQSFTTWIFPWRVNAVFIRRSCFLLAYLSFVAWLAIEYGKR 293 Query: 1197 LGLIKIFS 1220 LGL+ IFS Sbjct: 294 LGLMDIFS 301 >gb|EPS59472.1| hypothetical protein M569_15336, partial [Genlisea aurea] Length = 260 Score = 281 bits (718), Expect = 8e-73 Identities = 131/240 (54%), Positives = 172/240 (71%), Gaps = 4/240 (1%) Frame = +3 Query: 492 NLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWAS 671 +L +W+PISV C++VSSSVDLRSSKVC +G LNY AK+V YP ++ K+RCHYDYYWA+ Sbjct: 22 SLRLWSPISVRCLCRVVSSSVDLRSSKVCAIGVLNYNAKNVLYPLEKNKYRCHYDYYWAA 81 Query: 672 VFKVEYMDHSGQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGISTV 851 + KVE+ DH G +R A AEAP+EALP NCRP+FS AWLTK KF +NETY CWYT+GIS V Sbjct: 82 ILKVEFTDHLGHERFALAEAPNEALPYNCRPSFSGAWLTKSKFMINETYDCWYTLGISKV 141 Query: 852 NMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFST 1031 N+ +GLFNC+A DPS +EM++ + L +R+LKS FS G SL+HWRWD + GL TGF T Sbjct: 142 NINHEGLFNCRADDPSTLEMMKLHSTLLIRILKSSFS-GWGSLMHWRWDVIAGLTTGFIT 200 Query: 1032 SLLSFGLVAVLHQMKSSICQLFSG----RVHSAVHFKRICLFIAYFSFMGWLGIQYVKRL 1199 +LL LV+++ + S +L R + KR +F Y FM W+ +QY++RL Sbjct: 201 ALLVIALVSLIWPLIQSTTRLLGSWLFIRYPITLFLKRAFVFTVYLMFMCWITLQYLRRL 260 >ref|NP_001241403.1| uncharacterized protein LOC100811221 [Glycine max] gi|255642352|gb|ACU21440.1| unknown [Glycine max] Length = 305 Score = 279 bits (713), Expect = 3e-72 Identities = 141/248 (56%), Positives = 177/248 (71%), Gaps = 6/248 (2%) Frame = +3 Query: 492 NLSIWNPISVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWAS 671 + SI NPIS+PSQCKIVS+ VD+RSSK+CELG LNYKAK VF+ F+R KFRC YDYYWAS Sbjct: 55 DFSIPNPISLPSQCKIVSTGVDIRSSKICELGLLNYKAKDVFHHFERSKFRCRYDYYWAS 114 Query: 672 VFKVEYMDH-SGQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGIST 848 VFKVEY DH SGQ ++A AEAP+EALP CRPNF AAW T+ KFKVNE+Y CWYT G S Sbjct: 115 VFKVEYKDHFSGQTQVAFAEAPNEALPLYCRPNFGAAWFTQYKFKVNESYDCWYTSGNSK 174 Query: 849 VNMYQDGLFNCQAKDPSNVEMLRRYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFS 1028 V+++QD LF C A + S +E R+Y ++M M S+ S+ HWRW+ + G++TGF Sbjct: 175 VHLHQDNLFGCDAHEQSTLEKSRQYSTMAMEMAISWLSS-RGRTKHWRWETLAGVVTGFL 233 Query: 1029 TSLLSFGLVAVLHQMKSSICQ-----LFSGRVHSAVHFKRICLFIAYFSFMGWLGIQYVK 1193 TSL+S + L SS+ Q +FS RV +AV +R C +AY SF+ WL I+Y K Sbjct: 234 TSLISITFIRFLQLFLSSLHQSFTTWIFSWRV-NAVLIRRACFLLAYLSFVAWLVIEYGK 292 Query: 1194 RLGLIKIF 1217 RLGL+ IF Sbjct: 293 RLGLMDIF 300 >ref|XP_006414004.1| hypothetical protein EUTSA_v10025843mg [Eutrema salsugineum] gi|557115174|gb|ESQ55457.1| hypothetical protein EUTSA_v10025843mg [Eutrema salsugineum] Length = 299 Score = 272 bits (696), Expect = 3e-70 Identities = 131/235 (55%), Positives = 169/235 (71%), Gaps = 7/235 (2%) Frame = +3 Query: 516 SVPSQCKIVSSSVDLRSSKVCELGFLNYKAKHVFYPFDRKKFRCHYDYYWASVFKVEYMD 695 S+ S+CKIVSSSVDLRSSKVC +G LN KA+HVFYPF+R KFRC YDYYWASVFKVEY D Sbjct: 62 SLASRCKIVSSSVDLRSSKVCGIGLLNIKAQHVFYPFERDKFRCRYDYYWASVFKVEYKD 121 Query: 696 H-SGQQRLAQAEAPSEALPSNCRPNFSAAWLTKDKFKVNETYACWYTMGISTVNMYQDGL 872 H GQ RLA +EAP+EALP CRPNF AA LTKD FKVNETY CWYT+GI + +YQDG Sbjct: 122 HLMGQTRLAFSEAPNEALPPECRPNFGAALLTKDNFKVNETYDCWYTLGIPKIKLYQDGF 181 Query: 873 FNCQAKDPSNVEMLRRYFILSMRMLKSFFSNGTSSLVHWRWDAVVGLITGFSTSLLSFGL 1052 F CQA D S ++ ++Y +L R+L+S+F NG +WR+D + G+++GFSTS+++ + Sbjct: 182 FGCQANDRSFTDIFKQYAVLFSRLLQSWF-NGKGRPKYWRYDVIAGIVSGFSTSIITVFV 240 Query: 1053 VAVLHQMKSSICQLFS------GRVHSAVHFKRICLFIAYFSFMGWLGIQYVKRL 1199 + +L KS + + F +V+ V KR CL + YFS +GW+ QY+K L Sbjct: 241 MRILRHAKSWVPRAFCSVKSQYSKVNLVVQMKRACLVLVYFSALGWMATQYLKIL 295