BLASTX nr result
ID: Catharanthus22_contig00009194
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00009194 (1205 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006357258.1| PREDICTED: uncharacterized protein LOC102597... 357 5e-96 ref|XP_004238763.1| PREDICTED: uncharacterized protein LOC101255... 347 7e-93 ref|XP_006438571.1| hypothetical protein CICLE_v10032129mg [Citr... 333 1e-88 ref|XP_006483250.1| PREDICTED: uncharacterized protein LOC102616... 330 5e-88 gb|EMJ25322.1| hypothetical protein PRUPE_ppa015818mg [Prunus pe... 325 2e-86 ref|XP_003631283.1| PREDICTED: uncharacterized protein LOC100853... 317 8e-84 ref|XP_002311878.1| predicted protein [Populus trichocarpa] gi|5... 313 7e-83 ref|XP_004297341.1| PREDICTED: uncharacterized protein LOC101291... 313 9e-83 ref|XP_002520148.1| conserved hypothetical protein [Ricinus comm... 311 3e-82 gb|EXB63806.1| hypothetical protein L484_021078 [Morus notabilis... 308 2e-81 gb|EOY00205.1| Uncharacterized protein TCM_009967 [Theobroma cacao] 308 4e-81 ref|XP_004135199.1| PREDICTED: uncharacterized protein LOC101204... 305 2e-80 gb|ESW29572.1| hypothetical protein PHAVU_002G080900g [Phaseolus... 295 3e-77 ref|XP_004489960.1| PREDICTED: uncharacterized protein LOC101489... 290 6e-76 ref|XP_006395772.1| hypothetical protein EUTSA_v10004613mg [Eutr... 290 8e-76 ref|NP_001240072.1| uncharacterized protein LOC100813905 [Glycin... 287 7e-75 gb|EPS63886.1| hypothetical protein M569_10896, partial [Genlise... 286 2e-74 ref|XP_006291472.1| hypothetical protein CARUB_v10017608mg [Caps... 281 3e-73 ref|NP_178363.1| uncharacterized protein [Arabidopsis thaliana] ... 278 2e-72 gb|AAT68342.1| hypothetical protein At2g02590 [Arabidopsis thali... 278 2e-72 >ref|XP_006357258.1| PREDICTED: uncharacterized protein LOC102597342 [Solanum tuberosum] Length = 313 Score = 357 bits (916), Expect = 5e-96 Identities = 195/312 (62%), Positives = 222/312 (71%), Gaps = 1/312 (0%) Frame = +3 Query: 153 MAYSLARPWMLLTLTPGKSHFKFTPPSPRICFPSFPDQQIRIKFKSFPTFYTHNHLGHFE 332 MA SL+ LLT + S+ F+ P+I S ++QI +K +SFP T +HLG Sbjct: 1 MATSLSHSPQLLTFSYRNSNPSFS--FPKIHSFSHQNRQIHLKTQSFPILQTFSHLGR-- 56 Query: 333 TSSFPIARAIRVASND-FLETIEEKEKILVSKETPTKFLLWVLFWASVSIGIFAVSGEAK 509 + R IR + +D FLE IEE+E +L ++E P KFL WVL WASVS+G+FAVSG+AK Sbjct: 57 -----VQRVIRASDDDSFLEVIEEEEGLLANEEKPLKFLFWVLLWASVSVGLFAVSGDAK 111 Query: 510 AAADSIRASSFGVKGATALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPTLLTV 689 AAADSIRAS FGVK A +LR GWPDEAVVFALATLPVIELRGAIPVGYWLQLKPT+LTV Sbjct: 112 AAADSIRASGFGVKVANSLRSSGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPTVLTV 171 Query: 690 LSVLGNMVPVPFIILYLKSFATFLAGKNKSASHFLDKLFERAKAKAGPVEEFQWLGLMLF 869 LSVLGNMVPVPFI+LYLK A FLAG NKSAS LD LFERAK KAGPV+EFQWLGLMLF Sbjct: 172 LSVLGNMVPVPFIVLYLKKLAIFLAGTNKSASKLLDLLFERAKDKAGPVKEFQWLGLMLF 231 Query: 870 VAVPFPGTGAWTGAIVASILDMTFWSGLSAXXXXXXXXXXXXXXXXXXXXKYAXXXXXXX 1049 VAVPFPGTGAWTGAI+AS+LDM FWS +SA KYA Sbjct: 232 VAVPFPGTGAWTGAIIASVLDMPFWSAVSANFVGVVLAGLLVNLLVNLGLKYAIITGIIL 291 Query: 1050 XXXSTFMWSILR 1085 STFMWSILR Sbjct: 292 FIISTFMWSILR 303 >ref|XP_004238763.1| PREDICTED: uncharacterized protein LOC101255587 [Solanum lycopersicum] Length = 314 Score = 347 bits (889), Expect = 7e-93 Identities = 192/307 (62%), Positives = 217/307 (70%), Gaps = 7/307 (2%) Frame = +3 Query: 186 LTLTPGKSHFKFTPPSPRICFP---SFPDQ--QIRIKFKSFPTFYTHNHLGHFETSSFPI 350 L+ +P F + +P FP SF Q +I++K +SFP T +HLG + Sbjct: 5 LSHSPQLWTFSYRNTNPSFSFPKIHSFLHQNPKIQLKTQSFPILQTFSHLGR-------V 57 Query: 351 ARAIRVASND-FLETIEEKEKILVSKETPTKFLLWVLFWASVSIGIFAVSGEAKAAA-DS 524 R IR +S+D FLE IEE+E +L + E P KFL WVL WASVS+G+FAVSG+AKAAA DS Sbjct: 58 QRLIRASSSDSFLEVIEEEEGLLANDEKPLKFLFWVLLWASVSVGLFAVSGDAKAAAADS 117 Query: 525 IRASSFGVKGATALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPTLLTVLSVLG 704 IRAS FGVK A ALR GWPDEAVVFALATLPVIELRGAIPVGYWLQLKP++LTVLSVLG Sbjct: 118 IRASGFGVKVANALRSSGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPSVLTVLSVLG 177 Query: 705 NMVPVPFIILYLKSFATFLAGKNKSASHFLDKLFERAKAKAGPVEEFQWLGLMLFVAVPF 884 NMVPVPFI+LYLK A FLAG NKSAS LD LFERAK KAGPV+EFQWLGLMLFVAVPF Sbjct: 178 NMVPVPFIVLYLKKLAIFLAGTNKSASKLLDLLFERAKDKAGPVKEFQWLGLMLFVAVPF 237 Query: 885 PGTGAWTGAIVASILDMTFWSGLSAXXXXXXXXXXXXXXXXXXXXKYAXXXXXXXXXXST 1064 PGTGAWTGAI+AS+LDM FWS +SA KYA ST Sbjct: 238 PGTGAWTGAIIASVLDMPFWSAVSANFVGVVLAGLLVNLLVNLGLKYAIITGIILFIIST 297 Query: 1065 FMWSILR 1085 FMWSILR Sbjct: 298 FMWSILR 304 >ref|XP_006438571.1| hypothetical protein CICLE_v10032129mg [Citrus clementina] gi|557540767|gb|ESR51811.1| hypothetical protein CICLE_v10032129mg [Citrus clementina] Length = 322 Score = 333 bits (853), Expect = 1e-88 Identities = 175/270 (64%), Positives = 200/270 (74%), Gaps = 1/270 (0%) Frame = +3 Query: 279 KFKSFPTFYTHNHLGHFETSSFPIARAIRVASNDFLETIEEKEKIL-VSKETPTKFLLWV 455 K K F TF + HLG +S FP + +S+ F + I E+E+IL V++ETP KFLLWV Sbjct: 45 KSKPFSTFQSRRHLGPLVSSCFPTRASF--SSDMFPDNITEEERILPVTEETPLKFLLWV 102 Query: 456 LFWASVSIGIFAVSGEAKAAADSIRASSFGVKGATALRGLGWPDEAVVFALATLPVIELR 635 +FWAS+S+ F+ SG+A AA DSIRAS+ G+K ATALR GWPDEAVVFALATLPV+ELR Sbjct: 103 VFWASLSLVWFSTSGDANAAVDSIRASAIGLKIATALRRSGWPDEAVVFALATLPVLELR 162 Query: 636 GAIPVGYWLQLKPTLLTVLSVLGNMVPVPFIILYLKSFATFLAGKNKSASHFLDKLFERA 815 GAIPVGYW+QLKP LLTVLSVLGNMVPVPFIILYLK FA+FLAGKN+SAS FLD LF++A Sbjct: 163 GAIPVGYWMQLKPVLLTVLSVLGNMVPVPFIILYLKKFASFLAGKNRSASQFLDMLFQKA 222 Query: 816 KAKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAIVASILDMTFWSGLSAXXXXXXXXXXXX 995 K KAGPVEEFQWLGLMLFVAVPFPGTGAWTGA +A+ILDM FWS LSA Sbjct: 223 KEKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAFIAAILDMPFWSALSANFFGVVIAGLLV 282 Query: 996 XXXXXXXXKYAXXXXXXXXXXSTFMWSILR 1085 KYA STFMWS LR Sbjct: 283 NLLVNLGLKYAIVTGAILFIISTFMWSTLR 312 >ref|XP_006483250.1| PREDICTED: uncharacterized protein LOC102616695 [Citrus sinensis] Length = 322 Score = 330 bits (847), Expect = 5e-88 Identities = 174/270 (64%), Positives = 199/270 (73%), Gaps = 1/270 (0%) Frame = +3 Query: 279 KFKSFPTFYTHNHLGHFETSSFPIARAIRVASNDFLETIEEKEKIL-VSKETPTKFLLWV 455 K K F TF + HLG +S FP + +S+ F + I E+E+IL V++ETP KFLLWV Sbjct: 45 KSKPFSTFQSRRHLGPLVSSCFPTRASF--SSDMFPDNITEEERILPVTEETPLKFLLWV 102 Query: 456 LFWASVSIGIFAVSGEAKAAADSIRASSFGVKGATALRGLGWPDEAVVFALATLPVIELR 635 +FWAS+S+ F+ SG+A AA DSIRAS+ G+K ATALR WPDEAVVFALATLPV+ELR Sbjct: 103 VFWASLSLVWFSTSGDANAAVDSIRASAIGLKIATALRRSSWPDEAVVFALATLPVLELR 162 Query: 636 GAIPVGYWLQLKPTLLTVLSVLGNMVPVPFIILYLKSFATFLAGKNKSASHFLDKLFERA 815 GAIPVGYW+QLKP LLTVLSVLGNMVPVPFIILYLK FA+FLAGKN+SAS FLD LF++A Sbjct: 163 GAIPVGYWMQLKPVLLTVLSVLGNMVPVPFIILYLKKFASFLAGKNRSASQFLDMLFQKA 222 Query: 816 KAKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAIVASILDMTFWSGLSAXXXXXXXXXXXX 995 K KAGPVEEFQWLGLMLFVAVPFPGTGAWTGA +A+ILDM FWS LSA Sbjct: 223 KEKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAFIAAILDMPFWSALSANFFGVVIAGLLV 282 Query: 996 XXXXXXXXKYAXXXXXXXXXXSTFMWSILR 1085 KYA STFMWS LR Sbjct: 283 NLLVNLGLKYAIVTGAILFIISTFMWSTLR 312 >gb|EMJ25322.1| hypothetical protein PRUPE_ppa015818mg [Prunus persica] Length = 325 Score = 325 bits (833), Expect = 2e-86 Identities = 184/304 (60%), Positives = 211/304 (69%), Gaps = 6/304 (1%) Frame = +3 Query: 192 LTPGKSHFKFTPPSPRICFPSFPDQQIRIKFKSFP--TFYTHNHLGHFETSSFPIARAIR 365 L+ GK+ F+F+P R PS I+ F S F T + L +S A R Sbjct: 16 LSLGKTRFRFSPKHGR---PSIA-HSIQPPFNSNADLNFQTLSPLNPLLANSPLSHAATR 71 Query: 366 VASNDFLETIEEKEKILVSKETPTKFLLWVLFWASVSIGIFAVSGEAKAAA----DSIRA 533 V+S+ FL+ E+ + + V +E P KF+ WVL WASVS+ +FA SG+A AAA DSIRA Sbjct: 72 VSSHGFLDKDEKDDILPVFEERPVKFVFWVLVWASVSLALFAASGDANAAAAAAADSIRA 131 Query: 534 SSFGVKGATALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPTLLTVLSVLGNMV 713 SSFG+K A+ALRG GWPDEAVVFALATLPVIELRGAIPVGYWLQLKP +LTVLSVLGNMV Sbjct: 132 SSFGLKIASALRGSGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPVMLTVLSVLGNMV 191 Query: 714 PVPFIILYLKSFATFLAGKNKSASHFLDKLFERAKAKAGPVEEFQWLGLMLFVAVPFPGT 893 PVPFIILYLK FA+FLAGKNK+A+ FLD LF RAK KAGPVEEFQWLGLMLFVAVPFPGT Sbjct: 192 PVPFIILYLKRFASFLAGKNKAAARFLDILFVRAKEKAGPVEEFQWLGLMLFVAVPFPGT 251 Query: 894 GAWTGAIVASILDMTFWSGLSAXXXXXXXXXXXXXXXXXXXXKYAXXXXXXXXXXSTFMW 1073 GAWTGAI+ASILDM FW+ +SA KYA STFMW Sbjct: 252 GAWTGAIIASILDMPFWAAVSANFFGVVLAGLLVNLLVNLGLKYAIITGIILFIISTFMW 311 Query: 1074 SILR 1085 SILR Sbjct: 312 SILR 315 >ref|XP_003631283.1| PREDICTED: uncharacterized protein LOC100853229 [Vitis vinifera] gi|296086436|emb|CBI32025.3| unnamed protein product [Vitis vinifera] Length = 311 Score = 317 bits (811), Expect = 8e-84 Identities = 171/277 (61%), Positives = 193/277 (69%), Gaps = 2/277 (0%) Frame = +3 Query: 261 DQQIRIKFKSFPTFYTHN--HLGHFETSSFPIARAIRVASNDFLETIEEKEKILVSKETP 434 D Q R+ FK P+ N H H T S P + + + ++FL+ + + E P Sbjct: 32 DNQHRL-FKPNPSLALRNSRHSRHPLTISPPHSTPAQASPDEFLDKVGDFEG------PP 84 Query: 435 TKFLLWVLFWASVSIGIFAVSGEAKAAADSIRASSFGVKGATALRGLGWPDEAVVFALAT 614 KFL WVLFWAS+S+ FA SG+A AA DSIRASSFG+K A+ALR GWPDEAVV ALAT Sbjct: 85 VKFLFWVLFWASLSVAWFAASGDANAATDSIRASSFGLKVASALRSSGWPDEAVVVALAT 144 Query: 615 LPVIELRGAIPVGYWLQLKPTLLTVLSVLGNMVPVPFIILYLKSFATFLAGKNKSASHFL 794 LPVIELRGAIPVGYW+QLKP LT+LSVLGNM+PVPFIILYLK FATFLAGKNKSAS FL Sbjct: 145 LPVIELRGAIPVGYWMQLKPATLTILSVLGNMIPVPFIILYLKRFATFLAGKNKSASRFL 204 Query: 795 DKLFERAKAKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAIVASILDMTFWSGLSAXXXXX 974 D LFE+AK KAGPVEEFQWLGLMLFVAVPFPGTGAWTGAI+ASILDM FW +SA Sbjct: 205 DMLFEKAKEKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWPAVSANFFGV 264 Query: 975 XXXXXXXXXXXXXXXKYAXXXXXXXXXXSTFMWSILR 1085 KYA STFMWS+LR Sbjct: 265 VLAGLLVNLLVNLGLKYAIVTGVILFFISTFMWSVLR 301 >ref|XP_002311878.1| predicted protein [Populus trichocarpa] gi|566189003|ref|XP_006378162.1| hypothetical protein POPTR_0010s04340g [Populus trichocarpa] gi|550329033|gb|ERP55959.1| hypothetical protein POPTR_0010s04340g [Populus trichocarpa] Length = 310 Score = 313 bits (803), Expect = 7e-83 Identities = 163/267 (61%), Positives = 190/267 (71%) Frame = +3 Query: 285 KSFPTFYTHNHLGHFETSSFPIARAIRVASNDFLETIEEKEKILVSKETPTKFLLWVLFW 464 KS P+F + L S + + R +SN F +T ++KE + + P KFL WV FW Sbjct: 43 KSKPSFLAFHRLDGLRFLSS--STSTRASSNGFFDTTQDKEILPSFEPKPAKFLFWVAFW 100 Query: 465 ASVSIGIFAVSGEAKAAADSIRASSFGVKGATALRGLGWPDEAVVFALATLPVIELRGAI 644 AS+S+ FA SG+A AA DSI+AS FG+K ATA R LGWPDEAVVFALATLPV+ELRGAI Sbjct: 101 ASLSLVWFAASGDANAAVDSIKASGFGLKIATAFRRLGWPDEAVVFALATLPVLELRGAI 160 Query: 645 PVGYWLQLKPTLLTVLSVLGNMVPVPFIILYLKSFATFLAGKNKSASHFLDKLFERAKAK 824 PVGYW+QLKP +LT+LSV+GNMVPVPFIILYLK FA+FLAG+N+ AS FLD LFE AK K Sbjct: 161 PVGYWMQLKPIMLTILSVVGNMVPVPFIILYLKPFASFLAGRNQPASRFLDMLFENAKEK 220 Query: 825 AGPVEEFQWLGLMLFVAVPFPGTGAWTGAIVASILDMTFWSGLSAXXXXXXXXXXXXXXX 1004 +GPV+EFQWLGLMLFVAVPFPGTGAWTGAI+ASILDM FWS +SA Sbjct: 221 SGPVKEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSAVSANFCGVVLAGLLVNLL 280 Query: 1005 XXXXXKYAXXXXXXXXXXSTFMWSILR 1085 KYA STFMWSILR Sbjct: 281 VNLGLKYATITGIILFFISTFMWSILR 307 >ref|XP_004297341.1| PREDICTED: uncharacterized protein LOC101291815 [Fragaria vesca subsp. vesca] Length = 314 Score = 313 bits (802), Expect = 9e-83 Identities = 165/235 (70%), Positives = 176/235 (74%), Gaps = 5/235 (2%) Frame = +3 Query: 396 EEKEKILVSKETPTKFLLWVLFWASVSIGIFAVSGEAKAAA-----DSIRASSFGVKGAT 560 EE E + + +E P KF LWVLFWASVS+ FA SG+A AAA DSIRASSFGVK A Sbjct: 70 EEDEVLSIFEEKPVKFGLWVLFWASVSLAWFAASGDANAAANAAAADSIRASSFGVKIAN 129 Query: 561 ALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPTLLTVLSVLGNMVPVPFIILYL 740 ALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQL P +LTVL+VLGNMVPVP IILYL Sbjct: 130 ALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLTPVMLTVLAVLGNMVPVPIIILYL 189 Query: 741 KSFATFLAGKNKSASHFLDKLFERAKAKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAIVA 920 K FATFLAGKN + S FLD LFE+AK KAGPVEEFQWLGLMLFVAVPFPGTGAWTGAI+A Sbjct: 190 KRFATFLAGKNNATSRFLDLLFEKAKKKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAIIA 249 Query: 921 SILDMTFWSGLSAXXXXXXXXXXXXXXXXXXXXKYAXXXXXXXXXXSTFMWSILR 1085 SILDM FWS +SA KYA STFMWSILR Sbjct: 250 SILDMPFWSAVSANFFGVVLAGLLVNLLVNLGLKYAIVTGIALFFISTFMWSILR 304 >ref|XP_002520148.1| conserved hypothetical protein [Ricinus communis] gi|223540640|gb|EEF42203.1| conserved hypothetical protein [Ricinus communis] Length = 401 Score = 311 bits (797), Expect = 3e-82 Identities = 165/263 (62%), Positives = 197/263 (74%), Gaps = 3/263 (1%) Frame = +3 Query: 180 MLLTLTPGKSHFKFTPPSPRIC--FPSFPDQ-QIRIKFKSFPTFYTHNHLGHFETSSFPI 350 +LL+ + K++ +F P + +PS + Q +K K F +F T N S+ P Sbjct: 8 LLLSASFRKTYLRFLPNHVKNLNLYPSIAQKKQSFVKSKPFLSFQTVNFPSCNPLSAAPF 67 Query: 351 ARAIRVASNDFLETIEEKEKILVSKETPTKFLLWVLFWASVSIGIFAVSGEAKAAADSIR 530 +S+ FL + E++E + +E P KFL WV+FWASVS+ FAVS +A AA DSI+ Sbjct: 68 TTTRASSSHGFLNSAEDEEILPSFEEKPVKFLFWVVFWASVSLAWFAVSRDANAAVDSIK 127 Query: 531 ASSFGVKGATALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPTLLTVLSVLGNM 710 ASSFG+K A +LRGLGWPDEAVVFALATLPVIELRGAIPVGYW+QLKP +LTVLSV GNM Sbjct: 128 ASSFGLKIANSLRGLGWPDEAVVFALATLPVIELRGAIPVGYWMQLKPLILTVLSVAGNM 187 Query: 711 VPVPFIILYLKSFATFLAGKNKSASHFLDKLFERAKAKAGPVEEFQWLGLMLFVAVPFPG 890 VPVPFIILYLK FA+FLAG+N+SAS FLD LFE AK KA PVEEFQWLGLMLFVAVPFPG Sbjct: 188 VPVPFIILYLKRFASFLAGRNQSASRFLDMLFENAKQKADPVEEFQWLGLMLFVAVPFPG 247 Query: 891 TGAWTGAIVASILDMTFWSGLSA 959 TGAWTGAI+ASILDM FW +SA Sbjct: 248 TGAWTGAIIASILDMPFWPAVSA 270 >gb|EXB63806.1| hypothetical protein L484_021078 [Morus notabilis] gi|587990949|gb|EXC75168.1| hypothetical protein L484_000412 [Morus notabilis] Length = 323 Score = 308 bits (790), Expect = 2e-81 Identities = 174/303 (57%), Positives = 200/303 (66%), Gaps = 9/303 (2%) Frame = +3 Query: 204 KSHFKFTPPS--PRICFP--SFPDQQIRIKFKSFPTFYTHNHLGHFETSSFPIARAIRVA 371 K H + +P P I FP S P+ + TF T HL SS + + Sbjct: 21 KIHRRISPNHKVPTIIFPKKSLPNPN------ALATFQTSPHLKPPRASS----SSSYSS 70 Query: 372 SNDFLETIEEKEKILV-----SKETPTKFLLWVLFWASVSIGIFAVSGEAKAAADSIRAS 536 S +T EE + ++ + P KF WVLFWAS+S+ FA S +A AAADSI+AS Sbjct: 71 SGGLHDTAEENDTDIIITASFDHQRPVKFAFWVLFWASLSLLWFATSKDANAAADSIKAS 130 Query: 537 SFGVKGATALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPTLLTVLSVLGNMVP 716 SFG+K A ALRG GWPDEAVVFALATLP++ELRGAIPVGYW+QLKP +LTVLSVLGNMVP Sbjct: 131 SFGLKIANALRGSGWPDEAVVFALATLPLLELRGAIPVGYWMQLKPVVLTVLSVLGNMVP 190 Query: 717 VPFIILYLKSFATFLAGKNKSASHFLDKLFERAKAKAGPVEEFQWLGLMLFVAVPFPGTG 896 VPFIILYLKSFA+FLAGKNK+AS +D LF+ AKAKAGPVEEFQWLGLMLFVAVPFPGTG Sbjct: 191 VPFIILYLKSFASFLAGKNKTASRLIDLLFKNAKAKAGPVEEFQWLGLMLFVAVPFPGTG 250 Query: 897 AWTGAIVASILDMTFWSGLSAXXXXXXXXXXXXXXXXXXXXKYAXXXXXXXXXXSTFMWS 1076 AWTGA +A+ILDM FWSG SA KYA STFMWS Sbjct: 251 AWTGAFIAAILDMPFWSGFSANFIGVVLAGLLVNLLVNLGLKYAIITGIILFFVSTFMWS 310 Query: 1077 ILR 1085 ILR Sbjct: 311 ILR 313 >gb|EOY00205.1| Uncharacterized protein TCM_009967 [Theobroma cacao] Length = 316 Score = 308 bits (788), Expect = 4e-81 Identities = 176/317 (55%), Positives = 207/317 (65%), Gaps = 6/317 (1%) Frame = +3 Query: 153 MAYSLARPWMLLTLTPGKSHFKFTPPSPRICFPSFPDQQIRIKFKSFPTFYTHNHLGHFE 332 MA S + LL L P +PRI FP+ QI F + + +++ Sbjct: 1 MAASASAATSLLVLAPS-----LRKTNPRI-FPT----QIHWPTTRSKQFLSRSKFQNWQ 50 Query: 333 TSSFPIARAIRVASNDFLETIE---EKEKILVSKETPTKFLLWVLFWASVSIGIFAVSGE 503 P+ R +SN FL+T EKE + +E P KFL WV+ WAS+S+ FA S + Sbjct: 51 RFPLPLT-ITRASSNVFLDTAHTSREKEILPTFEEKPVKFLFWVVLWASLSLVWFAASSD 109 Query: 504 AKA---AADSIRASSFGVKGATALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLKP 674 A A AADSIRASSFG+K A+ALRG GWPDEAVVF LATLP++ELRGAIPVGYW+QLKP Sbjct: 110 ANASAAAADSIRASSFGLKIASALRGSGWPDEAVVFTLATLPILELRGAIPVGYWMQLKP 169 Query: 675 TLLTVLSVLGNMVPVPFIILYLKSFATFLAGKNKSASHFLDKLFERAKAKAGPVEEFQWL 854 LLT+LS+LGNMVPVPFIILYLK FATFLAG+N+SAS L+ +FE+AK KAGPVEEFQWL Sbjct: 170 RLLTILSILGNMVPVPFIILYLKRFATFLAGRNQSASGLLNMIFEKAKEKAGPVEEFQWL 229 Query: 855 GLMLFVAVPFPGTGAWTGAIVASILDMTFWSGLSAXXXXXXXXXXXXXXXXXXXXKYAXX 1034 GLMLFVAVPFPGTGAWTG I+ASILDM FWS +SA KYA Sbjct: 230 GLMLFVAVPFPGTGAWTGGIIASILDMPFWSAVSANFFGVVLAGLLVNLLVNMGLKYAIV 289 Query: 1035 XXXXXXXXSTFMWSILR 1085 STFMWSILR Sbjct: 290 TGIILFFISTFMWSILR 306 >ref|XP_004135199.1| PREDICTED: uncharacterized protein LOC101204187 [Cucumis sativus] gi|449478468|ref|XP_004155326.1| PREDICTED: uncharacterized LOC101204187 [Cucumis sativus] Length = 315 Score = 305 bits (782), Expect = 2e-80 Identities = 166/254 (65%), Positives = 184/254 (72%), Gaps = 7/254 (2%) Frame = +3 Query: 345 PIARAIRV-------ASNDFLETIEEKEKILVSKETPTKFLLWVLFWASVSIGIFAVSGE 503 P++R R+ +SN FLE + E I +E P K LL VLFWAS+S+ FA SG+ Sbjct: 55 PVSRTSRIIRTVPRSSSNGFLE---DDEIIPSFEEKPVKVLLLVLFWASLSLAWFAASGD 111 Query: 504 AKAAADSIRASSFGVKGATALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPTLL 683 AKAA DSIRAS+FG+K A+AL+ GWP EAVVFALATLPVIELRGAIPVGYW+QLKP L Sbjct: 112 AKAAVDSIRASNFGLKIASALQNSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVAL 171 Query: 684 TVLSVLGNMVPVPFIILYLKSFATFLAGKNKSASHFLDKLFERAKAKAGPVEEFQWLGLM 863 TVLSVLGNMVPVPFIILYLK FATFLAG+N SAS FLD LF+RAK KA PVEEFQWLGLM Sbjct: 172 TVLSVLGNMVPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLM 231 Query: 864 LFVAVPFPGTGAWTGAIVASILDMTFWSGLSAXXXXXXXXXXXXXXXXXXXXKYAXXXXX 1043 LFVAVPFPGTGAWTGAI+ASILDM FWSG+SA K A Sbjct: 232 LFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGV 291 Query: 1044 XXXXXSTFMWSILR 1085 STFMWSILR Sbjct: 292 ILFIISTFMWSILR 305 >gb|ESW29572.1| hypothetical protein PHAVU_002G080900g [Phaseolus vulgaris] Length = 317 Score = 295 bits (754), Expect = 3e-77 Identities = 155/278 (55%), Positives = 187/278 (67%), Gaps = 4/278 (1%) Frame = +3 Query: 264 QQIRIKFKSFPTFYTHNHLGHFETS----SFPIARAIRVASNDFLETIEEKEKILVSKET 431 Q+ R KS +F T + HF S P+ + R +S++ + +E E++L+S E Sbjct: 31 QKGRESLKSNFSFSTLHGSPHFRPSIAISPSPLTQT-RASSDECFDPADEAERLLLSGEK 89 Query: 432 PTKFLLWVLFWASVSIGIFAVSGEAKAAADSIRASSFGVKGATALRGLGWPDEAVVFALA 611 P KF WV+FWAS+S+ FAVS +A AA DSI+AS FG+ A +LR LGWPD VVF LA Sbjct: 90 PVKFAFWVIFWASLSLAWFAVSKDANAAVDSIKASGFGLNIANSLRKLGWPDGVVVFTLA 149 Query: 612 TLPVIELRGAIPVGYWLQLKPTLLTVLSVLGNMVPVPFIILYLKSFATFLAGKNKSASHF 791 TLPV+ELRGAIPVGYW+QL PT LT+LS+LGNMVPVPFI+LYLK FA+FLA ++ S Sbjct: 150 TLPVLELRGAIPVGYWMQLNPTTLTILSILGNMVPVPFIVLYLKRFASFLAARSSYVSRL 209 Query: 792 LDKLFERAKAKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAIVASILDMTFWSGLSAXXXX 971 LD LFE AK KAGPVEEFQWLGLMLFVAVPFPGTGAWTGA +A+ILDM FW+ +SA Sbjct: 210 LDMLFENAKEKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAFIAAILDMPFWAAVSANFFG 269 Query: 972 XXXXXXXXXXXXXXXXKYAXXXXXXXXXXSTFMWSILR 1085 KYA STFMWSILR Sbjct: 270 VVFAGLLVNLIVNLGLKYAIITGIILFFVSTFMWSILR 307 >ref|XP_004489960.1| PREDICTED: uncharacterized protein LOC101489688 [Cicer arietinum] Length = 320 Score = 290 bits (743), Expect = 6e-76 Identities = 148/242 (61%), Positives = 175/242 (72%) Frame = +3 Query: 360 IRVASNDFLETIEEKEKILVSKETPTKFLLWVLFWASVSIGIFAVSGEAKAAADSIRASS 539 IRV+S + L+ I+E E++++ E P KF +WV+FWAS+S+ FA S +A AA DSI+AS Sbjct: 66 IRVSSVECLDAIDEPERLMLYDEKPVKFAIWVIFWASMSLAWFAYSKDANAAVDSIKASG 125 Query: 540 FGVKGATALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPTLLTVLSVLGNMVPV 719 FG+K A +LR G PD VVF LATLPV+ELRGAIPVGYWLQL P LTV+S++GNMVPV Sbjct: 126 FGLKIANSLRKFGLPDWVVVFTLATLPVLELRGAIPVGYWLQLNPATLTVVSIIGNMVPV 185 Query: 720 PFIILYLKSFATFLAGKNKSASHFLDKLFERAKAKAGPVEEFQWLGLMLFVAVPFPGTGA 899 PFIILYLK FA+FLA K+ SAS FLD LF+ AK KAGPVEEFQWLGLMLFVAVPFPGTGA Sbjct: 186 PFIILYLKRFASFLASKSPSASRFLDILFKNAKEKAGPVEEFQWLGLMLFVAVPFPGTGA 245 Query: 900 WTGAIVASILDMTFWSGLSAXXXXXXXXXXXXXXXXXXXXKYAXXXXXXXXXXSTFMWSI 1079 W+GAI+ASILDM FW +SA KYA STFMW+I Sbjct: 246 WSGAIIASILDMPFWIAVSANFFGVVFAGLLVNLLVNLGLKYAIITGIVLFFVSTFMWTI 305 Query: 1080 LR 1085 LR Sbjct: 306 LR 307 >ref|XP_006395772.1| hypothetical protein EUTSA_v10004613mg [Eutrema salsugineum] gi|557092411|gb|ESQ33058.1| hypothetical protein EUTSA_v10004613mg [Eutrema salsugineum] Length = 318 Score = 290 bits (742), Expect = 8e-76 Identities = 153/242 (63%), Positives = 177/242 (73%), Gaps = 3/242 (1%) Frame = +3 Query: 369 ASNDFLETIEEKEKILVSKE---TPTKFLLWVLFWASVSIGIFAVSGEAKAAADSIRASS 539 +SN FL EE+E+I+ P KF + V+FWAS S+ FA SG+AKAAADSI++SS Sbjct: 66 SSNGFLGKTEEEEEIIKLPSIGVNPLKFAICVVFWASFSLLWFARSGDAKAAADSIKSSS 125 Query: 540 FGVKGATALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPTLLTVLSVLGNMVPV 719 FG++ A LR GWPDEAVVFALATLPVIELRGAIPVGYW+QLKPT+LT SVLGNMVPV Sbjct: 126 FGLRIAATLRRFGWPDEAVVFALATLPVIELRGAIPVGYWMQLKPTVLTFFSVLGNMVPV 185 Query: 720 PFIILYLKSFATFLAGKNKSASHFLDKLFERAKAKAGPVEEFQWLGLMLFVAVPFPGTGA 899 P IILYLK FA+FLAGK+++AS L+ LF+RAK KAGPVEEFQWLGLMLFVAVPFPGTGA Sbjct: 186 PVIILYLKKFASFLAGKSRTASKLLEILFKRAKEKAGPVEEFQWLGLMLFVAVPFPGTGA 245 Query: 900 WTGAIVASILDMTFWSGLSAXXXXXXXXXXXXXXXXXXXXKYAXXXXXXXXXXSTFMWSI 1079 WTGAI+ASILDM FWS +S+ K A STFMWS+ Sbjct: 246 WTGAIIASILDMPFWSAVSSNFCGVVLAGLLVNFLVNLGLKEAIVAGIALFFVSTFMWSV 305 Query: 1080 LR 1085 LR Sbjct: 306 LR 307 >ref|NP_001240072.1| uncharacterized protein LOC100813905 [Glycine max] gi|255635459|gb|ACU18082.1| unknown [Glycine max] Length = 321 Score = 287 bits (734), Expect = 7e-75 Identities = 160/304 (52%), Positives = 193/304 (63%), Gaps = 11/304 (3%) Frame = +3 Query: 207 SHFKFTPP----SPRICFPSFPDQQIRIKFKSFPTFYTHNHLGHFET------SSFPIAR 356 S F F P SP P Q+ + KS +F T N HF S+ + R Sbjct: 10 SPFHFRKPHNRVSPLNAHPLILIQKGKQSLKSNFSFSTLNASPHFRPPIAIAPSTLTLTR 69 Query: 357 AIRVASNDFLETIEEKEKILVSKETPTKFLLWVLFWASVSIGIFAVSGEAKAAADS-IRA 533 A +S++ + E +++L+S+E P F WV+FWAS+S+ FAVS +A AA +S I+A Sbjct: 70 AS--SSDECFDPAGEAQRLLLSEEKPVNFAFWVIFWASLSLAWFAVSRDANAAVESSIKA 127 Query: 534 SSFGVKGATALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPTLLTVLSVLGNMV 713 S FG A +LR LGWPD VVF LATLPV+ELRGAIPVGYW+QL P LTVLS+LGNMV Sbjct: 128 SGFGFNIANSLRKLGWPDWVVVFTLATLPVLELRGAIPVGYWMQLNPVTLTVLSILGNMV 187 Query: 714 PVPFIILYLKSFATFLAGKNKSASHFLDKLFERAKAKAGPVEEFQWLGLMLFVAVPFPGT 893 PVPFI+LYLK A+F+A ++ SAS FLD LFE AK KAGPVEEFQWLGLMLFVAVPFPGT Sbjct: 188 PVPFIVLYLKKIASFVAARSPSASRFLDMLFENAKEKAGPVEEFQWLGLMLFVAVPFPGT 247 Query: 894 GAWTGAIVASILDMTFWSGLSAXXXXXXXXXXXXXXXXXXXXKYAXXXXXXXXXXSTFMW 1073 GAWTGA +ASILDM FW+ +SA KYA STFMW Sbjct: 248 GAWTGAFIASILDMPFWAAVSANFFGVVFAGLLVNLLVNLGLKYAIITGVILFFVSTFMW 307 Query: 1074 SILR 1085 S+LR Sbjct: 308 SVLR 311 >gb|EPS63886.1| hypothetical protein M569_10896, partial [Genlisea aurea] Length = 243 Score = 286 bits (731), Expect = 2e-74 Identities = 151/239 (63%), Positives = 172/239 (71%) Frame = +3 Query: 369 ASNDFLETIEEKEKILVSKETPTKFLLWVLFWASVSIGIFAVSGEAKAAADSIRASSFGV 548 AS E IE+ E + P KFLL VL WASVSIG +A SG+AKAA+DSIRAS FG+ Sbjct: 2 ASRGLTEYIEKAEPDV--DVNPAKFLLMVLLWASVSIGFYAFSGDAKAASDSIRASGFGI 59 Query: 549 KGATALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPTLLTVLSVLGNMVPVPFI 728 K A+ALR GWP+EA+VF+LATLPVIELRGAIPVGYWL LKP LT+LS+LGNMVPVPFI Sbjct: 60 KVASALRASGWPNEAIVFSLATLPVIELRGAIPVGYWLHLKPLTLTLLSILGNMVPVPFI 119 Query: 729 ILYLKSFATFLAGKNKSASHFLDKLFERAKAKAGPVEEFQWLGLMLFVAVPFPGTGAWTG 908 +LYLK AT+L NK++S FL+ L +RAK KAGPVEEFQWLGLMLFVAVPFPGTGAWTG Sbjct: 120 LLYLKKLATYLTSDNKTSS-FLEMLLKRAKEKAGPVEEFQWLGLMLFVAVPFPGTGAWTG 178 Query: 909 AIVASILDMTFWSGLSAXXXXXXXXXXXXXXXXXXXXKYAXXXXXXXXXXSTFMWSILR 1085 AIVAS+LDM FW G+SA K+A STFMW ILR Sbjct: 179 AIVASVLDMPFWEGVSANLAGVVLAGLLVNLLVNLGVKHAIFTGVLLFGFSTFMWRILR 237 >ref|XP_006291472.1| hypothetical protein CARUB_v10017608mg [Capsella rubella] gi|482560179|gb|EOA24370.1| hypothetical protein CARUB_v10017608mg [Capsella rubella] Length = 332 Score = 281 bits (720), Expect = 3e-73 Identities = 143/218 (65%), Positives = 165/218 (75%) Frame = +3 Query: 432 PTKFLLWVLFWASVSIGIFAVSGEAKAAADSIRASSFGVKGATALRGLGWPDEAVVFALA 611 P KF + V+ WAS+S+ FA SG+AKAA DSI++SSFG++ A LR GWPDEAVVFALA Sbjct: 104 PVKFAVCVVLWASLSLLWFARSGDAKAATDSIKSSSFGLRIAATLRRFGWPDEAVVFALA 163 Query: 612 TLPVIELRGAIPVGYWLQLKPTLLTVLSVLGNMVPVPFIILYLKSFATFLAGKNKSASHF 791 TLPVIELRGAIPVGYW+QLKPT+LT SVLGNMVPVPFI+LYLK FA+FLAGK+++AS Sbjct: 164 TLPVIELRGAIPVGYWMQLKPTVLTFFSVLGNMVPVPFIVLYLKKFASFLAGKSQTASKL 223 Query: 792 LDKLFERAKAKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAIVASILDMTFWSGLSAXXXX 971 LD LF+RAK KAGPVEEFQWLGLMLFVAVPFPGTGAWTGAI+ASIL+M FWS +S+ Sbjct: 224 LDILFKRAKEKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILEMPFWSAVSSNFCG 283 Query: 972 XXXXXXXXXXXXXXXXKYAXXXXXXXXXXSTFMWSILR 1085 K A STFMWS+LR Sbjct: 284 VVLAGLLVNLLVNLGLKQAIVAGIALFFVSTFMWSVLR 321 >ref|NP_178363.1| uncharacterized protein [Arabidopsis thaliana] gi|3184280|gb|AAC18927.1| putative transport protein [Arabidopsis thaliana] gi|21554968|gb|AAM63742.1| putative transport protein [Arabidopsis thaliana] gi|61742564|gb|AAX55103.1| hypothetical protein At2g02590 [Arabidopsis thaliana] gi|110741534|dbj|BAE98716.1| putative transport protein [Arabidopsis thaliana] gi|330250508|gb|AEC05602.1| uncharacterized protein AT2G02590 [Arabidopsis thaliana] Length = 324 Score = 278 bits (712), Expect = 2e-72 Identities = 141/218 (64%), Positives = 165/218 (75%) Frame = +3 Query: 432 PTKFLLWVLFWASVSIGIFAVSGEAKAAADSIRASSFGVKGATALRGLGWPDEAVVFALA 611 P KF + V+ WAS S+ FA SG+AKAA DSI++SSFG++ A+ LR GWPDEAVVFALA Sbjct: 96 PVKFAICVVLWASFSLLWFARSGDAKAATDSIKSSSFGLRIASTLRRFGWPDEAVVFALA 155 Query: 612 TLPVIELRGAIPVGYWLQLKPTLLTVLSVLGNMVPVPFIILYLKSFATFLAGKNKSASHF 791 TLPVIELRGAIPVGYW+QLKP +LT SVLGNMVPVPFI+LYLK+FA+F+AGK+++AS Sbjct: 156 TLPVIELRGAIPVGYWMQLKPVVLTSFSVLGNMVPVPFIVLYLKTFASFVAGKSQTASKL 215 Query: 792 LDKLFERAKAKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAIVASILDMTFWSGLSAXXXX 971 LD LF+RAK KAGPVEEF+WLGLMLFVAVPFPGTGAWTGAI+ASILDM FWS +S+ Sbjct: 216 LDILFKRAKEKAGPVEEFKWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSAVSSNFCG 275 Query: 972 XXXXXXXXXXXXXXXXKYAXXXXXXXXXXSTFMWSILR 1085 K A STFMWS+LR Sbjct: 276 VVLAGLLVNLLVNLGLKQAIVAGIALFFVSTFMWSVLR 313 >gb|AAT68342.1| hypothetical protein At2g02590 [Arabidopsis thaliana] Length = 324 Score = 278 bits (712), Expect = 2e-72 Identities = 141/218 (64%), Positives = 165/218 (75%) Frame = +3 Query: 432 PTKFLLWVLFWASVSIGIFAVSGEAKAAADSIRASSFGVKGATALRGLGWPDEAVVFALA 611 P KF + V+ WAS S+ FA SG+AKAA DSI++SSFG++ A+ LR GWPDEAVVFALA Sbjct: 96 PVKFAICVVLWASFSLLWFARSGDAKAATDSIKSSSFGLRIASTLRRFGWPDEAVVFALA 155 Query: 612 TLPVIELRGAIPVGYWLQLKPTLLTVLSVLGNMVPVPFIILYLKSFATFLAGKNKSASHF 791 TLPVIELRGAIPVGYW+QLKP +LT SVLGNMVPVPFI+LYLK+FA+F+AGK+++AS Sbjct: 156 TLPVIELRGAIPVGYWMQLKPVVLTSFSVLGNMVPVPFIVLYLKTFASFVAGKSQTASKL 215 Query: 792 LDKLFERAKAKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAIVASILDMTFWSGLSAXXXX 971 LD LF+RAK KAGPVEEF+WLGLMLFVAVPFPGTGAWTGAI+ASILDM FWS +S+ Sbjct: 216 LDILFKRAKEKAGPVEEFKWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSAVSSNFCG 275 Query: 972 XXXXXXXXXXXXXXXXKYAXXXXXXXXXXSTFMWSILR 1085 K A STFMWS+LR Sbjct: 276 VVLAGLLVNLLVNLGLKQAIVAGIALFFVSTFMWSVLR 313