BLASTX nr result
ID: Akebia22_contig00016530
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia22_contig00016530 (1820 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN68624.1| hypothetical protein VITISV_010682 [Vitis vinifera] 367 1e-98 emb|CBI21048.3| unnamed protein product [Vitis vinifera] 361 5e-97 ref|XP_002285150.2| PREDICTED: uncharacterized protein LOC100266... 361 7e-97 ref|XP_007011854.1| Uncharacterized protein isoform 3 [Theobroma... 330 2e-87 ref|XP_002282244.1| PREDICTED: uncharacterized protein LOC100250... 313 1e-82 gb|EXB96866.1| hypothetical protein L484_016640 [Morus notabilis] 300 1e-78 ref|XP_007011852.1| Uncharacterized protein isoform 1 [Theobroma... 298 7e-78 ref|XP_004288790.1| PREDICTED: uncharacterized protein LOC101293... 293 1e-76 ref|XP_007011853.1| Uncharacterized protein isoform 2 [Theobroma... 290 1e-75 ref|XP_006450882.1| hypothetical protein CICLE_v10008357mg [Citr... 288 7e-75 ref|XP_002309630.1| hypothetical protein POPTR_0006s27080g [Popu... 288 7e-75 ref|XP_006600747.1| PREDICTED: uncharacterized protein LOC100806... 287 1e-74 ref|XP_007202061.1| hypothetical protein PRUPE_ppa006809mg [Prun... 286 2e-74 ref|XP_004139410.1| PREDICTED: uncharacterized protein LOC101205... 285 5e-74 ref|XP_002324862.2| hypothetical protein POPTR_0018s01770g [Popu... 283 2e-73 ref|XP_007222724.1| hypothetical protein PRUPE_ppa006474mg [Prun... 277 1e-71 ref|XP_002527188.1| conserved hypothetical protein [Ricinus comm... 276 2e-71 ref|XP_002324645.2| hypothetical protein POPTR_0018s12880g [Popu... 270 1e-69 ref|XP_007013493.1| Uncharacterized protein TCM_038116 [Theobrom... 267 1e-68 ref|XP_002515508.1| conserved hypothetical protein [Ricinus comm... 263 2e-67 >emb|CAN68624.1| hypothetical protein VITISV_010682 [Vitis vinifera] Length = 526 Score = 367 bits (942), Expect = 1e-98 Identities = 216/444 (48%), Positives = 259/444 (58%), Gaps = 8/444 (1%) Frame = +2 Query: 341 FELLLKMPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKL 520 F +MPRPGPRPYECVRRAWHSDRHQP+RGSLI+EIFRVV EIHSSATKKNKEW+EKL Sbjct: 19 FNFNKRMPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRVVNEIHSSATKKNKEWQEKL 78 Query: 521 PVVVLKAEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALI 700 P+VVLKAEEIMYSKANSEAEYMD++TLWDR NDAIN LQPCIEA+L Sbjct: 79 PIVVLKAEEIMYSKANSEAEYMDLKTLWDRANDAINTIIRRDESTETGEFLQPCIEASLN 138 Query: 701 LGCTARSASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGS---SHFLPHHSS-- 865 LGC R ASRSQRNNN RCYL+P+TQEP S+ P + +N QG S + +++ Sbjct: 139 LGCPQRRASRSQRNNNPRCYLTPSTQEPISISP--SILENSPQGNHTTISQVMSRYATFI 196 Query: 866 NPTSSPKFMPYYLGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSN 1045 P+S P G E + H N+ T E +P+SN Sbjct: 197 KPSSMSVIQP---GLEPHSTAFHNNDCPT----XKFLFSSENCPPSGNKCLQMEVYPASN 249 Query: 1046 LARVYPLYYGKLQQTTEPQFGFQI---PHSNAASLGTPHVQFSVEPTEKSFLQDLFPRDE 1216 L VYPLY G Q E Q GF + P SN +EP +Q+LF Sbjct: 250 LCAVYPLYDGNQLQCEESQCGFGVQSHPKSN-----------PMEPAGMGTIQNLF--SY 296 Query: 1217 AVNASNSMIRGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXX 1396 A++ + + D EN P+I CDLSLRLG +S P + WP+E EDVG Sbjct: 297 AIDPTKKPSQTDFGHVTENSPKIDCDLSLRLGPLSIPCVSVENSWPQEFEDVGSSCSREG 356 Query: 1397 XXXCDLSLRIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAG 1576 DLS R+D +F FFPR N DDPL+SC SKRSSEG+ LN E + RKRK S + Sbjct: 357 SKFSDLSPRVDKQFPFFPRGNTDDPLDSCLSKRSSEGENLNMEATMRKRKAVISYPLEDR 416 Query: 1577 QFSQKPKFSQNICFSRMKKPDQ*R 1648 QF +PK N RM+ D+ R Sbjct: 417 QFCCQPKLPYNYLPGRMRNADEGR 440 >emb|CBI21048.3| unnamed protein product [Vitis vinifera] Length = 451 Score = 361 bits (927), Expect = 5e-97 Identities = 210/435 (48%), Positives = 255/435 (58%), Gaps = 8/435 (1%) Frame = +2 Query: 359 MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538 MPRPGPRPYECVRRAWHSDRHQP+RGSLI+EIFRVV EIHSSATKKNKEW+EKLP+VVLK Sbjct: 1 MPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRVVNEIHSSATKKNKEWQEKLPIVVLK 60 Query: 539 AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718 AEEIMYSKANSEAEYMD++TLWDR NDAIN LQPCIEA+L LGC R Sbjct: 61 AEEIMYSKANSEAEYMDLKTLWDRANDAINTIIRRDESTETGEFLQPCIEASLNLGCPQR 120 Query: 719 SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGS---SHFLPHHSS--NPTSSP 883 ASRSQRNNN RCYL+P+TQEP S+ P + +N QG S + +++ P+S Sbjct: 121 RASRSQRNNNPRCYLTPSTQEPISISP--SILENSPQGNHTTISQVMSRYATFIKPSSMS 178 Query: 884 KFMPYYLGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLARVYP 1063 P G E + H N+ T+ E +P+SN+ VYP Sbjct: 179 VIQP---GLEPHSTAFHNNDCPTS----KFLFSSENCPPSGNKCLQMEVYPASNVCAVYP 231 Query: 1064 LYYGKLQQTTEPQFGFQI---PHSNAASLGTPHVQFSVEPTEKSFLQDLFPRDEAVNASN 1234 LY G Q E Q GF + P SN +EP +Q+LF A++ + Sbjct: 232 LYDGNQLQCEESQCGFGVQSHPKSN-----------PMEPAGMGTIQNLF--SYAIDPTK 278 Query: 1235 SMIRGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCDL 1414 + D EN P+I CDLSLRLG +S P + WP+E EDVG DL Sbjct: 279 KPSQTDFGHVTENSPKIDCDLSLRLGPLSIPCVSVENSWPQEFEDVGSSCSREGSKFSDL 338 Query: 1415 SLRIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAGQFSQKP 1594 S ++D +F FFPR N DDPL+SC SKRSSEG+ LN E + RKRK S + QF +P Sbjct: 339 SPQVDKQFPFFPRGNTDDPLDSCLSKRSSEGENLNMEATMRKRKAVISYPLEDRQFCCQP 398 Query: 1595 KFSQNICFSRMKKPD 1639 K N RM+ + Sbjct: 399 KLPYNYLPGRMRNAE 413 >ref|XP_002285150.2| PREDICTED: uncharacterized protein LOC100266444 [Vitis vinifera] Length = 414 Score = 361 bits (926), Expect = 7e-97 Identities = 210/432 (48%), Positives = 254/432 (58%), Gaps = 8/432 (1%) Frame = +2 Query: 359 MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538 MPRPGPRPYECVRRAWHSDRHQP+RGSLI+EIFRVV EIHSSATKKNKEW+EKLP+VVLK Sbjct: 1 MPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRVVNEIHSSATKKNKEWQEKLPIVVLK 60 Query: 539 AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718 AEEIMYSKANSEAEYMD++TLWDR NDAIN LQPCIEA+L LGC R Sbjct: 61 AEEIMYSKANSEAEYMDLKTLWDRANDAINTIIRRDESTETGEFLQPCIEASLNLGCPQR 120 Query: 719 SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGS---SHFLPHHSS--NPTSSP 883 ASRSQRNNN RCYL+P+TQEP S+ P + +N QG S + +++ P+S Sbjct: 121 RASRSQRNNNPRCYLTPSTQEPISISP--SILENSPQGNHTTISQVMSRYATFIKPSSMS 178 Query: 884 KFMPYYLGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLARVYP 1063 P G E + H N+ T+ E +P+SN+ VYP Sbjct: 179 VIQP---GLEPHSTAFHNNDCPTS----KFLFSSENCPPSGNKCLQMEVYPASNVCAVYP 231 Query: 1064 LYYGKLQQTTEPQFGFQI---PHSNAASLGTPHVQFSVEPTEKSFLQDLFPRDEAVNASN 1234 LY G Q E Q GF + P SN +EP +Q+LF A++ + Sbjct: 232 LYDGNQLQCEESQCGFGVQSHPKSN-----------PMEPAGMGTIQNLF--SYAIDPTK 278 Query: 1235 SMIRGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCDL 1414 + D EN P+I CDLSLRLG +S P + WP+E EDVG DL Sbjct: 279 KPSQTDFGHVTENSPKIDCDLSLRLGPLSIPCVSVENSWPQEFEDVGSSCSREGSKFSDL 338 Query: 1415 SLRIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAGQFSQKP 1594 S ++D +F FFPR N DDPL+SC SKRSSEG+ LN E + RKRK S + QF +P Sbjct: 339 SPQVDKQFPFFPRGNTDDPLDSCLSKRSSEGENLNMEATMRKRKAVISYPLEDRQFCCQP 398 Query: 1595 KFSQNICFSRMK 1630 K N RM+ Sbjct: 399 KLPYNYLPGRMR 410 >ref|XP_007011854.1| Uncharacterized protein isoform 3 [Theobroma cacao] gi|508782217|gb|EOY29473.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 447 Score = 330 bits (845), Expect = 2e-87 Identities = 210/440 (47%), Positives = 244/440 (55%), Gaps = 14/440 (3%) Frame = +2 Query: 353 LKMPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVV 532 LKMPRPGPRPY C RRAWHSDRHQPMRGSLI+EIFRVV EIHSSATKKNKEW+EKLPVVV Sbjct: 41 LKMPRPGPRPYVCERRAWHSDRHQPMRGSLIQEIFRVVNEIHSSATKKNKEWQEKLPVVV 100 Query: 533 LKAEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCT 712 LKAEEIMYSKANSEAEYMD+++LWDRTNDAIN LLQPCIEAAL LGCT Sbjct: 101 LKAEEIMYSKANSEAEYMDLKSLWDRTNDAINTIIKRDESTETGELLQPCIEAALNLGCT 160 Query: 713 ARSASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNPTSSPKFM 892 R RSQRN N RCYLSP TQE +N TQ +N T++P FM Sbjct: 161 PRRTLRSQRNCNPRCYLSPGTQE----------AENTTQ-----------ANLTTNPNFM 199 Query: 893 PYY-------------LGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAH 1033 Y LGSES I +N TT E + Sbjct: 200 ASYSGFMKSTIMNVTHLGSESQKHIAQDSNCTT---YKFPFASENGPLPSNSQCLPMEKY 256 Query: 1034 PSSNLARVYPLYYGKLQQTTEPQFGFQI-PHSNAASLGTPHVQFSVEPTEKSFLQDLFPR 1210 P NL VYPLYYG + E Q GF I P S + +VEP + + +LF Sbjct: 257 PPPNLYSVYPLYYGNHLKFEEMQHGFGIFPKS---------ISNTVEPAKMGVIDNLFSS 307 Query: 1211 DEAVNASNSMIRGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXX 1390 D V++SN+M + D+S T NP E CDLSLRLG +S P + G P+ +ED G Sbjct: 308 D--VDSSNNMNQTDVSNTSNNPHENACDLSLRLGPLSIPCLSVGKSRPQVIEDTGSTSLE 365 Query: 1391 XXXXXCDLSLRIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVV 1570 DL+ ID FPR N DDPL S ++ S EG+ +N + + RKRK V Sbjct: 366 WNRFG-DLTPSIDKMLSSFPRSNRDDPLNSSLNRWSLEGEHVNVDATMRKRKTVYGP-TV 423 Query: 1571 AGQFSQKPKFSQNICFSRMK 1630 QF PK + RMK Sbjct: 424 DQQFCLPPKLPYSHLTGRMK 443 >ref|XP_002282244.1| PREDICTED: uncharacterized protein LOC100250879 [Vitis vinifera] Length = 424 Score = 313 bits (803), Expect = 1e-82 Identities = 187/419 (44%), Positives = 228/419 (54%), Gaps = 6/419 (1%) Frame = +2 Query: 359 MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538 MPRPGPRPYECVRRAWHSDRHQPMRGS+I++IFRVV + HSSATKKN+EW+EKLP+VVLK Sbjct: 1 MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIFRVVTDTHSSATKKNREWQEKLPIVVLK 60 Query: 539 AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718 AEEIMYSKANSE EYMD+ TLWDR NDA+N LL PCIEAAL LGC Sbjct: 61 AEEIMYSKANSETEYMDLGTLWDRVNDAVNTIIRRDESTETGELLPPCIEAALNLGCVPV 120 Query: 719 SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNPTSSPKFM-- 892 ASRSQR+NN R YL+ TQEPTSV P RV DN P + N + + Sbjct: 121 RASRSQRHNNPRSYLTHRTQEPTSVSP--RVLDNAVNERCPQLQPPSAGNQLTFGRLNMD 178 Query: 893 PYYLGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLARVYPLYY 1072 +L +S +T N+ TT + E + N VYPLYY Sbjct: 179 STHLVLDSDRHVTQNNSLATTRN---FHFPYENFPLGSNQSMTVETNTPLNFGSVYPLYY 235 Query: 1073 GKLQQTTEPQFGFQIP---HSNAASLGTPHVQFSVEPTEKS-FLQDLFPRDEAVNASNSM 1240 G Q E GFQ+P ++N +G P EP+E LQ+LF D N N Sbjct: 236 GTHFQNEESHLGFQMPETANANTVFVGAPIGTSIAEPSEMGIILQNLFSSDGTENVLNKN 295 Query: 1241 IRGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCDLSL 1420 + + T P +CDLSLRLG S P + EDVG LS Sbjct: 296 AQENFRDTCGKEPVAECDLSLRLGLSSDPCMRKEKCSAPDTEDVGSSSSQEGAKVSGLSP 355 Query: 1421 RIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAGQFSQKPK 1597 FCFFP + + P SCS+K +S +G N + + RKRK P ++ + GQF P+ Sbjct: 356 GKSKGFCFFPSETANSPFGSCSNKWNSGDEGQNMDATVRKRKAPFNNDLEGGQFFLSPE 414 >gb|EXB96866.1| hypothetical protein L484_016640 [Morus notabilis] Length = 409 Score = 300 bits (768), Expect = 1e-78 Identities = 188/425 (44%), Positives = 225/425 (52%), Gaps = 8/425 (1%) Frame = +2 Query: 359 MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538 MPRPGPRPYECVRRAWHSDRHQPMRGS+I++IFRV E HS+ TKKNKEW+EKLP+VVLK Sbjct: 1 MPRPGPRPYECVRRAWHSDRHQPMRGSVIQQIFRVANEAHSATTKKNKEWQEKLPIVVLK 60 Query: 539 AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718 AEEIMYSKANSEAEY +++TLWDR NDAIN LL PC+EAAL LGC Sbjct: 61 AEEIMYSKANSEAEYTNLDTLWDRVNDAINTIIRREETTETGDLLPPCVEAALNLGCIPV 120 Query: 719 SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNPTS---SPKF 889 ASRSQR++N R YL+ EP S +RV D + LP HS N + +P Sbjct: 121 RASRSQRHSNPRTYLTARAHEPFSA--GTRVLDRTSDERRPQLLPLHSGNQLTFARAPIA 178 Query: 890 MPYYLGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLARVYPLY 1069 P SES T + NN T P + + S NL VYPLY Sbjct: 179 NPANFLSESNTHVNRNNNNLTAPR--SHAFSPENVVSGHSQATTIDTNASLNLGSVYPLY 236 Query: 1070 YGKLQQTTEPQFGFQIP---HSNAASLGTPHV--QFSVEPTEKSFLQDLFPRDEAVNASN 1234 +G +T + G IP HS +GTP V + EPT +QD F R +A+ Sbjct: 237 HGTNYRTEKYHSGSPIPENVHSKTIYVGTPVVTPAATAEPT----MQDCFTRVDAM---- 288 Query: 1235 SMIRGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCDL 1414 T ENP E +CDLSLRL S P+ + E EDVG D+ Sbjct: 289 --------GTQENPQEAECDLSLRLSLFSNPFGRTQKNFATETEDVGSSSSQDAGKVNDV 340 Query: 1415 SLRIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAGQFSQKP 1594 + EFCFFP + D ES S +S G+G N E RK K S GQF +P Sbjct: 341 RQSMGREFCFFPGKSACDLSESSSRMWNSGGEGQNLEAFVRKGKETFSSNEKDGQFCWQP 400 Query: 1595 KFSQN 1609 N Sbjct: 401 GVPSN 405 >ref|XP_007011852.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508782215|gb|EOY29471.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 417 Score = 298 bits (762), Expect = 7e-78 Identities = 197/440 (44%), Positives = 230/440 (52%), Gaps = 14/440 (3%) Frame = +2 Query: 353 LKMPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVV 532 LKMPRPGPRPY C RRAWHSDRHQPMRGSLI+EIFRVV EIHSSATKKNKEW+EKLPVVV Sbjct: 41 LKMPRPGPRPYVCERRAWHSDRHQPMRGSLIQEIFRVVNEIHSSATKKNKEWQEKLPVVV 100 Query: 533 LKAEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCT 712 LKAEEIMYSKANSEAEYMD+++LWDRTNDAIN LLQPCIEAAL LGCT Sbjct: 101 LKAEEIMYSKANSEAEYMDLKSLWDRTNDAINTIIKRDESTETGELLQPCIEAALNLGCT 160 Query: 713 ARSASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNPTSSPKFM 892 R RSQRN N RCYLSP TQE +N TQ +N T++P FM Sbjct: 161 PRRTLRSQRNCNPRCYLSPGTQE----------AENTTQ-----------ANLTTNPNFM 199 Query: 893 PYY-------------LGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAH 1033 Y LGSES I +N TT E + Sbjct: 200 ASYSGFMKSTIMNVTHLGSESQKHIAQDSNCTT---YKFPFASENGPLPSNSQCLPMEKY 256 Query: 1034 PSSNLARVYPLYYGKLQQTTEPQFGFQI-PHSNAASLGTPHVQFSVEPTEKSFLQDLFPR 1210 P NL VYPLYYG + E Q GF I P S + +VEP + + +LF Sbjct: 257 PPPNLYSVYPLYYGNHLKFEEMQHGFGIFPKS---------ISNTVEPAKMGVIDNLFSS 307 Query: 1211 DEAVNASNSMIRGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXX 1390 D V++SN+M + D+S T NP E CDLSLRLG +S P + G P+ +ED G Sbjct: 308 D--VDSSNNMNQTDVSNTSNNPHENACDLSLRLGPLSIPCLSVGKSRPQVIEDTGSTSLE 365 Query: 1391 XXXXXCDLSLRIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVV 1570 ++ S EG+ +N + + RKRK V Sbjct: 366 W-------------------------------NRWSLEGEHVNVDATMRKRKTVYGP-TV 393 Query: 1571 AGQFSQKPKFSQNICFSRMK 1630 QF PK + RMK Sbjct: 394 DQQFCLPPKLPYSHLTGRMK 413 >ref|XP_004288790.1| PREDICTED: uncharacterized protein LOC101293823 [Fragaria vesca subsp. vesca] Length = 421 Score = 293 bits (751), Expect = 1e-76 Identities = 182/423 (43%), Positives = 230/423 (54%), Gaps = 6/423 (1%) Frame = +2 Query: 359 MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538 MPRPGPRPYECVRRAWHSDRHQPMRGS+I+++FR V E+HS+ TK NKEW+EKLP+VV K Sbjct: 1 MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQMFRAVNEVHSAKTKNNKEWQEKLPMVVFK 60 Query: 539 AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718 AEEIMYSKANSEAEY++ +TLWDR NDAIN LL PC+EAAL LGC A Sbjct: 61 AEEIMYSKANSEAEYINSDTLWDRANDAINTIIRREEGNETGDLLPPCVEAALNLGCVAV 120 Query: 719 SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFL-PHHSSNPTSSPKFMP 895 ASRSQR++N R YL P QEP S PP+RV D + F PHH N ++ + P Sbjct: 121 RASRSQRHSNPRSYLMPRPQEPPS--PPTRVLDRPSDERRPPFSPPHHPGNQSNFAR--P 176 Query: 896 YYLGSESCTP--ITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLARVYPLY 1069 + S P ++HAN + + NL VYPLY Sbjct: 177 STVNSAHLVPESLSHANQSSNLSSPRHYPFSVENVPGGHNQITTISTNNQLNLGSVYPLY 236 Query: 1070 YGKLQQTTEPQFGFQIP---HSNAASLGTPHVQFSVEPTEKSFLQDLFPRDEAVNASNSM 1240 +G T PQ Q+P HS +GTP V EPT+K +F A N S+ + Sbjct: 237 HGFSYPTEAPQ--LQVPENVHSRTIYVGTP-VTSIQEPTKK----HIFTSQRAENVSHRI 289 Query: 1241 IRGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCDLSL 1420 + D+ E P + DLSLRLG +S ++ ++ED+G + S Sbjct: 290 PQVDVMDIQEKPRDEGYDLSLRLGPVSHLCTDRSL--ASQMEDIGSSNSQEGGKLHNYSP 347 Query: 1421 RIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAGQFSQKPKF 1600 I EFCFFP DP ES S+ +SEG+ + E + RKRK + GQF +P Sbjct: 348 SISKEFCFFPTKTAYDPSESTSNMWNSEGEDRSLEATLRKRKATFRNNEEDGQFFSQPPG 407 Query: 1601 SQN 1609 N Sbjct: 408 QPN 410 >ref|XP_007011853.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508782216|gb|EOY29472.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 359 Score = 290 bits (742), Expect = 1e-75 Identities = 178/349 (51%), Positives = 203/349 (58%), Gaps = 14/349 (4%) Frame = +2 Query: 353 LKMPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVV 532 LKMPRPGPRPY C RRAWHSDRHQPMRGSLI+EIFRVV EIHSSATKKNKEW+EKLPVVV Sbjct: 41 LKMPRPGPRPYVCERRAWHSDRHQPMRGSLIQEIFRVVNEIHSSATKKNKEWQEKLPVVV 100 Query: 533 LKAEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCT 712 LKAEEIMYSKANSEAEYMD+++LWDRTNDAIN LLQPCIEAAL LGCT Sbjct: 101 LKAEEIMYSKANSEAEYMDLKSLWDRTNDAINTIIKRDESTETGELLQPCIEAALNLGCT 160 Query: 713 ARSASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNPTSSPKFM 892 R RSQRN N RCYLSP TQE +N TQ +N T++P FM Sbjct: 161 PRRTLRSQRNCNPRCYLSPGTQE----------AENTTQ-----------ANLTTNPNFM 199 Query: 893 PYY-------------LGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAH 1033 Y LGSES I +N TT E + Sbjct: 200 ASYSGFMKSTIMNVTHLGSESQKHIAQDSNCTT---YKFPFASENGPLPSNSQCLPMEKY 256 Query: 1034 PSSNLARVYPLYYGKLQQTTEPQFGFQI-PHSNAASLGTPHVQFSVEPTEKSFLQDLFPR 1210 P NL VYPLYYG + E Q GF I P S + +VEP + + +LF Sbjct: 257 PPPNLYSVYPLYYGNHLKFEEMQHGFGIFPKS---------ISNTVEPAKMGVIDNLFSS 307 Query: 1211 DEAVNASNSMIRGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPR 1357 D V++SN+M + D+S T NP E CDLSLRLG +S P + G P+ Sbjct: 308 D--VDSSNNMNQTDVSNTSNNPHENACDLSLRLGPLSIPCLSVGKSRPQ 354 >ref|XP_006450882.1| hypothetical protein CICLE_v10008357mg [Citrus clementina] gi|568843993|ref|XP_006475881.1| PREDICTED: uncharacterized protein LOC102614540 [Citrus sinensis] gi|557554108|gb|ESR64122.1| hypothetical protein CICLE_v10008357mg [Citrus clementina] Length = 430 Score = 288 bits (736), Expect = 7e-75 Identities = 183/440 (41%), Positives = 225/440 (51%), Gaps = 14/440 (3%) Frame = +2 Query: 359 MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538 MPRPGPRPYECVRRAWHS+RHQPMRGS I++IFRV E HS+ TKKNKEW+EKLP+VVLK Sbjct: 1 MPRPGPRPYECVRRAWHSERHQPMRGSTIQQIFRVADEFHSTQTKKNKEWQEKLPIVVLK 60 Query: 539 AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718 AEEI+YSKANSE EYM+++TL DR NDA+N LL PC+EAAL LGC Sbjct: 61 AEEILYSKANSEDEYMNLDTLRDRVNDAVNTIIRRDESTETGELLPPCVEAALNLGCIPV 120 Query: 719 SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNP--------- 871 ASRSQR+++ R YL+ QEP +PP ++ D + F PHHS N Sbjct: 121 RASRSQRHSHPRTYLNLRPQEPAPLPP--KIVDKTIEDQCPRFSPHHSGNQFNFSRSFKN 178 Query: 872 TSSPKFMPYYLGSESCTPITHAN--NPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSN 1045 +S +P SC I + N P++ P +A+ Sbjct: 179 ANSTTLVP----ESSCQVIGNDNLAAPSSYP------SSYENIPSRHSKMMRVDANVQLK 228 Query: 1046 LARVYPLYYGKLQQTTEPQFGFQIP---HSNAASLGTPHVQFSVEPTEKSFLQDLFPRDE 1216 L VYPLYYG QT + + G I +S +G P EP E L L+ Sbjct: 229 LGSVYPLYYGTRFQTEDTKLGSGISENVNSTTIFVGKPIGTSIPEPGEMGALPRLYSCSG 288 Query: 1217 AVNASNSMIRGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXX 1396 + AS + D P CDLSLRLG P + RE EDVG Sbjct: 289 SDTASKPTTKPDFLELQRRPHVTGCDLSLRLGLSGDPCMSLDRSSARETEDVGSIRSQEG 348 Query: 1397 XXXCDLSLRIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAG 1576 DLS + EFCFFP D P ESCSSKR E + N E S RK K SD + Sbjct: 349 NKLKDLSSERNKEFCFFPEKTADKPHESCSSKRFPEHECRNLEASIRKPKALFSDNLEDV 408 Query: 1577 QFSQKPKFSQNICFSRMKKP 1636 QF +P N +++ P Sbjct: 409 QFCWQPGPPSNQFTGQIRGP 428 >ref|XP_002309630.1| hypothetical protein POPTR_0006s27080g [Populus trichocarpa] gi|222855606|gb|EEE93153.1| hypothetical protein POPTR_0006s27080g [Populus trichocarpa] Length = 407 Score = 288 bits (736), Expect = 7e-75 Identities = 193/428 (45%), Positives = 236/428 (55%), Gaps = 4/428 (0%) Frame = +2 Query: 359 MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538 MPRPGPRPYECVRRAWHSDRHQP+RGSLI+EIFR+V E HSS TKKNKEW+EKLPVVVLK Sbjct: 1 MPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRLVNEAHSSTTKKNKEWQEKLPVVVLK 60 Query: 539 AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718 AEEIMYSKANSEAEYM+++TLWDRTNDAIN LLQPCIEAAL LGCT R Sbjct: 61 AEEIMYSKANSEAEYMELKTLWDRTNDAINTIIRRDESTEIGELLQPCIEAALNLGCTPR 120 Query: 719 SASRSQRNNNHRCYLSPNTQEPTSVPPPS-RVPDNVTQGGSSHFLPHHSS--NPTSSPKF 889 ASRSQRN N YLSP+TQEP ++ S + +SH LP++SS P Sbjct: 121 RASRSQRNCNPSFYLSPSTQEPNTLSSGSVHSAIQANRTSNSHVLPNYSSMVKPIIMNST 180 Query: 890 MPYYLGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLARVYPLY 1069 P GSES + +N T+ PS L VYPLY Sbjct: 181 PP---GSESQDFVGQSNG--TSNRFLFIDDSIPLSNANQCLPLGNYRIPS--LCSVYPLY 233 Query: 1070 YGKLQQTTEPQFGF-QIPHSNAASLGTPHVQFSVEPTEKSFLQDLFPRDEAVNASNSMIR 1246 YG EPQ G +P + ++ EP + + +Q+ FP +E + Sbjct: 234 YG---CCLEPQRGCGALPKTFPGTM---------EPVKVAVMQNFFPCNE--DTPVKTCH 279 Query: 1247 GDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCDLSLRI 1426 D + P EI CDLSLRLG++ AP + T+ ++ +D G D ++ Sbjct: 280 ADHKDSPLQPQEIGCDLSLRLGSLPAPMLSVKTKQLKDAKDGGHDCSQEGGKVDDWMPQV 339 Query: 1427 DNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAGQFSQKPKFSQ 1606 D E FF R NV DPL S SSK + +N + + +KRK D V QF +PK Sbjct: 340 DKELPFFTRVNVADPLVSHSSK---SREHVNIDETKKKRKA-VLDHHVEDQFCWQPKLHC 395 Query: 1607 NICFSRMK 1630 N RMK Sbjct: 396 NQLTCRMK 403 >ref|XP_006600747.1| PREDICTED: uncharacterized protein LOC100806760 [Glycine max] Length = 405 Score = 287 bits (734), Expect = 1e-74 Identities = 178/424 (41%), Positives = 224/424 (52%), Gaps = 2/424 (0%) Frame = +2 Query: 359 MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538 MPRPGPRPYECVRRAWHS+RHQP+RGS+I++IFRVV + HS ATKKNKEW+EKLPVVVLK Sbjct: 1 MPRPGPRPYECVRRAWHSERHQPVRGSIIQQIFRVVNDAHSPATKKNKEWQEKLPVVVLK 60 Query: 539 AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718 AEEIMYSKANSEAEY++ +TLWDR NDA+N LL PC+EAAL LGC A Sbjct: 61 AEEIMYSKANSEAEYLNPDTLWDRLNDAVNTIIRRDETTETGGLLPPCVEAALNLGCKAV 120 Query: 719 SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNPTSSPKFMPY 898 SRS R+NN R YLSP Q+P +PP + + + P SP Sbjct: 121 RTSRSDRHNNPRTYLSPRIQQPPCLPPKPVAGNPLNYA--------KVTTPAVSP----- 167 Query: 899 YLGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLARVYPLYYGK 1078 P+ + +P + EA PS NL VYPLYYG Sbjct: 168 -------IPVPDSIHPNSRLMGSSKYPFSEGIPSGHHQPLTMEARPSLNLGSVYPLYYGY 220 Query: 1079 LQQTTEP-QFGFQIPHSNAASLGTPHVQFSVEPTEKSFLQDLFPRDEAVNASNSMIRGDL 1255 + EP S+ +G P + SV L + F + +N M + Sbjct: 221 EAREPEPWTTATDTTCSDTIFVGRPVI--SVPEPSGIGLSENFSYGTFHHVANRMRKETA 278 Query: 1256 SATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCDLSLRIDNE 1435 T E P+ +CDLSLRLG P S++ + EV+DVG LSL+ + E Sbjct: 279 VGTQEAAPDRECDLSLRLGQCLHPCSSSKSSSAYEVDDVGLGVSPESCKFSHLSLQRNRE 338 Query: 1436 FCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVP-SSDLVVAGQFSQKPKFSQNI 1612 FCF+PR+ +ES S K + EG+ LN E + RKRK P + GQF + P N Sbjct: 339 FCFYPRETGYGTIESTSGKCNVEGEDLNLEATLRKRKAPLCGNNEEDGQFCRNPGVPSNR 398 Query: 1613 CFSR 1624 SR Sbjct: 399 FTSR 402 >ref|XP_007202061.1| hypothetical protein PRUPE_ppa006809mg [Prunus persica] gi|462397592|gb|EMJ03260.1| hypothetical protein PRUPE_ppa006809mg [Prunus persica] Length = 394 Score = 286 bits (732), Expect = 2e-74 Identities = 177/427 (41%), Positives = 215/427 (50%), Gaps = 1/427 (0%) Frame = +2 Query: 359 MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538 MPRPGPRPYECVRRAWHSDRHQPMRGS+I++IFRVV E+H S TKKNKEW+EKLP+VV K Sbjct: 1 MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIFRVVSEVHGSVTKKNKEWQEKLPMVVFK 60 Query: 539 AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718 AEEIMYSKANSEAEYM++ETLWDR NDA+N LL PC+EAAL LGC Sbjct: 61 AEEIMYSKANSEAEYMNLETLWDRVNDAVNTIIRRDEGTETGELLPPCVEAALNLGCIPV 120 Query: 719 SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNPTSSPKFMPY 898 ASRSQR++N R YL+ QEP S P+R+ D T F PH S N + K Sbjct: 121 RASRSQRHSNPRIYLTSRAQEPPSA--PTRILDRTTDERRPQFPPHRSGNQLNFAKASTG 178 Query: 899 YLGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLARVYPLYYGK 1078 + N T + S +L VYPLYYG Sbjct: 179 NSAHSVPESYSRINQNTNLNSRRNDPFSRENLPAGHNQLTTMSTNNSLDLGSVYPLYYG- 237 Query: 1079 LQQTTEPQFGFQIPHSNAASLGTPHVQF-SVEPTEKSFLQDLFPRDEAVNASNSMIRGDL 1255 H QF +PT+ +LF A N S+ + + ++ Sbjct: 238 -----------------------AHYQFEECQPTK----HNLFSSQTAENVSHRITQVEV 270 Query: 1256 SATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCDLSLRIDNE 1435 + P E +CDLSLRLG + P E ED+G DLS I E Sbjct: 271 ---HDKPLETECDLSLRLGPVLHPCIQRSL--ASETEDIGSSSSQDGGKLNDLSPSISKE 325 Query: 1436 FCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAGQFSQKPKFSQNIC 1615 CFFP D ES SS +SEG+G + E + RKRK P G+F ++P N Sbjct: 326 ICFFPTKTACDRFESTSSMWNSEGEGRSLEATVRKRKAPFCSNEEDGKFCEQPDVLPNRL 385 Query: 1616 FSRMKKP 1636 R P Sbjct: 386 TDRTTGP 392 >ref|XP_004139410.1| PREDICTED: uncharacterized protein LOC101205660 [Cucumis sativus] Length = 423 Score = 285 bits (729), Expect = 5e-74 Identities = 178/412 (43%), Positives = 218/412 (52%), Gaps = 14/412 (3%) Frame = +2 Query: 359 MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538 MPRPGPRPYECVRRAWHSDRHQPMRGS+I++IFRVV E H+ ATKKNKEW+EKLP+VV + Sbjct: 1 MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIFRVVNENHTPATKKNKEWQEKLPIVVFR 60 Query: 539 AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718 AEEIMYSKANSE EYM++ETLWDR NDA+N LL PC+EAAL LGC Sbjct: 61 AEEIMYSKANSEVEYMNLETLWDRLNDAVNTIIRRDESSESGELLPPCVEAALNLGCVPV 120 Query: 719 SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNPTSSPKFM-- 892 ASRSQR++N R YL+P QEPTS + + + LP +P + F Sbjct: 121 RASRSQRHSNPRTYLTPRGQEPTST-----LATTLNKATDERRLPVSPLHPGNQLNFARA 175 Query: 893 ----PYYLGSESCTPITHANNPT--TTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLAR 1054 + SE + I NNPT +TP E + NL Sbjct: 176 KSMNSSFFASERSSQIKQHNNPTIPSTP-----AFLIENVPVVHNNYSMTETNTPLNLGS 230 Query: 1055 VYPLYYGKLQQTTEPQFGFQI---PHSNAASLGTPHVQFSVEPTEKSFLQDLFPRDEAVN 1225 VYPLYYG QT EP QI + LG P + S EP E S + Sbjct: 231 VYPLYYGIRCQTEEPNLSSQISADANQQTIFLGRPIIS-SAEPAEHSL--------RSYK 281 Query: 1226 ASNSMIRGD---LSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXX 1396 N+M R ++A E P+ +CDLSLRLG S P ++ W E DV Sbjct: 282 TGNAMSRFPSEFITAREEKLPDTECDLSLRLGVPSLPCVSSRKTWALETGDVAPSSSRER 341 Query: 1397 XXXCDLSLRIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVP 1552 D + + EF FFP + SCS+ SS+G G SE S++KRK P Sbjct: 342 HQFHDQTTYANEEFSFFPTRTEFERFGSCSNMWSSDGGGQISESSTKKRKEP 393 >ref|XP_002324862.2| hypothetical protein POPTR_0018s01770g [Populus trichocarpa] gi|550317816|gb|EEF03427.2| hypothetical protein POPTR_0018s01770g [Populus trichocarpa] Length = 448 Score = 283 bits (723), Expect = 2e-73 Identities = 191/433 (44%), Positives = 238/433 (54%), Gaps = 8/433 (1%) Frame = +2 Query: 356 KMPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVL 535 KMPRPGPRPYECVRRAWHSDRHQP+RGSLI+EIFR+V E H ATKKNKEW+EKLPVVVL Sbjct: 41 KMPRPGPRPYECVRRAWHSDRHQPIRGSLIQEIFRLVNEAHCPATKKNKEWQEKLPVVVL 100 Query: 536 KAEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTA 715 KAEEIMYSKANSEAEYMD++TLWDR NDAIN LLQPCIEAAL LGCT Sbjct: 101 KAEEIMYSKANSEAEYMDLKTLWDRANDAINTIIRRDESLETGELLQPCIEAALNLGCTP 160 Query: 716 RSASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQG--GSSHFLPHHSS--NPT--- 874 R ASRSQRN N R YLSP+TQE ++ P + V + + +SH L +S+ PT Sbjct: 161 RRASRSQRNCNLRFYLSPSTQESNTLSPAA-VHNAIRANHISNSHCLRDYSNLVKPTIMN 219 Query: 875 SSPKFMPYYLGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLAR 1054 S+P GSES + N+ + E + +L Sbjct: 220 SAPS------GSESQDLVGQGNDTSNR----FLFRSDNIPPSNVNRCLPLENYRIPSLCS 269 Query: 1055 VYPLYYGKLQQTTEPQFGF-QIPHSNAASLGTPHVQFSVEPTEKSFLQDLFPRDEAVNAS 1231 VYPLYYG EPQ G +P + + +EP + +Q+ FP +E Sbjct: 270 VYPLYYGSC---LEPQRGCGALPKTFPGT---------IEPVKVVAVQNFFPCNEDTPVR 317 Query: 1232 NSMIRGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCD 1411 S + G P EI+CDLSLRLG+I AP A T+ ++ +D G D Sbjct: 318 TSQV-GHKDCL--QPQEIECDLSLRLGSILAPVPRAKTKQIKDAKDGGHDCSQEGGKFDD 374 Query: 1412 LSLRIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAGQFSQK 1591 ++D E FFP+ +V DP S SSK + + + + +KRK+ V QF + Sbjct: 375 WMPQMDKELSFFPKVDVVDPQVSHSSK---SREHIIVDVTMKKRKLVFDHHVEDQQFLWQ 431 Query: 1592 PKFSQNICFSRMK 1630 PK N RMK Sbjct: 432 PKLPCNKLTGRMK 444 >ref|XP_007222724.1| hypothetical protein PRUPE_ppa006474mg [Prunus persica] gi|462419660|gb|EMJ23923.1| hypothetical protein PRUPE_ppa006474mg [Prunus persica] Length = 410 Score = 277 bits (708), Expect = 1e-71 Identities = 179/416 (43%), Positives = 225/416 (54%), Gaps = 3/416 (0%) Frame = +2 Query: 359 MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538 MPR GPRPYECVRRAWHS+RHQPMRGSLIKEIFRVV EIHSSAT+KNKEW++KLP+VVLK Sbjct: 1 MPRSGPRPYECVRRAWHSERHQPMRGSLIKEIFRVVNEIHSSATRKNKEWQDKLPIVVLK 60 Query: 539 AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718 AEEIMYSKANSEAEYMD++TLWDRTNDAIN LQPCIEAAL LGC R Sbjct: 61 AEEIMYSKANSEAEYMDLKTLWDRTNDAINTIIRRDEGTETGDFLQPCIEAALNLGCIPR 120 Query: 719 SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQ---GGSSHFLPHHSSNPTSSPKF 889 SRSQR+ N CYL P T + + P V +N +Q +S + PH + PK Sbjct: 121 RTSRSQRHANPSCYLIPITSDVPGISP--SVVENASQRDYTSNSQYRPHCPN--FVKPKS 176 Query: 890 MPYYLGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLARVYPLY 1069 M LG ES P+ N+ TT E+ +SN + YPL+ Sbjct: 177 MTTQLGFESRFPVVQNNDCTT---MKFRIASENIPPSGYDQFSPRESMATSNFSS-YPLH 232 Query: 1070 YGKLQQTTEPQFGFQIPHSNAASLGTPHVQFSVEPTEKSFLQDLFPRDEAVNASNSMIRG 1249 Y Q E + GF I L P V +EP + + +L + SN + Sbjct: 233 YRNFPQFEELKPGFVI-------LPKP-VSDPIEPAKMGVISNLLCNGD---KSNDNTQT 281 Query: 1250 DLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCDLSLRID 1429 D ENP + CDLSLRLG +S +S P EV+DVG + D Sbjct: 282 DTRDYTENPCTVGCDLSLRLGPLSTQHSIGENSQPEEVKDVGAQEGTMCSD--QSQPQFD 339 Query: 1430 NEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAGQFSQKPK 1597 F + N P +S SS+ + EG+ +N + + RKRK + +F ++P+ Sbjct: 340 RRPSFIGKGNEYGPRDSYSSRLNFEGEYMNVQATMRKRKAAFNHPTGDTKFYRQPE 395 >ref|XP_002527188.1| conserved hypothetical protein [Ricinus communis] gi|223533453|gb|EEF35201.1| conserved hypothetical protein [Ricinus communis] Length = 373 Score = 276 bits (706), Expect = 2e-71 Identities = 165/377 (43%), Positives = 197/377 (52%), Gaps = 6/377 (1%) Frame = +2 Query: 359 MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538 MPRPGPRPYECVRRAWHSDRHQPMRGS+I +IFRVV E HS+ TKKNKEW+EKLP+VVLK Sbjct: 1 MPRPGPRPYECVRRAWHSDRHQPMRGSVIHQIFRVVSETHSAITKKNKEWQEKLPIVVLK 60 Query: 539 AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718 AEEIMYSKANSEAEYM+ +TLWDR NDAIN LL PCIEAAL LGC Sbjct: 61 AEEIMYSKANSEAEYMNPDTLWDRVNDAINTIIRRDESNETGELLPPCIEAALNLGCIPV 120 Query: 719 SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNPTSSPKFMPY 898 ASRSQR++N R YLSP EP VP R+ + P SS S F Sbjct: 121 RASRSQRHSNPRSYLSPRMHEP--VPAALRIVERANDKQCPQLSPPQSS---SQLNFARP 175 Query: 899 YLGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXX----EAHPSSNLARVYPL 1066 S P++ +N T E + NL VYPL Sbjct: 176 TTAVNSTLPVSESNCHLTESSNIDASCSYPLLYDNISLGSSQLMSKEINKQLNLGSVYPL 235 Query: 1067 YYGKLQQTTEPQFGFQIP--HSNAASLGTPHVQFSVEPTEKSFLQDLFPRDEAVNASNSM 1240 YYG +P Q+P +SN +GTP + EP E S D A ++ + Sbjct: 236 YYGNNYHIKQPHLASQVPEKNSNTIFVGTPISTSAAEPAEMSVFHDFLTCPSAEISAKRI 295 Query: 1241 IRGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCDLSL 1420 + DL T E P ++CDLSLRLG + N +E EDVG + L Sbjct: 296 SQADLGNTHEKPSGVQCDLSLRLGLFTDLSVNMKRSLVQETEDVGSSNSQDRSKSSNFYL 355 Query: 1421 RIDNEFCFFPRDNVDDP 1471 + + E F P N +DP Sbjct: 356 QKNKELFFSPSRNTNDP 372 >ref|XP_002324645.2| hypothetical protein POPTR_0018s12880g [Populus trichocarpa] gi|550318626|gb|EEF03210.2| hypothetical protein POPTR_0018s12880g [Populus trichocarpa] Length = 395 Score = 270 bits (691), Expect = 1e-69 Identities = 170/416 (40%), Positives = 208/416 (50%), Gaps = 8/416 (1%) Frame = +2 Query: 359 MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538 MPRPGPRPYECVRRAWHSDRH+P+RGS+I +I R+ Y+ HS+ATK N+EW++KL +VV + Sbjct: 1 MPRPGPRPYECVRRAWHSDRHKPIRGSMIGQILRMAYDTHSAATKGNREWQDKLLLVVYR 60 Query: 539 AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718 AEEIMYSKANSEAEY+ +TLWDR NDA+N LL PCIEAAL LGC Sbjct: 61 AEEIMYSKANSEAEYVSQDTLWDRVNDAVNTIIRRDESTETGDLLPPCIEAALNLGCKVE 120 Query: 719 SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNP-----TSSP 883 ASRSQR+NN R YLSP TQEP SV P R D +P HS NP ++ Sbjct: 121 RASRSQRHNNPRSYLSPRTQEPASVAP--RAVDRTHDEQGPRLMPIHSINPLNFAARATT 178 Query: 884 KFMPYYLGSESCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLARVYP 1063 P SES + ++N PH + H N VYP Sbjct: 179 IVNPNLPVSESSHRLAESSN-AAPPHSCPILYENIPPGSDQLTTKEADMH--QNFGSVYP 235 Query: 1064 LYYGKLQQTTEPQFGFQIPHSNAASLGTPHVQFSVEPTEKSFLQDLFPRDEAVNASNSMI 1243 L+YG Q+ +E + D+ SN+++ Sbjct: 236 LFYGD--------------------------QYQIEAS------DMVSEVSTRMNSNTIL 263 Query: 1244 RG---DLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCDL 1414 G D E P +CDLSLRLG S P + +E E VG Sbjct: 264 VGKPIDFRNIHEKPTGTQCDLSLRLGPCSDPCISTERNQAQENEIVGSSSSQERDKFSVF 323 Query: 1415 SLRIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAGQF 1582 S + EFCFFP + DP ESC K +SEG N E + RKRK P SD V G F Sbjct: 324 SQHRNKEFCFFPSTSNRDPSESCPDKWASEGDVQNLEANIRKRKAPFSDNVEDGHF 379 >ref|XP_007013493.1| Uncharacterized protein TCM_038116 [Theobroma cacao] gi|508783856|gb|EOY31112.1| Uncharacterized protein TCM_038116 [Theobroma cacao] Length = 407 Score = 267 bits (683), Expect = 1e-68 Identities = 169/422 (40%), Positives = 215/422 (50%), Gaps = 5/422 (1%) Frame = +2 Query: 359 MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538 MPRPGPRPYECVRRAWHS+RHQP+RGS+I++I R+ + HS+ATKKNKEW++K+ V+ K Sbjct: 1 MPRPGPRPYECVRRAWHSERHQPIRGSIIQQILRLAIDTHSTATKKNKEWQDKILTVIFK 60 Query: 539 AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718 AEEIMYSKANSE+EYM+ ETLWDR NDAIN LL PC+EAAL LGC Sbjct: 61 AEEIMYSKANSESEYMNPETLWDRVNDAINTIIRRDESTETGELLPPCVEAALNLGCHPV 120 Query: 719 SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNPTSSPKFMPY 898 ASRSQR+ R YL+P QEP S P RV D +GG P+ P Sbjct: 121 RASRSQRHCIPRTYLTPRAQEPISAAP--RVLD---KGGEER-----------CPQLSPV 164 Query: 899 YLGSESCTPITHANN--PTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPSSNLARVYPLYY 1072 + GS+ T+ N+ + + E + NL +VYPLYY Sbjct: 165 HSGSQFTRIATNVNSNISVSQTNRHSYPFLSDNCPPGHDQLTRMETNTRPNLGQVYPLYY 224 Query: 1073 GKLQQTTEPQFGFQIPHSNAAS---LGTPHVQFSVEPTEKSFLQDLFPRDEAVNASNSMI 1243 G Q E Q G + + A+ +G P +P E LQ+LF + + Sbjct: 225 GIHYQNVESQTGSPVQENIASDNIIVGRPIGTSVAQPVEMGSLQNLFSSSDVDVGGKRIG 284 Query: 1244 RGDLSATFENPPEIKCDLSLRLGTISAPYSNAGTRWPREVEDVGXXXXXXXXXXCDLSLR 1423 + D+ T E +CDLSLRLG S P + E EDVG + + Sbjct: 285 QQDIRHTNEKSFGTECDLSLRLGLFSDPCMHVEKNSIGETEDVGPSSSQEGGKVNEAFQQ 344 Query: 1424 IDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRKVPSSDLVVAGQFSQKPKFS 1603 EFCFFP NV+D ES S K + +G N + RKRK QF +P S Sbjct: 345 KSKEFCFFPERNVNDHYESFSRKWIIDIEGRNLGATMRKRKATFGGNSEDEQFCWQPGPS 404 Query: 1604 QN 1609 N Sbjct: 405 SN 406 >ref|XP_002515508.1| conserved hypothetical protein [Ricinus communis] gi|223545452|gb|EEF46957.1| conserved hypothetical protein [Ricinus communis] Length = 391 Score = 263 bits (672), Expect = 2e-67 Identities = 175/412 (42%), Positives = 213/412 (51%), Gaps = 16/412 (3%) Frame = +2 Query: 359 MPRPGPRPYECVRRAWHSDRHQPMRGSLIKEIFRVVYEIHSSATKKNKEWKEKLPVVVLK 538 MPRPGPRPYECVRRAWHSDRHQP+RGSLI+EIFRVV E+HS ATKKNKEW+EKLPVVVLK Sbjct: 1 MPRPGPRPYECVRRAWHSDRHQPVRGSLIQEIFRVVNEVHSPATKKNKEWQEKLPVVVLK 60 Query: 539 AEEIMYSKANSEAEYMDVETLWDRTNDAINXXXXXXXXXXXXXLLQPCIEAALILGCTAR 718 AEEIMYSKANSEAEYMD++TLW+R ND IN LL PCIEAAL LGCT R Sbjct: 61 AEEIMYSKANSEAEYMDLKTLWERVNDVINTIVRRDESTETGDLLLPCIEAALNLGCTPR 120 Query: 719 SASRSQRNNNHRCYLSPNTQEPTSVPPPSRVPDNVTQGGSSHFLPHHSSNPTSSPKFMPY 898 SRSQRN N RCYL+P TQEP ++PP +P H+++ P ++ Sbjct: 121 RTSRSQRNCNPRCYLTPGTQEPNTLPPA---------------MPTHTTSLQCIPNYLDL 165 Query: 899 ---------YLGSE----SCTPITHANNPTTTPHXXXXXXXXXXXXXXXXXXXXXEAHPS 1039 +LGSE C I+ +N E +P Sbjct: 166 IKPAIVNSTHLGSELQNLVCQNISVTSN-------KFLLATDNGCLSNYNQSFPMENYPM 218 Query: 1040 SNLARVYPLYYGKLQQTTEPQFGFQIPHSNAASLGTPHVQFSVEPTEKSFLQDLFP--RD 1213 S+L VYPL YG + V ++EP + Q+LF D Sbjct: 219 SSLYSVYPLCYGLIP-----------------------VSSTLEPGKVGVEQNLFSFGDD 255 Query: 1214 EAVNASNSMIRGDLSATFENPPEIKCDLSLRLGTIS-APYSNAGTRWPREVEDVGXXXXX 1390 AV + + L N E CDLSLRLG++S AP + R ++ ED G Sbjct: 256 AAVKFNQPDPQSPL-----NQHETGCDLSLRLGSLSAAPLPSDKNRQLQDFEDNGHGSFQ 310 Query: 1391 XXXXXCDLSLRIDNEFCFFPRDNVDDPLESCSSKRSSEGQGLNSEGSSRKRK 1546 D+E F R N D+ L+SCSSK S + G +KRK Sbjct: 311 EGIKFKTQMQHTDDELHFVTRLNTDNSLDSCSSKLSEHA---SVNGIIKKRK 359