BLASTX nr result
ID: Akebia27_contig00026819
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia27_contig00026819 (888 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006853038.1| hypothetical protein AMTR_s00038p00020700 [A... 192 2e-46 ref|XP_004230386.1| PREDICTED: uncharacterized protein LOC101247... 174 5e-41 ref|XP_002304112.2| hypothetical protein POPTR_0003s03710g [Popu... 165 2e-38 gb|EEE67737.1| hypothetical protein OsJ_25428 [Oryza sativa Japo... 165 3e-38 ref|XP_006358484.1| PREDICTED: uncharacterized protein LOC102593... 162 1e-37 gb|EEC82605.1| hypothetical protein OsI_27177 [Oryza sativa Indi... 162 2e-37 ref|XP_004959865.1| PREDICTED: uncharacterized protein LOC101766... 160 5e-37 ref|XP_006470788.1| PREDICTED: uncharacterized protein LOC102629... 156 9e-36 ref|XP_006470787.1| PREDICTED: uncharacterized protein LOC102629... 156 9e-36 ref|XP_006470786.1| PREDICTED: uncharacterized protein LOC102629... 156 9e-36 ref|XP_007023219.1| Uncharacterized protein isoform 4 [Theobroma... 156 9e-36 ref|XP_007023218.1| Uncharacterized protein isoform 3 [Theobroma... 156 9e-36 ref|XP_007023216.1| Uncharacterized protein isoform 1 [Theobroma... 156 9e-36 ref|XP_006431360.1| hypothetical protein CICLE_v10001110mg [Citr... 155 2e-35 ref|XP_003534756.2| PREDICTED: uncharacterized protein LOC100781... 155 2e-35 gb|EXC47697.1| hypothetical protein L484_002408 [Morus notabilis] 152 2e-34 ref|XP_007022707.1| Uncharacterized protein TCM_033523 [Theobrom... 152 2e-34 ref|XP_007147543.1| hypothetical protein PHAVU_006G133500g [Phas... 151 4e-34 ref|XP_006299074.1| hypothetical protein CARUB_v10015214mg [Caps... 148 2e-33 dbj|BAC15471.1| hypothetical protein [Oryza sativa Japonica Grou... 144 4e-32 >ref|XP_006853038.1| hypothetical protein AMTR_s00038p00020700 [Amborella trichopoda] gi|548856677|gb|ERN14505.1| hypothetical protein AMTR_s00038p00020700 [Amborella trichopoda] Length = 458 Score = 192 bits (487), Expect = 2e-46 Identities = 124/285 (43%), Positives = 159/285 (55%), Gaps = 5/285 (1%) Frame = +1 Query: 46 SFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 225 SF+LEKAVCSHG FMMAPNLW S++TLQRP Sbjct: 17 SFELEKAVCSHGFFMMAPNLWFSSSQTLQRPLRLTDRSSVPVRITQLSLSSQKSLQILVL 76 Query: 226 XXXXPL--DQQFLLGQVARMLRLSESDEMCIKEFHKIHPEAKNRGFGRVFRSPTLFEDMV 399 DQQ+LL QVARMLR+SE D++ + +FH+++P AK GFGRVFRSPTLFEDMV Sbjct: 77 GASKLYQHDQQYLLAQVARMLRISEEDDLKVNKFHEMYPVAKETGFGRVFRSPTLFEDMV 136 Query: 400 KCMLLCNCQWSRTLTMARALCELQLNLKSDSFKYLGTEVTSQDPNCLKPNTEGFLPITPI 579 K +LLCNCQW+RTL+MARALCELQL L +S + +++D + K + P+TP+ Sbjct: 137 KSILLCNCQWTRTLSMARALCELQLELNGNSLRQ-----SNKDTDFSK--SVNLSPVTPM 189 Query: 580 GRELK--RKRSMKKIPANLDCKFSENETKLEAETTNCHQQTTCFLSKEKPSPSFLISVEE 753 E K RK + I NL KFSENET L A+ + SK P+ + E Sbjct: 190 QLEHKKRRKNPNQNIIMNLMTKFSENETHLAADESLRPIDLAKDFSKNSPT----MFSSE 245 Query: 754 DDSNGKRNSCQLLNDNNKVDACSISDRTLSEGRT-DFSYRIGDFP 885 + NGK N Q+ K+ +I D L E +T F G+FP Sbjct: 246 EGRNGKLNYDQV--SEEKLGDGAILDNQLLENKTLSFFLEAGNFP 288 >ref|XP_004230386.1| PREDICTED: uncharacterized protein LOC101247758 [Solanum lycopersicum] Length = 483 Score = 174 bits (440), Expect = 5e-41 Identities = 120/302 (39%), Positives = 157/302 (51%), Gaps = 11/302 (3%) Frame = +1 Query: 16 LKLELGDSY-SSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXX 192 L LE G+ Y +SFDLEKAVCSHGLFMMAPN WD +KTL+RP Sbjct: 17 LPLEDGNGYCASFDLEKAVCSHGLFMMAPNRWDTLSKTLERPLRLSENINDDDHEQSVLV 76 Query: 193 XXXXXXXXXXXXXXXPLD--------QQFLLGQVARMLRLSESDEMCIKEFHKIHPEAKN 348 LD Q+ LLGQV RM+RLS + +K F +I EAK Sbjct: 77 QITQPSDYPHSLLLRVLDTDSLSTIHQRSLLGQVRRMVRLSVEENKRVKLFQEICGEAKE 136 Query: 349 RGFGRVFRSPTLFEDMVKCMLLCNCQWSRTLTMARALCELQLNLKSDSFKYLGTEVTSQD 528 RGFGRVFRSPTLFEDMVKCMLLCNCQWSRTL+MA ALCELQL L S + +Q+ Sbjct: 137 RGFGRVFRSPTLFEDMVKCMLLCNCQWSRTLSMAEALCELQLELNCPSSAASFPDPDNQN 196 Query: 529 P-NCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKFSENETKLEAETTNCHQQTTCF 705 + +E F P TP G+EL+++ NL + +E E ++ + Sbjct: 197 QLKGVTSKSEHFTPRTPAGKELRKRAGAYGCSRNLLERLNEVEEIVDIDKPGV------- 249 Query: 706 LSKEKPSPSFLISVEEDDSNGKRNSCQLLNDNNKVDACSISDRTLSEGRTDFSY-RIGDF 882 +P+F + ++ K N CQ + +V + + SE R S+ ++G+F Sbjct: 250 ----TVTPAFSVG---EEVLQKSNLCQDTTEVWEVSVSAPLNPDPSEDRKLSSFNQLGNF 302 Query: 883 PS 888 PS Sbjct: 303 PS 304 >ref|XP_002304112.2| hypothetical protein POPTR_0003s03710g [Populus trichocarpa] gi|550342350|gb|EEE79091.2| hypothetical protein POPTR_0003s03710g [Populus trichocarpa] Length = 489 Score = 165 bits (418), Expect = 2e-38 Identities = 103/242 (42%), Positives = 125/242 (51%), Gaps = 20/242 (8%) Frame = +1 Query: 7 SCLLKLELGDSYSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXX 186 S + ++ LGD+ +F+LEKAVCSHGLFMM+PN WDP + T RP Sbjct: 16 SVVFEIPLGDAAETFNLEKAVCSHGLFMMSPNHWDPLSLTFSRPLRLSLSDSDPQVSTPT 75 Query: 187 XXXXXXXXXXXXXXXXX-----------PLDQQFLLGQVARMLRLSESDEMCIKEFHKIH 333 P Q+ L+ QV RMLRLSE+DE +EF KI Sbjct: 76 TSLFVSISHPPHLPRSLSVRVYGTRCLSPKHQESLVAQVVRMLRLSETDERNAREFRKIA 135 Query: 334 PEAKNR-------GFG-RVFRSPTLFEDMVKCMLLCNCQWSRTLTMARALCELQLNLK-S 486 A GFG RVFRSPTLFEDMVKC+LLCNCQW RTL+MARALCELQ L+ Sbjct: 136 EAAAAEENNSWLTGFGGRVFRSPTLFEDMVKCILLCNCQWPRTLSMARALCELQCELQCK 195 Query: 487 DSFKYLGTEVTSQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKFSENETKLE 666 S ++ V + N F+P T G+E KR K+ NL K E ET LE Sbjct: 196 SSGVFVAQAVNATVKNKCNDTAHNFIPNTSAGKESKRNIRASKVTKNLASKIVETETLLE 255 Query: 667 AE 672 A+ Sbjct: 256 AD 257 >gb|EEE67737.1| hypothetical protein OsJ_25428 [Oryza sativa Japonica Group] Length = 442 Score = 165 bits (417), Expect = 3e-38 Identities = 95/210 (45%), Positives = 116/210 (55%), Gaps = 8/210 (3%) Frame = +1 Query: 49 FDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 228 FDLE AVCSHGLFMMAPN WDP+++ L RP Sbjct: 37 FDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVRVSRHPARPSDALLVSV 96 Query: 229 XXXP-------LDQQFLLGQVARMLRLSESDEMCIKEFHKIHPEAKNRGFGRVFRSPTLF 387 P LDQ +L QV RMLRL E D + EF +H A+ GFGR+FRSPTLF Sbjct: 97 LGAPDDDALSPLDQTSILEQVRRMLRLDEEDGRAVAEFQAMHAVAREVGFGRIFRSPTLF 156 Query: 388 EDMVKCMLLCNCQWSRTLTMARALCELQLNLKSDSFKYLGTEVTSQDPNCLKPNTEGFLP 567 EDM+KC+LLCNCQW+RTL+M+ ALCELQL L+S S +TE F Sbjct: 157 EDMIKCILLCNCQWTRTLSMSTALCELQLELRSSS------------------STENFQS 198 Query: 568 ITPIGRELKRKRSMKK-IPANLDCKFSENE 654 TP RE KRKRS K+ + L+ KF+E++ Sbjct: 199 RTPPIRECKRKRSNKRNVRVKLETKFNEDK 228 >ref|XP_006358484.1| PREDICTED: uncharacterized protein LOC102593287 isoform X1 [Solanum tuberosum] gi|565385158|ref|XP_006358485.1| PREDICTED: uncharacterized protein LOC102593287 isoform X2 [Solanum tuberosum] Length = 485 Score = 162 bits (411), Expect = 1e-37 Identities = 100/214 (46%), Positives = 122/214 (57%), Gaps = 16/214 (7%) Frame = +1 Query: 7 SCLLKLELGDS-----YSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXX 171 S +++L LGD ++FDLEKAVCSHGLFMMAPN WD +KTL+RP Sbjct: 12 SVVVELPLGDGDGDGGCATFDLEKAVCSHGLFMMAPNRWDSLSKTLERPLHLSENINDDD 71 Query: 172 XXXXXXXXXXXXXXXXXXXXXX--------PLDQQFLLGQVARMLRLSESDEMCIKEFHK 327 + Q+ LLGQV RM+RLS + +K+F + Sbjct: 72 HEQSVLVQINQPSDSPHSLLLRVFGTASLSTIHQRSLLGQVRRMVRLSVEENKRVKQFQE 131 Query: 328 IHPEAKNRGFGRVFRSPTLFEDMVKCMLLCNCQWSRTLTMARALCELQLNLKSDSFKYLG 507 I EAK+RG GRVFRSPTLFEDMVKCMLLCNCQWSRTL+MA ALCELQL L S Sbjct: 132 ICGEAKDRGLGRVFRSPTLFEDMVKCMLLCNCQWSRTLSMAEALCELQLELNCPSSAASF 191 Query: 508 TEVTSQDPNCLKPNT---EGFLPITPIGRELKRK 600 + +Q N LK T E F P TP G+E +++ Sbjct: 192 PDPDNQ--NQLKGVTFKSEHFTPRTPAGKESRKR 223 >gb|EEC82605.1| hypothetical protein OsI_27177 [Oryza sativa Indica Group] Length = 463 Score = 162 bits (410), Expect = 2e-37 Identities = 95/209 (45%), Positives = 114/209 (54%), Gaps = 7/209 (3%) Frame = +1 Query: 49 FDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 228 FDLE AVCSHGLFMMAPN WDP+++ L RP Sbjct: 37 FDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVRVSRHPARPSDALLVSV 96 Query: 229 XXXP------LDQQFLLGQVARMLRLSESDEMCIKEFHKIHPEAKNRGFGRVFRSPTLFE 390 P DQ +L QV RMLRL E D EF +H A+ GFGR+FRSPTLFE Sbjct: 97 LGAPGDALSPPDQTSILEQVRRMLRLDEEDGRAAAEFQAMHAVAREAGFGRIFRSPTLFE 156 Query: 391 DMVKCMLLCNCQWSRTLTMARALCELQLNLKSDSFKYLGTEVTSQDPNCLKPNTEGFLPI 570 DMVKC+LLCNCQW+RTL+M+ ALCELQL L+S S +TE F Sbjct: 157 DMVKCILLCNCQWTRTLSMSTALCELQLELRSSS------------------STENFQSR 198 Query: 571 TPIGRELKRKRSMKK-IPANLDCKFSENE 654 TP RE KRKRS K+ + L+ KF+E++ Sbjct: 199 TPPIRECKRKRSNKRNVRVKLETKFNEDK 227 >ref|XP_004959865.1| PREDICTED: uncharacterized protein LOC101766322 [Setaria italica] Length = 461 Score = 160 bits (406), Expect = 5e-37 Identities = 97/231 (41%), Positives = 125/231 (54%), Gaps = 10/231 (4%) Frame = +1 Query: 49 FDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 228 FDL AVCSHGLFMMAPN WDP+ + L RP Sbjct: 36 FDLAAAVCSHGLFMMAPNRWDPAARALVRPLRLASDRSASLLARVSAHPARPGTALLVAV 95 Query: 229 XXXP----LDQQFLLGQVARMLRLSESDEMCIKEFHKIHPEAKNRGFGRVFRSPTLFEDM 396 LD+ ++L QV RMLRLSE D + EF +H A+ GFGR+FRSPTLFEDM Sbjct: 96 EGADALSSLDRDYILEQVRRMLRLSEEDGAAVAEFQAMHAAAREEGFGRIFRSPTLFEDM 155 Query: 397 VKCMLLCNCQWSRTLTMARALCELQLNLKSDSFKYLGTEVTSQDPNCLKPNTEGFLPITP 576 VKC+LLCNCQW+RTL+MA ALCE+QL LK S + E F TP Sbjct: 156 VKCILLCNCQWTRTLSMATALCEIQLELKCSS------------------SVEDFQSRTP 197 Query: 577 IGRELKRKRSMKK-IPANLDCKFSENETK---LEAETTN--CHQQTTCFLS 711 RE KRKRS ++ + L+ +F+E++ + + + T+N H +T +LS Sbjct: 198 PIRERKRKRSKRQSVRIKLETRFAEDKLEGPTIASGTSNDLTHPETNEYLS 248 >ref|XP_006470788.1| PREDICTED: uncharacterized protein LOC102629917 isoform X3 [Citrus sinensis] Length = 382 Score = 156 bits (395), Expect = 9e-36 Identities = 116/316 (36%), Positives = 152/316 (48%), Gaps = 24/316 (7%) Frame = +1 Query: 13 LLKLELGDSYSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXX 192 LLKL L ++ F+LE AVCSHGLFMM+PN WDP +++L RP Sbjct: 7 LLKLPLAET---FNLETAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVD 63 Query: 193 XXXXXXXXXXXXXXXPL--------------DQQFLLGQVARMLRLSESDEMCIKEFHKI 330 + Q LL QV RMLRLSE+DE ++EF +I Sbjct: 64 VTICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVREFKRI 123 Query: 331 HPE-AKNRG---------FGRVFRSPTLFEDMVKCMLLCNCQWSRTLTMARALCELQLNL 480 + A+ G GRVFRSPTLFEDMVKCMLLCNCQW RTL+MARALCELQ L Sbjct: 124 VRQVAQEEGEETQYMEDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLSMARALCELQWEL 183 Query: 481 KSDSFKYLGTEVTSQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKFSENETK 660 + +C +E F+P TP G+E KR++ + K+ + L + +E++ Sbjct: 184 Q----------------HCSPSISEDFIPQTPAGKESKRRQKVSKVASKLTSRIAESKAS 227 Query: 661 LEAETTNCHQQTTCFLSKEKPSPSFLISVEEDDSNGKRNSCQLLNDNNKVDACSISDRTL 840 E + N L +E PSF + E D +G LN+ + D S D Sbjct: 228 SE-DYMNLKLDCAGVL-EENVQPSFPQNDIESDLHG-------LNELSTTDPPSARD--- 275 Query: 841 SEGRTDFSYRIGDFPS 888 RIG+FPS Sbjct: 276 ---------RIGNFPS 282 >ref|XP_006470787.1| PREDICTED: uncharacterized protein LOC102629917 isoform X2 [Citrus sinensis] Length = 409 Score = 156 bits (395), Expect = 9e-36 Identities = 116/316 (36%), Positives = 152/316 (48%), Gaps = 24/316 (7%) Frame = +1 Query: 13 LLKLELGDSYSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXX 192 LLKL L ++ F+LE AVCSHGLFMM+PN WDP +++L RP Sbjct: 7 LLKLPLAET---FNLETAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVD 63 Query: 193 XXXXXXXXXXXXXXXPL--------------DQQFLLGQVARMLRLSESDEMCIKEFHKI 330 + Q LL QV RMLRLSE+DE ++EF +I Sbjct: 64 VTICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVREFKRI 123 Query: 331 HPE-AKNRG---------FGRVFRSPTLFEDMVKCMLLCNCQWSRTLTMARALCELQLNL 480 + A+ G GRVFRSPTLFEDMVKCMLLCNCQW RTL+MARALCELQ L Sbjct: 124 VRQVAQEEGEETQYMEDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLSMARALCELQWEL 183 Query: 481 KSDSFKYLGTEVTSQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKFSENETK 660 + +C +E F+P TP G+E KR++ + K+ + L + +E++ Sbjct: 184 Q----------------HCSPSISEDFIPQTPAGKESKRRQKVSKVASKLTSRIAESKAS 227 Query: 661 LEAETTNCHQQTTCFLSKEKPSPSFLISVEEDDSNGKRNSCQLLNDNNKVDACSISDRTL 840 E + N L +E PSF + E D +G LN+ + D S D Sbjct: 228 SE-DYMNLKLDCAGVL-EENVQPSFPQNDIESDLHG-------LNELSTTDPPSARD--- 275 Query: 841 SEGRTDFSYRIGDFPS 888 RIG+FPS Sbjct: 276 ---------RIGNFPS 282 >ref|XP_006470786.1| PREDICTED: uncharacterized protein LOC102629917 isoform X1 [Citrus sinensis] Length = 454 Score = 156 bits (395), Expect = 9e-36 Identities = 116/316 (36%), Positives = 152/316 (48%), Gaps = 24/316 (7%) Frame = +1 Query: 13 LLKLELGDSYSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXX 192 LLKL L ++ F+LE AVCSHGLFMM+PN WDP +++L RP Sbjct: 7 LLKLPLAET---FNLETAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVD 63 Query: 193 XXXXXXXXXXXXXXXPL--------------DQQFLLGQVARMLRLSESDEMCIKEFHKI 330 + Q LL QV RMLRLSE+DE ++EF +I Sbjct: 64 VTICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVREFKRI 123 Query: 331 HPE-AKNRG---------FGRVFRSPTLFEDMVKCMLLCNCQWSRTLTMARALCELQLNL 480 + A+ G GRVFRSPTLFEDMVKCMLLCNCQW RTL+MARALCELQ L Sbjct: 124 VRQVAQEEGEETQYMEDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLSMARALCELQWEL 183 Query: 481 KSDSFKYLGTEVTSQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKFSENETK 660 + +C +E F+P TP G+E KR++ + K+ + L + +E++ Sbjct: 184 Q----------------HCSPSISEDFIPQTPAGKESKRRQKVSKVASKLTSRIAESKAS 227 Query: 661 LEAETTNCHQQTTCFLSKEKPSPSFLISVEEDDSNGKRNSCQLLNDNNKVDACSISDRTL 840 E + N L +E PSF + E D +G LN+ + D S D Sbjct: 228 SE-DYMNLKLDCAGVL-EENVQPSFPQNDIESDLHG-------LNELSTTDPPSARD--- 275 Query: 841 SEGRTDFSYRIGDFPS 888 RIG+FPS Sbjct: 276 ---------RIGNFPS 282 >ref|XP_007023219.1| Uncharacterized protein isoform 4 [Theobroma cacao] gi|508778585|gb|EOY25841.1| Uncharacterized protein isoform 4 [Theobroma cacao] Length = 406 Score = 156 bits (395), Expect = 9e-36 Identities = 103/237 (43%), Positives = 130/237 (54%), Gaps = 21/237 (8%) Frame = +1 Query: 1 SSSC---LLKLELGDSYSS-----FDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXX 156 SSSC L++L +G++ ++ F+LEKAVCSHGLFMMAPN WDP +++L RP Sbjct: 26 SSSCCSVLIELPVGEAAAAEGAGPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPLRLLDH 85 Query: 157 XXXXXXXXXXXXXXXXXXXXXXXXXXXPLDQQF---LLGQVARMLRLSESDEMCIKEFHK 327 L Q LL QV+RMLRLSE +E ++EF K Sbjct: 86 HSPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKVREFRK 145 Query: 328 I----HPEAKN-----RGF-GRVFRSPTLFEDMVKCMLLCNCQWSRTLTMARALCELQLN 477 I H E + R F GRVFRSPTLFEDMVKC+LLCNCQ+SRTL+MA+ALCELQ Sbjct: 146 IVEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQFSRTLSMAKALCELQFE 205 Query: 478 LKSDSFKYLGTEVTSQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKFSE 648 + + G D F+P TP G ELKRK + K+ L+ KF+E Sbjct: 206 TQR---PFSGVRAAEDD----------FIPKTPAGNELKRKLRVSKVSMRLEGKFAE 249 >ref|XP_007023218.1| Uncharacterized protein isoform 3 [Theobroma cacao] gi|508778584|gb|EOY25840.1| Uncharacterized protein isoform 3 [Theobroma cacao] Length = 421 Score = 156 bits (395), Expect = 9e-36 Identities = 103/237 (43%), Positives = 130/237 (54%), Gaps = 21/237 (8%) Frame = +1 Query: 1 SSSC---LLKLELGDSYSS-----FDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXX 156 SSSC L++L +G++ ++ F+LEKAVCSHGLFMMAPN WDP +++L RP Sbjct: 41 SSSCCSVLIELPVGEAAAAEGAGPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPLRLLDH 100 Query: 157 XXXXXXXXXXXXXXXXXXXXXXXXXXXPLDQQF---LLGQVARMLRLSESDEMCIKEFHK 327 L Q LL QV+RMLRLSE +E ++EF K Sbjct: 101 HSPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKVREFRK 160 Query: 328 I----HPEAKN-----RGF-GRVFRSPTLFEDMVKCMLLCNCQWSRTLTMARALCELQLN 477 I H E + R F GRVFRSPTLFEDMVKC+LLCNCQ+SRTL+MA+ALCELQ Sbjct: 161 IVEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQFSRTLSMAKALCELQFE 220 Query: 478 LKSDSFKYLGTEVTSQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKFSE 648 + + G D F+P TP G ELKRK + K+ L+ KF+E Sbjct: 221 TQR---PFSGVRAAEDD----------FIPKTPAGNELKRKLRVSKVSMRLEGKFAE 264 >ref|XP_007023216.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508778582|gb|EOY25838.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 467 Score = 156 bits (395), Expect = 9e-36 Identities = 103/237 (43%), Positives = 130/237 (54%), Gaps = 21/237 (8%) Frame = +1 Query: 1 SSSC---LLKLELGDSYSS-----FDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXX 156 SSSC L++L +G++ ++ F+LEKAVCSHGLFMMAPN WDP +++L RP Sbjct: 41 SSSCCSVLIELPVGEAAAAEGAGPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPLRLLDH 100 Query: 157 XXXXXXXXXXXXXXXXXXXXXXXXXXXPLDQQF---LLGQVARMLRLSESDEMCIKEFHK 327 L Q LL QV+RMLRLSE +E ++EF K Sbjct: 101 HSPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKVREFRK 160 Query: 328 I----HPEAKN-----RGF-GRVFRSPTLFEDMVKCMLLCNCQWSRTLTMARALCELQLN 477 I H E + R F GRVFRSPTLFEDMVKC+LLCNCQ+SRTL+MA+ALCELQ Sbjct: 161 IVEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQFSRTLSMAKALCELQFE 220 Query: 478 LKSDSFKYLGTEVTSQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKFSE 648 + + G D F+P TP G ELKRK + K+ L+ KF+E Sbjct: 221 TQR---PFSGVRAAEDD----------FIPKTPAGNELKRKLRVSKVSMRLEGKFAE 264 >ref|XP_006431360.1| hypothetical protein CICLE_v10001110mg [Citrus clementina] gi|557533482|gb|ESR44600.1| hypothetical protein CICLE_v10001110mg [Citrus clementina] Length = 454 Score = 155 bits (393), Expect = 2e-35 Identities = 115/316 (36%), Positives = 152/316 (48%), Gaps = 24/316 (7%) Frame = +1 Query: 13 LLKLELGDSYSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXX 192 +LKL L ++ F+LE AVCSHGLFMM+PN WDP +++L RP Sbjct: 7 VLKLPLAET---FNLEAAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVD 63 Query: 193 XXXXXXXXXXXXXXXPL--------------DQQFLLGQVARMLRLSESDEMCIKEFHKI 330 + Q LL QV RMLRLSE+DE +++F +I Sbjct: 64 VTICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVRDFKRI 123 Query: 331 HPE-AKNRG---------FGRVFRSPTLFEDMVKCMLLCNCQWSRTLTMARALCELQLNL 480 + A+ G GRVFRSPTLFEDMVKCMLLCNCQW RTL MARALCELQ L Sbjct: 124 VRQVAQEEGEESQYMTDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLNMARALCELQWEL 183 Query: 481 KSDSFKYLGTEVTSQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKFSENETK 660 + +C +E F+P TP G+E KR++ + K+ + L + +E++ Sbjct: 184 Q----------------HCSPSISEDFIPQTPAGKESKRRQKVSKVASKLTSRIAESKAS 227 Query: 661 LEAETTNCHQQTTCFLSKEKPSPSFLISVEEDDSNGKRNSCQLLNDNNKVDACSISDRTL 840 E + N T L +E PSF + E D +G LN+ + D S D Sbjct: 228 SE-DDMNLKLDCTGAL-EENVQPSFPRNDIESDLHG-------LNELSTTDPPSACD--- 275 Query: 841 SEGRTDFSYRIGDFPS 888 RIG+FPS Sbjct: 276 ---------RIGNFPS 282 >ref|XP_003534756.2| PREDICTED: uncharacterized protein LOC100781827 [Glycine max] Length = 443 Score = 155 bits (392), Expect = 2e-35 Identities = 107/284 (37%), Positives = 141/284 (49%), Gaps = 2/284 (0%) Frame = +1 Query: 43 SSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXXXXXXXXXXX 222 S F LE+AVCSHGLFMM PN WDP +KTL RP Sbjct: 22 SPFQLEQAVCSHGLFMMPPNHWDPLSKTLIRPLRSSPSSFLVSLSQHSQSLAVRVHATHA 81 Query: 223 XXXXXPLDQQFLLGQVARMLRLSESDEMCIKEFHKIHP-EAKNRGF-GRVFRSPTLFEDM 396 P Q + QV+RMLR SE++E ++EF +H + NR F GRVFRSPTLFEDM Sbjct: 82 LS---PQQQNHITAQVSRMLRFSEAEEKAVREFRSLHVVDHPNRSFSGRVFRSPTLFEDM 138 Query: 397 VKCMLLCNCQWSRTLTMARALCELQLNLKSDSFKYLGTEVTSQDPNCLKPNTEGFLPITP 576 VKC+LLCNCQW RTL+MA+ALCELQL L++ G+ T K +EGF+P TP Sbjct: 139 VKCILLCNCQWPRTLSMAQALCELQLELQN------GSPCTIAVSGNSKGESEGFIPKTP 192 Query: 577 IGRELKRKRSMKKIPANLDCKFSENETKLEAETTNCHQQTTCFLSKEKPSPSFLISVEED 756 +E +R + K F + + +L+ H + + + L++ + Sbjct: 193 ASKETRRNKVSTK------GMFCKKKLELDGNLQIDH------VVASSSTATTLLTTDNG 240 Query: 757 DSNGKRNSCQLLNDNNKVDACSISDRTLSEGRTDFSYRIGDFPS 888 DS R+ D+C S G FS R G+FPS Sbjct: 241 DSEELRSH----------DSC----HEFSNGNEYFS-RTGNFPS 269 >gb|EXC47697.1| hypothetical protein L484_002408 [Morus notabilis] Length = 472 Score = 152 bits (384), Expect = 2e-34 Identities = 113/308 (36%), Positives = 157/308 (50%), Gaps = 17/308 (5%) Frame = +1 Query: 16 LKLELGDSYSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXX 195 L+L LGD+ ++F LE AVCSHGLFMMAPN WDP +KTL RP Sbjct: 5 LELPLGDAAATFRLETAVCSHGLFMMAPNQWDPLSKTLLRPLRLTLHHHHWNPQQQQDDS 64 Query: 196 XXXXXXXXXXXXXXPL-------------DQQFLLGQVARMLRLSESDEMCIKEFHKIHP 336 ++Q LL QV+RMLRLS+++E +EF +++ Sbjct: 65 VMARISQPHDRLHCLRVLVHAGTRSLTSDNKQALLAQVSRMLRLSQTEERICREFSEVY- 123 Query: 337 EAKNRGFGRVFRSPTLFEDMVKCMLLCNCQWSRTLTMARALCELQLNLKSDSFKYLGTEV 516 G GRVFRSPTLFEDMVKC+LLCNCQW RTL+MA+ALC+LQ L+ S Sbjct: 124 -GCGSGLGRVFRSPTLFEDMVKCILLCNCQWPRTLSMAQALCDLQRELQLQS-------- 174 Query: 517 TSQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKF-SENETKLEAETTNCH-- 687 + T F+P TP G+E KRK K L +F +++ LE+ + + Sbjct: 175 -------VPSKTVDFVPKTPAGKEPKRKVEKLKASTCLTSQFDAQSNEGLESHSNDLSID 227 Query: 688 -QQTTCFLSKEKPSPSFLISVEEDDSNGKRNSCQLLNDNNKVDACSISDRTLSEGRTDFS 864 Q T S + SPS L+SV ++ +C+ ++ VD+ S+ + + R +F Sbjct: 228 ISQPT--PSAQNLSPSSLLSVPMENV-----TCE---ESYGVDSASLCNPQILRDR-EFE 276 Query: 865 YRIGDFPS 888 GDFP+ Sbjct: 277 -GTGDFPT 283 >ref|XP_007022707.1| Uncharacterized protein TCM_033523 [Theobroma cacao] gi|508722335|gb|EOY14232.1| Uncharacterized protein TCM_033523 [Theobroma cacao] Length = 374 Score = 152 bits (383), Expect = 2e-34 Identities = 78/160 (48%), Positives = 100/160 (62%), Gaps = 3/160 (1%) Frame = +1 Query: 16 LKLELGDSYSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXX 195 L++ LG+ SSF++EKAVC+HGLFMM+PN+W PSTK+L+RP Sbjct: 7 LQVALGECSSSFNMEKAVCNHGLFMMSPNVWIPSTKSLRRPLRLADSSGSVYVTISHPAP 66 Query: 196 XXXXXXXXXXXXXXPL---DQQFLLGQVARMLRLSESDEMCIKEFHKIHPEAKNRGFGRV 366 + D+ ++ QVARMLR+S DE ++EF +H AK+RGFGR+ Sbjct: 67 NHPFLVIQVNGLQNSISSADKAVIMEQVARMLRISSKDERDVREFQTLHGSAKDRGFGRI 126 Query: 367 FRSPTLFEDMVKCMLLCNCQWSRTLTMARALCELQLNLKS 486 FRSP+ FED VK +LLCNC W RTLTMARALC LQL L S Sbjct: 127 FRSPSFFEDAVKSILLCNCGWKRTLTMARALCALQLQLAS 166 >ref|XP_007147543.1| hypothetical protein PHAVU_006G133500g [Phaseolus vulgaris] gi|561020766|gb|ESW19537.1| hypothetical protein PHAVU_006G133500g [Phaseolus vulgaris] Length = 474 Score = 151 bits (381), Expect = 4e-34 Identities = 87/203 (42%), Positives = 111/203 (54%), Gaps = 5/203 (2%) Frame = +1 Query: 22 LELGDSYSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXXXX 201 +EL F L++AVCSHG FMMAPN WDP +KTL RP Sbjct: 37 MELPSETEPFQLDQAVCSHGFFMMAPNHWDPLSKTLTRPLLLHNPSSSSSSSLLVSLSQR 96 Query: 202 XXXXXXXXXXXX---PLDQQFLLGQVARMLRLSESDEMCIKEFHKIHP-EAKNRGFG-RV 366 P Q+ + Q+ RMLRLSE++E ++EF +H + NR FG RV Sbjct: 97 PQSLAVRVHSVHFISPQQQRHIKAQITRMLRLSEAEEKAVREFRSVHAADHPNRSFGGRV 156 Query: 367 FRSPTLFEDMVKCMLLCNCQWSRTLTMARALCELQLNLKSDSFKYLGTEVTSQDPNCLKP 546 FRSPTLFEDMVKC+LLCNCQW RTL+MA+ALCELQ L++ G + K Sbjct: 157 FRSPTLFEDMVKCILLCNCQWPRTLSMAQALCELQSGLQN------GLPCAVEGSGNPKV 210 Query: 547 NTEGFLPITPIGRELKRKRSMKK 615 E F+P TP +E +RK++ K Sbjct: 211 EAEEFVPKTPASKENRRKKAPTK 233 >ref|XP_006299074.1| hypothetical protein CARUB_v10015214mg [Capsella rubella] gi|482567783|gb|EOA31972.1| hypothetical protein CARUB_v10015214mg [Capsella rubella] Length = 350 Score = 148 bits (374), Expect = 2e-33 Identities = 74/163 (45%), Positives = 97/163 (59%) Frame = +1 Query: 16 LKLELGDSYSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXX 195 L+L LG+ +FD+EKAVC+HG FMMAPN+W+PSTK+L RP Sbjct: 3 LRLHLGEKKGTFDMEKAVCNHGFFMMAPNVWNPSTKSLHRPLTLSDSSSTDVTISHPSGL 62 Query: 196 XXXXXXXXXXXXXXPLDQQFLLGQVARMLRLSESDEMCIKEFHKIHPEAKNRGFGRVFRS 375 +D++ +L QV RMLRLS+ DE + EF ++H A+ GFGR+FRS Sbjct: 63 SFLVIQVHAINNVSRVDEELILKQVERMLRLSDKDERDMFEFQQVHEAARESGFGRIFRS 122 Query: 376 PTLFEDMVKCMLLCNCQWSRTLTMARALCELQLNLKSDSFKYL 504 P+LFEDMVK +LLCN W +TL MA LC+LQ L + K L Sbjct: 123 PSLFEDMVKSILLCNADWGKTLLMASRLCQLQSKLADGTVKPL 165 >dbj|BAC15471.1| hypothetical protein [Oryza sativa Japonica Group] gi|50510134|dbj|BAD31099.1| hypothetical protein [Oryza sativa Japonica Group] Length = 501 Score = 144 bits (364), Expect = 4e-32 Identities = 95/252 (37%), Positives = 116/252 (46%), Gaps = 50/252 (19%) Frame = +1 Query: 49 FDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 228 FDLE AVCSHGLFMMAPN WDP+++ L RP Sbjct: 37 FDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVRVSRHPARPSDALLVSV 96 Query: 229 XXXP-------LDQQFLLGQVARMLRLSESDEMCIKEFHKIHPEAKNRGFGRVFRSPTLF 387 P LDQ +L QV RMLRL E D + EF +H A+ GFGR+FRSPTLF Sbjct: 97 LGAPDDDALSPLDQTSILEQVRRMLRLDEEDGRAVAEFQAMHAVAREVGFGRIFRSPTLF 156 Query: 388 EDMVKCMLLCNCQ------------------------------------------WSRTL 441 EDM+KC+LLCNCQ W+RTL Sbjct: 157 EDMIKCILLCNCQFSLPLPLPSLASTSMRNSDTNMSRYLGIAIFHLHSTVLFNCRWTRTL 216 Query: 442 TMARALCELQLNLKSDSFKYLGTEVTSQDPNCLKPNTEGFLPITPIGRELKRKRSMKK-I 618 +M+ ALCELQL L+S S +TE F TP RE KRKRS K+ + Sbjct: 217 SMSTALCELQLELRSSS------------------STENFQSRTPPIRECKRKRSNKRNV 258 Query: 619 PANLDCKFSENE 654 L+ KF+E++ Sbjct: 259 RVKLETKFNEDK 270