BLASTX nr result
ID: Akebia26_contig00022724
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia26_contig00022724 (928 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EXB99429.1| hypothetical protein L484_016405 [Morus notabilis] 190 8e-46 ref|XP_007219207.1| hypothetical protein PRUPE_ppa017292mg [Prun... 175 3e-41 emb|CAN64638.1| hypothetical protein VITISV_033929 [Vitis vinifera] 159 1e-36 ref|XP_002530358.1| conserved hypothetical protein [Ricinus comm... 157 6e-36 ref|XP_006588648.1| PREDICTED: uncharacterized protein LOC100797... 149 1e-33 ref|XP_004145472.1| PREDICTED: uncharacterized protein LOC101205... 145 2e-32 ref|XP_006471160.1| PREDICTED: uncharacterized protein LOC102613... 142 2e-31 ref|XP_004301624.1| PREDICTED: uncharacterized protein LOC101305... 142 2e-31 ref|XP_007132389.1| hypothetical protein PHAVU_011G090800g [Phas... 141 4e-31 ref|XP_006431682.1| hypothetical protein CICLE_v10000213mg [Citr... 140 9e-31 ref|XP_003600764.1| hypothetical protein MTR_3g069120 [Medicago ... 138 4e-30 ref|XP_004166877.1| PREDICTED: uncharacterized LOC101205354 [Cuc... 137 6e-30 ref|XP_006844152.1| hypothetical protein AMTR_s00006p00260920 [A... 136 1e-29 ref|XP_002317716.1| hypothetical protein POPTR_0012s03820g [Popu... 135 2e-29 ref|XP_007026747.1| TATA box-binding protein-associated factor R... 132 2e-28 ref|XP_006837237.1| hypothetical protein AMTR_s00602p00003840 [A... 127 8e-27 ref|XP_006841229.1| hypothetical protein AMTR_s00135p00060200 [A... 119 2e-24 ref|XP_006299498.1| hypothetical protein CARUB_v10015667mg [Caps... 115 2e-23 ref|XP_006406618.1| hypothetical protein EUTSA_v10020051mg [Eutr... 114 4e-23 ref|XP_002885248.1| hypothetical protein ARALYDRAFT_479330 [Arab... 114 6e-23 >gb|EXB99429.1| hypothetical protein L484_016405 [Morus notabilis] Length = 1000 Score = 190 bits (482), Expect = 8e-46 Identities = 117/286 (40%), Positives = 160/286 (55%), Gaps = 7/286 (2%) Frame = -2 Query: 870 MNFSEEWKSLWSISSVHSPPLLLSGPSAKPF-GXXXXXXXXXXXXXXXXXXXXXXXXXXX 694 MNFSEEWKSL+ IS+V PLLLSGPSA+ G Sbjct: 1 MNFSEEWKSLFPISAVFKSPLLLSGPSARTILGPLVFNPKESTITCLFSSPSLLPPFTPL 60 Query: 693 XXFCLHKALRNFTRPSKDPFILPSVISSITADFDS---QSDETSPIVNNNLQILRCPNND 523 + L S D LPS SSI + F Q D S +N LQ+L CP D Sbjct: 61 PRLSFPRFLLT---SSDDSSQLPSTSSSIASVFGPHHYQDDVASAFSHNRLQLLHCPRTD 117 Query: 522 -ILLFFLTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSG 346 ++FF TG+N++ VGF+ LS+K S + VD+ F D G + +IL+I + V DSG Sbjct: 118 KFIVFFPTGDNANQVGFMLLSIKNSCLDVRVDDNGEAFMVDCGSNHQILRISINPVVDSG 177 Query: 345 LGFSSMSGNSS-TVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNSS-VVHT 172 ++ GNSS T+G+LLA T+YSV+W+ +E+ LG +L P L +GTK F + +VH Sbjct: 178 SALLALGGNSSGTIGYLLASTMYSVHWYVIEVKELGLNLH-PSLTCVGTKVFKTCCIVHA 236 Query: 171 CWNPHVPEESMVLLDNGELFLFDLDACSGVEKLPVKLKGTKVGVSW 34 CW+PH+ EES++LL++G LFLFDL++C L KGT++ VSW Sbjct: 237 CWSPHILEESIILLESGALFLFDLESCLKTNTLSPHFKGTRLKVSW 282 >ref|XP_007219207.1| hypothetical protein PRUPE_ppa017292mg [Prunus persica] gi|462415669|gb|EMJ20406.1| hypothetical protein PRUPE_ppa017292mg [Prunus persica] Length = 925 Score = 175 bits (443), Expect = 3e-41 Identities = 118/291 (40%), Positives = 161/291 (55%), Gaps = 13/291 (4%) Frame = -2 Query: 861 SEEWKSLWSISSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFC 682 +EEWKSL+ ISSV PPLLLS PS KP Sbjct: 8 TEEWKSLFPISSVFKPPLLLSNPSLKPI--LGPLIFNPKPNSTTLLFSSSSSLLAPLPPL 65 Query: 681 LHKALRNF--TRPSKDPFILPSVISSITA---DFDSQSDETSPIVNNNLQILRCPN-NDI 520 H +L F T PS D LPS + S+ + +SD +S ++ N L+ L+CP N + Sbjct: 66 PHLSLPRFLLTSPS-DSAPLPSSVPSVASFLGPHHPKSDVSSSLLYNRLEFLQCPQINTV 124 Query: 519 LLFFLTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLG 340 ++FF TGENSD VGF++L LKGS + VD VF + RI +I V + G Sbjct: 125 VVFFPTGENSDQVGFLQLVLKGSTFDVKVDENGGVFASRRWFSYRISRISVNPIP----G 180 Query: 339 FSSMSGNSS--TVGFLLACTLYSVNWFRVEISNLGSDLEKPI-LVNLGTKKFNSS-VVHT 172 FSS+ GN S T+G+LLA T+YSV+WF V++ + G + + + LV+LG+K F + VVH Sbjct: 181 FSSLRGNGSCVTIGYLLASTMYSVHWFIVKVGDFGPNSDSRVSLVHLGSKIFKTCCVVHA 240 Query: 171 CWNPHVPEESMVLLDNGELFLFDLDA---CSGVEKLPVKLKGTKVGVSWAI 28 CW+PH+ EES+VLL+NG+LFLFDLD+ K GT++ V W I Sbjct: 241 CWSPHLLEESVVLLENGDLFLFDLDSRLKTPHTLNANFKFNGTRLKVPWDI 291 >emb|CAN64638.1| hypothetical protein VITISV_033929 [Vitis vinifera] Length = 865 Score = 159 bits (402), Expect = 1e-36 Identities = 106/281 (37%), Positives = 147/281 (52%), Gaps = 2/281 (0%) Frame = -2 Query: 870 MNFSEEWKSLWSISSVHSPPLLLSG-PSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXX 694 M+FSEEWKS+W ISSV +PPLL+S PS P Sbjct: 1 MDFSEEWKSIWPISSVFTPPLLISSKPSLGPL---------------------------- 32 Query: 693 XXFCLHKALRNFTRPSKDPFILPSVISSITADFDSQSDETSPIVNNNLQILRCPNNDILL 514 F PS P L + S + F +S ++++ L +LRCPN +L Sbjct: 33 -----------FFNPS--PNTLTPLFSKPSFSFPPHLPRSS-LLHDRLHLLRCPNAAVLA 78 Query: 513 FFLTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFS 334 F TG NSD +GF+ LS+K S + D +VF + L+ RI++IL + +G+ Sbjct: 79 LFPTGVNSDQIGFLLLSVKDSCLDVRADRNGDVFVSKKRLNHRIVQILA-----TPIGY- 132 Query: 333 SMSGNSSTVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNS-SVVHTCWNPH 157 S SGN +VG +LACT+YSV+WF V N+ S+ P L+ LG K F S +VV CW+PH Sbjct: 133 SFSGNPDSVGLVLACTMYSVHWFSVRNDNIDSE---PGLIYLGGKVFKSCAVVSACWSPH 189 Query: 156 VPEESMVLLDNGELFLFDLDACSGVEKLPVKLKGTKVGVSW 34 + EE +VLL++GELFLFDLD C KG ++ + W Sbjct: 190 LSEECLVLLESGELFLFDLDYCCSNS----NFKGNRLKIMW 226 >ref|XP_002530358.1| conserved hypothetical protein [Ricinus communis] gi|223530105|gb|EEF32019.1| conserved hypothetical protein [Ricinus communis] Length = 912 Score = 157 bits (397), Expect = 6e-36 Identities = 110/283 (38%), Positives = 152/283 (53%), Gaps = 4/283 (1%) Frame = -2 Query: 870 MNFSEEWKSLWSISSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXXX 691 M+ SEEWKSL+ I SV PLLLS P++K Sbjct: 1 MDLSEEWKSLFPIGSVFDAPLLLSSPTSKSILGPLFFNPNRKTLTQLYKSPSLFPPLLNP 60 Query: 690 XFCLHKALRNFTRPSKDPFILPSVISSITADFDSQSDETSP--IVNNNLQILRCPN-NDI 520 L + T + D I S SSIT+ SQ + S + +N LQ L CP+ N + Sbjct: 61 PPRLSLSRFLTTSTTFDSPIPLSTASSITSRLGSQFHDNSASLLAHNQLQFLNCPHDNSV 120 Query: 519 LLFFLTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLG 340 ++FF TG N D VGF+ LS+ + + D+ VF A+ L+ RI+KILV V DSG Sbjct: 121 IVFFSTGCNHDQVGFLLLSVNDKRLCAVGDSRGGVFVANKCLNQRIVKILVNPVVDSG-- 178 Query: 339 FSSMSGNSSTVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNS-SVVHTCWN 163 + + +S VG+LL TL+SV+WF V+I + E+PIL ++G K F S S+V CW+ Sbjct: 179 YFEGNASSKIVGYLLVYTLFSVHWFCVKIGEIN---ERPILGHVGCKTFKSCSIVDACWS 235 Query: 162 PHVPEESMVLLDNGELFLFDLDACSGVEKLPVKLKGTKVGVSW 34 PH+ EES+VLL+NG LFLFDL++ S +GTK+ V W Sbjct: 236 PHLIEESVVLLENGGLFLFDLNSDSS----NAYFRGTKLKVLW 274 >ref|XP_006588648.1| PREDICTED: uncharacterized protein LOC100797045 isoform X1 [Glycine max] gi|571481421|ref|XP_006588649.1| PREDICTED: uncharacterized protein LOC100797045 isoform X2 [Glycine max] Length = 894 Score = 149 bits (377), Expect = 1e-33 Identities = 105/287 (36%), Positives = 146/287 (50%), Gaps = 8/287 (2%) Frame = -2 Query: 870 MNFSEEWKSLWSISSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXXX 691 M SEEWKS + + PLLLS + P G Sbjct: 1 MELSEEWKSFFPTGASTVSPLLLSRSHSLPLGPLLFNPNPNSLSVLFSSPSLVP------ 54 Query: 690 XFCLHKALR----NFTRPSKDPFILPSVISSITA--DFDSQSDETSPIVNNNLQILRCPN 529 CLH F S ILPS SS+ + F +Q+D S + N L +L PN Sbjct: 55 --CLHLPPHLFPSRFLLTSHPHSILPSTASSVASLFSFPNQNDAASLFLRNRLHLLYYPN 112 Query: 528 N-DILLFFLTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTD 352 + ++FF TG N D +GF L++K S+ I++D+ +VF A G RIL I V V D Sbjct: 113 RPNAVVFFPTGANDDKLGFFILAVKDSRLDILLDSNGDVFRASTGSAHRILNISVNPVAD 172 Query: 351 SGLGFSSMSGNSSTVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNSS-VVH 175 SGL S +G+LLA LYSV+WF V+ +++ L++P + LG K F + VVH Sbjct: 173 SGL-----FNESHVIGYLLASALYSVHWFAVKHNSV---LDRPSVFYLGGKTFKTCPVVH 224 Query: 174 TCWNPHVPEESMVLLDNGELFLFDLDACSGVEKLPVKLKGTKVGVSW 34 CW+PH+ EES+VLL+NG+LFLFDL++ + KGT++ V W Sbjct: 225 ACWSPHILEESLVLLENGQLFLFDLES---HDTTGAAFKGTRLKVPW 268 >ref|XP_004145472.1| PREDICTED: uncharacterized protein LOC101205354 [Cucumis sativus] Length = 907 Score = 145 bits (366), Expect = 2e-32 Identities = 104/284 (36%), Positives = 148/284 (52%), Gaps = 9/284 (3%) Frame = -2 Query: 858 EEWKSLWSISSVHSPPLLLSGPSAK-PFGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFC 682 EEWKSL+ I +V PLL+SG S K G Sbjct: 4 EEWKSLFPIGTVFKSPLLISGSSVKNSIGPLVFNPVPTSLTRLFSSQSLLPSLSPPSVLN 63 Query: 681 LHKALRNFTRPSKDPFILPSVISSITADFDSQ---SDETSPIVNNNLQILRCPNND-ILL 514 L + L + ++PS SS+ + F Q SD S + N LQ L CPN+ +++ Sbjct: 64 LPRFLLTSSS------VVPSTSSSVASLFGEQQCCSDPPSVLRYNRLQCLPCPNSSSVVV 117 Query: 513 FFLTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFS 334 FF TG NSD VGF+ +S GS + D ++VF+ + L+ +I I V + GF Sbjct: 118 FFPTGPNSDHVGFLVVSSNGSGLDVQSDCSNDVFSVESELNYQIFGIAV----NPNSGF- 172 Query: 333 SMSGNSSTVGFLLACTLYSVNWFRVEISNLGSDLEKPI-LVNLGTKKFNS-SVVHTCWNP 160 + + +GFLLA T+YSV WF V+ +GS + + LV++G+K F + SVVH CWNP Sbjct: 173 -VDDSYEDIGFLLAYTMYSVEWFIVKNHAIGSSCQPRVSLVHMGSKVFKTCSVVHACWNP 231 Query: 159 HVPEESMVLLDNGELFLFDLDACSGVE--KLPVKLKGTKVGVSW 34 H+ EES+VLL++G LFLFD++ + V LKG K+ VSW Sbjct: 232 HLSEESVVLLEDGSLFLFDMEPLLKTKDYNANVNLKGIKLKVSW 275 >ref|XP_006471160.1| PREDICTED: uncharacterized protein LOC102613824 [Citrus sinensis] Length = 910 Score = 142 bits (358), Expect = 2e-31 Identities = 87/216 (40%), Positives = 126/216 (58%), Gaps = 9/216 (4%) Frame = -2 Query: 633 ILPSVISSITADFDSQSDETSPIVN------NNLQILRCP-NNDILLFFLTGENSDFVGF 475 +LPS +SI + FD P + N L++L CP NN + FF TG+N+D +GF Sbjct: 75 LLPSTSTSIASQFDDVGTHQHPNGSLSDQDYNRLRLLYCPLNNTAIAFFPTGDNNDQLGF 134 Query: 474 VKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFSSMSGNS-STVGFL 298 + +S KGS+ ++ D D VFT + L+ RI ILV V + +S+ GNS VG+L Sbjct: 135 LVISAKGSRFDVLSDEDDAVFTVVNRLNGRIRGILVNPVEEF---YSAFQGNSLVNVGYL 191 Query: 297 LACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNS-SVVHTCWNPHVPEESMVLLDNG 121 LA T+YSV+WF V++S KP++ LG K F + SVV CW+PH+PEES+VLL +G Sbjct: 192 LAFTMYSVHWFSVKVSKASESTIKPVVSYLGFKLFKTCSVVGACWSPHLPEESVVLLQSG 251 Query: 120 ELFLFDLDACSGVEKLPVKLKGTKVGVSWAISQLQN 13 +LF+FD++ KG ++ VSW L + Sbjct: 252 DLFMFDVNGRES--------KGKRLRVSWTDDDLSS 279 >ref|XP_004301624.1| PREDICTED: uncharacterized protein LOC101305856 [Fragaria vesca subsp. vesca] Length = 914 Score = 142 bits (357), Expect = 2e-31 Identities = 109/284 (38%), Positives = 147/284 (51%), Gaps = 9/284 (3%) Frame = -2 Query: 858 EEWKSLWSISSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFCL 679 EEWKSL+ ISSV PPLL+S PS G L Sbjct: 6 EEWKSLFPISSVFKPPLLISNPSI--LGPLIFNPKANSTTLLFSSPTLLPPLTPLPHLSL 63 Query: 678 HKALRNFTRPSKDPFILPSVISSITADFDSQSDETSPIVN---NNLQILRCPN-NDILLF 511 + L + P P LPS SSI A F + +++ N L+ L+CP N IL+F Sbjct: 64 PRFLST-SSPESAP--LPSTSSSI-APFLGPHQYKNDLLSSFRNRLEFLQCPKTNTILIF 119 Query: 510 FLTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRI---LKILVISVTDSGLG 340 F TGENSD VG ++L LK S + V GL R +IL ISV Sbjct: 120 FPTGENSDQVGLLELVLKDSTFDVKVG----------GLSTRCQFKYQILRISVNPLP-S 168 Query: 339 FSSMSGNSS-TVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNSS-VVHTCW 166 S+++GN T+G++LA T+YSV+WF V++ + GS+ + LV +G + F + VVH CW Sbjct: 169 LSNLTGNGPVTIGYVLASTMYSVHWFIVKLGDFGSNSDSIRLVYVGDRVFKACCVVHACW 228 Query: 165 NPHVPEESMVLLDNGELFLFDLDACSGVEKLPVKLKGTKVGVSW 34 +PHVPEES+VLL+NG LFLFDL++ KGT++ V W Sbjct: 229 SPHVPEESVVLLENGALFLFDLESRLRNTISNANFKGTRLKVLW 272 >ref|XP_007132389.1| hypothetical protein PHAVU_011G090800g [Phaseolus vulgaris] gi|593199831|ref|XP_007132390.1| hypothetical protein PHAVU_011G090800g [Phaseolus vulgaris] gi|593199873|ref|XP_007132391.1| hypothetical protein PHAVU_011G090800g [Phaseolus vulgaris] gi|561005389|gb|ESW04383.1| hypothetical protein PHAVU_011G090800g [Phaseolus vulgaris] gi|561005390|gb|ESW04384.1| hypothetical protein PHAVU_011G090800g [Phaseolus vulgaris] gi|561005391|gb|ESW04385.1| hypothetical protein PHAVU_011G090800g [Phaseolus vulgaris] Length = 894 Score = 141 bits (355), Expect = 4e-31 Identities = 101/285 (35%), Positives = 134/285 (47%), Gaps = 6/285 (2%) Frame = -2 Query: 870 MNFSEEWKSLWSISSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXXX 691 M SEEWKS + + S PLLLS + P G Sbjct: 1 MELSEEWKSFFPVGSSTVAPLLLSNSPSLPLGPLLFNPNPNSLSLLFSSPSLLPSLYCPP 60 Query: 690 XFCLHKALRNFTRPSKDPFILPSVISSITADFDS--QSDETSPIVNNNLQILRCPNND-I 520 + F S P ILPS SSI + F S Q+D P ++N L +L P+ Sbjct: 61 ----YLLPSRFLLSSHPPSILPSTASSIASLFSSTHQNDAAPPFLHNRLHLLTYPHRPYA 116 Query: 519 LLFFLTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLG 340 LL F G N + F L K S+ +D +VF A G RIL I V V D G Sbjct: 117 LLLFPAGSNDHKLAFFTLRFKDSRFHTQLDTKGDVFYASTGSSHRILNISVNPVADFGFT 176 Query: 339 FSSMSGN--SSTVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNS-SVVHTC 169 S + S +G+LLA TLYSV+WF ++ L++P +V LG K F + V H C Sbjct: 177 GSDDEDDDASRVIGYLLATTLYSVHWF---VARHNQILDRPSVVCLGDKMFKTCPVAHAC 233 Query: 168 WNPHVPEESMVLLDNGELFLFDLDACSGVEKLPVKLKGTKVGVSW 34 W+PH+ EES+VLL++G+LFLFDL+ C KGT++ V W Sbjct: 234 WSPHILEESVVLLESGQLFLFDLECCGA----GAGFKGTRLKVPW 274 >ref|XP_006431682.1| hypothetical protein CICLE_v10000213mg [Citrus clementina] gi|557533804|gb|ESR44922.1| hypothetical protein CICLE_v10000213mg [Citrus clementina] Length = 910 Score = 140 bits (352), Expect = 9e-31 Identities = 102/302 (33%), Positives = 146/302 (48%), Gaps = 16/302 (5%) Frame = -2 Query: 870 MNFSEEWKSLWSISSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXXX 691 M+F+EE KS + I PPLL S S + Sbjct: 1 MDFTEELKSQFPIGKFLKPPLLQSSESIQ------------GPLFFNPNPETLTLLSSSK 48 Query: 690 XFCLHKALRNFTRPSKDPFI-------LPSVISSITADFDSQSDETSPIVN------NNL 550 C H R + F+ LPS +SI + F P + N L Sbjct: 49 TLCPHSLFSPLPRLTLSRFLSTSSSSLLPSTSTSIASQFGDVGTHQHPDGSLSDQDYNRL 108 Query: 549 QILRCP-NNDILLFFLTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKI 373 ++L CP NN + FF TG+N+D +GF+ +S KGS+ ++ D D +F + L+ RI I Sbjct: 109 RLLYCPLNNTAIAFFPTGDNNDQLGFLVISAKGSRFDVLSDEDDAIFMVLNRLNGRIRGI 168 Query: 372 LVISVTDSGLGFSSMSGNSST-VGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKK 196 LV V + S+ GNS VG+LLA T+YSV+WF V++S KP++ LG K Sbjct: 169 LVNPVEEFD---SAFQGNSLVNVGYLLAFTMYSVHWFSVKVSKASESTTKPVVSYLGFKL 225 Query: 195 FNS-SVVHTCWNPHVPEESMVLLDNGELFLFDLDACSGVEKLPVKLKGTKVGVSWAISQL 19 F + SVV CW+PH+PEES+VLL +G+LF+FD++A KG ++ VSW L Sbjct: 226 FKTCSVVGACWSPHLPEESVVLLQSGDLFMFDVNARES--------KGKRLRVSWTDDDL 277 Query: 18 QN 13 + Sbjct: 278 SS 279 >ref|XP_003600764.1| hypothetical protein MTR_3g069120 [Medicago truncatula] gi|355489812|gb|AES71015.1| hypothetical protein MTR_3g069120 [Medicago truncatula] Length = 884 Score = 138 bits (347), Expect = 4e-30 Identities = 103/286 (36%), Positives = 142/286 (49%), Gaps = 7/286 (2%) Frame = -2 Query: 870 MNFSEEWKSLWSI-SSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXX 694 M FSEEWKSL+ I +S S LL S P + Sbjct: 1 MEFSEEWKSLFPIGASTVSNLLLHSDPDS---------LGPLFFNPNSNSPTPIFSSTIP 51 Query: 693 XXFCLHKALRNFTRPSKDPFILPSVISSITADFDS----QSDETSPIVNNNLQILRCPNN 526 H L + DP ILPS S+I FDS D S ++N +Q+L+CPN Sbjct: 52 SLHLPHNLLTERYLLTSDPSILPSTASTIAHLFDSTPELDDDNVSHFLHNRIQLLKCPNT 111 Query: 525 D-ILLFFLTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDS 349 ++ F TG N + +GF L +K S + +D +VF A G RIL++ V VT+ Sbjct: 112 PKAVVIFPTGANDETIGFFMLGVKDSLLETRLDVKGDVFRASTGSSSRILRMSVNPVTED 171 Query: 348 GLGFSSMSGNSSTVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKK-FNSSVVHT 172 S +S +G++LA + YSV WF V+ NL SD P + LG K F +VV Sbjct: 172 ----DSEPDSSPVIGYVLASSRYSVCWFDVK-HNLSSD--SPSMSYLGRSKVFKEAVVRA 224 Query: 171 CWNPHVPEESMVLLDNGELFLFDLDACSGVEKLPVKLKGTKVGVSW 34 CW+PH+ EESMVLL++G+LFLFD+DA ++ KGT++ V W Sbjct: 225 CWSPHILEESMVLLESGQLFLFDVDAQGSMK----TFKGTRLRVPW 266 >ref|XP_004166877.1| PREDICTED: uncharacterized LOC101205354 [Cucumis sativus] Length = 862 Score = 137 bits (345), Expect = 6e-30 Identities = 86/208 (41%), Positives = 125/208 (60%), Gaps = 8/208 (3%) Frame = -2 Query: 633 ILPSVISSITADFDSQ---SDETSPIVNNNLQILRCPNND-ILLFFLTGENSDFVGFVKL 466 ++PS SS+ + F Q SD S + N LQ L CPN+ +++FF TG NSD VGF+ + Sbjct: 69 VVPSTSSSVASLFGEQQCYSDPPSVLRYNRLQCLPCPNSSSVVVFFPTGPNSDHVGFLVV 128 Query: 465 SLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFSSMSGNSSTVGFLLACT 286 S GS + D ++VF+ + L+ +I I V + GF + + +GFLLA T Sbjct: 129 SSNGSGLDVQSDCSNDVFSVESELNYQIFGIAV----NPNSGF--VDDSYEDIGFLLAYT 182 Query: 285 LYSVNWFRVEISNLGSDLEKPI-LVNLGTKKFNS-SVVHTCWNPHVPEESMVLLDNGELF 112 +YSV WF V+ +GS + + LV++G+K F + SVVH CWNPH+ EES+VLL++G LF Sbjct: 183 MYSVEWFIVKNHAIGSSCQPRVSLVHMGSKVFKTCSVVHACWNPHLSEESVVLLEDGSLF 242 Query: 111 LFDLDACSGVE--KLPVKLKGTKVGVSW 34 LFD++ + V LKG K+ VSW Sbjct: 243 LFDMEPLLKTKDYNANVNLKGIKLKVSW 270 >ref|XP_006844152.1| hypothetical protein AMTR_s00006p00260920 [Amborella trichopoda] gi|548846551|gb|ERN05827.1| hypothetical protein AMTR_s00006p00260920 [Amborella trichopoda] Length = 929 Score = 136 bits (342), Expect = 1e-29 Identities = 98/282 (34%), Positives = 131/282 (46%), Gaps = 7/282 (2%) Frame = -2 Query: 870 MNFSEEWKSLWSISSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXXX 691 M+FSE+WKS + + SV S P L++G SA G Sbjct: 1 MDFSEDWKSQFPVGSVFSCPRLITGESAHSLGPLCFSPINPATHFLSLANTPVCYSPPPT 60 Query: 690 XFCLHKALRNFTRPSKDPFILPSVISSITADFDSQSDETSPIVNNNLQILRCPNNDILLF 511 + F R S D FI +I S T + N L +L C N + LL Sbjct: 61 AQDVFSTADWFYRRSDDDFIPFPLIFSTTKSAAGKHSSRH-FFGNPLHLLTCRNGEFLLL 119 Query: 510 FLTGENSDFVGFVKLSLKGSKPKIMVDNG-------DNVFTADHGLDCRILKILVISVTD 352 F +GENSD + V G + + DNG D+VF RI+++ VIS D Sbjct: 120 FPSGENSDRLACVV----GRRER---DNGGGFSLVKDSVFLLSPSFKNRIIRVSVISTAD 172 Query: 351 SGLGFSSMSGNSSTVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNSSVVHT 172 SS + T GF+L C+ Y V+W RV + N P+ NL + F + V H Sbjct: 173 CAS--SSEVCDQFTEGFVLLCSHYEVHWLRVGVRN-----STPLSQNLASATFKNQVAHA 225 Query: 171 CWNPHVPEESMVLLDNGELFLFDLDACSGVEKLPVKLKGTKV 46 CW+P++PEES VLL NGEL L+DL+ C GV+ LPVK KG V Sbjct: 226 CWSPYLPEESAVLLVNGELRLYDLNYCVGVKNLPVKFKGELV 267 >ref|XP_002317716.1| hypothetical protein POPTR_0012s03820g [Populus trichocarpa] gi|222858389|gb|EEE95936.1| hypothetical protein POPTR_0012s03820g [Populus trichocarpa] Length = 906 Score = 135 bits (341), Expect = 2e-29 Identities = 101/289 (34%), Positives = 148/289 (51%), Gaps = 10/289 (3%) Frame = -2 Query: 870 MNFSEEWKSLWSISSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXXX 691 + FS+EWKS + I +V PLLLS +++ Sbjct: 4 IEFSQEWKSGFPIDTVSKAPLLLSKQTSESLIGPLVFNPIPESLAHLFTSPALSPPLLNP 63 Query: 690 XFCLHKALRNFTRPSK--DPFILPSVISSITADFDSQSDE-TSPIVN-NNLQILRCPNND 523 H +L F S D + S SSI F Q +SP++ N LQ L+CP++D Sbjct: 64 PP--HLSLTRFISTSTLADSPLPLSTASSIAFSFGPQDLHFSSPLLAYNRLQFLKCPHDD 121 Query: 522 -ILLFFLTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSG 346 +++FF TG N D VGF+ LS+K D +FTA L +I+++LV + D Sbjct: 122 TVVVFFSTGTNLDRVGFLLLSVKDKSLVATGDQKGGIFTASKSLGSKIVRVLVNPIEDD- 180 Query: 345 LGFSSMSGN---SSTVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNS-SVV 178 S ++GN S + G+LL T+YSVNWF V+ S +++P+L LG K F S + Sbjct: 181 ---SFLNGNYSFSGSFGYLLVYTMYSVNWFCVKYSE---SMKRPVLSYLGCKNFKSCGIA 234 Query: 177 HTCWNPHVPEESMVLLDNGELFLFDLDA-CSGVEKLPVKLKGTKVGVSW 34 CW+P++ +S+VLL+NG LFLFDL+A CS + +GTK+ VSW Sbjct: 235 SACWSPYIKVQSVVLLENGTLFLFDLEADCS-----DMYFRGTKLKVSW 278 >ref|XP_007026747.1| TATA box-binding protein-associated factor RNA polymerase I subunit C, putative [Theobroma cacao] gi|508715352|gb|EOY07249.1| TATA box-binding protein-associated factor RNA polymerase I subunit C, putative [Theobroma cacao] Length = 910 Score = 132 bits (332), Expect = 2e-28 Identities = 94/283 (33%), Positives = 138/283 (48%), Gaps = 4/283 (1%) Frame = -2 Query: 870 MNFSEEWKSLWSISSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXXX 691 M SEEWKS + I PPLLLS S P Sbjct: 1 MELSEEWKSYFPIGKSLDPPLLLSSASPGPLFFIPKPRTLPKTLFSSPSLFPPLHPPPSR 60 Query: 690 XFCLHKALRNFTRPSKDPFILPSVISSITA--DFDSQSDETSPIVNNNLQILRCPNNDI- 520 + F S P+ S I+S F + +S + +N L +L CP+ +I Sbjct: 61 L-----SFSRFLSTSSVPYSASSSIASRFGLESFYDDAASSSFLSHNRLHLLHCPDQNIA 115 Query: 519 LLFFLTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLG 340 ++FF TG N D +GF + ++ + K + D ++ + + + +IL+ILV V D Sbjct: 116 VVFFTTGANHDRIGFFAVHVQDNDFKFLGDRDGDILISHNHCNHKILRILVSPVDDDD-- 173 Query: 339 FSSMSGNSSTVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKF-NSSVVHTCWN 163 F SG+S VG+L+ACTLYSV+W+ V+ + P L LG K F +SS+V C++ Sbjct: 174 FEENSGDS-VVGYLMACTLYSVHWYSVKFVKSS---KSPALDYLGCKLFKSSSIVSACFS 229 Query: 162 PHVPEESMVLLDNGELFLFDLDACSGVEKLPVKLKGTKVGVSW 34 PH+P+ESMVLL+NG LF FDL++ + KG K+ V W Sbjct: 230 PHLPQESMVLLENGALFFFDLESDVNCQIPNAYFKGNKLRVLW 272 >ref|XP_006837237.1| hypothetical protein AMTR_s00602p00003840 [Amborella trichopoda] gi|548839836|gb|ERN00091.1| hypothetical protein AMTR_s00602p00003840 [Amborella trichopoda] Length = 703 Score = 127 bits (318), Expect = 8e-27 Identities = 96/282 (34%), Positives = 130/282 (46%), Gaps = 7/282 (2%) Frame = -2 Query: 870 MNFSEEWKSLWSISSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXXX 691 M+FSEEWKS +S+ SV P L++G SA G Sbjct: 1 MDFSEEWKSQFSVGSVFPCPRLITGESAHSLGPLCFSPINLATHFLSLANTPVCYSPPPT 60 Query: 690 XFCLHKALRNFTRPSKDPFILPSVISSITADFDSQSDETSPIVNNNLQILRCPNNDILLF 511 + F R S D FI P +S T + N L +L C N + L+ Sbjct: 61 AQDVFSTADCFYRRSDDDFI-PFPLSFSTTKSAVGKHSSRHFSGNPLHLLTCRNGESLIL 119 Query: 510 FLTGENSDFVGFVKLSLKGSKPKIMVDNG-------DNVFTADHGLDCRILKILVISVTD 352 F +GENSD + V G + + DNG D+VF RI+++ VIS Sbjct: 120 FPSGENSDPLTCVV----GRRER---DNGGGFSLLKDSVFLLSPSFKNRIIRVSVIST-- 170 Query: 351 SGLGFSSMSGNSSTVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNSSVVHT 172 +G SS + T GF+L C+ Y V+ RV + N P+ NL + F + V H Sbjct: 171 AGCASSSEVCDQFTEGFVLLCSHYEVHQLRVGVRN-----STPLSQNLASATFKNQVAHA 225 Query: 171 CWNPHVPEESMVLLDNGELFLFDLDACSGVEKLPVKLKGTKV 46 CW+P++ EES VLL NGEL L+DL+ C GV+ LPVK KG V Sbjct: 226 CWSPYLLEESAVLLVNGELRLYDLNYCVGVKNLPVKFKGELV 267 >ref|XP_006841229.1| hypothetical protein AMTR_s00135p00060200 [Amborella trichopoda] gi|548843145|gb|ERN02904.1| hypothetical protein AMTR_s00135p00060200 [Amborella trichopoda] Length = 397 Score = 119 bits (297), Expect = 2e-24 Identities = 91/284 (32%), Positives = 126/284 (44%), Gaps = 9/284 (3%) Frame = -2 Query: 870 MNFSEEWKSLWSISSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXXX 691 M+FSEEWKS + + SV P L++G SA Sbjct: 1 MDFSEEWKSQFPVGSVFPYPCLITGESAH------------------------------- 29 Query: 690 XFCLHKALRNFTRPSKDPFILPSVISSITADFDSQSDETSPIVNNNLQILRCPNNDILLF 511 S D +P + T + + N L +L C N +IL+ Sbjct: 30 --------------SLDDDFIPFPLIFSTTKSAAGKHSSRHFSGNPLHLLTCRNGEILIL 75 Query: 510 FLTGENSDFVGFVKLSLKGSKPKIMVDNG-------DNVFTADHGLDCRILKILVISVTD 352 F + ENSD + V G + + DNG D+VF RI+ + VIS D Sbjct: 76 FPSRENSDRLACVV----GRRER---DNGGGFSLLKDSVFLLSPSFKNRIIGVSVISTAD 128 Query: 351 SGLGFSSMSG--NSSTVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNSSVV 178 ++S S + T GF+L C+ Y V+W RV + N P+ NL + F + V Sbjct: 129 ----YASCSEVCDQFTKGFVLLCSHYEVHWLRVGVRN-----STPLSQNLASATFKNQVA 179 Query: 177 HTCWNPHVPEESMVLLDNGELFLFDLDACSGVEKLPVKLKGTKV 46 H CW+P++PEES VLL NGEL L+DL+ C GV+ LPVK KG V Sbjct: 180 HACWSPYLPEESAVLLVNGELRLYDLNYCVGVKNLPVKFKGELV 223 >ref|XP_006299498.1| hypothetical protein CARUB_v10015667mg [Capsella rubella] gi|482568207|gb|EOA32396.1| hypothetical protein CARUB_v10015667mg [Capsella rubella] Length = 866 Score = 115 bits (289), Expect = 2e-23 Identities = 81/202 (40%), Positives = 116/202 (57%), Gaps = 5/202 (2%) Frame = -2 Query: 624 SVISSITADFDSQSDETSPIVN-NNLQILRCPN-NDILLFFLTGENSDFVGFVKLSL--K 457 S I++ + + D+T+ +++ N LQ L P+ N +L+FF TG N D +GF+ LS Sbjct: 76 SAIAAASLSVPNPPDDTAKVLSYNRLQFLPFPSKNSVLVFFPTGTNLDRIGFLLLSTGDS 135 Query: 456 GSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFSSMSGNSSTVGFLLACTLYS 277 G + D GD VF A L RILKILV V+ SS +S +G++L +LYS Sbjct: 136 GGLQVLGSDEGD-VFVATERLFSRILKILVQPVSTFAADDSS---SSVELGYVLVYSLYS 191 Query: 276 VNWFRVEISNLGSDLEKPILVNLGTKKFNSS-VVHTCWNPHVPEESMVLLDNGELFLFDL 100 ++WF V N KP+L NLG K+F VV W+PH+ ES+VLL+NGE+FLFD Sbjct: 192 IHWFCV---NYDESQGKPVLRNLGCKQFKMCMVVSAAWSPHITGESLVLLENGEVFLFD- 247 Query: 99 DACSGVEKLPVKLKGTKVGVSW 34 V + +L+G+K+ VSW Sbjct: 248 -----VNQRLSRLRGSKLKVSW 264 >ref|XP_006406618.1| hypothetical protein EUTSA_v10020051mg [Eutrema salsugineum] gi|557107764|gb|ESQ48071.1| hypothetical protein EUTSA_v10020051mg [Eutrema salsugineum] Length = 852 Score = 114 bits (286), Expect = 4e-23 Identities = 80/204 (39%), Positives = 115/204 (56%), Gaps = 6/204 (2%) Frame = -2 Query: 627 PSVISSITADF--DSQSDETSPIVN-NNLQILRCP-NNDILLFFLTGENSDFVGFVKLSL 460 PS S+I A F +D+ +++ N LQ+LRCP N +L+FF TG N D +GFV LS Sbjct: 64 PSDSSAIEASFRIPHPNDDAERVLSYNRLQLLRCPVKNCVLVFFPTGSNLDQIGFVLLST 123 Query: 459 KGSKP-KIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFSSMSGNSSTVGFLLACTL 283 S ++M + VF A RILKI V + S LG SSM G+++ TL Sbjct: 124 GDSGAIRVMGTDEGYVFVAKERFFSRILKIFVQPI--SNLGASSME-----FGYVMVYTL 176 Query: 282 YSVNWFRVEISNLGSDLEKPILVNLGTKKFNS-SVVHTCWNPHVPEESMVLLDNGELFLF 106 YS++WF V+ L +P+L LG K+F S+ W+PH P E +VLL+NGE+F+F Sbjct: 177 YSIHWFSVKYDE---SLGRPVLSYLGQKQFKRCSIASASWSPHFPGECLVLLENGEVFVF 233 Query: 105 DLDACSGVEKLPVKLKGTKVGVSW 34 DL+ ++ + +G K+ VSW Sbjct: 234 DLN-----QRHLGRFRGCKMKVSW 252 >ref|XP_002885248.1| hypothetical protein ARALYDRAFT_479330 [Arabidopsis lyrata subsp. lyrata] gi|297331088|gb|EFH61507.1| hypothetical protein ARALYDRAFT_479330 [Arabidopsis lyrata subsp. lyrata] Length = 856 Score = 114 bits (285), Expect = 6e-23 Identities = 81/205 (39%), Positives = 119/205 (58%), Gaps = 7/205 (3%) Frame = -2 Query: 627 PSVISSITADFD--SQSDETSPIVN-NNLQILRCPN-NDILLFFLTGENSDFVGFVKLSL 460 PS S+I + F+ + D+T+ +++ N LQ L P+ N +L+FF TG N D +GF+ LS Sbjct: 64 PSDSSAIHSSFNILNPHDDTARVLSYNRLQFLPFPSKNSVLVFFPTGTNLDQIGFLLLST 123 Query: 459 KGSKPKIMVDNGD--NVFTADHGLDCRILKILVISVTDSGLGFSSMSGNSSTVGFLLACT 286 G + V D +VF A L RILKILV V+D G S SG +G++L Sbjct: 124 -GDSGGLQVTGSDEGDVFVATERLFYRILKILVQPVSDFGAYKCSSSGE---LGYVLVYC 179 Query: 285 LYSVNWFRVEISNLGSDLEKPILVNLGTKKFNS-SVVHTCWNPHVPEESMVLLDNGELFL 109 LYS++W+ V+ KP+L NLG+K+F +V W+PHV E ++LLDNGE+F+ Sbjct: 180 LYSIHWYCVKYDESQG---KPVLRNLGSKQFKRFMIVSASWSPHVTGECLLLLDNGEVFV 236 Query: 108 FDLDACSGVEKLPVKLKGTKVGVSW 34 FDL+ + +L+G K+ VSW Sbjct: 237 FDLN------QRHCRLRGCKLKVSW 255