BLASTX nr result
ID: Akebia23_contig00038191
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia23_contig00038191 (756 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EXB99429.1| hypothetical protein L484_016405 [Morus notabilis] 159 8e-37 ref|XP_007219207.1| hypothetical protein PRUPE_ppa017292mg [Prun... 154 4e-35 emb|CAN64638.1| hypothetical protein VITISV_033929 [Vitis vinifera] 134 3e-29 ref|XP_002530358.1| conserved hypothetical protein [Ricinus comm... 132 1e-28 ref|XP_006471160.1| PREDICTED: uncharacterized protein LOC102613... 125 2e-26 ref|XP_006588648.1| PREDICTED: uncharacterized protein LOC100797... 124 4e-26 ref|XP_004145472.1| PREDICTED: uncharacterized protein LOC101205... 122 1e-25 ref|XP_006431682.1| hypothetical protein CICLE_v10000213mg [Citr... 121 3e-25 ref|XP_004301624.1| PREDICTED: uncharacterized protein LOC101305... 119 1e-24 ref|XP_007132389.1| hypothetical protein PHAVU_011G090800g [Phas... 117 3e-24 ref|XP_004166877.1| PREDICTED: uncharacterized LOC101205354 [Cuc... 116 7e-24 ref|XP_003600764.1| hypothetical protein MTR_3g069120 [Medicago ... 110 4e-22 ref|XP_007026747.1| TATA box-binding protein-associated factor R... 109 1e-21 ref|XP_002317716.1| hypothetical protein POPTR_0012s03820g [Popu... 107 6e-21 ref|XP_006844152.1| hypothetical protein AMTR_s00006p00260920 [A... 105 2e-20 ref|XP_006837237.1| hypothetical protein AMTR_s00602p00003840 [A... 96 2e-17 ref|XP_006406618.1| hypothetical protein EUTSA_v10020051mg [Eutr... 94 5e-17 ref|XP_004231258.1| PREDICTED: uncharacterized protein LOC101260... 94 5e-17 ref|NP_001049507.1| Os03g0240400 [Oryza sativa Japonica Group] g... 93 9e-17 ref|XP_006299498.1| hypothetical protein CARUB_v10015667mg [Caps... 92 2e-16 >gb|EXB99429.1| hypothetical protein L484_016405 [Morus notabilis] Length = 1000 Score = 159 bits (403), Expect = 8e-37 Identities = 99/251 (39%), Positives = 135/251 (53%), Gaps = 6/251 (2%) Frame = -1 Query: 735 MNFSEEWKSLWSISSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXXX 556 MNFSEEWKSL+ IS+V PLLLSGPSA+ Sbjct: 1 MNFSEEWKSLFPISAVFKSPLLLSGPSARTILGPLVFNPKESTITCLFSSPSLLPPFTPL 60 Query: 555 XXXLHKAFRNFTRPSKDPFILPSVISSITADFDS---QSDETSPIVNNNLQILRCPNND- 388 F S D LPS SSI + F Q D S +N LQ+L CP D Sbjct: 61 PRLSFPRF--LLTSSDDSSQLPSTSSSIASVFGPHHYQDDVASAFSHNRLQLLHCPRTDK 118 Query: 387 ILLFFPTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGL 208 ++FFPTG+N++ VGF+ LS+K S + VD+ F D G + +IL+I + V DSG Sbjct: 119 FIVFFPTGDNANQVGFMLLSIKNSCLDVRVDDNGEAFMVDCGSNHQILRISINPVVDSGS 178 Query: 207 GFSSMSGNCS-TVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNS-SVVHTC 34 ++ GN S T+G+LLA T+YSV+W+ +E+ LG +L P L +GTK F + +VH C Sbjct: 179 ALLALGGNSSGTIGYLLASTMYSVHWYVIEVKELGLNLH-PSLTCVGTKVFKTCCIVHAC 237 Query: 33 WNPHVPEESVV 1 W+PH+ EES++ Sbjct: 238 WSPHILEESII 248 >ref|XP_007219207.1| hypothetical protein PRUPE_ppa017292mg [Prunus persica] gi|462415669|gb|EMJ20406.1| hypothetical protein PRUPE_ppa017292mg [Prunus persica] Length = 925 Score = 154 bits (388), Expect = 4e-35 Identities = 102/252 (40%), Positives = 139/252 (55%), Gaps = 10/252 (3%) Frame = -1 Query: 726 SEEWKSLWSISSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 547 +EEWKSL+ ISSV PPLLLS PS KP Sbjct: 8 TEEWKSLFPISSVFKPPLLLSNPSLKPI--LGPLIFNPKPNSTTLLFSSSSSLLAPLPPL 65 Query: 546 LHKAFRNF--TRPSKDPFILPSVISSITA---DFDSQSDETSPIVNNNLQILRCPN-NDI 385 H + F T PS D LPS + S+ + +SD +S ++ N L+ L+CP N + Sbjct: 66 PHLSLPRFLLTSPS-DSAPLPSSVPSVASFLGPHHPKSDVSSSLLYNRLEFLQCPQINTV 124 Query: 384 LLFFPTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLG 205 ++FFPTGENSD VGF++L LKGS + VD VF + RI +I V + G Sbjct: 125 VVFFPTGENSDQVGFLQLVLKGSTFDVKVDENGGVFASRRWFSYRISRISVNPIP----G 180 Query: 204 FSSMSGN--CSTVGFLLACTLYSVNWFRVEISNLGSDLEKPI-LVNLGTKKFNS-SVVHT 37 FSS+ GN C T+G+LLA T+YSV+WF V++ + G + + + LV+LG+K F + VVH Sbjct: 181 FSSLRGNGSCVTIGYLLASTMYSVHWFIVKVGDFGPNSDSRVSLVHLGSKIFKTCCVVHA 240 Query: 36 CWNPHVPEESVV 1 CW+PH+ EESVV Sbjct: 241 CWSPHLLEESVV 252 >emb|CAN64638.1| hypothetical protein VITISV_033929 [Vitis vinifera] Length = 865 Score = 134 bits (338), Expect = 3e-29 Identities = 92/247 (37%), Positives = 128/247 (51%), Gaps = 2/247 (0%) Frame = -1 Query: 735 MNFSEEWKSLWSISSVHSPPLLLSG-PSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXX 559 M+FSEEWKS+W ISSV +PPLL+S PS P Sbjct: 1 MDFSEEWKSIWPISSVFTPPLLISSKPSLGPL---------------------------- 32 Query: 558 XXXXLHKAFRNFTRPSKDPFILPSVISSITADFDSQSDETSPIVNNNLQILRCPNNDILL 379 F PS P L + S + F +S ++++ L +LRCPN +L Sbjct: 33 -----------FFNPS--PNTLTPLFSKPSFSFPPHLPRSS-LLHDRLHLLRCPNAAVLA 78 Query: 378 FFPTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFS 199 FPTG NSD +GF+ LS+K S + D +VF + L+ RI++IL + +G+ Sbjct: 79 LFPTGVNSDQIGFLLLSVKDSCLDVRADRNGDVFVSKKRLNHRIVQILA-----TPIGY- 132 Query: 198 SMSGNCSTVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNS-SVVHTCWNPH 22 S SGN +VG +LACT+YSV+WF V N+ S+ P L+ LG K F S +VV CW+PH Sbjct: 133 SFSGNPDSVGLVLACTMYSVHWFSVRNDNIDSE---PGLIYLGGKVFKSCAVVSACWSPH 189 Query: 21 VPEESVV 1 + EE +V Sbjct: 190 LSEECLV 196 >ref|XP_002530358.1| conserved hypothetical protein [Ricinus communis] gi|223530105|gb|EEF32019.1| conserved hypothetical protein [Ricinus communis] Length = 912 Score = 132 bits (332), Expect = 1e-28 Identities = 97/252 (38%), Positives = 130/252 (51%), Gaps = 7/252 (2%) Frame = -1 Query: 735 MNFSEEWKSLWSISSVHSPPLLLSGPSAKPF-GXXXXXXXXXXXXXXXXXXXXXXXXXXX 559 M+ SEEWKSL+ I SV PLLLS P++K G Sbjct: 1 MDLSEEWKSLFPIGSVFDAPLLLSSPTSKSILGPLFFNPNRKTLTQLYKSPSLFPPLLNP 60 Query: 558 XXXXLHKAFRNFTRPSKDPFILPSVISSITADFDSQSDETSP--IVNNNLQILRCPN-ND 388 F + P L S SSIT+ SQ + S + +N LQ L CP+ N Sbjct: 61 PPRLSLSRFLTTSTTFDSPIPL-STASSITSRLGSQFHDNSASLLAHNQLQFLNCPHDNS 119 Query: 387 ILLFFPTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGL 208 +++FF TG N D VGF+ LS+ + + D+ VF A+ L+ RI+KILV V DSG Sbjct: 120 VIVFFSTGCNHDQVGFLLLSVNDKRLCAVGDSRGGVFVANKCLNQRIVKILVNPVVDSGY 179 Query: 207 GFSSMSGNCST--VGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNS-SVVHT 37 GN S+ VG+LL TL+SV+WF V+I + E+PIL ++G K F S S+V Sbjct: 180 ----FEGNASSKIVGYLLVYTLFSVHWFCVKIGEIN---ERPILGHVGCKTFKSCSIVDA 232 Query: 36 CWNPHVPEESVV 1 CW+PH+ EESVV Sbjct: 233 CWSPHLIEESVV 244 >ref|XP_006471160.1| PREDICTED: uncharacterized protein LOC102613824 [Citrus sinensis] Length = 910 Score = 125 bits (313), Expect = 2e-26 Identities = 75/175 (42%), Positives = 105/175 (60%), Gaps = 9/175 (5%) Frame = -1 Query: 498 ILPSVISSITADFDSQSDETSPIVN------NNLQILRCP-NNDILLFFPTGENSDFVGF 340 +LPS +SI + FD P + N L++L CP NN + FFPTG+N+D +GF Sbjct: 75 LLPSTSTSIASQFDDVGTHQHPNGSLSDQDYNRLRLLYCPLNNTAIAFFPTGDNNDQLGF 134 Query: 339 VKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFSSMSGN-CSTVGFL 163 + +S KGS+ ++ D D VFT + L+ RI ILV V + +S+ GN VG+L Sbjct: 135 LVISAKGSRFDVLSDEDDAVFTVVNRLNGRIRGILVNPVEEF---YSAFQGNSLVNVGYL 191 Query: 162 LACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNS-SVVHTCWNPHVPEESVV 1 LA T+YSV+WF V++S KP++ LG K F + SVV CW+PH+PEESVV Sbjct: 192 LAFTMYSVHWFSVKVSKASESTIKPVVSYLGFKLFKTCSVVGACWSPHLPEESVV 246 >ref|XP_006588648.1| PREDICTED: uncharacterized protein LOC100797045 isoform X1 [Glycine max] gi|571481421|ref|XP_006588649.1| PREDICTED: uncharacterized protein LOC100797045 isoform X2 [Glycine max] Length = 894 Score = 124 bits (311), Expect = 4e-26 Identities = 89/249 (35%), Positives = 123/249 (49%), Gaps = 4/249 (1%) Frame = -1 Query: 735 MNFSEEWKSLWSISSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXXX 556 M SEEWKS + + PLLLS + P G Sbjct: 1 MELSEEWKSFFPTGASTVSPLLLSRSHSLPLGPLLFNPNPNSLSVLFSSPSLVPCLHLPP 60 Query: 555 XXXLHKAFRNFTRPSKDPFILPSVISSITA--DFDSQSDETSPIVNNNLQILRCPNN-DI 385 H F S ILPS SS+ + F +Q+D S + N L +L PN + Sbjct: 61 ----HLFPSRFLLTSHPHSILPSTASSVASLFSFPNQNDAASLFLRNRLHLLYYPNRPNA 116 Query: 384 LLFFPTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLG 205 ++FFPTG N D +GF L++K S+ I++D+ +VF A G RIL I V V DSGL Sbjct: 117 VVFFPTGANDDKLGFFILAVKDSRLDILLDSNGDVFRASTGSAHRILNISVNPVADSGLF 176 Query: 204 FSSMSGNCSTVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNS-SVVHTCWN 28 S +G+LLA LYSV+WF V+ +++ L++P + LG K F + VVH CW+ Sbjct: 177 NES-----HVIGYLLASALYSVHWFAVKHNSV---LDRPSVFYLGGKTFKTCPVVHACWS 228 Query: 27 PHVPEESVV 1 PH+ EES+V Sbjct: 229 PHILEESLV 237 >ref|XP_004145472.1| PREDICTED: uncharacterized protein LOC101205354 [Cucumis sativus] Length = 907 Score = 122 bits (306), Expect = 1e-25 Identities = 90/247 (36%), Positives = 125/247 (50%), Gaps = 6/247 (2%) Frame = -1 Query: 723 EEWKSLWSISSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXL 544 EEWKSL+ I +V PLL+SG S K Sbjct: 4 EEWKSLFPIGTVFKSPLLISGSSVKNSIGPLVFNPVPTSLTRLFSSQSLLPSLSPPSVLN 63 Query: 543 HKAFRNFTRPSKDPFILPSVISSITADFDSQ---SDETSPIVNNNLQILRCPNND-ILLF 376 F T S ++PS SS+ + F Q SD S + N LQ L CPN+ +++F Sbjct: 64 LPRFL-LTSSS----VVPSTSSSVASLFGEQQCCSDPPSVLRYNRLQCLPCPNSSSVVVF 118 Query: 375 FPTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFSS 196 FPTG NSD VGF+ +S GS + D ++VF+ + L+ +I I V + GF Sbjct: 119 FPTGPNSDHVGFLVVSSNGSGLDVQSDCSNDVFSVESELNYQIFGIAV----NPNSGF-- 172 Query: 195 MSGNCSTVGFLLACTLYSVNWFRVEISNLGSDLEKPI-LVNLGTKKFNS-SVVHTCWNPH 22 + + +GFLLA T+YSV WF V+ +GS + + LV++G+K F + SVVH CWNPH Sbjct: 173 VDDSYEDIGFLLAYTMYSVEWFIVKNHAIGSSCQPRVSLVHMGSKVFKTCSVVHACWNPH 232 Query: 21 VPEESVV 1 + EESVV Sbjct: 233 LSEESVV 239 >ref|XP_006431682.1| hypothetical protein CICLE_v10000213mg [Citrus clementina] gi|557533804|gb|ESR44922.1| hypothetical protein CICLE_v10000213mg [Citrus clementina] Length = 910 Score = 121 bits (303), Expect = 3e-25 Identities = 88/260 (33%), Positives = 122/260 (46%), Gaps = 15/260 (5%) Frame = -1 Query: 735 MNFSEEWKSLWSISSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXXX 556 M+F+EE KS + I PPLL S S + Sbjct: 1 MDFTEELKSQFPIGKFLKPPLLQSSESIQ------------GPLFFNPNPETLTLLSSSK 48 Query: 555 XXXLHKAFRNFTRPSKDPFI-------LPSVISSITADFDSQSDETSPIVN------NNL 415 H F R + F+ LPS +SI + F P + N L Sbjct: 49 TLCPHSLFSPLPRLTLSRFLSTSSSSLLPSTSTSIASQFGDVGTHQHPDGSLSDQDYNRL 108 Query: 414 QILRCP-NNDILLFFPTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKI 238 ++L CP NN + FFPTG+N+D +GF+ +S KGS+ ++ D D +F + L+ RI I Sbjct: 109 RLLYCPLNNTAIAFFPTGDNNDQLGFLVISAKGSRFDVLSDEDDAIFMVLNRLNGRIRGI 168 Query: 237 LVISVTDSGLGFSSMSGNCSTVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKF 58 LV V + F S VG+LLA T+YSV+WF V++S KP++ LG K F Sbjct: 169 LVNPVEEFDSAFQGNS--LVNVGYLLAFTMYSVHWFSVKVSKASESTTKPVVSYLGFKLF 226 Query: 57 NS-SVVHTCWNPHVPEESVV 1 + SVV CW+PH+PEESVV Sbjct: 227 KTCSVVGACWSPHLPEESVV 246 >ref|XP_004301624.1| PREDICTED: uncharacterized protein LOC101305856 [Fragaria vesca subsp. vesca] Length = 914 Score = 119 bits (298), Expect = 1e-24 Identities = 96/250 (38%), Positives = 129/250 (51%), Gaps = 9/250 (3%) Frame = -1 Query: 723 EEWKSLWSISSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXL 544 EEWKSL+ ISSV PPLL+S PS G L Sbjct: 6 EEWKSLFPISSVFKPPLLISNPSI--LGPLIFNPKANSTTLLFSSPTLLPPLTPLPHLSL 63 Query: 543 HKAFRNFTRPSKDPFILPSVISSITADFDSQSDETSPIVN---NNLQILRCPN-NDILLF 376 + F + + P P LPS SSI A F + +++ N L+ L+CP N IL+F Sbjct: 64 PR-FLSTSSPESAP--LPSTSSSI-APFLGPHQYKNDLLSSFRNRLEFLQCPKTNTILIF 119 Query: 375 FPTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRI---LKILVISVTDSGLG 205 FPTGENSD VG ++L LK S + V GL R +IL ISV Sbjct: 120 FPTGENSDQVGLLELVLKDSTFDVKVG----------GLSTRCQFKYQILRISVNPLP-S 168 Query: 204 FSSMSGNCS-TVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNS-SVVHTCW 31 S+++GN T+G++LA T+YSV+WF V++ + GS+ + LV +G + F + VVH CW Sbjct: 169 LSNLTGNGPVTIGYVLASTMYSVHWFIVKLGDFGSNSDSIRLVYVGDRVFKACCVVHACW 228 Query: 30 NPHVPEESVV 1 +PHVPEESVV Sbjct: 229 SPHVPEESVV 238 >ref|XP_007132389.1| hypothetical protein PHAVU_011G090800g [Phaseolus vulgaris] gi|593199831|ref|XP_007132390.1| hypothetical protein PHAVU_011G090800g [Phaseolus vulgaris] gi|593199873|ref|XP_007132391.1| hypothetical protein PHAVU_011G090800g [Phaseolus vulgaris] gi|561005389|gb|ESW04383.1| hypothetical protein PHAVU_011G090800g [Phaseolus vulgaris] gi|561005390|gb|ESW04384.1| hypothetical protein PHAVU_011G090800g [Phaseolus vulgaris] gi|561005391|gb|ESW04385.1| hypothetical protein PHAVU_011G090800g [Phaseolus vulgaris] Length = 894 Score = 117 bits (294), Expect = 3e-24 Identities = 88/251 (35%), Positives = 114/251 (45%), Gaps = 6/251 (2%) Frame = -1 Query: 735 MNFSEEWKSLWSISSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXXX 556 M SEEWKS + + S PLLLS + P G Sbjct: 1 MELSEEWKSFFPVGSSTVAPLLLSNSPSLPLGPLLFNPNPNSLSLLFSSPSLLPSLYCPP 60 Query: 555 XXXLHKAFRNFTRPSKDPFILPSVISSITADFDS--QSDETSPIVNNNLQILRCPNNDI- 385 + F S P ILPS SSI + F S Q+D P ++N L +L P+ Sbjct: 61 YLLPSR----FLLSSHPPSILPSTASSIASLFSSTHQNDAAPPFLHNRLHLLTYPHRPYA 116 Query: 384 LLFFPTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGL- 208 LL FP G N + F L K S+ +D +VF A G RIL I V V D G Sbjct: 117 LLLFPAGSNDHKLAFFTLRFKDSRFHTQLDTKGDVFYASTGSSHRILNISVNPVADFGFT 176 Query: 207 GFSSMSGNCS-TVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNS-SVVHTC 34 G + S +G+LLA TLYSV+WF ++ L++P +V LG K F + V H C Sbjct: 177 GSDDEDDDASRVIGYLLATTLYSVHWF---VARHNQILDRPSVVCLGDKMFKTCPVAHAC 233 Query: 33 WNPHVPEESVV 1 W+PH+ EESVV Sbjct: 234 WSPHILEESVV 244 >ref|XP_004166877.1| PREDICTED: uncharacterized LOC101205354 [Cucumis sativus] Length = 862 Score = 116 bits (291), Expect = 7e-24 Identities = 72/172 (41%), Positives = 104/172 (60%), Gaps = 6/172 (3%) Frame = -1 Query: 498 ILPSVISSITADFDSQ---SDETSPIVNNNLQILRCPNND-ILLFFPTGENSDFVGFVKL 331 ++PS SS+ + F Q SD S + N LQ L CPN+ +++FFPTG NSD VGF+ + Sbjct: 69 VVPSTSSSVASLFGEQQCYSDPPSVLRYNRLQCLPCPNSSSVVVFFPTGPNSDHVGFLVV 128 Query: 330 SLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFSSMSGNCSTVGFLLACT 151 S GS + D ++VF+ + L+ +I I V + GF + + +GFLLA T Sbjct: 129 SSNGSGLDVQSDCSNDVFSVESELNYQIFGIAV----NPNSGF--VDDSYEDIGFLLAYT 182 Query: 150 LYSVNWFRVEISNLGSDLEKPI-LVNLGTKKFNS-SVVHTCWNPHVPEESVV 1 +YSV WF V+ +GS + + LV++G+K F + SVVH CWNPH+ EESVV Sbjct: 183 MYSVEWFIVKNHAIGSSCQPRVSLVHMGSKVFKTCSVVHACWNPHLSEESVV 234 >ref|XP_003600764.1| hypothetical protein MTR_3g069120 [Medicago truncatula] gi|355489812|gb|AES71015.1| hypothetical protein MTR_3g069120 [Medicago truncatula] Length = 884 Score = 110 bits (276), Expect = 4e-22 Identities = 86/252 (34%), Positives = 119/252 (47%), Gaps = 7/252 (2%) Frame = -1 Query: 735 MNFSEEWKSLWSI-SSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXX 559 M FSEEWKSL+ I +S S LL S P + Sbjct: 1 MEFSEEWKSLFPIGASTVSNLLLHSDPDS---------LGPLFFNPNSNSPTPIFSSTIP 51 Query: 558 XXXXLHKAFRNFTRPSKDPFILPSVISSITADFDS----QSDETSPIVNNNLQILRCPNN 391 H + DP ILPS S+I FDS D S ++N +Q+L+CPN Sbjct: 52 SLHLPHNLLTERYLLTSDPSILPSTASTIAHLFDSTPELDDDNVSHFLHNRIQLLKCPNT 111 Query: 390 D-ILLFFPTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDS 214 ++ FPTG N + +GF L +K S + +D +VF A G RIL++ V VT+ Sbjct: 112 PKAVVIFPTGANDETIGFFMLGVKDSLLETRLDVKGDVFRASTGSSSRILRMSVNPVTED 171 Query: 213 GLGFSSMSGNCSTVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLG-TKKFNSSVVHT 37 S + +G++LA + YSV WF V+ NL SD P + LG +K F +VV Sbjct: 172 ----DSEPDSSPVIGYVLASSRYSVCWFDVK-HNLSSD--SPSMSYLGRSKVFKEAVVRA 224 Query: 36 CWNPHVPEESVV 1 CW+PH+ EES+V Sbjct: 225 CWSPHILEESMV 236 >ref|XP_007026747.1| TATA box-binding protein-associated factor RNA polymerase I subunit C, putative [Theobroma cacao] gi|508715352|gb|EOY07249.1| TATA box-binding protein-associated factor RNA polymerase I subunit C, putative [Theobroma cacao] Length = 910 Score = 109 bits (272), Expect = 1e-21 Identities = 80/249 (32%), Positives = 120/249 (48%), Gaps = 4/249 (1%) Frame = -1 Query: 735 MNFSEEWKSLWSISSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXXX 556 M SEEWKS + I PPLLLS S P Sbjct: 1 MELSEEWKSYFPIGKSLDPPLLLSSASPGPL-----FFIPKPRTLPKTLFSSPSLFPPLH 55 Query: 555 XXXLHKAFRNFTRPSKDPFILPSVISSITA--DFDSQSDETSPIVNNNLQILRCPNNDI- 385 +F F S P+ S I+S F + +S + +N L +L CP+ +I Sbjct: 56 PPPSRLSFSRFLSTSSVPYSASSSIASRFGLESFYDDAASSSFLSHNRLHLLHCPDQNIA 115 Query: 384 LLFFPTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLG 205 ++FF TG N D +GF + ++ + K + D ++ + + + +IL+ILV V D Sbjct: 116 VVFFTTGANHDRIGFFAVHVQDNDFKFLGDRDGDILISHNHCNHKILRILVSPVDDD--D 173 Query: 204 FSSMSGNCSTVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKF-NSSVVHTCWN 28 F SG+ S VG+L+ACTLYSV+W+ V+ + P L LG K F +SS+V C++ Sbjct: 174 FEENSGD-SVVGYLMACTLYSVHWYSVKFV---KSSKSPALDYLGCKLFKSSSIVSACFS 229 Query: 27 PHVPEESVV 1 PH+P+ES+V Sbjct: 230 PHLPQESMV 238 >ref|XP_002317716.1| hypothetical protein POPTR_0012s03820g [Populus trichocarpa] gi|222858389|gb|EEE95936.1| hypothetical protein POPTR_0012s03820g [Populus trichocarpa] Length = 906 Score = 107 bits (266), Expect = 6e-21 Identities = 82/254 (32%), Positives = 123/254 (48%), Gaps = 9/254 (3%) Frame = -1 Query: 735 MNFSEEWKSLWSISSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXXX 556 + FS+EWKS + I +V PLLLS +++ Sbjct: 4 IEFSQEWKSGFPIDTVSKAPLLLSKQTSESL--IGPLVFNPIPESLAHLFTSPALSPPLL 61 Query: 555 XXXLHKAFRNFTRPSK--DPFILPSVISSITADFDSQSDE-TSPIVN-NNLQILRCPNND 388 H + F S D + S SSI F Q +SP++ N LQ L+CP++D Sbjct: 62 NPPPHLSLTRFISTSTLADSPLPLSTASSIAFSFGPQDLHFSSPLLAYNRLQFLKCPHDD 121 Query: 387 -ILLFFPTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSG 211 +++FF TG N D VGF+ LS+K D +FTA L +I+++LV + D Sbjct: 122 TVVVFFSTGTNLDRVGFLLLSVKDKSLVATGDQKGGIFTASKSLGSKIVRVLVNPIEDD- 180 Query: 210 LGFSSMSGNCS---TVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNS-SVV 43 S ++GN S + G+LL T+YSVNWF V+ S +++P+L LG K F S + Sbjct: 181 ---SFLNGNYSFSGSFGYLLVYTMYSVNWFCVKYS---ESMKRPVLSYLGCKNFKSCGIA 234 Query: 42 HTCWNPHVPEESVV 1 CW+P++ +SVV Sbjct: 235 SACWSPYIKVQSVV 248 >ref|XP_006844152.1| hypothetical protein AMTR_s00006p00260920 [Amborella trichopoda] gi|548846551|gb|ERN05827.1| hypothetical protein AMTR_s00006p00260920 [Amborella trichopoda] Length = 929 Score = 105 bits (262), Expect = 2e-20 Identities = 81/254 (31%), Positives = 111/254 (43%), Gaps = 9/254 (3%) Frame = -1 Query: 735 MNFSEEWKSLWSISSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXXX 556 M+FSE+WKS + + SV S P L++G SA G Sbjct: 1 MDFSEDWKSQFPVGSVFSCPRLITGESAHSLGPLCFSPINPATHFLSLANTPVCYSPPPT 60 Query: 555 XXXLHKAFRNFTRPSKDPFILPSVISSITADFDSQSDETSPIVNNNLQILRCPNNDILLF 376 + F R S D FI +I S T + N L +L C N + LL Sbjct: 61 AQDVFSTADWFYRRSDDDFIPFPLIFSTTKSAAGKHSSRH-FFGNPLHLLTCRNGEFLLL 119 Query: 375 FPTGENSDFVGFVKLSLKGSKPKIMVDNG-------DNVFTADHGLDCRILKILVISVTD 217 FP+GENSD + V G + + DNG D+VF RI+++ VIS D Sbjct: 120 FPSGENSDRLACVV----GRRER---DNGGGFSLVKDSVFLLSPSFKNRIIRVSVISTAD 172 Query: 216 SGLGFSSMSGNCS--TVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNSSVV 43 +S S C T GF+L C+ Y V+W RV + N P+ NL + F + V Sbjct: 173 C----ASSSEVCDQFTEGFVLLCSHYEVHWLRVGVRN-----STPLSQNLASATFKNQVA 223 Query: 42 HTCWNPHVPEESVV 1 H CW+P++PEES V Sbjct: 224 HACWSPYLPEESAV 237 >ref|XP_006837237.1| hypothetical protein AMTR_s00602p00003840 [Amborella trichopoda] gi|548839836|gb|ERN00091.1| hypothetical protein AMTR_s00602p00003840 [Amborella trichopoda] Length = 703 Score = 95.5 bits (236), Expect = 2e-17 Identities = 79/254 (31%), Positives = 109/254 (42%), Gaps = 9/254 (3%) Frame = -1 Query: 735 MNFSEEWKSLWSISSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXXXXX 556 M+FSEEWKS +S+ SV P L++G SA G Sbjct: 1 MDFSEEWKSQFSVGSVFPCPRLITGESAHSLGPLCFSPINLATHFLSLANTPVCYSPPPT 60 Query: 555 XXXLHKAFRNFTRPSKDPFILPSVISSITADFDSQSDETSPIVNNNLQILRCPNNDILLF 376 + F R S D FI P +S T + N L +L C N + L+ Sbjct: 61 AQDVFSTADCFYRRSDDDFI-PFPLSFSTTKSAVGKHSSRHFSGNPLHLLTCRNGESLIL 119 Query: 375 FPTGENSDFVGFVKLSLKGSKPKIMVDNG-------DNVFTADHGLDCRILKILVISVTD 217 FP+GENSD + V G + + DNG D+VF RI+++ VIS Sbjct: 120 FPSGENSDPLTCVV----GRRER---DNGGGFSLLKDSVFLLSPSFKNRIIRVSVISTA- 171 Query: 216 SGLGFSSMSGNCS--TVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNSSVV 43 G +S S C T GF+L C+ Y V+ RV + N P+ NL + F + V Sbjct: 172 ---GCASSSEVCDQFTEGFVLLCSHYEVHQLRVGVRN-----STPLSQNLASATFKNQVA 223 Query: 42 HTCWNPHVPEESVV 1 H CW+P++ EES V Sbjct: 224 HACWSPYLLEESAV 237 >ref|XP_006406618.1| hypothetical protein EUTSA_v10020051mg [Eutrema salsugineum] gi|557107764|gb|ESQ48071.1| hypothetical protein EUTSA_v10020051mg [Eutrema salsugineum] Length = 852 Score = 94.0 bits (232), Expect = 5e-17 Identities = 67/170 (39%), Positives = 93/170 (54%), Gaps = 6/170 (3%) Frame = -1 Query: 492 PSVISSITADF--DSQSDETSPIVN-NNLQILRCP-NNDILLFFPTGENSDFVGFVKLSL 325 PS S+I A F +D+ +++ N LQ+LRCP N +L+FFPTG N D +GFV LS Sbjct: 64 PSDSSAIEASFRIPHPNDDAERVLSYNRLQLLRCPVKNCVLVFFPTGSNLDQIGFVLLST 123 Query: 324 KGSKP-KIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFSSMSGNCSTVGFLLACTL 148 S ++M + VF A RILKI V + S LG SSM G+++ TL Sbjct: 124 GDSGAIRVMGTDEGYVFVAKERFFSRILKIFVQPI--SNLGASSME-----FGYVMVYTL 176 Query: 147 YSVNWFRVEISNLGSDLEKPILVNLGTKKF-NSSVVHTCWNPHVPEESVV 1 YS++WF V+ L +P+L LG K+F S+ W+PH P E +V Sbjct: 177 YSIHWFSVKYD---ESLGRPVLSYLGQKQFKRCSIASASWSPHFPGECLV 223 >ref|XP_004231258.1| PREDICTED: uncharacterized protein LOC101260775 [Solanum lycopersicum] Length = 907 Score = 94.0 bits (232), Expect = 5e-17 Identities = 85/268 (31%), Positives = 119/268 (44%), Gaps = 23/268 (8%) Frame = -1 Query: 735 MNFSEEWKSLWSISSVHSPPLLLSGPSAK----------PFGXXXXXXXXXXXXXXXXXX 586 M+ S++WK+LW I S S PLLLS + P G Sbjct: 1 MDSSDKWKALWKIWSSFSSPLLLSNSHEESSSKRRRIDSPIGPLIFRPCEETLTPLLRSP 60 Query: 585 XXXXXXXXXXXXXLHKAFRNFTRPSKDPFILPSVISSITADFDSQSDETSPIVN-NNLQI 409 + F + S +L S SSI +F Q +T I N N++Q Sbjct: 61 LLSTRIPSPVPDL---SLPRFLQTSSG--MLFSTASSIATEFSPQVSDT--IHNFNSIQF 113 Query: 408 LRCPN-------NDILLFFPTGENSDFVGFVKLSLKGSK-PKIMVDNGDNVFTADHGLDC 253 L PN N I+ PTGEN D VG L + ++ NG ++ +H L+ Sbjct: 114 LPLPNFGENSKPNSIIGISPTGENYDQVGLFMLCSEDTQFVAKKFKNGTSILVHNHKLNF 173 Query: 252 RILKILVISVTDSGLGFSSMSGNCSTVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNL 73 RIL++LV V++ S S +C T G+LL CTLYSV+W+ V+I G E +L + Sbjct: 174 RILRLLVNPVSEID---DSCSSSCITFGYLLVCTLYSVHWYSVKIGVKGD--ENVMLDYV 228 Query: 72 GTKKFN----SSVVHTCWNPHVPEESVV 1 G+ N V H CW+PH+ EE VV Sbjct: 229 GSADRNLFKGGIVSHACWSPHLREECVV 256 >ref|NP_001049507.1| Os03g0240400 [Oryza sativa Japonica Group] gi|113547978|dbj|BAF11421.1| Os03g0240400, partial [Oryza sativa Japonica Group] Length = 928 Score = 93.2 bits (230), Expect = 9e-17 Identities = 72/252 (28%), Positives = 113/252 (44%), Gaps = 4/252 (1%) Frame = -1 Query: 744 PISMNFSEEWKSLWSISSVHSPPLLLSGPSAKPFGXXXXXXXXXXXXXXXXXXXXXXXXX 565 P +MN S++W+ L+ +SSV +PP L + +A Sbjct: 50 PPAMNLSDDWRFLFPVSSVFAPPSLATSSAAAASYGPLLFSPLPPHATLLALPSPFQPPH 109 Query: 564 XXXXXXLHKAFRNFTRPSKDPFILPSVISSITADFDSQSDETSPIVNNNLQILRCPNND- 388 H R+F R + F+ + + ++ + P +N L +LR P++ Sbjct: 110 PSRRGLRH-LLRHFVRSTS--FLPFADLDPLSGALLTAPSPPFPAPSNLLAVLRAPSSSR 166 Query: 387 -ILLFFPTGENSDFVGFVKLS--LKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTD 217 +++FFP+GEN++ V +V L + P D H +I ++ T Sbjct: 167 SLVVFFPSGENAEQVSYVTLDPVADPTTPLSHSVQSDGFMHPRH-------RIQQLATTA 219 Query: 216 SGLGFSSMSGNCSTVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNSSVVHT 37 S + S S + S GFLLA TLYSVNWF+VE GS P LV + F+++VVH Sbjct: 220 SWSSWPSRSRDSSIEGFLLAATLYSVNWFKVESRGSGS----PALVPAAKQAFDAAVVHA 275 Query: 36 CWNPHVPEESVV 1 CW+ H+ E VV Sbjct: 276 CWSKHLQSECVV 287 >ref|XP_006299498.1| hypothetical protein CARUB_v10015667mg [Capsella rubella] gi|482568207|gb|EOA32396.1| hypothetical protein CARUB_v10015667mg [Capsella rubella] Length = 866 Score = 92.0 bits (227), Expect = 2e-16 Identities = 66/168 (39%), Positives = 93/168 (55%), Gaps = 5/168 (2%) Frame = -1 Query: 489 SVISSITADFDSQSDETSPIVN-NNLQILRCPN-NDILLFFPTGENSDFVGFVKLSL--K 322 S I++ + + D+T+ +++ N LQ L P+ N +L+FFPTG N D +GF+ LS Sbjct: 76 SAIAAASLSVPNPPDDTAKVLSYNRLQFLPFPSKNSVLVFFPTGTNLDRIGFLLLSTGDS 135 Query: 321 GSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFSSMSGNCSTVGFLLACTLYS 142 G + D GD VF A L RILKILV V+ SS S +G++L +LYS Sbjct: 136 GGLQVLGSDEGD-VFVATERLFSRILKILVQPVSTFAADDSSSS---VELGYVLVYSLYS 191 Query: 141 VNWFRVEISNLGSDLEKPILVNLGTKKFN-SSVVHTCWNPHVPEESVV 1 ++WF V N KP+L NLG K+F VV W+PH+ ES+V Sbjct: 192 IHWFCV---NYDESQGKPVLRNLGCKQFKMCMVVSAAWSPHITGESLV 236