BLASTX nr result
ID: Akebia27_contig00018129
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia27_contig00018129 (1212 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004233274.1| PREDICTED: uncharacterized protein LOC101267... 327 5e-87 gb|EYU25683.1| hypothetical protein MIMGU_mgv1a024294mg, partial... 320 8e-85 ref|XP_007203754.1| hypothetical protein PRUPE_ppa014878mg [Prun... 301 3e-79 ref|XP_007012013.1| Uncharacterized protein TCM_037120 [Theobrom... 298 2e-78 ref|XP_002523533.1| conserved hypothetical protein [Ricinus comm... 297 7e-78 ref|XP_002324567.1| hypothetical protein POPTR_0018s12090g [Popu... 292 2e-76 ref|XP_007138434.1| hypothetical protein PHAVU_009G208600g [Phas... 287 7e-75 gb|EPS70810.1| hypothetical protein M569_03949 [Genlisea aurea] 283 1e-73 ref|XP_006849857.1| hypothetical protein AMTR_s00022p00059330 [A... 269 2e-69 ref|XP_004957231.1| PREDICTED: uncharacterized protein LOC101779... 255 3e-65 ref|NP_001145235.1| hypothetical protein [Zea mays] gi|195653353... 255 3e-65 ref|XP_002462593.1| hypothetical protein SORBIDRAFT_02g028710 [S... 246 2e-62 gb|EEC84811.1| hypothetical protein OsI_31881 [Oryza sativa Indi... 241 5e-61 ref|XP_002980789.1| hypothetical protein SELMODRAFT_420371 [Sela... 151 6e-34 ref|XP_006381898.1| hypothetical protein POPTR_0006s20320g [Popu... 147 1e-32 ref|XP_002961862.1| hypothetical protein SELMODRAFT_403240 [Sela... 138 4e-30 gb|EXC12523.1| hypothetical protein L484_012337 [Morus notabilis] 137 9e-30 ref|NP_484331.1| uroporphyrinogen-III synthase [Nostoc sp. PCC 7... 120 1e-24 ref|YP_323579.1| uroporphyrinogen-III synthase [Anabaena variabi... 119 3e-24 ref|WP_016872683.1| hypothetical protein [Chlorogloeopsis fritsc... 112 3e-22 >ref|XP_004233274.1| PREDICTED: uncharacterized protein LOC101267674 [Solanum lycopersicum] Length = 312 Score = 327 bits (839), Expect = 5e-87 Identities = 175/295 (59%), Positives = 218/295 (73%), Gaps = 6/295 (2%) Frame = -1 Query: 1140 SPLSHKR---IAYTTPPNYADRLSHALELKGCSNTIWCPTILVETTQQTKDSIKPYLISK 970 SP + +R IA+TTP NYA RLS + LKG + +WCPT++VE+T+QT SI YL + Sbjct: 13 SPENSRRNCVIAFTTPQNYAPRLSELIHLKGWT-PLWCPTVIVESTEQTISSIHHYLNPQ 71 Query: 969 SR-PRISPLLQHFSAIAFTSRAGISAFSEILVEIVQPPLFQSQTNETFTISALGRDAELL 793 + + L+ FSA+AFTSR GI+AFS+ L PPL + E TI+ALG DAELL Sbjct: 72 AGIDEPNSFLEEFSALAFTSRTGITAFSQALSMNPTPPL--TPNGEILTIAALGNDAELL 129 Query: 792 LNGDFVSKLCDNPNRIRVLVPPVATPTSLVDSLVLGMGRRVLCPVPLVIGIEEPPVIPDF 613 + DF+ K+C+NP RIRVLVP VATP+ LV++L LG GR+VLCPVPLVIG+ EPPV+P F Sbjct: 130 -DRDFIRKMCENPERIRVLVPSVATPSGLVEALGLGQGRKVLCPVPLVIGLNEPPVVPKF 188 Query: 612 IQSLVANGWIPVRVSAYETRWAGPNCA-EIFVRSEEE-RVDAIVFTSTGEVEGLLKSLRG 439 + L GWIP+R+ AYETRWAG CA ++ +SEEE DAIVFTSTGEVEGLLKSL Sbjct: 189 LDDLSKRGWIPLRLDAYETRWAGATCAVDVVAKSEEECGFDAIVFTSTGEVEGLLKSLEE 248 Query: 438 YGFDWGMVKKRWPGLVVAAHGPITAAGAERLGVNVDVIGSKFGSFDGVVEALALK 274 +G DW MV++R P +VVAAHGP+TAAGAE LGV +DV+ S FGSFDGVV+ALA K Sbjct: 249 FGLDWSMVRRRCPRMVVAAHGPVTAAGAESLGVGIDVVSSNFGSFDGVVDALAHK 303 >gb|EYU25683.1| hypothetical protein MIMGU_mgv1a024294mg, partial [Mimulus guttatus] Length = 299 Score = 320 bits (820), Expect = 8e-85 Identities = 167/289 (57%), Positives = 209/289 (72%), Gaps = 2/289 (0%) Frame = -1 Query: 1143 TSPLSHKRIAYTTPPNYADRLSHALELKGCSNTIWCPTILVETTQQTKDSIKPYLISKSR 964 ++P + IA+TTP NYA RLS + LKG + +WCPT+ V+TT T SI+ Y +S Sbjct: 5 SAPTTAPVIAFTTPKNYASRLSDVIRLKGWT-PLWCPTLSVDTTPHTTSSIQHYFLS--- 60 Query: 963 PRISPLLQHFSAIAFTSRAGISAFSEILVEIVQPPLFQSQTNETFTISALGRDAELLLNG 784 + P L+HFSA+AFTSR GI+AFSE L I P F + FT+SALG+D+ELL Sbjct: 61 --LDPPLRHFSAVAFTSRTGITAFSEALSAIAAAPPF-GPDGDLFTLSALGKDSELLTES 117 Query: 783 DFVSKLCDNPNRIRVLVPPVATPTSLVDSLVLGMGRRVLCPVPLVIGIEEPPVIPDFIQS 604 FV+KLC NP R+RVLVPP+ATP+ LV++L LG+GR+VLCPVPLVIG++EPPV+P+F+ Sbjct: 118 -FVAKLCVNPARVRVLVPPIATPSGLVEALGLGLGRKVLCPVPLVIGLKEPPVVPEFLAG 176 Query: 603 LVANGWIPVRVSAYETRWAGPNCAEIFVRSEEER--VDAIVFTSTGEVEGLLKSLRGYGF 430 L GW+PVRV+AYETRW G A + EE VDAIVFTST EVEGLLKSL G Sbjct: 177 LARRGWVPVRVNAYETRWRGGGVAGLVAGMMEEHCGVDAIVFTSTAEVEGLLKSLEELGL 236 Query: 429 DWGMVKKRWPGLVVAAHGPITAAGAERLGVNVDVIGSKFGSFDGVVEAL 283 DWGMV++ P LV AAHGP+TA GAE+LGV +DV+ SKF SF GVV+AL Sbjct: 237 DWGMVRRMCPRLVAAAHGPVTAVGAEQLGVEIDVVSSKFHSFYGVVDAL 285 >ref|XP_007203754.1| hypothetical protein PRUPE_ppa014878mg [Prunus persica] gi|462399285|gb|EMJ04953.1| hypothetical protein PRUPE_ppa014878mg [Prunus persica] Length = 287 Score = 301 bits (772), Expect = 3e-79 Identities = 162/294 (55%), Positives = 208/294 (70%), Gaps = 1/294 (0%) Frame = -1 Query: 1137 PLSHKRIAYTTPPNYADRLSHALELKGCSNTIWCPTILVETTQQTKDSIKPYLISKSRPR 958 P + +A+TTPPNYA RL+H L LKG N I PT++V+ T T ++KPYL S P Sbjct: 4 PTAAPTVAFTTPPNYAARLAHLLALKGF-NPISSPTLIVQPTPSTISALKPYL---SPP- 58 Query: 957 ISPLLQHFSAIAFTSRAGISAFSEILVEIVQPPLFQSQTNETFTISALGRDAELLLNGDF 778 P L FSAIAF SR I++ S +I P L S + F I+ALG+DAEL+ + +F Sbjct: 59 --PSLDLFSAIAFPSRTAITSLSAAAADISHPLL--SPHGDAFIIAALGKDAELM-DDNF 113 Query: 777 VSKLCDNPNRIRVLVPPVATPTSLVDSLVLGMGRRVLCPVPLVIGIEEPPVIPDFIQSLV 598 V KLC N NR+R+LVPP ATP+ LV++L G RRVLCPVP+V+G+ EPPV+PDF++ L Sbjct: 114 VHKLCSNTNRVRILVPPTATPSGLVEALGDGRNRRVLCPVPVVVGLVEPPVVPDFLRDLE 173 Query: 597 ANGWIPVRVSAYETRWAGPNCA-EIFVRSEEERVDAIVFTSTGEVEGLLKSLRGYGFDWG 421 A W+PVRV+AYETRWAGP CA ++ R EE +DA+VFTST EVEGLLKS + +G DW Sbjct: 174 AKRWVPVRVNAYETRWAGPGCAKQVVERIEEGALDAMVFTSTAEVEGLLKSFKEFGLDWE 233 Query: 420 MVKKRWPGLVVAAHGPITAAGAERLGVNVDVIGSKFGSFDGVVEALALKCSGVS 259 + KKR P ++VAAHGPITAAGA LGV VD++ S+F SF GVV+AL + S +S Sbjct: 234 IAKKRCPKMLVAAHGPITAAGAHMLGVRVDLVSSQFDSFQGVVDALHTEISRLS 287 >ref|XP_007012013.1| Uncharacterized protein TCM_037120 [Theobroma cacao] gi|508782376|gb|EOY29632.1| Uncharacterized protein TCM_037120 [Theobroma cacao] Length = 301 Score = 298 bits (764), Expect = 2e-78 Identities = 160/296 (54%), Positives = 202/296 (68%), Gaps = 3/296 (1%) Frame = -1 Query: 1161 PVLVKKTSPLSHKRIAYTTPPNYADRLSHALELKGCSNTIWCPTILVETTQQTKDSIKPY 982 P L +S + +TTPPNYA RLS+ L LKG +WCPTI TT T S+ + Sbjct: 6 PNLTPLSSSTVKPTVIFTTPPNYAARLSNLLTLKG-HTPLWCPTI---TTHPTPHSLSTH 61 Query: 981 LISKSRPRISPLLQHFSAIAFTSRAGISAFSEILVEIVQPPLFQSQTNETFTISALGRDA 802 L S L SAI F SRA I++FS + + +P L TF ++ALG+D+ Sbjct: 62 LSPHS-------LSLLSAITFPSRASITSFSLAALSLPKPLL--PSHGPTFILAALGKDS 112 Query: 801 ELLLNGDFVSKLCDNPNRIRVLVPPVATPTSLVDSLVLGMGRRVLCPVPLVIGIEEPPVI 622 EL+ N F+S++C N RI+VLVPP ATP SL SL G GRRVLCPVP V+G+ EPPV+ Sbjct: 113 ELI-NTPFISQICSNLQRIKVLVPPTATPNSLALSLGKGYGRRVLCPVPKVVGLNEPPVV 171 Query: 621 PDFIQSLVANGWIPVRVSAYETRWAGPNCAEIFVR---SEEERVDAIVFTSTGEVEGLLK 451 PDF++ L + GW+P+RV AYETRW GP+CAE VR EE V+A+VFTS+GEVEG LK Sbjct: 172 PDFLKDLESGGWVPIRVDAYETRWVGPSCAEEVVRKGEEHEEEVNAVVFTSSGEVEGFLK 231 Query: 450 SLRGYGFDWGMVKKRWPGLVVAAHGPITAAGAERLGVNVDVIGSKFGSFDGVVEAL 283 SLR +G+DWGMV++RW LVVAAHGP+TA GA+RLGV+VDV+ S F SF GVV+AL Sbjct: 232 SLREFGWDWGMVRRRWSRLVVAAHGPVTAVGAKRLGVDVDVVSSNFDSFQGVVDAL 287 >ref|XP_002523533.1| conserved hypothetical protein [Ricinus communis] gi|223537240|gb|EEF38872.1| conserved hypothetical protein [Ricinus communis] Length = 295 Score = 297 bits (760), Expect = 7e-78 Identities = 157/287 (54%), Positives = 202/287 (70%) Frame = -1 Query: 1119 IAYTTPPNYADRLSHALELKGCSNTIWCPTILVETTQQTKDSIKPYLISKSRPRISPLLQ 940 +A+TTP NYA RLSH L LK + +WCPTI+ + T QT S+ +L S ISP+ Sbjct: 18 VAFTTPQNYASRLSHLLTLKSLT-PLWCPTIITQPTPQTLSSLALHLAPHS---ISPI-- 71 Query: 939 HFSAIAFTSRAGISAFSEILVEIVQPPLFQSQTNETFTISALGRDAELLLNGDFVSKLCD 760 SAI F SR I+AFS+ + + P L S ++ I ALG+DAEL+ + F+ +C Sbjct: 72 --SAILFPSRTAITAFSKAICSLATPLLHPS--HDAMIIGALGKDAELI-DSAFLLNICS 126 Query: 759 NPNRIRVLVPPVATPTSLVDSLVLGMGRRVLCPVPLVIGIEEPPVIPDFIQSLVANGWIP 580 + NRIR LVP ATP+ LV SL G GRRVLC VP ++G++EPPV+PDF++ L A GW+P Sbjct: 127 SINRIRALVPQTATPSGLVQSLGAGGGRRVLCLVPKIVGLKEPPVVPDFLRELEAAGWVP 186 Query: 579 VRVSAYETRWAGPNCAEIFVRSEEERVDAIVFTSTGEVEGLLKSLRGYGFDWGMVKKRWP 400 +RV AYETRW GP CAE V+ EE +D +VFTS+ EVEGLLKSL Y +DW MVK+RWP Sbjct: 187 IRVDAYETRWLGPTCAEGIVK--EEGLDGVVFTSSAEVEGLLKSLSEYRWDWKMVKQRWP 244 Query: 399 GLVVAAHGPITAAGAERLGVNVDVIGSKFGSFDGVVEALALKCSGVS 259 LVVAAHGP+TAAGAERLGV+VDV+ +F SF+GVV+AL + G+S Sbjct: 245 ELVVAAHGPVTAAGAERLGVDVDVVSDRFSSFEGVVDALYSRLQGLS 291 >ref|XP_002324567.1| hypothetical protein POPTR_0018s12090g [Populus trichocarpa] gi|222866001|gb|EEF03132.1| hypothetical protein POPTR_0018s12090g [Populus trichocarpa] Length = 302 Score = 292 bits (747), Expect = 2e-76 Identities = 156/291 (53%), Positives = 199/291 (68%), Gaps = 2/291 (0%) Frame = -1 Query: 1119 IAYTTPPNYADRLSHALELKGCSNTIWCPTILVETTQQTKDSIKPYLISKSRPRISPLLQ 940 +A+TTPPNYA RLSH L LK + +WCPTI E TQQT S+ +L S L Sbjct: 21 VAFTTPPNYATRLSHLLTLKSFT-PLWCPTITTEPTQQTLSSLALHLSPHS-------LS 72 Query: 939 HFSAIAFTSRAGISAFSEILVEIVQPPLFQSQTNETFTISALGRDAELLLNGDFVSKLC- 763 SAIAF SR I+AFS + + P L +TF I+ALG+D EL+ + F+ C Sbjct: 73 LLSAIAFPSRTAITAFSTAALSLTTPLL--PPREDTFIIAALGKDVELI-DSTFLLTFCG 129 Query: 762 DNPNRIRVLVPPVATPTSLVDSLVLGMGRRVLCPVPLVIGIEEPPVIPDFIQSLVANGWI 583 D+ + + VLVP +ATP+ LV L G GR+VLCPVP V+G+EEPPV+PDF++ L GW+ Sbjct: 130 DDISWVNVLVPTIATPSGLVQLLGTGRGRKVLCPVPRVVGLEEPPVVPDFLRELEGAGWV 189 Query: 582 PVRVSAYETRWAGPNCAEIFVR-SEEERVDAIVFTSTGEVEGLLKSLRGYGFDWGMVKKR 406 P+RV AYETRW GP C + V SE +DA+VFTS+GEVEGLLKSLR +G+DW MV++R Sbjct: 190 PIRVDAYETRWLGPACGKGVVELSEGGLLDAMVFTSSGEVEGLLKSLREFGWDWEMVRRR 249 Query: 405 WPGLVVAAHGPITAAGAERLGVNVDVIGSKFGSFDGVVEALALKCSGVSDA 253 WP LVVAAHGP+TAAGAERLGV VDV+ +F SF GVV+A+ K G+ + Sbjct: 250 WPHLVVAAHGPVTAAGAERLGVTVDVVSGRFDSFQGVVDAVEAKLRGLDSS 300 >ref|XP_007138434.1| hypothetical protein PHAVU_009G208600g [Phaseolus vulgaris] gi|561011521|gb|ESW10428.1| hypothetical protein PHAVU_009G208600g [Phaseolus vulgaris] Length = 280 Score = 287 bits (734), Expect = 7e-75 Identities = 152/285 (53%), Positives = 196/285 (68%), Gaps = 1/285 (0%) Frame = -1 Query: 1134 LSHKRIAYTTPPNYADRLSHALELKGCSNTIWCPTILVETTQQTKDSIKPYLISKSRPRI 955 L + +A+TTPPNYA RLS+ L L + +WCPT+L++ T + P+L S Sbjct: 3 LHNPTVAFTTPPNYAARLSNLLSLSAYT-PLWCPTLLIQPLPST---LAPFLSPHS---- 54 Query: 954 SPLLQHFSAIAFTSRAGISAFSEILVEIVQPPLFQSQTNETFTISALGRDAELLLNGDFV 775 L FSAIAFTSR I AF + + PPL TFT++ALG+DA+L+ + F+ Sbjct: 55 ---LHRFSAIAFTSRTAIQAFLQAATSLSHPPL--PPEGSTFTLAALGKDADLI-DAQFL 108 Query: 774 SKLCDNPNRIRVLVPPVATPTSLVDSLVLGMGRRVLCPVPLVIGIEEPPVIPDFIQSLVA 595 S C N NR+ VLVPP ATP++L +L G GR VLCPVP VIG+ EPPV+P F++ L Sbjct: 109 SAFCSNSNRLCVLVPPTATPSALAAALGDGCGRGVLCPVPRVIGVNEPPVVPGFLEELRR 168 Query: 594 NGWIPVRVSAYETRWAGPNCAEIFVR-SEEERVDAIVFTSTGEVEGLLKSLRGYGFDWGM 418 W+PVRV AYETRWAGP CAE VR SEE +DA+VFTST EVEGLL+SL+ +G + Sbjct: 169 GRWVPVRVEAYETRWAGPGCAEGIVRASEEGGLDAVVFTSTAEVEGLLQSLKDFGLGFAD 228 Query: 417 VKKRWPGLVVAAHGPITAAGAERLGVNVDVIGSKFGSFDGVVEAL 283 +++R P LVVAAHGP+TAAGA+RLGV VDV+ S+FGSFDGV++ L Sbjct: 229 LRRRCPRLVVAAHGPVTAAGAQRLGVEVDVVSSRFGSFDGVIDVL 273 >gb|EPS70810.1| hypothetical protein M569_03949 [Genlisea aurea] Length = 299 Score = 283 bits (723), Expect = 1e-73 Identities = 156/293 (53%), Positives = 201/293 (68%), Gaps = 5/293 (1%) Frame = -1 Query: 1137 PLSHKRIAYTTPPNYADRLSHALELKGCSNTIWCPTILVETTQQTKDSIKPYLISKSRPR 958 P + IA+TTP NYA +LS +++KG + +WCPTI VE+T T +++ Y+ Sbjct: 12 PAKARLIAFTTPENYAGKLSRLIQVKGWT-PLWCPTIAVESTASTVGALRRYVQPPD--- 67 Query: 957 ISPLLQHFSAIAFTSRAGISAFSEILVEIV-QPPLFQSQTNETFTISALGRDAELLLNGD 781 P+L+ F+A+AFTSR GI+AF+E + PPL T E FTISALG+DAELL + Sbjct: 68 --PILREFAAVAFTSRTGITAFAEAIHSSGGSPPL--DPTGEIFTISALGKDAELL-DDS 122 Query: 780 FVSKLCDNPNRIRVLVPPVATPTSLVDSLVLGMGRR-VLCPVPLVIGIEEPPVIPDFIQS 604 F+ LC+N RIRVLVP VATP++L ++L G GRR VLCPVP+VIG+EEPPV+P F+ Sbjct: 123 FIKSLCENAARIRVLVPAVATPSALAEALGSGEGRRKVLCPVPVVIGLEEPPVVPKFLTD 182 Query: 603 LVANGWIPVRVSAYETRWAGPNCA---EIFVRSEEERVDAIVFTSTGEVEGLLKSLRGYG 433 L GWIPVRV AYETR + E E +VDAIVFTST EVEGLLKSL+ G Sbjct: 183 LHRRGWIPVRVDAYETRRSHNGTGKLVEAMAAGAECKVDAIVFTSTAEVEGLLKSLQEIG 242 Query: 432 FDWGMVKKRWPGLVVAAHGPITAAGAERLGVNVDVIGSKFGSFDGVVEALALK 274 DW +++ PG+V AA GP+TAAGAE+LGV +DV+ S+F SFDGVV+AL K Sbjct: 243 LDWETIRRTCPGMVAAAQGPVTAAGAEQLGVGIDVVSSRFDSFDGVVDALEYK 295 >ref|XP_006849857.1| hypothetical protein AMTR_s00022p00059330 [Amborella trichopoda] gi|548853455|gb|ERN11438.1| hypothetical protein AMTR_s00022p00059330 [Amborella trichopoda] Length = 308 Score = 269 bits (687), Expect = 2e-69 Identities = 146/291 (50%), Positives = 198/291 (68%), Gaps = 3/291 (1%) Frame = -1 Query: 1143 TSPLSHKRIAYTTPPNYADRLSHALELKGCSNTIWCPTILVETTQQTKDSIKPYLISKSR 964 +SPLSH+ + YTTP +YA L L ++ +W PTI V +T TK I+ +L Sbjct: 22 SSPLSHRHVVYTTPAHYAPSLERRLRAHQ-AHPLWLPTISVLSTPHTKTLIRNHLQKT-- 78 Query: 963 PRISPLLQHFSAIAFTSRAGISAFSEILVEIVQ---PPLFQSQTNETFTISALGRDAELL 793 L+ SAIAFTSRA I++FSE L EI+ PPL S E F + ALGRD+ELL Sbjct: 79 -----LINQSSAIAFTSRAAINSFSEALSEILTLNGPPL--SGEGEPFYLCALGRDSELL 131 Query: 792 LNGDFVSKLCDNPNRIRVLVPPVATPTSLVDSLVLGMGRRVLCPVPLVIGIEEPPVIPDF 613 + FV LC+N +R+RV VP V TP ++ + L G+ R +LC VPLV G++EP V+PDF Sbjct: 132 -DQRFVLSLCENLDRVRVFVPSVPTPKAMAEELGDGLNREILCLVPLVTGLDEPSVVPDF 190 Query: 612 IQSLVANGWIPVRVSAYETRWAGPNCAEIFVRSEEERVDAIVFTSTGEVEGLLKSLRGYG 433 + +L W P+R+++YETRWAG +CAE + +E DAIVFTST EV+GL+K L+ G Sbjct: 191 LGALKDQNWRPIRLNSYETRWAGLDCAEFLI--SDEASDAIVFTSTAEVQGLIKGLKKLG 248 Query: 432 FDWGMVKKRWPGLVVAAHGPITAAGAERLGVNVDVIGSKFGSFDGVVEALA 280 F+W MV+++ PGLVVAAHGP+TA GA++LGV++D++ S+F SFDGVV ALA Sbjct: 249 FEWVMVREKRPGLVVAAHGPVTALGAKKLGVDIDLVSSRFDSFDGVVNALA 299 >ref|XP_004957231.1| PREDICTED: uncharacterized protein LOC101779932 [Setaria italica] Length = 299 Score = 255 bits (651), Expect = 3e-65 Identities = 145/300 (48%), Positives = 191/300 (63%), Gaps = 11/300 (3%) Frame = -1 Query: 1146 KTSPLSHKRIAYTTPP----NYADRLSHALELKGCSNTIWCPTILVETTQQTKDSIKPYL 979 +T PL+ +R+A+TTP +Y RL L +G + + PTI V+ D ++P+L Sbjct: 7 ETLPLAGRRVAFTTPQTGGASYGGRLGALLRQRG-ARPVPVPTIAVQP--HDPDRLRPFL 63 Query: 978 ISKSRPRISPLLQHFSAIAFTSRAGISAFSEILVEIVQPPL------FQSQTNETFTISA 817 + + L F+A+AFTSR+GISAF+ L PP + FT++A Sbjct: 64 LPGA-------LDPFAALAFTSRSGISAFARAL-----PPSSSHHRPLSDASALPFTVAA 111 Query: 816 LGRDAELLLNGDFVSKLC-DNPNRIRVLVPPVATPTSLVDSLVLGMGRRVLCPVPLVIGI 640 LG DA+LL + F+S+LC D R+ VLVP V TP LV++L G GRRVLCPVP V+G+ Sbjct: 112 LGSDADLL-DRAFLSRLCGDAGTRVAVLVPAVPTPAGLVEALGPGSGRRVLCPVPDVVGL 170 Query: 639 EEPPVIPDFIQSLVANGWIPVRVSAYETRWAGPNCAEIFVRSEEERVDAIVFTSTGEVEG 460 EPPV+PDF+ L A GW+ VR AY T WAGP CAE V ++ DA+VFTST EVEG Sbjct: 171 REPPVVPDFLAGLEAAGWVAVRAPAYTTSWAGPGCAEALVGADAAAPDAVVFTSTAEVEG 230 Query: 459 LLKSLRGYGFDWGMVKKRWPGLVVAAHGPITAAGAERLGVNVDVIGSKFGSFDGVVEALA 280 LLK L G+ W ++ RWPG+VVAAHGP+TAAGA LGV VDV+ ++F SF GVV+ALA Sbjct: 231 LLKGLDAAGWTWARLRARWPGMVVAAHGPVTAAGARSLGVEVDVVSARFSSFHGVVDALA 290 >ref|NP_001145235.1| hypothetical protein [Zea mays] gi|195653353|gb|ACG46144.1| hypothetical protein [Zea mays] gi|414589847|tpg|DAA40418.1| TPA: hypothetical protein ZEAMMB73_114348 [Zea mays] Length = 297 Score = 255 bits (651), Expect = 3e-65 Identities = 143/295 (48%), Positives = 187/295 (63%), Gaps = 6/295 (2%) Frame = -1 Query: 1146 KTSPLSHKRIAYTTPPN-----YADRLSHALELKGCSNTIWCPTILVETTQQTKDSIKPY 982 +T L+ +R+A+TTP Y RL L +G ++ + PTI V D ++PY Sbjct: 7 ETPSLAGRRVAFTTPQTGGGGAYGGRLGALLRQRG-AHPVAVPTIAVHP--HDPDRLRPY 63 Query: 981 LISKSRPRISPLLQHFSAIAFTSRAGISAFSEILVEIVQPPLFQSQTNETFTISALGRDA 802 L+ + L F+A+AFTSR+GISAF+ L +P S FT++ALG DA Sbjct: 64 LLPSA-------LDPFAALAFTSRSGISAFARALSSSHRPLSHASAL--PFTVAALGSDA 114 Query: 801 ELLLNGDFVSKLC-DNPNRIRVLVPPVATPTSLVDSLVLGMGRRVLCPVPLVIGIEEPPV 625 +LL + F+S+LC D R+ VLVP V TP LV++L G GRRVLCPVP V+G+ EPPV Sbjct: 115 DLLDHA-FLSRLCGDAGTRVSVLVPDVPTPAGLVEALGRGSGRRVLCPVPDVVGLREPPV 173 Query: 624 IPDFIQSLVANGWIPVRVSAYETRWAGPNCAEIFVRSEEERVDAIVFTSTGEVEGLLKSL 445 +PDF+ L A GW+ VR AY T WAGP CAE V + +DA+VFTST EVEGLLK L Sbjct: 174 VPDFLAGLEAAGWVAVRAPAYTTCWAGPRCAEALVDPDAAPLDAVVFTSTAEVEGLLKGL 233 Query: 444 RGYGFDWGMVKKRWPGLVVAAHGPITAAGAERLGVNVDVIGSKFGSFDGVVEALA 280 G+ W + RWPG+VVAAHGP+TA GA LGV VD++ ++F SF GVV+ALA Sbjct: 234 EAVGWTWARLAARWPGMVVAAHGPVTAGGARSLGVEVDIVSTRFSSFHGVVDALA 288 >ref|XP_002462593.1| hypothetical protein SORBIDRAFT_02g028710 [Sorghum bicolor] gi|241925970|gb|EER99114.1| hypothetical protein SORBIDRAFT_02g028710 [Sorghum bicolor] Length = 299 Score = 246 bits (627), Expect = 2e-62 Identities = 141/296 (47%), Positives = 185/296 (62%), Gaps = 7/296 (2%) Frame = -1 Query: 1146 KTSPLSHKRIAYTTPPN-----YADRLSHALELKGCSNTIWCPTILVETTQQTKDSIKPY 982 +T L+ +R+A+TTP Y RL L +G ++ + PTI V D ++P+ Sbjct: 7 ETPSLTGRRVAFTTPQTGGGGAYGGRLGALLRQRG-AHPVPVPTIAVHP--HDPDRLRPF 63 Query: 981 LISKSRPRISPLLQHFSAIAFTSRAGISAFSEILVEIVQPPLFQSQTNETFTISALGRDA 802 L+ + L F+A+AFTSR+GISAF+ L PL + FT++ALG DA Sbjct: 64 LLPGA-------LDPFAALAFTSRSGISAFARALSSSSHHPLADASALP-FTVAALGSDA 115 Query: 801 ELLLNGDFVSKLCDNP--NRIRVLVPPVATPTSLVDSLVLGMGRRVLCPVPLVIGIEEPP 628 +LL + F+S+LC R+ VLVP V TP LV++L G GRRVLCPVP V+G+ EPP Sbjct: 116 DLLDHA-FLSRLCGAAAGTRVSVLVPDVPTPAGLVEALGRGSGRRVLCPVPDVVGLREPP 174 Query: 627 VIPDFIQSLVANGWIPVRVSAYETRWAGPNCAEIFVRSEEERVDAIVFTSTGEVEGLLKS 448 V+PDF+ L A GW+ VR AY T WAGP CAE V + +DA+VFTST EVEGLLK Sbjct: 175 VVPDFLAGLEAAGWVAVRAPAYTTCWAGPRCAEALVDPDAAPLDAVVFTSTAEVEGLLKR 234 Query: 447 LRGYGFDWGMVKKRWPGLVVAAHGPITAAGAERLGVNVDVIGSKFGSFDGVVEALA 280 L G+ W + R PG+VVAAHGP+TA GA LGV VDV+ ++F SF GVV+ALA Sbjct: 235 LESAGWTWARLTARCPGMVVAAHGPVTAGGARSLGVEVDVVSARFSSFHGVVDALA 290 >gb|EEC84811.1| hypothetical protein OsI_31881 [Oryza sativa Indica Group] Length = 301 Score = 241 bits (615), Expect = 5e-61 Identities = 140/306 (45%), Positives = 186/306 (60%), Gaps = 15/306 (4%) Frame = -1 Query: 1146 KTSPLSHKRIAYTTPPN------YADRLSHALELKGCSNTIWCPTILVETTQQTKDSIKP 985 +T L+ +R+A+TTP Y RL L +G + + PTI + D ++P Sbjct: 5 ETLSLAGRRVAFTTPQTDAGGGGYGGRLHAILRQRG-ARPVPVPTIAIRA--HDPDILRP 61 Query: 984 YLISKSRPRISPLLQHFSAIAFTSRAGISAFSEILVEIVQP--------PLFQSQTNETF 829 ++ L F+A+AFTSR+GISAFS L+ P+ + T F Sbjct: 62 FVAPGG-------LDAFAALAFTSRSGISAFSRALLPSSSSSPARRPRHPVSDAATALPF 114 Query: 828 TISALGRDAELLLNGDFVSKLC-DNPNRIRVLVPPVATPTSLVDSLVLGMGRRVLCPVPL 652 T++ALG DA+LL + F+S+LC D R+ VLVP V TP LV++L G GRRVLCPVP Sbjct: 115 TVAALGSDADLL-DAAFLSRLCGDAGGRVSVLVPDVPTPAGLVEALGSGSGRRVLCPVPD 173 Query: 651 VIGIEEPPVIPDFIQSLVANGWIPVRVSAYETRWAGPNCAEIFVRSEEERVDAIVFTSTG 472 V+G+ EPPV+P F+ L A GW+ VR AY T WAGP CAE V + DA+VFTST Sbjct: 174 VVGLREPPVVPGFLSGLEAAGWVAVRAPAYVTCWAGPRCAEALV--DAAAPDAVVFTSTA 231 Query: 471 EVEGLLKSLRGYGFDWGMVKKRWPGLVVAAHGPITAAGAERLGVNVDVIGSKFGSFDGVV 292 EVEGLLK L G+ W ++ RWP +VVAAHGP+TA G RLG+ VDV+G++F SF GV+ Sbjct: 232 EVEGLLKGLDAAGWSWPRLRARWPRMVVAAHGPVTADGVRRLGIEVDVVGARFSSFHGVL 291 Query: 291 EALALK 274 +ALA K Sbjct: 292 DALAAK 297 >ref|XP_002980789.1| hypothetical protein SELMODRAFT_420371 [Selaginella moellendorffii] gi|300151328|gb|EFJ17974.1| hypothetical protein SELMODRAFT_420371 [Selaginella moellendorffii] Length = 231 Score = 151 bits (381), Expect = 6e-34 Identities = 96/236 (40%), Positives = 136/236 (57%), Gaps = 1/236 (0%) Frame = -1 Query: 987 PYLISKSRPRISPLLQHFSAIAFTSRAGISAFSEILVEIVQPPLFQSQTNETFTISALGR 808 P+ S R +S L +S IAFTSR+GI++ + L E+ + + ALG+ Sbjct: 2 PHTQSSVRRAVSAL-HTYSCIAFTSRSGIASIAHALEEV------RLSGCAELVVGALGK 54 Query: 807 DAELLLNGDFVSKLCDNPNRIRVLVPPVATPTSLVDSLVLGMGRRVLCPVPLVI-GIEEP 631 DAEL+ D + + R+ V+VP VATP +LV+ L G GRR+LCPVP V G+ EP Sbjct: 55 DAELIQELDLFKEHREQ-QRLTVVVPLVATPDALVEELGDGAGRRLLCPVPYVCGGLSEP 113 Query: 630 PVIPDFIQSLVANGWIPVRVSAYETRWAGPNCAEIFVRSEEERVDAIVFTSTGEVEGLLK 451 V+P+F+ +L +GW R+ AY T W G + VDA+VFTST EVEGLL Sbjct: 114 DVVPNFVAALQRHGWDVERLDAYATSWTGSASVTPLLAG---AVDALVFTSTAEVEGLLM 170 Query: 450 SLRGYGFDWGMVKKRWPGLVVAAHGPITAAGAERLGVNVDVIGSKFGSFDGVVEAL 283 +L+ + + WP V+ A GP+TA GA++LGV+VDVIG +F F + + L Sbjct: 171 ALQAHHL---TLASLWP-CVLVAFGPVTARGAKQLGVDVDVIGHRFNGFTDLADLL 222 >ref|XP_006381898.1| hypothetical protein POPTR_0006s20320g [Populus trichocarpa] gi|550336711|gb|ERP59695.1| hypothetical protein POPTR_0006s20320g [Populus trichocarpa] Length = 150 Score = 147 bits (370), Expect = 1e-32 Identities = 77/158 (48%), Positives = 103/158 (65%) Frame = -1 Query: 753 NRIRVLVPPVATPTSLVDSLVLGMGRRVLCPVPLVIGIEEPPVIPDFIQSLVANGWIPVR 574 +R++VLVP + T V L G R+VLCPVP V+G+EEPPV+PDF++ L A Sbjct: 12 SRVKVLVPTITTRNG-VHLLGTGRCRKVLCPVPRVVGLEEPPVVPDFLRELEA------- 63 Query: 573 VSAYETRWAGPNCAEIFVRSEEERVDAIVFTSTGEVEGLLKSLRGYGFDWGMVKKRWPGL 394 + RS+E +DA+VF S+GEVEGLLKSL+ G++W M+++RWP L Sbjct: 64 --------------AVVERSDEGLLDAMVFASSGEVEGLLKSLKELGWEWEMMRRRWPNL 109 Query: 393 VVAAHGPITAAGAERLGVNVDVIGSKFGSFDGVVEALA 280 VV AHGP+TAAGAE LGVNV+V+ +F SF G V L+ Sbjct: 110 VVVAHGPVTAAGAESLGVNVNVVSERFDSFQGTVWMLS 147 >ref|XP_002961862.1| hypothetical protein SELMODRAFT_403240 [Selaginella moellendorffii] gi|300170521|gb|EFJ37122.1| hypothetical protein SELMODRAFT_403240 [Selaginella moellendorffii] Length = 262 Score = 138 bits (348), Expect = 4e-30 Identities = 80/182 (43%), Positives = 110/182 (60%), Gaps = 1/182 (0%) Frame = -1 Query: 825 ISALGRDAELLLNGDFVSKLCDNPNRIRVLVPPVATPTSLVDSLVLGMGRRVLCPVPLVI 646 + ALG+DAEL+ D + + R+ V+VP VATP +LV+ L G GRR+LCPVP Sbjct: 80 VGALGKDAELIQELDLFKEHREQ-QRLTVVVPRVATPDALVEELGDGAGRRLLCPVPYAC 138 Query: 645 G-IEEPPVIPDFIQSLVANGWIPVRVSAYETRWAGPNCAEIFVRSEEERVDAIVFTSTGE 469 G + EP V+P+F+ +L +GW R+ AY T W G + VDA+VFTST E Sbjct: 139 GGLSEPDVVPNFVAALQRHGWDVERLDAYATSWTGSASVTPLLAGA---VDALVFTSTAE 195 Query: 468 VEGLLKSLRGYGFDWGMVKKRWPGLVVAAHGPITAAGAERLGVNVDVIGSKFGSFDGVVE 289 VEGLL +L + + WP V+ A GP+TA GA+RLGV+VDV+G +F SF + + Sbjct: 196 VEGLLMALHAHHLT---IASLWP-CVLVAFGPVTARGAKRLGVDVDVVGHRFNSFTDLAD 251 Query: 288 AL 283 L Sbjct: 252 LL 253 >gb|EXC12523.1| hypothetical protein L484_012337 [Morus notabilis] Length = 183 Score = 137 bits (345), Expect = 9e-30 Identities = 83/192 (43%), Positives = 116/192 (60%) Frame = -1 Query: 1131 SHKRIAYTTPPNYADRLSHALELKGCSNTIWCPTILVETTQQTKDSIKPYLISKSRPRIS 952 S+ +A+TTPPNYA RLSH L G N + PT+LVE T +T ++K YL P Sbjct: 13 SNPTVAFTTPPNYAGRLSHLLAANGL-NPLSSPTLLVEPTPRTISALKSYL-----PPPH 66 Query: 951 PLLQHFSAIAFTSRAGISAFSEILVEIVQPPLFQSQTNETFTISALGRDAELLLNGDFVS 772 L FSA+A ++ PL + FTI+ALG+D+ELL + ++++ Sbjct: 67 SLNALFSAVASD---------------LECPLLSPFGDREFTIAALGKDSELLYD-EYLT 110 Query: 771 KLCDNPNRIRVLVPPVATPTSLVDSLVLGMGRRVLCPVPLVIGIEEPPVIPDFIQSLVAN 592 K N +RIRVLVP VA P+ LV SL G +RVLC VP+++ +EEPPV+P+F++ L ++ Sbjct: 111 KFGKNRDRIRVLVPLVAMPSGLVRSLRDGRRQRVLCTVPIIVDLEEPPVVPNFLRELESS 170 Query: 591 GWIPVRVSAYET 556 WIPV V YET Sbjct: 171 RWIPVLVGTYET 182 >ref|NP_484331.1| uroporphyrinogen-III synthase [Nostoc sp. PCC 7120] gi|499303689|ref|WP_010994464.1| uroporphyrinogen III synthase [Nostoc sp. PCC 7120] gi|17135265|dbj|BAB77811.1| uroporphyrinogen-III synthase [Nostoc sp. PCC 7120] Length = 276 Score = 120 bits (301), Expect = 1e-24 Identities = 103/291 (35%), Positives = 146/291 (50%), Gaps = 6/291 (2%) Frame = -1 Query: 1137 PLSHKRIAYTTPPNYADRLSHALELKGCSNTIWCPTILVETTQQTKDSIKPYLISKSRPR 958 PL KRI T P NYA RLS + KG I PTI ET + S +IS Sbjct: 12 PLYGKRILVTAPRNYASRLSAQIICKG-GLPILMPTI--ETCYLSNFSKLDAVISS---- 64 Query: 957 ISPLLQHFSAIAFTSRAGISAFSEIL--VEIVQPPLFQSQTNETFTISALGRDAELLLNG 784 + F IAFTSR GI AF E L ++I L Q + ALG+D ++LL+ Sbjct: 65 ----INEFDWIAFTSRNGIIAFFERLHNLDISITKLQNCQ------LCALGKDIDILLS- 113 Query: 783 DFVSKLCDNPNRIRVLVPPVATPTSLVD--SLVLGMG-RRVLCPVPLVIGIEEPPVIPDF 613 K+ L+P ++P +V S + G+ +++L PVP VIGI EP ++P+F Sbjct: 114 -LFGKVD--------LIPDESSPAGIVAEFSQICGIREQKILVPVPEVIGIPEPNIVPNF 164 Query: 612 IQSLVANGWIPVRVSAYETRWAGPNCAEIFVRS-EEERVDAIVFTSTGEVEGLLKSLRGY 436 I+ L G +RV AY T+ + + + ++ +D I F+ST E+E L Sbjct: 165 IKDLEELGMQVIRVPAYITQSLDKDIYSVEINLIQQGLIDIIAFSSTAEIESFLAMFNS- 223 Query: 435 GFDWGMVKKRWPGLVVAAHGPITAAGAERLGVNVDVIGSKFGSFDGVVEAL 283 K + VVA GP TAA AE+LG+NV ++ + F SF+G VEA+ Sbjct: 224 -------KSEFQHCVVACFGPYTAANAEQLGLNVSIVSTDFSSFEGFVEAI 267 >ref|YP_323579.1| uroporphyrinogen-III synthase [Anabaena variabilis ATCC 29413] gi|499639080|ref|WP_011319814.1| uroporphyrinogen III synthase [Anabaena variabilis] gi|75703008|gb|ABA22684.1| uroporphyrinogen-III synthase [Anabaena variabilis ATCC 29413] Length = 276 Score = 119 bits (297), Expect = 3e-24 Identities = 100/289 (34%), Positives = 142/289 (49%), Gaps = 4/289 (1%) Frame = -1 Query: 1137 PLSHKRIAYTTPPNYADRLSHALELKGCSNTIWCPTILVETTQQTKDSIKPYLISKSRPR 958 PL KRI T P NYA RLS + KG I PTI ET S +IS Sbjct: 12 PLYGKRILVTAPRNYASRLSAQIICKG-GLPILMPTI--ETCYLPNFSQLDAVIS----- 63 Query: 957 ISPLLQHFSAIAFTSRAGISAFSEILVEIVQPPLFQSQTNETFTISALGRDAELLLNGDF 778 + F IAFTSR GI AF E L + + + ALG+D ++LL+ Sbjct: 64 ---CINEFDWIAFTSRNGIIAFFERLHNLD----ISINKLQNCQLCALGKDIDVLLS--- 113 Query: 777 VSKLCDNPNRIRVLVPPVATPTSLVD--SLVLGMGR-RVLCPVPLVIGIEEPPVIPDFIQ 607 L + L+P ++P +V S + G+ R ++L PVP VIGI EP ++P+FI+ Sbjct: 114 ---LFGRVD----LIPDESSPAGIVAKFSQIHGISRQKILVPVPEVIGIPEPNIVPNFIK 166 Query: 606 SLVANGWIPVRVSAYETRWAGPNCAEIFVRS-EEERVDAIVFTSTGEVEGLLKSLRGYGF 430 L G +RV Y T+ N + + ++ +D I F+ST E+E LK Sbjct: 167 DLEKLGMQVIRVPTYITQSLDKNIYSVEINLIQQGLIDVIAFSSTAEIESFLKMFNS--- 223 Query: 429 DWGMVKKRWPGLVVAAHGPITAAGAERLGVNVDVIGSKFGSFDGVVEAL 283 K + VVA GP TAA A++LG++V ++ + F SF+G VEA+ Sbjct: 224 -----KNEFQHCVVACFGPYTAANAQKLGLDVSLVSTDFSSFEGFVEAI 267 >ref|WP_016872683.1| hypothetical protein [Chlorogloeopsis fritschii] Length = 313 Score = 112 bits (280), Expect = 3e-22 Identities = 95/288 (32%), Positives = 131/288 (45%), Gaps = 2/288 (0%) Frame = -1 Query: 1137 PLSHKRIAYTTPPNYADRLSHALELKGCSNTIWCPTILVETTQQTKDSIKPYLISKSRPR 958 PL KRI T P NYA RLS L +G I PTI ET + + K Sbjct: 35 PLHSKRILVTAPRNYAARLSEQLINQGAL-PILMPTI--ETCVLENFAQLDIALQK---- 87 Query: 957 ISPLLQHFSAIAFTSRAGISAFSEILVEIVQPPLFQSQTNETFTISALGRDAELLLN-GD 781 + F IAFTSR GI AF + L + + + +SA+G+DAE L G Sbjct: 88 ----IDTFDWIAFTSRNGIDAFFQRLESLG----LNHRVLKNCRLSAIGKDAERLAAFGV 139 Query: 780 FVSKLCDNPNRIRVLVPPVATPTSLVDSLVLGMGRRVLCPVPLVIGIEEPPVIPDFIQSL 601 V + P+ ++ P G+++L PVP V+G+ EP V+P+F+ L Sbjct: 140 EVDLIPQQPSPQGIIAELAQIPNI--------QGKKILVPVPEVVGVPEPDVVPNFVAGL 191 Query: 600 VANGWIPVRVSAYETRWAGPNCAEIFVRS-EEERVDAIVFTSTGEVEGLLKSLRGYGFDW 424 G RV Y TR + E+ + + +VD I F+ST EV L+ Sbjct: 192 KNLGMSVTRVPTYLTRCLDKSFYEVELNLIRQGKVDVIAFSSTAEVASFLQMFTA----- 246 Query: 423 GMVKKRWPGLVVAAHGPITAAGAERLGVNVDVIGSKFGSFDGVVEALA 280 K + V+A GP TAA A +LGVNV +I + SF G EA+A Sbjct: 247 ---KADYQQCVIACFGPYTAANANKLGVNVSIIAQDYSSFAGFAEAIA 291