BLASTX nr result
ID: Mentha27_contig00034898
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha27_contig00034898 (875 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU25683.1| hypothetical protein MIMGU_mgv1a024294mg, partial... 355 1e-95 gb|EPS70810.1| hypothetical protein M569_03949 [Genlisea aurea] 337 4e-90 ref|XP_004233274.1| PREDICTED: uncharacterized protein LOC101267... 330 6e-88 ref|XP_007203754.1| hypothetical protein PRUPE_ppa014878mg [Prun... 278 2e-72 ref|XP_007138434.1| hypothetical protein PHAVU_009G208600g [Phas... 269 9e-70 ref|XP_002324567.1| hypothetical protein POPTR_0018s12090g [Popu... 249 1e-63 ref|XP_007012013.1| Uncharacterized protein TCM_037120 [Theobrom... 248 2e-63 ref|XP_002523533.1| conserved hypothetical protein [Ricinus comm... 248 2e-63 ref|XP_006849857.1| hypothetical protein AMTR_s00022p00059330 [A... 214 5e-53 ref|XP_002462593.1| hypothetical protein SORBIDRAFT_02g028710 [S... 212 2e-52 ref|XP_004957231.1| PREDICTED: uncharacterized protein LOC101779... 207 3e-51 ref|NP_001145235.1| hypothetical protein [Zea mays] gi|195653353... 206 9e-51 gb|EEC84811.1| hypothetical protein OsI_31881 [Oryza sativa Indi... 199 9e-49 gb|EXC12523.1| hypothetical protein L484_012337 [Morus notabilis] 152 2e-34 ref|XP_002980789.1| hypothetical protein SELMODRAFT_420371 [Sela... 125 3e-26 ref|XP_002961862.1| hypothetical protein SELMODRAFT_403240 [Sela... 112 2e-22 ref|XP_006381898.1| hypothetical protein POPTR_0006s20320g [Popu... 107 8e-21 ref|XP_005644693.1| tetrapyrrole biosynthesis, uroporphyrinogen ... 103 1e-19 ref|XP_006386962.1| hypothetical protein POPTR_2609s00200g, part... 101 4e-19 ref|YP_001865256.1| uroporphyrinogen III synthase HEM4 [Nostoc p... 99 3e-18 >gb|EYU25683.1| hypothetical protein MIMGU_mgv1a024294mg, partial [Mimulus guttatus] Length = 299 Score = 355 bits (912), Expect = 1e-95 Identities = 178/246 (72%), Positives = 204/246 (82%), Gaps = 3/246 (1%) Frame = +1 Query: 145 LVAFTTPHNYAGRLSRLIHLNGWTPLWCPTVTVEPTPATISALQHFLLPPYPPLRHFSAV 324 ++AFTTP NYA RLS +I L GWTPLWCPT++V+ TP T S++QH+ L PPLRHFSAV Sbjct: 12 VIAFTTPKNYASRLSDVIRLKGWTPLWCPTLSVDTTPHTTSSIQHYFLSLDPPLRHFSAV 71 Query: 325 AFTSRTGISAFSEALAGIST-PPLPSIGEIFTISALGKDSELLDESFIGRLCETPARIRV 501 AFTSRTGI+AFSEAL+ I+ PP G++FT+SALGKDSELL ESF+ +LC PAR+RV Sbjct: 72 AFTSRTGITAFSEALSAIAAAPPFGPDGDLFTLSALGKDSELLTESFVAKLCVNPARVRV 131 Query: 502 LVPPVATPSALVEALGLGLERKVLCPIPAVIGLEEPPVVPKLLTDLERRSWAAVRVDAYE 681 LVPP+ATPS LVEALGLGL RKVLCP+P VIGL+EPPVVP+ L L RR W VRV+AYE Sbjct: 132 LVPPIATPSGLVEALGLGLGRKVLCPVPLVIGLKEPPVVPEFLAGLARRGWVPVRVNAYE 191 Query: 682 TRWR-NGVAELVE-RIEEECGVDAIVFTSTAEVEGLLKSLAEVGLDWGMVRGMCPRMVAA 855 TRWR GVA LV +EE CGVDAIVFTSTAEVEGLLKSL E+GLDWGMVR MCPR+VAA Sbjct: 192 TRWRGGGVAGLVAGMMEEHCGVDAIVFTSTAEVEGLLKSLEELGLDWGMVRRMCPRLVAA 251 Query: 856 AHGPVT 873 AHGPVT Sbjct: 252 AHGPVT 257 >gb|EPS70810.1| hypothetical protein M569_03949 [Genlisea aurea] Length = 299 Score = 337 bits (864), Expect = 4e-90 Identities = 180/270 (66%), Positives = 204/270 (75%), Gaps = 5/270 (1%) Frame = +1 Query: 79 MVILNFQQYXXXXXXXXXXXXRLVAFTTPHNYAGRLSRLIHLNGWTPLWCPTVTVEPTPA 258 M++LN QQY RL+AFTTP NYAG+LSRLI + GWTPLWCPT+ VE T + Sbjct: 1 MILLNIQQYPPPAKA------RLIAFTTPENYAGKLSRLIQVKGWTPLWCPTIAVESTAS 54 Query: 259 TISALQHFLLPPYPPLRHFSAVAFTSRTGISAFSEAL-AGISTPPLPSIGEIFTISALGK 435 T+ AL+ ++ PP P LR F+AVAFTSRTGI+AF+EA+ + +PPL GEIFTISALGK Sbjct: 55 TVGALRRYVQPPDPILREFAAVAFTSRTGITAFAEAIHSSGGSPPLDPTGEIFTISALGK 114 Query: 436 DSELLDESFIGRLCETPARIRVLVPPVATPSALVEALGLGL-ERKVLCPIPAVIGLEEPP 612 D+ELLD+SFI LCE ARIRVLVP VATPSAL EALG G RKVLCP+P VIGLEEPP Sbjct: 115 DAELLDDSFIKSLCENAARIRVLVPAVATPSALAEALGSGEGRRKVLCPVPVVIGLEEPP 174 Query: 613 VVPKLLTDLERRSWAAVRVDAYET-RWRNGVAELVERIE--EECGVDAIVFTSTAEVEGL 783 VVPK LTDL RR W VRVDAYET R NG +LVE + EC VDAIVFTSTAEVEGL Sbjct: 175 VVPKFLTDLHRRGWIPVRVDAYETRRSHNGTGKLVEAMAAGAECKVDAIVFTSTAEVEGL 234 Query: 784 LKSLAEVGLDWGMVRGMCPRMVAAAHGPVT 873 LKSL E+GLDW +R CP MVAAA GPVT Sbjct: 235 LKSLQEIGLDWETIRRTCPGMVAAAQGPVT 264 >ref|XP_004233274.1| PREDICTED: uncharacterized protein LOC101267674 [Solanum lycopersicum] Length = 312 Score = 330 bits (845), Expect = 6e-88 Identities = 168/251 (66%), Positives = 188/251 (74%), Gaps = 8/251 (3%) Frame = +1 Query: 145 LVAFTTPHNYAGRLSRLIHLNGWTPLWCPTVTVEPTPATISALQHFLLP------PYPPL 306 ++AFTTP NYA RLS LIHL GWTPLWCPTV VE T TIS++ H+L P P L Sbjct: 22 VIAFTTPQNYAPRLSELIHLKGWTPLWCPTVIVESTEQTISSIHHYLNPQAGIDEPNSFL 81 Query: 307 RHFSAVAFTSRTGISAFSEALAGISTPPLPSIGEIFTISALGKDSELLDESFIGRLCETP 486 FSA+AFTSRTGI+AFS+AL+ TPPL GEI TI+ALG D+ELLD FI ++CE P Sbjct: 82 EEFSALAFTSRTGITAFSQALSMNPTPPLTPNGEILTIAALGNDAELLDRDFIRKMCENP 141 Query: 487 ARIRVLVPPVATPSALVEALGLGLERKVLCPIPAVIGLEEPPVVPKLLTDLERRSWAAVR 666 RIRVLVP VATPS LVEALGLG RKVLCP+P VIGL EPPVVPK L DL +R W +R Sbjct: 142 ERIRVLVPSVATPSGLVEALGLGQGRKVLCPVPLVIGLNEPPVVPKFLDDLSKRGWIPLR 201 Query: 667 VDAYETRWRNG--VAELVERIEEECGVDAIVFTSTAEVEGLLKSLAEVGLDWGMVRGMCP 840 +DAYETRW ++V + EEECG DAIVFTST EVEGLLKSL E GLDW MVR CP Sbjct: 202 LDAYETRWAGATCAVDVVAKSEEECGFDAIVFTSTGEVEGLLKSLEEFGLDWSMVRRRCP 261 Query: 841 RMVAAAHGPVT 873 RMV AAHGPVT Sbjct: 262 RMVVAAHGPVT 272 >ref|XP_007203754.1| hypothetical protein PRUPE_ppa014878mg [Prunus persica] gi|462399285|gb|EMJ04953.1| hypothetical protein PRUPE_ppa014878mg [Prunus persica] Length = 287 Score = 278 bits (712), Expect = 2e-72 Identities = 145/244 (59%), Positives = 181/244 (74%), Gaps = 2/244 (0%) Frame = +1 Query: 148 VAFTTPHNYAGRLSRLIHLNGWTPLWCPTVTVEPTPATISALQHFLLPPYPPLRHFSAVA 327 VAFTTP NYA RL+ L+ L G+ P+ PT+ V+PTP+TISAL+ +L PP P L FSA+A Sbjct: 10 VAFTTPPNYAARLAHLLALKGFNPISSPTLIVQPTPSTISALKPYLSPP-PSLDLFSAIA 68 Query: 328 FTSRTGISAFSEALAGISTPPLPSIGEIFTISALGKDSELLDESFIGRLCETPARIRVLV 507 F SRT I++ S A A IS P L G+ F I+ALGKD+EL+D++F+ +LC R+R+LV Sbjct: 69 FPSRTAITSLSAAAADISHPLLSPHGDAFIIAALGKDAELMDDNFVHKLCSNTNRVRILV 128 Query: 508 PPVATPSALVEALGLGLERKVLCPIPAVIGLEEPPVVPKLLTDLERRSWAAVRVDAYETR 687 PP ATPS LVEALG G R+VLCP+P V+GL EPPVVP L DLE + W VRV+AYETR Sbjct: 129 PPTATPSGLVEALGDGRNRRVLCPVPVVVGLVEPPVVPDFLRDLEAKRWVPVRVNAYETR 188 Query: 688 WRN-GVA-ELVERIEEECGVDAIVFTSTAEVEGLLKSLAEVGLDWGMVRGMCPRMVAAAH 861 W G A ++VERIEE +DA+VFTSTAEVEGLLKS E GLDW + + CP+M+ AAH Sbjct: 189 WAGPGCAKQVVERIEEG-ALDAMVFTSTAEVEGLLKSFKEFGLDWEIAKKRCPKMLVAAH 247 Query: 862 GPVT 873 GP+T Sbjct: 248 GPIT 251 >ref|XP_007138434.1| hypothetical protein PHAVU_009G208600g [Phaseolus vulgaris] gi|561011521|gb|ESW10428.1| hypothetical protein PHAVU_009G208600g [Phaseolus vulgaris] Length = 280 Score = 269 bits (688), Expect = 9e-70 Identities = 139/243 (57%), Positives = 170/243 (69%), Gaps = 1/243 (0%) Frame = +1 Query: 148 VAFTTPHNYAGRLSRLIHLNGWTPLWCPTVTVEPTPATISALQHFLLPPYPPLRHFSAVA 327 VAFTTP NYA RLS L+ L+ +TPLWCPT+ ++P P+T++ FL P L FSA+A Sbjct: 8 VAFTTPPNYAARLSNLLSLSAYTPLWCPTLLIQPLPSTLAP---FLSPH--SLHRFSAIA 62 Query: 328 FTSRTGISAFSEALAGISTPPLPSIGEIFTISALGKDSELLDESFIGRLCETPARIRVLV 507 FTSRT I AF +A +S PPLP G FT++ALGKD++L+D F+ C R+ VLV Sbjct: 63 FTSRTAIQAFLQAATSLSHPPLPPEGSTFTLAALGKDADLIDAQFLSAFCSNSNRLCVLV 122 Query: 508 PPVATPSALVEALGLGLERKVLCPIPAVIGLEEPPVVPKLLTDLERRSWAAVRVDAYETR 687 PP ATPSAL ALG G R VLCP+P VIG+ EPPVVP L +L R W VRV+AYETR Sbjct: 123 PPTATPSALAAALGDGCGRGVLCPVPRVIGVNEPPVVPGFLEELRRGRWVPVRVEAYETR 182 Query: 688 WRN-GVAELVERIEEECGVDAIVFTSTAEVEGLLKSLAEVGLDWGMVRGMCPRMVAAAHG 864 W G AE + R EE G+DA+VFTSTAEVEGLL+SL + GL + +R CPR+V AAHG Sbjct: 183 WAGPGCAEGIVRASEEGGLDAVVFTSTAEVEGLLQSLKDFGLGFADLRRRCPRLVVAAHG 242 Query: 865 PVT 873 PVT Sbjct: 243 PVT 245 >ref|XP_002324567.1| hypothetical protein POPTR_0018s12090g [Populus trichocarpa] gi|222866001|gb|EEF03132.1| hypothetical protein POPTR_0018s12090g [Populus trichocarpa] Length = 302 Score = 249 bits (635), Expect = 1e-63 Identities = 135/244 (55%), Positives = 164/244 (67%), Gaps = 2/244 (0%) Frame = +1 Query: 148 VAFTTPHNYAGRLSRLIHLNGWTPLWCPTVTVEPTPATISALQHFLLPPYPPLRHFSAVA 327 VAFTTP NYA RLS L+ L +TPLWCPT+T EPT T+S+L L P L SA+A Sbjct: 21 VAFTTPPNYATRLSHLLTLKSFTPLWCPTITTEPTQQTLSSLALHLSPH--SLSLLSAIA 78 Query: 328 FTSRTGISAFSEALAGISTPPLPSIGEIFTISALGKDSELLDESFIGRLC-ETPARIRVL 504 F SRT I+AFS A ++TP LP + F I+ALGKD EL+D +F+ C + + + VL Sbjct: 79 FPSRTAITAFSTAALSLTTPLLPPREDTFIIAALGKDVELIDSTFLLTFCGDDISWVNVL 138 Query: 505 VPPVATPSALVEALGLGLERKVLCPIPAVIGLEEPPVVPKLLTDLERRSWAAVRVDAYET 684 VP +ATPS LV+ LG G RKVLCP+P V+GLEEPPVVP L +LE W +RVDAYET Sbjct: 139 VPTIATPSGLVQLLGTGRGRKVLCPVPRVVGLEEPPVVPDFLRELEGAGWVPIRVDAYET 198 Query: 685 RWRN-GVAELVERIEEECGVDAIVFTSTAEVEGLLKSLAEVGLDWGMVRGMCPRMVAAAH 861 RW + V + E +DA+VFTS+ EVEGLLKSL E G DW MVR P +V AAH Sbjct: 199 RWLGPACGKGVVELSEGGLLDAMVFTSSGEVEGLLKSLREFGWDWEMVRRRWPHLVVAAH 258 Query: 862 GPVT 873 GPVT Sbjct: 259 GPVT 262 >ref|XP_007012013.1| Uncharacterized protein TCM_037120 [Theobroma cacao] gi|508782376|gb|EOY29632.1| Uncharacterized protein TCM_037120 [Theobroma cacao] Length = 301 Score = 248 bits (633), Expect = 2e-63 Identities = 134/245 (54%), Positives = 164/245 (66%), Gaps = 3/245 (1%) Frame = +1 Query: 148 VAFTTPHNYAGRLSRLIHLNGWTPLWCPTVTVEPTPATISALQHFLLPPYPPLRHFSAVA 327 V FTTP NYA RLS L+ L G TPLWCPT+T PTP ++S L P+ L SA+ Sbjct: 20 VIFTTPPNYAARLSNLLTLKGHTPLWCPTITTHPTPHSLSTH----LSPHS-LSLLSAIT 74 Query: 328 FTSRTGISAFSEALAGISTPPLPSIGEIFTISALGKDSELLDESFIGRLCETPARIRVLV 507 F SR I++FS A + P LPS G F ++ALGKDSEL++ FI ++C RI+VLV Sbjct: 75 FPSRASITSFSLAALSLPKPLLPSHGPTFILAALGKDSELINTPFISQICSNLQRIKVLV 134 Query: 508 PPVATPSALVEALGLGLERKVLCPIPAVIGLEEPPVVPKLLTDLERRSWAAVRVDAYETR 687 PP ATP++L +LG G R+VLCP+P V+GL EPPVVP L DLE W +RVDAYETR Sbjct: 135 PPTATPNSLALSLGKGYGRRVLCPVPKVVGLNEPPVVPDFLKDLESGGWVPIRVDAYETR 194 Query: 688 W--RNGVAELVERIEE-ECGVDAIVFTSTAEVEGLLKSLAEVGLDWGMVRGMCPRMVAAA 858 W + E+V + EE E V+A+VFTS+ EVEG LKSL E G DWGMVR R+V AA Sbjct: 195 WVGPSCAEEVVRKGEEHEEEVNAVVFTSSGEVEGFLKSLREFGWDWGMVRRRWSRLVVAA 254 Query: 859 HGPVT 873 HGPVT Sbjct: 255 HGPVT 259 >ref|XP_002523533.1| conserved hypothetical protein [Ricinus communis] gi|223537240|gb|EEF38872.1| conserved hypothetical protein [Ricinus communis] Length = 295 Score = 248 bits (633), Expect = 2e-63 Identities = 128/242 (52%), Positives = 163/242 (67%) Frame = +1 Query: 148 VAFTTPHNYAGRLSRLIHLNGWTPLWCPTVTVEPTPATISALQHFLLPPYPPLRHFSAVA 327 VAFTTP NYA RLS L+ L TPLWCPT+ +PTP T+S+L L P + SA+ Sbjct: 18 VAFTTPQNYASRLSHLLTLKSLTPLWCPTIITQPTPQTLSSLALHLAPH--SISPISAIL 75 Query: 328 FTSRTGISAFSEALAGISTPPLPSIGEIFTISALGKDSELLDESFIGRLCETPARIRVLV 507 F SRT I+AFS+A+ ++TP L + I ALGKD+EL+D +F+ +C + RIR LV Sbjct: 76 FPSRTAITAFSKAICSLATPLLHPSHDAMIIGALGKDAELIDSAFLLNICSSINRIRALV 135 Query: 508 PPVATPSALVEALGLGLERKVLCPIPAVIGLEEPPVVPKLLTDLERRSWAAVRVDAYETR 687 P ATPS LV++LG G R+VLC +P ++GL+EPPVVP L +LE W +RVDAYETR Sbjct: 136 PQTATPSGLVQSLGAGGGRRVLCLVPKIVGLKEPPVVPDFLRELEAAGWVPIRVDAYETR 195 Query: 688 WRNGVAELVERIEEECGVDAIVFTSTAEVEGLLKSLAEVGLDWGMVRGMCPRMVAAAHGP 867 W E I +E G+D +VFTS+AEVEGLLKSL+E DW MV+ P +V AAHGP Sbjct: 196 WLGPTC--AEGIVKEEGLDGVVFTSSAEVEGLLKSLSEYRWDWKMVKQRWPELVVAAHGP 253 Query: 868 VT 873 VT Sbjct: 254 VT 255 >ref|XP_006849857.1| hypothetical protein AMTR_s00022p00059330 [Amborella trichopoda] gi|548853455|gb|ERN11438.1| hypothetical protein AMTR_s00022p00059330 [Amborella trichopoda] Length = 308 Score = 214 bits (544), Expect = 5e-53 Identities = 118/247 (47%), Positives = 159/247 (64%), Gaps = 3/247 (1%) Frame = +1 Query: 142 RLVAFTTPHNYAGRLSRLIHLNGWTPLWCPTVTVEPTPATISALQHFLLPPYPPLRHFSA 321 R V +TTP +YA L R + + PLW PT++V TP T + +++ L + SA Sbjct: 28 RHVVYTTPAHYAPSLERRLRAHQAHPLWLPTISVLSTPHTKTLIRNHLQKTL--INQSSA 85 Query: 322 VAFTSRTGISAFSEALAGIST---PPLPSIGEIFTISALGKDSELLDESFIGRLCETPAR 492 +AFTSR I++FSEAL+ I T PPL GE F + ALG+DSELLD+ F+ LCE R Sbjct: 86 IAFTSRAAINSFSEALSEILTLNGPPLSGEGEPFYLCALGRDSELLDQRFVLSLCENLDR 145 Query: 493 IRVLVPPVATPSALVEALGLGLERKVLCPIPAVIGLEEPPVVPKLLTDLERRSWAAVRVD 672 +RV VP V TP A+ E LG GL R++LC +P V GL+EP VVP L L+ ++W +R++ Sbjct: 146 VRVFVPSVPTPKAMAEELGDGLNREILCLVPLVTGLDEPSVVPDFLGALKDQNWRPIRLN 205 Query: 673 AYETRWRNGVAELVERIEEECGVDAIVFTSTAEVEGLLKSLAEVGLDWGMVRGMCPRMVA 852 +YETRW + E + + DAIVFTSTAEV+GL+K L ++G +W MVR P +V Sbjct: 206 SYETRWAG--LDCAEFLISDEASDAIVFTSTAEVQGLIKGLKKLGFEWVMVREKRPGLVV 263 Query: 853 AAHGPVT 873 AAHGPVT Sbjct: 264 AAHGPVT 270 >ref|XP_002462593.1| hypothetical protein SORBIDRAFT_02g028710 [Sorghum bicolor] gi|241925970|gb|EER99114.1| hypothetical protein SORBIDRAFT_02g028710 [Sorghum bicolor] Length = 299 Score = 212 bits (539), Expect = 2e-52 Identities = 125/252 (49%), Positives = 150/252 (59%), Gaps = 8/252 (3%) Frame = +1 Query: 142 RLVAFTTPHN-----YAGRLSRLIHLNGWTPLWCPTVTVEPTPATISALQHFLLPPYPPL 306 R VAFTTP Y GRL L+ G P+ PT+ V P L+ FLLP L Sbjct: 14 RRVAFTTPQTGGGGAYGGRLGALLRQRGAHPVPVPTIAVHPHDP--DRLRPFLLPG--AL 69 Query: 307 RHFSAVAFTSRTGISAFSEALAGISTPPLPSIGEI-FTISALGKDSELLDESFIGRLCET 483 F+A+AFTSR+GISAF+ AL+ S PL + FT++ALG D++LLD +F+ RLC Sbjct: 70 DPFAALAFTSRSGISAFARALSSSSHHPLADASALPFTVAALGSDADLLDHAFLSRLCGA 129 Query: 484 PA--RIRVLVPPVATPSALVEALGLGLERKVLCPIPAVIGLEEPPVVPKLLTDLERRSWA 657 A R+ VLVP V TP+ LVEALG G R+VLCP+P V+GL EPPVVP L LE W Sbjct: 130 AAGTRVSVLVPDVPTPAGLVEALGRGSGRRVLCPVPDVVGLREPPVVPDFLAGLEAAGWV 189 Query: 658 AVRVDAYETRWRNGVAELVERIEEECGVDAIVFTSTAEVEGLLKSLAEVGLDWGMVRGMC 837 AVR AY T W + +DA+VFTSTAEVEGLLK L G W + C Sbjct: 190 AVRAPAYTTCWAGPRCAEALVDPDAAPLDAVVFTSTAEVEGLLKRLESAGWTWARLTARC 249 Query: 838 PRMVAAAHGPVT 873 P MV AAHGPVT Sbjct: 250 PGMVVAAHGPVT 261 >ref|XP_004957231.1| PREDICTED: uncharacterized protein LOC101779932 [Setaria italica] Length = 299 Score = 207 bits (528), Expect = 3e-51 Identities = 127/253 (50%), Positives = 155/253 (61%), Gaps = 9/253 (3%) Frame = +1 Query: 142 RLVAFTTPH----NYAGRLSRLIHLNGWTPLWCPTVTVEPTPATISALQHFLLPPYPPLR 309 R VAFTTP +Y GRL L+ G P+ PT+ V+P L+ FLLP L Sbjct: 14 RRVAFTTPQTGGASYGGRLGALLRQRGARPVPVPTIAVQPHDP--DRLRPFLLPG--ALD 69 Query: 310 HFSAVAFTSRTGISAFSEALAGISTP--PLPSIGEI-FTISALGKDSELLDESFIGRLC- 477 F+A+AFTSR+GISAF+ AL S+ PL + FT++ALG D++LLD +F+ RLC Sbjct: 70 PFAALAFTSRSGISAFARALPPSSSHHRPLSDASALPFTVAALGSDADLLDRAFLSRLCG 129 Query: 478 ETPARIRVLVPPVATPSALVEALGLGLERKVLCPIPAVIGLEEPPVVPKLLTDLERRSWA 657 + R+ VLVP V TP+ LVEALG G R+VLCP+P V+GL EPPVVP L LE W Sbjct: 130 DAGTRVAVLVPAVPTPAGLVEALGPGSGRRVLCPVPDVVGLREPPVVPDFLAGLEAAGWV 189 Query: 658 AVRVDAYETRWRN-GVAELVERIEEECGVDAIVFTSTAEVEGLLKSLAEVGLDWGMVRGM 834 AVR AY T W G AE + + DA+VFTSTAEVEGLLK L G W +R Sbjct: 190 AVRAPAYTTSWAGPGCAEALVG-ADAAAPDAVVFTSTAEVEGLLKGLDAAGWTWARLRAR 248 Query: 835 CPRMVAAAHGPVT 873 P MV AAHGPVT Sbjct: 249 WPGMVVAAHGPVT 261 >ref|NP_001145235.1| hypothetical protein [Zea mays] gi|195653353|gb|ACG46144.1| hypothetical protein [Zea mays] gi|414589847|tpg|DAA40418.1| TPA: hypothetical protein ZEAMMB73_114348 [Zea mays] Length = 297 Score = 206 bits (524), Expect = 9e-51 Identities = 121/250 (48%), Positives = 148/250 (59%), Gaps = 6/250 (2%) Frame = +1 Query: 142 RLVAFTTPHN-----YAGRLSRLIHLNGWTPLWCPTVTVEPTPATISALQHFLLPPYPPL 306 R VAFTTP Y GRL L+ G P+ PT+ V P L+ +LLP L Sbjct: 14 RRVAFTTPQTGGGGAYGGRLGALLRQRGAHPVAVPTIAVHPHDP--DRLRPYLLPS--AL 69 Query: 307 RHFSAVAFTSRTGISAFSEALAGISTPPLPSIGEIFTISALGKDSELLDESFIGRLC-ET 483 F+A+AFTSR+GISAF+ AL+ P + FT++ALG D++LLD +F+ RLC + Sbjct: 70 DPFAALAFTSRSGISAFARALSSSHRPLSHASALPFTVAALGSDADLLDHAFLSRLCGDA 129 Query: 484 PARIRVLVPPVATPSALVEALGLGLERKVLCPIPAVIGLEEPPVVPKLLTDLERRSWAAV 663 R+ VLVP V TP+ LVEALG G R+VLCP+P V+GL EPPVVP L LE W AV Sbjct: 130 GTRVSVLVPDVPTPAGLVEALGRGSGRRVLCPVPDVVGLREPPVVPDFLAGLEAAGWVAV 189 Query: 664 RVDAYETRWRNGVAELVERIEEECGVDAIVFTSTAEVEGLLKSLAEVGLDWGMVRGMCPR 843 R AY T W + +DA+VFTSTAEVEGLLK L VG W + P Sbjct: 190 RAPAYTTCWAGPRCAEALVDPDAAPLDAVVFTSTAEVEGLLKGLEAVGWTWARLAARWPG 249 Query: 844 MVAAAHGPVT 873 MV AAHGPVT Sbjct: 250 MVVAAHGPVT 259 >gb|EEC84811.1| hypothetical protein OsI_31881 [Oryza sativa Indica Group] Length = 301 Score = 199 bits (507), Expect = 9e-49 Identities = 123/261 (47%), Positives = 154/261 (59%), Gaps = 17/261 (6%) Frame = +1 Query: 142 RLVAFTTPHN------YAGRLSRLIHLNGWTPLWCPTVTVEPTPATISALQHFLLPPYPP 303 R VAFTTP Y GRL ++ G P+ PT+ + I L+ F+ P Sbjct: 12 RRVAFTTPQTDAGGGGYGGRLHAILRQRGARPVPVPTIAIRAHDPDI--LRPFVAPG--G 67 Query: 304 LRHFSAVAFTSRTGISAFSEAL--AGISTP---PLPSIGEI-----FTISALGKDSELLD 453 L F+A+AFTSR+GISAFS AL + S+P P + + FT++ALG D++LLD Sbjct: 68 LDAFAALAFTSRSGISAFSRALLPSSSSSPARRPRHPVSDAATALPFTVAALGSDADLLD 127 Query: 454 ESFIGRLC-ETPARIRVLVPPVATPSALVEALGLGLERKVLCPIPAVIGLEEPPVVPKLL 630 +F+ RLC + R+ VLVP V TP+ LVEALG G R+VLCP+P V+GL EPPVVP L Sbjct: 128 AAFLSRLCGDAGGRVSVLVPDVPTPAGLVEALGSGSGRRVLCPVPDVVGLREPPVVPGFL 187 Query: 631 TDLERRSWAAVRVDAYETRWRNGVAELVERIEEECGVDAIVFTSTAEVEGLLKSLAEVGL 810 + LE W AVR AY T W E + + DA+VFTSTAEVEGLLK L G Sbjct: 188 SGLEAAGWVAVRAPAYVTCWAG--PRCAEALVDAAAPDAVVFTSTAEVEGLLKGLDAAGW 245 Query: 811 DWGMVRGMCPRMVAAAHGPVT 873 W +R PRMV AAHGPVT Sbjct: 246 SWPRLRARWPRMVVAAHGPVT 266 >gb|EXC12523.1| hypothetical protein L484_012337 [Morus notabilis] Length = 183 Score = 152 bits (384), Expect = 2e-34 Identities = 89/180 (49%), Positives = 110/180 (61%), Gaps = 1/180 (0%) Frame = +1 Query: 148 VAFTTPHNYAGRLSRLIHLNGWTPLWCPTVTVEPTPATISALQHFLLPPYPPLRHFSAVA 327 VAFTTP NYAGRLS L+ NG PL PT+ VEPTP TISAL+ +L PP+ FSAVA Sbjct: 17 VAFTTPPNYAGRLSHLLAANGLNPLSSPTLLVEPTPRTISALKSYLPPPHSLNALFSAVA 76 Query: 328 FTSRTGISAFSEALAGISTPPLPSIGEI-FTISALGKDSELLDESFIGRLCETPARIRVL 504 + + P L G+ FTI+ALGKDSELL + ++ + + RIRVL Sbjct: 77 --------------SDLECPLLSPFGDREFTIAALGKDSELLYDEYLTKFGKNRDRIRVL 122 Query: 505 VPPVATPSALVEALGLGLERKVLCPIPAVIGLEEPPVVPKLLTDLERRSWAAVRVDAYET 684 VP VA PS LV +L G ++VLC +P ++ LEEPPVVP L +LE W V V YET Sbjct: 123 VPLVAMPSGLVRSLRDGRRQRVLCTVPIIVDLEEPPVVPNFLRELESSRWIPVLVGTYET 182 >ref|XP_002980789.1| hypothetical protein SELMODRAFT_420371 [Selaginella moellendorffii] gi|300151328|gb|EFJ17974.1| hypothetical protein SELMODRAFT_420371 [Selaginella moellendorffii] Length = 231 Score = 125 bits (313), Expect = 3e-26 Identities = 84/191 (43%), Positives = 111/191 (58%), Gaps = 1/191 (0%) Frame = +1 Query: 304 LRHFSAVAFTSRTGISAFSEALAGISTPPLPSIGEIFTISALGKDSELLDESFIGRLCET 483 L +S +AFTSR+GI++ + AL + L E+ + ALGKD+EL+ E + + Sbjct: 15 LHTYSCIAFTSRSGIASIAHALEEVR---LSGCAEL-VVGALGKDAELIQELDLFKEHRE 70 Query: 484 PARIRVLVPPVATPSALVEALGLGLERKVLCPIPAVI-GLEEPPVVPKLLTDLERRSWAA 660 R+ V+VP VATP ALVE LG G R++LCP+P V GL EP VVP + L+R W Sbjct: 71 QQRLTVVVPLVATPDALVEELGDGAGRRLLCPVPYVCGGLSEPDVVPNFVAALQRHGWDV 130 Query: 661 VRVDAYETRWRNGVAELVERIEEECGVDAIVFTSTAEVEGLLKSLAEVGLDWGMVRGMCP 840 R+DAY T W G A + + VDA+VFTSTAEVEGLL +L L + + P Sbjct: 131 ERLDAYATSW-TGSASVTPLLAG--AVDALVFTSTAEVEGLLMALQAHHL---TLASLWP 184 Query: 841 RMVAAAHGPVT 873 V A GPVT Sbjct: 185 -CVLVAFGPVT 194 >ref|XP_002961862.1| hypothetical protein SELMODRAFT_403240 [Selaginella moellendorffii] gi|300170521|gb|EFJ37122.1| hypothetical protein SELMODRAFT_403240 [Selaginella moellendorffii] Length = 262 Score = 112 bits (280), Expect = 2e-22 Identities = 81/201 (40%), Positives = 110/201 (54%), Gaps = 1/201 (0%) Frame = +1 Query: 274 QHFLLPPYPPLRHFSAVAFTSRTGISAFSEALAGISTPPLPSIGEIFTISALGKDSELLD 453 +H L+P + H ++GI++ + AL + L E+ + ALGKD+EL+ Sbjct: 39 RHSLVPHHSRGAH---ATHPIQSGIASIAHALGEVR---LSGCAEL-VVGALGKDAELIQ 91 Query: 454 ESFIGRLCETPARIRVLVPPVATPSALVEALGLGLERKVLCPIP-AVIGLEEPPVVPKLL 630 E + + R+ V+VP VATP ALVE LG G R++LCP+P A GL EP VVP + Sbjct: 92 ELDLFKEHREQQRLTVVVPRVATPDALVEELGDGAGRRLLCPVPYACGGLSEPDVVPNFV 151 Query: 631 TDLERRSWAAVRVDAYETRWRNGVAELVERIEEECGVDAIVFTSTAEVEGLLKSLAEVGL 810 L+R W R+DAY T W G A + + VDA+VFTSTAEVEGLL +L L Sbjct: 152 AALQRHGWDVERLDAYATSW-TGSASVTPLLAG--AVDALVFTSTAEVEGLLMALHAHHL 208 Query: 811 DWGMVRGMCPRMVAAAHGPVT 873 + + P V A GPVT Sbjct: 209 ---TIASLWP-CVLVAFGPVT 225 >ref|XP_006381898.1| hypothetical protein POPTR_0006s20320g [Populus trichocarpa] gi|550336711|gb|ERP59695.1| hypothetical protein POPTR_0006s20320g [Populus trichocarpa] Length = 150 Score = 107 bits (266), Expect = 8e-21 Identities = 62/129 (48%), Positives = 79/129 (61%) Frame = +1 Query: 487 ARIRVLVPPVATPSALVEALGLGLERKVLCPIPAVIGLEEPPVVPKLLTDLERRSWAAVR 666 +R++VLVP + T + V LG G RKVLCP+P V+GLEEPPVVP L +LE Sbjct: 12 SRVKVLVPTITTRNG-VHLLGTGRCRKVLCPVPRVVGLEEPPVVPDFLRELE-------- 62 Query: 667 VDAYETRWRNGVAELVERIEEECGVDAIVFTSTAEVEGLLKSLAEVGLDWGMVRGMCPRM 846 A +VER +E +DA+VF S+ EVEGLLKSL E+G +W M+R P + Sbjct: 63 ------------AAVVERSDEGL-LDAMVFASSGEVEGLLKSLKELGWEWEMMRRRWPNL 109 Query: 847 VAAAHGPVT 873 V AHGPVT Sbjct: 110 VVVAHGPVT 118 >ref|XP_005644693.1| tetrapyrrole biosynthesis, uroporphyrinogen III synthase [Coccomyxa subellipsoidea C-169] gi|384246660|gb|EIE20149.1| tetrapyrrole biosynthesis, uroporphyrinogen III synthase [Coccomyxa subellipsoidea C-169] Length = 247 Score = 103 bits (256), Expect = 1e-19 Identities = 90/257 (35%), Positives = 122/257 (47%), Gaps = 17/257 (6%) Frame = +1 Query: 154 FTTPHNYAGRLSRLIHLNGWTPLWCPTVTVEPTPATISALQHFLLPPYPPLRHFSAVAFT 333 FT+P YA +L+ + G P+W P + + S Q L L ++ +AFT Sbjct: 2 FTSPRQYALKLAARLAERGARPVWVPAIEIARLSDAQSMQQ--LDDELASLDSYTHLAFT 59 Query: 334 SRTGISAFSEALAGISTPPLPSIGEIFTI----SALGKDSELLDESFIGRLCETPARIRV 501 SR GI A E LA +I + + +ALG D+E+L E+ + V Sbjct: 60 SRNGIQAVLERLAAAHGSLQSAIAHLNALPLRCAALGADAEMLAEAGVRD---------V 110 Query: 502 LVPPVATPSALVEAL---GLGLERKVLCPIPAVIG-LEEPPVVPKLLTDLERRSWAAVRV 669 L P A+ LV L G +VLCP+P V G L EPPVVP+ L L+ AVRV Sbjct: 111 LTPQEASTQGLVAELQRRGEAEGARVLCPVPLVSGGLTEPPVVPRFLASLQAAGAHAVRV 170 Query: 670 DAYETRWRNGVAE---LVERIEEECGVDAIVFTSTAEVEGLL------KSLAEVGLDWGM 822 DAYETR AE ++ + V A+ FTSTAE EGLL ++L ++ WG Sbjct: 171 DAYETR-PGATAEQCAAERQLLADGHVYAVAFTSTAEAEGLLQIMGGREALQQMLEKWG- 228 Query: 823 VRGMCPRMVAAAHGPVT 873 + AAHGP T Sbjct: 229 -------TILAAHGPYT 238 >ref|XP_006386962.1| hypothetical protein POPTR_2609s00200g, partial [Populus trichocarpa] gi|550303614|gb|ERP45876.1| hypothetical protein POPTR_2609s00200g, partial [Populus trichocarpa] Length = 119 Score = 101 bits (251), Expect = 4e-19 Identities = 65/148 (43%), Positives = 85/148 (57%) Frame = +1 Query: 385 PPLPSIGEIFTISALGKDSELLDESFIGRLCETPARIRVLVPPVATPSALVEALGLGLER 564 P LP IF I +LGK+ +L+D + R++VLVP + T + V LG G R Sbjct: 1 PLLPPRENIFIIVSLGKNVQLIDTT----------RVKVLVPTITTRNG-VHLLGTGRCR 49 Query: 565 KVLCPIPAVIGLEEPPVVPKLLTDLERRSWAAVRVDAYETRWRNGVAELVERIEEECGVD 744 KVLCP+P V+GLEEPPVVP L +LE A +VER +E +D Sbjct: 50 KVLCPVPRVVGLEEPPVVPDFLRELE--------------------AAVVERSDEGL-LD 88 Query: 745 AIVFTSTAEVEGLLKSLAEVGLDWGMVR 828 A+VF S+ EVEGLLKSL E+G +W M+R Sbjct: 89 AMVFASSGEVEGLLKSLKELGWEWEMMR 116 >ref|YP_001865256.1| uroporphyrinogen III synthase HEM4 [Nostoc punctiforme PCC 73102] gi|501376765|ref|WP_012408331.1| uroporphyrinogen III synthase [Nostoc punctiforme] gi|186464512|gb|ACC80313.1| Uroporphyrinogen III synthase HEM4 [Nostoc punctiforme PCC 73102] Length = 300 Score = 98.6 bits (244), Expect = 3e-18 Identities = 76/220 (34%), Positives = 111/220 (50%), Gaps = 7/220 (3%) Frame = +1 Query: 148 VAFTTPHNYAGRLSRLIHLNGWTPLWCPTVT---VEPTPATISALQHFLLPPYPPLRHFS 318 + T P NYA RLS I G P++ PT+ + +AL H + F Sbjct: 40 ILVTAPRNYAYRLSEQIIKQGGLPVFMPTIETCYLSNYAKLDAALNH--------IAEFD 91 Query: 319 AVAFTSRTGISAFSEALAGISTPPLPSIGEIFTISALGKDSELLDESFIGRLCETPARIR 498 + FTSR GI+AF + ++ P S+ E + ALGKD+E L SF G++ Sbjct: 92 WIVFTSRNGITAFFHRMNDLNIPV--SVVEKCQLCALGKDAESL-LSFCGKVD------- 141 Query: 499 VLVPPVATPSALVEALGLGLE---RKVLCPIPAVIGLEEPPVVPKLLTDLERRSWAAVRV 669 L+P ++P+ +V L + +KVL P P V+GL EP VVP L+TDL++ +RV Sbjct: 142 -LIPTESSPAGIVAELAKIPQIHNKKVLIPAPEVVGLPEPDVVPNLITDLQQLGTEVIRV 200 Query: 670 DAYETRWRNGVAELVE-RIEEECGVDAIVFTSTAEVEGLL 786 Y T+ N +E + + +D I F+STAEVE L Sbjct: 201 PTYITQGLNTSIYSIELNLIHQGMIDVIAFSSTAEVESFL 240