BLASTX nr result
ID: Catharanthus23_contig00005440
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00005440 (1076 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004233274.1| PREDICTED: uncharacterized protein LOC101267... 336 1e-89 gb|EPS70810.1| hypothetical protein M569_03949 [Genlisea aurea] 295 2e-77 gb|EMJ04953.1| hypothetical protein PRUPE_ppa014878mg [Prunus pe... 286 1e-74 ref|XP_002523533.1| conserved hypothetical protein [Ricinus comm... 272 1e-70 ref|XP_002324567.1| hypothetical protein POPTR_0018s12090g [Popu... 267 5e-69 gb|ESW10428.1| hypothetical protein PHAVU_009G208600g [Phaseolus... 264 5e-68 gb|EOY29632.1| Uncharacterized protein TCM_037120 [Theobroma cacao] 260 6e-67 ref|XP_006849857.1| hypothetical protein AMTR_s00022p00059330 [A... 246 9e-63 ref|NP_001145235.1| hypothetical protein [Zea mays] gi|195653353... 229 1e-57 ref|XP_004957231.1| PREDICTED: uncharacterized protein LOC101779... 227 7e-57 ref|XP_002462593.1| hypothetical protein SORBIDRAFT_02g028710 [S... 226 9e-57 gb|EEC84811.1| hypothetical protein OsI_31881 [Oryza sativa Indi... 225 3e-56 gb|EXC12523.1| hypothetical protein L484_012337 [Morus notabilis] 128 3e-27 ref|XP_002980789.1| hypothetical protein SELMODRAFT_420371 [Sela... 125 4e-26 ref|XP_006381898.1| hypothetical protein POPTR_0006s20320g [Popu... 118 4e-24 ref|XP_002961862.1| hypothetical protein SELMODRAFT_403240 [Sela... 112 2e-22 ref|XP_002309320.1| predicted protein [Populus trichocarpa] gi|2... 100 1e-18 ref|YP_323579.1| uroporphyrinogen-III synthase [Anabaena variabi... 100 1e-18 ref|WP_016872683.1| hypothetical protein [Chlorogloeopsis fritsc... 95 4e-17 ref|NP_484331.1| uroporphyrinogen-III synthase [Nostoc sp. PCC 7... 92 5e-16 >ref|XP_004233274.1| PREDICTED: uncharacterized protein LOC101267674 [Solanum lycopersicum] Length = 312 Score = 336 bits (861), Expect = 1e-89 Identities = 182/299 (60%), Positives = 217/299 (72%), Gaps = 8/299 (2%) Frame = +3 Query: 165 PGISSRSRLIAFTTPHNYAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLLTPNS 344 P S R+ +IAFTTP NYA RLS++I L GW+PL CP++IVE T QTISSI +YL N Sbjct: 14 PENSRRNCVIAFTTPQNYAPRLSELIHLKGWTPLWCPTVIVESTEQTISSIHHYL---NP 70 Query: 345 HPG----KSFLEDFSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELL 512 G SFLE+FSALAFTSRTGITAFS+AL+ PPL P GE TI+ALG D+ELL Sbjct: 71 QAGIDEPNSFLEEFSALAFTSRTGITAFSQALSMNPTPPLTPNGEI-LTIAALGNDAELL 129 Query: 513 DESFLVKICENPSRIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXXXXXXXXXXDFL 692 D F+ K+CENP RI+VL+P +ATP GLV++LGLGQGRK FL Sbjct: 130 DRDFIRKMCENPERIRVLVPSVATPSGLVEALGLGQGRKVLCPVPLVIGLNEPPVVPKFL 189 Query: 693 NDLAKKGWIPVRVNAYKTRWAGPKCAEVVKNRGN----LDAIVFTSTGEVEGFLKSLKEL 860 +DL+K+GWIP+R++AY+TRWAG CA V + DAIVFTSTGEVEG LKSL+E Sbjct: 190 DDLSKRGWIPLRLDAYETRWAGATCAVDVVAKSEEECGFDAIVFTSTGEVEGLLKSLEEF 249 Query: 861 GLNWEMMRKRFPRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDALDYRWNKSL 1037 GL+W M+R+R PRMVVAAHGPVTA GAE LGV IDVVSS F SFDGV+DAL ++W KSL Sbjct: 250 GLDWSMVRRRCPRMVVAAHGPVTAAGAESLGVGIDVVSSNFGSFDGVVDALAHKW-KSL 307 >gb|EPS70810.1| hypothetical protein M569_03949 [Genlisea aurea] Length = 299 Score = 295 bits (756), Expect = 2e-77 Identities = 161/309 (52%), Positives = 214/309 (69%), Gaps = 7/309 (2%) Frame = +3 Query: 120 MVALNLQNPQMNAAVPGISSRSRLIAFTTPHNYAGRLSQVIKLHGWSPLSCPSIIVEITP 299 M+ LN+Q A ++RLIAFTTP NYAG+LS++I++ GW+PL CP+I VE T Sbjct: 1 MILLNIQQYPPPA-------KARLIAFTTPENYAGKLSRLIQVKGWTPLWCPTIAVESTA 53 Query: 300 QTISSIQNYLLTPNSHPGKSFLEDFSALAFTSRTGITAFSEAL-TPIEKPPLDPKGENPF 476 T+ +++ Y+ P+ L +F+A+AFTSRTGITAF+EA+ + PPLDP GE F Sbjct: 54 STVGALRRYVQPPDP-----ILREFAAVAFTSRTGITAFAEAIHSSGGSPPLDPTGEI-F 107 Query: 477 TISALGKDSELLDESFLVKICENPSRIKVLIPQIATPKGLVDSLGLGQGR-KXXXXXXXX 653 TISALGKD+ELLD+SF+ +CEN +RI+VL+P +ATP L ++LG G+GR K Sbjct: 108 TISALGKDAELLDDSFIKSLCENAARIRVLVPAVATPSALAEALGSGEGRRKVLCPVPVV 167 Query: 654 XXXXXXXXXXDFLNDLAKKGWIPVRVNAYKTRWAGPKCAEVVKNRG-----NLDAIVFTS 818 FL DL ++GWIPVRV+AY+TR + ++V+ +DAIVFTS Sbjct: 168 IGLEEPPVVPKFLTDLHRRGWIPVRVDAYETRRSHNGTGKLVEAMAAGAECKVDAIVFTS 227 Query: 819 TGEVEGFLKSLKELGLNWEMMRKRFPRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDG 998 T EVEG LKSL+E+GL+WE +R+ P MV AA GPVTA GAE+LGV IDVVSS+FDSFDG Sbjct: 228 TAEVEGLLKSLQEIGLDWETIRRTCPGMVAAAQGPVTAAGAEQLGVGIDVVSSRFDSFDG 287 Query: 999 VIDALDYRW 1025 V+DAL+Y+W Sbjct: 288 VVDALEYKW 296 >gb|EMJ04953.1| hypothetical protein PRUPE_ppa014878mg [Prunus persica] Length = 287 Score = 286 bits (732), Expect = 1e-74 Identities = 147/277 (53%), Positives = 194/277 (70%), Gaps = 3/277 (1%) Frame = +3 Query: 192 IAFTTPHNYAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLLTPNSHPGKSFLED 371 +AFTTP NYA RL+ ++ L G++P+S P++IV+ TP TIS+++ YL P S L+ Sbjct: 10 VAFTTPPNYAARLAHLLALKGFNPISSPTLIVQPTPSTISALKPYLSPPPS------LDL 63 Query: 372 FSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELLDESFLVKICENPS 551 FSA+AF SRT IT+ S A I P L P G+ F I+ALGKD+EL+D++F+ K+C N + Sbjct: 64 FSAIAFPSRTAITSLSAAAADISHPLLSPHGD-AFIIAALGKDAELMDDNFVHKLCSNTN 122 Query: 552 RIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXXXXXXXXXXDFLNDLAKKGWIPVRV 731 R+++L+P ATP GLV++LG G+ R+ DFL DL K W+PVRV Sbjct: 123 RVRILVPPTATPSGLVEALGDGRNRRVLCPVPVVVGLVEPPVVPDFLRDLEAKRWVPVRV 182 Query: 732 NAYKTRWAGPKCAEVVKNR---GNLDAIVFTSTGEVEGFLKSLKELGLNWEMMRKRFPRM 902 NAY+TRWAGP CA+ V R G LDA+VFTST EVEG LKS KE GL+WE+ +KR P+M Sbjct: 183 NAYETRWAGPGCAKQVVERIEEGALDAMVFTSTAEVEGLLKSFKEFGLDWEIAKKRCPKM 242 Query: 903 VVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDAL 1013 +VAAHGP+TA GA LGV +D+VSS+FDSF GV+DAL Sbjct: 243 LVAAHGPITAAGAHMLGVRVDLVSSQFDSFQGVVDAL 279 >ref|XP_002523533.1| conserved hypothetical protein [Ricinus communis] gi|223537240|gb|EEF38872.1| conserved hypothetical protein [Ricinus communis] Length = 295 Score = 272 bits (696), Expect = 1e-70 Identities = 143/298 (47%), Positives = 193/298 (64%) Frame = +3 Query: 120 MVALNLQNPQMNAAVPGISSRSRLIAFTTPHNYAGRLSQVIKLHGWSPLSCPSIIVEITP 299 M+A+ + +P A P +AFTTP NYA RLS ++ L +PL CP+II + TP Sbjct: 1 MMAVAMHSPVTTTAKP-------TVAFTTPQNYASRLSHLLTLKSLTPLWCPTIITQPTP 53 Query: 300 QTISSIQNYLLTPNSHPGKSFLEDFSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFT 479 QT+SS+ +L P+S + SA+ F SRT ITAFS+A+ + P L P + Sbjct: 54 QTLSSLALHL-APHS------ISPISAILFPSRTAITAFSKAICSLATPLLHPS-HDAMI 105 Query: 480 ISALGKDSELLDESFLVKICENPSRIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXX 659 I ALGKD+EL+D +FL+ IC + +RI+ L+PQ ATP GLV SLG G GR+ Sbjct: 106 IGALGKDAELIDSAFLLNICSSINRIRALVPQTATPSGLVQSLGAGGGRRVLCLVPKIVG 165 Query: 660 XXXXXXXXDFLNDLAKKGWIPVRVNAYKTRWAGPKCAEVVKNRGNLDAIVFTSTGEVEGF 839 DFL +L GW+P+RV+AY+TRW GP CAE + LD +VFTS+ EVEG Sbjct: 166 LKEPPVVPDFLRELEAAGWVPIRVDAYETRWLGPTCAEGIVKEEGLDGVVFTSSAEVEGL 225 Query: 840 LKSLKELGLNWEMMRKRFPRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDAL 1013 LKSL E +W+M+++R+P +VVAAHGPVTA GAE+LGV++DVVS +F SF+GV+DAL Sbjct: 226 LKSLSEYRWDWKMVKQRWPELVVAAHGPVTAAGAERLGVDVDVVSDRFSSFEGVVDAL 283 >ref|XP_002324567.1| hypothetical protein POPTR_0018s12090g [Populus trichocarpa] gi|222866001|gb|EEF03132.1| hypothetical protein POPTR_0018s12090g [Populus trichocarpa] Length = 302 Score = 267 bits (683), Expect = 5e-69 Identities = 146/286 (51%), Positives = 192/286 (67%), Gaps = 4/286 (1%) Frame = +3 Query: 171 ISSRSRLIAFTTPHNYAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLLTPNSHP 350 I++ +AFTTP NYA RLS ++ L ++PL CP+I E T QT+SS+ +L +P+S Sbjct: 14 ITTNKPTVAFTTPPNYATRLSHLLTLKSFTPLWCPTITTEPTQQTLSSLALHL-SPHS-- 70 Query: 351 GKSFLEDFSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELLDESFLV 530 L SA+AF SRT ITAFS A + P L P+ E+ F I+ALGKD EL+D +FL+ Sbjct: 71 ----LSLLSAIAFPSRTAITAFSTAALSLTTPLLPPR-EDTFIIAALGKDVELIDSTFLL 125 Query: 531 KIC-ENPSRIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXXXXXXXXXXDFLNDLAK 707 C ++ S + VL+P IATP GLV LG G+GRK DFL +L Sbjct: 126 TFCGDDISWVNVLVPTIATPSGLVQLLGTGRGRKVLCPVPRVVGLEEPPVVPDFLRELEG 185 Query: 708 KGWIPVRVNAYKTRWAGPKCAEVV---KNRGNLDAIVFTSTGEVEGFLKSLKELGLNWEM 878 GW+P+RV+AY+TRW GP C + V G LDA+VFTS+GEVEG LKSL+E G +WEM Sbjct: 186 AGWVPIRVDAYETRWLGPACGKGVVELSEGGLLDAMVFTSSGEVEGLLKSLREFGWDWEM 245 Query: 879 MRKRFPRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDALD 1016 +R+R+P +VVAAHGPVTA GAE+LGV +DVVS +FDSF GV+DA++ Sbjct: 246 VRRRWPHLVVAAHGPVTAAGAERLGVTVDVVSGRFDSFQGVVDAVE 291 >gb|ESW10428.1| hypothetical protein PHAVU_009G208600g [Phaseolus vulgaris] Length = 280 Score = 264 bits (674), Expect = 5e-68 Identities = 140/290 (48%), Positives = 189/290 (65%), Gaps = 3/290 (1%) Frame = +3 Query: 171 ISSRSRLIAFTTPHNYAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLLTPNSHP 350 +S + +AFTTP NYA RLS ++ L ++PL CP+++++ P T++ L+P+S Sbjct: 1 MSLHNPTVAFTTPPNYAARLSNLLSLSAYTPLWCPTLLIQPLPSTLAPF----LSPHS-- 54 Query: 351 GKSFLEDFSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELLDESFLV 530 L FSA+AFTSRT I AF +A T + PPL P+G FT++ALGKD++L+D FL Sbjct: 55 ----LHRFSAIAFTSRTAIQAFLQAATSLSHPPLPPEGST-FTLAALGKDADLIDAQFLS 109 Query: 531 KICENPSRIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXXXXXXXXXXDFLNDLAKK 710 C N +R+ VL+P ATP L +LG G GR FL +L + Sbjct: 110 AFCSNSNRLCVLVPPTATPSALAAALGDGCGRGVLCPVPRVIGVNEPPVVPGFLEELRRG 169 Query: 711 GWIPVRVNAYKTRWAGPKCAEVV---KNRGNLDAIVFTSTGEVEGFLKSLKELGLNWEMM 881 W+PVRV AY+TRWAGP CAE + G LDA+VFTST EVEG L+SLK+ GL + + Sbjct: 170 RWVPVRVEAYETRWAGPGCAEGIVRASEEGGLDAVVFTSTAEVEGLLQSLKDFGLGFADL 229 Query: 882 RKRFPRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDALDYRWNK 1031 R+R PR+VVAAHGPVTA GA++LGVE+DVVSS+F SFDGVID L+ +++ Sbjct: 230 RRRCPRLVVAAHGPVTAAGAQRLGVEVDVVSSRFGSFDGVIDVLNVTFSR 279 >gb|EOY29632.1| Uncharacterized protein TCM_037120 [Theobroma cacao] Length = 301 Score = 260 bits (665), Expect = 6e-67 Identities = 141/280 (50%), Positives = 182/280 (65%), Gaps = 5/280 (1%) Frame = +3 Query: 192 IAFTTPHNYAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLLTPNSHPGKSFLED 371 + FTTP NYA RLS ++ L G +PL CP+I TP ++S+ L+P+S L Sbjct: 20 VIFTTPPNYAARLSNLLTLKGHTPLWCPTITTHPTPHSLSTH----LSPHS------LSL 69 Query: 372 FSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELLDESFLVKICENPS 551 SA+ F SR IT+FS A + KP L G F ++ALGKDSEL++ F+ +IC N Sbjct: 70 LSAITFPSRASITSFSLAALSLPKPLLPSHGPT-FILAALGKDSELINTPFISQICSNLQ 128 Query: 552 RIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXXXXXXXXXXDFLNDLAKKGWIPVRV 731 RIKVL+P ATP L SLG G GR+ DFL DL GW+P+RV Sbjct: 129 RIKVLVPPTATPNSLALSLGKGYGRRVLCPVPKVVGLNEPPVVPDFLKDLESGGWVPIRV 188 Query: 732 NAYKTRWAGPKCAEVVKNRGN-----LDAIVFTSTGEVEGFLKSLKELGLNWEMMRKRFP 896 +AY+TRW GP CAE V +G ++A+VFTS+GEVEGFLKSL+E G +W M+R+R+ Sbjct: 189 DAYETRWVGPSCAEEVVRKGEEHEEEVNAVVFTSSGEVEGFLKSLREFGWDWGMVRRRWS 248 Query: 897 RMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDALD 1016 R+VVAAHGPVTA GA++LGV++DVVSS FDSF GV+DALD Sbjct: 249 RLVVAAHGPVTAVGAKRLGVDVDVVSSNFDSFQGVVDALD 288 >ref|XP_006849857.1| hypothetical protein AMTR_s00022p00059330 [Amborella trichopoda] gi|548853455|gb|ERN11438.1| hypothetical protein AMTR_s00022p00059330 [Amborella trichopoda] Length = 308 Score = 246 bits (629), Expect = 9e-63 Identities = 133/285 (46%), Positives = 183/285 (64%), Gaps = 3/285 (1%) Frame = +3 Query: 186 RLIAFTTPHNYAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLLTPNSHPGKSFL 365 R + +TTP +YA L + ++ H PL P+I V TP T + I+N+L K+ + Sbjct: 28 RHVVYTTPAHYAPSLERRLRAHQAHPLWLPTISVLSTPHTKTLIRNHLQ-------KTLI 80 Query: 366 EDFSALAFTSRTGITAFSEALTPI---EKPPLDPKGENPFTISALGKDSELLDESFLVKI 536 SA+AFTSR I +FSEAL+ I PPL +GE PF + ALG+DSELLD+ F++ + Sbjct: 81 NQSSAIAFTSRAAINSFSEALSEILTLNGPPLSGEGE-PFYLCALGRDSELLDQRFVLSL 139 Query: 537 CENPSRIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXXXXXXXXXXDFLNDLAKKGW 716 CEN R++V +P + TPK + + LG G R+ DFL L + W Sbjct: 140 CENLDRVRVFVPSVPTPKAMAEELGDGLNREILCLVPLVTGLDEPSVVPDFLGALKDQNW 199 Query: 717 IPVRVNAYKTRWAGPKCAEVVKNRGNLDAIVFTSTGEVEGFLKSLKELGLNWEMMRKRFP 896 P+R+N+Y+TRWAG CAE + + DAIVFTST EV+G +K LK+LG W M+R++ P Sbjct: 200 RPIRLNSYETRWAGLDCAEFLISDEASDAIVFTSTAEVQGLIKGLKKLGFEWVMVREKRP 259 Query: 897 RMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDALDYRWNK 1031 +VVAAHGPVTA GA+KLGV+ID+VSS+FDSFDGV++AL R+ K Sbjct: 260 GLVVAAHGPVTALGAKKLGVDIDLVSSRFDSFDGVVNALAQRFMK 304 >ref|NP_001145235.1| hypothetical protein [Zea mays] gi|195653353|gb|ACG46144.1| hypothetical protein [Zea mays] gi|414589847|tpg|DAA40418.1| TPA: hypothetical protein ZEAMMB73_114348 [Zea mays] Length = 297 Score = 229 bits (585), Expect = 1e-57 Identities = 130/288 (45%), Positives = 175/288 (60%), Gaps = 8/288 (2%) Frame = +3 Query: 174 SSRSRLIAFTTPHN-----YAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLLTP 338 S R +AFTTP Y GRL +++ G P++ P+I V P ++ YLL Sbjct: 10 SLAGRRVAFTTPQTGGGGAYGGRLGALLRQRGAHPVAVPTIAVH--PHDPDRLRPYLLP- 66 Query: 339 NSHPGKSFLEDFSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELLDE 518 S L+ F+ALAFTSR+GI+AF+ AL+ +P L PFT++ALG D++LLD Sbjct: 67 ------SALDPFAALAFTSRSGISAFARALSSSHRP-LSHASALPFTVAALGSDADLLDH 119 Query: 519 SFLVKICENP-SRIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXXXXXXXXXXDFLN 695 +FL ++C + +R+ VL+P + TP GLV++LG G GR+ DFL Sbjct: 120 AFLSRLCGDAGTRVSVLVPDVPTPAGLVEALGRGSGRRVLCPVPDVVGLREPPVVPDFLA 179 Query: 696 DLAKKGWIPVRVNAYKTRWAGPKCAEVV--KNRGNLDAIVFTSTGEVEGFLKSLKELGLN 869 L GW+ VR AY T WAGP+CAE + + LDA+VFTST EVEG LK L+ +G Sbjct: 180 GLEAAGWVAVRAPAYTTCWAGPRCAEALVDPDAAPLDAVVFTSTAEVEGLLKGLEAVGWT 239 Query: 870 WEMMRKRFPRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDAL 1013 W + R+P MVVAAHGPVTA GA LGVE+D+VS++F SF GV+DAL Sbjct: 240 WARLAARWPGMVVAAHGPVTAGGARSLGVEVDIVSTRFSSFHGVVDAL 287 >ref|XP_004957231.1| PREDICTED: uncharacterized protein LOC101779932 [Setaria italica] Length = 299 Score = 227 bits (578), Expect = 7e-57 Identities = 131/285 (45%), Positives = 172/285 (60%), Gaps = 9/285 (3%) Frame = +3 Query: 186 RLIAFTTPH----NYAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLLTPNSHPG 353 R +AFTTP +Y GRL +++ G P+ P+I V+ P ++ +LL PG Sbjct: 14 RRVAFTTPQTGGASYGGRLGALLRQRGARPVPVPTIAVQ--PHDPDRLRPFLL-----PG 66 Query: 354 KSFLEDFSALAFTSRTGITAFSEALTPIEKP--PLDPKGENPFTISALGKDSELLDESFL 527 L+ F+ALAFTSR+GI+AF+ AL P PL PFT++ALG D++LLD +FL Sbjct: 67 A--LDPFAALAFTSRSGISAFARALPPSSSHHRPLSDASALPFTVAALGSDADLLDRAFL 124 Query: 528 VKICENP-SRIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXXXXXXXXXXDFLNDLA 704 ++C + +R+ VL+P + TP GLV++LG G GR+ DFL L Sbjct: 125 SRLCGDAGTRVAVLVPAVPTPAGLVEALGPGSGRRVLCPVPDVVGLREPPVVPDFLAGLE 184 Query: 705 KKGWIPVRVNAYKTRWAGPKCAEVV--KNRGNLDAIVFTSTGEVEGFLKSLKELGLNWEM 878 GW+ VR AY T WAGP CAE + + DA+VFTST EVEG LK L G W Sbjct: 185 AAGWVAVRAPAYTTSWAGPGCAEALVGADAAAPDAVVFTSTAEVEGLLKGLDAAGWTWAR 244 Query: 879 MRKRFPRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDAL 1013 +R R+P MVVAAHGPVTA GA LGVE+DVVS++F SF GV+DAL Sbjct: 245 LRARWPGMVVAAHGPVTAAGARSLGVEVDVVSARFSSFHGVVDAL 289 >ref|XP_002462593.1| hypothetical protein SORBIDRAFT_02g028710 [Sorghum bicolor] gi|241925970|gb|EER99114.1| hypothetical protein SORBIDRAFT_02g028710 [Sorghum bicolor] Length = 299 Score = 226 bits (577), Expect = 9e-57 Identities = 131/289 (45%), Positives = 171/289 (59%), Gaps = 9/289 (3%) Frame = +3 Query: 174 SSRSRLIAFTTPHN-----YAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLLTP 338 S R +AFTTP Y GRL +++ G P+ P+I V P ++ +LL Sbjct: 10 SLTGRRVAFTTPQTGGGGAYGGRLGALLRQRGAHPVPVPTIAVH--PHDPDRLRPFLL-- 65 Query: 339 NSHPGKSFLEDFSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELLDE 518 PG L+ F+ALAFTSR+GI+AF+ AL+ PL PFT++ALG D++LLD Sbjct: 66 ---PGA--LDPFAALAFTSRSGISAFARALSSSSHHPLADASALPFTVAALGSDADLLDH 120 Query: 519 SFLVKICENPS--RIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXXXXXXXXXXDFL 692 +FL ++C + R+ VL+P + TP GLV++LG G GR+ DFL Sbjct: 121 AFLSRLCGAAAGTRVSVLVPDVPTPAGLVEALGRGSGRRVLCPVPDVVGLREPPVVPDFL 180 Query: 693 NDLAKKGWIPVRVNAYKTRWAGPKCAEVV--KNRGNLDAIVFTSTGEVEGFLKSLKELGL 866 L GW+ VR AY T WAGP+CAE + + LDA+VFTST EVEG LK L+ G Sbjct: 181 AGLEAAGWVAVRAPAYTTCWAGPRCAEALVDPDAAPLDAVVFTSTAEVEGLLKRLESAGW 240 Query: 867 NWEMMRKRFPRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDAL 1013 W + R P MVVAAHGPVTA GA LGVE+DVVS++F SF GV+DAL Sbjct: 241 TWARLTARCPGMVVAAHGPVTAGGARSLGVEVDVVSARFSSFHGVVDAL 289 >gb|EEC84811.1| hypothetical protein OsI_31881 [Oryza sativa Indica Group] Length = 301 Score = 225 bits (573), Expect = 3e-56 Identities = 129/297 (43%), Positives = 170/297 (57%), Gaps = 16/297 (5%) Frame = +3 Query: 171 ISSRSRLIAFTTPHN------YAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLL 332 +S R +AFTTP Y GRL +++ G P+ P+I + I L Sbjct: 7 LSLAGRRVAFTTPQTDAGGGGYGGRLHAILRQRGARPVPVPTIAIRAHDPDI-------L 59 Query: 333 TPNSHPGKSFLEDFSALAFTSRTGITAFSEALTPIEK---------PPLDPKGENPFTIS 485 P PG L+ F+ALAFTSR+GI+AFS AL P P D PFT++ Sbjct: 60 RPFVAPGG--LDAFAALAFTSRSGISAFSRALLPSSSSSPARRPRHPVSDAATALPFTVA 117 Query: 486 ALGKDSELLDESFLVKICENPS-RIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXXX 662 ALG D++LLD +FL ++C + R+ VL+P + TP GLV++LG G GR+ Sbjct: 118 ALGSDADLLDAAFLSRLCGDAGGRVSVLVPDVPTPAGLVEALGSGSGRRVLCPVPDVVGL 177 Query: 663 XXXXXXXDFLNDLAKKGWIPVRVNAYKTRWAGPKCAEVVKNRGNLDAIVFTSTGEVEGFL 842 FL+ L GW+ VR AY T WAGP+CAE + + DA+VFTST EVEG L Sbjct: 178 REPPVVPGFLSGLEAAGWVAVRAPAYVTCWAGPRCAEALVDAAAPDAVVFTSTAEVEGLL 237 Query: 843 KSLKELGLNWEMMRKRFPRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDAL 1013 K L G +W +R R+PRMVVAAHGPVTA G +LG+E+DVV ++F SF GV+DAL Sbjct: 238 KGLDAAGWSWPRLRARWPRMVVAAHGPVTADGVRRLGIEVDVVGARFSSFHGVLDAL 294 >gb|EXC12523.1| hypothetical protein L484_012337 [Morus notabilis] Length = 183 Score = 128 bits (322), Expect = 3e-27 Identities = 75/185 (40%), Positives = 106/185 (57%) Frame = +3 Query: 192 IAFTTPHNYAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLLTPNSHPGKSFLED 371 +AFTTP NYAGRLS ++ +G +PLS P+++VE TP+TIS++++YL P+S Sbjct: 17 VAFTTPPNYAGRLSHLLAANGLNPLSSPTLLVEPTPRTISALKSYLPPPHS--------- 67 Query: 372 FSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELLDESFLVKICENPS 551 +AL FS + +E P L P G+ FTI+ALGKDSELL + +L K +N Sbjct: 68 LNAL----------FSAVASDLECPLLSPFGDREFTIAALGKDSELLYDEYLTKFGKNRD 117 Query: 552 RIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXXXXXXXXXXDFLNDLAKKGWIPVRV 731 RI+VL+P +A P GLV SL G+ ++ +FL +L WIPV V Sbjct: 118 RIRVLVPLVAMPSGLVRSLRDGRRQRVLCTVPIIVDLEEPPVVPNFLRELESSRWIPVLV 177 Query: 732 NAYKT 746 Y+T Sbjct: 178 GTYET 182 >ref|XP_002980789.1| hypothetical protein SELMODRAFT_420371 [Selaginella moellendorffii] gi|300151328|gb|EFJ17974.1| hypothetical protein SELMODRAFT_420371 [Selaginella moellendorffii] Length = 231 Score = 125 bits (313), Expect = 4e-26 Identities = 83/229 (36%), Positives = 120/229 (52%), Gaps = 1/229 (0%) Frame = +3 Query: 357 SFLEDFSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELLDESFLVKI 536 S L +S +AFTSR+GI + + AL + G + ALGKD+EL+ E L K Sbjct: 13 SALHTYSCIAFTSRSGIASIAHALEEVRL-----SGCAELVVGALGKDAELIQELDLFKE 67 Query: 537 CENPSRIKVLIPQIATPKGLVDSLGLGQGRK-XXXXXXXXXXXXXXXXXXDFLNDLAKKG 713 R+ V++P +ATP LV+ LG G GR+ +F+ L + G Sbjct: 68 HREQQRLTVVVPLVATPDALVEELGDGAGRRLLCPVPYVCGGLSEPDVVPNFVAALQRHG 127 Query: 714 WIPVRVNAYKTRWAGPKCAEVVKNRGNLDAIVFTSTGEVEGFLKSLKELGLNWEMMRKRF 893 W R++AY T W G + G +DA+VFTST EVEG L +L+ L + + Sbjct: 128 WDVERLDAYATSWTGSASVTPLL-AGAVDALVFTSTAEVEGLLMALQAHHLT---LASLW 183 Query: 894 PRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDALDYRWNKSLD 1040 P V+ A GPVTA GA++LGV++DV+ +F+ F + D L + K LD Sbjct: 184 P-CVLVAFGPVTARGAKQLGVDVDVIGHRFNGFTDLADLLVSHFRKRLD 231 >ref|XP_006381898.1| hypothetical protein POPTR_0006s20320g [Populus trichocarpa] gi|550336711|gb|ERP59695.1| hypothetical protein POPTR_0006s20320g [Populus trichocarpa] Length = 150 Score = 118 bits (296), Expect = 4e-24 Identities = 72/154 (46%), Positives = 87/154 (56%), Gaps = 2/154 (1%) Frame = +3 Query: 549 SRIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXXXXXXXXXXDFLNDLAKKGWIPVR 728 SR+KVL+P I T G V LG G+ RK DFL +L Sbjct: 12 SRVKVLVPTITTRNG-VHLLGTGRCRKVLCPVPRVVGLEEPPVVPDFLRELE-------- 62 Query: 729 VNAYKTRWAGPKCAEVVK--NRGNLDAIVFTSTGEVEGFLKSLKELGLNWEMMRKRFPRM 902 A VV+ + G LDA+VF S+GEVEG LKSLKELG WEMMR+R+P + Sbjct: 63 -------------AAVVERSDEGLLDAMVFASSGEVEGLLKSLKELGWEWEMMRRRWPNL 109 Query: 903 VVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVI 1004 VV AHGPVTA GAE LGV ++VVS +FDSF G + Sbjct: 110 VVVAHGPVTAAGAESLGVNVNVVSERFDSFQGTV 143 >ref|XP_002961862.1| hypothetical protein SELMODRAFT_403240 [Selaginella moellendorffii] gi|300170521|gb|EFJ37122.1| hypothetical protein SELMODRAFT_403240 [Selaginella moellendorffii] Length = 262 Score = 112 bits (281), Expect = 2e-22 Identities = 77/216 (35%), Positives = 112/216 (51%), Gaps = 1/216 (0%) Frame = +3 Query: 396 RTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELLDESFLVKICENPSRIKVLIPQ 575 ++GI + + AL + G + ALGKD+EL+ E L K R+ V++P+ Sbjct: 57 QSGIASIAHALGEVRL-----SGCAELVVGALGKDAELIQELDLFKEHREQQRLTVVVPR 111 Query: 576 IATPKGLVDSLGLGQGRK-XXXXXXXXXXXXXXXXXXDFLNDLAKKGWIPVRVNAYKTRW 752 +ATP LV+ LG G GR+ +F+ L + GW R++AY T W Sbjct: 112 VATPDALVEELGDGAGRRLLCPVPYACGGLSEPDVVPNFVAALQRHGWDVERLDAYATSW 171 Query: 753 AGPKCAEVVKNRGNLDAIVFTSTGEVEGFLKSLKELGLNWEMMRKRFPRMVVAAHGPVTA 932 G + G +DA+VFTST EVEG L +L L + +P V+ A GPVTA Sbjct: 172 TGSASVTPLL-AGAVDALVFTSTAEVEGLLMALHAHHLT---IASLWP-CVLVAFGPVTA 226 Query: 933 CGAEKLGVEIDVVSSKFDSFDGVIDALDYRWNKSLD 1040 GA++LGV++DVV +F+SF + D L + K LD Sbjct: 227 RGAKRLGVDVDVVGHRFNSFTDLADLLVSHFRKRLD 262 >ref|XP_002309320.1| predicted protein [Populus trichocarpa] gi|224099845|ref|XP_002334435.1| predicted protein [Populus trichocarpa] Length = 74 Score = 100 bits (249), Expect = 1e-18 Identities = 46/67 (68%), Positives = 55/67 (82%) Frame = +3 Query: 804 IVFTSTGEVEGFLKSLKELGLNWEMMRKRFPRMVVAAHGPVTACGAEKLGVEIDVVSSKF 983 +VF S+GEVEG LKSLKELG WEMMR+R+P +VV AHGPVTA GAE LGV ++VVS +F Sbjct: 1 MVFASSGEVEGLLKSLKELGWEWEMMRRRWPNLVVVAHGPVTAAGAESLGVNVNVVSERF 60 Query: 984 DSFDGVI 1004 DSF G + Sbjct: 61 DSFQGTV 67 >ref|YP_323579.1| uroporphyrinogen-III synthase [Anabaena variabilis ATCC 29413] gi|499639080|ref|WP_011319814.1| uroporphyrinogen III synthase [Anabaena variabilis] gi|75703008|gb|ABA22684.1| uroporphyrinogen-III synthase [Anabaena variabilis ATCC 29413] Length = 276 Score = 100 bits (248), Expect = 1e-18 Identities = 86/280 (30%), Positives = 123/280 (43%), Gaps = 6/280 (2%) Frame = +3 Query: 192 IAFTTPHNYAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLLTPNSHPGKSFLED 371 I T P NYA RLS I G P+ P+I P S + + S + + Sbjct: 18 ILVTAPRNYASRLSAQIICKGGLPILMPTIETCYLPN-FSQLDAVI---------SCINE 67 Query: 372 FSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELLDESFLVKICENPS 551 F +AFTSR GI AF E L ++ + + ALGKD ++L F Sbjct: 68 FDWIAFTSRNGIIAFFERLHNLD---ISINKLQNCQLCALGKDIDVLLSLF--------G 116 Query: 552 RIKVLIPQIATPKGLVDSLGLGQG---RKXXXXXXXXXXXXXXXXXXDFLNDLAKKGWIP 722 R+ LIP ++P G+V G +K +F+ DL K G Sbjct: 117 RVD-LIPDESSPAGIVAKFSQIHGISRQKILVPVPEVIGIPEPNIVPNFIKDLEKLGMQV 175 Query: 723 VRVNAYKTRWAGPKCAEVVKN---RGNLDAIVFTSTGEVEGFLKSLKELGLNWEMMRKRF 893 +RV Y T+ V N +G +D I F+ST E+E FLK + F Sbjct: 176 IRVPTYITQSLDKNIYSVEINLIQQGLIDVIAFSSTAEIESFLKMFNS--------KNEF 227 Query: 894 PRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDAL 1013 VVA GP TA A+KLG+++ +VS+ F SF+G ++A+ Sbjct: 228 QHCVVACFGPYTAANAQKLGLDVSLVSTDFSSFEGFVEAI 267 >ref|WP_016872683.1| hypothetical protein [Chlorogloeopsis fritschii] Length = 313 Score = 95.1 bits (235), Expect = 4e-17 Identities = 85/286 (29%), Positives = 128/286 (44%), Gaps = 9/286 (3%) Frame = +3 Query: 183 SRLIAFTTPHNYAGRLSQVIKLHGWSPLSCPSI---IVEITPQTISSIQNYLLTPNSHPG 353 S+ I T P NYA RLS+ + G P+ P+I ++E Q ++Q Sbjct: 38 SKRILVTAPRNYAARLSEQLINQGALPILMPTIETCVLENFAQLDIALQK---------- 87 Query: 354 KSFLEDFSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELLDESFLVK 533 ++ F +AFTSR GI AF + L E L+ + +SA+GKD+E L +F V+ Sbjct: 88 ---IDTFDWIAFTSRNGIDAFFQRL---ESLGLNHRVLKNCRLSAIGKDAERL-AAFGVE 140 Query: 534 ICENPSRIKVLIPQIATPKGLVDSLGLG---QGRKXXXXXXXXXXXXXXXXXXDFLNDLA 704 + LIPQ +P+G++ L QG+K +F+ L Sbjct: 141 VD--------LIPQQPSPQGIIAELAQIPNIQGKKILVPVPEVVGVPEPDVVPNFVAGLK 192 Query: 705 KKGWIPVRVNAYKTRWAGPKCAEVVKN---RGNLDAIVFTSTGEVEGFLKSLKELGLNWE 875 G RV Y TR EV N +G +D I F+ST EV FL+ Sbjct: 193 NLGMSVTRVPTYLTRCLDKSFYEVELNLIRQGKVDVIAFSSTAEVASFLQMFTA------ 246 Query: 876 MMRKRFPRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDAL 1013 + + + V+A GP TA A KLGV + +++ + SF G +A+ Sbjct: 247 --KADYQQCVIACFGPYTAANANKLGVNVSIIAQDYSSFAGFAEAI 290 >ref|NP_484331.1| uroporphyrinogen-III synthase [Nostoc sp. PCC 7120] gi|499303689|ref|WP_010994464.1| uroporphyrinogen III synthase [Nostoc sp. PCC 7120] gi|17135265|dbj|BAB77811.1| uroporphyrinogen-III synthase [Nostoc sp. PCC 7120] Length = 276 Score = 91.7 bits (226), Expect = 5e-16 Identities = 83/280 (29%), Positives = 121/280 (43%), Gaps = 6/280 (2%) Frame = +3 Query: 192 IAFTTPHNYAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLLTPNSHPGKSFLED 371 I T P NYA RLS I G P+ P+I S + + S + + Sbjct: 18 ILVTAPRNYASRLSAQIICKGGLPILMPTIETCYL-SNFSKLDAVI---------SSINE 67 Query: 372 FSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELLDESFLVKICENPS 551 F +AFTSR GI AF E L ++ + + ALGKD ++L F Sbjct: 68 FDWIAFTSRNGIIAFFERLHNLD---ISITKLQNCQLCALGKDIDILLSLF--------G 116 Query: 552 RIKVLIPQIATPKGLVDSLGLGQG---RKXXXXXXXXXXXXXXXXXXDFLNDLAKKGWIP 722 ++ LIP ++P G+V G +K +F+ DL + G Sbjct: 117 KVD-LIPDESSPAGIVAEFSQICGIREQKILVPVPEVIGIPEPNIVPNFIKDLEELGMQV 175 Query: 723 VRVNAYKTRWAGPKCAEVVKN---RGNLDAIVFTSTGEVEGFLKSLKELGLNWEMMRKRF 893 +RV AY T+ V N +G +D I F+ST E+E FL + F Sbjct: 176 IRVPAYITQSLDKDIYSVEINLIQQGLIDIIAFSSTAEIESFLAMFNS--------KSEF 227 Query: 894 PRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDAL 1013 VVA GP TA AE+LG+ + +VS+ F SF+G ++A+ Sbjct: 228 QHCVVACFGPYTAANAEQLGLNVSIVSTDFSSFEGFVEAI 267