BLASTX nr result

ID: Catharanthus23_contig00005440 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00005440
         (1076 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004233274.1| PREDICTED: uncharacterized protein LOC101267...   336   1e-89
gb|EPS70810.1| hypothetical protein M569_03949 [Genlisea aurea]       295   2e-77
gb|EMJ04953.1| hypothetical protein PRUPE_ppa014878mg [Prunus pe...   286   1e-74
ref|XP_002523533.1| conserved hypothetical protein [Ricinus comm...   272   1e-70
ref|XP_002324567.1| hypothetical protein POPTR_0018s12090g [Popu...   267   5e-69
gb|ESW10428.1| hypothetical protein PHAVU_009G208600g [Phaseolus...   264   5e-68
gb|EOY29632.1| Uncharacterized protein TCM_037120 [Theobroma cacao]   260   6e-67
ref|XP_006849857.1| hypothetical protein AMTR_s00022p00059330 [A...   246   9e-63
ref|NP_001145235.1| hypothetical protein [Zea mays] gi|195653353...   229   1e-57
ref|XP_004957231.1| PREDICTED: uncharacterized protein LOC101779...   227   7e-57
ref|XP_002462593.1| hypothetical protein SORBIDRAFT_02g028710 [S...   226   9e-57
gb|EEC84811.1| hypothetical protein OsI_31881 [Oryza sativa Indi...   225   3e-56
gb|EXC12523.1| hypothetical protein L484_012337 [Morus notabilis]     128   3e-27
ref|XP_002980789.1| hypothetical protein SELMODRAFT_420371 [Sela...   125   4e-26
ref|XP_006381898.1| hypothetical protein POPTR_0006s20320g [Popu...   118   4e-24
ref|XP_002961862.1| hypothetical protein SELMODRAFT_403240 [Sela...   112   2e-22
ref|XP_002309320.1| predicted protein [Populus trichocarpa] gi|2...   100   1e-18
ref|YP_323579.1| uroporphyrinogen-III synthase [Anabaena variabi...   100   1e-18
ref|WP_016872683.1| hypothetical protein [Chlorogloeopsis fritsc...    95   4e-17
ref|NP_484331.1| uroporphyrinogen-III synthase [Nostoc sp. PCC 7...    92   5e-16

>ref|XP_004233274.1| PREDICTED: uncharacterized protein LOC101267674 [Solanum
            lycopersicum]
          Length = 312

 Score =  336 bits (861), Expect = 1e-89
 Identities = 182/299 (60%), Positives = 217/299 (72%), Gaps = 8/299 (2%)
 Frame = +3

Query: 165  PGISSRSRLIAFTTPHNYAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLLTPNS 344
            P  S R+ +IAFTTP NYA RLS++I L GW+PL CP++IVE T QTISSI +YL   N 
Sbjct: 14   PENSRRNCVIAFTTPQNYAPRLSELIHLKGWTPLWCPTVIVESTEQTISSIHHYL---NP 70

Query: 345  HPG----KSFLEDFSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELL 512
              G     SFLE+FSALAFTSRTGITAFS+AL+    PPL P GE   TI+ALG D+ELL
Sbjct: 71   QAGIDEPNSFLEEFSALAFTSRTGITAFSQALSMNPTPPLTPNGEI-LTIAALGNDAELL 129

Query: 513  DESFLVKICENPSRIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXXXXXXXXXXDFL 692
            D  F+ K+CENP RI+VL+P +ATP GLV++LGLGQGRK                   FL
Sbjct: 130  DRDFIRKMCENPERIRVLVPSVATPSGLVEALGLGQGRKVLCPVPLVIGLNEPPVVPKFL 189

Query: 693  NDLAKKGWIPVRVNAYKTRWAGPKCAEVVKNRGN----LDAIVFTSTGEVEGFLKSLKEL 860
            +DL+K+GWIP+R++AY+TRWAG  CA  V  +       DAIVFTSTGEVEG LKSL+E 
Sbjct: 190  DDLSKRGWIPLRLDAYETRWAGATCAVDVVAKSEEECGFDAIVFTSTGEVEGLLKSLEEF 249

Query: 861  GLNWEMMRKRFPRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDALDYRWNKSL 1037
            GL+W M+R+R PRMVVAAHGPVTA GAE LGV IDVVSS F SFDGV+DAL ++W KSL
Sbjct: 250  GLDWSMVRRRCPRMVVAAHGPVTAAGAESLGVGIDVVSSNFGSFDGVVDALAHKW-KSL 307


>gb|EPS70810.1| hypothetical protein M569_03949 [Genlisea aurea]
          Length = 299

 Score =  295 bits (756), Expect = 2e-77
 Identities = 161/309 (52%), Positives = 214/309 (69%), Gaps = 7/309 (2%)
 Frame = +3

Query: 120  MVALNLQNPQMNAAVPGISSRSRLIAFTTPHNYAGRLSQVIKLHGWSPLSCPSIIVEITP 299
            M+ LN+Q     A       ++RLIAFTTP NYAG+LS++I++ GW+PL CP+I VE T 
Sbjct: 1    MILLNIQQYPPPA-------KARLIAFTTPENYAGKLSRLIQVKGWTPLWCPTIAVESTA 53

Query: 300  QTISSIQNYLLTPNSHPGKSFLEDFSALAFTSRTGITAFSEAL-TPIEKPPLDPKGENPF 476
             T+ +++ Y+  P+       L +F+A+AFTSRTGITAF+EA+ +    PPLDP GE  F
Sbjct: 54   STVGALRRYVQPPDP-----ILREFAAVAFTSRTGITAFAEAIHSSGGSPPLDPTGEI-F 107

Query: 477  TISALGKDSELLDESFLVKICENPSRIKVLIPQIATPKGLVDSLGLGQGR-KXXXXXXXX 653
            TISALGKD+ELLD+SF+  +CEN +RI+VL+P +ATP  L ++LG G+GR K        
Sbjct: 108  TISALGKDAELLDDSFIKSLCENAARIRVLVPAVATPSALAEALGSGEGRRKVLCPVPVV 167

Query: 654  XXXXXXXXXXDFLNDLAKKGWIPVRVNAYKTRWAGPKCAEVVKNRG-----NLDAIVFTS 818
                       FL DL ++GWIPVRV+AY+TR +     ++V+         +DAIVFTS
Sbjct: 168  IGLEEPPVVPKFLTDLHRRGWIPVRVDAYETRRSHNGTGKLVEAMAAGAECKVDAIVFTS 227

Query: 819  TGEVEGFLKSLKELGLNWEMMRKRFPRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDG 998
            T EVEG LKSL+E+GL+WE +R+  P MV AA GPVTA GAE+LGV IDVVSS+FDSFDG
Sbjct: 228  TAEVEGLLKSLQEIGLDWETIRRTCPGMVAAAQGPVTAAGAEQLGVGIDVVSSRFDSFDG 287

Query: 999  VIDALDYRW 1025
            V+DAL+Y+W
Sbjct: 288  VVDALEYKW 296


>gb|EMJ04953.1| hypothetical protein PRUPE_ppa014878mg [Prunus persica]
          Length = 287

 Score =  286 bits (732), Expect = 1e-74
 Identities = 147/277 (53%), Positives = 194/277 (70%), Gaps = 3/277 (1%)
 Frame = +3

Query: 192  IAFTTPHNYAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLLTPNSHPGKSFLED 371
            +AFTTP NYA RL+ ++ L G++P+S P++IV+ TP TIS+++ YL  P S      L+ 
Sbjct: 10   VAFTTPPNYAARLAHLLALKGFNPISSPTLIVQPTPSTISALKPYLSPPPS------LDL 63

Query: 372  FSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELLDESFLVKICENPS 551
            FSA+AF SRT IT+ S A   I  P L P G+  F I+ALGKD+EL+D++F+ K+C N +
Sbjct: 64   FSAIAFPSRTAITSLSAAAADISHPLLSPHGD-AFIIAALGKDAELMDDNFVHKLCSNTN 122

Query: 552  RIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXXXXXXXXXXDFLNDLAKKGWIPVRV 731
            R+++L+P  ATP GLV++LG G+ R+                  DFL DL  K W+PVRV
Sbjct: 123  RVRILVPPTATPSGLVEALGDGRNRRVLCPVPVVVGLVEPPVVPDFLRDLEAKRWVPVRV 182

Query: 732  NAYKTRWAGPKCAEVVKNR---GNLDAIVFTSTGEVEGFLKSLKELGLNWEMMRKRFPRM 902
            NAY+TRWAGP CA+ V  R   G LDA+VFTST EVEG LKS KE GL+WE+ +KR P+M
Sbjct: 183  NAYETRWAGPGCAKQVVERIEEGALDAMVFTSTAEVEGLLKSFKEFGLDWEIAKKRCPKM 242

Query: 903  VVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDAL 1013
            +VAAHGP+TA GA  LGV +D+VSS+FDSF GV+DAL
Sbjct: 243  LVAAHGPITAAGAHMLGVRVDLVSSQFDSFQGVVDAL 279


>ref|XP_002523533.1| conserved hypothetical protein [Ricinus communis]
            gi|223537240|gb|EEF38872.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 295

 Score =  272 bits (696), Expect = 1e-70
 Identities = 143/298 (47%), Positives = 193/298 (64%)
 Frame = +3

Query: 120  MVALNLQNPQMNAAVPGISSRSRLIAFTTPHNYAGRLSQVIKLHGWSPLSCPSIIVEITP 299
            M+A+ + +P    A P        +AFTTP NYA RLS ++ L   +PL CP+II + TP
Sbjct: 1    MMAVAMHSPVTTTAKP-------TVAFTTPQNYASRLSHLLTLKSLTPLWCPTIITQPTP 53

Query: 300  QTISSIQNYLLTPNSHPGKSFLEDFSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFT 479
            QT+SS+  +L  P+S      +   SA+ F SRT ITAFS+A+  +  P L P   +   
Sbjct: 54   QTLSSLALHL-APHS------ISPISAILFPSRTAITAFSKAICSLATPLLHPS-HDAMI 105

Query: 480  ISALGKDSELLDESFLVKICENPSRIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXX 659
            I ALGKD+EL+D +FL+ IC + +RI+ L+PQ ATP GLV SLG G GR+          
Sbjct: 106  IGALGKDAELIDSAFLLNICSSINRIRALVPQTATPSGLVQSLGAGGGRRVLCLVPKIVG 165

Query: 660  XXXXXXXXDFLNDLAKKGWIPVRVNAYKTRWAGPKCAEVVKNRGNLDAIVFTSTGEVEGF 839
                    DFL +L   GW+P+RV+AY+TRW GP CAE +     LD +VFTS+ EVEG 
Sbjct: 166  LKEPPVVPDFLRELEAAGWVPIRVDAYETRWLGPTCAEGIVKEEGLDGVVFTSSAEVEGL 225

Query: 840  LKSLKELGLNWEMMRKRFPRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDAL 1013
            LKSL E   +W+M+++R+P +VVAAHGPVTA GAE+LGV++DVVS +F SF+GV+DAL
Sbjct: 226  LKSLSEYRWDWKMVKQRWPELVVAAHGPVTAAGAERLGVDVDVVSDRFSSFEGVVDAL 283


>ref|XP_002324567.1| hypothetical protein POPTR_0018s12090g [Populus trichocarpa]
            gi|222866001|gb|EEF03132.1| hypothetical protein
            POPTR_0018s12090g [Populus trichocarpa]
          Length = 302

 Score =  267 bits (683), Expect = 5e-69
 Identities = 146/286 (51%), Positives = 192/286 (67%), Gaps = 4/286 (1%)
 Frame = +3

Query: 171  ISSRSRLIAFTTPHNYAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLLTPNSHP 350
            I++    +AFTTP NYA RLS ++ L  ++PL CP+I  E T QT+SS+  +L +P+S  
Sbjct: 14   ITTNKPTVAFTTPPNYATRLSHLLTLKSFTPLWCPTITTEPTQQTLSSLALHL-SPHS-- 70

Query: 351  GKSFLEDFSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELLDESFLV 530
                L   SA+AF SRT ITAFS A   +  P L P+ E+ F I+ALGKD EL+D +FL+
Sbjct: 71   ----LSLLSAIAFPSRTAITAFSTAALSLTTPLLPPR-EDTFIIAALGKDVELIDSTFLL 125

Query: 531  KIC-ENPSRIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXXXXXXXXXXDFLNDLAK 707
              C ++ S + VL+P IATP GLV  LG G+GRK                  DFL +L  
Sbjct: 126  TFCGDDISWVNVLVPTIATPSGLVQLLGTGRGRKVLCPVPRVVGLEEPPVVPDFLRELEG 185

Query: 708  KGWIPVRVNAYKTRWAGPKCAEVV---KNRGNLDAIVFTSTGEVEGFLKSLKELGLNWEM 878
             GW+P+RV+AY+TRW GP C + V      G LDA+VFTS+GEVEG LKSL+E G +WEM
Sbjct: 186  AGWVPIRVDAYETRWLGPACGKGVVELSEGGLLDAMVFTSSGEVEGLLKSLREFGWDWEM 245

Query: 879  MRKRFPRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDALD 1016
            +R+R+P +VVAAHGPVTA GAE+LGV +DVVS +FDSF GV+DA++
Sbjct: 246  VRRRWPHLVVAAHGPVTAAGAERLGVTVDVVSGRFDSFQGVVDAVE 291


>gb|ESW10428.1| hypothetical protein PHAVU_009G208600g [Phaseolus vulgaris]
          Length = 280

 Score =  264 bits (674), Expect = 5e-68
 Identities = 140/290 (48%), Positives = 189/290 (65%), Gaps = 3/290 (1%)
 Frame = +3

Query: 171  ISSRSRLIAFTTPHNYAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLLTPNSHP 350
            +S  +  +AFTTP NYA RLS ++ L  ++PL CP+++++  P T++      L+P+S  
Sbjct: 1    MSLHNPTVAFTTPPNYAARLSNLLSLSAYTPLWCPTLLIQPLPSTLAPF----LSPHS-- 54

Query: 351  GKSFLEDFSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELLDESFLV 530
                L  FSA+AFTSRT I AF +A T +  PPL P+G   FT++ALGKD++L+D  FL 
Sbjct: 55   ----LHRFSAIAFTSRTAIQAFLQAATSLSHPPLPPEGST-FTLAALGKDADLIDAQFLS 109

Query: 531  KICENPSRIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXXXXXXXXXXDFLNDLAKK 710
              C N +R+ VL+P  ATP  L  +LG G GR                    FL +L + 
Sbjct: 110  AFCSNSNRLCVLVPPTATPSALAAALGDGCGRGVLCPVPRVIGVNEPPVVPGFLEELRRG 169

Query: 711  GWIPVRVNAYKTRWAGPKCAEVV---KNRGNLDAIVFTSTGEVEGFLKSLKELGLNWEMM 881
             W+PVRV AY+TRWAGP CAE +      G LDA+VFTST EVEG L+SLK+ GL +  +
Sbjct: 170  RWVPVRVEAYETRWAGPGCAEGIVRASEEGGLDAVVFTSTAEVEGLLQSLKDFGLGFADL 229

Query: 882  RKRFPRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDALDYRWNK 1031
            R+R PR+VVAAHGPVTA GA++LGVE+DVVSS+F SFDGVID L+  +++
Sbjct: 230  RRRCPRLVVAAHGPVTAAGAQRLGVEVDVVSSRFGSFDGVIDVLNVTFSR 279


>gb|EOY29632.1| Uncharacterized protein TCM_037120 [Theobroma cacao]
          Length = 301

 Score =  260 bits (665), Expect = 6e-67
 Identities = 141/280 (50%), Positives = 182/280 (65%), Gaps = 5/280 (1%)
 Frame = +3

Query: 192  IAFTTPHNYAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLLTPNSHPGKSFLED 371
            + FTTP NYA RLS ++ L G +PL CP+I    TP ++S+     L+P+S      L  
Sbjct: 20   VIFTTPPNYAARLSNLLTLKGHTPLWCPTITTHPTPHSLSTH----LSPHS------LSL 69

Query: 372  FSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELLDESFLVKICENPS 551
             SA+ F SR  IT+FS A   + KP L   G   F ++ALGKDSEL++  F+ +IC N  
Sbjct: 70   LSAITFPSRASITSFSLAALSLPKPLLPSHGPT-FILAALGKDSELINTPFISQICSNLQ 128

Query: 552  RIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXXXXXXXXXXDFLNDLAKKGWIPVRV 731
            RIKVL+P  ATP  L  SLG G GR+                  DFL DL   GW+P+RV
Sbjct: 129  RIKVLVPPTATPNSLALSLGKGYGRRVLCPVPKVVGLNEPPVVPDFLKDLESGGWVPIRV 188

Query: 732  NAYKTRWAGPKCAEVVKNRGN-----LDAIVFTSTGEVEGFLKSLKELGLNWEMMRKRFP 896
            +AY+TRW GP CAE V  +G      ++A+VFTS+GEVEGFLKSL+E G +W M+R+R+ 
Sbjct: 189  DAYETRWVGPSCAEEVVRKGEEHEEEVNAVVFTSSGEVEGFLKSLREFGWDWGMVRRRWS 248

Query: 897  RMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDALD 1016
            R+VVAAHGPVTA GA++LGV++DVVSS FDSF GV+DALD
Sbjct: 249  RLVVAAHGPVTAVGAKRLGVDVDVVSSNFDSFQGVVDALD 288


>ref|XP_006849857.1| hypothetical protein AMTR_s00022p00059330 [Amborella trichopoda]
            gi|548853455|gb|ERN11438.1| hypothetical protein
            AMTR_s00022p00059330 [Amborella trichopoda]
          Length = 308

 Score =  246 bits (629), Expect = 9e-63
 Identities = 133/285 (46%), Positives = 183/285 (64%), Gaps = 3/285 (1%)
 Frame = +3

Query: 186  RLIAFTTPHNYAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLLTPNSHPGKSFL 365
            R + +TTP +YA  L + ++ H   PL  P+I V  TP T + I+N+L        K+ +
Sbjct: 28   RHVVYTTPAHYAPSLERRLRAHQAHPLWLPTISVLSTPHTKTLIRNHLQ-------KTLI 80

Query: 366  EDFSALAFTSRTGITAFSEALTPI---EKPPLDPKGENPFTISALGKDSELLDESFLVKI 536
               SA+AFTSR  I +FSEAL+ I     PPL  +GE PF + ALG+DSELLD+ F++ +
Sbjct: 81   NQSSAIAFTSRAAINSFSEALSEILTLNGPPLSGEGE-PFYLCALGRDSELLDQRFVLSL 139

Query: 537  CENPSRIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXXXXXXXXXXDFLNDLAKKGW 716
            CEN  R++V +P + TPK + + LG G  R+                  DFL  L  + W
Sbjct: 140  CENLDRVRVFVPSVPTPKAMAEELGDGLNREILCLVPLVTGLDEPSVVPDFLGALKDQNW 199

Query: 717  IPVRVNAYKTRWAGPKCAEVVKNRGNLDAIVFTSTGEVEGFLKSLKELGLNWEMMRKRFP 896
             P+R+N+Y+TRWAG  CAE + +    DAIVFTST EV+G +K LK+LG  W M+R++ P
Sbjct: 200  RPIRLNSYETRWAGLDCAEFLISDEASDAIVFTSTAEVQGLIKGLKKLGFEWVMVREKRP 259

Query: 897  RMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDALDYRWNK 1031
             +VVAAHGPVTA GA+KLGV+ID+VSS+FDSFDGV++AL  R+ K
Sbjct: 260  GLVVAAHGPVTALGAKKLGVDIDLVSSRFDSFDGVVNALAQRFMK 304


>ref|NP_001145235.1| hypothetical protein [Zea mays] gi|195653353|gb|ACG46144.1|
            hypothetical protein [Zea mays]
            gi|414589847|tpg|DAA40418.1| TPA: hypothetical protein
            ZEAMMB73_114348 [Zea mays]
          Length = 297

 Score =  229 bits (585), Expect = 1e-57
 Identities = 130/288 (45%), Positives = 175/288 (60%), Gaps = 8/288 (2%)
 Frame = +3

Query: 174  SSRSRLIAFTTPHN-----YAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLLTP 338
            S   R +AFTTP       Y GRL  +++  G  P++ P+I V   P     ++ YLL  
Sbjct: 10   SLAGRRVAFTTPQTGGGGAYGGRLGALLRQRGAHPVAVPTIAVH--PHDPDRLRPYLLP- 66

Query: 339  NSHPGKSFLEDFSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELLDE 518
                  S L+ F+ALAFTSR+GI+AF+ AL+   +P L      PFT++ALG D++LLD 
Sbjct: 67   ------SALDPFAALAFTSRSGISAFARALSSSHRP-LSHASALPFTVAALGSDADLLDH 119

Query: 519  SFLVKICENP-SRIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXXXXXXXXXXDFLN 695
            +FL ++C +  +R+ VL+P + TP GLV++LG G GR+                  DFL 
Sbjct: 120  AFLSRLCGDAGTRVSVLVPDVPTPAGLVEALGRGSGRRVLCPVPDVVGLREPPVVPDFLA 179

Query: 696  DLAKKGWIPVRVNAYKTRWAGPKCAEVV--KNRGNLDAIVFTSTGEVEGFLKSLKELGLN 869
             L   GW+ VR  AY T WAGP+CAE +   +   LDA+VFTST EVEG LK L+ +G  
Sbjct: 180  GLEAAGWVAVRAPAYTTCWAGPRCAEALVDPDAAPLDAVVFTSTAEVEGLLKGLEAVGWT 239

Query: 870  WEMMRKRFPRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDAL 1013
            W  +  R+P MVVAAHGPVTA GA  LGVE+D+VS++F SF GV+DAL
Sbjct: 240  WARLAARWPGMVVAAHGPVTAGGARSLGVEVDIVSTRFSSFHGVVDAL 287


>ref|XP_004957231.1| PREDICTED: uncharacterized protein LOC101779932 [Setaria italica]
          Length = 299

 Score =  227 bits (578), Expect = 7e-57
 Identities = 131/285 (45%), Positives = 172/285 (60%), Gaps = 9/285 (3%)
 Frame = +3

Query: 186  RLIAFTTPH----NYAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLLTPNSHPG 353
            R +AFTTP     +Y GRL  +++  G  P+  P+I V+  P     ++ +LL     PG
Sbjct: 14   RRVAFTTPQTGGASYGGRLGALLRQRGARPVPVPTIAVQ--PHDPDRLRPFLL-----PG 66

Query: 354  KSFLEDFSALAFTSRTGITAFSEALTPIEKP--PLDPKGENPFTISALGKDSELLDESFL 527
               L+ F+ALAFTSR+GI+AF+ AL P      PL      PFT++ALG D++LLD +FL
Sbjct: 67   A--LDPFAALAFTSRSGISAFARALPPSSSHHRPLSDASALPFTVAALGSDADLLDRAFL 124

Query: 528  VKICENP-SRIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXXXXXXXXXXDFLNDLA 704
             ++C +  +R+ VL+P + TP GLV++LG G GR+                  DFL  L 
Sbjct: 125  SRLCGDAGTRVAVLVPAVPTPAGLVEALGPGSGRRVLCPVPDVVGLREPPVVPDFLAGLE 184

Query: 705  KKGWIPVRVNAYKTRWAGPKCAEVV--KNRGNLDAIVFTSTGEVEGFLKSLKELGLNWEM 878
              GW+ VR  AY T WAGP CAE +   +    DA+VFTST EVEG LK L   G  W  
Sbjct: 185  AAGWVAVRAPAYTTSWAGPGCAEALVGADAAAPDAVVFTSTAEVEGLLKGLDAAGWTWAR 244

Query: 879  MRKRFPRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDAL 1013
            +R R+P MVVAAHGPVTA GA  LGVE+DVVS++F SF GV+DAL
Sbjct: 245  LRARWPGMVVAAHGPVTAAGARSLGVEVDVVSARFSSFHGVVDAL 289


>ref|XP_002462593.1| hypothetical protein SORBIDRAFT_02g028710 [Sorghum bicolor]
            gi|241925970|gb|EER99114.1| hypothetical protein
            SORBIDRAFT_02g028710 [Sorghum bicolor]
          Length = 299

 Score =  226 bits (577), Expect = 9e-57
 Identities = 131/289 (45%), Positives = 171/289 (59%), Gaps = 9/289 (3%)
 Frame = +3

Query: 174  SSRSRLIAFTTPHN-----YAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLLTP 338
            S   R +AFTTP       Y GRL  +++  G  P+  P+I V   P     ++ +LL  
Sbjct: 10   SLTGRRVAFTTPQTGGGGAYGGRLGALLRQRGAHPVPVPTIAVH--PHDPDRLRPFLL-- 65

Query: 339  NSHPGKSFLEDFSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELLDE 518
               PG   L+ F+ALAFTSR+GI+AF+ AL+     PL      PFT++ALG D++LLD 
Sbjct: 66   ---PGA--LDPFAALAFTSRSGISAFARALSSSSHHPLADASALPFTVAALGSDADLLDH 120

Query: 519  SFLVKICENPS--RIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXXXXXXXXXXDFL 692
            +FL ++C   +  R+ VL+P + TP GLV++LG G GR+                  DFL
Sbjct: 121  AFLSRLCGAAAGTRVSVLVPDVPTPAGLVEALGRGSGRRVLCPVPDVVGLREPPVVPDFL 180

Query: 693  NDLAKKGWIPVRVNAYKTRWAGPKCAEVV--KNRGNLDAIVFTSTGEVEGFLKSLKELGL 866
              L   GW+ VR  AY T WAGP+CAE +   +   LDA+VFTST EVEG LK L+  G 
Sbjct: 181  AGLEAAGWVAVRAPAYTTCWAGPRCAEALVDPDAAPLDAVVFTSTAEVEGLLKRLESAGW 240

Query: 867  NWEMMRKRFPRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDAL 1013
             W  +  R P MVVAAHGPVTA GA  LGVE+DVVS++F SF GV+DAL
Sbjct: 241  TWARLTARCPGMVVAAHGPVTAGGARSLGVEVDVVSARFSSFHGVVDAL 289


>gb|EEC84811.1| hypothetical protein OsI_31881 [Oryza sativa Indica Group]
          Length = 301

 Score =  225 bits (573), Expect = 3e-56
 Identities = 129/297 (43%), Positives = 170/297 (57%), Gaps = 16/297 (5%)
 Frame = +3

Query: 171  ISSRSRLIAFTTPHN------YAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLL 332
            +S   R +AFTTP        Y GRL  +++  G  P+  P+I +      I       L
Sbjct: 7    LSLAGRRVAFTTPQTDAGGGGYGGRLHAILRQRGARPVPVPTIAIRAHDPDI-------L 59

Query: 333  TPNSHPGKSFLEDFSALAFTSRTGITAFSEALTPIEK---------PPLDPKGENPFTIS 485
             P   PG   L+ F+ALAFTSR+GI+AFS AL P            P  D     PFT++
Sbjct: 60   RPFVAPGG--LDAFAALAFTSRSGISAFSRALLPSSSSSPARRPRHPVSDAATALPFTVA 117

Query: 486  ALGKDSELLDESFLVKICENPS-RIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXXX 662
            ALG D++LLD +FL ++C +   R+ VL+P + TP GLV++LG G GR+           
Sbjct: 118  ALGSDADLLDAAFLSRLCGDAGGRVSVLVPDVPTPAGLVEALGSGSGRRVLCPVPDVVGL 177

Query: 663  XXXXXXXDFLNDLAKKGWIPVRVNAYKTRWAGPKCAEVVKNRGNLDAIVFTSTGEVEGFL 842
                    FL+ L   GW+ VR  AY T WAGP+CAE + +    DA+VFTST EVEG L
Sbjct: 178  REPPVVPGFLSGLEAAGWVAVRAPAYVTCWAGPRCAEALVDAAAPDAVVFTSTAEVEGLL 237

Query: 843  KSLKELGLNWEMMRKRFPRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDAL 1013
            K L   G +W  +R R+PRMVVAAHGPVTA G  +LG+E+DVV ++F SF GV+DAL
Sbjct: 238  KGLDAAGWSWPRLRARWPRMVVAAHGPVTADGVRRLGIEVDVVGARFSSFHGVLDAL 294


>gb|EXC12523.1| hypothetical protein L484_012337 [Morus notabilis]
          Length = 183

 Score =  128 bits (322), Expect = 3e-27
 Identities = 75/185 (40%), Positives = 106/185 (57%)
 Frame = +3

Query: 192 IAFTTPHNYAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLLTPNSHPGKSFLED 371
           +AFTTP NYAGRLS ++  +G +PLS P+++VE TP+TIS++++YL  P+S         
Sbjct: 17  VAFTTPPNYAGRLSHLLAANGLNPLSSPTLLVEPTPRTISALKSYLPPPHS--------- 67

Query: 372 FSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELLDESFLVKICENPS 551
            +AL          FS   + +E P L P G+  FTI+ALGKDSELL + +L K  +N  
Sbjct: 68  LNAL----------FSAVASDLECPLLSPFGDREFTIAALGKDSELLYDEYLTKFGKNRD 117

Query: 552 RIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXXXXXXXXXXDFLNDLAKKGWIPVRV 731
           RI+VL+P +A P GLV SL  G+ ++                  +FL +L    WIPV V
Sbjct: 118 RIRVLVPLVAMPSGLVRSLRDGRRQRVLCTVPIIVDLEEPPVVPNFLRELESSRWIPVLV 177

Query: 732 NAYKT 746
             Y+T
Sbjct: 178 GTYET 182


>ref|XP_002980789.1| hypothetical protein SELMODRAFT_420371 [Selaginella moellendorffii]
            gi|300151328|gb|EFJ17974.1| hypothetical protein
            SELMODRAFT_420371 [Selaginella moellendorffii]
          Length = 231

 Score =  125 bits (313), Expect = 4e-26
 Identities = 83/229 (36%), Positives = 120/229 (52%), Gaps = 1/229 (0%)
 Frame = +3

Query: 357  SFLEDFSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELLDESFLVKI 536
            S L  +S +AFTSR+GI + + AL  +        G     + ALGKD+EL+ E  L K 
Sbjct: 13   SALHTYSCIAFTSRSGIASIAHALEEVRL-----SGCAELVVGALGKDAELIQELDLFKE 67

Query: 537  CENPSRIKVLIPQIATPKGLVDSLGLGQGRK-XXXXXXXXXXXXXXXXXXDFLNDLAKKG 713
                 R+ V++P +ATP  LV+ LG G GR+                   +F+  L + G
Sbjct: 68   HREQQRLTVVVPLVATPDALVEELGDGAGRRLLCPVPYVCGGLSEPDVVPNFVAALQRHG 127

Query: 714  WIPVRVNAYKTRWAGPKCAEVVKNRGNLDAIVFTSTGEVEGFLKSLKELGLNWEMMRKRF 893
            W   R++AY T W G      +   G +DA+VFTST EVEG L +L+   L    +   +
Sbjct: 128  WDVERLDAYATSWTGSASVTPLL-AGAVDALVFTSTAEVEGLLMALQAHHLT---LASLW 183

Query: 894  PRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDALDYRWNKSLD 1040
            P  V+ A GPVTA GA++LGV++DV+  +F+ F  + D L   + K LD
Sbjct: 184  P-CVLVAFGPVTARGAKQLGVDVDVIGHRFNGFTDLADLLVSHFRKRLD 231


>ref|XP_006381898.1| hypothetical protein POPTR_0006s20320g [Populus trichocarpa]
            gi|550336711|gb|ERP59695.1| hypothetical protein
            POPTR_0006s20320g [Populus trichocarpa]
          Length = 150

 Score =  118 bits (296), Expect = 4e-24
 Identities = 72/154 (46%), Positives = 87/154 (56%), Gaps = 2/154 (1%)
 Frame = +3

Query: 549  SRIKVLIPQIATPKGLVDSLGLGQGRKXXXXXXXXXXXXXXXXXXDFLNDLAKKGWIPVR 728
            SR+KVL+P I T  G V  LG G+ RK                  DFL +L         
Sbjct: 12   SRVKVLVPTITTRNG-VHLLGTGRCRKVLCPVPRVVGLEEPPVVPDFLRELE-------- 62

Query: 729  VNAYKTRWAGPKCAEVVK--NRGNLDAIVFTSTGEVEGFLKSLKELGLNWEMMRKRFPRM 902
                         A VV+  + G LDA+VF S+GEVEG LKSLKELG  WEMMR+R+P +
Sbjct: 63   -------------AAVVERSDEGLLDAMVFASSGEVEGLLKSLKELGWEWEMMRRRWPNL 109

Query: 903  VVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVI 1004
            VV AHGPVTA GAE LGV ++VVS +FDSF G +
Sbjct: 110  VVVAHGPVTAAGAESLGVNVNVVSERFDSFQGTV 143


>ref|XP_002961862.1| hypothetical protein SELMODRAFT_403240 [Selaginella moellendorffii]
            gi|300170521|gb|EFJ37122.1| hypothetical protein
            SELMODRAFT_403240 [Selaginella moellendorffii]
          Length = 262

 Score =  112 bits (281), Expect = 2e-22
 Identities = 77/216 (35%), Positives = 112/216 (51%), Gaps = 1/216 (0%)
 Frame = +3

Query: 396  RTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELLDESFLVKICENPSRIKVLIPQ 575
            ++GI + + AL  +        G     + ALGKD+EL+ E  L K      R+ V++P+
Sbjct: 57   QSGIASIAHALGEVRL-----SGCAELVVGALGKDAELIQELDLFKEHREQQRLTVVVPR 111

Query: 576  IATPKGLVDSLGLGQGRK-XXXXXXXXXXXXXXXXXXDFLNDLAKKGWIPVRVNAYKTRW 752
            +ATP  LV+ LG G GR+                   +F+  L + GW   R++AY T W
Sbjct: 112  VATPDALVEELGDGAGRRLLCPVPYACGGLSEPDVVPNFVAALQRHGWDVERLDAYATSW 171

Query: 753  AGPKCAEVVKNRGNLDAIVFTSTGEVEGFLKSLKELGLNWEMMRKRFPRMVVAAHGPVTA 932
             G      +   G +DA+VFTST EVEG L +L    L    +   +P  V+ A GPVTA
Sbjct: 172  TGSASVTPLL-AGAVDALVFTSTAEVEGLLMALHAHHLT---IASLWP-CVLVAFGPVTA 226

Query: 933  CGAEKLGVEIDVVSSKFDSFDGVIDALDYRWNKSLD 1040
             GA++LGV++DVV  +F+SF  + D L   + K LD
Sbjct: 227  RGAKRLGVDVDVVGHRFNSFTDLADLLVSHFRKRLD 262


>ref|XP_002309320.1| predicted protein [Populus trichocarpa]
            gi|224099845|ref|XP_002334435.1| predicted protein
            [Populus trichocarpa]
          Length = 74

 Score =  100 bits (249), Expect = 1e-18
 Identities = 46/67 (68%), Positives = 55/67 (82%)
 Frame = +3

Query: 804  IVFTSTGEVEGFLKSLKELGLNWEMMRKRFPRMVVAAHGPVTACGAEKLGVEIDVVSSKF 983
            +VF S+GEVEG LKSLKELG  WEMMR+R+P +VV AHGPVTA GAE LGV ++VVS +F
Sbjct: 1    MVFASSGEVEGLLKSLKELGWEWEMMRRRWPNLVVVAHGPVTAAGAESLGVNVNVVSERF 60

Query: 984  DSFDGVI 1004
            DSF G +
Sbjct: 61   DSFQGTV 67


>ref|YP_323579.1| uroporphyrinogen-III synthase [Anabaena variabilis ATCC 29413]
            gi|499639080|ref|WP_011319814.1| uroporphyrinogen III
            synthase [Anabaena variabilis] gi|75703008|gb|ABA22684.1|
            uroporphyrinogen-III synthase [Anabaena variabilis ATCC
            29413]
          Length = 276

 Score =  100 bits (248), Expect = 1e-18
 Identities = 86/280 (30%), Positives = 123/280 (43%), Gaps = 6/280 (2%)
 Frame = +3

Query: 192  IAFTTPHNYAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLLTPNSHPGKSFLED 371
            I  T P NYA RLS  I   G  P+  P+I     P   S +   +         S + +
Sbjct: 18   ILVTAPRNYASRLSAQIICKGGLPILMPTIETCYLPN-FSQLDAVI---------SCINE 67

Query: 372  FSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELLDESFLVKICENPS 551
            F  +AFTSR GI AF E L  ++   +         + ALGKD ++L   F         
Sbjct: 68   FDWIAFTSRNGIIAFFERLHNLD---ISINKLQNCQLCALGKDIDVLLSLF--------G 116

Query: 552  RIKVLIPQIATPKGLVDSLGLGQG---RKXXXXXXXXXXXXXXXXXXDFLNDLAKKGWIP 722
            R+  LIP  ++P G+V       G   +K                  +F+ DL K G   
Sbjct: 117  RVD-LIPDESSPAGIVAKFSQIHGISRQKILVPVPEVIGIPEPNIVPNFIKDLEKLGMQV 175

Query: 723  VRVNAYKTRWAGPKCAEVVKN---RGNLDAIVFTSTGEVEGFLKSLKELGLNWEMMRKRF 893
            +RV  Y T+        V  N   +G +D I F+ST E+E FLK            +  F
Sbjct: 176  IRVPTYITQSLDKNIYSVEINLIQQGLIDVIAFSSTAEIESFLKMFNS--------KNEF 227

Query: 894  PRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDAL 1013
               VVA  GP TA  A+KLG+++ +VS+ F SF+G ++A+
Sbjct: 228  QHCVVACFGPYTAANAQKLGLDVSLVSTDFSSFEGFVEAI 267


>ref|WP_016872683.1| hypothetical protein [Chlorogloeopsis fritschii]
          Length = 313

 Score = 95.1 bits (235), Expect = 4e-17
 Identities = 85/286 (29%), Positives = 128/286 (44%), Gaps = 9/286 (3%)
 Frame = +3

Query: 183  SRLIAFTTPHNYAGRLSQVIKLHGWSPLSCPSI---IVEITPQTISSIQNYLLTPNSHPG 353
            S+ I  T P NYA RLS+ +   G  P+  P+I   ++E   Q   ++Q           
Sbjct: 38   SKRILVTAPRNYAARLSEQLINQGALPILMPTIETCVLENFAQLDIALQK---------- 87

Query: 354  KSFLEDFSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELLDESFLVK 533
               ++ F  +AFTSR GI AF + L   E   L+ +      +SA+GKD+E L  +F V+
Sbjct: 88   ---IDTFDWIAFTSRNGIDAFFQRL---ESLGLNHRVLKNCRLSAIGKDAERL-AAFGVE 140

Query: 534  ICENPSRIKVLIPQIATPKGLVDSLGLG---QGRKXXXXXXXXXXXXXXXXXXDFLNDLA 704
            +         LIPQ  +P+G++  L      QG+K                  +F+  L 
Sbjct: 141  VD--------LIPQQPSPQGIIAELAQIPNIQGKKILVPVPEVVGVPEPDVVPNFVAGLK 192

Query: 705  KKGWIPVRVNAYKTRWAGPKCAEVVKN---RGNLDAIVFTSTGEVEGFLKSLKELGLNWE 875
              G    RV  Y TR       EV  N   +G +D I F+ST EV  FL+          
Sbjct: 193  NLGMSVTRVPTYLTRCLDKSFYEVELNLIRQGKVDVIAFSSTAEVASFLQMFTA------ 246

Query: 876  MMRKRFPRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDAL 1013
              +  + + V+A  GP TA  A KLGV + +++  + SF G  +A+
Sbjct: 247  --KADYQQCVIACFGPYTAANANKLGVNVSIIAQDYSSFAGFAEAI 290


>ref|NP_484331.1| uroporphyrinogen-III synthase [Nostoc sp. PCC 7120]
            gi|499303689|ref|WP_010994464.1| uroporphyrinogen III
            synthase [Nostoc sp. PCC 7120]
            gi|17135265|dbj|BAB77811.1| uroporphyrinogen-III synthase
            [Nostoc sp. PCC 7120]
          Length = 276

 Score = 91.7 bits (226), Expect = 5e-16
 Identities = 83/280 (29%), Positives = 121/280 (43%), Gaps = 6/280 (2%)
 Frame = +3

Query: 192  IAFTTPHNYAGRLSQVIKLHGWSPLSCPSIIVEITPQTISSIQNYLLTPNSHPGKSFLED 371
            I  T P NYA RLS  I   G  P+  P+I         S +   +         S + +
Sbjct: 18   ILVTAPRNYASRLSAQIICKGGLPILMPTIETCYL-SNFSKLDAVI---------SSINE 67

Query: 372  FSALAFTSRTGITAFSEALTPIEKPPLDPKGENPFTISALGKDSELLDESFLVKICENPS 551
            F  +AFTSR GI AF E L  ++   +         + ALGKD ++L   F         
Sbjct: 68   FDWIAFTSRNGIIAFFERLHNLD---ISITKLQNCQLCALGKDIDILLSLF--------G 116

Query: 552  RIKVLIPQIATPKGLVDSLGLGQG---RKXXXXXXXXXXXXXXXXXXDFLNDLAKKGWIP 722
            ++  LIP  ++P G+V       G   +K                  +F+ DL + G   
Sbjct: 117  KVD-LIPDESSPAGIVAEFSQICGIREQKILVPVPEVIGIPEPNIVPNFIKDLEELGMQV 175

Query: 723  VRVNAYKTRWAGPKCAEVVKN---RGNLDAIVFTSTGEVEGFLKSLKELGLNWEMMRKRF 893
            +RV AY T+        V  N   +G +D I F+ST E+E FL             +  F
Sbjct: 176  IRVPAYITQSLDKDIYSVEINLIQQGLIDIIAFSSTAEIESFLAMFNS--------KSEF 227

Query: 894  PRMVVAAHGPVTACGAEKLGVEIDVVSSKFDSFDGVIDAL 1013
               VVA  GP TA  AE+LG+ + +VS+ F SF+G ++A+
Sbjct: 228  QHCVVACFGPYTAANAEQLGLNVSIVSTDFSSFEGFVEAI 267


Top