Frontiers in Clinical Drug Research - Anti Infectives: Volume 3 -  - E-Book

Frontiers in Clinical Drug Research - Anti Infectives: Volume 3 E-Book

0,0
32,64 €

-100%
Sammeln Sie Punkte in unserem Gutscheinprogramm und kaufen Sie E-Books und Hörbücher mit bis zu 100% Rabatt.
Mehr erfahren.
Beschreibung

Frontiers in Clinical Drug Research – Anti infectives is an eBook series that brings updated reviews to readers interested in learning about advances in the development of pharmaceutical agents for the treatment of infectious diseases. The scope of the eBook series covers a range of topics including the chemistry, pharmacology, molecular biology and biochemistry of natural and synthetic drugs employed in the treatment of infectious diseases. Reviews in this series also include research on multi drug resistance and pre-clinical / clinical findings on novel antibiotics, vaccines, antifungal agents and antitubercular agents. Frontiers in Clinical Drug Research – Anti infectives is a valuable resource for pharmaceutical scientists and postgraduate students seeking updated and critically important information for developing clinical trials and devising research plans in the field of anti infective drug discovery and epidemiology.
The third volume of this series features reviews that cover a variety of topics including:
-Geomic mining and metabolomic techniques for developing antimcrobials
-Probiotic use in complementary antiretroviral therapy
-Anti-HIV pharmaceuticals
-Phytochemicals used for antimicrobial purposes
- Antimicrobial photodynamic therapy (APDT)

Sie lesen das E-Book in den Legimi-Apps auf:

Android
iOS
von Legimi
zertifizierten E-Readern

Seitenzahl: 458

Veröffentlichungsjahr: 2017

Bewertungen
0,0
0
0
0
0
0
Mehr Informationen
Mehr Informationen
Legimi prüft nicht, ob Rezensionen von Nutzern stammen, die den betreffenden Titel tatsächlich gekauft oder gelesen/gehört haben. Wir entfernen aber gefälschte Rezensionen.



Table of Contents
BENTHAM SCIENCE PUBLISHERS LTD.
End User License Agreement (for non-institutional, personal use)
Usage Rules:
Disclaimer:
Limitation of Liability:
General:
PREFACE
List of Contributors
Modern Approaches to Genome Mining for the Development of New Anti-infectives: In Silico Gene Prediction and Experimental Metabolomics
INTRODUCTION
Silent Biosynthetic Gene Clusters
What is Genome Mining?
Genome Mining Approaches and the Awakening of Silent Gene Clusters
Homologous vs. Heterologous Expression
Bioinformatics Tools and Databases for Secondary Metabolite Discovery
ClustScan
ClusterFinder
AntiSMASH
PRISM
Pep2Path
NRPquest
Genome-to-Natural Products (GNP)
The Motif Density Method (MDM)
Additional Bioinformatics Platforms
Databases for Genome Mining
Integrated Microbial Genomes-Atlas of Biosynthetic Clusters (IMG-ABC)
StreptomeDB
DoBISCUIT
Clustermine360
Norine
Protein Data Bank (PDB)
Experimental Metabolomics for the Identification of Secondary Metabolites
Mass Spectrometry (MS)
MS/MS Glycogenomics and Peptidogenomics
Proteomining
MALDI-based Imaging Mass Spectrometry (IMS)
Experimental Strategies to Awaken the Homologous Expression of Silent Gene Clusters
The OSMAC Approach to Awaken Silent Clusters
Awakening of Silent Clusters by Interspecies Co-culturing
Identification of Novel Secondary Metabolite Biosynthetic Gene Clusters
Predicted Biosynthetic Potential of Actinobacteria
CONCLUDING REMARKS
CONFLICT OF INTEREST
ACKNOWLEDGEMENTS
REFERENCES
A Novel Complementary Approach Using New Probiotic Product for the Improvement of HIV Therapy
Introduction
HIV and Immunodeficiency, Opportunistic Infections and AIDS Associated Diseases, Antiretroviral Therapy
Cell and Tissue Latent Reservoirs, Eradication of Virus
Microbiome and Mucosal Immunity, How it is Destroyed by HIV/AIDS
Therapeutical Potential of Probiotics and/or their Products in HIV Patients
CONCLUDING REMARKS
Abbreviations
CONFLICT OF INTEREST
ACKNOWLEDGEMENTS
REFERENCES
Anti-HIV Agents: The Way Forward for the Complete Eradication of the Virus
INTRODUCTION
ANTIRETROVIRAL AGENTS
Pre-exposure Prophylaxis (PrEP)
Early Use of Antiretroviral Therapy
Post-exposure Prophylaxis (PEP)
VIRUCIDES AS A PREVENTION TOOL FROM THE SPREAD OF HIV
TARGETTING HIV PROVIRUS
STEM CELL TRANSPLANTATION
ANTI-HIV GENE THERAPY
ERADICATION OF HIV THROUGH EFFECTIVE VACCINES
MONOCLONAL ANTIBODIES
THE ROLE OF COMPLEMNETARY AND ALTERNATIVE MEDICINE IN THE ERADICATION OF HIV
CONCLUSION AND FUTURE DIRECTIONS
CONFLICT OF INTEREST
ACKNOWLEDGEMENTS
References
Essential Oils, Polyphenols and Glycosides: Secondary Plant Metabolites against Human Pathogenic Microbes
ESSENTIAL OILS
In vitro Anti-microbial Activities
Anti-viral properties
Anti-bacterial Properties
Anti-fungal Properties
In vivo Anti-microbial Activities of EOs
PHENOLIC COMPOUNDS
Anti-microbial Activity of Simple Phenols
Anti-bacterial Activity
Anti-fungal Activity
Anti-viral Activity
Anti-microbial Activity of Polyphenols
Epigallocatechin-3-gallate (EGCG)
Anti-bacterial Activity
Anti-fungal Activity
Anti-viral Activity
Anthocyanins, Anthocyanidins and Proanthocyanidins
Anti-bacterial Activity
Anti-fungal Activity
Anti-viral Activity
Achillea spp. and Hypericum spp. Phenolic Compounds and their Relevant Activities
GLYCOSIDES GLUCOSINOLATES
RECENT ADVANCES ON BIOACTIVE DELIVERY: NANO-ENCAPSULATION AND MICROENCAPSULATION
CONCLUDING REMARKS
CONFLICT OF INTEREST
ACKNOWLEDGEMENTS
REFERENCES
Photosensitizers: An Effective Alternative Approach to Microbial Pathogen
INTRODUCTION
Antimicrobial Photodynamic Therapy
History of Photodynamic Therapy
Components of Photosensitization
Light Source
Photosensitizers
Commonly Used Photosensitizers in APDT
Cationic Phthalocyanines
Nonporphyrin Based Photosensitizers
Anthraquinones
Phenothiazinium Dyes
Xanthenes
Curcuminoids
Mechanism of Action
Photodynamic Inactivation of Microbial Cells
Pre-clinical and Clinical Application of Antimicrobial Photodynamic Therapy
In Vitro Studies of Photodynamic Inhibition of Fungi
Eradication of viruses by photodynamic therapy
Photodynamic Effect on Protozoans
Perspectives and Future Directions
CONFLICT OF INTEREST
ACKNOWLEDGEMENTS
REFERENCES

Frontiers in Clinical Drug Research-Anti Infectives

(Volume 3)

Edited by

Atta-ur-Rahman, FRS

Honorary Life Fellow
Kings College
University of Cambridge
Cambridge
UK

BENTHAM SCIENCE PUBLISHERS LTD.

End User License Agreement (for non-institutional, personal use)

This is an agreement between you and Bentham Science Publishers Ltd. Please read this License Agreement carefully before using the ebook/echapter/ejournal (“Work”). Your use of the Work constitutes your agreement to the terms and conditions set forth in this License Agreement. If you do not agree to these terms and conditions then you should not use the Work.

Bentham Science Publishers agrees to grant you a non-exclusive, non-transferable limited license to use the Work subject to and in accordance with the following terms and conditions. This License Agreement is for non-library, personal use only. For a library / institutional / multi user license in respect of the Work, please contact: [email protected].

Usage Rules:

All rights reserved: The Work is the subject of copyright and Bentham Science Publishers either owns the Work (and the copyright in it) or is licensed to distribute the Work. You shall not copy, reproduce, modify, remove, delete, augment, add to, publish, transmit, sell, resell, create derivative works from, or in any way exploit the Work or make the Work available for others to do any of the same, in any form or by any means, in whole or in part, in each case without the prior written permission of Bentham Science Publishers, unless stated otherwise in this License Agreement.You may download a copy of the Work on one occasion to one personal computer (including tablet, laptop, desktop, or other such devices). You may make one back-up copy of the Work to avoid losing it. The following DRM (Digital Rights Management) policy may also be applicable to the Work at Bentham Science Publishers’ election, acting in its sole discretion:25 ‘copy’ commands can be executed every 7 days in respect of the Work. The text selected for copying cannot extend to more than a single page. Each time a text ‘copy’ command is executed, irrespective of whether the text selection is made from within one page or from separate pages, it will be considered as a separate / individual ‘copy’ command.25 pages only from the Work can be printed every 7 days.

3. The unauthorised use or distribution of copyrighted or other proprietary content is illegal and could subject you to liability for substantial money damages. You will be liable for any damage resulting from your misuse of the Work or any violation of this License Agreement, including any infringement by you of copyrights or proprietary rights.

Disclaimer:

Bentham Science Publishers does not guarantee that the information in the Work is error-free, or warrant that it will meet your requirements or that access to the Work will be uninterrupted or error-free. The Work is provided "as is" without warranty of any kind, either express or implied or statutory, including, without limitation, implied warranties of merchantability and fitness for a particular purpose. The entire risk as to the results and performance of the Work is assumed by you. No responsibility is assumed by Bentham Science Publishers, its staff, editors and/or authors for any injury and/or damage to persons or property as a matter of products liability, negligence or otherwise, or from any use or operation of any methods, products instruction, advertisements or ideas contained in the Work.

Limitation of Liability:

In no event will Bentham Science Publishers, its staff, editors and/or authors, be liable for any damages, including, without limitation, special, incidental and/or consequential damages and/or damages for lost data and/or profits arising out of (whether directly or indirectly) the use or inability to use the Work. The entire liability of Bentham Science Publishers shall be limited to the amount actually paid by you for the Work.

General:

Any dispute or claim arising out of or in connection with this License Agreement or the Work (including non-contractual disputes or claims) will be governed by and construed in accordance with the laws of the U.A.E. as applied in the Emirate of Dubai. Each party agrees that the courts of the Emirate of Dubai shall have exclusive jurisdiction to settle any dispute or claim arising out of or in connection with this License Agreement or the Work (including non-contractual disputes or claims).Your rights under this License Agreement will automatically terminate without notice and without the need for a court order if at any point you breach any terms of this License Agreement. In no event will any delay or failure by Bentham Science Publishers in enforcing your compliance with this License Agreement constitute a waiver of any of its rights.You acknowledge that you have read this License Agreement, and agree to be bound by its terms and conditions. To the extent that any other terms and conditions presented on any website of Bentham Science Publishers conflict with, or are inconsistent with, the terms and conditions set out in this License Agreement, you acknowledge that the terms and conditions set out in this License Agreement shall prevail.

Bentham Science Publishers Ltd. Executive Suite Y - 2 PO Box 7917, Saif Zone Sharjah, U.A.E. Email: [email protected]

PREFACE

The third volume of Frontiers in Clinical Drug Research – Anti Infectives comprises five chapters that cover genomic mining for anti-infectives, HIV treatments and photosensitizers for anti-microbial activity.

In the first chapter, Sanchez and colleagues review the research in genomic data mining to predict gene clusters that are responsible for coding for secondary metabolites that exhibit antimicrobial activity. They give information about the tools in metabololomics used for the purpose and also give examples of links established through predictive methods. The authors also provide information about research techniques used in metabolomics adding value to their work for readers.

In the second chapter, Constantin V. Sobol discusses recent developments concerning a new probiotic prophylactic for HIV treatment. This probiotic stimulates the growth of microflora that increase the concentration of antibodies in the mucosa, thereby boosting the immune system. Continuing with the theme of HIV/AIDS treatments, Al-Jabri et al., have contributed a review on the status of HIV medications that are geared towards eliminating the virus from the body. This review is a reminder to readers that the hope for finding a cure for AIDS, while difficult, is still alive. Readers will find the list of drugs covered in this review useful for keeping their knowledge updated on current anti-HIV medicines.

In chapter 4, Sampaio et al. provide a review of natural products (essential oils, glycosides, polyphenols and other secondary metabolites) that can be used to treat microbial infections in vivo. In the last chapter, Bakthavatchalu and Noel present an interesting review of the use of photosensitizers for treating bacterial infections. Light based treatments (Antimicrobial Photodynamic Therapy, APDT) are a good way to combat drug resistant pathogens.

I would like to acknowledge the efforts of all the contributors for their outstanding contributions. I am also thankful to the team of Bentham Science Publishers, especially Dr. Faryal Sami and Mr. Shehzad Naqvi led by Mr. Mahmood Alam, Director Bentham Science Publishers for their efforts.

Prof. Atta-ur-Rahman, FRS Honorary Life Fellow Kings College University of Cambridge Cambridge UK

List of Contributors

Alfredo AiresCentre for Research and Technology of Agro-Environmental and Biological Sciences (CITAB), Vila Real, Portugal Department of Agronomy, UTAD, Quinta dos Prados, 5001-801 Vila Real, PortugalAli A. Al-JabriDivision of Immunology, Department of Microbiology and Immunology, College of Medicine and Health Sciences, SQU, OmanAmélia M. SilvaDepartment of Biology and Environment, University of Trás-os-Montes e Alto Douro (UTAD), Quinta dos Prados, 5001-801 Vila Real, Portugal Centre for Research and Technology of Agro-Environmental and Biological Sciences (CITAB), Vila Real, PortugalAna C. SampaioDepartment of Biology and Environment, University of Trás-os-Montes e Alto Douro (UTAD), Quinta dos Prados, 5001-801 Vila Real, Portugal Centre for Research and Technology of Agro-Environmental and Biological Sciences (CITAB), Vila Real, PortugalConstantin V. SobolSechenov Institute of Physiology and Biochemistry, Russian Academy of Sciences, Saint-Petersburg, RussiaElena Martinez-KlimovaInstituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, México D.F. 04510, MexicoEliana B. SoutoDepartment of Pharmaceutical Technology, Faculty of Pharmacy, University of Coimbra (FFUC), Pólo das Ciências da Saúde, Azinhaga de Santa Comba, 3000-548 Coimbra, Portugal REQUIMTE/LAQV, Group of Pharmaceutical Technology, FFUC, Coimbra, PortugalElias A. SaidDivision of Immunology, Department of Microbiology and Immunology, College of Medicine and Health Sciences, SQU, OmanGahamanyi NoelFaculty of Science and Technology, Department of Biomedical Laboratory Sciences, Catholic University of Rwanda, P.O.Box 49 Butare/Huye, RwandaMohammed S. Al-BalushiDivision of Immunology, Department of Microbiology and Immunology, College of Medicine and Health Sciences, SQU, OmanSara Centeno-LeijaCatedrático CONACYT, Laboratorio de Bioingeniería, Universidad de Colima, Km. 9 Carretera Coquimatlán-Colima, C.P. 28400 Coquimatlán, Colima, MéxicoSasirekha BakthavatchaluDepartment of Microbiology, Acharya Bangalore B School, Off Magadi road, Bangalore-560 091, IndiaSergio SánchezInstituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, México D.F. 04510, MexicoSidgi S. HassonDivision of Immunology, Department of Microbiology and Immunology, College of Medicine and Health Sciences, SQU, Oman

Modern Approaches to Genome Mining for the Development of New Anti-infectives: In Silico Gene Prediction and Experimental Metabolomics

INTRODUCTION

More than half of the known natural products that have antimicrobial, antiviral or antitumor activity originate from only five cultivated bacterial groups: filamentous Actinomycetes, Myxobacteria, Cyanobacteria, as well as members of the genera Pseudomonas and Bacillus [1]. Actinomycetes are Gram-positive mycelial bacteria found mainly in the soil, but are also present in symbiotic association with terrestrial and aquatic invertebrates [2]. Actinomycetes produce metabolites as they undergo the morphological and physiological differentiation processes that are part of their life cycle [2].

The secondary metabolites that bacteria produce include aminoglycosides, polyketides, as well as small proteinaceous and peptide structures such as bacteriocins, oligopeptides and lipopeptides. These secondary metabolites may have bactericidal, immune suppression and tumor suppression properties and can be useful for human and veterinary medicine. Lipopeptides and polyketides have linear, cyclic or branched structures. Lipopeptides are generated by non-ribosomal peptide synthases (NRPSs) whilst polyketides are generated by polyketide synthases (PKSs) [3, 4].

The function that these metabolites have in their natural environment is not always known, but they are thought to provide a competitive advantage to the producing organism since many of these possess potent antibiotic activity [2]. It has also been suggested that antibiotics act as signaling molecules facilitating intra- or interspecies interactions within microbial communities [5].

Most of the antibiotics clinically used are microbial natural products or their derivatives [6]. In fact, of the 18,000 currently known bioactive compounds, 10,000 were described from the genus Streptomyces (Actinobacteria) [7]. Actinobacteria still are one of the most important producers of natural products that are currently applied as antibiotics, immunosuppresants, anticancer drugs, Anthelmintics and antifungals [8, 9].

The threat of multi-drug resistant pathogens puts at grave risk the advances of modern medicine [6, 10, 11] and yet, new antibiotics emerging in the markets are few. Drug discovery is expensive and the return on investment is difficult to predict. New products in the market are poorly sold because they are not prescribed in the hope to slow down development of resistance [6, 8].

Silent Biosynthetic Gene Clusters

With the onset of the genomic era, it became evident that Actinomycetes contain a largely untapped and unexplored potential for the production of secondary metabolites [2, 12]. Analyses of genome sequences have been revealing that each genome contains clusters to synthesize 20 or more secondary metabolites [13], which increases the chances of discovering novel bioactive natural products. Genome mining bioinformatics software detects biosynthetic gene clusters encoded in the genome, but bioinformatics programs alone will not lead to the discovery of new metabolites, since many of the secondary metabolism gene clusters are silent under laboratory conditions [14].

Secondary metabolite biosynthetic gene clusters remain silent until the required signals occur, which may be environmental or physiological [15]. However, it has been proposed that the majority of secondary metabolism gene clusters in Streptomyces are not silent, but are expressed at very low levels under laboratory conditions, so the transcription of these gene clusters is not sufficient to produce detectable amounts of novel secondary metabolites [16].

What is Genome Mining?

Genome mining consists on using genetic information to assess the potential of microorganisms to produce novel compounds [17]. Such analysis has to be followed by extensive experimental research [2] involving proteomics and metabolomics to confirm that the predicted gene cluster produces the target secondary metabolite [7]. Genome mining as a natural product discovery strategy is based on connecting an unknown structure of a natural product with its corresponding biosynthetic genes by applied biosynthetic knowledge. As proposed by Nett [18] genome mining involves “basic in silico analyses” to aid in the proposal of putative genes and putative products, as well as “the emerging chemical or genetic methods that are applied to trace the metabolic products of the (putative genes)”. New methods are necessary that allow linking conclusively a gene cluster and a natural product [19]. Several in silico and experiment-guided approaches have been developed for this purpose.

The first step in a genome mining approach is to identify the putative biosynthetic gene clusters in the genome sequence [17]. In the second step, once putative clusters have been identified, it is necessary to predict the biosynthetic products resulting from the enzymes encoded in the cluster [17].

Genome mining consists not only of the in silico determination of a gene cluster, but also in the activation of a cryptic gene cluster [20]. In fact, genome mining is typically accompanied by proteome and/or metabolome analyses to accurately link a metabolite to its biosynthetic gene cluster [17, 21]. Such a connection may help to ascertain the novelty of a compound, guide fractionation/identification and allow heterologous expression in a suitable host [19].

A “gene cluster” is a set of co-localized and co-regulated genes, whose products are functionally connected [22]. A secondary metabolite biosynthetic gene cluster normally does not contain more than 20 genes. Identifying the gene clusters that are likely to encode new molecules is a key priority for genome mining. Thus, it is necessary to compare the predicted to known gene clusters and to predict the structure of the putative product, as summarized in Fig. (2), which is extremely challenging [23].

Ultimately, a genome mining approach should be more efficient to find bioactive secondary metabolites that are novel so as to avoid re-discovering the same compounds over and over again. As summarized in Fig. (1), traditionally, the isolation of natural products with bioactivity relied on screening extracts to detect bioactivity, followed by separation and characterization of the compounds by assay-guided fractionation [24].

Fig. (1)) The traditional approach compared to a genome mining approach for the discovery of novel compounds with bioactivity. The main difference between the traditional approach and a genome mining approach is that at a genome mining approach provides information about the genes involved in the biosynthesis of a metabolite. Figure based on [13].
Fig. (2)) Summarized steps involved in a genome mining approach. The greatest challenge of a genome mining approach is to link a produced metabolite with the genes involved in the biosynthesis of the metabolite. To propose the biosynthetic pathway of a metabolite, it is necessary to carry out the analysis of a genome sequence aided by bioinformatics platforms for the prediction of secondary metabolite gene clusters. Experimental work is required to confirm the computational prediction of the genes involved in the production of the metabolite and the actual structure of the metabolite.

Genome Mining Approaches and the Awakening of Silent Gene Clusters

Similarly to Choi et al. [13], for Gomez-Escribano and Bibb [25] the activation of the expression of silent gene clusters or the heterologous expression in a suitable organism are approaches to genome mining. For [25] genome mining can be defined as “the use of bioinformatics, molecular genetics and natural product analytical chemistry to access the metabolic product of a gene cluster found in the genome of an organism”.

Bachmann et al. in 2014 [26] reported that genome mining now includes the bioinformatic prediction of clusters, the control of gene expression and the identification of new metabolites. The reason a genome mining approach includes strategies for the awakening of silent clusters is that, if they are not awakened, it will not be possible to produce the metabolite for its identification and bioactivity tests [27].

In agreement with the view that “genome mining may be defined as the process of technically translating secondary metabolite-encoding gene sequence data into purified molecules in tubes” [26], in the following pages we have laid out a description of in silico biosynthetic gene cluster prediction software platforms and we have complied a list of online databases created to store information on secondary metabolites, biosynthetic genes and predictions. The in silico prediction section is followed by a metabolomics section to describe experimental metabolomics techniques that have the aim of identifying and characterizing secondary metabolites. It has been of particular interest for us to discuss how the data generated by metabolomics studies has been integrated to data generated by bioinformatics platforms to link a metabolite to its gene cluster.

Homologous vs. Heterologous Expression

Genome-mining efforts can be organized into two categories: homologous and heterologous expression. Homologous expression of secondary metabolites tries to elicit the expression of secondary metabolites in the encoding producing organism [26], i.e., a gene or gene cluster is being “homologously” expressed when the organism that encodes the cluster in its genome is expressing the genes in the cluster. When the clusters are silent, strategies must be sought to force the expression of the clusters by the organism that encodes the cluster. Once the cluster has been expressed by the encoding-organism, the product of the cluster can be identified by comparison to non-expressing cells, then purified and analyzed. Several strategies have been reported in the literature to awaken the expression of secondary metabolite gene clusters, such as modification of the growth conditions, co-culturing various strains together or genetic engineering approaches that involve over-expressing a native transcriptional activator within the native strain [27]. In contrast to homologous expression, a gene cluster that has been isolated or amplified from one organism and has been introduced into another, perhaps more suitable organism for its expression is being “heterologously” expressed by that new host.

For a review on heterologous expression of actinomycete genes in Streptomyces coelicolor A3(2), we invite the reader to consult the 2014 work of Gomez-Escribano and Bibb [25], who have reviewed publications involving cloning of genome fragments from diverse Actinomycetes to express them in the host S. coelicolor A3(2). According to [25], the cases when heterologous expression has been favored over homologous expression are normally due to the difficulty to study clusters in the natural hosts. For example:

When deletion of the cluster or gene cannot be achieved to validate a host strain.When it is desired to generate novel unnatural chemical structures by combining genes from a variety of pathways to create a new unnatural pathway.To study the function of particular genes from an “imported” cluster in order to propose the biosynthetic pathway of a metabolite.To optimize the production of a metabolite.

It is possible to modify the genetic content of the cluster by changing the native promoters, eliminating negative regulators or re-coding the codon to generate mutants of an enzyme, as detailed in [25]. The metabolites produced by the wild-type strain and the heterologous host can be compared to find the metabolite encoded in the foreign genes [28].

New approaches are required to identify and characterize the silent gene clusters that appear to be present among all Actinomycetes [19]. Therefore, the aim of this chapter is to review the latest proposed strategies for the computational identification of secondary metabolite biosynthetic gene clusters in the genome, as well as the most recent experimental methodologies that have succeeded at linking novel natural products with their biosynthetic gene cluster. Also, considering that Actinomycetes produce more than 70% of the natural product scaffolds of clinically-used anti-infective compounds [2], a summary of some of the most recent natural products synthesized by Actinobacteria are included.

Bioinformatics Tools and Databases for Secondary Metabolite Discovery

The current excess of genetic information encoding uncharacterized proteins [29] requires a variety of computational tools that have been developed to aid scientists in genome mining, as summarized in Fig. (3) [7].

Fig. (3)) Tools for the mining of genomes. Several programs and databases have been created to analyze genome sequences to predict genes clusters involved in the production of secondary metabolites and their putative metabolic products.

Bioinformatics platforms are essential in the search for secondary metabolite clusters because many organisms cannot be isolated or cultured in a laboratory [2, 30]. The key feature these programs exploit is the high degree of sequence similarity of the catalytic domains from enzymes involved in secondary metabolite biosynthesis, in spite of the immense chemical diversity of secondary metabolites [17]. In most bacteria, a secondary metabolite biosynthetic cluster includes not only the genes involved in the biosynthesis of the secondary metabolite but also regulators, transporters, and genes involved in conferring resistance to the metabolite produced. Computational screening of genes is a way to complement experimental assays where extracts or purified compounds are tested against specific targets with the goal of finding bioactive compounds [17]. Computationally identified biosynthetic clusters can be cloned or synthesized for heterologous expression [30].

The application of bioinformatics tools in the search for polyketide (PK) and non-ribosomal peptide (NRP) pathways requires browsing genetic sequences to pinpoint the location of the putative pathways by comparing with an ortholog of a known protein from the pathway, for which conserved catalytic domains are often used [23]. Once the biosynthetic gene cluster has been located, it is necessary to identify all the genes involved in the biosynthesis of the metabolite, which includes all genes encoding polyketide synthases (PKS), nonribosomal peptide synthases (NRPS), tailoring enzymes, biosynthetic genes, regulatory elements and resistance, which are typically tightly clustered together on the chromosome [23]. The automation of this second step of the process is challenging if some of the genes are located far from the core signature genes or if gene clusters that are located close together are merged into one cluster [23].

Often, NRPS and type I PKS enzymes work predictably, i.e. the order of recruitment for assembly of amino acids for NRPS or carboxylic acids for PKS is the same as the order of the catalytic domains. This insight into the architecture of the domains facilitates prediction of the structures they might produce based exclusively on genomic information [4]. Quoting a recent work by C. N. Boddy: “the predicted connectivity of the individual building blocks selected by the A and AT domains is defined by the order of the A and AT coding regions in the gene cluster” [23].

Several tools have been developed to aid in genome mining of bacterial secondary metabolite gene clusters such as ClustScan, ClusterFinder, antiSMASH, PRISM, Pep2Path, and NRPquest, among several others. Hidden Markov Models are statistical models generated from multiple sequences that are superior to pairwise search methods such as BLAST to detect distantly related homologs [23].

ClustScan

The ClustScan program [31] contains descriptions of 170 natural product clusters. The program was developed to analyze modular clusters. What makes ClustScan unique is that it takes a “top-down” approach to annotate gene clusters, so the cluster is also considered as a whole unit. Then the modules and domains are organized in a hierarchical way, so it is possible to predict the structures of the products [32].

The ClustScan program proceeds to construct the ClustScan database on its own, without any further manual curation. According to Weber [17], by 2014 the ClustScan database contained 57 entries on characterized PKS, 51 entries on NRPS and 62 entries on hybrid PKS/NRPS clusters. It is important to mention that the ClustScan program is a commercial program for the analysis of biosynthetic clusters.

ClusterFinder

ClusterFinder is a hidden Markov model-based probabilistic algorithm developed by Cimermancic et al. [33]. ClusterFinder aims to find clusters that are not well-characterized by converting the nucleotide sequence into a string of contiguous Pfam domains. Each domain is then assigned a probability of being part of a given gene cluster based on how frequently these domains occur in the ClusterFinder datasets [33].

Cimermancic et al. [33] experimentally tested the ClusterFinder predictions of biosynthetic gene clusters and discovered an aryl polyene (APE) carboxylic acid.

AntiSMASH

AntiSMASH is so far the most comprehensive and user-friendly tool for the analysis and identification of secondary metabolite biosynthetic gene clusters in bacteria [17, 21]. The program antiSMASH, now available in version 3.0 [21] at http://antismash.secondarymetabolites.org, has undergone major improvements since its first release in 2011 [34].

AntiSMASH is a powerful prediction tool because (1) it permits a more detailed prediction of the clusters, since it allows BLAST searches of the predicted cluster to find the closest homologues in the database and (2) it allows the analysis of fragmented genomes and metagenomes [4]. AntiSMASH includes rule-based and statistics-based algorithms and offers various modules for pathway analysis [7]. The new version of antiSMASH has integrated the ClusterFinder algorithm developed by Cimermancic et al. [33], which no longer limits results exclusively to the detection of known biosynthetic gene clusters.

AntiSMASH also incorporates the CLUSEAN and NRPS predictor tools [17] and it also takes advantage of the conserved regions within clusters. In particular, it can detect conserved operons responsible for the biosynthesis of specific building blocks [17]. The antiSMASH pipeline involves the following steps [35]:

Genes are predicted from the genome using Glimmer3.Biosynthetic gene clusters are identified using profile Hidden Markov Models in order to search databases for homologous sequences.Biosynthetic clusters are automatically annotated.The core chemical structure of natural products is predicted based on the annotated gene clusters.
Fig. (4)) Screenshot of antiSMASH output. Identification of secondary metabolite biosynthetic gene clusters of S. coelicolor (NCBI accession number GCA_000203835.1) by antiSMASH. Biosynthetic gene clusters are identified, classified and listed according to the type of the potential metabolite.

The predicted secondary clusters that antiSMASH yields can be: NRPS, PKS, hybrid PKS/NRPS, siderophore, bacteriocin, lantibiotic [4]. AntiSMASH includes the ClusterBlast algorithm to quantify similarity between query clusters and close homolog clusters deposited in the NCBI database [23]. AntiSMASH defines clusters as groups of signature genes within 10 kb of each other and extends the cluster 20 kb on each side of the last signature gene for defining the boundaries of clusters to ensure that no important genes are left out from the predicted gene cluster [23]. Two screenshots have been included below of the output obtained after analyzing the S. coelicolor genome sequence (accession number: GCA_ 000203835.1) with the program antiSMASH. Fig. (4) shows a list of all the putative biosynthetic clusters in the genome sequence predicted by antiSMASH while Fig. (5) shows the resulting output of selecting one of the gene clusters.

Fig. (5)) Identification of biosynthetic gene cluster in antiSMASH. The biosynthetic, transport-related and regulatory genes are displayed including genomic information. The predicted core structure is shown according to the logical architecture of assembly.

PRISM

PRediction Informatics for Secondary Metabolomes (PRISM) is an open-source web application for the genomic prediction, as well as bio- and chemo-informatic dereplication of nonribosomal peptides, type I and II polyketide chemical structures including transacting acyltransferase or adenylation domains [36]. The program PRISM, available since 2015 at http://magarveylab.ca/prism/, can identify enzyme domains and regulatory genes associated with natural product biosynthesis and resistance. Also, a combinatorial library of natural product scaffolds are suggested by the identification of each monomer using an algorithm that accounts for all combinations of enzyme substrates consistent with known biosynthetic logic. Moreover, the set of predicted chemical structures is compared to a database of 49,860 known natural products via the Tanimoto coefficient in order to chemo-informatically dereplicate known natural products [36].

The enzyme domains associated with specialized metabolites are identified through a library of 479 hidden Markov models. The hypothetical domains are grouped into putative biosynthetic gene clusters and biosynthetically plausible open reading frame permutations are generated. Fig. (6) shows a screenshot of the predictions by PRISM from the analysis of the genome sequence of an Actinobacteria.

Fig. (6)) Screenshot of PRISM output. Identification of the genes that compose a biosynthetic gene cluster from the genome sequence of an Actinobacteria as determined by PRISM.

Pep2Path

Once antiSMASH has identified possible gene clusters and predicted putative NRPSs, the program NRPSPredictor2, developed by Röttig et al. in 2011 [37], can be used to obtain substrate specificity predictions of NRPSs and possible amino acids had to be compared manually [38]. Pep2Path is a program that aims to bridge the gap between antiSMASH and NRPSPredictor2. The creation of Pep2Path [38] was motivated by the tediousness of manually having to match possible amino acid sequences to substrate specificity predictions. Pep2Path uses mass shift sequence tags detected by tandem mass spectrometry to automatically identify candidate biosynthetic gene clusters of either NonRibosomally synthe-sized Peptides (NRPs) or Ribosomally-synthesized and Post-translationally- modified Peptides (RiPPs) [38].

The inputs for Pep2Path can be either: (1) the MS mass shift sequences or the amino acid search tags (MS mass shift sequences translated into amino acids) or (2) the biosynthetic gene clusters plus substrate specificity predictions generated by antiSMASH and NRPSPredictor2. Pep2Path will then bridge both, producing an output that consists of all the possible matches between the amino acid “problem” sequence and all potential assembly steps that have the highest likelihood of generating peptidic natural products containing the sequence [35, 38]. Pep2Path is freely available at http://pep2path.sourceforge.net/.

NRPquest

The current problem of NRP analysis is that standard de novo sequencing tools were developed for linear peptides and cannot be applied to identify cyclic and branched NRPS peptides [39]. Mohimani et al. [39] presented in 2014 NRPquest, a resource to couple Mass Spectrometry and genome mining for the discovery of nonribosomal peptides. NRPquest performs mutation-tolerant and modification-tolerant searches of spectral datasets against a database of putative NRPs. NRPquest is available at www.cyclo.ucsd.edu, and it first generates a database of putative NRPs extracted from the genome using the nonribosomal code. The putative NRPs are matched to MS/MS spectra in the database to identify the NRP [39].

Genome-to-Natural Products (GNP)

Johnston et al. [40] proposed using LC-MS/MS data of crude extracts to discover natural products. Their proposed Genome-to-Natural Products platform, available at http://magarveylab.ca/gnp, is a tool that can generate natural product predictions from LC-MS/MS data [40]. The Genome-to-Natural Products platform uses hidden Markov models and predicts chemical structures based on genome-guided prediction of biosynthetic gene clusters and identification of modules, domains and substrate specificities [40].

The Motif Density Method (MDM)

The main disadvantage of similarity-based programs for the detection of secondary metabolite gene clusters, such as those described above, is that there is actually a limited number of known tested clusters that can serve as a template. As a result, these programs may overestimate the length of a cluster or may not be able to differentiate between adjacent clusters [22].

The Motif Density Method (MDM) is an entirely different approach proposed by Wolf et al. [22] that consists of determining in silico transcription factor binding site occurrences in promoter regions to predict gene clusters and potential regulators of the clusters. It can also help discover clusters co-regulated by the same transcription factors [22]. Cluster-specific transcription factors that bind in several locations within the gene cluster must be detectable by a common motif in the promoter regions. The program that can identify common motifs is MEME [41]. The Motif Density Method approach is an interesting alternative approach to the similarity-based programs for in silico prediction of secondary metabolite gene clusters. It opens up exciting prospects for the future of genome mining.

Additional Bioinformatics Platforms

There are several other bioinformatics platforms mentioned in genome mining research articles such as:

The bioinformatics platform MicroScope [42] enables visualizing genome synteny in annotated bacterial genomes. Genome synteny (colocalization of genetic loci) may also provide information on gene function [43].The Natural Product Domain Seeker (NaPDoS) [44] uses hidden Markov models to identify NRPSs and PKSs in bacterial genomes [4]. NaPDoS analysis provides excellent identification of the KS and C domains [23].The MIDDAS-M software [45] uses gene expression data to identify and assess cooperatively transcribed gene clusters.Thiofinder is a genome mining tool for thiopeptides [46].PKMiner [47] is a combination of database and program for genome mining. The database contains 42 characterized and 230 uncharacterized type II PKS clusters of Actinomycetes.NRPSpredictor2 [37] is a web server for predicting NRPS adenylation domain specificity. The A domains select the amino acid building blocks. According to [23], in NRP pathways, the substrate selectivity of the A domains can be predicted.BAGEL3 [48] is a program that allows the identification and analysis of bacteriocins as well as ribosomally synthesized peptides and post-translationally modified peptides (RiPPs).NP searcher [49] predicts amino acid composition and connectivity, backbone heterocyclization, tailoring chemistries including dimerization, hetero-cyclization and glycosylation [17].SBSPKS [50] is a tool for predicting the order of substrate addition that takes into account that not all PK and NRP biosynthetic pathways follow co-linearity.

Databases for Genome Mining

The analysis of sequences is time-consuming and generates a large amount of data regarding the specificities of domains and the chemical structures of the products. To incorporate the accumulated information [32], several databases have been developed, among them IMG-ABC, StreptomeDB, DoBISCUIT, ClusterMine360, Norine and Protein Data Bank (PDB).

Integrated Microbial Genomes-Atlas of Biosynthetic Clusters (IMG-ABC)

Hadjithomas et al. presented in 2015 IMG-ABC [30], the largest publicly available database of biosynthetic gene clusters, which permits to screen genomic and metagenomic data for secondary metabolite clusters. IMG-ABC is a platform that integrates powerful search and analysis tools and is available at https://img.jgi.doe.gov/abc. IMG-ABC aims to expand constantly to become an essential bioinformatics tool in the search for secondary metabolites.

StreptomeDB

StreptomeDB, now available in version 2.0 [51] is a database that compiles the bioactive molecules produced by Streptomyces. It can be accessed through http://www.pharmaceutical-bioinformatics.de/streptomedb. StreptomeDB cont-ains, to date, around 4041 molecules from 2584 different Streptomyces strains and substrains. The StreptomeDB database is linked with PubMed and offers the names of the molecules, their structures, source organisms, references, biological activities and synthesis routes. It is the largest database of natural products isolated from Streptomyces [51]. The search inputs of StreptomeDB can be a substance name, a producer or a substructure [17].

The utility of StreptomeDB was recently demonstrated by Ntie-Kang [52] and Jain et al. [53]: Ntie-Kang assessed the compounds in StreptomeDB in silico for their agreement with Lipinski's “Rule of Five” and pharmacokinetic profile. Jain et al. used the database to carry out a similarity search of a new compound isolated from Streptomyces sporoverrucosus by using the substructure search engine of the database.

DoBISCUIT

DoBISCUIT [54] provides careful annotations of secondary metabolite biosynthetic clusters from bacteria. It is available at http://www.bio.nite.go.jp/ pks/. DoBISCUIT is a manually curated database. Even though information on gene clusters is constantly being deposited in International Nucleotide Sequence Database Collection (INSDC) entries (DDBJ/GenBank/EMBL), newer additional information is rarely incorporated into the existing entries. As a result, the accumulated knowledge requires professional expertise to track it down and organize it comprehensively [54].

The core data of DoBISCUIT are INSDC entries obtained from a thorough review of articles available from PubMed. The precise identification of the functional domains of biosynthetic clusters is necessary to assign substrate specificity, to be able to predict the biosynthetic mechanism [54]. To date, DoBISCUIT contains information on 103 secondary metabolite biosynthetic clusters. In addition, DoBISCUIT offers information on the “tailoring enzymes” encoded in the gene clusters [17]. Databases such as DoBISCUIT permit to infer the function of uncharacterized genes by similarity to characterized genes from other gene clusters. Ichikawa et al. [54] annotated genes based on the function of similar genes if they had more than 30% amino acid identity with an experimentally confirmed protein. Hagen et al. in 2014 [55] used DoBISCUIT to inquire into the functionality of a PKS domain involved in borrelidin biosynthesis. Castro et al. [56] have also recently used DoBISCUIT to obtain annotated ansamycin-related gene clusters.

Clustermine360

Clustermine360 [57] contains more than 200 PKS and NRPS gene clusters [17]. The difference between Clustermine360 and DoBISCUIT is that ClusterMine360 allows the public to enter information. Once a user uploads a PKS or NRPS cluster into the database, the sequence is retrieved from NCBI and automatically analyzed with antiSMASH. The database then incorporates the results obtained from antiSMASH as well as bioactivity data obtained from PubChem.

Norine

AntiSMASH-predicted peptides can be compared against the Norine database [58] because it contains the structures of more than 1000 non-ribosomal peptides [4]. Norine includes information on the producers, literature references, biological activities, amino acid sequence and the building blocks of the respective peptides [17]. The Norine database can be used to find peptides matching a specific sequence or containing unusual non-proteinogenic amino acids. It conveniently allows text searches or structure searches generated by an interactive structure editor.

Table 1Overview of bioinformatic tools and databases for secondary metabolite discovery.ProgramType of AnalysisReferenceAnalysis of biosynthetic gene clustersClustScanIdentification and prediction of NRPS/PKS biosynthetic gene clusters[31]ClusterFinderAnalysis and identification of biosynthetic gene clusters[33]antiSMASHAnalysis and identification of secondary metabolite biosynthetic gene clusters[21]PRISMIdentification of NRPS, type I and II PKS biosynthetic gene clusters[36]MDMDetection of genomic clusters based on consideration of cluster-specific regulatory patterns[22]Coupling mass-spectrometric and genomic dataPep2PathConnection of genomic and mass spectrometry data of peptide metabolites[38]NRPquestCoupling of mass spectrometry and genome mining for NRPS discovery[39]GNPPrediction, combinatorialization and identification of PKS and NRPS from biosynthetic assembly lines using LC–MS/MS data of crude extracts[40]Biosynthetic gene cluster and chemical structure databasesIMG-ABCStructural and functional genomic data for the analysis of biosynthetic gene clusters and associated secondary metabolites[30]StreptomeDBThe largest database of natural products isolated from Streptomyces[51]DoBISCUITDatabase of gene clusters for secondary metabolite biosynthesis (manually curated)[54]Clustermine360Library of NRPS and PKS gene clusters[57]NorineDatabase of NRPS from bacteria and fungi[58]Protein Data BankContains peptide-like antibiotics and inhibitors[59]

Table 1 includes a summary of the type of analysis that the bioinformatics platforms and databases mentioned in this chapter are capable of performing and storing.

Protein Data Bank (PDB)

The Protein Data Bank (PDB) [59] is a global repository for three-dimensional structures of biological macromolecules and their complexes. In 2011, a major remediation of the 1300 peptide-like inhibitors and antibiotics in the database was carried out by PDB curators for easier identification. It is possible to cross-reference to the UniProt or Norine databases. The database includes chemical information of peptide-like antibiotics such as their chemical composition, connectivity and structural description as well as biological information such as function, mechanism of action and pharmacological action [59]. The rapid progress in sequencing technology means that these databases will become more and more populated. These growing databases may even help predict which compound is likely to possess useful biological activities [32]. For a very comprehensive review on bioinformatics programs and databases created to mine genomes, the reader is invited to consult the 2016 publication of Weber and Kim [60], who have introduced a web portal available at http://www.secondarymetabo- lites.org that includes a catalog and links to the bioinformatics software for genome mining.

In addition, the Minimum Information about a Biosynthetic Gene cluster (MIBiG) data standard proposed in 2015 by Medema and 153 other co-authors [61] is an effort to gather in a single place the growing information in the literature about biosynthetic gene clusters, proposed pathways and characterized metabolites in one universal format available at http://mibig.secondarymetabolites.org.

Experimental Metabolomics for the Identification of Secondary Metabolites

New approaches are required to identify and characterize the gene clusters that appear to be present among all Actinomycetes. Genome mining-based drug discovery requires methods that allow linking a gene cluster and a natural product conclusively [19].

Mass Spectrometry (MS)

Mass spectrometry is rapidly gaining popularity in natural product research and is increasingly present in the latest publications related to genome mining, as summarized below. It has been suggested that mass spectrometry may even surpass NMR for the initial characterization of natural products [62].

MS/MS Glycogenomics and Peptidogenomics

Our understanding of the biosynthetic machinery for natural products has advanced and now it is becoming possible to connect genomic information with the MS/MS signatures of compounds [62]. The basis of the glycogenomics and peptidogenomics methods consists of detecting the mass spectrometry signature of a secondary metabolite, matching it to known signatures and looking in the genome for gene clusters that have the biosynthetic machinery to produce such type of compound [62]. Glycogenomics and peptidogenomics are two experiment-guided approaches based on tandem mass spectrometry (MS/MS) analysis of secondary metabolites [7].

Glycogenomics allows the characterization of glycosylated natural product chemotypes by connecting glycosyl groups obtained from tandem mass spectrometry (MS/MS) signatures of microbial metabolomes with their corresponding biosynthetic genes [63]. Glycosyl tags from glycosylated natural products can be linked to their corresponding glycosylation genes among all the glycosylation genes found in the genome [7].

As proof-of-principle of the glycogenomics approach, Kersten et al. [63], reported the discovery of arenimycin B from the marine actinobacterium Salinispora arenicola, a natural compound that has antibiotic activity against multi-drug resistant Staphylococcus aureus. MS-genome mining can target secondary metabolite biosynthetic pathways as long as those pathways are expressed, not silent [63].

Peptidogenomics is a method that is also based on matching tandem mass spectrometry structures of chemotypes of peptide natural products to their biosynthetic gene clusters [64]. The short amino acid sequence tags that are part of the peptide in question are reconstructed from the MS spectrum. These “mass shift sequences” can be assessed for their potential to be a novel peptide with the help of tools like iSNAP reported by Ibrahim et al. in 2012 [65] and can be matched to biosynthetic gene clusters predicted by antiSMASH with the help of the software package Pep2Path, described above.

As Kersten et al. [64] demonstrated, the peptidogenomics method can be used to characterize ribosomal and non-ribosomal peptide natural products of previously unidentified composition, including lantipeptides, lasso peptides, linardins, formylated peptides and lipopeptides.

Streptomyces roseosporus is an industrial producer of daptomycin. Liu et al. [66] applied an MS/MS peptidogenomics approach to analyze four NRPS-derived molecules (daptomycin, arylomycin, napsamycin and stenothricin) as well as their corresponding genes in S. roseosporus. A map of the biosynthetic intermediates of these four compounds was created by MS/MS-based networking. Liu et al. [66] highlighted the importance of developing new approaches to characterize in a more global manner the biosynthetic capacity of a microorganism. As opposed to studying one molecule at a time, Liu et al. [66] aimed at visualizing the “molecular network”. The molecular network developed by MS/MS networking refers to the metabolites detectable under given mass spectrometry conditions. MS/MS is a methodology where molecules characterized by mass spectrometry are subjected to fragmentation resulting in mass spectrometry patterns that can be analyzed based on similarity and can be used to visualize structurally similar molecules from one organism as a constellation of nodes. The MS/MS networking technique allowed Liu et al. [66] to find yet unknown analogs of known molecules.

Methods like peptidogenomics and glycogenomics can be applied to a variety of compounds to identify their biosynthetic genes rapidly, but according to Mohimani and Pevzner [35] the key difficulties in peptidogenomics are that:

Many peptidic natural products are non-linear peptides (see NRPquest).Many peptidic natural products are not directly encoded in genomes.Even when they are encoded in a genome, peptidic natural products still have modifications that make it difficult to identify them using standard MS/MS searches.Many peptidic natural products are encoded in the alphabet of hundreds of building blocks.Many peptidic natural products fragment poorly and often feature very few peaks, which cannot be identified in MS/MS databases.

Proteomining

The induction of silent biosynthetic gene clusters under specific conditions will lead to an increase in production and protein expression levels, which is ideal for the proteomining approach [19].

Gubbens et al. presented in 2014 an alternative methodology: proteomining. The aim of proteomining is to link a natural product to its biosynthetic gene cluster, which is still a difficult task [19]. Proteomining is a combination between metabolomics and quantitative proteomics: fluctuating growth conditions ensure differential biosynthesis of secondary metabolites [67]. Therefore, correlations can be established between the abundance of a natural product and changes in the protein pool, which allow the identification of the gene cluster involved in the production of the natural product. In other words, the proteomining approach is based on the reasonable assumption that protein expression levels are directly proportional to the amount of secondary metabolite produced [19].

Quantitative proteomics is based on labeling with stable isotopes, which is a well-established technique that is easy to implement [19] because it is based on inexpensive reagents, it is applicable to any sample and the full protocol doesn't take more than three days to complete [68]. Dimethyl-labeling is more cost effective and yields more data points when compared to MS/MS-based quantification of isobaric labels such as iTRAQ or tandem mass tags. Chemical modification of peptides had to be implemented instead of isotopic labeling because the variation of growth conditions hampers metabolic incorporation of isotopes [19].

The usage of expression levels should allow the detection of all types of clusters specifying natural products simultaneously and allows for accurate determination of cluster members. As proof-of-principle, proteomining was used to identify the gene cluster for a juglomycin-type antibiotic from the new isolate Streptomyces sp. MBT70 [19]. Techniques that help to identify natural products such as MS or NMR help to support the effectiveness of proteomining [19].

MALDI-based Imaging Mass Spectrometry (IMS)

The MALDI-TOF imaging technique is an alternative approach to identify metabolites. As demonstrated by Yang et al. in 2011 [69] MALDI-based imaging mass spectrometry (IMS) can be applied directly to cultures growing on agar to connect the chemotype to the phenotype. Such technique is necessary to study signalling interactions, namely metabolites produced within interspecies co-cultures. The technique permits the analysis of the spatial distribution of the metabolites on solid media [70] to better understand how microbes affect the growth of neighboring organisms because it enables to visualize the metabolites being produced by the colonies and link them to their corresponding mass-to- charge (m/z) ratio readout.

Yang et al. [69] used IMS to discover a novel peptide produced by a Promicromonosporaceae strain that inhibited the motility of B. subtilis but not its growth, as well as the novel hydroxamate siderophore promicroferrioxamine.

Liu et al. in 2011 [71] reported the discovery of three new anti-infective arylomycins from Streptomyces roseosporus also using IMS. The new arylomycins inhibited the growth of S. aureus and Staphylococcus epidermidis. A peptidogenomics approach was followed to identify the biosynthetic cluster responsible for the production of the new arylomycins using NRPS predictor, NP.searcher and the Norine database. Using IMS allowed Liu et al. [71] to observe that prior to the production of daptomycin, a cluster of ions corresponding to the three new arylomycins was produced by S. roseosporus. The production of the cluster of ions correlated well with the decrease in staphylococcal cell growth. The sequence tags generated from MS/MS were queried against genomes to be able to identify the biosynthetic pathway and propose that the new molecules are arylomycins.

Experimental Strategies to Awaken the Homologous Expression of Silent Gene Clusters

According to Choi et al. [13], most of the secondary metabolite biosynthetic gene clusters encoded in the genomes of Actinomycetes appear to be silent under laboratory conditions. If each strain of Actinomycetes contains, on average, 20 or more potential clusters involved in the biosynthesis of secondary metabolites, then most of them are expressed poorly or not at all under laboratory conditions. Not being able to awaken silent gene clusters is a major bottleneck for natural product discovery. The awakening of silent clusters is one of the major challenges of

genome mining. Fortunately, experimental strategies exist to awaken silent gene clusters in a laboratory setting. These experimental strategies include the activation of regulatory genes, ribosome engineering, heterologous expression, the addition of chemical elicitors and co-cultivation, among others [13].

Co-culturing methods and the OSMAC approach are very relevant strategies that have proved useful for both, the awakening of silent gene clusters and improvement of the titer of existing pathways. The latter is a strategy which in turn may be useful to awaken silent genes clusters because silent clusters are expressed very poorly under laboratory conditions. Therefore, it is relevant to know what conditions have been proved useful to increase the expression of secondary gene clusters.

The OSMAC Approach to Awaken Silent Clusters

Many secondary metabolites can be discovered by homologous expression via modification of the growth conditions and studying the compounds that are produced following the principles of analytical chemistry [26]. The composition and concentration of nutrients in the media affect growth rate and also influence complex changes in global gene regulation. This is a reflection of the range of conditions in nature that trigger the production of different antimicrobials [67].

The OSMAC approach “one strain/many compounds” is powerful to investigate the secondary metabolites produced by a microorganism by changing basic growth conditions like media composition, aeration rate, illumination and temperature [16]. The addition of chemical elicitors is not traditionally considered to be part of the OSMAC approach, but it has been observed that the presence of certain compounds in the growth media resulted in the awakening of silent gene clusters because chemical elicitors are signal compounds that stimulate the synthesis of other compounds [13]. Below we summarize a few recent examples from the literature where the implementation of the OSMAC approach and the addition of chemical elicitors has led to the production of either new secondary metabolites or enhanced the production of known secondary metabolites.

Secondary metabolites are produced by secondary pathways that are dispensable in many growth conditions, thus, when bacteria are grown under experimental growth conditions, the dynamics of cell growth show different stages: abundant nutrients allow for exponential growth. Limiting nutrient produces an arrest in growth and entry into stationary phase. Nutrient limitation often results in the activation of the production of secondary metabolites [5]. Small changes in the nutrients Actinobacteria grow on can have an impact on the production of secondary metabolites, even to the extent of facilitating the discovery of novel secondary metabolites.

To carry out the Proteomining approach, the range of conditions that gave good variation in bioactivity were minimal medium supplied with NaOH to pH 9, 25 mM N-acetylglucosamine, 0.8% w/v Bacto peptone, 0.5% w/v yeast extract and 1-2% w/v NaCl [19].

Carbon sources have a major impact on the expression of secondary metabolism biosynthetic genes. Carbon Catabolite Repression (CCR) occurs when the growth media contains mixtures of rapidly and slowly used carbon sources [67]. Glucose has been reported to repress antibiotic production [72]. Pawlik et al. in 2010 [73] reported that S. coelicolor A3(2) produced a yellow pigment associated with the type I PKS gene cluster cpk when grown on rich media without glucose. The production of the pigment did not occur in the presence of glucose, so for industrial fermentations other carbon sources such as starch, lactose or soybean oil are used [67].

Selection of yet unknown bacterial strains from unexplored environments is of great interest for the discovery of new natural products. Rateb et al. reported in 2011 [74] that the metabolic profile of Streptomyces sp. strain 34, now Streptomyces leeuwenhoekii [2], is dependent on the culture media used for its growth. An OSMAC approach testing of eight different media led to the discovery of three new compounds of a rare class of 22-membered macrolactone polyketides, named chaxalactins. Chaxalactins showed strong activity against S. aureus ATCC 25923, Listeria monocytogenes ATCC 19115 and Bacillus subtilis NCTC 2116 [74].

Similarly, Rateb et al. in 2011 [75] discovered that S. leeuwenhoekii