25,99 €
An indispensable look at the next frontier of technological advancement and its impact on our world
Generative AI is rewriting the rulebook with its seemingly endless capabilities, from crafting intricate industrial designs, writing computer code, and producing mesmerizing synthetic voices to composing enchanting music and innovating genetic breakthroughs. In Generative AI in Practice, renowned futurist Bernard Marr offers readers a deep dive into the captivating universe of GenAI. This comprehensive guide introduces you to the basics of this groundbreaking technology and outlines the profound impact that GenAI will have on business and society. Professionals, technophiles, and anyone with an interest in the future will need to understand how GenAI is set to redefine jobs, revolutionize business, and question the foundations everything we do.
In this book, Marr sheds light on the most innovative real-world GenAI applications through practical examples, describing how they are moulding industries like retail, healthcare, education, finance, and beyond. You'll enjoy a captivating discussion of innovations in media and entertainment, seismic shifts in advertising, and the future trajectory of GenAI. You will:
Ideal for professionals, technophiles, and anyone eager to understand the next big thing in technology, Generative AI In Practice will equip readers with insights on how to implement GenAI, how GenAI is different to traditional AI, and a comprehensive list of generative AI tools available today.
Sie lesen das E-Book in den Legimi-Apps auf:
Seitenzahl: 396
Veröffentlichungsjahr: 2024
COVER
TABLE OF CONTENTS
TITLE PAGE
COPYRIGHT
DEDICATION
ABOUT THE AUTHOR
INTRODUCTION
The Incredible Impact of Generative AI
Because Generative AI Is More than Chatbots
What to Expect from This Book
Notes
Part 1: SETTING THE SCENE FOR AN AI REVOLUTION
1 UNVEILING GENERATIVE AI: A NEW FRONTIER
What Is Generative AI? A Quick Explanation
How Does Generative AI Work?
What Can Generative AI Do?
The Impact of Generative AI
Key Takeaways
2 TRACING THE EVOLUTIONARY BLUEPRINT OF GENERATIVE AI
1950s–1990s: The Early Years of AI
The promise of AI takes root
2000s–2010s: Laying the Foundations for Generative AI
MID-2010s–Today: The Era of Generative AI Arrives
Generative AI and Its Link to Other Technological Advances
Key Takeaways
Note
3 REVOLUTIONIZING SOCIETIES AND BUSINESS ECOSYSTEMS
Transforming Sectors and Society
Reshaping Business Models
Innovating Products and Services
Refining Business Processes
Generative AI in Daily Life
Key Takeaways
Notes
4 RISKS AND CHALLENGES TO MANAGE
Ethical and Societal Concerns
AI Bias and Explainability
Legal Concerns
The Environmental Impact of AI
Responsible Ways Forward
Key Takeaways
Notes
5 IMPACT OF GENERATIVE AI ON JOBS
How at Risk Is Your Job?
Consider the Difference Between Lower-Level and Higher-Level Jobs
Jobs at Maximum Risk of Automation
Other jobs that may cease to exist in future (at least, in their lower-level forms)
Jobs at Moderate to Low Risk
Let's Not Forget that New Jobs Will Be Created
Key Takeaways
Notes
Part 2: GENERATIVE AI IN PRACTICE
6 A NEW DAWN IN MEDIA AND ENTERTAINMENT
Generative AI and Journalism
Enhancing Sports Broadcasting and Fan Engagement with Generative AI
Storytelling: Generative AI in Books, Audiobooks, and Podcasts
Generative AI in Film
Generating Music with AI
AI in Art
Key Takeaways
Notes
7 ADVERTISING AND MARKETING: BRIDGING CREATIVITY WITH AI
Overview: How Can Generative AI Be Applied in Advertising and Marketing?
Using Generative AI to Create Ads and Marketing Materials
Personalization of Content with GenAI
AI Influencers and Models
Key Takeaways
Notes
8 REINVENTING CUSTOMER ENGAGEMENT THROUGH INTELLIGENT SYSTEMS
Generative AI in Customer Service
A New Level of Personalization
Delivering More Proactive Interventions through Predictive AI
Incorporating Generative AI into Products
Key Takeaways
Notes
9 RETAIL REIMAGINED: THE RISE OF VIRTUAL TRY-ONS, AI SHOPPING ASSISTANTS, AND MORE
Improved Digital Shopping Experiences
Augmenting Physical Stores
Personalizing the Customer Journey
Other Potential Uses in Retail
Key Takeaways
Notes
10 PERSONALIZED LEARNING: THE FUTURE OF EDUCATION
Rethinking How Learners Learn, and How Teachers Teach
Transforming
What
We Teach
The Need for Formal Policy and Guidance in Schools
Key Takeaways
Notes
11 HEALTHCARE TRANSFORMATION: FROM PERSONALIZED ADVICE TO OPERATIONAL IMPROVEMENTS
Giving Patients Personalized Health Advice
Image Analysis and the Early Detection of Diseases
Providing Personalized and Improved Treatment for Patients
Healthcare Research and the Development of New Drugs
Delivering Administrative and Operational Improvements
Could Generative AI Help to Solve the Healthcare Crisis?
Key Takeaways
Notes
12 VIDEO GAME DESIGN AND TESTING: THE GENERATIVE AI APPROACH
Generative AI in Video Game Development and Testing
Generative AI Will Shape the Future of Video Games
Key Takeaways
Notes
13 THE LEGAL SPHERE: AI-ASSISTED DOCUMENT CREATION AND REVIEW
The Use of Generative AI in Law Firms
Helping Everyday People Digest Legal Documents and More
Key Takeaways
Notes
14 CRAFTING TOMORROW: AI IN DESIGN AND DEVELOPMENT
Generative AI in Drug Discovery
Designing Anything with Generative AI and Generative Design
The Impact of Generative AI on Digital Twins
Aiding Academic Research
Key Takeaways
Notes
15 BANKING AND FINANCIAL SERVICES: AI AS A DISRUPTIVE FORCE
Generative AI Uses in Banking and Financial Services
The Risks of Generative AI
Examples from the Financial Sector
Key Takeaways
Notes
16 CODING AND PROGRAMMING: THE AI REVOLUTION
The Impact of Generative AI on Programming
Adoption and New Coding Tools: Real-World Examples
Key Takeaways
Notes
17 DATA INSIGHTS: HARNESSING THE POWER OF GENERATIVE AI
What Is the Data Problem and How Does Generative AI Help?
Real-World Examples and New Analytics Tools
Key Takeaways
Notes
Part 3: MOVING FORWARD WITH GENERATIVE AI
18 IMPLEMENTING GENERATIVE AI: KEYS FOR SUCCESS
Fostering the Right Culture and Mindset for Success
Cultivating the Right Skills and Talent
Data as Another Key Building Block
Proprietary data will set you apart – but don't overlook external data
Getting the Right Technology in Place
Remember to Take a Strategic Approach
Key Takeaways
Notes
19 GLIMPSES OF THE FUTURE: PREDICTING THE TRAJECTORY OF GENERATIVE AI
Are We Moving Closer to General AI?
Combining Generative AI with Robots
Brain–Computer Interfaces Will Allow for More Intuitive Interactions
Using Generative AI for a Better World
Connect with Me
Notes
APPENDIX: GENAI TOOLS
MULTIMODAL MODELS
Writing/ Text Generation Tools
Education Tools
Image, Art, and Design
Video
Sound, Music, and Voice Generators
Coding Tools
Data Analytics
Synthetic Data
Productivity Tools
Cyber Security
INDEX
END USER LICENSE AGREEMENT
Cover
Title Page
Copyright
Dedication
About the Author
Introduction
Table of Contents
Begin Reading
Appendix: Genai Tools
Index
End User License Agreement
iii
iv
v
ix
xi
xii
xiii
xiv
1
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
97
98
99
100
101
102
103
104
105
106
107
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
143
144
145
146
147
148
149
150
151
152
153
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
BERNARD MARR
This edition first published 2024© 2024 by Bernard Marr
All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, except as permitted by law. Advice on how to obtain permission to reuse material from this title is available at http://www.wiley.com/go/permissions.
The right of Bernard Marr to be identified as the author of this work has been asserted in accordance with law.
Registered OfficesJohn Wiley & Sons, Inc., 111 River Street, Hoboken, NJ 07030, USAJohn Wiley & Sons Ltd, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, UK
For details of our global editorial offices, customer services, and more information about Wiley products visit us at www.wiley.com.
Wiley also publishes its books in a variety of electronic formats and by print-on-demand. Some content that appears in standard print versions of this book may not be available in other formats.
Trademarks: Wiley and the Wiley logo are trademarks or registered trademarks of John Wiley & Sons, Inc. and/or its affiliates in the United States and other countries and may not be used without written permission. All other trademarks are the property of their respective owners. John Wiley & Sons, Inc. is not associated with any product or vendor mentioned in this book.
Limit of Liability/Disclaimer of WarrantyWhile the publisher and authors have used their best efforts in preparing this work, they make no representations or warranties with respect to the accuracy or completeness of the contents of this work and specifically disclaim all warranties, including without limitation any implied warranties of merchantability or fitness for a particular purpose. No warranty may be created or extended by sales representatives, written sales materials, or promotional statements for this work. This work is sold with the understanding that the publisher is not engaged in rendering professional services. The advice and strategies contained herein may not be suitable for your situation. You should consult with a specialist where appropriate. The fact that an organization, website, or product is referred to in this work as a citation and/or potential source of further information does not mean that the publisher and authors endorse the information or services the organization, website, or product may provide or recommendations it may make. Further, readers should be aware that websites listed in this work may have changed or disappeared between when this work was written and when it is read. Neither the publisher nor authors shall be liable for any loss of profit or any other commercial damages, including but not limited to special, incidental, consequential, or other damages.
Library of Congress Cataloging-in-Publication Data:
Names: Marr, Bernard, author.
Title: Generative AI in practice : 100+ amazing ways generative artificial intelligence is changing business and society / Bernard Marr.
Description: Hoboken, NJ : John Wiley & Sons, Inc., 2024.
Identifiers: LCCN 2024000295 (print) | LCCN 2024000296 (ebook) | ISBN 9781394245567 (cloth) | ISBN 9781394254255 (adobe pdf) | ISBN 9781394254248 (epub)
Subjects: LCSH: Artificial intelligence—Economic aspects. | Artificial intelligence—Social aspects.
Classification: LCC HC79.I55 M36528 2024 (print) | LCC HC79.I55 (ebook) | DDC 658/.0563—dc23/eng/20240111
LC record available at https://lccn.loc.gov/2024000295
LC ebook record available at https://lccn.loc.gov/2024000296
Cover Design: WileyCover Image: © Jannatul Koraise/Adobe Stock
To my wife, Claire, my anchor in the shifting seas of life, whose love lights the way through every storm. And to my children, Sophia, James, and Oliver, my brightest stars in the vast universe, who remind me every day that the most profound truths often come from the simplest curiosities.
Bernard Marr is a world-renowned futurist, influencer, and thought leader in the fields of business and technology, with a passion for using technology for the good of humanity. He is a multi-award-winning and internationally bestselling author of over 20 books, writes a regular column for Forbes, and advises and works with many of the world's best-known organizations. Bernard has a combined subscriber base of 4 million people across his social media channels and newsletters and was ranked by LinkedIn as one of the top five business influencers in the world.
Bernard helps organizations and their management teams prepare for future trends and create the strategies to succeed. He has worked with or advised many of the world's best-known organizations, including Amazon, Microsoft, Google, Dell, IBM, Walmart, Shell, Cisco, HSBC, Toyota, Nokia, Vodafone, T-Mobile, the NHS, Walgreens Boots Alliance, the Home Office, the Ministry of Defence, NATO, and the United Nations, among many others.
Connect with Bernard on LinkedIn, X (@bernardmarr), Facebook, Instagram, and YouTube to take part in an ongoing conversation; subscribe to Bernard's podcast; and head to www.bernardmarr.com for more information and hundreds of free articles, whitepapers, and e-books.
If you would like to talk to Bernard about any advisory work, speaking engagements, or influencer services, please contact him via email at [email protected].
It feels like generative AI has come out of nowhere, doesn't it? Perhaps you were happily going through life thinking artificial intelligence was something other people had to worry about. And then boom! Suddenly, everyone's talking about this new thing called “ChatGPT” and how it's going to take over everyone's jobs.
As you've no doubt noticed, there's a lot of hype around generative AI (or GenAI as I call it in this book), and tools like ChatGPT. I expect you've picked up this book because you want to cut through the chatter and understand just what the deal is with GenAI. Is it really worthy of the hype? How might GenAI impact your job? Should your organization be leveraging this technology?
To answer the first question, yes, GenAI really is worthy of all the noise. I believe it's one of the most powerful, transformative technologies that humans have ever had access to. It's right up there with the emergence of the internet. Seriously! We'll cover the second and third questions throughout this book but, spoiler alert, GenAI (probably) won't take your job. That said, it will almost certainly change the way you work. And yes, your organization should absolutely be looking to harness this transformative technology. That's why I wrote this book.
I'm not the only one convinced that GenAI is utterly transformative.
One study led by Harvard found that GenAI tools helped workers at the Boston Consulting Group gain a 40% performance boost.1 We'll see many examples throughout the book of organizations using GenAI to accomplish work more easily, lighten the burden of mundane tasks, and provide a better employee (and customer) experience.
GenAI will be so ubiquitous that Bloomberg predicts the GenAI market will explode from £40 billion in 2022, to a whopping US$1.3 trillion by 2032.2 What's more, according to research by McKinsey, GenAI could add up to US$4.4 trillion in value to the global economy annually – and save up to 70% of workers' time.3 It's no wonder McKinsey refers to GenAI as “the next productivity frontier.”
That's the economic impact, but what about our everyday lives and the way we do business? That, too, will be fundamentally altered by GenAI. We'll talk about the business and societal impact in Chapter 3, but to whet your appetite, GenAI will change how we work, how we shop, how we consume content, how we experience healthcare, how we learn (at school and otherwise), how we play video games, and even potentially how we date. Adam Selipsky, CEO of Amazon Web Services, says GenAI will “pretty much change every application inside of companies, every application that consumers interact with. It is a profound technological change that we're still at the very beginning of.”4 I couldn't agree more.
But wait, back up a minute … US$4.4 trillion in economic benefit? I know what you're thinking. How the heck can chatbots add so much value?
We'll delve into a more detailed definition of GenAI in Chapter 1, but for now, all you need to know is that GenAI is a facet of artificial intelligence that allows computers to create content, including (but not limited to) text. One of the ways in which this technology can be harnessed is through more intuitive, more responsive chatbots – chatbots with advanced language capabilities that can respond to complex queries, and understand context as well as content.
ChatGPT is a chatbot. Developed by OpenAI and available free for public use (although a paid-for, upgraded version is available), it's probably the most advanced chatbot we've ever seen. It can answer almost any question you throw at it. It's brilliant. It's a great example of GenAI in action. However, GenAI is much bigger than ChatGPT. And its uses extend far beyond chatbots.
Therefore, this is not a book about ChatGPT (although ChatGPT will obviously feature in several examples throughout). This book shines a light on the vast array of tasks that GenAI tools can accomplish – from designing video games to interpreting medical scans, and more. Yes, GenAI gives computers the ability to converse and answer questions. But it can do so much more on top of that.
Almost every time I type “GenAI,” I think of the word “genie.” And for good reason. Working with GenAI is a bit like having your own personal genie to help lighten the load, get more done, and generally make life easier. (If only it could grant infinite wishes…)
That said, as we'll see in this book, getting the best out of GenAI requires a balance between human expertise and machine intelligence. This isn't about total automation – it's about humans collaborating with GenAI to do things better. At the end of the day, GenAI is just a tool. An incredible tool. But a tool nonetheless – and it's up to us to use it thoughtfully.
With that in mind, my hope is that GenAI will be used not just as a productivity tool, but as a force for good in our world. As an example, in Chapter 11, we'll see how GenAI can be used to democratize access to healthcare for people around the world. And in Chapter 14, we'll see how GenAI is speeding up the process of discovering life-saving new drugs. True, it can also be used to generate fake content to sway elections or be harnessed by criminals for all sorts of nefarious purposes. But GenAI also has the potential to help solve some of humanity's biggest challenges.
The bulk of this book focuses on practical applications of GenAI across a wide range of sectors. In other words, we'll explore how organizations are already using this technology – and, in the process, help you unearth potential use cases for your own organization. That's the focus of Part 2 of this book, practical examples and takeaways from other organizations that have deployed GenAI.
Then in Part 3, we'll look at keys for success when implementing GenAI in your own business. Plus, you'll find predictions for the future of GenAI.
But before all that, Part 1 sets the scene for this powerful, transformative technology. We'll explore what the technology is, how it works, how it will change our world (and the world of business), and how it will impact jobs. We'll also take an honest look at the risks and challenges that surround GenAI.
Before you tuck into Part 1, it's important to note that GenAI technology is rapidly evolving. And it's getting better and better all the time. Indeed, one of the things that made this book challenging to write is the fact that the technology is advancing so darn quickly. Amazing new use cases and updates to GenAI tools were emerging literally every week. As an example, I'd barely finished writing Part 1 when OpenAI launched a new version of ChatGPT that could not just answer written questions, but also understand speech and visuals, and respond in an artificial voice.
Safe to say, by the time you read this, new tools and new examples will have surfaced that continue to surprise and delight. It's truly an exciting time.
1
. Enterprise workers gain 40 percent performance boost from GPT-4, Harvard study finds;
Venture Beat
;
https://venturebeat.com/ai/enterprise-workers-gain-40-percent-performance-boost-from-gpt-4-harvard-study-finds/
2
. Generative AI to become a $1.3 trillion market by 2032, Research finds;
Bloomberg
;
https://www.bloomberg.com/company/press/generative-ai-to-become-a-1-3-trillion-market-by-2032-research-finds/
3
. The economic potential of generative AI: The next productivity frontier;
McKinsey
;
https://www.mckinsey.com/capabilities/mckinsey-digital/our-insights/the-economic-potential-of-generative-ai-the-next-productivity-frontier
4
. AWS CEO: Generative AI will ‘pretty much change’ every application consumers interact with;
YouTube
;
https://www.youtube.com/watch?v=jQ18HJkLJjw
I said in my introduction that Generative AI (GenAI) is one of the most powerful technologies humans have ever had access to. But what exactly is GenAI and how does it work? How did we get to this point? And how, broadly speaking, might GenAI change our world?
In Part 1, we'll answer all those questions and more. We'll define GenAI and how it works, explore the evolution of GenAI until this point, outline the risks associated with GenAI, and take a big-picture look at how GenAI is going to change our world – including the jobs that humans do. This will set the scene nicely before we delve into GenAI use cases across various sectors and industries in Part 2.
Okay, let's go back to basics. What is generative AI (artificial intelligence)? How does it work? And what is the technology capable of? You'll find out in this chapter as we delve under the hood of GenAI, tug at a few wires, and examine what the heck's going on in there.
I really want this chapter to give you an inspiring feel for the many things that GenAI is capable of – but also to drive home the point that GenAI isn't just about ChatGPT (Chat Generative Pre-trained Transformer). Sure, ChatGPT is a prime example of GenAI (and it certainly hoovers up the majority of GenAI's press), but there's a lot more to GenAI than ChatGPT … as you'll find out in this chapter.
Actually, let's start by defining artificial intelligence (AI) in its broadest sense. The term “AI” refers to computer algorithms that can effectively simulate human cognitive processes, like learning, decision-making, and problem-solving.
GenAI is a groundbreaking subset of AI – the cutting edge of the cutting edge – that is able to create new content based on patterns and structures it has learned from existing data. Like any AI, GenAI tools are given enormous amounts of data to learn from (what's known as “training data”). They learn from the training data, and then use the patterns or rules that they've learned to create new content that's similar to, but not exactly the same as, the data they have been trained on.
A good example is DALL-E 2 – the text-to-art platform that allows anyone to generate artworks. Or, of course, there's ChatGPT, the language model that can create text based on conversational text prompts. (You'll find a much bigger list of GenAI tools in the Appendix, by the way.) These systems learn from huge training datasets – ChatGPT, for example, was trained on vast amounts of text from the internet, including web pages, articles, and books.
Text and images are perhaps two of the best-known uses of GenAI so far, but the technology is capable of so much more. With GenAI you can generate product designs, computer code, music, video, voices, and even entire visual worlds. We'll talk more about GenAI's capabilities later in the chapter, but to whet your appetite, imagine being able to create unique video game worlds rendered in real time, or have a book written just for you, or have your favorite celebrity read you today's news. The possibilities are mind-blowing.
The capabilities are already quite impressive, but in the future, GenAI systems will be able to create pretty much anything that humans can. And this, in turn, means GenAI can turn anyone and everyone into an author, musician, computer programmer, filmmaker, or other type of creator.
AI is increasingly part of the world around us, including the search results you get on your phone, your conversations with Alexa, and the movie recommendations that Netflix serves up on an evening. So what makes GenAI different to these “traditional” AI tools? (I realize it sounds strange to refer to “traditional AI” when it's hardly been around that long, but I do so to distinguish between the AI that we're already used to in everyday life and this new evolution of AI systems. One technical term for traditional AI is “discriminative AI.”)
Traditional AI systems also learn from large amounts of data, but they deliver a different output. Traditional AI systems are used to make predictions based on existing data. And we use those predictions to help us make better decisions, at work and in everyday life. This could span anything from listening to new music on Spotify, and viewing recommended products on Amazon, to identifying which of your company's customers are most likely to buy a certain product.
This new wave of GenAI tools goes even further, by creating new content based on existing data. In other words, GenAI isn't just about simulating human cognitive processes like decision-making and problem-solving – it's about simulating human creativity.
To further illustrate the difference, imagine you're playing computer chess. The computer knows all the rules, can predict your moves, and makes its own moves. It's not inventing new chess moves; rather, it's selecting the right move based on existing strategies. That's traditional AI – it's like a master strategist who can make smart decisions within a set of rules. And it does it very well. But GenAI? Well, that could, in theory, come up with new ways of playing chess that we haven't invented yet.
So, traditional AI excels at pattern recognition, while GenAI excels at pattern creation. Which is very cool indeed.
That said, GenAI and traditional AI aren't mutually exclusive. They could be used in tandem to provide even more powerful solutions. For example, a traditional AI could analyze user behavior from your company's website, and then a GenAI could use the analysis to create personalized content for users.
Think of it like learning to draw by looking at many pictures. After looking at many pictures, you try to draw something new on your own. GenAI does something similar: it “looks” at a lot of data (text, visual, or other), learns the patterns within that data, and then tries to create something new that fits those patterns.
So, in very simple terms, GenAI is like an artist or writer that has studied lots of existing works and then tries to create its own work based on what it has learned. This process is powered by complex algorithms that mimic how our brain works in order to learn from data and identify patterns.
That's the super-simple explanation. Let's get a little more technical.
We'll talk more about the evolution of GenAI in Chapter 2, but as a quick primer, GenAI grew out of a field of AI study and practice called “machine learning” – indeed all of the AI we see today is based on machine learning. While traditional computer algorithms are coded by a human to tell a machine exactly how to do a particular job, machine-learning algorithms are able to make decisions based on what they learn from the data. The more data they're fed, the better they get at this process.
Another term you'll need to get to grips with is “neural networks,” as this is the core technology that GenAI is built upon. A neural network is essentially an advanced machine-learning model inspired by the way human brains work. While a less complex machine-learning model may need some human intervention in the process, a neural network has the ability to learn and make decisions by itself, and can even learn from its own errors – rather like the way a human learns through a process of trial and error.
Here's how a neural network works:
Training: neural networks learn through a process called “training.” During training, the neural network is fed a lot of data (which could be text, images, or whatever), and it learns to identify patterns and relationships in the data.
Learning: as the neural network is exposed to more and more data – and we really are talking about vast amounts of data – it gradually gets better at identifying patterns and understanding the underlying rules that govern the data.
Layers: neural networks are organized into layers, and each layer is responsible for identifying different types of patterns. The initial layers might identify simple patterns, and as we move deeper, the layers are able to identify more complex patterns.
Generative models: GenAI often uses specific types of neural networks called “generative models.” In addition to recognizing patterns, generative models are able to generate new data that is similar to (but not exactly the same as) the data they were trained on.
Input and output
:
once the generative model is trained, you can give it an input (e.g., a partial image or a text prompt), and it will generate an output (like a completed image or a piece of text) based on what it learned during training.
Randomness: GenAI introduces a certain amount of randomness in the generation process, which means that it can produce slightly different outputs each time, even when given the same input over and over again.
I just mentioned generative models, the neural networks that enable GenAI to create new content. Here are some examples of generative models used in GenAI applications:
Large language models (LLMs): by gobbling up large amounts of text, LLMs learn the semantic relationships between words and use that data to generate more language. An example of an LLM is GPT-4, created by OpenAI, which powers the ChatGPT tool.
Generative adversarial networks (GANs): these work by pitting two competing algorithms against each other, one tasked with generating data that resembles its training data and the other tasked with trying to tell whether the output is real or generated. This type of model is typically used to create images, sounds, or even video (including deepfakes).
Variational autoencoders: this is a type of model that learns how data is constructed. It's often used to generate synthetic data.
Diffusion models: these work by adding random data (known as “noise”) to the data it's learning about, and then figuring out how to remove it while preserving the original data. This way, the model learns what's important and what can be discarded. Diffusion models are most commonly used in image generation.
Foundation models: this is something of an umbrella term for models that are pre-trained on a broad range of data at an enormous scale and then fine-tuned for specific tasks. These are large-scale, general-purpose models that are the foundation of LLMs and other GenAI tools.
With different models able to do different things, it's clear that GenAI has a wide range of applications. Which brings us to…
In Part 2, we'll delve into how GenAI is already being used in practice (with lots of real-world examples), but for now, I really want to make the point that GenAI is capable of creating all sorts of content.
Of course, ChatGPT gets a lot of attention – and for good reason: it's an impressive tool, and creating text is a major application of GenAI. Which is probably why “ChatGPT” and “generative AI” have almost become synonymous terms. But GenAI is capable of so much more than writing.
Take Coca-Cola's Masterpiece advert, for example. A collaboration between human artists and GenAI, the ad brings many of history's greatest artworks to life on the screen in a way that's never been done before. GenAI has also been used to create a new Beatles song by rebuilding partially recorded lyrics by John Lennon, combined with new material by Paul McCartney.
But GenAI isn't just relevant to creative content. It's also being used to speed up drug discovery, with one UK company recently announcing that it's created the world's first AI-generated immunotherapy cancer treatment.
Then there's “generative design” – an emerging field where GenAI is used to create designs and production processes for new products. In one instance, General Motors used generative tools created by Autodesk to design a new seatbelt bracket that's 40% lighter and 20% stronger than its existing components.
And we can't talk about GenAI without talking about “deepfakes,” which blur the lines between reality and fiction by making it appear as if real people have done or said fake things. GenAI is the technology behind deepfakes, and it's only going to get better at creating fake – but incredibly realistic – content. Deepfake Tom Cruise was one of the earliest and most famous examples (Google it if you haven't seen it – it's uncanny). More insidiously, potential candidates on both sides of the 2024 US presidential elections have starred in deepfakes designed to discredit them.
As if spreading false information wasn't bad enough, there are also outright criminal applications of GenAI, including attempts to extort money by staging hoax kidnappings using cloned voices, and, in one memorable example, fraudulently scamming money by posing as a company CEO.
As the technology has become more sophisticated, GenAI's capabilities have expanded greatly. To summarize, GenAI can be used to generate:
Text
Images
Video
Music and sound
Designs
Data analysis
Computer code
Synthetic data
Research
Video games and virtual worlds
Let's explore each usage in a little more detail.
Alongside ChatGPT, there are other generative text tools like Google's Bard and Meta's Llama. These tools can be used to write anything from essays and articles, to plays, poems, and novels.
Here are some of the incredible things that generative text tools can do for you:
Writing assistants: helping you draft emails, reports, and other written materials efficiently.
Creative writing: assisting authors in coming up with unique story ideas, characters, plotlines, or even drafting entire narratives.
World-building: aiding authors in creating vivid and immersive worlds for their narratives, with tools that help outline histories, geographies, and cultures of fictional settings.
Writing prompts: offering a variety of writing prompts to help writers overcome creative blocks and generate new ideas.
Grammar and style corrections: offering real-time suggestions for improving the grammatical accuracy and stylistic fluency of written content.
Translation assistance: help with translating texts to different languages while retaining the nuances of the original content.
Accessibility: providing features like speech-to-text to help people with disabilities draft texts.
Song writing: creating song lyrics, even in the style of your favorite artist.
Recipe writing: coming up with recipes that suit your parameters (e.g., “a dinner recipe that uses butternut squash, feeds a family of four, and takes no longer than 45 minutes”).
Plain-English explanations and text summaries: helping you understand a complex topic in plain English, or providing a succinct summary of a long piece of text.
Multiple-choice questions: generating quiz or test questions for a variety of topics, to beginner, intermediate, or advanced levels.
Travel itineraries: aiding you with everything from finding flights and accommodations to suggesting popular attractions to visit. Even where to get a good meal, based on your dietary needs and budget.
Many GenAI tools – such as Midjourney or Stable Diffusion – can take a natural language prompt (i.e., regular, conversational language) and use it to generate a picture. Tell it you want an image of a two-headed dog wearing an Elvis costume flying a spaceship into a black hole and watch as it (or something close to it) appears before your eyes.
This can be useful for:
Art creation: assisting artists in generating unique artworks by understanding and mimicking different art styles.
Style transfer: enabling artists to apply the styles of famous artworks to their creations.
Automated sketching: assisting in automatically generating sketches based on the descriptions provided, aiding artists in the initial stages of their work.
While not yet as advanced as text or image generation, tools are beginning to emerge that allow us to create and edit video simply by describing what we want to see.
Typical uses include:
Enhanced editing: automating intricate editing tasks including cut detection, color correction, and subtitle generation, thereby streamlining the post-production process.
Dynamic special effects: allowing real-time rendering of high-quality special effects, motion tracking, and 3D modeling, significantly reducing the time and effort involved in post-production.
3D modeling: facilitating the creation of 3D models and environments for integration into videos, helping to create immersive visual experiences.
Sound design and synthetic voice: enhancing audio experiences through AI-generated music compositions and synthetic voices. Which brings us to…
GenAI tools can create human-like voices (voice synthesis), allowing computers to speak words. They can also create music and sound effects.
In practice, this can be used for:
Harmony suggestions: proposing harmonious chord progressions and melody lines.
Generative music: creating entire musical pieces automatically based on a set of input parameters.
Mood-based creation: offering tools that help composers create music tailored to specific moods or emotions.
Voice synthesis: generating realistic voices that can read text naturally, providing a human-like auditory experience.
Language and accent variability: offering a wide range of language and accent options, helping to create diverse and inclusive auditory experiences.
Customizable voices: providing tools to create customizable voices, allowing users to personalize their auditory experiences.
GenAI offers huge potential for graphic designers and product designers to streamline the design process – from automating aspects of visual design to creating multiple product design variations.
Some of the ways this may prove useful include:
Design suggestions: aiding designers by providing design suggestions.
Color palette suggestions: providing suggestions for color palettes that work well.
Stock image integration: offering seamless integration with stock image libraries, giving designers easy access to a wide range of resources.
Architectural design: assisting architects in generating building designs optimized for specific criteria like materials cost, energy efficiency, and spatial requirements.
Product design: helping designers to come up with optimized and innovative product designs by exploring a vast number of design alternatives in a short time.
Fashion design: assisting fashion designers by proposing new clothing designs based on current trends, historical data, and designer inputs.
With GenAI tools, you don't need to be a data scientist to interrogate data. From generating automated reports to providing useful predictions, there are lots of ways GenAI can help businesses get the most out of data.
In terms of automated reporting, GenAI offers:
Real-time reporting: offering tools that generate reports in real-time, providing up-to-date information for decision-making.
Visual representation: creating visual representations of data, such as graphs and charts, to make reports more understandable and engaging.
Natural language summaries: generating summaries of reports in easy-to-understand language.
And in terms of predictive analysis, GenAI can deliver:
Anomaly detection: identifying unusual patterns in data that may indicate potential issues or opportunities.
Forecasting and customer insights: utilizing historical data to make accurate forecasts about future trends, and identify customer preferences.
Simulation and modeling: creating detailed simulations based on data analysis, allowing organizations to explore potential scenarios and make better decisions.
As well as ChatGPT, tools like Microsoft's GitHub Copilot and Amazon's CodeWhisperer make it easy for anyone to generate computer code with very little technical knowledge.
This can span:
Code generation: creating code based on high-level descriptions or partially completed code.
Code review: assisting in reviewing codes by identifying bugs, security vulnerabilities, and providing suggestions for optimization.
Automated testing: creating tests automatically based on the software's specifications, helping to improve the software's reliability.
GenAI makes it easy to create entirely synthetic data for use in training other AI models. And for everyday businesses, synthetic data can help them overcome the privacy and data security obligations that come with real-world data.
In practice, this may include:
Data augmentation: creating additional data through techniques like image augmentation to enhance the training of AI models.
Privacy preservation: generating synthetic data that preserves the privacy of individuals by not using real personal data.
Balanced datasets: creating balanced datasets that help in reducing biases in AI models, promoting more fair and equitable AI systems.
Diverse data: generating diverse datasets that help researchers explore a wide range of scenarios and conditions.
There are many ways GenAI can aid the research process, with a few notable examples being:
Drug discovery: speeding up the drug discovery process by predicting potential drug candidates through the analysis of vast datasets.
Simulation environments: creating realistic simulation environments based on synthetic data, aiding researchers in conducting experiments and studies.
Rare event modeling: assisting in modeling rare events through synthetic data, helping researchers explore scenarios that are difficult to study with real data.
From virtual reality (VR) environments and immersive video game worlds to the wider concept of the metaverse, we're increasingly interacting with immersive, highly realistic content. Designing such content is a complex task that can be accelerated with the help of GenAI. Here's how:
Procedural generation and design: creating intricate environments, realistic terrains, and adaptive storylines, fostering unique and immersive gameplay experiences.
Character design and animation: generating natural character animations, and crafting consistent character personalities to enhance storytelling and immersion.
Personalized user experiences: tailoring games and immersive environments to individual preferences and behaviors, fostering a more personalized and engaging user journey.
Automated content moderation and security: monitoring and maintaining the safety of immersive environments, using automated systems to detect and manage inappropriate content.
GenAI is clearly capable of amazing things and has the potential to transform business and everyday life. I'll explore examples from specific sectors across Part 2, but for a broader overview, turn to Chapter 3.
The rapid evolution of GenAI is also forcing us to confront some difficult questions, and fast. For example, what happens when we reach a point where it's impossible to tell the difference between what's real content and what is generated by AI? There are no easy answers right now, but we'll do our best to delve into such thorny issues in Chapter 4.
And then there's the question of how all this will affect human jobs, as machines take on increasingly creative work. That's the subject of Chapter 5.
In this chapter, we've learned:
GenAI is a groundbreaking type of AI that is able to create new content based on patterns it has learned from existing data.
While traditional AI simulates human cognitive processes like decision-making, GenAI simulates
human creativity
.
GenAI is made possible by neural networks – an advanced machine-learning model inspired by the way human brains work.
ChatGPT may have made GenAI famous, but GenAI is much bigger than ChatGPT. GenAI can be used to create music, sound, and video, write computer code, design products, create immersive worlds, automate data analysis and reporting, and more.
Before we move on to the impact of GenAI and the issues surrounding it, let's briefly explore the evolution of GenAI, from the early days of computing to now.
We've reached the point where AI is no longer just an executor of tasks or a decision-making tool, but also an inventor. This is an age where AI can be a creative companion, capable of producing original outputs that can inspire, assist, and even amaze us. In the next chapter, we'll look at how this development will shape our world. But to navigate the future of GenAI, we need to understand its past. How did we get to this point? How far back do the roots of GenAI stretch? (You might be surprised to discover just how far.) And how is GenAI's evolution inextricably linked with other fast-developing technologies?
Come with me as I take you on a tour through the history of GenAI, and explore the key milestones that have brought us to this point.
Scientists and researchers first introduced the concept of AI and machine learning in the 1950s, which goes to show that humans have been fascinated by the concept of intelligent machines for a long, long time.