An
introduction to a broad range of topics in deep learning, covering
mathematical and conceptual background, deep learning techniques used in
industry, and research perspectives.“Written by three experts in the field, Deep Learning is the only comprehensive book on the subject.”
—Elon Musk, cochair of OpenAI; cofounder and CEO of Tesla and SpaceX
Deep
learning is a form of machine learning that enables computers to learn
from experience and understand the world in terms of a hierarchy of
concepts. Because the computer gathers knowledge from experience, there
is no need for a human computer operator to formally specify all the
knowledge that the computer needs. The hierarchy of concepts allows the
computer to learn complicated concepts by building them out of simpler
ones; a graph of these hierarchies would be many layers deep. This book
introduces a broad range of topics in deep learning.
The
text offers mathematical and conceptual background, covering relevant
concepts in linear algebra, probability theory and information theory,
numerical computation, and machine learning. It describes deep learning
techniques used by practitioners in industry, including deep feedforward
networks, regularization, optimization algorithms, convolutional
networks, sequence modeling, and practical methodology; and it surveys
such applications as natural language processing, speech recognition,
computer vision, online recommendation systems, bioinformatics, and
videogames. Finally, the book offers research perspectives, covering
such theoretical topics as linear factor models, autoencoders,
representation learning, structured probabilistic models, Monte Carlo
methods, the partition function, approximate inference, and deep
generative models.
Deep Learning
can be used by undergraduate or graduate students planning careers in
either industry or research, and by software engineers who want to begin
using deep learning in their products or platforms. A website offers
supplementary material for both readers and instructors.