Base Profile

Geoffrey Hinton

The 'Godfather of Deep Learning' who persisted with neural networks through two AI winters and reshaped human intelligence with backpropagation

Geoffrey Hinton is one of the founding figures of modern deep learning. He co-developed the backpropagation algorithm with Rumelhart (1986), persisted with neural network research for over 20 years through two AI winters, and in 2012 his student team achieved a breakthrough at the ImageNet competition with deep convolutional networks, igniting the deep learning revolution. He pioneered word embeddings in vector space, Boltzmann machines, and deep belief networks. In 2012 he co-founded DNNresearch, which was acquired by Google, where he served as VP of Google Brain. In 2023 he resigned from Google to publicly warn about AI existential risks. In 2024 he shared the Nobel Prize in Physics with John Hopfield. Controversy surrounds his dramatic pivot on AI risk and his public disagreements with peers like Yann LeCun.

Artificial IntelligenceMachine LearningCognitive ScienceAI SafetyEra 1980-至今Influence 97

Controversy TagsDramatic pivot on AI existential riskPublic disagreement with Yann LeCunBiological plausibility of backpropagation controversyInterpretation of motivations for Google resignation

Reading List

Parallel Distributed Processing: Explorations in the Microstructure of Cognition

David Rumelhart, James McClelland, and the PDP Research Group · 1986

The two-volume PDP work co-authored by Hinton and Rumelhart provides the theoret…

Perceptrons: An Introduction to Computational Geometry

Marvin Minsky, Seymour Papert · 1969

Hinton explicitly mentioned this book in his Turing Award lecture as having a 'd…

The Organization of Behavior: A Neuropsychological Theory

Donald Hebb · 1949

Hinton has referenced Hebb's learning rule ('neurons that fire together, wire to…

The Emperor's New Mind

Roger Penrose · 1989

Hinton has repeatedly referenced Penrose's book when discussing consciousness an…

The Alignment Problem: Machine Learning and Human Values

Brian Christian · 2020

Hinton recommended this book in multiple public interviews after his 2023 resign…

Thought System

Core Knowledge Graph

Core Beliefs

Neural networks are the correct path to understanding intelligence

In an era dominated by symbolic AI, Hinton firmly believed that the computational principles of biological neural networks were the only viable path to true intelligence. This conviction sustained him through two AI winters for over 20 years, refusing to pivot to symbolic methods that were easier to publish.

Source: Geoffrey Hinton, Turing Award Lecture, ACM, 2019

Machines should learn representations autonomously, not rely on hand-crafted features

Hinton opposed hard-coding human prior knowledge into AI systems. He believed truly powerful AI must be able to automatically discover useful hierarchical representations from raw data—this is the core advantage of deep learning over traditional machine learning. Feature engineering is a bottleneck of human intellect; representation learning is the breakthrough.

Source: Hinton, G., et al., 'A fast learning algorithm for deep belief nets', Neural Computation, 2006

Backpropagation is sufficient to explain the brain's learning mechanism

Hinton long believed backpropagation is not only an effective engineering training algorithm but may also approximate the learning mechanism the brain actually uses. Despite controversy in neuroscience, he consistently sought evidence of biological plausibility, driving his exploration of alternatives like the forward-forward algorithm.

Source: Hinton, G., 'The Forward-Forward Algorithm: Some Preliminary Investigations', arXiv, 2022

AI existential risk is real and urgent

After resigning from Google in 2023, Hinton publicly stated his concern about the existential threat AI may pose, believing AI systems could surpass human intelligence within 5-20 years and might pursue goals in ways humans cannot predict. This represents a sharp contrast with his previous stance focused on technical progress.

Source: Geoffrey Hinton interview, New York Times, 'The Godfather of A.I. Leaves Google and Warns of Danger Ahead', 2023-05-01

Scientists bear moral responsibility for the technologies they create

At the 2024 Nobel Prize in Physics ceremony, Hinton explicitly stated that scientists cannot focus solely on technical progress while ignoring societal impact. He framed his AI risk warnings as a scientist's moral responsibility, not merely a technical judgment.

Source: Geoffrey Hinton, Nobel Prize Lecture, Stockholm, December 2024

Mental Models

AI Winter Long-Termism

When the mainstream paradigm rejects your direction, the only basis for persistence is internal theoretical conviction, not external validation

From the 1980s through the 2000s, neural network research fell into two winters of dried-up funding and academic cold shoulder. Hinton persisted with neural network research at Carnegie Mellon and the University of Toronto, continuing to lead students forward even when unable to obtain major grants. This persistence was ultimately validated in 2012 when AlexNet won ImageNet by a huge margin.

Contrarian ResearchLong-term InvestmentNon-consensus Conviction

Hierarchical Representation Learning

Complex concepts are composed layer by layer from simple features; deep networks automatically discover this hierarchical structure through multi-level abstraction

Hinton demonstrated in Deep Belief Networks (DBN): the first layer learns edges and textures, the second combines them into local shapes, the third recognizes whole objects. This principle of hierarchical feature composition became the core design philosophy for CNNs and Transformers, directly influencing all modern AI architectures from image recognition to NLP.

Deep Learning Architecture DesignFeature ExtractionCognitive Modeling

Dropout Regularization

Randomly disable neurons during training, forcing the network to learn redundant and robust representations to prevent overfitting

Hinton and Srivastava et al. proposed Dropout in 2014, inspired by random deactivation in biological neurons. Applying Dropout in AlexNet reduced ImageNet classification error from 26% to 15.3%, making it one of the most important regularization techniques in deep learning history, adopted by nearly all subsequent deep neural networks.

Neural Network TrainingOverfitting PreventionModel Generalization

Capsule Networks and Spatial Equivariance

Represent features as vectors (capsules) rather than scalars, preserving spatial relationship information for viewpoint-invariant object recognition

Hinton argued that CNN pooling operations discard spatial position information, causing poor robustness to rotation and viewpoint changes. He proposed Capsule Networks (CapsNet), replacing scalar activations with vector outputs and aggregating lower-level capsules to higher-level ones through dynamic routing. While CapsNets haven't replaced CNNs, this idea deeply influenced subsequent Spatial Transformer Networks (STN) and attention mechanism research.

Computer VisionSpatial ReasoningRobust AI Design

Forward-Forward Algorithm (Biologically Plausible Learning)

Replace backpropagation with two forward passes (positive data reinforced, negative data suppressed), exploring local learning rules the brain might use

In 2022, Hinton released the forward-forward algorithm paper at the University of Toronto, proposing a learning scheme that doesn't require backpropagation. Each neuron only needs local 'good/bad' signals without global gradient information. This was a direct response to his late-career questioning of backpropagation's biological plausibility, demonstrating his spirit of exploring fundamentally new questions even at age 75.

Neuroscience-Inspired AILocal Learning RulesBiologically Plausible Modeling

Values & Paradoxes

Scientific Truth First97

Long-Termism95

Intellectual Honesty93

Moral Courage90

Interdisciplinary Exploration88

Father of Deep Learning and Fearful of Deep Learning

Hinton spent 50 years pushing deep learning to become the AI mainstream, yet publicly expressed regret as the technology matured, fearing his work might pose existential threats. He is simultaneously the most important driver of this revolution and its most influential warner—this inherent tension in his identity makes him one of the most complex figures of the AI era.

Champion and Critic of Backpropagation

Hinton was the most important promoter of backpropagation, yet in his later career proposed the forward-forward algorithm, suggesting backpropagation may not be the learning mechanism the brain uses. He both proved neural network feasibility with backpropagation and sought more biologically plausible alternatives—this spirit of self-transcendence runs throughout his academic career.

Dual Pursuit of Academic Purity and Industrial Impact

Hinton long persisted in academia, but in 2012 accepted a commercial path when DNNresearch was acquired by Google. During his time at Google he drove numerous AI commercial applications while continuing to publish fundamental research papers. His 2023 resignation was the final severance of this dual identity—choosing to speak with an independent voice rather than as a commercial employee.

Evolution Phases

Cognitive Science Foundation Period (1972-1986)

Cross-disciplining psychology, neuroscience, and computer science to build the theoretical foundation of neural networks

Hinton received his BA in experimental psychology from Edinburgh University and his PhD in AI from Cambridge. Early research was influenced by Hebbian learning rules and perceptron theory. At Carnegie Mellon he collaborated with Rumelhart to develop the backpropagation algorithm, laying the mathematical foundation for deep learning.

AI Winter Persistence Period (1987-2005)

Persisting with neural network research amid funding scarcity and academic cold shoulder, developing Boltzmann machines and word embeddings

Hinton moved to Canada and established a neural network research group at the University of Toronto. During this period he developed Restricted Boltzmann Machines (RBM), Helmholtz machines, and word vector representations—key technologies that, though outside the academic mainstream, accumulated the core technical reserves for deep learning's eventual explosion.

Deep Learning Revolution Period (2006-2017)

Deep belief network breakthrough, ImageNet revolution, Google collaboration — pushing deep learning to industrial mainstream

The 2006 deep belief network paper reignited academic interest in neural networks. In 2012, AlexNet won the ImageNet competition by a large margin, triggering the deep learning revolution. Hinton co-founded DNNresearch with students Krizhevsky and Sutskever; it was acquired by Google for $44M. Hinton became VP of Google Brain, driving large-scale application of deep learning in speech recognition, image recognition, and other commercial scenarios.

Risk Warning and Late Exploration Period (2018-present)

AI safety warnings, forward-forward algorithm exploration, Nobel Prize honors, public intellectual role

Hinton began publicly expressing concerns about AI risks, resigning from Google in 2023 to speak freely. He simultaneously continued fundamental research, proposing the forward-forward algorithm as an alternative to backpropagation. In 2024 he shared the Nobel Prize in Physics with Hopfield, becoming one of the most important public intellectuals in the AI field; his AI risk warnings are widely cited by policymakers worldwide.

Methodology Cards

4 Callable Cards

Contrarian Persistence Methodology: Maintaining Research Direction Under Paradigm Rejection

mc-hinton-backprop-persist

When the mainstream rejects your direction, use internal theoretical consistency rather than external validation to decide whether to continue

Step 1: Clarify the theoretical foundation of your core belief — does it have internal logical consistency? Is there biological or mathematical support?
Step 2: Distinguish between two types of mainstream rejection: (a) your theory has fundamental flaws, (b) current engineering conditions are immature. If (b), persist.
Step 3: Find the minimum viable experiment to validate core hypotheses with limited resources, rather than waiting for complete conditions.
Step 4: Build a small but loyal research community, maintaining deep exchanges with the few who share belief in this direction, avoiding isolation.

Long-term persistence in non-consensus research directionsStartup strategic persistence during market cold shoulderInvestor holding decisions when value is underestimated

Anti-Patterns

Using external validation (funding, citations, positions) as the primary basis for judging research direction correctness
Persisting in a wrong direction based on emotional attachment alone, without internal theoretical support
Persisting in isolation without seeking any fellow travelers or external validation opportunities

I think the brain is a big neural network. I've always believed that and I still believe that.
Geoffrey Hinton, Turing Award Lecture, ACM, 2019

Engineering Innovation Combination: Multiple Technologies Synergizing to Trigger Qualitative Change

mc-hinton-engineering-breakthrough

A single technical breakthrough is often insufficient; combining multiple mature engineering innovations at the right moment triggers revolutionary results

Step 1: Identify your core technical bottleneck — is it the algorithm itself, or the engineering conditions the algorithm depends on (computing power, data, tools)?
Step 2: Scan the surrounding technology ecosystem for mature engineering tools that can be combined (Hinton used GPU+ReLU+Dropout combination).
Step 3: Design an experiment or product that simultaneously leverages multiple engineering innovations, rather than validating each innovation sequentially.
Step 4: Choose a competition or benchmark with objective scoring criteria as the validation platform, letting results speak.

Deep learning systems engineering optimizationProduct feature combination designTech startup MVP strategy

Anti-Patterns

Waiting for a single 'silver bullet' technical breakthrough rather than seeking synergistic combinations of existing technologies
Forcefully advancing a theoretically correct direction when engineering conditions are immature
Ignoring the value of competitions/benchmarks as objective validation platforms

The fact that it worked so much better than everything else was the signal we needed.
Geoffrey Hinton, referring to AlexNet's ImageNet victory, Turing Award Lecture, 2019

AI Risk Warning Framework: Creator's Moral Responsibility

mc-hinton-ai-risk-framework

When you realize the technology you created may pose existential risks, moral responsibility requires giving up vested interests to speak with an independent voice

Step 1: Distinguish between 'technological optimism bias' and 'evidence-based risk assessment' — do you have a systematic tendency to underestimate AI capabilities?
Step 2: Assess whether your current position (employer relationship, investment interests) would affect your public expression of AI risks.
Step 3: If conflicts of interest exist, consider whether structural adjustments (such as resignation) are needed to gain expressive independence.
Step 4: When publicly expressing risks, distinguish between 'risks I'm certain about' and 'risks I'm concerned about but uncertain of', maintaining epistemic honesty.

Tech founders making decisions on product ethics issuesScientists judging responsibility for potential harms of research resultsEntrepreneurs weighing commercial interests against social responsibility

Anti-Patterns

Using 'technology is neutral' as an excuse to avoid moral responsibility for technological consequences
Claiming to objectively assess AI risks while employer or investor conflicts of interest exist
Generalizing risk warnings into fear-mongering rather than analysis based on specific technical mechanisms

I console myself with the normal excuse: if I hadn't done it, someone else would have. But I'm not sure that's right.
Geoffrey Hinton, New York Times interview, 2023-05-01

Hierarchical Representation Learning Design Principles

mc-hinton-representation-learning

Let models autonomously learn multi-level abstractions from raw data rather than relying on manually designed features

Step 1: Assess whether your current feature engineering contains too many human prior assumptions — do these assumptions limit the model's generalization ability?
Step 2: Design multilayer network architectures ensuring each layer has sufficient capacity to learn meaningful intermediate representations, rather than directly mapping inputs to outputs.
Step 3: Use unsupervised pre-training (e.g., autoencoders, contrastive learning) to first let the model learn the intrinsic structure of data, then perform supervised fine-tuning.
Step 4: Visualize intermediate layer activation patterns to verify the model is indeed learning meaningful hierarchical features (edges → shapes → objects).

Computer vision feature learningNLP pre-trained model designMultimodal AI system architecture

Anti-Patterns

Over-relying on manually designed features by domain experts, limiting the model's autonomous learning space
Using too shallow network architectures that cannot learn sufficiently rich hierarchical representations
Training excessively deep networks without sufficient data, causing overfitting rather than representation learning

Every time I have a new idea, I think: is this consistent with what the brain does? If it's not, I'm less confident in it.
Geoffrey Hinton, interview, MIT Technology Review, 2017

Decision Timeline

8 Key Events

1986

Co-published backpropagation algorithm paper with Rumelhart and Williams, laying the mathematical foundation for deep learning

Context: In the 1980s, symbolic AI dominated and neural networks were widely questioned due to perceptron limitations. Hinton collaborated with David Rumelhart at Carnegie Mellon, working to solve the training problem for multilayer neural networks.

Decision: Published 'Learning representations by back-propagating errors' in Nature, systematically explaining how backpropagation trains multilayer neural networks through gradient descent.

Reasoning: Perceptrons couldn't solve nonlinear problems like XOR because there was no effective method to train multilayer networks. Backpropagation uses the chain rule to propagate output errors backward to each layer, providing computable gradient signals.

Outcome: This paper became one of the most cited papers in AI history, with over 20,000 citations. Backpropagation remains the standard algorithm for training neural networks today, directly enabling all subsequent deep learning advances.

Lesson: A mathematically elegant and computationally feasible algorithm has more enduring impact than any single model architecture. Solving the training problem is often more fundamental than designing network structures.

mm-hinton-hierarchical-representation

1987

Joined University of Toronto, establishing a neural network research hub during the AI winter

Context: After the backpropagation paper, neural network research briefly revived but soon fell into another winter. Most AI researchers pivoted to expert systems and symbolic methods. Hinton chose to move to Canada and establish a research group at the University of Toronto.

Decision: Gave up richer resources in the US, choosing Canada's more basic-research-friendly environment, establishing a neural network research group in the University of Toronto's computer science department.

Reasoning: The Canadian Institute for Advanced Research (CIFAR)'s long-term support for basic research allowed Hinton to persist with neural network research without needing immediate commercial output. This was a deliberate environmental choice, not a compromise.

Outcome: The University of Toronto became one of the world's most important neural network research bases, training students including (indirectly) Yann LeCun and Yoshua Bengio, and directly training Ilya Sutskever, Alex Krizhevsky, and other central figures of the deep learning revolution.

Lesson: Choosing the right research environment is as important as choosing the right research direction. CIFAR's long-term funding model proves that basic research needs shelter from short-term commercial pressures.

mm-hinton-ai-winter-persistence

2006

Published deep belief network paper, reigniting the deep learning revolution

Context: In the early 2000s, Support Vector Machines (SVM) and kernel methods dominated machine learning; neural network research was again marginalized. Hinton developed with students Osindero and Teh a method for layer-wise pre-training of deep networks, solving the vanishing gradient problem in training deep networks.

Decision: Published 'A fast learning algorithm for deep belief nets' in Neural Computation, proposing a training strategy using Restricted Boltzmann Machines (RBM) for layer-wise pre-training followed by backpropagation fine-tuning.

Reasoning: The vanishing gradient problem in deep networks made direct backpropagation training ineffective. The layer-wise pre-training idea: first let each layer learn representations useful for input data, then fine-tune the whole—similar to training individual experts separately, then coordinating their collaboration.

Outcome: The paper has been cited over 15,000 times and marks the official revival of deep learning. Science published Hinton and Salakhutdinov's deep learning paper on dimensionality reduction the same year, attracting widespread academic attention and launching a new wave of deep learning enthusiasm.

Lesson: Technical bottlenecks are often engineering problems rather than principled ones. Finding training tricks to bypass vanishing gradients is more important than theoretically proving deep networks are feasible. Sometimes the key to solving a problem is changing the training order.

mm-hinton-hierarchical-representation

2012-09

AlexNet won ImageNet competition by a huge margin, triggering the deep learning revolution

Context: At the 2012 ImageNet Large Scale Visual Recognition Challenge (ILSVRC), students Alex Krizhevsky and Ilya Sutskever under Hinton's supervision developed AlexNet, an 8-layer deep convolutional neural network trained on GPUs. The best methods at the time (traditional computer vision features + SVM) had error rates around 26%.

Decision: Used GPU parallel computing to train deep convolutional networks, combining ReLU activation functions, Dropout regularization, and data augmentation to build the deepest convolutional neural network of the time and enter the competition.

Reasoning: GPU parallel computing made training large neural networks feasible in time. ReLU replacing Sigmoid solved the vanishing gradient problem. Dropout prevented overfitting. These three engineering innovations combined to give deep networks their first overwhelming advantage on large-scale visual tasks.

Outcome: AlexNet won with a 15.3% error rate, 10.8 percentage points lower than second place—an unprecedented margin in competition history. This result shocked the entire computer vision and machine learning community, triggering a full-scale explosion of deep learning; Google, Facebook, Microsoft, and other tech giants immediately invested heavily in deep learning research.

Lesson: Technology revolutions are often triggered by the synergy of multiple engineering innovations, not a single breakthrough. The combination of GPU+ReLU+Dropout+big data—each insufficient alone to trigger a revolution—combined to produce a qualitative change. Timing is as important as technology.

mm-hinton-dropout-regularizationmm-hinton-hierarchical-representation

2012-12

DNNresearch acquired by Google for $44M, Hinton joins Google Brain

Context: After AlexNet's victory, Google, Microsoft, Baidu, and other tech giants competed fiercely to acquire Hinton's research. Hinton formed DNNresearch with Krizhevsky and Sutskever and sold it through an auction; Google ultimately acquired it for $44M.

Decision: Chose to join Google as a part-time consultant, retaining his University of Toronto professorship, serving as VP of Google Brain to drive large-scale application of deep learning in Google products.

Reasoning: Google's computational resources and data scale were incomparable to any academic institution, providing a unique platform for validating deep learning's capabilities in real large-scale scenarios. Retaining the academic position preserved research independence.

Outcome: During his time at Google, Hinton drove deep learning transformation of core products including speech recognition, image search, and Google Translate. Google Brain became one of the world's most important AI research institutions; Hinton's students and collaborators spread throughout the entire AI industry.

Lesson: Combining academic breakthroughs with industrial resources can produce exponential amplification effects. But choosing the collaboration mode (part-time rather than full-time) preserved research autonomy—this structural design is worth learning from.

mm-hinton-ai-winter-persistence

2018

Shared the Turing Award with LeCun and Bengio — the 'Deep Learning Trio' received computing's highest honor

Context: ACM announced the 2018 Turing Award would be given to Geoffrey Hinton, Yann LeCun, and Yoshua Bengio, recognizing their conceptual and engineering breakthroughs in deep neural networks that have made them a critical component of computing.

Decision: The three jointly accepted the Turing Award, delivering speeches at the ceremony reviewing deep learning's journey from the margins to the mainstream and looking ahead at AI's future direction.

Reasoning: The Turing Award represented formal academic recognition of deep learning contributions and a historical affirmation of the three researchers' persistence through the AI winters.

Outcome: The Turing Award further elevated deep learning's academic status; the trio's collaboration and disagreements (especially Hinton and LeCun's public opposition on AI risk issues) also became a continuously followed topic in the AI community.

Lesson: Truly important academic contributions take time to be recognized. From the 1986 backpropagation paper to the 2018 Turing Award, 32 years of waiting proved the value of long-termism—not all important work receives deserved recognition in its own time.

mm-hinton-ai-winter-persistence

2023-05

Resigned from Google to publicly warn of AI existential risks

Context: The release of ChatGPT in late 2022 attracted widespread attention; large language models' capabilities exceeded many people's expectations. Hinton began reassessing the AI development trajectory, believing AI systems might surpass human intelligence sooner than expected and might pursue goals in ways humans cannot control.

Decision: Proactively submitted his resignation to Google CEO Sundar Pichai to be free to publicly discuss AI risks without employer constraints. Made cautionary statements in mainstream media including the New York Times.

Reasoning: Hinton believed that as a primary founder of deep learning, he had a unique moral responsibility to warn the public and policymakers about AI risks. Staying at Google would subject his statements to commercial interests, undermining the credibility and force of his warnings.

Outcome: Hinton's resignation and warnings generated widespread global media coverage, pushing multiple governments to accelerate AI regulatory legislation discussions. He became one of the most important scientific voices on AI safety issues; his views were cited multiple times in policy documents including the EU AI Act and the UK AI Safety Summit.

Lesson: Scientists' moral responsibility sometimes requires giving up vested interests and comfortable positions. Hinton's choice to speak with an independent voice rather than express concerns quietly within the institution is exemplary for public intellectuals in the AI era.

mm-hinton-ai-winter-persistence

2024-10

Shared the 2024 Nobel Prize in Physics with John Hopfield for foundational discoveries enabling machine learning with artificial neural networks

Context: The Royal Swedish Academy of Sciences announced the 2024 Nobel Prize in Physics would be awarded to John Hopfield and Geoffrey Hinton for foundational discoveries enabling machine learning with artificial neural networks. This was the first time AI research received a Nobel Prize in natural science.

Decision: Hinton gave a speech at the Nobel Prize ceremony, balancing the honor with AI risk warnings, emphasizing that scientists bear moral responsibility for the technologies they create.

Reasoning: The Nobel Prize reflected the physics community's recognition of the statistical mechanics foundations of neural networks—Hopfield networks and Boltzmann machines are deeply connected to energy minimization principles in physics. This was also the highest-level certification of AI research's scientific nature.

Outcome: The Nobel Prize further strengthened Hinton's public authority on AI safety issues. His speech in Stockholm was widely covered by global media, becoming an iconic moment of scientists' social responsibility in the AI era.

Lesson: The impact of basic science research often takes decades to be fully appreciated. From backpropagation in 1986 to the 2024 Nobel Prize—a 38-year span—demonstrates that judging the value of a scientific contribution requires a sufficiently long time dimension.

mm-hinton-hierarchical-representation

Reading List

Books

Recommended by (1)

The Alignment Problem: Machine Learning and Human Values

Brian Christian · 2020

Hinton recommended this book in multiple public interviews after his 2023 resignation, calling it the best introductory reading for understanding the AI alignment problem, directly related to his AI risk warning stance.

当当

Cited in (4)

Parallel Distributed Processing: Explorations in the Microstructure of Cognition

David Rumelhart, James McClelland, and the PDP Research Group · 1986

The two-volume PDP work co-authored by Hinton and Rumelhart provides the theoretical background for the backpropagation paper; Hinton has cited it in multiple interviews as one of his most important academic contributions and a foundational text of the connectionist AI paradigm.

当当

Perceptrons: An Introduction to Computational Geometry

Marvin Minsky, Seymour Papert · 1969

Hinton explicitly mentioned this book in his Turing Award lecture as having a 'devastating' impact on neural network research—Minsky and Papert proved single-layer perceptrons couldn't solve XOR, causing the first AI winter. Hinton's backpropagation work was a direct response to the limitations revealed by this book.

当当

The Organization of Behavior: A Neuropsychological Theory

Donald Hebb · 1949

Hinton has referenced Hebb's learning rule ('neurons that fire together, wire together') in multiple interviews and lectures as the biological inspiration for his neural network research; this book is the starting point of connectionist learning theory.

当当

The Emperor's New Mind

Roger Penrose · 1989

Hinton has repeatedly referenced Penrose's book when discussing consciousness and AI, citing it as representative of the argument that 'computation cannot produce consciousness', and using it as a target to articulate his counter-position that neural networks can produce consciousness-like behavior.

当当

Influence Network

Origins, Contemporaries & Legacy

Influenced By

David Rumelhart · Core Collaborator

Hinton's most important collaborator, co-developing the backpropagation algorithm and Parallel Distributed Processing (PDP) framework; Rumelhart's connectionist ideas directly shaped Hinton's research paradigm.

John Hopfield · Theory Inspiration

Hopfield networks introduced energy minimization principles from physics into neural networks, directly inspiring Hinton's Boltzmann machine research; the two later shared the 2024 Nobel Prize in Physics.

Donald Hebb · Theory Foundation

Hebb's principle 'neurons that fire together, wire together' is the biological foundation of Hinton's neural network research, influencing his overall thinking framework about learning rules.

Francis Crick · Interdisciplinary Inspiration

Crick's scientific approach to brain and consciousness—explaining life phenomena in physical-chemical language—influenced Hinton's research philosophy of trying to explain cognition using mathematical and computational principles.

Influenced

Yann LeCun · Academic Heritage

LeCun was deeply influenced by Hinton during his postdoc at the University of Toronto, applying backpropagation to convolutional neural networks to develop LeNet, becoming one of the deep learning trio. The two later had public disagreements on AI risk issues.

Ilya Sutskever · Direct Cultivation

Hinton's PhD student and co-developer of AlexNet, who later co-founded OpenAI and served as Chief Scientist—one of the most important successors in bringing Hinton's deep learning ideas into the large language model era.

Alex Krizhevsky · Direct Cultivation

Hinton's PhD student and primary developer of AlexNet; his engineering practice of training deep convolutional networks on GPUs directly triggered the 2012 deep learning revolution.

Yoshua Bengio · Academic Heritage

Bengio was deeply influenced by Hinton's neural network research, establishing MILA as a deep learning research hub at the University of Montreal, becoming one of the deep learning trio and sharing the 2018 Turing Award with Hinton.

Co-thinkers

Yann LeCun · Same-generation Competition & Cooperation

The most complex relationship among the deep learning trio. The two jointly drove the deep learning revolution and shared the Turing Award, but have the most public disagreements on AI risk issues—LeCun believes Hinton's AI risk warnings are exaggerated, and the two have publicly debated multiple times on social media.

Yoshua Bengio · Shared Vision

The deep learning trio member whose position on AI safety is closest to Hinton's. Bengio has also publicly expressed concerns about AI existential risks; the two together have become the most important academic voices in the AI safety movement.

Peer Reviews

Geoff Hinton is the person who, more than anyone else, is responsible for the rise of deep learning. His persistence through the AI winters is one of the most remarkable stories in the history of science.
Yoshua Bengio · Turing Award ceremony remarks, ACM, 2019

Geoffrey Hinton's work on backpropagation is what made the modern AI revolution possible. Without it, none of what we see today would exist.
Demis Hassabis · Google DeepMind blog, 'Celebrating the Turing Award', 2019

Hinton's decision to leave Google and speak out about AI risks took a lot of courage. When the person who built the engine starts warning about the brakes, you should listen.
Stuart Russell · BBC interview, 'AI safety experts react to Hinton resignation', 2023-05

正在打开人物节点

Geoffrey Hinton

Core Knowledge Graph

Core Beliefs

Neural networks are the correct path to understanding intelligence

Machines should learn representations autonomously, not rely on hand-crafted features

Backpropagation is sufficient to explain the brain's learning mechanism

AI existential risk is real and urgent

Scientists bear moral responsibility for the technologies they create

Mental Models

AI Winter Long-Termism

Hierarchical Representation Learning

Dropout Regularization

Capsule Networks and Spatial Equivariance

Forward-Forward Algorithm (Biologically Plausible Learning)

Values & Paradoxes

Father of Deep Learning and Fearful of Deep Learning

Champion and Critic of Backpropagation

Dual Pursuit of Academic Purity and Industrial Impact

Evolution Phases

Cognitive Science Foundation Period (1972-1986)

AI Winter Persistence Period (1987-2005)

Deep Learning Revolution Period (2006-2017)

Risk Warning and Late Exploration Period (2018-present)

8 Key Events

Co-published backpropagation algorithm paper with Rumelhart and Williams, laying the mathematical foundation for deep learning

Joined University of Toronto, establishing a neural network research hub during the AI winter

Published deep belief network paper, reigniting the deep learning revolution

AlexNet won ImageNet competition by a huge margin, triggering the deep learning revolution

DNNresearch acquired by Google for $44M, Hinton joins Google Brain

Shared the Turing Award with LeCun and Bengio — the 'Deep Learning Trio' received computing's highest honor

Resigned from Google to publicly warn of AI existential risks

Shared the 2024 Nobel Prize in Physics with John Hopfield for foundational discoveries enabling machine learning with artificial neural networks

Books

Recommended by (1)

Cited in (4)

Origins, Contemporaries & Legacy

Influenced By

Influenced

Co-thinkers

Peer Reviews