Representation Learning Is the Core Problem of AI
The capability of AI systems depends largely on how they represent the world. Good representations capture the intrinsic causal structure of data, not just surface statistical patterns. The essence of deep learning is learning hierarchical abstract representations.
Source: Representation Learning: A Review and New Perspectives, Bengio et al., IEEE TPAMI, 2013
AI Needs to Integrate System 2 Thinking
Current deep learning primarily simulates System 1 thinking (fast, intuitive, pattern-matching) but lacks System 2 (slow, logical, planning). True AGI requires integrating both thinking modes to achieve causal reasoning and counterfactual thinking.
Source: From System 1 Deep Learning to System 2 Deep Learning, Bengio, NeurIPS keynote, 2019
AI Safety Is the Most Urgent Scientific Problem of Our Time
As AI systems' capabilities rapidly improve, ensuring their alignment with human values becomes critical. Bengio believes AI safety is not a science fiction problem but an engineering and scientific challenge requiring serious attention, global cooperation, and government regulation.
Source: AI Safety Needs Social Scientists, Bengio et al., Distill, 2019 / Managing AI Risks in an Era of Rapid Progress, Bengio et al., 2023
Open Science Is the Foundation of AI's Healthy Development
AI research results should be published openly, allowing global researchers to advance together. Academic independence and openness are important safeguards against AI being monopolized by a few companies. MILA as an academic institution model is Bengio's institutionalized practice of this belief.
Source: Yoshua Bengio interview, Le Monde, 2023
Causal Reasoning Is the Key to Transcending Statistical Learning
Current deep learning is essentially powerful statistical pattern matching but cannot perform true causal reasoning. Understanding 'why' rather than just 'what' is the necessary path for AI toward genuine intelligence.
Source: A Meta-Transfer Objective for Learning to Disentangle Causal Mechanisms, Bengio et al., ICLR 2020
Representation Hierarchy
Decompose complex data into hierarchical abstract representations, with each layer capturing higher-level semantic features.
In deep neural networks, lower layers learn edges and textures, middle layers learn shapes and parts, top layers learn semantic concepts — this hierarchical representation is the core reason deep learning outperforms shallow methods.
Model Architecture DesignFeature EngineeringTransfer LearningRepresentation Learning
System 1/System 2 Integration Framework
In AI system design, distinguish fast intuition modules (System 1) and slow reasoning modules (System 2), and design coordination mechanisms between them.
AlphaGo combined deep neural networks (System 1: intuitive position evaluation) and Monte Carlo Tree Search (System 2: systematic planning), a classic example of successful integration of both thinking modes.
AI Architecture DesignReasoning SystemsCognitive ModelingAGI Research
Causal Disentanglement Representation
Train AI models to learn the causal generative structure of data rather than just correlations, improving generalization and interpretability.
In medical AI, learning causal relationships (e.g., drug→outcome) rather than correlations (e.g., hospital→outcome) avoids systematic errors caused by distribution shift.
Causal ReasoningModel GeneralizationExplainable AIDomain Transfer
Democratic AI Governance Framework
AI development requires multi-stakeholder democratic governance mechanisms rather than being dominated by a few companies or nations, to ensure AI benefits all of humanity.
Bengio actively participated in the 2023 UK AI Safety Summit, pushing to establish an international AI safety regulatory framework and advocating for a global AI scientific assessment mechanism similar to the IPCC.
AI PolicyTechnology GovernanceInternational CooperationAI Safety
Adversarial Generative Thinking
By having two systems compete against each other (generator and discriminator), drive both to co-evolve and reach quality levels unachievable through individual training.
GANs achieve realistic image generation through the adversarial game between generator and discriminator, opening a new era of deep generative models.
Generative ModelsAdversarial TrainingImage GenerationData Augmentation
Deep Learning Theory Foundational Phase
1991-2006
Neural network training difficulty problems and representation learning theory
During the years when deep learning was ignored by the mainstream, Bengio persisted in researching neural network training difficulties (vanishing gradients, local optima), publishing foundational work on the long-term dependency problem in RNNs. In 2003 published a neural language model paper introducing word embeddings, laying foundations for later Word2Vec and Transformers.
Deep Learning Rise Phase
2006-2015
Deep belief networks, GANs, and attention mechanisms
In 2006 co-published the deep belief networks paper with Hinton, marking the beginning of deep learning's revival. In 2014 co-invented GANs with Ian Goodfellow, opening a new era of generative models. Participated in early attention mechanism research, paving the way for Transformer development. MILA grew into a global top deep learning research center during this period.
Post-Turing Award AI Safety Advocacy Phase
2018-至今
AI safety, causal reasoning, and System 2 thinking
After receiving the Turing Award in 2018, Bengio devoted more energy to AI safety and AI governance advocacy. He signed multiple AI safety open letters, participated in government consultations, and pushed to establish international AI regulatory frameworks. Research shifted toward causal representation learning and System 2 thinking integration, exploring AI architectures beyond the current deep learning paradigm.
Existential Risk Research Phase
2023-至今
AI existential risks and global governance
After ChatGPT's explosion in 2023, Bengio's AI safety stance became more pronounced, publicly expressing concern about AI's potential existential risks. He participated in drafting multiple scientific statements on AI risks, actively participated in AI policy discussions at international forums like G7 and UN, becoming a core scientific voice in the global AI safety movement.