Research
My primary research interests lie in the general areas of Machine Learning, Artificial Intelligence, Natural Language Processing, large-scale ML/AI systems, and their applications.
In particular, I'm interested in principles, methodologies, and systems of building AI agents that learn and reason with human-level (or super-human) efficiceny and generality. To this end, we're building general World Models (PAN, LAW, RAP) to enable next-generation machine reasoning beyond large language models
(LLM Reasoners).
We're also working on Panoramic Learning (HDSR)—building AI agents with ALL types of experience, ranging from data instances (NeurIPS), embodied experiences (NeurIPS), structured knowledge (ACL, NeurIPS),
constraints, to rewards (EMNLP), adversaries (NeurIPS), lifelong interplay, etc. Along this line, we've been studying a standardized ML formalism ("Standard Model" of ML) for systematic understanding, unifying, and generalizing a wide range of ML paradigms (e.g., supervised, unsupervised, active, reinforcement, adversarial, meta, lifelong learning).
We're part of the open-source consortium Maitrix.org, aiming at build AI-powered realities with techniques of large language/multi-modal models, agent Models, and world models.
|