Skip to content

MuLabPKU/In-Parameter-Learning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

In-Parameter Learning: Why Lifelong AI Systems Need More Than Longer Context

📄 Paper

ICL vs IPL comparison

Introduction

This is the official repository for our position paper:

In-Parameter Learning: Why Lifelong AI Systems Need More Than Longer Context

AI agents are increasingly expected to interact with changing users, tools, and environments over extended periods of time. However, current systems remain largely static: their parameters are fixed during deployment, while new experiences, domain knowledge, corrections, and behavioral adaptations are handled primarily through external memory management and in-context learning.

In this position paper, we argue that existing in-context mechanisms are insufficient for lifelong AI, and advocate In-Parameter Learning (IPL) as a necessary complement to In-Context Learning (ICL) — referring to mechanisms that allow a deployed model to continually and safely update its parameters during use.

Our central claim is that future lifelong AI systems should be built around a hybrid paradigm: in-context learning for immediate, provisional, and reversible information, and in-parameter learning for durable, cumulative, and generalizable growth.

Publication Trends

Publication trends 2018–2025

Publication trends from 2018 to 2025 (OpenAlex). Long-context modeling and context management have grown rapidly into major LLM research directions, while continual In-Parameter Learning remains an emerging area.

Key Arguments

  • Context windows have hard limits. Even the most conservative estimates of lifetime experience (language, perception, structured data) exceed current 1M-token frontiers by orders of magnitude.
  • Scaling context length faces fundamental obstacles in computation, data, and model architecture.
  • In-Parameter Learning raises the capability ceiling by consolidating newly acquired information into model weights — enabling cumulative growth, better generalization, and reduced inference overhead.
  • ICL and IPL are complementary, not competing: ICL handles transient, reversible context; IPL handles durable, generalizable knowledge.

Authors

Yiding Wang*, Haotong Yang*, Pingzhi Tang*, Shijia Kang*, Cai Zhou, Chenglin Zhu, Fan Jiang, Fanxu Meng, Juntong Wang, Libin Chen, Tianyi Guan, Xiyuan Wang, Xuejie Liu, Yanbo Wang, Yansheng Mao, Yewei Liu, Yi Hu, Yufei Xu, Yuxuan Wang, Zian Li, Ziyuan He, Xing Sun†, Muhan Zhang†‡

¹Peking University   ²Massachusetts Institute of Technology   ³Tencent Youtu Lab

* Equal contribution. † Senior authors. ‡ Correspondence: muhan@pku.edu.cn

About

Position Paper: In-Parameter Learning: Why Lifelong AI Systems Need More Than Longer Context

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors