Effective Continual Learning

Beyond State-of-the-Art Reasoning
Effective Continual Learning
Infinite Context Window

GitHub0k

BDH shows no catastrophic forgetting during sequential task training on a 100M parameters scale, maintaining a low and stable combined loss across task switches. In contrast, GPT (Transformer) degrades after each switch - its loss rises after moving to Task 2 and spikes when returning to Task 1 - indicating disrupted retention and catastrophic forgetting.

_benchmarks

Beyond State-of-the-Art Reasoning

_benchmarks

Infinite Context Window

62k

Product

Live Data Framework
AI Pipelines
Download our tooling

Company

Join us
News
Media kit
Licensing Terms
Policies

Contact

Chat with us on Discord
Pathway
418 Florence Street
Palo Alto, CA 94301