Marten Lienen, Abdullah Saydemir, Stephan Günnemann
https://openreview.net/forum?id=U8GUmxnzXn
https://arxiv.org/abs/2506.05065
This repository contains the code that produced the results in the paper. Feel free to build upon this or take bits and pieces for your own project with proper attribution, see Citation below.
If you want to run our code, start by setting up the python environment.
We use pixi to easily set up reproducible environments based on conda packages.
Install it with curl -fsSL https://pixi.sh/install.sh | bash and then run
# Clone the repository
git clone https://github.com/martenlienen/unhippo.git
# Change into the repository
cd unhippo
# Install and activate the environment
pixi shellWe use hydra for configuration, so you can overwrite all settings from the command line, e.g. the dataset with data=fsd as above.
Check the config directory to explore the available configuration options.
To train on FSD or SC10, just pass data=fsd or data=sc10 and the script will automatically download the dataset into the data directory and preprocess it for you.
Hydra also let's you submit jobs to a slurm cluster easily. So to recreate, for example, our results on FSD, just run
./train.py -m hydra/launcher=slurm hydra.launcher.qos=<your-qos> hydra.launcher.partition=<your-gpu-partition> experiment=fsdWe decided to leave some of our work-in-progress notebooks in notebooks.
So if you want to see how this method came about, feel free to snoop around in there.
Please be aware, that some of the notebooks likely won't run anymore and we will also not fix them.
They are meant to be an artifact of the research process.
If you build upon this work, please cite our paper as follows.
@inproceedings{lienen2025unhippo,
title = {{{UnHiPPO}}: {{Uncertainty}}-aware {{Initialization}} for {{State Space Models}}},
author = {Lienen, Marten and Saydemir, Abdullah and G{\"u}nnemann, Stephan},
booktitle = {International {{Conference}} on {{Machine Learning}}},
year = {2025},
}