Contributing to BayesFlow#
BayesFlow is a highly open source project. We welcome contributions from the community.
How to contribute#
First, take some time to read this whole guide. Then, follow these steps to submit a contribution to BayesFlow:
1. Open an issue#
Before you start any work, please open an issue on GitHub if one does not already exist. Describe the motivation for the changes and add possible interfaces or use cases. We will determine the feature scope and discuss implementation details with you.
Here is an example of a good issue:
#123: Add support for multi-level priors#
Motivation:#
There is currently a significant scientific push to use multi-level prior distributions for hierarchical models, for instance in [1] and [2]. This feature would allow users to research these models more easily.
Possible Interface:#
The multi-level graph structure could be given implicitly via the argument names of each sampling function. For instance:
import bayesflow as bf
import keras
@bf.distribution
def prior1():
return dict(a=keras.random.normal(), b=keras.random.normal())
@bf.distribution
def prior2(a):
return dict(c=keras.random.normal(a))
@bf.distribution
def prior3(a, b, c):
return dict(d=keras.random.normal(a + b + c))
References:#
[1]: Some paper [2]: Some other paper
2. Set up your development environment#
Once we agree on an approach in the issue you opened, we can move ahead with the implementation.
First, create a development environment with conda, or any other environment manager of your choice.
conda create -n bf python=3.11
conda activate bf
Note: always use a clean environment dedicated for development of BayesFlow to avoid dependency issues.
Most contributors will have to create and clone a fork of BayesFlow using the GitHub interface or the CLI:
gh repo fork bayesflow-org/bayesflow --clone
Then, check out the development branch and install dependencies:
cd bayesflow
git checkout dev
conda install pip
pip install -e .[dev,docs,test]
pre-commit install
Finally, install at least one backend of your choice. At the moment of writing this, to install all three backends on a machine supporting CUDA 12.6, we would use:
pip install -U jax[cuda12]
pip install -U tensorflow[and-cuda]
pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu126
You can set an environment variable to choose the default backend. We recommend defaulting to jax:
conda env config vars set KERAS_BACKEND=jax
Note that you will have to re-activate the environment for the changes to take effect.
3. Implement your changes#
In general, we recommend a test-driven development approach:
Write a test for the functionality you want to add.
Write the code to make the test pass.
You can run tests for your installed environment using pytest
:
pytest
Make sure to occasionally also run multi-backend tests for your OS using tox:
tox --parallel auto
See tox.ini
for details on the environment configurations.
Multi-OS tests will automatically be run once you create a pull request.
Note that to be backend-agnostic, your code must not:
Use code from a specific machine learning backend
Use code from the
keras.backend
moduleRely on the specific tensor object type or semantics
Examples of bad code:
# bad: do not use specific backends
import tensorflow as tf
x = tf.zeros(3)
# bad: do not use keras.backend
shape = keras.backend.shape(x) # will error under torch backend
# bad: do not use tensor methods directly
z = x.numpy() # will error under torch backend if device is cuda
Use instead:
# good: use keras instead of specific backends
import keras
x = keras.ops.zeros(3)
# good: use keras.ops, keras.random, etc.
shape = keras.ops.shape(x)
# good: use keras methods instead of direct tensor methods
z = keras.ops.convert_to_numpy(x)
4. Document your changes#
The documentation uses sphinx and relies on numpy style docstrings in classes and functions.
If you haven’t done so earlier, run the following command to install all necessary packages for setting up documentation generation:
pip install .[docs]
The overall structure of the documentation is manually designed, but the API documentation is auto-generated.
New top-level modules (i.e., bayesflow.mynewmodule
) have to be manually added to the list in docsrc/source/api/bayesflow.rst
to be included.
You can re-build the for your local state with:
cd docsrc
make clean && make local
# in case of issues, try `make clean-all`
Note that files ignored by git (i.e., listed in .gitignore
) are not included in the documentation.
We also provide a multi-version documentation. To generate it, run
cd docsrc
make clean && make docs
This will create and cache virtual environments for the build at docsrc/.docs_venvs
.
To remove them, run make clean-all
in the docsrc
directory.
The entry point of the rendered documentation will be at docs/index.html
.
To view the docs in the browser (which ensures correct redirects), run:
cd docsrc
make view-docs
See docsrc/README.md
for more details.
Note that undocumented changes will likely be rejected.
5. Create a pull request#
Once your changes are ready, create a pull request to the dev
branch using the GitHub interface.
Make sure to reference the issue you opened in step 1. If no issue exists, either open one first or follow the
issue guidelines for the pull request description.
Here is an example of a good pull request:
#124: Add support for multi-level priors#
Resolves #123.
Multi-level priors are implemented via a graph structure which is internally created from the argument names of the sampling functions.
Tutorial Notebooks#
New tutorial notebooks are always welcome! You can add your tutorial notebook file to examples/
and add a reference
to the list of notebooks in docsrc/source/examples.rst
.
Re-build the documentation (see above) and your notebook will be included.