TOFU Benchmark
==============

Run the Task of Fictitious Unlearning (TOFU) benchmark to compare
unlearning strategies.

.. literalinclude:: ../../benchmarks/tofu/run.py
   :language: python
   :caption: benchmarks/tofu/run.py

This benchmark:

- Generates synthetic fictitious data
- Runs multiple strategy × selector combinations
- Measures accuracy and forgetting quality
- Saves results to JSON

Running the Benchmark
---------------------

.. code-block:: bash

   python benchmarks/tofu/run.py

Results are saved to ``benchmarks/tofu/results/tofu_results.json``.

Interpreting Results
--------------------

The output table shows for each strategy × selector combination:

- **Strategy**: The unlearning method used
- **Selector**: The coreset selection method (or "full" for no selection)
- **Time**: Wall-clock time in seconds
- **Final Loss**: Last forget loss value (lower = less forgetting)