You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We assume that individual confidence scores have already been obtained from dataset specific models (see paper and the associated model repos cited for details). We provide all raw score files used in the data directory, which can be obtained by running:
./download_data.sh
This also includes pre-computed set scores (but not for target k when using the NN model for controlling P(FP ≤ k) ≥ 1 - delta, see below). To generate (or re-generate) the set scores, run the compute_set_scores.py script in the scripts folder. As an example, to generate the NN-based scores for the ChEMBL dataset (using the Message Passing Network base individual confidence scores), run:
For control in probability, see the delta option. Also note that some scores (i.e., NN) are specific to a target k, so the compute_set_scores.py script should be re-run with the desired k specified via the --k argument.
Results are saved as a dict mapping from key (e.g., tuple of (args.set_scores_file, k, args.delta) for conformal methods) to Result object (see src/utils.py).
About
Conformal Prediction Sets with Limited False Positives