You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Configuration. We have released a new Arxiv version to state the experiment settings. For all concepts, the coefficients of Eq.3 are: $\lambda_1=0.1$ and $\lambda_2=0.1$. The regularization coefficients $\lambda$ and epochs are set as follows:
Nudity and unsafe concepts(I2P concepts), $\lambda=1e-1$, with nudity for 3 epochs and unsafe concepts for 2 epochs.
Artistic styles, $\lambda=1e-3$, 1 epoch.
Difficult objects for UCE(e.g., church and garbage truck), $\lambda=1e-3$, 1 epoch.
Easy objects for UCE(e.g., English Springer, golf ball and parachute), $\lambda=1e-1$, 1 epoch.
For other objects where erasing accuracies reach 0 using UCE, RECE's further erasure is not applied.
Red-teaming tools. Due to the open-source timeline, we used our reproduced Ring-A-Bell attack method for all baselines, available in attack_methods/. And we used the P4D attack method reproduced by UnlearnDiff.
Citation
If you find our work helpful, please leave us a star and cite our paper.
@article{gong2024reliable,
title={Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models},
author={Gong, Chao and Chen, Kai and Wei, Zhipeng and Chen, Jingjing and Jiang, Yu-Gang},
journal={arXiv preprint arXiv:2407.12383},
year={2024}
}