Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration

Kaihang Pan¹*, Zhaoyu Fan¹*, Juncheng Li^1†, Qifan Yu¹, Hao Fei², Siliang Tang¹, Richang Hong³, Hanwang Zhang⁴, Qianru Sun⁵

¹Zhejiang University, ²National University of Singapore, ³Hefei University of Technology, ⁴Nanyang Technological University, ⁵Singapore Management University

^*Equal Contribution, ^†Corresponding Author

This repo contains the PyTorch implementation of Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration, which is accepted by NeurIPS2024 (Spotlight).

Note

(Chinese Version) 在上一版本的代码中，我们错误地上传了一个消融实验的代码版本（其中包含了UNIKE的核心实现，只是main函数easyeditor/models/unike/unike_main.py中由于消融的setting导致一些实现代码不会被调用到）；同时，我们的代码基于T-Patcher实现，存在一些命名不规范的情况：直接修改了T-Patcher的一些内部实现逻辑，但没有修改对应的函数或方法名称（名称上依然是T-Patcher）。这两方面引发了一些误解，误以为我们只是调用T-Patcher的原实现进行编辑。对此，我们为自己的疏忽诚挚道歉，对代码进行了紧急规范化的调整，在此给出一些核心实现的代码位置。

(English Version) In the previous version of the code, we mistakenly uploaded a version intended for ablation experiments (however, it did include the core implementation of UNIKE;). Additionally, our code is based on T-Patcher and contains some non-standard naming conventions: we directly modified some internal implementation logic of T-Patcher but did not change the corresponding function or method names (which still bear the names of T-Patcher). These two aspects have caused some misunderstandings, leading to the assumption that we merely invoked the original implementation of T-Patcher for editing. For this oversight, we sincerely apologize, have urgently standardized the code, and are now providing the locations of some core implementations.

Intrinsic Knowledge Editing: We follow the implementation of T-Patcher.

External Knowledge Resorting: Please refer to AdapterLayer.

Knowledge Collaboration: Please refer to easyeditor/models/unike/src/models/patch.py#389 and easyeditor/models/unike/src/models/patch.py#136.

Run the code

First setup the python environment in the following way:

pip install -r requirements.txt

Then download some necessary data and model checkpoints. Download MiniGPT4 and Blip2-OPT into the hugging_cache dir. Download MMEdit data into the data dir, with images saved in images dir, please refer to EasyEdit for more details 【To be continued...】

To run the multimodal editing:

python run_edit.py --hparams_dir hparams/minigpt4.yaml --task vqa

Acknowledgment

Our project is developed based on the following repositories:

EasyEdit: An Easy-to-use Knowledge Editing Framework for LLMs.
Transformer-Patcher: One mistake worth one neuron.

Citation

If you found this work useful, please consider citing our paper as follows:

@article{pan2024towards,
  title={Towards unified multimodal editing with enhanced knowledge collaboration},
  author={Pan, Kaihang and Fan, Zhaoyu and Li, Juncheng and Yu, Qifan and Fei, Hao and Tang, Siliang and Hong, Richang and Zhang, Hanwang and Sun, Qianru},
  journal={arXiv preprint arXiv:2409.19872},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
easyeditor		easyeditor
hparams		hparams
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
run_edit.py		run_edit.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration

Note

Run the code

Acknowledgment

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

beepkh/UniKE

Folders and files

Latest commit

History

Repository files navigation

Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration

Note

Run the code

Acknowledgment

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages