You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Clone this repository and set up the environment of AndroidWorld; you may still need to install extra packages needed listed in requirements.txt although you have already installed AndroidWorld;
git clone https://github.com/OS-Copilot/OS-Sentinel
cd OS-Sentinel
# install AndroidWorld# requirements.txt contains packages not included by AndroidWorld
pip install -r requirements.txt
pipeline/eval_llm.py is for text-only LLM evaluation;
pipeline/tag.py is for risk tag evaluation of VLM;
pipeline/cons.py is for recorded trajectories via mobile agent instead of our hand-made ones;
Run pipeline/multi_method_consistency.py after result.json is ready.
📋 Citation
@article{sun2025ossentinel,
title={OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows},
author={Qiushi Sun and Mukai Li and Zhoumianze Liu and Zhihui Xie and Fangzhi Xu and Zhangyue Yin and Kanzhi Cheng and Zehao Li and Zichen Ding and Qi Liu and Zhiyong Wu and Zhuosheng Zhang and Ben Kao and Lingpeng Kong},
journal={arXiv preprint arXiv:2510.24411},
year={2025}
}