You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
SmellNet: A Large-scale Dataset for Real-world Smell Recognition
SmellNet is the first large-scale open-source dataset that captures how real-world substances smell, digitized using portable gas and chemical sensors. It includes 50 hours of data from 50 substances (nuts, spices, herbs, fruits, and vegetables), totaling over 180,000 time steps of multichannel sensor data, accompanied by chemical composition (GC-MS) and textual descriptions.
SmellNet enables research into:
🧠 Real-time substance classification with supervised learning
🔁 Cross-modal learning with sensor + GC-MS alignment
📈 Time-series modeling using LSTMs and Transformers
📊 Signal preprocessing like first-order temporal difference (FOTD)
Each ingredient has multiple time-series recordings in CSV format, plus paired metadata and chemical information to support multimodal learning tasks.
SmellNet is the first large-scale database that digitizes a diverse range of smells in the natural world. SmellNet enables various AI models to make substance prediction like supervised learning, contrastive learning and more to explore!
🧪 Applications
SmellNet is designed to support machine learning for:
Allergen detection (e.g., peanut traces)
Food and beverage quality control
Digital olfaction and human-AI interaction
Health diagnostics (e.g., stress, hormones, early disease)