Date

2 months ago

Organization

Publish URL

www.kaggle.com

Paper URL

2511.02495

License

Non-Commercial

Tags

Object Recognition

Computer Vision

DetectiumFire is a dataset released in 2025 by Tulane University in collaboration with Aalto University, designed for tasks such as flame detection, visual reasoning, and multimodal generation. The related research paper is titled "...".DetectiumFire: A Comprehensive Multi-modal Dataset Bridging Vision and Language for Fire UnderstandingThe "Flame Scene" track has been included in the NeurIPS 2025 Datasets and Benchmarks Track, aiming to provide a unified training and evaluation resource for computer vision and vision-language models.

This dataset contains over 145,000 high-quality real-world fire images and 25,000 fire-related videos. In addition to real data, it includes 8,000 synthetic fire images generated using a diffusion model, and 12,000 carefully selected preference pairs from the RLHF process to enhance model alignment. It covers both real and synthetic flame and non-flame images and videos, accompanied by flame intensity, environmental information, text descriptions, and human preference annotations. The dataset consists of four parts: real images, real videos, synthetic flame images generated by the diffusion model, and human preference data based on pairwise comparisons. The synthetic images provide YOLO-formatted detection annotations, while the preference data records the human judgments regarding generation quality.

Dataset composition:

Real images
- fire: Realistic flame images and YOLO format annotations
- non_fire: Difficult negative examples that do not contain flames but are easily confused (such as bright light, smoke, sunset).
Real video (real_video)
- fire: Real video footage containing visible flames
- non_fire: Scenes without fire, used for robustness testing.
Synthetic images
- stable_diff_v15/train: Image generation using SFT fine-tuning + YOLO annotation
- dpo_stable_diff_v15/train: DPO fine-tuning generated images + YOLO annotations
Preference data (preference_dataset)
- preference.json: Comparison and interpretation of human preferences for paired generated images, used for RLHF/DPO training.

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at support@hyper.ai for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Discuss on Discord

Date

2 months ago

Organization

Publish URL

www.kaggle.com

Paper URL

2511.02495

License

Non-Commercial

Dataset composition:

Real images
- fire: Realistic flame images and YOLO format annotations
- non_fire: Difficult negative examples that do not contain flames but are easily confused (such as bright light, smoke, sunset).
Real video (real_video)
- fire: Real video footage containing visible flames
- non_fire: Scenes without fire, used for robustness testing.
Synthetic images
- stable_diff_v15/train: Image generation using SFT fine-tuning + YOLO annotation
- dpo_stable_diff_v15/train: DPO fine-tuning generated images + YOLO annotations
Preference data (preference_dataset)
- preference.json: Comparison and interpretation of human preferences for paired generated images, used for RLHF/DPO training.

Related Datasets

Vehicles OpenImages Vehicle Image Dataset

11 days ago

Mobile Actions Mobile Function Call Dataset

a month ago

VenusBench-GD Cross-Platform Interface Understanding Dataset

a month ago

X-ray Contraband Detection Dataset

a month ago

OpenGU Graph Forgetting Comprehensive Evaluation Dataset

2 months ago

VideoRewardBench Video Reward Model Evaluation Dataset

2 months ago

MUVR Multimodal Uncropped Video Retrieval Benchmark

2 months ago

RubricHub_v1 Multi-Domain Generative Task Dataset

5 days ago

PhysToolBench Physics Tool Task Dataset

2 months ago

1.56 GB56

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

DetectiumFire Multimodal Fire Understanding Dataset

Dataset composition:

Build AI with AI

HyperAI Newsletters

Command Palette

DetectiumFire Multimodal Fire Understanding Dataset

Dataset composition:

Related Datasets

Vehicles OpenImages Vehicle Image Dataset

Mobile Actions Mobile Function Call Dataset

VenusBench-GD Cross-Platform Interface Understanding Dataset

X-ray Contraband Detection Dataset

OpenGU Graph Forgetting Comprehensive Evaluation Dataset

VideoRewardBench Video Reward Model Evaluation Dataset

MUVR Multimodal Uncropped Video Retrieval Benchmark

RubricHub_v1 Multi-Domain Generative Task Dataset

PhysToolBench Physics Tool Task Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

DetectiumFire Multimodal Fire Understanding Dataset

Dataset composition:

Related Datasets

Vehicles OpenImages Vehicle Image Dataset

Mobile Actions Mobile Function Call Dataset

VenusBench-GD Cross-Platform Interface Understanding Dataset

X-ray Contraband Detection Dataset

OpenGU Graph Forgetting Comprehensive Evaluation Dataset

VideoRewardBench Video Reward Model Evaluation Dataset

MUVR Multimodal Uncropped Video Retrieval Benchmark

RubricHub_v1 Multi-Domain Generative Task Dataset

PhysToolBench Physics Tool Task Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

Vehicles OpenImages Vehicle Image Dataset

Mobile Actions Mobile Function Call Dataset

VenusBench-GD Cross-Platform Interface Understanding Dataset

X-ray Contraband Detection Dataset

OpenGU Graph Forgetting Comprehensive Evaluation Dataset

VideoRewardBench Video Reward Model Evaluation Dataset

MUVR Multimodal Uncropped Video Retrieval Benchmark

RubricHub_v1 Multi-Domain Generative Task Dataset

PhysToolBench Physics Tool Task Dataset

Related Datasets

Vehicles OpenImages Vehicle Image Dataset

Mobile Actions Mobile Function Call Dataset

VenusBench-GD Cross-Platform Interface Understanding Dataset

X-ray Contraband Detection Dataset

OpenGU Graph Forgetting Comprehensive Evaluation Dataset

VideoRewardBench Video Reward Model Evaluation Dataset

MUVR Multimodal Uncropped Video Retrieval Benchmark

RubricHub_v1 Multi-Domain Generative Task Dataset

PhysToolBench Physics Tool Task Dataset