nielsr HF Staff commited on
Commit
7959831
·
verified ·
1 Parent(s): d9c2f44

Add pipeline tag and improve model card

Browse files

Hi! I'm Niels, part of the community science team at Hugging Face.

This PR improves the model card for FakeReasoning. It adds:
- The `pipeline_tag: image-text-to-text` to the metadata to ensure the model is correctly categorized in the Hugging Face Hub.
- Links to the paper, project page, and the official GitHub repository.
- A sample usage section based on the instructions found in the repository's README.
- The BibTeX citation for researchers to cite the work.

Please review and merge if this looks good!

Files changed (1) hide show
  1. README.md +39 -7
README.md CHANGED
@@ -1,17 +1,49 @@
1
  ---
2
  license: apache-2.0
 
3
  ---
4
- # FakeReasoning Model Card
5
 
6
- ## Model details
7
 
8
- **Model type:** FakeReasoning is a forgery detection and reasoning framework with three key components: 1) a dual-branch visual encoder that integrates CLIP and DINO to capture both high-level semantics and low-level artifacts; 2) a Forgery-Aware Feature Fusion Module that leverages DINO's attention maps and cross-attention mechanisms to guide MLLMs toward forgery-related clues; 3) a Classification Probability Mapper that couples language modeling and forgery detection, enhancing overall performance.
9
 
10
- **Project page:** https://pris-cv.github.io/FakeReasoning/
 
 
11
 
12
- **Paper:** https://arxiv.org/abs/2503.21210
13
 
14
- ## License
 
 
 
15
 
16
- FakeReasoning is licensed under the Apache 2.0 License.
17
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ pipeline_tag: image-text-to-text
4
  ---
 
5
 
6
+ # FakeReasoning
7
 
8
+ FakeReasoning is a forgery detection and reasoning framework designed to accurately detect AI-generated images and provide reliable reasoning over forgery attributes. It formulates detection and explanation as a unified Forgery Detection and Reasoning task (FDR-Task).
9
 
10
+ - **Project Page:** [https://pris-cv.github.io/FakeReasoning/](https://pris-cv.github.io/FakeReasoning/)
11
+ - **Paper:** [Toward Generalizable Forgery Detection and Reasoning](https://huggingface.co/papers/2503.21210)
12
+ - **Code:** [https://github.com/PRIS-CV/FakeReasoning](https://github.com/PRIS-CV/FakeReasoning)
13
 
14
+ ## Model Details
15
 
16
+ FakeReasoning consists of three key components:
17
+ 1. **Dual-branch visual encoder:** Integrates CLIP and DINO to capture both high-level semantics and low-level artifacts.
18
+ 2. **Forgery-Aware Feature Fusion Module:** Leverages DINO's attention maps and cross-attention mechanisms to guide the model toward forgery-related clues.
19
+ 3. **Classification Probability Mapper:** Couples language modeling and forgery detection, enhancing overall performance.
20
 
21
+ The model was trained on the **MMFR-Dataset**, a large-scale dataset containing 120K images across 10 generative models with 378K reasoning annotations.
22
 
23
+ ## Sample Usage
24
+
25
+ To use the model, please follow the installation instructions in the [official repository](https://github.com/PRIS-CV/FakeReasoning). You can then run inference using the following commands:
26
+
27
+ ```bash
28
+ cd LLaVA/forgery_eval
29
+ export DINO_PATH='path_to_dinov2-main'
30
+ export DINO_WEIGHT='path_to_dinov2_vitl14_pretrain.pth'
31
+
32
+ python inference.py \
33
+ --model-path AnnaGao/FakeReasoning \
34
+ --img_path path_to_your_image.png
35
+ ```
36
+
37
+ Note: Inference and evaluation require at least 30 GB of GPU memory on a single GPU.
38
+
39
+ ## Citation
40
+
41
+ ```bibtex
42
+ @article{gao2025fakereasoning,
43
+ title={FakeReasoning: Towards Generalizable Forgery Detection and Reasoning},
44
+ author={Gao, Yueying and Chang, Dongliang and Yu, Bingyao and Qin, Haotian and Chen, Lei and Liang, Kongming and Ma, Zhanyu},
45
+ journal={arXiv preprint arXiv:2503.21210},
46
+ year={2025},
47
+ url={https://arxiv.org/abs/2503.21210}
48
+ }
49
+ ```