🧠 Model

Florence-2-large

by microsoft

--- license: mit license_link: https://huggingface.co/microsoft/Florence-2-large/resolve/main/LICENSE pipeline_tag: image-text-to-text tags: - vision --- **This

🕐 Updated 12/19/2025

🧠 Architecture Explorer

Neural network architecture

1 Input Layer

2 Hidden Layers

3 Attention

4 Output Layer

Learn about Transformers →

About

**This is a continued pretrained version of Florence-2-large model with 4k context length, only 0.1B samples are used for continue pretraining, thus it might not be trained well. In addition, OCR task has been updated with line separator ('\n'). COCO OD AP 39.8** This Hub repository contains a HuggingFace's implementation of Florence-2 model...

📝 Limitations & Considerations

• Benchmark scores may vary based on evaluation methodology and hardware configuration.
• VRAM requirements are estimates; actual usage depends on quantization and batch size.
• FNI scores are relative rankings and may change as new models are added.
• Data source: [{"source_platform":"huggingface","source_url":"https://huggingface.co/microsoft/Florence-2-large","fetched_at":"2025-12-19T07:41:01.176Z","adapter_version":"3.2.0"}]

📚 Related Resources

📄 Related Papers

No related papers linked yet. Check the model's official documentation for research papers.

📊 Training Datasets

Training data information not available. Refer to the original model card for details.

🔗 Related Models V6.2

phi-2📦 Sibling phi-4📦 Sibling VibeVoice-1.5B📦 Sibling OmniParser📦 Sibling Phi-3-mini-128k-instruct📦 Sibling

Model Information Summary
Model Name	Florence-2-large
Author	microsoft
Type	image-text-to-text
Downloads	857,145
Likes	1,721
Source	Hugging Face
Last Updated	December 19, 2025

Graph Overview

200 Models

460 Connections

Explore Full Graph →

🚀 What's Next?

📊