@misc{hf_dataset__adrianmele__computer_use_large,
author = {adrianmele},
title = {Computer Use Large Dataset},
year = {2026},
howpublished = {\url{https://huggingface.co/datasets/adrianmele/computer-use-large}},
note = {Accessed via Free2AITools Knowledge Fortress}
}
APA Style
adrianmele. (2026). Computer Use Large [Dataset]. Free2AITools. https://huggingface.co/datasets/adrianmele/computer-use-large
A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet. All videos have been trimmed to remove non-screen-recording content (intros, outros, talking heads, transitions) and audio has been stripped.
Dataset Summary
Category
Videos
Hours
AutoCAD
10,059
2,149
Blender
11,493
3,624
Excel
8,111
2,002
Photoshop
10,704
2,060
Salesforce
7,807
2,336
VS Code
304
127
Total
48,478
~12,300
Data Fields
Each folder contains a metadata.jsonl file with the following fields per video:
Field
Type
Description
file_name
string
Filename of the video (e.g. abc123.mp4)
category
string
Software category
trimmed_duration
float
Duration of the video in seconds
num_segments
int
Number of contiguous screen recording segments
Data Organization
Videos are stored under data/{category}/ with a metadata.jsonl per folder. Due to HuggingFace's 10,000 file per directory limit, some categories are split across two folders (e.g. blender/ and blender_2/).
from datasets import load_dataset
# Load a specific category
ds = load_dataset("markov-ai/computer-use-large", "blender")
# Load all categories
ds = load_dataset("markov-ai/computer-use-large")
Intended Use
This dataset is designed for training and evaluating computer use agents â models that interact with desktop software through GUI actions (clicking, typing, scrolling). The screen recordings provide demonstrations of real software workflows across diverse applications.