📊

Dataset

Computer Use Large

Name: Computer Use Large
Creator: adrianmele
License: CC-BY-4.0

by adrianmele hf-dataset--adrianmele--computer-use-large

Free2AITools Nexus Index

58.9 Top 100%

S: Semantic 50

A: Authority 62

P: Popularity 56

R: Recency 83

Q: Quality 50

Tech Context

Vital Performance

0 DL / 30D

0.0%

Source →

Data Integrity 58.9 FNI Score

- Size

- Rows

Parquet Format

- Tokens

Dataset Information Summary
Entity Passport
Registry ID	hf-dataset--adrianmele--computer-use-large
License	CC-BY-4.0
Provider	huggingface

📜

Cite this dataset

Academic & Research Attribution

BibTeX

@misc{hf_dataset__adrianmele__computer_use_large,
  author = {adrianmele},
  title = {Computer Use Large Dataset},
  year = {2026},
  howpublished = {\url{https://huggingface.co/datasets/adrianmele/computer-use-large}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}

APA Style

adrianmele. (2026). Computer Use Large [Dataset]. Free2AITools. https://huggingface.co/datasets/adrianmele/computer-use-large

🔬Technical Deep Dive

Full Specifications [+]

⚖️ Free2AITools Nexus Index V2.0

Methodology Index Protocol

Semantic (S) 50

Authority (A) 62

Popularity (P) 56

Recency (R) 83

Quality (Q) 50

💬 Index Insight

FNI V2.0 for Computer Use Large: Semantic (S:50), Authority (A:62), Popularity (P:56), Recency (R:83), Quality (Q:50).

Free2AITools Nexus Index

Verification Authority

HuggingFace API GitHub Metadata Arxiv Citation DB System Audit

Unbiased Data Node Refresh: VFS Live

⬇️

Downloads

91,182

👁️ Data Preview

📊

Row-level preview not available for this dataset.

Schema structure is shown in the Field Logic panel when available.

🔗 Explore Full Dataset ↗

🧬 Field Logic

🧬

Schema not yet indexed for this dataset.

Dataset Specification

Computer Use Large

A large-scale dataset of 48,478 screen recording videos (~12,300 hours) of professional software being used, sourced from the internet. All videos have been trimmed to remove non-screen-recording content (intros, outros, talking heads, transitions) and audio has been stripped.

Dataset Summary

Category	Videos	Hours
AutoCAD	10,059	2,149
Blender	11,493	3,624
Excel	8,111	2,002
Photoshop	10,704	2,060
Salesforce	7,807	2,336
VS Code	304	127
Total	48,478	~12,300

Data Fields

Each folder contains a metadata.jsonl file with the following fields per video:

Field	Type	Description
`file_name`	string	Filename of the video (e.g. `abc123.mp4`)
`category`	string	Software category
`trimmed_duration`	float	Duration of the video in seconds
`num_segments`	int	Number of contiguous screen recording segments

Data Organization

Videos are stored under data/{category}/ with a metadata.jsonl per folder. Due to HuggingFace's 10,000 file per directory limit, some categories are split across two folders (e.g. blender/ and blender_2/).

text

data/
  autocad/        (9,999 videos + metadata.jsonl)
  autocad_2/      (60 videos + metadata.jsonl)
  blender/        (9,999 videos + metadata.jsonl)
  blender_2/      (1,494 videos + metadata.jsonl)
  excel/          (8,111 videos + metadata.jsonl)
  photoshop/      (9,999 videos + metadata.jsonl)
  photoshop_2/    (705 videos + metadata.jsonl)
  salesforce/     (7,807 videos + metadata.jsonl)
  vscode/         (304 videos + metadata.jsonl)

Usage

python

from datasets import load_dataset

# Load a specific category
ds = load_dataset("markov-ai/computer-use-large", "blender")

# Load all categories
ds = load_dataset("markov-ai/computer-use-large")

Intended Use

This dataset is designed for training and evaluating computer use agents — models that interact with desktop software through GUI actions (clicking, typing, scrolling). The screen recordings provide demonstrations of real software workflows across diverse applications.

License

CC-BY-4.0

Social Proof

HuggingFace Hub

91.2KDownloads

Hub Discussions

🤗 Data Source: Hugging Face ↗

🔄 Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

📊 FNI Methodology 📚 Knowledge Baseℹ️ Verify with original source

🛡️ Dataset Transparency Report

Technical metadata sourced from upstream repositories.

Open Metadata

🆔 Identity & Source

id: hf-dataset--adrianmele--computer-use-large
slug: adrianmele--computer-use-large
source: huggingface
author: adrianmele
license: CC-BY-4.0
tags: task_categories:video-classification, task_categories:robotics, language:en, license:cc-by-4.0, size_categories:10k<n<100k, region:us, screen-recording, computer-use, software-tutorials, gui, desktop

⚙️ Technical Specs

architecture: null
params billions: null
context length: null
pipeline tag

📊 Engagement & Metrics

downloads: 91,182
stars: 0
forks: null

Data indexed from public sources. Updated daily.