📊

Dataset

hellaswag

Name: hellaswag
Creator: Rowan

by Rowan hf-dataset--rowan--hellaswag

Nexus Index

33.6 Top 100%

S: Semantic 50

A: Authority 0

P: Popularity 64

R: Recency 23

Q: Quality 30

Tech Context

Vital Performance

0 DL / 30D

0.0%

Source →

Data Integrity 33.6 FNI Score

- Size

- Rows

Parquet Format

- Tokens

Dataset Information Summary
Entity Passport
Registry ID	hf-dataset--rowan--hellaswag
Provider	huggingface

📜

Cite this dataset

Academic & Research Attribution

BibTeX

@misc{hf_dataset__rowan__hellaswag,
  author = {Rowan},
  title = {hellaswag Dataset},
  year = {2026},
  howpublished = {\url{https://huggingface.co/datasets/rowan/hellaswag}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}

APA Style

Rowan. (2026). hellaswag [Dataset]. Free2AITools. https://huggingface.co/datasets/rowan/hellaswag

🔬Technical Deep Dive

Full Specifications [+]

⚖️ Nexus Index V2.0

Methodology Index Protocol

33.6

TOP 100% SYSTEM IMPACT

Semantic (S) 50

Authority (A) 0

Popularity (P) 64

Recency (R) 23

Quality (Q) 30

💬 Index Insight

FNI V2.0 for hellaswag: Semantic (S:50), Authority (A:0), Popularity (P:64), Recency (R:23), Quality (Q:30).

Free2AITools Nexus Index

Verification Authority

HuggingFace API GitHub Metadata Arxiv Citation DB System Audit

Unbiased Data Node Refresh: VFS Live

⬇️

Downloads

303,103

👁️ Data Preview

📊

Row-level preview not available for this dataset.

Schema structure is shown in the Field Logic panel when available.

🔗 Explore Full Dataset ↗

🧬 Field Logic

🧬

Schema not yet indexed for this dataset.

Dataset Specification

Dataset Card for "hellaswag"

Dataset Description
Dataset Structure
Dataset Creation
Considerations for Using the Data
Additional Information

Dataset Description

Homepage: https://rowanzellers.com/hellaswag/
Repository: https://github.com/rowanz/hellaswag/
Paper: HellaSwag: Can a Machine Really Finish Your Sentence?
Point of Contact: More Information Needed
Size of downloaded dataset files: 71.49 MB
Size of the generated dataset: 65.32 MB
Total amount of disk used: 136.81 MB

Dataset Summary

HellaSwag: Can a Machine Really Finish Your Sentence? is a new dataset for commonsense NLI. A paper was published at ACL2019.

Supported Tasks and Leaderboards

More Information Needed

Languages

More Information Needed

Dataset Structure

Data Instances

default

Size of downloaded dataset files: 71.49 MB
Size of the generated dataset: 65.32 MB
Total amount of disk used: 136.81 MB

An example of 'train' looks as follows.

text

This example was too long and was cropped:

{
    "activity_label": "Removing ice from car",
    "ctx": "Then, the man writes over the snow covering the window of a car, and a woman wearing winter clothes smiles. then",
    "ctx_a": "Then, the man writes over the snow covering the window of a car, and a woman wearing winter clothes smiles.",
    "ctx_b": "then",
    "endings": "[\", the man adds wax to the windshield and cuts it.\", \", a person board a ski lift, while two men supporting the head of the per...",
    "ind": 4,
    "label": "3",
    "source_id": "activitynet~v_-1IBHYS3L-Y",
    "split": "train",
    "split_type": "indomain"
}

Data Fields

The data fields are the same among all splits.

default

ind: a int32 feature.
activity_label: a string feature.
ctx_a: a string feature.
ctx_b: a string feature.
ctx: a string feature.
endings: a list of string features.
source_id: a string feature.
split: a string feature.
split_type: a string feature.
label: a string feature.

Data Splits

name	train	validation	test
default	39905	10042	10003

Dataset Creation

Curation Rationale

More Information Needed

Source Data

Initial Data Collection and Normalization

More Information Needed

Who are the source language producers?

More Information Needed

Annotations

Considerations for Using the Data

More Information Needed

Discussion of Biases

More Information Needed

Other Known Limitations

More Information Needed

Additional Information

Dataset Curators

More Information Needed

Licensing Information

MIT https://github.com/rowanz/hellaswag/blob/master/LICENSE

Citation Information

text

@inproceedings{zellers2019hellaswag,
    title={HellaSwag: Can a Machine Really Finish Your Sentence?},
    author={Zellers, Rowan and Holtzman, Ari and Bisk, Yonatan and Farhadi, Ali and Choi, Yejin},
    booktitle ={Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics},
    year={2019}
}

Contributions

Thanks to @albertvillanova, @mariamabarham, @thomwolf, @patrickvonplaten, @lewtun for adding this dataset.

📊 Structured Schema (Zero-Fabrication)

Feature Key	Data Type
`ind`	`int32`
`activity_label`	`string`
`ctx_a`	`string`
`ctx_b`	`string`
`ctx`	`string`
`endings`	`unknown`
`source_id`	`string`
`split`	`string`
`split_type`	`string`
`label`	`string`

Estimated Rows: 59,950

Social Proof

HuggingFace Hub

303.1KDownloads

Hub Discussions

🤗 Data Source: Hugging Face ↗

🔄 Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

📊 FNI Methodology 📚 Knowledge Baseℹ️ Verify with original source

🛡️ Dataset Transparency Report

Technical metadata sourced from upstream repositories.

Open Metadata

🆔 Identity & Source

id: hf-dataset--rowan--hellaswag
slug: rowan--hellaswag
source: huggingface
author: Rowan
license
tags: language:en, size_categories:10k<n<100k, format:parquet, modality:text, library:datasets, library:pandas, library:mlcroissant, library:polars, arxiv:1905.07830, region:us

⚙️ Technical Specs

architecture: null
params billions: null
context length: null
pipeline tag

📊 Engagement & Metrics

downloads: 303,103
stars: 169
forks: 0

Data indexed from public sources. Updated daily.

Welcome to Free2AI Tools!

Smart Search

FNI Score

You're All Set!

Cite this dataset

🔬Technical Deep Dive

⚖️ Nexus Index V2.0

💬 Index Insight

Verification Authority

👁️ Data Preview

🧬 Field Logic

Dataset Specification

Dataset Card for "hellaswag"

Table of Contents

Dataset Description

Dataset Summary

Supported Tasks and Leaderboards

Languages

Dataset Structure

Data Instances

default

Data Fields

default

Data Splits

Dataset Creation

Curation Rationale

Source Data

Initial Data Collection and Normalization

Who are the source language producers?

Annotations

Annotation process

Who are the annotators?

Personal and Sensitive Information

Considerations for Using the Data

Social Impact of Dataset

Discussion of Biases

Other Known Limitations

Additional Information

Dataset Curators

Licensing Information

Citation Information

Contributions

📊 Structured Schema (Zero-Fabrication)

Social Proof

🛡️ Dataset Transparency Report

🆔 Identity & Source

⚙️ Technical Specs

📊 Engagement & Metrics