⚠️

This is a Dataset, not a Model

The following metrics do not apply: FNI Score, Deployment Options, Model Architecture

📊

gsm8k

Name: gsm8k
Creator: openai
License: ["mit"]

FNI 24.2

by openai Dataset

"--- annotations_creators: - crowdsourced language_creators: - crowdsourced language: - en license: - mit multilinguality: - monolingual size_categories: - 1K"

Download Dataset

Best Scenarios

✨ Data Science

Technical Constraints

Generic Use

- Size

- Rows

Parquet Format

1.1K Likes

Graph Overview

263 Entities

273 Connections

Explore Full Graph →

📈 Interest Trend

* Real-time activity index across HuggingFace, GitHub and Research citations.

Capabilities

✅ Data Science

🔬Deep Dive

Expand Details [+]

🛠️ Technical Profile

⚡ Hardware & Scale

Size

Total Rows

Files

🧠 Training & Env

Format

Parquet

Cleaning

Raw

🌐 Cloud & Rights

Source

huggingface

License

["mit"]

👁️ Data Preview

feature	label	split
example_text_1	0	train
example_text_2	1	train
example_text_3	0	test
example_text_4	1	validation
example_text_5	0	train

Showing 5 sample rows. Real-time preview requires login.

🧬 Schema & Configs

Fields

feature: string

label: int64

split: string

Dataset Card

Dataset Card for GSM8K

Dataset Description

- Dataset Summary - Supported Tasks - Languages

Dataset Structure

- Data Instances - Data Fields - Data Splits

Dataset Creation

- Curation Rationale - Source Data - Annotations - Personal and Sensitive Information

Considerations for Using the Data

- Social Impact of Dataset - Discussion of Biases - Other Known Limitations

Additional Information

- Dataset Curators - Licensing Information - Citation Information

Dataset Description

Homepage: https://openai.com/blog/grade-school-math/
Repository: https://github.com/openai/grade-school-math
Paper: https://arxiv.org/abs/2110.14168
Leaderboard: [Needs More Information]
Point of Contact: [Needs More Information]

Dataset Summary

GSM8K (Grade School Math 8K) is a dataset of 8.5K high quality linguistically diverse grade school math word problems. The dataset was created to support the task of question answering on basic mathematical problems that require multi-step reasoning.

These problems take between 2 and 8 steps to solve.
Solutions primarily involve performing a sequence of elementary calculations using basic arithmetic operations (+ − ×÷) to reach the final answer.
A bright middle school student should be able to solve every problem: from the paper, "Problems require no concepts beyond the level of early Algebra, and the vast majority of problems can be solved without explicitly defining a variable."
Solutions are provided in natural language, as opposed to pure math expressions. From the paper: "We believe this is the most generally useful data format, and we expect it to shed light on the properties of large language models’ internal monologues""

Supported Tasks and Leaderboards

This dataset is generally used to test logic and math in language modelling. It has been used for many benchmarks, including the LLM Leaderboard.

Languages

The text in the dataset is in English. The associated BCP-47 code is en.

Dataset Structure

Data Instances

For the main configuration, each instance contains a string for the grade-school level math question and a string for the corresponding answer with multiple steps of reasoning and calculator annotations (explained here).

```python { 'question': '

Dataset Card for GSM8K

Dataset Description

- Dataset Summary - Supported Tasks - Languages

Dataset Structure

- Data Instances - Data Fields - Data Splits

Dataset Creation

- Curation Rationale - Source Data - Annotations - Personal and Sensitive Information

Considerations for Using the Data

- Social Impact of Dataset - Discussion of Biases - Other Known Limitations

Additional Information

- Dataset Curators - Licensing Information - Citation Information

Dataset Description

Homepage: https://openai.com/blog/grade-school-math/
Repository: https://github.com/openai/grade-school-math
Paper: https://arxiv.org/abs/2110.14168
Leaderboard: [Needs More Information]
Point of Contact: [Needs More Information]

Dataset Summary

These problems take between 2 and 8 steps to solve.
Solutions primarily involve performing a sequence of elementary calculations using basic arithmetic operations (+ − ×÷) to reach the final answer.
A bright middle school student should be able to solve every problem: from the paper, "Problems require no concepts beyond the level of early Algebra, and the vast majority of problems can be solved without explicitly defining a variable."
Solutions are provided in natural language, as opposed to pure math expressions. From the paper: "We believe this is the most generally useful data format, and we expect it to shed light on the properties of large language models’ internal monologues""

Supported Tasks and Leaderboards

This dataset is generally used to test logic and math in language modelling. It has been used for many benchmarks, including the LLM Leaderboard.

Languages

The text in the dataset is in English. The associated BCP-47 code is en.

Dataset Structure

Data Instances

python

{
    'question': 'Natalia sold clips to 48 of her friends in April, and then she sold half as many clips in May. How many clips did Natalia sell altogether in April and May?',
    'answer': 'Natalia sold 48/2 = <<48/2=24>>24 clips in May.\nNatalia sold 48+24 = <<48+24=72>>72 clips altogether in April and May.\n#### 72',
}

For the socratic configuration, each instance contains a string for a grade-school level math question, a string for the corresponding answer with multiple steps of reasoning, calculator annotations (explained here), and Socratic sub-questions.

python

{
    'question': 'Natalia sold clips to 48 of her friends in April, and then she sold half as many clips in May. How many clips did Natalia sell altogether in April and May?',
    'answer': 'How many clips did Natalia sell in May? <strong> Natalia sold 48/2 = <<48/2=24>>24 clips in May.\nHow many clips did Natalia sell altogether in April and May? </strong> Natalia sold 48+24 = <<48+24=72>>72 clips altogether in April and May.\n#### 72',
}

Data Fields

The data fields are the same among main and socratic configurations and their individual splits.

question: The question string to a grade school math problem.

answer: The full solution string to the question. It contains multiple steps of reasoning with calculator annotations and the final numeric solution.

Data Splits

| name |train|validation| |--------|----:|---------:| |main | 7473| 1319| |socratic| 7473| 1319|

Dataset Creation

Curation Rationale

[Needs More Information]

Source Data

#### Initial Data Collection and Normalization

From the paper, appendix A:

We initially collected a starting set of a thousand problems and natural language solutions by hiring freelance contractors on Upwork (upwork.com). We then worked with Surge AI (surgehq.ai), an NLP data labeling platform, to scale up our data collection. After collecting the full dataset, we asked workers to re-solve all problems, with no workers re-solving problems they originally wrote. We checked whether their final answers agreed with the original solutions, and any problems that produced disagreements were either repaired or discarded. We then performed another round of agreement checks on a smaller subset of problems, finding that 1.7% of problems still produce disagreements among contractors. We estimate this to be the fraction of problems that contain breaking errors or ambiguities. It is possible that a larger percentage of problems contain subtle errors.

#### Who are the source language producers?

[Needs More Information]

Annotations

#### Annotation process

[Needs More Information]

#### Who are the annotators?

Surge AI (surgehq.ai)

Personal and Sensitive Information

[Needs More Information]

Considerations for Using the Data

Social Impact of Dataset

[Needs More Information]

Discussion of Biases

[Needs More Information]

Other Known Limitations

[Needs More Information]

Additional Information

Dataset Curators

[Needs More Information]

Licensing Information

The GSM8K dataset is licensed under the MIT License.

Citation Information

bibtex

@article{cobbe2021gsm8k,
  title={Training Verifiers to Solve Math Word Problems},
  author={Cobbe, Karl and Kosaraju, Vineet and Bavarian, Mohammad and Chen, Mark and Jun, Heewoo and Kaiser, Lukasz and Plappert, Matthias and Tworek, Jerry and Hilton, Jacob and Nakano, Reiichiro and Hesse, Christopher and Schulman, John},
  journal={arXiv preprint arXiv:2110.14168},
  year={2021}
}

Contributions

Thanks to @jon-tow for adding this dataset.

6,765 characters total

Welcome to Free2AI Tools!

Smart Search

FNI Score

You're All Set!

Best Scenarios

Technical Constraints

🕸️ Neural Graph Explorer

📈 Interest Trend

Capabilities

🔬Deep Dive

🛠️ Technical Profile

⚡ Hardware & Scale

🧠 Training & Env

🌐 Cloud & Rights

👁️ Data Preview

🧬 Schema & Configs

Fields

Dataset Card

Dataset Card for GSM8K

Table of Contents

Dataset Description

Dataset Summary

Supported Tasks and Leaderboards

Languages

Dataset Structure

Data Instances

Dataset Card for GSM8K

Table of Contents

Dataset Description

Dataset Summary

Supported Tasks and Leaderboards

Languages

Dataset Structure

Data Instances

Data Fields

Data Splits

Dataset Creation

Curation Rationale

Source Data

Annotations

Personal and Sensitive Information

Considerations for Using the Data

Social Impact of Dataset

Discussion of Biases

Other Known Limitations

Additional Information

Dataset Curators

Licensing Information

Citation Information

Contributions