โš ๏ธ

This is a Dataset, not a Model

The following metrics do not apply: FNI Score, Deployment Options, Model Architecture

๐Ÿ“Š

xquad

FNI 22.4
by google Dataset

"--- annotations_creators: - expert-generated language_creators: - expert-generated language: - ar - de - el - en - es - hi - ro - ru - th - tr - vi - zh license: - cc-by-sa-4.0 multilinguality: - multilingual size_categories: - unknown source_datasets: - extended|squad task_categories: - question-an..."

Best Scenarios

โœจ Data Science

Technical Constraints

Generic Use
- Size
- Rows
Parquet Format
38 Likes

Capabilities

  • โœ… Data Science

๐Ÿ”ฌDeep Dive

Expand Details [+]

๐Ÿ› ๏ธ Technical Profile

โšก Hardware & Scale

Size
-
Total Rows
-
Files
14

๐Ÿง  Training & Env

Format
Parquet
Cleaning
Raw

๐ŸŒ Cloud & Rights

Source
huggingface
License
["cc-by-sa-4.0"]

๐Ÿ‘๏ธ Data Preview

feature label split
example_text_1 0 train
example_text_2 1 train
example_text_3 0 test
example_text_4 1 validation
example_text_5 0 train
Showing 5 sample rows. Real-time preview requires login.

๐Ÿงฌ Schema & Configs

Fields

feature: string
label: int64
split: string

Dataset Card

Dataset Card for "xquad"

Table of Contents

- Dataset Summary - Supported Tasks and Leaderboards - Languages - Data Instances - Data Fields - Data Splits - Curation Rationale - Source Data - Annotations - Personal and Sensitive Information - Social Impact of Dataset - Discussion of Biases - Other Known Limitations - Dataset Curators - Licensing Information - Citation Information - Contributions

Dataset Description

Dataset Summary

XQuAD (Cross-lingual Question Answering Dataset) is a benchmark dataset for evaluating cross-lingual question answering performance. The dataset consists of a subset of 240 paragraphs and 1190 question-answer pairs from the development set of SQuAD v1.1 (Rajpurkar et al., 2016) together with their professional translations into ten languages: Spanish, German, Greek, Russian, Turkish, Arabic, Vietnamese, Thai, Chinese, and Hindi. Consequently, the dataset is entirely parallel across 11 languages.

Supported Tasks and Leaderboards

More Information Needed

Languages

More Information Needed

Dataset Structure

Data Instances

#### xquad.ar

  • Size of downloaded dataset files: 13.30 MB
  • Size of the generated dataset: 1.72 MB
  • Total amount of disk used: 15.03 MB
An example of 'validation' looks as follows. ``` This example was too long and was cropped:

{ "answers": { "answer_start": [527], "text": ["136"] }

9,581 characters total