πŸ“Š
Dataset

Huginn Dataset

by Tomg Group Umd hf-dataset--tomg-group-umd--huginn-dataset
Nexus Index
47.0 Top 0%
S / A / P / R / Q Breakdown Calibration Pending

Pillar scores are computed during the next indexing cycle.

Tech Context
Vital Performance
0 DL / 30D
0.0%

--- tags: - code - math - reasoning - llm language: - en source_datasets: - HuggingFaceTB/smollm-corpus - jon-tow/starcoderdata-python-edu - ubaada/booksum-complete-cleaned - euirim/goodwiki - togethercomputer/RedPajama-Data-1T - allenai/dolma - bigcode/the-stack-v2-train-smol-ids - bigcode/starcoderdata - m-a-p/Matrix - cerebras/SlimPajama-627B - open-phi/textbooks - open-phi/textbooks_grounded - open-phi/programming_books_llama - nampdn-ai/tiny-strange-textbooks - nampdn-ai/t...

Data Integrity 47 FNI Score
- Size
- Rows
Parquet Format
- Tokens
Dataset Information Summary
Entity Passport
Registry ID hf-dataset--tomg-group-umd--huginn-dataset
Provider huggingface
πŸ“œ

Cite this dataset

Academic & Research Attribution

BibTeX
@misc{hf_dataset__tomg_group_umd__huginn_dataset,
  author = {Tomg Group Umd},
  title = {Huginn Dataset Dataset},
  year = {2026},
  howpublished = {\url{https://huggingface.co/datasets/tomg-group-umd/huginn-dataset}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}
APA Style
Tomg Group Umd. (2026). Huginn Dataset [Dataset]. Free2AITools. https://huggingface.co/datasets/tomg-group-umd/huginn-dataset

πŸ”¬Technical Deep Dive

Full Specifications [+]

βš–οΈ Nexus Index V2.0

47.0
ESTIMATED IMPACT TIER
Semantic (S) 50
Authority (A) 0
Popularity (P) 0
Recency (R) 0
Quality (Q) 0

πŸ’¬ Index Insight

FNI V2.0 for Huginn Dataset: Semantic (S:50), Authority (A:0), Popularity (P:0), Recency (R:0), Quality (Q:0).

Free2AITools Nexus Index

Verification Authority

Unbiased Data Node Refresh: VFS Live
⬇️
Downloads
498,734
❀️
Likes
6

πŸ‘οΈ Data Preview

πŸ“Š

Row-level preview not available for this dataset.

Schema structure is shown in the Field Logic panel when available.

πŸ”— Explore Full Dataset β†—

🧬 Field Logic

🧬

Schema not yet indexed for this dataset.

Dataset Specification

Top Tier

Social Proof

HuggingFace Hub
6Likes
498.7KDownloads
πŸ”„ Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

πŸ“Š FNI Methodology πŸ“š Knowledge Baseℹ️ Verify with original source

πŸ›‘οΈ Dataset Transparency Report

Verified data manifest for traceability and transparency.

100% Data Disclosure Active

πŸ†” Identity & Source

id
hf-dataset--tomg-group-umd--huginn-dataset
source
huggingface
author
Tomg Group Umd
tags
task_categories:text-generationsource_datasets:huggingfacetb/smollm-corpussource_datasets:jon-tow/starcoderdata-python-edusource_datasets:ubaada/booksum-complete-cleanedsource_datasets:euirim/goodwikisource_datasets:allenai/dolmasource_datasets:bigcode/starcoderdatasource_datasets:m-a-p/matrixsource_datasets:cerebras/slimpajama-627bsource_datasets:open-phi/textbookssource_datasets:open-phi/textbooks_groundedsource_datasets:open-phi/programming_books_llamasource_datasets:nampdn-ai/tiny-strange-textbookssource_datasets:nampdn-ai/tiny-textbookssource_datasets:nampdn-ai/tiny-code-textbookssource_datasets:nampdn-ai/tiny-orca-textbookssource_datasets:vikp/textbook_quality_programmingsource_datasets:eleutherai/proof-pile-2source_datasets:open-web-math/open-web-mathsource_datasets:biglam/blbooks-parquetsource_datasets:storytracer/loc-pd-bookssource_datasets:gair/mathpilesource_datasets:tomg-group-umd/clrs-text-trainsource_datasets:math-ai/automathtextsource_datasets:bigcode/commitpackftsource_datasets:bigcode/stack-dedup-python-fnssource_datasets:mlabonne/chessllmsource_datasets:waterhorse/chess_datasource_datasets:eleutherai/lichess-puzzlessource_datasets:locutusque/hercules-v5.0source_datasets:nvidia/openmathinstruct-1source_datasets:meta-math/metamathqasource_datasets:nvidia/daring-anteatersource_datasets:nvidia/sft_datablend_v1source_datasets:baai/infinity-instructsource_datasets:nopm/opus_writingstructsource_datasets:xinlai/math-step-dpo-10ksource_datasets:hkust-nlp/gsm8k-fixsource_datasets:huggingfaceh4/no_robotssource_datasets:thudm/longwriter-6ksource_datasets:thudm/webglm-qasource_datasets:bigscience/p3source_datasets:gryphe/opus-writingpromptssource_datasets:internlm/lean-githubsource_datasets:pkuai4m/leanworkbooksource_datasets:ai4m/leandojo-informalizedsource_datasets:casey-martin/oa_cpp_annotate_gensource_datasets:l3lab/ntp-mathlib-instruct-stsource_datasets:ajibawa-2023/maths-collegesource_datasets:ajibawa-2023/maths-grade-schoolsource_datasets:xinyaohu/amps_mathematicasource_datasets:xinyaohu/amps_khansource_datasets:gair-prox/fineweb-prosource_datasets:gair-prox/c4-prosource_datasets:gair-prox/redpajama-prosource_datasets:gair-prox/open-web-math-prosource_datasets:emozilla/pg19source_datasets:mathgenie/mathcode-pilesource_datasets:kingnish/reasoning-base-20ksource_datasets:nvidia/openmathinstruct-2source_datasets:llm360/txt360source_datasets:neuralwork/arxiverlanguage:enlicense:othersize_categories:100mformat:parquetmodality:textlibrary:datasetslibrary:dasklibrary:mlcroissantlibrary:polarsarxiv:2502.05171region:uscodemathreasoningllm

βš™οΈ Technical Specs

architecture
null
params billions
null
context length
null

πŸ“Š Engagement & Metrics

likes
6
downloads
498,734

Free2AITools Constitutional Data Pipeline: Curated disclosure mode active. (V15.x Standard)