📊

Dataset

Deception Probes Activations

Name: Deception Probes Activations
Creator: xycoord
License: Other

by xycoord xycoord/deception-probes-activations

Free2AITools Nexus Index

60.2

S: Semantic 50

Query-time baseline · scored live at search

A: Authority 61

P: Popularity 51

R: Recency 75

Q: Quality 50

Tech Context

Vital Performance —

Source →

Data Integrity 60.2 FNI Score

- Size

- Rows

- Tokens

Dataset Information Summary
Entity Passport
Registry ID	xycoord/deception-probes-activations
License	Other
Provider	huggingface

📜

Cite this dataset

Academic & Research Attribution

BibTeX

@misc{hf_dataset_xycoord_deception_probes_activations,
  author = {xycoord},
  title = {Deception Probes Activations Dataset},
  year = {2026},
  howpublished = {\url{https://huggingface.co/datasets/xycoord/deception-probes-activations}},
  note = {Accessed via Free2AITools.}
}

APA Style

xycoord. (2026). Deception Probes Activations [Dataset]. Free2AITools. https://huggingface.co/datasets/xycoord/deception-probes-activations

🔬Technical Deep Dive

Full Specifications [+]

⚖️ Free2AITools Nexus Index V2.0

Methodology How FNI works

Semantic (S) 50

Query-time baseline · scored live at search

Authority (A) 61

Popularity (P) 51

Recency (R) 75

Quality (Q) 50

💬 Index Insight

FNI V2.0 for Deception Probes Activations: Authority (A:61), Popularity (P:51), Recency (R:75), Quality (Q:50). Semantic (S) is a query-time baseline scored live at search.

Free2AITools Nexus Index

Data Sources / Provenance

HuggingFace API GitHub Metadata Arxiv Citation DB Methodology

Open data Updated: Live data

⬇️

Downloads

28,693

🎯 Task Categories

text-classification

👁️ Data Preview

📊

Row-level preview not available for this dataset.

Schema structure is shown in the Field Logic panel when available.

🔗 Explore Full Dataset ↗

🧬 Field Logic

🧬

Schema not yet indexed for this dataset.

Dataset Specification

Deception Probes Activations

Pre-extracted residual-stream activations for training and evaluating deception detection probes on LLMs. Each example contains per-token hidden states from a specific transformer layer, saved in bfloat16 safetensors format.

License

This dataset contains activations derived from multiple sources with different licenses. See the LICENSE file for full details.

Component	Source	License
Apollo Probe Pairs (statements)	[Azaria & Mitchell (2023)](https://arxi

Social Proof

HuggingFace Hub

28.7KDownloads

Hub Discussions

🤗 Data Source: Hugging Face ↗

🔄 Updated daily

Source summary: Based on Hugging Face metadata. Not a recommendation.

📊 FNI Methodology 📚 Knowledge Baseℹ️ Verify with original source

🛡️ Dataset Transparency Report

Technical metadata sourced from upstream repositories.

Open Metadata

🆔 Identity & Source

id: hf-dataset--xycoord--deception-probes-activations
slug: xycoord--deception-probes-activations
source: huggingface
author: xycoord
license: Other
tags: task_categories:text-classification, language:en, license:other, size_categories:1m<n<10m, format:json, modality:text, library:datasets, library:dask, library:polars, library:mlcroissant, arxiv:2304.13734, arxiv:2407.15285, region:us, deception, mechanistic-interpretability, activations, probing, safety, alignment

⚙️ Technical Specs

architecture: null
params billions: null
context length: null
pipeline tag

📊 Engagement & Metrics

downloads: 28,693
stars: null
forks: null

Data indexed from public sources. Updated daily.