🧠

Model

Specter2 Base Mlx

Name: Specter2 Base Mlx
Author: transilluminate

by transilluminate hf-model--transilluminate--specter2-base-mlx

Free2AITools Nexus Index

38.1 Top 100%

S: Semantic 50

A: Authority 0

P: Popularity 6

R: Recency 97

Q: Quality 50

Tech Context

2 Params

512 Ctx

Vital Performance

70 DL / 30D

0.0%

Source →

Audited 38.1 FNI Score

Tiny 2B Params

1k Context

70 Downloads

8G GPU ~3GB Est. VRAM

Dense BERT Architecture

Commercial APACHE License

Model Information Summary
Entity Passport
Registry ID	hf-model--transilluminate--specter2-base-mlx
License	Apache-2.0
Provider	huggingface

💾

Compute Threshold

~2.8GB VRAM

Interactive

Analyze Hardware

Hardware Compatibility Test

▼

* Static estimation for 4-Bit Quantization.

📜

Cite this model

Academic & Research Attribution

BibTeX

@misc{hf_model__transilluminate__specter2_base_mlx,
  author = {transilluminate},
  title = {Specter2 Base Mlx Model},
  year = {2026},
  howpublished = {\url{https://huggingface.co/transilluminate/specter2-base-mlx}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}

APA Style

transilluminate. (2026). Specter2 Base Mlx [Model]. Free2AITools. https://huggingface.co/transilluminate/specter2-base-mlx

🔬Technical Deep Dive

Full Specifications [+]

Quick Commands

🦙 Ollama Run

ollama run specter2-base-mlx

🤗 HF Download

huggingface-cli download transilluminate/specter2-base-mlx

⚖️ Free2AITools Nexus Index V2.0

Methodology Index Protocol

Semantic (S) 50

Authority (A) 0

Popularity (P) 6

Recency (R) 97

Quality (Q) 50

💬 Index Insight

FNI V2.0 for Specter2 Base Mlx: Semantic (S:50), Authority (A:0), Popularity (P:6), Recency (R:97), Quality (Q:50).

Free2AITools Nexus Index

Verification Authority

HuggingFace API GitHub Metadata Arxiv Citation DB System Audit

Unbiased Data Node Refresh: VFS Live

---

🚀 What's Next?

📊

Find Training Datasets

Discover datasets compatible with this model

📈

Compare Benchmarks

See how this model ranks on standard tests

⚡

Technical Deep Dive

specter2-base-mlx

SPECTER2 base (allenai/specter2_base) converted to MLX safetensors format for use with mlx-swift / MLXEmbedders on Apple Silicon.

License: Apache 2.0 (inherited from allenai/specter2_base)

What it is

SPECTER2 is a 12-layer BERT-base model trained by Allen AI on 6M scientific paper citation triplets. It produces 768-dimensional sentence embeddings optimised for scientific text similarity — significantly outperforming general-purpose embedders (all-MiniLM, Apple NL embedding) on academic content.

Input format: "title [SEP] abstract" — feed title and abstract concatenated with a [SEP] token.

Files

File	Description
`model.safetensors`	Float16 weights (~220 MB), keys remapped to MLX BERT convention
`config.json`	MLX-compatible BERT config (vocab 31090, 12L, 768d, 12h)
`tokenizer.json` / `vocab.txt` / etc.	bert-base-uncased tokenizer
`convert.py`	Reproduces this conversion from `allenai/specter2_base`
`test.py`	Validates the converted weights (4 checks, ~10 s on M-series)

Key remapping

PyTorch BERT → MLX BERT key translation (mirrors mlx-examples/bert/convert.py):

text

.layer.                         → .layers.
.self.query.                    → .query_proj.
.self.key.                      → .key_proj.
.self.value.                    → .value_proj.
.attention.output.dense.        → .attention.out_proj.
.attention.output.LayerNorm.    → .ln1.
.output.LayerNorm.              → .ln2.
.intermediate.dense.            → .linear1.
.output.dense.                  → .linear2.
.LayerNorm.                     → .norm.
pooler.dense.                   → pooler.
position_ids                    → dropped (buffer, not a parameter)

Attention mask note

When weights are stored as float16, the attention mask must be cast to float16 before the mx.log() additive-mask conversion or MLX's scaled_dot_product_attention will raise a dtype mismatch:

python

attention_mask = mx.log(attention_mask.astype(mx.float16))

Usage in Swift (mlx-swift MLXEmbedders)

swift

let container = try await loadModelContainer(
    configuration: ModelConfiguration(id: "transilluminate/specter2-base-mlx"),
    progressHandler: { _ in }
)

Input: "Paper Title [SEP] Paper abstract text…" — encode with the bundled bert-base-uncased tokenizer.
Output: CLS token hidden state, shape [1, 768] — take output[:, 0, :].

Reproducing the conversion

bash

uv run python convert.py          # float16 (default)
uv run python convert.py --dtype f32

Requirements: torch, transformers, safetensors, numpy

Validating

bash

uv run python test.py

Expected results:

text

[1] Shape          (1, 768) ✓
[2] Norm           L2 = 21.55 ✓
[3] Distinctness   cosine sim = 0.8726 (< 0.95) ✓
[4] Domain signal  related=0.9302  unrelated=0.8052  Δ=+0.1250 ✓
✓ 4/4 tests passed

⚠️ Incomplete Data

Some information about this model is not available. Use with Caution - Verify details from the original source before relying on this data.

View Original Source →

📝 Limitations & Considerations

• Benchmark scores may vary based on evaluation methodology and hardware configuration.
• VRAM requirements are estimates; actual usage depends on quantization and batch size.
• FNI scores are relative rankings and may change as new models are added.
⚠ License Unknown: Verify licensing terms before commercial use.

Social Proof

HuggingFace Hub

70Downloads

Hub Discussions

🤗 Data Source: Hugging Face ↗

🔄 Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

📊 FNI Methodology 📚 Knowledge Baseℹ️ Verify with original source

🛡️ Model Transparency Report

Technical metadata sourced from upstream repositories.

Open Metadata

🆔 Identity & Source

id: hf-model--transilluminate--specter2-base-mlx
slug: transilluminate--specter2-base-mlx
source: huggingface
author: transilluminate
license: Apache-2.0
tags: mlx, safetensors, bert, feature-extraction, sentence-similarity, scientific, en, base_model:allenai/specter2_base, base_model:finetune:allenai/specter2_base, license:apache-2.0, region:us

⚙️ Technical Specs

architecture: bert
params billions: 2
context length: 512
pipeline tag: feature-extraction
vram gb: 2.8
vram is estimated: true
vram formula: VRAM ≈ (params * 0.75) + 0.8GB (KV) + 0.5GB (OS)

📊 Engagement & Metrics

downloads: 70
stars: 0
forks: 0

Data indexed from public sources. Updated daily.