🧠
Model

Specter2 Base Mlx

by transilluminate hf-model--transilluminate--specter2-base-mlx
Nexus Index
38.1 Top 100%
S: Semantic 50
A: Authority 0
P: Popularity 6
R: Recency 97
Q: Quality 50
Tech Context
2 Params
4.096K Ctx
Vital Performance
70 DL / 30D
0.0%
Audited 38.1 FNI Score
Tiny 2B Params
4k Context
70 Downloads
8G GPU ~3GB Est. VRAM
Commercial APACHE License
Model Information Summary
Entity Passport
Registry ID hf-model--transilluminate--specter2-base-mlx
License Apache-2.0
Provider huggingface
πŸ’Ύ

Compute Threshold

~2.8GB VRAM

Interactive
Analyze Hardware
β–Ό

* Static estimation for 4-Bit Quantization.

πŸ“œ

Cite this model

Academic & Research Attribution

BibTeX
@misc{hf_model__transilluminate__specter2_base_mlx,
  author = {transilluminate},
  title = {Specter2 Base Mlx Model},
  year = {2026},
  howpublished = {\url{https://huggingface.co/transilluminate/specter2-base-mlx}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}
APA Style
transilluminate. (2026). Specter2 Base Mlx [Model]. Free2AITools. https://huggingface.co/transilluminate/specter2-base-mlx

πŸ”¬Technical Deep Dive

Full Specifications [+]

Quick Commands

πŸ¦™ Ollama Run
ollama run specter2-base-mlx
πŸ€— HF Download
huggingface-cli download transilluminate/specter2-base-mlx

βš–οΈ Nexus Index V2.0

38.1
TOP 100% SYSTEM IMPACT
Semantic (S) 50
Authority (A) 0
Popularity (P) 6
Recency (R) 97
Quality (Q) 50

πŸ’¬ Index Insight

FNI V2.0 for Specter2 Base Mlx: Semantic (S:50), Authority (A:0), Popularity (P:6), Recency (R:97), Quality (Q:50).

Free2AITools Nexus Index

Verification Authority

Unbiased Data Node Refresh: VFS Live
---

πŸš€ What's Next?

Technical Deep Dive

specter2-base-mlx

SPECTER2 base (allenai/specter2_base) converted to MLX safetensors format for use with mlx-swift / MLXEmbedders on Apple Silicon.

License: Apache 2.0 (inherited from allenai/specter2_base)


What it is

SPECTER2 is a 12-layer BERT-base model trained by Allen AI on 6M scientific paper citation triplets. It produces 768-dimensional sentence embeddings optimised for scientific text similarity β€” significantly outperforming general-purpose embedders (all-MiniLM, Apple NL embedding) on academic content.

Input format: "title [SEP] abstract" β€” feed title and abstract concatenated with a [SEP] token.

Files

File Description
model.safetensors Float16 weights (~220 MB), keys remapped to MLX BERT convention
config.json MLX-compatible BERT config (vocab 31090, 12L, 768d, 12h)
tokenizer.json / vocab.txt / etc. bert-base-uncased tokenizer
convert.py Reproduces this conversion from allenai/specter2_base
test.py Validates the converted weights (4 checks, ~10 s on M-series)

Key remapping

PyTorch BERT β†’ MLX BERT key translation (mirrors mlx-examples/bert/convert.py):

text
.layer.                         β†’ .layers.
.self.query.                    β†’ .query_proj.
.self.key.                      β†’ .key_proj.
.self.value.                    β†’ .value_proj.
.attention.output.dense.        β†’ .attention.out_proj.
.attention.output.LayerNorm.    β†’ .ln1.
.output.LayerNorm.              β†’ .ln2.
.intermediate.dense.            β†’ .linear1.
.output.dense.                  β†’ .linear2.
.LayerNorm.                     β†’ .norm.
pooler.dense.                   β†’ pooler.
position_ids                    β†’ dropped (buffer, not a parameter)

Attention mask note

When weights are stored as float16, the attention mask must be cast to float16 before the mx.log() additive-mask conversion or MLX's scaled_dot_product_attention will raise a dtype mismatch:

python
attention_mask = mx.log(attention_mask.astype(mx.float16))

Usage in Swift (mlx-swift MLXEmbedders)

swift
let container = try await loadModelContainer(
    configuration: ModelConfiguration(id: "transilluminate/specter2-base-mlx"),
    progressHandler: { _ in }
)

Input: "Paper Title [SEP] Paper abstract text…" β€” encode with the bundled bert-base-uncased tokenizer.
Output: CLS token hidden state, shape [1, 768] β€” take output[:, 0, :].

Reproducing the conversion

bash
uv run python convert.py          # float16 (default)
uv run python convert.py --dtype f32

Requirements: torch, transformers, safetensors, numpy

Validating

bash
uv run python test.py

Expected results:

text
[1] Shape          (1, 768) βœ“
[2] Norm           L2 = 21.55 βœ“
[3] Distinctness   cosine sim = 0.8726 (< 0.95) βœ“
[4] Domain signal  related=0.9302  unrelated=0.8052  Ξ”=+0.1250 βœ“
βœ“ 4/4 tests passed

⚠️ Incomplete Data

Some information about this model is not available. Use with Caution - Verify details from the original source before relying on this data.

View Original Source β†’

πŸ“ Limitations & Considerations

  • β€’ Benchmark scores may vary based on evaluation methodology and hardware configuration.
  • β€’ VRAM requirements are estimates; actual usage depends on quantization and batch size.
  • β€’ FNI scores are relative rankings and may change as new models are added.
  • ⚠ License Unknown: Verify licensing terms before commercial use.

Social Proof

HuggingFace Hub
70Downloads
πŸ”„ Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

πŸ“Š FNI Methodology πŸ“š Knowledge Baseℹ️ Verify with original source

πŸ›‘οΈ Model Transparency Report

Technical metadata sourced from upstream repositories.

Open Metadata

πŸ†” Identity & Source

id
hf-model--transilluminate--specter2-base-mlx
slug
transilluminate--specter2-base-mlx
source
huggingface
author
transilluminate
license
Apache-2.0
tags
mlx, safetensors, bert, feature-extraction, sentence-similarity, scientific, en, base_model:allenai/specter2_base, base_model:finetune:allenai/specter2_base, license:apache-2.0, region:us

βš™οΈ Technical Specs

architecture
null
params billions
2
context length
4,096
pipeline tag
feature-extraction
vram gb
2.8
vram is estimated
true
vram formula
VRAM β‰ˆ (params * 0.75) + 0.8GB (KV) + 0.5GB (OS)

πŸ“Š Engagement & Metrics

downloads
70
stars
0
forks
0

Data indexed from public sources. Updated daily.