🧠

Model

Ntv3 650m Pre 8kb

Name: Ntv3 650m Pre 8kb
Author: InstaDeepAI

by InstaDeepAI hf-model--instadeepai--ntv3_650m_pre_8kb

Free2AITools Nexus Index

39.4 Top 100%

S: Semantic 50

A: Authority 0

P: Popularity 14

R: Recency 88

Q: Quality 65

Tech Context

0.65B Params

8.192K Ctx

Vital Performance

231 DL / 30D

0.0%

Source →

Audited 39.4 FNI Score

Tiny 0.65B Params

8k Context

231 Downloads

8G GPU ~2GB Est. VRAM

Dense NTV3PRETRAINED Architecture

Restricted OTHER License

Model Information Summary
Entity Passport
Registry ID	hf-model--instadeepai--ntv3_650m_pre_8kb
License	Other
Provider	huggingface

💾

Compute Threshold

~1.8GB VRAM

Interactive

Analyze Hardware

Hardware Compatibility Test

▼

* Static estimation for 4-Bit Quantization.

📜

Cite this model

Academic & Research Attribution

BibTeX

@misc{hf_model__instadeepai__ntv3_650m_pre_8kb,
  author = {InstaDeepAI},
  title = {Ntv3 650m Pre 8kb Model},
  year = {2026},
  howpublished = {\url{https://huggingface.co/InstaDeepAI/NTv3_650M_pre_8kb}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}

APA Style

InstaDeepAI. (2026). Ntv3 650m Pre 8kb [Model]. Free2AITools. https://huggingface.co/InstaDeepAI/NTv3_650M_pre_8kb

🔬Technical Deep Dive

Full Specifications [+]

Quick Commands

🦙 Ollama Run

ollama run ntv3_650m_pre_8kb

🤗 HF Download

huggingface-cli download instadeepai/ntv3_650m_pre_8kb

📦 Install Lib

pip install -U transformers

⚖️ Free2AITools Nexus Index V2.0

Methodology Index Protocol

Semantic (S) 50

Authority (A) 0

Popularity (P) 14

Recency (R) 88

Quality (Q) 65

💬 Index Insight

FNI V2.0 for Ntv3 650m Pre 8kb: Semantic (S:50), Authority (A:0), Popularity (P:14), Recency (R:88), Quality (Q:65).

Free2AITools Nexus Index

Verification Authority

HuggingFace API GitHub Metadata Arxiv Citation DB System Audit

Unbiased Data Node Refresh: VFS Live

---

🚀 What's Next?

📊

Find Training Datasets

Discover datasets compatible with this model

📈

Compare Benchmarks

See how this model ranks on standard tests

⚡

Technical Deep Dive

ℹ️ Model Info: 8kb context

This model is only pre-trained on 8kbp sequences and intended solely for exploration.

They are NOT the main, recommended NTv3 models for results.

🧬 NTv3: A Foundation Model for Genomics

NTv3 is a series of foundational models designed to understand and generate genomic sequences. It unifies representation learning, functional prediction, and controllable sequence generation within a single, efficient U-Net-like architecture. It also enables the modeling of long-range dependencies, up to 1 Mb of context, at nucleotide resolution. Pretrained on 9 trillion base pairs, NTv3 excels at functional-track prediction and genome annotation across 24 animal and plant species. It can also be fine-tuned into a controllable generative model for genomic sequence design. This repository contains the MLM pre-trained models and weights. For more details, please refer to the [NTv3 paper placeholder].

⚖️ License Summary

The Licensed Models are only available under this License for Non-Commercial Purposes.
You are permitted to reproduce, publish, share and adapt the Output generated by the Licensed Model only for Non-Commercial Purposes and in accordance with this License.
You may not use the Licensed Models or any of its Outputs in connection with:
1. any Commercial Purposes, unless agreed by Us under a separate licence;
2. to train, improve or otherwise influence the functionality or performance of any other third-party derivative model that is commercial or intended for a Commercial Purpose and is similar to the Licensed Models;
3. to create models distilled or derived from the Outputs of the Licensed Models, unless such models are for Non-Commercial Purposes and open-sourced under the same license as the Licensed Models; or
4. in violation of any applicable laws and regulations.

📋 Model Summary

Architecture: U-Net style conv tower → Transformer stack → deconv tower → LM head
Tokenizer: character-level over A T C G N + specials (<unk> <pad> <mask> <cls> <eos> <bos>)
Selective intermediate outputs: use config to save specific layers
Dependencies: needs transformers >= 4.55.0
Input size: input sequence length need to be a multiple of 128
Note: custom code → use trust_remote_code=True

🚀 Quickstart

python

from transformers import AutoTokenizer, AutoModelForMaskedLM

repo = "InstaDeepAI/NTv3_650M_pre_8kb"
tok = AutoTokenizer.from_pretrained(repo, trust_remote_code=True)
model = AutoModelForMaskedLM.from_pretrained(repo, trust_remote_code=True)

batch = tok(["ATCGNATCG", "ACGT"], add_special_tokens=False, padding=True, pad_to_multiple_of=128, return_tensors="pt")
out = model(**batch)

print(out.logits.shape)  # (B, L, V = 11)

🔤 Tokenization

python

enc = tok("ATCGNATCG", add_special_tokens=False)
print(enc["input_ids"])  # char-level IDs

🔍 Getting hidden states and attentions

To get all hidden states and attention weights from all layers:

python

out = model(**batch, output_hidden_states=True, output_attentions=True)

# Access all hidden states (tuple of tensors, one per layer)
hidden_states = out.hidden_states
print(len(hidden_states))  # Number of layers
print(hidden_states[0].shape)  # (B, L, 1536)

# Access all attention weights (tuple of tensors, one per transformer layer)
attentions = out.attentions
print(len(attentions))  # Number of transformer layers
print(attentions[0].shape)  # (B, H = 24, L, L)

# Get final embedding (after deconv tower)
final_emb = out.hidden_states[-1]  # shape (B, L, 1536)

🛠️ Selective intermediate outputs

You can also save specific intermediate outputs with custom keys:

python

from ntv3_huggingface_new import Ntv3PreTrainedConfig

config = Ntv3PreTrainedConfig.from_pretrained(repo)
# Save embeddings from specific transformer layers
config.embeddings_layers_to_save = (1, 2)
# Save attention maps from specific layers/heads
config.attention_maps_to_save = [(1, 0), (2, 1)]  # (layer, head)
# Save embeddings from specific deconv layers
config.deconv_layers_to_save = (1, 2)

model = AutoModelForMaskedLM.from_pretrained(repo, config=config, trust_remote_code=True)
# Access via core's output dict (these are saved in addition to hidden_states/attentions)
core_out = model.core(**batch, output_hidden_states=True, output_attentions=True)
emb_1 = core_out['embeddings_1']  # Transformer layer 1
attn_1_0 = core_out['attention_map_layer_1_number_0']  # Layer 1, head 0
deconv_1 = core_out['embeddings_deconv_1']  # Deconv layer 1

📝 Getting input embeddings

python

emb_layer = model.get_input_embeddings()  # nn.Embedding(V = 11, D = 16)

🎯 Masked LM training

python

import torch
inputs = tok(["ATCGNATCG"], add_special_tokens=False, padding=True, pad_to_multiple_of=128, return_tensors="pt")
labels = inputs["input_ids"].clone(); labels[:] = -100
mask_id = tok.mask_token_id
inputs["input_ids"][0, 2] = mask_id
labels[0, 2] = tok.convert_tokens_to_ids("C")
out = model(**inputs, labels=labels)
print(out.loss.item())

📊 Shapes & config summary

Parameter	Value
Vocab size	11
Token embedding dim	16
Model (hidden) dim	1536
FFN dim	6144
Attention heads	24
Transformer layers	12
Downsample stages	7

⚡ Mixed precision

This model was originally trained with mixed precision (bf16) in JAX and later ported to Torch. During JAX training, all weights maintained full fp32 precision at all times, but certain inferences were performed in bf16 for efficiency. This repo will be loaded with full precision (fp32) inference by default to ensure numerical stability. However, it can be used with mixed precision (bf16) for efficient long range training and inferences. Do note, to support bfloat16 precision, you need to use a GPU with bfloat16 support (e.g. A100, H100, etc.). Also, loading the model with mixed precision would introduce numerical instability, including small differences to the original JAX model. The difference is usually insignificant, but be aware of it when using the model.

To load the model with mixed precision, use the following code:

python

from transformers import AutoTokenizer, AutoModelForMaskedLM

repo = "InstaDeepAI/NTv3_650M_pre_8kb"
tok = AutoTokenizer.from_pretrained(repo, trust_remote_code=True)
model = AutoModelForMaskedLM.from_pretrained(
    repo, trust_remote_code=True,
    stem_compute_dtype='bfloat16',
    down_convolution_compute_dtype='bfloat16',
    transformer_qkvo_compute_dtype='bfloat16',
    transformer_ffn_compute_dtype='bfloat16',
    up_convolution_compute_dtype='bfloat16',
    modulation_compute_dtype='bfloat16',
)

⚠️ Incomplete Data

Some information about this model is not available. Use with Caution - Verify details from the original source before relying on this data.

View Original Source →

📝 Limitations & Considerations

• Benchmark scores may vary based on evaluation methodology and hardware configuration.
• VRAM requirements are estimates; actual usage depends on quantization and batch size.
• FNI scores are relative rankings and may change as new models are added.
⚠ License Unknown: Verify licensing terms before commercial use.

Social Proof

HuggingFace Hub

231Downloads

Hub Discussions

🤗 Data Source: Hugging Face ↗

🔄 Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

📊 FNI Methodology 📚 Knowledge Baseℹ️ Verify with original source

🛡️ Model Transparency Report

Technical metadata sourced from upstream repositories.

Open Metadata

🆔 Identity & Source

id: hf-model--instadeepai--ntv3_650m_pre_8kb
slug: instadeepai--ntv3_650m_pre_8kb
source: huggingface
author: InstaDeepAI
license: Other
tags: transformers, safetensors, ntv3, fill-mask, genomics, dna, masked-lm, long-range, custom_code, code, license:other, region:us

⚙️ Technical Specs

architecture: NTv3PreTrained
params billions: 0.65
context length: 8,192
pipeline tag: fill-mask
vram gb: 1.8
vram is estimated: true
vram formula: VRAM ≈ (params * 0.75) + 0.8GB (KV) + 0.5GB (OS)

📊 Engagement & Metrics

downloads: 231
stars: 0
forks: 0

Data indexed from public sources. Updated daily.

Cite this model

🔬Technical Deep Dive

Quick Commands

⚖️ Free2AITools Nexus Index V2.0

💬 Index Insight

Verification Authority

🚀 What's Next?

Find Training Datasets

Compare Benchmarks

Deployment Guide

Technical Deep Dive

ℹ️ Model Info: 8kb context

🧬 NTv3: A Foundation Model for Genomics

⚖️ License Summary

📋 Model Summary

🚀 Quickstart

🔤 Tokenization

🔍 Getting hidden states and attentions

🛠️ Selective intermediate outputs

📝 Getting input embeddings

🎯 Masked LM training

📊 Shapes & config summary

⚡ Mixed precision

⚠️ Incomplete Data

📝 Limitations & Considerations

Social Proof

🛡️ Model Transparency Report

🆔 Identity & Source

⚙️ Technical Specs

📊 Engagement & Metrics