LTX 2.3 Studio
| Entity Passport | |
| Registry ID | hf-space--techfreakworm--ltx2.3-studio |
| Provider | huggingface |
Cite this space
Academic & Research Attribution
@misc{hf_space__techfreakworm__ltx2.3_studio,
author = {techfreakworm},
title = {LTX 2.3 Studio Space},
year = {2026},
howpublished = {\url{https://huggingface.co/spaces/techfreakworm/LTX2.3-Studio}},
note = {Accessed via Free2AITools Knowledge Fortress}
} ๐ฌTechnical Deep Dive
Full Specifications [+]โพ
โ๏ธ Free2AITools Nexus Index V2.0
๐ฌ Index Insight
FNI V2.0 for LTX 2.3 Studio: Semantic (S:50), Authority (A:54), Popularity (P:47), Recency (R:81), Quality (Q:70).
Verification Authority
Environment Profile
LTX 2.3 Studio
A single-process Gradio app that wraps LTX-2.3 โ Lightricks' open 22B video generation model โ under one focused UI. Six modes (text ยท image ยท audio ยท lipsync ยท keyframe ยท style) sharing the same ComfyUI All-In-One workflow. Runs locally on Apple Silicon (MPS) or NVIDIA (CUDA), deploys to Hugging Face Spaces (ZeroGPU).
โ Live demo: https://huggingface.co/spaces/techfreakworm/LTX2.3-Studio
What's inside
Six modes wired through the same ComfyUI All-In-One workflow. Each mode exposes only the inputs it actually consumes โ the form stays short and focused.
| Mode | Inputs | Output | Notes |
|---|---|---|---|
| Text โ Video | Prompt (+ optional audio prompt) | mp4 (+ optional wav) | The core mode. Camera-control LoRAs auto-applied by keyword. |
| Audio โ Video | Prompt + audio track | mp4 with the input audio preserved | Conditions motion on the audio waveform. |
| Image โ Video | Image + prompt | mp4 (+ optional audio) | Image-conditioned generation. |
| Lipsync | Image + audio | mp4 with audio | Viseme-aligned mouth motion. |
| Keyframe | First + last frames + prompt | mp4 | Latent interpolation between two anchors. |
| Style Transfer | Source video + style image | mp4 | IC-LoRA restyle; motion preserved from source. |
Every mode carries Fast / Balanced / Quality presets (steps ร 1, ร 1.5, ร 3). A per-mode ZeroGPU duration estimator adapts the call timeout to the requested workload.
Quick start (local)
Requires Python 3.11, ~80 GB free disk for the weight set, and ~24 GB VRAM (CUDA) or ~32 GB unified memory (Apple Silicon).
git clone --recurse-submodules https://github.com/techfreakworm/ltx2.3-AIO-generator
cd ltx2.3-AIO-generator
bash setup.sh # creates .venv, installs ComfyUI + pinned custom nodes + app deps
source .venv/bin/activate
python app.py # http://127.0.0.1:7860
The first run resolves model weights into your HF cache (~/.cache/huggingface/hub/) and symlinks them into comfyui/models/<comfy_type>/. Subsequent starts skip the download. Expect ~70 GB of weights pulled on a cold first run.
Apple Silicon notes. PYTORCH_ENABLE_MPS_FALLBACK=1 is set automatically so the few MPS-unsupported ops fall back to CPU. ComfyUI's VRAM autodetect picks the right tier; override with LTX23_AIO_VRAM=lowvram|normalvram|highvram if you need to force one.
LAN access (phone / tablet on the same WiFi): python app.py binds 0.0.0.0:7860. Visit http://<your-LAN-IP>:7860 from another device. On macOS, allow inbound for python in System Settings โ Network โ Firewall if the connection refuses.
Quick start (HF Spaces)
This repo is a Gradio Space. The Pro tier provides ZeroGPU (A10G) access and the per-call duration budget needed for the Balanced and Quality presets.
git remote add space https://huggingface.co/spaces//LTX2.3-Studio
git push space master:main # local branch is master; HF Space deploys from main
โ The refspec
master:mainmatters. The local default branch ismaster(GitHub convention); the HF Space deploys frommain. A baregit push space mastercreates an orphan remote branch that does NOT trigger a deploy.
The Space's preload_from_hub directive (see the YAML at the top of this file) bakes ~111 GB of weights into the build image. app.py:_bootstrap() then:
- Clones ComfyUI + pinned custom nodes into
~/comfyuion cold start (ZeroGPU container freezes preserve them across calls) - Mirrors the read-only preload cache into
~/hf-cache-rw/โ works around the build-user-vs-runtime-user permissions trap (preloaded files are root-owned; we run as uid 1000 and can't write to them, so any lazy download to the cache would fail withPermission denied) - Stages seed input files into
comfyui/input/so workflow loaders don't error before any user upload arrives
Subsequent requests hit warm cache โ no network traffic on inference 2+.
ZeroGPU duration estimator. Each generate call carries a dynamic @spaces.GPU(duration=N) calculated from mode, preset, and frame count. Clamped at [60, 900] s. On timeout ("GPU task aborted"), the handler auto-retries once at 2ร duration.
Architecture
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
browser โโโถโ app.py โ Gradio Blocks โ
โ header ยท drawer ยท 6 mode tabs โ
โโโโโโโโโโโโโโโโโโโโฌโโโโโโโโโโโโโโโโ
โ
โผ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ backend.py โ
โ ComfyUILibraryBackend โ
โ @spaces.GPU(duration=callable) โ
โ calls PromptExecutor directly โ
โโโโโโโโโโโโโโโโโโโโฌโโโโโโโโโโโโโโโโ
โ
โโโโโโโโโโโโโโโโฌโโโโโโโโโโโโโโโฌโโโโโโโโโโโโโโโโโโโดโโโโโโโฌโโโโโโโโโโโโโโโโโโโ
โผ โผ โผ โผ โผ
modes.py models.py workflow.py ui.py tools/
per-mode walk + ensure load + patch per-mode form extract_modes.py
parameterize from HF cache API-format JSON builders (regen workflows/)
โ
โผ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ comfyui/ โ
โ submodule (local) โ
โ runtime clone at ~/comfyui โ
โ on HF Spaces โ
โ โ
โ โโโ custom_nodes/ (pinned SHAs)โ
โ โโโ models/ โ HF cache symlinksโ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
One backend, one process. The @spaces.GPU decorator is the only divergence between local and Spaces runtime. ComfyUI manages VRAM via its tiered presets โ no empty_cache() sprinkling needed elsewhere.
Workflow as data. Each of the six modes is a user-exported API-format JSON in workflows/. The mode handler patches a deep-copied template (modes.parameterize_fn) and hands it to ComfyUI's PromptExecutor. Updating the master workflow is a three-step ritual: edit in the ComfyUI editor โ export โ python tools/extract_modes.py --master ... --out workflows.
Project layout
.
โโโ app.py # Gradio Blocks entry, _bootstrap, _on_generate, mode tabs
โโโ backend.py # ComfyUILibraryBackend, @spaces.GPU, duration estimator
โโโ modes.py # MODE_REGISTRY + per-mode parameterize_fn + node-id constants
โโโ models.py # MODEL_REGISTRY, walk_workflow_for_models, ensure_models
โโโ ui.py # render_status, _render_idle, mode-form layout primitives
โโโ workflow.py # load_template, set_input helpers
โโโ workflows/ # API-format mode JSONs (do not hand-edit)
โ โโโ t2v.json
โ โโโ i2v.json
โ โโโ a2v.json
โ โโโ lipsync.json
โ โโโ keyframe.json
โ โโโ style.json
โโโ assets/seed_inputs/ # placeholder image / audio / video for cold-start staging
โโโ tools/
โ โโโ extract_modes.py # regenerate workflows/ from a master ComfyUI export
โโโ docs/
โ โโโ future_improvements.md
โ โโโ superpowers/{specs,plans}/ # spec + implementation plans per feature
โโโ tests/ # L1 + L3 in CI; L2 with --comfy-real; L4 GPU smoke
โโโ README.md # this file (HF Space YAML + project intro)
โโโ CLAUDE.md # project facts + gotchas (what & why)
โโโ AGENTS.md # tool-agnostic agent rulebook
โโโ SKILLS.md # process / debugging / deployment (how)
โโโ requirements.txt # pinned deps
โโโ pyproject.toml # ruff + pytest config (py311)
โโโ setup.sh # venv + ComfyUI + custom nodes bootstrap
โโโ comfyui/ # git submodule (local) / runtime clone target (Spaces)
Tech stack
- Gradio 5.50 โ UI shell, native components,
gr.Progress(track_tqdm=True) - ComfyUI โ library-mode
PromptExecutor(pinned commit; submodule locally, runtime-cloned on Spaces) - LTX-2.3 22B by Lightricks โ primary diffusion transformer (BF16 weights via Kijai/LTX2.3_comfy)
- Gemma 3 12B by Google โ multimodal text encoder (requires the full 5-shard model โ text-only checkpoints crash on meta-tensor allocation in SDPA)
- Custom nodes (pinned SHAs in
app.CUSTOM_NODES_PINNED):- Lightricks/ComfyUI-LTXVideo โ LTX sampler / decoder nodes
- kijai/ComfyUI-KJNodes โ utility nodes
- rgthree/rgthree-comfy โ Power-Lora-Loader
- Kosinkadink/ComfyUI-VideoHelperSuite โ video I/O
- pythongosssss/ComfyUI-Custom-Scripts โ string / dict helpers
- city96/ComfyUI-GGUF โ GGUF transformer loader
- Fannovel16/comfyui_controlnet_aux โ DWPose for Lipsync/Style preprocessors
- evanspearman/ComfyMath โ math nodes for the workflow's keyframe path
- Smirnov75/ComfyUI-mxToolkit โ utility nodes
- DoctorDiffusion/ComfyUI-MediaMixer โ
FinalFrameSelector
- HF Spaces ZeroGPU (A10G) โ
@spaces.GPU(duration=โฆ)for queue-priority signalling and per-call timeout
Design
Theme: Topaz Cinema Slate โ slate substrate #1A1F26, warm amber accent #E0A458 used sparingly, IBM Plex Sans throughout. Defined as _TOPAZ_THEME + _CUSTOM_CSS in app.py.
Layout: hamburger drawer. Pinned 220 px sidebar at โฅ1024 px (mode buttons + model status + settings); below 1024 px it slides in as a fixed overlay via the .aio-shell.drawer-open class. The header carries a live mode tag (T2V/A2V/I2V/LIPSYNC/KEY/STYLE) updated by JS without a server round-trip.
Spec, plan, and design rationale live under docs/superpowers/specs/ and docs/superpowers/plans/.
Notes on running
- First inference is slow. Cold-start workflow validation + model load on the active node graph takes ~30 โ 90 s. Subsequent calls within the same session reuse loaded models.
- VRAM tier is auto-detected; override with
LTX23_AIO_VRAM=lowvram|normalvram|highvram. - ZeroGPU duration cap. The per-call estimator clamps to
[60, 900] s. If a generation aborts with"GPU task aborted", the handler retries once at 2ร duration. The duration field is the queue-priority signal, not a billing cap. - Output directory. Local:
comfyui/output/LTX2.3/. Spaces:~/comfyui/output/LTX2.3/. Both are whitelisted viaallowed_paths=on launch (Gradio 5 file-access policy). - Local LAN testing. Bound to
0.0.0.0:7860. macOS firewall: allow inbound forpythonif a connection from your phone refuses.
License
MIT for the AIO app code (see LICENSE).
- ComfyUI is GPL-3.0.
- LTX-2.3 and Lightricks-published LoRAs / auxiliaries retain Lightricks' open-source licensing โ see the individual model cards on Hugging Face.
- Gemma 3 weights are subject to Google's Gemma Terms of Use.
- Each pinned custom node retains its own license; see the linked repositories.
Credits
- LTX-2.3 by Lightricks
- ComfyUI by comfyanonymous
- Gemma 3 by Google DeepMind
- All-In-One ComfyUI workflow that this app wraps โ by Danielle Falco (FutuTek)
- Workflow nodes by Lightricks, kijai, rgthree, Kosinkadink, pythongosssss, city96, Fannovel16, evanspearman, Smirnov75, DoctorDiffusion
Built by @techfreakworm โ drop a โฅ on the Space if it's useful, and follow there for what's next.
AI Summary: Based on Hugging Face metadata. Not a recommendation.
๐ก๏ธ Space Transparency Report
Technical metadata sourced from upstream repositories.
๐ Identity & Source
- id
- hf-space--techfreakworm--ltx2.3-studio
- slug
- techfreakworm--ltx2.3-studio
- source
- huggingface
- author
- techfreakworm
- license
- tags
- gradio, region:us
โ๏ธ Technical Specs
- architecture
- null
- params billions
- null
- context length
- null
- pipeline tag
- gradio
๐ Engagement & Metrics
- downloads
- 0
- stars
- 0
- forks
- null
Data indexed from public sources. Updated daily.