ComfyUI-TangoFlux

ComfyUI Custom Nodes for "TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching". These nodes, adapted from the official implementations, generates high-quality 44.1kHz audio up to 30 seconds using just a text promptproduction.

Installation

Navigate to your ComfyUI's custom_nodes directory:

cd ComfyUI/custom_nodes

Clone this repository:

git clone https://github.com/LucipherDev/ComfyUI-TangoFlux

Install requirements:

cd ComfyUI-TangoFlux
python install.py

Or Install via ComfyUI Manager

Check out some demos from the official demo page

Example Workflow

Usage

Models can be downloaded using the install.py script

Manual Download:

Download TangoFlux from here into models/tangoflux
Download text encoders from here into models/text_encoders/google-flan-t5-large

(Include Everything as shown in the screenshot above. Do Not Rename Anything)

The nodes can be found in "TangoFlux" category as TangoFluxLoader, TangoFluxSampler, TangoFluxVAEDecodeAndPlay.

If you are on low VRAM, try enabling offload_model_to_cpu in TangoFluxSampler.

The audio output of the TangoFluxVAEDecodeAndPlay can be used as audio input for theComfyUI-VideoHelperSuite VideoCombine node. (This will not sync audio to the video)

TeaCache can speedup TangoFlux 2x without much audio quality degradation, in a training-free manner.

📈 Inference Latency Comparisons on a Single A800

TangoFlux TeaCache (0.25) TeaCache (0.4)

~4.08 s ~2.42 s ~1.95 s

Citation

@misc{hung2024tangofluxsuperfastfaithful,
      title={TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization}, 
      author={Chia-Yu Hung and Navonil Majumder and Zhifeng Kong and Ambuj Mehrish and Rafael Valle and Bryan Catanzaro and Soujanya Poria},
      year={2024},
      eprint={2412.21037},
      archivePrefix={arXiv},
      primaryClass={cs.SD},
      url={https://arxiv.org/abs/2412.21037}, 
}

@article{liu2024timestep,
  title={Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model},
  author={Liu, Feng and Zhang, Shiwei and Wang, Xiaofeng and Wei, Yujie and Qiu, Haonan and Zhao, Yuzhong and Zhang, Yingya and Ye, Qixiang and Wan, Fang},
  journal={arXiv preprint arXiv:2411.19108},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.github/workflows		.github/workflows
example_workflows		example_workflows
tangoflux		tangoflux
web/js		web/js
LICENSE		LICENSE
README.md		README.md
STABILITY_AI_COMMUNITY_LICENSE		STABILITY_AI_COMMUNITY_LICENSE
__init__.py		__init__.py
install.py		install.py
nodes.py		nodes.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
server.py		server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ComfyUI-TangoFlux

Installation

Or Install via ComfyUI Manager

Check out some demos from the official demo page

Example Workflow

Usage

📈 Inference Latency Comparisons on a Single A800

Citation

About

Languages

License

LucipherDev/ComfyUI-TangoFlux

Folders and files

Latest commit

History

Repository files navigation

ComfyUI-TangoFlux

Installation

Or Install via ComfyUI Manager

Check out some demos from the official demo page

Example Workflow

Usage

📈 Inference Latency Comparisons on a Single A800

Citation

About

Resources

License

Stars

Watchers

Forks

Languages