Audio Dataset Manager
Audio Dataset Manager is a comprehensive toolkit designed to streamline the preparation of audio datasets for TTS (Text-to-Speech) training and voice cloning projects.
Born from the need to efficiently process dozens of hours of audiobook data for voice cloning, this tool bridges the gap between raw audio files and training-ready datasets.
Diffusers in ComfyUI
Diffusers in ComfyUI is a custom node that integrates the Hugging Face Diffusers pipeline directly into ComfyUI. Available through ComfyUI Manager and the Comfy Registry.
Supports txt2img, img2img, inpainting, LoRA, and BLoRA workflows across SD 1.5 and SDXL models.
Decimator
This is a project that was developed originally for museums to be able to quickly optimize their digital collections in order to present them in virtual environments (XR mostly). With time, it got bigger and bigger, and I recently endeavoured to rewrite the code entirely. Indeed, this project having started in 2018, my coding skills then were still young, and I was not very good at architecture. That has changed a lot today, so a new version will hopefully come along soon!