{"id":225,"date":"2026-03-09T17:08:05","date_gmt":"2026-03-09T17:08:05","guid":{"rendered":"https:\/\/julie.your-ai-buddy.net\/?p=225"},"modified":"2026-03-09T19:43:31","modified_gmt":"2026-03-09T19:43:31","slug":"ltx-2-3-audi-image-to-video-workflow","status":"publish","type":"post","link":"https:\/\/julie.your-ai-buddy.net\/index.php\/2026\/03\/09\/ltx-2-3-audi-image-to-video-workflow\/","title":{"rendered":"LTX-2.3 Audio\/Image-to-Video Workflow"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\">Ran across this LTX-2.3 ai2v workflow (voice cloning\/talking head) on <a href=\"https:\/\/www.youtube.com\/watch?v=frP39jsAXEA&amp;t=1s\" target=\"_blank\" rel=\"noreferrer noopener\">YouTube<\/a>. Decided to try it out. The first problem occurred when ComfyUI was saying that some nodes were missing in the workflow, but Manager did not show any missing nodes. <\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"774\" height=\"644\" src=\"https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/03\/mising_nodes.png\" alt=\"\" class=\"wp-image-250\" srcset=\"https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/03\/mising_nodes.png 774w, https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/03\/mising_nodes-300x250.png 300w, https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/03\/mising_nodes-768x639.png 768w\" sizes=\"auto, (max-width: 774px) 100vw, 774px\" \/><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">Asked ChatGPT about this. It gave some asinine explanation why Manager was not showing the missing nodes, but did provide <a href=\"http:\/\/git clone https:\/\/github.com\/AICoderTudou\/ComfyUI-TD-Qwen3TTS.git\">a link to where to get them from<\/a>. Did<strong> git clone<\/strong> and then <strong>pip install -r requirements.txt<\/strong> for this package. With all nodes in place, tried to run it. No good. Got this error message:<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img decoding=\"async\" src=\"https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/03\/vivaldi_JKn5UR9DNe-1024x465.png\" alt=\"\" class=\"wp-image-238\"\/><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">Translated into English:<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><em><strong>You must enter a reference text (ref_text), which should contain the actual spoken content from the reference audio. If you cannot provide text, please enable &#8216;x_vector_only_mode&#8217;.<\/strong><\/em><\/p>\n\n\n\n<figure class=\"wp-block-image size-full is-resized\"><img decoding=\"async\" src=\"https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/03\/vivaldi_kCoGFj6ND1.png\" alt=\"\" class=\"wp-image-243\" style=\"width:607px;height:auto\"\/><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">That is, besides the reference audio, it wanted the text of that reference audio in field<strong> ref_text<\/strong>. But since this field was disabled in this workflow and I did not feel like figuring out why it was disabled, I just enabled switch <strong>x_vector_only_mode<\/strong>. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Ran it again and got an error again, but a different one this time.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"668\" src=\"https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/03\/custom_voice-1024x668.png\" alt=\"\" class=\"wp-image-249\" srcset=\"https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/03\/custom_voice-1024x668.png 1024w, https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/03\/custom_voice-300x196.png 300w, https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/03\/custom_voice-768x501.png 768w, https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/03\/custom_voice.png 1133w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">Since this did not tell much, asked ChatGPT again. It said that I had a wrong model here:<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"567\" height=\"242\" src=\"https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/03\/model_loader_custom_voice.png\" alt=\"\" class=\"wp-image-255\" srcset=\"https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/03\/model_loader_custom_voice.png 567w, https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/03\/model_loader_custom_voice-300x128.png 300w\" sizes=\"auto, (max-width: 567px) 100vw, 567px\" \/><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">Asked ChatGPT which of the available models should I use:<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"377\" height=\"250\" src=\"https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/03\/available_models.png\" alt=\"\" class=\"wp-image-248\" srcset=\"https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/03\/available_models.png 377w, https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/03\/available_models-300x199.png 300w\" sizes=\"auto, (max-width: 377px) 100vw, 377px\" \/><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">It said to use VoiceDesign.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">No problem. Took about 20 minutes to download it, but, unfortunately, it was no good either:<br><\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"671\" src=\"https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/03\/voice_design-1024x671.png\" alt=\"\" class=\"wp-image-253\" srcset=\"https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/03\/voice_design-1024x671.png 1024w, https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/03\/voice_design-300x197.png 300w, https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/03\/voice_design-768x503.png 768w, https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/03\/voice_design.png 1129w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">Back to ChatGPT. This time it dumped a shitload of technical gobbledygook, from which I only more of less understood the summary:<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><em><strong>&#8220;Your current TD Qwen3TTS node stack does not provide voice cloning through <code>TDQwen3TTSVoiceClone<\/code>, regardless of whether you load <code>CustomVoice<\/code> or <code>VoiceDesign<\/code>. &#8220;<\/strong><\/em><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Whatever that may mean. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\">So, this is it for the active stage. Back to the drawing board trying to figure out what it needs.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Ran across this LTX-2.3 ai2v workflow (voice cloning\/talking head) on YouTube. Decided to try it out. The first problem occurred when ComfyUI was saying that some nodes were missing in the workflow, but Manager did not show any missing nodes. Asked ChatGPT about this. It gave some asinine explanation why Manager was not showing the &#8230; <a title=\"LTX-2.3 Audio\/Image-to-Video Workflow\" class=\"read-more\" href=\"https:\/\/julie.your-ai-buddy.net\/index.php\/2026\/03\/09\/ltx-2-3-audi-image-to-video-workflow\/\" aria-label=\"Read more about LTX-2.3 Audio\/Image-to-Video Workflow\">Read more<\/a><\/p>\n","protected":false},"author":9,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-225","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/julie.your-ai-buddy.net\/index.php\/wp-json\/wp\/v2\/posts\/225","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/julie.your-ai-buddy.net\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/julie.your-ai-buddy.net\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/julie.your-ai-buddy.net\/index.php\/wp-json\/wp\/v2\/users\/9"}],"replies":[{"embeddable":true,"href":"https:\/\/julie.your-ai-buddy.net\/index.php\/wp-json\/wp\/v2\/comments?post=225"}],"version-history":[{"count":19,"href":"https:\/\/julie.your-ai-buddy.net\/index.php\/wp-json\/wp\/v2\/posts\/225\/revisions"}],"predecessor-version":[{"id":267,"href":"https:\/\/julie.your-ai-buddy.net\/index.php\/wp-json\/wp\/v2\/posts\/225\/revisions\/267"}],"wp:attachment":[{"href":"https:\/\/julie.your-ai-buddy.net\/index.php\/wp-json\/wp\/v2\/media?parent=225"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/julie.your-ai-buddy.net\/index.php\/wp-json\/wp\/v2\/categories?post=225"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/julie.your-ai-buddy.net\/index.php\/wp-json\/wp\/v2\/tags?post=225"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}