{"id":18,"date":"2026-01-24T19:32:36","date_gmt":"2026-01-24T19:32:36","guid":{"rendered":"https:\/\/julie.your-ai-buddy.net\/?p=18"},"modified":"2026-01-27T20:25:58","modified_gmt":"2026-01-27T20:25:58","slug":"adventures-in-ltx-2","status":"publish","type":"post","link":"https:\/\/julie.your-ai-buddy.net\/index.php\/2026\/01\/24\/adventures-in-ltx-2\/","title":{"rendered":"First Steps in LTX-2"},"content":{"rendered":"\n<h3 class=\"wp-block-heading\"><strong>Welcome to Rosie&#8217;s<\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Updated my ComfyUI and discovers a few new templates. My attention caught new image generation platform LTX-2. Decided to try it out. After some fiddling got some results. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\">This is the clip generated by the <a href=\"https:\/\/u.pcloud.link\/publink\/show?code=XZ2c4K5ZUQR8iiUqbD5GCtwOoeAQBy4aDY9k\" target=\"_blank\" rel=\"noreferrer noopener\">default image-to-video template<\/a>. Click the image to play the clip in a new page.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><a href=\"https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/01\/welcome2rosies.mp4\" target=\"_blank\" rel=\" noreferrer noopener\"><img loading=\"lazy\" decoding=\"async\" width=\"755\" height=\"1024\" src=\"https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/01\/welcom2rosies-755x1024.png\" alt=\"\" class=\"wp-image-56\" srcset=\"https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/01\/welcom2rosies-755x1024.png 755w, https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/01\/welcom2rosies-221x300.png 221w, https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/01\/welcom2rosies-768x1042.png 768w, https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/01\/welcom2rosies.png 896w\" sizes=\"auto, (max-width: 755px) 100vw, 755px\" \/><\/a><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">The source image was generated in the same ComfyUI using a <a href=\"https:\/\/u.pcloud.link\/publink\/show?code=XZHc4K5ZwxM9F3qrBuL35I7PfvvjbjVJperk\" target=\"_blank\" rel=\"noreferrer noopener\">simple custom template<\/a> with <a href=\"https:\/\/civitai.com\/models\/928145\/jedpointreal\" target=\"_blank\" rel=\"noreferrer noopener\">JedPointReal checkpoint<\/a>. The clip took about 4 minutes to generate. Clip length: 4 seconds.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Next Song<\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">These results got me pretty much encouraged and I tried making another clip, this time trying to make the character sing. Well, singing was a problem, so I went back to talking. Here is the resulting clip. Click the image to play in in a new page. The source image for the i2v LTX-2 workflow was also generated using ComfyUI default <a href=\"https:\/\/u.pcloud.link\/publink\/show?code=XZ4OQK5ZOarpacGB0ipuCc1oHFNigmCfip9V\" target=\"_blank\" rel=\"noreferrer noopener\"><strong>HiDream I1 Full<\/strong> workflow<\/a> and its default fp8 checkpoint. Resolution: 720&#215;1280. Generation time: about 10 minutes.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><a href=\"https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/01\/next-song.mp4\" target=\"_blank\" rel=\" noreferrer noopener\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"1024\" src=\"https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/01\/next-song.png\" alt=\"\" class=\"wp-image-74\" srcset=\"https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/01\/next-song.png 1024w, https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/01\/next-song-300x300.png 300w, https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/01\/next-song-150x150.png 150w, https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/01\/next-song-768x768.png 768w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/a><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">While the voice was generated along with the clip by the <a href=\"https:\/\/u.pcloud.link\/publink\/show?code=XZ0OQK5Zmbc784RPsdj6hB064AoDCR4mMsJk\" target=\"_blank\" rel=\"noreferrer noopener\">default LTX-2 i2v workflow<\/a>, the guitar strumming was added in using ffmpeg. Well, this one, while not being out of hands bad, is not something to write home about. Here, there is no mistaking that it was AI-generated. And it also has some quality issues, such as, e.g., changing the facial features and making the skin sort of rubbery and too wrinkly. I don&#8217;t know, perhaps fiddling with the prompts could improve the quality, but I am just giving this a fast go over. In any case, I think it&#8217;s good enough for some purposes.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Taste of Cousins<\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Next, I decided to check out how long generated clips could be. I tried a couple of custom workflows using GGUF checkpoints, but they all would OOM on anything longer than 5 seconds. I did not start investigating the reasons why, because I got an idea to try using the CPU as the main processing device instead of the GPU by starting ComfyUI with &#8211;novram parameter. It turned out to be the right direction and after several unsuccessful attempts I managed to generate this 10 second clip using the <a href=\"https:\/\/u.pcloud.link\/publink\/show?code=XZ5o4K5ZiuSSymUKrip9cxWeN5eE2uwcOJ8V\" target=\"_blank\" rel=\"noreferrer noopener\">default LTX-2 i2v workflow<\/a>. Click the image to watch the clip in a new page.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><a href=\"https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/01\/ate-cousins.mp4\" target=\"_blank\" rel=\" noreferrer noopener\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"1024\" src=\"https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/01\/frame_0002-1.png\" alt=\"\" class=\"wp-image-46\" srcset=\"https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/01\/frame_0002-1.png 1024w, https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/01\/frame_0002-1-300x300.png 300w, https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/01\/frame_0002-1-150x150.png 150w, https:\/\/julie.your-ai-buddy.net\/wp-content\/uploads\/2026\/01\/frame_0002-1-768x768.png 768w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/a><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">Not perfect, but good enough for memes and YouTube shorts not pretending to be realistic.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Using &#8211;novram parameter turned out to be good enough to allow using the standard Gemma <a href=\"https:\/\/huggingface.co\/Comfy-Org\/ltx-2\/resolve\/main\/split_files\/text_encoders\/gemma_3_12B_it.safetensors\" target=\"_blank\" rel=\"noreferrer noopener\">gemma_3_12B_it.safetensors<\/a> text encoder, the heaviest checkpoint (<a href=\"https:\/\/huggingface.co\/Lightricks\/LTX-2\/resolve\/main\/ltx-2-19b-dev.safetensors\" target=\"_blank\" rel=\"noreferrer noopener\">ltx-2-19b-dev.safetensors<\/a>), and even string up two LoRAs at each stage. Practically all the memory use was in RAM (up to 75-80% of the total 128 GB), but VRAM was also getting used, up to about 50% of the 12 GB of the dedicated GPU at the refiner stage. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Welcome to Rosie&#8217;s Updated my ComfyUI and discovers a few new templates. My attention caught new image generation platform LTX-2. Decided to try it out. After some fiddling got some results. This is the clip generated by the default image-to-video template. Click the image to play the clip in a new page. The source image &#8230; <a title=\"First Steps in LTX-2\" class=\"read-more\" href=\"https:\/\/julie.your-ai-buddy.net\/index.php\/2026\/01\/24\/adventures-in-ltx-2\/\" aria-label=\"Read more about First Steps in LTX-2\">Read more<\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[3,4],"tags":[],"class_list":["post-18","post","type-post","status-publish","format-standard","hentry","category-comfyui","category-ltx-2-i2v"],"_links":{"self":[{"href":"https:\/\/julie.your-ai-buddy.net\/index.php\/wp-json\/wp\/v2\/posts\/18","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/julie.your-ai-buddy.net\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/julie.your-ai-buddy.net\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/julie.your-ai-buddy.net\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/julie.your-ai-buddy.net\/index.php\/wp-json\/wp\/v2\/comments?post=18"}],"version-history":[{"count":47,"href":"https:\/\/julie.your-ai-buddy.net\/index.php\/wp-json\/wp\/v2\/posts\/18\/revisions"}],"predecessor-version":[{"id":116,"href":"https:\/\/julie.your-ai-buddy.net\/index.php\/wp-json\/wp\/v2\/posts\/18\/revisions\/116"}],"wp:attachment":[{"href":"https:\/\/julie.your-ai-buddy.net\/index.php\/wp-json\/wp\/v2\/media?parent=18"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/julie.your-ai-buddy.net\/index.php\/wp-json\/wp\/v2\/categories?post=18"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/julie.your-ai-buddy.net\/index.php\/wp-json\/wp\/v2\/tags?post=18"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}