Andreas Welsch
Chief AI Strategist, Intelligence Briefing
How generating images to discard them impacts energy use, emissions, and sustainability.
Copyright: intelligencebriefing.substack.com – “Understanding Generative AI’s Impact On Sustainability”
As if getting our “triple-foam double-shot oat milk latte with caramel drizzle on top” wasn’t the biggest problem on our mind, we have a new first-world problem: AI-assisted image & video generation.
Let me explain…
Improvements And Limitations Of Prompts And Models
The images or videos you create using leading Generative AI tools rarely match your expectations on the first attempt. At least not in my case. Tools like Midjourney and DALL-E 3 (image generation) or Runway and Pika Labs (video generation) are powerful, but they’re not mind readers. And most of us are not robot whisperers either.
Sure, one approach for generating better output is improving your prompting skills. There are several style guides and examples publicly available. They’re a great start to give you some help and some inspiration. The one that I occasionally use for Midjourney is https://midlibrary.io. It provides a great library of styles.
Thank you for reading this post, don't forget to subscribe to our AI NAVIGATOR!
On the other hand, vendors need to further mature the models underlying these products — and they will. Generating a 3- or 4-second-long video based on a text prompt is already an impressive technological achievement. But there’s more room for growth: 60-second long scenes, consistency between scenes, and entire AI-generated movies. You get the idea.
Recent updates to image generation tools like Midjourney (v5, v5.2, and v6) have advanced the tool significantly within the span of a few short months. For example, getting the model to generate more text more accurately has been a huge step forward. But it’s not 100% reliable, yet. That’s why for every usable image you generate, you have a handful of images that you discard. And that’s the problem.[…]
Read more: www.intelligencebriefing.substack.com
Andreas Welsch
Chief AI Strategist, Intelligence Briefing
Andreas Welsch is an internationally recognized AI leader in the software industry with over 21 years of experience. Andreas has led regional business development teams for AI, built and led an AI Center of Excellence, and currently leads product marketing and go-to-market strategy for AI at SAP, the world’s leading business application provider. He has successfully managed stakeholder relationships with business leaders and technology teams across Fortune 500 companies in more than 80 innovation projects, and helped create an AI mindset across organizations.
Andreas is best known as the creator of the Intelligence Briefing series on LinkedIn and the popular “What’s the BUZZ?” live stream and podcast. He is a frequent keynote speaker and guest on expert panels and podcasts.
Industry focus: High Tech
Previous awards by SwissCognitive:
How generating images to discard them impacts energy use, emissions, and sustainability.
Copyright: intelligencebriefing.substack.com – “Understanding Generative AI’s Impact On Sustainability”
As if getting our “triple-foam double-shot oat milk latte with caramel drizzle on top” wasn’t the biggest problem on our mind, we have a new first-world problem: AI-assisted image & video generation.
Let me explain…
Improvements And Limitations Of Prompts And Models
The images or videos you create using leading Generative AI tools rarely match your expectations on the first attempt. At least not in my case. Tools like Midjourney and DALL-E 3 (image generation) or Runway and Pika Labs (video generation) are powerful, but they’re not mind readers. And most of us are not robot whisperers either.
Sure, one approach for generating better output is improving your prompting skills. There are several style guides and examples publicly available. They’re a great start to give you some help and some inspiration. The one that I occasionally use for Midjourney is https://midlibrary.io. It provides a great library of styles.
Thank you for reading this post, don't forget to subscribe to our AI NAVIGATOR!
On the other hand, vendors need to further mature the models underlying these products — and they will. Generating a 3- or 4-second-long video based on a text prompt is already an impressive technological achievement. But there’s more room for growth: 60-second long scenes, consistency between scenes, and entire AI-generated movies. You get the idea.
Recent updates to image generation tools like Midjourney (v5, v5.2, and v6) have advanced the tool significantly within the span of a few short months. For example, getting the model to generate more text more accurately has been a huge step forward. But it’s not 100% reliable, yet. That’s why for every usable image you generate, you have a handful of images that you discard. And that’s the problem.[…]
Read more: www.intelligencebriefing.substack.com
Share this: