Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

Anthropic throttles Claude rate limits, devs call foul

Why Dispo’s co-founder made the leap from social media to steelmaking

Bell and Cohere partner to sell AI tools to governments, businesses

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Industry AI
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
Industry Applications

How Getty Images Built a Generative AI Model Without Scraping the Web

By Advanced AI EditorApril 18, 2025No Comments7 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


(Source: Andrey Suslov/Shutterstock)

Generative AI can conjure up just about any image, but it rarely tells you where that image came from or who deserves credit. The recent controversy surrounding Studio Ghibli and OpenAI offered a glimpse of what’s at stake, as AI-generated images mimicking the studio’s distinctive animation style went viral, despite having no connection to Hayao Miyazaki and no authorization to imitate his work.

In an AI-saturated world where AI models are often trained on scraped and unlicensed content, Getty Images is offering a different kind of tool: an image generation model custom-built entirely on licensed, human-created content, with a royalty system that ensures contributors are paid for their work. 

To learn more about how this works in practice, AIwire spoke with Andrea Gagliano, Getty Images’ head of AI/ML. Her team oversees the company’s search and generative AI efforts, rooted in the Creative side of Getty Images, comprising the images, illustrations, and videos used in advertising and marketing campaigns. Unlike the company’s Editorial content, which covers celebrities, politics, and current events, the Creative library provides a foundation that’s free of copyright concerns, drawn entirely from licensed contributor content. 

Getty Images has built strict safeguards into its AI generator: it will not generate known likenesses or recognizable trademarks, ensuring the content is safe for commercial use. Gagliano says customers need visuals they can use freely without worrying about legal risk. The goal is to support creativity on both ends: empowering users to push boundaries while continuing to invest in the artists who make it all possible. 

“We really think that it can elevate creativity and allow our customers, creatives, and artists to be more conceptual or to push the boundaries in terms of creativity, but we want to harness that power while also making sure that we do so in a way that protects creators and is done in a commercially safe way,” Gagliano said. 

Rather than training their model on public data scraped from the internet, Getty relies entirely on its licensed creative library of about 200 million images, and contributors are compensated through a revenue-share model that rewards them for the life of the product. Gagliano says the content is “licensed from photographers and contributors, and it gives compensation back to those contributors on a recurring basis, so not just a one-time fee, but as a percentage of revenue here into eternity, based on how much that generative tool makes.”

AIwire tested Getty Images’ AI generation model to create this image of an artist and his AI assistant. The user interface was highly intuitive, and the built-in prompt builder was effective and simple to use. We were also able to fine-tune the image using a special tool to highlight areas we wanted to refine using additional prompting.

Unlike many generative tools, Getty Images’ model offers something concrete: legal assurance and commercial usability. Generated visuals come with automatic legal protection of up to $50,000 per image, and the company offers uncapped indemnification as part of its enterprise solutions, along with perpetual and worldwide usage rights, and no limits on print runs or digital impressions. Additionally, user outputs are never added to Getty’s searchable creative library, and prompt safeguards are in place to prevent the generation of known brands, logos, or celebrity likenesses. “Safe for commercial use” isn’t just a claim but a foundation of the tool. 

Promising a truly copyright-free image is not an easy task. To ensure that standard, Getty Images’ generative model wasn’t adapted from any existing foundation model. Instead, it was built from scratch in partnership with Nvidia using NVIDIA Edify, a multimodal architecture for developing visual generative AI. Getty Images trained and customized the model using the NVIDIA AI Foundry, an end-to-end platform for building custom models. That approach gives the company control not only over the data pipeline but also over how the model evolves, sidestepping the legal and creative risks that come with pre-trained, publicly sourced models. 

The company also avoids common technical shortcuts that could compromise quality or originality over time. Getty Images does not use reinforcement learning or train on the model’s own outputs. This decision was made to prevent a phenomenon known as model collapse, which can happen when generated images gradually narrow into a repetitive, homogenous style. 

“Basically, the outputs of the model begin to converge to a very small sort of distribution of pixels,” Gagliano explained. “It’s really important to us that our model stays more generalized, so that it can produce a lot of different pixels and a lot of different things.” 

To counteract model collapse, Getty feeds in roughly 10 million new creative images each quarter, all contributed by its global network of artists and photographers. The result is a system that not only reflects current visual trends, from fashion to cultural aesthetics, but also preserves the diversity and novelty essential for storytelling through images. 

Andrea Gagliano

“We have a large team of people that work with our photographers and our contributors that are constantly doing research, quantitative and qualitative, into finding the gaps in our library,” Gagliano said. The content team works with contributors to address the gaps, adding new subjects, styles, and underrepresented perspectives, supporting both the company’s core licensing business and the health of the generative model.

That emphasis on freshness and diversity helps keep the model relevant and expansive, but it also points to a deeper challenge in the generative AI field, one that Gagliano believes hasn’t been fully addressed: a dependence on ever-expanding volumes of data. “These models are hungry,” she said. “Just feed them more and more data. And that’s the power that gets you better outputs, which is true. But I think there’s a whole area of research that hasn’t really been tapped into yet, which is, how do we make these models more efficient to work with less data?” 

That question is central to Getty’s approach. Because the company is committed to licensing content and compensating creators, it cannot take shortcuts that rely on massive, indiscriminate datasets. Instead, Gagliano said, the focus is on developing model architectures that can do more with high-value, curated content. 

“In a world where we want to compensate creators, sometimes we have to do that with less data,” she said.

While synthetic data is often pitched as the solution, Gagliano cautioned that it is not always a clean fix. “Synthetic data can be great,” she said, “but only if the synthetic data itself is trained on models that are trained on licensed content.” Otherwise, the artists are not being compensated, and models are just generating more data from unlicensed sources. 

This delicate balance between innovation and artistic integrity is something Gagliano understands from both sides. Before she led AI efforts at Getty, she was, and still is, a visual artist herself, uniquely positioning her to tackle these challenges. 

“It gives me an appreciation for what makes a good visual versus a less good visual,” she said. “And it gives me empathy and understanding for both sides: for the technical drive to innovate, and for protecting artists and creators. I really try to think hard about how we find a more nuanced solution, one that isn’t a polarized all or nothing.” 

Related



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleDemystifying C3.ai: Insights From 8 Analyst Reviews – C3.ai (NYSE:AI)
Next Article Building a foundation with AI to jumpstart your journalism
Advanced AI Editor
  • Website

Related Posts

Delve Bags $32m For Agentic Compliance AI – Artificial Lawyer

July 29, 2025

Agentic AI Sets the Tone at TPC25’s Hackathon and Tutorial Plenary Session

July 29, 2025

Jus Mundi 1st Legal Tech To Gain ISO AI Cert – Artificial Lawyer

July 28, 2025
Leave A Reply

Latest Posts

Picasso’s ‘Demoiselles’ May Not Have Been Inspired by African Art

Catalan National Assembly protested the restitution of murals to Aragon.

UNESCO Adds 26 Sites to World Heritage List

Aspen Art Fair Doubles in Size for 2025 Edition

Latest Posts

Anthropic throttles Claude rate limits, devs call foul

July 29, 2025

Why Dispo’s co-founder made the leap from social media to steelmaking

July 29, 2025

Bell and Cohere partner to sell AI tools to governments, businesses

July 29, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Anthropic throttles Claude rate limits, devs call foul
  • Why Dispo’s co-founder made the leap from social media to steelmaking
  • Bell and Cohere partner to sell AI tools to governments, businesses
  • Delve Bags $32m For Agentic Compliance AI – Artificial Lawyer
  • Mayo Clinic deploys NVIDIA AI to transform medicine | Health

Recent Comments

  1. binance kód on Anthropic closes $2.5 billion credit facility as Wall Street continues plunging money into AI boom – NBC Los Angeles
  2. 🖨 🔵 Incoming Message: 1.95 Bitcoin from exchange. Claim transfer => https://graph.org/ACTIVATE-BTC-TRANSFER-07-23?hs=40f06aae45d2dc14b01045540f836756& 🖨 on SFC Dialogue丨Jeffrey Sachs says he uses DeepSeek every hour_to_facts_its
  3. 📪 ✉️ Unread Notification: 1.65 BTC from user. Claim transfer >> https://graph.org/ACTIVATE-BTC-TRANSFER-07-23?hs=63f0a8159ef8316c31f5a9a8aca50f39& 📪 on Sean Carroll: Arrow of Time
  4. 🔋 📬 Unread Alert - 1.65 BTC from exchange. Accept funds > https://graph.org/ACTIVATE-BTC-TRANSFER-07-23?hs=db3ef91843302da628b83636ef7db949& 🔋 on Rohit Prasad: Amazon Alexa and Conversational AI | Lex Fridman Podcast #57
  5. 📟 ✉️ New Alert: 1.95 Bitcoin from partner. Review funds => https://graph.org/ACTIVATE-BTC-TRANSFER-07-23?hs=945d7d4685640a791a641ab7baaf111d& 📟 on OpenAI’s $3 Billion Windsurf Acquisition Changes AI Forever

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.