Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

DeepSeek-R1: Hype cools as India seeks practical GenAI solutions

Security experts warn against selling Nvidia AI chips to China

OpenAI’s Sam Altman sees AI bubble forming as industry spending surges

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Business AI
    • Advanced AI News Features
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
Alibaba Cloud (Qwen)

Qwen-Image Edit gives Photoshop a run for its money with AI-powered text-to-image edits that work in seconds

By Advanced AI EditorAugust 20, 2025No Comments8 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now

Adobe Photoshop is among the most recognizable pieces of software ever created, used by more than 90% of the world’s creative professionals, according to Photutorial.

So the fact that a new open source AI model — Qwen-Image Edit, released yesterday by Chinese e-commerce giant Alibaba’s Qwen Team of AI researchers — is now able to accomplish a huge number of Photoshop-like editing jobs with text inputs alone, is a notable achievement.

Built on the 20-billion-parameter Qwen-Image foundation model released earlier this month, Qwen-Image-Edit extends the system’s unique strengths in text rendering to cover a wide spectrum of editing tasks, from subtle appearance changes to broader semantic transformations.

Simply upload a starting image — I tried one of myself from VentureBeat’s last annual Transform conference in San Francisco — and then type instructions of what you want to change, and Qwen-Image-Edit will return a new image with those edits applied.

AI Scaling Hits Its Limits

Power caps, rising token costs, and inference delays are reshaping enterprise AI. Join our exclusive salon to discover how top teams are:

Turning energy into a strategic advantage

Architecting efficient inference for real throughput gains

Unlocking competitive ROI with sustainable AI systems

Secure your spot to stay ahead: https://bit.ly/4mwGngO

Input image example:

Photo credit: Michael O’Donnell Photography

Output image example with prompt: “Make the man wearing a tuxedo.”

The model is available now across several platforms, including Qwen Chat, Hugging Face, ModelScope, GitHub, and through the Alibaba Cloud application programming interface (API), the latter which allows any third-party developer or enterprise to integrate this new model into their own applications and workflows.

I created my examples above on Qwen Chat, the Qwen Team’s rival to OpenAI’s ChatGPT, however, it should be noted for any aspiring users that generations are limited to about 8 free jobs (input/outputs) per 12 hour period before it resets. Paying users can have access to more jobs.

With support for both English and Chinese inputs, and a dual focus on both semantic meaning and visual fidelity, Qwen-Image-Edit aims to lower barriers to professional-grade visual content creation.

And given that the model is available as an open source code under an Apache 2.0 license, it’s safe for enterprises to take, download and set up for free on their own hardware or virtual clouds/machines, potentially resulting in a huge cost savings from proprietary software like Photoshop.

As Junyang Lin, a Qwen Team researcher wrote on X, “it can remove a strand of hair, very delicate image modification.”

The team’s announcement echoes this sentiment, presenting Qwen-Image-Edit not as an entirely new system, but as a natural extension of Qwen-Image that applies its unique text rendering and dual-encoding approach directly to editing tasks.

Dual encodings allow for edits preserving style and content of original image

Qwen-Image-Edit builds on the foundation established by Qwen-Image, which was introduced earlier this year as a large-scale model specializing in both image generation and text rendering.

Qwen-Image’s technical report highlighted its ability to handle complex tasks like paragraph-level text rendering, Chinese and English characters, and multi-line layouts with accuracy.

The report also emphasized a dual-encoding mechanism, feeding images simultaneously into Qwen2.5-VL for semantic control and a variational autoencoder (VAE) for reconstructive detail. This approach allows edits that remain faithful to both the intent of the prompt and the look of the original image.

Those same architectural choices underpin Qwen-Image-Edit. By leveraging dual encodings, the model can adjust at two levels: semantic edits that change the meaning or structure of a scene, and appearance edits that introduce or remove elements while keeping the rest untouched.

Semantic editing includes creating new intellectual property, rotating objects 90 or 180 degrees to reveal different views, or transforming an input into another style such as Studio Ghibli-inspired art. These edits typically modify many pixels but preserve the underlying identity of objects.

Here’s an example of semantic editing from Shridhar Athinarayanan, an engineer at AI applications platform Replicate, who used a Replicate-hosted implementation or “inference” of Qwen to reskin a photo of Manhattan to look like a toy Lego set.

Appearance editing focuses on precise, local changes. In these cases, most of the image remains unchanged while specific objects are altered. Demonstrations include adding a signboard that generates a reflection in water, removing stray hair strands from a portrait, and changing the color of a single letter in a text image.

One good example of appearance editing with Qwen-Image Edit comes from AnswerAI co-founder and CEO Thomas Hill who posted a side-by-side on X showing his wife in her wedding dress below an archway and another with the same archway covered with graffiti:

Combined with Qwen’s established strength in rendering Chinese and English text, the editing-focused system is positioned as a flexible tool for creators who need more than simple generative imagery.

The dual control over semantic scope and appearance fidelity means the same tool can serve very different needs, from creative IP development to production-level photo retouching.

Adding or removing text to images

Another standout capability is bilingual text editing. Qwen-Image-Edit allows users to add, remove, or modify text in both Chinese and English while preserving font, size, and style.

This expands on Qwen-Image’s reputation for strong text rendering, particularly in challenging scenarios like intricate Chinese characters.

In practice, this allows for accurate editing of posters, signs, T-shirts, or calligraphy artworks where small text details matter, as seen in another example from Replicate below.

One demonstration involved correcting errors in a piece of generated Chinese calligraphy through a step-by-step chained editing process.

Users could highlight incorrect regions, instruct the system to fix them, and then further refine details until the correct characters were rendered. This iterative approach shows how the model can be applied to high-stakes editing tasks where precision is essential.

Applications and use cases

The Qwen team has highlighted a range of potential applications:

Creative design and IP expansion, such as generating mascot-based emoji packs.

Advertising and content creation, where logos, signage, and text-heavy visuals can be customized.

Virtual avatars and art, with style transfer supporting unique character representations.

Photography and personal use, including background adjustments, clothing changes, and object removal.

Cultural preservation, demonstrated through correcting classical calligraphy works.

By bridging fine-grained editing with broader creative transformations, Qwen-Image-Edit caters to professionals who need control while remaining approachable for casual experimentation.

Benchmarking and performance

According to the Qwen team, evaluations across public benchmarks indicate that Qwen-Image-Edit delivers state-of-the-art performance in image editing.

This follows from the broader Qwen-Image technical evaluations, where the base model achieved leading results in both general image generation and text rendering tasks.

While specific editing benchmark figures were not detailed in the release, Qwen-Image itself ranked highly in independent evaluations such as AI Arena, where human raters compared outputs across models from different providers.

API pricing and availability

Through Alibaba Cloud Model Studio, developers can access Qwen-Image-Edit as an API. Pricing is set at $0.045 per image, with a free quota of 100 images valid for 180 days after activation.

The service is initially available in the Singapore region, with a rate limit of five requests per second and up to two concurrent tasks per account.

To use the API, developers must obtain a Model Studio API key and can call the model via HTTP or through the DashScope SDK in Python or Java.

Images can be submitted as URLs or in Base64 format, with supported resolutions ranging from 512 to 4,096 pixels and file sizes up to 10 MB. Output images are hosted on Alibaba Cloud Object Storage with links valid for 24 hours, requiring users to download and save results promptly.

What’s next for Qwen?

Qwen positions Image-Edit as a step toward lowering barriers for visual content creation. By making precise, style-consistent editing more accessible, the model could support applications from design studios to casual users refining personal projects.

The system also signals a broader trend in AI development: moving beyond single-purpose generation toward tools that integrate editing, correction, and refinement.

With both semantic flexibility and appearance-level precision, Qwen-Image-Edit reflects this shift, blending the generative strengths of large models with the reliability required for professional editing.

Daily insights on business use cases with VB Daily

If you want to impress your boss, VB Daily has you covered. We give you the inside scoop on what companies are doing with generative AI, from regulatory shifts to practical deployments, so you can share insights for maximum ROI.

Read our Privacy Policy

Thanks for subscribing. Check out more VB newsletters here.

An error occured.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleVodafone Idea, IBM Launch AI Innovation Hub for Telecom Transformation
Next Article WPP, Stability AI Form Strategic Alliance 03/06/2025
Advanced AI Editor
  • Website

Related Posts

Alibaba AI Momentum Builds As Qwen Coder Gains Market Share – Alibaba Gr Hldgs (NYSE:BABA), Baidu (NASDAQ:BIDU)

August 19, 2025

Alibaba’s AI coding model Qwen 3 Coder soars in popularity, challenging Claude Sonnet 4

August 19, 2025

Alibaba’s Qwen3 Coder Appears To Take Market Share From Anthropic & Google As Per OpenRouter Data

August 17, 2025

Comments are closed.

Latest Posts

Barbara Hepworth Sculpture Will Remain in UK After £3.8 M. Raised

After 12-Year Hiatus, Egypt’s Alexandria Biennale Will Return

Ai Weiwei Visits Ukraine’s Front Line Ahead of Kyiv Installation

Maren Hassinger to Receive Her Largest Retrospective to Date Next Year

Latest Posts

DeepSeek-R1: Hype cools as India seeks practical GenAI solutions

August 20, 2025

Security experts warn against selling Nvidia AI chips to China

August 20, 2025

OpenAI’s Sam Altman sees AI bubble forming as industry spending surges

August 20, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • DeepSeek-R1: Hype cools as India seeks practical GenAI solutions
  • Security experts warn against selling Nvidia AI chips to China
  • OpenAI’s Sam Altman sees AI bubble forming as industry spending surges
  • IBM Announces Registrations For Its Global Entrance Test
  • Uber uses GPT-5 to balance speed and quality

Recent Comments

  1. wifofeFoste on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  2. Jimmyjaito on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  3. BeriyEnent on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  4. Jimmyjaito on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  5. SamuelCoatt on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.