AI Models

AI Upscaling Models

Real-ESRGAN and Clarity Upscaler — two different approaches to AI super-resolution. Here's everything you need to know to choose the right one.

⚡ Fastest
🚀
Real-ESRGAN
nightmareai/real-esrgan on Replicate

Neural network super-resolution model. Fastest upscaler, great for photos, illustrations, anime, and screenshots. 15–30 second processing.

  • Scale: 2×, 4×, 8×
  • Processing time: ~15–30 seconds
  • Face enhance: optional GFPGAN
  • Best for: photos, screenshots, anime, illustrations
  • Output: very clean, no artifacts
  • Open source (BSD-3-Clause)
Side by Side

Full specification comparison

SpecificationReal-ESRGANClarity Upscaler
ArchitectureRRDB + GANStable Diffusion XL (tile)
Scale factors2×, 4×, 8×2×, 4×
Processing speed~15–30 s~45–90 s
Output styleClean, naturalHighly detailed, AI-enhanced
Face enhancementYes (GFPGAN)No (built-in)
Adjustable parametersScale, face_enhanceScale, creativity, resemblance, prompt
Best image typesPhotos, screenshots, animePortraits, nature, art
Hallucination riskVery lowModerate (at high creativity)
Open sourceYes (BSD-3)Yes (Apache-2)
HostingReplicate APIReplicate API

What is Real-ESRGAN?

Real-ESRGAN (Enhanced Super-Resolution Generative Adversarial Networks) is an improved version of the original ESRGAN model, developed by Xinntao Wang. It's trained specifically on real-world degraded images — compression artifacts, blur, noise, low resolution — making it far more practical than earlier SR models that only handled simple bicubic downsampling.

The architecture uses Residual-in-Residual Dense Blocks (RRDB) as the generator, trained against a discriminator that learns what "real high-resolution" looks like. The result is an output that's convincingly sharp without over-sharpening or hallucinating texture that doesn't belong.

When to choose Real-ESRGAN: Speed is important, 8× scale is needed, or you're upscaling screenshots, illustrations, logos, anime, or text-heavy images where hallucinated detail would look wrong.

What is Clarity Upscaler?

Clarity Upscaler (by philz1337x) is a tile-based upscaler built on Stable Diffusion XL with the Juggernaut Reborn checkpoint plus custom LoRAs for detail enhancement. Unlike GAN-based upscalers, it uses a diffusion process to actively generate new pixel content based on what it "thinks" should be there — not just pattern-matching from training data.

The creativity parameter controls how aggressively the AI adds new detail (0.3 = subtle, 0.9 = aggressive regeneration). The resemblance parameter controls how faithfully the output matches the original composition. At resemblance 1.6, the output is very close to the original but sharper. At 0.3, it's a creative reinterpretation.

When to choose Clarity Upscaler: Maximum output quality is the goal, especially for portraits, landscape photography, studio product shots, or any image where real texture richness makes a visible difference.

Try both models for free

10 free upscales per day · No signup · Compare results yourself

Start Upscaling ↑