Series

29 articles

Part 1: From GANs to Generative AI — Why and How the Migration Happened Series
AIGenerative AIGeminiVirtual Try-OnComputer VisionGoogle Cloud

Part 1: From GANs to Generative AI — Why and How the Migration Happened

Google proved that generative AI can do virtual try-on. Could the same approach replace our entire GAN pipeline? This is the story of migrating from PASTA-GAN++ to Gemini and Vertex AI — simplifying a multi-stage GPU pipeline into a single API call.

Read more →
Part 2: Nano Banana Virtual Try-On — 16 Test Cases and What They Revealed Series
AIGenerative AIGeminiVirtual Try-OnComputer Vision

Part 2: Nano Banana Virtual Try-On — 16 Test Cases and What They Revealed

Systematic testing of Gemini's image generation for virtual try-on across three phases: noisy inputs, clean images, and high-resolution action poses. The key finding: resolution matters more than any preprocessing pipeline.

Read more →
Part 3: The 3-Engine Showdown — PASTA-GAN++ vs Nano Banana vs Vertex AI VTO Series
AIGenerative AIGeminiVertex AIVirtual Try-OnComputer VisionGoogle Cloud

Part 3: The 3-Engine Showdown — PASTA-GAN++ vs Nano Banana vs Vertex AI VTO

A head-to-head comparison of three generations of virtual try-on technology across 12 test cases. The results reveal not just incremental improvement, but a generational leap — especially in body diversity, where GANs fundamentally failed.

Read more →
Part 3: Building a Production Browser Game with Next.js and Phaser 3 Series
Next.jsPhaser 3TypeScriptGame DevEducation

Part 3: Building a Production Browser Game with Next.js and Phaser 3

How the game-core architecture separates framework-agnostic logic from rendering, why Phaser 3 handles the board while Next.js handles everything else, and the engineering decisions behind authentication, PWA support, and i18n.

Read more →
Part 2: Recreating Economic Cycles with Real Japanese News Series
Next.jsPhaser 3TypeScriptGame DevEducation

Part 2: Recreating Economic Cycles with Real Japanese News

How I built a Python data pipeline to transform 70 years of Japanese economic history into game-ready event data — scraping Wikipedia, modeling macro indicators, and simulating stock prices that feel authentic.

Read more →
Part 1: Teaching Kids to Invest — Why We Chose a Game Series
Next.jsPhaser 3TypeScriptGame DevEducation

Part 1: Teaching Kids to Invest — Why We Chose a Game

Japan mandated financial education in 2022, but most adults barely understand investing themselves. This is the story of how a decade-long quest to teach children about markets through a board game evolved from paper prototypes to a production browser game.

Read more →
Part 1: From Photo Booths to Virtual Try-On — The 20-Year Quest Series
AIMachine LearningPythonGANComputer Vision

Part 1: From Photo Booths to Virtual Try-On — The 20-Year Quest

The origin story of META FIT: how a decades-old vision of seeing yourself in clothes before buying evolved from hardware kiosks to GAN-powered virtual try-on, plus a comprehensive survey of 15+ VTON research models.

Read more →
Part 3: Inside PF-AFN — The Try-On Engine in Code Series
AIMachine LearningPythonGANComputer Vision

Part 3: Inside PF-AFN — The Try-On Engine in Code

A code-level walkthrough of the Parser-Free Appearance Flow Network: Feature Pyramid encoding, CUDA-accelerated correlation kernels, optical flow warping, and the ResUnet generator that composites garments onto people.

Read more →
Part 2: Understanding GANs — The Engine Behind Virtual Try-On Series
AIMachine LearningPythonGANComputer Vision

Part 2: Understanding GANs — The Engine Behind Virtual Try-On

A deep dive into Generative Adversarial Networks: how the generator-discriminator dynamic works, why GANs dominated image generation before diffusion models, and how they power virtual try-on systems.

Read more →