Jingchao's Website

24 Mar 2026

Introduction

23 Mar 2026

Introduction

22 Mar 2026

TL;DR — I captured per-port, 10-millisecond resolution InfiniBand traffic during FSDP fine-tuning of Qwen-7B (dense) ...

18 Mar 2026

If you deploy GPU VMs on Azure using VMSS Uniform mode, three settings can make the difference between a successful a...

17 Mar 2026

We implement a ring allreduce algorithm from scratch in Python, run it on 16 NVIDIA H100 GPUs across 2 nodes with Inf...

14 Mar 2026

Hands-on experiments on an NVIDIA H100 GPU reveal why KV cache — not model weights — dominates GPU memory during infe...

Jingchao Zhang, PhD