CV

Zhantong Qiu

studyztp@gmail.com
Ithaca, New York, US

Summary

Computer architecture researcher focusing on computer system performance evaluation, simulation methodology, and hardware-software co-design for emerging applications. I build novel frameworks for fast, accurate, and portable computer system performance evaluation using sampled simulation.

Education

  • Computer Science
    2026-06-01
    University of California, Davis
  • Electrical and Computer Engineering
    2026-06-01
    Cornell University
  • Computer Science and Engineering
    2023-06-01
    University of California, Davis

Work Experience

  • Graduate Researcher
    2023-06-01 -
    DArchR Lab, UC Davis. Advisor: Jason Lowe-Power. Working on Nugget (LLVM IR sampling framework), LoopPoint validation, and gem5 contributions.
  • Visiting Student
    2025-08-01 -
    Computer Systems Laboratory, Cornell University. Advisor: Christopher Batten. Working on agile robotic hardware-software co-design and STM32G4 MCU modeling in gem5.
  • Undergraduate Researcher
    2022-06-01 - 2023-06-01
    DArchR Lab, UC Davis. Advisor: Jason Lowe-Power. Implemented SimPoint and LoopPoint support in gem5.

Skills

Programming Languages

  • C/C++
  • Python
  • Bash
  • CUDA
  • Assembly
  • LaTeX

Hardware Description Languages

  • Chisel

System Evaluation Tools

  • gem5
  • QEMU

Profiling & Instrumentation

  • LLVM passes
  • DynamoRIO
  • Valgrind
  • Linux perf
  • PAPI

Linux Tools

  • cpuset
  • CRIU
  • cgroups
  • Docker

Compilers

  • GCC/G++
  • GFortran
  • Clang/LLVM

Publications

  • Nugget: Portable Program Snippets
    2026
    HPCA 2026
    Introduces an LLVM IR-based sampling framework that turns long-running workloads into portable program snippets which can be analyzed once, then reused across binaries, ISAs, and microarchitectures. Achieves up to 100x+ speedup versus functional simulation on SPEC CPU2017, NPB, and LSMS workloads.
  • Accelerating the Simulation of Parallel Workloads using Loop-Bounded Checkpoints
    2025
    ACM TACO (Under Review)
    Proposed LoopPoint, a synchronization-agnostic loop-based sampling methodology that enables fast, accurate simulation of multi-threaded workloads via loop-bounded checkpoints, achieving up to 801x speedup on SPEC CPU2017 with ~2.3% average runtime error.

Presentations

  • LoopPoint Tools: Sampled Simulation of Complex Multi-threaded Workloads using Sniper and gem5
    2023
    HPCA 2023
    Montreal, Canada
    Tutorial on LoopPoint sampling methodology
  • gem5 Workshop
    2023
    ISCA 2023
    Orlando, FL, USA
    Workshop on gem5 simulator
  • gem5 Bootcamp 2024
    2024
    gem5 Bootcamp
    Virtual
    Bootcamp on gem5 simulation framework

Teaching

  • ECS 154B: Computer Architecture
    2024
    University of California, Davis
    Role: Teaching Assistant
    Led weekly discussion sections and office hours; created assignment material on Chisel-based CPU model (DINO CPU).

Portfolio

  • Computer Systems Seminar at UC Davis
    2024
    Founder
    Organized weekly speaker series with 12+ talks from academia and industry (ongoing).
  • gem5 Repository Contributions
    2022
    Open source
    Author of 50+ commits to the gem5 simulator, including full-system sampling support and related simulation features.
  • CXL Shared Memory Filesystem Optimization
    2024
    Course project
    Investigated application needs in a CXL shared-memory system and improved the FAMFS framework with efficient allocation/deallocation and zero-copy operations.
  • CUDA Microbenchmarks
    2024
    Course project
    Developed configurable microbenchmarks to measure shared-memory latency and memory-scaling behavior on GPUs.

Languages

  • English
    Native speaker
  • Cantonese
    Native speaker
  • Mandarin
    Professional working proficiency

Interests

  • Computer Architecture
    Performance Evaluation, Simulation, Hardware-Software Co-design
  • Robotic Applications
    Tiny Robots, HPC Workloads