Yuang Shi

Last updated: .

Yuang Shi (施宇昂)

I'm a third year Ph.D. Candidate at National University of Singapore advised by Prof. Wei Tsang Ooi, where I work on Networking and Multimedia Systems, specifically on 3D Media Streaming. I received my M.Comp Degree in 2022 from National University of Singapore and B. Eng. Degree in 2021 from Sichuan University in Sichuan.

I am honored to receive ACM MMSys Best Paper Award (2025, Press) and ACM SIGCOMM EMS Best Paper Award (2024, Press).

I am honored to recieved The France Eiffel Excellence Scholarship (Acceptance Rate < 5%), which will support my academic visit at INP Toulouse, France, from 2025/09 to 2026/09.

I am a part-time cat person and a full-time dog person.

Email / Google Scholar / Github

News

[2025/05] - Recieved The France Eiffel Excellence Scholarship (Acceptance Rate < 5%).
[2025/05] - Invited talk at New Jersey Institute of Technology, NJ. Invited by Prof. Jacob Chakareski.
[2025/04] - Our paper "LTS: A DASH Streaming System for Dynamic Multi-Layer 3D Gaussian Splatting Scenes" received the Best Paper Award at MMSys 2025! See the Press from NUS.
[2024/11] - Our paper "LapisGS: Layered Progressive 3D Gaussian Splatting for Adaptive Streaming" is accepted to 3DV 2025.
[2024/08] - Our paper received the Best Paper Award at the SIGCOMM 2024 Workshop on Emerging Multimedia Systems (EMS 2024)! See the Press from NUS.

Research

As a PhD student, I mainly focus on 3D Media Streaming, including compression, networking, and quality evaluation.

Some works are highlighted. Authors marked with ^* are interns or students whom I mentored when the work was carried out.

	LTS: A DASH Streaming System for Dynamic Multi-Layer 3D Gaussian Splatting Scenes Best Paper Award. Yuan-Chun Sun, Yuang Shi, Cheng-Tse Lee, Mufeng Zhu, Wei Tsang Ooi, Yao Liu, Chun-Ying Huang, Cheng-Hsin Hsu The 16th ACM Multimedia Systems Conference (MMSys'25). ACM, 2025. Paper / Code / Press from NUS We develop, implement, and evaluate the very first DASH-based dynamic 3DGS streaming system, named Tile, Segment, and Layer Adaptive streaming (LTS). LTS is built on our previous work LapisGS, and achieves superior performance in both live streaming and on-demand streaming.
	LapisGS: Layered Progressive 3D Gaussian Splatting for Adaptive Streaming Yuang Shi, Simone Gasparini, Géraldine Morin, Wei Tsang Ooi, The 12th International Conference on 3D Vision (3DV'25). 2025. Paper / Code / Project Page We introduce LapisGS, a layered progressive 3D Gaussian Splatting (3DGS), which offers a progressive representation supporting a continuous rendering quality adapted for bandwidth-aware streaming.
	Sketch and Patch: Efficient 3D Gaussian Representation for Man-Made Scenes Yuang Shi, Géraldine Morin, Simone Gasparini, Chenggang Yang^, Wei Tsang Ooi MMSys Workshop. The 17th International Workshop on Immersive Mixed and Virtual Environment Systems (MMVE'25).* ACM, 2025. Rejected by MMSys'25 with 3/3 accepts from reviewrs, but I still love MMSys. Slides@MMVE / Paper / ArXiv (Full Version) Inspired by traditional painting techniques, we propose a novel hybrid representation for 3DGS that categorizes Gaussians into (i) Sketch Gaussians, which define scene boundaries, and (ii) Patch Gaussians, which cover smooth regions. This hybrid categorization is conducive to efficient compression, and progressive and scalable streaming.
	GSVC: Efficient Video Representation and Compression Through 2D Gaussian Splatting Longan Wang^, Yuang Shi, Wei Tsang Ooi MMSys Workshop. The 35th edition of the Workshop on Network and Operating System Support for Digital Audio and Video (NOSSDAV'25).* ACM, 2025. Paper / ArXiv / Project Page We propose GSVC, an approach to learning a set of 2D Gaussian splats that can effectively represent and compress video frames. Experiment results show that GSVC achieves good rate-distortion trade-offs, comparable to state-of-the-art video codecs such as AV1 and HEVC, and a rendering speed of 1500 fps for a 1920x1080 video.
	Multi-frame Bitrate Allocation of Dynamic 3D Gaussian Splatting Streaming Over Dynamic Networks Best Paper Award. Yuan-Chun Sun, Yuang Shi, Wei Tsang Ooi, Chun-Ying Huang, Cheng-Hsin Hsu SIGCOMM Workshop. The 2024 SIGCOMM Workshop on Emerging Multimedia Systems (EMS'24). ACM, 2024. Paper / Press from NUS We proposed two algorithms, MGA and MGAA, to allocate bitrate across multiple 3DGS scenes for streaming over dynamic networks.
	QV4: QoE-based Viewpoint-Aware V-PCC-encoded Volumetric Video Streaming Yuang Shi, Bennett Clement^, Wei Tsang Ooi The 15th ACM Multimedia Systems Conference (MMSys'24).* ACM, 2024. Paper / Code We present QV4, a Quality-of-Experience (QoE) based streaming system for viewpoint-aware V-PCC-encoded volumetric video. QV4 achieves average compression ratio of 610 while keeping satisfactory quality, which is around 80x better than other SOTA streaming systems.
	Volumetric Video Compression Through Neural-based Representation Yuang Shi, Ruoyu Zhao^, Simone Gasparini, Geraldine Morin, Wei Tsang Ooi MMSys Workshop. The 16th International Workshop on Immersive Mixed and Virtual Environment Systems (MMVE'24).* ACM, 2024. Paper / Code We represent 3D dynamic content as a sequence of NeRFs, converting the explicit representation to neural representation. We then compress the neural representation based on the insight of significant similarity between successive NeRFs.
	Quality Assessment and Modeling for MPEG V-PCC Volumetric Video Yuang Shi, Sam Cox, Wei Tsang Ooi MMSys Workshop. The 16th International Workshop on Immersive Mixed and Virtual Environment Systems (MMVE'24).ACM, 2024. Paper / Dataset We propose a QoE model to predict the subjective quality with respect to the compression level of geometry and texture, quantifying the impact of geometry and texture compression on perceptual quality.
	Perceptual Impact of Facial Quality in MPEG V-PCC-encoded Volumetric Videos Yuang Shi, Wei Tsang Ooi MMSys Workshop. The 16th International Workshop on Immersive Mixed and Virtual Environment Systems (MMVE'24). ACM, 2024. Paper / Dataset We investigated the influence of rendering face quality of the avatars on users' viewing experience in MPEG V-PCC-encoded volumetric videos, and revealed the significant role of facial quality in influencing users' overall perceptual quality in volumetric videos.
	Enabling Low Bit-Rate MPEG V-PCC-encoded Volumetric Video Streaming with 3D Sub-sampling Yuang Shi, Pranav Venkatram^, Yifan Ding, Wei Tsang Ooi The 14th ACM Multimedia Systems Conference (MMSys'23).* ACM, 2023. Paper / We show that it is possible to improve the quality of V-PCC encoded point clouds at low bit-rate by exploiting redundant information among the points in the 3D domain.

When I was a Master student, my dissertation is about human activity recognition in-the-wild.

Shape-Based Conditional Neural Field for Wrist-Worn Change-Point Detection
Yuang Shi, Varsha Suresh, Wei Tsang Ooi
The 2022 IEEE International Conference on Pervasive Computing and Communications Workshops. IEEE, 2022.
Paper / Code

ShapeCNF is a simple, fast, and accurate change-point detection method which uses shape-based features to model the patterns and a conditional neural field to model the temporal correlations among the time regions.

During my undergraduate, I spent most of my time on medical image analysis.

Uncertainty-weighted and Relation-driven Consistency Training for Semi-supervised Head-and-Neck Tumor Segmentation
Yuang Shi, et al.
Knowledge-based Systems (2023): 110598.
Paper /

We propose a consistency training framework for semi-supervised NPC segmentation, which includes an Uncertainty-weighted Prediction Consistency Training (UPCT) strategy and a Relation-driven Consistency Training (RCT) strategy.

ASMFS: Adaptive-Similarity-based Multi-modality Feature Selection for Classification of Alzheimer's Disease
Yuang Shi, et al.
Pattern Recognition 126 (2022): 108566.
Paper / Code

ASMFS is a novel multi-modal feature selection method for classification of Alzheimer's Disease, which performs adaptive similarity learning and feature selection simultaneously.

Teaching

[2025/01-2025/05] - Graduate Tutor & Teaching Assistant, CS2106 Introduction to Operating Systems.
- Over 600 students enrolled.
- Conducted weekly tutorials.
[2024/08-2024/12] - Graduate Tutor & Teaching Assistant, CS3244 Machine Learning.
- Over 100 students enrolled.
- Conducted weekly tutorials; Graded assignments; Mentored course projects; Made Midterm and Final Exam.
[2024/01-2024/05] - Graduate Tutor & Teaching Assistant, CS3244 Machine Learning.
- Over 100 students enrolled.
- Conducted weekly tutorials; Graded assignments; Mentored course projects; Made Midterm and Final Exam.
[2023/01-2023/05] - Graduate Tutor & Teaching Assistant, CS3244 Machine Learning.
- Over 100 students enrolled.
- Conducted weekly tutorials; Graded assignments; Mentored course projects; Made Midterm and Final Exam.
[2022/08-2022/12] - Graduate Tutor & Teaching Assistant, CS3244 Machine Learning.
- Over 200 students enrolled.
- Conducted weekly tutorials; Graded assignments; Mentored course projects; Made Midterm and Final Exam.

Mentored Students (Selected)

I really appreciate the opportunity to work with the following talent students (I only list the selected ones who I have worked with for a long time, and I have made non-negligible contributions to their research projects 😃.):

Jia Yi Lee --> Startup.
- FYP 2025. Efficient Video Representation with 3D Gaussian Splatting.
Longan Wang --> PhD @ NUS.
- Undergraduate NGNE 2025. Gaussian-based Video Compression.
- Co-author of an MMSys'25 paper.
Chenggang Yang --> Tencent.
- Master Dissertation 2024. 3D Line Segment Representation on 3D Gaussian Splatting.
- Co-author of an MMSys'25 paper.
Sherwin Poh --> Apple.
- FYP 2024. Real-time Point Cloud Super-Resolution for Volumetric Video Streaming.
- Contributor of VVTk, an open-sourced toolkit for volumetric video streaming.
Chi Sern Ng --> Tiktok.
- FYP 2024. Scalable Approaches to Volumetric Video Playback.
- Contributor of VVTk, an open-sourced toolkit for volumetric video streaming.
Joseph Wang Guanlin --> RA @ NUS.
- FYP 2024, UROP 2023. Improving the Performance of V-PCC through Segmentation With Meta's SAM.
- Contributor of VVTk, an open-sourced toolkit for volumetric video streaming.
Bennett Clement --> HoYoVerse.
- FYP 2023. Viewpoint-aware VPCC-Encoded Volumetric Video Streaming.
- Co-author of an MMSys'24 paper.
Ruoyu Zhao.
- Undergraduate Research Intern 2023. Neural-based Representation for Volumetric Video Compression.
- Co-author of an MMSys'24 paper.
Shixin Ji --> PhD @ Brown University.
- Undergraduate NGNE 2023. Human Activity Recognition In-the-Wild Using Wearable Devices.

*FYP (Undergraduate Final Year Project), UROP (Undergraduate Research Opportunities Programme), NGNE (Non-Graduating Non-Exchange Programme).

Academic Activity amd Volunteer

[2025/05] - Invited talk at New Jersey Institute of Technology, NJ. Invited by Prof. Jacob Chakareski.
[2024/05] - Invited talk at Sichuan University, Sichuan. Invited by Prof. Yan Wang.
[2024/04] - Invited talk at Université de Toulouse, Toulouse INP-ENSEEIHT, IRIT, Toulouse. Invited by Prof. Geraldine Morin and Prof. Simone Gasparini.
[2023/12] - Invited talk at National Tsing Hua University, HsinChu. Invited by Prof. Cheng-Hsin Hsu.
[2022/11] - Student Volunteer. The 21st IEEE International Symposium on Mixed and Augmented Reality, 2022, Singapore.

Peer Reviewer

IEEE Transactions on Visualization and Computer Graphics (TVCG).
IEEE Multimedia.
Medical Image Analysis (MIA).
IEEE Transactions on Cognitive and Developmental Systems (TCDS).
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM).
ACM Multimedia (ACM MM) 2024.

This template comes from Jon Barron's public academic website. ❤️