PaMO: Parallel Mesh Optimization for Intersection-Free Low-Poly Modeling on the GPU

Abstract

Reducing the triangle count in complex 3D models is a basic geometry preprocessing step in graphics pipelines such as efficient rendering and interactive editing. However, most existing mesh simplification methods exhibit a few issues. Firstly, they often lead to self-intersections during decimation, a major issue for applications such as 3D printing and soft-body simulation. Second, to perform simplification on a mesh in the wild, one would first need to perform re-meshing, which often suffers from surface shifts and losses of sharp features. Finally, existing re-meshing and simplification methods can take minutes when processing large-scale meshes, limiting their applications in practice. To address the challenges, we introduce a novel GPU-based mesh optimization approach containing three key components: (1) a parallel re-meshing algorithm to turn meshes in the wild into watertight, manifold, and intersection-free ones, and reduce the prevalence of poorly shaped triangles; (2) a robust parallel simplification algorithm with intersection-free guarantees; (3) an optimization-based safe projection algorithm to realign the simplified mesh with the input, eliminating the surface shift introduced by re-meshing and recovering the original sharp features. The algorithm demonstrates remarkable efficiency, simplifying a 2-million-face mesh to 20k triangles in 3 seconds on RTX4090. We evaluated the approach on the Thingi10K dataset and showcased its exceptional performance in geometry preservation and speed.

Method

Pipeline overview. Our method performs remeshing, mesh simplification, and safe projection sequentially.
All stages are parallelized on the GPU.

Parallel Remeshing

Our first stage converts arbitrary inputs into watertight, manifold, and intersection-free meshes for downstream processing. We begin by computing a narrow-band unsigned distance field (UDF) around the input mesh using a voxel grid hierarchy. This step removes unnecessary voxel–triangle pairs and enables efficient GPU computation.

Narrow-band UDF on a voxel hierarchy. — Efficient narrow-band UDF computation with a voxel grid hierarchy.

Next, an ε-band conversion safely transforms the UDF into an SDF for iso-surface extraction by defining SDF(x) = UDF(x) − ε, where ε is the band width. We then extract the mesh from the SDF using a DualMC-based algorithm. In this step, self-intersections from quad division are avoided, and a smoothing function is applied to reduce numerical errors.
We also provide a standalone CUDA implementation for this stage.

DualMC improvements — Our method produces meshes with significantly fewer skinny triangles compared to other iso-surface extraction algorithms.

Parallel Mesh Simplification

This stage reduces the number of triangles efficiently while preserving manifoldness and preventing self-intersections. We propose a parallel simplification method based on an iterative edge-collapse algorithm. The edge cost combines QEM with edge-length and skinny-triangle penalties, balancing output quality with the need to avoid poorly shaped elements. To parallelize the process, the mesh is divided into independent regions. In each iteration, all regions perform edge collapses simultaneously on the GPU, since a single collapse only affects its local region.

Independent collapse regions — The yellow and pink areas mark two independent regions (R1 and R2) associated with E1 and E2, respectively.

After each iteration, a parallel self-intersection detector checks for intersections. Any invalid collapses are reverted, ensuring that the mesh remains manifold and intersection-free throughout the process.

Parallel Safe Projection

In the final stage, we safely project the simplified mesh back toward the original input, strictly avoiding self-intersections while recovering sharp features and preserving mesh quality. We optimize an energy function that balances fidelity to the input surface (Chamfer/Hausdorff distance) with elastic and bending terms that preserve triangle quality and shape. Inspired by IPC, we employ a barrier-based, collision-aware solver with continuous collision detection to guarantee intersection-free projection. This process restores sharp features, reduces surface error, and maintains high mesh quality.

Sharp feature restoration — Sharp features are recovered through projection while avoiding intersections.

Result

Our method produces high-quality output meshes while achieving fast runtime, as illustrated in the teaser.

Comparison with Baselines

Our method guarantees intersection-free output meshes, whereas baseline models cannot avoid producing self-intersecting triangles (marked in red).

The original shape is preserved better in our output mesh, with fewer spike artifacts.

BibTeX

 @inproceedings{oh2025pamo,
  title={PaMO: Parallel Mesh Optimization for Intersection-Free Low-Poly Modeling on the GPU},
  author={Oh, Seonghun and Yuan, Xiaodi and Wei, Xinyue and Shi, Ruoxi and Xiang, Fanbo and Liu, Minghua and Su, Hao},
  booktitle={Computer Graphics Forum},
  pages={e70267},
  year={2025},
  organization={Wiley Online Library}
}