MeshFlow: Efficient Artistic Mesh Generation via MeshVAE and Flow-based DiTs
Weiyu Li ⋅ Antoine Toisoul ⋅ Tom Monnier ⋅ Roman Shapovalov ⋅ Rakesh Ranjan ⋅ Ping Tan ⋅ Andrea Vedaldi
Abstract
We present MeshFlow, a new method for compressing and generating artist-like 3D meshes. Current mesh generators often adopt Auto-Regressive (AR) next-token prediction, a natural choice given the discrete nature of mesh connectivity, which, however, scales poorly due to the inference cost being quadratic in mesh size. AR methods also require discretizing the vertex coordinates, which introduces quantization errors and can cause vertex collapse. To address these challenges, we introduce a Variational Autoencoder (VAE) that, supervised with a contrastive loss, represents both continuous vertex positions and discrete connectivity in a continuous latent space.This latent space is significantly more compact than prior token-based mesh representations. We then build a 3D generator based on a Rectified-Flow transformer, which generates all mesh vertices and edges in parallel. This model samples meshes $26\times$ faster than the fastest AR generator while also achieving state-of-the-art accuracy across standard mesh-generation metrics.
Successful Page Load