European Conference on Computer Vision 2024

CadVLM

Bridging Language and Vision in the Generation of Parametric CAD Sketches

For the CAD autocompletion task, our multi-modal CadVLM model (b) receives partial CAD entities as both image and text input (a) and generates the remaining sketch entities as output (c). The complete sketch, with optional predicted constraints, can then be used in CAD software (d) to form 3D shapes (e).More description about the primitive values in sketch text are in the Appendix.

Abstract

Parametric Computer-Aided Design (CAD) is central to contemporary mechanical design. However, it encounters challenges in achieving precise parametric sketch modeling and lacks practical evaluation metrics suitable for mechanical design. We harness the capabilities of pre-trained foundation models, renowned for their successes in natural language processing and computer vision, to develop generative models specifically for CAD. These models are adept at understanding complex geometries and design reasoning, a crucial advancement in CAD technology. In this paper, we propose CadVLM, an end-to-end vision language model for CAD generation. Our approach involves adapting pre-trained foundation models to manipulate engineering sketches effectively, integrating both sketch primitive sequences and sketch images. Extensive experiments demonstrate superior performance on multiple CAD sketch generation tasks such as CAD autocompletion, CAD autoconstraint, and image conditional generation. To our knowledge, this is the first instance of a multimodal Large Language Model (LLM) being successfully applied to parametric CAD generation, representing a pioneering step in the field of computer-aided mechanical design.

Download publication

Associated Researchers

Sifan Wu

University of Montreal

Amir Khasahmadi

Senior AI Research Scientist

Mor Katz

Principal AI Research Scientist

Pradeep Kumar Jayaraman

Senior Manager, Research Science

Yewen Pu

Former Autodesk

Karl Willis

Senior Manager, Research Science

Bang Liu

University of Montreal

Related Publications

Publication

2024

HG-CAD: Hierarchical Graph Learning for Material Prediction and Recommendation in Computer-Aided Design

This work presents a new Machine Learning architecture to support…

Publication

2023

CAD-LLM: Large Language Model for CAD Generation

This research presents generating Computer Aided Designs (CAD) using…

Publication

2023

Hierarchical Neural Coding for Controllable CAD Model Generation

This paper presents a new controllable parametric CAD generative…

Publication

2022

Reconstructing editable prismatic CAD from rounded voxel models

Reverse Engineering a CAD shape from other representations is an…

Get in touch

Something pique your interest? Get in touch if you’d like to learn more about Autodesk Research, our projects, people, and potential collaboration opportunities.

Contact us