Feasibility study: 3D Environment Reconstruction Using Foundation Stereo Without Task-Specific Training
The topic evaluates the feasibility of using FoundationStereo, a foundational deep learning model, for qualitative 3D reconstruction of industrial environments without task-specific training. It explores the model's ability to generate dense depth maps and reconstruct 3D scenes from stereo image pairs captured in real settings.
-
FoundationStereo model overview: FoundationStereo predicts dense disparity maps from rectified stereo image pairs using transformer architectures, enabling 3D reconstruction through depth maps and point clouds without environment-specific tuning or retraining.
-
Performance in industrial environment: The model successfully reconstructs the overall 3D geometry of most objects and structures in an industrial scene, providing clear and consistent point clouds for components with sufficient thickness. Thin objects around 1–2 mm thickness pose challenges due to minimal disparity differences, limiting reliable reconstruction.
-
Implications and limitations: FoundationStereo demonstrates strong generalization and rapid applicability for robotic perception tasks like scene understanding and navigation. Its main limitation is accurately reconstructing very thin objects, but it remains effective for typical industrial components without requiring labeled datasets or task-specific training.
CANDIDATE PROFILE
Suitable candidates have:
- A bachelor's degree in engineering or bachelor of Electromechanical Engineering Technology (specialization: Robotics & Automation);
- A bachelor of Computer Science/Artificial Intelligence;
- A bachelor in Artificial Intelligence;
- A bachelor in Applied Computer Science with specialization Machine Learning/Vision.
PRACTICAL DETAILS
The internship lasts 3 to 6 months at Flanders Make in Leuven, Belgium, with specific eligibility criteria for interns and thesis students.
File attachments
Flanders Make
Interested?
Call KATRIEN GEEBELEN
at the number: 011 790 590