Machine Learning & Computer Vision Consultant

I am an applied research scientist with 20+ years of experience in computer vision, deep learning, and ML systems. I've worked across academia and industry at Google, Momenta, ENSCO, and UMD, delivering impactful solutions from conception to deployment.

Work with me View projects

📍 Böblingen, Germany
🗣️ English, Catalan, Spanish, German (A1)
🎓 Ph.D., University of Maryland

About

I'm Xavier Gibert-Serra, Ph.D., a consultant specializing in machine learning and computer vision. Previously, I was a Staff R&D Engineer at Momenta Europe working on autonomous driving perception, and a Machine Learning Software Engineer at Google Maps and Google X Robotics. Earlier, at ENSCO I led vision R&D for railway inspection systems. I earned my Ph.D. in Electrical & Computer Engineering at the University of Maryland, advised by Rama Chellappa.

Skills

Computer Vision Deep Learning ML Research & Deployment 3D Perception Sensor Fusion Time Series Analysis Python C++ CUDA PyTorch TensorFlow OpenCV ROS MLOps Leadership & Mentoring

Experience

May 2023 – Jul 2025

Staff R&D Engineer — Momenta Europe GmbH

Developed, trained, and deployed perception module updates for EU and US autonomous driving customers. Focused on 3D object detection, multi-sensor fusion, tracking, prediction, and data mining.
Sep 2015 – Apr 2023

Machine Learning Software Engineer — Google

Google Maps: Designed large-scale vision pipelines for extracting structured information from Street View using detection, segmentation, OCR, and bundle adjustment.

X Robotics: Developed real-time pose estimation and tracking algorithms for robotics applications using geometric techniques.
Sep 2011 – Sep 2015

Faculty Research Assistant — University of Maryland

Managed a federally funded project for railway defect detection. Built GPU-accelerated anomaly detection algorithms, distributed processing pipelines, and multi-modal medical image registration.
Apr 2004 – Apr 2013

Senior Scientist — ENSCO, Inc.

ENSCO Rail: Led the Image Processing Group. Developed real-time algorithms for optical rail profile analysis and crack detection. Managed R&D and productization of the RailScan family of systems.

Team ENSCO — DARPA Grand Challenge: Built an obstacle detector using stereo cameras, enabling a robotic vehicle to autonomously drive 91 miles in desert terrain and finish sixth.
Sep 2001 – Dec 2003

Graduate Research Assistant — UMD LAMP Lab

Developed frameworks for feature extraction from multimedia streams, OCR evaluation, and classification using multiple modalities.

Selected Projects

Google Maps Street View Extraction

Built ML pipelines for extracting street names, numbers, and traffic signs using detection, OCR, and semantic segmentation at global scale.

Rail Joint Bar Crack Detection

Deployed real-time system for crack detection on moving trains using line-scan cameras, integrated into production inspection fleets.

Autonomous Driving Perception (Momenta)

Developed 3D object detection, fusion, and prediction modules for L2+/L3 autonomous driving stacks in Europe and the US.

DARPA Grand Challenge Obstacle Detector

Designed stereo-based obstacle detector enabling Team ENSCO’s vehicle to autonomously travel 91 miles and finish sixth overall.

Automatic Inspection of Railway Components

Designed system for automatic inspection of railway components using deep learning, with crack detection, tie grading, and detection of missing and/or broken rail anchors.

CardioViewer: Multimodality Cardiac Display

UMD CardioViewer

Designed a multimodality cardiac display and analysis tool for the University of Maryland school of medicine.

Selected Publications

Adapting Style and Content for Attended Text Sequence Recognition. WACV, 2020. Paper
Deep Multitask Learning for Railway Track Inspection. IEEE ITS, 2017. Paper
Sequential Score Adaptation with EVT for Railway Inspection. ICCV Workshop, 2015. Paper Code
Material Classification and Semantic Segmentation of Railway Track Images. ICIP, 2015. Paper
Discrete Shearlet Transform on GPU for Anomaly Detection. EURASIP JASP, 2014. Paper Code
CardioViewer: A novel modular software tool for integrating cardiac electrophysiology voltage measurements and PET/SPECT data. IEEE NSS/MIC, 2014. Paper

Contact

Interested in collaborating? Connect on LinkedIn.

🔗 LinkedIn
🐙 GitHub
📄 Google Scholar