Machine Learning & Computer Vision Consultant

I am an applied research scientist with 20+ years of experience in computer vision, deep learning, and ML systems. I've worked across academia and industry at Google, Momenta, ENSCO, and UMD, delivering impactful solutions from conception to deployment.

  • πŸ“ BΓΆblingen, Germany
  • πŸ—£οΈ English, Catalan, Spanish, German (A1)
  • πŸŽ“ Ph.D., University of Maryland
Portrait of Xavier Gibert-Serra

About

I'm Xavier Gibert-Serra, Ph.D., a consultant specializing in machine learning and computer vision. Previously, I was a Staff R&D Engineer at Momenta Europe working on autonomous driving perception, and a Machine Learning Software Engineer at Google Maps and Google X Robotics. Earlier, at ENSCO I led vision R&D for railway inspection systems. I earned my Ph.D. in Electrical & Computer Engineering at the University of Maryland, advised by Rama Chellappa.

Skills

Computer Vision Deep Learning ML Research & Deployment 3D Perception Sensor Fusion Time Series Analysis Python C++ CUDA PyTorch TensorFlow OpenCV ROS MLOps Leadership & Mentoring

Experience

  1. May 2023 – Jul 2025

    Staff R&D Engineer β€” Momenta Europe GmbH

    Developed, trained, and deployed perception module updates for EU and US autonomous driving customers. Focused on 3D object detection, multi-sensor fusion, tracking, prediction, and data mining.

  2. Sep 2015 – Apr 2023

    Machine Learning Software Engineer β€” Google

    Google Maps: Designed large-scale vision pipelines for extracting structured information from Street View using detection, segmentation, OCR, and bundle adjustment.

    X Robotics: Developed real-time pose estimation and tracking algorithms for robotics applications using geometric techniques.

  3. Sep 2011 – Sep 2015

    Faculty Research Assistant β€” University of Maryland

    Managed a federally funded project for railway defect detection. Built GPU-accelerated anomaly detection algorithms, distributed processing pipelines, and multi-modal medical image registration.

  4. Apr 2004 – Apr 2013

    Senior Scientist β€” ENSCO, Inc.

    ENSCO Rail: Led the Image Processing Group. Developed real-time algorithms for optical rail profile analysis and crack detection. Managed R&D and productization of the RailScan family of systems.

    Team ENSCO β€” DARPA Grand Challenge: Built an obstacle detector using stereo cameras, enabling a robotic vehicle to autonomously drive 91 miles in desert terrain and finish sixth.

  5. Sep 2001 – Dec 2003

    Graduate Research Assistant β€” UMD LAMP Lab

    Developed frameworks for feature extraction from multimedia streams, OCR evaluation, and classification using multiple modalities.

Selected Projects

Google Maps Street View project

Google Maps Street View Extraction

Built ML pipelines for extracting street names, numbers, and traffic signs using detection, OCR, and semantic segmentation at global scale.

ENSCO JBIS project

Rail Joint Bar Crack Detection

Deployed real-time system for crack detection on moving trains using line-scan cameras, integrated into production inspection fleets.

Momenta AI Autonomous Driving project

Autonomous Driving Perception (Momenta)

Developed 3D object detection, fusion, and prediction modules for L2+/L3 autonomous driving stacks in Europe and the US.

Dexter 2005 DARPA Grand Challenge

DARPA Grand Challenge Obstacle Detector

Designed stereo-based obstacle detector enabling Team ENSCO’s vehicle to autonomously travel 91 miles and finish sixth overall.

Automatic Inspection of Railway Components

Automatic Inspection of Railway Components

Designed system for automatic inspection of railway components using deep learning, with crack detection, tie grading, and detection of missing and/or broken rail anchors.

CardioViewer: Multimodality Cardiac Display

UMD CardioViewer

Designed a multimodality cardiac display and analysis tool for the University of Maryland school of medicine.

Selected Publications

  • Adapting Style and Content for Attended Text Sequence Recognition. WACV, 2020. Paper logo  Paper
  • Deep Multitask Learning for Railway Track Inspection. IEEE ITS, 2017. Paper logo  Paper
  • Sequential Score Adaptation with EVT for Railway Inspection. ICCV Workshop, 2015. Paper logo  Paper GitHub logo  Code
  • Material Classification and Semantic Segmentation of Railway Track Images. ICIP, 2015. Paper logo  Paper
  • Discrete Shearlet Transform on GPU for Anomaly Detection. EURASIP JASP, 2014. Paper logo  Paper GitHub logo  Code
  • CardioViewer: A novel modular software tool for integrating cardiac electrophysiology voltage measurements and PET/SPECT data. IEEE NSS/MIC, 2014. Paper logo  Paper

Contact

Interested in collaborating? Email me or connect on LinkedIn.