My Resume

Expertise

2020 - 2026

AI Researcher / PhD Researcher
Universidad Politecnica de Madrid

Designed and evaluated deep-learning systems for computer vision and video understanding, including face recognition, sports-video summarization, and multimodal video analysis. Built end-to-end ML pipelines with CNNs, Vision Transformers, CLIP, and vision-language models, delivered applied R&D with Nokia and Airbus, and published at CVPR Workshops, IEEE Access, Scientific Reports, and AVSS.

2017 - 2018

Junior Programmer
Indra Sistemas S.A.

Developed automation scripts for Voice-over-IP communication systems in air-traffic-management environments. Configured and tested telecommunications equipment, routers, and switches, collaborating with engineering teams under Scrum methodology.

2016 - 2017

Data Coder & Trainee Programmer
DEYDE Calidad de Datos S.L.

Processed, validated, and maintained structured databases of municipalities, streets, and postal-address records. Contributed to rule-based expert systems for postal-address correction and coding, supporting data-quality and software-support workflows.

Education

2020 - 2025

Ph.D. in Communication Technologies and Systems
Universidad Politecnica de Madrid, UPM

Thesis: Automatic Sports Video Summarization with Identity-Aware Highlight Selection. Grade: Cum Laude.

2018 - 2020

M.S. in Telecommunication Engineering
Universidad Politecnica de Madrid, UPM

Thesis: Automatic Highlight Detection in Martial Arts Tricking Videos. Grade: 10/10, with Honors (Matrícula de Honor).

2013 - 2018

B.S. in Telecommunication Systems Engineering
Universidad de Alcala de Henares, UAH

Thesis: Acoustic Bird Classification Using MFCC Feature Extraction. Grade: 9/10.

Skills

Python

PyTorch

TensorFlow / Keras

HuggingFace

Docker & Kubernetes

OpenCV

CLIP & Vision-Language Models

Git & Linux

AWS / SageMaker

Languages

Spanish (Native)

English (C1 Certified)

French (Basic)

My Portfolio

All Research Academy Projects Courses

CVPR 2025

ViMoCLIP: Video Motion Cues for Animal Action Recognition

ViMoCLIP

A novel approach that augments static CLIP representations with video motion cues for improved animal action recognition. Published at IEEE/CVF CVPR Workshops 2025.

Tech: CLIP, Vision Transformers, PyTorch, Optical Flow

Read Paper Website Code

IEEE Access 2025

Text-Guided Sports Highlights with CLIP

Sports Video Summarization

CLIP-based framework for automatic video summarization of soccer matches. Uses multimodal (text + image) neural networks for highlight detection.

Tech: CLIP, Multimodal Networks, Video Processing

Read Paper Code

Scientific Reports 2024

Vision Transformers vs CNNs for Face Recognition

Face Recognition Comparison

Comprehensive comparison between Vision Transformers and CNNs for face recognition tasks. Published in Nature Scientific Reports.

Tech: ViT, CNNs, TensorFlow, Face Recognition

Read Paper Code

MTAP 2023

Automatic Highlight Detection in Martial Arts Tricking

Tricking Highlight Detection

Deep learning system for automatic highlight detection in martial arts tricking videos using 2D/3D CNNs, recurrent networks and Transformers.

Tech: CNNs, LSTMs, Transformers, Video Analysis

Read Paper

IEEE AVSS 2022

UPM-GTI-Face: Face Detection Dataset

UPM-GTI-Face Dataset

Dataset for evaluating the impact of distance and face masks on face detection and recognition systems. Presented at IEEE AVSS in Madrid.

Tech: Face Detection, Dataset Creation, CNNs

Read Paper

Ph.D. Thesis 2025

Automatic Sports Video Summarization with Identity-Aware Highlight Selection

Ph.D. Thesis

Doctoral dissertation presenting novel deep learning methods for automatic sports video summarization. Combines identity-aware techniques with highlight detection for personalized content generation.

University: Universidad Politecnica de Madrid (UPM)

Read Thesis

Master's Thesis 2020

Automatic Highlight Detection in Martial Arts Tricking Videos

Master's Thesis (TFM)

Development of a deep learning strategy to automatically detect highlights in martial arts tricking videos. Grade: Distinction (10/10).

University: Universidad Politecnica de Madrid (UPM)

Read Thesis

Bachelor's Thesis 2018

Acoustic Bird Classification Using MFCC Feature Extraction

Bachelor's Thesis (TFG)

Acoustic classification of bird species using sound feature extraction through MFCC (Mel-Frequency Cepstral Coefficients) parameters. Grade: Excellent (9/10).

University: Universidad de Alcala de Henares (UAH)

Read Thesis

AI Infrastructure

Olympus: OpenClaw Agent System

Personal Multi-Agent AI Workspace

Personal AI workspace built on OpenClaw and running 24/7 on a dedicated Mac Mini. It orchestrates specialized agents for coordination, planning, and coding, with persistent memory, Telegram-based control, task tracking, and GitHub-backed automation.

Tech: OpenClaw, Claude Sonnet/Opus, Telegram Bots, FastAPI, SQLite, Python, GitHub

Code

AI Agent

Paper Copilot: Research Paper Summarizer

Paper Copilot

Local AI agent that reads a research paper (PDF) and produces a structured, Notion-ready markdown summary including metadata, methodology, key results, reference analysis charts, and extracted figures.

Tech: LangChain, Ollama, Streamlit, Python

Code

AI Agent

Job Finder: AI Job Monitoring Agent

Job Finder

Local-first AI agent that crawls 16+ public job sources, normalizes postings, and scores relevance using hybrid ranking (rule-based, semantic embeddings, and LLM fit). Includes a Streamlit dashboard for review.

Tech: LangGraph, Ollama, FAISS, SQLite, Streamlit

Code

Computer Vision

RoboMaster Tank Detection

RoboMaster Detection

Designed a university course assignment using RoboMaster tanks. Implemented localization networks in TensorFlow, object detection in PyTorch, and YOLOv8 for real-time detection.

Tech: TensorFlow, PyTorch, YOLO, Python SDK

Website

Web App

Breakfast-Order Management App

Breakfast Order System

Web app for organizing breakfast orders for a 20-person research group. Auto-calculates optimal drink/food combinations and tracks shared expenses with a debt management module.

Tech: Python, Streamlit, Linux, Reverse Proxy

Website Code

Generative AI

Story & Image Generation

Generative Stories

System that auto-generates fictional stories and portraits of research group members using Llama 3.3 for narratives and Stable Diffusion 3.5 Large for images. Runs as a daily Linux service.

Tech: Llama 3.3, Stable Diffusion 3.5, Python, Linux

Website Code

Hackathon

INDESIAhack: Weather Detection for Ferrovial

Adverse Weather Detection

Led a team to build a Model of Experts (MoE) combining CLIP, Microsoft Azure, and ChatGPT API to assess road visibility from traffic cameras worldwide. Ran on AWS SageMaker.

Tech: CLIP, Azure, AWS SageMaker, ChatGPT API, MoE

Workshop

Custom Components with TensorFlow

TensorFlow Workshop

Organized workshops teaching custom TensorFlow components: loss functions, activations, initializers, regularizers, metrics, layers, models and training loops. Explored library internals and graph management.

Tech: TensorFlow, Keras, Python

Website

Web Development

Personal Website

This Website!

Self-taught HTML, CSS, and JavaScript to build this personal portfolio. Learned web development from scratch, version control with Git, and deployment on GitHub Pages.

Tech: HTML, CSS, JavaScript, Git, GitHub Pages

View on GitHub

DevOps

Kubernetes Cluster with 17 GPU Nodes

GPU Kubernetes Cluster

Built a Kubernetes cluster integrating 17 GPU-equipped computers. Configured NVIDIA support, native authentication, NFS storage, custom Docker profiles, and a minimalist JupyterHub interface for distributed neural network training.

Tech: Kubernetes, Docker, JupyterHub, NFS, NVIDIA

Website

Hardware

Multi-GPU Workstation Assembly

Deep Learning Workstations

Selected components and assembled multiple high-performance multi-GPU workstations for the research group. Handled system administration: OS installation, user management, package control, and driver updates.

Tech: NVIDIA GPUs, Linux, System Administration

DeepLearning.AI · 2025

How Transformer LLMs Work

A guided tour of the transformer architecture behind modern LLMs by Jay Alammar and Maarten Grootendorst: tokenization, embeddings, self-attention and transformer blocks, plus recent advances such as KV cache, multi-query / grouped-query attention, and Mixture of Experts.

Tech: Hugging Face, Transformers, LLMs, Self-Attention

View Course

DeepLearning.AI · 2025

Attention in Transformers: Concepts and Code in PyTorch

Attention in Transformers

Josh Starmer's deep dive into the attention mechanism that powers LLMs: Query/Key/Value matrices, self-attention and masked self-attention, cross-attention, and multi-head attention — combining mathematical intuition with hands-on PyTorch implementations.

Tech: PyTorch, Attention, Transformers

View Course

DeepLearning.AI & AWS · 2024

Generative AI with Large Language Models

AWS & DeepLearning.AI course covering the full generative-AI project lifecycle: transformer architecture, model pre-training and scaling laws, instruction fine-tuning, parameter-efficient methods (LoRA, soft prompts), evaluation, and reinforcement learning from human feedback (RLHF) for deployment.

Tech: LLMs, Transformers, Fine-tuning, LoRA, RLHF

View Course Certificate

DeepLearning.AI · 2024

How Diffusion Models Work

A hands-on short course by Sharon Zhou that builds diffusion models from scratch — the sampling process, noise-prediction neural networks, training, conditional and personalized generation, and techniques to speed up sampling — going beyond pre-built APIs.

Tech: Diffusion Models, PyTorch, Generative AI

View Course

DeepLearning.AI · 2023

Deep Learning Specialization

Andrew Ng's five-course specialization spanning neural networks and deep learning, network tuning and optimization, structuring ML projects, convolutional neural networks, and sequence models (RNNs, LSTMs, Transformers), with applications across computer vision, NLP, and speech.

Tech: Python, TensorFlow, CNNs, RNNs/LSTMs

View Course Certificate

My Hobbies

Info Chess Books Acrobatics Boxing

🚧 🛠 🚀

This Section is Under Construction!

I'm working on adding more content to this website.

♟

Chess

Chess always lingered at the edges of my life—a game my friends loved and one I had tried as a child, yet that never truly caught on. That changed during a three-month international research stay in Montreal in 2024, when, with little else to do, I downloaded a chess app and quickly became hooked. It has since grown into a genuine passion: I play regularly with friends and online, have been taking lessons for several months, and have watched my rating climb from around 400 to a peak of 1500—and still rising. I love how it challenges the mind and rewards patience and strategy. And to share that passion with you, I have put together the puzzles below—each a little harder than the last, so feel free to test yourself.

♟ Chess Puzzle

📚

Books

Ever since I was a child I have loved to read, especially fantasy and science-fiction novels—though my all-time favourite will always be Patrick Rothfuss's The Name of the Wind. These days I read less fiction and more technical books, on subjects like Linux, deep learning, and chess. Soon I plan to add small cards here, much like those in my portfolio, featuring books I have read along with my recommendations.

🤸

Acrobatics

My fascination with acrobatics began with parkour videos from Yamakasi, the French collective that pioneered the discipline. Inspired, I ran up a large tree in my garden and threw myself into a backflip—a dozen attempts later I still had not landed it cleanly, but I walked away unhurt and certain I could learn. That led me to a couple of years of artistic gymnastics, before my restless, sport-hungry nature pulled me toward football, archery, tennis, bike trials, and more. The spark returned after a year living in California: back home I started flipping again on any patch of grass I could find, and soon met the friends who introduced me to martial arts tricking—a passion I pursued for ten years, achieving goals I never thought possible when I began. I hope to share some videos of these acrobatics here soon.

🥊

Boxing

Boxing is another sport I had tried briefly years ago—a couple of years of boxing and kickboxing here in Madrid before, as always, moving on to something new. After a decade of martial arts tricking, I eventually stepped away from it, since its toll on the joints makes it hard to sustain over time. A year ago I returned to boxing and have loved it ever since: it is far more technical than it looks from the outside, and the people I have met through it—far from aggressive—are some of the most humble and welcoming I have known. I enjoy every part of it, from training to sparring, and I hope to share some clips of my sparring sessions here soon.

Who am I ?

An AI Researcher / Research Engineer with a PhD, based in Madrid

Personal Info

My Expertise

Computer Vision & Video Understanding

Multimodal & Generative AI

ML Engineering & Infrastructure

My Resume

Expertise

2020 - 2026

2017 - 2018

2016 - 2017

Education

2020 - 2025

2018 - 2020

2013 - 2018

Skills

Python

PyTorch

TensorFlow / Keras

HuggingFace

Docker & Kubernetes

OpenCV

CLIP & Vision-Language Models

Git & Linux

AWS / SageMaker

Languages

Spanish (Native)

English (C1 Certified)

French (Basic)

My Portfolio

CVPR 2025

ViMoCLIP

IEEE Access 2025

Sports Video Summarization

Scientific Reports 2024

Face Recognition Comparison

MTAP 2023

Tricking Highlight Detection

IEEE AVSS 2022

UPM-GTI-Face Dataset

Ph.D. Thesis 2025

Ph.D. Thesis

Master's Thesis 2020

Master's Thesis (TFM)

Bachelor's Thesis 2018

Bachelor's Thesis (TFG)

AI Infrastructure

Personal Multi-Agent AI Workspace

AI Agent

Paper Copilot

AI Agent

Job Finder

Computer Vision

RoboMaster Detection

Web App

Breakfast Order System

Generative AI

Generative Stories

Hackathon

Adverse Weather Detection

Workshop

TensorFlow Workshop

Web Development

This Website!

DevOps

GPU Kubernetes Cluster

Hardware

Deep Learning Workstations

DeepLearning.AI · 2025

How Transformer LLMs Work

DeepLearning.AI · 2025

Attention in Transformers

DeepLearning.AI & AWS · 2024

Generative AI with Large Language Models

DeepLearning.AI · 2024

How Diffusion Models Work

DeepLearning.AI · 2023

Deep Learning Specialization

My Hobbies

Phone :
+ (34) 618-382-472

Address :
Madrid, Spain

Email :
marcosrodrigo5@hotmail.com