Summary
Research scientist and engineer specializing in energy-based models, multi-agent systems, and production ML infrastructure. Currently leading the design of a foundation model for market panel data at Kuona, and previously built four production AI systems in under a year — a multi-agent analytical platform (ExpertAI), a knowledge graph engine (Grafa), a NL-to-SQL engine (KQuery), and Kubernetes infrastructure — each deployed at enterprise scale.
Education
M.S. Data Science, Center for Data Science
GPA: 3.88 / 4.0
Fulbright–García Robles Scholar (2022).
B.Eng. Computer Engineering — School of Engineering
GPA: 9.18 / 10
B.Sc. Data Science — Institute of Research in Applied Math & Systems (IIMAS)
GPA: 9.44 / 10
Specialization Certificate in Finance — School of Accounting & Administration
GPA: 9.57 / 10
Honors: UNAM PAPIIT Research Scholarship Grantee — IN100719 (2020), IA104720 (2020–2021)
Experience
Foundation Models for Panel Data (USA)
- Leading the design of a JEPA-style energy-based foundation model for market panel data with structural inductive biases, supporting multi-horizon forecasting, intervention simulation, and probing of unobservable market drivers via synthetic supervision.
- Developed mechanisms to model latent, unobservable market drivers (e.g., price elasticity, cannibalization) via partial supervision from synthetic markets, allowing causal probing without direct observation in real data.
- Designed training objectives emphasizing structural inductive biases and compact representations, embedding key economic effects directly into the model and reducing reliance on brittle downstream estimation pipelines.
- Energy-based latent-variable architecture with JEPA-style training objective, trained on real market data, synthetic markets with known unobservable drivers, and distilled predictions from 4+ years of Kuona's production forecasting models. Modular adapters per client without full retraining — eliminating the 1–12 hour per-client cycle of prior approaches.
ExpertAI (USA)
- Conceived and shipped ExpertAI end-to-end as sole architect — multi-agent backend (Django, Python), React frontend with WebSocket real-time communication, RabbitMQ/Kombu messaging system, cloud deployment (AWS EKS) — from concept to production in under a year.
- Multi-agent orchestration with compartmentalized context per subagent/tool, two-layer plan validation (formal PDDL gates + soft LLM-as-a-judge evaluators with scoped rule sets), and checkpoint-based resilience. Deployed and in use by global retailers and manufacturers.
- Designed a formal tool-planning framework where agents generate and execute multi-step plans verified via PDDL-style preconditions and effects, ensuring logical validity and dependency satisfaction before execution.
- Implemented deterministic plan reuse and semantic grounding mechanisms, enabling consistent execution, controlled plan adaptation, and resolution of ambiguous user inputs into unambiguous, executable specifications.
- Built Grafa: knowledge graph engine giving LLMs structured, relationship-aware memory over heterogeneous documents. Multi-tenant isolation with federated retrieval and hook-based extensibility at every pipeline stage.
- Built KQuery: NL-to-SQL engine across three database backends. Plans and verifies the full query hierarchy upfront — deciding intermediate schemas and SQL implementation details before execution — then runs all queries in parallel as soon as their dependencies are satisfied. Generation rules stored as data in the knowledge graph, scoped by platform, company, or user and updatable without code changes.
- Built KNER (Kuona Named Entity Recognition): multi-agent entity resolution pipeline that resolves free-text query segments against company-specific structured databases — products, geographies, promotions, brands, and business concepts — within a conversational, human-in-the-loop framework. Implemented as nested LangGraph state machines with four specialized LLM agents selected for cost/latency/capability tradeoffs.
- Designed UserAI, an autonomous evaluation harness that drives multi-turn agent sessions end-to-end — enabling CI-integrated regression detection of failure modes invisible in single-turn testing.
- Built an evaluation system that represents the agent as a directed graph of components, attributes failures to specific nodes via automated triage, ranks components by downstream harm for prioritized improvement, and provides deterministic replay for counterfactual experiments.
- Designed bypass auditing for agentic systems: when agents override validation gates, decisions are explicitly recorded — what was bypassed, why, and downstream consequences — making safety exceptions analyzable rather than invisible.
USA
- Built vision-LLM quality monitoring systems and automated data acquisition pipelines over CommonCrawl, producing a ~700 GB high-quality dataset and training a custom SDXL ControlNet for conditional generation.
Mexico
- Designed and deployed large-scale forecasting systems across thousands of market-correlated time series.
- Design, implementation, testing and deployment of a Machine Learning Pipeline that generates weekly demand forecasts for all of Heineken Mexico's product catalog in each of its distribution centers. Rewrote the entire system from scratch to improve results and performance for this process spanning 500+ servers, training approximately 3,500 recurrent neural networks weekly.
- Redesign and standardization of one of Kuona's main products, the Perfect Order demand forecasting system.
- Design and implementation of a new time series forecasting library featuring transparent multi-horizon predictions, automatic hyperparameter search, and native support for ensemble models.
- Design and implementation of a custom task orchestration system for Kuona's Machine Learning Pipelines with native support for heterogeneous distributed computing, arbitrary horizontal scalability, and easy management for remote tasks by abstracting the compute resources behind a UNIX-like process control interface.
- Regular meetings with clients to identify data requirements in order to guarantee consistent and precise forecasting. Evaluation of alternative datasources originating within clients' organizations.
Skills & Technologies
Programming Languages
Python
Julia
C/C++
C#
Java
R
CUDA
Elixir
VHDL
ML & Data
PyTorch
TensorFlow
Flux.jl
Spark
LangGraph
Databases
SQL
Neo4j
Redis
MongoDB
Systems & Infrastructure
Multi-Agent Systems
Knowledge Graphs
Kubernetes
EKS
AWS (Advanced)
Azure
Databricks
Prometheus
Products & Systems
Multi-agent analytical platform for autonomous analysis and ML model development over tabular data. PDDL-validated planning, hierarchical rule system (platform/company/user scopes), checkpoint-based resilience, LangGraph orchestration. Full-stack: Django backend, React frontend with WebSocket real-time communication, RabbitMQ/Kombu messaging, AWS EKS deployment. Used by global retailers and manufacturers.
Knowledge graph engine providing LLMs with structured, relationship-aware memory over heterogeneous documents. Multi-tenant isolation with federated retrieval and hook-based extensibility at every pipeline stage.
Natural-language-to-SQL engine across three database backends. Plans and verifies the full query hierarchy upfront — intermediate schemas and SQL implementation decided before execution — then runs all queries in parallel as dependencies resolve. Generation rules stored as data in the knowledge graph, scoped by platform, company, or user. Entity resolution cleanly separated from SQL generation.
Multi-agent, LLM-orchestrated named entity resolution pipeline for natural language business analytics queries. Resolves free-text segments against company-specific structured databases — products, geographies, promotions, brands, and business concepts — within a conversational, human-in-the-loop framework. Implemented as nested LangGraph state machines with four specialized LLM agents, deployed as a sub-graph within ExpertAI. Read more
Autonomous evaluation harness that drives multi-turn agent sessions end-to-end — sending queries, selecting disambiguation options, assessing goal satisfaction, and deciding when to continue or stop. CI-integrated regression detection for failure modes invisible in single-turn testing.
Energy-based latent-variable model with JEPA-style training objective. Learns compact representations of unobservable market dynamics via synthetic supervision and structured probing. Modular per-client adapters eliminate hours-long retraining cycles.
Academic Projects
Led a multidisciplinary group that built a Julia language replacement for the AIMPAC software suite for describing the quantum structure of molecules. Parallel, GPU-ready replacement for the original Fortran code. Collaboration between UNAM's IIMAS and School of Chemistry. View project
Parallel implementation for supply chain optimization on Nvidia GPUs using CUDA and Julia. Binary particle swarm optimization algorithm with conservation-of-flow constraints and LP-based cost evaluation. View project
Improved upon previous results in predicting future academic collaborations using topological data. Created a reduced feature vector via SVD on the collaboration network's adjacency matrix, improving on Hasan et al. (2006). View project
GRU-based recurrent model to detect spans of toxic text in social network posts for the 2021 SEMEVAL International Workshop on Semantic Evaluation. Outperformed baseline results, placing above 50 other participating teams.
U-Net Convolutional Neural Network for identifying pneumonia indicators from COVID-19 in radiology images. Transfer learning and domain-specific augmentation achieved strong results from only ~60 annotated images. View project
Publications & Preprints
Agarwal, V., Manasson, J., Garrido Czacki, M., & Sucholutsky, I. (2025)
ICLR 2025 Re-Align Workshop (Poster)
Proposed and built a pipeline for extracting hierarchical latent representations from olfactory bulb imagery; repurposed medically finetuned SAM2 for generalization, selected encoder-level features, and applied optimal transport to align and compare latent geometry across subjects.
Awards & Honors
1st Place — Energy-Based Modeling Competition (NYU, Yann LeCun; 1st of 53). Compact ~20K-parameter physics-informed model outperformed ~3M-parameter ViT approaches via structural inductive biases.
2025
Fulbright–García Robles Scholar
2022
UNAM PAPIIT Research Scholarship — Project IA104720 (MCMC Methods for Large-Scale Linear Systems)
2020–2021
1st Place — UNAM School of Engineering VLSI Design Competition
2020
UNAM PAPIIT Research Scholarship — Project IN100719 (Predictive Models Applied to Graphs and Text)
2020
2nd Place — First UNAM Impulse to Innovation Contest
2018
Telmex Foundation Scholarship for Academic Excellence
2017
UNAM Data Science B.Sc. Alumni Association — Founding Member
UNAM Data Science B.Sc. Academic Council — First Class Student Representative
Certifications
- Neural Networks and Deep Learning — Deeplearning.ai
- Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization — Deeplearning.ai
- Structuring Machine Learning Projects — Deeplearning.ai
- Sequence Models — Deeplearning.ai
- Natural Language Processing with Classification and Vector Spaces — Deeplearning.ai
- Natural Language Processing with Probabilistic Models — Deeplearning.ai
- Build Basic Generative Adversarial Networks (GANs) — Deeplearning.ai
Languages
Spanish
Native
English
Fluent
Japanese
Intermediate
Chinese
Beginner