Summary

Research scientist and engineer specializing in energy-based models, multi-agent systems, and production ML infrastructure. Currently leading the design of a foundation model for market panel data at Kuona, and previously built four production AI systems in under a year — a multi-agent analytical platform (ExpertAI), a knowledge graph engine (Grafa), a NL-to-SQL engine (KQuery), and Kubernetes infrastructure — each deployed at enterprise scale.

Education

New York University August 2023 – May 2025
M.S. Data Science, Center for Data Science
GPA: 3.88 / 4.0

Fulbright–García Robles Scholar (2022).

National Autonomous University of Mexico (UNAM) August 2016 – June 2022
B.Eng. Computer Engineering — School of Engineering GPA: 9.18 / 10
B.Sc. Data Science — Institute of Research in Applied Math & Systems (IIMAS) GPA: 9.44 / 10
Specialization Certificate in Finance — School of Accounting & Administration GPA: 9.57 / 10
Honors: UNAM PAPIIT Research Scholarship Grantee — IN100719 (2020), IA104720 (2020–2021)

Experience

Kuona — Principal Research Scientist 2025 – Present
Foundation Models for Panel Data (USA)
  • Leading the design of a JEPA-style energy-based foundation model for market panel data with structural inductive biases, supporting multi-horizon forecasting, intervention simulation, and probing of unobservable market drivers via synthetic supervision.
  • Developed mechanisms to model latent, unobservable market drivers (e.g., price elasticity, cannibalization) via partial supervision from synthetic markets, allowing causal probing without direct observation in real data.
  • Designed training objectives emphasizing structural inductive biases and compact representations, embedding key economic effects directly into the model and reducing reliance on brittle downstream estimation pipelines.
  • Energy-based latent-variable architecture with JEPA-style training objective, trained on real market data, synthetic markets with known unobservable drivers, and distilled predictions from 4+ years of Kuona's production forecasting models. Modular adapters per client without full retraining — eliminating the 1–12 hour per-client cycle of prior approaches.
Kuona — Product Owner & AI Lead 2024 – Present
ExpertAI (USA)
  • Conceived and shipped ExpertAI end-to-end as sole architect — multi-agent backend (Django, Python), React frontend with WebSocket real-time communication, RabbitMQ/Kombu messaging system, cloud deployment (AWS EKS) — from concept to production in under a year.
  • Multi-agent orchestration with compartmentalized context per subagent/tool, two-layer plan validation (formal PDDL gates + soft LLM-as-a-judge evaluators with scoped rule sets), and checkpoint-based resilience. Deployed and in use by global retailers and manufacturers.
  • Designed a formal tool-planning framework where agents generate and execute multi-step plans verified via PDDL-style preconditions and effects, ensuring logical validity and dependency satisfaction before execution.
  • Implemented deterministic plan reuse and semantic grounding mechanisms, enabling consistent execution, controlled plan adaptation, and resolution of ambiguous user inputs into unambiguous, executable specifications.
  • Built Grafa: knowledge graph engine giving LLMs structured, relationship-aware memory over heterogeneous documents. Multi-tenant isolation with federated retrieval and hook-based extensibility at every pipeline stage.
  • Built KQuery: NL-to-SQL engine across three database backends. Plans and verifies the full query hierarchy upfront — deciding intermediate schemas and SQL implementation details before execution — then runs all queries in parallel as soon as their dependencies are satisfied. Generation rules stored as data in the knowledge graph, scoped by platform, company, or user and updatable without code changes.
  • Built KNER (Kuona Named Entity Recognition): multi-agent entity resolution pipeline that resolves free-text query segments against company-specific structured databases — products, geographies, promotions, brands, and business concepts — within a conversational, human-in-the-loop framework. Implemented as nested LangGraph state machines with four specialized LLM agents selected for cost/latency/capability tradeoffs.
  • Designed UserAI, an autonomous evaluation harness that drives multi-turn agent sessions end-to-end — enabling CI-integrated regression detection of failure modes invisible in single-turn testing.
  • Built an evaluation system that represents the agent as a directed graph of components, attributes failures to specific nodes via automated triage, ranks components by downstream harm for prioritized improvement, and provides deterministic replay for counterfactual experiments.
  • Designed bypass auditing for agentic systems: when agents override validation gates, decisions are explicitly recorded — what was bypassed, why, and downstream consequences — making safety exceptions analyzable rather than invisible.
Sketchpro.ai — Research Scientist Intern June – August 2024
USA
  • Built vision-LLM quality monitoring systems and automated data acquisition pipelines over CommonCrawl, producing a ~700 GB high-quality dataset and training a custom SDXL ControlNet for conditional generation.
Kuona — Data Scientist / Machine Learning Engineer February 2022 – 2024
Mexico
  • Designed and deployed large-scale forecasting systems across thousands of market-correlated time series.
  • Design, implementation, testing and deployment of a Machine Learning Pipeline that generates weekly demand forecasts for all of Heineken Mexico's product catalog in each of its distribution centers. Rewrote the entire system from scratch to improve results and performance for this process spanning 500+ servers, training approximately 3,500 recurrent neural networks weekly.
  • Redesign and standardization of one of Kuona's main products, the Perfect Order demand forecasting system.
  • Design and implementation of a new time series forecasting library featuring transparent multi-horizon predictions, automatic hyperparameter search, and native support for ensemble models.
  • Design and implementation of a custom task orchestration system for Kuona's Machine Learning Pipelines with native support for heterogeneous distributed computing, arbitrary horizontal scalability, and easy management for remote tasks by abstracting the compute resources behind a UNIX-like process control interface.
  • Regular meetings with clients to identify data requirements in order to guarantee consistent and precise forecasting. Evaluation of alternative datasources originating within clients' organizations.

Skills & Technologies

Programming Languages
Python Julia C/C++ C# Java R CUDA Elixir VHDL
ML & Data
PyTorch TensorFlow Flux.jl Spark LangGraph
Databases
SQL Neo4j Redis MongoDB
Systems & Infrastructure
Multi-Agent Systems Knowledge Graphs Kubernetes EKS AWS (Advanced) Azure Databricks Prometheus
Frontend
React

Products & Systems

ExpertAI

Agentic Platform • 2024–Present

Multi-agent analytical platform for autonomous analysis and ML model development over tabular data. PDDL-validated planning, hierarchical rule system (platform/company/user scopes), checkpoint-based resilience, LangGraph orchestration. Full-stack: Django backend, React frontend with WebSocket real-time communication, RabbitMQ/Kombu messaging, AWS EKS deployment. Used by global retailers and manufacturers.

Grafa

Knowledge Graph Engine • 2024

Knowledge graph engine providing LLMs with structured, relationship-aware memory over heterogeneous documents. Multi-tenant isolation with federated retrieval and hook-based extensibility at every pipeline stage.

KQuery

NL-to-SQL Engine • 2024

Natural-language-to-SQL engine across three database backends. Plans and verifies the full query hierarchy upfront — intermediate schemas and SQL implementation decided before execution — then runs all queries in parallel as dependencies resolve. Generation rules stored as data in the knowledge graph, scoped by platform, company, or user. Entity resolution cleanly separated from SQL generation.

KNER

Entity Resolution Pipeline • 2024

Multi-agent, LLM-orchestrated named entity resolution pipeline for natural language business analytics queries. Resolves free-text segments against company-specific structured databases — products, geographies, promotions, brands, and business concepts — within a conversational, human-in-the-loop framework. Implemented as nested LangGraph state machines with four specialized LLM agents, deployed as a sub-graph within ExpertAI. Read more

UserAI

Evaluation Harness • 2025

Autonomous evaluation harness that drives multi-turn agent sessions end-to-end — sending queries, selecting disambiguation options, assessing goal satisfaction, and deciding when to continue or stop. CI-integrated regression detection for failure modes invisible in single-turn testing.

Foundation Model for Market Panel Data

Research • 2025–Present

Energy-based latent-variable model with JEPA-style training objective. Learns compact representations of unobservable market dynamics via synthetic supervision and structured probing. Modular per-client adapters eliminate hours-long retraining cycles.

Academic Projects

AIMPAC Refactoring

Research • 2021–2022

Led a multidisciplinary group that built a Julia language replacement for the AIMPAC software suite for describing the quantum structure of molecules. Parallel, GPU-ready replacement for the original Fortran code. Collaboration between UNAM's IIMAS and School of Chemistry. View project

PSO Supply Chain Optimization

Research • 2021

Parallel implementation for supply chain optimization on Nvidia GPUs using CUDA and Julia. Binary particle swarm optimization algorithm with conservation-of-flow constraints and LP-based cost evaluation. View project

Academic Collaboration Prediction

Research • 2020–2021

Improved upon previous results in predicting future academic collaborations using topological data. Created a reduced feature vector via SVD on the collaboration network's adjacency matrix, improving on Hasan et al. (2006). View project

Detection of Toxic Language

Class Project • 2020

GRU-based recurrent model to detect spans of toxic text in social network posts for the 2021 SEMEVAL International Workshop on Semantic Evaluation. Outperformed baseline results, placing above 50 other participating teams.

COVID-19 Lung CT Segmentation ConvNet

Class Project • 2020

U-Net Convolutional Neural Network for identifying pneumonia indicators from COVID-19 in radiology images. Transfer learning and domain-specific augmentation achieved strong results from only ~60 annotated images. View project

Publications & Preprints

Agarwal, V., Manasson, J., Garrido Czacki, M., & Sucholutsky, I. (2025)
ICLR 2025 Re-Align Workshop (Poster)
Proposed and built a pipeline for extracting hierarchical latent representations from olfactory bulb imagery; repurposed medically finetuned SAM2 for generalization, selected encoder-level features, and applied optimal transport to align and compare latent geometry across subjects.

Awards & Honors

1st Place — Energy-Based Modeling Competition (NYU, Yann LeCun; 1st of 53). Compact ~20K-parameter physics-informed model outperformed ~3M-parameter ViT approaches via structural inductive biases. 2025
Fulbright–García Robles Scholar 2022
UNAM PAPIIT Research Scholarship — Project IA104720 (MCMC Methods for Large-Scale Linear Systems) 2020–2021
1st Place — UNAM School of Engineering VLSI Design Competition 2020
UNAM PAPIIT Research Scholarship — Project IN100719 (Predictive Models Applied to Graphs and Text) 2020
2nd Place — First UNAM Impulse to Innovation Contest 2018
Telmex Foundation Scholarship for Academic Excellence 2017
UNAM Data Science B.Sc. Alumni Association — Founding Member
UNAM Data Science B.Sc. Academic Council — First Class Student Representative

Certifications

  • Neural Networks and Deep Learning — Deeplearning.ai
  • Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization — Deeplearning.ai
  • Structuring Machine Learning Projects — Deeplearning.ai
  • Sequence Models — Deeplearning.ai
  • Natural Language Processing with Classification and Vector Spaces — Deeplearning.ai
  • Natural Language Processing with Probabilistic Models — Deeplearning.ai
  • Build Basic Generative Adversarial Networks (GANs) — Deeplearning.ai

Languages

Spanish Native
English Fluent
Japanese Intermediate
Chinese Beginner