Skip to main content

Roberto Villafuerte

Data Architect · Privacy & Data Governance · HealthTech

Download PDF (opens in new tab)

Summary

Lead Data Architect with a security and privacy-by-design mindset, building scalable data foundations for HealthTech and analytics-driven products. Focused on data modeling, governance, and secure data handling — quality, integrity, access control, auditability — to deliver actionable insights without exposing sensitive information. Background includes Master's-level training in Trustworthy ML, Big Data Platforms, and Software Architecture at the University of Helsinki, plus hands-on delivery of production data and ML pipelines.

Experience

Lead Data Architect

SaliHub · Hybrid, Quito

Feb 2026 – Present
  • Leading the design and governance of SaliHub's data architecture to support a scalable preventive health ecosystem
  • Architected the founding data model and core database schema from scratch, translating complex LOPD privacy requirements and business logic into a secure, multi-tenant architecture
  • Established Privacy-by-Design and defined the organization's data governance framework, implementing strict access controls and audit trails to handle sensitive health data (PII/PHI)
  • Building scalable data infrastructure with robust ETL pipelines using Python and SQL to transform raw health signals into longitudinal user insights and real-time dashboards
  • Partnering with Product and Engineering to translate business goals into measurable data capabilities

AI Engineer

COMPUMAX · Part-time, Remote

Dec 2025 – Feb 2026
  • Built and deployed a production-grade RAG pipeline to automate support workflows with high reliability
  • Designed end-to-end backend services in Python from prototype to deployment
  • Implemented retrieval quality evaluation and continuous improvement workflows
  • Added monitoring and observability practices to support safe releases and maintenance

Machine Learning Engineer Intern

Datalysis Group · Remote

Feb 2025 – Jun 2025
  • Designed and implemented an end-to-end sales forecasting system using Snowflake, Mage AI, and Python — from raw transactional data to automated daily predictions for branch managers
  • Built data cleaning, feature engineering, model training, and evaluation workflows following CRISP-DM principles
  • Improved reliability with validation checks, schema conventions, and documentation for reproducibility

Education

B.Eng. Computer Science

Universidad San Francisco de Quito (USFQ)

2022 – 2026

Focus: software engineering, algorithms, databases, applied machine learning.

Exchange Studies — Data Science (Master's-level)

University of Helsinki

Aug 2025 – Jan 2026

Coursework: Big Data Platforms, Computer Vision, Engineering of ML Systems, Trustworthy ML, Software Architecture.

Certifications

Technical AI Safety Course

BlueDot Impact

Jan 2026

General Data Protection Regulation (GDPR)

Packt

Dec 2025

IBM Machine Learning Specialization

Coursera

Awards

2nd Place — IEEEXtreme 18.0 (Ecuador)

Global 24-hour algorithmic programming competition organized by IEEE

Oct 2024

Languages

English

Native / Bilingual

Spanish

Native / Bilingual

Skills

Data Architecture & Governance

Data Architecture Data Modeling Data Governance Privacy-by-Design Database Security Access Control

Data Engineering

PostgreSQL SQL Python ETL / Data Pipelines Snowflake Mage AI

AI / ML Systems

Generative AI Machine Learning RAG MLOps LLM Evaluation Forecasting

Infrastructure & Engineering

AWS Docker CI/CD GitHub Actions Git