Roberto Villafuerte
Data Architect · Privacy & Data Governance · HealthTech
Summary
Lead Data Architect with a security and privacy-by-design mindset, building scalable data foundations for HealthTech and analytics-driven products. Focused on data modeling, governance, and secure data handling — quality, integrity, access control, auditability — to deliver actionable insights without exposing sensitive information. Background includes Master's-level training in Trustworthy ML, Big Data Platforms, and Software Architecture at the University of Helsinki, plus hands-on delivery of production data and ML pipelines.
Experience
Lead Data Architect
SaliHub · Hybrid, Quito
- Leading the design and governance of SaliHub's data architecture to support a scalable preventive health ecosystem
- Architected the founding data model and core database schema from scratch, translating complex LOPD privacy requirements and business logic into a secure, multi-tenant architecture
- Established Privacy-by-Design and defined the organization's data governance framework, implementing strict access controls and audit trails to handle sensitive health data (PII/PHI)
- Building scalable data infrastructure with robust ETL pipelines using Python and SQL to transform raw health signals into longitudinal user insights and real-time dashboards
- Partnering with Product and Engineering to translate business goals into measurable data capabilities
AI Engineer
COMPUMAX · Part-time, Remote
- Built and deployed a production-grade RAG pipeline to automate support workflows with high reliability
- Designed end-to-end backend services in Python from prototype to deployment
- Implemented retrieval quality evaluation and continuous improvement workflows
- Added monitoring and observability practices to support safe releases and maintenance
Machine Learning Engineer Intern
Datalysis Group · Remote
- Designed and implemented an end-to-end sales forecasting system using Snowflake, Mage AI, and Python — from raw transactional data to automated daily predictions for branch managers
- Built data cleaning, feature engineering, model training, and evaluation workflows following CRISP-DM principles
- Improved reliability with validation checks, schema conventions, and documentation for reproducibility
Education
B.Eng. Computer Science
Universidad San Francisco de Quito (USFQ)
Focus: software engineering, algorithms, databases, applied machine learning.
Exchange Studies — Data Science (Master's-level)
University of Helsinki
Coursework: Big Data Platforms, Computer Vision, Engineering of ML Systems, Trustworthy ML, Software Architecture.
Certifications
Technical AI Safety Course
BlueDot Impact
General Data Protection Regulation (GDPR)
Packt
IBM Machine Learning Specialization
Coursera
Awards
2nd Place — IEEEXtreme 18.0 (Ecuador)
Global 24-hour algorithmic programming competition organized by IEEE
Languages
English
Native / BilingualSpanish
Native / Bilingual