HELLO, I'M PEDRO

Data Scientist & Statistician

Data Worker who transforms complex data into clear, actionable insights through visualization, modeling, and evidence-based storytelling. I am passionate about using data to understand people, systems, and real-world challenges. My goal is to make insights accessible, meaningful, and useful for better decision-making.

Skills

Business & Metrics
  • Business Problem Interpretation, Metric Design
  • Survey Design, Program Impact Evaluation
ML / Modeling
  • Predictive Modeling, Machine Learning, Generative AI
  • Multivariate Statistical Methods
Languages
  • Python, R, SQL, C, C++
Data & Ops
  • Data Pipelines, Automated Reporting
  • Azure, AWS, Docker
  • SPSS, SAS, Stata
Visualization & Geo
  • Analytical Storytelling, Data Visualization
  • Power BI, Google Data Studio
  • QGIS, ArcGIS
Tools
  • VSCode, Positron, Colab, JupyterLab
  • Microsoft Office, OpenOffice

01. Sobre Mi

Data Scientist and Statistician with 12+ years of experience leveraging large datasets to generate clear, actionable insights that support strategic decision-making.

Strong expertise in statistical sampling, multivariate analysis, machine learning, predictive modeling, big data, and data visualization, with advanced command of modern analytical tools and the adaptability to work effectively across industries and analytical environments.

02. Portfolio

Bogota Road Safety preview

Data Science & Viz: Bogotá Road Casualty Yearbook (2017–2022)

Designed indicators and ggplot2 visuals for Bogotá’s 2017–2022 road casualty yearbooks, enabling Vision Zero actions.

Road Safety Data Analysis Public Health Policy R ggplot2
ClusterCarac preview

ClusterCarac: Clusters & Classifications

R package that surfaces what defines each cluster or category; retro-inspired characterization preview.

Social studies Market research R FactoClass EDA Characterization
FactoClass preview

FactoClass: letting data reveal their own structure

Early Statistics work: factorial + clustering pipeline (Ward/K-means) inspired by Lebart, guided by Campo Elías Pardo.

Statistics DataScience Multivariate Analysis Clustering RStats Applied Statistics