π Hola, Iβm a Data Scientist | Analyst π§ π
π Nairobi, Kenya | π§ bonsoul24@gmail.com | π +254 700 015600
π About Me
From data to decisions β powering progress through intelligent insights.
Iβm a results-oriented Data Scientist passionate about transforming data into tools for innovation and sustainable impact. With hands-on experience across public health, logistics, and research, I specialize in machine learning, data visualization, and statistical modeling. Whether itβs predicting outbreaks or improving business decisions, I build data solutions that empower people and organizations alike.
π§ Skills
π§° Technical
Python Β· R Β· SQL Β· C++ Β· Power BI Β· Tableau Β· TensorFlow Β· PyTorch Β· Scikit-learn Β· Keras Β· PostgreSQL Β· Azure AI Studio Β· Kobo Collect
π Analytical
Statistical Modeling Β· Forecasting Β· Hypothesis Testing Β· Data Visualization Β· A/B Testing Β· Time Series Β· Epidemiological Analytics Β· Data Modelling
π€ Soft Skills
Problem Solving Β· Critical Thinking Β· Business Acumen Β· Communication Β· Teamwork Β· Leadership Β· Presentation Β· Data Storytelling
π§ͺ Projects and Case Studies
1. Statistical Modeling & Health Analytics
Cholera Surveillance & Forecasting (Kenya)
Conducted time series analysis on cholera case data across Kenyan counties, integrating environmental and demographic variables. Built an interactive dashboard and a seasonal ARIMA model in R to forecast future outbreaks, improving preparedness and reducing regional response time by up to 30%.
HIV/AIDS Transmission Risk Modeling
Investigated HIV/AIDS transmission patterns using national health datasets. Employed logistic regression and chi-square tests to analyze the relationship between socio-demographic factors (e.g., education levels, marital status) and infection risk, revealing statistically significant disparities across counties. Results informed targeted community-based interventions.
Quality of Pediatric Clinical Assessment β Nyeri, Kenya
This study aimed to assess the quality of clinical assessment provided to sick children aged 2β59 months in primary health facilities in Nyeri County. It examined both the clinical care delivered and caregiver satisfaction, while exploring the relationship between the two. Using structured facility assessments and caregiver interviews, the study applied descriptive statistics, cross-tabulations, and logistic regression to evaluate the quality of care and identify key factors influencing caregiver perceptions. The insights generated informed recommendations for enhancing pediatric service delivery and strengthening caregiver engagement in rural healthcare settings.
2. Machine Learning Projects
Malaria Detection Using CNNs
Built a deep learning model using Convolutional Neural Networks (CNNs) to classify infected vs. uninfected blood cells from microscope images. Applied image augmentation and performance tuning to achieve high accuracy for early disease diagnosis.
Cotton Disease Detection (PyTorch)
Developed a classification model using PyTorch to detect plant diseases from leaf images. Applied transfer learning with ResNet, improving accuracy by over 20% after optimizing preprocessing and hyperparameters.
Content-Based Recommendation System
Built a recommendation engine using Python to suggest products based on user behavior and preferences. Integrated cosine similarity and TF-IDF vectorization to personalize suggestions for e-commerce users.
π 3. Data Analysis & Dashboarding
NGO Reporting System & Power BI Dashboard
Designed and implemented a comprehensive PostgreSQL database to centralize program monitoring data for a local NGO. Built end-to-end ETL pipelines and connected the database to Power BI to automate reporting. Developed a dynamic dashboard to track KPIs, visualize program reach, and generate donor-ready impact reports β significantly improving reporting efficiency and transparency.
Customer Churn Analysis (Telco)
Analyzed telecom customer behavior using Pandas, NumPy, and Seaborn to identify churn drivers. Trained and tested models (Logistic Regression, Decision Trees) achieving 85% accuracy. Developed a real-time Power BI dashboard to monitor churn trends, enabling targeted customer retention strategies that reduced churn by 15%.
Sales Performance Dashboard
Built a dynamic sales dashboard in Power BI that tracked revenue, conversion rates, and sales rep performance across regions. Integrated data from Excel, SQL, and CRM systems. Delivered actionable insights that led to a 15% increase in regional sales performance within 3 months.
πΌ Experience
π¬ Research Analyst β Motiri Consultants π Nairobi | Mar 2025 β Present
Led modeling and analysis for cholera and HIV/AIDS surveillance.
Built time series models and visual dashboards for outbreak forecasting (β preparedness by 30%).
π» Freelance Data Scientist β Upwork π Remote | Jan 2024 β Feb 2025
Built ML models (classification, recommendation) with Python & Scikit-learn.
Developed automated pipelines using PostgreSQL and deployed dashboards in R.
π€ ML Intern β Technohacks Education π Remote | Jun 2024 β Sept 2024
Preprocessed and structured data for TensorFlow/Keras models.
Tuned parameters and boosted model accuracy by 10%.
π Data Analyst β Samburu Awareness Action Program π Samburu | Apr 2023 β Jan 2024
Designed centralized data systems with Kobo Collect + SQL.
Improved program outcomes by 25% via data-informed strategies.
π Supply Chain Analyst Intern β Sendy Logistics π Nairobi | May 2022 β Sept 2022
Built Excel forecasts reducing excess inventory by 8%.
Maintained Tableau dashboards for real-time insights.
π Education
BSc. Statistics, Computing & IT - Cooperative University of Kenya
ALX Data Science Program - Python Β· SQL Β· Data Viz Β· ML Β· Power BI
Udemy Certifications - Machine Learning, Analytics, Storytelling, Data Engineering
ποΈ Leadership & Volunteering
Global Citizens Challenge (Team Lead): Led a diverse team in global sustainability challenges.
Menβs Book Club (Moderator): Managed sessions, facilitated critical discussions, promoted learning.