View on GitHub

Arani Bosire(AB)

My Portfolio

πŸ‘‹ Hola, I’m a Data Scientist | Analyst πŸ§ πŸ“Š

πŸ“ Nairobi, Kenya | πŸ“§ bonsoul24@gmail.com | πŸ“ž +254 700 015600

🌟 About Me

From data to decisions β€” powering progress through intelligent insights.

I’m a results-oriented Data Scientist passionate about transforming data into tools for innovation and sustainable impact. With hands-on experience across public health, logistics, and research, I specialize in machine learning, data visualization, and statistical modeling. Whether it’s predicting outbreaks or improving business decisions, I build data solutions that empower people and organizations alike.

🧠 Skills

🧰 Technical

Python Β· R Β· SQL Β· C++ Β· Power BI Β· Tableau Β· TensorFlow Β· PyTorch Β· Scikit-learn Β· Keras Β· PostgreSQL Β· Azure AI Studio Β· Kobo Collect

πŸ“Š Analytical

Statistical Modeling Β· Forecasting Β· Hypothesis Testing Β· Data Visualization Β· A/B Testing Β· Time Series Β· Epidemiological Analytics Β· Data Modelling

🀝 Soft Skills

Problem Solving Β· Critical Thinking Β· Business Acumen Β· Communication Β· Teamwork Β· Leadership Β· Presentation Β· Data Storytelling

πŸ§ͺ Projects and Case Studies

1. Statistical Modeling & Health Analytics

Cholera Surveillance & Forecasting (Kenya)

Conducted time series analysis on cholera case data across Kenyan counties, integrating environmental and demographic variables. Built an interactive dashboard and a seasonal ARIMA model in R to forecast future outbreaks, improving preparedness and reducing regional response time by up to 30%.

HIV/AIDS Transmission Risk Modeling

Investigated HIV/AIDS transmission patterns using national health datasets. Employed logistic regression and chi-square tests to analyze the relationship between socio-demographic factors (e.g., education levels, marital status) and infection risk, revealing statistically significant disparities across counties. Results informed targeted community-based interventions.

Quality of Pediatric Clinical Assessment – Nyeri, Kenya

This study aimed to assess the quality of clinical assessment provided to sick children aged 2–59 months in primary health facilities in Nyeri County. It examined both the clinical care delivered and caregiver satisfaction, while exploring the relationship between the two. Using structured facility assessments and caregiver interviews, the study applied descriptive statistics, cross-tabulations, and logistic regression to evaluate the quality of care and identify key factors influencing caregiver perceptions. The insights generated informed recommendations for enhancing pediatric service delivery and strengthening caregiver engagement in rural healthcare settings.

2. Machine Learning Projects

Malaria Detection Using CNNs

Built a deep learning model using Convolutional Neural Networks (CNNs) to classify infected vs. uninfected blood cells from microscope images. Applied image augmentation and performance tuning to achieve high accuracy for early disease diagnosis.

Cotton Disease Detection (PyTorch)

Developed a classification model using PyTorch to detect plant diseases from leaf images. Applied transfer learning with ResNet, improving accuracy by over 20% after optimizing preprocessing and hyperparameters.

Content-Based Recommendation System

Built a recommendation engine using Python to suggest products based on user behavior and preferences. Integrated cosine similarity and TF-IDF vectorization to personalize suggestions for e-commerce users.

πŸ“Š 3. Data Analysis & Dashboarding

NGO Reporting System & Power BI Dashboard

Designed and implemented a comprehensive PostgreSQL database to centralize program monitoring data for a local NGO. Built end-to-end ETL pipelines and connected the database to Power BI to automate reporting. Developed a dynamic dashboard to track KPIs, visualize program reach, and generate donor-ready impact reports β€” significantly improving reporting efficiency and transparency.

Customer Churn Analysis (Telco)

Analyzed telecom customer behavior using Pandas, NumPy, and Seaborn to identify churn drivers. Trained and tested models (Logistic Regression, Decision Trees) achieving 85% accuracy. Developed a real-time Power BI dashboard to monitor churn trends, enabling targeted customer retention strategies that reduced churn by 15%.

Sales Performance Dashboard

Built a dynamic sales dashboard in Power BI that tracked revenue, conversion rates, and sales rep performance across regions. Integrated data from Excel, SQL, and CRM systems. Delivered actionable insights that led to a 15% increase in regional sales performance within 3 months.

πŸ’Ό Experience

πŸ”¬ Research Analyst – Motiri Consultants πŸ“ Nairobi | Mar 2025 – Present

Led modeling and analysis for cholera and HIV/AIDS surveillance.

Built time series models and visual dashboards for outbreak forecasting (↑ preparedness by 30%).

πŸ’» Freelance Data Scientist – Upwork 🌍 Remote | Jan 2024 – Feb 2025

Built ML models (classification, recommendation) with Python & Scikit-learn.

Developed automated pipelines using PostgreSQL and deployed dashboards in R.

πŸ€– ML Intern – Technohacks Education 🌍 Remote | Jun 2024 – Sept 2024

Preprocessed and structured data for TensorFlow/Keras models.

Tuned parameters and boosted model accuracy by 10%.

πŸ“ˆ Data Analyst – Samburu Awareness Action Program πŸ“ Samburu | Apr 2023 – Jan 2024

Designed centralized data systems with Kobo Collect + SQL.

Improved program outcomes by 25% via data-informed strategies.

🚚 Supply Chain Analyst Intern – Sendy Logistics πŸ“ Nairobi | May 2022 – Sept 2022

Built Excel forecasts reducing excess inventory by 8%.

Maintained Tableau dashboards for real-time insights.

πŸ“š Education

BSc. Statistics, Computing & IT - Cooperative University of Kenya

ALX Data Science Program - Python Β· SQL Β· Data Viz Β· ML Β· Power BI

Udemy Certifications - Machine Learning, Analytics, Storytelling, Data Engineering

πŸŽ–οΈ Leadership & Volunteering

Global Citizens Challenge (Team Lead): Led a diverse team in global sustainability challenges.

Men’s Book Club (Moderator): Managed sessions, facilitated critical discussions, promoted learning.