View on GitHub

Arani Bosire(AB) - DATA SCIENTIST

My Portfolio

Data Scientist

đź“Ť Nairobi, Kenya

I have over three years of experience working with data, helping organizations make informed decisions by extracting actionable insights from raw data and building data products(dashboards,predictive models, recomendation models as well).I have a diverse background across various roles, allowing me to adapt and contribute effectively. My experience spans machine learning, data modeling, analysis, dashboards, and visualization—skills that align well with this role’s demands.As an experienced communicator and collaborator, I thrive in team environments. Beyond work, I’m passionate about farming, hiking, and reading classic literature.

đź’» Skilled: Machine & Deep Learning, Statistical & Data Analysis, Data Modelling & Visualization

Technical : Python, R, C++,SQL

Education

Bachelor of Statistics and Information Technology | ALX - Data Science | Udemy

Work Experience

Data Scientist | Upwork Machine Learning Models, Python, SQL Jan 2023 – Present
Data Verifier Lead | Selistar Africa Data Verification, Quality Assurance, Team Training Oct 2024 – Dec 2024
Machine Learning Intern | Technohacks TensorFlow, Keras, PyTorch, Azure AI Studio June 2024 – Sept 2024
Data Analyst |MEAL Samburu Awareness Action Program SQL, Kobo Collect, Power BI Sept 2023 – Jan 2024
Supply chain Analyst Intern | Inventory Management | Sendy Logistics Excel, Tableau, Report Writing May 2022 – Sept.2022

Projects

Malaria Detection Using Deep Learning

Developed an automated approach for malaria detection by analyzing microscopic blood cell images through deep learning techniques. Leveraged a data-driven strategy to identify parasitized cells, significantly enhancing diagnostic capabilities for early malaria detection.Implemented Convolutional Neural Networks (CNNs) to classify blood cells as infected or uninfected. The system utilized a labeled dataset of cell images, followed by preprocessing steps like image augmentation and normalization to optimize model performance. The model was evaluated using metrics such as accuracy, precision, recall, and F1-score to ensure robustness.

Churn Prediction Graph

Customer Churn- Machine Learning

Developed a customer churn model for a telecom company using machine learning. Data was cleaned and preprocessed with Pandas and NumPy, and key churn factors like contract length, call duration, and billing methods were explored through Seaborn and Matplotlib. Logistic Regression and Decision Trees were applied, achieving 85% accuracy with Decision Trees. Power BI dashboard to monitor churn in real-time, helping the company implement targeted retention strategies that reduced churn by 15%.

Reward Program Dashboard

I developed a Reward Program Dashboard by cleaning and merging multiple data sources to build a comprehensive dataset, removing duplicates and standardizing formats. I implemented an automated point allocation system that calculates points based on session participation, duration, and mentee engagement with transparent documentation. Finally, I created an interactive Power BI dashboard to visualize key insights, enabling data-driven recommendations that optimized the reward structure and boosted user retention.

KIVA Loans Analysis & Modelling

Analyzed the relationship between poverty levels and microloan distribution in Kenya using Kiva microfinance data. Utilized statistical modeling and geospatial mapping in R to assess loan effectiveness across counties. Identified disparities in loan distribution, highlighting gaps in financial access for rural communities. Provided insights on strategic microfinance allocations to enhance poverty alleviation efforts.