Prosper Ablordeppey


Data Science || Web Development

Projects and works


Published Software Packages

The package implements two iterative techniques called T3Clus and 3Fkmeans, aimed at simultaneously clustering objects and a factorial dimensionality reduction of variables and occasions on three-mode datasets developed by \(\href{https://doi.org/10.1007/s00357-007-0006-x}{\text{Vichi et. al}}\) in 2007. Also, we provide a convex combination of these two simultaneous procedures called CT3Clus and based on a hyperparameter \(\alpha\) (\(\alpha \in [0,1]\), with 3FKMeans for \(\alpha=0\) and T3Clus for \(\alpha=1\)) was also developed in \(\href{https://doi.org/10.1007/s00357-007-0006-x}{\text{Vichi et. al}}\). Furthermore, we implemented the traditional tandem procedures of T3Clus (TWCFTA) and 3FKMeans (TWFCTA) for sequential clustering-factorial decomposition (TWCFTA), and vice-versa (TWFCTA) proposed by \(\href{https://doi.org/10.1007/978-3-642-79999-0_1}{\text{P. Arabie and L. Hubert}}\) in 1996.

source codes publication
Python (Github) PyPI
R (Github) R-CRAN

Data Analytics

Thesis Summaries

In this study, we conducted a case study on improving the National Health Insurance Scheme (NHIS) coverage in the Sunyani municipality. We used the Contingent Valuation Method (CVM), a technique for estimating the Willingness To Pay (WTP) for non-market goods, to determine individuals' willingness to pay for an extension of NHIS coverage. We also employed Binary Logit Regression analysis to identify factors that influence individuals' decisions to pay for an improved NHIS, and included a social interaction term in the model to measure the impact of other individuals' views on an individual's decision. Our research results showed that a majority of NHIS holders who purchase additional drugs when visiting a health facility were willing to pay for an extension of NHIS coverage. The most costly medical conditions for which respondents usually purchased additional drugs and were willing to pay an additional premium for inclusion in NHIS coverage were rehabilitation conditions (such as vision, hearing, and dental issues), cancer, and heart-related surgeries. On average, respondents were willing to pay no more than GH₵30.00 per year to cover these conditions. We recommended that policymakers or planners consider income, gender, and level of education when making NHIS coverage improvement decisions, as these variables were found to be the most significant in our model.

In this work, we developed and implemented five models, TWCFTA, TWFCTA, T3Clus, 3FKMeans, and CT3Clus, in the simuclustfactor packages in Python and R for the analysis of three-mode datasets. These models were proposed by \(\href{https://doi.org/10.1007/s00357-007-0006-x}{\text{Vichi et al.}}\) in 2007 as an alternative to traditional tandem models for simultaneously clustering objects and reducing the dimensionality of variables and occasions in three-mode datasets. T3Clus and 3Fkmeans are iterative techniques that apply the Tucker2 algorithm and K-means algorithm sequentially, while CT3Clus is a convex combination of these two methods with a hyperparameter alpha \((0 \leq \alpha \leq 1)\) that allows for the interpolation between the two. In contrast, tandem analysis only involves the sequential application of clustering and factorial methodologies. We tested these simultaneous models on synthetic and real datasets and found that they produced more well-separated clusters with higher cohesion within clusters, and were better at recovering object-cluster, variable/occasion-factor associations compared to the tandem models. The best results in terms of well-separated clusters with high cohesion within the clusters were obtained when the factorial analysis was prioritized, minimizing the within-cluster deviation of the component scores (i.e. WSS in the reduced space).

About


Masters
  • University of Aveiro

    (Aveiro, Portugal)

    MSc. Mathematics & Applications

    August 2021 - Sept. 2022

    core courses :

    • Advanced Algorithms
    • Mathematical Techniques For Big Data
    • Exploratory Data Analytics
    • Seminar In Mathematics & Applications
  • University of L'Aquila

    (L'Aquila, Italy)

    MSc. Mathematical Engineering

    July 2020 - July 2021

    core courses :

    • Data Analytics
    • Machine Learning & Neural Networks
    • Cryptography
    • Differential Equations
    • Numerical Methods
    • Optimization
    • Parallel Computing
Undergraduate
  • University of Energy and Natural Resources

    (Sunyani, Ghana)

    BSc. Actuarial Science

    August 2019 - September 2020

    core courses:

    • Computing
    • Descriptive & Inferential Statistics 
    • Credibility & Loss Theory
    • Insurance & Risk Management
    • Stochastic Processes
    • Differential Equations
    • Multivariate Calculus
    • Vectors & Mechanics

    supervisor:

High School
  • Achimota Senior High

    (Accra, Ghana)

    General Science

    August 2011 - September 2014

    core courses:

    • Physics
    • Biology
    • Chemistry
    • Mathematics
Python Software Foundation
  • Python Software Foundation (PSF)

    (Sunyani, Ghana)

    Team Lead

    June 2019 - Date

    core activities:

    • Web2py and Django.
    • TKinter desktop app dev.
    • Data Analytics (Numpy, Pandas, ScikitLearn)
    • Neural Nets (Tensorflow, Keras)
    • Front-end development (HTML, CSS, Javascript, Bootstrap)
National Service
  • University of Energy & Natural Resources

    (Sunyani, Ghana)

    Teaching Assistant

    September 2019 - September 2020

    core activities:

    • Provide general assistance and support for my lecturer.
    • Assist in assessments of students, invigilation of quizzes and mid-semester exams.
Attachment
  • New Fountain International School

    (Tema, Ghana)

    Pupils Tutor

    2014 - 2015

    core activities:

    • Class teacher of primary 3.
Conferences & Presentations
  • IFCS 2022

    (University of Porto, Portugal)

    17th Conference of the International Federation of Classification Societies

    July 2022

    Contributed session

Machine Learning Workshop
  • Machine Learning in Science

    (University of Lisbon, Portugal)

    Workshop of Machine Learning in Science

    June 2022

    participant

Programming Languages
    • Python,
    • R,
    • Typescript,
    • Visual Basic,
    • Google Script,
    • Javascript,
    • LaTeX,
    • SQL,
    • HTML,
    • CSS
Data Analytics (Python)
    • Numpy,
    • Pandas,
    • Scipy,
    • Tensorly,
    • Scikit Learn,
    • Tensorflow,
    • Keras
Web Development
    • Node,
    • Angular,
    • Web2py,
    • Django,
    • ASP.NET,
    • Web3,
    • SQLite,
    • MySQL
Software Apps
    • SPSS,
    • Tableau,
    • MS Excel,
    • Google Sheet,
    • PowerBI,
    • MS Word,
    • TexStudio,
    • Bootstrap Studio,
    • Visual Studio,
    • Wireframe Sketcher,
    • Open Bullet
    • Github,
    • PyCharm,
    • Anaconda
Computing
    • General Programming,
    • Algorithm Optimization,
    • Webscraping,
    • Selenium,
Interests
    • Full-Stack Web Development, 
    • Clustering, 
    • Classification,
    • Predictive Analytics,
    • Big Data, 
    • Factorial Analysis,
    • Three Mode Tensor Analysis,
    • Machine Learning & AI

certificatES & licences


Coursera
Neural Nets & Deep Learning
Linear Algebra for Machine Learning

LinkedIn
Power BI Essential Training
Python for Data Visualization
AI & Business Strategy: Case Studies

Conferences & Seminars
Machine Learning in Science
17' IFCS Abstract Participation

Others
Ocean 101