I'm a data professional with a wide range of experience in different types of modelling and software engineering practices. I originally specialised in recommender systems, but I have more recently followed my interest in other topics, including geospatial data and computer vision
I'm currently based in Edinburgh, UK 🇬🇧
I'm working with genetic algorithms and machine learning to create tools that enhance the design of large-scale linear infrastructure projects - most noteably electricity transmission lines, whose fast development is crucial for the transition to renewable energy.
I worked with a novel retinal image dataset to develop deep learning models for the early detection of Age-related Macular Degeneration, Glaucoma and other retinal diseases.
I worked on building a product recommendation system from the ground up. My work spanned a range of ML areas, from EDA and model development, to CI/CD and productionising model workflows.
Led a team of 4 in the design and execution of data science and data engineering work, focusing on development of existing and new recommendations models - whilst still writing production code. Achievements:
Responsible for developing in-house recommendation algorithms and ensured they delivered good results in production. Achievements:
Worked on a variety of different projects as one of the first members of a new Data Science team, and gained exposure to a number of different business areas and modelling techniques, including:
Kubrick provided 4 months of core training in data problem solving along with the key technologies needed to succeed as a data scientist:
I have volunteered on multiple DataDives - which help charities extract value from their data using data science techniques. I also helped to organise several of these events as a Data Ambassador - by framing key analytical questions, cleaning and documenting data, and liaising with charity representatives to enable volunteers to develop good solutions.
Helped teach data science & machine learning foundations to a group of beginners. I taught two sessions, on Exploratory Data Analysis and Clustering.
• Python
• ML libraries - sklearn, tensorflow, pytorch
• SQL - Redshift, Bigquery, MSSQL, Teradata, dbt
• Cloud Platforms - AWS & Google Cloud
• Git
• Docker
• CI/CD - Gitlab, Jenkins, Github Actions
• ML Workflows - Metaflow & AWS Sagemaker
• Terraform
• Airflow