Projects

A showcase of my technical projects and explorations. Each project represents a learning journey and problem-solving experience.

Automatic Music Transposition

September 2023 - December 2023

A collaborative project focused on developing a specialized application that leverages transformer-based machine learning models and advanced waveform analysis for automatic music transcription from audio files. The system converts complex audio recordings into structured musical notation with high accuracy.

The application implements the MT3 framework (TensorFlow) and Score Transformer to transcribe user-selected audio files into MusicXML format with the option to export as MIDI. The user interface and file handling components were built using the JUCE framework in C++, providing a seamless experience across different operating systems.

Technologies: Python (TensorFlow, NumPy, Librosa), C++ (JUCE), MusicXML, MIDI, Transformer Neural Networks, Audio Signal Processing

Note: The source code repository is not publicly available due to copyright concerns related to the project's implementation.

View Project Poster

Automatic Enhancement of Fashion Imagery

January - April 2024

An advanced image processing application that automatically enhances fashion product images, particularly from platforms like Grailed.com. The system implements a comprehensive enhancement pipeline with adaptive processing capabilities that analyze each image to determine optimal parameters based on lighting conditions, contrast levels, noise presence, and detail complexity.

Key features include high-quality Lanczos upscaling (4x by default), detail enhancement through high-pass filtering and unsharp masking, color enhancement with LAB color space processing, contrast improvement using CLAHE (Contrast Limited Adaptive Histogram Equalization), shadow and highlight recovery for better dynamic range, noise reduction using non-local means denoising, and background removal using the U2Net deep learning model. The application includes both a user-friendly graphical interface and command-line functionality for batch processing with parallel execution capabilities.

The enhancement pipeline consists of multiple stages: image analysis for parameter optimization, pre-processing with white balance correction, resolution enhancement via Lanczos algorithm, color/contrast enhancement in LAB color space, detail enhancement through frequency domain manipulation, background processing with edge feathering, and post-processing with auto-cropping based on content analysis. The system employs adaptive configuration that tailors processing parameters to each image's unique characteristics.

Technologies: Python (100%), OpenCV, NumPy, Pillow, rembg (U2Net), Adaptive Image Processing, LAB Color Space Manipulation, Frequency Domain Filtering, Non-local Means Denoising, CLAHE, Parallel Processing

View on GitHub

View Paper

Automoni - Automated System Monitoring

October 2024 - December 2024

A specialized tool for automatic monitoring of Grailed marketplace listings with push notifications to iOS devices. Automoni was designed to track new product listings and price changes on the Grailed fashion marketplace, alerting users to potential deals in real-time.

The system features sophisticated web scraping that monitors Grailed listings without triggering anti-bot measures, delivering instant iOS notifications when items matching user-defined criteria appear. This provides fashion enthusiasts with a competitive edge in securing limited-edition or highly sought-after items.

Technologies: Go (52.7%), Python (47.3%), Web Scraping, iOS Notifications, Automated Monitoring

View on GitHub

Interactive Data Dashboards

February 2025

A practice project focused on creating interactive dashboards from API calls. This project demonstrates my ability to work with Go for backend services while creating functional, interactive dashboards that consume API data.

The dashboards transform raw API data into intuitive visualizations with real-time filtering, drill-down capabilities, and customizable views that help users explore data patterns and make data-driven decisions.

Technologies: Go (96.5%), Shell (3.5%), Frontend/Backend Architecture

View on GitHub

VCT Scoreboard Matrix

September 2024 - Ongoing

A computer vision project focused on extracting game data from VALORANT Champions Tour (VCT) esports broadcasts (Barracks UI) using OCR and object detection techniques. The system automatically identifies and captures UI elements from the game's esports broadcast interface to build a comprehensive database of match statistics.

The project employs sophisticated image processing algorithms to detect weapon icons, player statistics, and team compositions in real-time. By automating the data extraction process, it creates a structured dataset that enables advanced analytics of professional VALORANT matches without manual data entry.

Technologies: Python (99.5%), Jupyter Notebook (0.5%), Computer Vision, OCR, Object Detection, Image Processing

View on GitHub