I've been working on an ML model/system side project for a few weeks. 95% of the time is spent manipulating data. 95% of the performance gain is optimizing I/O when manipulating large datasets.

The model itself, took maybe two hours of trial and error.