Started watching the Machine Learning series from the MIT Open Courseware offerings. Incredible source of free coursework. This is from last fall and is part of MIT infamous “course 6” Introduction to Computational Thinking and Data Science. I avoided course 6 when I was at MIT Sloan School – too theoretical. It is far more relevant today.
I have looked at a good 50 books on machine learning and see few that seriously deal with Testing Driven development and Machine learning. Given the vary nature of machine learning, the lack of TDD seems to be a problem waiting to happen. Having grown up in the world of credit risk modeling, I know the level of scrutiny we had at every level of model development deployment and monitoring. More recently I dived into full-stack web development with a rather simple web app that is backed by 1400 tests in a continuous integration environment. It is a challenge keeping the environment and test suite in sync at times, but the foundation principles of creating single responsibility classes and overtly spelling out acceptance criteria at the most basic level (unit tests) is a critical discipline.