Data science projects consist of code, data and algorithmic ideas. If these three factors play together there is a tendency that your projects ends up being a total mess.
This article describes csvmodel, a validation tool for small to medium size csv files
Is there really anything like that? Rather some really nice ideas.