Data Science Software resources. While it's important to code your own implementations to gain a deeper understanding of the algorithm, we are most likely going to use publicly available implementations that have been heavily optimized and tested. Please aggregate software resources you've found from github or other websites with a description. Please make a new post for each software resources so that potential users can ask questions.
There are thousands of papers in data science (a few dozen are submitted to arXiv daily) so lets try not to overload. Please only post papers that are particularly meaningful, provide good summaries of the field, you wish to discuss or are relevant to your project.