Collaboration How to make your software team much more effective Programming is the closest we have to magic in the real world. Just like magic, you can conjure rich and powerful creations with the power of your own mind, given the right tools and training. Just as powerful magic could lay waste to an
Data Science Make it work, make it right, make it fast I first heard this phrase from Gael Varoquax (one of the core scikit-learn committers) in a talk[1], and I loved the super-concentrated wisdom of it: Make it work. First of all, get it to compile, get it to run, make sure it spits
Collaboration How to reproduce your analysis But first, an exciting announcement! I'm currently hiring for two Machine Learning Engineering roles, both doing interesting work with high-energy startups led by great people on good missions. Scroll to the bottom of this article for more details! How to reproduce your analysis “But
Data Science How to Cheat at Data Science (with help from Centaurs and the Wizard of Oz) What does it take to run a high-performing data science team? In this 15-minute video, I talk about two tools that I’ve found invaluable for doing just that: Wizard-of-ozzing is a technique for product discovery that can be very helpful for evaluating potential
Data Science What makes a great data scientist and data science team? (Podcast) I was recently honoured to be a guest on The Data Pubcast - a podcast about making data accessible to everyone, hosted by the incredibly talented Nick Latocha and Andy Crossley. In the episode we discuss: the spectrum of what "data science" means and
Programming How to write good software faster (we spend 90% of our time debugging) If we spend the majority of our programming time and effort on debugging, we should focus our efforts on speeding up our debugging (rather than trying to write code faster).
Data Science How to know if your recommendations algorithm is actually doing a good job I led the team that built Channel 4’s recommender system for All 4 in 2016. It started out as a straightforward project. But after getting lost in a rabbit hole trying to devise a score for ‘provocativeness’ and ‘serendipity’, I learned the single most important lesson about data science.
Software Why are my emails being sent to spam - A guide for startups using GSuite “This email to an investor was marked as spam! They only noticed it by accident. Why is this happening?”
Software The Difference: Throughput vs Latency Latency and Throughput are important concepts for data scientists. How are they distinct? And why is this distinction important, not just for technical systems, but also for team performance?
Software Technical debt isn't like financial debt I hope you weren’t hoping for a blanket answer to the question of exactly when to pay off technical debt. If you were, the answer is Tuesday. Almost all interesting decisions are about risks and trade-offs. This is a framework for making our intuitions about technical debt more rigorous.