Being Glue. Testing for "Not Being Worse". Hyper-Scale Cloud Growth. Fast-Response Object Detection. [DSR #186]

❤️ Want to support this project? Forward this email to three friends!

🚀 Forwarded this from a friend? Sign up to the Data Science Roundup here.

Tristan’s note: No Roundup next week! I’ll be on a (long-overdue) vacation with my family. If you’re located in the Denver area and have any favorite hikes to recommend, shoot me a note!

This week's best data science articles

Being Glue

This talk has become hugely influential, passed around both inside of Google and Squarespace, and now making its way throughout the software engineering world. Here’s the author, Tanya Reilly, on its genesis:

Being Glue originated as a comment on an internal Google+ post when I worked at Google. I’d used the expression “glue work” in passing, and someone asked what I meant by it. The reply became a standalone post and then an internal document (which as far as I know is still being circulated).

“Glue work” is a term that Tanya coined to describe all of that work that is so critical on medium-to-large technical teams but often goes under-appreciated (and under-promoted): communication, planning, project management, documentation… Recognizing the value of this work is critical for the success of teams and for the career paths of those whose efforts it describes (often disproportionately women).

This effect is not isolated to software engineering. Data is now a technical field, and as we start to figure out the (still in flux!) career paths for our own roles this will be an increasingly important topic.

Highly recommended.


Susie Lu


Early prototypes of reviziting the receipt, one piece of a larger question I want explore: how can viz be integrated into everyday experiences?

3:07 PM - 4 May 2019

StitchFix: Suffering from a Non-inferiority Complex?

What if you don’t need version B to be better than version A?

This is an amazing post that I’m surprised I’ve never seen written before. It goes fairly deep into the math, but you don’t need to follow it there—the most important part is building the intuition.

Most A/B tests are in the service of conversion optimization: making your website push users to achieve some quantifiable goal more effectively. We therefore want to set up a statistical test to conclude that the new version is superior to the old version. But there are many instances where what you want to do is prove that the new version is no worse than the old version. This is not covered in the standard “Implement Optimizely and go to town” playbook.

If you’ve ever been involved in A/B testing, you’ll likely have run across these scenarios. I have, often. This is the best post I’ve ever seen outlining how to effectively construct a test for them.


Adversarial Examples Are Not Bugs, They Are Features

This is fascinating. Adversarial examples—images that have been modified specifically to trick an algorithm but that are undetectably different from the original by a human—have always felt interesting to me. Their existence, and the ease with which they can be generated, always seemed to point to something worthwhile. Turns out, that instinct was right. From the paper:

We demonstrate that adversarial examples can be directly attributed to the presence of non-robust features: features derived from patterns in the data distribution that are highly predictive, yet brittle and incomprehensible to humans.


Which cloud computing giant is growing the fastest?

AWS continues to have more of the market than Azure and GCP combined, but the others are growing fast. The cloud you choose has major implications for your available toolset—fewer companies are open to going multi-cloud these days.


Rapid, Dynamic Obstacle Avoidance with an Event-based Camera

Rapid, Dynamic Obstacle Avoidance with an Event-based Camera

You are probably not working on drones, but this is pretty cool nonetheless :)

Thanks to our sponsors!

Fishtown Analytics: Analytics Consulting for Startups

At Fishtown Analytics, we work with venture-funded startups to build analytics teams. Whether you’re looking to get analytics off the ground after your Series A or need support scaling, let’s chat.


Stitch: Simple, Powerful ETL Built for Developers

Developers shouldn’t have to write ETL scripts. Consolidate your data in minutes. No API maintenance, scripting, cron jobs, or JSON wrangling required.


By Tristan Handy

The internet's most useful data science articles. Curated with ❤️ by Tristan Handy.

Tweet Share

If you don't want these updates anymore, please unsubscribe here.

If you were forwarded this newsletter and you like it, you can subscribe here.

Powered by Revue

915 Spring Garden St., Suite 500, Philadelphia, PA 19123