Issue #200 š„³š! Streamlit. Etsy's Engineering Career Ladder. RFCs in Analytics? [DSR #200]
Wow! Four years and 200 issues. š„³ While the Data Science Roundup started as more of a marketing effort for my fledgling company, I continue to do it for a completely different reason: you, dear reader, hold me accountable.
The data ecosystem moves very quickly, and in my role as the CEO of Fishtown Analytics itās critical that I have broad visibility into the entire space. But reading hundreds of headlines and dozens of articles every single week would be all-too-easy to deprioritize behind closing the next customer or building the next feature. Itās the 8,000 of you who make sure I put in the work. So thanks.
The contents of this newsletter are the things I find strategically valuable for me to knowāthey are the things that inform my model of the world. If you find them valuable, and if they impact your model as well, then so much the better.
Thanks for all of the support over the last four years. Hereās to another four.
Tristan
This week's best data science articles

Introducing Streamlit, an app framework for ML engineers
In my experience, every nontrivial machine learning project is eventually stitched together with bug-ridden and unmaintainable internal tools. These tools ā often a patchwork of Jupyter Notebooks and Flask apps ā are difficult to deploy, require reasoning about client-server architecture, and donāt integrate well with machine learning constructs like Tensorflow GPU sessions.
This new open core company is founded by a whoās who of ML from GoogleX and Zoox and follows the āproductizing internal tooling we builtā playbook (which often produces fantastic results). The linked post is from the team, hereās the TechCrunch post about the launch.
My thoughts: Streamlit is not actually solving a data science problem, itās solving a web development problem that data science teams have. This is an under-invested area; I could imagine lots of shitty internal tools being wiped away in favor of this.
towardsdatascience.com ⢠Share

Fantastic. From Julia Evans. All data teams should have this printed.
Engineering Career Development at Etsy
Oh wow. Youāve likely seen a career ladder beforeāroughly, a set of stages that employees are expected to progress through as they develop their careers. Most companiesā engineering-focused career ladders areā¦uninspiredā¦shall we say. And companies that are world-class at developing engineering talent donāt tend to share theirs.
Which is what makes this particular release unique. I havenāt seen a company of Etsyās caliber release their internal ladder publicly. It will be a tremendous resource for other companies attempting to build high-performance technical teams.
Share this with your CTO and VP Data.
codeascraft.com ⢠Share
The Power of āYes, ifā: Iterating on our RFC Process
As anyone whoās read the Roundup for any time knows, I believe that data analysts should work more like software engineers. And while weāre already watching that change percolate through the industry, there are plenty of practices that havenāt made the leap. Doing RFCs (Requests for Comment) is one of theseāIāve never seen a data team do an RFC in the way that a software engineering team would.
This is an excellent piece on how Squarespace improved their RFC process. As I read it I couldnāt help but think about the times when it would / would not be a good fit for data projects. I think there is plenty of potential applicability.
engineering.squarespace.com ⢠Share
Best Practices for Data Modeling
ā¦[A]lways think about how to build a better product for users ā think about usersā needs and experience and try to build the data model that will best serve those considerations.
Solid overview article on the topic. If youāre new to analytics engineering this is a great way to get exposed.
Google AI Blog: Improving Quantum Computation with Classical Machine Learning
Chances are youāre not working in quantum computing given that there are, what, less than a thousand people on the planet that are. That said, I canāt help but be interested in quantum computing given the potentially massive implications and the level of progress in recent years. This post outlines how one Google team designed a novel ML-based algorithm for their quantum control systems (which run on classical computers).
There are widespread rumors that Google has achieved quantum supremacy. If this is a topic that youāve been interested in the past but havenāt caught up on recently, now is a good time to dive back in. Lots happening.
Thanks to our sponsors!
dbt: Your Entire Analytics Engineering Workflow
Analytics engineering is the data transformation work that happens between loading data into your warehouse and analyzing it. dbt allows anyone comfortable with SQL to own that workflow.
getdbt.com ⢠Share
Stitch: Simple, Powerful ETL Built for Developers
Developers shouldnāt have to write ETL scripts. Consolidate your data in minutes. No API maintenance, scripting, cron jobs, or JSON wrangling required.
The internet's most useful data science articles. Curated with ā¤ļø by Tristan Handy.
If you don't want these updates anymore, please unsubscribe here.
If you were forwarded this newsletter and you like it, you can subscribe here.
Powered by Revue
915 Spring Garden St., Suite 500, Philadelphia, PA 19123