Machine Learning Up to Date #25

The interface between k8s and Allegro Trains [Source]

Here's ML UTD #25 from the LifeWithData blog! We help you separate the signal from the noise in today's hectic front lines of software engineering and machine learning.

LifeWithData strives to deliver curated machine learning & software engineering updates that point the reader to key developments without superfluous details. This enables frequent, concise updates across the industry without information overload.



Stop Using Kubernetes for ML-Ops; Instead use Kubernetes

The interface between k8s and Allegro Trains [Source]
We often think of 'deployment' as packaging software into an artifact and moving it to an environment to run on. For Machine Learning it can be better to think of deployment as “the action of bringing resources into effective action” (one of Oxford’s definitions of 'deployment'). There are a range of patterns for using ML models to make business decisions. Deploying machine learning models can mean different things, depending on the context. Understanding the key prediction-making patterns can help to decide on which tools apply to your use case.
... keep reading
The Rundown

It’s All Just Wiggly Air: Building Infrastructure to Support Audio Research

A sneak peek at the Klio documentation [Source]
**TL;DR** We just open sourced [Klio](https://venturebeat.com/2020/10/13/spotify-open-sources-klio-a-framework-for-ai-audio-research/) — our framework for building smarter data pipelines for audio and other media processing. Based on Python and Apache Beam, [Klio](https://klio.io/) helps our teams process Spotify’s massive catalog of music and podcasts, faster and more efficiently. We think Klio’s [ease of use](https://docs.klio.io/) — and its ability to let anyone leverage modern cloud infrastructure and tooling — has the potential to unlock new possibilities in media and ML research everywhere, from big tech companies to universities and libraries. But now we’re getting ahead of ourselves. What exactly is Klio and what does it do? Let’s start with the problem of audio itself.
... keep reading

Safely Rolling Out ML Models To Production

A diagram of what the author calls the "ML Orchestra" [Source]
Replacing or publishing a new version to production touches upon the core decision making logic of your business process. With AI adoption rising, the necessity to automatically publish and update models is becoming a common and frequent task, which makes it a top concern for data science teams. In this article, I will review what makes the rollout of a new version so sensitive, what precautions are required, and how to leverage monitoring to optimize your Continuous Integration (CI) pipeline, as well as your Continuous Deployment (CD) one to safely achieve your goals.
... keep reading
The Rundown

Noah’s Ark Lab Multi-Agent RL Simulation for Autonomous Driving

A snapshot of the simulation [Source]
The [CoRL 2020 Best System Paper Award](https://www.youtube.com/watch?v=KNVTASwezyw) was presented today to Huawei Noah’s Ark Lab, Shanghai Jiao Tong University and University College London for their paper _SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving._ The CoRL 2020 Award Committee praised the work as “a thorough and well-thought-out system with strong potential impact for the Autonomous Driving community.” The paper introduces SMARTS (Scalable Multi-Agent RL Training School), a realistic multi-agent simulation platform for autonomous driving. SMARTS supports the training, accumulation, and use of diverse road user behaviour models to help reinforcement learning (RL) researchers examine realistic road interaction scenarios. It has been open-sourced.
... keep reading
The Rundown

MLflow and PyTorch — Where Cutting Edge AI meets MLOps

Matching up MLFlow and PyTorch features [Source]
PyTorch has continued to evolve rapidly since the introduction of PyTorch 1.0, which brought an accelerated workflow from research to production. Looking at the momentum in research, as shown on [paperswithcode.com/trends](http://paperswithcode.com/trends), we can see that the research community has embraced PyTorch as its tool of choice. Conversely, the enterprise and production users of the world such as Lyft Level 5, Microsoft, AstraZeneca and many others, are realizing real product value through deploying PyTorch at scale. MLOps and end-to-end lifecycle management of machine learning is an important bridge between these communities, and the combination of state of the art AI developed on PyTorch and MLOps is a powerful force that will bring the impact of cutting edge research to products. Today, we are announcing a number of technical contributions to enable end-to-end support for MLflow usage with PyTorch, including support for: autologging via PyTorch Lightning; TorchServe integration through a new deployment plug-in; and a sample end-to-end workflow targeting HuggingFace Transformers. This work, done in collaboration between members of the technical teams at Databricks/MLflow core maintainers and the PyTorch core development team within Facebook, is just the beginning. We expect more technical contributions in the coming months.
... keep reading

Emergent Reciprocity and Team Formation from Randomized Uncertain Social Preferences

Demonstration of emergent cooperation in the Infinite Prisoner's Dilemma environment [Source]
Multi-agent reinforcement learning (MARL) has shown recent success in increasingly complex fixed-team zero-sum environments. However, the real world is not zero-sum nor does it have fixed teams; humans face numerous social dilemmas and must learn when to cooperate and when to compete. To successfully deploy agents into the human world, it may be important that they be able to understand and help in our conflicts. Unfortunately, selfish MARL agents typically fail when faced with social dilemmas. In this work, we show evidence of emergent direct reciprocity, indirect reciprocity and reputation, and team formation when training agents with randomized uncertain social preferences (RUSP), a novel environment augmentation that expands the distribution of environments agents play in. RUSP is generic and scalable; it can be applied to any multi-agent environment without changing the original underlying game dynamics or objectives. In particular, we show that with RUSP these behaviors can emerge and lead to higher social welfare equilibria in both classic abstract social dilemmas like Iterated Prisoner's Dilemma as well in more complex inter-temporal environments.
... keep reading