![](https://uploads-ssl.webflow.com/5f6030edfd63364a668b1265/5ff892cf8d549ea29acb5f82_Untitled%20(1).jpg)
Google’s JAX: Flexible, High-Performance Machine Learning
![](https://uploads-ssl.webflow.com/5f6030edfd63364a668b1265/5ff49626c70c7d5ff4ff3573_Serving%20Layer%20(2)%20(1).png)
Server-side batching: Scaling inference throughput in machine learning
![](https://uploads-ssl.webflow.com/5f6030edfd63364a668b1265/5feb63c47aaf9683e588e251_Screen%20Shot%202020-12-29%20at%2012.16.03%20PM.png)
Caleb Kaiser
December 2020
How we served 1,000 models on GPUs for $0.47
![](https://uploads-ssl.webflow.com/5f6030edfd63364a668b1265/5fe21c60b3c0003acd04fbf6_python-logo-master-v3-TM-flattened.png)
Caleb Kaiser
December 2020
Designing a machine learning platform for both data scientists and engineers
![](https://uploads-ssl.webflow.com/5f6030edfd63364a668b1265/5fdbc8412acc064912da1924_metaflow.jpg)
Caleb Kaiser
December 2020
Netflix's Metaflow: Reproducible machine learning pipelines
![](https://uploads-ssl.webflow.com/5f6030edfd63364a668b1265/5fd91686e634f985cb949e74_TFServing.png)
Caleb Kaiser
December 2020
How to serve batch predictions with TensorFlow Serving
![](https://uploads-ssl.webflow.com/5f6030edfd63364a668b1265/5fd7987fc68e703c91119d74_Untitled%20Document%20(3).png)
Caleb Kaiser
December 2020
How to build a pipeline to retrain and deploy models
![](https://uploads-ssl.webflow.com/5f6030edfd63364a668b1265/5fce64b2e323b571d4cc33a6_hugging-face_1f917.png)
Caleb Kaiser
December 2020
How to deploy Transformer models for language tasks
![](https://uploads-ssl.webflow.com/5f6030edfd63364a668b1265/5fc94d548b1f23e8a287f176_cortex_gcp.jpg)
Caleb Kaiser
December 2020
Cortex 0.24: Announcing multi-cloud support
![](https://uploads-ssl.webflow.com/5f6030edfd63364a668b1265/5fce9038a8cea52cce32a39c_Cortex_Architecture.png)
Caleb Kaiser
December 2020
How we scale machine learning model deployment on Kubernetes
![](https://uploads-ssl.webflow.com/5f6030edfd63364a668b1265/5fce91cd687ac05a5e01997c_Arch_AWS-Lambda_64.png)
Caleb Kaiser
December 2020
Why we built a serverless machine learning platform—instead of using AWS Lambda
![](https://uploads-ssl.webflow.com/5f6030edfd63364a668b1265/5fce9484a6040288dce659d6_fastapi.jpg)
Caleb Kaiser
November 2020
Why we don’t deploy machine learning models with Flask
![](https://uploads-ssl.webflow.com/5f6030edfd63364a668b1265/5faebbe616320960d46d2f89_jupyter%20%2B%20Cortex.jpg)
Caleb Kaiser
November 2020
How to deploy machine learning models from a notebook to production
![](https://uploads-ssl.webflow.com/5f6030edfd63364a668b1265/5fce8c8a7ae3138bfe35e70b_yaml.png)
Caleb Kaiser
November 2020
Why we use YAML—not notebooks—for machine learning
![](https://uploads-ssl.webflow.com/5f6030edfd63364a668b1265/5fce8fb7f6eeec37e0ba2857_MLOps.png)
Caleb Kaiser
November 2020
5 Lessons Learned Building an Open Source MLOps Platform
![](https://uploads-ssl.webflow.com/5f6030edfd63364a668b1265/5fa1d1fb853b6c3dc5052a20_Untitled.jpg)
Caleb Kaiser
November 2020
Machine learning doesn't have to be expensive
![](https://uploads-ssl.webflow.com/5f6030edfd63364a668b1265/5fce9095793568ad17b60b46_a%3Ab_test.png)
Caleb Kaiser
November 2020
A/B testing machine learning models in production
![](https://uploads-ssl.webflow.com/5f6030edfd63364a668b1265/5f9af24ca40758f0190b1342_tensorflowserving.jpg)
How to deploy 1,000 models on one CPU with TensorFlow Serving
![](https://uploads-ssl.webflow.com/5f6030edfd63364a668b1265/5fce90cd2ce88119e3f0292d_spot_instances.png)
How to reduce the cost of machine learning inference
![](https://uploads-ssl.webflow.com/5f6030edfd63364a668b1265/5f875111b391a4fb0c6f402d_Untitled.png)
How to deploy PyTorch Lightning models to production
![](https://uploads-ssl.webflow.com/5f6030edfd63364a668b1265/5f80ae1c2d3851431ec6d70d_CXHFONNX.png)
Improve NLP inference throughput 40x with ONNX and Hugging Face
![](https://uploads-ssl.webflow.com/5f6030edfd63364a668b1265/5f738e8452c9c5fc64c062d5_poachers.png)
Catching poachers with machine learning