ML Pipeline

Many AI systems are not just a single ML model running a prediction service, but instead involves a pipeline of multiple steps.

What are ML pipelines? How do build monitoring systems for that?

Speech recognition example

Screen Shot 2021-10-25 at 3.05.53 PM

User profile example

Screen Shot 2021-10-25 at 3.04.38 PM

Sum up: In ML pipelines, the cascading effects of differet ML components can be complex to keep track on.

It’s useful to brainstorm metrics to monitor that can detect changes, including concept drift or data drift (or both) at multiple stages of the pipeline.

Metrics to monitor

Monitor

Note: the principle from last section to brainstorm whatever that could go wrong and use them as metrics still applies here but to multiple components of the pipeline.

How quickly does data change?