ML OPs Flashcards

Question

What is the purpose of **evaluation** in an ML pipeline?

Answer 1

Check how good it is (metrics, validation, tests) ## Footnote Ensures the model meets performance criteria.

Answer 2

Put the system into real use ## Footnote Involves making the model accessible for predictions.

Answer 3

Watch it in production: is it still accurate? ## Footnote Ensures ongoing performance and reliability.

Answer 4

A real prediction system that the business actually uses ## Footnote This is the ultimate goal of the pipeline.

Answer 5

FALSE ## Footnote You deploy a pipeline, which includes data cleaning, feature processing, scaling, transformations, the model, and prediction logic.

Answer 6

* MLOps engineers * Software engineers * Data scientists ## Footnote It’s system architecture work.

Answer 7

A Jupyter notebook = experiment; An ML pipeline = production system ## Footnote This highlights the transition from experimentation to deployment.

Answer 8

* Business goal * Data → cleaning → features * Train model * Evaluate * Deploy * Monitor * Improve ## Footnote This summarizes the entire process from start to finish.

Answer 9

* Modelling * Evaluation * Deployment * Monitoring ## Footnote These steps outline the workflow for developing and maintaining a machine learning model.

Answer 10

* Algorithm used * Features used * Hyperparameters tried * Error limits accepted ## Footnote Documentation is crucial for reproducibility in model development.

Answer 11

To determine if the model can predict unknown data well ## Footnote This is assessed by running the model on a test set and computing performance metrics.

Answer 12

FALSE ## Footnote Deployment requires the model, preprocessing, feature steps, scaling, and prediction code.

Answer 13

* Scalability * Stability * Automation ## Footnote This involves turning the model into a real product and often includes continuous delivery.

Answer 14

* Software performance * Model performance ## Footnote Monitoring ensures that both the software and the model continue to function effectively in real-world conditions.

Answer 15

* Go back to earlier pipeline steps * Retrain with new data * Adjust model/hyperparameters * Evaluate again * Redeploy ## Footnote This process ensures that the model remains valid and effective over time.

Answer 16

pipeline ## Footnote This emphasizes the importance of deploying the entire workflow for machine learning.

Answer 17

* Model type * Features * Hyperparameters * Learned weights * Training data info (metadata) ## Footnote This output is essential for understanding the model's structure and performance.

Answer 18

Comparing predicted values vs real outcomes ## Footnote This helps determine if the model is still valid in changing conditions.

Answer 19

How automated, reliable, and scalable your ML system is ## Footnote MLOps maturity indicates the effectiveness of managing machine learning operations.

Answer 20

* No MLOps * Everything is manual * Train manually * Test manually * Deploy manually * No monitoring ## Footnote Models are treated with uncertainty: 'We trained something… we hope it still works.'

Answer 21

* DevOps only (no real MLOps) * Software has automation (builds, tests, releases) * Models treated like files * Little or no feedback on model performance ## Footnote Code is professional, but ML processes remain messy.

Answer 22

* Automated model training * Model performance tracking * Better model management ## Footnote Deployment is still mostly manual, but you can determine which model is better and which experiment worked.

Answer 23

* Automated deployment * Proper train/test/production environments * A/B testing between models * Full traceability ## Footnote The system can safely decide which model to deploy.

Answer 24

* Continuous monitoring * Production performance feeds back automatically * Automatic retraining triggers * Models compared against live metrics * Zero-downtime releases ## Footnote The system improves itself autonomously.

Answer 25

* Level 0: everything manual * Level 1: software automated, ML not * Level 2: training & tracking automated * Level 3: deployment automated * Level 4: monitoring + retraining automated ## Footnote This table provides an easy reference for understanding MLOps maturity.

Answer 26

* Data scientists * Data engineers * Software engineers * MLOps/platform people ## Footnote Teams must collaborate and stop being siloed to improve MLOps maturity.

Answer 27

* Manual runs → automated pipelines * Manual releases → CI/CD * No tracking → full version control * No monitoring → performance feedback * Manual retraining → triggered retraining ## Footnote These changes enhance the efficiency and reliability of ML operations.

Answer 28

FALSE ## Footnote Management support, access to data, easy deployment paths, and time for maintenance are also crucial.

Answer 29

'What MLOps level is this project actually at?' ## Footnote This question helps identify the current maturity level and plan for improvements.

Answer 30

MLOps maturity = how automated and reliable your ML lifecycle is ## Footnote It emphasizes the transition from manual experiments to self-monitoring, self-retraining systems.

Answer 31

Cross-Industry Standard Process for Data Mining ## Footnote CRISP-DM is a checklist for data projects that prevents jumping straight to modeling.

Answer 32

* Business understanding * Data understanding * Data preparation * Modelling * Evaluation * Deployment ## Footnote These steps guide the execution of a data/ML project.

Answer 33

Business understanding ## Footnote This step involves defining the problem and success criteria.

Answer 34

* Assessing available data * Evaluating data quality * Exploring data with visualizations ## Footnote Engaging with business stakeholders to clarify data meanings is also important.

Answer 35

Data preparation ## Footnote This step includes cleaning data, merging files, fixing errors, and feature engineering.

Answer 36

* Splitting data into train/validation/test * Trying different models * Tuning hyperparameters ## Footnote This step focuses on developing the predictive model.

Answer 37

* Model performance on unseen data * Achievement of business targets ## Footnote This step determines if the model meets the defined success criteria.

Answer 38

* Deploying the entire pipeline * Documenting the process * Planning for monitoring and maintenance ## Footnote This step ensures the model is operational and maintained.

Answer 39

FALSE ## Footnote The process involves going back and forth between steps as needed.

Answer 40

Allows renting computers (including GPUs) instead of owning them ## Footnote This makes machine learning more affordable.

Answer 41

* IaaS – Infrastructure * PaaS – Platform * SaaS – Software ## Footnote These services provide varying levels of control and tools for ML projects.

Answer 42

* Google Cloud Platform (GCP) * Amazon Web Services (AWS) * Microsoft Azure ## Footnote These platforms offer various tools and services for machine learning.

Answer 43

An environment where the app, model, and libraries run together ## Footnote This ensures reliable deployment.

Answer 44

* They are powerful * Often complex * May be overkill for small projects ## Footnote Simpler platforms are often better for learning or demos.

Answer 45

A simple project workflow that helps plan, organize, and iterate properly ## Footnote It is essential for managing data and ML projects effectively.

Answer 46

Enables running ML on rented machines and GPUs, making it accessible and scalable ## Footnote This has transformed the affordability of machine learning.

ML OPs Flashcards

(70 cards)