This class was created by Brainscape user Mahsa Zamanifard. Visit their profile to learn more about the creator.

Decks in this class (22)

External: Dealing with Skewness
1 what are the effects of skewed ...,
2 what are the ways of dealing wi...,
What to do with skewness in targe...
4  cards
External: Dealing With Outliers
5 should the outlier detection be...,
Does outlier treatment come first...,
What are 2 automatic outlier finder
3  cards
Outlier Identification and Removal
When can we use std of sample as ...,
What are the cut off values for o...,
How can we compute cut off for ou...
6  cards
How to Mark and Remove Missing Data
What is the indicator for missing...,
Can we count missing values as a ...,
What is statistical imputation p92
14  cards
What Is Feature Selection
How do statistical based feature ...,
How many main types of feature se...,
What are the types of supervised ...
15  cards
How to Select Categorical Input Features: Encoding and K-best
Does pandas try to map some str i...,
Does the ordinalencoder in scikit...,
What is the difference between or...
5  cards
How to Select Numerical Input Features
What is an f test p1571tan f stat...,
What is scikitlearn s implementat...,
How can we use anova test in k be...
3  cards
How to Select Features for Numerical Output
What is the scikit learn s implem...,
Is the score given by scikit lear...,
How can we use mutual information...
7  cards
How to Use RFE for Feature Selection
What are the two important config...,
Is the performance of the rfe str...,
What does rfe is a wrapper type f...
9  cards
How to Use Feature Importance
What are the 3 main types of more...,
In which models can we use coeffi...,
What attribute do we use to get c...
13  cards
How to Scale Numerical Data
Which type of algorithms benefit ...,
What are the two most popular tec...,
What does normalization do p230sc...
14  cards
How to Scale Data With Outliers
What s robust scaling formula p248,
What is the mean and std of input...,
What are the parameters of robust...
3  cards
How to Encode Categorical Data
What are the two most popular tec...,
What is discretization p259,
What is the difference between no...
15  cards
How to Make Distributions More Gaussian
When are the transformations for ...,
Why is it better to have gaussian...,
What are power transformers p273
7  cards
How to Change Numerical Data Distributions
What are the causes of highly ske...,
Does standard distribution for th...,
What does a quantile transform do...
8  cards
How to Transform Numerical to Categorical Data: Suitable For Highly Skewed or Non-Standard Distribution
What do discretization transforms...,
Which library do we use for chang...,
What are 3 common methods we can ...
13  cards
How to Derive New Input Variables: Polynomial Feature Transform
Typically what degrees are used f...,
What is an example of creating a ...,
What does a squared or cubed vers...
12  cards
How to Transform the Target in Regression
Which class in scikit learn is fo...,
What are two ways we can scale th...,
How can we manually transform the...
6  cards
How to Save and Load Data Transforms: how to save a model and data preparation object to file for later use
What does make_blobs function do ...,
Does make_blobs have a random_sta...,
How do we save a model and its sc...
4  cards
What is Dimensionality Reduction
What is dimensionality p355,
What is the curse of dimensionali...,
External q what is degree of free...
14  cards
How to perform LDA, PCA, SVD
What is latent dirichlet allocati...,
What does the lda model do to sep...,
Why is it better to standardize d...
12  cards
SHAP values-Kaggle
What do shap values show footnote...,
Sum shap values for all features,
What do different types of shap e...
4  cards

More about
Data Prep

  • Class purpose General learning

Learn faster with Brainscape on your web, iPhone, or Android device. Study Mahsa Zamanifard's Data Prep flashcards now!

How studying works.

Brainscape's adaptive web mobile flashcards system will drill you on your weaknesses, using a pattern guaranteed to help you learn more in less time.

Add your own flashcards.

Either request "Edit" access from the author, or make a copy of the class to edit as your own. And you can always create a totally new class of your own too!

What's Brainscape anyway?

Brainscape is a digital flashcards platform where you can find, create, share, and study any subject on the planet.

We use an adaptive study algorithm that is proven to help you learn faster and remember longer....

Looking for something else?

1. Data Science (Python)
  • 9 decks
  • 648 flashcards
  • 247 learners
Decks: 1 Introduction To Python 1, 2 Introduction To Python Ii, 3 Symbols In Data Science Part 1 Math Py, And more!
Data Processing at Scale
  • 28 decks
  • 603 flashcards
  • 44 learners
Decks: Week 1, Week 15, Week 1 Relational Models And Relational, And more!
Data Engineering
  • 18 decks
  • 265 flashcards
  • 174 learners
Decks: Hadoop, Hive, Big Data, And more!
Data Management & Business Intelligence
  • 30 decks
  • 570 flashcards
  • 74 learners
Decks: Rob Coronell Chapter 1, Rob Coronell Chapter 2, Rob Coronell Chapter 31 32, And more!
Make Flashcards