This class was created by Brainscape user Mahsa Zamanifard. Visit their profile to learn more about the creator.

Decks in this class (22)

External: Dealing with Skewness
What are the effects of skewed da...,
What are the ways of dealing with...,
What to do with skewness in targe...
4  cards
External: Dealing With Outliers
Should the outlier detection be d...,
Does outlier treatment come first...,
What are 2 automatic outlier find...
3  cards
Outlier Identification and Removal
When can we use std of sample as ...,
What are the cut off values for o...,
How can we compute cut off for ou...
6  cards
How to Mark and Remove Missing Data
What is the indicator for missing...,
Can we count missing values as a ...,
What is statistical imputation p92 3
14  cards
What Is Feature Selection
How do statistical based feature ...,
How many main types of feature se...,
What are the types of supervised ...
15  cards
How to Select Categorical Input Features: Encoding and K-best
Does pandas try to map some str i...,
Does the ordinalencoder in scikit...,
What is the difference between or...
5  cards
How to Select Numerical Input Features
What is an f test p1571t 1,
What is scikitlearn s implementat...,
How can we use anova test in k be...
3  cards
How to Select Features for Numerical Output
What is the scikit learn s implem...,
Is the score given by scikit lear...,
How can we use mutual information...
7  cards
How to Use RFE for Feature Selection
What are the two important config...,
Is the performance of the rfe str...,
What does rfe is a wrapper type f...
9  cards
How to Use Feature Importance
What are the 3 main types of more...,
In which models can we use coeffi...,
What attribute do we use to get c...
13  cards
How to Scale Numerical Data
Which type of algorithms benefit ...,
What are the two most popular tec...,
What does normalization do p230 3
14  cards
How to Scale Data With Outliers
What s robust scaling formula p248 1,
What is the mean and std of input...,
What are the parameters of robust...
3  cards
How to Encode Categorical Data
What are the two most popular tec...,
What is discretization p259 2,
What is the difference between no...
15  cards
How to Make Distributions More Gaussian
When are the transformations for ...,
Why is it better to have gaussian...,
What are power transformers p273 3
7  cards
How to Change Numerical Data Distributions
What are the causes of highly ske...,
Does standard distribution for th...,
What does a quantile transform do...
8  cards
How to Transform Numerical to Categorical Data: Suitable For Highly Skewed or Non-Standard Distribution
What do discretization transforms...,
Which library do we use for chang...,
What are 3 common methods we can ...
13  cards
How to Derive New Input Variables: Polynomial Feature Transform
Typically what degrees are used f...,
What is an example of creating a ...,
What does a squared or cubed vers...
12  cards
How to Transform the Target in Regression
Which class in scikit learn is fo...,
What are two ways we can scale th...,
How can we manually transform the...
6  cards
How to Save and Load Data Transforms: how to save a model and data preparation object to file for later use
What does make_blobs function do ...,
Does make blobs have a randomstat...,
How do we save a model and its sc...
4  cards
What is Dimensionality Reduction
What is dimensionality p355 1,
What is the curse of dimensionali...,
External q what is degree of free...
14  cards
How to perform LDA, PCA, SVD
What is latent dirichlet allocati...,
What does the lda model do to sep...,
Why is it better to standardize d...
12  cards
SHAP values-Kaggle
What do shap values show 1,
Sum shap values for all features 2,
What do different types of shap e...
4  cards

More about
Data Prep

  • Class purpose General learning

Learn faster with Brainscape on your web, iPhone, or Android device. Study Mahsa Zamanifard's Data Prep flashcards now!

How studying works.

Brainscape's adaptive web mobile flashcards system will drill you on your weaknesses, using a pattern guaranteed to help you learn more in less time.

Add your own flashcards.

Either request "Edit" access from the author, or make a copy of the class to edit as your own. And you can always create a totally new class of your own too!

What's Brainscape anyway?

Brainscape is a digital flashcards platform where you can find, create, share, and study any subject on the planet.

We use an adaptive study algorithm that is proven to help you learn faster and remember longer....

Looking for something else?

Big Data
  • 26 decks
  • 699 flashcards
  • 9 learners
Decks: Cloud Storage Mabel, Distributed File Systems Mabel, Big Data Lecture 01 Introduction, And more!
C175 Data Management Foundations
  • 30 decks
  • 3050 flashcards
  • 4 learners
Decks: Pa And Oa Set, Lesson 1, Lesson 2, And more!
IJ DATA 2
  • 13 decks
  • 2305 flashcards
  • 35 learners
Decks: Written 2, Written 6, Written 5, And more!
Data Comm
  • 16 decks
  • 744 flashcards
  • 39 learners
Decks: Lesson 1, Lesson 2, Lesson 3, And more!
Make Flashcards