Filling Gaps Flashcards by Jacquelyn Garcia

What does CTR stand for?

click-through rate

How well did you know this?

Not at all

Perfectly

What is the A group in A/B test?

the control group

How well did you know this?

Not at all

Perfectly

What is the B group in A/B test?

the treatment group

How well did you know this?

Not at all

Perfectly

How would we compute conversion rates in an A/B test?

the number of times A or B was chosen divided by number of users shown either A or B

How well did you know this?

Not at all

Perfectly

What would be the null hypothesis and alternate hypothesis for an A/B test with click through rates?

H_0: p_a = p_b, no difference
H_1: p_a != p_b, there is a difference

How well did you know this?

Not at all

Perfectly

after identifying the null hypothesis for an a/b test, what should we do?

compute the pooled proportion, compute the standard error and then use that to compute z-score

How well did you know this?

Not at all

Perfectly

what is the equivalent of 5% significance level in z values?

the critical z values for two tailed tests are += 1.96

How well did you know this?

Not at all

Perfectly

what can we conclude if our z score was larger than 2?

we’d reject h_0 and conclude B performs better

How well did you know this?

Not at all

Perfectly

When would we use a t-test?

use it if we’re comparing averages

How well did you know this?

Not at all

Perfectly

what is an example of a metric that tells us we need to use a t-test?

average time spent, purchase amount

How well did you know this?

Not at all

Perfectly

what is an example of a metric that tells us we need to use a two-proportion z-test?

click-through rate, conversion rate

How well did you know this?

Not at all

Perfectly

what is an example of a metric that tells us we need to use a mann-whitney U test?

non-normal data or small samples

How well did you know this?

Not at all

Perfectly

When would we use a mann-whitney u test?

non-parametric alternative

How well did you know this?

Not at all

Perfectly

What is another time we would use a two-proportion z-test?

when the data are binary/categorical like clicked(1) or did not click(0)

How well did you know this?

Not at all

Perfectly

What does a two-sample t-test measure?

whether the means differ significantly

How well did you know this?

Not at all

Perfectly

What is an example that a two-sample t-test would be good for?

comparing the average time on a site for version A vs. version B

How well did you know this?

Not at all

Perfectly

What is the chi-squared test of independence?

tests whether two categorical variables are related

How well did you know this?

Not at all

Perfectly

what is an example that we would use chi-squared test of independence?

does gender affect click behavior?

How well did you know this?

Not at all

Perfectly

what is the chi-squared goodness-of-fit test?

tests whether one categorical variable follows an expected distribution

How well did you know this?

Not at all

Perfectly

what is an example of when we would use a chi-squared goodness-of-fit test?

are clicks evenly distributed across 3 button colors?

How well did you know this?

Not at all

Perfectly

What is an easy way to tell if we need to use chi-squared test?

are these two things connected?

How well did you know this?

Not at all

Perfectly

what is a true positive?

you said yes and you were right

How well did you know this?

Not at all

Perfectly

what is a false positive?

you said yes but were wrong

How well did you know this?

Not at all

Perfectly

what is a false negative?

you said no but were wrong

How well did you know this?

Not at all

Perfectly

what is a true negative?

you said no and were right

what is accuracy?

how often the model is right

what is precision?

of all the things the model said "yes" to, how many were actually yes?

what is recall?

of all the real yes things, how many did the model catch?

what is f1 score?

the balance between precision and recall

what is specificity?

how many real "no" things did it correctly say no to?

What is an example of a true positive?

sick person correctly diagnosed as sick

what is an example of a false positive?

healthy person told they're sick

what is an example of a false negative?

sick person told they're healthy

what is an example of a true negative?

healthy person correctly told they're healthy

what is a real world example or precision?

when the doctor says someone is sick, how often is it true?

what is a real world example or recall?

of all sick people, how many did the doctor find?

what is a real world example or accuracy?

how often did the doctor get it right overall?

what is a taxonomy a fancy word for?

organized categories

why are taxonomies important?

they group similar things, find information faster, compare things fairly

what is hierarchy?

from big to small

what is a node?

one level in that chain

what is a parent?

the category above

what is a child?

the category below

what are siblings?

categories on the same level

how are taxonomies like maps?

they tell you where something belongs and how it connects to other things

what is a trunk?

the main idea

what are the branches?

big groups

what are leaves?

specific items

what is a category?

a named group of similar things

what is labeling?

assigning items to categories

what is data labeling?

giving names or tags to examples so the model can learn from it later

why do we label data?

because computers don't know what they're looking at so we turn raw data into something meaningful

if we add a label to a photo of an apple, what are we using it for?

image recognition

if we add a label of "positive sentiment" to a text example, what are we using it for?

sentiment analysis

if we add a label of "bark" to an audio clip, what are we using it for?

sound detection

if we are adding a label of "car" to a video of traffic, what are we using it for?

self-driving cars

how do we do a classification label?

pick one label for the whole thing

how do we create a bounding box label?

draw a box around an object in an image

how do we create a segmentation label?

color every pixel of an object

how do we create an entity tagging label?

mark key words in text

how do we create sentiment labeling?

mark how something feels

what is a data annotator?

the person labeling the data

what is a data annotation?

the act of marking or highlighting the data

what is data governance?

knowing who manages the data, where it comes from, and how it's used

what is data quality?

making sure the data is accurate, timely, complete, and consistent so its fit for purpose for whoever needs it.

what is data modeling?

it gives structure to the data and represents how we represent real world entities and relations in a way that's scalable

what is data lifecycle management?

focuses on how data is created, maintained, improved, and retired over time

What is the consistency property of a good taxonomy?

same logic at every level

What is the completeness property of a good taxonomy?

all relevant categories are represented

What is the single inheritence property of a good taxonomy?

each child has one parent

What is the proper granularity property of a good taxonomy?

levels are consistent detail

What is the no cycles property of a good taxonomy?

should form a tree, not loops

Filling Gaps Flashcards

(72 cards)