Biostats & Epi for PM Flashcards

Question 1

Q

Fetal Death Rate Equation

Answer

A

total number of fetal deaths in a given time period/total number of live births during the same period of time x 1000

Question 2

Q

Infant Mortality Rate Equation

Answer

A

total number of deaths of infants (<1 y/o) in a time period/total number of live births during the same period x 1000

Question 3

Q

Maternal Mortality Rate Equation

Answer

A

deaths due to pregnancy related illness in a given time period/total number of live births during the same period of time x 100,000

Question 4

Q

Neonatal Mortality Rate Equation

Answer

A

total number of deaths of neonates (<28 days old) in a given time period/total number of live births during the same period of time x 1000

Question 5

Q

Perinatal Mortality Rate Equation

Answer

A

neonatal deaths + fetal deaths in a given time period/total number live births and fetal deaths during the same time period x 1000

Question 6

Q

ecological fallacy definition

Answer

A

an association at the population level is not necessarily true at the individual level

Question 7

Q

studies with ecological fallacy

Answer

A

cross-sectional studies

Question 8

Q

vital statistics recorded (4)

Answer

A

birth, death, marriage, divorce

Question 9

Q

length bias definition

Answer

A

when a less aggressive disease appears to have a higher incidence because slower-moving diseases are more likely to be detected

Question 10

Q

non differential bias is the same as

Answer

A

random error

Question 11

Q

lead time bias definition

Answer

A

appearance that early diagnosis of a disease prolongs survival

Question 12

Q

Hawthorne effect definition

Answer

A

individual behavior changes when a person knows they are being observed

Question 13

Q

regression to the mean definition

Answer

A

the further a value is from the mean, the more likely future recordings are closer to the mean

Question 14

Q

Neyman bias definition

Answer

A

selective survival bias

cases in a study have different exposures than the ones that die

Question 15

Q

When does stratification reduce confounding?

Answer

A

analysis stage

Question 16

Q

3 ways to reduce confounding during the design stage

Answer

A

randomization
restriction
matching

Question 17

Q

3 ways to reduce confounding during the analysis stage

Answer

A

standardization
stratification
statistical modeling

Question 18

Q

Bayes theorem equation

Answer

A

(prevalence)(sensitivity)/(prevalence)(sensitivity) + [(1-prev)(1-specificity)]

Question 19

Q

incidence density definition

Answer

A

number of new cases of a disease per summation of time that each person is at risk of a disease in a specified time and place

Question 20

Q

incidence density equation

Answer

A

new cases/sum of person-time

Question 21

Q

central limit theorem definition

Answer

A

when there are a large amount of mutually independent random variables, the mean population will approach normal distribution (n >30)

Question 22

Q

IQ mean and SD

Answer

A

100 +/- 15

Question 23

Q

z-score definiton

Answer

A

how many standard deviations are between an observed value and the mean

Question 24

Q

z-score equation

Answer

A

observed value - mean / standard deviation

Question 25

Q

rule of addition equation

Answer

A

event 1 + event 2 - (event 1 and event 2 overlap) = probability
used for non-mutually exclusive events

Question 26

Q

standard mortality ratio equation

Answer

A

observed # of deaths/expected # of deaths x 100

Question 27

Q

direct adjustment

Answer

A

when you use a second population to extrapolate estimates

Question 28

Q

null hypothesis definition

Answer

A

there is no difference between the variables being tested

Question 29

Q

type 1 error definition

Answer

A

when a null hypothesis is rejected when it is actually true (ex. false-positives)

Question 30

Q

type 2 error definition

Answer

A

when a false null hypothesis is not rejected (ex. false negatives)

Question 31

Q

confidence interval equation

Answer

A

mean +/- 1.96(std dev/sq root N)

Question 32

Q

as prevalence increases, PPV _____ and NPV ____

Answer

A

increases, decreases

Question 33

Q

power equation

Answer

A

1 - beta = 1- the probability of rejecting the null when the null is true

Question 34

Q

3 ways to increase power

Answer

A

increase sample size
decrease beta
increase threshold of Ho

Question 35

Q

NNT equation

Answer

A

1/ARR = 1/risk exposed - risk unexposed

Question 36

Q

NNH Equation

Answer

A

1/absolute risk increase

Question 37

Q

9 components to determine causality

Answer

A

consistency of association
strength of association
specificity
temporal factors
coherence of explanation
biological plausibility
experimental evidence from a controlled trial
dose-response relationship
analogy

Question 38

Q

Standard error equation

Answer

A

std dev/sq root n

Question 39

Q

internal validity definition

Answer

A

how well a study represents the true association within a study

Question 40

Q

external validity definition

Answer

A

how well the results of a study are generalizable to a different population

Question 41

Q

degrees of freedom equation

Answer

A

(rows-1)(columns-1)

Question 42

Q

chi squared equation

Answer

A

sum of (observed data-expected data)sq/expected data

expected= (rows)(columns)/total

Question 43

Q

Kappa equation

Answer

A

observed agreement/chance agreement/total number-chance agreement

observed: agreed true + agreed false
cell agreement due to chance = (row total)(column total)/(total number)
chance agreement = TT chance + FF chance

Question 44

Q

F test

Answer

A

part of ANOVA

Question 45

Q

confounder definition

Answer

A

3rd variable associated with the exposure and the outcome

obscured the relationship between the exposure and outcome

Question 46

Q

effect modifier definition

Answer

A

changes the relationship between exposures and outcomes

Question 47

Q

intervening variable defintion

Answer

A

a mechanism by which a causal variable leads to an outcome

Question 48

Q

necessary cause definition

Answer

A

required for disease to occur but may not invariable lead to disease

Question 49

Q

sufficient cause definition

Answer

A

invariably leads to a disease

Question 50

Q

coefficient of determination definition

Answer

A

the proportion of variation of a dependent variable that can be explained by an independent variable

Question 51

Q

3 examples of time-series analysis

Answer

A

cohort studies
epidemic studies
longitudinal data

Question 52

Q

McNemar’s Test definition

Answer

A

chi-sq test for non-independent variables, allows you to analyze matched pairs or calculate before and after in the same variable

Question 53

Q

Mann-Whitney U test definition

Answer

A

tests the median between two groups, the nonparametric version fo the t-test

Question 54

Q

attributable risk equation

Answer

A

a/a+b - c/c+d

Question 55

Q

relative risk equation

Answer

A

(a/a+b)/(c/c+d)

Question 56

Q

OR equation

Answer

A

(a/c)/(b/d)

Question 57

Q

25th percentile calculation

Question 58

Q

sign test defintiion

Answer

A

nonparametric test that compared dichotomous differences in data from matched otherwise identical pairs, ignored magnitude of difference

Question 59

Q

Nonparametric version of t-test

Answer

A

mann-whitney U test

wilcoxon rank-sum test

Question 60

Q

Nonparametric version of paired t-test

Answer

A

Wilcoxon signed rank test

sign test

Question 61

Q

Nonparametric version of ANOVA

Answer

A

Kruskal-wallis test

Question 62

Q

Nonparametric version of Pearson correlation

Answer

A

spearman correlation

chi-sq

Question 63

Q

regular categorical variable example

Answer

A

group names, M/F

Question 64

Q

ordinal variable definition

Answer

A

group names with an order, ex. cancer stage

Answer 64

A

measurements, ex. height/weight

Answer 65

A

counts, ex. number of crashes at an intersection

Answer 66

A

continuous variable with no true zero

Answer 67

A

continuous variable with a true 0

Answer 68

A

average squared distance from the mean

Answer 69

A

square root of variance

Answer 70

A

mean > median

tail goes to the right

Answer 71

A

mean < median

tail goes to the left

Answer 72

A

mean of logs = e^mean

Answer 73

A

ratio of std dev to the mean x 100

SD/mean x 100

Answer 74

A

compare relative data spread for 2 variables

2. evaluate precision of the measurement of a single variable

Answer 75

A

number of standard deviations a value is away from the mean

Answer 76

A

50th percentile

Answer 77

A

84th percentile

Answer 78

A

97.5th percentile

Answer 79

A

z = obs value - known sample mean / population std dev

Answer 80

A

simple random sample
stratified random sample
cluster random sample
systematic random sample

Answer 81

A

distribution of sample means is approximately normal if the sample size is large enough (N~=30)

Answer 82

A

AKA standard error

std dev/sq root N

Answer 83

A

sample mean +/- 2(pop sd/sq root sample size)

Answer 84

A

H0: mu1 = mu0
HA: mu 1 does not = Mu0

Answer 85

A

H0: mu1 >= M0
HA mu1 < Mu0
OR
H0: Mu <= M0
HA Mu1>M0

Answer 86

A

calculate test statistic
identify probability distribution of the test statistic
calculate p-value from test statistic based on probability distribution

Answer 87

A

select a smaller alpha

Answer 88

A

increases, decreases

Answer 89

A

increased, decrease

Answer 90

A

t-test
Wilcoxon rand sum (NP)
mann-whitney U test
median test

Answer 91

A

ANOVA

kruskal-wallis (NP)

Answer 92

A

paired t-test
Wilcoxson signed rank (NP)
sign test

Answer 93

A

chi squared
fisher’s exact test
paired–McNemar’s chi squared

Answer 94

A

Pearson’s
spearman’s (NP)
linear regression

Answer 95

A

logistic regression

Answer 96

A

to convert values to rank–then analyze rank
with small sample sizes
with ordinal outcomes

Answer 97

A

use: compare continuous outcome between 2 groups when the data is symmetric or n>15
outcome: t-statistic –> p-value

Answer 98

A

use: compare continuous outcome between 2 groups when the data is skewed, small n, or ordinal data
output: rank overall –> compare sums of ranks between 2 groups

Answer 99

A

overall median across entire sample

asks whether each value is > or < median and compares via a 2x2 table and chi-squared

Answer 100

A

compare continuous outcomes in pairs

looks at mean difference of pairs then asks is it different y/n by one-sample t test

Answer 101

A

continuous outcomes in pairs when there are few pairs or data is skewed

Answer 102

A

continuous outcomes in pairs when you don’t have numbers, only relationships

Answer 103

A

comparing continuous outcomes between >2 groups

Answer 104

A

comparing continuous outcomes between >2 groups when you have skewed sample, small n, ordinal data
compares sums of ranks or groups

Answer 105

A

small sample size for categorical outcome/categorical predictor (any cell <5)

Answer 106

A

chi-sq for matched or paired proportions (ex. matched case-control)

Answer 107

A

the amount of variability accounted for by the line of best fit

Answer 108

A

sq root of r^2

Answer 109

A

continuous outcome w continuous predictor

Answer 110

A

MSfitted/MSerror with p-1, n-1 DFs

Answer 111

A

When 2 or more predictor variables are highly correlated

Answer 112

A

increases standard error of beta estimates

can lead to confusion/misleading results

Answer 113

A

used to compare means between groups while controlling for other variables (covariates) that may be unbalanced between groups

Answer 114

A

categorical outcome/continuous predictor

betas are estimated from maximum likelihood–model gives the probability of the outcome

Answer 115

A

Council of Territorial and State Epidemiologists

Answer 116

A

the proportion of those that have a diseases that are accurately defined as having it (SNOUT)

Answer 117

A

those without a disease that are accurately identified as NOT having it (SPIN)

Answer 118

A

P(event 1 and event 2) = P(1) x P(2)

Answer 119

A

determine the probability of 2 independent events

can also use to test for independence

Answer 120

A

P(1 or 2) = P(1) + P(2)

Answer 121

A

P(1 or 2) = P(1) + P(2) - P(1 and 2)

Answer 122

A

total variation in a study estimate due to heterogeneity between studies (for meta-analysis)
If >50% –> heterogenous

Answer 123

A

log rank test

Answer 124

A

hazard ratios

Answer 125

A

a group of people become ill after being exposed to a point-source contaminant

Answer 126

A

a common source continuously affects this who come into contact with them

Answer 127

A

infection is transmitted from one person to another

Answer 128

A

when a common source outbreak is complicated by person-to-person spread

Answer 129

A

mean differences

Answer 130

A

false negative error rate

Answer 131

A

false positive error rate

Answer 132

A

fever >100
cough +/- sore throat
if flu swab + ok

Answer 133

A

home interviews and PEs

Brainscape's Knowledge GenomeTM

Biostats & Epi for PM Flashcards

Brainscape's Knowledge Genome^TM