STA 2023 Exam 1

8 September 2022
4.7 (114 reviews)
63 test answers

Unlock all answers in this set

Unlock answers (59)
question
Statistics
answer
the science of collecting, analyzing, interpreting, and presenting data
question
Categorical
answer
data that is describing using words or categories, qualitative
question
Quantitative
answer
data that is describing using numbers, can be averaged
question
Graphical
answer
categorical and quantitative
question
Categorical
answer
bar charts (bars do not touch) and pie charts
question
Quantitative
answer
histograms (bars usually touch), stem plots, box plots, and dot plots
question
Numerically
answer
center and spread
question
Center
answer
mean, median, mode
question
Mean
answer
the 'average', the balancing point, distances from the data points always add up to zero
question
Median
answer
the middle ordered value, the 50th percentile, falls in (n+1)/2 position, always exactly 50% of the observations on either side of it and is not very sensitive to outlier, robust
question
Mode
answer
the most frequently occurring number, the measure of center represents the most common observations or class of observations
question
Spread
answer
range, variance, standard deviation, IQR
question
Range
answer
min-max, the measure of spread that is affected most by outliers
question
Variance
answer
represents the 'typical' squared distance from the mean, in squared units, measure of spread around the mean, but its units are not the same as those of the data points
question
Standard deviation
answer
represents the 'typical' distance from the mean, in units of data, measure of spread that is smaller for distributions where the points are clustered around the middle, cannot be negative
question
IQR
answer
interquartile range, Q3-Q1 where Q3= 75th percentile (the median of the 'top' half) and Q1= 25th percentile (the median of the 'bottom' half), gives the spread of the central (middle) 50% of the data set
question
Outliers
answer
any observations that are significantly far away from the rest of the data points
question
Skewed right
answer
mean->median->mode
question
Normal
answer
bell curve, mean=median=mode
question
Skewed left
answer
mean<-median<-mode
question
Bimodal
answer
mean=median
question
Bell shaped with an outlier
answer
mean>median
question
Rectangular/uniform
answer
mean=median
question
Discrete
answer
countable
question
Continuous
answer
measurable
question
Residuals
answer
vertical distance between the points and the line, sum to 0, error=actual-predicted
question
Least square regression line
answer
minimizes the sum of the residuals squared
question
b
answer
slope, as x increases by 1, y is predicted to increase/decrease by b
question
a
answer
y intercept, when x=0, y is predicted to be a
question
r
answer
correlation coefficient, sign of the slope, measures strength and direction, between -1 and 1, not affected by units of x and y
question
r2
answer
coefficient of determination, percent of variation in y that is explained by x, between 0% and 100%, 1-r2= the fraction of the variation in y that is NOT explained by x, larger r2 is better
question
Probabilities
answer
determined by the proportion of times the event(s) will occur in a long series of independent trials (law of large numbers)
question
Compliment rule
answer
P(not A) = 1-P(A)
question
At least 1
answer
equals 1 - none
question
Additive rule
answer
P(A or B) = P(A) + P(B) - P(A and B)
question
Additive rule for disjoint events
answer
P(A or B) = P(A) + P(B)
question
Multiplicative Rule
answer
P(A and B) = P(A) x P(B)
question
Conditional probabilities
answer
P(AlB) = P(A and B)/P(B) or P(BlA) = P(A and B)/P(A)
question
Disjoint
answer
mutually exclusive, cannot occur together
question
Independent
answer
occurrence of one does not affect the probability of the other
question
Exact
answer
binompdf (n,p,x)
question
Sensitivity
answer
the condition is correctly determined to exist in the subject
question
Specificity
answer
the condition is correctly determined to not exist in the subject
question
False positive
answer
the condition is incorrectly determined to exist in the subject
question
False negative
answer
the condition is incorrectly determined to not exist in the subject
question
Discrete RV's
answer
binomial and non binomial, RV's which assume a finite number of outcomes, countable, probability of an exact value can be computed
question
Binomial RV's
answer
collection of yes/no (binary) outcomes
question
Non binomial RV's
answer
many types (distinguishing specific type is not required)
question
Continuous RV's
answer
uniformly distributed and normally distributed, RV's which assume an infinite number of outcomes, measurable, probability of any exact value cannot be computed, probability is 0 (move on)
question
Uniformly distributed RV's
answer
outcomes are equally likely
question
Normally distributed RV's
answer
should be told the population is normal, z score can be used
question
Random variable
answer
a variable that represents the numerical outcome(s) of a random phenomenon
question
Binomial distribution
answer
observation of binary, probability of success is constant, n fixed observations, n observations are independent, X~B(n,p)
question
Binomial
answer
two outcomes, mean and SD
question
Non binomial
answer
three or more outcomes, mean
question
Expected
answer
=mean=average
question
More than x times
answer
doesn't include x
question
Less than x times
answer
doesn't include x
question
At least x times
answer
includes x
question
At most x times
answer
includes x
question
Probability
answer
=(UV-LV)/(Max-Min)
question
Value
answer
probability, normalcdf(LV,UV,mean,SD), for z use 0 as mean and 1 as SD
question
%
answer
value, invNorm(area to the left,mean,SD), for z use 0 as mean and 1 as SD