Probability and Distribution | Bài giảng số 8 chương 4 học phần Applied statistics | Trường Đại học Quốc tế, Đại học Quốc gia Thành phố Hồ Chí Minh

Whenever a simple random sample is selected and the value of the sample mean is used to estimate the value of the population mean µ, we cannot expect the sample mean to exactly equal the population mean. The practical reason we are interested in the sampling distribution of x is that it can be used to provide probability information about the difference between the sample mean and the population mean. Tài liệu giúp bạn tham khảo, ôn tập và đạt kết quả cao. Mời bạn đón xem.

APPLIED STATISTICS
COURSE CODE: ENEE1006IU
Lecture 8:
Chapter 4: Probability and Distribuon
(3 credits: 2 is for lecture, 1 is for lab-work)
Instructor: TRAN THANH TU Email:
tu@hcmiu.edu.vn
tu@hcmiu.edu.vn 1
tu@hcmiu.edu.vn 2
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
•The sampled populaon is the populaon from which the sample is drawn.
•The target populaon is the populaon we want to make inferences about
•A frame is a list of the elements that the sample will be selected from.
The sampling problems
Selecng a sample
Point esmaon
Introducon to sampling distribuons
Sampling distribuon of
Sampling distribuon of
Properes of point esmators
tu@hcmiu.edu.vn 3
Other sampling methods
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
Numerical characteriscs of a populaon are called parameters.
Sampling problems:
-Sampling errors
-Lack of sample representaveness
-Diculty in esmaon of sample size
-Lack of knowledge about the sampling process
-Lack of resources
-Lack of cooperaon
-Lack of exisng appropriate sampling frames for larger populaon
tu@hcmiu.edu.vn 4
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
•Selecng a sample:
Sampling from a Finite Populaon: A simple random sample of size n from a
nite populaon of size N is a sample selected such that each possible sample of
size n has the same probability of being selected.
One procedure for selecng a simple random sample from a nite populaon is
to use a table of random numbers to choose the elements for the sample one at
a me in such a way that, at each step, each of the elements remaining in the
populaon has the same probability of being selected.
Sampling n elements in this way will sasfy the denion of a simple random
sample from a nite populaon.
tu@hcmiu.edu.vn 5
sampling without replacement
sampling with replacement
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
we cannot develop a list of all the elements that could
•Selecng a sample: be produced, the populaon is considered innite
Sampling from an innite Populaon: A random sample of size n from an
innite populaon is a sample selected such that the following condions are
sased:
1. Each element selected comes from the same populaon.
2. Each element is selected independently.
tu@hcmiu.edu.vn 6
Care and judgment must be exercised in implemenng the selecon process for
obtaining a random sample from an innite populaon.
prevent selecon bias
7
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
•Point esmaon:
To esmate the value of a populaon parameter, we compute a corresponding
characterisc of the sample, referred to as a sample stasc.
•By making the preceding computaons, we perform the stascal procedure
called point esmaon.
•We refer to:
- the sample mean as the point esmator of the populaon mean µ
- the sample standard deviaon s as the point esmator of the populaon standard
deviaon , and
tu@hcmiu.edu.vn 8
- the sample proporon as the point esmator of the populaon proporon p. The
numerical value obtained for , s, or is called the point esmate.
tu@hcmiu.edu.vn
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
•Introducon to sampling distribuons:
•If we consider the process of selecng a simple random sample as an experiment,
the sample mean is the numerical descripon of the outcome of the experiment.
the sample mean is a random variable
has a mean or expected value, a standard deviaon, and a probability
distribuon.
tu@hcmiu.edu.vn 9
•Because the various possible values of are the result of dierent simple random
samples, the probability distribuon of is called the sampling distribuon of .
•Knowledge of this sampling distribuon and its properes will enable us to make
probability statements about how close the sample mean is to the populaon
mean µ.
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
•Sampling distribuon of : the probability distribuon of all possible values
of the sample mean .
The sampling distribuon of has an expected value or mean, a standard
tu@hcmiu.edu.vn 10
Innite (populaon is large, some how can’t be idened)
tu@hcmiu.edu.vn 11
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
•Sampling distribuon of : the probability distribuon of all possible values
of the sample mean .
Form of the Sampling Distribuon of :
-Populaon has a normal distribuon: the sampling distribuon of is normally
distributed for any sample size.
- Populaon does not have a normal distribuon: central limit theorem is
helpful in idenfying the shape of the sampling distribuon of
tu@hcmiu.edu.vn 12
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
•Sampling distribuon of : the probability distribuon of all possible values
of the sample mean .
Praccal Value of the Sampling Distribuon of
tu@hcmiu.edu.vn 13
Whenever a simple random sample is selected and the value of the sample
mean is used to esmate the value of the populaon mean µ, we cannot
expect the sample mean to exactly equal the populaon mean.
The praccal reason we are interested in the sampling distribuon of is that
it can be used to provide probability informaon about the dierence
between the sample mean and the populaon mean.
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
•Sampling distribuon of : the probability distribuon of all possible values
of the sample mean .
Relaonship Between the Sample Size and the Sampling Distribuon of
As the sample size is increased, the standard error of the mean decreases.
tu@hcmiu.edu.vn 14
As a result, the larger sample size provides a higher probability that the sample
mean is within a specied distance of the populaon mean.
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
•Sampling distribuon of point esmator of the populaon proporon p.
tu@hcmiu.edu.vn 15
The sampling distribuon of
tu@hcmiu.edu.vn 16
values of the sample proporon . Expected Value of :
Standard Deviaon of
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
•Sampling distribuon of point esmator of the populaon proporon p.
Form of the Sampling Distribuon of : The sampling distribuon of p can be
approximated by a normal distribuon whenever: np ≥ 5 and n(1 − p) ≥ 5
tu@hcmiu.edu.vn 17
Praccal Value of the Sampling Distribuon of : can be used to provide
probability informaon about the dierence between the sample proporon and
the populaon proporon.
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
•Properes of point esmators: three properes of good point esmators:
unbiased, eciency, and consistency.
Unbiased: The sample stasc is an unbiased esmator of the populaon
parameter
Eciency: The point esmator with the smaller standard error is said to have
greater relave eciency than the other.
tu@hcmiu.edu.vn 18
Consistency: point esmator is consistent if the values of the point esmator
tend to become closer to the populaon parameter as the sample size becomes
larger. In other words, a large sample size tends to provide a beer point
esmate than a small sample size.
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
•Properes of point esmators: three properes of good point esmators:
unbiased, eciency, and consistency.
tu@hcmiu.edu.vn 19
Unbiased:
tu@hcmiu.edu.vn 20
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
•Properes of point esmators: three properes of good point esmators:
unbiased, eciency, and consistency.
| 1/31

Preview text:

APPLIED STATISTICS COURSE CODE: ENEE1006IU Lecture 8:
Chapter 4: Probability and Distribution
(3 credits: 2 is for lecture, 1 is for lab-work)
Instructor: TRAN THANH TU Email: tttu@hcmiu.edu.vn tttu@hcmiu.edu.vn 1
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
•The sampled population is the population from which the sample is drawn.
•The target population is the population we want to make inferences about
•A frame is a list of the elements that the sample will be selected from. The sampling problems Selecting a sample Point estimation
Introduction to sampling distributions Sampling distribution of Sampling distribution of
Properties of point estimators tttu@hcmiu.edu.vn 2 Other sampling methods
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
Numerical characteristics of a population are called parameters. Sampling problems: -Sampling errors
-Lack of sample representativeness
-Difficulty in estimation of sample size
-Lack of knowledge about the sampling process -Lack of resources -Lack of cooperation
-Lack of existing appropriate sampling frames for larger population tttu@hcmiu.edu.vn 3
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS •Selecting a sample:
Sampling from a Finite Population: A simple random sample of size n from a
finite population of size N is a sample selected such that each possible sample of
size n has the same probability of being selected.
One procedure for selecting a simple random sample from a finite population is
to use a table of random numbers to choose the elements for the sample one at
a time in such a way that, at each step, each of the elements remaining in the
population has the same probability of being selected.
Sampling n elements in this way will satisfy the definition of a simple random
sample from a finite population. tttu@hcmiu.edu.vn 4 sampling without replacement sampling with replacement
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
we cannot develop a list of all the elements that could •Selecting a sample:
be produced, the population is considered infinite
Sampling from an infinite Population: A random sample of size n from an
infinite population is a sample selected such that the following conditions are satisfied:
1. Each element selected comes from the same population.
2. Each element is selected independently. tttu@hcmiu.edu.vn 5
Care and judgment must be exercised in implementing the selection process for
obtaining a random sample from an infinite population. prevent selection bias tttu@hcmiu.edu.vn 6
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS •Point estimation:
To estimate the value of a population parameter, we compute a corresponding
characteristic of the sample, referred to as a sample statistic.
•By making the preceding computations, we perform the statistical procedure called point estimation. •We refer to:
- the sample mean as the point estimator of the population mean µ
- the sample standard deviation s as the point estimator of the population standard deviation , and 7
- the sample proportion as the point estimator of the population proportion p. The
numerical value obtained for , s, or is called the point estimate. tttu@hcmiu.edu.vn
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
•Introduction to sampling distributions:
•If we consider the process of selecting a simple random sample as an experiment,
the sample mean is the numerical description of the outcome of the experiment.
the sample mean is a random variable
has a mean or expected value, a standard deviation, and a probability distribution. tttu@hcmiu.edu.vn 8
•Because the various possible values of are the result of different simple random
samples, the probability distribution of is called the sampling distribution of .
•Knowledge of this sampling distribution and its properties will enable us to make
probability statements about how close the sample mean is to the population mean µ.
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
•Sampling distribution of : the probability distribution of all possible values of the sample mean .
The sampling distribution of has an expected value or mean, a standard tttu@hcmiu.edu.vn 9
Infinite (population is large, some how can’t be identified) tttu@hcmiu.edu.vn 10
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
•Sampling distribution of : the probability distribution of all possible values of the sample mean .
Form of the Sampling Distribution of :
-Population has a normal distribution: the sampling distribution of is normally
distributed for any sample size.
- Population does not have a normal distribution: central limit theorem is
helpful in identifying the shape of the sampling distribution of tttu@hcmiu.edu.vn 11
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
•Sampling distribution of : the probability distribution of all possible values of the sample mean .
Practical Value of the Sampling Distribution of tttu@hcmiu.edu.vn 12
Whenever a simple random sample is selected and the value of the sample
mean is used to estimate the value of the population mean µ, we cannot
expect the sample mean to exactly equal the population mean.
The practical reason we are interested in the sampling distribution of is that
it can be used to provide probability information about the difference
between the sample mean and the population mean.
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
•Sampling distribution of : the probability distribution of all possible values of the sample mean .
Relationship Between the Sample Size and the Sampling Distribution of
As the sample size is increased, the standard error of the mean decreases. tttu@hcmiu.edu.vn 13
As a result, the larger sample size provides a higher probability that the sample
mean is within a specified distance of the population mean.
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
•Sampling distribution of point estimator of the population proportion p. tttu@hcmiu.edu.vn 14 The sampling distribution of tttu@hcmiu.edu.vn 15
values of the sample proportion . Expected Value of : Standard Deviation of
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
•Sampling distribution of point estimator of the population proportion p.
Form of the Sampling Distribution of : The sampling distribution of p can be
approximated by a normal distribution whenever: np ≥ 5 and n(1 − p) ≥ 5 tttu@hcmiu.edu.vn 16
Practical Value of the Sampling Distribution of : can be used to provide
probability information about the difference between the sample proportion and the population proportion.
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
•Properties of point estimators: three properties of good point estimators:
unbiased, efficiency, and consistency.
Unbiased: The sample statistic is an unbiased estimator of the population parameter
Efficiency: The point estimator with the smaller standard error is said to have
greater relative efficiency than the other. tttu@hcmiu.edu.vn 17
Consistency: point estimator is consistent if the values of the point estimator
tend to become closer to the population parameter as the sample size becomes
larger. In other words, a large sample size tends to provide a better point
estimate than a small sample size.
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
•Properties of point estimators: three properties of good point estimators:
unbiased, efficiency, and consistency. tttu@hcmiu.edu.vn 18 Unbiased: tttu@hcmiu.edu.vn 19
4.4. SAMPLING AND SAMPLING DISTRIBUTIONS
•Properties of point estimators: three properties of good point estimators:
unbiased, efficiency, and consistency. tttu@hcmiu.edu.vn 20