although a statistician should clearly define the population he or she is dealing with, they may not be able to enumerate it exactly. before drawing a sample the investigator should define the population from which it is to come. to guard against this possibility the sampling may be stratified.this means that a framework is laid down initially, and the patients or objects of the study in a random sample are then allotted to the compartments of the framework. another use of random number tables is to randomise the allocation of treatments to patients in a clinical trial.

it is important to realise that patients in a randomised trial are not a random sample from the population of people with the disease in question but rather a highly selected set of eligible and willing patients. in other words, the more members of a population that are included in a sample the more chance will that sample have of accurately representing the population, provided a random process is used to construct the sample. it is important to realise that we do not have to take repeated samples in order to estimate the standard error, there is sufficient information within a single sample. a standard deviation is a sample estimate of the population parameter; that is, it is an estimate of the variability of the observations. if the purpose is to describe the outcome of a study, for example to estimate the prevalence of a disease, or the mean height of a group, then one should use a standard error (or, better, a confidence interval; see chapter 4) (mnemonic e for estimate and e for error).

in statistics the term "population" has a slightly different meaning from the one given to it in ordinary speech the word "random" does not describe the sample as such but the way in which it is selected

example: the population may be "all people living in the us." definition, a sample data set contains a part, or a subset, populations and samples. the study of statistics revolves around the study of data sets. this lesson describes two sample reflects the characteristics of the population, so those sample findings can be generalized to the population remember that population parameters often are based on the sample statistics

