# Study Skills

What is qualitative data

method of observation to gather non-numerical data.

Types of qualitative data

nominal, ordinal, (binary)

What is nominal

Variables with NO order or ranking. E.g: race or gender

What is ordinal

Can be quantitative or qualitative. Variables with an order and ranking. E.g: ranking 1-5 or ranking outstanding - disappointing

How to collect qualitative

interviews, written statement and documents

What is quantitative

measures of values or counts and are expressed as numbers

Types of quantitative data

continuous data or discrete data

How to collect quantitative data

surveys, observation, experiments and interviews

What is contentious data

When the values in the set can take on ANY value. finite or infinite.

What is discrete data

values are distinct and separate

Difference between discrete and contentious

contentious can be measured and that discrete can be counted

Example of continuous data

can be any number, even decimal, between an interval. [0,70] - can have 2.5. Height of a child

Examples of discrete data

the numbers you can get on a dice - cannot get 2.5. number of language spoken. ordinal data are discrete

Advantages of using old data for a research

useful, effortless, save time, ethical consideration have been done

disadvantages of using old data for a research

no control on how data was collected, might not align with research aims, data needs scrutiny be fore use

Disadvanage of continuous data

observer error can be reduced but CAN'T be ELIMINATED. Limits of instruments to measure

When to use categorical nominal

when the data CANNOT be put into a meaningful order

When to use categorical ordinal

when the data CAN be put into a meaningful order

The significance of p value

lower than the p value = significant = data does NOT come from the same population

what is p value

if sample being compared is coming from the same population, which you don't want

What is p hacking

1)when you continue to collect data even when p<0.05. 2) data manipulation. Excessive stats

How does collection too much data cause p- hacking

it becomes biased as the results are no longer significant

how does too much stat cause p hacking

more result could happen by chance alone

The types of error when comparing data

type 1 = false positive, type 2 = false negative

What is a type 1 error when collecting data

WRONGLY accepting relationship. "guilty until proven innocent"

What is a type 2 error when collecting data

FAILING to accepting relationship. "innocent until proven guilty"

when to use chi-squared

for categorical data

what are the conditions for chi-squared

must be an actual data (raw, proportional, ratio), simple random sample, catergorical, contigency table of <5

what "actual" data is not included as a condition for chi-squared

percentage

Types of test for a continuous data

a parametric test or a non-parametric test

what makes it suitable to use a parametric test for a continuous data

for a normally distributed data, sample independent from 2 population, 2 population having similar variance, no outlier

what makes it suitable to use a non-parametric test for a continuous data

any continuous data based on the ranks of the data values

type of tests to know if data is normally distributed for it to be PARAMETRIC

t-test, ANOVA, regression

what is a t-test

2 samples collected from the SAME continuous data

which hypothesis should you disprove

null, p>0.05 to have a significant difference

what does the t-test do

compares the mean and dispersion of 2 samples so you can establish if the data came from the same population

what are the criteria for a t-test

continuous, 2 groups, independent, random samples, no outliers, normal distribution, similar variance

what happens when the criteria for a t-test are NOT met

use a log-transformation, use a less sensitive test

what is a log transformation

it address skewed data by decreasing the variability. this causes the data to be close to the normal distribution

what are the less sensitive test when the criteria for the t-test ISN'T met

Welch's test (parametric), Mann-Witney U test or Kruskal Wallis test (both ordinal)

when to use Mann-Witney U test (ordinal)

when t-test won't work. when there is 2 samples

when to use Kruskal Wallis test (ordinal)

when t-test won't work. more than 2 samples

what test can you do for ordinal data

Mann-Witney U, Kruskal Wallis, Likert scales

When to use likert scales (ordinal)

when there is a 5-7 point scales, when the average is the standard interval data

What is ANOVA

Analysis of variance

Criteria for ANOVA

Observation independent. No outliers. Each sample is normally distributed. Variance if roughly equal

How is each of ANOVA's sample normally distributed

large sample size (≥2), tested using graphically, tested using Shapiro-Wilk test, test the skewness, test using kurtosis

the p value for each of the test required to know the normal distribution for ANOVA

Shapiro-Wilks :) p=0.150 :? 0.128 :/ 0.32 . Skewness :) 0.533 :? 0.662 :/ 1.105

How would you compare to sets of continuous data on the same sample

regression (Pearson product moment correlation )

Criteria for regression test

continuous

