Sample Size Calculator for Logistic Regression

Logistic Regression Sample Size Calculator

Below is a table summarizing key aspects related to sample size for logistic regression.

AspectDescription
Effect SizeA measure of the strength of the relationship between the independent and dependent variables.
Significance Level (α)The probability of rejecting the null hypothesis when it is true, commonly set at 0.05.
Power (1 – β)The probability of correctly rejecting the null hypothesis; commonly set at 0.80 or 0.90.
Number of PredictorsThe number of independent variables included in the model, which influences sample size.
Event RateThe proportion of the outcome occurring in the sample; can affect the required sample size.
Sample Size FormulaSample size (N) can be estimated using formulas such as:
N = (Zα/2 + Zβ)² × (p(1-p)) / d², where:
Zα/2 = critical value for significance level
Zβ = critical value for power
p = estimated event rate
d = margin of error (difference in proportions of the event rate)
Sample Size CalculatorsOnline tools and software (e.g., G*Power, R packages) can help estimate sample sizes.
GuidelinesAs a rule of thumb, at least 10-15 events per predictor variable is often recommended.
Final ConsiderationsConsider potential loss to follow-up or incomplete data when determining the final sample size.

Example Calculation

  • For a logistic regression model with:
    • Two predictors
    • An expected event rate of 20%
    • A significance level of 0.05
    • A power of 0.80

Using a sample size calculator or software, you would input these parameters to find the necessary sample size for your study.

Resources

  • Online calculators and software like G*Power or R’s pwr package can assist in calculating sample size for logistic regression based on the parameters you specify.

This table provides a concise overview of important factors to consider when determining the sample size for logistic regression analyses.

Leave a Comment