Interpreting regression models in clinical outcome
studies

D. F. Hamilton; M. Ghert; A. H. R. W. Simpson

doi:10.1302/2046-3758.49.2000571

Article alerts Social media

Current issue

Editorial

Interpreting regression models in clinical outcome studies

D. F. Hamilton
M. Ghert
A. H. R. W. Simpson

Download PDF

Interpreting regression models in clinical outcome studies

Hamilton DF, Ghert M, Simpson AHRW. Interpreting regression models in clinical outcome studies. Bone Joint Res. 2015;4(9):152-153. doi:10.1302/2046-3758.49.2000571

Hamilton, D. F., et al. “Interpreting regression models in clinical outcome studies.” Bone & Joint Research, vol. 4, no. 9, 2015, pp. 152-153., https://doi.org/10.1302/2046-3758.49.2000571

Hamilton, D. F., Ghert, M., & Simpson, A. H. R. W. (2015). Interpreting regression models in clinical outcome studies. Bone & Joint Research, 4(9), 152-153. https://doi.org/10.1302/2046-3758.49.2000571

Hamilton, D. F., Ghert, M. and Simpson, A. H. R. W. (2015) “Interpreting regression models in clinical outcome studies.” Bone & Joint Research, 4(9), pp. 152-153. Available at: https://doi.org/10.1302/2046-3758.49.2000571

Hamilton, D. F., M. Ghert, and A. H. R. W. Simpson. “Interpreting regression models in clinical outcome studies.” Bone & Joint Research 4, no. 9 (2015): 152-153. https://doi.org/10.1302/2046-3758.49.2000571

Hamilton DF, Ghert M, Simpson AHRW. Interpreting regression models in clinical outcome studies. Bone Joint Res. 2015 Sep 1;4(9):152-153. https://doi.org/10.1302/2046-3758.49.2000571

Copy to clipboard

BibTeX

EndNote

RIS

Measuring the outcome of an intervention is central to the practice of evidence based medicine, and most research papers evaluating patient outcomes now incorporate some form of patient-based metric, such as questionnaires or performance tests. Once an outcome has been defined, researchers typically want to know if any other factors can influence the result. This is typically assessed with regression analysis.

Regression analysis¹ determines the relationship of an independent variable (such as bone mineral density) on a dependent variable (such as ageing) with the statistical assumption that all other variables remain fixed. The calculation of the relationship results in a theoretical straight line, and the correlation co-efficient (r) measures how closely the observed data are to the theoretical straight line that we have calculated.

In such a linear model, we can judge how well the line fits the data (‘goodness of fit’) by calculating the coefficient of determination (or square of the regression line, R²). R²is a measure of the percentage of total variation in the dependant variable that is accounted for by the independent variable. An R² of 1.0 indicates that the data perfectly fit the linear model. Any R² value less than 1.0 indicates that at least some variability in the data cannot be accounted for by the model (e.g., an R²of 0.5 indicates that 50% of the variability in the outcome data cannot be explained by the model).

Given these statistical tools, we can use the regression equation to predict the value of the dependent variable based on the known value of independent variable. Since many variables may contribute to the outcome (dependent variable), further statistical analysis can be achieved with multiple regression analysis. These models are essentially the same as simple regression analysis, except that the multiple regression analysis equation describes the interrelationship of many variables and allows us to evaluate the joint effect of these variables on the outcome variable in question.

Poitras et al² report an interesting study this month that aims to predict length of stay and early clinical function following joint arthroplasty. Multiple linear regression analyses produced an equation based on the timed-up-and-go test, which was associated with length of stay. In addition, models based on the pre-operative WOMAC function sub-score produced the best model for describing early post-operative function (as calculated by the Older American Resources and Services ALD score). As such the authors were able to conclude that the outcomes assessments (timed-up-and-go and WOMAC) were predictive of outcome, and further modelling identified thresholds of the outcome assessment scores that related to better and worse outcomes.

How should we interpret these findings? The authors quite correctly suggest that models such as these could be of value in discharge planning and resource utilisation by targeting the patients that most need intervention and rehabilitation. The reported R² for the models, however, was 0.18. Bearing in mind that R², the coefficient of determination, measures the percentage of the variation in the dependent variable that is explained by variation in the independent variable,³ taking the compliment (100 – R²) we see that 82% of the variation in the outcome parameter assessed is unexplained by the model. The principal problem is that the variance in the population studied can strongly influence R² magnitude. Therefore, there is no guarantee that a high coefficient of determination is indicative of ‘goodness of fit’. Similarly there is no guarantee that a small R² indicates a weak relationship, given that the statistic is largely influenced by variation in the independent variable.⁴

Therefore, there is no rule for interpreting the strength of R² in its application to clinical relevance. Useful high values of R² can be obtained with clinical data sets,⁵ however, a low R²can still provide a useful clinical model with respect to data trends, but may be low in precision. In this study there is an association between the performance tests and length of stay; and, using the equations, we can indeed predict one from the other. The accuracy of this prediction though, needs to be borne in mind when using it as a clinical tool.

Furthermore, it is not rational to compare R² across different samples, which given clinical populations, are likely to differ significantly in the variance of the independent and dependent variables.⁶

In controlled environments, such as biomechanical tests on cadaveric bones, the variance across predictive measurements is likely to be low, and therefore R²values can be expected to lie in the 0.8 range.⁷ In clinical studies, however, R² values vary widely depending on the nature of the analysis. For example, when comparing radiographic parameters or associating surgical technical factors, values of R² are reported in the 0.2 to 0.4 range.^8,9 Whereas, comparing data between separate (but intrinsically similar) outcome assessment questionnaires can yield higher values in excess of 0.7.¹⁰

As such, further validation of the Poitras study² using new datasets and, ideally, confirmatory analysis of the findings using a much larger sample size, would be required before their regression model could be recommended for use clinically. This does not devalue the appropriateness – or indeed ‘worthiness’ – of reporting these findings in the literature, as the important clinical tools typically start as ideas in small datasets. As with all research papers, the reader requires a basic understanding of methodology to evaluate how relevant the results are to wider practice.

Correspondence should be sent to Professor A. H. R. W. Simpson; e-mail: e.vodden@boneandjoint.org.uk

1 Draper NR , SmithHApplied regression analysis. Wiley-Interscience, 1998. Google Scholar

2 Poitras S , WoodKS, SavardJ, DervinGF, BeaulePE. Predicting early clinical function after hip or knee arthroplasty. Bone Joint Res2015;4:145–151. Google Scholar

3 Schroeder LD , SjoquistDL, StephenPEUnderstanding regression analysis: an introductory guide. 1986, Sage Publications; Beverly Hills, California. Google Scholar

4 Filho DBF , SilvaJA, RochaE. What is R2 all about?Leviathan – Cadernos de Pesquisa Política2011;3:60–68. Google Scholar

5 Maempel JF , ClementND, BrenkelIJ, WalmsleyPJ. Validation of a prediction model that allows direct comparison of the Oxford Knee Score and American Knee Society clinical rating system. Bone Joint J2015;97-B:503–509.CrossrefPubMed Google Scholar

6 Kennedy P A guide to econometrics. 2008, Wiley-Blackwell; San Francisco, California:27. Google Scholar

7 Eckstein F , WundererC, BoehmH, et al.Reproducibility and side differences of mechanical tests for determining the structural strength of the proximal femur. J Bone Miner Res2004;19:379–385.CrossrefPubMed Google Scholar

8 Weber M , LechlerP, von KunowF, et al.The validity of a novel radiological method for measuring femoral stem version on anteroposterior radiographs of the hip after total hip arthroplasty. Bone Joint J2015;97-B:306–311.CrossrefPubMed Google Scholar

9 Kuwashima U , OkazakiK, TashiroY, et al.Correction of coronal alignment correlates with reconstruction of joint height in unicompartmental knee arthroplasty. Bone Joint Res2015;4:128–133.CrossrefPubMed Google Scholar

10 Parsons N , GriffinXL, AchtenJ, CostaML. Outcome assessment after hip fracture: is EQ-5D the answer?Bone Joint Res2014;19;3:69–75.CrossrefPubMed Google Scholar

Figure 1

Some description here

Information

Journal

Bone & Joint Research

Volume

4 No.9 | Pages 152 - 153

Section

Editorial

Published

01 September 2015

DOI

Authors

Expand all

D. F. Hamilton

Research Fellow, Department of Trauma and Orthopaedics

University of Edinburgh, FU413 Chancellor’s Building, 49 Little France Crescent, Edinburgh, EH16 4SB, UK.

Search for more articles by this author

M. Ghert

Associate Professor, Department of Surgery Deputy Editor,

The Bone and Joint Journal, 22 Buckingham Street, London, WC2N 6ET, UK.

Search for more articles by this author

A. H. R. W. Simpson

Editor-in-Chief

The Bone and Joint Journal, 22 Buckingham Street, London, WC2N 6ET, UK.

e.vodden@boneandjoint.org.uk

Search for more articles by this author

Keywords

Regression

Outcome

Studies

Submit Letter to the Editor

Share

Share article on social media

Facebook

X (Twitter)

Email

Figures

Metrics

Downloaded 4168 times

References

1 Find in article

Draper NR , SmithHApplied regression analysis. Wiley-Interscience, 1998.

Google Scholar

2 Find in article

Poitras S , WoodKS, SavardJ, DervinGF, BeaulePE. Predicting early clinical function after hip or knee arthroplasty. Bone Joint Res2015;4:145–151.

Google Scholar

3 Find in article

Schroeder LD , SjoquistDL, StephenPEUnderstanding regression analysis: an introductory guide. 1986, Sage Publications; Beverly Hills, California.

Google Scholar

4 Find in article

Filho DBF , SilvaJA, RochaE. What is R2 all about?Leviathan – Cadernos de Pesquisa Política2011;3:60–68.

Google Scholar

5 Find in article

Maempel JF , ClementND, BrenkelIJ, WalmsleyPJ. Validation of a prediction model that allows direct comparison of the Oxford Knee Score and American Knee Society clinical rating system. Bone Joint J2015;97-B:503–509.

Crossref PubMed Google Scholar

6 Find in article

Kennedy P A guide to econometrics. 2008, Wiley-Blackwell; San Francisco, California:27.

Google Scholar

7 Find in article

Eckstein F , WundererC, BoehmH, et al.Reproducibility and side differences of mechanical tests for determining the structural strength of the proximal femur. J Bone Miner Res2004;19:379–385.

Crossref PubMed Google Scholar

8 Find in article

Weber M , LechlerP, von KunowF, et al.The validity of a novel radiological method for measuring femoral stem version on anteroposterior radiographs of the hip after total hip arthroplasty. Bone Joint J2015;97-B:306–311.

Crossref PubMed Google Scholar

9 Find in article

Kuwashima U , OkazakiK, TashiroY, et al.Correction of coronal alignment correlates with reconstruction of joint height in unicompartmental knee arthroplasty. Bone Joint Res2015;4:128–133.

Crossref PubMed Google Scholar

10 Find in article

Parsons N , GriffinXL, AchtenJ, CostaML. Outcome assessment after hip fracture: is EQ-5D the answer?Bone Joint Res2014;19;3:69–75.

Crossref PubMed Google Scholar