Objectives

To compare four first premolar extraction and nonextraction treatment effects on intra-arch width, profile, treatment duration, occlusal outcomes, smile aesthetics and stability.

Materials and Methods

An electronic search of the literature to June 2, 2023 was conducted using health science databases, with additional search of gray literature, unpublished material, and hand searching, for studies reporting nonsurgical patients with fixed appliances regarding sixteen sub-outcomes. Data extraction used customized forms, quality assessed with ROBINS-I (Risk Of Bias In Non-randomized Studies—of Interventions) and Cochrane RoB 2 (risk-of-bias) tool. GRADE (Grading of Recommendations Assessment, Development and Evaluation) assessed certainty of evidence.

Results

Thirty (29 retrospective studies, 1 randomized controlled trial) studies were included. Random-effect meta-analysis (95% CI) demonstrated maxillary (MD: −2.03 mm; [−2.97, −1.09]; P < .0001) and mandibular inter-first molar width decrease (MD: −2.00 mm; [−2.71, −1.30]; P < .00001) with four first premolar extraction; mandibular intercanine width increase (MD: 0.68 mm; [0.36, 0.99]; P < .0001) and shorter treatment duration (MD: 0.36 years; [0.10, 0.62]; P = .007) in the nonextraction group. Narrative synthesis included three and five studies for upper and lower lip-E plane, respectively. For American Board of Orthodontics Objective Grading System and maxillary/mandibular anterior alignment (Little’s irregularity index), each included two studies with inconclusive evidence. There were no eligible studies for UK Peer Assessment Rating (PAR) score. Class I subgroup/sensitivity analyses favored the same results. Prediction interval indicated no significant difference for all outcomes.

Conclusions

Four first premolar extraction results in maxillary and mandibular inter-first molar width decrease and retraction of upper/lower lips. Nonextraction treatment results in mandibular intercanine width increase and shorter treatment duration. There was no significant difference between the two groups regarding maxillary intercanine width, US PAR score, and posttreatment smile esthetics. Further high-quality focused research is recommended.

The longest running debate in orthodontics, spanning more than a century, has been the effects of extraction and nonextraction treatment.1  The main concern with extraction treatment has been the possible deleterious effect on facial profile and the main concern with nonextraction treatment being post-treatment stability.2  Edward Angle’s philosophy of preserving the full complement of teeth argued that extraction of teeth would cause an imbalance in facial harmony and abnormal function due to the change in arch width and form.3  Unlike many of Angle’s disciples, Calvin Case opposed this philosophy and defended the extraction of teeth in treating malocclusion to avoid later relapse.4  However, it was not until the 1940s, when more members of the orthodontic community (including Charles H. Tweed and Raymond Begg) also supported an extraction treatment approach, that it became a generally accepted option.5,6 

Since then, the pendulum has swung between extraction and non-extraction treatment, reporting a peak extraction rate of 76% in 19687  declining to 17.6% in 20058  among University of North Carolina patients. At the University of São Paulo, nonextraction treatment continued with an upward trend from 14.29% (1973–1977) to 54.55% (2003–2007).9 

The orthodontic literature has discussed this conundrum, with conflicting results. Bowman and Johnston10  examined the effects on facial profile and concluded from a sample of 120 patients that extraction treatment had positive results for patients who had initial protrusion relative to the E plane, but it was detrimental for those who had retrusive lips before starting treatment. Boley et al.11  studied profiles of 50 patients and concluded that no difference was found between the two groups as facial profile measurements (Holdaway H-line) were within normal limits. Konstantonis12  attributed change in the soft tissue profile of extraction patients to greater incisor retraction, which could be controlled during treatment planning with less retraction mechanics and more mesial movement of posterior segments. These effects were more pronounced in patients with thin lips or high lip strain.

Little et al.13  concluded that extraction did not guarantee long term stability and Rossouw et al.14  reported no significant difference in stability between extraction and nonextraction groups, with similar amounts of relapse.

The literature has previously reported premolar extraction compared to nonextraction treatment focused on limited outcomes.12,15–23  A recently published scoping review24  outlined the weaknesses of published evidence across the breadth of the current literature but did not include any quantitative evaluation of the available data. This systematic review was, therefore, focused on four first premolar extraction, a broad range of outcomes and quantitative analysis, providing the orthodontist with the evidence required to inform clinical decisions.

The aim of this systematic review was to compare four first premolar extraction and nonextraction treatment effects on arch form, maxillary and mandibular intercanine width and first molar width, profile changes (upper and lower lip prominence to E plane), treatment duration, occlusal outcomes (end treatment UK and US weighted peer assessment rating [PAR] scores, American Board of Orthodontics Objective Grading System [ABO-OGS] score), posttreatment smile aesthetics (aesthetic score, maxillary intercanine width/smile width, visible dentition width/smile width, maxillary intercanine width/visible dentition width) and posttreatment changes of maxillary and mandibular anterior alignment (Little’s irregularity index) to provide orthodontists with the best data available.

This review was prepared in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) 2020 statement and the Cochrane Handbook for Systematic Reviews of Interventions (PROSPERO:CRD42021254523).

Eligibility criteria are in Table 1. Electronic databases were searched until June 2, 2023 without restrictions regarding publication year, study design, or language, with additional searching of gray literature, unpublished literature, and hand-searching of reference lists of included and excluded studies comparing premolar extraction to nonextraction treatment for outcomes of interest. Search strategies and publication date range of the search are in Table 2.

Table 1

Eligibility Criteria

Eligibility Criteria
Eligibility Criteria
Table 2.

Search Strategya

Search Strategya
Search Strategya

The articles resulting from the search were added to Zotero (version 6.0.26). Duplicates were identified and removed. Articles were manually checked during screening and further duplicates found and removed. Articles were first checked and excluded by title, with the resultant articles screened by their abstract and then full text articles checked for eligibility.

If there were any difficulties in getting the full text of an article, soft copies were obtained from the University of Dundee Library or The British Library. No contact was made with the authors.

Non-English studies without an English version were translated using Google Translate. Sample size and all reported data were checked and values were revised and recalculated whenever raw data were provided to ensure quality of included data.

Two reviewers undertook study selection and data extraction in duplicate using a customized data extraction form (Appendix 1 and 2) prepared by the third reviewer. When there was disagreement, discussion with the third reviewer reached the final decision.

Due to the ethical challenges in undertaking comparative prospective and randomized clinical trials in this subject area, retrospective studies were included. ROBINS-I tool was used to assess the quality of observational (prospective and retrospective) studies, with each outcome being individually judged. Cochrane RoB 2 tool was used for randomized trials.

Where appropriate, continuous data, with sample size, mean value, and standard deviation were available, RevMan (version 5.4.1) was used for quantitative synthesis and narrative synthesis reported when meta-analysis was not possible. Confidence interval (95%) with mean difference was used with significance level P < .05. I2 statistic for random effects model meta-analysis was calculated using Comprehensive Meta-Analysis software (version 3.0). Prediction interval (95%) with mean difference, used to describe the distribution of true effect sizes, was calculated using an Excel spreadsheet (Microsoft Corp., Redmond, Washington) based on formulas by Borenstein et al.25,26 

Outcome measures and time points of assessment are presented in Table 1. Studies reporting female and male subgroups were combined into a single group using RevMan. Random-effects model meta-analysis was used because of the amount of heterogeneity due to the difference in populations and study design. Heterogeneity was assessed by assessing overlap of the confidence intervals on Forest plots and I2 statistic with threshold for interpretation as described in the Cochrane Handbook.

Subgroup and sensitivity analyses were carried out to deal with possible sources of heterogeneity of including different malocclusion classes together and differences in outcome measures to isolate their influence.

Publication bias was addressed by including unpublished literature. When more than 10 studies pooled together for an outcome in the meta-analysis, publication bias was identified through a funnel plot. GRADE was used to assess certainty of evidence for each outcome.

Study Selection and Characteristics

The search of databases (including gray literature and hand-searching) identified 2652 articles. Removal of duplicates, exclusion by title (Appendix 3), and screening by abstracts (Appendix 4), resulted in a total of 383 articles (Appendix 5).

Thirty (29 retrospective studies27–55 , 1 randomized controlled trial [RCT]56 ) studies were included with some studies including multiple outcomes. Twenty-four27,29–33,35,36,38–44,46–51,54–56  were included in meta-analysis, of which one thesis47  and two non-English articles in Chinese54  and Korean42  were included (Table 3). Narrative synthesis included three studies31,45,53  for UL-E plane (upper lip), five studies31,34,37,45,53  for LL-E plane (lower lip), two studies28,52  for ABO-OGS, and two studies32,33  for maxillary and mandibular anterior alignment (Little’s irregularity index) (Table 4). Figure 1 shows the process of study identification and selection.

Figure 1.

PRISMA Flow Diagram. PRISMA indicates Preferred Reporting Items for Systematic Reviews and Meta-Analyses.

Figure 1.

PRISMA Flow Diagram. PRISMA indicates Preferred Reporting Items for Systematic Reviews and Meta-Analyses.

Close modal
Table 3.

Outcomes Included in Quantitative Synthesis (Meta-Analysis)a

Outcomes Included in Quantitative Synthesis (Meta-Analysis)a
Outcomes Included in Quantitative Synthesis (Meta-Analysis)a
Table 4.

Outcomes Included In Qualitative Synthesis (Narrative Synthesis)a

Outcomes Included In Qualitative Synthesis (Narrative Synthesis)a
Outcomes Included In Qualitative Synthesis (Narrative Synthesis)a

Risk of Bias Within Studies

The included RCT56  was judged as being of high risk of bias (Figure 2). For retrospective studies, 23 were of serious risk28–33,35–41,43,45–50,53–55  and six of moderate risk of bias27,34,42,44,51,52  (Table 5) (Appendix 6).

Figure 2.

Risk of bias- De Almeida et al. (RCT).

Figure 2.

Risk of bias- De Almeida et al. (RCT).

Close modal
Table 5.

ROBINS-I Tool

ROBINS-I Tool
ROBINS-I Tool

Synthesis of Results

Results of meta-analyses, prediction interval, subgroup, and sensitivity analyses are presented in Table 6 with values rounded to two decimal places except for P value.

Table 6.

Synthesis of Resultsa

Synthesis of Resultsa
Synthesis of Resultsa

Arch Width

Intercanine Width (Figure 3).

Figure 3.

Forest plot, maxillary and mandibular intercanine width.

Figure 3.

Forest plot, maxillary and mandibular intercanine width.

Close modal

Nine retrospective studies27,31,39,42,44,47,49,51,54  and one RCT56  reported no statistically significant difference between four first premolar extraction and nonextraction treatment in maxillary intercanine width (MD: 0.02 mm; total 95% CI [−0.38, 0.43]; I2 = 0%; P = .91) with significant increase in mandibular intercanine width (MD: 0.68 mm; 95% CI [0.36, 0.99]; I2 = 0%; P < .0001) in the nonextraction group.

Intermolar Width (Figure 4).

Figure 4.

Forest plot, maxillary and mandibular intermolar width.

Figure 4.

Forest plot, maxillary and mandibular intermolar width.

Close modal

Eight retrospective studies27,31,39,42,44,47,49,51  and one RCT56  reported significant decrease in maxillary (MD: −2.03 mm; total 95% CI [−2.97, −1.09]; I2 = 0%; P < .0001) and mandibular interfirst molar width (MD: −2.00 mm; total 95% CI [−2.71, −1.30]; I2 = 5.32%; P < .00001) with four first premolar extraction.

Profile

Three studies31,45,53  were included for UL- E plane and five studies31,34,37,45,53  for LL- E plane with vote counting indicating retraction of upper and lower lips with four first premolar extraction.

Choi et al.31  compared 15 four first premolar extraction (UL- E plane: −1.61 ± 1.62, LL- E plane: −3.13 ± 1.97) with 17 nonextraction (UL- E plane: −0.07 ± 0.89, LL- E plane: −0.15 ± 0.70) Class I and II female patients and found significant retraction of upper and lower lips in the extraction group.

In an equally divided sample of 20 Class I patients, Freitas et al.34  reported no significant difference between four first premolar extraction (LL- E plane: −0.2 ± 3.7) and nonextraction (LL- E plane: −0.05 ± 1.9) treatment.

Hassan et al.37  reported no significant difference between four first premolar extraction (LL- E plane: −2.15 ± 3.38) and nonextraction (LL- E plane: −0.83 ± 2.75) treatment in a sample of 60 Class I and II Pakistani females.

Konstantonis45  compared 30 four first premolar extraction (UL- E plane: −2.75 ± 1.5, LL- E plane: −3.34 ± 1.75) with 32 nonextraction (UL- E plane: −0.68 ± 1.89, LL- E plane: 0.67 ± 2.24) borderline Class I patients and found significant retraction of upper and lower lips in the extraction group.

Xu et al.53  compared 13 four first premolar extraction (UL- E plane: –1.0 ± 1.9, LL- E plane: –2.6 ± 1.9) with 12 nonextraction (UL- E plane: –0.9 ± 2.4, LL- E plane: –0.4 ± 3.4) borderline Chinese patients of different malocclusion and found significant retraction of lower lip in the extraction group with no difference regarding upper lip retraction.

Treatment Duration (Figure 5)

Figure 5.

Forest plot and sensitivity analysis, treatment duration.

Figure 5.

Forest plot and sensitivity analysis, treatment duration.

Close modal

Eight retrospective studies29,30,32,33,36,46,47,55  and one RCT56  reported shorter treatment duration in the nonextraction group (MD: 0.36 years; total 95% CI [0.10, 0.62]; I2 = 3.18%; P = .007) compared to the four first premolar extraction group.

Occlusal outcomes

PAR Score.

No eligible studies were found for UK-weighted PAR score. Three retrospective studies33,38,41  reported no statistically significant difference between four first premolar extraction and nonextraction treatment with US weighted PAR score. (MD: 0.33; total 95% CI [−0.21, 0.87]; I2 = 0%; P = .23) (Figure 6).

Figure 6.

Forest plot, US-weighted PAR score. PAR indicates peer assessment rating.

Figure 6.

Forest plot, US-weighted PAR score. PAR indicates peer assessment rating.

Close modal

ABO-OGS.

Two retrospective studies were included with inconclusive evidence. Anthopoulou et al.28  compared 25 four first premolar extraction (total score: 27.04 ± 6.30) with 30 nonextraction (total score: 29.07 ± 7.11) Class I borderline patients and found no statistically significant difference between the two groups.

In 40 Class I borderline patients, Vaidya et al.52  reported lower scores for four first premolar extraction group (total score: 22.0 ± 2.29), compared to nonextraction (total score: 26.8 ± 5.18).

Smile Aesthetics (Figure 7)

Figure 7.

Forest plot, smile aesthetics.

Figure 7.

Forest plot, smile aesthetics.

Close modal

Four retrospective studies40,43,48,50  for aesthetic score (MD: −0.09; total 95% CI [−0.24, 0.05]; I2 = 0%; P = .21) and four retrospective studies35,40,43,50  for maxillary intercanine width/smile width (MD: 0.01; total 95% CI [−0.00, 0.02]; I2 = 0%; P = .12), visible dentition width/smile width (MD: −0.00; total 95% CI [−0.01, 0.01]; I2 = 0%; P = .81) and maxillary intercanine width/visible dentition width (MD: 0.00; total 95% CI [−0.02, 0.02]; I2 = 0%; P = .94) reported no statistically significant difference between four first premolar extraction and nonextraction treatment.

Stability

Two retrospective studies were included with inconclusive evidence. Francisconi et al.32  compared 40 four first premolar extraction (maxillary Little index: 0.89 ± 1.48, mandibular Little index: 1.64 ± 1.75) with 44 nonextraction (maxillary Little index: 1.64 ± 1.37, mandibular Little index: 1.36 ± 1.33) patients of different malocclusions and found greater maxillary crowding relapse in the nonextraction group and no significant difference between the two treatment groups for mandibular crowding relapse.

Freitas et al.33  compared 97 four first premolar extraction (maxillary Little index: 1.30 ± 1.75, mandibular Little index: 1.93 ± 2.06) with 58 nonextraction (maxillary Little index: 1.66 ± 1.42, mandibular Little index: 1.40 ± 1.18) patients with no significant difference between the two groups regarding maxillary crowding relapse, and more mandibular crowding relapse in the extraction group.

Prediction Interval

Prediction interval, the range that in 95% of all populations the true effect size will fall within, was wider than 95% confidence intervals, but with no significant difference, for all outcomes.

Subgroup Analysis

Class I subgroup analyses of maxillary and mandibular intercanine width (Figure 3), maxillary and mandibular interfirst molar width (Figure 4) and treatment duration (Figure 5), favored the same results as the main analyses.

Sensitivity Analysis

For arch width, sensitivity analysis excluded three studies27,47,56  in which measurement of intercanine (Figure 8) and intermolar (Figure 9) width was from the most labial aspect of buccal surfaces of teeth instead of canine tips and mesiobuccal cusp tips. Sensitivity analysis excluded two studies30,32  for treatment duration reporting the use of extraoral appliances related to patient compliance (Figure 5), and two studies35,48  for smile aesthetics (Figure 10), where it was not clear whether included data were end of treatment or posttreatment.

Figure 8.

Sensitivity analysis, maxillary and mandibular intercanine width.

Figure 8.

Sensitivity analysis, maxillary and mandibular intercanine width.

Close modal
Figure 9.

Sensitivity analysis, maxillary and mandibular intermolar width.

Figure 9.

Sensitivity analysis, maxillary and mandibular intermolar width.

Close modal
Figure 10.

Sensitivity analysis, smile aesthetics.

Figure 10.

Sensitivity analysis, smile aesthetics.

Close modal

Risk of Bias Across Studies

No funnel plot generated, as no more than 10 studies were included in meta-analyses.

Quality of Evidence

GRADE evidence profile was completed for all outcomes (Table 7). No separate grading was undertaken for subgroup/sensitivity analyses.

Table 7.

GRADE

GRADE
GRADE

This review only included studies with four first premolar extraction compared to nonextraction treatment-to-control heterogeneity, with the inclusion of gray literature, non-English studies and published theses to reduce reporting bias. However, eligible studies were of high level of bias except six studies of moderate risk of bias.

Eligibility criteria were set to reduce confounding variables related to effects due to different treatment approaches. For all outcomes, studies included were limited to nonsurgical patients with fixed appliances with no adjunctive procedures. Studies reporting the use of functional appliances or extraoral appliances were excluded for profile.

Confounding variables were further controlled by exclusion of studies with incomplete data reporting to avoid imputations and studies with more than one error in sample size or treatment data, for greater consistency. However, this may have resulted in smaller sample sizes with a potential source of bias,57  as not all studies provided raw data for recalculation. No study included in quantitative synthesis reported any error in outcomes of interest, and excluded studies are included in supplementary material with reason for their exclusions if needed for future analysis.

Ideally, age of subjects included would have been limited to 13 years of age or older for arch width58  and 15 years of age or younger for profile changes59  to exclude growth effects. Significant increase in maxillary and mandibular intercanine and intermolar width occurs due to growth between 3 and 13 years of age, and there is significant upper and lower lip retraction in relation to the E plane between 15 and 25 years of age. However, it was not possible to include an age threshold criterion based on these limits as raw age data were not provided and only mean age reported, a confounding variable of broad age range, increasing indirectness of the results.

There is conflicting evidence in the literature regarding arch width changes, with meta-analysis showing significant increase in mandibular intercanine width in the nonextraction group and no difference regarding maxillary intercanine width. A possible explanation might be related to greater variability in maxillary arch form, whereas mandibular arch form is more influenced by the soft tissue environment, meaning that the need to generate space for alignment in nonextraction treatment leads to mandibular arch width changes while the mandibular arch form is maintained in extraction treatment. The movement of posterior teeth mesially into narrower areas of the dental arch is the cause for maxillary and mandibular intermolar width decrease in the extraction group.44 

The reported results of profile changes, ABO-OGS and stability should be carefully interpreted because of the imprecision of the results due to small sample sizes. Where two studies were included, no meta-analysis was undertaken, as with two studies and in the presence of heterogeneity, confidence intervals based on normal quantiles are not recommended.60 

For profile changes, the current narrative synthesis indicated retraction of upper and lower lips relative to the E plane with four first premolar extraction, matching the findings of the meta-analysis by Konstantonis et al.12  However, as ethnic differences were reported, indicating potential additional confounding factors for this outcome, no meta-analysis was undertaken.

Four first premolar extraction took an average of 0.36 years longer to complete in comparison to nonextraction treatment. This might be reasonable to assume as it allows for the time to complete space closure in extraction treatment, although it could also be related to more complex cases being treated with extractions. This difference is one of clinical significance for clinicians and patients. The evidence for this finding was graded as low certainty compared with most other outcomes, which were very low certainty.

Conflicting results on occlusal outcomes were reported in this review. No eligible studies were found for UK-weighted PAR score and no significant difference was reported with US-weighted PAR score. However, it should be noted that end treatment data were pooled rather than percentage improvement for ease of comparison with ABO-OGS.

The results of this review on smile aesthetics indicated no difference between the two treatment approaches in four different smile parameters. Furthermore, Işiksal et al.40  stated that inadequate torque expression can affect smile aesthetics regardless of treatment modality.

There was no clear consensus whether four first premolar extraction or nonextraction treatment would provide greater posttreatment stability, as only two studies were included with conflicting results.

In summary, this review found low certainty evidence for a clinically significant difference in treatment duration. However, there are debatable clinical implications of differences found in arch width changes and no differences in occlusal outcomes and smile parameters. The decision whether to extract or not is very situational. Ruellas et al.61  stated that clinicians should be aware of factors such as compliance, tooth-arch discrepancy, cephalometric discrepancy, facial profile, growth, anteroposterior relationships, dental asymmetry, and pathology in decision making.

In the context of existing data, more robust evidence for changes in outcome between extraction and nonextraction treatment approaches is still needed. However, this is an almost impossible aim, as one of the reasons for the lack of RCTs on this topic is ethical, with patient recruitment dilemmas of randomizing these treatments. Alternatively, high-quality observational studies may be most appropriate and a suggested protocol has been made available recently.24 

Limitations

  • Studies included in quantitative synthesis were retrospective in nature except one prospective randomized trial with high level of bias. The limitation due to the decision to include observational studies has been discussed in a Cochrane review62  with little evidence for significant effect estimate differences between observational studies and RCTs. However, it is important to consider the level of heterogeneity in meta-analyses of RCTs or observational studies with control for confounding in observational studies.

  • There is a possible source of bias due to exclusion of studies with incomplete dataset reporting, but exclusion provides greater confidence in results rather than imputation, with the same for studies with more than one error in sample size or treatment data.

  • As malocclusion classes were not studied individually, this baseline characteristic caused an increase in heterogeneity when studies were included for the meta-analysis. Clinical heterogeneity was controlled by limiting the intervention group to four first premolar extraction only and subgroup analyses were carried out where possible.

  • As different ages were pooled on the same estimative, without a subgroup analysis, this baseline characteristic caused an increase in the degree of indirectness of the results.

  • The subjective nature of the aesthetic score, especially since this was assessed by various raters in the different studies, the authors considered this as a possible source of heterogeneity due to the associated observational bias in the meta analysis.17 

  • Retention regimen might be a confounding factor related to stability while poor oral hygiene, poor patient compliance, and experience of the operator are factors related to treatment duration.63 

  • There is an imprecision up to ±0.1 in recalculated data of included studies, which may affect the results.

  • Non-English studies were translated using Google Translate, which might present a possible source of inaccuracy.

  • GRADE does not allow inclusion of multiple study designs per outcome, a recognized limitation of GRADEpro. For outcomes reporting RCT and observational studies, changing study design in certainty assessment did not affect overall certainty.

  • Low level evidence indicates clinically significant, shorter treatment duration in the nonextraction group compared to four first premolar extraction.

  • Low level evidence indicates mandibular intercanine width increase with nonextraction treatment and mandibular interfirst molar width decrease in the four first premolar extraction group.

  • Very low evidence indicates no significant difference in maxillary intercanine width between the extraction and nonextraction groups and decrease of maxillary interfirst molar width with four first premolar extraction.

  • Very low level evidence indicates retraction of upper and lower lips-E plane in the four first premolar extraction group.

  • Very low level evidence indicates no significant difference regarding US PAR score and posttreatment smile aesthetics.

We would like to gratefully acknowledge Dr. Gavin Revie, Research Methodologist and Research Integrity Lead, School of Dentistry, University of Dundee, for his assistance with the meta-analysis.

This paper was based on a thesis submitted in partial fulfillment of the requirements for the degree of Master of Science (Orthodontics) at University of Dundee.

No external funding was received to conduct this research. No conflicts of interest were declared.

Appendices 1–6 are available online.

1. 
Pollock
HC.
The extraction debate of 1911 by Case, Dewey, and Cryer
.
Am J Orthod
.
1964
;
50
(9)
:
656
658
.
2. 
The extraction debate of 1911 by Case, Dewey, and Cryer: discussion of Case: the question of extraction in orthodontia
.
Am J Orthod
.
1964
;
50
(11)
:
843
851
.
3. 
Angle
EH.
Treatment of Malocclusion of the Teeth
.
Angle’s System
. 7th ed.,
S.S. White dental manufacturing Co.
,
Philadelphia
,
1907
.
4. 
Case
CS.
The question of extraction in orthodontia
.
Am J Orthod
.
1964
;
50
(9)
:
660
691
.
5. 
Begg
PR.
Stone age man’s dentition: with reference to anatomically correct occlusion, the etiology of malocclusion, and a technique for its treatment
.
Am J Orthod
.
1954
;
40
(7)
:
517
531
.
6. 
Tweed
CH.
Indications for the extraction of teeth in orthodontic procedure
.
Am J Orthod Oral Surg
.
1944
;
42
:
22
45
.
7. 
Proffit
WR.
Forty-year review of extraction frequencies at a university orthodontic clinic
.
Angle Orthod
.
1994
;
64
(6)
:
407
414
.
8. 
Jackson
TH,
Guez
C,
Lin
FC,
Proffit
WR,
Ko
CC.
Extraction frequencies at a university orthodontic clinic in the 21st century: demographic and diagnostic factors affecting the likelihood of extraction
.
Am J Orthod Dentofacial Orthop
.
2017
;
151
(3)
:
456
462
.
9. 
Janson
G,
Maria
FRT,
Bombonatti
R.
Frequency evaluation of different extraction protocols in orthodontic treatment during 35 years
.
Prog Orthod
.
2014
;
15
(1)
:
51
.
10. 
Bowman
SJ,
Johnston
LE.
The esthetic impact of extraction and nonextraction treatments on Caucasian patients
.
Angle Orthod
.
2000
;
70
(1)
:
3
10
.
11. 
Boley
JC,
Pontier
JP,
Smith
S,
Fulbright
M.
Facial changes in extraction and nonextraction patients
.
Angle Orthod
.
1998
;
68
(6)
:
539
546
.
12. 
Konstantonis
D,
Vasileiou
D,
Papageorgiou
SN,
Eliades
T.
Soft tissue changes following extraction vs. nonextraction orthodontic fixed appliance treatment: a systematic review and meta-analysis
.
Eur J Oral Sci
.
2018
;
126
(3)
:
167
179
.
13. 
Little
RM,
Wallen
TR,
Riedel
RA.
Stability and relapse of mandibular anterior alignment-first premolar extraction cases treated by traditional edgewise orthodontics
.
Am J Orthod
.
1981
;
80
(4)
:
349
365
.
14. 
Rossouw
PE,
Preston
CB,
Lombard
C.
A longitudinal evaluation of extraction versus nonextraction treatment with special reference to the posttreatment irregularity of the lower incisors
.
Semin Orthod
.
1999
;
5
(3)
:
160
170
.
15. 
Almurtadha
RH,
Alhammadi
MS,
Fayed
MMS,
Abou-El-Ezz
A,
Halboub
E.
Changes in soft tissue profile after orthodontic treatment with and without extraction: a systematic review and meta-analysis
.
J Evid-Based Dent Pract
.
2018
;
18
(3)
:
193
202
.
16. 
Burke
SP,
Silveira
AM,
Goldsmith
LJ,
Yancey
JM,
Van Stewart
A,
Scarfe
WC.
A meta-analysis of mandibular intercanine width in treatment and postretention
.
Angle Orthod
.
1998
;
68
(1)
:
53
60
.
17. 
Cheng
HC,
Wang
YC,
Tam
KW,
Yen
MF.
Effects of tooth extraction on smile esthetics and the buccal corridor: a meta-analysis
.
J Dent Sci
.
2016
;
11
(4)
:
387
393
.
18. 
Dai
M Lu,
Xiao
M,
Yu
Z,
Liu
D Xu.
[Effect of extraction and non-extraction treatment on frontal smiling esthetics: a meta-analysis]
.
Shanghai Kou Qiang Yi Xue Shanghai J Stomatol
.
2015
;
24
(4)
:
499
504
.
19. 
Mavreas
D,
Athanasiou
AE.
Factors affecting the duration of orthodontic treatment: a systematic review
.
Eur J Orthod
.
2008
;
30
(4)
:
386
395
.
20. 
Moon
S,
Mohamed
AMA,
He
Y,
Dong
W,
Yaosen
C,
Yang
Y.
Extraction vs. nonextraction on soft-tissue profile change in patients with malocclusion: a systematic review and meta-analysis
.
Biomed Res Int
.
2021
;
2021
:
7751516
.
21. 
Papageorgiou
SN,
Höchli
D,
Eliades
T.
Outcomes of comprehensive fixed appliance orthodontic treatment: a systematic review with meta-analysis and methodological overview
.
Korean J Orthod
.
2017
;
47
(6)
:
401
413
.
22. 
Swidi
AJ,
Griffin
AE,
Buschang
PH.
Mandibular alignment changes after full-fixed orthodontic treatment: a systematic review and meta-analysis
.
Eur J Orthod
.
2019
;
41
(6)
:
609
621
.
23. 
Ze-Ping
L,
Ting-Ting
C,
Gui-Feng
L,
Si-Cong
J,
Shu-Juan
Z.
Meta analysis of soft tissue profile repair in patients treated with and without premolar extraction
.
Chin J Tissue Eng Res
.
2010
;
14
(2)
:
237
.
24. 
Benson
PE,
Alshawy
E,
Fenton
GD,
et al
Extraction vs nonextraction of premolars for orthodontic treatment: a scoping review examining the extent, range, and characteristics of the literature
.
Am J Orthod Dentofacial Orthop
.
2023
;
164
:
368
376
.
25. 
Borenstein
M,
Higgins
JPT,
Hedges
LV,
Rothstein
HR.
Basics of meta-analysis: I2 is not an absolute measure of heterogeneity
.
Res Synth Methods
.
2017
;
8
(1)
:
5
18
.
26. 
Borenstein
M,
Hedges
LV,
Higgins
JPT,
Rothstein
HR.
Introduction to Meta-Analysis
.
Chichester U.K.: John Wiley & Sons
;
2009
.
27. 
Aksu
M,
Kocadereli
I.
Arch width changes in extraction and nonextraction treatment in class I patients
.
Angle Orthod
.
2005
;
75
(6)
:
948
952
.
28. 
Anthopoulou
C,
Konstantonis
D,
Makou
M.
Treatment outcomes after extraction and nonextraction treatment evaluated with the American Board of Orthodontics objective grading system
.
Am J Orthod Dentofacial Orthop
.
2014
;
146
(6)
:
717
723
.
29. 
Beit
P,
Konstantonis
D,
Papagiannis
A,
Eliades
T.
Vertical skeletal changes after extraction and non-extraction treatment in matched class I patients identified by a discriminant analysis: cephalometric appraisal and Procrustes superimposition
.
Prog Orthod
.
2017
;
18
(1)
:
44
.
30. 
Bishara
SE,
Cummins
DM,
Jorgensen
GJ,
Jakobsen
JR.
A computer assisted photogrammetric analysis of soft tissue changes after orthodontic treatment. Part I: Methodology and reliability
.
Am J Orthod Dentofacial Orthop
.
1995
;
107
(6)
:
633
639
.
31. 
Choi
TH,
Kim
SH,
Kim
C,
Kook
YA,
Larson
BE,
Lee
NK.
Changes in maximum lip-closing force after extraction and nonextraction orthodontic treatments
.
Korean J Orthod
.
2020
;
50
(2)
:
120
128
.
32. 
Francisconi
MF,
Janson
G,
Freitas
KM,
et al
Overjet, overbite, and anterior crowding relapses in extraction and nonextraction patients, and their correlations
.
Am J Orthod Dentofacial Orthop
.
2014
;
146
(1)
:
67
72
.
33. 
Freitas
KM,
Janson
G,
Tompson
B,
et al
Posttreatment and physiologic occlusal changes comparison
.
Angle Orthod
.
2013
;
83
(2)
:
239
245
.
34. 
Freitas
BV,
Rodrigues
VP,
Rodrigues
MF,
de Melo
HVF,
Dos Santos
PCF.
Soft tissue facial profile changes after orthodontic treatment with or without tooth extractions in Class I malocclusion patients: a comparative study
.
J Oral Biol Craniofacial Res
.
2019
;
9
(2)
:
172
176
.
35. 
Ghaffar
F,
Fida
M.
Effect of extraction of first four premolars on smile esthetics
.
Eur J Orthod
.
2011
;
33
(6)
:
679
683
.
36. 
Gorucu-Coskuner
H,
Atik,
E,
Kocadereli
I.
Effects of three different orthodontic treatment methods on the stability of mandibular incisor alignment
.
J Clin Pediatr Dent
.
2017
;
41
(6)
:
486
493
.
37. 
Hassan
S,
Shaikh
A,
Fida
M.
Esthetic impact of tooth extraction in Pakistani patients
.
J Ayub Med Coll Abbottabad
2014
;
26
(3)
:
263
268
.
38. 
Holman
JK,
Hans
MG,
Nelson
S,
Powers
MP.
An assessment of extraction versus nonextraction orthodontic treatment using the peer assessment rating (PAR) index
.
Angle Orthod
.
1998
;
68
(6)
:
527
534
.
39. 
Işik
F,
Sayinsu
K,
Nalbantgil
D,
Arun
T.
A comparative study of dental arch widths: extraction and non-extraction treatment
.
Eur J Orthod
.
2005
;
27
(6)
:
585
589
.
40. 
Işiksal
E,
Hazar
S,
Akyalçin
S.
Smile esthetics: perception and comparison of treated and untreated smiles
.
Am J Orthod Dentofacial Orthop
.
2006
;
129
(1)
:
8
16
.
41. 
Janson
G,
Simao
TM,
Barros
SE,
Tompson
BD,
Freitas
K.
A 2-center comparison of maxillary incisor positioning with non-extraction, 2-maxillary premolar and 4-premolar extractions for Class II treatment
.
Orthod WAVES
.
2015
;
74
(4)
:
105
111
.
42. 
Jeon
JY,
Kim
SJ,
Kang
SG,
Park
YG.
Change in arch width in extraction vs nonextraction treatment
.
Korean J Orthod
.
2007
;
37
(1)
:
65
72
.
43. 
Johnson
DK,
Smith
RJ.
Smile esthetics after orthodontic treatment with and without extraction of four first premolars
.
Am J Orthod Dentofacial Orthop
.
1995
;
108
(2)
:
162
167
.
44. 
Kim
E,
Gianelly
AA.
Extraction vs nonextraction: Arch widths and smile esthetics
.
Angle Orthod
.
2003
;
73
(4)
:
354
358
.
45. 
Konstantonis
D.
The impact of extraction vs nonextraction treatment on soft tissue changes in Class I borderline malocclusions
.
Angle Orthod
.
2012
;
82
(2)
:
209
217
.
46. 
Kouli
A,
Papagiannis
A,
Konstantoni
N,
Halazonetis
DJ,
Konstantonis
D.
A geometric morphometric evaluation of hard and soft tissue profile changes in borderline extraction versus non-extraction patients
.
Eur J Orthod
.
2019
;
41
(3)
:
264
272
.
47. 
MacKriel
EA.
Changes in Arch Dimensions After Extraction and Non-Extraction Orthodontic Treatment
[PhD thesis]
.
Bellville, South Africa
:
University of the Western Cape
;
2008
.
48. 
Naik
P,
Manohar
MR,
Shivaprakash
G,
Jabeen
N.
Smile esthetics: evaluation of differential perception among laypersons, dental professionals and orthodontics
.
IOSR J Dent Med Sci
.
2014
;
13
(1)
:
35
43
.
49. 
Oz
AA,
Oz
AZ,
Yazicioğlu
S,
Arici
N,
Ozer
M,
Arici
S.
Comparison of arch width changes following orthodontic treatment with and without extraction using three-dimensional models
.
Niger J Clin Pract
.
2017
;
20
(5)
:
581
586
.
50. 
Prasad
V,
Tandon
P,
Singh
GK,
Nagar
A,
Maurya
RP.
Comparison of smile esthetics after extraction and non-extraction orthodontic treatment
.
IJODR
.
2018
;
4
(4)
:
182
189
.
51. 
Sumit
G,
Ashima
V.
Arch width changes in canine and molar regions after extraction and non-extraction treatment
.
J Pierre Fauchard Acad India Sect
.
2010
;
24
(2)
:
45
53
.
52. 
Vaidya
M,
Jyothikiran
H,
Raghunath
N,
Pai
P.
Extraction versus non-extraction: a retrospective study
.
Int J Orthod Rehabil
.
2018
;
9
(1)
:
23
.
53. 
Xu
TM,
Liu
Y,
Yang
MZ,
Huang
W.
Comparison of extraction versus nonextraction orthodontic treatment outcomes for borderline Chinese patients
.
Am J Orthod Dentofacial Orthop
.
2006
;
129
(5)
:
672
677
.
54. 
Dong
ZY,
Liu
DX,
Wang
TJ,
Liu
L.
[Changes in arch width after extraction and nonextraction treatment]
.
Shanghai Kou Qiang Yi Xue
.
2007
;
16
(4)
:
355
357
[Article in Chinese]
.
55. 
Mahmood
DAD.
Changes in third molar angulations after first premolar extractions during fixed orthodontic treatment
.
Mustansiria Dent J
.
2009
;
6
(4)
:
356
360
.
56. 
De Almeida
MR,
Almeida-Pedrin
RRD.
A prospective clinical trial of arch width changes evaluated through dental casts in consecutively treated non-extraction and extraction cases
.
Dentistry
.
2011
;
01
(02)
.
57. 
Kaji
AH,
Schriger
D,
Green
S.
Looking through the retrospectoscope: reducing bias in emergency medicine chart review studies
.
Ann Emerg Med
.
2014
;
64
(3)
:
292
298
.
58. 
Bishara
SE,
Jakobsen
JR,
Treder
J,
Nowak
A.
Arch width changes from 6 weeks to 45 years of age
.
Am J Orthod Dentofacial Orthop
.
1997
;
111
(4)
:
401
409
.
59. 
Bishara
SE,
Jakobsen
JR,
Hession
TJ,
Treder
JE.
Soft tissue profile changes from 5 to 45 years of age
.
Am J Orthod Dentofacial Orthop
.
1998
;
114
(6)
:
698
706
.
60. 
Friede
T,
Röver
C,
Wandel
S,
Neuenschwander
B.
Meta-analysis of two studies in the presence of heterogeneity with applications in rare diseases
.
Biom J Biom Z
.
2017
;
59
(4)
:
658
671
.
61. 
Ruellas
AC de O,
Ruellas
RM de O,
Romano
FL,
Pithon
MM,
Santos
RL dos.
Tooth extraction in orthodontics: an evaluation of diagnostic elements
.
Dent Press J Orthod
.
2010
;
15
:
134
157
.
62. 
Anglemyer
A,
Horvath
HT,
Bero
L.
Healthcare outcomes assessed with observational study designs compared with those assessed in randomized trials
.
Cochrane Database Syst Rev
.
2014
;
2014
(4)
:
MR000034
.
63. 
Skidmore
KJ,
Brook
KJ,
Thomson
WM,
Harding
WJ.
Factors influencing treatment time in orthodontic patients
.
Am J Orthod Dentofacial Orthop
.
2006
;
129
(2)
:
230
238
.

Author notes

a

 Private Practice, Cairo, Egypt; and former postgraduate student, Orthodontics, School of Dentistry, University of Dundee, Dundee, United Kingdom.

b

 Clinical Lecturer and Honorary Consultant in Orthodontics, School of Dentistry, University of Dundee, Dundee, United Kingdom.

c

 Honorary Professor of Orthodontics, School of Dentistry, University of Dundee, Dundee; and Consultant in Orthodontics, NHS Fife, United Kingdom.

Supplementary data