Data Preparation
This study retrospectively collected data from adult patients diagnosed with Class II malocclusion who attended the Department of Orthodontics, School & Hospital of Stomatology, Wuhan University (WHUSS), between 2018 and 2024. The use of patients’ data and the waiver of informed consent were approved by the Ethics Committee of Wuhan University Stomatological Hospital (project number HGGC-258).
CBCT images were obtained for all patients pre-treatment (T0) and post-treatment (T1) using the same equipment, NewTom VGI evo (NewTom, Verona, Italy), with 512×512 mm field of view, 0.3 mm voxel size, 110 kV, and 15.3 mAs. The images were exported in DICOM format. Inclusion criteria for the study include: (1) patients aged 18 years or older with complete permanent dentition (excluding third molars); (2) a diagnosis of Class II malocclusion necessitating orthodontic treatment with four premolars (first premolars or second premolars) extracted during treatment; (3) complete treatment records and high-quality preoperative and postoperative CBCT images; (4) maxillary and mandibular dental arches crowding ranging from 0 to 8 mm; (5) receipt of treatment in both jaws; (6) restoration of a good occlusal relationship after treatment. Exclusion criteria include: (1) absence of supernumerary or impacted teeth (excluding third molars); (2) no severe bony malocclusion requiring orthognathic surgery; (3) a history of orthodontic treatment; (4) history of traumatic injuries; (5) presence of caries, periapical disease, or periodontal disease in the maxillary and mandibular anterior teeth.
Sample size calculations were performed using G*Power 3.1 (University of Düsseldorf, Düsseldorf, Germany), with a significance level of 5% (alpha), a beta of 0.2, and an effect size of 0.8, resulting in a minimum requirement of 21 patients per group; ultimately, 50 patients were recruited in this study. After conducting CBCT image screening and a detailed review of case records, a total of 25 adult patients treated with fixed appliances and another 25 adult patients treated with clear aligners for Class II extraction of four premolar teeth were selected for this study.
The dataset for the development of the tooth segmentation model originates from two sources: 147 CBCT images with tooth segmentation referenced from the study by Cui et al.[24], and 30 internal datasets. The CBCT data in the internal datasets were obtained from 30 patients undergoing orthodontic treatment, independent of this study. Tooth segmentation labels for these data were performed by four trained orthodontists using 3D Slicer software (3D Slicer 5.6.2)(Fig. 1). Upon preparation, the 147 external datasets were used as the training set, while the 30 internal datasets were randomly divided into validation and test sets in a 1:1 ratio. The methodology of this study is depicted in Fig. 2.
Clinical Information Acquisition
Four premolar teeth were extracted according to the Alveolus Surgery Department's treatment plan before initiating orthodontic treatment. The ABO-DI(American Board of Orthodontics Discrepancy Index)was applied to evaluate the treatment difficulty for the two groups of patients. In the FA group, fixed active self-ligating brackets (Empower metal brackets, American Orthodontics, Washington, USA) were bonded to all teeth, including the second molars. Follow-up appointments were scheduled every 6–8 weeks, during which the archwire was sequentially changed from 0.012 NT to 0.019×0.025 NT for standard alignment and leveling. The main archwire used was 0.019×0.025 SS, and traction hooks were placed on the lateral incisors and cuspids. A power chain (American Orthodontics, Washington, USA) was used to close the extraction gaps. The CA group used Invisalign aligners (Invisalign, Align Technology, California, USA). The aligner attachments’ size, shape, and placement were customized according to the orthodontists' treatment plan and the manufacturer's design. Each aligner was set to move the teeth by 0.25 mm, with patients instructed to wear the aligners for more than 22 hours per day and replace them every 7–10 days. Precision cuts and buttons were designed on the cuspids and first molars, and elastics (3/16 in, 4.5 oz, Ormco, California, USA) were used during the extraction gap closure stage. On average, 65 ± 15 aligners were used for both the maxillary and mandibular. At the end of the first phase of treatment, fine adjustments were made using an average of 18 ± 10 aligners. The cases in the FA group and the CA group were completed by two orthodontists, each with 15 years of extensive clinical experience.
Model Overview
The evaluation protocol for root resorption consists of three main processes: automatic teeth segmentation and modeling of entire dentition, three-dimensional model alignment of teeth, and automatic measurement of root length and volume.
We applied Hierarchical Morphology Guided Network (HMGNet) incorporating a self-attention mechanism [24, 28], which involves the following three steps: First, these CBCT images were randomly cropped to a size of 256*256*256 and standardized to the mean of zero and variance of one to serve as the input into V-net for extract the region of interest for the teeth (ROI). Next, based on the tooth labels, tooth centroids were generated, and a distance transformation algorithm was used to extract the tooth skeletons. The centroids and skeletons of the teeth were extracted, which helps to quickly identify the number of teeth and determine their spatial positions. Subsequently, the CBCT images, tooth labels, and tooth skeletons were randomly cropped to a size of 128*128*128 for training. We introduced a Swin Transformer block with a self-attention mechanism during the multi-task segmentation phase. This modular design can be integrated with existing networks, allowing for more effective utilization of limited medical imaging data and enhancing the generalization ability of the segmentation model. Furthermore, Swin Transformer employs a shifted window attention mechanism that divides the image into multiple small windows and performs attention operations within these windows. This effectively reduces computational complexity and enhances the segmentation efficiency of tooth edge detection. Guided by the tooth skeleton, the Canny edge detection algorithm was combined with the Swin transformer blocks to generate smooth tooth boundaries on 2D CBCT slices, this model generates tooth boundaries and masks, ultimately achieving instance segmentation of individual teeth. Figure 3 illustrates the architecture of our fully automated tooth segmentation model.
To evaluate the model segmentation performance, we used five commonly employed evaluation metrics to quantify the volumetric overlap between the 3D segmentation results and the corresponding ground truth segmentation results, namely accuracy, precision, recall, IOU index, and Dice score.
Figure 3 Network architecture. The CBCT scan results were input into the centroid network and the skeleton network to generate the offset and binary maps, respectively. Then, predicting the center points and skeletons of the teeth. Tooth instance segmentation was guided by the skeleton. The output is the tooth boundaries and tooth masks.
The 3D tooth models of FA and CA groups before and after orthodontic treatment were automatically segmented for the entire dentition, and the segmentation results were exported in STL format. Using 3-matic software (version 18.0, Materialise N.V., Leuven, Belgium), we manually took the T0 tooth model as the reference and three points were selected on the tooth crown to perform point-to-point registration with the T1 tooth model [27]. Additionally, global registration was applied, which is based on the Iterative Closest Point (ICP) algorithm, minimizing registration errors and achieving optimal alignment of models. We automatically segmented the crown and root for each tooth, the segmentation plane was determined by the average height of the regular crown for each tooth which reported in literature [29]. After segmentation, the volume of the root portion was measured automatically, and the maximum vertical height from the segmentation plane to the root was used as the root length. Figure 4 illustrates the tooth alignment, and the automated assessment of OIERR based on crown height.
Statistical Analysis.
Statistical evaluations were conducted using SPSS software (version 27.0, SPSS, Chicago, IL). The normality of the distribution of continuous variables was assessed using the Shapiro-Wilk test. The chi-square test was employed to compare gender distribution, while the t-test was utilized to compare the baseline characteristics of patients in each group, including initial age and duration of treatment, the Mann-Whitney U-test was applied for accessing DI index differences between two groups. The paired samples t-test was performed to show significant changes of root length and volume during treatment of each group. Additionally, the Mann-Whitney U-test was employed to compare root length and volume resorption rates between the FA and CA groups. A difference was considered statistically significant at P < 0.05.