Abstract
Objective. To conduct a meta-analysis of randomized controlled trials (RCTs) to compare knee arthroplasty with patient-specific instrumentation (PSI) with the conventional instrumentation (CI). Methods. RCTs were selected in PubMed and Embase from 2012 to 2018. Key data extracted included malalignment of mechanical axis, blood loss, surgical time, Oxford Knee Score (OKS), Knee Society Score (KSS), length of stay, and complications. Subgroup analysis was also performed regarding different PSI systems and different image processing methods. Results. 29 RCTs with 2487 knees were eligible for the meta-analysis. Results showed that PSI did not improve the alignment of the mechanical axis compared with CI, but MRI-based PSI and Visionaire-specific PSI decrease the risk of malalignment significantly ( and , respectively). PSI reduced operative time () and blood loss () and improve the KSS () compared with CI, but for CT-based PSI, the difference of operative time becomes insignificant. PSI showed no significant difference with CI regarding risk of complication, length of stay in hospital, and functional outcomes of OKS. Conclusion. PSI reduced the blood loss and improved KSS. MRI-based PSI reduced operative time and risk of malalignment of mechanical axis compared with CT-based PSI. Moreover, Visionaire-specific PSI achieves better alignment result of the mechanical axis than other systems.
1. Introduction
Total knee arthroplasty (TKA) is the most important treatment for end-stage osteoarthritis of the knee and has been performed increasingly common in orthopedics surgery. Goals of this procedure include pain relief and alignment correction [1]. Promising outcomes has been reported by this procedure. Nevertheless, mechanical axis malalignment remained a problem, which may result in aseptic loosing, instability, and unexplained pain postoperatively [2, 3]. To achieve better anatomical and functional outcomes, patient-specific instrumentation (PSI) has been introduced.
Compared with conventional instrumentation (CI), PSI uses customized cutting blocks instead of standard blocks. The customized blocks were generated from a preoperative three-dimensional model, which was reconstructed from computed tomography (CT) or magnetic resonance imaging (MRI) [4]. Reported advantages of PSI included improved alignment, higher surgical efficacy, and reduced complication risk [5]. However, the superiority of PSI over CI remained inconclusive.
Previous meta-analyses demonstrated no significant difference between PSI and CI in short-term follow-up results, regarding postoperative functional outcomes [6, 7] and radiographic alignment [8, 9]. Mannan et al. [6] conducted a meta-analysis including five RCTs and three prospective-comparative studies that revealed no significant difference between PSI and conventional instrumentation for postoperative Knee Society Score (KSS), ROM, or Oxford Knee Score (OKS). Thienpont et al. [8] compared the mechanical alignment between PSI and CI, concluding that PSI might improve the accuracy of femoral component alignment and global mechanical alignment, but increased the risk of outliers for the tibial component alignment. However, most meta-analyses included nonrandomized studies and with small size of patients. Besides, they analyzed only a few variables (outliers from radiology only or functional outcomes only, etc.), which is not robust enough to determine the superiority of one over another. Furthermore, more randomized controlled trials (RCTs) have been published [10–15] recently and need to be integrated to update our knowledge (comprehensive comparison between our work and previous reviews can be found in S2 Supplemental file).
We therefore conducted a meta-analysis, including all level one studies that are comparing the PSI and CI methods to treat osteoarthritis, to answer three questions: (1) Intraoperatively, do patient-specific instrumentations reduce the risk of blood loss and shorten the surgical time compared with conventional instrumentation? (2) Postoperatively, does the use of PSI yield improved radiological and functional results while reducing the risk of complication? (3) Integrating the above data with patients’ length of stay in the hospital, does PSI provide financial benefit and achieve better cost-effectiveness results? Finally, we hypothesized that PSI did not improve the alignment of the mechanical axis or functional outcomes compared with CI.
2. Materials and Methods
2.1. Search Strategy
We employed a validated search strategy by using the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) checklist. Two databases, PubMed and Embase, from 2012 to 2018 were searched.
The following terms including mesh terms and free terms for PubMed were used in the search strategy: total knee arthroplasty, patient-matched, custom-made guides, patient-specific, and custom-fit. Reference lists from relevant articles were retrieved to identify additional studies. We also complemented our reference by searching gray literature and unpublished literature database.
2.2. Study Selection and Eligible Criterion
2 reviewers independently screened all search terms, abstract, and full text of studies potentially eligible for including. A third senior reviewer was consulted when disagreements occurred.
For the types of participants, studies involving males or females of all ages were included.
The types of interventions and controlled groups were as follows: (1) the comparison between patient-specific instrumentation (PSI) and conventional instrumentation (CI) in the aspect of clinical, functional, or radiographic outcomes; (2) patients who underwent primary total knee replacement; and (3) studies that were published in English. Studies were excluded if they reported the patients with fracture, deformity, or tumor, and if they were animal or cadaveric studies.
For the types of studies, 29 studies included were all randomized clinical trials. No language, publication date, or publication status restrictions were imposed.
The types of outcome measures were operative time, blood loss, malalignment of mechanical axis, and Knee Society Score (KSS).
2.3. Data Extraction
We used a standard form to extract data, and after extracted by one investigator, the data would be verified by the other investigator. The data extracted was strictly confined in the same follow-up time. Data extraction included outliers of the mechanical axis, which was defined as a measurement of >3° on standing full-extremity radiograph in full extension [16], total operative time, blood loss (measured by volume loss or reduction of hemoglobin), and postoperative functional score, including Knee Society Score (KSS) and Oxford Knee Score (OKS). Subgroup analysis was conducted in the KSS-knee and KSS-function, the MRI-based and CT-based operative time and malalignment, and specific system of PSI. Besides, length of stay in hospital and complication rate were also extracted.
2.4. Quality Assessment
The methodological quality of included studies was assessed by 2 independent investigators. The Cochrane Handbook for Systematic Reviews of Interventions was used for assessing all randomized studies. Six items were examined to value the risk of bias: (1) random sequence generation, (2) allocation concealment, (3) blinding of participant and personal, (4) blinding of outcome assessment, (5) incomplete outcome data, and (6) selective reporting. Each item is classified as low risk of bias, high risk of bias, or unclear risk of bias.
2.5. Data Analysis
We used value and to evaluate the statistic heterogeneity, and , indicating high heterogeneity. The fixed effects model was conducted to the homogeneous data when , while the random effects model was performed to the data with high heterogeneity. Revman 5.3 (Version 5.3. Copenhagen: The Nordic Cochrane Centre, The Cochrane Collaboration) was used for the data analysis. For dichotomous variables, risk ratio and 95% confidence intervals were worked by the Mantel-Haenszel method. Forest plot was used to present our statistic from PSI and CI groups. The Funnel plot was conducted to assess publication bias when variables were extracted from more than 10 studies. For studies that did not report the standard deviation (SD), we calculated it from value, 95% confidence intervals, interquartile range, or standard errors. If this information was not mentioned, we will contact the corresponding author for the missing part.
3. Results
3.1. Search Findings
The results of the search strategy are presented in PRISMA flowchart (Figure 1). A total of 433 studies were identified from the initial search. 29 studies, which are all randomized controlled trials (RCTs), met the final inclusion criteria for the meta-analysis. Study baseline characteristics are summarized in Table 1. Twenty-nine randomized studies representing 2487 TKAs (PSI: 1243, CI: 1244) were included.

3.2. Study Characteristics
All these included studies had level I evidence and were published between 2012 and 2018 (Table 1). The average follow-up time is 10.9 months. The average age of included patients is 68.3 years. For the PSI group, 21 studies acquired 3D models of the patients’ anatomy from magnetic resonance imaging (MRI) while 8 studies chose computed tomography (CT) to acquire images. Signature is the most commonly used PSI system (8/29), followed by Visionaire, Zimmer PSI, and TruMatch (7/29, 5/29, 4/29, respectively). Myknee, Miscellaneous, Materialise NV, and Imprint were also used in the included studies. Most studies are based in Europe (20/29), followed by the United States (4/29), Asia (4/29), and Australia (1/29).
3.3. Quality Assessment and Risk of Bias
All included RCTs were evaluated by the Cochrane Handbook for Systematic Reviews of Interventions. More than 50% of the included studies have a risk of performance bias. Nearly 75% included studies were not mentioned the allocation concealment, thus resulting in a selection bias. All studies were free of incomplete outcome data and selective reporting (Figure 2).

3.4. Operative Time
Fifteen studies [10, 12, 15, 17–28] with 1,404 knees were included for the analysis of operative time. The total standardized mean difference (SMD) was -0.36 (95% CI, -0.67 to -0.04; ) (Figure 3(a)). Substantial heterogeneity was found in the statistical analysis (, ). Subgroup analysis of operative time between MRI-based or CT-based image processing favors the MRI group. Unlike the result from overall effect or from the isolated MRI-based group, the CT-based group showed insignificant difference regarding operative time between PSI and CI.

(a) Operative time

(b) Blood loss
3.5. Blood Loss
Data of blood loss, from 5 studies, were pooled; 2 of the 5 measured the reduction of hemoglobin (g/dL) [17, 22], 2 studies measured the loss volume of 11 blood [12, 29], and one study [24] measured both. In both methods, we found a significant reduction of blood loss. The Std. Mean Difference was -0.48 (95% CI, -0.73 to -0.23; ) and -0.49 (95% CI, -0.92 to -0.05; ) for the hemoglobin loss and blood volume, respectively (Figure 3(b)).
3.6. Malalignment of Mechanical Axis
17 studies [10, 15, 17–23, 25–27, 30–35] involving 1,577 knees reported the number of knees with mechanical axis (hip-knee-ankle, HKA axis) malalignment of >3°. The chi-squared test for heterogeneity was 29.15 (). Numbers of malalignment were similar in both groups (Figure 4) with risk ratio 0.88 (95% CI, 0.74 to 1.04; ). According to the radiographic method and PSI system, the subgroup analysis was assigned. For the radiographic subgroup, the result favors the MRI subgroup as CT shows no significant difference (Figure 4). For the PSI system subgroup, there is no significance difference among each subgroup but Visionaire-specific PSI system is the only system that showed significant difference between PSI and CI (Figure 5).


3.7. Patient-Reported Outcomes
We observed a significant superior outcome of Knee Society Score (KSS) in PSI group compared with CI groups, particularly in KSS-knee group. Patients with the follow-up time of 3 months in 3 studies [12, 15, 24] were included. KSS was analyzed by subgroups of KSS-knee and KSS-function. Meta-analysis was conducted, and the pooled result showed that there is significantly better effect for the PSI group (, 95% CI, -0.33 to -0.02, ). No substantial heterogeneity was found (, ) (Figure 6(a)). For postoperative Oxford Knee Score (OKS), 5 studies [11, 15, 18, 19, 36] were included and no significant difference between PSI and CI groups was found. The total SMD was 0.07 (95% CI, -0.09 to 0.22, ), and there is no heterogeneity found (, ) (Figure 6(a)).

(a) Patient-reported outcomes

(b) Complications

(c) Length of stay
3.8. Postoperative Complication and Length of Stay
Five studies [18, 19, 29, 36, 37] that reported complications were included in meta-analysis (Figure 6(b)) (Table 2) while no significant difference was found (, 95% CI, 0.68 to 1.63; ). Superficial surgical site infection/delayed healing was found the most common complication among 29 studies (, ) (Table 2). Poor range of motion was another common complication observed (, ).
In the analysis of length of stay, 5 studies [12, 17, 21, 27, 29] were chosen. Fixed effects model meta-analysis confirmed no significant difference for either intervention group (Figure 6(c)).
3.9. Publication Bias
Risk of publication bias in the studies that reported on operative time (15 studies) and malalignment (18 studies) was assessed by graphical assessment of funnel plots (Figure 7). Both plots show minimal evidence of publication bias.

(a)

(b)
4. Discussion
Patient-specific instrumentation (PSI) has been introduced to reduce operation time and increase surgical efficacy due to avoidance of intramedullary canal violation. In our work, we did find a shorter operative time performed by PSI than by CI significantly (, ). Thienpont et al. [8] did a meta-analyses including level 1 and level 2 studies, concluding a slight but significant difference for PSI, consistent with our result. Other studies [5, 38] could not reach a significance perhaps their samples’ size was not as large as ours (690 patients for PSI and 714 for CI), thus resulting in the type 2 error.
Compared with the MRI-based group, the CT-based PSI group showed no significance with the CI group, consistent with another study by Noble et al. [28]. In fact, the number of surgical steps of CT-based PSI was not reduced compared with CI, as it did not have a cutting slot or proximal peg holes on the tibial articular surface, which was necessary to use a standard cutting instrument on both the femur and tibia sides [39]. Despite the statistical difference when compared with CI, our subgroup analysis found no significant difference between MRI- and CT-based groups (, ).
Besides operative time, significant reduction of blood loss was also observed in PSI group, regardless of calculating as blood volume () or hemoglobin (Hb) count (). This may also be attributed to the minimally invasive operative way of PSI that diminished dissection. None of the three studies [17, 22, 24] considering Hb reduction reached significance as they were limited by separated sample size. We chose a fixed effects model to analyze the Hb while random effects model for the blood volume due to the heterogeneity we found ( for Hb and for blood volume). The higher heterogeneity for blood volume might be due to the different drained time they chose. When we discuss the effect of different instrumentations on the blood loss, hemoglobin is a more ideal variable than blood volume, for it is directly related to the physiological condition and may be of greater clinical relevance.
Overall, with 1500 patients regardless of image acquisition method, we found no difference between two groups regarding outliers of the mechanical axis. While if we compared the subgroups of MRI-based and CT-based with CI groups, we observed a trend of difference toward MRI-based subgroup () than CT-based subgroup (). Previous meta-analysis [40] and RCT [32] directly comparing MRI with CT modalities also found slightly lower proportion of outliers in the overall alignment of the limb in MRI groups. However, if we take the safety, cost, and convenience into consideration, the question of superiority becomes far beyond the current research. When analyzing the subgroups with different systems, it appeared that the risk of outlier of the mechanical axis was related to specific systems (Figure 5), consistent with the study of Huijbregts et al. [18].
Evidences currently suggested that malalignment was strongly related to postoperative complications [41], such as patellar tracking [42] and kinematics [43]. PSI was produced to improve the limb alignment of the TKA [11]. Yet, our work did not detect any significant difference between PSI and CI regarding postoperative complication as well as rate of outliers. However, postoperative complication is a multifactorial result and whether those complications were alignment-related or instrumentation-related remains unclear. Boonen et al. [36] and Yan et al. [19], for instance, both got a result that patients in the PSI group actually suffered a higher risk of complication compared with CI groups. A possible reason could be that the surgeons enrolled in this study were high-volume knee arthroplasty surgeons with conventional instrumentation, thus producing a better outcome than the PSI as a newer technique.
Patient-reported outcomes (PROs) are the best subjective measurement of functional outcome after joint arthroplasty [44]. Due to our strict inclusion criteria, only Knee Society Score (KSS) and Oxford Knee Score (OKS) were extracted and analyzed. The difference of OKS is insignificant, consistent with previous reviews conducted by Goyal and Tripathy [7] and Mannan et al. [6]. Unlike their studies, all patients included in our analysis were restricted at a 3-month follow-up. Although data from a long-term outcome could not be extracted, existing RCTs suggest that OKS in 2-year [36] and 5-year [11] follow-up is also statistically not different between PSI and CI.
Although we included all level 1 studies, we still detected risk of bias from some included studies. However, we used strict criteria to assess the quality of studies. For example, studies that were designed with only patients in PSI group receiving the MRI or CT examination were considered as high risk of bias, as the blinding from patients was broken. Besides, studies where surgeons performed TKA but were not independent of the trial were also considered as high risk of performance bias.
Our work has several strengths. Firstly, as a meta-analysis with all level one studies of approximately 2500 patients, it has greater statistical power than all the included studies and the previous published meta-analyses. Secondly, to our knowledge, it is the first review to evaluate intraoperative efficacy, postoperative outcomes (radiology, function, and complications), subgroup analysis (different image acquisition methods and PSI systems used), and length of stay which is strongly related to cost-effectiveness analysis, allowing a more comprehensive appraisal of the PSI technology.
Limitation of our analysis still existed. Firstly, the perioperative procedures between two techniques have seldom been evaluated. Time cost of learning PSI technology and processing the image has not been calculated and analyzed. The minor improvement of surgical efficacy might not offset additional perioperative time [45]. As the learning curve for a new technique (e.g., PSI) has always been a matter of debate, we still need more solid evidences to warrant the indication of patient-specific instrumentations.
Conflicts of Interest
The authors declare that there is no conflict of interest regarding the publication of this paper.
Supplementary Materials
Supplementary 1. S1: PRISMA checklist.
Supplementary 2. S2: Supplemental file: comparison of published reviews with our studies.