AbstractPurposeWe performed three-dimensional (3D) dose reconstruction-based pretreatment verification to evaluate gamma analysis acceptance criteria in volumetric modulated arc therapy (VMAT) for prostate cancer.
Materials and MethodsPretreatment verification for 28 VMAT plans for prostate cancer was performed using the COMPASS system with a dolphin detector. The 3D reconstructed dose distribution of the treatment planning system calculation (TC) was compared with that of COMPASS independent calculation (CC) and COMPASS reconstruction from the dolphin detector measurement (CR). Gamma results (gamma failure rate and average gamma value [GFR and γAvg]) and dose-volume histogram (DVH) deviations, 98%, 2% and mean dose-volume difference (DD98%, DD2% and DDmean), were evaluated. Gamma analyses were performed with two acceptance criteria, 2%/2 mm and 3%/3 mm.
ResultsThe GFR in 2%/2 mm criteria were less than 8%, and those in 3%/3 mm criteria were less than 1% for all structures in comparisons between TC, CC, and CR. In the comparison between TC and CR, GFR and γAvg in 2%/2 mm criteria were significantly higher than those in 3%/3 mm criteria. The DVH deviations were within 2%, except for DDmean (%) for rectum and bladder.
ConclusionsThe 3%/3 mm criteria were not strict enough to identify any discrepancies between planned and measured doses, and DVH deviations were less than 2% in most parameters. Therefore, gamma criteria of 2%/2 mm and DVH related parameters could be a useful tool for pretreatment verification for VMAT in prostate cancer.
IntroductionExternal beam radiation therapy is one of the main treatment options for prostate cancer [1]. Intensity-modulated radiotherapy (IMRT), which has been widely used for prostate cancer, can provide conformal target coverage while minimizing the dose to adjacent organs [2]. Volumetric modulated arc therapy (VMAT) is an IMRT technique that delivers rotational cone beams with simultaneous modulation of multileaf collimator (MLC) leaves and dose rates [3]. Compared with IMRT which uses fixed gantry angles, VMAT provides an extra degree of freedom, and can reduce treatment delivery time and monitor units (MUs) while providing equivalent or improved dose distributions [4,5]. However, because of its complexities in treatment planning and treatment delivery systems, VMAT requires a stringent pretreatment verification to check whether the intended doses will be delivered accurately.
Gamma analysis, introduced by Low et al. [6] in 1998, has been predominantly used for pretreatment verification [7-10]. It compares reference and evaluated dose distributions based on a combination of dose difference and distance-to-agreement (DTA) criteria and calculates gamma index (γ) and gamma passing/failure rates. Gamma failure rate (GFR), which is the percentage of points with γ > 1, has been used as a tool to decide whether or not a treatment plan is acceptable. Many studies, including AAPM TG-218, have made efforts to develop recommendations on the choice of acceptance criteria for dose difference/DTA as well as the tolerance and action limits for gamma passing/failure rates [6-10]. However, whether the GFR is sufficient to detect clinically relevant error is unknown, as this measure only focusses on the quantity of error. In addition, recommendations for gamma analysis are not cancer- or structure-specific. Although three-dimensional (3D) dose reconstruction-based pretreatment verification is available, no recommendation regarding its interpretation and incorporation into choosing acceptance criteria for gamma analysis have been made.
In this study, we performed 3D dose reconstruction-based pretreatment verification to evaluate gamma analysis acceptance criteria in VMAT for prostate cancer.
Materials and MethodsPretreatment verifications for 28 VMAT plans for prostate cancer patients treated between May 2017 and April 2018 were performed using the COMPASS system in conjunction with a dolphin detector (IBA Dosimetry GmbH, Schwarzenbruck, Germany). This study was approved by the Institutional Review Board of Wonkwang University Hospital (WKUH 2018-12-004). The informed consent was waived.
1. Simulation and VMAT planEach patient was immobilized with a dual leg positioner in the supine position, and computed tomography (CT) images from the second lumbar vertebra to the proximal half of the femur with a 3-mm slice thickness were obtained. Patients were instructed to void, drink 500 mL of water, and then wait for 30 minutes before the acquisition of CT images as well as before each fraction of treatment. Clinical target volume (CTV) was defined as the surgical bed of prostate and seminal vesicles in adjuvant and salvage radiotherapy. In cases of definitive radiotherapy (n = 5), CTV included prostate gland and seminal vesicles on the planning CT. Planning target volume (PTV) was defined as expansion of CTV with a margin of 5 mm in the other direction, except for the rectum, where a 3-mm margin was used. Organs at risk (OARs), including rectum, bladder, both femur heads, and sigmoid colon were also contoured. The PTV dose of each patient was prescribed as 66 Gy, 70 Gy, and 76 Gy as the adjuvant, salvage, and definitive aim in 2 Gy fractions, respectively.
All VMAT plans were created using two-arc with a 6-MV photon beam and a maximum dose rate of 600 MU/min using a VitalBeam linear accelerator mounted with a millennium 120 MLC system (Varian Medical System, Palo Alto, CA, USA). The first arc beam (clockwise rotation) ranged from 181° to 179°, and the second arc beam (counterclockwise rotation) ranged from 179° to 181°. The collimator angles were 330° and 30° for the first and second arc beam, respectively. The plan objectives were as followings: (1) the prescribed dose was at least 95% of PTV and at least 99% of CTV; (2) a maximum dose (Dmax) of <40 Gy to the sigmoid colon and both femur heads; (3) a mean dose (Dmean) of <18 Gy to the femur; (4) the volume of the rectum irradiated by 68 Gy, 65 Gy, 60 Gy, and 50 Gy (V68Gy, V65Gy, V60Gy, and V50Gy) to <15%, <25%, <35%, and <50%, respectively; and (5) the V62Gy of the bladder was <40%. Optimization was performed using an Eclipse photon optimizer (version 13.7.14), and dose calculations were performed using the Anisotropic Analytic Algorithm (version 13.7.14) with a grid size of 2.5 mm.
2. Pretreatment verification and statisticsThe pretreatment verification was performed using a COMPASS system with a dolphin detector. First, a DICOM dataset consisting of CT data, structure, plan, and dose was exported from the treatment planning system (TPS) to the COMPASS system. Second, an independent calculation was done using a collapsed cone algorithm with a grid size of 2.5 mm (COMPASS dose engine, version 4.0). Third, the measurement was done using the dolphin detector, a two-dimensional array with 1,513 ionization chambers, according to the manufacturer guidelines (COMPASS 4.0 user’s guide): (1) the detector was mounted on the gantry of the VitalBeam; (2) the detector setup procedure was performed, including pre-irradiation (1000 MU at field size of 40 cm × 40 cm), background measurement (100 MU at field size of 10 cm × 10 cm), and subsequent absolute calibration; (3) the detector was connected to the COMPASS system, and the gantry angle sensor was tested; (4) the measurement was performed and assigned to the plan; (5) the measurement was exported; and (6) 3D reconstructed dose distribution from the dolphin detector was calculated with the COMPASS dose engine.
Fig. 1 shows a representative case of pretreatment verification using the COMPASS system with the dolphin detector. The 3D reconstructed dose distribution of the TPS calculation (TC) was compared with that of COMPASS independent calculation (CC) and COMPASS reconstruction from the dolphin detector measurement (CR), and the gamma results and dose-volume histogram (DVH) deviations were evaluated for each structure, PTV, CTV, as well as rectum and bladder. Gamma results, including GFR and average gamma value (γAvg) were automatically calculated according to the equations defined by Low et al. [6]. Global normalization was used, and gamma results with gamma criteria of 2%/2 mm were compared with those with criteria of 3%/3 mm using paired t-tests. DVH deviations regarding clinically relevant errors were evaluated: the DD98% (%), DD2% (%), and DDmean (%) for target volumes; the DD2% (%) and DDmean (%) for rectum and bladder. The DD98% (%) and DD2% (%) were defined as the percentage difference in the dose received by 98% and 2% volumes of structure. The DDmean (%) was defined as the percentage difference in the mean dose of the structure. Correlation between gamma results and DVH deviations was examined, and Pearson correlation coefficient >0.8 (p < 0.05) was considered to be significant. All statistical tests were performed at 5% level of significance using SPSS version 21 (IBM SPSS Statistics, Armonk, NY, USA).
Results1. Gamma resultsThe structure-specific GFR and γAvg are shown in Tables 1 and 2. The GFR in 2%/2 mm criteria ranged up to 7.45%. When TC and CC were compared, the mean GFR in 2%/2 mm criteria for PTV, CTV, rectum, and bladder were 0.18%, 0.10%, 1.41%, and 1.23%, respectively. In comparison between TC and CR, the values were 3.34%, 4.04%, 0.38%, and 2.28%, respectively. The GFR in 3%/3 mm criteria were less than 1% for all structures in all comparisons between TC, CC, and CR. The mean GFRs in 3%/3 mm criteria were less than 0.1%. In comparison between TC and CR, GFR in 2%/2 mm criteria were significantly higher than those in 3%/3 mm criteria for all structures.
The mean γAvg in 2%/2 mm criteria for PTV, CTV, rectum, and bladder were 0.31, 0.36, 0.34, and 0.40, respectively, in comparison between TC and CC. In comparison between TC and CR, these values were 0.46, 0.52, 0.38, and 0.40, respectively. The γAvg in the 2%/2 mm criteria ranged up to 0.66, and were significantly higher than those in the 3%/3 mm criteria (which ranged up to 0.46) for all structures in all comparisons between TC, CC, and CR.
2. DVH deviationsDVH deviations in each structure are shown in Table 3. For PTV and CTV, DD98%, DD2%, and DDmean were less than ±2% in all comparisons between TC, CC, and CR. The mean DD98%, DD2%, and DDmean for PTV were -0.42%, 0.61%, and 0.57% in comparison between TC and CC, and -0.85%, 0.88%, and 0.49% in comparison between TC and CR, respectively.
For rectum and bladder, the DD2% values were less than ±2% in all comparisons between TC, CC, and CR. The mean DD2% for rectum and bladder were 0.43% and 0.75%, respectively, in comparisons between TC and CC. In comparison between TC and CR, these values were 0.81% and 1.10%, respectively. The mean DDmean for rectum and bladder were -1.09% and -1.33%, respectively, in comparison between TC and CC; for comparison between TC and CR, these values were 1.28% and -0.39%, respectively.
3. Correlation analysisThere was no strong correlation between the gamma results and DVH deviations for any structure, except for CTV (Table 4). For CTV, Pearson correlation coefficient was 0.961 (p < 0.001) between the γAvg in the 2%/2 mm criteria and the |DDmean|, and 0.974 (p < 0.001) between the γAvg in the 3%/3 mm criteria and the |DDmean|.
Discussion and ConclusionIn this study, 3D dose reconstruction-based pretreatment verification of VMAT plans for prostate cancer was performed. Both the independent calculation with the collapsed cone algorithm and the measurement with dolphin detector were used to validate the TPS calculation. The independent calculation has been used in many centers as another calculation that may reveal errors in TPS, and is generally less time consuming than measurement based verification [9]. However, independent calculations may include errors due to the known uncertainties of the model-based algorithm [11]. In addition, errors in the actual delivery of VMAT plans are not validated by the independent calculation, and measurement based verification is also necessary.
We found that there was little discrepancy between the planned and independently calculated doses, as well as between the planned and measured doses. The TPS calculation by the Anisotropic Analytic Algorithm showed good agreement with the independent calculation using the collapsed cone algorithm in the COMPASS dose engine. The collapsed cone algorithm uses a 3D convolution superposition model, and is one of the most advanced calculation algorithms [12,13]. Chopra et al. [13] evaluated the accuracy of five dose calculation algorithms, and found that the collapsed cone algorithm along with the Acuros XB and Monte Carlo-based algorithms correlated best with measured data in both heterogeneous phantom and homogeneous phantom. In contrast, the Anisotropic Analytic Algorithm was reported to be less sensitive to tissue heterogeneities than the collapsed cone algorithm [12-14]. Differences between the Anisotropic Analytic Algorithm and the measured values were 3%–5% and 1.5%–3.5% in heterogeneous lung phantom and heterogeneous bone phantom, respectively, despite the excellent agreement between the Anisotropic Analytic Algorithm and the measured values in the homogeneous phantom [13]. However, our results found little discrepancy between the TPS calculation using the Anisotropic Analytic Algorithm and the independent calculation with the collapsed cone algorithm. This little discrepancy may be for several reasons; firstly, the calculated and measured regions of the VMAT plans consisted of relatively homogenous materials as the treatment volume included prostate and seminal vesicles or the surgical bed, excluding the regional lymphatic area. Therefore, in heterogeneous tissues which include air, such as the head and neck, or lung, the comparison between the Anisotropic Analytic Algorithm and collapsed cone algorithm may differ from the results found here. Because of this, further studies are needed in these types of cancer. Secondly, the degree of modulation in VMAT plans for prostate cancers can also be lower than that for other cancers due to the small treatment field size and less irregular shape of the target volume. Although there has been no concern about the cancer- or structure-specific acceptance criteria for gamma analysis, the degree of modulation may affect the quantity and magnitude of errors, and several studies have reported similar results to ours [14-16]. In the study by Vieillevigne et al. [16], which performed pretreatment verification for VMAT plans using three different detectors in 15 cases of prostate cancer, the GFRs were less than 2% and less than 7% for the acceptance criteria of 3%/3 mm and 2%/2 mm in all detectors, respectively. Thirdly, the accuracy of beam modeling in TPS might be improved by using a more recent version of TPS than that used in previous studies.
As the degree of clinically relevant errors may vary depending on the type of cancer, TPS and the delivery system used as well as the dosimetric measurement and analysis tools for pretreatment evaluation, cancer- and institution-specific criteria for pretreatment verification seems to be important. The 3D dose reconstruction-based pretreatment verification provides more thorough information on the quantity, magnitude, and clinical impacts of potential errors (such as under- or over-dosage than the TPS calculation). However, previous studies have mainly focused on the correlation analysis between gamma results and DVH deviations. Similar to our findings, previous studies reported weak or absence of correlation between GFR and DVH deviations in VMAT [17-20] and IMRT [21-24] plans for prostate cancers [17,18,20,23], as well as for other cancers [19-24]. Fundamentally, clinically relevant DVH parameters are extracted from the statistical analyses for the entire dose distribution, despite gamma results evaluating dose differences point by point. Weak or absence of correlation between GFR and DVH deviations is therefore likely inevitable regardless of cancer types, plans, and normalization methods, and additional evaluations for DVH parameters are needed to detect clinically relevant errors.
In the present study, we tried to evaluate gamma results with two acceptance criteria as well as DVH deviation in prostate cancer VMAT plans. Our results showed that the 2%/2 mm criteria was more appropriate than the 3%/3 mm criteria in VMAT plans for prostate cancer, as the 3%/3 mm criteria might not be stringent enough to detect any discrepancies between the planned and measured dose distributions. There are several limitations to the present study. First, plans with intended errors were not evaluated. Second, pretreatment verification was performed with the COMPASS system with the dolphin detector only. There are several commercially available tools for 3D dose reconstruction based pretreatment verification, and gamma results and DVH deviations can be affected by spatial resolution of the detector, as well as by interpolation and the calculation algorithm [10,25]. Third, unlike the IMRT plans that use fixed gantry angles, VMAT plans cannot be evaluated by perpendicular field-by-field measurement, and the composite measurement used here may mask errors due to the summation of errors [10]. Fourth, dose errors in the low dosimetric regions can also be underestimated due to the global normalization, which evaluates dose differences based on the same value (the maximum planned dose) for all points. Fifth, 3D reconstructed dose distribution from the dolphin detector has an inherent limitation because COMPASS dose engine uses collapsed cone algorithm to calculate 3D dose from the measured 2D fluence map.
Although there have been no practical guidelines for the interpretation of DVH deviations, the target volume- or structure-specific DVH deviation may be very useful tools in the dosimetric evaluation for comparison between the planned and measured dose distribution. Our analysis showed that a dose difference in 98%, 2%, and mean volume of PTV, CTV, rectum, and bladder were within ±2% between TC, CC, and CR. Further study for the proper dosimetric objects and appropriate tolerances in analysis of DVH deviations is needed.
In conclusions, the 3%/3 mm criteria were not stringent enough to identify any discrepancies between the planned and measured dose, and DVH deviations were less than 2% in most parameters. Therefore, the gamma criteria of 2%/2 mm and DVH related parameters could be useful tools in the pretreatment verification for VMAT in prostate cancer.
Table 1.
Table 2.
Table 3.Values are presented as mean (95% confidence interval). PTV, planning target volume; CTV, clinical target volume; TC, treatment planning system calculation; CC, COMPASS independent calculation; CR, COMPASS reconstruction from the dolphin detector measurement; DD98% (%), percentage difference in dose received by 98% volume of structure; DD2% (%), percentage difference in dose received by 2% volume of structure; DDmean (%), percentage difference in mean dose of structure. Table 4.
DVH, dose-volume histogram; PTV, planning target volume; CTV, clinical target volume; DD98% (%), percentage difference in dose received by 98% volume of structure; DD2% (%), percentage difference in dose received by 2% volume of structure; DDmean (%), percentage difference in mean dose of structure; GFR, gamma failure rate; γAvg, average gamma value. References1. Bekelman JE, Rumble RB, Freedland SJ. Clinically localized prostate cancer: ASCO Clinical Practice Guideline Endorsement of an AUA/ASTRO/SUO Guideline Summary. J Oncol Pract 2018;14:618–24.
2. Nguyen PL, Gu X, Lipsitz SR, et al. Cost implications of the rapid adoption of newer technologies for treating prostate cancer. J Clin Oncol 2011;29:1517–24.
3. Khan FM, Gibbons JP. Khan's the physics of radiation therapy. 5th ed. Philadelphia, PA: Lippincott Williams & Wilkins; 2014.
4. Hatano K, Tohyama N, Kodama T, Okabe N, Sakai M, Konoeda K. Current status of intensity-modulated radiation therapy for prostate cancer: History, clinical results and future directions. Int J Urol 2019;26:775–84.
5. Cho B. Intensity-modulated radiation therapy: a review with a physics perspective. Radiat Oncol J 2018;36:1–10.
6. Low DA, Harms WB, Mutic S, Purdy JA. A technique for the quantitative evaluation of dose distributions. Med Phys 1998;25:656–61.
7. Muzik J, Soukup M, Alber M. Comparison of fixed-beam IMRT, helical tomotherapy, and IMPT for selected cases. Med Phys 2008;35:1580–92.
8. Ezzell GA, Burmeister JW, Dogan N, et al. IMRT commissioning: multiple institution planning and dosimetry comparisons, a report from AAPM Task Group 119. Med Phys 2009;36:5359–73.
9. Low DA, Moran JM, Dempsey JF, Dong L, Oldham M. Dosimetry tools and techniques for IMRT. Med Phys 2011;38:1313–38.
10. Miften M, Olch A, Mihailidis D, et al. Tolerance limits and methodologies for IMRT measurement-based verification QA: recommendations of AAPM Task Group No. 218. Med Phys 2018;45:e53–e83.
11. Alber M, Mijnheer B, Georg D, et al. Guidelines for the verification of IMRT. Brussels, Belgium: European Society for Therapeutic Radiology and Oncology; 2008.
12. Vangvichith M, Autret D, Tiplica T, Barreau M, Dufreneix S. Comparison of five dose calculation algorithms in a heterogeneous media using design of experiment. Phys Med 2019;61:103–11.
13. Chopra KL, Leo P, Kabat C, et al. Evaluation of dose calculation accuracy of treatment planning systems in the presence of tissue heterogeneities. Ther Radiol Oncol 2018;2:420–7.
14. Kathirvel M, Subramanian S, Clivio A, et al. Critical appraisal of the accuracy of Acuros-XB and Anisotropic Analytical Algorithm compared to measurement and calculations with the compass system in the delivery of RapidArc clinical plans. Radiat Oncol 2013;8:140.
15. Kim JI, Choi CH, Wu HG, Kim JH, Kim K, Park JM. Correlation analysis between 2D and quasi-3D gamma evaluations for both intensity-modulated radiation therapy and volumetric modulated arc therapy. Oncotarget 2017;8:5449–59.
16. Vieillevigne L, Molinier J, Brun T, Ferrand R. Gamma index comparison of three VMAT QA systems and evaluation of their sensitivity to delivery errors. Phys Med 2015;31:720–5.
17. Sdrolia A, Brownsword KM, Marsden JE, Alty KT, Moore CS, Beavis AW. Retrospective review of locally set tolerances for VMAT prostate patient specific QA using the COMPASS system. Phys Med 2015;31:792–7.
18. Cozzolino M, Oliviero C, Califano G, et al. Clinically relevant quality assurance (QA) for prostate RapidArc plans: gamma maps and DVH-based evaluation. Phys Med 2014;30:462–72.
19. Jin X, Yan H, Han C, Zhou Y, Yi J, Xie C. Correlation between gamma index passing rate and clinical dosimetric difference for pre-treatment 2D and 3D volumetric modulated arc therapy dosimetric verification. Br J Radiol 2015;88:20140577.
20. Park JM, Kim JI, Park SY, Oh DH, Kim ST. Reliability of the gamma index analysis as a verification method of volumetric modulated arc therapy plans. Radiat Oncol 2018;13:175.
21. Nelms BE, Zhen H, Tome WA. Per-beam, planar IMRT QA passing rates do not predict clinically relevant patient dose errors. Med Phys 2011;38:1037–44.
22. Zhen H, Nelms BE, Tome WA. Moving from gamma passing rates to patient DVH-based QA metrics in pretreatment dose QA. Med Phys 2011;38:5477–89.
23. Stasi M, Bresciani S, Miranti A, Maggio A, Sapino V, Gabriele P. Pretreatment patient-specific IMRT quality assurance: a correlation study between gamma index and patient clinical dose volume histogram. Med Phys 2012;39:7626–34.
|
|