Abstract
Phillyrin, a well-known bisepoxylignan, has been shown to have many critical pharmacological activities. In this study, a novel strategy for the extensive acquisition and use of data was established based on UHPLC-Q-Exactive mass spectrometry to analyze and identify the in vivo metabolites of phillyrin and to elucidate the in vivo metabolic pathways of phillyrin. Among them, the generation of data sets was mainly due to multichannel data mining methods, such as high extracted ion chromatogram (HEIC), diagnostic product ion (DPI), and neutral loss filtering (NLF). A total of 60 metabolites (including the prototype compound) were identified in positive and negative ion modes based on intuitive and useful data such as the standard’s cleavage rule, accurate molecular mass, and chromatographic retention time. The results showed that a series of biological reactions of phillyrin in vivo mainly included methylation, hydroxylation, hydrogenation, sulfonation, glucuronidation, demethylation, and dehydrogenation and their composite reactions. In summary, this study not only comprehensively explained the in vivo metabolism of phillyrin, but also proposed an effective strategy to quickly analyze and identify the metabolites of natural pharmaceutical ingredients in nature.
1. Introduction
Phillyrin, as an essential bisepoxylignan, is the main component of Forsythia suspensa (Thunb.) Vahl in plants belongs to the family Oleaceae [1–3]. Modern pharmacological research has shown that phillyrin not only has potent biological activity, but also plays a huge role that cannot be ignored in resisting diseases and maintaining human health, such as inhibiting inflammatory response, antiviral, antioxidative stress, and anticell apoptosis [4–8]. Phillyrin has been reported to improve insulin resistance in the body [9, 10], and it can also reduce the weight of obese mice through specific pathways [11]. Besides, phillyrin can decrease the formation and function of osteoclasts and prevent LPS-induced osteolysis in mice [12]. However, there are still many deficiencies in the comprehensive research on phillyrin metabolism in vivo. Therefore, revealing the metabolites and metabolic pathways of phillyrin is of great significance for its further development and utilization.
As we all know, liquid chromatography-mass spectrometry (LC-MS) combines the high separation power of liquid chromatography with the forceful qualitative and quantitative ability of mass spectrometry, which has the advantages of high sensitivity, high selectivity, and rich structural information [13–15]. UHPLC-Q-Exactive MS (ultra high-performance liquid chromatography coupled with hybrid quadrupole-Orbitrap mass spectrometry) can thoroughly and extensively obtain the structural information of compounds by using the functions of positive and negative ion scanning mode, full scanning, and automatic triggering of secondary mass spectrometry scanning. The superior resolution based on Orbitrap technology can quickly achieve the measurement of high-accuracy mass. Therefore, it can identify and confirm small molecule compounds in mixtures, is a powerful tool for analyzing complex compound systems, and has been extensively used in compound identification and screening [16–18]. By obtaining sufficient fragment information and analyzing composition information of a compound online, the molecular weight of the compound can be determined, and the possible molecular structure can be inferred based on the primary, secondary, and even multilevel mass spectral information [19–21]. In consequence, it is imperative to form a comprehensive and authoritative data set in the data processing process. Data mining methods reported in recent years have emerged endlessly, mainly including the following categories: mass defect filter, extracted ion chromatogram, diagnostic product ion (DPI), neutral loss filtering (NLF), and isotope pattern filtering [22–27].
In this paper, we have established a new strategy that combined UHPLC-Q-Exactive MS with multiple data mining analysis methods. Due to its slight detection limit and narrow deviation range, it is feasible to analyze and identify metabolites in rats for oral administration of phillyrin. Based on this study, we further proposed the in vivo metabolic pathways of phillyrin, which made up for the shortcomings of the current insufficient research on phillyrin metabolism and was conducive to revealing the in vivo mechanism of many pharmacological activities of phillyrin.
2. Materials and Methods
Based on the previous research results of our research group, we adopted the experimental methods and sample processing procedures of our predecessors [28, 29].
2.1. Chemicals and Reagents
The reference standard of phillyrin was purchased from Chengdu Must Biotechnology Co. Ltd. (Sichuan, China). After HPLC-UV analysis, the purity of phillyrin was not less than 98%. Acetonitrile, methanol, and formic acid (HPLC grade) used in the mobile phase were provided by Fisher Scientific (Fair Lawn, NJ, USA). C18-low solid-phase extraction (SPE) cartridges (3 L/60 g) for biological sample pretreatment were obtained from Waters (Milford, MA, USA). Ultrapure water was freshly prepared using the Milli-Q Gradient Å 10 water purification system (Millipore, Billerica, MA, USA). Moreover, other reagents and solvents met the requirements of analytical experiments in Beijing Chemical Works (Beijing, China).
2.2. Animals and Drug Administration
Sixteen male SD rats weighing 200–220 g were purchased from Beijing Weitong Lihua Experimental Animals Company (Beijing, China). All animals were kept under specific environmental conditions (temperature 22 ± 1°C, humidity 60 ± 10%, 12-hour day and night change), with free access to food and water for one week. The rats were then randomly divided into Drug Group (for experimental urine, plasma and feces, n = 8) and Control Group (for blank urine, plasma, and feces, n = 8). The reference standard of phillyrin was suspended in 0.5% sodium carboxymethyl cellulose (CMC-Na) solution. The Drug Group was orally administered phillyrin (300 mg/kg), while Control Group was given an equal amount of 0.5% CMC-Na solution by oral gavage. All animals were fasted for 12 hours before the experiment but had free access to water. The experimental protocol has been approved by the institutional Animal Care and Use Committee in Beijing University of Chinese Medicine. All procedures were carried out according to the Guide for the Care and Use of Laboratory Animals of the US National Institutes of Health.
2.3. Sample Collection
2.3.1. Plasma Sample Collection
After oral administration, all rats were placed in metabolic cages. Blood samples (about 0.5 ml) were taken from the infraorbital venous plexus of rats at time points of 0.5, 1, 1.5, 2, and 4 h after administration, and the operation was identical for each rat. Blank and experimental samples were obtained from the Control Group and the Drug Group, respectively. All blood samples were placed in a heparin sodium anticoagulated EP tube for 15 minutes and then centrifuged at 3000 rpm (4°C) for 15 minutes to separate plasma. After that, the plasma from the same group was combined into an aggregate and stored at −80°C until use.
2.3.2. Urine Sample Collection
Urine samples (0–24 h) were collected from each rat using a metabolic cage, and each sample was centrifuged at 3000 rpm (4°C) for 15 min to obtain a supernatant. The urine supernatants of each group of rats were mixed and stored at −80°C until use.
2.3.3. Feces Sample Collection
Fecal samples (0–24 h) were obtained from each rat using a metabolic cage, then dried, and ground to a powder. The fecal powders of each group of rats were mixed. Firstly, 0.5 g of fecal powder was dissolved in 70% methanol, and then it was extracted for 30 min by ultrasonic. Finally, fecal samples of rats in the Control Group and the Drug Group were obtained.
2.4. Biological Sample Preparation
All biological samples were prepared by precipitating and concentrating proteins and solid residues. Firstly, the SPE cartridges were pretreated by activation with methanol (5 ml) and deionized water (5 ml). Afterward, plasma, urine, and feces samples (1 ml) were added to the SPE cartridges, respectively. Then, the SPE cartridges were eluted with deionized water (5 ml) and methanol (3 ml) in that order, and the methanol eluates were collected. Finally, the eluates were dried with nitrogen at room temperature, and the residue was redissolved in 5% acetonitrile solution (100 μl) and centrifuged at 14000 rpm (4°C) for 15 min. The supernatant obtained by the above method was used for instrumental analysis.
2.5. Instruments and Conditions
All biological samples were analyzed using a UHPLC-Q-Exactive mass spectrometer (Thermo Fisher Scientific, MA, USA) equipped with an ESI ion source. Chromatographic separation was performed on a Waters ACQUITY BEH C18 column (2.1 mm × 100 mm × 1.7 μm; Waters Corporation, Milford, MA, USA), and the temperature was set at 25°C. The mobile phase consists of 0.1% formic acid solution (A) and acetonitrile (B), and the linear gradient was set as follows: 0–1 min, 5% B; 1–3 min, 5%–25% B; 3–8 min, 25%–60% B; 8–15 min, 60%–80% B; 15–20 min, 80%–100% B; 20–24 min, 100% B; 24–25 min, 100%–5% B; and 25–30 min, 5% B. The flow rate was 0.3 mL/min and the injection volume was 2 μL.
The optimal operating parameters in negative and positive ion mode were set as follows: electrospray voltage, 4 kV; capillary temperature, 350°C; sheath gas flow rate, 275 kPa; auxiliary gas flow rate, 138 kPa; auxiliary gas temperature, 300°C; and collision energy, 40 eV. Metabolites were detected from m/z 100–1500 using full-scan mass spectrometry with a resolution of 70000. Furthermore, the acquisition mode that relies on the parent ion list- (PIL-) DE was used to obtain the MSn phase of the acquired data set.
2.6. Data Processing
A Thermo Xcalibur 2.1 workstation (Thermo Scientific, Bremen, Germany) was utilized to acquire and process HR-ESI-MS1 and MSn data. Based on the established screening templates of metabolites, by searching the metabolites with specific molecular weight to compare the high-resolution extracted ion chromatograms of the Control Group and the Drug Group, the metabolites related to phillyrin were selected. To obtain as many phillyrin metabolite ions as possible, the peaks detected with intensity over 10,000 for negative ion mode and 50,000 for positive ion mode were selected for further structural characterization. Based on the exact mass of the metabolite and the set elemental composition, the formula predictor could accurately calculate the chemical formula of all parent ions. The type and number of atoms were set as follows: C [0–40], H [0–60], O [0–25], S [0–2], N [0–5], and ring double bond (RDB) equivalent value [0–15]. At the same time, MetWorks (Version 1.3) and Mass Frontier (Version 8.0) software (Thermo Scientific, Waltham, MA, USA) were used as auxiliary tools for mass fragmentation behaviors analysis and structural identification.
3. Results and Discussion
3.1. The Construction of Analysis Strategy
This study constructed an effective new strategy based on UHPLC-Q-Exactive MS for data acquisition combined with multipath data mining to analyze the metabolism of phillyrin in vivo. First of all, the ESI-MSn data set of the samples in both positive and negative ion modes were obtained by using the data-dependent scanning (DDS) acquisition method. Secondly, the common metabolites of phillyrin were determined according to literature reports and HREIC search, thereby establishing the screening templates for phillyrin metabolites. Then, by analyzing the mass fragmentation behaviors of the reference standards, the reasonable DPIs and NLFs of phillyrin were summarized. Afterward, based on the chromatographic retention time, the established DPIs and NLFs, and the corresponding calculated ClogP values, the structure of a series of metabolites of phillyrin could be identified. Finally, the metabolic pathway of phillyrin can be inferred from the above metabolic data. The entire research process of this strategy is shown in Figure 1.

3.2. Establishment of the Screening Templates for Metabolites
According to literature reports and HREICs search, three compounds (phillyrin, phillygenin, and enterolactone) were selected as primary metabolites, which are often found in the metabolites of phillyrin. After data sorting, in negative ion mode, the metabolites screening templates were set as follows: (1) phillyrin template (m/z 533.2017); (2) phillygenin template (m/z 371.1489) and its derivative templates (m/z 329.1020 for detrimethyl, m/z 357.1333 for demethylation, m/z 373.1646 for hydrogenation, and m/z 387.1438 for oxygenation); (3) enterolactone template (m/z 297.1121). Based on this effective method, some nondiscoverable metabolites could also be screened out from complex background noise.
3.3. Analysis of Mass Fragmentation Behavior of Phillyrin
To further explore the ESI-MSn fracture behavior of phillyrin, a comprehensive analysis of the standard solution was performed using UHPLC-Q-Exactive MS. For example, phillyrin in negative ion mode could find [M + HCOOH–H]− ion at m/z 579.2072 in the ESI-MS1 spectrum. Due to the absence of the glucose fragment, its characteristic ion peak was found at m/z 371 of the ESI-MS2 spectrum. On this basis, several series of characteristic product ions were retrieved at m/z 357, m/z 341, m/z 327, and m/z 311 because a series of fragments such as CH2, 2CH2, CH2 + CH2O, and 2CH2O were successively lost. Compounds with the same parent nucleus will have similar cleavage fragments in the ESI-MSn spectrum, so a comprehensive identification of metabolites can be achieved based on regular DPIs and NLFs. For example, the DPI at m/z 371 was diagnosed due to the absence of a glucose moiety. Thus the presence of DPIs at m/z 371 or m/z 371 + X in the ESI-MS2 spectrum of the compound provided comparable information for the identification of metabolites. At the same time, the continuous appearance of 14 Da (CH2) and 30 Da (CH2O) NLFs in the ESI-MSn spectrum of phillyrin also provided significant help for the identification of metabolites. The cleavage pathway of phillyrin in negative ion mode is presented in Figure 2. Moreover, the mass fragmentation behaviors of phillyrin in positive ion mode are shown in Figure 3.


3.4. Identification of Phillyrin Metabolites in Rats
The total ion chromatograms (TICs) of urine, plasma, and feces samples after oral administration of phillyrin in rats were obtained using UHPLC-Q-Exactive mass spectrometry. A total of 60 metabolites were found in both positive and negative ion modes by processing the data collected from the UHPLC-Q-Exactive instrument. Among them, there were 31 metabolites in positive ion mode and 33 metabolites in negative ion mode. In addition, after the literature search and comparison, 21 metabolites have been found and detected by predecessors [30, 31], while the remaining 39 metabolites have been screened and identified through the established strategy. All relevant mass spectral data are summarized in Table 1, and high-resolution extracted ion chromatograms (HREICs) of all metabolites of phillyrin are shown in Figure 4.

(a)

(b)

(c)

(d)

(e)

(f)

(g)

(h)

(i)

(j)
3.4.1. Identification of Metabolites Based on Phillyrin
The metabolite M0 producing [M + HCOOH–H]- ion at m/z 579.2072 (C28H35O13, 2.32 ppm) was eluted at 6.60 min. In contrast to the elution time and fragmentation behavior of the phillyrin standard, M0 could be accurately inferred as the phillyrin [32, 33].
The retention times of the metabolites M17 and M22 were 5.76 and 5.96 min, respectively, and they showed the same molecular ion at m/z 519.1498 (C25H27O12, error ≤ ±2.50 ppm). In the ESI-MS2 spectrum, the DPI at m/z 343 was generated due to the loss of 2CH2 by the phillygenin. While m/z 519 was 176 Da larger than the former, it was inferred that m/z 343 was caused by the absence of the glucuronic acid moiety. Therefore, it was speculated that M17 (ClogP, −0.41) and M22 (ClogP, −0.39) were demethylated and carboxylated metabolites of phillyrin, but the position of glucuronic acid group was different.
The metabolite M32 was eluted at 6.58 min and showed [M-H]− ion at m/z 533.1652 (C26H29O12, 2.15 ppm). Similarly, as in the case of M17, the presence of m/z 175 meant the formation of glucuronic acid. While M32 was 14 Da higher than M17, it was speculated that the phillygenin only had one molecule of CH2 removed.
The metabolite M36 with a retention time of 6.76 min showed [M-H]− ion at m/z 547.1809 (C27H31O12, 2.17 ppm). Similar to the above rule, it was concluded that M36 was the carboxylation product of phillyrin.
Metabolite M56 exhibited [M-H]− ion at m/z 507.2225 (C26H35O10, 2.28 ppm). It had a neutral loss of 44 Da, 132 Da, and 44 Da in the ESI-MS2 spectrum, which caused the appearance of m/z 463, m/z 331, and m/z 287. Among them, the 44 Da NLF can be presumed to be CH2O + CH2, while the NLF of 132 Da was formed by the loss of CH2O from the glucosyl group. Therefore, it was concluded that M56 is the dehydroxylation and hydrogenation product of phillyrin.
3.4.2. Identification of Metabolites Based on Phillygenin
The retention time of the metabolite M44 was 7.05 min, which showed [M + H]+ ion at m/z 305.1772 (C19H21N4, 5.95 ppm). According to the DPI at m/z 109 and m/z 93, it could be inferred that two benzene rings of phillygenin were introduced into 2NH2 and CH3, respectively. Interestingly, m/z 205 was 28 Da higher than m/z 177, and m/z 177 was also 28 Da higher than m/z 149, so NLF was assumed to be CH=NH by molecular formula and degree of unsaturation. Therefore, M44 was presumed to be the ammoniated product of phillygenin.
The metabolite M2 eluted at 2.49 min showed [M−H]− ion at m/z 315.0865 (C17H15O6, −8.27 ppm). In the ESI-MS2 spectrum, the DPI at m/z 193 demonstrated that one of the phenyl rings of the phillygenin was introduced into 4OH, and the mother nucleus lost 2CH2O. According to the DPI at m/z 124, 137, and 150, the generation of two double bonds could also be proved.
Metabolite M18 was eluted at 5.92 min and showed [M−H]− ion at m/z 329.1021 (C18H17O6, 3.33 ppm). The presence of DPI at m/z 137 and 163 in the ESI-MS2 spectrum confirmed the presence of dimethoxyphenyl, while the DPI at m/z109 speculated that 2OH was introduced on the other benzene ring. However, the substitution sites of the two other hydroxyl groups could not be determined.
The retention time of the metabolite M39 was 6.79 min, which showed [M−H] ions at m/z 343.1177 (C19H19O6, 3.57 ppm). Similar to the inferred process of M18, after determining the substituents of two benzene rings, it could be concluded that M39 was the product of the loss of 2CH2 by phillygenin.
Metabolites M38 and M54 eluted at 6.79 min and 8.44 min, respectively, which showed the same [M + H]+ ion at m/z 355.1541 (C21H23O5, error ≤ ±1.50 ppm). The DPI at m/z 137, 151 indicated the presence of a bismethoxyphenyl group, while m/z 189 was 112 Da higher than m/z 77 (phenyl), inferring the formation of a double bond in the two five-membered rings. Furthermore, m/z 284 was 30 Da higher than m/z 254, and it was inferred that both were dehydroxyl and dehydrogenated products of phillygenin, except that the position of the double bond was different.
Metabolite M43 was eluted at 6.93 min and showed [M + H]+ ion at m/z 371.1489 (C21H23O6, −1.12 ppm). It was 2 Da lower than the [M + H]+ ion of phillygenin, and combined with DPIs in the ESI-MS2 spectrum, M43 could be identified as the dehydrogenation product of phillygenin.
The metabolite M57 was extracted in HREIC at m/z 373.1281 (C20H21O7, −0.99 ppm) in positive ion mode with a retention time of 9.65 min. The DPI at m/z 373 was 30 Da higher than m/z 343, which meant the loss of CH2O, while m/z 343 was 18 Da higher than m/z 325, indicating that water molecule had been removed. Later, due to the loss of CO NLF, the generation of DPI at m/z 297 was caused. According to this, it could be inferred that after the dehydrogenation and demethylation reaction of the phillygenin, M57 was formed by oxidation. Still, the position of the hydroxyl group could not be determined.
At 6.92 min, the metabolite M42 was eluted, and the [M−H]− ion was shown at m/z 451.1057 (C21H23O9S, 3.42 ppm). The DPI at m/z 371 in the ESI-MS2 spectrum can be inferred as phillygenin, and m/z 451 was 80 Da higher than m/z 371, which was a great help for us to conclude that M42 is a sulfonated product of phillygenin [30].
The retention time of M33 was 6.59 min, which showed [M + H]+ ion at m/z 492.1687 (C24H30O8NS, −1.35 ppm). The DPI at m/z 492 was 137 Da higher than m/z 355, and it was presumed that NLF might be a cysteine-oxygen atom (C3H7NO2S + O). The DPI at m/z 88 (C3H6NO2) was more potent in demonstrating this so that M33 might be the product of the binding of the phillygenin to cysteine.
The metabolite M58 eluted at 9.90 min produced [M−H]− ions at m/z 261.1146 (C15H17O4, 4.15 ppm). From the ESI-MS2 spectrum, the NLF from m/z 262 to m/z 235 might be CH=CH2. Similarly, the DPI at m/z 191 was due to the loss of CH=CH from m/z 217. Moreover, the reason why the m/z 235 was 18 Da higher than the m/z 217 was that one molecule of water was lost. It was presumed that the glucosyl moiety had undergone a reduction reaction, leaving two hydroxyl groups remaining on the ring.
The metabolite M40 with a retention time of 6.87 min showed [M + H]+ ion at m/z 317.0657 (C16H13O7, −1.04 ppm). The DPI at m/z 133 was 56 Da (2CO) higher than m/z 77 (phenyl), and m/z 121 was 44 Da (CO + O) higher than m/z 77. It was presumed that the original five-membered ring portion has C=O formation. According to the DPI at m/z 109, it was speculated that 2OH existed in the benzene ring moiety.
The metabolite M47 was eluted at 7.47 min, which had [M−H]− ion at m/z 357.1332 (C20H21O6, 3.94 ppm). In the ESI-MSn spectrum, the DPIs produced by the cleavage of M47, such as m/z 191, 177, 151, 137, and 123, indicated that the two benzene rings underwent substitution reactions with diverse groups (2OCH3, OH + OCH3), respectively. At the same time, the NLFs were 30 Da (CH2O, from m/z 357 to m/z 327) and 54 Da (C4H6, from m/z 177 to m/z 123), it could be inferred that the two benzene rings were mutually connected through a five-membered ring and a double bond. Besides, it was known from the molecular formula that it was difficult to ascertain the substitution site of one OH.
The metabolite M48 was eluted at 7.54 min, and [M + H]+ ion was generated at m/z 331.0181 (C17H15O7, −0.54 ppm). The DPI at m/z 162 was 69 Da (C4H5O) higher than m/z 93 (hydroxybenzene, C6H5O), thereby confirming that the five-membered ring has undergone dehydrogenation to form a double bond. According to the molecular formula, in addition to the presence of 2OH in both benzene rings, the substitution site of 2OH could not be identified.
The metabolites M8 and M11 with retention times of 4.91 min and 5.08 min, respectively, produced the same [M−H]− ion at m/z 331.1177 (C18H19O6, error ≤ ±5.00 ppm). In their ESI-MSn spectra, the NLFs were 30 Da (from m/z 333 to m/z 303) and 28 Da (from m/z 303 to m/z 275); this indicated that there was only one five-membered ring in the molecular structure. Further, product ions such as m/z 109 and m/z 123 stated the substituent groups (2OH, OH + OCH3) on the two benzene rings.
The metabolite M10 was eluted at 4.96 min and showed [M−H]− ion at m/z 333.1327 (C18H21O6, 4.52 ppm). DPIs generated by the continuous neutral loss of CH2O (30 Da) such as m/z 303 and m/z 273 indicated that both five-membered rings were introduced into H2.
Metabolites M3, M4, and M5 eluted at 3.97 min, 4.27 min, and 4.58 min, respectively, showed the same [M−H]− ion at m/z 347.1126 (C18H19O7, error ≤ ±4.50 ppm). In the ESI-MS2 spectrum, the NLF was 30 Da (from m/z 347 to m/z 317), indicating the presence of a five-membered ring in the structural formula. The DPIs which appeared at m/z 163 and 193 further provided a basis for our inference. According to the DPIs such as m/z 109, 123, and 137, the substitution of two benzene rings could be inferred. Since the substitution sites of the two hydroxyl groups were difficult to judge, M3, M4, and M5 were isomers.
The retention times of the metabolites M14 and M16 were 5.54 min and 5.69 min, respectively, both of which produced [M−H]− ion at m/z 409.0588 (C18H17O9S, error ≤ ±3.50 ppm). It was 42 Da lower than M42, presumably owing to the loss of 3CH2. DPIs at m/z 329 and m/z 80 provided strong evidence for this inference. Thus both M14 and M16 were M42 demethylated products, except that the sulfonic acid groups had different binding positions.
Metabolites M27 and M30 were eluted at 6.31 min and 6.54 min and showed [M + H]+ ions at m/z 357.1332 (C20H21O6, −1.19 ppm) and m/z 357.1332 (C20H21O6, −2.90 ppm), respectively. In the ESI-MS2 spectrum of both, the NLF was 112 Da (C6H8O2, from m/z 189 to m/z 77), so a double bond was introduced in the five-membered ring. However, DPI was generated at m/z 137 in the ESI-MSn spectrum of M30, and it was inferred that the bismethoxyphenyl group was present, and M27 did not have this group, so M27 (ClogP, 1.14) and M30 (ClogP, 1.17) were isomers.
The metabolite M34 with a retention time of 6.65 min showed [M−H]− ions at m/z 359.1490 (C20H23O6, 3.16 ppm). The DPI at m/z 344 in the ESI-MS2 spectrum generated m/z 284 by loss of 2CH2O, and then the DPIs such as m/z 192 and m/z 178 could assume that hydrogenation (+2H2) occurred in both five-membered rings and there was one molecular double bond. Further, based on the DPI at m/z 122, the functional groups on the two benzene rings could be deduced.
The retention times of the metabolites M13 and M19 were 5.30 min and 5.93 min, respectively, and the same [M−H]− ion was produced at m/z 375.1074 (C19H19O8, error ≤ ± 4.00 ppm). In their ESI-MS2 spectra, the NLFs were 28 Da (CO, from m/z 375 to m/z 347) and 60 Da (2CH2O, from m/z 347 to m/z 287), respectively. From this, it was presumed that there was a five-membered ring with a double bond and a dimethoxyphenyl group in the structural formula. Combined with the DPIs such as m/z 137, 109, and the degree of unsaturation, the structural formula could be inferred. However, the positions of the three hydroxyl groups were still unidentifiable, and M13 and M19 were positional isomers.
The metabolite M6 possessed [M + H]+ ion at m/z 393.1543 (C20H25O8, −8.33 ppm) with a retention time of 4.75 min. In the ESI-MS2 spectrum of M6, the DPI at m/z 393 was 93 Da higher than m/z 300, presumably due to the neutral loss of hydroxybenzene. At the same time, the DPI at m/z 300 was 60 Da higher than m/z 240 and 28 Da higher than m/z 272, and it was presumed that NLFs were 2CH2O (60 Da) and 2CH2 (28 Da), respectively. Based on the information obtained, we could conclude that the structural formula of M6 contained hydroxybenzene and dimethoxyphenyl groups, and two of the five-membered rings were hydrogenated to open the ring. The DPIs at m/z 216, 246, and 361 further confirmed the above judgment, and the substitution sites of three other hydroxyl groups could not be determined.
M24, M28, and M41 showed the same [M−H]− ion at m/z 437.0901 (C20H21O9S, error ≤ ±3.00 ppm), and they were eluted at 6.14 min, 6.32 min, and 6.88 min, respectively. Their molecular weight was 14 Da lower than that of M42, so they were inferred to be the demethylation products of M42. In the ESI-MS2 spectrum, the three metabolites had the same DPIs, such as m/z 357, 151, 137, and 80, which proved that their benzene rings had identical substituents (2OCH3, 2OH). Because the substituents on the benzene rings and the sulfo (-HSO3) substitution site were different, M24, M28, and M41 were positional isomers [34].
The retention time of the metabolite M25 showing [M−H]− ion at m/z 453.0850 (C20H21O10S, 2.62 ppm) was 6.16 min. Because its molecular weight was 16 Da higher than M41, it can be speculated that M25 is the oxidation product of M41. The DPIs retrieved from M25’s ESI-MSn spectrum, such as m/z 373, 97, and 80, unexpectedly upheld the above guess.
The metabolite M59 was eluted at 26.48 min and showed [M−H]− ion at m/z 339.2035 (C15H31O8, −3.67 ppm). Based on the molecular formula and the calculated degree of unsaturation, M59 was presumed to be a compound in which an alkane containing 15 carbon atoms was replaced by eight hydroxyl groups. Also, the DPIs and NLFs shown in the ESI-MSn spectrum could prove this conclusion.
The metabolite M52 was detected at 8.34 min, which produced [M + H]+ ion at m/z 373.1646 (C21H25O6, −3.47 ppm). In the ESI-MSn spectrum of M52, through various information, it could be undoubtedly identified as the parent compound of phillygenin [31, 35, 36].
The metabolite M23 produced [M−H]− ion at m/z 417.1179 (C21H21O9, 3.67 ppm), and its retention time was 5.98 min. In the ESI-MS2 spectrum, the NLF was 176 Da (from m/z 417 to m/z 241), and the unsaturated glucuronide group was presumed to be lost, and the DPI at m/z 175 confirmed this. The DPIs at m/z 121 and m/z 103 indicated that the benzene ring was substituted by two methyl groups, so the substitution position of one methyl group could not be determined.
The metabolite M7 was eluted at 4.84 min and showed [M + H]+ ion at m/z 377.1953 (C16H25O10, 1.56 ppm). In the ESI-MS2 spectrum, NLF was 134 Da (from m/z 377 to m/z 243), and it was presumed to be a derivative of glucose (C5H10O4). According to the DPIs at m/z 172 and m/z 72, the substitution position of four hydroxyl groups could be inferred, of which three hydroxyl groups reacted with the benzene ring, and the other hydroxyl group was substituted on the alkane.
3.4.3. Identification of Metabolites Based on Enterolactone
The metabolite M50 was eluted at 7.63 min, and the [M + H]+ ion was displayed at m/z 231.1140 (C16H23O, −0.35 ppm). From the DPIs in the ESI-MS2 spectrum of M50, such as m/z 135, 107, and 93, it could be inferred that a hydroxy group was substituted on the benzene ring. Then it could be speculated that another benzene ring had undergone an additional reaction, which itself had been introduced with four hydrogen atoms.
The metabolite M45 produced [M + H]+ ion at m/z 273.0759 (C15H13O5, −0.70 ppm) with a retention time of 7.36 min. In the ESI-MS2 spectrum, the DPIs at m/z 123 and m/z 95 indicated the substitution of two benzene rings (OH + OCH3, OH). At the same time, the NLF was 28 Da (from m/z 151 to m/z 123 and from m/z 255 to m/z 227), and it was speculated that there were two ketone groups between the two benzene rings.
The metabolite M51 was eluted at 7.83 min and showed [M + H]+ ion at m/z 281.1173 (C18H17O3, −2.07 ppm). In the ESI-MSn spectrum, the DPIs at m/z 263, 235, 133, and 107 fully demonstrated the double bond formation in the two 5-membered ring structures. Besides, the NLF was 16 Da (from m/z 149 to m/z 133), indicating that there was still a hydroxyl group whose substitution site was unknown.
The metabolite M37 with a retention time of 6.77 min showed [M + H]+ ion at m/z 285.0759 (C16H13O5, −1.30 ppm). Based on the DPIs located in the ESI-MS2 spectrum, such as m/z 161 and m/z 85, it can be inferred that three consecutive keto groups were present. Meanwhile, the continuous loss of O atom (16 Da) indicated that there are two hydroxyl groups in the other benzene ring.
The metabolite M20, which was eluted at 5.93 min, had [M + H]+ ion at m/z 295.0964 (C18H15O4, −1.51 ppm). According to the continuous neutral loss of CO (28 Da) (from m/z 277 to m/z 249 and from m/z 249 to m/z 221) and the DPI at m/z 186, it could be presumed that there were two double bonds in the two five-membered ring structure. Furthermore, the continuous loss of water molecules (18 Da) further proved the existence of dihydroxyphenyl.
The retention time of the metabolite M1 was 1.61 min, which showed [M + H]+ ion at m/z 298.0962 (C19H12ON3, −2.81 ppm). In the ESI-MS2 spectrum of M1, the NLFs were 26 Da (from m/z 189 to m/z 163 to m/z 137), and it was presumably caused by the continuous loss of CN. What is more, the substitution site where a hydroxyl group should also exist in the structural formula could not be determined. Therefore, there were three cyano groups and one hydroxyl group in the structural formula.
The metabolite M21 was eluted at 5.94 min and showed [M + H]+ ion at m/z 313.1070 (C18H17O5, −1.73 ppm). According to the DPIs in the ESI-MS2 spectrum, such as m/z 277, 249, 123, 161, and 173, could be presumed the substitution of two benzene rings (OH + OCH3, OH) could be presumed and the existence of a five-membered ring and two double bonds between them. Also, according to the molecular formula, it was known that there was still one hydroxyl substitution site that could not be identified.
Metabolite M53 has a retention time of 8.39 min, which had [M + H]+ ion at m/z 315.1227 (C18H19O5, −9.27 ppm). In the ESI-MS2 spectrum of M53, the DPIs at m/z 93, 109, 167, and 189 could fully clarify the substituents (2OH, OH) on the two benzene rings. At the same time, the NLF was 60 Da (from m/z 315 to m/z 255), which was presumed to be due to the continuous loss of CH2O, so there were two five-membered rings in the structural formula.
The metabolite M9 was eluted at 4.95 min, and it showed [M + H]+ ion at m/z 317.1384 (C18H21O5, 0.16 ppm). Analogous to the above, based on the DPIs in the ESI-MS2 spectrum, such as m/z 123, 149, 151, 163, and 175, the substituents (OH + OCH3) on the two benzene rings were presumed. It was inferred from the degree of unsaturation that the linear alkane between the two benzene rings undergone a dehydrogenation reaction to form a double bond. At the same time, it could be known from the molecular formula that another OH substitution site could not be determined.
The metabolites M15, M26, M29, M31, and M46 were eluted at 5.62, 6.21, 6.38, 6.55, and 7.44 min, respectively, and they all showed the same [M + H]+ ion at m/z 341.1384 (C20H21O5, error ≤ ±1.50 ppm). Among them, in the ESI-MS2 spectra of M15, M26, and M29, DPIs at m/z 137 and m/z 187 indicated the presence of dimethoxyphenyl and methoxyphenyl. At the same time, NLFs were 28 Da (from m/z 291 to m/z 263) and 70 Da (from m/z 341 to m/z 271), which proved that there were a five-membered ring and two double bonds between the two benzene ring structures. Additionally, it was difficult to determine the position of one hydroxyl group. Therefore, it was inferred that the three metabolites were isomers because the substitution positions of methoxy and hydroxyl groups were different. However, in the ESI-MSn spectra of M31 and M46, the substituents (OH + OCH3, OCH3) on the two benzene rings could be presumed based on the DPIs at m/z 123, 151, and 161. Meanwhile, the NLFs were 30 Da (CH2O, from m/z 341 to m/z 311) and 28 Da (CO, from m/z 311 to m/z 283), and the existence of two five-membered rings and the formation of a double bond were further deduced. Because the position of the methoxy group was difficult to determine, M31 and M46 were positional isomers.
The retention time of the metabolite M12 was 5.15 min, and it had [M−H]− ion at m/z 345.0969 (C18H17O7, 4.20 ppm). DPIs in the ESI-MS2 spectrum, such as m/z 109 and 137, revealed the structure of two benzene rings (2OH, 2OH). The neutral loss of 60 Da (2CH2O) resulted in the formation of DPI at m/z 285, indicating that the two five-membered rings remained unchanged. Also, there was still no known substitution site for a hydroxyl group, so M12 was a demethylation and oxidation product of phillygenin.
The metabolite M35 was eluted at 6.69 min and produced [M−H]− ion at m/z 377.0689 (C18H17O7S, 3.69 ppm). In the ESI-MS2 spectrum, the NLF was 81 Da (HSO3), which was presumably due to the neutral loss of the sulfo group. At the same time, the DPIs at m/z 107 and m/z 189 revealed the existence of the substituents on the two benzene rings (2OCH3, 2OH) and the presence of two double bonds between the benzene rings.
The metabolite M49, which was eluted at 7.61 min, showed [M−H]− ion at m/z 271.1329 (C14H23O5, 4.68 ppm). In the ESI-MSn spectrum, the NLF was 60 Da (from m/z 271 to m/z 211), and it was inferred that the two five-membered rings undergo hydrogenation (+H4) to open the ring. The DPIs at m/z 151 and m/z 165 proved that the benzene ring had an addition reaction (H2) and was replaced by 2OCH3. Another OH substitution site was challenging to identify.
The metabolite M55 with a retention time of 8.49 min showed [M + H]+ ion at m/z 273.1486 (C17H21O3, 0.80 ppm). The DPIs in the ESI-MS2 spectrum, such as m/z 179, 137, and 95, clarified the structure of the two benzene rings (OH + OCH3, OH). Based on the established molecular formula, it could be inferred that the fabric between two benzene rings was a linear alkane containing four carbon atoms.
3.5. Proposed Metabolic Pathways of Phillyrin
In our study, a total of 60 metabolites (including the prototype compound) were detected and identified in rats that were orally administered phillyrin. The proposed metabolic pathways of forsythiaside are shown in Figure 5. The main biological reactions of phillyrin in rats included the following types, such as hydrogenation, methylation, hydroxylation, glucuronidation, sulfonation, carbonylation, ammoniation, dehydrogenation, demethylation, and ring cleavage and their composite reactions. Also, some extraordinary products had been discovered, such as carbon-nitrogen unsaturated bonds formed in M1 and M44. Moreover, the metabolites M49, M50, and M59, which were generated by hydrogenation or cleavage of benzene, had never been reported before.

4. Conclusions
After oral administration of phillyrin to rats, metabolites in plasma, urine, and feces of rats were studied comprehensively. Its innovation lay in the use of a new comprehensive strategy to screen and identify 60 in vivo metabolites of phillyrin. Firstly, the combination of UHPLC and Q-Exactive MS overcame many of the shortcomings of traditional triple quadrupole mass spectrometry, and the extremely high resolution could significantly eliminate the interference of the sample matrix. Secondly, the formation of the data set mainly depended on a variety of data mining methods, such as high extracted ion chromatogram (HEIC), diagnostic product ion (DPI), neutral loss filtering (NLF), and isotope pattern filtering (IPF). Finally, accurate qualitative identification of compounds could be achieved based on the precise relative molecular mass of each chromatographic peak in the mass spectrum, the ion fragment information of the secondary mass spectrum, the fragmentation rule of the mass spectrum, and the chromatographic retention time. The results showed that the primary biological reactions of phillyrin in vivo included methylation, hydrogenation, sulfonation, glucuronidation, demethylation, dehydrogenation, ring cleavage, and their composite reactions. Among them, the biological activity of some specific metabolites was unknown. In summary, this study provided vital information to the research field of phillyrin metabolism and had important value significance for studying the mechanism of action of phillyrin and monitoring of drug content.
Data Availability
The data used to support the finding of this study are available from the corresponding author upon request.
Conflicts of Interest
The authors have declared no conflicts of interest.
Acknowledgments
This work was financially supported by Chinese Pharmacopoeia Commission (no. 2018Y007).