Investigation the spatial distribution of polyphenols and metabolomic profiling of apple

W


Introduction
Apple (Malus domestica L.) is a widely cultivated and consumed fruit throughout the world.The known apple varieties are more than 7,500 and only 10% of them are intendedly bred and cultivated to develop a series of desired characteristics, including Fuji (FS), Golden Delicious, Gala (GL), Granny Smith, Jonagold, and so forth [1].Different apple cultivars vary in taste and appearance owing to their distinctively different chemical compositions.Polyphenols are responsible for the unique astringency flavor, susceptive to fruit browning, and also antioxidant activity.
Apple polyphenols can be divided into different structural subclasses, comprising four major groups, including dihydrochalcones (e.g., phloridzin), flavan-3-ols (e.g., catechin, epicatechin, and procyanidins), hydroxycinnamic acids (e.g., chlorogenic acid), and flavonols (e.g., quercetin glycosides) [2].And the composition and proportion of each phenolic group varied in different apple varieties [3].Given the molecular structure of free hydroxyl groups on the extended and conjugated aromatic system, apple polyphenols are of high potential for unpaired electron delocalization and metal chelating.Thus, apple consumption is related to bioactivity on prevention and treatment of several chronic disorders, like cancer, hypertension, hyperlipidemia, etc. [4].In this respect, apple polyphenols have become an important indicator for antioxidant and nutritional properties evaluation as proposed in several literatures [5][6][7].Accompanied by versatile and valuable chemometric techniques [2,8,9], apple polyphenols would be a feasible monitoring target molecules as markers to establish both the quality and origin of apple and apple products, which is crucial and a major concern in order to maintain and improve the standard of consumers life [10].
Apple polyphenol compositions are ultimately depended on the metabolic makeup of an apple fruit.Polyphenols are the main secondary metabolites in apple fruits for self-protection and adversity resistance [11].They are primarily derived from phenylpropanoid and flavonoid pathways [12], in which a series of enzymatic reactions are involved.Specifically, phosphoenolpyruvic acid, the intermediates metabolite of primary metabolism, is catalyzed into L-phenylalanine by dehydratase, phosphate synthase, kinase, mutase, and aminotransferase successively.And L-phenylalanine is then subjected to catalyze reaction of phenylalanine ammonia lyase, chalcone synthase, flavanone 3-hydroxylase, dihydroflavonol 4-reductase, anthocyanidin synthase, and uridine diphosphate (UDP)-glucose flavonoid 3-O-glucosyl transferase to form different kinds of phenolic compounds.Nonetheless, apple fruit metabolism is also notably affected by external factors, like light, temperature, rainfall, cultivation technique etc.Therefore, the metabolism pattern varied in different parts of apple fruit (peel, pulp, and core), as well as in different apple varieties, leading to uneven space distribution of polyphenols in both concentration and composition.So far, several researchers have investigated the differentiation and classification of apple fruit according to the variety and/or geographical origin by using different chemometric techniques [2].Focusing on untargeted metabolic profiles related to polyphenols, liquid chromatography quadrupole-time of flight mass spectrometry (LC-Q-TOF-MS), 1 H NMR has been applied to discriminate their difference on dried apple [8], edible part of mongo [13], sweet potato [9], coffee leaves [10] and tea [14].Moreover, LC-Q-TOF-MS has been demonstrated to be an advanced platform for the quantification and identification of characteristic components.
However, few work has been conducted to systematically illustrate the polyphenol spatial distribution in apple fruit among different varieties [15][16][17].Currently, there is a lack of research on whether the polyphenolic profile of specific parts of the apple fruit can serve as an indicator for distinguishing between apple varieties.This study was aim to characterize and categorize apple fruit from perspective of spatial distribution of polyphenol profile in apple fruit.Seven apple varieties, including FS, Qinguan (QG), Qingping (QP), Jinshuai (JS), GL, Changmiou (CMO), and Huahong (HH) cultivars, were investigated for their polyphenol profile in fruit core, pulp, and peel.The metabolome spatial distribution in mature apple fruits were trying to be disclosed for character and category prediction by utilization of UHPLC-Q-TOF-MS based metabolomics approach.

Apple samples
Apples of 7 varieties were collected in October 2017, which were provided by the national apple breeding research base of China in Xingcheng, Liaoning Province (N40°33' E120°45').The selected seven varieties include FS, QG, QP, JS, CMO, GL, and HH.The apples were packed in plastic atmosphere bags in shock proof cartons, and stored at 4°C, and 85% relative humidity refrigerator until use.For each apple variety, 5 fruits were segmented into three parts of core, pulp, and peel (seed was not considered in present study), which were instantly freezed in liquid nitrogen, and subsequently smashed into powder.The core, pulp, and peel freeze powder were then stored at -80°C for further analysis.

Sample preparation for metabolomic analysis
About 10 g frozen powder of the core, pulp, and peel of each variety of apple was taken, mixed with 1.0 mL 2% NaCl solution and 10 mL 80% methanol solution.The samples were extracted by using ultrasonic under 45 kHz, < 30°C, for 20 min.(Ultrasonic machine, SB 25-12 DTN, Scientz Biotechnology Co.,Ltd, Ningbo, China) .The supernatant was centrifuged (10,000 rpm, 4°C, 10 min) by using centrifuge (5804R, Eppendorf, Hamburg, Germany) before being collected.The residue part was then added 50 µL of 37% HCl and 10 mL of 80% methanol for ultrasonic-assisted extraction and collected the supernatant after centrifugation.The residue was further added 100 µL 5 mol/L NaOH, and 10 mL of 80% methanol for ultrasonic-assisted extraction, and collect the supernatant after centrifugation.Combined the neutral, acidic, and alkaline supernatant extracts, and dried by nitrogen blow at the temperature of less than 30°C.The dry matter was then dissolved in 5 mL methanol and stored at -80°C for further analysis.

Total polyphenol content determination
The Folin-Ciocalteu method was used for total polyphenol concentration determination, based on the method established previously [18] with some modifications.Gallic acid was used as standard and ultrapure water was used as blank.Briefly, 1.0 mL of diluted apple extracts (apple extracts were diluted by distilled water for 100-fold) was mixed with 0.5 mL of Folin-Ciocalteu reagent before adding 3 mL of sodium carbonate solution (10%, weight/volume).Water was added to bring the volume to 10 mL.The absorbance was read at 760 nm after 30 min reaction in dark place.Results were expressed as mg gallic acid equivalent (GAE) per gram sample.Each sample was tested in triplicate.

HPLC-ESI-Q-TOF-MS analysis
Analysis was performed on an ultra-performance liquid chromatography (ACQUITY UPLC ® , Waters Corp., Milford, MA) equipped with an Acquity UPLC ® BEH column C18 1.7 μm 2.1 mm × 100 mm, Waters Corp.) connecting to a quadrupole time of flight mass spectrometer (Xevo ® G2-S QTof, Waters Corp.).The detailed protocol was established in the reference [19].The mobile solutions were water with 0.04% acetic acid (A) and methanol containing 0.04% acetic acid (B).The gradient program was set as follows, from 5-95% B, 11 min; 95% B, 1 min; 95-5% B, 0.1 min; 95% B, 2.9 min.The column was kept at 40°C and the flow rate was 0.3 mL/min.Mass spectra were acquired using electrospray ionization under positive mode over the range of m/z 100-1,700.The desolvation gas temperature was 550°C, the cone gas flow was 50 L/h, and the desolation gas flow was 1,000 L/h.The accurate mass and composition of the precursor were calculated and sequenced using the MassLynx ™ software (Waters Corp., Milford, MA).The stability of the method was tested by performing 6 repeated injections of solutions prepared from authentic reagents catechins and chlorogenic acid every 2 h.The relative standard deviation of the retention times was below 2% and the mass error was below 5 ppm.Quality control (QC) samples were prepared by mixing all the sample extracts, then divided into four QC samples and analyzed using the same method as the experimental samples.The QC samples were run every six samples throughout the analytical run to make sure the repeatability could be assessed.

Data processing and statistical
Chromatographic data acquired from the HPLC coupled with electrospray ionization quadrupole time-offlight tandem mass spectrometry (HPLC-ESI-Q-TOF-MS) (waters Xevo ® G2) were directly imported to Progenesis QI V2.0 data processing platform, which adopts an intuitive workflow approach to perform data analysis.Automatic data processing was available in the form of licensing individual data files, running and reviewing alignment before experiment design, in which two ways of grouping (group by apple varieties and group by peel, pulp, and core of apple fruit) were used.Furthermore, parameter was set for compounds ions peak picking, and data normalization method was established to review normalization.Then we review and edit the various adduct and forms of a compound, and identify compound using progenesis MetaSCOPE ® , combined with database of NCBI, PUBMED filtered by conditions of ANOVA P value ≤ 0.05, variable importance in the projection (VIP) > 1, max fold change ≥ 20.The possible compounds were further analyzed by their identity and expression between conditions.Multivariate statistical analysis was conducted using principal component analysis (PCA), partial least-squares discriminant analysis (PLS-DA), and orthogonal partial least squares discriminant analysis (OPLS-DA) performed on EZinfo 3.0 (Umetrics ® , Umea, Sweden).Regression model was constructed by PLS to examine the relation between metabolite profiling and the score of evaluation.
Chromatographic data acquired from the LC-Q-TOF-MS (Waters Xevo ® G2) were imported to Waters UNIFI ® for metabolites identification and binary comparison.The analysis was based on the natural products database, which was established by collecting the polyphenols and related metabolites information from the recently published literatures [10,14,20,21].The analysis was conducted by matching the imported data file and the data processing method, in which blank, standard, QC, and experimental tests were labeled and set for three replicates.The characteristic components screening and difference constitutes comparison were processed automatically.

Morphological difference between the seven apple cultivars
Apple (Malus domestica) is the mostly consumed, cultivated, and investigated fruit, which is distinct in both the morphology and flavor among different apple varieties.Even though the seven apple varieties were cultivated in the same research base with the same cultivation approach, harvested time, and exactly the same storage method, the appearances of the same maturity of apple fruit were remarkably different from those in different apple varieties (Figure 1A).It has been reported that different apple varieties with distinctive morphology, texture, taste, flesh color, and also their polyphenol content and composition [18].

Identification and quantification of significant phenolic components
Polyphenols are the main secondary metabolites in apple fruit, the content varied in different apple cultivars and in different part of the apple.In present study, there was a large variation in total polyphenol content across the peel, pulp, and core of the seven apple cultivars (Figure 1B), ranging from 368.6 μg/g to 684.0 μg/g fresh weight (FW) in core; from 202.5 μg/g to 423.5 μg/g FW in pulp; and from 401.6 μg/g to 952.9 μg/g FW in apple peel.These results are in line with the total polyphenol content (363.9-2,516.9μg/ g FW) of peel in 145 apple cultivars for two years [22].The difference was proposed to be induced by different sampling strategies, quantification approaches, as well as climate variation, agricultural practices, and location.As comparable to the previous research that provided the polyphenol level in the pulp of 66 apple varieties from the UK [23], 58 apple varieties from China [2], and 20 apples from Jena German [7].Moreover, total polyphenol level was a significant difference among different cultivars.Total polyphenol of apple peel was found in descending order in GL, HH, QP, CMO, JS, FS, and QG, while total polyphenol level of pulp was in the descending order of GL, HH, JS, CMO, QP, FS, QG; and total polyphenol level of core was in descending order of GL, JS, QP, CMO, QG, HH, FS.
In the present work, the distribution of four groups of polyphenols, including phenolic acid, flavanols, dihydrochalcones, and flavonols in apple peel, pulp, and core were separately identified and quantified by using high-performance liquid chromatography-diode array detection (HPLC-DAD).Phenolic acid, including gallic acid, p-coumaric acid, protocatechuic acid, caffeic acid, ferulic acid, neochlorogenic acid, and chlorogenic acid were listed in Table 1.Flavanols including catechin, and epicatechin; two dihydrochalcones including phloretin and phloretin-2'-O-glucuronide, and four flavonols including quercetin, quercetin-3-O-rhamnoside, quercetin-3-O-galactoside, quercetin-3-O-rutinoside level were showed in Table 2.These results are in good agreement with the research that chlorogenic acids were then the main phenolic acids in different apple varieties [3,8,24].

Metabolomics profiling in three parts of apple fruit
Polyphenol composition difference among different apple cultivars is one of the major characteristics for distinguishing varieties, as discussed in previous literature [25].Given that other components of apple Metabolic profiles differences in peel, pulp, and core of 7 apple varieties were evaluated by multivariate statistics analysis.PCA is a widely used chemometric tool for extracting and rationalizing information from any multivariate description of a biological system, and it is used to reveal the internal structure of several variables by a few principal components [26].In the PCA plot (Figure 2) the QC samples grouped together, revealing that the QC samples had similar metabolic profiles and the entire analysis was stable and repeatable.PCA analysis indicated that there are 9 components that explained 80% of the total variance upon 10,259 variables.The samples of peel, pulp, and peel part of apple fruit grouped in 3 separate areas in the plot, among which the metabolic profiles of pulp and core were partially overlapped, indicating that peel had notably different metabolic profile from that of pulp and core, while the metabolic profile of the latter two shared some similarities.
Partial least-square analysis (PLS-DA) was used to generate pairwise comparisons of the metabolite contributions [27].In present work, 7 components established for metabolic profile of peel, pulp, and core were compared to identify the metabolites that were responsible for the observed difference.As much as 88% of predictability (Q 2 ) and strong goodness of fit (96%) on R 2 Y were observed in the PLS-DA models.Thereinto metabolic profiles of peel, pulp, and core were well separated from each other in three quadrants of the rectangular coordinates.Score plots of the first t(1) and second t(2) predictive components are illustrated.On the score plot, the percentage of total variation explained by the component is indicated in parentheses.The colored ellipses include 95% of the multivariate normal distribution of the peel, core, and pulp samples respectively.On the loading plots of PCA, the variables with extreme values in each direction of principal component p(1) and p(2) were focused.On the plot of coefficients versus VIP, the variables with VIP value higher than 10 were focused.On the scores plot scatter (S)-plot, the predictive of P value (loadings) in the range of |P| > 0.1 were focused The OPLS-DA extends a regression of the PCA and uses the class membership to maximize the variation and introduce an orthogonal signal correction filter to separately handle the systematic variation correlated to or uncorrelated to the Y variable.Therefore, it has a better discriminant ability for the samples with larger within-class divergence than PCA, as has been illustrated by Bylesjö [28].In the present study, 5 components were selected for samples from three parts of an apple that were clearly separated in the OPLS-DA score plot.The variance explained R 2 Y indicating 98% of the variation in the dataset could be modeled by the selected component.The variance prediction Q 2 reached 96%, indicating a high goodness of fit.

Metabolomics profiling in 7 different apple varieties
The metabolomics profile of 7 apple varieties (FS, QG, QP, JS, CMO, HH, GL) were comparatively analyzed (Figure 3).In PCA plot, with 80% variance explanation upon 10,259 variables, 9 components were found.Based on the predictive component 1, t(1) and 2, t(2), HH, GL, and QP were distinctively separated from FS and QG.Besides, CMO and JS showed the most overlap.In PLS-DA modeling, 2 predictive components were established and only 17% variance explanation and 6% variance prediction were found.While 7 apple varieties were grouped into 3 parts on predictive component 1.It suggests the different metabolomic profiles of peel, pulp, and core of GL were different from that of HH and QP, and also that of the rest apple varieties.In respect of OPLS-DA model, there were 5 components, explaining 98% variance and possessing 96% variance prediction between the two groups (one with GL, HH, QP, and the other including FS, QG, JS, CMO), indicating the metabolomics profile difference and relationship among the 7 apple varieties.

Study of key metabolite in peel, pulp, and core of apple
The loading plot (Figure 3) indicated the variables responsible for the difference of the metabolomic profiles of peel, pulp, and core of apple fruit, which contained 27 compounds.The variables' importance distribution along the PLS regression coefficient showed in the plot of coefficients versus VIP, among which 11 variables that had VIP > 10, coefficient > 0.00005 were investigated.The important variables distribution along the OPLS regression coefficients were displayed in S-plot, in which 9 variables were located at the extreme value in two direction of predictive component 1 (Table 3).
The significantly different phenolic components between core and pulp were hyperin, there is more hyperin found in core as compared to the pulp.While the melilotoside, isorhamnetin-3-O-rutinoside, flavonoid glycoside, and rutin were significant different phenols found between core and peel, among which melilotoside and flavonoid glycoside were higher in core than that in peel, isorhamnetin-3-Orutinoside and rutin were lower in core than that in peel.quercetin-3-O-rutinoside and naringenin-7-Oglucoside were the phenolic compound found different in peel and pulp, in which Quercetin-3-O-rutinoside were more in pulp and naringenin-7-O-glucoside were in peel.
In respect of metabolomics profile difference attributed to apple varieties, the variables located in the area related to the metabolomics profile difference among seven varietals of apple slices on the basis of the loading plot.While the variables' importance distribution along PLS regression coefficient of CMO, HH, and JS showed that 5 variables had VIP > 1 coefficient > 0.00005.There were 21 important variables distribution along the OPLS regression coefficients as shown in S-plot (Table 4).
The remarkable phenolic compound to differentiate GL and HH was 4-O-caffeoylquinic acid, in which GL possessed higher this specific phenol.Melilotoside, Chlorogenic acid, kaempferol, isorhamnetin were the significant phenols to differentiate the GL and QP, and only kaempferol was found higher in QP than in GL. 4-O-p-coumaroylquinic acid, chlorogenic acid, neochlorogenic acid showed different in GL and FS, in which neochlorogenic acid was higher in FS.Similarly, chlorogenic acid, neochlorogenic acid, 4-O-caffeoylquinic acid, and flavonoid glycoside were significant different in GL and QG, and only flavonoid glycoside was remarkably higher in QG than GL.Moreover, isorhamnetin was the significant phenolic compounds to distinguish GL and CMO.Whereas polyphenol could not necessarily differentiate all the apple varieties, which is closely dependent on the apple varieties.

Discussion
Polyphenols are important characteristic components in apple fruit [29,30].In this study, different phenolic compounds in apple peel, pulp, and core of seven varieties were qualified and quantified.Chlorogenic acid was the most abundant component (21.49-389.68 μg/g FW) in core section of QG, JS, CMO, QP, FS, GL, and HH apple varieties, accounting for 33.94%, 23.89%, 45.46%, 24.26%, 24.14%, 16.84% and 3.08% of total phenolic content respectively.Chlorogenic acid content in pulp was in the range of 25.37-215.06μg/g FW, accounting for 14.86-75.47% of total polyphenol content.While in peel section, chlorogenic acid was fall into the scope of 16.80-234.57μg/g FW, and accounting for 4.51-50.08% of total polyphenols.Besides, neochlorogenic acid, caffeic acid, gallic acid, protocatechuic acid, p-coumaric acid, ferulic acid occupied 0.63-5.00%,0.23-10.97%,0.03-1.52%,0.28-1.83%,0-0.93%, 0.08-1.38% of total polyphenol content in respective part of apple.There are significant varietal differences of phenolic acid (P < 0.05) observed in both peel, pulp, and core.And the levels of phenolic acids were generally higher in the pulp than that in apple peel and core (in fresh weight) of 7 apple varieties.
The metabolic profile variation within peel, pulp, and core were remarkably decreased, while the phenolic dispersion was in a descending order in core, pulp, and peel of apple.It indicates that the core is more likely to determine the difference of apple variety.This result could help instruct the apple variety distinguish by precisely located the sampling position in apple fruit.Moreover, the results revealed that metabolic profile in different part of apple fruit is different, among which polyphenolic metabolites made significant contributions.However, polyphenol is the important secondary metabolites that responding to the external stimuli, such as cold, hot, physical injury, insect, chemicals, etc.Therefore, specific polyphenol accumulation will vary according to several factors including the planting area, climate, soil agronomic method.Polyphenol profile is relatively stable, and just like the finger print for different apple varieties, especially the polyphenol profile in core part of the apple fruit, which could be the superior target for apple varieties distinguish.
Present study investigated the polyphenols spatial distribution in apple fruit, within which the correlation of the polyphenol distribution difference and the apple varieties character difference were found.The characteristic components of apple peel, pulp, and core were extracted and compared across different apple varieties.The result concluded that the phloretin were the dominant component in core, chlorogenic acid was the main phenolic compounds in pulp, and quercetin was mainly found in apple peel.The less varied of polyphenol in core of the apple is likely to be the main factor that determines the difference in apple variety.The work would like to provide an insight to distinguish the apple by the characteristic biomarker distributed in different part of fruit.In future research, it is crucial to develop more precise techniques for identifying biomarkers and establish more comprehensive and structured methodologies.

Figure 1 .
Figure 1.(A) The morphology and cross-section color of 7 apple varieties; (B) total polyphenol distribution in core, pulp, and peel of apple fruit.Data are presented as mean ± SD of three independent experiments, different letters upon error bars indicate significant difference.P < 0.05

Figure 2 .
Figure 2. Multivariate modeling of variations using PCA, PLS-DA, and OPLS-DA of peel, pulp, and core of the 7 apple varieties.Score plots of the first t(1) and second t(2) predictive components are illustrated.On the score plot, the percentage of total variation explained by the component is indicated in parentheses.The colored ellipses include 95% of the multivariate normal distribution of the peel, core, and pulp samples respectively.On the loading plots of PCA, the variables with extreme values in each direction of principal component p(1) and p(2) were focused.On the plot of coefficients versus VIP, the variables with VIP value higher than 10 were focused.On the scores plot scatter (S)-plot, the predictive of P value (loadings) in the range of |P| > 0.1 were focused

Figure 3 .
Figure 3. Diagnostic of multivariate modeling (PLS-DA).The diagnostic plot shows the distances within and orthogonal to the selected score plan for PLS modeling AbbreviationsCMO: Changmiou FS: Fuji FW: fresh weight GL: Gala HH: Huahong HPLC: high-performance liquid chromatography JS: Jinshuai LC-Q-TOF-MS: liquid chromatography quadrupole-time of flight mass spectrometry OPLS-DA: orthogonal partial least squares discriminant analysis PCA: principal component analysis PLS-DA: partial least-squares discriminant analysis QC: quality control QG: Qinguan QP: Qingping

Table 1 .
Flavanol, dihydrochalcone, and flavonol composition and distribution in core, pulp, and peel of apple fruit (ug/g) Data are presented as mean ± SD of three independent experiments, different letters follow indicate significant differences (P < 0.05).LD: lower than detection limit

Table 2 .
Flavanol, dihydrochalcone, and flavonol composition and distribution in core, pulp, and peel of apple fruit (ug/g)

Table 4 .
Significant biomarkers for comparing classifiers between seven apple varieties

Table 4 .
Significant biomarkers for comparing classifiers between seven apple varieties (continued)

Table 4 .
Significant biomarkers for comparing classifiers between seven apple varieties (continued)