Comprehensive two-dimensional gas chromatography–mass spectrometry combined with multivariate data analysis for pattern recognition in Ecuadorian spirits

The current methodology used in quality control of Ecuadorian beverages such as Pájaro azúl, Puro and Pata de vaca is carried out by using conventional gas chromatography; however, it does not allow the fingerprinting of these Ecuadorian spirit beverages and their possible cases of adulteration. In order to overcome this drawback, comprehensive two-dimensional gas chromatography–mass spectrometry (GC × GC–MS) was combined with multivariate data analysis, revealing that compounds like citronellal, citronellol, geraniol, methyl anthranilate, (−)-trans-α-bergamotene, (−)-cis-α-bergamotene and d-limonene can be considered key elements for pattern recognition of these traditional beverages and product adulteration cases. Thus, the two-dimensional chromatographic fingerprints obtained by GC × GC–MS coupled with chemometric analysis, using Principal Component Analysis and Fisher-ratio can be considered as a potential strategy for adulteration recognition, and it may used as a quality assurance system for Ecuadorian traditional spirits.


Introduction
Ancestral and typical liquors have always been an important part of the culture in Ecuador. Beverages such as Pájaro Azúl, Puro and Pata de Vaca are prepared nationwide, and the recipes of these artisanal spirits have remained throughout the centuries. These beverages are distilled liqueurs obtained directly from the raw juice of unrefined sugar cane, whose production process begins with the extraction of the cane juice followed by its fermentation during 96 h at 26 °C approximately [1]. Then, the fermented juice goes through a second distillation, which results in the Puro beverage that contains 70% of alcohol approximately. Afterwards, fruits, herbs and/or animal legs may be added to the Puro, and a third distillation is performed to obtain other beverage variants. For example, chicken legs and some specific herbs are added to create Pájaro azul, while beef legs and other fruits and herbs are used for making of Pata de vaca, in agreement with local references available in databases from Ministerio de Industrias y Productividad (MIPRO) and Ministerio de Agricultura, Ganadería, Acuacultura y Pesca (MAGAP). Therefore, the sensorial characteristics of each beverage are unique and strongly dependent on the raw materials used throughout the whole production process [2,3].
Ecuador produces about 36.500 L of liquor per day, and most of the artisanal production of liquors occur in the province of Bolivar where there are approximately 600 associated producers. From 30 to 40% of these producers also work independently, and almost 900 families obtain their income from these spirits beverage

Open Access
Chemistry Central Journal *Correspondence: gaby867@gmail.com 1 Ikiam-Universidad Regional Amazónica, Km 7 Via Muyuna, Tena, Napo, Ecuador Full list of author information is available at the end of the article commerce. Currently, the determination of the quality of beverages, as well as the concentration limits for congeners and some toxic compounds are determined by the Ecuadorian norm INEN 2014 [4]. The analyses are performed by conventional gas chromatography, which searches for the presence of some target compounds only such as acetaldehyde, methanol, isopropanol, n-propanol, ethyl acetate, iso-butanol, n-butanol, isoamyl alcohol, n-amylic and furfural [4]. Different from some well-known spirits samples (tequila, whisky, rum, cachaça, among others) a great variety of other compounds that are strongly related to the organoleptic and aromatic properties of the Ecuadorian spirit beverages has never been investigated so far. This investigation can be achieved by extending gas chromatographic analysis to the identification of other volatile organic compounds like esters, terpenes, aldehydes and higher alcohols [1,3,5]. In order to avoid counterfeiting (or counterfeits), it is very important to identify the most relevant compounds closely associated to the origin of the spirits beverages in an attempt to discover possible adulterations, because some non-associated producers may perform adulteration of the beverages due to commercial reasons. For instance, authentic Pájaro Azul and Pata de vaca beverages present a blue and pale yellow color respectively, but the addition of colorants or other products with similar colors can alter the resulting beverage. Therefore, an analytical method that is capable of providing a complete characterization of these Ecuadorian spirits, as well as distinguishing among these original varieties from counterfeits has become necessary.
Headspace solid-phase microextraction (HS-SPME) has being recognized as a successful sample preparation procedure for analysis of volatile compounds using gas chromatography, mainly because of its advantages such as experimental simplicity and the absence of solvent, thus having several applications in the quality control in the food industry. To illustrate this, HS-SPME coupled with GC has been successfully used to determine relevant volatile aromas for the quality of cachaça, beers, wines, tequilas and rums [5][6][7][8][9][10].
However, due to the complex variety of the volatiles presented in these beverages, coelutions are generally observed in their corresponding chromatograms. Consequently, a comprehensive two-dimensional gas chromatography-mass spectrometry (GC × GC-MS) is a powerful tool for overcoming this drawback, providing higher detectability as well as higher chromatographic separation efficiency.
In GC × GC, two capillary columns containing preferably orthogonal separation capabilities are connected through a modulator, which concentrates the eluate coming from the end of the first column (1D) and then reinjects this eluate in a narrower band into the head of the second column (2D) [11][12][13][14].
The great advantage of using HS-SPME along with GC × GC-MS is the possibility of an enhanced characterization of the volatile compounds contained in the samples, but the high amount of these extracted compounds that are chromatographed can make the visual discrimination among several samples an extremely difficult task. For example, Cardeal et al. [5], identified the compounds that are formed during the production of cachaça when analyzing several fractions of this distillate using HS-SPME and GC × GC-TOFMS. However, the authors affirmed that the discrimination between the fractions and the identification of their most relevant variables demanded too much time, and that the type of wood or time of fermentation could not be identified.
Chemometric analysis, mainly principal components analysis (PCA) has proved a powerful technique for the extraction of patterns from large multivariate datasets such as chromatographic data, allowing the identification of chemically similar samples as well as the most relevant variables that are responsible for these clusterings. PCA systematically decomposes the data matrix into eigenvectors and eigenvalues that describe the different sources of variation, according to their respective percentages of the total variance occurring in the multivariate data. Thus, when PCA is applied to entire chromatographic datasets, all the peaks that best explain the variability of the samples can be identified and analyzed more comprehensively, contrary to the univariate data analysis approach in which each peak has to be evaluated separately, restricting, thus, data analysis to only some conventional compounds [15]. However, the high complexity of the data coming from the large amount of variables, commonly occurring in chromatographic analysis of biological samples, can make the interpretation of all the compounds responsible for the patterns from PCA a non-trivial task. Therefore, variable selection strategies aiming to reduce the data complexity toward preserving only the most relevant variables that discriminate between the groups of samples are of paramount importance, and the use of the multivariate Fisher-ratio approach may achieve this goal. Herein, successive one-way Anova is performed in each variable of the data while discriminating samples between their corresponding classes, and non-important variables disturbing the discrimination of the samples can be excluded from the data [16][17][18]. The data dimensionality reduction to only the most relevant variables clarifies the interpretation of the role of each compound in the sample, and it can be obtained by PCA [3,10]. Orujo samples were characterized according to the geographical origin of the grapes and the distillation system used for the elaboration of the spirits through the GC-HS-SPME profiles and PCA [7]. HS-SPME was combined with GC × GC-TOFMS in order to characterize Bianco and Giallo Moscatel sparkling wines, using Fisher-ratio and multiway PCA, observing the clear difference between the types of wine due to the higher concentration of terpenes and norisoprenoids in the Giallo type [3]. In another similar study, compounds like 2,3-butanediol, 4-carene, 3-penten-2-one, diethyl succinate, β-santalol, diethyl malonate, dihydro-2(3H)-thiophenone, tetrahydro-2(2H)-pyranone, C9 alcohols, 3-methyl-2(5H)-furanone, ethyl 9-decenoate and nerol, were found in such wines such as Cabernet Sauvignon, Merlot, Chardonnay and Sauvignon Blanc as potential markers of grape variety [10].
Taking into consideration all the aforementioned, the main goal of this study was to develop a reliable analytical method based on HS-SPME GC × GC-QMS aiming for the complete characterization of the Ecuadorian spirits beverages Pájaro azul, Pata de vaca and Puro. In order to achieve this goal, chromatographic data was combined with Fisher-ratio to identify the most relevant and distinguishing compounds among these types of beverages, and (multiway) PCA was used to determine straightforwardly the relations among the profiles of these relevant compounds in each type of beverage to discover potential chemical markers in their qualities.

Chemical and materials Spirit samples
Six different samples of the beverage Puro, six of Pata de vaca and six of Pájaro azúl were taken for the analysis. All the samples were obtained from Guaranda, central state of Ecuador, and these spirit beverages were produced following a traditional artisanal methodology.

Reagents and materials
A series of C8-C22 n-alkanes (Sigma-Aldrich-St. Lois, MO, USA) was used for the determination of the 1D linear temperature programmed retention indices (LTPRI), additionally hexane and heptane were used in order to calculate with high precision minors alkanes. The HS-SPME procedures were performed using a SPME fiber coated with 50/30 µm divinylbenzene/Carboxen on poly(dimethylsiloxano) (DVB/CAR/PDMS) (Sigma-Aldrich). Septum-sealed Pyrex vials of 20.00 mL (Wheaton science Products-Millvine, NJ, USA), volumetric flask of 50.00 mL and magnetic stirrers were also used during the sample preparation procedures (Sigma-Aldrich).

HS-SPME sample preparation
An aliquot of 5.00 mL of the spirit samples was diluted with water in a volumetric flask of 50 mL containing 2.5 g of sodium chloride [1,5]. Then, 10 mL of this solution was transferred into a 20.00 mL septum-sealed Pyrex vials, and the SPME fiber was exposed in the headspace during 20 min, at T = 60 °C and magnetic stirring (600 rpm). For the retention indices determination, samples were spiked with 5 µL of a C8-C22 n-alkanes standard mixture [19]. The extracted compounds were immediately desorbed into the GC injector at 250 °C for 3 min.

Equipment
The analyses were performed on a lab-made GC × GC-QMS prototype based on a QP2010 + GC (Shimadzu Corp, Tokyo, Japan) fitted with a split/splitless injector and equipped with a miniaturized sealed two-stage cryogenic modulator that provided cold (T = − 196 °C) and hot (T = 250 °C) jets that were controlled by solenoid valves (ASCO, Florham Park, NJ-USA) and a 8-bit Duemilanove microcontroller board (Arduino, Ivrea, Italy) [20]. The modulation period was set to 6.0 s. The column set consisted of a 25 m × 0.25 mm × 0.25 µm HP-5 MS (Agilent Technologies-Palo Alto, CA, USA) column (1D) fitted with a 1 m × 0.10 mm × 0.10 µm SupelcoWax 10 column (Sigma-Aldrich), as the second dimension (2D). The oven temperature programming was initially set to T = 35 °C (t = 5 min), then it was raised to 210 °C at 3 °C/min, next to T = 240 °C at 40 °C/min and finally holding for 10 min. The injection port and transfer line were kept at T = 250 °C, using hydrogen as carrier gas at initial flow of 0.6 mL/min. The MS ionization source was set to 200 °C and the mass scan range was set from m/z 40 to 487 Da, at acquisition rate of 20 Hz. The peaks identification was performed using the NIST 2010 (NIST, Gaithersburd-MD, USA) and the FFNSC (Chromaleont, Messina, Italy) spectra libraries combined with the LTPRI inspections. All the analyses were performed in duplicate. The raw two-dimensional chromatograms were generated using the GCImage software (Zoex Corp., Houston, TX, USA).

Multivariate analysis
The raw unfolded GC × GC-Q(TIC)MS chromatograms were firstly converted to .txt files and then imported into Matlab ® R2013b software (MathWorks, Natick-MA, USA). Next, the chromatographic peaks were aligned using the icoshift algorithm [21] and the Fisher-ratio was performed throughout the aligned chromatograms using an in-house routine written in Matlab. The PCA was performed in the mean-centered unfolded chromatograms containing only the selected peaks obtained previously from the Fisher-ratio results, using the software Pls_ Toolbox v. 8.1.1. for Matlab (Eigenvector Research Inc., Wenatchee-WA, USA). The chromatographic loadings extracted from PCA were re-folded to the original twodimensional chromatographic structure for visualization and interpretation.

Results and discussion
The conditions for the extraction of the compounds in the spirits were adapted from a previous research [5], in which the 6.0 s modulation period was suitable for the proper chromatographic separation in 2D without jeopardizing the efficiency in 1D, during the analysis around 150 approximately were detected. However, 100 compounds were identified which are the responsible for the differentiation of the samples. Figure 1 shows the aligned unfolded chromatograms of the samples, whose variance may be mostly attributed to the chemical diversity in the beverages that is the result of the different ingredients used during the preparation of each type of beverage.
Additionally, the profiles of the chromatographed compounds are a consequence of the physicochemical properties of the DVB/CAR/PDMS fiber that selectively extract polar and non-polar compounds. A clear difference between the samples of Puro and Pájaro azul, and between Puro and Pata de vaca can be easily noticed in Fig. 1, while distinguishing Pájaro azul from Pata de vaca is much more difficult due to the highly similar fingerprints between the samples of these beverages. Therefore, the use of multivariate data analysis for pattern recognition was required to achieve this goal, obtaining a better discrimination amongst the types of beverages by firstly selecting only the most relevant compounds for the discrimination, and afterwards computing the Fisher-ratio for each peak, followed by multiway PCA. The compounds were tentatively identified according to the MS library matching and retention index criteria, in which the uncertainty threshold of 3% was considered reasonable ( Table 1).
The two-dimensional structure of the chromatograms was also used to support the identification of homologous compounds. Moreover, the multiway PCA provided two factors that explained 59.85% (PC1) and 20.04% (PC2) of variance in the data, and no outliers were detected. The three different types of beverages could be distinguished in the reduced subspace defined by the PCs (Fig. 2).   The most important compounds related to the discrimination among the samples for each factor were identified in the loadings plots of the model depicted in the Fig. 3. On the one hand, the compounds found in the negative loadings were responsible for the differentiation between the samples, and these were identified in the plot with shades of yellow with a scale from 0 to − 0.04. On the other hand, the compounds in the positive loadings were common among the samples and identified in the plot with shades of blue with a scale from 0 to 0.08.
In general, the compounds identified in all the samples belong to the family of alcohols, aldehydes, ester and acetates. Essentially, compounds 2-methyl-1-propanol, 2-propanol, ethyl acetate, n-Butanal, 3-methyl-1-butanol, 2-methyl-1-butanol, hexanal, 3-hydroxybutanal, furfural, isopenty acetate and heptanal were tentatively identified in the positive loadings from the PC1, which are the most volatile compounds recognized by their negative organoleptic contributions and described as "spicy" or "solventlike" and produce toxic effects [1,8]. Short chain alcohols such as n-butanol, n-pentanol, n-hexanol, n-heptanol and 2-heptanol (which are normally associated with green flavor) were also identified, which may result in prejudicial sensorial characteristics for the beverage when found in higher concentrations [5,[22][23][24]. However, these compounds were identified in higher concentrations in both Pájaro azul and in Pata de vaca than in Puro, according to the loadings plots (Fig. 3a).
Esters compounds such as ethyl heptanoate, ethyl octanoate, ethyl nonanoate, ethyl decanoate, ethyl undecanoate, ethyl dodecanoate, ethyl tetradecanoate, ethyl pentadecanoate, ethyl hexadecanoate, ethyl heptadecanoate and ethyl octadecanoate were identified, which are present in many alcoholic beverages such as tequila, rum and cachaça [7,23,25]. These compounds are usually associated with fruity and pleasant attributes, and they were found in higher concentration in both Pata de vaca and in Pájaro azul than in Puro. The preparation process of Pata de vaca and Pájaro azul beverages supports this result, as chicken legs are commonly used in Pájaro azul, while beef legs are added to Pata de vaca. Therefore, a higher concentration of these compounds can be the result of esterification reactions between some animal saturated fatty acids and the ethanol contained in the beverage, as well as the heating occurring during the distillation and production processes.
The positive loadings in PC1 also show the compounds 2,4-dimethylanisole, estragole, foeniculin, p-propyl anisole, (Z)-anethole and (E)-anethole in higher concentration in Pata de vaca than in Pájaro azul. Despite the fact that both beverages contain the same quantity of anise, Pájaro azul contains a greater number of additional components like fruits, which may result in a dilution of the compounds responsible for the anise flavor. In addition, it is worth considering that the alcohol content in Pata de vaca is 45% approximately, while the alcohol content in Pájaro azul is 40%, and that both beverages are the result of the same distillation cut. In addition, some plants used exclusively during the preparation of Pata de vaca can contribute to their constituents, increasing the concentration of these compounds in this beverage. To illustrate this, compounds such as p-cumenol, methyl salicylate, 4-allylphenyl acetate, propyl octanoate, β-caryophyllene, α-farnesene, β-farnesene, iso-amyl n-decanoate, secbutyl octanoate and α-bisabolol were also present in Pájaro azul and Pata de vaca (higher concentrations), and these compounds are generally found in plant extracts, which are mainly used in Pata de vaca according to its artesian recipe.
On the other hand, the compounds carene, (−)-transα-bergamotene and (−)-cis-α-bergamotene, which are characteristic of some herbs and plants that may be used during the beverage production, were identified only in the positive loadings of Pata de vaca. Carene is particularly characteristic of rosemary that is a herb used in the preparation of Pata de vaca, as well as carrots, which are associated with the (−)-trans-αbergamotene and (−)-cis-α-bergamotene compounds responsible for the yellow color of this beverage, and thus they can be considered origin markers [26,27] (Fig. 3a).
In the negative chromatographic loadings in PC1, compounds such as 1-decanol, and n-dodecanal were found in higher concentration in Puro and Pájaro azul, and they are associated with toxic effects [7,23].
Additionally, the identification of the compounds d-limonene, methyl anthranilate, citronellal, citronellol and geraniol in Pájaro azul in high concentration may indicate markers of origin (Fig. 3b). The typical blue color of this beverage corresponds to the ancestral recipe in which the artisans add leaves of tangerine to provide this color. While d-limonene and methyl anthranilate are characteristic compounds in citrus fruits such as tangerines, oranges and lemons, citronellal, citronellol and geraniol are compounds found mainly in Citronella grass, which is a herb used during its production. The presence of these compounds help to perform the quality control of authentic Pájaro azul, as opposed to counterfeits in which the blue color is due to some colorants in the beverage in order to avoid expenses and raw material consumption, colorants in the beverage that are not reported in sugar cane [28][29][30]. Furthermore, the higher molecular weight of the compounds n-octanol, n-nonanal, 2-undecanol, n-dodecanol, n-tridecanol and n-tetradecanol identified in high concentration in the negative PC2 loadings belong to the Puro as well as, and these compounds are related to the poor aroma quality of this beverage. Diethyl succinate was also identified, which is a secondary compound resulting from fermentation and provides some pleasant flavor [23].
Performing quality control during the distillation process with the purpose of eliminating toxic compounds, but maintaining the compounds associated with the flavors at the same time, proves an interesting approach from the commercial perspective. Specifically, techniques using high chromatographic resolution such as GC × GC, along with (multiway) PCA for the identification and characterization of volatile profiles of these ancestral selected spirits provided a suitable and time-efficient tool in order to assure quality control during their production. These techniques also ensured the presence of their most important constituents, especially those that have a great influence on the chemical and physical characteristics of these beverages. Finally, this suggests that the monitoring of these compounds should be part of the routine protocols to ensure quality and to avoid the addition of any other components in the recipe that may affect the characteristics of the final product.

Conclusions
Comprehensive two-dimensional gas chromatography along with MPCA allowed the discrimination between three Ecuadorians artisan spirits, characterizing the volatile profiles of each them, in order to measure their qualities. MPCA along with Fisher ratio allowed to perform a tentative identification of the most important compounds for the discrimination of the beverages, as well as the detection of the compounds that can considered marker of origin. The monitoring of these compounds may avoid counterfeiting practices, mainly those related to the substitution of the original products that contain the essential components responsible for their organoleptic properties, according to the ancestral recipe. In this study, Pájaro azul and Pata de vaca were found to be significantly different from Puro, but they were very similar to each other to the extent of becoming almost impossible to truly distinguish each other only by simple visual inspection. However, the target analysis of the main compounds such as citronellal, citronellol, geraniol, methyl anthranilate, carene, (−)-trans-α-bergamotene, (−)-cis-α-bergamotene and d-limonene can provide the basic chemical differences between these spirits, since they have low concentrations in these beverages. GC × GC-MS became an alternative to the proper separation and detection of such compounds; As a result, the two-dimensional chromatographic fingerprints obtained by GC × GC-MS coupled with chemometric analysis using MPCA and Fisher Ratio proved valuable tools for the characterization and quality inspection of these spirit beverages.