Keywords
Abstract
Present review focuses on the most recent results (the interim of 2019-2023) in a liquid-phase NMR of lignin and the lignin-derived products and related chemicals. Collected and discussed are the recent applications of 1H, 13C, and 31P NMR spectroscopy together with NMR of less popular nuclei to the structural studies and practical recommendations in this field. Owing to the complexity of lignin and the products of its transformation, their NMR spectra consist of a number of overlapping signals belonging to different structural types. In this regard, comprehensive studies of lignin by means of NMR over the past several decades revealed characteristic functional groups of lignin and lignin-related products together with spectral regions in which they resonate. Quantitative NMR spectra of 1H and 13C together with less popular nuclei like 31P are used to characterize different structural units of lignin (such as guacil, syringil, and p-hydroxyphenyl) providing aromatic and saturated carbons spread over many structural moieties.
The bibliography includes 54 references.
1. Introduction
Lignin is a prospective biomass-derived source for the production of some crucial organic building blocks, typically derived from unsustainable and non-renewable petroleum feedstocks.1 Lignin, a source of plant-based biomass, is a complex cross-linked phenolic polymer, as shown in Fig. 1.2 As a complex polymer, lignin characterization is often challenging due to its random structure and multitudes of different repeating molecular substructures.
Identified structural units of lignin are shown in Fig. 1 and characteristic heteronuclear single quantum coherence (HSQC) NMR spectrum of spruce milled wood lignin is presented in FIg. 2. Aromatic structures present in lignin are as follows: syringyl unit, oxidized syringyl unit, guaiacyl unit, p-hydroxyphenyl unit, ferulic acid, cinnamyl alcohol end group, cinnamaldehyde end group, and p-coumaric acid, while the main linkages of lignin are β-aryl-ether β-O-4, resinol β-β, phenylcoumaran β-5 and spirodienone.
This polymer is crucial to the structural integrity of cell walls in wood and bark and has been extensively studied. As a by-product of paper production, lignin is used for fast energy generation. However, lignin proved to be an exceptional source of organic building blocks, many of which are currently obtained from petroleum feedstocks. Lignin, a natural biopolymer and an abundant by-product, is a particularly promising feedstock for carbon-based materials and a potentially sustainable alternative to phenolic resins, which are typically derived from crude oil. Among different types of biomass, lignin is the most abundant aromatic biopolymer and a potential source for industrial products.
The structure of lignin is highly dependent on its origin. While softwood lignins contain mainly guaiacyl units with negligible amounts of p-hydroxyphenyl groups, hardwood lignins are composed of guaiacyl and syringyl subunits. Grass lignins are constituted by guaiacyl (G), syringyl (S), and p-hydroxyphenyl (H) subunits.
In general, NMR provides direct information on lignin structure and its chemical composition. In particular, data from quantitative monodimensional 1H and 13C NMR spectra together with heteronuclear and multidimensional NMR data provide information on different types of lignin interunit bondings.3 Unfortunately, monodimensional spectra suffer from significant signal overlap, which seriously hampers signal integration. Quantitative HSQC has been used to understand lignin framework, providing direct information on internal linkages and also giving general insights into lignin structure based on the early 1H and 13C NMR (see Refs 4 – 9) together with 31P NMR (see Ref. 10) data on modified lignin.
However, although a part of the lignin chemistry community has used HSQC as a quantitative or semi-quantitative tool in recent years, this is not scientifically rigorous and therefore, acceptable due to the intrinsic nature of the pulse sequence. Quantitative 1H – 13C HSQC measurements have recently greatly facilitated lignin analyses. In some cases, however, long acquisition times were needed for obtaining quantitative HSQC, which are incompatible with the chemical integrity of the (potentially functionalized) lignin sample.
Sette et al.9 compared different methods that were developed for the more time-efficient quantitative HSQC measurements with respect to their usefulness in lignin analysis. As a result, reliable and reproducible results were obtained using both the QQ-HSQC and HSQC0 methods. Indeed, an obstacle to the development of a reliable relationship between lignin structure and upgradeability is the difficulty to quantitatively measure lignin structural features by means of classical HSQC. Amiri et al.11 demonstrated that a modified HSQC method known as HSQC0, can accurately quantify lignin functionalities in extracted lignin. In 2024, Bourmaud et al.12 developed a rapid elevated-temperature 1H – 13C heteronuclear single-quantum coherence zero (HSQC0) method that enables a precise quantification of the kraft lignin structural characteristics even with whole plant cell wall (known as ‘HSQC0 WPCW’). Such analysis illustrates how this analytical method can greatly facilitate the study of kraft lignin structure, which can then be used for the fundamental studies of lignin.
Recent applications of 1H, 13C, and 31P NMR spectroscopy together with NMR of less popular nuclei like 31P, 19F, and 15N to the structural studies of lignin provide a breakthrough and practical guide in this field. The structural NMR studies of lignin are developing on a backdrop of a marked progress in computational NMR of hydrogen 13 – 15 and carbon,16 – 20 the elements which are the main constituents of lignin, together with NMR of phosphorous,21, 22 provided lignin is labelled with 31P isotope to offer lignin analysis by means of 31P NMR.
2. Most recent NMR studies of lignin and lignin-derived products
2.1. 1H and 13C one-dimensional NMR
A representative example of NMR studies of lignin carried out in the last five years is the paper by Levdansky et al.,23 which deals with the NMR study of the soluble organosolv lignins extracted from fir and aspen wood. Fir and aspen lignins were found to be of the G and GS types, respectively. The one-dimensional 1H and 13C NMR spectra of these samples are illustrated in Fig. 3 and Fig. 4, respectively. The authors also found that the β-O-4' linkages in ethanol lignins extracted from fir and aspen were partially ethoxylated and contained the ethoxy groups in the α-position.
Hopa and Fatehi 24 studied sulfoalkylated lignin derivatives obtained by sulfomethylation and sulfobutylation of kraft lignin. The 1H NMR spectra of kraft lignin and its sulfomethylated and sulfobutylated derivatives, shown in Fig. 5, were used to identify those structures. In the spectrum of kraft lignin, two broad peaks at 6.8 – 7.5 and 3.2 – 3.8 ppm were attributed to the protons of aromatic ring and methoxy groups, respectively. In the spectrum of sulfomethylated lignin, these peaks were found to be much weaker than those of kraft lignin due to the cleavage of aromatic ring and methoxy groups during sulfomethylation. The peak at 8.2 ppm (assigned by the authors to proton A) represented the hydrogen of the phenolic hydroxyls. In the spectrum of sulfomethylated lignin, the peak at 3.23 ppm (assigned to proton B) was attributed to the signal of the methylene groups. In the spectrum of sulfobutylated lignin, three peaks at 1.79, 1.58, and 2.63 ppm were assigned to C, D, and E protons, respectively, which were absent in the spectrum of the starting kraft lignin.
In 2023, Zhao et al.25 prepared a novel cationic tall oil lignin (TOL)-starch copolymer by the polymerization of TOL, starch and a cationic monomer of (3-acrylamidopropyl) trimethyl ammonium chloride (APTAC) through the free radical chain extension. Based on the 1H NMR and 1H – 1H Correlated Spectroscopy (COSY) experiments, it was found that TOL and starch were covalently linked by the APTAC to produce corresponding cationic copolymers. One-dimensional and 1H – 1H COSY spectra of the cationic copolymers prepared with a ratio of APTAC to TOL of 1 or 5 in the neutral or basic reaction medium (designated as CR1-pH7, CR1-pH11, CR5-pH7, and CR5-pH11, respectively) together with the pristine starch and TOL are shown in Fig. 6 and Fig. 7 as examples. For the origin and numbering of the fractions, see the original publication. In general, it was shown that the structure of cationic copolymers comprised anhydroglucose unit of starch, the phenolic structure of TOL, and the APTAC monomer. It was also found that the phenolic hydroxyl content of TOL was reduced by the reaction of the APTAC chain with the phenolic OH of TOL, significantly increasing the solubility, charge density, and molecular weight of the cationic copolymers.
In 2023, Wang et al.26 explored the free-radical in situ copolymerization study of lignin, acrylamide, and diallyldimethylammonium chloride by a combination NMR with other methods. It was shown that the presence of acrylamide in that reaction was essential for the polymerization of diallyldimethylammonium chloride with lignin to proceed due to the lower steric hindrance of the former (Fig. 8). The monomer conversion ratio and dynamic rheology of the reaction system indicated that lignin acted as an inhibitor in the corresponding copolymerization reaction.
Mun et al.27 studied the samples of kraft lignin from mixed hardwoods by 1H and 13C NMR. It is well known that the basic aromatic structure of hardwood lignin consists of guaiacyl (G) and syringyl (S) units, both of which can be seen in the 1H NMR spectra of acetylated kraft lignin (Ac-KL) and most clearly in the spectra of acetylated milled wood lignins (Ac-MWL) extracted from acacia (Ac-MWL-aca) and mixed hardwood (Ac-MWL-mhw) (see the marked yellow region in Fig. 9). The intensities of aromatic protons in Ac-KL were 0.52 for S and 0.37 for G units, while for Ac-MWL-mhw, the aromatic proton intensities were accordingly 0.84 for S and 0.57 for G units. Based on the 1H NMR spectral intensities of Ac-KL and Ac-MWLs, the authors detected the decreased number of aromatic protons, suggesting a highly condensed structure of KL. As shown in Table 1, the S/G molar ratio of Ac-KL was almost similar to that of Ac-MWL-mhw.
The 13C NMR spectra of KL and milled wood lignin (MWL) from acacia and mixed hardwood are presented in FIg. 10. The corresponding chemical shifts and signal intensities are listed inTable 2. The 13C NMR spectra of MWL-aca and MWL-mhw in aliphatic and aromatic regions were found to be essentially similar. However, the 13C NMR spectrum of KL showed decreased aliphatic and increased aromatic content. This phenomenon was typical for KL since significant structural changes occurred in the aliphatic and aromatic moieties of lignin during kraft pulping. In addition, demethylation of methoxyl groups and cleavage along Cγ leading to polycondensation resulted in a marked difference between KL and MWL spectra, as was claimed by the authors as one of the main results of their study.
Rönnols et al.28 discussed the prospects for lignin analysis by the low-field benchtop NMR spectroscopy. It is well known that benchtop NMR spectroscopy is an emerging field with an appealing profile for analytical analysis of lignin. The instrumentation of the benchtop NMR spectroscopy offers the possibility to measure NMR spectra in situations where the use of the high-field NMR spectroscopy is too expensive or unavailable. In that paper, the ability to quantify different lignin samples using benchtop NMR was found to be rather promising. For example, the 1H NMR spectra of isolated lignin samples recorded at benchtop low and high magnetic fields showed three major peaks arising from aromatic and oxygenated aliphatic compounds together with that of the residual solvent with the corresponding spectral regions being clearly separated (Fig. 11).
2.2. 1H and 13C two-dimensional NMR
Two-dimensional NMR of lignin is currently used as either a qualitative or a semi-quantiative technique. An example of this is the fundamental study by Levdansky et al.,23 who used HSQC spectra to confirm the presence of the β-O-4', β–β', and β-5' linkages in the organosolv lignin extracted from fir and aspen, as shown in Fig. 12 and Fig. 13. The main structural units of these ethanol lignins are also presented (see Fig. 12).
The same year, Li et al.29 reported a kinetic study of the hydrogenolysis of lignin to monomers in a continuous flow reactor using 2D HSQC NMR spectra (Fig. 14). It was found that lignin end and internal units reacted essentially differently which provided insight into lignin hydrogenolysis pathways. As a result of that study, a continuous lignin upgrading process was developed using soluble lignin in a continuous flow reactor. The advantages of the flow-through system included the continuous production of lignin monomers monitored by 2D HSQC NMR by optimizing reaction parameters such as reaction temperature, pressure, and flow-rate in real time.
The resulting 2D HSQC NMR spectra showed that the major lignin sidechain units (β-O-4, β-5, and β-β) in the Cu-catalyzed alkaline hydrogen peroxide (Cu-AHP) lignin were well resolved in the sidechain fingerprint region. The aromatic signals were similar to those from kraft lignin, except for a small amount of the benzylic oxidation caused by the Cu-AHP pretreatment. The signals of the lignin side-chain units showed no obvious changes before and after the reaction.
In 2019, Liu et al.30 reported a rapid lignocellulose fractionation method applied to common reed based on milling under mild alkaline conditions with using no organic solvents or other reagents. As a result, the lignin-containing cellulose nanofibre and native-like lignins were obtained. Films prepared from the suspension of the lignin-containing cellulose nanofibres showed very interesting properties such as transparency and hydrophobicity. Shown in Fig. 15 are characteristic partial 2D HSQC NMR spectra of reed straw enzyme and alkali-pretreated reed lignins. Overall, the isolated lignin preserved its native structure with high β-ether content. The resulting isolated high-quality lignin was found to be suitable for further upgrading by various lignin depolymerization processes.
Wang et al.31 developed an integrated process based on hydrothermal pretreatment and Kraft delignification to deconstruct lignocellulosic biomass. Hydrothermal pretreatment was found not only to facilitate the production of xylo-oligosaccharides but also to reduce the chemicals dosage of the following delignification. The structural characteristics of lignin obtained from the integrated process were studied by 2D HSQC NMR (Fig. 16).
Zhao et al.32 prepared alkaline and kraft lignins and carried out their comparative characterization by 2D HSQC NMR (Fig. 17 and Fig. 18). Using authenticated reference compounds, the phenylglycerol structures, characteristic of the cleavage of non-phenolic β-aryl ether by soda pulping, in the alkaline lignin were identified and quantified by 2D HSQC NMR. The content of phenylglycerol compounds in alkaline lignin was estimated to be 8 – 14%, which was much higher than that in kraft lignin.
A large number of related 2D HSQC NMR studies have recently appeared devoted to the structural studies of lignin and lignin-related products, as exemplified below being among many other related papers.
Cao et al.33 investigated the microwave-assisted depolymerization of various types of waste lignins elucidating how the structure of lignins recovered from various agricultural and industrial residues affects its downstream catalytic conversion. Three types of lignins, such as bio-enzymatic lignin, organosolv lignin, and Kraft lignin, were fully characterized by HSQC-NMR to obtain a detailed description of their structures, as shown in Fig. 19. The strong dependence of the conversion efficiency on the interunit linkages and functional groups of lignin structures was found. The strong metal – support interaction between CuO and sodium cyanoborohydride NaBH3CN not only facilitated lignin depolymerization via the promoted electron transfer, but also improved the stability of Cu catalysts under hydrothermal conditions.
Chen et al.34 developed an integrated biorefinery strategy towards phenolics, levulinic acid, and furfural during total utilization of lignin and carbohydrates in Eucalyptus grandis. It is well known that the depolymerization of lignin into the well-defined mono-aromatic chemicals suitable for downstream processing is recognized increasingly as an important starting point for the lignin valorization. In that study, the conversion of all three components of Eucalyptus grandis into the corresponding monomeric chemicals was investigated using solid and acidic catalysts in sequence. The 2D HSQC NMR spectra of the lignin oil shown in Fig. 20 confirmed that the main substructures, such as β-aryl ether (β-O-4), resinol (β-β), and phenyl coumaran (β-5) have disappeared after Pd/C treatment, indicating the C – O bonds dissociation in protolignin. New dominant cross peaks corresponding to a propanol moiety were observed in HSQC. The cross signals in the aromatic region corresponding to guaiacyl and syringyl units were also found.
Kim et al.35 carried out a 2D HSQC study of monolignol benzoates incorporation into the lignin of transgenic Populus trichocarpa (Fig. 21). It was found that monolignol benzoates incorporated into the lignin of triple-transgenic Populus trichocarpa, which is downregulated for the three endoplasmic reticulum cytochrome enzymes participating in oxidative radical coupling reactions during lignification in the cell wall. This study has demonstrated how the biosynthesis of monolignol benzoate and monolignol p-hydroxybenzoate in plant is involved in the lignin biosynthetic pathway by providing direct evidence for the monolignol benzoate and monolignol p-hydroxybenzoate structures in the transgenic Populus trichocarpa lignin. In addition, attention was drawn to the convoluted interactions of enzymes in the pathway to produce benzoates.
Li et al.36 used a number of methods to identify the biomass structural determinants defining the properties of the plant-derived renewable carbon fibre. The linkages and composition of lignin from eight sorghum feedstocks were analyzed by 2D HSQC NMR and the linkage profiles of uncondensed β-O-4, condensed β-5, and condensed β-β were calculated based on a number of aromatic rings of syringyl, guaiacyl and ρ-hydroxyphenyl units. At that, the frequencies of β-O-4, β-5, and β-β linkages were determined based on a series of 2D HSQC NMR experiments (Fig. 22 and Fig. 23). Considering the diverse range of lignin characteristics, lignin from the sorghum feedstock samples was found to be a perfect model precursor to elucidate the impact of biomass characteristics on the performance of the resultant carbon fibres. That 2D HSQC NMR study opened a new path to modify cell wall structures for manufacturing the high-quality carbon fibres.
Liu et al.37 reported the method for isolation and characterization of crude lignin of bark using the ethanol-water organosolv treatment. According to the two-dimensional 1H – 13C 2D HSQC NMR and monodimensional 13C NMR spectral data, crude lignin contained mixed polyphenolics, suberin compounds and carbohydrates (Fig. 24). Overall, this study demonstrated the potential to solubilize a high proportion of bark and provided insight into the structure–property relationships of crude lignin as a function of processing conditions.
Rowlandson et al.38 investigated the influence of aromatic structure of lignin on its thermal behaviour. The authors performed a systematic study of the chemical composition of lignins extracted using an identical organosolv isolation method but from different biomass feedstocks: hemp hurds, eucalyptus chips, flax straw, rice husk and pine. Basic results were obtained mainly from the 2D-NMR HSQC spectra shown in Fig. 25 for rice husk lignin, eucalyptus lignin and industrial lignin. It was found how the aromatic structure of lignin could affect the thermal behaviour of the polymer, which correlated with the structure of the resulting carbons. Carbons from lignins with a high content of syringyl units displayed a pronounced foaming behaviour which, upon activation, resulted in a high-surface area material with hierarchical porosity.
It was found that organosolv lignin samples contained syringyl unit, oxidized syringyl unit, guaiacyl unit, p-hydroxyphenyl unit, ferulic acid, cinnamyl alcohol end group, cinnamaldehyde end group, and p-coumaric acid with the following main linkages present: β-aryl-ether β-O-4, resinol β-β, phenylcoumaran β-5 and spirodienone (Fig. 26).
Tokunaga et al.39 explored the nonproductive binding sites of lignin models with carbohydrate-binding module of cellobiohydrolase. In this study, three types of 13C-labeled β-O-4 lignin oligomer models were synthesized and characterized. According to the 2D 1H – 13C HSQC spectra of the 13C-labeled lignin models (Fig. 27), three types of the 13C labels were correctly incorporated in the aromatic rings and β-positions, α-positions and methoxy groups. The Trichoderma reesei carbohydrate-binding module 1 (TrCBM1) binding sites in lignin were analyzed by observing NMR chemical shift perturbations using the synthetic 13C-labeled β-O-4 lignin oligomer models. Obvious chemical shift perturbations were found in signals from the aromatic regions in oligomers bound to TrCBM1, whereas changes in the signals from aliphatic regions and methoxy groups were insignificant. These results suggested that hydrophobic interactions and π – π stacking were dominating factors in the nonproductive binding.
Ghavidel et al.40 studied chemical reactivity and sulfofunctionalization of enzymatically produced lignin. The comparison between six different enzymatically hydrolyzed lignin samples revealed the fundamental role of β-aryl ether and residual carbohydrate on the physicochemical properties and chemical reactivity of the samples used to generate dispersants for the clay suspension. Based on the 2D HSQC spectra of six hydrolyzed lignin samples shown in Fig. 28, it was established that lignin samples with fewer β-O-4 and lignin-carbohydrate bonds (LCC) had a higher content of Ph – OH and carboxylate groups. The higher ratio of these bonds was associated by the authors with the superior contribution of the β-aryl ether and lignin-carbohydrate bonds. The samples with the lower sugar content appeared to be smaller and more porous, which increased the chemical reactivity of the hydrolyzed lignin by making it more accessible to the functionalization reactions.
Islam et al.41 used N-methyl-2-pyrrolidone (NMP) pretreatment of lignocellulose to increase lignin yield and cellulose digestibility. The authors discovered a new N-methyl-2-pyrrolidone pretreatment solvent with excellent lignin solubility and preservation of β-O-4 linkages present in the pristine lignin structure. These results indicated that the formation of 4-(methylamino)butyric acid in the NMP pretreatment system increased the polarity of the solvent system. The increased polarity in turn increased the lignin removal and reduced the amount of lignin condensation. This result was attributed to the higher amount of the β-O-4 interunit linkages in the NMP lignin sample, which was mainly determined by HSQC analysis.
The 1H – 13C 2D HSQC NMR spectra shown in Fig. 29 demonstrated that the major substructures of the studied lignin were S and G units together with interunit β-O-4, β-β and β-5 linkages. The NMR signals were assigned by comparison with those of the benchmark ball mill lignin. The position and colour of the correlations shown represent the corresponding substructures, which are β-O-4 (green), β-5 (brown), and β-β (pink), S unit (grass green), G unit (light orange) and H unit (purple). The semi-quantitative values of major lignin moieties were determined using the aromatic units as an internal standard. For the ball milled lignin, α-position of β-O-4, β-β, and β-5 linkages appeared at δC/δH 72.56/4.82 ppm, δC/δH 85.66/4.60 ppm and δC/δH 86.1/4.1 ppm, respectively. With the NMP pretreatment, the corresponding peaks were observed at the same position with different intensities. The β-O-4 was found to be the predominant linkage of the studied lignin, which determined the integrity and potential of fractionated lignin for its effective valorization to produce aromatic chemicals.
Diaz-Baca et al.42 carried out the polymerization of tall oil lignin, starch and 2-methyl-2-propene-1-sulfonic acid sodium salt to produce flocculants for colloidal systems together with the sulfonated lignin – starch polymer, and described the use of the latter as a flocculant. Based on the advanced 1H, COSY, HSQC, HSQC-TOCSY, and HMBC NMR techniques (Fig. 30 and Fig. 31), it was confirmed that the phenolic substructures of tall oil lignin and the anhydroglucose unit of starch were covalently polymerized by the monomer to give the three-block copolymer.
Karlsson et al.43 studied lignin structure and reactivity in the organosolv process. The aim of this work was to study the reactivity of lignin during a cyclic organosolv extraction process while using physical protection strategies. Synthetic lignins obtained by mimicking the lignin polymerization process were used as reference compounds. The authors used state-of-the-art NMR analysis, which is a powerful tool for elucidating lignin interunit linkages and functionalities. This study revealed interesting fundamental aspects of lignin polymerization, such as identifications of molecular populations with high degrees of structural homogeneity and the emergence of branching points in the lignin structure. Analysis of the organosolv biorefinery lignin by HSQC NMR, as exemplified in Fig. 32, showed an abundance of aryl-ether linkages (β-O-4), confirming that physical protection did indeed occur due to performing the extraction in cyclic mode. A previously proposed intramolecular condensation reaction was substantiated, and new insights into the selectivity of this reaction were introduced, which was supported by the density functional theory (DFT) calculations (Fig. 33 and Fig. 34), where the important role of intramolecular π – π stacking was emphasized.
Rinken et al.44 investigated the role of solvent fractionation of lignin oil in structural profiling during lignin stabilization, and the carbohydrate nature in the H-transfer reductive catalytic fractionation was also evaluated. In particular, the solvent fractionation of the reductive catalytic fractionation lignin oil was studied as a facile method for producing lignin oil fractions for advanced characterization. Solvent fractionation is known to use small volumes of environmentally benign solvents like methanol, acetone, and ethyl acetate to produce multigram lignin fractions containing products in different molecular weight ranges. This feature allows the determination of structural heterogeneity across the entire molecular weight distribution of the reductive catalytic fractionation lignin oil by high-resolution HSQC NMR spectroscopy. Performed study provided a detailed insight into the role of the hydrogenation catalyst in stabilizing the lignin fragments and defining the structural features of the hemicellulose-derived carbohydrates in the lignin oil. The detailed characterization of the fractions by HSQC NMR measurements (Fig. 35) revealed the formation of the reduced β-O-4 linkages presenting a methylene group at the Cα position. It is generally accepted that stabilization processes primarily involve lignin monomer intermediates. The findings presented in this publication indicated that an additional route for the lignin stabilization has been established through the formation of the reduced β-O-4 linkages.
The next two short Sections 2.3 and 2.4 focus on the application of 31P, 19F, and 15N NMR to the structural studies of different types of lignin are by no means a comprehensive discussion of the subject.
2.3. 31P NMR
Due to the complexity of the repeating monomers in lignin, its 1H NMR spectra suffer from significant overlap when used for lignin analysis. However, 31P derivatization has been proven a useful technique for the NMR analysis of lignin. The structure of lignin can be effectively modified by the instantaneous incorporation of phosphorus into kraft lignin, so that 31P NMR could be applied to lignin analysis and structural studies. As an example, see paper by Puyadena et al.45 on the phosphorus-containing lignin intermediates for the polyurethane and acrylic coatings for wood. The 100% natural abundance of the 31P isotope, the large chemical shift range, and the sharp signals provide perspectives of using 31P NMR for the lignin analysis.
One of the basic publications 46 of the last five years demonstrated that 31P NMR spectroscopy is a promising technique for the quantitative analysis of hydroxyl groups of lignin due to its unique characterization capability of the biorefinery process. The proposed protocol describes procedures for the preparation and analysis of phosphitylated lignin samples based on 31P NMR spectra together with the means of quantitative analysis of different types of hydroxyl groups. Compared to the traditional ‘wet-chemical techniques’, the above method of quantitative 31P NMR spectroscopy offered unique advantages in measuring phosphitylated hydroxyl groups from a single 31P NMR spectrum, as illustrated in Fig. 36 and Fig. 37. In addition, this method provided complete quantitative information about hydroxyl groups in small amounts of sample (~ 30 mg) within a relatively short experimental time (up to 2 h).
A similar study was carried out using quantitative 31P NMR of lignins and tannins from organosolv lignin, which was derivatized with TMDP.47 Quantitative 31P NMR spectroscopy was shown to be a rapid and reliable method for the identification of unsubstituted, o-mono-substituted and o-disubstituted phenols, aliphatic hydroxyls and carboxylic acids in lignins and tannins. The methodology consisted of an in situ quantitative 31P labelling of lignin or tannin followed by the acquisition of a quantitative 31P NMR spectrum (Fig. 38). However, it was concluded that TMDP-based 31P NMR is unable to distinguish aryl-glycerol-β-aryl ethers, phenylcumarane and pinoresinols.
Jia et al.48 determined the role of lignin produced in the autohydrolysis process in the enzymatic hydrolysis of biomass. Based on the 31P NMR data, β-O-4 of lignin was reduced, while β-5 and β-β ratios increased with increasing the hydrolysis intensity. At that, the increase in the hydrolysis intensity significantly enhanced the content of condensed and non-condensed phenolic OH groups of lignin. It was also shown by 31P NMR that the cellulase enzyme adsorbs more readily onto lignin with higher phenolic content, and its association with lignin reduces its activity to hydrolyze cellulose microcrystals.
Mun et al.49 carried out 31P NMR analysis of kraft lignin prepared from mixed hardwoods, as illustrated in Fig. 39, which shows the enlarged region (134 – 150 ppm) of phosphitylated hydroxyl groups in syringyl, guaiacyl and p-hydroxyphenyl moieties. A sharp peak at 174 ppm was attributed to the excess amount of unreacted TMDP indicating that all hydroxyls in kraft lignin were completely derivatized. The nature of the signal at 174 ppm was discussed in detail by Meng et al.46
Purushothaman et al.50 performed an extended NMR study of five technical lignins derived from different botanical origins (herbaceous, hardwood, and softwood) and covering three main industrial pulping methods (soda, kraft, and organosolv). The 31P NMR spectra (Fig. 40) confirmed that the base-catalyzed depolymerization residues were more condensed and had higher phenolic hydroxyl content, as compared to all the feed lignins. In general, a large decrease in aliphatic hydroxyl groups was observed in all lignin residues indicating on its depolymerization.
It is well known that phenolic moieties strongly influence the reactivity and physical properties of lignin, and thus accurate quantification of phenolic groups in lignin is a critical analytical chemistry need. Kenny et al.51 noted that 31P NMR is widely considered to be the standard method for lignin analysis, but this approach uses a hazardous and expensive derivatization agent, and the NMR spectroscopy experiments are time-consuming due to long relaxation times. The authors reported a complementary method that enables accurate identification and quantification of phenolic groups in lignin samples using pentafluoropyridine as a derivatizing reagent followed by the 19F NMR analysis. However, in this approach, the reactivity of the fluorinated compound towards aliphatic OH groups present in lignin is not quantitative, and signals arising from aliphatic hydroxy groups overlap with those from the phenolic groups, so that the analysis of the OH groups is in fact neither quantitative nor selective.
It should also be noted that an enhanced applicability and innovation of 31P NMR in the structural studies of lignins deals with the use of ionic liquids for the solvation of lignosulphonates. Indeed, 31P NMR spectroscopy is the most common and most accurate analytical method for the quantitative determination of the hydroxy group contents in technical lignins. However, for lignosulfonates, liquid-state NMR analysis is often limited due to solubility problems in commonly used solvent systems, which may arise from the wide range of lignosulfonates from different wood sources, pulping conditions, and purification procedures used in biorefineries. Finding a suitable solvent system is even more difficult for chemically modified or fractionated lignosulfonates. Wurzer et al.52 proposed a novel and fast approach for the solubilization of genuine, modified, and fractionated lignosulfonates and subsequent quantitative analysis of hydroxy groups by 31P NMR after derivatization with TMDP.
Another application that deserves to be cited is the successful utilization of 31P NMR for studying the oxidation of kraft lignin in view of preparing lignin vitrimers, which are currently frontier materials in lignin chemistry. It is well known that the valorization of lignin into the commercial products by oxidative conversion is a widely studied strategy. However, in many cases, this approach has limited scope for integration into industrial processes.
In general, 31P NMR spectroscopy is used for either the qualitative or the quantitative determination of hydroxylated moieties as well as for determination of the condensation degree and the syringyl/guaiacyl ratio in lignins.
Of some practical interest is somehow also the benchtop NMR spectroscopy, which is used for the low-cost analytical analysis of lignin (Fig. 41). The benchtop NMR instrumentation offers the possibility to measure 31P NMR spectra in situations where the application of the high-field NMR spectroscopy is unavailable or too expensive. The former technique provides the ability to quantify different classes of compounds less accurately but at lower cost. Further examples of the benchtop NMR investigations of different lignins (like kraft softwood and hardwood lignins, organosolv hardwood lignin, and soda lignin) have been reported by Araneda et al.53
2.4. NMR of other nuclei
Examples of this type are very rare because NMR of other nuclei such as 15N or 19F requires isotopic labelling and/or chemical modification of lignin, which is rather expensive and/or time-consuming.
One of those rare examples appeared during the last 5 years, is the NMR study of the molecular interaction of lignin with amino acid residues of the 15N-labelled carbohydrate-binding module.54 The authors reported NMR-based analyses of the binding sites of a carbohydrate-binding module of cellobiohydrolase 1 from a hypercellulase-producing fungus. A unique method was developed to obtain Trichoderma reesei carbohydrate-binding module 1 at the C-terminal domain (TrCBM1). Chemical shift analysis revealed that TrCBM1 was adsorbed onto cellohexaose in a highly specific manner via two subsites, the flat plane surface and the cleft, which were located on the opposite sides of the protein surface. Spectral assignments of 13C/15N-labelled TrCBM1 were performed by a standard sequential assignment procedure. As an illustrative example, the 1H – 15N HSQC spectrum of 15N-labelled TrCBM1 is shown in Fig. 42. The final backbone assignments of TrCBM1 were unambiguously made by the authors, however with the exception of eight residues.
The second example deals with the 19F NMR study of the phenolic hydroxyl groups in lignin modified by pentafluoropyridine, as was reported in 2023 by Kenny et al.51 The authors proposed a complementary method based on 19F NMR spectroscopy that enabled accurate identification and quantification of phenolic groups in lignin samples by using pentafluoropyridine as a derivatizing reagent. The 19F NMR spectrum of the resulting tetrafluoropyridine ether (Fig. 43) showed signals of two sets of identical fluorine atoms, which gave rise to resonances in two distinct regions, namely a downfield (designated as ‘DF’ in Fig. 43) region, from –89 to –95 ppm (fluorine atoms and integration region are marked blue), and an upfield (designated as ‘UF’ in Fig. 43) region, from –155 to –162 ppm (fluorine atoms and integration region are marked red).
However, the use of 19F NMR spectroscopy is still under revision and it is not at the level of 31P NMR for the quantitative and qualitative analyses of hydroxylated moieties of lignins. The former technique is especially suitable for the lignin-derived products such as pyrolysis oils.
3. Conclusion
Recent applications of the one- and two-dimensional 1H and 13C NMR spectroscopy together with NMR of less popular nuclei such as 31P, 19F and 15N to the structural studies of lignin and lignin-derived products in liquid phase have provided a solid breakthrough and a practical guide to their molecular organization. In this respect, 1H and 13C one-dimensional NMR spectra together with different modifications of two-dimensional 1H – 13C HSQC pulse sequences should first be mentioned. Also, NMR of less popular nuclei such as 31P, 19F, and 15N are of considerable interest and are a guiding thread in the structural studies of lignin.
Quantitative NMR spectra of 1H and 13C together with less popular nuclei such as 31P, 19F, and 15N are extensively used to characterize different structural units of lignin (syringyl, oxidized syringyl, guaiacyl, p-hydroxyphenyl, ferulic acid, cinnamyl alcohol end group, cinnamaldehyde end group, and p-coumaric acid, which provide the main lignin linkages — β-aryl-ether β-O-4, resinol β-β, phenylcoumaran β-5 and spirodienone) with aromatic and saturated carbons spread over many structural moieties. The structure of lignin is highly dependent on its origin. While softwood lignins contain mainly guaiacyl units with negligible amounts of p-hydroxyphenyl groups, hardwood lignins are composed of guaiacyl and syringyl subunits. In turn, grass lignins are composed of guaiacyl, syringyl and p-hydroxyphenyl subunits.
Due to the complexity of lignin and the products of its transformation, their NMR spectra consist of a number of overlapping signals belonging to different structural types. In this regard, comprehensive studies of lignin by NMR spectroscopy over the last few decades and especially the last five years have revealed characteristic functional groups of lignin and lignin-related products together with the spectral regions in which they resonate. These data provide a straightforward insight into the chemical structure and stereochemistry of different lignins.
4. List of abbreviations
Ac-KL — acetylated kraft lignin,
Ac-MWL — acetylated milled wood lignins,
Ac-MWL-aca — acetylated milled wood lignins from acacia,
Ac-MWL-mhw — acetylated milled wood lignins from mixed hardwood,
APTAC — (3-acrylamidopropyl) trimethyl ammonium chloride,
BL — bioenzymatic lignin,
COSY — COrrelated SpectroscopY,
Cu-AHP — Cu-catalyzed alkaline hydrogen peroxide lignin,
DF — downfield,
DFT — density functional theory,
DSS — 2,2-dimethyl-2-silapentane-5-sulfonic acid,
G — guaiacyl,
H — p-hydroxyphenyl,
HL — hydrolyzed lignin,
HMBC — heteronuclear multiple bond correlation,
HSQC — heteronuclear single quantum coherence,
HSQC-TOCSY — Heteronuclear Single Quantum Coherence in combination with TOtal Correlation SpectroscopY,
HSQC0 — heteronuclear single-quantum coherence zero,
Int. Std. — internal standard,
KL — kraft lignin,
MWL — milled wood lignin,
OL — organosolv lignin,
pBA — p-hydroxybenzoates,
pCA — p-coumarates,
S — syringyl,
SBL — sulfobutylated lignin,
SML — sulfomethylated lignin,
TMDP — 1-chloro-4,4',5,5'-tetramethyl-1,3-dioxa-2-phospholane,
TOL — Ttall Ooil Llignin,
TrCBM1 — Trichoderma reesei carbohydrate-binding module 1,
UF — upfield.