Category Archives: Electric double layer

Post-publication review: Tournassat and Steefel (2015), part IV

This is the fourth part of the review of “Ionic Transport in Nano-Porous Clays with Consideration of Electrostatic Effects” (Tournassat and Steefel, 2015) (referred to as TS15 in the following). For background and context please check the first part. This part covers the two sections “Constitutive equations for diffusion in bulk, diffuse layer, and interlayer water” and “Relative contributions of concentration, activity coefficient and diffusion potential gradients to total flux”.

“Constitutive equations for diffusion in bulk, diffuse layer, and interlayer water”

This section presents a mathematical formulation of ion diffusion in bentonite,1 based on the material descriptions in the earlier sections. As we have previously noted, these descriptions are fundamentally flawed in several respects. In particular, compacted bentonite is presented as consisting of stacks (called “particles”), where it is supposed to make sense to differ between external and internal interface water. TS15 also mean that compacted bentonite (sometimes?) is supposed to contain a bulk water phase.

As I have commented on in earlier parts, the only reason I can see to provide this nonsensical material description is as an attempt to to motivate a macroscopic, multi-porous model of bentonite. Here, TS15 make this claim quite explicit, as they write

Still it is possible to define three porosity domains, or water domains, that can be handled separately: the bulk water, the diffuse layer water and the interlayer water, the properties for which can be each defined independently.

This is in essence what I have referred to as “the mainstream view” of bentonite. It is basically “possible” to define anything, but the real question is if provided definitions are relevant and useful. And, as we have already discussed in detail, there is no rationale for introducing these “porosity domains” when modeling water saturated, compacted bentonite.2

Here we will first comment on the conceptual aspects of the provided mathematical description. Thereafter, we will delve into the mathematical formulations, as I’m quite convinced that these are not correct. Unfortunately, this latter part will be quite burdened with equations and notation, but for the motivated reader I think it may be worth going through.

Conceptual aspects

TS15 choose the Nernst-Planck description of ion diffusion, and begin by commenting that this is more rigorous than using Fick’s law. I certainly agree with that a general description of ion diffusion in bentonite requires treating electrostatic couplings between the various system components (TOT-layers, ions). I don’t think, however, that putting up a massively complex description of multi-component diffusion in “three porosity domains” is the appropriate starting point for including such couplings. Since we have every reason to believe that e.g. no bulk water phase is present, I mean that this type of treatment only leads us astray from understanding the actual processes involved (we will return to this aspect in later parts of the review).

Also, as the “Fickian” aspect was the focus of the earlier section on diffusion, a reader of TS15 could here understandably get the impression that a Nernst-Planck treatment will “fix” the “issues” addressed there. But, as we have already discussed in some detail, the shortcomings of the traditional sorption-diffusion model are not solved by including multi-component diffusion in a bulk water phase. They are solved by removing the bulk water phase.

Although the above quotation states that the various “porosity domains” can be handled separately, and that their properties can be defined independently, this is not what is done in TS15. Rather, the treatment of any “porosity domain” assumes equilibrium with the corresponding bulk water phase. The entire description in TS15 is thus fully centered around the bulk water phase.

TS15 insist on treating their model quantities as functions of two spatial coordinates (\(x\) and \(y\)), in what they refer to as a “pseudo 2-D Cartesian system” (I don’t fully understand what that means). Diffusive flux is only assumed to take place in the \(x\)-direction, while the “\(y\)”-dimension is used for stacking the different “porosity domains”. The description can be schematically illustrated like this

Here we have for illustrative purposes discretized the various components in \(x\)- and \(y\)-directions. The bulk water domain is colored blue, the “interlayer” domain pink, and the “diffuse layer” domain green. For a given \(x\)-position, the “diffuse layer” and the “interlayer” domains are assumed to always be in equilibrium with the corresponding bulk water phase. TS15 nowhere consider the length scale in the \(y\)-direction (is it therefore the coordinate system is referred to as “pseudo 2-D”?), which in practice makes the model a collection of 1-dimensional domains that are in equilibrium locally. Note that even though diffusion only is accounted for in the \(x\)-direction, transport occurs also in the \(y\)-direction, as a consequence of equilibration between the “porosity domains”.

This description is exactly what we have investigated in the blog post on why multi-porous models cannot be taken seriously. To summarize what was said there, without properly defining the length scales, it makes no sense to “short-cut” the model in the \(y\)-direction (to assume equilibrium for all domains at the same \(x\) is in a sense equivalent to assuming infinitely high mobility of all components in the \(y\)-direction). And even if we assume that such an assumption is valid — which would mean that we consider a thin strip of stacked parallel domains, where the extension in \(y\) is negligible in comparison to the extension in \(x\) — the resulting model has really nothing to do with actual bentonite. As we concluded in the multi-porosity blog post, the only way to make sense of this type of description is as a set of macroscopic continua that are assumed to be locally in equilibrium. How this equilibrium is supposed to be maintained has never been suggested by any proponent of this description. Note that this description (in particular the existence of a bulk water phase in equilibrium) disqualifies the model for describing swelling and swelling pressure.

Incorrect application of the Nernst-Planck framework

While the presented model makes little sense conceptually, TS15 also fail in applying the Nernst-Planck framework. The problem arises, as far as I can see, from that they don’t fully recognize the role of the electric potential.

As we now begin scrutinizing the details of the formulation, we will suppress the variables \(x\) and \(y\) in order to, hopefully, make the equations a little more readable. It should be understood that any quantity is evaluated for some specific value of \(x\), and that all “porosity domains” are supposed to be in equilibrium at the same value of \(x\).

The electro-chemical potential

In most standard thermodynamic text books we learn that the chemical potential governs the equilibrium associated with mass transfer. Just as e.g. pressure and temperature (which govern mechanical and thermal equilibrium, respectively), the chemical potential is defined by a specific derivative of a thermodynamic potential, e.g.3

\begin{equation} \bar{\mu} = \left ( \frac{\partial G}{\partial n} \right )_{T,p} \tag{1} \end{equation}

where \(G\) is the Gibbs free energy, \(n\) number of moles, \(p\) pressure, and \(T\) temperature. The corresponding mass flux is generally written

\begin{equation} j = -\frac{cD}{RT}\nabla \bar{\mu} \tag{2} \end{equation}

where \(c\) is concentration, \(D\) the diffusion coefficient4, and \(RT\) the usual absolute temperature factor. Here, and in the following, we use the symbol \(\nabla\), which denotes the general gradient operator, but since the model is effectively one-dimensional, it can simply be seen as a neat way of writing \(\partial/\partial x\).

For charged species, it is common to refer to the quantity defined in eq. 1 as the electro-chemical potential, and write it as composed of an “ordinary” and a purely “electrical” part

\begin{equation} \bar{\mu} = \mu + zF\Psi \tag{3} \end{equation}

where \(z\) is the charge number of the considered species, \(F\) is the Faraday constant and \(\Psi\) is the electric potential. The “ordinary” chemical potential \(\mu\) (without bar) is, perhaps a bit confusingly, also often referred to as the chemical potential. I will here continue to refer to this part as “ordinary”. The “ordinary” chemical potential is furthermore conventionally expressed in terms of a reference potential (\(\mu^0\)) and an activity \(a\)

\begin{equation} \mu = \mu^0 + RT\ln a \tag{4} \end{equation}

A lot can be said about the decomposition in eq. 3, but it is clear that singling out an electric potential term is useful in e.g. electrochemistry or for describing charged clay. It should, however, be kept in mind that mass transfer is fundamentally governed by gradients in \(\bar{\mu}\); always keeping eqs. 1 and 2 in mind will avoid us from making mistakes, because the mass transfer rate relates to the “total” (i.e. electrochemical) potential, and for charge neutral species the description reduces to gradients in the “ordinary” chemical potential.

Nernst-Planck flux

Combining eqs. 3 and 4 gives

\begin{equation} \bar{\mu} = \mu^0+RT \ln a + zF\Psi \tag{5} \end{equation}

with the corresponding flux (eq. 2)

\begin{equation} j = -cD\nabla \ln a -\frac{cDzF}{RT} \nabla \Psi \end{equation}

Expressing the activity in terms of an activity coefficient, \(a = \gamma c\), the flux can also be written (TS15 are quite fond of including activity coefficients explicitly)

\begin{equation} j = -D\nabla c -cD\nabla \ln \gamma -\frac{cDzF}{RT} \nabla \Psi \tag{6} \end{equation}

Considering an arbitrary set of diffusing charged species (using the index \(i\)), and utilizing that the electric current is zero, lead to an expression for the electric potential gradient

\begin{equation} \nabla \Psi = -\frac{RT}{F}\frac{\sum z_iD_i \left ( \nabla c_i + c_i \nabla \ln \gamma_i \right )}{\sum z_i^2 D_i c_i} \tag{7} \end{equation}

Misunderstanding the electric potential

For the bulk water phase, TS15 indeed provide an expression for the flux that is essentially the same as eq. 6 (their eq. 37), and which they refer to as the Nernst-Planck equation. They claim, however, that the electrochemical potential in this case lack an electric potential term (my emphasis)5,6

In absence of an external electric potential, the electrochemical potential in the bulk water can be expressed as (Ben-Yaakov 1981; Lasaga 1981) \begin{equation} \bar{\mu}_\mathrm{bulk} = \mu^0+RT \ln a_\mathrm{bulk} \end{equation}

But even without an externally applied electric field,7 a zero bulk electric potential cannot be assumed, of course, if the goal is to treat individual ion mobilities; as just shown, the gradient in electric potential that appears in eq. 6 is a result of a corresponding term in the electrochemical potential (eq. 5). Oddly, TS15 seem to treat the electric potential term in the flux as a quantity unrelated to the electrochemical potential, giving it a separate symbol, \(^\mathrm{b}\Psi_\mathrm{diff}\), and writing

\(^\mathrm{b}\Psi_\mathrm{diff}\) is the diffusion potential that arises because of the diffusion of charged species at different rates.

It may be natural for a reader at this point to simply assume that TS15 have missed writing out the term \(zF^\mathrm{b}\Psi_\mathrm{diff}\) when stating the electrochemical potential. But this seems to be a genuine misunderstanding rather than a mistake/typo, because the pattern repeats in the derivation of the flux in the other “porosity domains”.

For e.g. the “diffuse layer”,8 TS15 recognize the presence of an electric potential in the expression for the electrochemical potential, writing it (this is more awkwardly expressed in eq. 42 in TS15)

\begin{equation} \bar{\mu}_\mathrm{DL} = \mu^0 + RT\ln a_\mathrm{DL} + zF\Psi_\mathrm{DL} \tag{8} \end{equation}

where index “DL” refers to quantities in the “diffuse layer”.

However, the corresponding flux expression contains a different potential, labelled \(^\mathrm{DL}\Psi_\mathrm{diff}\) (eq. 46 in TS15)

\begin{equation} j_\mathrm{DL} = -\frac{c_\mathrm{DL}D_\mathrm{DL}}{RT}\nabla \bar{\mu}_\mathrm{DL} -\frac{c_\mathrm{DL}D_\mathrm{DL}zF}{RT} \nabla ^{\mathrm{DL}}\Psi_\mathrm{diff} \tag{9} \end{equation}

TS15 don’t further comment what \(^{\mathrm{DL}}\Psi_\mathrm{diff}\) is supposed to represent, but it must reasonably be understood as “the diffusion potential that arises because of the diffusion of charged species at different rates”, in analogy with what was claimed for the bulk water phase. Note that when eq. 8 is combined with eq. 9, the flux expression contains two different electric potential gradients! (TS15 never address this oddity)

It is thus quite clear that TS15 misunderstand the function of the electric potential in the Nernst-Planck framework. When presenting the expression for the “diffuse layer” flux (eq. 9), they also refer to Appelo and Wersin (2007), who, in turn, express the misconception explicitly9

The gradient of the electrical potential [in the expression for the flux] originates from different transport velocities of ions, which creates charge and an associated potential. This electrical potential may differ from the one used in [the expression for the electro-chemical potential], which comes from a charged surface and is fixed, without inducing electrical current.

I cannot understand this passage in any other way than that Appelo and Wersin (2007) are under the impression that different electric potentials can simultaneously act independently in a given point. And it seems like TS15 are under some similar impression.

This ignorance leads to more errors in the description of the “diffuse layer” in TS15. We should remember that the promoted model requires the “diffuse layer” and bulk water domains to be in equilibrium (for the same coordinate value \(x\)). When TS15 express this condition, i.e. \(\bar{\mu}_\mathrm{DL} = \bar{\mu}_\mathrm{bulk}\), they again leave out the electric potential in the bulk water (eq. 42 in TS 15)

\begin{equation} \mu^0 + RT\ln a_\mathrm{DL} + zF\Psi_\mathrm{DL} = \mu^0 + RT\ln a_\mathrm{bulk}\;\:\;\;\;\;\;\mathrm{(WRONG)} \tag{10} \end{equation}

eq. 10 can be rewritten

\begin{equation} a_\mathrm{DL} = a_\mathrm{bulk}\cdot e^{-\frac{zF}{RT}\Psi_\mathrm{DL}} \;\:\;\;\;\;\;\mathrm{(WRONG)} \tag{11} \end{equation}

TS15 utilize a simplified version of eq. 11, expressed in terms of concentrations rather than activities, by assuming identical activity coefficients in the two domains10

\begin{equation} c_\mathrm{DL} = c_\mathrm{bulk}\cdot e^{-\frac{zF}{RT}\Psi_\mathrm{DL}} \;\:\;\;\;\;\;\mathrm{(WRONG)} \tag{12} \end{equation}

Note that the exponential in eqs. 11 and 12 actually should contain the electric potential difference between “diffuse layer” and bulk (see below).

As TS15 have not included any electric potential in the bulk water phase, they continue by incorrectly substituting \(RT\nabla \ln a_\mathrm{bulk}\) for \(\nabla\bar{\mu}_\mathrm{DL}\) in eq. 9 (i.e. they use the incorrect relation in eq. 10), giving (TS15 eq. 47)

\begin{equation} j_\mathrm{DL} = -c_\mathrm{DL}D_\mathrm{DL}\nabla \ln a_\mathrm{bulk} -\frac{c_\mathrm{DL}D_\mathrm{DL}zF}{RT} \nabla ^{\mathrm{DL}}\Psi_\mathrm{diff} \;\;\;\;\mathrm{(WRONG)} \tag{13} \end{equation}

Note that this additional error “solves” the earlier pointed out problem of having two electric potential gradients.

By utilizing the requirement of zero electric current, eq. 13 gives

\begin{equation} \nabla ^{\mathrm{DL}}\Psi_\mathrm{diff} = -\frac{RT}{F}\frac{\sum z_iD_{\mathrm{DL},i}c_{\mathrm{DL},i} \nabla \ln a_{\mathrm{bulk},i} }{\sum z_i^2 D_{\mathrm{DL},i} c_{\mathrm{DL},i}} \;\:\;\;\;\;\;\mathrm{(WRONG)} \tag{14} \end{equation}

By substituting eq. 12 into this expression, we end up with the formula for the gradient of the mysterious potential \(^{\mathrm{DL}}\Psi_\mathrm{diff}\) (TS15 eq. 48)

\begin{equation} \nabla ^{\mathrm{DL}}\Psi_\mathrm{diff} = \end{equation} \begin{equation} -\frac{RT}{F}\frac{\sum z_iD_{\mathrm{DL},i} e^{-\frac{zF}{RT}\Psi_\mathrm{DL}} \left ( \nabla c_{\mathrm{bulk},i} + c_{\mathrm{bulk},i}\nabla \ln \gamma_{\mathrm{bulk},i} \right )} {\sum z_i^2 D_{\mathrm{DL},i} e^{-\frac{zF}{RT}\Psi_\mathrm{DL}}c_{\mathrm{bulk},i}} \;\mathrm{(WRONG)} \tag{15} \end{equation}

At face value, eq. 15 is a quite weirdly looking equation, as it relates two electric potentials — \(^{\mathrm{DL}}\Psi_\mathrm{diff}\) and \(\Psi_\mathrm{DL}\) — that both are supposed to be associated with the “diffuse layer”. But, as we will see below, there is actually a way to make some sense of eq. 15, by completely reinterpreting what these potentials represent.

A “correct” formulation

Most of the errors pointed out above are corrected by including the electric potential in the bulk water and writing the condition for equilibrium as (compare eq. 10)

\begin{equation} \mu^0 + RT\ln a_\mathrm{DL} + zF\Psi_\mathrm{DL} = \mu^0 + RT\ln a_\mathrm{bulk} + zF\Psi_\mathrm{bulk} \tag{16} \end{equation}

Writing the electric potential difference between “diffuse layer” and bulk water as11

\begin{equation} \Psi^\star \equiv \Psi_\mathrm{DL} – \Psi_\mathrm{bulk} \tag{17} \end{equation}

eq. 16 can be rewritten

\begin{equation} a_\mathrm{DL} = a_\mathrm{bulk}\cdot e^{-\frac{zF}{RT}\Psi^\star} \tag{18} \end{equation}

Note that, when correctly derived, eq. 18 naturally contains the difference in electric potential between “diffuse layer” and bulk.

The flux in the “diffuse layer” is (eq. 6)

\begin{equation} j_\mathrm{DL} = -c_\mathrm{DL} D_\mathrm{DL} \nabla \ln a_\mathrm{DL} – \frac{c_\mathrm{DL} D_\mathrm{DL} zF}{RT} \nabla \Psi_\mathrm{DL} \tag{19} \end{equation}

But if we now plug in eq. 18 in eq. 19 we of course get

\begin{equation} j_\mathrm{DL} = -c_\mathrm{DL} D_\mathrm{DL} \nabla \ln a_\mathrm{bulk} + \frac{c_\mathrm{DL} D_\mathrm{DL} zF}{RT} \nabla \Psi^\star – \frac{c_\mathrm{DL} D_\mathrm{DL} zF}{RT} \nabla \Psi_\mathrm{DL}, \end{equation} which can be simplified to \begin{equation} j_\mathrm{DL} = -c_\mathrm{DL} D_\mathrm{DL} \nabla \ln a_\mathrm{bulk} – \frac{c_\mathrm{DL} D_\mathrm{DL} Fz}{RT} \nabla \Psi_\mathrm{bulk}, \tag{20} \end{equation} and, by identifying the electro-chemical potential in the bulk \begin{equation} j_\mathrm{DL} = -\frac{c_\mathrm{DL} D_\mathrm{DL}}{RT} \left ( RT \nabla \ln a_\mathrm{bulk} + Fz \nabla \Psi_\mathrm{bulk} \right ) = -\frac{c_\mathrm{DL} D_\mathrm{DL}}{RT} \nabla \bar{\mu}_\mathrm{bulk} \end{equation}

This whole “derivation” leads back to the rather trivial result that the flux in the diffuse layer is given by eq. 2, which we could have written down from the start! (because the model assumes \(\bar{\mu}_\mathrm{bulk} = \bar{\mu}_\mathrm{DL}\); eq. 16)

As TS15 have established the expression for the gradient of the electrochemical potential in the bulk water phase (which is implicit in their eq. 40), there should strictly be no need to consider a new expression for the same quantity in any other phase. Rather, they could simply have used the bulk water expression in all “porosity domains”, as a consequence of the assumption that these are all supposed to be in equilibrium. In a sense, this is actually what is done in TS15 — mainly by chance! — by establishing eq. 13 (their eq. 47).

Comparing with eq. 20, we see that the incorrect eq. 13 can be “saved” by reinterpreting \(^{\mathrm{DL}}\Psi_\mathrm{diff}\) as \(\Psi_\mathrm{bulk}\). Similarly, as TS15 assume the bulk electric potential to be zero, eq. 15 can be “saved” by also reinterpreting \(\Psi_\mathrm{DL}\) as \(\Psi^\star\) in that expression.12 I find this quite hilarious: By making several errors in its derivation, eq. 15 is in a sense a correct expression for the electric potential gradient in the bulk water — a potential that TS15 has put identically equal to zero.

But even if the total flux in the “diffuse layer” is correctly given by combining eqs. 12, 13 and 15 (and by completely ignoring what TS15 mean \(\Psi_\mathrm{DL}\) and \(^{\mathrm{DL}}\Psi_\mathrm{diff}\) represent), TS15 continue by defining the separate terms in eq. 13 as contributions from the “concentration gradient”, and the “diffusion potential”. As we will explore next, this interpretation fails miserably.

“Relative contributions of concentration, activity coefficient and diffusion potential gradients to total flux”

According to TS15, the “concentration gradient” and the “diffusion potential” contributions to the “diffuse layer” flux are given by, respectively (TS15 eq. 50 and below)

\begin{equation} j_\mathrm{conc,TS15} = -D_\mathrm{DL}A\nabla c_\mathrm{bulk} \tag{21} \end{equation}

and

\begin{equation} j_\mathrm{E,TS15} = zD_\mathrm{DL}c_\mathrm{bulk}A\frac{\sum z_iD_{\mathrm{DL},i}A_i\left ( \nabla c_{\mathrm{bulk},i} + c_{\mathrm{bulk},i} \nabla \ln \gamma_{\mathrm{bulk},i} \right )} {\sum z_i^2 D_{\mathrm{DL},i} A_i c_{\mathrm{bulk},i}} \tag{22} \end{equation}

Here we use the index “conc” for the “concentration gradient” contribution, and “E” for the “diffusion potential” contribution. \(A\) is referred to as a “DL enrichment factor”, and is essentially defined as the concentration ratio \(c_\mathrm{DL}/c_\mathrm{bulk}\). Using the incorrect relation in eq. 12, TS15 write these as \(A = e^{-\frac{zF}{RT}\Psi_\mathrm{DL}}\), but, as we see from eq. 18, they are really given by13 (we continue assuming identical activity coefficients in the two domains)

\begin{equation} A = e^{-\frac{zF}{RT}\Psi^\star} \tag{23} \end{equation}

TS15 also define a third contribution, related to the gradient of the bulk water activity coefficient. Here we will not further discuss this contribution, as it does not give any additional insight. Moreover, since TS15 anyway derive their model under the unjustified assumption that activity coefficients in the “diffuse layer” and the bulk water are identical, I cannot see the use of including their spatial variation in the description.14 (TS15 spend a couple of pages on activity coefficient models that we will ignore.)

Examples

To explore the various couplings in the presented model, TS15 apply the Nernst-Planck framework in three examples. We can see immediately from the presented graphs that their partitioning of the total flux in “concentration gradient” and “diffusion potential” contributions makes no sense.

“Example 2” imposes constant concentration gradients in the bulk water of NaCl and corresponding \(^{22}\mathrm{Na}^+\) and \(^{36}\mathrm{Cl}^-\) tracers; the NaCl concentration drops from 0.1 M to 0.001 M, and the tracer concentrations drop from 10-9 M to 10-11 M (domain length is 10 mm).

The corresponding sodium and chloride tracer concentrations in the “diffuse layer” look like this15

These profiles make sense: bulk water ionic strength decreases with distance, but so do the tracer concentrations. For the process of accumulating \(^{22}\mathrm{Na}^+\) in the diffuse layer, these two effects oppose each other, resulting in a quite flat profile. We thus expect the corresponding “concentration gradient” contribution to the flux to be quite moderate, and to fall off with distance (as the profile flattens with distances). The corresponding flux graph presented in TS15, however, looks completely different16

This plot makes no sense: The “concentration gradient” contribution is seen to increase quite dramatically with distance, rather than falling off. The value of this contribution is also orders of magnitude too large, given the imposed sodium diffusion coefficient of 1.33⋅10-10 m2/s. Moreover, the “concentration gradient” contribution is “compensated” by an equally nonsensical “diffusion potential” contribution. Note, for instance, that the “diffusion potential” contribution is negative, which implies that the corresponding electric field is supposed to be directed towards higher concentrations. This can certainly not be the case, as the electric potential gradient is caused by the negative ion having higher mobility than the positive ion (chloride diffusivity is set to 2.03⋅10-10 m2/s).

In “example 3”, the tracer concentrations in the bulk is set to a constant value (1⋅10-9 M), while the same concentration gradient as in “example 2” is maintained for the main NaCl electrolyte (from 0.1 M to 0.001 M). We thereby expect the corresponding \(^{22}\mathrm{Na}^+\) concentration in the “diffuse layer” to strongly increase with distance, which is also what is presented in TS15

while the corresponding flux plot looks like this16

This plot is almost comically absurd. According to TS15, the highly skewed concentration profile above is supposed to give no (zero, nil, 0) contribution to the flux (we see from eq. 21 that this is a consequence of that this “contribution” is directly proportional to the concentration gradient in the bulk). Instead, the huge flux is supposed to be caused entirely by an electric field that has the wrong direction! I can’t even really begin to imagine how these two plots have ended up next to each other in a peer-reviewed published article.

Note that the flux associated with a concentration gradient is what we may reasonably call a “Fickian” contribution. If TS15 mean (and they do) that these examples demonstrate how ion diffusion in bentonite works, we can understand the focus on the “Fickian” aspect at the beginning of the article (covered here). But the only reasonable response to these outlandish results is that they demonstrate that the definitions of eqs. 21 and 22 simply make no sense.

The real concentration gradient and electric field contributions

The only reasonable way to define “concentration gradient” and “diffusion potential” contributions to the “diffuse layer” flux is as the two terms in eq. 19, respectively. To rewrite these, we utilize eq. 16 (or 18), giving for the “concentration gradient” contribution (we continue ignoring activity coefficients)

\begin{equation} j_\mathrm{conc, corr} = -c_\mathrm{DL} D_\mathrm{DL} \nabla \ln c_\mathrm{DL} = \end{equation} \begin{equation} -c_\mathrm{DL} D_\mathrm{DL} \nabla \ln c_\mathrm{bulk} + \frac{c_\mathrm{DL} D_\mathrm{DL}zF}{RT} \nabla \Psi^\star = \end{equation} \begin{equation} -D_\mathrm{DL} A \nabla c_\mathrm{bulk} + \frac{A c_\mathrm{bulk} D_\mathrm{DL}zF}{RT} \nabla \Psi^\star = j_\mathrm{conc, TS15} + j^\star \tag{24} \end{equation}

where we have defined

\begin{equation} j^\star \equiv \frac{A c_\mathrm{bulk} D_\mathrm{DL}zF}{RT} \nabla \Psi^\star. \tag{25} \end{equation}

In the same manner, the correct “diffusion potential” contribution is

\begin{equation} j_\mathrm{E, corr} = – \frac{c_\mathrm{DL} D_\mathrm{DL} zF}{RT} \nabla \Psi_\mathrm{DL} = \end{equation} \begin{equation} – \frac{D_\mathrm{DL}Ac_\mathrm{bulk} zF}{RT} \nabla \Psi_\mathrm{bulk} – \frac{D_\mathrm{DL}Ac_\mathrm{bulk} zF}{RT} \nabla \Psi^\star = \end{equation} \begin{equation} D_\mathrm{DL}Ac_\mathrm{bulk} z \frac{\sum z_iD_iA_i\left ( \nabla c_{\mathrm{bulk},i} + \nabla c_{\mathrm{bulk},i} \ln \gamma_{\mathrm{bulk},i} \right )} {\sum z_i^2 D_i A_i c_{\mathrm{bulk},i}} – \frac{D_\mathrm{DL}Ac_\mathrm{bulk} zF}{RT} \nabla \Psi^\star = \end{equation} \begin{equation} j_\mathrm{E, TS15} – j^\star \tag{26} \end{equation}

where we have utilized that \(\nabla \Psi_\mathrm{bulk}\) is actually what is expressed in eq. 15 (where \(\Psi_\mathrm{DL}\) should be replaced by \(\Psi^\star\)).

We note that, to compensate the nonsensical expressions given in TS15, we should add the term \(j^\star\) (eq. 25) to the “concentration concentration” contribution (eq. 21), and subtract the same term from the “diffusion potential” contribution (eq. 22). Making these corrections gives the following components of the tracer fluxes in “example 2”

This is an infinitely more reasonable situation than what is depicted in TS15. Although the sodium flux has a non-negligible contribution from the electric field, the larger contribution is still from the concentration gradient (and none of these are gigantic terms that cancel). The concentration contribution also falls off with distance, in accordance with the shape of the concentration profile.

For chloride, the field contribution to the flux is negligible, i.e. this flux is essentially fully governed by the concentration gradient. The electric field contributions for both ions are also seen to have the correct signs: the electric field is directed from high to low concentration, and mainly functions to boost the sodium transport, in order to “keep up” with the faster chloride ions.

For “example 3” we get the following picture

The corrected \(^{22}\mathrm{Na}^+\) flux is essentially fully due to the concentration gradient, in absolute contrast to what is concluded in TS15, who mean that this flux is completely governed by an electric field in the wrong direction. Also the \(^{36}\mathrm{Cl}^-\) transport is basically solely governed by the concentration gradient, rather than by an incorrectly directed electric field (as stated in TS15). In conclusion, most of the “diffuse layer” diffusion in these examples can actually be classified as “Fickian”.

We may also investigate the electric potential profile in the “diffuse layer” in both of these examples (this is the same in the two cases, as the main electrolyte distribution does not change)

Here we have chosen the reference \(\Psi_\mathrm{DL}(0) = 0\). The total potential drop is only about 1 mV. Such a relatively small drop is reasonable because the denominator in the Nernst-Planck expression for the electric potential gradient (eq. 7) will always be large due to the ever-present counter-ions in the “diffuse layer”. The electric potential gradient — and thus the corresponding electric potential drop — is therefore suppressed. Physically, this means that since many (equally charged) charge carriers are always present, smaller potential differences are required to cancel electric currents caused by differences in mobility (a “diffuse layer” is a quite good conductor).

Even worse problems?

Even though some sense can be made out of the derived expression for the flux in the “diffuse layer” domain — by completely reinterpreting the electric potentials involved — it seems as the overall model is too constrained. Specifically, for an imposed set of concentration profiles in one domain it is not possible, as far as I can see, to simultaneously have zero current in all domains, while also maintaining (Donnan) equilibrium. As this blog post is already quite massive, I will elaborate on this point in the next part of the review.

Summary

Here is an attempt to sum up the main messages of this blog post.

  • Conceptually, the clay model presented in TS15 is exactly what was discussed in the blog post on multi-porous models, and the same issues that are identified there are present here. In particular, no attention is paid to length scales (perhaps that is why TS15 call the coordinate system “pseudo-2D”…), and no mechanism whatsoever is suggested for how the different diffusing domains are supposed to maintain equilibrium.
  • Mathematically (or perhaps physically), the presented Nernst-Planck flux expressions are incorrectly derived. The source of the error, as far as I can see, appears to be a misunderstanding of how electric potentials function.
  • TS15 define “contributions” to the “diffuse layer” flux, claimed to be related to the concentration gradient and the “diffusion potential” (i.e. the electric field), respectively. It is, however, quite obvious that these “contributions” are completely nonsensical: highly skewed concentration profiles are claimed to not have any concentration gradient contributions, and several “diffusion potential” “contributions” have the electric field in the wrong direction. We have shown that these “contributions” can be corrected, where the correction term involves the gradient of the Donnan potential. With these corrections, fluxes in the provided examples must be interpreted completely differently (they’re basically “Fickian”).
  • As far as I can see, the proposed model has even larger problems, related to the imposed Donnan equilibrium. We will address this issue in the next part.

Footnotes

[1] As I have commented in the earlier parts: TS15 are fond of using the general terms “clays” and “clay minerals”, while it is clear that the publication mainly focus on systems with substantial ion exchange capacity and swelling properties. Here we will continue to use the term “bentonite” for these systems, and ignore the frequent references in TS15 to more general terms.

[2] It is of course crucial to include a component that represents compartments where the exchangeable ions reside. This is done in the TS15 model by both the “diffuse layer water” and the “interlayer water” domains. But the distinction made between these domains is based on the flawed “stack” concept.

[3] This equation assumes a single component. The formulation of the Nernst-Planck framework naturally involves several different charged species. When several species are involved, we will indicate this with an index \(i\) in the equations.

[4] In some of their equations, TS15 use (electrical) mobility, \(u\), rather than diffusivity, \(D\). These quantities are related via the Einstein relation \(D = uRT/(F|z|)\). I don’t see the point in involving \(u\), as it typically makes expressions even more cluttered, and since we here ultimately are interested in diffusion coefficients anyway.

[5] In order to not cause too much confusion, and to try to simplify a bit, I use slightly different mathematical notation than what is actually used in the quotation. In particular, I use the notation \(\bar{\mu}\) for the electro-chemical potential, while TS15 don’t use a bar (\(\mu\)). I also try to avoid the index \(i\) as much as possible.

[6] Fun fact: this statement is nowhere found in neither (Ben-Yaakov, 1981) nor (Lasaga, 1981) (at least I can’t find any).

[7] I whined about electrostatics being poorly understood in the bentonite research field in an earlier part of this review, but here is more fuel for my argument. The statement “absence of an [external] potential” has no physical meaning, as we are free to choose the reference point (the absolute value of a potential has no physical meaning). What TS15 must mean in the quote is “the absence of an external electric field”. The electric field relates to the potential as \(E = -\nabla \Psi\). Thus, all gradients of electric potentials that occur in this text are synonymous with electric fields (electric fields drive electric currents).

[8] This post focus almost entirely on the “diffuse layer” domain, but a similar analysis can be made for the “interlayer” domain. This is left as an exercise for the reader.

[9] It should of course also rather read “…which creates a charge separation and an associated potential gradient.”, or simply “…which induces an electric field.” (showing that this part of the sentence is redundant). See also footnote 7.

[10] TS15 write cryptically that equating the activity coefficients (and the reference potentials) in bulk and “diffuse layer” is assumed “by following the [Modified Gouy-Chapman] model”. But I don’t see why this model has to be alluded to here, these assumptions can just be made.

[11] Yes, this is a Donnan potential. We will discuss this more in the next part part of the review.

[12] Again, this is related to Donnan equilibrium between the bulk and “diffuse layer” domains, that we will discuss further in the next part.

[13] This is \(f_D^{-z} \), where \(f_D\) is the Donnan factor.

[14] Rather, I would argue for that the activity coefficients in a “diffuse layer” domain will be quite insensitive to the imposed external (bulk) concentration, for details see Birgersson (2017).

[15] In producing these graphs we have used the Donnan equilibrium framework to calculate the “diffuse layer” concentrations. These are given from eq. 23, where \(\Psi^\star\) is calculated from

\begin{equation} f_D = e^\frac{F\Psi^\star}{RT} = – \frac{q}{2c_\mathrm{bulk}} +
\sqrt{\frac{q^2}{4c_\mathrm{bulk}^2} + 1} \end{equation}

where \(q\) is a measure of the structural charge in the “diffuse layer”, in the examples set to \(q\) = 0.33 M.

[16] Note that I have not included activity coefficient gradients when producing the plots in this section. They may therefore differ slightly from the published plots. This does not in any way influence the conclusions drawn here.

Post-publication review: Tournassat and Steefel (2015), part III


This is the third part of the review of “Ionic Transport in Nano-Porous Clays with Consideration of Electrostatic Effects” (Tournassat and Steefel, 2015) (referred to as TS15 in the following). For background and context please check the first part. In this part, we wrap up our discussion of the section “Clay mineral surfaces and related properties”.1

“Adsorption processes in clays”

The subsection we focus on here, “Adsorption processes in clays”, contains very little descriptions of fundamental properties of bentonite, and is instead almost exclusively devoted to detailed discussions on various models. As an example, already in the first paragraph the text digresses into dealing with the problem of defining “surface species activity” in the “DDL”2 model…

TS15 discuss adsorption separately on “outer basal surfaces”, “interlayer basal surfaces”, and “edge surfaces”. Note that the distinction between “outer” and “interlayer” basal surfaces requires that we view the compacted bentonite as composed of stacks (referred to as “particles” in TS 15). But this idea is just fantasy, as we have discussed in the previous part and in a separate blog post. Moreover, central to the description of adsorption processes in TS15 is the idea of a Stern layer. This concept was briefly introduced in the previous subsection (“Electrostatic properties, high surface area, and anion exclusion”)

The [electrical double layer] can be conceptually subdivided into a Stern layer containing inner- and outer-sphere surface complexes […] and a diffuse layer (DL) containing ions that interact with the surface through long-range electrostatics […].

The next time this concept is brought up is at the beginning of the discussion on adsorption on “outer basal surfaces”

The high specific basal surface area and their electrostatic properties give rise to adsorption processes in the diffuse layer, but also in the Stern layer.

I have written a separate blog post arguing for that the idea of Stern layers on montmorillonite basal surfaces is unjustified. Note that the notion of Stern layers on montmorillonite basal surfaces in the contemporary bentonite literature de facto means that these surfaces are supposed to be full-fledged chemical systems. In particular, the basal surface is supposed to contain localized “sites” that interact generally with ions to form surface complexes and that can involve covalent bonding.

Note further that the Stern layer was originally introduced as a model (or a model component) that extends the Gouy-Chapman description of the electric double layer. TS15, on the other hand, use the term “Stern layer” to refer to an actual physical structural component. And just as in the case of several other “components” that has been introduced in the article (“particles”, “inter-particle water”, “free or bulk water”, “aggregates”…), the existence of a Stern layer is just declared rather than argued for. And just like with the other components, these are not universally adopted. I don’t think it is appropriate to include Stern layers in this way in a review article when established parts of the colloid science community refer to them as an “intellectual cul de sac”.

So in order to even begin to criticize what TS15 actually write about adsorption processes here, one has to accept both the flawed idea of stacks as fundamental structural units and the far from universally accepted idea of Stern layers on montmorillonite basal surfaces. I will therefore refrain from doing that, and simply proclaim that I don’t accept the premises. (I believe I will have reasons to return to the models presented here when reviewing later sections of TS15.)

Additional remarks

But I think it is worth reminding ourselves that at the end of the previous section (covered in part I) we were promised that this section should qualitatively link “fundamental properties of the clay minerals” to the diffusional behavior of compacted bentonite. A reader of TS15 will thus expect this section to contain, in particular, a reasonable description and discussion on how compacted montmorillonite works. Instead a very specific (and flawed) model is imposed on the reader: the first subsection (covered in part II), introduced the fictional stack concept, and gave a confused and irrelevant explanation of anion exclusion; the presently discussed subsection is centered around Stern layers.

If the authors truly did what they claimed, in this section they should have addressed the consequences of montmorillonite TOT-layers being charged — a universally accepted fact — without introducing further assumptions. This would naturally lead to a discussion on osmosis, swelling, swelling pressure and semi-permeable boundary conditions (all simple empirical facts). These topics, in turn, should lead to considerations of e.g. ion mobility and chemical interface equilibrium. Not a single one of these topics are, in any meaningful sense, actually addressed in this section.

Before ending this part of the review, I also would like to focus on what is being said bout “interlayers”. We should keep in mind that TS15 — together with a large part of the contemporary bentonite research community — assume “interlayers” to be something different than simply the space between adjacent basal surfaces: these are supposed to be internal to the fantasy construct of a stack. When discussing adsorption in these presumed compartments they write

The interlayer space can be seen as an extreme case where the diffuse layer vanishes leaving only the Stern layer of the adjacent basal surfaces.

Of everything I’ve read in the bentonite literature, this is the closest I’ve come to see some actual description of what the fundamental difference between an “outer basal surface” and an “interlayer” is supposed to be. But let’s think this through. TS15 have claimed that an electric double layer is composed of a Stern layer and a diffuse layer, and we have vaugley been told that ions in the Stern layer are immobile. The above quotation thus implicitly says that that “interlayer” ions are not mobile, and that diffuse layers are only supposed to exist on “outer basal surfaces” (which, remember, is a fantasy component). But — disregarding that the stack-internal “interlayer” also is a fantasy concept — it is an indisputable experimental fact that has been known for a long time that interlayers provide the only relevant transport mechanism in compacted bentonite.

Thus, either TS15 here provide us with yet another incorrect description of the behavior of compacted bentonite (that “interlayer” ions are immobile) or they are claiming, somewhat contradictorily, that Stern layer ions are mobile after all. But if Stern layer ions diffuse, such a structural component could reasonably not have been singled out in the first place! (The diffuse layer is supposed to have “vanished”.) As with many other issues in TS15, this question is left vague and unanswered.3 The continuation of the text does not make things clearer

For this reason, the interlayer space is often considered to be completely free of anions (Tournassat and Appelo 2011), although this hypothesis is still controversial (Rotenberg et al. 2007c; Birgersson and Karnland 2009).

An interlayer completely devoid of anions certainly play by other rules than an “ordinary” electric double layer. Does this mean that TS15 assume “interlayer” ions to be immobile?4 Anyway, it is an indisputable experimental fact that anions occupy interlayers, and I find it quite bizarre to find myself referenced in connection with the “controversial hypothesis”. The idea of compartments completely devoid of anions is widespread in the contemporary bentonite research community, but no one has ever suggested a mechanism for how such an exclusion is supposed to work; here, it apparently should be related to “Stern layers” in some (unexplained) manner. At the same time, the simplest application of Donnan equilibrium principally explains e.g. the behavior of the steady-state flux in anion tracer through-diffusion tests.

Speaking of controversial, I find it highly problematic that the authors, only the year after the publication of TS15, in a molecular dynamics (MD) study on montmorillonite interlayers,5 conclude

The agreement between [Poisson-Boltzmann] calculations and MD simulation predictions was somewhat worse in the case of the \(\mathrm{Cl^-}\) concentration profiles than in the case of the \(\mathrm{Na^+}\) profiles (Figure 3), perhaps reflecting the poorer statistics for interlayer Cl concentrations or the influence of short-range ion-ion interactions (and possibly ion- water interactions, as noted above) that are not accounted for in the [Poisson-Boltzmann] equation. Nevertheless, reasonable quantitative agreement was found (Table 2).

Here they acknowledge not only that anions do occupy interlayers, but also that the interlayer plays by the same rules as the “ordinary” electric double layer (“Poisson-Boltzmann calculations”). What happened to the “vanishing” diffuse layer, and to “considering” the interlayer to be “completely free of anions”? I find it quite outrageous that they fail to acknowledge these blatantly mixed messages with so much as a single word.

Update (251106): Part IV of this review is found here.

Footnotes

[1] As I have commented in the earlier parts: TS15 are fond of using the very general terms “clays” and “clay minerals”, while it is clear that the publication mainly focus on systems with substantial ion exchange capacity and swelling properties. Here we will continue to use the term “bentonite” for these systems, and ignore the frequent references in TS15 to more general terms.

[2] For some reason, “DDL” is short for (the very generically sounding) “double layer model”. Why not “DLM”?

[3] Spoiler: in later sections describing models, TS15 allow for the possibility of transport in “interlayers”.

[4] Questions like these can often not be answered because so many statements in TS15 are vague and ambiguous. In this discussion we have to refer to statements such as (my emphasis)

  • “The EDL can be subdivided into a Stern layer […] and a diffuse layer […].”
  • “The interlayer can be seen as an extreme case where the diffuse layer vanishes […]”
  • “The interlayer space is often considered to be completely free of anions […]”

I get annoyed by too much of such language in scientific publications.

[5] This study is discussed in a previous blog post, on molecular dynamics simulations of montmorillonite .

Post-publication review: Tournassat and Steefel (2015), part II

This is the second part of the review of “Ionic Transport in Nano-Porous Clays with Consideration of Electrostatic Effects” (Tournassat and Steefel, 2015) (referred to as TS15 in the following). For background and context please check the first part. That part covered the introduction and the section “Classical Fickan Diffusion Theory”. The next section is titled “Clay mineral surfaces and related properties”, and is further partitioned into two subsections. Here we exclusively deal with the first one of these subsections: “Electrostatic properties, high surface area, and anion exclusion”. It only covers three and a half journal pages, but since the article here goes completely off the rails, there is much to comment on.

“Electrostatic properties, high surface area, and anion exclusion”

As stated in the first part, I find it remarkable that the authors use general terms such as “clay minerals” when the actual subject matter is specifically systems with a significant cation exchange capacity, and montmorillonite in particular. I will continue to refer to these systems as “bentonite” in the following, disregarding the constant references to “clay minerals” in TS15.

Stacks

After having established that montmorillonite and illite have structural negative charge, it begins:

Clay mineral particles are made of layer stacks and the space between two adjacent layers is named the interlayer space (Fig. 5).

This is the first mention of clay “particles” in the article, and they are introduced as if this is a most well-established concept in bentonite science (incredibly, it is also the first occurrence of the term “interlayer”). We will refer to “clay mineral particle” constructs as “stacks” in the following. I have written a detailed post on why stacks make little sense, where I demonstrate their geometrical impossibility and show that most references given to support the concept are studies on suspensions that often imply that montmorillonite do not form stacks. Sure enough, this is also the case in TS15

The number of layers per montmorillonite particle depends on the water chemical potential and on the nature and external concentration of the layer charge compensating cation (Banin and Lahav 1968; Shainberg and Otoh 1968; Schramm and Kwak 1982a; Saiyouri et al. 2000)

Banin and Lahav (1968), Shainberg and Otoh (1968), and Schramm and Kwak (1982) all report studies on montmorillonite suspensions. The abstract of Shainberg and Otoh (1968) even states “The breakdown of the tactoids occurred when the equivalent fraction of Na increased from 0.2 to 0.5. Montmorillonite clay saturated with 50% calcium (and less) exists as single platelets.”, and the abstract of Schramm and Kwak (1982) states “Upon exchange of Ca-counterions for Li-, Na-, or K-counterions, a sharp initial decrease in tactoid size was observed over approximately the first 30% of cation exchange.”. These are just different ways of saying that sodium dominated montmorillonite is sol forming.

I want to stress the absurdity of the description given in TS15. A pure fantasy is stated about how compacted bentonite is structured. As “support” for the claim are given references to studies on “dilute suspensions”. It should be clear that the way TOT-layers interact in such suspensions essentially says nothing about how they are organized at high density. But even if we pretend that these results are applicable, the given references say that most of the relevant systems (montmorillonite with about 30% sodium or more) do not form stacks.

Disregarding the references, note also how bizarre the above statement is that the number of layers in a “particle” depends on “the water chemical potential and on the nature and external concentration of the layer charge compensating cation”: stacks are supposed to be fundamental structural units, yet the number of layers in a stack is supposed to depend on the entire water chemistry?! (It makes sense, of course, for stacks in actual suspensions.) Also, for montmorillonite an actual number of layers is nowhere stated in TS15.

TS15 further complicate things by lumping together montmorillonite and illite. In contrast to Na-montmorillonite, illite has by definition a mechanism for keeping adjacent TOT-layers together: its layer charge density is higher and compensated by potassium, which doesn’t hydrate that well, leading to collapsed interlayers. As far as I understand, one characterizing feature of illite is that the collapsed interlayers are manifested as a “10-angstrom peak” in X-ray diffraction measurements.

To treat montmorillonite and illite on equal footing (in a laid-back single sentence) again shows how nonsensical this description is. Stacking in montmorillonite suspensions occurs as a consequence of an increased ion-ion correlation effect when the fraction of e.g. calcium becomes large (> 70-80%). This process requires the ions to be diffusive and is distinctly different from the interlayer collapse in illite.

I actually have a hard time understanding what exactly is meant by the term “illite” here. In clay science it is clear that what is referred to by this name are systems that may have a quite considerable cation exchange capacity.1 Reasonably, such systems contain other types of cations besides potassium2 (as they are exchangeable), and must contain compartments where such ions can diffuse (as they are exchangeable). To increase the complexity, there are also “illite-smectite interstratified clay minerals”, which typically are in “smectite-to-illite” transitional states. For these, it seems reasonable to assume that the remaining smectite layers provide both diffusable interlayer pores and the cation exchange capacity. I don’t know if such “smectite layers” provides the cation exchange capacity in general in systems that researchers call illite. Neither do I understand how researchers can accept and use this, in my view, vague definition of “illite”. Anyway, it is the task of TS15 to sort out what they mean by the term. This is not done, and instead we get the following sentence

Illite particles typically consist of 5 to 20 stacked TOT layers (Sayed Hassan et al. 2006).

This study (Sayed Hassan et al., 2006) concerns one particular material (illite from “the Le Puy ore body”) that has been heavily processed as part of the study.3 I mean that such a specific study cannot be used as a single reference for the general nature of “illite particles”. Moreover, the stated stack size (5 — 20 layers) is nowhere stated in Sayed Hassan et al. (2006)!4

In their laid-back sentence, TS15 also implicitly define “interlayer space” as being internal to stacks. I criticized this way of redefining already established terms in the stack blog post, and TS15 serves as a good illustration of the problem: are we not supposed to be able to use the term “interlayer” without accepting the fantasy concept of stacks? To be clear, “interlayer spaces” in the context of montmorillonite simply means, and must continue to mean, spaces between adjacent TOT basal surfaces. It drives me half mad that the “stack-internal” definition is so common in contemporary bentonite scientific literature that this point seems almost impossible to communicate.

The provided illustration (“Fig 5”) explicitly shows how TS15 differ between “interlayers” that are assumed internal, and “outer basal surfaces” that are assumed external to the stack.

This illustration misrepresents the actual result of assembling a set of TOT-layers, just like any other “stack” picture found in the literature. The figure shows five identical TOT-layers that can be estimated to be smaller than 20 nm in lateral extension (while the text “conveniently” states that they should be 50 — 200 nm). Compared with “realistic” stacks, formed by randomly drawing TOT-layer sizes from an actual distribution, the depicted stack in TS15 looks like this5 (see here for details)

Besides the fact that “realistic” stack units cannot be used to form the structure of compacted bentonite, it should also be clear from this picture that “outer basal surfaces” and “interlayers” (in the sense of being internal to the stack) are not well defined. Note further that in actual compacted systems (above 1.2 g/cm3, say) such “realistic” stacks would be pushed together, something like this

In this picture, why should e.g. the interface between the green and the red stack be defined as an interface between two “outer surfaces” rather than an interlayer? Also, is this interface supposed to change nature and become an “interlayer”, as the water chemical potential or the external ion content changes? Like all other proponents of stack descriptions that I have encountered, TS15 do not in any way explain how “interlayers” and “outer surfaces” are supposed to function fundamentally differently. Similarly, they do not describe how the number of layers in a stack depends on water chemistry, nor do they provide a mechanism for why (sodium dominated) montmorillonite stacks of are supposed to keep together.

I want to emphasize that I do not favor any construction with “realistic” stacks, but only use them to illustrate the absurd consequences of taking a stack description seriously, and to demonstrate that all such descriptions in the bentonite literature are essentially pure fantasies, including the one given in TS15. I’m also quite baffled as to why TS15 (and others) provide such completely nonsensical descriptions, and how these can end up in review articles. I believe a hint is given in this formulation

[T]he number of stacked TOT layers in montmorillonite particles dictates the distribution of water in two distinct types of porosity: the interlayer porosity […] and the inter-particle porosity.

The only way I can make sense of this whole description is as an embarrassing attempt to motivate the introduction of models with several “distinct types of porosity”: the outcome is simply a macroscopic multi-porosity model (which will also be evident in later sections).

I’ve written a detailed blog post on why multi-porosity models cannot be taken seriously. There I point out that basically all authors promoting multi-porosity for some reason attempt to dress it up in terms of microscopic concepts, while the models obviously are macroscopic. Moreover, no one has ever suggested a mechanism for how equilibrium is supposed to be maintained between the different types of “porosities”.

Anion exclusion

After hallucinating about the structure of compacted bentonite, TS15 change gear and begin an “explanation” of anion exclusion. Let’s go through the description in detail.

The negative charge of the clay layers is responsible for the presence of a negative electrostatic potential field at the clay mineral basal surface–water interface.

I cannot really make sense of the term “negative electrostatic potential field”, although I think I understand what the authors are trying to say here. What is true is that the electrostatic potential near a montmorillonite basal surface is lowered compared to a point farther away. But whether or not the value of the potential is negative is irrelevant, as we are free to choose the reference level. If the zero level is chosen at a point very far from the surface, which often is done, it is true that the potential is negative at the surface. But the key principle is that the potential decreases towards the surface.6 A varying electrostatic potential signifies an electric field, which in this case is directed towards the surface (\(E = -d\phi/dx\)).

Furthermore, the electric field is not present merely because of the presence of negative charge, but because this charge is constrained to be positioned in the atomic structure of the clay. Remember that the structural clay charge is compensated by counter-ions, and that the system as a whole is charge neutral. The reason for the presence of an electric field near the surface is due to charge separation. And the reason for the potential decreasing (i.e. the electric field pointing towards the surface) is because it is the negative charge that is unable to be completely freely distributed.7

The concentrations of ions in the vicinity of basal planar surfaces of clay minerals depend on the distance from the surface considered. In a region known as the electrical double layer (EDL), concentrations of cations increase with proximity to the surface, while concentrations of anions decrease.

Having established that the electrostatic potential varies in the vicinity of the surface, it follows trivially that the ion concentrations also vary. I also find it peculiar to label the regions where the concentrations varies as the EDL. An electric double layer is a structure that includes both the surface charge and the counter-ions (hence the word “double”). What is described here should preferably be called a diffuse layer. Note, moreover, that the way an electric double layer here is introduced implies that TS15 consider a single interface, i.e. some variant of the Gouy-Chapman model (this becomes clear below). But this model is not applicable to compacted bentonite.

At infinite distance from the surface, the solution is neutral and is commonly described as bulk or free solution (or water).

Here I think it becomes obvious that the authors try to motivate the presence of bulk water within the clay structure. As described in the blog post on “Anion accessible porosity”, it is only reasonable to assume that diffuse layers merge with a bulk solution in systems that are very sparse — i.e. in suspensions.8 This is how e.g. Schofield (1947) utilized the Gouy-Chapman model to estimate surface area. But how is the solution next to a basal surface in compacted bentonite supposed to merge with a bulk solution? Even if we use the authors’ own fantasy stack constructs, the typical structure of compacted bentonite must be envisioned something like this (I have color coded different stacks to be able to understand where they begin and end).

The regions where basal surfaces of different stacks face each other (labelled A) are way too small in order to merge with a bulk solution (and, as asked earlier, how are these regions even different from “interlayers”?). Furthermore, regions adjacent to external edge-surfaces of these imaginary stack units (B, C) are not at all considered by applying a Gouy-Chapman model. The only way to make “sense” out of the present description is to imagine larger voids in the clay structure, something like this

But even if such voids would exist (in equilibrated water-saturated bentonite under reasonable conditions, they do not) they would only constitute an exotic exception to the typical pore structure. By focusing on this type of possible “anion” exclusion, TS15 completely miss the point.

This spatial distribution of anions and cations gives rise to the anion exclusion process that is observed in diffusion experiments.

Now I’m lost. I don’t understand how ion distributions are supposed to cause a process. I think the authors here allude to Schofield’s approach to estimating surface area in montmorillonite suspensions. As discussed in detail in the blog post on anion-accessible porosity, if the suspension is so dilute that we can consider each clay layer independently, and if we equilibrate it with an external solution, we can measure its salt content, and use the Gouy-Chapman model to e.g. estimate surface area from the amount of excluded salt (as compared with the external solution).

But, as also discussed in the blog post on salt exclusion, the “Schofield type” of exclusion is not what we expect to be dominating in a dense system. Rather, in denser systems (and in Donnan systems generally — no surfaces need to be involved), salt exclusion occurs mainly because of charge separation at interfaces with the external solution. I find it revealing that TS15 so far in the article has not at all mentioned such interfaces.

Moreover, in the above sentence TS15 causally states that the anion exclusion process is “observed in diffusion experiments”, without further clarification. Given that the previous section treated diffusion, a reader would expect to have been introduced to the anion exclusion process and how it is observed in diffusion experiments. But this subsection is the first time in TS15 where the term “anion exclusion” is used! In the section on diffusion, “anion accessible porosity” was briefly mentioned, and I suppose a reader is here presumed to connect the dots. But the presence of an exclusion process certainly does not imply an “anion accessible porosity”! Furthermore, anion exclusion is not necessarily observed in diffusion experiments. A more correct statement is that we observe effects of salt exclusion in experiments where a bentonite sample is contacted with an external solution via a semi-permeable component (which typically is a filter that keeps the clay in place). The effect is most conveniently studied in equilibrium rather than diffusion tests, and salt exclusion is not present in e.g. closed-cell diffusion tests. Note that exclusion effects are always related to an external solution.

As the ionic strength increases, the EDL thickness decreases, with the result that the anion accessible porosity increases as well.

Here it is fully clear that TS15 conflate “anion accessible porosity” and “anion exclusion”. If we consider the “Schofield type” of salt exclusion, it is true that the so-called “exclusion volume” changes with the ionic strength. However, an exclusion volume is not a physical space, but an effective, equivalent quantity. It is derived from the Gouy-Chapman model, which always has anions present everywhere.

Even more importantly, the “Schofield type” of exclusion is not really of interest in dense systems (nor is the Gouy-Chapman model valid in such systems). As discussed above, one must instead consider salt exclusion stemming from charge separation at interfaces with the external solution. For this case it does not even make sense to define an exclusion volume.

I can only interpret this entire paragraph as another fruitless attempt to motivate a multi-porous modeling approach. In this subsection we have so far been told that “two distinct types of porosity” can be defined (they cannot), and we have vaguely been hinted that “bulk or free solution” also is relevant for modelling compacted bentonite. And with the last quoted sentence it is relatively clear that TS15 try to establish that the relative sizes of various “porosities” are controlled by a simple parameter (ionic strength).

The final paragraph of this subsection contain several statements that makes my jaw drop.

An equivalent anion accessible porosity can be estimated from the integration of the anion concentration profile (Fig. 6) from the surface to the bulk water (Sposito 2004)

Here the authors suddenly use the phrase “equivalent”! They are thus obviously aware of that “anion accessible porosity” is a spurious concept?! ?!?! I really don’t know what to say. Their own graph (“Fig. 6”) even show that the Gouy-Chapman model has anions (salt) everywhere! Note that this statement also implies that “the bulk water” is assumed to exist within the clay.

In compacted clay material, the pore sizes may be small as compared to the EDL size. In that case, it is necessary to take into account the EDLs overlap between two neighbouring surfaces.

I think this is a very revealing passage. The conditions of compacted bentonite are treated as an exception: pore sizes “may” be smaller than the EDL, and “in that case” it is necessary to account for overlapping diffuse layers. But for compacted bentonite, this is the only relevant situation to consider! Without “overlapping” diffuse layers there is no swelling and no sealing properties. An entire page has been devoted to discussing a model only relevant for suspensions (Gouy-Chapman), while “compacted clay material” here is commented in two sentences…

Clay mineral particles are, however, often segregated into aggregates delimiting inter-aggregate spaces whose size is usually larger than inter-particle spaces inside the aggregates.

All of a sudden — in the middle of a paragraph — we are introduced to a new structural component! “Aggregates” have not been mentioned earlier in the article and is here introduced without any references. It is my strong opinion that this way of writing is not appropriate for a scientific publication, especially not for a review article. I’m not sure what type of system the authors have in mind here, but “aggregates” are typically not present in actual water saturated bentonite. I have commented more on this in the blog post on stacks.

Conclusion

At the end of the previous section (on diffusion), we were promised that this section should qualitatively link “fundamental properties of the clay minerals” to the diffusional behavior of compacted bentonite. Instead, we are given a fictional description of the structure (conflated with structures of other “clay minerals”), along with a confused explanation of anion exclusion that is irrelevant for such systems. Not a single word is said about the equilibrium that must be considered, namely that at interfaces between bentonite and external solutions. Rather, the idea of “overlapping” diffuse layers — which is the ultimate cause for bentonite swelling — is treated as an exception and only commented on in passing (and nothing is said about how to handle such systems). Although nothing is fully spelled out, I can only interpret this entire part as a (failed) attempt to motivate a multi-porous approach to modeling bentonite. And multi-porosity models cannot be taken seriously.

I admit that scrutinizing studies and pointing out flaws can be fun. However, considering that the descriptions in TS15 are the rule rather than the exception in contemporary bentonite research, I mostly feel weary and resigned. I don’t mean that every clay researcher must agree with me that a homogeneous model is the only reasonable starting point for describing compacted bentonite, and I could only wish that this blog was more influential. But I feel almost dizzy thinking about how this research sector is so hermetically sealed that one can spend entire careers in it without ever having to worry about understanding the nature of swelling and swelling pressure.

Update (250901): Part III of this review is found here.

Footnotes

[1] The Wikipedia article on illite, for example, states that the cation exchange capacity is typically 0.2 — 0.3 eq/kg. Is a significant cation exchange capacity required for classifying something as illite?

[2] E.g. (Poinssot et al., 1999), that TS15 reference as a source on illite, work with sodium exchanged “illite du Puy”, i.e. “Na-illite”.

[3] The material was dispersed by diluting it in alkaline solution and sonicating it. It was thereafter dropped as a suspension on a glass slide and dried.

[4] We may note that the number 5 — 20 TOT-layers in a stack actually showed up when we investigated how this concept is (mis)used in descriptions of bentonite. There it turned out to be a complete misunderstanding of the behavior of suspensions of Ca-montmorillonite.

[5] I am not capable to produce anything reasonable in 3D, but I think a 2D representation still conveys the message.

[6] Perhaps this criticism can be regarded as nitpicking. I have a nagging feeling, though, that electrostatics is quite poorly understood in certain parts of the bentonite research field. Take the phrase “negative electrostatic potential field”, for example. Although it can be understood at face value (a scalar field with negative values), it also appears to mix together stuff related to charges (“negative”), electric fields (“field”), and potentials. It certainly is important to separate these concepts. There are many examples in the clay literature when this is not done. E.g. Madsen and Müller-Vonmoos (1989) mean that two “potential fields” can repel each other (and also misunderstand swelling)

A high negative potential exists directly at the surface of the clay layer. […] When two such negative potential fields overlap, they repel each other, and cause the observed swelling in clay.

Horseman et al. (1996) claim that a potential repels charges:

[…] the net negative electrical potential between closely spaced clay particles repel anions attempting to migrate through the narrow aqueous films of a compact clay […]

And Shackleford and Moore (2013) mean that “overlapping” potentials repel charges

In this case, when the clay is compressed […] to the extent that the electrostatic (diffuse double) layers surrounding the particles overlap, the overlapping negative potentials repel invading anions such that the pore becomes excluded to the anion.

[7] An isolated layer of negative charge of course also has an electric field directed towards it, but this is not the relevant system to consider here. (Such a system will actually have an electric field strength that is independent of the distance to the surface, as long as the layer can be regarded as infinitely extended.)

[8] See also this comment, and this quotation.

Sorption, part V: A case against Stern layers

It should go without saying that modelers and model developers must justify every feature, mechanism, or component that they use. Failing to do so strongly increases the risk of being fooled by overparameterization rather than gaining insight. The bentonite scientific literature is nonetheless full of incorrect or unjustified model assumptions, several of which have been discussed previously on the blog. Examples include assuming the presence of bulk water, assuming “stack” structures, and assuming that diffusive fluxes from separate domains are additive. Here we discuss yet another unjustified common model component: Stern layers on montmorillonite basal surfaces.

In the bentonite literature, a Stern layer essentially means a layer of “specifically” sorbed ions on the basal surface, as e.g. illustrated here, in a a figure very similar to what is found in Leroy et al. (2006). Illustrations like this are ubiquitous in the literature.

If montmorillonite basal surfaces function roughly as uniform planes of charge we expect the counter-ions to form a diffuse layer, as e.g. described by the Gouy-Chapman model. By introducing a Stern layer, however, many bentonite researchers mean that exchangeable cations in general also interact with basal surfaces by forming immobile surface complexes. Such interactions necessarily involve mechanisms more “chemical” than the pure electrostatic interaction with uniform planes of charge, and the typical description postulates localized “sorption sites”, as illustrated above.

This blog post treats three different main arguments against Stern layers, presented in different sections

I want to make clear that this criticism concerns one particular type of surface: the montmorillonite basal surface. Stern layer models are found in many research fields dealing with solid interfaces, and although they have been criticized more generally, here we have no intention of doing so. Likewise, the process of surface complexation is certainly important generally — even in bentonite, e.g. on edge surfaces of montmorillonite particles.1

To better be able to criticize the use of Stern layers on basal surfaces in bentonite modeling, we begin by discussing the origin of Stern’s model.

Origins of the Stern Layer model

The Stern layer concepts were introduced by Otto Stern2 as an extension of the Gouy-Chapman model. Stern’s main concern was metallic electrodes in electrochemical applications. In such systems, the surface (electrode) potential is externally controlled, and can typically be on the order of 1 volt. It is easily seen that the Gouy-Chapman model predicts nonsense for such surface potentials. For e.g. a 1:1 system, the counter-ion concentration at the surface is enhanced by a factor of the order of \(10^{17}\)(!), as seen directly from the Boltzmann distribution \(c^\mathrm{surf} = c^\mathrm{ext}\cdot e^{e\psi^0/kT}\), where \(\psi^0\) is the surface potential, \(e\) is the elementary charge, \(kT\) the thermal energy, and \(c^\mathrm{ext}\) is the concentration far away from the surface. The main problem is that the Gouy-Chapman model does not account for the finite size of ions, and therefore can accumulate an arbitrary amount of charge at the surface. To remedy this flaw, Stern suggested to divide the interface region into a “compact” layer and a “diffuse” part, with the division located an ionic radius from the electrode surface (sometimes referred to as the outer Helmholtz plane).

In the simplest version of Stern’s model the compact layer is free of charges but act as a plate capacitor with a prescribed capacitance (per unit area) \(K_0\). In the original paper Stern shows that, with electrode potential \(\psi^0 = 1\) V and external 1:1 solution concentration \(c^\mathrm{ext} = 1\) M, such a capacitive layer reduces the potential where the diffuse layer begins to \(\psi^1 = 0.08\) V; lowering the external concentration to \(c^\mathrm{ext} = 0.01\) M gives \(\psi^1 = 0.18\) V. For these calculations, Stern uses a value of \(K_0 = 0.29\;\mathrm{F/m^2}\), adopted from measurements on mercury electrodes. This version of the Stern model is essentially a way to take into account that ions cannot get arbitrarily close to the surface.

Stern also presented more elaborate versions of the model that include adsorption in the compact layer (as a Langmuir adsorption model). It is such mechanisms that is universally referred to as a Stern layer in the bentonite scientific literature. Clearly, such versions are substantially more conceptually complex; rather than to just account for a finite ion size at the first molecular layer, we must now also consider additional chemical interactions that typically are different for different types of ions. We also need to have an idea about the adsorption capacity.

Lack of a coherent description of “specific sorption” on montmorillonite basal surfaces

When using Stern layers for describing montmorillonite basal surfaces, a first thing to note is that the surface potential is not independently controlled for these systems. In contrast to metallic electrodes, montmorillonite is characterized by a fixed surface charge and the problem of accumulating unrealistically large amounts of ions at the interface is significantly mitigated. As pointed by e.g. Norrish and Bolt already in the 1950s, even if we put all counter-ions within the first nanometer adjacent to the surface, the corresponding ion concentration is not larger than approximately 3 M. Here is a illustration of the montmorillonite basal surface on the nm scale, with a representative number of monovalent counter-ions (top layer oxygen atoms are red and the counter-ions blue).3

Clearly, there is room to accommodate all ions without running into the problems that was initially addressed by Stern’s model. Of course, solely accounting for the finite size of the ions — as is done in the simplest version of Stern’s model — is always well justified and will in principle improve the description. In particular, a pure diffuse layer model overestimates the capacitance of the surface. As shown in the table below, the introduction of an empty Stern layer “fixes” this problem.

Here \(c^\mathrm{ext}\) is the concentration of the 1:1 salt far away from the surface, \(\psi^1\) and \(c^1\) denote the electrostatic potential and the counter-ion concentration, respectively, at the point where the diffuse layer begins (i.e. at the interface to the compact layer), and \(\psi^0\) is the electrostatic potential at the surface. \(K_\mathrm{Stern}\) denotes the corresponding capacitance as calculated from the Stern model, with the choice \(K_0 = 0.29 \;\mathrm{F/m^2}\). \(K_\mathrm{DL}\) is instead the capacitance as calculated from a pure diffuse layer model (in which case the surface potential has the value of \(\psi^1\)). In the calculations are assumed a montmorillonite surface charge of 0.111 \(\mathrm{C}/\mathrm{m ^2}\).

But to simply account for the finite size of the ions by means of an empty compact layer is not how the term Stern layer is used in the bentonite scientific literature. As mentioned, most bentonite researchers mean that parts of the rather sparse collection of ions on the surface interacts chemically (“specific sorption”, “chemisorption”). The question of whether Stern layers on montmorillonite basal surfaces are well motivated thus reduces to what arguments there are for more elaborate chemical mechanisms being active on these surfaces. And descriptions of specific sorption on basal surfaces are really all over the place.

Deshpande and Marshall (1959, 1961) claim that counter-ions are partitioned between (i) chemisorbed ions, which do not contribute to conductivity or activity, (ii) physisorbed ions in a Stern layer, which do not contribute to activity or D.C. conductivity, and (iii) diffuse layer ions, which contribute fully to activity and conductivity. If I interpret their numbers correctly, they state that about 75% — 80% of the counter-ions in pure K-montmorillonite are immobilized. Note that these authors mean that ions in the Stern layer are “physically” adsorbed, while the surface also has “chemically” adsorbed species. Thus, they use the term Stern layer for certain types of physisorption, while stating that ions also bond covalently to the surface.

Shainberg and Kemper (1966, 1967), on the other hand, model ions as either mobile in a diffuse layer or immobile in a Stern layer. They argue that covalently bound ions are “extremely unlikely”, and mean that ions in the Stern layer form “ion pairs” with the surface, as suggested earlier by Heald et al. (1964). They use this idea as a starting point for analyzing differences in exchange selectivity for different monovalent cations in montmorillonite. They claim that “about 20 to 50% of sodium are specifically adsorbed”.

Note that Shainberg and Kemper, just like Deshpande and Marshall, assume Stern layer ions to be “physically” bonded to the surface (i.e. non-covalently), while having a completely different opinion on the presence of chemisorbed ions. Shainberg and Kemper (1966) provide a picture showing the conceptual difference between an ion in the Stern layer (“unhydrated”) and ions in the diffuse layer (“hydrated”) that looks very similar to this

In this context it may be worth to also mention the work of Low and co-workers. Low argued consistently that swelling pressure is not primarily related to the exchangeable ions — something that I strongly disagree with and that I commented briefly on in a previous blog post.4 Directly related to this view, these authors claim that the exchangeable ions for the most part do not dissociate from the surfaces, and in later papers they refer to such ions as being part of a Stern layer.

To me, all the above descriptions seem like little more than speculation. None of these authors discuss how or why e.g. sodium ions (!) are supposed to from ion pairs with a charge center buried far inside the montmorillonite layer, nor how or why they bond covalently with the basal surface. Nevertheless, both Low as well as Shainberg and Kemper seem to have influenced the writings of Sposito, who, in turn, has had quite a huge impact on contemporary descriptions of bentonite. In e.g. Sposito (1992), which specifically discusses montmorillonite (“smectite”), he writes

Despite the long history of continual investigation of the surface and colloid chemistry of smectites (van Olphen, 1977; Sposito, 1984), the structure of the electrical double layer at smectite surfaces and its influence on the rheological properties of smectite suspensions remain topics of lively controversy. One of the most contentious issues is the partitioning of adsorbed monovalent cations among the three possible surface species on the basal planes of smectite particles, such as montmorillonite (see, e.g., Low, 1981, 1987). […] [A] monovalent cation can be adsorbed on the basal planes by three different mechanisms: inner-sphere surface complexes, in which the cation desolvates and is captured by a ditrigonal cavity; outer-sphere surface complexes, in which the cation remains solvated but still is captured by a ditrigonal cavity and immobilized; and the diffuse-ion swarm, in which the cation is attracted to the basal plane, but remains fully dissociated from the smectite surface (Sposito, 1989a, Chap. 7).

The view conveyed here is that exchangeable ions do not interact with montmorillonite basal surfaces as if these, to a first approximation, are planes of uniform charge. Such interaction is only supposed to govern an outer diffuse layer (called a “diffuse-ion swarm” for unclear reasons), and ions are also supposed to interact with the surface by no less than two other “mechanisms”, related to the “ditrigonal cavities”.

Note that while Sposito acknowledges an ongoing “lively controversy” regarding how to describe montmorillonite basal surfaces, he specifies that this debate is limited to how to distribute the exchangeable ions among “three possible surface species.” But, as we will explore below, there is certainly no consensus within colloid chemistry that exchangeable ions are involved in complexation chemistry on the basal surfaces! (I therefore find this way of formulating the “controversy” quite dishonest, to be honest.) For reasons I can’t get my head around, descriptions of “inner-” and “outer-sphere complexes” on montmorillonite basal surfaces are anyway ubiquitous in modern bentonite literature. Let’s therefore take a closer look at how these are introduced.

“Inner-” and “outer-sphere” surface complexes

A description that hardly enlightens me is given in Sposito (1989)5

[The inner- and outer-sphere complexes] constitute the Stern layer on an adsorbent. […] The diffuse-ion swarm and the outer-sphere surface complex mechanisms of adsorption involve almost exclusively electrostatic bonding, whereas inner-sphere complex mechanisms are likely to involve ionic as well as covalent bonding. Because covalent bonding depends significantly on the particular electron configurations of both the surface group and the complexed ion, it is appropriate to consider inner-sphere surface complexation as the molecular basis of the term specific adsorption. Correspondingly, diffuse-ion screening and outer-sphere surface complexation are the molecular basis for the term nonspecific adsorption. Nonspecific refers to the weak dependence on the detailed electron configurations of the surface functional group and adsorbed ion that is to be expected for the interactions of solvated species.

Here, Sposito means that exchangeable ions bond covalently with the montmorillonite basal surface,6 in agreement with Deshpende and Marschall, and in disagreement with Shainberg and Kemper (we have one “extremely unlikely” and one “likely” for covalent bonding…). In contrast to Deshpende and Marschall, however, Sposito means that these “inner-sphere” complexes are part of the Stern layer. To confuse matters even more, Shainberg and Kemper assume their “unhydrated” construct (which corresponds structurally to an “inner-sphere” complex, see above figure) as being part of the Stern layer, but not part of any covalent bonding. Moreover, Shainberg and Kemper assume their “hydrated” construct (corresponding structurally to an “outer-sphere” complex, see above figure) to be part of the diffuse layer, while Sposito wants his “outer-sphere” complexes to be immobile and part of the Stern layer…

Given the above description (and others) it is hard to understand what the difference is supposed to be between an “outer-sphere” complex and an ion in the “diffuse-ion swarm”, other than that the former is simply claimed to be immobilized; both ions are said to interact with “exclusively electrostatic bonding”,7 both are classified as “nonspecific adsorption”, and both are fully hydrated. In my head, this is simply a recipe for achieving an overparameterized model description.

Sposito’s description also makes implicit statements about the montmorillonite basal surface: it contains “surface functional groups” whose specific electron configuration significantly influence covalent bonding, while being insensitive for the formation of “outer-sphere” complexes and the “diffuse ion-swarm”. In Sposito (1984) he suggests that the “functional groups” are groups of oxygen atoms on the surface of the montmorillonite particle (“ditrigonal cavities”) that qualifies as Lewis bases. The presence of atomic substitutions in the octahedral layer is supposed to enhance this Lewis base character

If isomorphic substitution of \(\mathrm{Al}^{3+}\) by \(\mathrm{Fe}^{2+}\) or \(\mathrm{Mg}^{2+}\) occurs in the octahedral sheet, the resulting excess negative charge can distribute itself principally over the 10 surface oxygen atoms of the four silica tetrahedra that are associated through their apexes with a single octahedron in the layer. This distribution of negative charge enhances the Lewis base character of the ditrigonal cavity and makes it possible to form complexes with cations as well as with dipolar molecules.

Note how completely different this whole description is compared to the original Gouy-Chapman conceptual view. Here is implied that montmorillonite basal planes8 cannot be described as a passive layer of charge, but that it is a fully reactive system, including covalent bonding mechanisms.

Frankly, I dismiss the above description of the montmorillonite surface as a Lewis base as pure speculation. I will gladly admit that I am a physicist rather than a chemist, and perhaps I am missing something obvious, but I really don’t see any argumentation behind this description. I am also under the impression that montmorillonite basal planes are relatively chemically stable — that is why they form in the first place, and that is also one reason for why we are interested in using bentonite for e.g. long-term geological waste storage. Furthermore, a meta-argument for dismissing this description is that in later publications we find statements like this, in Sposito (2004):

The \(\mathrm{Na}^+\) that are counterions for the negative structural charge developed as a result of isomorphic substitutions within the clay mineral layer tend to adsorb as solvated species on the basal plane (a plane of hexagonal rings of oxygen ions known as a siloxane surface) near deficits of negative charge originating in the octahedral sheet from substitution of a bivalent cation for \(\mathrm{Al}^{3+}\). This mode of adsorption occurs as a result of the strong solvating characteristics of Na and the physical impediment to direct contact between Na and the site of negative charge posed by the layer structure itself. The way in which this negative charge is distributed on the siloxane surface is not well known, but if the charge tends to be delocalized there, that would also lend itself to outer-sphere surface complexation.

So, 20 years after the surface chemistry of montmorillonite was described as if it was completely understood (Sposito, 1984), the way the negative charge distributes is now described instead as “not well known”… Furthermore, in contrast to earlier statements, the formation of an “outer-sphere complex” is here associated mainly with the hydration properties of sodium. But if the idea of a “surface functional group” is discarded — or at least downplayed — why should a hydrated ion near the surface be described as a surface complex at all?

We note that Sposito (2004) still seem to imply that the “outer-sphere surface complex” is localized and immobile (“adsorbed near deficits of negative charge”) But the evidence is vast that sodium, and several other ions, are quite mobile even in monohydrates (see below).

Deviations from the Gouy-Chapman model do not imply surface complexation

Authors that promote Stern layers on montmorillonite basal surfaces usually rely on the Gouy-Chapman model for describing the diffuse layer part. Lyklema, writing generally on colloid science, explicitly “defends” such an approach

In the following our discussion will be based on the rather pragmatic, though somewhat artificial, subdivision of the solution side of the double layer into two parts: an inner part, or Stern layer where all complications regarding finite ion size, specific adsorption, discrete charges, surface heterogeneity, etc., reside and an outer, Gouy or diffuse layer, that is by definition ideal, i.e. it obeys Poisson-Boltzmann statistics. This model is due to Stern following older ideas of Helmholtz and has over the decades since its inception rendered excellent services, especially in dealing with experimental systems.

Dzombak and Hudson (1995) express a similar attitude

Bolt and co-workers […] investigated in detail the application of the Gouy–Chapman diffuse-layer theory to ion-exchange processes. Their work demonstrated that consideration of electrostatic sorption alone is not sufficient to explain ion-exchange data and that chemisorption (or “specific” sorption) needs to be included in ion-exchange models.

It is not logically consistent to conclude that deviations from the Gouy-Chapman model implies that specific sorption “needs to be included”.9 On the contrary, introducing specific sorption to compensate for a certain model rather than for surface chemical reasons may, in my mind, be a recipe for an overparameterized disaster. I don’t get reassured by statements like this, also from Dzombak and Hudson (1995)

Surface complexation models can be extended to include diffuse-layer sorption. This approach permits their application in modeling the sorption of ions (such as monovalent electrolyte ions) that exhibit weak specific sorption. The generality of such an extended surface complexation approach together with the mathematical power of modern chemical speciation models offers the potential for accurate physicochemical modeling of ion exchange

Reasonably, a complex system may require complex models, but it is certainly dangerous in a modeling context to rely too heavily on “mathematical power” (I guess “numerical power” is the preferred phrase).10

Note that very different attitudes towards the Stern layer concept is found in the colloid science literature, where e.g. Evans and Wennerström (1999) describe it as an “intellectual cul de sac”.11

One way of dealing with these difficulties is to say that the solution layer closest to a charged surface has properties so different from the bulk that it should be treated as a separate entity. This device was introduced in the 1930s by the German electrochemist Stern and the surface layer is commonly referred to as the Stern layer, whose properties are specified by a number of empirical parameters. It is the opinion of the authors of this book that the Stern layer concept is an intellectual cul de sac for the description of electrostatics in colloidal systems. One reason for this point of view is that from modern spectroscopic measurements we know molecular properties are not dramatically changed for a liquid close to a charged surface.

I find it quite perplexing that so many authors in the bentonite scientific community attribute any deviation from the Gouy-Chapman model solely to surface-related mechanisms. The Gouy-Chapman model treats both ions (point particles) and water (a dielectric continuum) in a very simplified manner, and it is clear that “specific ion” effects are ubiquitous, also in systems that lack surfaces. Addressing differences in e.g. selectivity coefficients without considering ion polarizability and hydration, while postulating the existence of localized sorption “sites”, can, to my mind, only lead to incorrect descriptions.

The Poisson-Boltzmann equation is a mean field approximation

Note also that the Poisson-Boltzmann equation — which underlies the Gouy-Chapman model — is only approximate. It is derived by assuming that the electrostatic potential experienced by any ion is the average potential from all other ions (and surfaces). More accurately, the ion distribution around a given ion deviates from the average, as a direct consequence of the presence of the central ion.

Including these ion-ion correlation effects makes the mathematical description considerably more complex. But with the advent of sufficiently powerful computers and algorithms, the electric double layer has been solved basically “exactly”. The “exact” solution may differ strongly from the Poisson-Boltzmann solution, with increasing concentrations towards the surfaces (and consequently a lowering of interlayer midpoint concentrations), and an explicit attractive electrostatic force between the two halves of an interlayer. Using Monte Carlo simulations, Guldbrand et al. (1984) demonstrated that with divalent counter-ions these effects are so large that the system becomes net attractive at a certain interlayer distance, in qualitative disagreement with the Poisson-Boltzmann solution. This effect, which has been thoroughly studied since the 80s, and which we have discussed in several posts on this blog, is the prevailing explanation e.g. for the limited swelling of Ca-montmorillonite.

The lesson here is that observed deviations from predictions of the Poisson-Boltzmann equation not automatically can be taken as evidence for additional active system components, and certainly not as evidence for specific sorption. Note that limited swelling in divalent montmorillonite is explained by the ions being diffusive, not that they are sorbed and immobilized. I cannot overstate the importance of this insight.

It boggles my mind that the entire research area on ion-ion correlations in colloidal systems seems to have made no significant impact on parts of the bentonite scientific community; I seldom find references to works on ion-ion correlation, and when I do it’s quite confused. E.g. Sposito (1992) means that the formation of “quasicrystals”12 is due to “outer-sphere” complexes

The best known example of a montmorillonite quasicrystal is that comprising stacks of four to seven layers. \(\mathrm{Ca}^{2+}\) ions, solvated by six water molecules (outer-sphere surface complex), serve as molecular “cross-links” to help bind the clay layers together through electrostatic forces.

Sure, the ion-ion correlation effect that prevents Ca-montmorillonite from exfoliating is of electrostatic origin, but it is not related to “cross-links” or surface complexes. Sposito furthermore continues by claiming that “even […] Na-montmorillonite” forms “quasicrystals”. Such claim cannot be supported by ion-ion correlation — on the contrary, ion-ion correlation explains why Ca-montmorillonite forms “quasicrystals”, while Na-montmorillonite does not. It is thus relatively clear that Sposito do not refer to ion-ion correlation in the above statement. At the same time, later in the same publication he cites Kjellander et al. (1988) on going beyond the mean-field treatment of the Poisson-Boltzmann equation, and even claims that the Gouy-Chapman model is “completely inaccurate” for systems containing divalent ions. I can only conclude that this is a quite confused description.

Finite-size effects of water molecules

With focus on the first molecular layers at a solid interface, it is clear that finite-size effects of water molecules — which are not treated in the Gouy-Chapman model — reasonably influences the resulting ion distribution. This influence is manifested both as a steric effect — there can only be a discrete number of water molecules between the ion and the surface — and as an effect of how strongly a certain ion is hydrated.

Water molecules are treated explicitly e.g. in molecular dynamics (MD) simulations of montmorillonite/water interfaces, and here are results from simulating a three water-layer interlayer of Na-montmorillonite, from Hedström and Karnland (2012)13

Sodium is seen to accumulate “between” the water layers; in the above illustration we have also included schematic illustrations of the molecular configurations, as conceived by Shainberg and Kemper (1966) (shown earlier). As stated earlier, Shainberg and Kemper (1966) refer to these as “hydrated” and “unhydrated”, but they are clearly the same type of configurations that e.g. Sposito (1992) and Dzombak and Hudson (1995) call “outer-” and “inner-sphere” complexes.

While the above mentioned authors mean that these “complexes” involve specific interactions between ions and surface, the MD simulation suggests that such structures are mainly a consequence of the finite-size of the molecules and ions. In particular, the MD results do not support the idea that these structures depend critically on a specific, non-electrostatic, ion–surface interaction. Indeed, the simulations explicitly treat also the atoms of the montmorillonite layer, which could make it difficult to judge whether the appearance of “complexes” mainly is related to water–ion or ion–surface interactions. But note that Hedström and Karnland (2012) simulate two different systems: one where the montmorillonite charge is put on specific atoms in the octahedral sheet (Mg for Al substitutions), and one where it is distributed on all Al atoms (as a fraction of the elementary charge). Both systems have essentially an identical atomic configuration in the interlayer, which strongly suggest that no critical ion–surface interaction is involved in forming “outer-” and “inner-sphere complexes” (i.e. they really are not surface complexes). I am not aware of any published simulation where the basal surface is represented as a uniform sheet of charge while water molecules are treated explicitly, but I am convinced that “outer-” and “inner-sphere complexes” would appear also in such a simulation.

Regarding MD simulations of montmorillonite interlayers, you can also simply observe them to convince yourself that the counter-ions are not in any reasonable sense immobilized. These types of simulations are routinely used to calculate (quite significant) interlayer diffusion coefficients, for crying out loud!

Experimental evidence of counter-ion mobility

A final argument for why Stern layers on montmorillonite basal surfaces are unjustified is the vast amount of empirical evidence of counter-ion mobility. We have discussed several diffusion studies in earlier blog posts that show that many ions (Na, Cl, K, Sr, I, Cs, Ca,…) have a significant mobility even in very dense systems, dominated by bi- or monohydrated interlayers. In the previous post, we brought up the following result

This figure shows the resulting concentration profiles in two diffusion experiments where sodium and chloride tracers, respectively, have diffused from an initial planar source for the same amount of time (23.7 h), in samples of pure Na-montmorillonite of dry density 1.8 \(\mathrm{g/cm^3}\), equilibrated with deionized water. This result was used previously to dismiss the ludicrous idea that these two ions are supposed to migrate in separate parts of the pore volume, exposed to completely different mechanisms. In the same vein, this result can be used to dismiss the idea of a Stern layer on basal surfaces.

Sodium, which is universally acknowledged to reside in the interlayers, is here demonstrated to diffuse just fine in bi- and monohydrated interlayers. As chloride, which also resides in the interlayers (despite all talk of “anion-accessible porosity”), behave essentially identical, it is quite far-fetched to assume any significant surface complexation mechanism. And anyone who argues for that these tracers actually do not diffuse in the interlayers should be reminded of the seeming “uphill” diffusion experiment,14 which is performed at even higher density, and where the “uphill” diffusion direction once and for all proves that the transport occurs in interlayers.

Strangely, many authors nowadays seem to promote both Stern layers and interlayer mobility in bentonite. Various simulation codes has been modified for this possibility, and there are several examples of researchers pointing out a possibility of “Stern layer diffusion”. I think these authors should carefully examine their chain of assumptions: Surface complexation in a Stern layer (i.e. sorption) is initially suggested to explain e.g. why breakthrough times in cation through-diffusion tests are relatively long as compared with the steady-state flux (i.e. why “\(D_e\)” can be considerably larger than”\(D_a\)”). With evidence for that the “sorbed” ions actually dominate the mass transfer, the sorption mechanism is not reconsidered, but yet another mechanism is suggested: Stern layer mobility… Reasonably, such an approach is not adequate for developing models; researchers employing it should critically consider the intended purpose of a Stern layer component.

Counter-ion mobility is also related to swelling pressure. Bentonite swelling pressure is difficult to describe generally, and I have written a whole series of blog posts on the subject, but it is clear that measured swelling pressures in e.g. moderately dense Na- and Li-montmorillonites is quite well described by the Poisson-Boltzmann equation. As this set of conditions (not too dense clay, simple monovalent ions) are exactly those for which we expect the Poisson-Boltzmann equation to be adequate, this is a strong indication that all counter-ions contribute to the pressure.15 Also, the limited swelling in e.g. Ca-montmorillonite, as previously discussed, is explained by ion-ion correlation effects where all ions are included in the diffuse layer.

Finally, we can take a look at salt exclusion from compacted bentonite. The magnitude of salt exclusion is directly related to the amount of mobile counter-ions. Thus, if most of the counter-ions were immobilized in a Stern layer, bentonite should show small exclusion effects. In contrast, the empirical results for e.g. chloride exclusion in sodium dominated bentonite indicate, again, that all counter-ions are part of a diffuse layer.

This diagram shows the relative amount of chloride in the bentonite as a function of \(c^\mathrm{ext}/c_\mathrm{IL}\), where \(c^\mathrm{ext}\) is the external salt concentration and \(c_\mathrm{IL}\) is the amount exchangeable cations, expressed as a monovalent interlayer concentration. The experimental data is from Van Loon et al. (2007), which we reevaluated and examined in detail in a previous blog post. The lines are the result from applying the “ideal” Donnan formula with various amounts of the counter-ions assumed diffusive. For details on Donnan theory, see this blog post.

Although the experimental data show considerable scatter, there is nothing in this plot that suggests that a fraction of the counter-ions are immobilized. And the quality of this data is certainly good enough to directly dismiss models that assume that the major part of ions are immobilized in a Stern layer.

Footnotes

[1] I find it quite frustrating that many descriptions in the literature only refer abstractly to “mineral surfaces” rather than specifically addressing montmorillonite. At the same time it is often clear from the context that statements regarding “mineral surfaces” should be understood as applicable to montmorillonite basal surfaces. I would much appreciate if researchers promoting Stern layers on basal surfaces would provide descriptions for specific systems, e.g. pure Na-Ca-montmorillonite.

[2] Otto Stern is a fascinating character in the history of science, most famous for the Stern-Gerlasch experiment, that helped pave the way for quantum mechanics. I highly recommend this lecture by the late Sandip Pakvasa. An example of its contents:

A note on Stern’s style of working: He always had a cigar in one hand, and he left actual work with hands to others, as he did not trust his own manual dexterity! […] He described the beneficial effects of a large wooden hammer that he kept in his lab and used it to threaten the apparatus if it did not behave! (apparently it worked!)

[3] Note that di-valent counter-ions would be even more sparsely distributed than this.

[4] It may be worth discussing my objections against the work of Low and co-workers in more detail in a future blog post.

[5] This is a general discussion on sorption on mineral surfaces, and is cited from the second edition of the book (2008). There is also a “Thired” edition.

[6] This particular description is general (for “adsorbents”), but since Sposito, as well as a large part of the contemporary bentonite scientific community, claim that “inner-sphere” complexes are present on montmorillonite basal surfaces, we can conclude that they mean that covalent bonding occur on such surfaces.

[7] Sure, the full quotation is “almost exclusively electrostatic bonding”, but what is a reader supposed to do with that? Such vague and sloppy scientific writing annoys me.

[8] Again, the discussion is on general “mineral surfaces”, but from other writings it is clear that this is supposed to apply to the montmorillonite basal surface.

[9] I furthermore don’t believe that “Bolt and co-workers” concluded that specific sorption “needs” to be included, but that this rather is an interpretation made by Dzombak and Hudson themselves. Bolt considered and downplayed the Stern Layer already in the mid 50s, and although he indeed has expressed positive attitudes (seriously, these guys just write too much!), he continued to downplay its significance in e.g. Bolt (1979), writing “In conclusion it appears justified to assume that for homoionic clays saturated with common ions, if hydrated, the Stern layer will be an “empty” Stern layer according to the terminology of Grahame (1947).”

[10] Note also that the perspective in this quotation is that specific sorption models can be complemented with diffuse layer features — i.e. the existence of sorption “sites” is assumed a priori. But Dzombak and Hudson (1995) never really discuss the nature of such “sites” on montmorillonite basal surfaces, but rely on Sposito’s speculations about “inner-” and “outer-surface” complexes.

[11] Note that Stern’s original paper actually is from 1924. I also suspect that Stern would object to being labeled an “electrochemist”.

[12] The terminology here is quite messy, and other authors may use other terms such as “tactoids”, see this post for a further discussion.

[13] This study was thoroughly discussed in a blog post on MD simulations and anion exclusion.

[14] Anyone making this argument should also provide a plausible suggestion for where a significant non-interlayer pore structure is located at these extreme densities.

[15] The pressure in these types of calculations can be related to the interlayer midpoint concentration. But this does not mean that not all counter-ions are involved in the process.

Are interlayer cations not attracted to the surfaces?!

Electrostatics can be quite subtle. The following comment on the interlayer ion distribution, in Kjellander et al. (1988), was an eye-opener for me

The ion concentration profile is determined by the net force acting on each ion. The electrostatic potential from the uniform surface charges is constant between the two walls, which means that the forces due to these charges cancel each other completely. Thus, the large counter-ion concentration in the electric double layer near the walls is solely a consequence of the repulsive interactions between the ions.

Interlayer cations are not attracted to the surfaces, but are pushed towards them due to repulsion between the ions themselves! My intuition has been that interlayer counter-ions distribute due to attraction with the surfaces, but the perspective given in the above quotation certainly makes a lot of sense. Here I use the word “perspective” because I don’t fully agree with the statement that the ion distribution is solely a consequence of repulsion. To discuss the issue further, let’s flesh out the reasoning in Kjellander et al. (1988) and draw some pictures.

Here we discuss an idealized model of an interlayer as a dielectric continuum sandwiched between two parallel infinite planes of uniform surface charge density.1 The system is thus symmetric around the axis normal to the surfaces (the model is one-dimensional).

From electrostatics we know that the electric field originating from a plane of uniform surface charge has the same size at any distance from the plane (we discussed this fact in the blog post on electrostatics and swelling pressure). We may draw such electric fields like this

From this result follows that the electric field vanishes between two equally negatively charged surfaces. The electrostatic field configuration for an “empty” interlayer can thus be illustrated like this

This means that the two interlayer surfaces don’t “care” about the counter-ions, in the sense that this part of the electrostatic energy (ion – surfaces) is independent of the counter-ion distribution.

To consider the fate of the counter-ions we continue to explore the axial symmetry. The counter-ion distribution varies only in the direction normal to the surfaces, and we can treat it as a sequence of thin parallel planes of uniform charge. Since the size of the electric field from such planes is independent of distance, the force on a positive test charge (= the electric field) at any position in the interlayer depends only on the difference in total amount of charge on each side of this position, as illustrated here

This, in turn, implies both that the electric field is zero at the mid position, and that the electric field elsewhere is directed towards the closest surface (since symmetry requires equal amount of charge in the two halves of the interlayer2). The counter-ions indeed repel each other towards the surfaces! The charge density must therefore increase towards the surfaces, and we understand that the equilibrium electric field qualitatively must look like this3

However, as far as I see, the “indifference” of the surfaces to the counter-ions is a matter of perspective. Consider e.g. making the interlayer distance very large. In this limit, the system is more naturally conceptualized as two single surfaces. It is then awkward to describe the ion distribution at one surface as caused by repulsion from other ions arbitrarily far away, rather than as caused by attraction to the surface. But for the case most relevant for compacted bentonite — i.e. interlayers, or what is often described as “overlapping” electric double layers — the natural perspective is that counter-ions distribute as a consequence of repulsion among themselves.

This perspective also implies that anions (co-ions) distribute within the interlayer as a consequence of attraction to counter-ions rather than repulsion from the surfaces! (The above figure applies, with all arrows reversed.) This insight should not be confused with the fact that repulsion between anions and surfaces is not really the mechanism behind “anion exclusion”. Rather, the implication here is that anion-surface repulsion can be viewed as not even existing within an interlayer.

A couple of corrections

With this (to me) new perspective in mind, I’d like to correct a few formulations in the blog post on electrostatics and swelling. In that post, I write

[R]ather than contributing to repulsion, electrostatic interactions actually reduce the pressure. This is clearly seen from e.g. the Poisson-Boltzmann solution for two charged surfaces, where the resulting osmotic pressure corresponds to an ideal solution with a concentration corresponding to the value at the midpoint (cf. the quotation from Kjellander et al. (1988) above). But the midpoint concentration — and hence the osmotic pressure — is lowered as compared with the average, because of electrostatic attraction between layers and counter-ions.

But the final sentence should rather be formulated as

But the midpoint concentration — and hence the osmotic pressure — is lowered as compared with the average, because of electrostatic repulsion between the counter-ions.

In the original post, I also write

This plot demonstrates the attractive aspect of electrostatic interactions in these systems. While the NaCl pressure is only slightly reduced, Na-montmorillonite shows strong non-ideal behavior. In the “low” concentration regime (< 2 mol/kgw) we understand the pressure reduction as an effect of counter-ions electrostatically attracted to the clay surfaces.

The last part is better formulated as

In the “low” concentration regime (< 2 mol/kgw) we understand the pressure reduction as an effect of electrostatic repulsion among the counter-ions.

I think the implication here is quite wild: In a sense, electrostatic repulsion reduces swelling pressure!

Footnotes

[1] The treatment in Kjellander et al. (1988) is more advanced, including effects of image charges and ion-ion correlations, but it does not matter for the present discussion.

[2] Actually, the whole distribution is required to be symmetric around the interlayer midpoint.

[3] The quantitative picture is of course achieved from solving the Poisson-Boltzmann equation. The picture may be altered when considering more involved mechanisms, such as image charge interactions or ion-ion correlations; Kjellander et al. (1988) show that the effect of image charges may reduce the ion distribution at very short distances, while the effect of ion-ion correlations is to further increase the accumulation towards the surfaces. Note that neither of these effects involve direct interaction with the surface charge.

Solving the Poisson-Boltzmann equation

To celebrate that I have built myself a tool for solving the Poisson-Boltzmann equation for two parallel charged plates and specified external solution, here is a cosy little animation

The animation shows the anion concentration profile (blue) between the plates as the distance varies, in systems in equilibrium with an external 100 mM 1:1 salt solution. Also plotted is the corresponding internal concentration level as calculated from the ideal Donnan equilibrium formula (orange). The layer charge density in the Poisson-Boltzmann calculation is 0.111 C/m2, and the corresponding cation exchange capacity in the Donnan calculation is 0.891 eq/kg.

As the distance between the plates increases, the Poisson-Boltzmann profile increasingly deviates from the Donnan concentration. At lower density (larger plate distance) it is clear that the Poisson-Boltzmann solution allows for considerably more anions between the plates as compared with the Donnan result. On the other hand, for denser systems, the difference between the two solutions decreases; this is especially true when considering the relative difference — keep in mind that the external concentration is kept constant, at 100 mM.

In fact, in systems relevant for e.g. radioactive waste storage — spanning an effective montmorillonite density range from \(\rho_\mathrm{mmt} =\) 1.60 g/cm3 to \(\rho_\mathrm{mmt} =\) 1.15 g/cm3, say — the difference between the Poisson-Boltzmann and the Donnan results is virtually negligible (it should also be kept in mind that the continuum assumption underlying the Poisson-Boltzmann calculation is not valid in this density range). Here are plotted snapshots of these two limiting cases, together with the Poisson-Boltzmann solution for a single plate (the Gouy-Chapman model)

This figure clearly shows that the Gouy-Chapman model is not at all valid in any relevant system, unless you postulate larger voids in the bentonite. But why would you do that?

Multi-porosity models cannot be taken seriously (Semi-permeability, part II)

“Multi-porosity” models1 — i.e models that account for both a bulk water phase and one, or several, other domains within the clay — have become increasingly popular in bentonite research during the last couple of decades. These are obviously macroscopic, as is clear e.g. from the benchmark simulations described in Alt-Epping et al. (2015), which are specified to be discretized into 2 mm thick cells; each cell is consequently assumed to contain billions and billions individual montmorillonite particles. The macroscopic character is also relatively clear in their description of two numerical tools that have implemented multi-porosity

PHREEQC and CrunchFlowMC have implemented a Donnan approach to describe the electrical potential and species distribution in the EDL. This approach implies a uniform electrical potential \(\varphi^\mathrm{EDL}\) in the EDL and an instantaneous equilibrium distribution of species between the EDL and the free water (i.e., between the micro- and macroporosity, respectively). The assumption of instantaneous equilibrium implies that diffusion between micro- and macroporosity is not considered explicitly and that at all times the chemical potentials, \(\mu_i\), of the species are the same in the two porosities

On an abstract level, we may thus illustrate a multi-porosity approach something like this (here involving two domains)

The model is represented by one continuum for the “free water”/”macroporosity” and one for the “diffuse layer”/”microporosity”,2 which are postulated to be in equilibrium within each macroscopic cell.

But such an equilibrium (Donnan equilibrium) requires a semi-permeable component. I am not aware of any suggestion for such a component in any publication on multi-porosity models. Likewise, the co-existence of diffuse layer and free water domains requires a mechanism that prevents swelling and maintains the pressure difference — also the water chemical potential should of course be the equal in the two “porosities”.3

Note that the questions of what constitutes the semi-permeable component and what prevents swelling have a clear answer in the homogeneous mixture model. This answer also corresponds to an easily identified real-world object: the metal filter (or similar component) separating the sample from the external solution. Multi-porosity models, on the other hand, attribute no particular significance to interfaces between sample and external solutions. Therefore, a candidate for the semi-permeable component has to be — but isn’t — sought elsewhere. Donnan equilibrium calculations are virtually meaningless without identifying this component.

The partitioning between diffuse layer and free water in multi-porosity models is, moreover, assumed to be controlled by water chemistry, usually by means of the Debye length. E.g. Alt-Epping et al. (2015) write

To determine the volume of the microporosity, the surface area of montmorillonite, and the Debye length, \(D_L\), which is the distance from the charged mineral surface to the point where electrical potential decays by a factor of e, needs to be known. The volume of the microporosity can then be calculated as \begin{equation*} \phi^\mathrm{EDL} = A_\mathrm{clay} D_L, \end{equation*} where \(A_\mathrm{clay}\) is the charged surface area of the clay mineral.

I cannot overstate how strange the multi-porosity description is. Leaving the abstract representation, here is an attempt to illustrate the implied clay structure, at the “macropore” scale

The view emerging from the above description is actually even more peculiar, as the “micro” and “macro” volume fractions are supposed to vary with the Debye length. A more general illustration of how the pore structure is supposed to function is shown in this animation (“I” denotes ionic strength)

What on earth could constitute such magic semi-permeable membranes?! (Note that they are also supposed to withstand the inevitable pressure difference.)

Here, the informed reader may object and point out that no researcher promoting multi-porosity has this magic pore structure in mind. Indeed, basically all multi-porosity publications instead vaguely claim that the domain separation occurs on the nanometer scale and present microscopic illustrations, like this (this is a simplified version of what is found in Alt-Epping et al. (2015))

In the remainder of this post I will discuss how the idea of a domain separation on the microscopic scale is even more preposterous than the magic membranes suggested above. We focus on three aspects:

  • The implied structure of the free water domain
  • The arbitrary domain division
  • Donnan equilibrium on the microscopic scale is not really a valid concept

Implied structure of the free water domain

I’m astonished by how little figures of the microscopic scale are explained in many publications. For instance, the illustration above clearly suggests that “free water” is an interface region with exactly the same surface area as the “double layer”. How can that make sense? Also, if the above structure is to be taken seriously it is crucial to specify the extensions of the various water layers. It is clear that the figure shows a microscopic view, as it depicts an actual diffuse layer.4 A diffuse layer width varies, say, in the range 1 – 100 nm,5 but authors seldom reveal if we are looking at a pore 1 nm wide or several hundred nm wide. Often we are not even shown a pore — the water film just ends in a void, as in the above figure.6

The vague nature of these descriptions indicates that they are merely “decorations”, providing a microscopic flavor to what in effect still is a macroscopic model formulation. In practice, most multi-porosity formulations provide some ad hoc mean to calculate the volume of the diffuse layer domain, while the free water porosity is either obtained by subtracting the diffuse layer porosity from total porosity, or by just specifying it. Alt-Epping et al. (2015), for example, simply specifies the “macroporosity”

The total porosity amounts to 47.6 % which is divided into 40.5 % microporosity (EDL) and 7.1 % macroporosity (free water). From the microporosity and the surface area of montmorillonite (Table 7), the Debye length of the EDL calculated from Eq. 11 is 4.97e-10 m.

Clearly, nothing in this description requires or suggests that the “micro” and “macroporosities” are adjacent waterfilms on the nm-scale. On the contrary, such an interpretation becomes quite grotesque, with the “macroporosity” corresponding to half a monolayer of water molecules! An illustration of an actual pore of this kind would look something like this

This interpretation becomes even more bizarre, considering that Alt-Epping et al. (2015) assume advection to occur only in this half-a-monolayer of water, and that the diffusivity is here a factor 1000 larger than in the “microporosity”.

As another example, Appelo and Wersin (2007) model a cylindrical sample of “Opalinus clay” of height 0.5 m and radius 0.1 m, with porosity 0.16, by discretizing the sample volume in 20 sections of width 0.025 m. The void volume of each section is consequently \(V_\mathrm{void} = 0.16\cdot\pi\cdot 0.1^2\cdot 0.025\;\mathrm{m^3} = 1.257\cdot10^{-4}\;\mathrm{m^3}\). Half of this volume (“0.062831853” liter) is specified directly in the input file as the volume of the free water;7 again, nothing suggests that this water should be distributed in thin films on the nm-scale. Yet, Appelo and Wersin (2007) provide a figure, with no length scale, similar in spirit to that above, that look very similar to this

They furthermore write about this figure (“Figure 2”)

It should be noted that the model can zoom in on the nm-scale suggested by Figure 2, but also uses it as the representative form for the cm-scale or larger.

I’m not sure I can make sense of this statement, but it seems that they imply that the illustration can serve both as an actual microscopic representation of two spatially separated domains and as a representation of two abstract continua on the macroscopic scale. But this is not true!

Interpreted macroscopically, the vertical dimension is fictitious, and the two continua are in equilibrium in each paired cell. On a microscopic scale, on the other hand, equilibrium between paired cells cannot be assumed a priori, and it becomes crucial to specify both the vertical and horizontal length scales. As Appelo and Wersin (2007) formulate their model assuming equilibrium between paired cells, it is clear that the above figure must be interpreted macroscopically (the only reference to a vertical length scale is that the “free solution” is located “at infinite distance” from the surface).

We can again work out the implications of anyway interpreting the model microscopically. Each clay cell is specified to contain a surface area of \(A_\mathrm{surf}=10^5\;\mathrm{m^2}\).8 Assuming a planar geometry, the average pore width is given by (\(\phi\) denotes porosity and \(V_\mathrm{cell}\) total cell volume)

\begin{equation} d = 2\cdot \phi \cdot \frac{V_\mathrm{cell}}{A_\mathrm{surf}} = 2\cdot \frac{V_\mathrm{void}}{A_\mathrm{surf}} = 2\cdot \frac{1.26\cdot 10^{-4}\;\mathrm{m^3}}{10^{5}\;\mathrm{m^2}} = 2.51\;\mathrm{nm} \end{equation}

The double layer thickness is furthermore specified to be 0.628 nm.9 A microscopic interpretation of this particular model thus implies that the sample contains a single type of pore (2.51 nm wide) in which the free water is distributed in a thin film of width 1.25 nm — i.e. approximately four molecular layers of water!

Rather than affirming that multi-porosity model formulations are macroscopic at heart, parts of the bentonite research community have instead doubled down on the confusing idea of having free water distributed on the nm-scale. Tournassat and Steefel (2019) suggest dealing with the case of two parallel charged surfaces in terms of a “Dual Continuum” approach, providing a figure similar to this (surface charge is -0.11 C/m2 and external solution is 0.1 M of a 1:1 electrolyte)

Note that here the perpendicular length scale is specified, and that it is clear from the start that the electrostatic potential is non-zero everywhere. Yet, Tournassat and Steefel (2019) mean that it is a good idea to treat this system as if it contained a 0.7 nm wide bulk water slice at the center of the pore. They furthermore express an almost “postmodern” attitude towards modeling, writing

It should be also noted here that this model refinement does not imply necessarily that an electroneutral bulk water is present at the center of the pore in reality. This can be appreciated in Figure 6, which shows that the Poisson–Boltzmann predicts an overlap of the diffuse layers bordering the two neighboring surfaces, while the dual continuum model divides the same system into a bulk and a diffuse layer water volume in order to obtain an average concentration in the pore that is consistent with the Poisson–Boltzmann model prediction. Consequently, the pore space subdivision into free and DL water must be seen as a convenient representation that makes it possible to calculate accurately the average concentrations of ions, but it must not be taken as evidence of the effective presence of bulk water in a nanoporous medium.

I can only interpret this way of writing (“…does not imply necessarily that…”, “…must not be taken as evidence of…”) that they mean that in some cases the bulk phase should be interpreted literally, while in other cases the bulk phase should be interpreted just as some auxiliary component. It is my strong opinion that such an attitude towards modeling only contributes negatively to process understanding (we may e.g. note that later in the article, Tournassat and Steefel (2019) assume this perhaps non-existent bulk water to be solely responsible for advective flow…).

I say it again: no matter how much researchers discuss them in microscopic terms, these models are just macroscopic formulations. Using the terminology of Tournassat and Steefel (2019), they are, at the end of the day, represented as dual continua assumed to be in local equilibrium (in accordance with the first figure of this post). And while researchers put much effort in trying to give these models a microscopic appearance, I am not aware of anyone suggesting a reasonable candidate for what actually could constitute the semi-permeable component necessary for maintaining such an equilibrium.

Arbitrary division between diffuse layer and free water

Another peculiarity in the multi-porosity descriptions showing that they cannot be interpreted microscopically is the arbitrary positioning of the separation between diffuse layer and free water. We saw earlier that Alt-Epping et al. (2015) set this separation at one Debye length from the surface, where the electrostatic potential is claimed to have decayed by a factor of e. What motivates this choice?

Most publications on multi-porosity models define free water as a region where the solution is charge neutral, i.e. where the electrostatic potential is vanishingly small.10 At the point chosen by Alt-Epping et al. (2015), the potential is about 37% of its value at the surface. This cannot be considered vanishingly small under any circumstance, and the region considered as free water is consequently not charge neutral.

The diffuse layer thickness chosen by Appelo and Wersin (2007) instead corresponds to 1.27 Debye lengths. At this position the potential is about 28% of its value at the surface, which neither can be considered vanishingly small. At the mid point of the pore (1.25 nm), the potential is about 8%11 of the value at the surface (corresponding to about 2.5 Debye lengths). I find it hard to accept even this value as vanishingly small.

Note that if the boundary distance used by Appelo and Wersin (2007) (1.27 Debye lengths) was used in the benchmark of Alt-Epping et al. (2015), the diffuse layer volume becomes larger than the total pore volume! In fact, this occurs in all models of this kind for low enough ionic strength, as the Debye length diverges in this limit. Therefore, many multi-porosity model formulations include clunky “if-then-else” clauses,12 where the system is treated conceptually different depending on whether or not the (arbitrarily chosen) diffuse layer domain fills the entire pore volume.13

In the example from Tournassat and Steefel (2019) the extension of the diffuse layer is 1.6 nm, corresponding to about 1.69 Debye lengths. The potential is here about 19% of the surface value (the value in the midpoint is 12% of the surface value). Tournassat and Appelo (2011) uses yet another separation distance — two Debye lengths — based on misusing the concept of exclusion volume in the Gouy-Chapman model.

With these examples, I am not trying to say that a better criterion is needed for the partitioning between diffuse layer and bulk. Rather, these examples show that such a partitioning is quite arbitrary on a microscopic scale. Of course, choosing points where the electrostatic potential is significant makes no sense, but even for points that could be considered having zero potential, what would be the criterion? Is two Debye lengths enough? Or perhaps four? Why?

These examples also demonstrate that researchers ultimately do not have a microscopic view in mind. Rather, the “microscopic” specifications are subject to the macroscopic constraints. Alt-Epping et al. (2015), for example, specifies a priori that the system contains about 15% free water, from which it follows that the diffuse layer thickness must be set to about one Debye length (given the adopted surface area). Likewise, Appelo and Wersin (2007) assume from the start that Opalinus clay contains 50% free water, and set up their model accordingly.14 Tournassat and Steefel (2019) acknowledge their approach to only be a “convenient representation”, and don’t even relate the diffuse layer extension to a specific value of the electrostatic potential.15 Why the free water domain anyway is considered to be positioned in the center of the nanopore is a mystery to me (well, I guess because sometimes this interpretation is supposed to be taken literally…).

Note that none of the free water domains in the considered models are actually charged, even though the electrostatic potential in the microscopic interpretations is implied to be non-zero. This just confirms that such interpretations are not valid, and that the actual model handling is the equilibration of two (or more) macroscopic, abstract, continua. The diffuse layer domain is defined by following some arbitrary procedure that involves microscopic concepts. But just because the diffuse layer domain is quantified by multiplying a surface area by some multiple of the Debye length does not make it a microscopic entity.4

Donnan effect on the microscopic scale?!

Although we have already seen that we cannot interpret multi-porosity models microscopically, we have not yet considered the weirdest description adopted by basically all proponents of these models: they claim to perform Donnan equilibrium calculations between diffuse layer and free water regions on the microscopic scale!

The underlying mechanism for a Donnan effect is the establishment of charge separation, which obviously occur on the scale of the ions, i.e. on the microscopic scale. Indeed, a diffuse layer is the manifestation of this charge separation. Donnan equilibrium can consequently not be established within a diffuse layer region, and discontinuous electrostatic potentials only have meaning in a macroscopic context.

Consider e.g. the interface between bentonite and an external solution in the homogeneous mixture model. Although this model ignores the microscopic scale, it implies charge separation and a continuously varying potential on this scale, as illustrated here

The regions where the potential varies are exactly what we categorize as diffuse layers (exemplified in two ideal microscopic geometries).

The discontinuous potentials encountered in multi-porosity model descriptions (see e.g. the above “Dual Continuum” potential that varies discontinuously on the angstrom scale) can be drawn on paper, but don’t convey any physical meaning.

Here I am not saying that Donnan equilibrium calculations cannot be performed in multi-porosity models. Rather, this is yet another aspect showing that such models only have meaning macroscopically, even though they are persistently presented as if they somehow consider the microscopic scale.

An example of this confusion of scales is found in Alt-Epping et al. (2018), who revisit the benchmark problem of Alt-Epping et al. (2015) using an alternative approach to Donnan equilibrium: rather than directly calculating the equilibrium, they model the clay charge as immobile mono-valent anions, and utilize the Nernst-Planck equations. They present “the conceptual model” in a figure very similar to this one

This illustration simultaneously conveys both a micro- and macroscopic view. For example, a mineral surface is indicated at the bottom, suggesting that we supposedly are looking at an actual interface region, in similarity with the figures we have looked at earlier. Moreover, the figure contains entities that must be interpreted as individual ions, including the immobile “clay-anions”. As in several of the previous examples, no length scale is provided (neither perpendicular to, nor along the “surface”).

On the other hand, the region is divided into cells, similar to the illustration in Appelo and Wersin (2007). These can hardly have any other meaning than to indicate the macroscopic discretization in the adopted transport code (FLOTRAN). Also, as the “Donnan porosity” region contains the “clay-anions” it can certainly not represent a diffuse layer extending from a clay surface; the only way to make sense of such an “immobile-anion” solution is that it represents a macroscopic homogenized clay domain (a homogeneous mixture!).

Furthermore, if the figure is supposed to show the microscopic scale there is no Donnan effect, because there is no charge separation! Taking the depiction of individual ions seriously, the interface region should rather look something like this in equilibrium

This illustrates the fundamental problem with a Donnan effect between microscopic compartments: the effect requires a charge separation, whose extension is the same as the size of the compartments assumed to be in equilibrium.16

Despite the confusion of the illustration in Alt-Epping et al. (2018), it is clear that a macroscopic model is adopted, as in our previous examples. In this case, the model is explicitly 2-dimensional, and the authors utilize the “trick” to make diffusion much faster in the perpendicular direction compared to the direction along the “surface”. This is achieved either by making the perpendicular diffusivity very high, or by making the perpendicular extension small. In any case, a perpendicular length scale must have been specified in the model, even if it is nowhere stated in the article. The same “trick” for emulating Donnan equilibrium is also used by Jenni et al. (2017), who write

In the present model set-up, this approach was implemented as two connected domains in the z dimension: one containing all minerals plus the free porosity (z=1) and the other containing the Donnan porosity, including the immobile anions (CEC, z=2, Fig. 2). Reproducing instantaneous equilibrium between Donnan and free porosities requires a much faster diffusion between the porosity domains than along the porosity domains.

Note that although the perpendicular dimension (\(z\)) here is referred to without unit(!), this representation only makes sense in a macroscopic context.

Jenni et al. (2017) also provide a statement that I think fairly well sums up the multi-porosity modeling endeavor:17

In a Donnan porosity concept, cation exchange can be seen as resulting from Donnan equilibrium between the Donnan porosity and the free porosity, possibly moderated by additional specific sorption. In CrunchflowMC or PhreeqC (Appelo and Wersin, 2007; Steefel, 2009; Tournassat and Appelo, 2011; Alt-Epping et al., 2014; Tournassat and Steefel, 2015), this is implemented by an explicit partitioning function that distributes aqueous species between the two pore compartments. Alternatively, this ion partitioning can be modelled implicitly by diffusion and electrochemical migration (Fick’s first law and Nernst-Planck equations) between the free porosity and the Donnan porosity, the latter containing immobile anions representing the CEC. The resulting ion compositions of the two equilibrated porosities agree with the concentrations predicted by the Donnan equilibrium, which can be shown in case studies (unpublished results, Gimmi and Alt-Epping).

Ultimately, these are models that, using one approach or the other, simply calculates Donnan equilibrium between two abstract, macroscopically defined domains (“porosities”, “continua”). Microscopic interpretations of these models lead — as we have demonstrated — to multiple absurdities and errors. I am not aware of any multi-porosity approach that has provided any kind of suggestion for what constitutes the semi-permeable component required for maintaining the equilibrium they are supposed to describe. Alternatively expressed: what, in the previous figure, prevents the “immobile anions” from occupying the entire clay volume?

The most favorable interpretation I can make of multi-porosity approaches to bentonite modeling is a dynamically varying “macroporosity”, involving magical membranes (shown above). This, in itself, answers why I cannot take multi-porosity models seriously. And then we haven’t yet mentioned the flawed treatment of diffusive flux.

Footnotes

[1] This category has many other names, e.g. “dual porosity” and “dual continuum”, models. Here, I mostly use the term “multi-porosity” to refer to any model of this kind.

[2] These compartments have many names in different publications. The “diffuse layer” domain is also called e.g. “electrical double layer (EDL)”, “diffuse double layer (DDL)”, “microporosity”, or “Donnan porosity”, and the “free water” is also called e.g. “macroporosity”, “bulk water”, “charge-free” (!), or “charge-neutral” porewater. Here I will mostly stick to using the terms “diffuse layer” and “free water”.

[3] This lack of a full description is very much related to the incomplete description of so-called “stacks” — I am not aware of any reasonable suggestion of a mechanism for keeping stacks together.

[4] Note the difference between a diffuse layer and a diffuse layer domain. The former is a structure on the nm-scale; the latter is a macroscopic, abstract model component (a continuum).

[5] The scale of an electric double layer is set by the Debye length, \(\kappa^{-1}\). From the formula for a 1:1 electrolyte, \(\kappa^{-1} = 0.3 \;\mathrm{nm}/\sqrt{I}\), the Debye length is seen to vary between 0.3 nm and 30 nm when ionic strength is varied between 1.0 M to 0.0001 M (\(I\) is the numerical value of the ionic strength expressed in molar units). Independent of the value of the factor used to multiply \(\kappa^{-1}\) in order to estimate the double layer extension, I’d say that the estimation 1 – 100 nm is quite reasonable.

[6] Here, the informed reader may perhaps point out that authors don’t really mean that the free water film has exactly the same geometry as the diffuse layer, and that figures like the one above are more abstract representations of a more complex structure. Figures of more complex pore structures are actually found in many multi-porosity papers. But if it is the case that the free water part is not supposed to be interpreted on the microscopic scale, we are basically back to a magic membrane picture of the structure! Moreover, if the free water is not supposed to be on the microscopic scale, the diffuse layer will always have a negligible volume, and these illustrations don’t provide a mean for calculating the partitioning between “micro” and “macroporosity”.

It seems to me that not specifying the extension of the free water is a way for authors to dodge the question of how it is actually distributed (and, as a consequence, to not state what constitutes the semi-permeable component).

[7] The PHREEQC input files are provided as supplementary material to Appelo and Wersin (2007). Here I consider the input corresponding to figure 3c in the article. The free water is specified with keyword “SOLUTION”.

[8] Keyword “SURFACE” in the PHREEQC input file for figure 3c in the paper.

[9] Using the identifier “-donnan” for the “SURFACE” keyword.

[10] We assume a boundary condition such that the potential is zero in the solution infinitely far away from any clay component.

[11] Assuming exponential decay, which is only strictly true for a single clay layer of low charge.

[12] For example, Tournassat and Steefel (2019) write (\(f_{DL}\) denotes the volume fraction of the diffuse layer):

In PHREEQC and CrunchClay, the volume of the diffuse layer (\(V_{DL}\) in m3), and hence the \(f_{DL}\) value, can be defined as a multiple of the Debye length in order to capture this effect of ionic strength on \(f_{DL}\): \begin{equation*} V_{DL} = \alpha_{DL}\kappa^{-1}S \tag{22} \end{equation*} \begin{equation*} f_{DL} = V_{DL}/V_{pore} \end{equation*} […] it is obvious that \(f_{DL}\) cannot exceed 1. Equation (22) must then be seen as an approximation, the validity of which may be limited to small variations of ionic strength compared to the conditions at which \(f_{DL}\) is determined experimentally. This can be appreciated by looking at the results obtained with a simple model where: \begin{equation*} \alpha_{DL} = 2\;\mathrm{if}\;4\kappa^{-1} \le V_{pore}/S\;\mathrm{and,} \end{equation*} \begin{equation*} f_{DL} = 1 \;\mathrm{otherwise.} \end{equation*}

[13] Some tools (e.g. PHREEQC) allow to put a maximum size limit on the diffuse layer domain, independent of chemical conditions. This is of course only a way for the code to “work” under all conditions.

[14] As icing on the cake, these estimations of free water in bentonite (15%) and Opalinus clay (50%) appear to be based on the incorrect assumption that “anions” only reside in such compartments. In the present context, this handling is particularly confusing, as a main point with multi-porosity models (I assume?) is to evaluate ion concentrations in other types of compartments.

[15] Yet, Tournassat and Steefel (2019) sometimes seem to favor the choice of two Debye lengths (see footnote 12), for unclear reasons.

[16] Donnan equilibrium between microscopic compartments can be studied in molecular dynamics simulations, but they require the considered system to be large enough for the electrostatic potential to reach zero. The semi-permeable component in such simulations is implemented by simply imposing constraints on the atoms making up the clay layer.

[17] I believe the referred unpublished results now are published: Gimmi and Alt-Epping (2018).

Semi-permeability, part I

Descriptions in bentonite literature

What do authors mean when they say that bentonite has semi-permeable properties? Take for example this statement, from Bradbury and Baeyens (2003)1

[…] highly compacted bentonite can function as an efficient semi-permeable membrane (Horseman et al., 1996). This implies that the re-saturation of compacted bentonite involves predominantly the movement of water molecules and not solute molecules.

Judging from the reference to Horseman et al. (1996) — which we look at below — it is relatively clear that Bradbury and Baeyens (2003) allude to the concept of salt exclusion when speaking of “semi-permeability” (although writing “solute molecules”). But a lowered equilibrium salt concentration does not automatically say that salt is less transferable.

A crucial question is what the salt is supposed to permeate. Note that a semi-permeable component is required for defining both swelling pressure and salt exclusion. In case of bentonite, this component is impermeable to the clay particles, while it is fully permeable to ions and water (in a lab setting, it is typically a metal filter). But Bradbury and Baeyens (2003) seem to mean that in the process of transferring aqueous species between an external reservoir and bentonite, salt is somehow effectively hindered to be transferred. This does not make much sense.

Consider e.g. the process mentioned in the quotation, i.e. to saturate a bentonite sample with a salt solution. With unsaturated bentonite, most bets are off regarding Donnan equilibrium, and how salt is transferred depends on the details of the saturation procedure; we only know that the external and internal salt concentrations should comply with the rules for salt exclusion once the process is finalized.

Imagine, for instance, an unsaturated sample containing bentonite pellets on the cm-scale that very quickly is flushed with the saturating solution, as illustrated in this state-of-the-art, cutting-edge animation

The evolution of the salt concentration in the sample will look something like this

Initially, as the saturating solution flushes the sample, the concentration will be similar to that of the external concentration (\(c_\mathrm{ext}\)). As the sample reaches saturation, it contains more salt than what is dictated by Donnan equilibrium (\(c_\mathrm{eq.}\)), and salt will diffuse out.

In a process like this it should be obvious that the bentonite not in any way is effectively impermeable to the salt. Note also that, although this example is somewhat extreme, the equilibrium salt concentration is probably reached “from above” in most processes where the clay is saturated with a saline solution: too much salt initially enters the sample (when a “microstructure” actually exists) and is later expelled.

Also for mass transfer between an external solution and an already saturated sample does it not make sense to speak of “semi-permeability” in the way here discussed. Consider e.g. a bentonite sample initially in equilibrium with an external 0.3 M NaCl solution, where the solution suddenly is switched to 1.0 M. Salt will then start to diffuse into the sample until a new (Donnan) equilibrium state is reached. Simultaneously (a minute amount of) water is transported out of the clay, in order for the sample to adapt to the new equilibrium pressure.2

There is nothing very “semi-permeabilic” going on here — NaCl is obviously free to pass into the clay. That the equilibrium clay concentration in the final state happens to be lower than in the external concentration is irrelevant for how how difficult it is to transfer the salt.

But it seems that many authors somehow equate “semi-permeability” with salt exclusion, and also mean that this “semi-permeability” is caused by reduced mobility for ions within the clay. E.g. Horseman et al. (1996) write (in a section entitled “Clays as semi-permeable membranes”)

[…] the net negative electrical potential between closely spaced clay particles repel anions attempting to migrate through the narrow aqueous films of a compact clay, a phenomenon known as negative adsorption or Donnan exclusion. In order to maintain electrical neutrality in the external solution, cations will tend to remain with their counter-ions and their movement through the clay will also be restricted (Fritz, 1986). The overall effect is that charged chemical species do not move readily through a compact clay and neutral water molecules may be able to pass more freely.

It must be remembered that Donnan exclusion occurs in many systems other than “compact clay”. By instead considering e.g. a ferrocyanide solution, it becomes clear that salt exclusion has nothing to do with how hindered the ions are to move in the system (as long as they move). KCl is, of course, not excluded from a potassium ferrocyanide system because ferrocyanide repels chloride, nor does such interactions imply restricted mobility (repulsion occurs in all salt solutions). Similarly, salt is not excluded from bentonite because of repulsion between anions and surfaces (also, a negative potential does not repel anything — charge does).

In the above quotation it is easy to spot the flaw in the argument by switching roles of anions and cations; you may equally incorrectly say that cations are attracted, and that anions tag along in order to maintain charge neutrality.

The idea that “semi-permeability” (and “anion” exclusion) is caused by mobility restrictions for the ions within the bentonite, while water can “pass more freely” is found in many places in the bentonite literature. E.g. Shackelford and Moore (2013) write (where, again, potentials are described as repelling)

In [the case of bentonite], when the clay is compressed to a sufficiently high density such that the pore spaces between adjacent clay particles are minimized to the extent that the electrostatic (diffuse double) layers surrounding the particles overlap, the overlapping negative potentials repel invading anions such that the pore becomes excluded to the anion. Cations also may be excluded to the extent that electrical neutrality in solution is required (e.g., Robinson and Stokes, 1959).


This phenomenon of anion exclusion also is responsible for the existence of semipermeable membrane behavior, which refers to the ability of a porous medium to restrict the migration of solutes, while allowing passage of the solvent (e.g., Shackelford, 2012).

Chagneau et al. (2015) write

[…] TOT layers bear a negative structural charge that is compensated by cation accumulation and anion depletion near their surfaces in a region known as the electrical double layer (EDL). This property gives clay materials their semipermeable membrane properties: ion transport in the clay material is hindered by electrostatic repulsion of anions from the EDL porosity, while water is freely admitted to the membrane.

and Tournassat and Steefel (2019) write (where, again, we can switch roles of “co-” and “counter-ions”, to spot one of the flaws)

The presence of overlapping diffuse layers in charged nanoporous media is responsible for a partial or total repulsion of co-ions from the porosity. In the presence of a gradient of bulk electrolyte concentration, co-ion migration through the pores is hindered, as well as the migration of their counter-ion counterparts because of the electro-neutrality constraint. This explains the salt-exclusionary properties of these materials. These properties confer these media with a semi-permeable membrane behavior: neutral aqueous species and water are freely admitted through the membrane while ions are not, giving rise to coupled transport processes.

I am quite puzzled by these statements being so commonplace.3 It does not surprise me that all the quotations basically state some version of the incorrect notion that salt exclusion is caused by electrostatic repulsion between anions and surfaces — this is, for some reason, an established “explanation” within the clay literature.4 But all quotations also state (more or less explicitly) that ions (or even “solutes”) are restricted, while water can move freely in the clay. Given that one of the main features of compacted bentonite components is to restrict water transport, with hydraulic conductivities often below 10-13 m/s, I don’t really know what to say.

Furthermore, one of the most investigated areas in bentonite research is the (relatively) high cation transport capacity that can be achieved under the right conditions. In this light, I find it peculiar to claim that bentonite generally impedes ion transport in relation to water transport.

Bentonite as a non-ideal semi-permeable membrane

As far as I see, authors seem to confuse transport between external solutions and clay with processes that occur between two external solutions separated by a bentonite component. Here is an example of the latter set-up

The difference in concentration between the two solutions implies water transport — i.e. osmosis — from the reservoir with lower salt concentration to the reservoir with higher concentration. In this process, the bentonite component as a whole functions as the membrane.

The bentonite component has this function because in this process it is more permeable to water than to salt (which has a driving force to be transported from the high concentration to the low concentration reservoir). This is the sense in which bentonite can be said to be semi-permeable with respect to water/salt. Note:

  • Salt is still transported through the bentonite. Thus, the bentonite component functions fundamentally only as a non-ideal membrane.
  • Zooming in on the bentonite component in the above set-up, we note that the non-ideal semi-permeable functionality emerges from the presence of two ideal semi-permeable components. As discussed above, the ideal semi-permeable components (metal filters) keep the clay particles in place.
  • The non-ideal semi-permeability is a consequence of salt exclusion. But these are certainly not the same thing! Rather, the implication is: Ideal semi-permeable components (impermeable to clay) \(\rightarrow\) Donnan effect \(\rightarrow\) Non-ideal semi-permeable membrane functionality (for salt)
  • The non-ideal functionality means that it is only relevant during non-equilibrium. E.g., a possible (osmotic) pressure increase in the right compartment in the illustration above will only last until the salt has had time to even out in the two reservoirs; left to itself, the above system will eventually end up with identical conditions in the two reservoirs. This is in contrast to the effect of an ideal membrane, where it makes sense to speak of an equilibrium osmotic pressure.
  • None of the above points depend critically on the membrane material being bentonite. The same principal functionality is achieved with any type of Donnan system. One could thus imagine replacing the bentonite and the metal filters with e.g. a ferrocyanide solution and appropriate ideal semi-permeable membranes. I don’t know if this particular system ever has been realized, but e.g. membranes based on polyamide rather than bentonite seems more commonplace in filtration applications (we have now opened the door to the gigantic fields of membrane and filtration technology). From this consideration it follows that “semi-permeability” cannot be attributed to anything bentonite specific (such as “overlapping double layers”, or direct interaction with charged surfaces).
  • I think it is important to remember that, even if bentonite is semi-permeable in the sense discussed, the transfer of any substance across a compacted bentonite sample is significantly reduced (which is why we are interested in using it e.g. for confining waste). This is true for both water and solutes (perhaps with the exception of some cations under certain conditions).

“Semi-permeability” in experiments

Even if bentonite is not semi-permeable in the sense described in many places in the literature, its actual non-ideal semi-preamble functionality must often be considered in compacted clay research. Let’s have look at some relevant cases where a bentonite sample is separated by two external solution reservoirs.

Tracer through-diffusion

The simplest set-up of this kind is the traditional tracer through-diffusion experiment. Quite a lot of such tests have been published, and we have discussed various aspects of this research in earlier blog posts.

The traditional tracer through-diffusion test maintains identical conditions in the two reservoirs (the same chemical compositions and pressures) while adding a trace amount of the diffusing substance to the source reservoir. The induced tracer flux is monitored by measuring the amount of tracer entering the target reservoir.

In this case the chemical potential is identical in the two reservoirs for all components other than the tracer, and no additional transport processes are induced. Yet, it should be kept in mind that both the pressure and the electrostatic potential is different in the bentonite as compared with the reservoirs. The difference in electrostatic potential is the fundamental reason for the distinctly different diffusional behavior of cations and anions observed in these types of tests: as the background concentration is lowered, cation fluxes increase indefinitely (for constant external tracer concentration) while anion fluxes virtually vanish.

Tracer through-diffusion is often quantified using the parameter \(D_e\), defined as the ratio between steady-state flux and the external concentration gradient.5 \(D_e\) is thus a type of ion permeability coefficient, rather than a diffusion coefficient, which it nevertheless often is assumed to be.

Typically we have that \(D_e^\mathrm{cation} > D_e^\mathrm{water} > D_e^\mathrm{anion}\) (where \(D_e^\mathrm{cation}\) in principle may become arbitrary large). This behavior both demonstrates the underlying coupling to electrostatics, and that “charged chemical species” under these conditions hardly can be said to move less readily through the clay as compared with water molecules.

Measuring hydraulic conductivity

A second type of experiment where only a single component is transported across the clay is when the reservoirs contain pure water at different pressures. This is the typical set-up for measuring the so-called hydraulic conductivity of a clay component.6

Even if no other transport processes are induced (there is nothing else present to be transported), the situation is here more complex than for the traditional tracer through-diffusion test. The difference in water chemical potential between the two reservoirs implies a mechanical coupling to the clay, and a corresponding response in density distribution. An inhomogeneous density, in turn, implies the presence of an electric field. Water flow through bentonite is thus fundamentally coupled to both mechanical and electrical processes.

In analogy with \(D_e\), hydraulic conductivity is defined as the ratio between steady-state flow and the external pressure gradient. Consequently, hydraulic conductivity is an effective mass transfer coefficient that don’t directly relate to the fundamental processes in the clay.

An indication that water flow through bentonite is more subtle than what it may seem is the mere observation that the hydraulic conductivity of e.g. pure Na-montmorillonite at a porosity of 0.41 is only 8·10-15 m/s. This system thus contains more than 40% water volume-wise, but has a conductivity below that of unfractioned metamorphic and igneous rocks! At the same time, increasing the porosity by a factor 1.75 (to 0.72), the hydraulic conductivity increases by a factor of 75! (to 6·10-13 m/s7)

Mass transfer in a salt gradient

Let’s now consider the more general case with different chemical compositions in the two reservoirs, as well as a possible pressure difference (to begin with, we assume equal pressures).

Even with identical hydrostatic pressures in the reservoirs, this configuration will induce a pressure response, and consequently a density redistribution, in the bentonite. There will moreover be both an osmotic water flow from the right to the left reservoir, as well as a diffusive solute flux in the opposite direction. This general configuration thus necessarily couples hydraulic, mechanical, electrical, and chemical processes.

This type of configuration is considered e.g. in the study of osmotic effects in geological settings, where a clay or shale formation may act as a membrane.8 But although this configuration is highly relevant for engineered clay barrier systems, I cannot think of very many studies focused on these couplings (perhaps I should look better).

For example, most through-diffusion studies are of the tracer type discussed above, although evaluated parameters are often used in models with more general configurations (e.g. with salt or pressure gradients). Also, I am not aware of any measurements of hydraulic conductivity in case of a salt gradient (but the same hydrostatic pressure), and I am even less aware of such values being compared with those evaluated in conventional tests (discussed previously).

A quite spectacular demonstration that mass transfer may occur very differently in this general configuration is the seeming steady-state uphill diffusion effect: adding an equal concentration of a cation tracer to the reservoirs in a set-up with a maintained difference in background concentration, a tracer concentration difference spontaneously develops. \(D_e\) for the tracer can thus equal infinity,9 or be negative (definitely proving that this parameter is not a diffusion coefficient). I leave it as an exercise to the reader to work out how “semi-permeable” the clay is in this case. Update (240822): The “uphill” diffusion effect is further discussed here.

A process of practical importance for engineered clay barrier systems is hyperfiltration of salts. This process will occur when a sufficient pressure difference is applied over a bentonite sample contacted with saline solutions. Water and salt will then be transferred in the same direction, but, due to exclusion, salt will accumulate on the inlet side. A steady-state concentration profile for such a process may look like this

The local salt concentration at the sample interface on the inlet side may thus be larger than the concentration of the injected solution. This may have consequences e.g. when evaluating hydraulic conductivity using saline solutions.

Hyperfiltration may also influence the way a sample becomes saturated, if saturated with a saline solution. If the region near the inlet is virtually saturated, while regions farther into the sample still are unsaturated, hyperfiltration could occur. In such a scenario the clay could in a sense be said to be semi-permeable (letting through water and filtrating salts), but note that the net effect is to transfer more salt into the sample than what is dictated by Donnan equilibrium with the injected solution (which has concentration \(c_1\), if we stick with the figure above). Salt will then have to diffuse out again, in later stages of the process, before full equilibrium is reached. This is in similarity with the saturation process that we considered earlier.

Footnotes

[1] We have considered this study before, when discussing the empirical evidence for salt in interlayers.

[2] This is more than a thought-experiment; a test just like this was conducted by Karnland et al. (2005). Here is the recorded pressure response of a Na-montmorillonite sample (dry density 1.4 g/cm3) as it is contacted with NaCl solutions of increasing concentration

We have considered this study earlier, as it proves that salt enters interlayers.

[3] As a side note, is the region near the surface supposed to be called “diffuse layer”, “electrical double layer”, or “electrostatic (diffuse double) layer”?

[4] Also Fritz (1986), referenced in the quotation by Horseman et al. (1996), states a version of this “explanation”.

[5] This is not a gradient in the mathematical sense, but is defined as \( \left (c_\mathrm{target} – c_\mathrm{source} \right)/L\), where \(L\) is sample length.

[6] Hydraulic conductivity is often also measured using a saline solution, which is commented on below.

[7] Which still is an a amazingly small hydraulic conductivity, considering the the water content.

[8] The study of Neuzil (2000) also provides clear examples of water moving out of the clay, and salt moving in, in similarity with the process considered above.

[9] Mathematically, the statement “equal infinity” is mostly nonsense, but I am trying to convey that a there is a tracer flux even without any external tracer concentration difference.

Molecular dynamics simulations do not support complete anion exclusion

We have discussed various aspects of “anion exclusion” on this blog. This concept is often used to justify multi-porosity models of compacted bentonite, by reasoning that the exclusion mechanism makes parts of the pore space inaccessible to anions. But we have seen that this reasoning has no theoretical backup: studies making such assumptions usually turn out to refer to conventional electric double layer theory, described e.g. by the Poisson-Boltzmann equation. In the following, we refer to the notion of compartments inaccessible to anions as complete anion exclusion.

In fact, a single, physically reasonable concept underlies basically all descriptions of anion exclusion in the clay literature: charge separation. Although the required mathematics may differ for different systems — may it be using Donnan’s “classical equations”, or the Poisson-Boltzmann equation — the underlying mechanism is the same. In the following we refer to this type of description as traditional theory or Donnan theory. It is important to recognize that traditional theory is incompatible with complete anion exclusion: the Poisson-Boltzmann equation predicts anions everywhere.

In more recent years, however, a different meaning of the term “anion exclusion” has sneaked into the literature. This seems to be related to the dawn of molecular dynamics (MD) simulations of clays. In particular, the study of Rotenberg et al. (2007) — which I think is the first published MD simulation of montmorillonite interlayers in contact with an external compartment — is frequently cited as demonstrating qualitatively different results as compared with the traditional models. E.g. Kosakowski and Berner (2013) write

Very often it is assumed that negatively charged ions are strongly hindered to enter the interlayer space (Kosakowski et al., 2008; Rotenberg et al., 2007), although other authors come to different conclusions (Karnland et al., 2007). Note that we favor the former view with our montmorillonite setup.

Although the terms “assumed” and “conclusions” seem misplaced, it is clear that Kosakowski and Berner (2013) mean that the interlayer space is essentially anion-free, rather than obeying ordinary Donnan equilibrium (the approach used in Karnland et al. (2007)).

A similar citation is found in Tournassat and Steefel (2015)

The interlayer space can be seen as an extreme case where the diffuse layer vanishes leaving only the Stern layer of the adjacent basal surfaces. For this reason, the interlayer space is often considered to be completely free of anions (Tournassat and Appelo 2011), although this hypothesis is still controversial (Rotenberg et al. 2007c; Birgersson and Karnland 2009).

Here Tournassat and Steefel (2015) conceive of the interlayer space as something distinctly different from a diffuse layer,1 and they mean that the MD result stands in contrast to conventional Donnan theory (Birgersson and Karnland, (2009)).

As a third example, Wersin et al. (2016) write

Based upon [results from anion diffusion tests], anion-exclusion models have been formulated, which subdivide the water-filled pore space into interlayer, diffuse (or electric) double layer (DDL) and “free” water porosities (Wersin et al. 2004; Tournassat & Appelo 2011; Appelo 2013). In this formulation, anions are considered to reside in the “free” electrically neutral solution and in the DDL in the external (intergranular) pores, whereas the interlayer (intragranular) space is considered devoid of anions. Support for this model has been given by molecular dynamics simulations (Rotenberg et al. 2007), but this issue remains controversial (Birgersson & Karnland 2009)

The term “anion-exclusion” is here fully transformed to refer to complete exclusion, rather than to the traditional theory from which the term was coined. Note that the picture of bentonite given in this and the previous quotations is basically the contemporary mainstream view, which we discussed in a previous blog post. This description has not emerged from considering MD results that are allegedly in contradiction with traditional Donnan equilibrium theory. Rather, it has resulted from misusing the concept of exclusion-volume. The study of Rotenberg et al. (2007) (Rot07, in the following) supports the contemporary mainstream view only to the extent that it is at odds with the predictions of traditional theory. But is it? Let’s take a look at the relevant MD studies.

Rotenberg et al. (2007)

Rot07 is not primarily a study of the anion equilibrium, but considers more generally the transition of species between an external compartment2 and interlayer pores: water, cations (Na and Cs), and anions (Cl). The study only concerns interlayers with two monolayers of water, in the following referred to as a 2WL system. There is of course nothing wrong with exclusively studying the 2WL system, but this study alone cannot be used to support general model assumptions regarding interlayers (which anyway is commonplace, as we saw above). The meaning of the term “interlayer” in modern clay literature is quite confusing, but there is at least full consensus that it includes also states with three monolayers of water (3WL) (we’ll get back to those). Rot07 furthermore consider only a single external concentration, of 0.52 M.

Here is an illustration of the simulated system:

A cell (outlined with dashed lines) containing two montmorillonite layers (yellow) and six chloride ions (green) is repeated infinitely in all directions (the cell depth in the direction normal to the picture is 20.72 Å). While only chloride ions are indicated in this figure, also cations, water atoms, and montmorillonite atoms are explicitly accounted for in the simulation.

Note that the study neither varies density (interlayer distance) nor external concentration (number of chloride ions) — two variables essential for studying anion equilibrium. I don’t mean this as direct criticism, but it should be recognized when the study is used to support assumptions regarding interlayers in other models.

What I do want to criticize, however, is that Rot07 don’t actually compare with Donnan theory. Instead, they seem to be under the impression that traditional theory predicts complete exclusion in their system. Consider this passage in the introduction

Due to the negative charge of clay layers, anions should be repelled by the external surfaces, and excluded from the interlayers. On the contrary, cations are attracted by the surfaces, and may exchange with the natural interlayer counterions.

Here they associate two different terms with the anions: they are repelled by the “external surfaces” and excluded from “interlayers”. I can only interpret this as meaning that anions are completely excluded from interlayers, especially as the wording “on the contrary” is used when describing cations.3

The study comprises both a “plain” MD simulation of the (presumed) equilibrium state, and separate calculations of free energy profiles. In the “plain” MD simulation, anions do not enter the interlayers, and the calculation of the free energy profile gives a barrier of ~9 kT for chloride to enter the interlayer.

These results motivate the authors to conclude that the “thermal fluctuations do not allow anions to overcome the free energy barrier corresponding to their entrance into the interlayer” and that “anions are excluded from the interlayer: the probability for an anion reaching the interface to enter into the interlayer is very small (of the order of e-9 ~ 10-4)”

It is important to keep in mind that the authors are under the impression that this result and conclusion are in line with the traditional description of anion exclusion.3 When summarizing their findings they write

All the results are in agreement with the common sense on ionic exchange and anion exclusion.

and

The results confirm the generally admitted ideas of ionic exchange and anion exclusion

The problem is that this “common sense” and these “generally admitted ideas” are based on misconceptions of traditional theory (I also think one should be careful with using terms like these in scientific writing). Consequently, the authors erroneously conclude that their results confirm, rather than contrast, traditional theory. This is opposite to how this study is referred to in later publications, as was exemplified above.

The anion exclusion predicted from Donnan theory for the system in Rot07 is estimated as follows. The adopted montmorillonite unit cell (Na0.75Si8Al3.25Mg0.75O20OH4) has structural charge 0.75e, and lateral dimensions 8.97 Å × 5.18 Å. With an interlayer width of 6.1 Å we thus have for the concentration of interlayer charge

\begin{equation} c_{IL} = \frac{0.75/N_A}{8.97\cdot 5.18\cdot 6.1 \mathrm{Å^3}} = 4.39 \;\mathrm{M} \end{equation}

where \(N_A\) is the Avogadro constant. Using this value for \(c_{IL}\) in the expression for internal anion concentration in an ideal 1:1 Donnan system,

\begin{equation} c^\mathrm{int} = \frac{c_{IL}}{2} \left ( \sqrt{1+\frac{4\cdot (c^\mathrm{ext})^2}{c_{IL}^2}} – 1 \right ) \tag{1} \end{equation}

together with \(c^\mathrm{ext}\) = 0.52 M, gives

\begin{equation} c^\mathrm{int} = 0.06 \;\mathrm{M} \end{equation}

This should be the anion interlayer concentration expected from “generally admitted ideas”, and Rot07 should have concluded that their results differ by a factor ~1000 (or more) from traditional theory. This is not to say that the calculations are incorrect (more on that later), but it certainly puts the results in a different light. A discrepancy of this magnitude should reasonably be of interest to investigate further.

Hsiao and Hedström (2015)

Considerably more detailed MD simulations of the 2WL system are provided by Hsiao and Hedström (2015) (Hsi15, hereafter). In contrast to Rot07, Hsi15 specifically focus on the anion equilibrium, and they explicitly compare with both conventional Donnan theory, and the results of Rot07. In these simulations, chloride actually populates the interlayer.

Hsi15 also analyze the convergence behavior, by varying system size and simulation time. This analysis makes it clear both that most of the simulations presented in the paper are properly converged, and that the simulation of Rot07 is not. With external concentration 1.67 M, Hsi15 demonstrate that, during intervals of 20 ns, the interlayer concentration fluctuates between basically zero and 0.13 M (converged value: 0.04 M), in a system with similar size as that of Rot07. Given that the total simulation time of the earlier study is 20 ns, and that it also adopts a considerably lower external concentration, its result of zero chloride concentration in the interlayer is no surprise.

The converged interlayer concentrations in Hsi15 look like this in the direction normal to the basal surfaces (simulation time: 150 ns, layer size: 8 × 4 unit cells, external concentration: 1.67 M)

Note that the simulation contains two interlayer pores (indicated by the dotted lines; cf. the illustration of the simulated system) and that sodium and chloride populate the same central layer, sandwiched by the two water layers (not shown). The nearly identical chloride profiles is a strong confirmation that the simulation is converged.

The chloride interlayer concentrations evaluated in Hsi15 deviate strongly from the predictions of the ideal Donnan formula. With \(c_{IL}\) = 4.23 M (as reported in the article) and \(c^\mathrm{ext}\) = 1.67 M, eq. 1 gives \(c^\mathrm{int}\) = 0.580 M, while the MD results are in the range 0.033 M — 0.045 M, i.e. more than a factor 10 lower (but not a factor 1000).

Hsi15 also calculate the free energy profiles along the coordinate connecting the external compartment and the interlayer, similar to the technique utilized by Rot07 (as far as I understand). For the external concentration of 1.67 M they evaluate a free energy barrier of ~3.84 kT, which corresponds to an interlayer concentration of 0.036 M, and is in good agreement with the directly evaluated concentrations.

Note that Hsi15 — in contrast to Rot07 — conclude significant deviation between the MD results of the 2WL system and ideal traditional theory. Continuing their investigation (again, in contrast to Rot07), Hsi15 found that the contribution from ion hydration to the free energy barrier basically make up for the entire discrepancy with the ideal Donnan formula. Moreover, even though the ideal Donnan formula strongly overestimates the actual values obtained from MD, it still shows the correct dependency on external concentration: when the external concentration is lowered to 0.55 M, the evaluated free energy barrier increases to ~5.16 kT, which corresponds to a reduction of the internal concentration by about a factor of 10. This is in agreement with Donnan theory, which gives for the expected reduction (0.55/1.67)2 ≈ 0.11.

From the results of Hsi15 (and Rot07, for that matter), a relatively clear picture emerges: MD simulated 2WL systems function as Donnan systems. Anions are not completely excluded, and the dependency on external concentration is in line with what we expect from a varying Donnan potential across the interface between interlayer and external compartment (Hsi15 even comment on observing the space-charge region!).

The simulated 2WL system is, however, strongly non-ideal, as a consequence of the ions not being optimally hydrated. Hsi15 remark that the simulations probably overestimate this energy cost, e.g. because atoms are treated as non-polarizable. This warning should certainly be seriously considered before using the results of MD simulated 2WL systems to motivate multi-porosity in compacted bentonite. But, concerning assumptions of complete anion exclusion in interlayers, another system must obviously also be considered: 3WL.

Hedström and Karnland (2012)

MD simulations of anion equilibrium in the 3WL system are presented in Hedström and Karnland (2012) (Hed12, in the following). Hed12 consider three different external concentrations, by including either 12, 6, or 4 pairs of excess ions (Cl + Na+). This study also varies the way the interlayer charge is distributed, by either locating unit charges on specific magnesium atoms in the montmorillonite structure, or by evenly reducing the charge by a minor amount on all the octahedrally coordinated atoms.

Here are the resulting ion concentration profiles across the interlayer, for the simulation containing 12 chloride ions, and evenly distributed interlayer charge (simulation time: 20 ns, layer size: 4 × 4 unit cells)

Chloride mainly resides in the middle of the interlayer also in the 3WL system, but is now separated from sodium, which forms two off-center main layers. The dotted lines indicate the extension of the interlayer.

The main objectives of this study are to simply establish that anions in MD equilibrium simulations do populate interlayers, and to discuss the influence of unavoidable finite-size effects (6 and 12 are, after all, quite far from Avogadro’s number). In doing so, Hed12 demonstrate that the system obeys the principles of Donnan equilibrium, and behaves approximately in accordance with the ideal Donnan formula (eq. 1). The authors acknowledge, however, that full quantitative comparison with Donnan theory would require better convergence of the simulations (the convergence analysis was further developed in Hsi15). If we anyway make such a comparison, it looks like this

#Cl TOTLayer charge#Cl IL\(c^\mathrm{ext}\)\(c^\mathrm{int}\) (Donnan)\(c^\mathrm{int}\) (MD)
12distr.1.81.450.620.42 (67%)
12loc.1.41.500.660.32 (49%)
6distr.0.60.770.200.14 (70%)
6loc.1.30.670.150.30 (197%)
4distr.0.20.540.100.05 (46%)
4loc.0.180.540.100.04 (41%)

The first column lists the total number of chloride ions in the simulations, and the second indicates if the layer charge was distributed on all octahedrally coordinated atoms (“distr.”) or localized on specific atoms (“loc.”) The third column lists the average number of chloride ions found in the interlayer in each simulation. \(c^\mathrm{ext}\) denotes the corresponding average molar concentration in the external compartment. The last two columns lists the corresponding average interlayer concentration as evaluated either from the Donnan formula (eq. 1 with \(c_{IL}\) = 2.77 M, and the listed \(c^\mathrm{ext}\)), or from the simulation itself.

The simulated results are indeed within about a factor of 2 from the predictions of ideal Donnan theory, but they also show a certain variation in systems with the same number of total chloride ions,4 indicating incomplete convergence (compare with the fully converged result of Hsi15). It is also clear from the analysis in Hed12 and Hsi15 that the simulations with the highest number och chloride ions (12) are closer to being fully converged.5 Let’s therefore use the result of those simulations to compare with experimental data.

Comparison with experiments

In an earlier blog post, we looked at the available experimental data on chloride equilibrium concentrations in Na-dominated bentonite. Adding the high concentration chloride equilibrium results from Hed12 and Hsi15 to this data (in terms of \(c^\mathrm{int}/c^\mathrm{ext}\)), gives the following picture6 (the 3WL system corresponds to pure montmorillonite of density ~1300 kg/m3, and the 2WL system corresponds to ~1600 kg/m3, as also verified experimentally).

The x-axis shows montmorillonite effective dry density, and applied external concentrations for each data series are color coded, but also listed in the legend. Note that this plot contains mainly all available information for drawing conclusions regarding anion exclusion in interlayers.7 To me, the conclusions that can be drawn are to a large extent opposite to those that have been drawn:

  • The amount chloride in the simulated 3WL system corresponds roughly to measured values. Consequently, MD simulations do not support models that completely exclude anions from interlayers.
  • The 3WL results instead suggest that interlayers contain the main contribution of chloride. Interlayers must consequently be handled no matter how many additional pore structures a model contains.
  • For systems corresponding to 2WL interlayers, there is a choice: Either,
    1. assume that the discrepancy between simulations and measurements indicates the existence of an additional pore structure, where the majority of chloride resides, or
    2. assume that presently available MD simulations of 2WL systems overestimate “anion” exclusion.8
  • If making choice no. 1. above, keep in mind that the additional pore structure cannot be 3WL interlayers (they are virtually non-existent at 1600 kg/m3), and that it should account for approximately 0% of the pore volume.

Tournassat et al. (2016)

Tournassat et al. (2016) (Tou16, in the following) present more MD simulations of interlayer pores in contact with an external compartment, with a fixed amount of excess ions, at three different interlayer distances: 2WL (external concentration ~0.5 M), 3WL (~0.4 M), and 5WL (~0.3 M).

In the 2WL simulations, no anions enter the interlayers. Tou16 do not reflect on the possibility that 2WL simulations may overestimate exclusion, as suggested by Hsi159, but instead use this result to argue that anions are basically completely excluded from 2WL interlayers. They even imply that the result of Rot07 is more adequate than that of Hsi15

In the case of the 2WL hydrate, no Cl ion entered the interlayer space during the course of the simulation, in agreement with the modeling results of Rotenberg et al. (2007b), but in disagreement with those of Hsiao and Hedström (2015).

But, as discussed, there is no real “disagreement” between the results of Hsi15 and Rot07. To refute the conclusions of Hsi15, Tou16 are required to demonstrate well converged results, and analyze what is supposedly wrong with the simulations of Hsi15. It is, furthermore, glaringly obvious that most of the anion equilibrium results in Tou16 are not converged.

Regarding convergence, the only “analysis” provided is the following passage

The simulations were carried out at the same temperature (350 K) as the simulations of Hsiao and Hedström (2015) and with similar simulation times (50 ns vs. 100-200 ns) and volumes (27 × 104 Å3 vs. 15 × 104 Å3), thus ensuring roughly equally reliable output statistics. The fact that Cl ions did not enter the interlayer space cannot, therefore, be attributed to a lack of convergence in the present simulation, as Hsiao and Hedström have postulated to explain the difference between their results and those of Rotenberg et al. (2007b).

I mean that this is not a suitable procedure in a scientific publication — the authors should of course demonstrate convergence of the simulations actually performed! (Especially after Hsi15 have provided methods for such an analysis.10)

Anyhow, Tou16 completely miss that Hsi15 demonstrate convergence in simulations with external concentration 1.67 M; for the system relevant here (0.55 M), Hsi15 explicitly write that the same level of convergence requires a 10-fold increase of the simulation time (because the interlayer concentration decreases approximately by a factor of 10, as predicted by — Donnan theory). Thus, the simulation time of Tou16 (53 ns) should be compared with 2000 ns, i.e. it is only a few percent of the time required for proper convergence.

Further confirmation that the simulations in Tou16 are not converged is given by the data for the systems where chloride has entered the interlayers. The ion concentration profiles for the 3WL simulation look like this

The extension of the interlayers is indicated by the dotted lines. Each interlayer was given slightly different (average) surface charge density, which is denoted in the figure. One of the conspicuous features of this plot is the huge difference in chloride content between different interlayers: the concentration in the mid-pore (0.035 M) is more than three times that in left pore (0.010 M). This clearly demonstrates that the simulation is not converged (cf. the converged chloride result of Hsi15). Note further that the larger amount of chloride is located in the interlayer with the highest surface charge, and the least amount is located in the interlayer with the smallest surface charge.11 I think it is a bit embarrassing for Clays and Clay Minerals to have used this plot for the cover page.

As the simulation times (53 ns vs. 40 ns), as well as the external concentrations (~0.5 M vs. ~0.4 M), are similar in the 2WL and and 3WL simulations, it follows from the fact that the 3WL system is not converged, that neither is the 2WL system. In fact, the 2WL system is much less converged, given the considerably lower expected interlayer concentration. This conclusion is fully in line with the above consideration of convergence times in Hsi15.

For chloride in the 3WL (and 5WL) system, Tou16 conclude that “reasonable quantitative agreement was found” between MD and traditional theory, without the slightest mentioning of what that implies.12 I find this even more troublesome than the lack of convergence. If the authors mean that MD simulations reveal the true nature of anion equilibrium (as they do when discussing 2WL), they here pull the rug out from under the entire mainstream bentonite view! With the 3WL system containing a main contribution, interlayers can of course not be modeled as anion-free, as we discussed above. Yet, not a word is said about this in Tou16.

In this blog post I have tried to show that available MD simulations do not, in any reasonable sense, support the assumption that anions are completely excluded from interlayers. Frankly, I see this way of referencing MD studies mainly as an “afterthought”, in attempts to justify the misuse of the exclusion-volume concept. In this light, I am not surprised that Hed12 and Hsi15 have not gained reasonable attention, while Tou16 nowadays can be found referenced to support claims that anions do not have access to “interlayers”.13

Footnotes

[1] I should definitely discuss the “Stern layer” in a future blog post. Update (250113): Stern layers are discussed here.

[2] The view of bentonite (“clay”) in Rotenberg et al. (2007) is strongly rooted in a “stack” concept. What I refer to as an “external compartment” in their simulation, they actually conceive of as a part of the bentonite structure, calling it a “micropore”.

[3] That Rotenberg et al. (2007) expresses this view of anion exclusion puzzles me somewhat, since several of the same authors published a study just a few years later where Donnan theory was explored in similar systems: Jardat et al. (2009).

[4] Since the number of chloride ions found in the interlayer is not correlated with how layer charge is distributed, we can conclude that the latter parameter is not important for the process.

[5] The small difference in the two simulations with 4 chloride ions is thus a coincidence.

[6] I am in the process of assessing the experimental data, and hope to be able to better sort out which of these data series are more relevant. So far I have only looked at — and discarded — the study by Muurinen et al. (1988). This study is therefore removed from the plot.

[7] There are of course several other results that indirectly demonstrate the presence of anions in interlayers. Anyway, I think that the bentonite research community, by now, should have managed to produce better concentration data than this (both simulated and measured).

[8] As the cation (sodium) may give a major contribution to the hydration energy barrier (this is not resolved in Hsiao and Hedström (2015)), it may be inappropriate to refer to this part as “anion” exclusion (remember that it is salt that is excluded from bentonite). It may be noted that sodium actually appear to have a hydration barrier in e.g. the Na/Cs exchange process, which has been explored both experimentally and in MD simulations.

[9] Tournassat et al. (2016) even refer to Hsiao and Hedström (2015) as presenting a “hypothesis” that “differences in solvation energy play an important role in inhibiting the entry of Cl in the interlayer space”, rather than addressing their expressed concern that the hydration energy cost may be overestimated.

[10] Ironically, Tournassat et al. (2016) choose to “rely” on the convergence analysis in Hsiao and Hedström (2015), while simultaneously implying that the study is inadequate.

[11] As the interlayers have different surface charge, they are not expected to have identical chloride content. But the chloride content should reasonably decrease with increasing surface charge, and the difference between interlayers should be relatively small.

[12] Here we have to disregard that the “agreement” is not quantitative. It is not even qualitative: the highest chloride content was recorded in the interlayer pore with highest charge, in both the 3WL and the 5WL system.

[13] There are even examples of Hedström and Karnland (2012) being cited to support complete exclusion!

The mechanism for “anion” exclusion

Repulsion between surfaces and anions is not really the point

Many publications dealing with “anion” exclusion in compacted bentonite describe the phenomenon as being primarily due to electrostatic repulsion of anions from the negatively charged clay surfaces. This explanation, which may seem plausible both at a first and a second glance, is actually not that satisfactory. There are two major issues to consider:

  • Although it is popular to use the word “anion” when referring to the phenomenon, it must be remembered that the anions are accompanied by cations, in order to maintain overall charge neutrality; it really is salt that is excluded from the bentonite. This observation shows that the above “explanation” is incomplete: it can be argued with the same logic that salt should accumulate, because the clay surfaces attract the cations of the external salt.
  • Salt exclusion occurs generally in Donnan systems, also in those that lack surfaces. Its principal explanation can consequently not involve the presence of surfaces. For a simpler system, e.g. potassium ferrocyanide, the “explanation” above translates to claiming that exclusion is caused by “anions” being electrostatically repelled by the ferrocyanide ions. In this case it may be easier to spot the shortcoming of such a claim, and to consider also the potassium ions (which attract anions), as well as the role played by the cations of the excluded salt.

What, then, is the primary cause for salt exclusion? Let us continue with using potassium ferrocyanide as an example of a simple Donnan system, and then translate our findings to the case of compacted bentonite.

Ferrocyanide

Consider a potassium ferrocyanide solution separated from a potassium chloride solution by a membrane permeable to all but the ferrocyanide ions. The ionic configuration near the membrane then looks something like this

KCl - Ferrocyanide interface and potential variation

Because potassium ions can pass the membrane, and because they have an entropic driving force to migrate out of the ferrocyanide solution, a (microscopic) region is formed in the external solution next to the membrane, with an excess amount of positive charge. Similarly, a region is formed next to the membrane in the ferrocyanide solution with an excess amount of negative charge. Thus, a region of charge separation exists across the membrane — similar to the depletion zone in a p-n junction — over which the electrostatic potential varies. The electric field (= a varying potential) at the interface acts as to pull back potassium ions towards the ferrocyanide solution. The equilibrium width of the space charge region is set when the diffusive flux is balanced by the flux due to the electric field.

With a qualitative understanding of the electrostatic potential configuration we can now give the most plain answer to what causes “anion” exclusion: it is because of the potential difference across the membrane. Chloride ions behave in the opposite way as compared to potassium, with an entropic driving force to enter the ferrocyanide solution, while being pulled back towards the external solution due to the electric field across the membrane.

Here the mindful reader may perhaps object and point out that the electric field restricting the chloride inflow reasonably originates from the ferrocyanide anions. It thus may seem that “anion” exclusion, after all, is caused by repulsion from other negative charges.

Indeed, electrostatic repulsion of anions requires the “push” of some other negatively charged entity. But note that the potential is constant in the interior of the ferrocyanide solution, and only varies near the membrane. The variation of the potential is caused by separation of charge: chloride is as much “pushed” out of the ferrocyanide solution by the ferrocyanide as it is “pulled” out of it, due to electrostatic attraction, by the excess potassium on the other side. Repulsion between charges of equal sign occurs also in the interior of the ferrocyanide solution (or in any ionic solution), but does not in itself lead to salt exclusion.

Bentonite

The above description can be directly transferred to the case of compacted bentonite. Replacing the potassium ferrocyanide with e.g. K-montmorillonite, salt exclusion occurs mainly because potassium can migrate out of the clay region, while montmorillonite particles cannot. Again, we have charge separation with a resulting varying electrostatic potential across the interface.

Admittedly, the general situation is more complicated in bentonite because of the extension of montmorillonite particles; viewed as “anions”, these are irregularly shaped macromolecules with hundreds or thousands of charge centers.

The ion configuration in a bentonite suspension therefore looks quite different from a corresponding ordinary solution, as the montmorillonite charge obviously is constrained to individual particles. Dilute systems thus have charge separation on the particle scale and show salt exclusion even without charge separation at the interface to the external solution. These types of systems (suspensions) have historically been the subject of most studies on “anion” exclusion, and are usually treated theoretically using the Gouy-Chapman model.

With increasing density, however, the effect of a varying potential between montmorillonite particles diminishes, while the effect of charge separation at the interface increases. For dense systems (> 1.2 g/cm3, say), we may therefore approximate the internal potential as constant and only consider the variation across the interface to the external solution using Donnan’s “classical” framework.1

Here is an illustration of the validity of this approximation:

Internal and external potential in compacted bentonite system

The figure shows the difference between the external (green) and the average internal (orange) potentials in a 1:1 system of density 1.3 g/cm3 and with external concentration 0.1 M, calculated using Donnan’s “classical” equation. Also plotted is the electrostatic potential across the interlayer (blue) as calculated using the Poisson-Boltzmann equation,2 in a similar system (interlayer distance 1 nm). It is clear that the variation of the Poisson-Boltzmann potential from the average is small in comparison with the Donnan potential.

Repulsion between chloride and montmorillonite particles of course occurs everywhere in compacted bentonite, whereas the phenomenon mainly responsible for salt exclusion occurs only near the interfaces. Merely stating electrostatic repulsion as the cause for salt exclusion in compacted bentonite does not suffice, just as in the case of ferrocyanide.

To illustrate that the salt exclusion effect depends critically on exchangeable cations being able to diffuse out of the bentonite, consider the following thought experiment.3 Compacted K-montmorillonite is contacted with a NaCl solution. But rather than having a conventional component separating the solution and the clay, we imagine a membrane that does not allow for the passage of neither potassium nor clay, but that allows for the passage of sodium and chloride. Since potassium is not allowed to diffuse out of the bentonite, no charge separation occurs across the membrane. With no space charge region, the electrostatic potential does not vary and NaCl is not excluded! (to the extent that the Donnan approximation is valid)

NaCl + K-montmorillonite with interface only permeable for Na and Cl

A charge neutral perspective

The explanation for “anion” exclusion that we have explored rests on the formation of a potential difference across the interface region between bentonite and external solution. But remember that it is salt — in our example KCl — that is excluded from the bentonite (or the ferrocyanide solution), and that the cation (K) gains energy by being transferred from the external to the internal solution. The electrical work for transferring a unit of KCl is thus zero (which makes sense since KCl is a charge neutral entity). In this light, it may seem unsatisfactory to offer the potential difference as the sole explanation for salt exclusion.

I therefore think that the following kinematic way of reasoning is very helpful. Instead of considering the mass transfer of Cl across the membrane in terms of oppositely directed “electric” and “diffusive” parts, we lump them together with equal amounts of K transfer, giving two equal but oppositely directed fluxes of KCl. Reasonably, the KCl flux into the ferrocyanide solution is proportional to the external ion concentrations

\begin{equation} j^\mathrm{in}_\mathrm{KCl} = A\cdot c_\mathrm{K}^\mathrm{ext}\cdot c_\mathrm{Cl}^\mathrm{ext} \end{equation}

while the outflux is proportional to the internal ion concentrations

\begin{equation} j^\mathrm{out}_\mathrm{KCl} = -A\cdot c_\mathrm{K}^\mathrm{int}\cdot c_\mathrm{Cl}^\mathrm{int} \end{equation}

\(A\) is a coefficient accounting for the transfer resistance across the interface region. Requiring the sum of these fluxes to be zero gives the following relation

\begin{equation} c_\mathrm{K}^\mathrm{ext}\cdot c_\mathrm{Cl}^\mathrm{ext} = c_\mathrm{K}^\mathrm{int}\cdot c_\mathrm{Cl}^\mathrm{int} \end{equation}

which is the (ideal) Donnan equation.

We can therefore interpret KCl exclusion as an effect of potassium in the clay providing a potential for “out-transfer”, as soon as the chance is given, i.e. when chloride enters from the external solution. From this perspective salt exclusion could maybe be said to be a form of cation “rejection”.

Footnotes

[1] Note also that the Gouy-Chapman model is not valid in the high density limit, although it is applied (or alluded to) in this limit in many publications. But e.g. Schofield (1947) states (about the Gouy-Chapman solution):

[T]he equation is applicable to cases in which the distance between opposing surfaces considerably exceeds the distance between neighboring point charges on the surfaces; for there will then be a range of electrolyte concentrations over which the radius of the ionic atmosphere is less than the former and greater than the latter.

This criterion is not met in compacted bentonite, where instead the interlayer distance is comparable to the distance between neighboring charge centers on the surfaces. Invalid application of the Gouy-Chapman model also seems to underlie the flawed but widespread “anion-accessible porosity” concept.

[2] This calculation uses the equations presented in Engström and Wennerström (1978), and assumes no excess ions and a surface charge density of 0.111 \(\mathrm{C/m^2}\). For real consistency this calculation should really be performed with the boundary condition of 0.1 M external concentration. However, since the purpose of the graph is just to demonstrate the sizes of the two potential variations, and since I have yet to acquire a reasonable tool for performing Poisson-Boltzmann calculations with non-zero external concentration, I disregard this inconsistency. Moreover, the continuum assumption of the Poisson-Boltzmann description is anyway beginning to lose its validity at these interlayer distances. Update (220831): Solutions to the Poisson-Boltzmann equation with non-zero external concentration are presented here.

[3] Perhaps this could be done as a Molecular Dynamics simulation?