Category Archives: Electric double layer

The mechanism for “anion” exclusion

Repulsion between surfaces and anions is not really the point

Many publications dealing with “anion” exclusion in compacted bentonite describe the phenomenon as being primarily due to electrostatic repulsion of anions from the negatively charged clay surfaces. This explanation, which may seem plausible both at a first and a second glance, is actually not that satisfactory. There are two major issues to consider:

Although it is popular to use the word “anion” when referring to the phenomenon, it must be remembered that the anions are accompanied by cations, in order to maintain overall charge neutrality; it really is salt that is excluded from the bentonite. This observation shows that the above “explanation” is incomplete: it can be argued with the same logic that salt should accumulate, because the clay surfaces attract the cations of the external salt.
Salt exclusion occurs generally in Donnan systems, also in those that lack surfaces. Its principal explanation can consequently not involve the presence of surfaces. For a simpler system, e.g. potassium ferrocyanide, the “explanation” above translates to claiming that exclusion is caused by “anions” being electrostatically repelled by the ferrocyanide ions. In this case it may be easier to spot the shortcoming of such a claim, and to consider also the potassium ions (which attract anions), as well as the role played by the cations of the excluded salt.

What, then, is the primary cause for salt exclusion? Let us continue with using potassium ferrocyanide as an example of a simple Donnan system, and then translate our findings to the case of compacted bentonite.

Ferrocyanide

Consider a potassium ferrocyanide solution separated from a potassium chloride solution by a membrane permeable to all but the ferrocyanide ions. The ionic configuration near the membrane then looks something like this

Because potassium ions can pass the membrane, and because they have an entropic driving force to migrate out of the ferrocyanide solution, a (microscopic) region is formed in the external solution next to the membrane, with an excess amount of positive charge. Similarly, a region is formed next to the membrane in the ferrocyanide solution with an excess amount of negative charge. Thus, a region of charge separation exists across the membrane — similar to the depletion zone in a p-n junction — over which the electrostatic potential varies. The electric field (= a varying potential) at the interface acts as to pull back potassium ions towards the ferrocyanide solution. The equilibrium width of the space charge region is set when the diffusive flux is balanced by the flux due to the electric field.

With a qualitative understanding of the electrostatic potential configuration we can now give the most plain answer to what causes “anion” exclusion: it is because of the potential difference across the membrane. Chloride ions behave in the opposite way as compared to potassium, with an entropic driving force to enter the ferrocyanide solution, while being pulled back towards the external solution due to the electric field across the membrane.

Here the mindful reader may perhaps object and point out that the electric field restricting the chloride inflow reasonably originates from the ferrocyanide anions. It thus may seem that “anion” exclusion, after all, is caused by repulsion from other negative charges.

Indeed, electrostatic repulsion of anions requires the “push” of some other negatively charged entity. But note that the potential is constant in the interior of the ferrocyanide solution, and only varies near the membrane. The variation of the potential is caused by separation of charge: chloride is as much “pushed” out of the ferrocyanide solution by the ferrocyanide as it is “pulled” out of it, due to electrostatic attraction, by the excess potassium on the other side. Repulsion between charges of equal sign occurs also in the interior of the ferrocyanide solution (or in any ionic solution), but does not in itself lead to salt exclusion.

Bentonite

The above description can be directly transferred to the case of compacted bentonite. Replacing the potassium ferrocyanide with e.g. K-montmorillonite, salt exclusion occurs mainly because potassium can migrate out of the clay region, while montmorillonite particles cannot. Again, we have charge separation with a resulting varying electrostatic potential across the interface.

Admittedly, the general situation is more complicated in bentonite because of the extension of montmorillonite particles; viewed as “anions”, these are irregularly shaped macromolecules with hundreds or thousands of charge centers.

The ion configuration in a bentonite suspension therefore looks quite different from a corresponding ordinary solution, as the montmorillonite charge obviously is constrained to individual particles. Dilute systems thus have charge separation on the particle scale and show salt exclusion even without charge separation at the interface to the external solution. These types of systems (suspensions) have historically been the subject of most studies on “anion” exclusion, and are usually treated theoretically using the Gouy-Chapman model.

With increasing density, however, the effect of a varying potential between montmorillonite particles diminishes, while the effect of charge separation at the interface increases. For dense systems (> 1.2 g/cm³, say), we may therefore approximate the internal potential as constant and only consider the variation across the interface to the external solution using Donnan’s “classical” framework.¹

Here is an illustration of the validity of this approximation:

Internal and external potential in compacted bentonite system

The figure shows the difference between the external (green) and the average internal (orange) potentials in a 1:1 system of density 1.3 g/cm³ and with external concentration 0.1 M, calculated using Donnan’s “classical” equation. Also plotted is the electrostatic potential across the interlayer (blue) as calculated using the Poisson-Boltzmann equation,² in a similar system (interlayer distance 1 nm). It is clear that the variation of the Poisson-Boltzmann potential from the average is small in comparison with the Donnan potential.

Repulsion between chloride and montmorillonite particles of course occurs everywhere in compacted bentonite, whereas the phenomenon mainly responsible for salt exclusion occurs only near the interfaces. Merely stating electrostatic repulsion as the cause for salt exclusion in compacted bentonite does not suffice, just as in the case of ferrocyanide.

To illustrate that the salt exclusion effect depends critically on exchangeable cations being able to diffuse out of the bentonite, consider the following thought experiment.³ Compacted K-montmorillonite is contacted with a NaCl solution. But rather than having a conventional component separating the solution and the clay, we imagine a membrane that does not allow for the passage of neither potassium nor clay, but that allows for the passage of sodium and chloride. Since potassium is not allowed to diffuse out of the bentonite, no charge separation occurs across the membrane. With no space charge region, the electrostatic potential does not vary and NaCl is not excluded! (to the extent that the Donnan approximation is valid)

NaCl + K-montmorillonite with interface only permeable for Na and Cl

A charge neutral perspective

The explanation for “anion” exclusion that we have explored rests on the formation of a potential difference across the interface region between bentonite and external solution. But remember that it is salt — in our example KCl — that is excluded from the bentonite (or the ferrocyanide solution), and that the cation (K) gains energy by being transferred from the external to the internal solution. The electrical work for transferring a unit of KCl is thus zero (which makes sense since KCl is a charge neutral entity). In this light, it may seem unsatisfactory to offer the potential difference as the sole explanation for salt exclusion.

I therefore think that the following kinematic way of reasoning is very helpful. Instead of considering the mass transfer of Cl across the membrane in terms of oppositely directed “electric” and “diffusive” parts, we lump them together with equal amounts of K transfer, giving two equal but oppositely directed fluxes of KCl. Reasonably, the KCl flux into the ferrocyanide solution is proportional to the external ion concentrations

\begin{equation} j^\mathrm{in}_\mathrm{KCl} = A\cdot c_\mathrm{K}^\mathrm{ext}\cdot c_\mathrm{Cl}^\mathrm{ext} \end{equation}

while the outflux is proportional to the internal ion concentrations

\begin{equation} j^\mathrm{out}_\mathrm{KCl} = -A\cdot c_\mathrm{K}^\mathrm{int}\cdot c_\mathrm{Cl}^\mathrm{int} \end{equation}

\(A\) is a coefficient accounting for the transfer resistance across the interface region. Requiring the sum of these fluxes to be zero gives the following relation

\begin{equation} c_\mathrm{K}^\mathrm{ext}\cdot c_\mathrm{Cl}^\mathrm{ext} = c_\mathrm{K}^\mathrm{int}\cdot c_\mathrm{Cl}^\mathrm{int} \end{equation}

which is the (ideal) Donnan equation.

We can therefore interpret KCl exclusion as an effect of potassium in the clay providing a potential for “out-transfer”, as soon as the chance is given, i.e. when chloride enters from the external solution. From this perspective salt exclusion could maybe be said to be a form of cation “rejection”.

Footnotes

[1] Note also that the Gouy-Chapman model is not valid in the high density limit, although it is applied (or alluded to) in this limit in many publications. But e.g. Schofield (1947) states (about the Gouy-Chapman solution):

[T]he equation is applicable to cases in which the distance between opposing surfaces considerably exceeds the distance between neighboring point charges on the surfaces; for there will then be a range of electrolyte concentrations over which the radius of the ionic atmosphere is less than the former and greater than the latter.

This criterion is not met in compacted bentonite, where instead the interlayer distance is comparable to the distance between neighboring charge centers on the surfaces. Invalid application of the Gouy-Chapman model also seems to underlie the flawed but widespread “anion-accessible porosity” concept.

[2] This calculation uses the equations presented in Engström and Wennerström (1978), and assumes no excess ions and a surface charge density of 0.111 \(\mathrm{C/m^2}\). For real consistency this calculation should really be performed with the boundary condition of 0.1 M external concentration. However, since the purpose of the graph is just to demonstrate the sizes of the two potential variations, and since I have yet to acquire a reasonable tool for performing Poisson-Boltzmann calculations with non-zero external concentration, I disregard this inconsistency. Moreover, the continuum assumption of the Poisson-Boltzmann description is anyway beginning to lose its validity at these interlayer distances. Update (220831): Solutions to the Poisson-Boltzmann equation with non-zero external concentration are presented here.

[3] Perhaps this could be done as a Molecular Dynamics simulation?

Stacks make no sense

Leave a reply

At the atomic level, montmorillonite is built up of so-called TOT-layers: covalently bonded sheets of aluminum (“O”) and silica (“T”) oxide (including the right amount of impurities/defects). In my mind, such TOT-layers make up the fundamental particles of a bentonite sample. Reasonably, since montmorillonite TOT-layers vary extensively in size, and since a single cubic centimeter of bentonite contains about ten million billions (\(10^{16}\)), they are generally configured in some crazily complicated manner.

Stack descriptions in the literature

But the idea that the single TOT-layer is the fundamental building block of bentonite is not shared with many of today’s bentonite researchers. Instead, you find descriptions like e.g. this one, from Bacle et al. (2016)

Clay mineral particles consist of stacks of parallel negatively-charged layers separated by interlayer nanopores. Consequently, compacted smectite contains two major classes of pores: interlayer nanopores (located inside the particles) and larger mesopores (located between the particles).

or this one, from Churakov et al. (2014)

In compacted rocks, montmorillonite (Mt) forms aggregates (particles) with 5–20 TOT layers (Segad et al., 2010). A typical radial size of these particles is of the order of 0.01 to 1 \(\mathrm{\mu m}\). The pore space between Mt particles is referred to as interparticle porosity. Depending on the degree of compaction, the interparticle porosity contributes 10 to 30% of the total water accessible pore space in Mt (Holmboe et al., 2012; Kozaki et al., 2001).

Such statements show that researchers have something more complex in mind than individual TOT-layers when speaking of “particles”: they are some sort of assemblages of TOT-layers. The quotation of Bacle et al. (2016), using both the terms “stacks” and “particles”, even hints at an idea of a hierarchy of fundamental structures. Such a hierarchy is expressed explicitly in e.g. Navarro et al. (2017), who provide a figure with the caption “Schematic particle arrangement in highly compacted Na-bentonite” that looks similar to this one:

“Schematic particle arrangement in highly compacted Na-bentonite” from Navarro et al. (2017)

Here it is clear that they differ between “aggregates” (which I’m not sure is the same thing as “particles”), “stacks”, and individual TOT-layers (which I assume are represented by the line-shaped objects). In the following, however, we will use the term “stack” to refer to any kind of suggested fundamental structure built up from individual TOT-layers.

The one-sentence version of this blog post is:

Stacks make no sense as fundamental building blocks in models of water saturated, compacted bentonite.

The easiest argument against stacks is, in my mind, to simply work out the geometrical consequences. But before doing that we will examine some of the references given to support statements about stacks in compacted systems. Often, no references are given at all, but when they are, they usually turn out to be largely irrelevant for the system under study, or even to support an opposite view.

Inadequate referencing

As an example (of many) of inadequate referencing, we use the statement above from Churakov et al. (2014) as a starting point. I think this is a “good” statement, in the sense that it makes rather precise claims about how compacted bentonite is supposed to be structured, and provides references for some key statements, which makes it easier to criticize.

Churakov et al. (2014) reference Segad et al. (2010) for the statement that montmorillonite forms “particles” with 5 – 20 TOT-layers. In turn, Segad et al. (2010) write:

Clay is normally not a homogeneous lamellar material. It might be better described as a disordered structure of stacks of platelets, sometimes called tactoids — a tactoid typically consists of 5-20 platelets.^19-21

Here the terminology is quite different from the previous quotations: TOT-layers are called “platelets”, and “particles” are called “tactoids”. Still, they use the phrase “stacks of platelets”, so I think we can continue with using “stack” as a sort of common term for what is being discussed.¹ We may also note that here is used the word “clay”, rather than “montmorillonite” (as does Bacle et al. (2016)), but it is clear from the context of the article that it really is montmorillonite/bentonite that is discussed.

Anyhow, Segad et al. (2010) do not give much direct information on the claim we investigate, but provide three new references. Two² of these — Banin (1967) and Shalkevich et al. (2007) — are actually studies on montmorillonite suspensions, i.e. as far away as you can get from compacted bentonite in terms of density; the solid mass fraction in these studies is in the range 1 – 4%.

The average distance between individual TOT-layers in this density limit is comparable with, or even larger than, their typical lateral extension (~100 nm). Therefore, much of the behavior of low density montmorillonite depends critically on details of the interaction between layer edges and various other components, and systems in this density limit behave very differently depending on e.g. ionic strength, cation population, preparation protocol, temperature, time, etc. This complex behavior is also connected with the fact that pure Ca-montmorillonite does not form a sol, while the presence of as little as 10 – 20% sodium makes the system sol forming. The behaviors and structures of montmorillonite suspensions, however, say very little about how the TOT-layers are organized in compacted bentonite.

We have thus propagated from a statement in Churakov et al. (2014), and a similar one in Segad et al. (2010), that montmorillonite in general, in “compacted rocks” forms aggregates of 5 – 20 TOT-layers, to studies which essentially concern different types of materials. Moreover, the actual value of “5 – 20 TOT layers” comes from Banin (1967), who writes

Evidence has accumulated showing that when montmorillonite is adsorbed with Ca, stable tactoids, containing 5 to 20 parallel plates, are formed (1). When the mineral is adsorbed with Na, the tactoids are not stable, and the single plates are separated from each other.

This source consequently claims that the single TOT-layers are the fundamental units, i.e. it provides an argument against any stack concept! (It basically states that pure Ca-montmorillonite does not form a sol.) In the same manner, even though Segad et al. (2010) make the above quoted statement in the beginning of the paper, they only conclude that “tactoids” are formed in pure Ca-montmorillonite.

The swelling and sedimentation behavior of Ca-montmorillonite is a very interesting question, that we do not have all the answers to yet. Still, it is basically irrelevant for making statements about the structure in compacted — sodium dominated³ — bentonite.

Churakov et al. (2014) also give two references for the statement that the “interparticle porosity” in montmorillonite is 10 – 30% of the total porosity: Holmboe et al. (2012) and Kozaki et al. (2001). This is a bizarre way of referencing, as these two studies draw incompatible conclusions, and since Holmboe et al. (2012) — which is the more adequately performed study — state that this type of porosity may be absent:

At dry density \(>1.4 \;\mathrm{g/cm^3}\) , the average interparticle porosity for the [natural Na-dominated bentonite and purified Na-montmorillonite] samples used in this study was found to be \(1.5\pm1.5\%\), i.e. \(\le 3\%\) and significantly lower than previously reported in the literature.

Holmboe et al. (2012) address directly the discrepancy with earlier studies, and suggest that these were not properly analyzed

The apparent discrepancy between the basal spacings reported by Kozaki et al. (1998, 2001) using Kunipia-F washed Na-montmorillonite, and by Muurinen et al. (2004), using a Na-montmorillonite originating from Wyoming Bentonite MX-80, and the corresponding average basal spacings of the [Na-montmorillonite originating from Wyoming bentonite MX-80] samples reported in this study may partly be due to water contents and partly to the fact that only apparent \(\mathrm{d_{001}}\) values using Bragg’s law, without any profile fitting, were reported in their studies.

If Kozaki et al. (2001) should be used to support a claim about “interparticle porosity”, it consequently has to be done in opposition to — not in conjunction with — Holmboe et al. (2012). It would then also be appropriate for authors to provide arguments for why they discard the conclusions of Holmboe et al. (2012).⁴

Stacks in compacted bentonite make no geometrical sense

The literature is full of fancy figures of bentonite structure involving stacks. A typical example is found in Wu et al. (2018), and looks similar to this:

stacks illustration from Wu et al. (2018)

This illustration is part of a figure with the caption “Schematic representation of the different porosities in bentonite and the potential diffusion paths.”⁵ The regular rectangles in this picture illustrate stacks that each seems to contain five TOT-layers (I assume this throughout). Conveniently, these groups of five layers have the same length within each stack, while the length varies somewhat between stacks. This is a quite common feature in figures like this, but it is also common that all stacks are given the same length.

Another feature this illustration has in common with others is that the particles are ordered: we are always shown edges of the TOT-layers. I guess this is partly because a picture of a bunch of stacks seen from “the top” would be less interesting, but it also emphasizes the problem of representing the third dimension: figures like these are in practice figures of straight lines oriented in 2D, and the viewer is implicitly required to imagine a 3D-version of this two-dimensional representation.

A “realistic” stack picture

But, even as a 2D-representation, these figures are not representative of what an actual configuration of stacks of TOT-layers looks like. Individual TOT-layers have a distinct thickness of about 1 nm, but varies widely in the other two dimensions. Ploehn and Liu (2006) analyzed the size distribution of Na-montmorillonite (“Cloisite Na+”) using atomic force microscopy, and found an average aspect ratio of 180 (square-root of basal area divided by thickness). A representative single “TOT-line” drawn to scale is consequently quite different from what is illustrated in in most stack-pictures, and look like this (click on the figure to see it in full size)

Representative TOT-layer drawn to scale with water films

In this figure, we have added “water layers” on each side of the TOT-layer (light red), with the water-to-solid volume ratio of 1⁶. Neatly stacking five such units shows that the rectangles in the Wu et al. (2018)-figure should be transformed like this

actual veiw of stacks in Wu et al. (2018)

But this is still not representative of what an assemblage of five randomly picked TOT-layers would look like, because the size distribution has a substantial variance. According to Ploehn and Liu (2006), the aspect ratio follows approximately a log-normal distribution. If we draw five values from this distribution for the length of five “TOT-lines”, and form assemblages, we end up with structures that look like this:⁷

These are the kind of units that should fill the bentonite illustrations. They are quite irregularly shaped and are certainly not identical (this would be even more pronounced when considering the third dimension, and if the stacks contain more layers).

It is easy to see that it is impossible to construct a dense structure with these building blocks, if they are allowed a random orientation. The resulting structure rather looks something like this

Such a structure evidently has very low density, and are reminiscent of the gel structures suggested in e.g. Shalkevich et al. (2007) (see fig. 7 in that paper). This makes some sense, since the idea of stacks of TOT-layers (“tactoids”) originated from studies of low density structures, as discussed above.

Note that the structure in pictures like that in Wu et al. (2018) has a substantial density only because it is constructed with stacks with an unrealistic shape. But even in these types of pictures is the density not very high: with some rudimentary image analysis we conclude that the density in the above picture is only around 800 kg/m³. Also the figure from Navarro et al. (2017) above gives a density below 1000 kg/m³, although there it is explicitly stated that it is a representation of “highly compacted bentonite”.

The only manner in which the “realistic” building blocks can be made to form a dense structure is to keep them in the same orientation. The resulting structures then look e.g. like this

Dense structrue of color coded realistic stacks

where we have color coded each stack, to remind ourselves that these units are supposed to be fundamental.

Just looking at this structure of a “stack of stacks” should make it clear how flawed the idea is of stacks as fundamental structural units in compacted bentonite (note also how unrepresentative the stack-pictures found in the literature are). One of many questions that immediately arises is e.g. why on earth the tiny gaps between stacks (indicated by arrows) should remain. This brings us to the next argument against stacks as fundamental units for compacted water saturated bentonite:

What is supposed to keep stacks together?

Compacted bentonite of interest e.g. for sealing in radioactive waste repositories exerts swelling pressure of several MPa when in contact with external water. This osmotic pressure is a consequence of the presence of the mobile exchangeable cations in montmorillonite. Each “realistic” unit that we have imagined above is thus required to be at a huge elevated pressure, and the individual TOT-layers have a strong driving force to separate. And, unless a mechanism is provided for why such a separation is impossible, this is of course what we expect to happen! As far as I am aware, such a separation inhibiting mechanism has never been suggested in any publication that promotes the stack concept in compacted bentonite. To get a feel for the absurdity of this issue, let’s take a new look at the figure from Navarro et al. (2017)

The pressure configuration in the structure suggested by Navarro et al. (2017)

Assuming that this system is in equilibrium with an external water reservoir at zero pressure (i.e. atmospheric absolute pressure), the pressure in the compartment labeled “intra-aggregate space” is also close to zero. On the other hand, in the “stacks” located just a few nm away, the pressure is certainly above 10 MPa in many places! A structure like this is obviously not in mechanical equilibrium! (To use the term “obvious” here feels like such an understatement.)

Implications

To sum up what we have discussed so far, the following picture emerges. The bentonite literature is packed with descriptions of compacted water saturated bentonite as built up of stacks as fundamental units. These descriptions are so commonplace that they often are not supported by references. But when they are, it seems that the entire notion is based on misconceptions. In particular, structures identified in low density systems (suspensions, gels) have been carried over, without reflection, to descriptions of compacted bentonite. Moreover, all figures illustrating the stack concept are based on inadequate representations of what an arbitrary assemblage of TOT-layers arranged in this way actually would look like. With a “realistic” representation it quickly becomes obvious that it makes little sense to base a fundamental unit in compacted systems on the stack concept.

My impression is that this flawed stack concept underlies the entire contemporary mainstream view of compacted bentonite, as e.g. expressed by Wu et al. (2018):

A widely accepted view is that the total porosity of bentonite consists of \(\epsilon_ {ip}\) and \(\epsilon_ {il}\) (Tachi and Yotsuji, 2014; Tournassat and Appelo, 2011; Van Loon et al., 2007). \(\epsilon_ {ip}\) is a porosity related to the space between the bentonite particles and/or between the other grains of minerals present in bentonite. It can further be subdivided into \(\epsilon_ {ddl}\) and \(\epsilon_ {free}\). The diffuse double layer, which forms in the transition zone from the mineral surface to the free water space, contains water, cations and a minor amount of anions. The charge at the negative outer surface of the montmorillonite is neutralized by an excess of cations. The free water space contains a charge-balanced aqueous solution of cations and anions. \(\epsilon_ {il}\) represents the space between TOT-layers in montmorillonite particles exhibiting negatively charged surfaces. Due to anion exclusion effect, anions are excluded from the interlayer space, but water and cations are present.

This view can be summarized as:

The fundamental building blocks are stacks of TOT-layers (“particles”, “aggregates”, “tactoids”, “grains”…)
Electric double layers are present only on external surfaces of the stacks.
Far away from external surfaces — in the “inter-particle” or “inter-aggregate” pores — the diffuse layers merge with a bulk water solution
Interlayer pores are defined as being internal to the stacks, and are postulated to be fundamentally different from the external diffuse layers; they play by a different set of rules.

I don’t understand how authors can get away with promoting this conceptual view without supplying reasonable arguments for all of its assumptions⁸ — and with such a complex structure, there are a lot of assumptions.

As already discussed, the geometrical implications of the suggested structure do not hold up to scrutiny. Likewise, there are many arguments against the presence of substantial amounts of bulk water in compacted bentonite, including the pressure consideration above. But let’s also take a look at what is stated about “interlayers” and how these are distinguished from electric double layers (I will use quotation marks in the following, and write “interlayers” when specifically referring to pores defined as internal to stacks).

“Interlayers”

“Interlayers” are often postulated to be completely devoid of anions. We discussed this assumption in more depth in a previous blog post, where we discovered that the only references supplied when making this postulate are based on the Poisson-Boltzmann equation. But this is inadequate, since the Poisson-Boltzmann equation does describe diffuse layers, and predicts anions everywhere.

By requiring anion-free “interlayers”, authors actually claim that the physico-chemistry of “interlayers” is somehow qualitatively different from that of “external surfaces”, although these compartments have the exact same constitution (charged TOT-layer surface + ions + water). But an explanation for why this should be the case is never provided, nor is any argument given for why diffuse layer concepts are not supposed to apply to “interlayers”.⁹ This issue becomes even more absurd given the strong empirical evidence for that anions actually do reside in interlayers.

The treatment of anions is not the only ad hoc description of “interlayers”. It also seems close to mandatory to describe them as having a maximum extension, and as having an extension independently parameterized by sample density. E.g. the influential models for Na-bentonite of Bourg et al. (2006) and Tournassat and Appelo (2011) both rely on the idea that “interlayers” swell out to a certain volume that is smaller than the total pore volume, but that still depends on density.

In e.g. Bourg et al. (2006), the fraction of “interlayer” pores remains essentially constant at ~78%, as density decreases from 1.57 g/cm³ to 1.27 g/cm³, while the “interlayers” transform from having 2 monolayers of water (2WL) to having 3 monolayers (3WL). This is a very strange behavior: “interlayers” are acknowledged as having a swelling potential (2WL expands to 3WL), but do, for some reason, not affect 22% of the pore volume! Although such a behavior strongly deviates from what we expect if “interlayers” are treated with conventional diffuse layer concepts, no mechanism is provided.

In contrast, it should be noted that the established explanation for “tactoid” formation in pure Ca-montmorillonite involves no ad hoc assumptions of this sort, but rests on treating all ions as part of diffuse layers.

Another type of macabre consequence of defining “interlayer” pores as internal to stacks is that a completely homogeneous system is described has having no interlayer pores (because it has no stacks). E.g. Tournassat and Appelo (2011) write (\(n_c\) is the number of TOT-layers in a stack)

[…] the number of stacks in the \(c\)-direction has considerable influence on the interlayer porosity, with interlayer porosity increasing with \(n_c\) and reaching the maximum when \(n_c \approx 25\). The interlayer porosity halves with \(n_c\) when \(n_c\) is smaller than 3, and becomes zero for \(n_c = 1\).¹⁰

It is not acceptable that using the term interlayer should require accepting stacks as fundamental units. But the usage of the term as being internal to stacks is so widespread in the contemporary bentonite literature, that I fear it is difficult to even communicate this issue. Nevertheless, I am certain that e.g. Norrish (1954) does not depend on the existence of stacks when using the term like this:

Fig. 7 shows the relationship between interlayer spacing and water content for Na-montmorillonite. There is good agreement between the experimental points and the theoretical line, showing that interlayer swelling accounts for all, or almost all, of physical swelling.

The stack view obstructs real discovery

A severe consequence of the conceptual view just discussed is that “stacking number” — the (average) number of TOT-layers that stacks are supposed to contain — has been established as fitting parameter in models that are clearly over-parameterized. An example of this is Tournassat and Appelo (2011), who write¹¹

Our predictive model excludes anions from the interlayer space and relates the interlayer porosity to the ionic strength and the montmorillonite bulk dry density. This presentation offers a good fit for measured anion accessible porosities in bentonites over a wide range of conditions and is also in agreement with microscopic observations.

But since anions do reside in interlayers,¹² it would be better if the model didn’t fit: an over-parameterized or conceptually flawed model that fits data provides very little useful information.

A similar more recent example is Wu et al. (2018). In this work, a model based on the stack concept is successfully fitted both to data on \(\mathrm{ReO_4^-}\) diffusion in “GMZ” bentonite and to data on \(\mathrm{Cl^-}\) diffusion in “KWK” bentonite, by varying “stacking number” (among other parameters). Again, as the model assumes anion-free “interlayer” pores, a better outcome would be if it was not able to fit the data. Moreover, this paper focuses mainly on the ability of the model, while not at all emphasizing the fact that about ten (!) times more \(\mathrm{ReO_4^-}\) was measured in “GMZ” as compared with \(\mathrm{Cl^-}\) in “KWK”, at similar conditions in certain cases. The latter observation is quite puzzling and is, in my opinion, certainly worth deeper investigation (and I am fully convinced that it is not explained by differences in “stacking number”).

Footnotes

[1] The terminology in the bentonite literature is really all over the place. You may e.g. also find the term “tactoid” used as Navarro et al. (2017) use “aggregate”, or the term “platelet” used for a stack of TOT-layers…

[2] The third reference is an entire book on clays.

[3] Note that “sodium dominated” in this context means ~20% or more.

[4] It may be noticed that Kozaki et al. (2001) see no X-ray diffraction peaks for low density samples:

The basal spacing of water-saturated montmorillonite was determined by the XRD method. […] It was found that a basal spacing of 1.88 nm, corresponding to the three-water layer hydrate state […] was not observed before the dry density reached 1.0 Mg/m³.

My interpretation of this observation is that the diffraction peak has shifted to even lower reflection angles (in agreement with the observations of Holmboe et al. (2012)), not registered by the equipment. The alternative interpretation must otherwise be that “stacks” suddenly cease to exist below 1.0 g/cm³. (Yet, Kozaki et al. (2001) continues to use a certain d-value in their analysis, also for densities below 1.0 g/cm³.)

[5] I have discussed “diffusion paths” in an earlier blog post. This illustration certainly fits that discussion.

[6] A water-to-solid volume ratio of 1 corresponds basically to interlayers of three monolayers of water (3WL).

[7] To construct these units, I made the additional choice of placing each layer randomly in the horizontal direction, with the constraint that all layers should be confined within the range of the longest one in each unit.

[8] By “get away with” I mean “pass peer-review”, and by “don’t understand” I mean “understand”.

[9] This is reminiscent of how certain authors imply that the interlayer is non-diffusive under so-called crystalline swelling.

[10] A mathematical remark: if the interlayer porosity “halves with \(n_c\)” (what does that mean?) when \(n_c = 2\) (“smaller than 3”), it is impossible to simultaneously have zero interlayer porosity for \(n_c = 1\) (unless the interlayer porosity is zero for any \(n_c\)).

[11] I guess the word “presentation” here really should be “representation”?

[12] Note that one of the authors of this paper also claims in a later paper that anions do populate 3-waterlayer interlayers, in accordance with the Poisson-Boltzmann equation:

The agreement between PB calculations and MD simulation predictions was somewhat worse in the case of the \(\mathrm{Cl^-}\) concentration profiles than in the case of the \(\mathrm{Na^+}\) profiles (Figure 3), perhaps reflecting the poorer statistics for interlayer Cl concentrations […] Nevertheless, reasonable quantitative agreement was found (Table 2).

Sorption part IV: What is K_d?

Leave a reply

Measuring K_d

Researchers traditionally measure sorption on montmorillonite in batch tests, where a small amount of solids is mixed with a tracer-spiked solution (typical solid-to-liquid ratios are \(\sim 1 – 10\) g/l). After equilibration, solids and solution are usually separated by centrifugation and the supernatant is analyzed.

This procedure evidently counts tracer cations that reside in diffuse layers as sorbed. But tracer ions may also sorb due to other mechanisms, in particular due to bonding on specific surface hydroxyl groups, on the edges of individual montmorillonite layers. These different types of “sorption” are in the clay literature usually referred to as “cation exchange” and “surface complexation”, respectively.

The amount of tracer “sorbed” in the ways just described is quantified by the distribution coefficient \(K_d\), defined as

\begin{equation} s = K_d\cdot c_\mathrm{eq} \end{equation}

where \(s\) denotes the amount of tracers “on the solids”, and \(c_\mathrm{eq}\) is the corresponding equilibrium concentration in the aqueous phase. As the amount “on the solids” can be inferred from the amount of tracers that has been removed from the initial solution, we can evaluate \(K_d\) from

\begin{equation} K_d = \frac{\left ( c_\mathrm{init} – c_\mathrm{final} \right ) \cdot V_\mathrm{sol}} {c_\mathrm{final}\cdot m_\mathrm{s}} \end{equation}

where \(c_\mathrm{init}\) is the initial tracer concentration (i.e. before adding the clay), \(c_\mathrm{final}\) is the tracer concentration in the supernatant, \(V_\mathrm{sol}\) is the solution volume, and \(m_s\) is the mass of the solids.

If the purpose of a study is solely to quantify the amount of tracer “on the solids”, it is adequate to define sorption as including both “cation exchange” and “surface complexation”, and to use \(K_d\) as the measure of this sorption. However, if our main concern is to describe transport in compacted bentonite, \(K_d\) is a rather blunt tool, since it quantifies both ions that dominate the transport capacity (“cation exchange”), and ions that are immobile, or at least contribute to an actual delay of diffusive fluxes (“surface complexation”).

A good illustration of this problem is the traditional diffusion-sorption model, which incorrectly assumes that all ions quantified by \(K_d\) are immobilized. In earlier blog posts, we have discussed the consequences of this model assumption, and the empirical evidence against it. A complication when discussing sorption is that researchers often “measure” \(K_d\) by fitting the traditional diffusion-sorption model to data — although the model is not valid for compacted bentonite.

Moreover, when evaluating \(K_d\) in batch tests, or when using this parameter in models, authors assume that the solids are in equilibrium locally with a bulk water phase. But there is no compelling evidence that such a phase exists in compacted water saturated bentonite. On the contrary, several observations strongly suggest that compacted bentonite lacks significant amounts of bulk water. This, in turn, suggests that \(K_d\) actually quantifies the equilibrium between a bentonite sample and an external solution.

Indeed, even in batch tests is the final concentration measured in a solution (the supernatant) separated from the clay (the sediment), as a consequence of the centrifugation, as illustrated here:

This figure also illuminates additional and perhaps more subtle complications when evaluating \(K_d\) from batch tests. Firstly, such values are implicitly assumed independent of “sample” density. There are, however, arguments for that \(K_d\) in general depends on density, as will be explored below. The question is then to what density range we can apply batch test values when modeling compacted systems, or if they can be applied at all. Note that the “sample” that is measured on in a batch test (see figure) has a more or less well-defined density. But sediment densities are, to my knowledge, never investigated in these types of studies.¹

Secondly, it could be questioned if the supernatant have had time to equilibrate with the sediment, i.e whether \(c_\mathrm{final} = c_\mathrm{eq}\). Instead, as far as I know, researchers routinely assume that the equilibrium established prior to centrifugation remains.

In the following, we use the homogeneous mixture model to analyze in more detail the nature of \(K_d\) in compacted bentonite.

K_d in the homogeneous mixture model

As usual when analyzing bentonite with the homogeneous mixture model, we assume an external solution in contact with a homogeneous bentonite domain at a specific density (water-to-solid mass ratio \(w\)). The bentonite and the external solution are separated via a semi-permeable component, which allows for the passage of water and ions, but does not allow for the passage of clay (symbols are explained below):

This model resembles the alternative test set-up for determining \(K_d\) in compacted systems used by Van Loon and Glaus (2008), where the clay is contained in a sample holder, and the tracer is supplied through a filter from an external circulating solution. This approach has the advantages that the state of the clay is controlled throughout the test (which, e.g., allows for investigating how \(K_d\) depends on density), and that the equilibration process is better controlled (avoiding the possible disruptive procedure of centrifugation). The obvious disadvantage is that equilibration — being diffusion controlled — may take a long time.

When applying the homogeneous mixture model in earlier blog posts, we have assumed “simple” ions, which contribute to the ion population of the clay only in terms of the interlayer concentration, \(c^\mathrm{int}\). This concentration quantifies the amount of mobile ions involved in establishing Donnan equilibrium between clay and external solutions. But many “non-simple” ions actually do seem to be immobilized/delayed by also associate with surfaces (\(\mathrm{H}^+\), \(\mathrm{Ni}^{2+}\), \(\mathrm{Zn}^{2+}\), \(\mathrm{Co}^{2+}\), \(\mathrm{P_2O_7^{4-}}, …\)). For a more general description, we therefore extend the homogeneous mixture model with a second contribution to the ion population: \(s^\mathrm{int}\) (ions per unit mass).

Using the traditional terminology, the ions quantified by \(c^\mathrm{int}\) are to be identified as “sorbed by ion exchange”, and those quantified by \(s^\mathrm{int}\) as “sorbed by surface complexation”. But since the ion exchange process does not immobilize ions and primarily should be associated with Donnan equilibrium, we want to avoid referring to them as “sorbed”. Also, with the traditional terminology, all ions in the homogeneous mixture model are described as “sorbed”, which obviously is not very useful.

We therefore introduce different terms, and refer to the ions quantified by \(c^\mathrm{int}\) as aqueous interlayer species, and to the ions quantified by \(s^\mathrm{int}\) as truly sorbed ions. With this terminology, the term “sorption” puts emphasis on ions being immobile.² Moreover, the description now also applies to anions, without having to refer to them as e.g. “sorbed by ion exchange”.

In analogy with the traditional diffusion-sorption model, we assume a linear relation between \(s^\mathrm{int}\) and \(c^\mathrm{int}\)

\begin{equation} s^\mathrm{int} = \Lambda\cdot c^\mathrm{int} \tag{1} \end{equation}

where \(\Lambda\) is a distribution coefficient quantifying the relation between the amount of aqueous species in the interlayer domain and amount of truly sorbed substance.³

The amount of an aqueous species in the homogeneous mixture model is \(V_p\cdot c^\mathrm{int}\), where \(V_p\) is the total pore volume. The total amount of an ion per unit mass is thus \(V_p\cdot c^\mathrm{int}/m_s + s^\mathrm{int}\), where \(m_s\), as before, denotes total solid mass.

To get an expression for \(K_d\) in the homogeneous mixture model, we must associate ions “on the solids” (\(s\)) with the concentration in the external solution. Here we choose the simplest way to do this, and write

\begin{equation} s = \frac{V_p\cdot c^\mathrm{int}}{m_s} + s^\mathrm{int} = K_d\cdot c^\mathrm{ext} \tag{2} \end{equation}

which implies that we define all ions in the bentonite sample to be “on the solids”. To be fully consistent, we should perhaps subtract the contribution expected to be found in the clay if it behaved like a conventional porous system (\(V_p\cdot c^\mathrm{ext}/m_s\)). But, since we are mostly interested in the limit of small \(V_p/m_s\), this contribution can be thought of as becoming arbitrary small, and we therefore don’t bother with including it in the formulas. In any case, this “conventional porewater” contribution would simply give an extra term \(-w/\rho_w\) in the equations we are about to derive, and can be included if desired.

Using eqs. 1 and 2, we get the expression for \(K_d\) in the homogeneous mixture model

\begin{equation} K_d = \frac{w\cdot\Xi }{\rho_w} + \Lambda\cdot \Xi \tag{3} \end{equation}

where we also have used the definition of the ion equilibrium coefficient \(\Xi = c^\mathrm{int}/c^\mathrm{ext}\), and utilized that \(V_p/m_s = w/\rho_w\), where \(\rho_w\) is the density of water.⁴

A full analysis of eq. 3 is a major task, but a few things are immediately clear:

\(K_d\) generally has two contributions: one from Donnan equilibrium (\(w\cdot\Xi/\rho_w\)) and one from true sorption (\(\Lambda\cdot \Xi\)). Using the traditional terminology, these contributions correspond for cations to “sorption by ion exchange” and “sorption by surface complexation”, respectively. But note that eq. 3 is valid also for anions.
For a simple cation (\(\Lambda = 0\)), \(K_d\) merely quantifies the aqueous interlayer concentration.⁵ As we have discussed earlier, \(K_d\) quantifies in this case a type of enhancement of the transport capacity. I think it is unfortunate that a mechanism that dominates the mass transfer capacity traditionally is labeled “sorption”.
For cations with \(\Lambda \neq 0\), \(K_d\) is not a measure of true sorption, because we always expect a significant Donnan contribution. In this case \(K_d\) quantifies a mixture of transport enhancing and transport inhibiting mechanisms. Clearly, it is unsatisfactory to use the term “sorption” for mechanisms that both enhance and reduce transport capacity (at least when the objective is a transport description).
For simple anions, the above expression gives a positive value for \(K_d\). Traditionally, the \(K_d\) concept has not been applied to these types of ions, and e.g. chloride is often described as “non-sorbing”, with \(K_d =0\). Since \(\Xi \rightarrow 0\) as \(w \rightarrow 0\) generally for anions, this result (\(K_d = 0\)) is recovered in this limit.⁶

K_d for simple cations

We end this post by examining expressions for \(K_d\) for simple cations in some specific cases. In the following we consequently assume \(\Lambda = 0\), and this section relies heavily on the ion equilibrium framework in the homogeneous mixture model, with the main relation

\begin{equation} \Xi \equiv \frac{c^\mathrm{int}}{c^\mathrm{ext}} = \Gamma f_D^{-z} \tag{4} \end{equation}

where \(z\) is the charge number of the ion, \(\Gamma \equiv \gamma^\mathrm{ext}/\gamma^\mathrm{int}\) is an activity coefficient ratio, and \(f_D = e^\frac{F\psi^\star}{RT}\) is the so-called Donnan factor, with \(\psi^\star\) (\(<0\)) being the Donnan potential.

Simple cation tracers in a 1:1 system

We assume a bentonite sample at water-to-solid mass ratio \(w\) in equilibrium with an external 1:1 solution (e.g. NaCl) of concentration \(c^\mathrm{bgr}\). The Donnan factor is in this case, in the limit \(c^\mathrm{bgr} \ll c_\mathrm{IL}\)⁷

\begin{equation} f_D = \Gamma_+\frac{c^\mathrm{bgr}}{c_\mathrm{IL}} \end{equation}

where \(\Gamma_+\) is the activity coefficient ratio for the cation of the 1:1 electrolyte, and, as usual

\begin{equation} c_\mathrm{IL} = \frac{CEC\cdot \rho_w}{w\cdot F} \end{equation}

where \(CEC\) is the cation exchange capacity, and \(F\) is the Faraday constant (1 eq/mol). We furthermore assume the presence of a mono-valent cation tracer, which, by definition, does not influence \(f_D\). The ion equilibrium coefficient for this tracer is (from eq. 4)

\begin{equation} \Xi = \Gamma\cdot \Omega_{1:1}\cdot \frac{\rho_w}{w} \end{equation}

where \(\Gamma\) is the activity coefficient ratio for the tracer, and we have defined

\begin{equation} \Omega_{1:1} \equiv \frac{CEC}{F\cdot c^\mathrm{bgr}\cdot\Gamma_+} \end{equation}

\(K_d\) for a simple mono-valent tracer in a 1:1 electrolyte is thus (using eq. 3 with \(\Lambda = 0\))

\begin{equation} K_{d} = \Gamma \cdot \Omega_{1:1} \tag{5} \end{equation} \begin{equation} \text{ (mono-valent simple tracer in 1:1 system)} \end{equation}

For a divalent tracer we instead have

\begin{equation} \Xi = \Gamma \cdot \Omega_{1:1}^2 \cdot \left (\frac{\rho_w}{w} \right )^2 \end{equation}

giving

\begin{equation} K_d = \Gamma \cdot \Omega_{1:1}^2 \cdot \frac{\rho_w} {w} \tag{6} \end{equation} \begin{equation}\text{(di-valent simple tracer in 1:1 system)} \end{equation}

Eqs. 5 and 6 are essentially identical⁸ with the expression for \(K_d\) in a 1:1 system, derived in Glaus et al. (2007), which we used in the analysis of filter influence in cation through-diffusion.

Simple cation tracers in a 2:1 system

In a 2:1 system (e.g \(\mathrm{CaCl_2}\)), the Donnan factor is, in the limit \(c^\mathrm{bgr} \ll c_\mathrm{IL}\)

\begin{equation} f_D = \sqrt{2 \Gamma_{++}\frac{c^\mathrm{bgr}}{c_\mathrm{IL}}} \end{equation}

where index “++” refers to the cation of the 2:1 background electrolyte. Thus, for a mono-valent tracer

\begin{equation} \Xi = \Gamma\cdot \sqrt{\Omega_{2:1}} \cdot \sqrt{\frac{\rho_w}{w}} \end{equation}

where

\begin{equation} \Omega_{2:1} \equiv \frac{CEC}{2F\cdot c^\mathrm{bgr}\cdot\Gamma_{++}} \end{equation}

\(K_d\) for a mono-valent simple tracer in a 2:1 electrolyte is consequently

\begin{equation} K_{d} = \Gamma \cdot \sqrt{\Omega_{2:1}}\cdot\sqrt{\frac{w}{\rho_w}} \tag{7} \end{equation} \begin{equation} \text{(simple mono-valent tracer in 2:1 system)} \end{equation}

For a divalent tracer we instead have

\begin{equation} \Xi = \Gamma \cdot \Omega_{2:1} \cdot \frac{\rho_w}{w} \end{equation}

giving

\begin{equation} K_d = \Gamma \cdot \Omega_{2:1} \tag{8} \end{equation} \begin{equation} \text{(simple di-valent tracer in 2:1 system)} \end{equation}

Density dependence of K_d

Note that \(K_d\) for a mono-valent ion in a 1:1 system does not explicitly depend on density (eq. 5), while \(K_d\) for a di-valent ion diverges as \(w\rightarrow 0\) (eq. 6). In contrast, \(K_d\) in a 2:1 system has no explicit density dependence for di-valent tracers (eq. 8), while \(K_d\) vanishes for a mono-valent tracer in the limit \(w \rightarrow 0\) (eq. 7).

These results imply that we expect \(K_d\) to generally depend on sample density in systems where the charge number of the tracer ions differs from that of the cation of the background electrolyte. It may therefore not be appropriate to use values of \(K_d\) evaluated in batch-type tests for analyzing compacted systems.

Note also that \(K_d\) may have significant density dependence also in cases where the present analysis gives no explicit \(w\)-dependence on \(K_d\). This was demonstrated e.g. by Van Loon and Glaus (2008) for cesium tracers in sodium dominated bentonite. Interpreted in terms of the homogeneous mixture model, their results show that the interlayer activity coefficients vary significantly with density. In particular, the results imply either that the interlayer activity coefficient for cesium becomes small (\(\Gamma_\mathrm{Cs} \gg 1\)), or that the interlayer activity coefficient for sodium becomes large (\(\Gamma_\mathrm{Na} \ll 1\)), in the high density limit.

Footnotes

[1] A sediment density is, reasonably, related to e.g. initial solid-to-water ratio and to the details of the centrifugation procedure.

[2] I am not very happy with this terminology, but we need a way to distinguish this type of sorption from how the term “sorption” is used in the bentonite literature, where it nowadays essentially refers to the process of taking up an ion from a bulk water phase to some other phase. This is the reason for why there are so many quotation marks around the word “sorption” in the text.

[3] I don’t know if this is a valid assumption, but it seems like the natural starting point.

[4] The presence of water density in the formulas reflects the fact that we are using molar units (substance per unit volume), which is natural, as \(K_d\) typically has units of volume per mass. How to associate a density to water in the homogeneous mixture model is a bit subtle, and we don’t focus on that aspect here (it may be the issue of future posts). In the presented formulas \(\rho_w\) can rather be viewed as a unit conversion factor.

[5] When \(\Lambda = 0\), we can rearrange eq. 3 as

\begin{equation} \Xi = \frac{K_d\cdot \rho_w}{w} = \frac{K_d\cdot \rho_d}{\phi} \equiv \kappa \end{equation}

where \(\rho_d\) is dry density, \(\phi\) is porosity, and \(\kappa\) was defined as a scaled, dimensionless version of \(K_d\) by Gimmi and Kosakowski (2011), discussed in a previous blog post. Interpreted using the homogeneous mixture model, \(\kappa\) is thus simply the ion equilibrium coefficient for simple cations.

[6] By including the “conventional porewater” contribution in the definition of \(K_d\), as discussed earlier, we get for these types of anions

\begin{equation} K^\prime_d = \frac{w\cdot \Xi}{\rho_w} – \frac{w}{\rho_w} = \frac{w}{\rho_w} \left ( \Xi – 1 \right) \end{equation}

This is typically a negative quantity, and quantifies anion exclusion, in the Schofield sense of the term. We have, also with this definition, that \(K^\prime_d \rightarrow 0\) as \(w \rightarrow 0\).

[7] We assume \(c^\mathrm{bgr} \ll c_\mathrm{IL}\) in this and all following cases. For compacted bentonite \(c_\mathrm{IL}\) is of the order of several molar, and the derived approximations are thus valid for “typical” background concentrations (\(< 1\) M). Also, for an arbitrary value of \(c^\mathrm{bgr}\), one can in principle always choose a sufficiently low value of \(w\) to satisfy \(c^\mathrm{bgr} \ll c_\mathrm{IL}\).

[8] If the selectivity coefficient is identified with that derived in Birgersson (2017).

Sorption part III: Donnan equilibrium in compacted bentonite

Leave a reply

Consider this basic experiment: contact a water saturated sample of compacted pure Na-montmorillonite, with dry mass 10 g and cation exchange capacity 1 meq/g, with an external solution of 100 ml 0.1 M KCl. Although such an experiment has never been reported¹, I’m convinced that all agree that the outcome would be similar to what is illustrated in this animation.

Potassium diffuses in, and sodium diffuses out of the sample until equilibrium is established. At equilibrium also a minor amount of chloride is found in the sample. The indicated concentration levels are chosen to correspond roughly to results from from similar type of experiments.²

Although results like these are quite unambiguous, the way they are described and modeled in the bentonite³ literature is, in my opinion, quite a mess. You may find one or several of the following terms used to describe the processes

Cation exchange
Sorption/Desorptioṇ
Anion exclusion
Accessible porosity
Surface complexation
Donnan equilibrium
Donnan exclusion
Donnan porosity/volume
Stern layer
Electric double layer
Diffuse double layer
Triple layer
Poisson-Boltzmann
Gouy-Chapman
Ion equilibrium
…

In this blog post I argue for that the primary mechanism at play is Donnan equilibrium, and that most of the above terms can be interpreted in terms of this type of equilibrium, while some of the others do not apply.

Donnan equilibrium: effect vs. model

In the bentonite literature, the term “Donnan” is quite heavily associated with the modeling of anion equilibrium; e.g. the term “Donnan exclusion” is quite common , and you may find statements that researchers use “Donnan porespace models” as models for “anion exclusion”, or a “Donnan approach” to model “anion porosity”.⁴ Sometimes the term “Donnan effect” is used synonymously with “Salt exclusion”. Also when authors acknowledge cations as being part of “Donnan” equilibrium, the term is still used mainly to label a model or an “approach”.

But I would like to push for that “Donnan equilibrium” primarily should be the name of an observable effect, and that it applies equally to both anions and cations. This effect — which was hypothesized by Gibbs already in the 1870s — relies basically only on two things:

An electrolytic system, i.e. the presence of charged aqueous species (ions).
The presence of a semi-permeable component that is permeable to some of the charges, but does not allow for the passage of at least one type of charge.

In equilibrated systems fulfilling these requirements it is — to use Donnan’s own words — “thermodynamically necessary” that the permeant ions distribute unequally across the semi-permeable component. This phenomenon — unequal ion distributions on the different sides of the semi-permeable component — should, in my opinion, be the central meaning of the term “Donnan equilibrium”.

The first publication of Donnan on the effect actually concerned osmotic pressure response, in systems of Congo Red separated from solutions of sodium chloride and sodium hydroxide. The same year (1911) he also published the ionic equilibrium equations for some specific systems.⁵ In particular he considered the equilibrium of NaCl initially separated from NaR, where R is an impermeant anion (e.g. that of Congo Red), leading to the famous relation (“int” denotes the solution containing R)

\begin{equation} c_\mathrm{Na^+}^\mathrm{ext}\cdot c_\mathrm{Cl^-}^\mathrm{ext} = c_\mathrm{Na^+}^\mathrm{int}\cdot c_\mathrm{Cl^-}^\mathrm{int} \tag{1} \end{equation}

Unfortunately, this relation alone (or relations derived from it) is often what the term “Donnan” is associated with in today’s clay research literature, with the implication that systems not obeying it are not Donnan systems. But the above relation assumes ideal conditions and complete ionization of the salts — issues Donnan persistently seems to have grappled with. In a review on the effect he writes

The exact equations can, however, be stated only in terms of the chemical potentials of Willard Gibbs, or of the ion activities or ionic activity-coefficients of G. N. Lewis. Indeed an accurate experimental study of the equilibria produced by ionically semi-permeable membranes may prove to be of value in the investigation of ionic activity coefficients.

It must therefore be understood that, if in the following pages ionic concentrations and not ionic activities are used, this is done in order to present a simple, though only approximate, statement of the fundamental relationships.

The issue of (the degree of) ionization was explicitly addressed in publications following the 1911 article; Donnan & Allmand (1914) motivated their investigations of the \(\mathrm{KCl/K_4Fe(CN)_6}\) system by that “it was deemed advisable to test the relation when using a better defined, non-dialysable anion than that of Congo-red”, and the study of the Na/K equilibrium in Donnan & Garner (1919) used ferrocyanide solutions on both sides of the membrane in an attempt to overcome the difficulty of the “uncertainty as to the manner of ionisation of potassium ferrocyanide” (and thus for the simplified equations to apply).

I mean that since non-ideality and ion association are general issues when treating salt solutions, it does not make much sense to use the term “Donnan equilibrium” only when some particular equation applies; as long as the mechanism for the observed behavior is that some charges diffuse through a semi-permeable component, while some others don’t, the effect should be termed Donnan equilibrium.

Donnan equilibrium in gels, soils and clays

After Donnan’s original publications in 1911, the effect was soon recognized in colloidal systems. Procter & Wilson (1916) used Donnan’s equations to analyze the swelling of gelatin jelly immersed in hydrochloric acid. In this case chloride is the charge compensating ion, allowed to move between the phases, while the immobile charge is positive charges on the gelatin network. Thus, no semi-permeable membrane is necessary for the effect; alternatively one could say that the gel constitutes its own semi-permeable component. The Donnan equilibrium in protein solutions was further and extensively investigated by Loeb.

As far as I am aware, Mattson was first to identify the Donnan effect in “soil” suspensions,⁶ attributing e.g. “negative adsorption” of chloride as a consequence of Donnan equilibrium, and explicitly referencing the works of Procter and Loeb. Mattson describes the suspension in terms of electric double layers with a diffuse “atmosphere of cations” surrounding the “micelle” (the soil particle), and refers to Donnan equilibrium as the distribution of an electrolyte between the “micellar” and the “inter-micellar” solutions. Oddly,⁷ he uses Donnan’s original framework (e.g. eq. 1) to quantify the equilibrium, although the electrostatic potential and the ion concentrations varies significantly in the investigated systems. A more appropriate treatment would thus be to use e.g. the Gouy-Chapman description for the ion distribution near a charged plane surface (which he refers to!).

Instead, Schofield (1947) analyzed Mattson’s data using this approach. He also comments on its (the Gouy-Chapman model) range of validity

… [T]he equation is applicable to cases in which the distance between opposing surfaces considerably exceeds the distance between neighboring point charges on the surfaces; for there will then be a range of electrolyte concentrations over which the radius of the ionic atmosphere is less than the former and greater than the latter. In Mattson’s measurements on bentonite suspension, these distances are roughly 500 A. and 10 A. respectively, so there is an ample margin.

He continues to comment on the validity of Donnan’s original equations

When the distance ratio has narrowed to unity, it is to be expected that the system will conform to the equation of the Donnan membrane equilibrium. This equation fits closely the measurements of Procter on gelatine swollen in dilute hydrochloric acid. […] In a bentonite suspension the charges are so far from being evenly distributed that the Donnan equation is not even approximately obeyed.

From these statements it should be clear that the general behavior (cation exchange, salt exclusion) of ions in bentonite equilibrated with an external solution is due to the Donnan effect.⁸ The appropriate theoretical treatment of this effect differs, however, depending on details of the investigated system. To argue whether or not e.g. the Gouy-Chapman description should be classified as a “Donnan” approach is purely semantic.

It is also clear that in the case of compacted bentonite the distance ratio is narrowed to unity — the typical interlayer distance is 1 nm, which also is the typical distance between structural charges in the montmorillonite particles. It is thus expected that Donnan’s original treatment may work for such systems (adjusted for non-ideality), while the Gouy-Chapman description is not valid.⁹

The message I am trying to convey is neatly presented in Overbeek (1956) — a text I highly recommend for further information. Overbeek distinguishes between “classical” (Donnan’s original) and “new” (accounting for variations in potential etc.) treatments of Donnan equilibrium, and says the following about dense systems

If the particles come very close together the potential drop between [surface and interlayer midpoint] becomes smaller and smaller as illustrated in Fig. 4. This means that the local concentrations of ions are not very variable and that we are again back at the classical Donnan situation, where distribution of ions, osmotic pressure and Donnan potential are simply given by the elementary equations as treated in section 2. It is remarkable that the new treatment of the Donnan effects may deviate strongly from the classical treatment when the colloid concentration is low, but not when it is high.

It thus seems plausible that Donnan equilibrium in compacted bentonite can be treated using Donnan’s original equations. But — as interlayer pores are a quite extreme chemical environment — substantial non-ideal behavior may be expected. Treating such behavior is a large challenge for chemical modeling of compacted bentonite, but can not be avoided, since interlayers dominate the pore structure.

Cation exchange is Donnan equilibration

The term “Donnan” in modern bentonite literature is, as mentioned, quite heavily associated with the fate of anions interacting with bentonite. In contrast, cations are often described as being “sorbed” onto the “solids”. This sorption is usually separated into two categories: cation exchange and surface complexation.

Surface complexation reactions are typically described using “surface sites”, and are usually written something like this (exemplified with sodium sorption)

\begin{equation} \equiv \mathrm{S^-} + \mathrm{Na^{+}(aq)} \leftrightarrow \equiv \mathrm{SNa} \end{equation}

where the “surface site” is labeled \(\equiv \mathrm{S}^-\)

Cation exchange is also typically written in terms of “sites”, but requires the exchange of ions (duh!), like this (here exemplified for calcium/sodium exchange)

\begin{equation} \mathrm{2XNa} + \mathrm{Ca^{2+}(aq)} \leftrightarrow \mathrm{X_2Ca} + 2\mathrm{Na^+(aq)} \tag{2} \end{equation}

where X represents an “exchange site” in the solid phase.

In the clay literature the distinction between “surface complexation” and “ion exchange” reactions is rather blurred. You can e.g. find statements that “the ion exchange model can be seen as a limiting case of the surface complex model…”, and it is not uncommon that ion exchange is modeled by means of a surface complexation model. It also seems rather common that ion exchange is understood to involve surface complexation.

Underlying these modeling approaches and descriptions is the (sometimes implicit) idea that exchanged ions are immobile, which clearly has motivated e.g. the traditional diffusion-sorption model for bentonite and claystone. This model assumes that ion exchange binds cations to the solid, making them immobile, while diffusion occurs solely in a bulk water phase (which, incredibly, is assumed to fill the entire pore volume).

However, the idea that the exchanged ion is immobile does not agree with descriptions in the more general ion exchange literature, which instead acknowledge the process as an aspect of the Donnan effect.

Indeed, already in 1919, Donnan & Garner reported Na/K exchange equilibrium in a system consisting of two ferrocyanide solutions separated by a membrane impermeable to ferrocyanide, and it is fully clear that the particular distribution of cations in such systems is just as “thermodynamically necessary” as the distribution of chloride in the initial work on Congo Red and ferrocyanide.

Applied to clays, it is clear that cation exchange occurs even without postulating specific “sorption sites” or immobilization. On the contrary, ion exchange occurs in Donnan systems precisely because the ions are mobile.

In his book “Ion exchange”,¹⁰ Freidrich Helfferich describes ion exchange as diffusion, and distinguishes it from “chemical” processes

Occasionally, ion exchange has been referred to as a “chemical” process, in contrast to adsorption as a “physical” process. This distinction, though plausible at first glance, is misleading. Usually, in ion exchange as a redistribution of ions by diffusion, chemical factors are less significant than in adsorption where the solute is held by the sorbent by forces which may not be purely electrostatic.

Furthermore, in describing a general ion exchange system, he states the exact characteristics of a Donnan system, with the crucial point that the exchangeable ion is “free”, albeit subject to the constraint of electroneutrality

Ion exchangers owe their characteristic properties to a peculiar feature of their structure. They consist of a framework which is held together by chemical bonds or lattice energy. This framework carries a positive or negative electric surplus charge which is compensated by ions of opposite sign, the so-called counter ions. The counter ions are free to move within the framework and can be replaced by other ions of the same sign. The framework of a cation exchanger may be regarded as a macromolecular or crystalline polyanion, that of an anion exchanger as a polycation.

To give a very simple picture, the ion exchanger may be compared to a sponge with counter ions floating in the pores. When the sponge is immersed in a solution, the counter ions can leave the pores and float out. However, electroneutrality must be preserved, i.e., the electric surplus charge of the sponge must be compensated at any time by a stoichiometrically equivalent number of counter ions within the pores. Hence a counter ion can leave the sponge only when, simultaneously, another counter ion enters and takes over the task of contributing its share to the compensation of the framework charge.

With this “sponge” model at hand, he argues for that the reaction presented in eq. 2 above should be reformulated

[T]he model shows that ion exchange is essentially a statistical redistribution of counter ions between the pore liquid and the external solution, a process in which neither the framework nor the co-ions take part. Therefore Eqs. (1-1) [eq. 2 above] and (1-2) should be rewritten: \begin{equation} 2\overline{\mathrm{Na^+}} + \mathrm{Ca^{2+}} \leftrightarrow \overline{\mathrm{Ca^{2+}}} + 2\mathrm{Na^{+}} \end{equation} \begin{equation} 2\overline{\mathrm{Cl^-}} + \mathrm{SO_4^{2+}} \leftrightarrow \overline{\mathrm{SO_4^{2-}}} + 2\mathrm{Cl^{-}} \end{equation} Quantities with bars refer to the inside of the ion exchanger.

This “statistical redistribution” is of course nothing but the establishment of Donnan equilibrium between the external solution and the exchanger phase (as in the animation above). Naturally, Donnan equilibrium — using either the “classical” or the “new” equations — is at the heart of many analyses of ion exchange systems.

Unfortunately, this has not been the tradition in the compacted bentonite research field, where a “diffuse layer” approach to cation exchange has only been considered in more recent years, and then usually as a supplement to already existing models and tools. We are therefore in the rather uneasy situation that ion exchange in bentonite nowadays often is explained in terms of both a Donnan effect and as specific surface complexation.

Considering the robust evidence for significant ion mobility in interlayer pores, I strongly doubt surface complexation to be relevant for describing ion exchange in bentonite.¹¹ Instead, I believe that not separating these processes obscures the analysis of species that actually do sorb in these systems. In any event, the exact effects of Donnan equilibrium — a mechanism dependent on nothing but that some charges diffuses through the semi-permeable component, while some others don’t — must first and foremost be worked out.

A demonstration of compacted bentonite as a Donnan system

To demonstrate how well the Donnan effect in compacted bentonite is captured by Donnan’s original description, we use the following relation, derived from eq. 1 (i.e we assume only the presence of a 1:1 salt, apart from the impermeable component)

\begin{equation} \frac{c_\mathrm{Cl^-}^\mathrm{int}}{c_\mathrm{Cl^-}^\mathrm{ext}} = -\frac{1}{2}\frac{z}{c_\mathrm{Cl^-}^\mathrm{ext}} + \sqrt{\frac{1}{4} (\frac{z}{c_\mathrm{Cl^-}^\mathrm{ext}})^2+1} \tag{3} \end{equation}

Here \(z\) denotes the concentration of cations compensating impermeable charge. Eq. 3 quantifies anion exclusion, and is seen to depend only on the ratio \(c_\mathrm{Cl^-}^\mathrm{ext}/z\).

This equation is plotted in the diagram below, together with data of chloride exclusion in sodium dominated bentonite (Van Loon et al., 2007) and in potassium ferrocyanide (Donnan & Allmand, 1914)

Anion exclusion in bentonite and ferrocyanide compared with Donnan's ideal formula

I find this plot amazing. Although some points refer to bentonite at density 1900 \(\mathrm{kg/m^3}\) (corresponding to \(z \approx 5\) M), while others refer to a solution of approximately 25 mM \(\mathrm{K_4Fe(CN)_6}\) (\(z \approx 0.1\) M), the anion exclusion behavior is basically identical! Moreover, it fits the ideal “Donnan model” (eq. 3) quite well!

There is of course a lot more to be said about the detailed behavior of these systems, but I think a few things stand out:

It should be obvious that the basic mechanism for anion exclusion is the same in these two systems. This observed similarity thus invalidates the idea that anion exclusion in compacted bentonite is due to an intricate, ionic strength-dependent partitioning of a complex pore structure into parts which either are, or are not, accessible to chloride. In other words, the above plot is another demonstration that the concept of “accessible anion porosity” is nonsense.
The similarity between compacted bentonite and the simpler ferrocyanide system confirms Overbeek’s statement above, that Donnan’s “elementary” equations apply when the colloid concentration (i.e. density) is high enough.
The slope of the curve at small external concentrations directly reflects the amount of exchangeable cations that contributes to the Donnan effect. The similarity between model and experimental data thus confirms that the major part of the cations are mobile, i.e. not adsorbed by surface complexation. The similarity between the bentonite system and the ferrocyanide system also suggests that non-ideal corrections to the theory is better dealt with by means of e.g. activity coefficients, rather than by singling out a quite different mechanism (surface complexation) in one of the systems.

Footnotes

[1] The only equilibrium study of this kind I am aware of, that involves compacted, purified, homo-ionic clay, is Karnland et al. (2011). This study concerns Na/Ca exchange, and does not investigate the associated chloride equilibrium.

[2] I have assumed a K/Na selectivity coefficient of 2, and 95% salt exclusion.

[3] “Bentonite” is used in the following as an abbreviation for bentonite and claystone, or any clay system with significant cation exchange capacity.

[4] This particular publication states that I am one of the researchers using a “Donnan approach” to model “anion porosity”. Let me state for the record that I never have modeled “anion porosity”, or have any intentions to do so.

[5] This article has an English translation.

[6] In my head, a “soil suspension” and a “soil particle” are not very well defined entities. As I understand, Mattson investigated “Sharkey soil” and “Bentonite”. Sharkey soil is reported to have a cation exchange capacity of around 0.3 eq/kg, and the bentonite appear to be of “Wyoming” type. It is thus reasonably clear that Mattson’s “soil” particles are montmorillonite particles.

[7] Mattson and co-workers published a whole series of papers on “The laws of soil colloidal behavior” during the course of over 15 years, and appear to have caused both awe and confusion in the soil science community. I find it a bit amusing that there is a published paper (Kelley, 1943) which in turn reviews and comments on Mattson’s papers. Some statements in this paper include: “It seems to be generally agreed that some of [Mattsons papers] are difficult to understand.” and “The extensive use by [Mattson and co-workers] of terms either coined by them or used in new settings, the frequent contradictions of statement and inconsistencies in definition, and perhaps most important of all, the use by the authors of theoretical reasoning founded, not on experimentally determined data, but on calculations based on purely hypothetical premises, make it difficult to condense these papers into a form suitable for publication without doing injustice to the authors or sacrificing strict accuracy.”

[8] It may be worth noting that the only works referenced by Schofield — apart from a paper on dye adsorption — are Mattson, Procter and Donnan. Remarkably, Gouy is not referenced!

[9] Of course, one can instead solve the Poisson-Boltzmann equation for “overlapping” double layers.

[10] In its introduction is found the following gem: “A spectacular evolution began in 1935 with the discovery by two English chemists, Adams and Holmes, that crushed phonograph records exhibit ion-exchange properties.” Who wouldn’t want to hear more of that story?!

[11] As a further argument for that the concept of immobile exchangeable ions in bentonite is flawed, one can take a look at the spread in reported values for the fraction of such ions. You can basically find any value between \(>99\%\) and \(\sim 0\%\) for the same type of systems. To me, this indicates overparameterization rather than physical significance.

Sorption part II: Letting go of the bulk water

Leave a reply

Disclaimer: The following discussion applies fully to ions that only interact with bentonite by means of being part of an electric double layer. Here such ions are called “simple” ions. Species with more specific chemical interactions will be discussed in separate blog posts.

The “surface diffusion” model is not suitable for compacted bentonite

In the previous post on sorption¹ we derived a correct “surface diffusion” model. The equation describing the concentration evolution in such a model is a real Fick’s second law, meaning that it only contains the actual diffusion coefficient (apart from the concentration itself)

\begin{equation}
\frac{\partial c}{\partial t} = D_\mathrm{sd} \cdot\nabla^2 c \tag{1}
\end{equation}

Note that \(c\) in this equation still denotes the concentration in the presumed bulk water,² while \(D_\mathrm{sd}\) relates to the mobility, on the macroscopic scale, of a diffusing species in a system consisting of both bulk water and surfaces.³

Conceptually, eq. 1 states that there is no sorption in a surface diffusion model, in the sense that species do not get immobilized. Still, the concept of sorption is frequently used in the context of surface diffusion, giving rise to phrases such as “How Mobile Are Sorbed Cations in Clays and Clay Rocks?”. The term “sorption” has evidently shifted from referring to an immobilization process, to only mean the uptake of species from a bulk water domain to some other domain (where the species may or may not be mobile). In turn, the role of the parameter \(K_d\) is completely shifted: in the traditional model it quantifies retardation of the diffusive flux, while in a surface diffusion model it quantifies enhancement of the flux (in a certain sense).

A correct⁴ surface diffusion model resolves several of the inconsistencies experienced when applying the traditional diffusion-sorption model to cation diffusion in bentonite. In particular, the parameter referred to as \(D_e\) may grow indefinitely without violating physics (because it is no longer a real diffusion coefficient), and the insensitivity of \(D_\mathrm{sd}\) to \(K_d\) may be understood because \(D_\mathrm{sd}\) is the real diffusion coefficient (it is not an “apparent” diffusivity, which is expected to be influenced by a varying amount of immobilization).

Still, a surface diffusion model is not a very satisfying description of bentonite, because it assumes the entire pore volume to be bulk water. To me, it seems absurd to base a bentonite model on bulk water, as the most characteristic phenomenon in this material — swelling — relies on it not being in equilibrium with a bulk water solution (at the same pressure). It is also understood that the “surfaces” in a surface diffusion model correspond to montmorillonite interlayer spaces — here defined as the regions where the exchangeable ions reside⁵ — which are known to dominate the pore volume in any relevant system.

Indeed, assuming that diffusion occurs both in bulk water and on surfaces, it is expected that \(D_\mathrm{sd}\) actually should vary significantly with background concentration, because a diffusing ion is then assumed to spend considerably different times in the two domains, depending on the value of \(K_d\).⁶

Using the sodium diffusion data from Tachi and Yotsuji (2014) as an example, \(\rho\cdot K_d\) varies from \(\sim 70\) to \(\sim 1\), when the background concentration (NaCl) is varied from 0.01 M to 0.5 M (at constant dry density \(\rho=800\;\mathrm{kg/m^3}\)). Interpreting this in terms of a surface diffusion model, a tracer is supposed to spend about 1% of the time in the bulk water phase when the background concentration is 0.01 M, and about 41% of the time there when the background concentration is 0.5 M⁷. But the evaluated values of \(D_\mathrm{sd}\) (referred to as “\(D_a\)” in Tachi and Yotsuji (2014)) show a variation less than a factor 2 over the same background concentration range.

Insignificant dependence of \(D_\mathrm{sd}\) on background concentration is found generally in the literature data, as seen here (data sources: 1, 2, 3, 4, 5)

Diffusion coefficients as a function of background concentration for Sr, Cs, and Na.

These plots show the deviation from the average of the macroscopically observed diffusion coefficients (\(D_\mathrm{macr.}\)). These diffusion coefficients are most often reported and interpreted as “\(D_a\)”, but it should be clear from the above discussion that they equally well can be interpreted as \(D_\mathrm{sd}\). The plots thus show the variation of \(D_\mathrm{sd}\), in test series where \(D_\mathrm{sd}\) (reported as “\(D_a\)”) has been evaluated as a function of background concentration.⁸ The variation is seen to be small in all cases, and the data show no systematic dependencies on e.g. type of ion or density (i.e., at this level of accuracy, the variation is to be regarded as scatter).

The fact that \(D_\mathrm{sd}\) basically is independent of background concentration strongly suggests that diffusion only occurs in a single domain, which by necessity must be interlayer pores. This conclusion is also fully in line with the basic observation that interlayer pores dominate in any relevant system.

Diffusion in the homogeneous mixture model

A more conceptually satisfying basis for describing compacted bentonite is thus a model that assigns all pore volume to the surface regions and discards the bulk water domain. I call this the homogeneous mixture model. In its simplest version, diffusive fluxes in the homogeneous mixture model is described by the familiar Fickian expression

\begin{equation} j = -\phi\cdot D_c \cdot \nabla c^\mathrm{int} \tag{2} \end{equation}

where the concentration of the species under consideration, \(c^\mathrm{int}\), is indexed with an “int”, to remind us that it refers to the concentration in interlayer pores. The corresponding diffusion coefficient is labeled \(D_c\). Notice that \(c^\mathrm{int}\) and \(D_c\) refer to macroscopic, averaged quantities; consequently, \(D_c\) should be associated with the empirical quantity \(D_\mathrm{macr.}\) (i.e. what we interpreted as \(D_\mathrm{sd}\) in the previous section, and what many unfortunately interpret as \(D_a\)) — \(D_c\) is not the short scale diffusivity within an interlayer.

For species that only “interact” with the bentonite by means of being part of an electric double layer (“simple” ions), diffusion is the only process that alters concentration, and the continuity equation has the simplest possible form

\begin{equation} \frac{\partial n}{\partial t} + \nabla\cdot j = 0 \end{equation}

Here \(n\) is the total amount of diffusing species per volume porous system, i.e. \(n = \phi c^\mathrm{int}\). Inserting the expression for the flux in the continuity equation we get

\begin{equation} \frac{\partial c^\mathrm{int}}{\partial t} = D_c \cdot \nabla^2 c^\mathrm{int} \tag{3} \end{equation}

Eqs. 2 and 3 describe diffusion, at the Fickian level, in the homogeneous mixture model for “simple” ions. They are identical in form to the Fickian description in a conventional porous system; the only “exotic” aspect of the present description is that it applies to interlayer concentrations (\(c^\mathrm{int}\)), and more work is needed in order to apply it to cases involving external solutions.

But for isolated systems, e.g. closed-cell diffusion tests, eq. 3 can be applied directly. It is also clear that it will reproduce the results of such tests, as the concentration evolution is known to obey an equation of this form (Fick’s second law).

Model comparison

We have now considered three different models — the traditional diffusion-sorption model, the (correct) surface diffusion model, and the homogeneous mixture model — which all can be fitted to closed-cell diffusion data, as exemplified here

three models fitted to diffusion data for Sr from Sato et al. (92)

The experimental data in this plot (from Sato et al. (1992)) represent the typical behavior of simple ions in compacted bentonite. The plot shows the resulting concentration profile in a Na-montmorillonite sample of density 600 \(\mathrm{kg/m^3}\), where an initial planar source of strontium, embedded in the middle of the sample, has diffused for 7 days. Also plotted are the identical results from fitting the three models to the data (the diffusion coefficient and the concentration at 0 mm were used as fitting parameters in all three models).

From the successful fitting of all the models it is clear that bentonite diffusion data alone does not provide much information for discriminating between concepts — any model which provides a governing equation of the form of Fick’s second law will fit the data. Instead, let us describe what a successful fit of each model implies conceptually

The traditional diffusion-sorption model
The entire pore volume is filled with bulk water, in contradiction with the observation that bentonite is dominated by interlayer pores. In the bulk water strontium diffuse at an unphysically high rate. The evolution of the total ion concentration is retarded because most ions sorb onto surface regions (which have zero volume) where they become immobilized.
The “surface diffusion” model
The entire pore volume is filled with bulk water, in contradiction with the observation that bentonite is dominated by interlayer pores. In the bulk water strontium diffuse at a reasonable rate. Most of the strontium, however, is distributed in the surface regions (which have zero volume), where it also diffuse. The overall diffusivity is a complex function of the diffusivities in each separate domain (bulk and surface), and of how the ion distributes between these domains.
The homogeneous mixture model
The entire pore volume consists of interlayers, in line with the observation that bentonite is dominated by such pores. In the interlayers strontium diffuse at a reasonable rate.

From these descriptions it should be obvious that the homogeneous mixture model is the more reasonable one — it is both compatible with simple observations of the pore structure and mathematically considerably less complex as compared with the others.

The following table summarizes the mathematical complexity of the models (\(D_p\), \(D_s\) and \(D_c\) denote single domain diffusivities, \(\rho\) is dry density, and \(\phi\) porosity)

Note that the simplicity of the homogeneous mixture model is achieved by disregarding any bulk water phase: only with bulk water absent is it possible to describe experimental data as pure diffusion in a single domain. This process — pure diffusion in a single domain — is also suggested by the observed insensitivity of diffusivity to background concentration.

These results imply that “sorption” is not a valid concept for simple cations in compacted bentonite, regardless of whether this is supposed to be an immobilization mechanism, or if it is supposed to be a mechanism for uptake of ions from a bulk water to a surface domain. For these types of ions, closed-cell tests measure real (not “apparent”) diffusion coefficients, which should be interpreted as interlayer pore diffusivities (\(D_c\)).

Footnotes

[1] Well, the subject was rather on “sorption” (with quotes), the point being that “sorbed” ions are not immobilized.

[2] Eq. 1 can be transformed to an equation for the “total” concentration by multiplying both sides by \(\left (\phi + \rho\cdot K_d\right)\).

[3] Unfortunately, I called this quantity \(D_\mathrm{macr.}\) in the previous post. As I here compare several different diffusion models, it is important to separate between model parameters and empirical parameters, and the diffusion coefficient in the “surface diffusion” model will henceforth be called \(D_\mathrm{sd}\). \(D_\mathrm{macr.}\) is used to label the empirically observed diffusion parameter. Since the “surface diffusion” model can be successfully fitted to experimental diffusion data, the value of the two parameters will, in the end, be the same. This doesn’t mean that the distinction between the parameters is unimportant. On the contrary, failing to separate between \(D_\mathrm{macr.}\) and the model parameter \(D_\mathrm{a}\) has led large parts of the bentonite research community to assume \(D_\mathrm{a}\) is a measured quantity.

[4] It might seem silly to point out that the model should be “correct”, but the model which actually is referred to as the surface diffusion model in the literature is incorrect, because it assumes that diffusive fluxes in different domains can be added.

[5] There is a common alternative, implicit, and absurd definition of interlayer, based on the stack view, which I intend to discuss in a future blog post. Update (220906): This interlayer definition and stacks are discussed here.

[6] Note that, although \(D_\mathrm{sd}\) is not given simply by a weighted sum of individual domain diffusivities in the surface diffusion model, it is some crazy function of the ion mobilities in the two domains.

[7] With this interpretation, the fraction of bulk water ions is given by \(\frac{\phi}{\phi+\rho K_d}\).

[8] The plot may give the impression that such data is vast, but these are basically all studies found in the bentonite literature, where background concentration has been varied systematically. Several of these use “raw” bentonite (“MX-80”), which contains soluble minerals. Therefore, unless this complication is identified and dealt with (which it isn’t), the background concentration may not reflect the internal chemistry of the samples, i.e. the sample and the external solution may not be in full chemical equilibrium. Also, a majority of the studies concern through-diffusion, where filters are known to interfere at low ionic strength, and consequently increase the uncertainty of the evaluated parameters. The “optimal” tests for investigating the behavior of \(D_\mathrm{macr.}\) with varying background concentrations are closed-cell tests on purified montmorillonite. There are only two such tests reported (Kozaki et al. (2008) and Tachi and Yotsuji (2014)), and both are performed on quite low density samples.

Bentonite swelling pressure, part IV: electrostatics

Leave a reply

Swelling is not due to electrostatic repulsion between montmorillonite particles

Few things confuse me more than how the role of electrostatics in clay swelling is described in the scientific literature. Consider e.g. this statement from Bratko et al. (1986)

The interaction between charged aggregates in solution is generally interpreted in terms of electrostatic repulsion between double layers surrounding the aggregates.

But in the same paper we learn that the main contribution to the force between two charged surfaces in solution is the entropy of mixing of counter-ions, and that electrostatic interactions actually may result in an attractive force between the surfaces.

Nevertheless, I think Bratko et al. (1986) are right: swelling is, for some reason, often “interpreted” in terms of electrostatic repulsion between electric double layers. It is easy to find statements that e.g. the expression for the osmotic pressure in the Gouy-Chapman model describes “the electrostatic force per unit area”, or that lamellar phases are “electrostatically swollen”, with an osmotic pressure “mainly of electrostatic origin”. Segad (2013) writes

The interactions between the negatively charged platelets lead to a repulsive long-ranged electrostatic force promoting swelling.

and Tester et al. (2016) write

The DLVO theory describes the interaction between two colloidal particles as a balance between electrostatic repulsion, in this case between two negatively charged clay layers, and vdW attraction.

Laird (2006) claims that electrostatics cause both repulsion and (strong) attraction between clay layers

A balance between strong electrostatic-attraction and hydration-repulsion forces controls crystalline swelling. The extent of crystalline swelling decreases with increasing layer charge. Double-layer swelling occurs between quasicrystals. An electrostatic repulsion force develops when the positively charged diffuse portions of double layers from two quasicrystals overlap in an aqueous suspension. Layer charge has little or no direct effect on double-layer swelling.

Although many authors reasonably understand the actual mechanisms of double layer repulsion, I think it is very unfortunate that this language is established and contributes to unnecessary confusion.

To gain some intuition for that clay swelling is not primarily due to electrostatic repulsion between montmorillonite particles, let us consider the Poisson-Boltzmann equation. This is, after all, the description usually referred to when authors speak of “electrostatic repulsion” between clay layers. The Poisson-Boltzmann equation may be used to describe the electrostatic potential, and the corresponding counter-ion equilibrium distribution, between two equally charged surfaces, and a typical result looks like this¹

Here we assume two negatively charged parallel surfaces with uniform charge density, and the counter-ions are represented by a continuous charge density. The system is assumed infinitely extended in the x- (in/out of the page) and y- (up/down) directions, and thus rotationally symmetric around the z-axis.

With a lot of equal charges “facing” each other, the illustration may indeed give the impression that there somehow is an electrostatic repulsion between the surfaces. That this is not the case, however, may be seen from the symmetry of the potential. In fact, replacing one of the negatively charged surfaces by a neutral surface at half the distance does not change the solution to the Poisson-Boltzmann equation! A charged and a neutral surface thus experience the same repulsion as two charged surfaces, if only placed at half the distance.²

charged-neutral and charged-chared diffuse layer

With one surface being uncharged, “interpreting” the force as an electrostatic repulsion between the particles makes little sense.

A related way to convince yourself that there is no electrostatic repulsion between the two charged surfaces is to consider the electric field generated by one “half” of the original system. This field vanishes on the outside of the considered “half”-system.

This means that removing a “half”-system would not be “noticed” by the other “half”-system, in the sense that the electric field configuration remains the same (and corresponds to having a neutral particle at half the distance).

It may be helpful to also remember from electrostatics that the electric field outside a plate capacitor vanishes. Thus, configuring two plate capacitors as shown below, there is no electric field between the positively charged surfaces, regardless of how close they are.³

This “plate capacitor” configuration is actually reminiscent of the charge distribution in an interlayer at low water content (where the continuum assumption of the Poisson-Boltzmann equation is not valid).

A squeezed ion cloud

Having established that there is no direct electrostatic repulsion between clay particles, the obvious question is: what is the main cause for the repulsion? What the two configurations above have in common — with either two charged surfaces or one charged and one neutral surface — is that they restrict the counter ions to a certain volume. Hence, there is an entropic driving force for transporting more water into the region between the surfaces, thereby pushing them apart. Nelson (2013) describes this quite well⁴

One may be tempted to say, “Obviously two negatively charged surfaces will repel.” But wait: Each surface, together with its counterion cloud, is an electrically neutral object! Indeed, if we could turn off thermal motion the mobile ions would collapse down to the surfaces, rendering them neutral. Thus the repulsion between like-charged surfaces can only arise as an entropic effect. As the surfaces get closer than twice their Gouy–Chapman length, their diffuse counterion clouds get squeezed; they then resist with an osmotic pressure.

Notice that the presence of this osmotic pressure requires contact with an “external” solution. The existence of a repulsive force between clay layers thus requires that water is available to be transported into the interlayer region. This seems to often be “forgotten” about in many descriptions of clay swelling. But let Kjellander et al. (1988) remind us

The PB pressure between two planar surfaces with equal surface charge equals \(P_\mathrm{ionic} = k_BT\sum_i n_i(0)\), where \(n_i(0)\) is the ion density at the midplane between the surfaces. Due to symmetry there is no electrostatic force between the two halves of the system (the electrostatic fluctuation forces due to ion-ion correlations are neglected). To obtain the net pressure when the system is surrounded by a bulk electrolyte solution, it is necessary to subtract the external pressure calculated in the same approximation; this is given by the ideal gas contribution \(P_\mathrm{bulk} = k_BT \sum_i n_i^\mathrm{bulk}\).

There is no repulsive force of this kind in an isolated, internally equilibrated, clay.

Moreover, the force is usually conceived of as repulsive because the water chemical potential of the surrounding (“external”) solution is typically larger than in the clay. But from an osmotic viewpoint there is nothing fundamentally different going on when the external phase is, say, vapor of low pressure (set e.g. by a saturated salt solution), causing the clay to lose water, i.e to shrink. Thus, if swelling is “interpreted” as electrostatic repulsion between montmorillonite particles, then drying should be “interpreted” as electrostatic attraction between the same particles.

The fact that there is repulsion — in the osmotic sense — between a montmorillonite particle and a neutral surface has huge implications for how to handle interfaces between montmorillonite and other phases in compacted bentonite components. Rather than to simply assume these to contain bulk water (“free water”), there is every reason to believe that the physico-chemical conditions at such interfaces are similar to “ordinary” interlayer pores. Since any type of mineral alteration occurs at such interfaces, there is no escape from understanding interlayer chemistry if a satisfying geochemical description of bentonite is desired.

The role of electrostatics in clay swelling

Although swelling is not primarily due to direct electrostatic repulsion between clay particles, electrostatics is of course essential to consider when calculating the osmotic pressure. And rather than contributing to repulsion, electrostatic interactions actually reduce the pressure. This is clearly seen from e.g. the Poisson-Boltzmann solution for two charged surfaces, where the resulting osmotic pressure corresponds to an ideal solution with a concentration corresponding to the value at the midpoint (cf. the quotation from Kjellander et al. (1988) above). But the midpoint concentration — and hence the osmotic pressure — is lowered as compared with the average, because of electrostatic attraction between layers and counter-ions.

Moreover, a treatment of the electrostatic problem beyond the mean-field (i.e. beyond the Poisson-Boltzmann description) shows that ion-ion correlation cause an explicit attraction between equally charged surfaces (similar to a van der Waals force). In systems with divalent counter ions, this attraction is large enough to prevent swelling beyond a certain limit — a prediction in qualitative agreement with observation. Electrostatics could thus be claimed to contribute to prevent clay swelling.

I think comparison with a simple salt solution can be useful. Nobody (?) would come up with the idea that the primary reason for the osmotic pressure of a NaCl solution is due to electrostatic repulsion between, say, chloride ions. In fact, the electrostatic interactions in such a solution reduce the osmotic pressure compared with a corresponding ideal solution.

Below is plotted the swelling pressure of Na-montmorillonite as a function of the average concentration of counter-ions (data from here). For comparison, the osmotic pressures of a NaCl solution and an ideal solution are also plotted (data from here), as a function of the total amount of ions (i.e. two times the NaCl concentration)⁵

This plot demonstrates the attractive aspect of electrostatic interactions in these systems. While the NaCl pressure is only slightly reduced, Na-montmorillonite shows strong non-ideal behavior. In the “low” concentration regime (\(<2\) mol/kgw) we understand the pressure reduction as an effect of counter-ions electrostatically attracted to the clay surfaces. The dramatic increase of swelling pressure in the high concentration limit is reasonably an effect of hydration of ions and surfaces; it should be kept in mind that an average ion concentration of 3 mol/kgw in Na-montmorillonite roughly corresponds to a water-to-solid-mass ratio of only 0.3, and an average interlayer width below 1 nm.

Even though there seems to be quite some confusion regarding clay swelling pressure in the bentonite literature, the message here is not that everything about it is in reality understood. On the contrary, there are quite a number of behaviors that, as far as I’m aware, lack fully satisfactory explanations. For example, at room temperature the basal spacing in Ca-montmorillonite is never observed to be larger than \(\sim 19\) Å⁶, corresponding to a (dry) density of approximately \(1300 \;\mathrm{kg/m^3}\); yet, this material systematically exerts swelling pressure at considerably lower density (\(\sim 700 \;\mathrm{kg/m^3}\)). But in order to tackle issues like these, it is essential to be clear about the swelling mechanisms that we actually do understand.

Update (221018): A correction to this blog post is discussed here.

Footnotes

[1] This particular calculation uses the formulas presented in Engström and Wennerström (1978), and assumes mono-valent counter-ions at room temperature, a charge density of \(-0.1 \;\mathrm{C/m^2}\), and a surface-surface distance of 2 nm.

[2] Here is only considered the Poisson-Boltzmann pressure. If e.g. van der Waals attraction between the surfaces is included, the resulting forces are not necessarily equal. The point here, however, concerns the repulsion due to the presence of diffuse layers.

[3] Having strictly zero field is of course an ideal result, corresponding to an infinitely extended capacitor.

[4] The quotation is taken from a draft version of this book.

[5] The graph denoted “Ideal solution” is simply the van’t Hoff relation \(\Pi = RT c\), which strictly is only valid in the low concentration limit. It is nevertheless here extended to the whole concentration range. In the same way, the NaCl-curve is simply \(\Pi = \varphi RT c\), where \(\varphi\) is the osmotic coefficient for NaCl. Sorry about that.

[6] Upon cooling, Svensson and Hansen (2010) actually observed a basal spacing of 21.6 Å in pure Ca-montmorillonite.

Swelling pressure, part III: osmosis

1 Reply

An established procedure in clay research is to differ between regions of “crystalline” and “osmotic” swelling. Although this distinction makes sense in many ways, I think it is unfortunate that one of the regions has been named “osmotic”, as it may suggest that bentonite¹ swelling is only partly osmotic, or that it is only osmotic in certain density ranges.

In this post I argue for that bentonite swelling pressure should be understood as an osmotic pressure under all conditions, and discuss the distinction between “crystalline” and “osmotic” swelling in some detail.

Bentonite swelling pressure is an osmotic pressure, under all conditions

A macroscopic definition of osmosis and osmotic pressure cannot depend on specific microscopic aspects. Here we take the description from Atkins’ Physical Chemistry² as a starting point

The phenomenon of osmosis is the spontaneous passage of a pure solvent into a solution separated from it by a semipermeable membrane , a membrane permeable to the solvent but not to the solute. The osmotic pressure , \(\Pi\), is the pressure that must be applied to the solution to stop the influx of solvent.

These definitions are written with simple aqueous solutions in mind,³ but can easily be generalized to include bentonite lab samples. For such a case the role of the “solution” is taken by the bentonite sample, and the “solutes” are the exchangeable cations and other dissolved species, as well as the individual clay particles. The semipermeable membrane in a bentonite set-up is typically filters confining the sample. Note that such filters are impermeable only to the clay particles, while e.g. the exchangeable ions can freely move across them. That the exchangeable ions anyway are located in the sample is because of the electrostatic coupling between them and the clay particles; the filters keep the clay particles in place, and the requirement of charge neutrality forces, in turn, the exchangeable ions to stay in place. Finally, in a bentonite set-up the external water source is in general itself an aqueous solution (often a salt solution). But even if the above description assumes a source of pure solvent it is clear that the mechanism (passage of solvent) is active also if the external source contains several components.

With these remarks it should be clear that water uptake in a laboratory bentonite sample is an osmotic effect and that swelling pressure is an osmotic pressure: swelling pressure is the pressure (difference) that must be applied to prevent further spontaneous inflow of water from the external source.

comparison conventional osmotic pressure and bentonite swelling pressure

Note that the definition of osmotic pressure says nothing about the specific microscopic conditions — it would be rather bizarre if it did. That would imply that the poor lab worker must have knowledge, e.g. of whether a certain interlayer distance is realized in the sample, in order to judge whether or not the measured swelling pressure is an osmotic pressure.

What qualifies swelling pressure as an osmotic pressure is summarized in the relation

\begin{equation} P_s = -\frac{\Delta \mu_w}{v}, \tag{1} \end{equation}

which in earlier blog posts was shown to be generally valid in bentonite. Here \(\Delta \mu_w\) is the difference in water chemical potential between the non-pressurized bentonite and the external solution, and \(v\) is the partial molar volume of water. The presence of \(\Delta \mu_w\) in eq. 1 expresses the “spontaneous” character of the phenomenon: “spontaneous” in this context means movement of water from higher to lower chemical potential. \(\Delta \mu_w\) may have contributions both from entropy and energy, which can be expressed (a bit sloppy) as

\begin{equation} \Delta \mu_w = \Delta h_w – T \Delta s_w, \tag{2} \end{equation}

where \(\Delta h_w\) and \(\Delta s_w\) are the differences in (partial) molar enthalpy and entropy, respectively, and \(T\) is the absolute temperature.

Indeed, \(\Delta h_w\) dominates in very dense bentonite. But the chemical potential having both energetic and entropic contributions is in principle no different from more conventional aqueous solutions, as manifested in osmotic coefficients generally being different from unity.

When only mixing entropy contributes to \(\Delta \mu_w\), and in the limit of a dilute solution, eq. 1 reduces to van ‘t Hoff’s formula \(\Pi = RTc\), where \(c\) is the solute concentration. Thus, rather than defining osmotic pressure, van ‘t Hoff’s formula is a limit of the the general relation expressed in eq. 1.

“Crystalline” vs. “osmotic” swelling

Although a division between “crystalline” and “osmotic” swelling regions can be found in the literature as far back as the 1930s, there doesn’t seem to be fully coherent definitions of these terms.

Some authors use an interlayer spacing range to define the crystalline swelling regions, some emphasize “hydration” of ions or surfaces (or both) as the defining feature. Some associate crystalline swelling with the release of an appreciable amount of heat, and others with that it occurs in discrete steps. There are also examples of authors differing between “limited” and “extensive” crystalline swelling.

Note that any of these definitions complies with swelling pressure being an osmotic pressure of the form discussed above; the release of heat, or effects of “hydration”, is accommodated by a non-zero enthalpy contribution (\(\Delta h_w\)) in eq. 2.

Also the “osmotic” swelling region is defined by some authors in terms of an interlayer spacing range. But in defining this region, many authors allude to some emerging “diffusive” property of the exchangeable cations, and here I really think the definitions become problematic. E.g. Madsen and Müller-Vonmoos (1989) discuss two “phases” of swelling, and write

Unlike innercrystalline swelling, which acts over small distances (up to 1 nm), osmotic swelling, which is based on the repulsion between electric double layers, can act over much larger distances. In sodium montmorillonite it can result in the complete separation of the layers. […] The driving force for the osmotic swelling is the large difference in concentration between the ions electrostatically held close to the clay surface and the ions in the pore water of the rock.

Leaving aside what is exactly meant by the term “pore water”, there are several issues here. Firstly, it appears that the authors have in mind a text book version of osmosis — basically van ‘t Hoff’s formula — when writing that the driving force is due to “differences in concentration”. But the actual driving force is differences in water chemical potential, which only under certain circumstances can be translated to differences in solute concentration. Note that also in the case of “crystalline” swelling is water transported from regions of low to regions of (really) high ion concentration. So, with the same logic you can also claim that the driving force for “crystalline” swelling is “large differences in concentration”.

Secondly, the electric double layer is an example of a system where there is no simple relation between ion concentration differences and transport driving forces — the diffuse layer displays an ion concentration gradient in equilibrium, and very weakly overlapping diffuse layers can be conceived of, where the driving force for in-transport of water is minimal, even though the ion concentration closest to the surfaces is large. To arrive at a van ‘t Hoff-like equation for the osmotic pressure of an overlapping diffuse layer, you first have to solve an electrostatic problem (the Poisson-Boltzmann equation, or something worse). With that analysis made, the (approximate⁴) osmotic pressure can be related to the midpoint concentration in the interlayer space. Madsen and Müller-Vonmoos (1989) present some electrostatic treatment, but, as far as I can see, don’t reflect over the amount of energetics involved in evaluating the osmotic pressure.

Lastly, the way these and many other authors single out the “diffusive” nature of the exchangeable cations when defining “osmotic” swelling implies that they do not consider ions to be diffusive in “crystalline” swelling states. Norrish (1954) states this quite explicitly (writing about the “crystalline” swelling region)

Nor can the interaction of diffuse double layers produce a repulsive force since in this region diffuse double layers are not formed. The repulsive forces of ion hydration and surface adsorption are probably the initial repulsive forces for many other colloids. They can cause surface separations of \(\sim 10\) Å, where the ions could begin to form diffuse double layers.

Even though I cannot find any explicit statements in Norrish (1954), writing like this makes me fear that authors of this era were under the impression that the initial interlayer hydration states consist of actual crystalline (non-liquid) water; I note that e.g. Grim (1953) has a several pages long section entitled “Evidence for the Crystalline State of the Initially Adsorbed Water”. Could it be that the original use of the term “crystalline” swelling was influenced by this belief?

Anyway, nowadays we have vast amount of evidence that interlayer water — at least down to the bihydrate — is liquid-like, and that ions in such states certainly diffuse. It follows that the osmotic pressure in such states has a contribution from mixing entropy.⁵ It should also be pointed out that the prevailing qualitative explanation for limited swelling in Ca-montmorillonite — which often is described as only displaying “crystalline” swelling — is due to ion-ion correlations in a diffusive system (“overlapping” diffuse layers).

Despite the evidence for interlayer diffusivity, it is very common to find descriptions in the bentonite literature that diffuse layers “develop” or “form” as the interlayers distances (or some other presumed pore) becomes large enough. This is usually claimed without giving a mechanism of how such a “development” or “formation” occurs. I genuinely wonder what authors using such descriptions believe the ions are doing when they have not “formed” a diffuse layer…

My message here is not that a division between “crystalline” and “osmotic” swelling should be discarded — for certain issues it makes a lot of sense to make a distinction, especially as the transition between these regions is not fully understood. But I think authors can do a better job in defining what exactly they mean by terms such as “osmotic”, “crystalline”, “diffusive”, etc. I furthermore wish that another name could be established for the “osmotic” swelling region (Norrish (1954) actually used “Region 1” and “Region 2”), although that seems rather unlikely. Until then we have to live with that bentonite swelling is described as “osmotic” only in a certain density range, while — if reasonable definitions are adopted — bentonite swelling pressure actually is an osmotic pressure under all conditions.

Footnotes

[1] In the following I usually mean bentonite when writing “bentonite”, even though the main points of the blog post also apply to claystone with swelling properties.

[2] The quotation is taken from the 8th edition.

[3] Note how this description does not refer to any microscopic concepts, nor to differences in concentrations. There seems to be a whole academic field devoted to sorting out misconceptions about osmosis. For further reading, I can recommend e.g. (Kramer and Mayer, 2012) and (Bowler, 2017).

[4] There may be additional significant activity corrections. I guess a solution of the Gouy-Chapman model could be compared to using the Debye-Hückel equation for a conventional aqueous salt solution.

[5] I am not arguing for that swelling is driven by entropy in these states — the entropy contribution is actually negative. But the entropy reasonably has both a positive (mixing) and a negative (hydration) part.

Anion-accessible porosity – a brief history

2 Replies

Genesis

In the beginning there was the Poisson-Boltzmann equation. Solving it for the case of a salt solution in contact with a negatively charged plane surface (a.k.a. the Gouy-Chapman model) gives the concentration of cations and anions in the solution as a function of the distance to the surface, like this¹

Illustration of Gouy-Chapman concentration profiles

Note:

The suppression of the anion concentration near the surface is often referred to as negative adsorption or anion exclusion. The total amount of excluded anions per unit surface area (indicated in green), usually labeled \(\Gamma^-\), is obtained by integrating the Poisson-Boltzmann equation.
There are, nevertheless, anions everywhere! This model will give zero anion concentration only for an infinitely negative electrostatic potential (or if \(c_0 = 0\), of course).

A clever way to utilize negative adsorption is for estimating the amount of smectite surface area in a soil sample, first suggested by Schofield (1947). This is done by comparing measured values of negative adsorption with the appropriate expression evaluated from the Gouy-Chapman model. When doing the necessary math² for such an analysis you naturally end up with expressions like

\begin{equation} \frac{\Gamma^-}{c_0} \sim \text{const.}\cdot \kappa^{-1} \tag{1} \end{equation}

where \(c_0\) is the external anion concentration (i.e. far from the surface), and \(\kappa^{-1}\) is the Debye length. This equation, having the dimension of length, can be interpreted as the width, \(d_{ex}\), of a region devoid of anions, which gives the same amount of negative adsorption as the full exclusion region, as illustrated here (yellow)

However, note:

This is just an equivalent, fictitious region.
Anions are still everywhere!

Due to its convenience in the analysis, the notion of an equivalent region devoid of anions — often referred to in terms of “volume of exclusion” — became rather popular. At the same time, authors stopped emphasizing that this is a fictitious region. A clear example of such a transition is Edwards and Quirk (1962) who states that \(\Gamma^-/c_0\) “can be regarded as the surface depth from which chloride ions are excluded”, while in Edwards et al. (1965) the same quantity (multiplied by area) is referred to as “the volume from which chloride is excluded”. The latter statement is, strictly speaking, wrong: the actual volume from which anions are excluded is the entire region where the concentration deviates from \(c_0\), and the exclusion is only partial — there are anions everywhere!

Compacted bentonite

But the idea of an actual region devoid of anions seems to have stuck, and I believe that this influenced the interpretation of diffusion in compacted bentonite³ in terms of “effective porosity” or “anion accessible-porosity”. Concepts which, in turn, have motivated the idea that bentonite contains bulk water (“free water”, “pore water”).

The first example of this usage in studies of compacted bentonite, that I know of, is in Muurinen et al. (1988) reporting chloride through-diffusion in bentonite with various densities and background concentrations.

The tracer concentration of the porewater clearly depends on the compaction of bentonite and on the salt concentration of the circulating water. The effective porosity can even be less than one percent when the salt concentration is low and compaction high. Also, the diffusivities strongly depend on the density of bentonite and on the salt concentration.

The low tracer concentration in bentonite in the diffusion tests […] are indicative of ion-exclusion [5]. Ion-exclusion probably decreases the effective size of the pores, which changes the geometric factor, of bentonite and thus the apparent diffusivity. In addition to the geometric factor, the effective diffusivity takes into account the effective pore volume; thus, the dependence is even stronger.

“Effective porosity” has not been defined earlier in the article, so it is difficult to know precisely what the authors mean by the term. But it is relatively clear⁴ from the second paragraph that they explain the measured fluxes as being a result of a physical variation of the pore volume accessible to anions, rather than as a variation of the tracer concentration in a homogeneous system. This is also supported by their writing in the conclusions section: “The decreased pore size and porosity caused by ion-exclusion could at least qualitatively explain the dependence.”

However, the reference they provide (“[5]”) is Soudek et al. (1984), who calculate anion exclusion by means of — the Poisson-Boltzmann equation! (Which predicts anions everywhere.) In fact, Soudek et al. (1984) calculate what they term “Donnan exclusion” in a homogeneous model of “parallel, equally-spaced platelets”. Thus, the reference supplied by Muurinen et al. (1988) is in direct contradiction with their interpretation that the pore size and porosity is decreasing with the salt concentration.

Soudek et al. (1984) even provide an example of how the average chloride concentration between the platelets depends on the separation distance, when in equilibrium with an external solution of 10 mM, and write

Note the extremely strong rejection of the co-ion. At 50 w% clay (\(\sim 25\)Å plate separation) almost 90% of the anions are rejected.

which is completely in line with the observation of Muurinen et al. (1988) that “The effective porosity can even be less than one percent when the salt concentration is low and compaction high”, if only “effective porosity” is replaced by “concentration between the plates”.

It makes me a bit tired to discover that the record could have been set straight over 30 years ago regarding which pores anions can access. Instead the bentonite research community, for the most part, doubled down on the idea that anions only have accesses to parts of the pore volume, or that compacted bentonite contains a significant amount of bulk water.

An explicit description of interpreting “chloride through diffusion porosity” as a specific, limited part of the pore volume is given by Bradbury and Baeyens (2003)

In the interlayer spaces and regions where the individual montmorillonite stacks are in close proximity, double layer overlap will occur and anion exclusion effects will take place. Exclusion will probably be so large that it is highly unlikely that anions can move through these regions (Bolt and de Haan, 1982). However, Cl anions do move relatively readily through compacted bentonite since diffusion rates have been measured in ‘‘through-diffusion’’ tests […]

If the Cl anions cannot move through the interlayer and overlapping double layer regions because of anion exclusion effects, then it is reasonable to propose that the ‘‘free water’’ must provide the diffusion pathways (Fig. 1). Therefore, the hypothesis is that the pore volume associated with the transport of chloride (and other anions) is the ‘‘free water’’ volume, and that this is the porewater in a compacted bentonite.

Here they refer to Bolt and De Haan, (1982)⁵ when arguing for that anions do not have access to interlayers. But the analysis in this reference is based on nothing but — the Poisson-Boltzmann equation! (which predicts anions everywhere)

Another thing to note is the notion of “overlapping” diffuse layers. Studies of negative adsorption to quantify surface area typically look at soil suspensions, with a solid part of a few percent. In such systems it is justified to perform the analysis on a single diffuse layer because the distance between separate montmorillonite particles is large enough. But at higher density there is not enough space between separate clay particles for the ion concentrations to ever reach the “external” value (\(c_0\)) — the diffuse layers “overlap”.⁶

It has been shown that effects of “overlapping” diffuse layers on the resulting negative adsorption is significant already at a a solid content of 6%. When carrying over the anion exclusion analysis to compacted bentonite — with solid content typically above 70%! — it therefore becomes near impossible to believe that the system should contain regions unaffected by the montmorillonite (“free water”). Yet, the argumentation above, apart from being flawed in the way it refers to the Poisson-Boltzmann equation, relies critically on the existence of such regions.

The mindful reader may remark that compacted bentonite, if it mainly contains “overlapping” diffuse layers, perhaps is devoid of anions after all. But the Poisson-Boltzmann equation predicts anions everywhere also for “overlapping” diffuse layers. Actually, the model by Soudek et al. (1984), discussed above, considers this case.

Despite the improbability that montmorillonite particles in compacted bentonite can be spaced so far apart as to allow for bulk water within the system, the idea of anions only having access to “free” water was nevertheless further pursued by Van Loon et al. (2007). They provide a picture similar to this

The idea here (and elsewhere) is that bentonite consists of “stacks” of individual montmorillonite particles (TOT-layers) interlaced with interlayer water.⁷ The space between “stacks” is assumed large enough for diffuse layers to fully develop, and to merge into a bulk solution (“free water”), whose volume depends on the ionic strength, reminiscent of the excluded volume in eq 1.⁸ Anions are postulated to only have access to this “free” water.

But as references for anion exclusion is once again simply given studies based on the Poisson-Boltzmann equation (in particular, Bolt and De Haan, (1982)). But these — as I hope has been made clear by now — predict anions everywhere, and consequently do not support the suggested model. In this case, the mismatch between model and supporting references stands out, as the term “effective porosity” is used interchangeably with the term “Cl-accessible porosity”; if Gouy-Chapman theory in a convoluted way can be used to define an “effective” porosity (having no other meaning than a fictitious, equivalent volume), there is no possibility whatsoever to use it to support the idea of anions having access to only parts of the pore space. Ironically, “anion-accessible porosity” seems to be the most popular term nowadays for describing effects of anion exclusion in compacted bentonite.

The strongest confirmation that the modern-day concept of anion-accessible porosity is simply a misuse of the exclusion-volume concept is given in Tournassat and Appelo (2011). They provide a quite extensive background for the type of anion exclusion they consider, and it is based on the excluded-volume concept discussed above. They even explicitly calculate the excluded-volume (named “total chloride exclusion distance”) only to directly discard it as not suitable

However, this binary representation (absence or presence of chloride, Fig. 3) is not very representative of the system since the EDL is not completely devoid of anions.

Yet, after making this statement that anions are everywhere (in the diffuse layer) they anyway define anion accessible porosity as an effective, fictitious volume!⁹

Interlayers

Apart from treating the diffuse layer incorrectly, Bradbury and Baeyens (2003), Van Loon et al. (2007) and Tournassat and Appelo (2011) all make the additional unjustified assumption that interlayers — which in these studies are defined as distinctly different from diffuse layers¹⁰ — are completely devoid of anions. Bradbury and Bans (2003) cites conventional Poisson-Boltzmann based studies to incorrectly support this claim (see above). Also Van Loon et al. (2007) use Bolt and De Haan, (1982) as a reference¹¹

Due to the very narrow space, the double layers in the interlayers overlap and the electric potential in the truncated layer becomes large leading to a complete exclusion of anions from the interlayer (Bolt and de Haan, 1982; Pusch et al., 1990; Olin, 1994; Wersin et al., 2004). The interlayer water thus contains exclusively cations that compensate the permanent charges located in the octahedral layer of the clay.

Of the other sources cited, Pusch et al. (1990) mention “Donnan exclusion” as the reason preventing anions from having access to interlayers. But this is incorrect – Donnan equilibrium always gives a non-zero anion concentration in the interlayer (as long as the external concentration is non-zero). Wersin et al. (2004) only claim that anions are “excluded” from interlayers, without further explanation or references. (I haven’t managed to read Olin (1994) .)

Tournassat and Appelo (2011) cites Bourg et al. (2003) to support the claim that anions have no access to interlayers

When the dry density is above \(1.8 \;\mathrm{kg/dm^3}\), almost all the porosity resides in the interlayers of Na-montmorillonite. Since anions are excluded from the interlayers, the anion-accessible porosity becomes zero, and anion-diffusion is minimal (Bourg et al., 2003)

But in Bourg et al. (2003) is explicitly stated that anion exclusion from interlayers is only “partial”!

To sum up…

The idea that anions have access only to parts of the pore volume is widespread in today’s compacted bentonite research community. In this blog post I have shown that this idea emerges from misusing the concept of exclusion-volume, and that all references used to support ideas of “complete exclusion” rests on the Poisson-Boltzmann equation. The Poisson-Boltzmann equation, however, predicts anions everywhere! Thus, the concept of an anion-accessible porosity, and the related idea that compacted bentonite contains different “types” of water, have not been provided with any kind of theoretical support.

In contrast, the result that anions have access to the entire pore volume is further supported both by molecular dynamics simulations, as well as by the empirical evidence for salt in interlayers.

Footnotes

[1] This figure is just an illustration, not an actual result. Update (220831): Actual solutions to the Poisson-Boltzmann equation are presented here.

[2] Schofield writes with an enthusiasm seldom seen in modern scientific papers: “I considered that it would be possible to compute the negative adsorption of the repelled ions from the basic assumptions of Gouy’s theory of the diffuse electric double layer, and therefore invited Mr. M. H. Quenouille to tackle the mathematical difficulties involved. Complete solutions have now been obtained for electrolytes in which the ions have valency ratios 1:2, 1:1, and 2:1, and a full account of this work will be submitted for publication shortly.”

[3] “Bentonite” is used in the following as an abbreviation of “Bentonite and claystone”.

[4] I mean that the word “probably” as used here does not belong in a scientific text.

[5] Sciencedirect.com dates this reference to 1979. The book has a second revised edition, however, published in 1982.

[6] I use quotation marks when writing “overlap” because I think this wording gives the wrong impression in compacted clay: with an average distance between montmorillonite particles of around 1 nm, the concept of individual diffuse layers has lost its meaning.

[7] I plan to comment on “stacks” in a future blog post. Update (211027): Stacks make no sense.

[8] The volume is, however, not proportional to the Debye length, but depends exponentially on ionic strength.

[9] The “anion accessible porosity” is defined in this paper as \(\epsilon_{an} = \epsilon_{free} + \epsilon_D\cdot c_D/c_{free}\), where \(\epsilon_{free}\) is the porosity of a presumed bulk water phase in the bentonite, and \(\epsilon_D\) quantifies the volume of an arbitrarily chosen “Donnan volume” which is (Donnan) equilibrated with the “free” solution. \(c_D\) is the anion concentration in this “Donnan volume”, and \(c_{free}\) is the anion concentration in the bulk water.

[10] In this context, “interlayers” are defined as being parts of “stacks”. I really need to write about “stacks”… Update (211027): Stacks make no sense

[11] Bolt and de Haan (and others) are fond of writing that anions in very narrow confinement are “almost completely excluded” or “virtually completely excluded”, indicating that they may neglect anions in these compartments, but also that they are aware of that the equations they use never give exactly zero anion concentration. When working with soil suspensions of only a few percent solids it may be a valid approximation to neglect anions in nm-wide pores. In compacted bentonite it is not.

Swelling pressure, part I

3 Replies

I am puzzled by how bentonite swelling pressure is presented in present day academic works.

In soil science, the thermodynamic description of the phenomenon has been around since at least the 1940s. Still, pure thermodynamic approaches to swelling pressure are not fashionable in modern day research. I think this is a pity, because for many issues this is the preferred approach.
Naturally, the notion of the Electric Double Layer (EDL) is central in many descriptions of bentonite swelling pressure, and EDL models seem to fit the bill very well for e.g. Li- and Na-montmorillonite at intermediate densities (at high density, the resulting pressure becomes increasingly sensitive to variations in model parameters, such as ionic radii). Models based on the EDL concept also give a satisfying qualitative explanation for the limited swelling of Ca- and Mg-montmorillonite, in terms of ion-ion correlation. But common EDL approaches — as far as I’m aware — fail to reproduce the observation that swelling pressure is significantly reduced in montmorillonites with heavier monovalent cations (e.g. K-montmorillonite).
Although the notion of an EDL captures essential parts of the observed swelling pressure behavior, it is nevertheless quite easy to find claims that e.g. DLVO theory has serious flaws , or even that the fundamental mechanism for clay swelling is not associated with the exchangeable cations¹. While many of these alternative descriptions may be easily (or not so easily) rejected, I fear that they have influenced a way of thinking about other aspects of bentonite, in particular the persistent idea of exchangeable cations as non-diffusive.
Modern geotechnical descriptions of bentonite often combine capillary concepts (surface tension, saturation,”suction” etc.) with a diffuse double layer mechanism. This seems conceptually incompatible, as a rigid structure — which is required to withstand attractive capillary forces — can hardly exist in a system that, under certain circumstances, is able to swell indefinitely.
In most geochemical treatments of bentonite, swelling pressure is simply ignored. This is remarkable as most modern reactive-transport models postulate different domains with different “types” of water, making the issue of water equilibrium critical.

Here, I would like to revisit the pure thermodynamic description of swelling pressure, which I think may help in resolving several misconceptions about swelling pressure.

Of course, thermodynamics cannot answer what the microscopic mechanism of swelling is, but puts focus on other — often relevant — aspects of the phenomenon. We thus take as input that, at the same pressure and temperature, the water chemical potential² is lowered in compacted bentonite as compared with pure water, and we ignore the (microscopic) reason for why this is the case. We write the chemical potential in non-pressurized³ bentonite as \begin{equation} \mu_w(w,P_0) = \mu_0 + \Delta \mu(w,P_0) \end{equation}

where \(\mu_0\) is a reference potential of pure bulk water at pressure \(P_0\) (isothermal conditions are assumed, and temperature will be left out of this discussion), and \(w\) is the water-to-solid mass ratio. Note that \(\Delta \mu(w,P_0)\) is a negative quantity.

The chemical potential in a pressurized system is given by integrating \(d\mu_w = v_wdP\), where \(v_w\) is the partial molar volume of water, giving⁴ \begin{equation} \mu_w(w,P) = \mu_0 + \Delta \mu(w,P_0) + v_w\cdot (P-P_0) \end{equation}

In order to define swelling pressure, we require that the bentonite is confined to a certain volume while still having access to externally supplied water, i.e. that it is separated from an external water source by a semi-permeable component. This may sound abstract, but is in fact how any type of swelling pressure test is set up: water is supplied to the sample via e.g. sintered metal filters.

With this boundary condition, a relation between swelling pressure and the chemical potential is easily obtained by invoking the condition that, at equilibrium, the chemical potential is the same everywhere. Assuming an external reservoir of pure water at pressure \(P_0\), its chemical potential is \(\mu_0\), and the equilibrium condition reads \begin{equation} \mu_w(w,P_{eq}) = \mu_0 + \Delta \mu(w,P_0) + v_w\cdot (P_{eq}-P_0) = \mu_0 \end{equation}

where \(P_{eq}\) is the pressure in the bentonite at thermodynamic equilibrium.

Defining the swelling pressure as \(P_s = P_{eq}-P_0\) we get the desired relation⁵ \begin{equation} P_s = -\frac{\Delta \mu(w,P_0)}{v_w} \tag{4} \end{equation}

Alternatively this relation can be expressed in terms of activity (related to the chemical potential as \(\mu = \mu_0 +RT\ln a\)) \begin{equation} P_s = -\frac{RT}{v_w}\ln a (w,P_0) \tag{5} \end{equation}

or, if the activity is expressed in terms of the vapor pressure, \(P_v\), in equilibrium with the sample, \begin{equation} P_s = -\frac{RT}{v_w}\ln \frac{P_v}{P_{v0}} \tag{6} \end{equation}

where \(P_{v0}\) is the corresponding vapor pressure of pure bulk water.

The above relation has been presented in the literature for a long time. But, as far as I am aware, direct interpretation of experimental data using eq. 4 is more scarce. Spostio (72) compares swelling pressures in Na-montmorillonite (reported by Warkentin et al 57) with water activities measured in the materials (reported by Klute and Richards 62) and concludes a “quite satisfactory” agreement of eq. 4 (the highest pressures were on the order of 1 MPa). He moreover comments

Future measurements of \(P_S\) and \(\Delta \mu_w\) for pure clays and soils as a function of water content would do much to help assess the merit of equation (11) [eq. 4 here].

Such “future” measurements were indeed presented by Bucher et al (1989), for “natural” bentonites in a density range including very high pressures (\(\sim 40\) Mpa). For “MX-80” the data looks like this

Here the value of \(v_w\) was set equal to the molar volume of bulk water when applying eq. 6. It is interesting to note that this value, which is necessarily correct in the limit of low density, appears to be valid for densities as large as \(2\;\mathrm{g/cm^3}\).

The clearest demonstration of the validity of eq. 4 is in my opinion the study by Karnland et al. (2005), where swelling pressure and vapor pressure were measured on the same samples. The result for Na-montmorillonite is shown below (again, the value of bulk water molar volume was used for \(v_w\)).

The above plots make it clear that the description underlying eq. 4 (or eq. 5, or eq. 6) is valid for bentonite, at any density. An important consequence of this insight — and something I think is often not emphasized enough — is that swelling pressure depends as much on the external solution as it does on the bentonite.

Measuring the response in swelling pressure to changes in the external solution is therefore a powerful method for exploring the physico-chemical behavior of bentonite. I will return to this point in later blog posts, in particular when discussing the “controversial” issue whether “anions” have access to montmorillonite interlayers.

The animation below summarizes the thermodynamic view of the development of swelling pressure: the external reservoir fixes the value of the water chemical potential, and in order for the bentonite sample to attain this level, its pressure increases.

Footnotes

[1] You can even find a statement saying that clay swelling has been proved to be “due to long-range interaction between particle surfaces and the water” (I don’t agree).

[2] In the following I will simply write “chemical potential”. Here the water chemical potential is the only one involved.

[3] Here “non-pressurized” means being at the reference pressure \(P_0\). In practice \(P_0\) is usually atmospheric absolute pressure.

[4] Here it is assumed that \(v_w\) is independent of pressure. Also, using \(w\) as thermodynamic variable implies that the water chemical potential is measured in units of energy per mass, which requires this volume factor to be the partial specific volume of water. Here we assume that the chemical potential is measured in units energy per mol, but use \(w\) for quantifying the amount of water in the clay, since it is the more commonly used variable in the bentonite world. The amount of moles of water is of course in strict one-to-one correspondence with the water mass.

[5] What is said here is that swelling pressure generally is identified as an osmotic pressure. I will expand on this in a future blog post.

Repulsion between surfaces and anions is not really the point

Ferrocyanide

Bentonite

A charge neutral perspective

Footnotes

Stack descriptions in the literature

Inadequate referencing

Stacks in compacted bentonite make no geometrical sense

A “realistic” stack picture

What is supposed to keep stacks together?

Implications

“Interlayers”

The stack view obstructs real discovery

Footnotes

Measuring Kd

Kd in the homogeneous mixture model

Kd for simple cations

Simple cation tracers in a 1:1 system

Simple cation tracers in a 2:1 system

Density dependence of Kd

Footnotes

Donnan equilibrium: effect vs. model

Donnan equilibrium in gels, soils and clays

Cation exchange is Donnan equilibration

A demonstration of compacted bentonite as a Donnan system

Footnotes

The “surface diffusion” model is not suitable for compacted bentonite

Diffusion in the homogeneous mixture model

Model comparison

Footnotes

Swelling is not due to electrostatic repulsion between montmorillonite particles

A squeezed ion cloud

The role of electrostatics in clay swelling

Footnotes

Bentonite swelling pressure is an osmotic pressure, under all conditions

“Crystalline” vs. “osmotic” swelling

Footnotes

Genesis

Compacted bentonite

Interlayers

To sum up…

Footnotes

Footnotes

Measuring K_d

K_d in the homogeneous mixture model

K_d for simple cations

Density dependence of K_d