Integration and Measure Theory
Darboux sums and the Riemann integral. Jordan content and zero Lebesgue measure. Lebesgue's criterion for Riemann integrability. Introduction to sigma-algebras, Borel sets, outer measures, and Lebesgue measure.
In this chapter we develop the Riemann integral from first principles via Darboux sums, establish a sharp criterion for Riemann integrability (Lebesgue's theorem), and introduce the basic language of measure theory. The goal is twofold: first, to understand exactly which functions can be integrated in the classical Riemann sense and which cannot; and second, to lay the groundwork for a much more flexible theory of integration — the Lebesgue integral — that resolves the defects of the Riemann theory. Along the way we meet Jordan content, sets of Lebesgue measure zero, sigma-algebras, Borel sets, outer measures, Caratheodory's criterion, and the construction of Lebesgue measure on .
Partitions and Darboux Sums
Let be a compact interval. A partition of is a finite set with Given two partitions of , we say is a refinement of iff .
Intuition: A partition chops into finitely many subintervals. A refinement adds more dividing points, so every subinterval of sits inside a subinterval of . Finer partitions give finer approximations of area.
Let be bounded and let be a partition of . For , set The upper Riemann–Darboux sum and the lower Riemann–Darboux sum of over with partition are
Intuition: The upper sum is the area of the smallest "staircase" that fits above the graph of , while the lower sum is the area of the largest staircase fitting below. A finer partition can only make smaller and larger, squeezing them toward the "true" area.
If are partitions of and is bounded, then Moreover, if are any two partitions, .
The Riemann Integral
Let be bounded. We say is Riemann integrable on iff and this common value is called the Riemann integral of over , denoted
Intuition: The upper integral is the best upper estimate for the area under ; the lower integral is the best lower estimate. When they agree, there is a unique sensible notion of area, and we call the common value the Riemann integral. From the lemma above, we always have lower integral upper integral.
A bounded function is Riemann integrable if and only if for every there exists a partition of with
Intuition: Riemann integrability is equivalent to the gap between upper and lower sums being controllable. This "squeeze" criterion is the workhorse for proving integrability in practice.
Basic Properties
Let be Riemann integrable, , and .
(1) Linearity: is Riemann integrable and
(2) Monotonicity: If pointwise, then .
(3) Additivity over intervals: is Riemann integrable on and on , and
(4) Absolute integrability: is Riemann integrable and .
Intuition: These properties are the bedrock of calculus. Linearity and monotonicity follow directly from the definition of Darboux sums; additivity is proved by concatenating partitions on and ; and absolute integrability uses to control oscillations of by oscillations of .
Let be the Dirichlet function if , otherwise. For any partition , every subinterval contains both rationals and irrationals, so and for every . Therefore and for every partition . The upper integral is and the lower integral is ; they do not agree, so is not Riemann integrable.
Integrating on Bounded Subsets and Jordan Content
We extend the Riemann integral to bounded subsets by zero-extension.
Let be bounded, bounded. Let and define If is Riemann integrable on , we define This definition is independent of the choice of containing .
Intuition: To integrate on a general bounded set, we set to be zero outside and integrate over any enclosing interval. The choice of enclosing interval does not matter because extending the partition outside only adds zero contributions.
Let be bounded. Consider the indicator function We say is Jordan measurable (or has Jordan content) iff exists (equivalently, is Riemann integrable on any enclosing interval). In that case, the value is called the Jordan content of .
Intuition: Jordan content is the Riemann theorist's notion of "length" (or "area" in higher dimensions). A set is Jordan measurable exactly when its characteristic function is Riemann integrable, i.e. when upper and lower sums of can be made arbitrarily close. Intervals have Jordan content ; finite sets have Jordan content .
The set does not have Jordan content. Indeed, is the Dirichlet function restricted to , which is not Riemann integrable by the previous example. Heuristically, is "too irregular" for Jordan's theory of length, despite being countable.
A bounded set has Jordan content zero if and only if for every there is a finite collection of intervals with
Intuition: Jordan content zero sets can be covered by a finite collection of intervals of arbitrarily small total length. This finiteness is the crucial limitation: it rules out sets like , which would require countably (not finitely) many intervals.
Sets of Lebesgue Measure Zero
Relaxing "finite cover" to "countable cover" gives a much more permissive notion.
A set has zero Lebesgue measure iff for every there exists a sequence of open intervals , , such that
Intuition: A set has zero Lebesgue measure if it can be covered by a countable collection of intervals of arbitrarily small total length. The difference from Jordan content zero is that we now allow infinitely many intervals, which is far more flexible: countable sets always have Lebesgue measure zero, even when they have no Jordan content.
The set has zero Lebesgue measure. Enumerate and given , for each let Then and . So has zero Lebesgue measure even though it has no Jordan content.
Jordan content zero implies zero Lebesgue measure. If has Jordan content zero, then the finite interval covers in the Jordan definition form (in particular) countable covers, so has zero Lebesgue measure. The converse is false in general, as shows. However, for compact sets the two notions coincide: a compact set has Jordan content zero iff it has zero Lebesgue measure.
A countable union of sets of zero Lebesgue measure has zero Lebesgue measure.
Intuition: This is the first essential advantage of Lebesgue measure over Jordan content: Jordan content is not closed under countable unions. Countable additivity of Lebesgue measure is what will eventually make the Lebesgue integral so much more flexible.
Lebesgue's Criterion for Riemann Integrability
The notion of Lebesgue measure zero has a stunning consequence in Riemann theory.
Let be bounded. Then is Riemann integrable on if and only if the set of points where is discontinuous has zero Lebesgue measure.
Intuition: This is the cleanest possible description of which bounded functions are Riemann integrable. Continuity is not necessary; being almost everywhere continuous — with the "bad" set being negligible in the measure sense — is exactly what's needed. For example, the Dirichlet function is discontinuous on all of , which is a set of Lebesgue measure , so it is not Riemann integrable. A bounded function continuous except on a countable set is Riemann integrable.
Let be a bounded set. Then has Jordan content if and only if its topological boundary has zero Lebesgue measure.
Let be Jordan measurable and bounded. Then is Riemann integrable on if and only if is continuous almost everywhere on the interior of (i.e. the set of points in where is discontinuous has zero Lebesgue measure).
Sigma-Algebras and Measurable Spaces
To develop measure theory properly we need to decide which subsets of a set we are allowed to assign a "measure" to. The right structure is a -algebra.
Let be a set, and write for the power set of (all subsets). A collection is a -algebra iff
(1) and .
(2) If , then .
(3) If for every , then .
If is a -algebra of subsets of , the pair is called a measurable space and sets in are called -measurable (or simply measurable).
Intuition: A -algebra is a collection of sets that is closed under complementation and countable unions. Together with , this closure makes it stable under every countable set-theoretic operation: finite and countable intersections, differences, symmetric differences, limsup/liminf of sequences of sets, and so on. It is exactly the minimal structure needed to consistently assign "sizes" via countable operations.
If is a -algebra and for every , then .
Generation of Sigma-Algebras
Let be an arbitrary family of -algebras of subsets of . Then is a -algebra.
Let . The -algebra generated by , denoted , is the intersection of all -algebras of subsets of that contain :
Intuition: is the smallest -algebra containing . It exists because itself is always a -algebra containing , so the collection being intersected is nonempty, and the intersection is a -algebra by the previous theorem.
Borel Sets on
Let be a topological space. The Borel -algebra on is , the -algebra generated by the open sets. Sets in are called Borel sets.
Intuition: Borel sets are the sets you can build from open sets by applying countable unions, countable intersections, and complements (possibly repeatedly). Every open set is Borel, every closed set is Borel, every countable union of closed sets (an ) is Borel, every countable intersection of open sets (a ) is Borel, and so on. For practical purposes, every set you can "describe reasonably" on is Borel.
On with its Euclidean topology , the Borel -algebra coincides with the -algebra generated by each of the following collections: That is, .
Intuition: Any of these natural collections of intervals generates the Borel -algebra. The essential observation is that open intervals, closed intervals, and half-open intervals can all be built from one another using countable operations (e.g. and ).
Measures
Given a measurable space , a measure on is a function such that
(1) for at least one .
(2) Countable additivity: If is a sequence of pairwise disjoint sets in , then
In this case we call a measure space. If in addition , is a probability measure and is a probability space.
Intuition: A measure is a consistent way of assigning a nonnegative size to each measurable set, respecting the fact that disjoint pieces should have sizes that add. Countable additivity is the key axiom: it handles limit operations correctly. From countable additivity one derives (using any with and the disjoint union ).
Let be a measure space.
(1) Monotonicity: If with , then . Moreover if , .
(2) Countable subadditivity: If for , then .
(3) Continuity from below: If and for every , then
(4) Continuity from above: If and for every , and there exists with , then
Outer Measures and Caratheodory's Construction
The axiomatic definition of measure leaves open the question of existence. A standard route is to first build an outer measure defined on all subsets of , then restrict to a carefully chosen sub--algebra on which the outer measure is actually countably additive.
Given a set , an outer measure on is a function such that
(1) .
(2) Monotonicity: if , then .
(3) Countable subadditivity: if for , then
Intuition: An outer measure is a set function defined on every subset that is monotone and subadditive over countable unions. It is not a measure in general — there is no additivity, only subadditivity. The point is that outer measures are easy to construct, and we can restrict to measurable sets to recover genuine additivity.
Constructing Outer Measures
A collection is a sequential covering class of iff
(1) .
(2) For every , there exists a sequence with such that .
is a sequential covering class of : , and any is covered by , a countable union of sets in .
Let be a sequential covering class of and with . Define, for every , Then is an outer measure on .
Take , , and . The associated outer measure is called the Lebesgue outer measure on .
Intuition: The Lebesgue outer measure of is the infimum of total lengths of countable covers of by half-open intervals. This is exactly the definition of "zero Lebesgue measure" from earlier — that case corresponds to .
Caratheodory's Criterion
Let be an outer measure on . A set is -measurable (or Caratheodory measurable) iff for every ,
Intuition: The Caratheodory criterion says that cleanly "splits" every test set into two pieces whose outer measures add. By countable subadditivity, the inequality is automatic; the content is the reverse inequality. The criterion singles out sets that are nice enough for additivity to hold.
Let be an outer measure on . The collection is a -algebra of subsets of , and the restriction is a measure on .
Intuition: Caratheodory's theorem is the engine that produces measures out of outer measures. Starting from any outer measure, we obtain a -algebra (automatically closed under all countable operations) and a genuine measure on it. The trade-off is that may be smaller than : not every set is Caratheodory measurable.
Lebesgue Measure on
Let be the Lebesgue outer measure on . The -algebra of all -measurable sets is called the Lebesgue -algebra on . The measure is called the Lebesgue measure on .
Intuition: Lebesgue measure is the unique "natural" extension of length from intervals to a large class of subsets of , obtained by combining the covering-class outer measure construction with the Caratheodory restriction. It is countably additive, assigns length to the interval , and contains every Borel set.
Let denote Lebesgue measure on , the Lebesgue -algebra. Then
(1) Extension of length: for all .
(2) Borel sets are measurable: . In particular, all open and closed sets are Lebesgue measurable.
(3) Countable additivity: For any sequence of pairwise disjoint sets in , .
(4) Translation invariance: For any and , the translate is in and .
(5) Null sets: A set has zero Lebesgue measure (in the sense of the earlier definition) iff and .
Intuition: These are the basic features that justify calling "length": it agrees with usual length on intervals, it contains all Borel sets (so in particular every set we can "describe"), it is countably additive, and it is invariant under translation — the geometry of the real line is respected. Countable additivity makes it vastly more flexible than Jordan content, and countable unions of null sets remain null, which is the technical underpinning of the Lebesgue integral.
Why not just use Borel sets? The Lebesgue -algebra is strictly larger than the Borel -algebra; it contains, for instance, every subset of a Lebesgue null Borel set (such subsets need not be Borel). This completeness property is useful because it implies that changing an integrable function on a null set does not affect measurability or integrability.
Lebesgue Measure Agrees with Length
For every , In particular, as well.
Intuition: Covering by itself gives . The nontrivial direction is that every countable cover of by half-open intervals has total length ; this is proved by a compactness argument (reduce to finite covers using Heine–Borel after a small -enlargement to open intervals, then add up lengths along the interval). So the outer measure truly extends length.
Every Borel subset of is Lebesgue measurable, i.e. .
Intuition: Since the Lebesgue -algebra is a -algebra that contains every interval (via the Caratheodory criterion applied to half-open intervals), it must contain the -algebra generated by intervals — which is exactly . Hence open sets, closed sets, 's, 's, and so on, are all Lebesgue measurable.
Approximation by Open and Closed Sets
Although sets in can be extremely irregular, their Lebesgue measure is well-approximated by nicer sets.
A set is Lebesgue measurable if and only if for every there exists an open set with , which happens if and only if for every there exists a closed set with .
Intuition: Measurability is exactly the same as "approximable above by an open set and below by a closed set, to arbitrary precision." This is the technical underpinning of many constructions in integration: we can always replace a measurable set by an open or closed approximation and control the error by the measure of the symmetric difference.
Existence of Non-Measurable Sets
There exists a subset of that is not Lebesgue measurable, i.e. .
Intuition: The explicit construction (due to Vitali) uses the axiom of choice to pick one representative from each coset of inside . The resulting "Vitali set" cannot be assigned a Lebesgue measure consistent with translation invariance and countable additivity. The proof is outside the main development, but the take-away is important: extending "length" to every subset of in a way compatible with countable additivity and translation invariance is impossible. Lebesgue's -algebra is as large as we can hope for without abandoning these properties.
Simple Functions and the Lebesgue Integral
Now that we have the Lebesgue measure on , we can begin to build the Lebesgue integral. The simplest functions to integrate are those that take only finitely many values.
A function is a simple function iff its range is a finite set and, for each , the level set is Lebesgue measurable. Equivalently, with the measurable and pairwise disjoint.
Intuition: Simple functions are the measure-theoretic analogue of step functions, with arbitrary measurable level sets replacing intervals. Their integral is defined by "area = value times measure of level set," summed over level sets. Every measurable function is a monotone limit of simple functions, so Lebesgue integration of general measurable functions is built by extending this definition through limits.
Let be a simple function with pairwise disjoint. The Lebesgue integral of (with respect to Lebesgue measure ) is with the convention .
Intuition: This mirrors the Riemann sum , but instead of partitioning the domain into intervals, we partition the range into level values and weight each by the measure of the preimage. This "horizontal slicing" is what makes Lebesgue integration so flexible: it makes sense whenever the preimages are measurable, regardless of how wildly they are scattered in the domain.
If has , the indicator is simple with range and For instance, .
Let be the Dirichlet function from earlier: if , otherwise. Since is countable, it is Lebesgue measurable with measure zero, so is a simple function. By the definition, So the Dirichlet function, which fails to be Riemann integrable, has Lebesgue integral equal to — matching the intuition that the function is "essentially zero" (differing from zero only on a null set).
Intuition: This example is the cleanest illustration of the superiority of the Lebesgue theory. Changing an integrable function on a null set does not affect its Lebesgue integral; in particular, functions that agree almost everywhere have the same integral. Riemann integration lacks this property because the Riemann integral is too closely tied to the continuity of the function at individual points rather than to its global behaviour on the level sets.
Beyond simple functions. The Lebesgue integral can be extended to a much wider class of functions — the measurable functions — by approximating them as monotone (or dominated) limits of simple functions and taking the limits of the corresponding integrals. This is developed in the standard reference (Terrell, Chapter 17), and yields the full-fledged Lebesgue integral, along with its powerful convergence theorems (monotone convergence, dominated convergence, Fatou's lemma). For the purposes of this course, the key ideas are: (i) what measurable sets are, (ii) that Lebesgue measure genuinely extends length, and (iii) that simple functions can already integrate examples out of reach of the Riemann theory.