PAC Learning #

This file defines the Probably Approximately Correct (PAC) learning model introduced by Valiant [Val84], generalized to an arbitrary label type β and parameterized by a family of distributions 𝒟 on α × β.

A concept class C over domain α with labels in β is a collection of functions α → β. A learning algorithm receives a labeled sample drawn i.i.d. from an unknown joint distribution D on α × β and must produce a hypothesis whose 0-1 error is within ε of the best concept in C, with probability at least 1 - δ.

The single definition IsPACLearnerFor captures the realizable, agnostic, and noise-tolerant settings by varying the distribution family 𝒟:

Agnostic [Hau92]: 𝒟 = Set.univ — the learner must work for all distributions.
Realizable: 𝒟 consists of pushforwards of arbitrary probability measures P on α along the graph x ↦ (x, c x) of some concept c ∈ C, so that optimalError D C = 0.
Noise-tolerant [AL88]: 𝒟 consists of noisy versions of realizable distributions, where each label is corrupted independently with some probability η.

The accuracy and confidence parameters ε and δ are elements of the subtype Set.Ioo (0 : ℝ≥0) 1, which bundles the value together with the proof that it lies in the open interval (0, 1), ensuring the learning condition is non-vacuous.

All declarations live under the Cslib.MachineLearning.PACLearning namespace so that generic names like error and optimalError do not pollute the parent namespace.

Main definitions #

ConceptClass: a set of functions α → β (classifiers).
LabeledSample: a finite sequence of (point, label) pairs.
Learner: a function from labeled samples to hypotheses.
error: the 0-1 error of a hypothesis under a joint distribution.
optimalError: the infimum of error over a concept class.
IsPACLearnerFor: deterministic (ε, δ)-PAC learner over a distribution family.
IsRPACLearnerFor: randomized variant of IsPACLearnerFor. Universe-polymorphic in the randomness space Ω : Type*.
IsPACLearnable: a concept class is PAC learnable if IsPACLearnerFor holds for all ε, δ : Set.Ioo (0 : ℝ≥0) 1 with some sample size m.
IsRPACLearnable: randomized variant of IsPACLearnable. Pins the randomness space to Type 0; IsRPACLearnerFor itself remains universe-polymorphic for users who need it.
LearnerModel: the common predicate shape ℕ → ε → δ → C → 𝒟 → Prop abstracting both the deterministic and randomized learners so sample-complexity lemmas can be shared.
sampleComplexity: sample complexity of a generic learner model.
rsampleComplexity: randomized sample complexity, i.e. sampleComplexity IsRPACLearnerFor.

Binary classification #

When β = Bool, concepts correspond to subsets of α. The section Binary Classification provides:

hypothesisError: the symmetric-difference error P(h ∆ c).
falsePositiveError, falseNegativeError: its decomposition.
hypothesisError_eq_add: the decomposition theorem.
error_map_eq_hypothesisError: bridge between the general error and the binary hypothesisError under a realizable distribution.

Main statements #

IsPACLearnerFor.toIsRPACLearnerFor: every deterministic PAC learner is a randomized one (via the trivial randomness space PUnit).
IsPACLearnerFor.antitone_family, .antitone_C: the deterministic PAC learner predicate is antitone in the distribution family and concept class.
IsPACLearnerFor.mono_δ, .mono_ε: the predicate is monotone in the confidence and accuracy parameters (a weaker bound still holds).
IsRPACLearnerFor.antitone_family, .mono_δ: analogues for the randomized predicate. (mono_ε and antitone_C are not provided because they change the integrand and would require an extra measurability assumption.)
IsPACLearnable.toIsRPACLearnable: deterministic learnability implies randomized.
IsPACLearnable.antitone_family, .antitone_C, IsRPACLearnable.antitone_family: PAC learnability is antitone in the distribution family and concept class.
sampleComplexity_antitone_δ, _antitone_ε, _mono_family, _mono_C: variation of deterministic sample complexity in confidence, accuracy, distribution family, and concept class (antitone in the numeric parameters, monotone under ⊆ in the set parameters). The randomized analogues rsampleComplexity_antitone_δ and _mono_family are provided.
IsPACLearnable.sampleComplexity_*, IsRPACLearnable.rsampleComplexity_*: the same monotonicity facts phrased with a learnability hypothesis in place of the ad-hoc ∃ m, IsPACLearnerFor m … existence witness, so callers who already know the class is learnable need not thread it through.
hypothesisError_eq_add: total error = false positive + false negative.

PAC Learning #

Main definitions #

Binary classification #

Main statements #

References #

Core Definitions #

PAC Learners #

PAC Learnability #

Sample Complexity #

Monotonicity of Sample Complexity #

Binary Classification #