Matching the Paper's Model¶
First, I will focus on what exists in the paper to attempt to structure this code arround.
Equations¶
Below are a list of the equations in the model. Each of these equations are from the origional paper, and the variable explainations were automatically generated by an AI refereancing the paper. I have corrected any equation explainations that I noticed an error in.
Equation 1¶
Bayes' rule for updating beliefs over hypotheses after observing data.
| Variable | Description |
|---|---|
| \(h\) | A hypothesis — a candidate decision rule (defined by a boundary orientation and position) about how to classify observations |
| \(D\) | The observed data collected so far |
| \(p(h)\) | The prior probability of hypothesis \(h\) before seeing any data |
| \(p(D \mid h)\) | The likelihood of observing data \(D\) given that hypothesis \(h\) is true |
| \(p(h \mid D)\) | The posterior belief in hypothesis \(h\) after observing \(D\); its magnitude reflects the relative strength of belief that this hypothesis is correct |
Equation 2¶
The probability that stimulus \(x^t\) belongs to Category A, given a linear decision boundary defined by \(\theta\), \(b\), and \(\sigma\). Because classification is binary, \(P(x^t = B \mid \theta, b, \sigma) = 1 - P(x^t = A \mid \theta, b, \sigma)\).
| Variable | Description |
|---|---|
| \(x^t\) | The stimulus (observation) on trial \(t\), a point in the 2D stimulus space |
| \(x_1^t\) | The value of stimulus \(x^t\) on dimension 1 (e.g., antenna length) on trial \(t\) |
| \(x_2^t\) | The value of stimulus \(x^t\) on dimension 2 (e.g., antenna angle) on trial \(t\) |
| \(A\) | Category A — one of the two binary classification outcomes |
| \(\theta\) | The angle of the decision boundary vector in the stimulus space; controls its orientation (\(\theta = 0\) gives a vertical boundary aligned with one axis) |
| \(b\) | The bias term; the offset of the decision boundary from the center of the stimulus space |
| \(\sigma\) | The determinism of the boundary; controls the sharpness of the sigmoidal response (higher \(\sigma\) = sharper boundary) |
Equation 3¶
Converts a boundary angle \(\theta\) into a normalized distance from the nearest axis-aligned rule, used to define the prior over boundary orientations.
| Variable | Description |
|---|---|
| \(r\) | The relative distance of the decision boundary from the nearest orthogonal (axis-aligned) direction; bounded between 0 and 1, where 0 means the boundary is perfectly axis-aligned |
| \(\theta\) | The angle of the decision boundary in the stimulus space |
| \(\bmod\) | The arithmetic modulo operator |
Equation 4¶
The prior distribution over \(r\), which encodes the learner's bias toward axis-aligned (unidimensional) decision boundaries.
| Variable | Description |
|---|---|
| \(r\) | The relative distance from the nearest axis-aligned boundary (same as Equation 3) |
| \(\alpha\) | The shape parameter of the Beta distribution; acts as an abstract weight controlling preference for unidimensional rules. When \(\alpha \ll 1\), the prior strongly favors axis-aligned rules; when \(\alpha = 1\), all orientations are equally likely |
Equation 5¶
Generates a new hypothesis angle \(\theta\) from a randomly chosen quadrant and a sampled relative distance \(r\).
| Variable | Description |
|---|---|
| \(\theta\) | The newly generated decision boundary angle in the stimulus space |
| \(k\) | An integer between 0 and 3 specifying which quadrant (i.e., which axis-aligned reference direction) the new hypothesis is drawn relative to |
| \(r\) | The relative distance from the nearest orthogonal axis, sampled from \(\text{Beta}(\alpha, \alpha)\) per Equation 4 |
Equation 6¶
The hypothesis-dependent sampling procedure — how a selection learner generates a new stimulus to query, biased toward the region near their current decision boundary.
| Variable | Description |
|---|---|
| \(\mathbf{x}\) | The new stimulus selected by the learner for the next trial |
| \(d_1\) | The coordinate in the direction orthogonal to the current decision boundary; sampled from a Normal distribution centered on the boundary |
| \(d_2\) | The coordinate in the direction parallel to the current decision boundary; sampled uniformly so that selections are spread along the boundary |
| \(b^t\) | The bias term of the current hypothesis on trial \(t\); serves as the mean of the normal distribution, anchoring selections to the current boundary location |
| \(S\) | The sampling spread parameter; controls the average orthogonal distance of new selections from the current boundary. Larger \(S\) = weaker bias (selections further from the boundary); smaller \(S\) = stronger bias (selections tightly clustered on the boundary) |
| \(\mathbf{u}_1\) | Unit vector orthogonal to the current decision boundary |
| \(\mathbf{u}_2\) | Unit vector parallel to the current decision boundary |