Add HOW_TO_MATHS.md: guide to reading the mathematical notation

Explains every symbol, operator, and proof technique used in the paper, assuming no math background beyond basic arithmetic. Covers summation notation, fractions, subscripts/superscripts, Greek letters, set notation, piecewise functions, limits, correlation, variance, mutual information, and the exchange argument proof technique. Includes a worked walkthrough of Theorem 1's proof step-by-step, translating each mathematical statement into plain English. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-29 00:44:52 -04:00
parent b6d42d3237
commit 2108789956
1 changed files with 365 additions and 0 deletions
@@ -0,0 +1,365 @@
+# How to Read the Mathematics in This Paper
+
+This guide explains every piece of mathematical notation used in the paper,
+in the order you will encounter it. No prior math background beyond basic
+arithmetic is assumed.
+
+---
+
+## The Basics
+
+### Variables and Subscripts
+
+A variable is a letter that stands for a number. In this paper:
+
+- $n$ = the total number of tasks
+- $p_i$ = the processing time (how many hours of work) for task $i$
+- $C_i$ = the completion time (when task $i$ is finished) under a given schedule
+- $q_i$ = the priority class of task $i$ (1 = Critical, 4 = Low)
+- $w(q)$ = the weight assigned to priority class $q$
+
+The subscript $i$ is just a label. $p_1$ means "the processing time of
+task 1," $p_2$ means "the processing time of task 2," and so on. When we
+write $p_i$, we mean "the processing time of task $i$, for any $i$."
+
+### Lists of Things
+
+$p_1, p_2, \ldots, p_n$ means "a list of processing times, from task 1
+up to task $n$." The $\ldots$ just means "and so on." If there are 5 tasks,
+this is $p_1, p_2, p_3, p_4, p_5$.
+
+---
+
+## Symbols You Will See
+
+| Symbol | Name | What it means | Example |
+|--------|------|---------------|---------|
+| $=$ | Equals | Left side is the same as right side | $2 + 3 = 5$ |
+| $\ne$ | Not equal | These two things are different | $3 \ne 4$ |
+| $>$ | Greater than | Left is bigger | $5 > 3$ |
+| $<$ | Less than | Left is smaller | $3 < 5$ |
+| $\le$ | Less than or equal to | Left is smaller or the same | $3 \le 5$, $3 \le 3$ |
+| $\ge$ | Greater than or equal to | Left is bigger or the same | $5 \ge 3$, $5 \ge 5$ |
+| $\approx$ | Approximately equal | Close but not exact | $10/3 \approx 3.33$ |
+| $\cdot$ | Multiplication | Same as $\times$ | $3 \cdot 4 = 12$ |
+| $\perp$ | Independent of | No relationship between them | $p_i \perp q_i$ means task size and priority are unrelated |
+| $\forall$ | For all | "This is true for every..." | $S_i = S_j \; \forall \, i, j$ means "slowdown is equal for all tasks" |
+| $\in$ | Is a member of | Belongs to a set | $q_i \in \{1, 2, 3, 4\}$ means priority is one of 1, 2, 3, or 4 |
+| $\blacksquare$ | End of proof | "The proof is complete" | Appears at the end of every proof |
+
+---
+
+## The Big Sigma: Summation ($\sum$)
+
+This is the most important symbol in the paper. It means "add up a list
+of things."
+
+$$\sum_{i=1}^{n} p_i$$
+
+Read this as: "Add up $p_i$ for every $i$ from 1 to $n$."
+
+If there are 3 tasks with processing times $p_1 = 2$, $p_2 = 5$, $p_3 = 3$:
+
+$$\sum_{i=1}^{3} p_i = p_1 + p_2 + p_3 = 2 + 5 + 3 = 10$$
+
+The letter under the $\sum$ ($i = 1$) tells you where to start counting.
+The number on top ($n$ or $3$) tells you where to stop. The expression
+after the $\sum$ ($p_i$) tells you what to add up.
+
+### Variations You Will See
+
+**Summing a product:**
+
+$$\sum_{i=1}^{n} w(q_i) \cdot C_i$$
+
+Add up $w(q_i) \times C_i$ for each task. If task 1 has weight 8 and
+completion time 3, and task 2 has weight 1 and completion time 5:
+
+$$= 8 \cdot 3 + 1 \cdot 5 = 24 + 5 = 29$$
+
+**Summing with a condition:**
+
+$$\sum_{\substack{a \ne b}} p_a \, p_b$$
+
+Add up $p_a \times p_b$ for every pair of tasks $a$ and $b$ where $a$
+and $b$ are different tasks. The condition ($a \ne b$) filters which
+terms to include.
+
+**Summing up to a variable position:**
+
+$$\sum_{j=1}^{k} p_{\sigma(j)}$$
+
+Add up processing times for the first $k$ tasks in the schedule. This is
+how completion time is defined — it is the total work done up to and
+including position $k$.
+
+---
+
+## Fractions
+
+$$\frac{a}{b}$$
+
+This means $a \div b$. The top is the numerator, the bottom is the
+denominator. When you see:
+
+$$\bar{C}(\sigma) = \frac{1}{n} \sum_{k=1}^{n} C_{\sigma(k)}$$
+
+This means: add up all the completion times, then divide by $n$ (the
+number of tasks). This is an **average** — the same operation as
+calculating a mean in everyday life.
+
+### Nested Fractions
+
+$$\bar{C}_w(\sigma) = \frac{\sum_{k=1}^{n} p_{\sigma(k)} \cdot C_{\sigma(k)}}{\sum_{k=1}^{n} p_{\sigma(k)}}$$
+
+The top (numerator) is: add up (processing time $\times$ completion time)
+for each task. The bottom (denominator) is: add up all the processing
+times. This is a **weighted average** — tasks with more work count more
+heavily.
+
+---
+
+## Schedules and Permutations ($\sigma$)
+
+$\sigma$ (the Greek letter "sigma") represents a **schedule** — the order
+in which tasks are done.
+
+If you have 3 tasks and decide to do them in order 2, 3, 1, then:
+- $\sigma(1) = 2$ — the first task you do is task 2
+- $\sigma(2) = 3$ — the second task you do is task 3
+- $\sigma(3) = 1$ — the third task you do is task 1
+
+When you see $p_{\sigma(k)}$, it means "the processing time of whichever
+task is in position $k$ of the schedule."
+
+### Ordering Symbols
+
+- $b \preceq_\sigma a$ means "task $b$ is scheduled at the same time as
+  or before task $a$" — i.e., $b$ comes first (or they are the same task)
+- $a \prec_\sigma b$ means "task $a$ is scheduled strictly before task $b$"
+
+These are just ways of saying "which task comes first in the schedule."
+
+---
+
+## The Bar: Averages ($\bar{C}$)
+
+A bar over a letter means "average." So:
+
+- $\bar{C}$ = average completion time
+- $\bar{C}_w$ = weighted average completion time (the subscript $w$ distinguishes it)
+- $\bar{p}$ = average processing time
+
+---
+
+## The Delta: Change ($\Delta$)
+
+$\Delta$ (capital delta) means "the change in" or "the difference."
+
+- $\Delta_i = C_i - p_i$ — the delay for task $i$ (how much longer it
+  waited beyond its own processing time)
+- $\Delta D = w(q_j) \cdot p_i - w(q_i) \cdot p_j$ — how much the
+  priority-weighted delay cost changes when you swap two tasks
+
+---
+
+## Subscripts and Superscripts
+
+**Subscripts** (below) are labels or indices:
+- $p_i$ = processing time of task $i$
+- $p_{\max}$ = the largest processing time
+- $p_{\min}$ = the smallest processing time
+- $\bar{C}_{\text{SPT}}$ = average completion time under the SPT schedule
+- $\bar{C}_{\text{priority}}$ = average completion time under priority scheduling
+
+**Superscripts** (above) indicate context or power:
+- $p_a^2$ = $p_a \times p_a$ (squared)
+- $\bar{C}^{\text{SPT}}$ = completion time under SPT (same idea as subscript, just different formatting)
+
+---
+
+## Set Notation
+
+$$q_i \in \{1, 2, 3, 4\}$$
+
+The curly braces $\{ \}$ define a **set** — a collection of possible
+values. $\in$ means "is a member of." So this reads: "the priority
+$q_i$ is one of the values 1, 2, 3, or 4."
+
+---
+
+## The Piecewise Function (Cases)
+
+$$w(q) = \begin{cases} 8 & q = 1 \text{ (Critical)} \\ 4 & q = 2 \text{ (High)} \\ 2 & q = 3 \text{ (Medium)} \\ 1 & q = 4 \text{ (Low)} \end{cases}$$
+
+This defines a function that gives different outputs depending on the
+input. Read it as: "If $q$ is 1, then $w$ is 8. If $q$ is 2, then $w$
+is 4." And so on. It is just a lookup table written in math notation.
+
+---
+
+## Limits and Convergence
+
+$$\lim_{T \to \infty} \frac{W(T)}{T} = \mu$$
+
+Read as: "As $T$ gets larger and larger (approaching infinity), the value
+of $W(T) / T$ gets closer and closer to $\mu$." In this paper, it means:
+over a long enough time horizon, the throughput settles to a fixed rate
+$\mu$ regardless of scheduling order.
+
+- $\to$ means "approaches" or "goes toward"
+- $\infty$ means infinity (without bound)
+
+---
+
+## Mutual Information ($I$)
+
+$$I(\sigma_{\text{SPT}}) = 0 \quad \text{when } p_i \perp q_i$$
+
+$I$ here measures **how much knowing one thing tells you about another**.
+$I = 0$ means "knowing a task's position in the SPT schedule tells you
+absolutely nothing about its priority." The schedule and the priority
+system are statistically independent — they contain no information about
+each other.
+
+---
+
+## Functions
+
+$f(\bar{C})$ means "some function of $\bar{C}$." A function takes an
+input and produces an output. When the paper writes:
+
+$$U_{\text{client}} = f\!\left(\bar{C}(\sigma)\right), \quad f' < 0$$
+
+It means: client satisfaction ($U$) depends on the average completion
+time, and $f' < 0$ means the function is **decreasing** — when the
+average goes up, satisfaction goes down. The $'$ (prime) notation refers
+to the derivative, which indicates the direction of change.
+
+---
+
+## Correlation ($\text{Corr}$)
+
+$$\text{Corr}(p_i, q_i) \approx 0$$
+
+Correlation measures whether two quantities move together. A correlation
+of 0 means they are unrelated — knowing the size of a task tells you
+nothing about its priority. A positive correlation means they increase
+together; a negative correlation means one goes up when the other goes
+down.
+
+---
+
+## Variance ($\text{Var}$) and Standard Deviation ($\sigma_p$)
+
+Variance measures how spread out a set of numbers is. If all tasks take
+about the same time, variance is low. If some are 15 minutes and others
+are 40 hours, variance is high.
+
+The **coefficient of variation** $CV = \sigma_p / \bar{p}$ normalizes
+the spread by dividing by the average. Here $\sigma_p$ is the standard
+deviation (the square root of variance) of the processing times, and
+$\bar{p}$ is the mean. A $CV$ above 1 means the spread is wider than
+the average — the data is highly variable.
+
+(Note: $\sigma_p$ uses the same Greek letter as schedule $\sigma$, but
+they are different things. Context tells you which is which — $\sigma_p$
+with a subscript $p$ always means standard deviation of processing times;
+$\sigma$ alone or $\sigma(k)$ always means a schedule.)
+
+---
+
+## How to Read the Proofs
+
+### The Exchange Argument
+
+The most common proof technique in this paper works like this:
+
+1. Assume you have any schedule.
+2. Find two adjacent tasks that are in the "wrong" order.
+3. Swap them and show that the metric improves (or stays the same).
+4. Conclude: since every "wrong" pair can be improved by swapping,
+   the best schedule has no "wrong" pairs — i.e., everything is in
+   the "right" order.
+
+This is like proving that a sorted list is optimal by showing that
+any out-of-order pair can be fixed by swapping, and each swap makes
+things better.
+
+### Reading a Proof Step by Step
+
+Take the simplest proof in the paper (Theorem 1):
+
+> *"Consider any schedule in which two adjacent tasks $i, j$ satisfy*
+> *$p_i > p_j$ with task $i$ scheduled immediately before task $j$."*
+
+Translation: Pick any two neighboring tasks where the bigger one comes
+first.
+
+> *"Let $t$ be the start time of task $i$."*
+
+Translation: Call the clock time when we start working on task $i$ by
+the name $t$. We don't need to know what $t$ actually is.
+
+> *"The change in the sum of completion times is:*
+> *$(2p_i + p_j) - (p_i + 2p_j) = p_i - p_j > 0$"*
+
+Translation: When the bigger task comes first, the total completion time
+is $p_i - p_j$ hours **more** than if the smaller task came first. Since
+$p_i > p_j$, this difference is positive — swapping them makes it better.
+
+> *"Therefore SPT uniquely minimizes $\bar{C}(\sigma)$."* $\blacksquare$
+
+Translation: Since every swap of big-before-small improves the metric,
+the best possible schedule is all tasks sorted from smallest to largest.
+Proof complete.
+
+---
+
+## Quick Reference Card
+
+| When you see... | It means... |
+|----------------|-------------|
+| $\sum_{i=1}^{n}$ | Add up for each task from 1 to $n$ |
+| $\frac{a}{b}$ | $a$ divided by $b$ |
+| $\bar{X}$ | The average of $X$ |
+| $\Delta X$ | The change in $X$ |
+| $\sigma$ | A schedule (the order tasks are done) |
+| $\sigma(k)$ | Which task is in position $k$ |
+| $p_i$ | How long task $i$ takes |
+| $C_i$ | When task $i$ finishes |
+| $q_i$ | Priority class of task $i$ |
+| $w(q_i)$ | How much priority class $q_i$ is weighted |
+| $\le, \ge$ | Less/greater than or equal to |
+| $\ne$ | Not equal to |
+| $\in$ | Is a member of |
+| $\forall$ | For all |
+| $\perp$ | Independent of |
+| $\blacksquare$ | End of proof |
+
+---
+
+## A Note on Reading Math
+
+Mathematical notation is a compression format. The formula:
+
+$$\bar{C}(\sigma) = \frac{1}{n} \sum_{k=1}^{n} C_{\sigma(k)}$$
+
+says in 15 characters what takes a full sentence in English: "The average
+completion time under schedule $\sigma$ equals the sum of all individual
+completion times divided by the number of tasks."
+
+If a formula looks intimidating, break it apart:
+1. Find the main operation (usually $=$ splitting left from right)
+2. Identify the pieces on each side
+3. Replace the symbols with words using the tables above
+4. Read it as a sentence
+
+Every formula in this paper can be understood this way. The math is not
+doing anything more complicated than arithmetic — it is just doing it
+for a general case rather than specific numbers.
+
+---
+
+*This guide accompanies the paper "Unweighted Average Completion Time
+Is Not a Fair Metric for Task Scheduling" (2026-03-28).*