Explains every symbol, operator, and proof technique used in the paper, assuming no math background beyond basic arithmetic. Covers summation notation, fractions, subscripts/superscripts, Greek letters, set notation, piecewise functions, limits, correlation, variance, mutual information, and the exchange argument proof technique. Includes a worked walkthrough of Theorem 1's proof step-by-step, translating each mathematical statement into plain English. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
12 KiB
How to Read the Mathematics in This Paper
This guide explains every piece of mathematical notation used in the paper, in the order you will encounter it. No prior math background beyond basic arithmetic is assumed.
The Basics
Variables and Subscripts
A variable is a letter that stands for a number. In this paper:
n= the total number of tasksp_i= the processing time (how many hours of work) for taskiC_i= the completion time (when taskiis finished) under a given scheduleq_i= the priority class of taski(1 = Critical, 4 = Low)w(q)= the weight assigned to priority classq
The subscript i is just a label. p_1 means "the processing time of
task 1," p_2 means "the processing time of task 2," and so on. When we
write p_i, we mean "the processing time of task i, for any i."
Lists of Things
p_1, p_2, \ldots, p_n means "a list of processing times, from task 1
up to task n." The \ldots just means "and so on." If there are 5 tasks,
this is p_1, p_2, p_3, p_4, p_5.
Symbols You Will See
| Symbol | Name | What it means | Example |
|---|---|---|---|
= |
Equals | Left side is the same as right side | 2 + 3 = 5 |
\ne |
Not equal | These two things are different | 3 \ne 4 |
> |
Greater than | Left is bigger | 5 > 3 |
< |
Less than | Left is smaller | 3 < 5 |
\le |
Less than or equal to | Left is smaller or the same | 3 \le 5, 3 \le 3 |
\ge |
Greater than or equal to | Left is bigger or the same | 5 \ge 3, 5 \ge 5 |
\approx |
Approximately equal | Close but not exact | 10/3 \approx 3.33 |
\cdot |
Multiplication | Same as \times |
3 \cdot 4 = 12 |
\perp |
Independent of | No relationship between them | p_i \perp q_i means task size and priority are unrelated |
\forall |
For all | "This is true for every..." | S_i = S_j \; \forall \, i, j means "slowdown is equal for all tasks" |
\in |
Is a member of | Belongs to a set | q_i \in \{1, 2, 3, 4\} means priority is one of 1, 2, 3, or 4 |
\blacksquare |
End of proof | "The proof is complete" | Appears at the end of every proof |
The Big Sigma: Summation (\sum)
This is the most important symbol in the paper. It means "add up a list of things."
\sum_{i=1}^{n} p_i
Read this as: "Add up p_i for every i from 1 to n."
If there are 3 tasks with processing times p_1 = 2, p_2 = 5, p_3 = 3:
\sum_{i=1}^{3} p_i = p_1 + p_2 + p_3 = 2 + 5 + 3 = 10
The letter under the \sum (i = 1) tells you where to start counting.
The number on top (n or 3) tells you where to stop. The expression
after the \sum (p_i) tells you what to add up.
Variations You Will See
Summing a product:
\sum_{i=1}^{n} w(q_i) \cdot C_i
Add up w(q_i) \times C_i for each task. If task 1 has weight 8 and
completion time 3, and task 2 has weight 1 and completion time 5:
= 8 \cdot 3 + 1 \cdot 5 = 24 + 5 = 29
Summing with a condition:
\sum_{\substack{a \ne b}} p_a \, p_b
Add up p_a \times p_b for every pair of tasks a and b where a
and b are different tasks. The condition (a \ne b) filters which
terms to include.
Summing up to a variable position:
\sum_{j=1}^{k} p_{\sigma(j)}
Add up processing times for the first k tasks in the schedule. This is
how completion time is defined — it is the total work done up to and
including position k.
Fractions
\frac{a}{b}
This means a \div b. The top is the numerator, the bottom is the
denominator. When you see:
\bar{C}(\sigma) = \frac{1}{n} \sum_{k=1}^{n} C_{\sigma(k)}
This means: add up all the completion times, then divide by n (the
number of tasks). This is an average — the same operation as
calculating a mean in everyday life.
Nested Fractions
\bar{C}_w(\sigma) = \frac{\sum_{k=1}^{n} p_{\sigma(k)} \cdot C_{\sigma(k)}}{\sum_{k=1}^{n} p_{\sigma(k)}}
The top (numerator) is: add up (processing time \times completion time)
for each task. The bottom (denominator) is: add up all the processing
times. This is a weighted average — tasks with more work count more
heavily.
Schedules and Permutations (\sigma)
\sigma (the Greek letter "sigma") represents a schedule — the order
in which tasks are done.
If you have 3 tasks and decide to do them in order 2, 3, 1, then:
\sigma(1) = 2— the first task you do is task 2\sigma(2) = 3— the second task you do is task 3\sigma(3) = 1— the third task you do is task 1
When you see p_{\sigma(k)}, it means "the processing time of whichever
task is in position k of the schedule."
Ordering Symbols
b \preceq_\sigma ameans "taskbis scheduled at the same time as or before task $a$" — i.e.,bcomes first (or they are the same task)a \prec_\sigma bmeans "taskais scheduled strictly before task $b$"
These are just ways of saying "which task comes first in the schedule."
The Bar: Averages (\bar{C})
A bar over a letter means "average." So:
\bar{C}= average completion time\bar{C}_w= weighted average completion time (the subscriptwdistinguishes it)\bar{p}= average processing time
The Delta: Change (\Delta)
\Delta (capital delta) means "the change in" or "the difference."
\Delta_i = C_i - p_i— the delay for taski(how much longer it waited beyond its own processing time)\Delta D = w(q_j) \cdot p_i - w(q_i) \cdot p_j— how much the priority-weighted delay cost changes when you swap two tasks
Subscripts and Superscripts
Subscripts (below) are labels or indices:
p_i= processing time of taskip_{\max}= the largest processing timep_{\min}= the smallest processing time\bar{C}_{\text{SPT}}= average completion time under the SPT schedule\bar{C}_{\text{priority}}= average completion time under priority scheduling
Superscripts (above) indicate context or power:
p_a^2=p_a \times p_a(squared)\bar{C}^{\text{SPT}}= completion time under SPT (same idea as subscript, just different formatting)
Set Notation
q_i \in \{1, 2, 3, 4\}
The curly braces \{ \} define a set — a collection of possible
values. \in means "is a member of." So this reads: "the priority
q_i is one of the values 1, 2, 3, or 4."
The Piecewise Function (Cases)
w(q) = \begin{cases} 8 & q = 1 \text{ (Critical)} \\ 4 & q = 2 \text{ (High)} \\ 2 & q = 3 \text{ (Medium)} \\ 1 & q = 4 \text{ (Low)} \end{cases}
This defines a function that gives different outputs depending on the
input. Read it as: "If q is 1, then w is 8. If q is 2, then w
is 4." And so on. It is just a lookup table written in math notation.
Limits and Convergence
\lim_{T \to \infty} \frac{W(T)}{T} = \mu
Read as: "As T gets larger and larger (approaching infinity), the value
of W(T) / T gets closer and closer to \mu." In this paper, it means:
over a long enough time horizon, the throughput settles to a fixed rate
\mu regardless of scheduling order.
\tomeans "approaches" or "goes toward"\inftymeans infinity (without bound)
Mutual Information (I)
I(\sigma_{\text{SPT}}) = 0 \quad \text{when } p_i \perp q_i
I here measures how much knowing one thing tells you about another.
I = 0 means "knowing a task's position in the SPT schedule tells you
absolutely nothing about its priority." The schedule and the priority
system are statistically independent — they contain no information about
each other.
Functions
f(\bar{C}) means "some function of \bar{C}." A function takes an
input and produces an output. When the paper writes:
U_{\text{client}} = f\!\left(\bar{C}(\sigma)\right), \quad f' < 0
It means: client satisfaction (U) depends on the average completion
time, and f' < 0 means the function is decreasing — when the
average goes up, satisfaction goes down. The ' (prime) notation refers
to the derivative, which indicates the direction of change.
Correlation (\text{Corr})
\text{Corr}(p_i, q_i) \approx 0
Correlation measures whether two quantities move together. A correlation of 0 means they are unrelated — knowing the size of a task tells you nothing about its priority. A positive correlation means they increase together; a negative correlation means one goes up when the other goes down.
Variance (\text{Var}) and Standard Deviation (\sigma_p)
Variance measures how spread out a set of numbers is. If all tasks take about the same time, variance is low. If some are 15 minutes and others are 40 hours, variance is high.
The coefficient of variation CV = \sigma_p / \bar{p} normalizes
the spread by dividing by the average. Here \sigma_p is the standard
deviation (the square root of variance) of the processing times, and
\bar{p} is the mean. A CV above 1 means the spread is wider than
the average — the data is highly variable.
(Note: \sigma_p uses the same Greek letter as schedule \sigma, but
they are different things. Context tells you which is which — \sigma_p
with a subscript p always means standard deviation of processing times;
\sigma alone or \sigma(k) always means a schedule.)
How to Read the Proofs
The Exchange Argument
The most common proof technique in this paper works like this:
- Assume you have any schedule.
- Find two adjacent tasks that are in the "wrong" order.
- Swap them and show that the metric improves (or stays the same).
- Conclude: since every "wrong" pair can be improved by swapping, the best schedule has no "wrong" pairs — i.e., everything is in the "right" order.
This is like proving that a sorted list is optimal by showing that any out-of-order pair can be fixed by swapping, and each swap makes things better.
Reading a Proof Step by Step
Take the simplest proof in the paper (Theorem 1):
"Consider any schedule in which two adjacent tasks
i, jsatisfyp_i > p_jwith taskischeduled immediately before taskj."
Translation: Pick any two neighboring tasks where the bigger one comes first.
"Let
tbe the start time of taski."
Translation: Call the clock time when we start working on task i by
the name t. We don't need to know what t actually is.
"The change in the sum of completion times is: $(2p_i + p_j) - (p_i + 2p_j) = p_i - p_j > 0$"
Translation: When the bigger task comes first, the total completion time
is p_i - p_j hours more than if the smaller task came first. Since
p_i > p_j, this difference is positive — swapping them makes it better.
"Therefore SPT uniquely minimizes
\bar{C}(\sigma)."\blacksquare
Translation: Since every swap of big-before-small improves the metric, the best possible schedule is all tasks sorted from smallest to largest. Proof complete.
Quick Reference Card
| When you see... | It means... |
|---|---|
\sum_{i=1}^{n} |
Add up for each task from 1 to n |
\frac{a}{b} |
a divided by b |
\bar{X} |
The average of X |
\Delta X |
The change in X |
\sigma |
A schedule (the order tasks are done) |
\sigma(k) |
Which task is in position k |
p_i |
How long task i takes |
C_i |
When task i finishes |
q_i |
Priority class of task i |
w(q_i) |
How much priority class q_i is weighted |
\le, \ge |
Less/greater than or equal to |
\ne |
Not equal to |
\in |
Is a member of |
\forall |
For all |
\perp |
Independent of |
\blacksquare |
End of proof |
A Note on Reading Math
Mathematical notation is a compression format. The formula:
\bar{C}(\sigma) = \frac{1}{n} \sum_{k=1}^{n} C_{\sigma(k)}
says in 15 characters what takes a full sentence in English: "The average
completion time under schedule \sigma equals the sum of all individual
completion times divided by the number of tasks."
If a formula looks intimidating, break it apart:
- Find the main operation (usually
=splitting left from right) - Identify the pieces on each side
- Replace the symbols with words using the tables above
- Read it as a sentence
Every formula in this paper can be understood this way. The math is not doing anything more complicated than arithmetic — it is just doing it for a general case rather than specific numbers.
This guide accompanies the paper "Unweighted Average Completion Time Is Not a Fair Metric for Task Scheduling" (2026-03-28).