Despite major strides in the treatment of cancer, the development of drug resistance remains a major hurdle. One strategy which has been proposed to address this is the sequential application of drug therapies where resistance to one drug induces sensitivity to another drug, a concept called collateral sensitivity. The optimal timing of drug switching in these situations, however, remains unknown. To study this, we developed a dynamical model of sequential therapy on heterogeneous tumors comprised of resistant and sensitive cells. A pair of drugs (Drug A, Drug B) are utilized and are periodically switched during therapy. Assuming resistant cells to one drug are collaterally sensitive to the opposing drug, we classified cancer cells into two groups, AR and BR, each of which is a subpopulation of cells resistant to the indicated drug and concurrently sensitive to the other, and we subsequently explored the resulting population dynamics. Specifically, based on a system of ordinary differential equations for AR and BR, we determined that the optimal treatment strategy consists of two stages: an initial stage in which a chosen effective drug is utilized until a specific time point, T, and a second stage in which drugs are switched repeatedly, during which each drug is used for a relative duration (i.e. fΔt-long for Drug A and (1 − f)Δt-long for Drug B with 0 ≤ f ≤ 1 and Δt ≥ 0). We prove that the optimal duration of the initial stage, in which the first drug is administered, T, is shorter than the period in which it remains effective in decreasing the total population, contrary to current clinical intuition. We further analyzed the relationship between population makeup, A/B = AR/BR, and the effect of each drug. We determine a critical ratio, which we term (A/B)*, at which the two drugs are equally effective. As the first stage of the optimal strategy is applied, A/B changes monotonically to (A/B)* and then, during the second stage, remains at (A/B)* thereafter. Beyond our analytic results, we explored an individual based stochastic model and presented the distribution of extinction times for the classes of solutions found. Taken together, our results suggest opportunities to improve therapy scheduling in clinical oncology.