Competitive Programming

ICPC 2020 - L. Sweep Stakes

Each cell (i,j) independently contains a mine with probability a_ ij =p_i+q_j. We are given that the total number of mines is exactly t. For each query subset S of at most 500 cells, we must output the conditional dis...

Source sync Apr 19, 2026

Track ICPC

Year 2020

Files TeX, C++, statement assets

Folder competitive_programming/icpc/2020/L-sweep-stakes

ICPC2020TeXC++statement textstatement pdf

This page is built from the copied files in competitive_programming/icpc/2020/L-sweep-stakes. Edit competitive_programming/icpc/2020/L-sweep-stakes/solution.tex to update the written solution and competitive_programming/icpc/2020/L-sweep-stakes/solution.cpp to update the implementation.

The website does not replace those files with hand-maintained HTML. It reads the copied source tree during the build and exposes the exact files below.

Raw TeX Raw C++ Meta JSON

Problem statement

Problem Statement

Copied statement text kept beside the solution archive for direct reference.

Raw text Statement PDF

Problem L
                                         Sweep Stakes
                                    Time limit: 20 seconds
You may have already won! In fact, you did already win! You won your very own island, in the deepest
reaches of the unexplored ocean! Well, mostly unexplored. As it happens, there was a small military
base there before you, and when they packed up and flew out they left behind an assortment of scraps,
munitions, tunnels, . . . and unexploded defensive ordnance. That’s right: You now possess your very
own minefield.
The minefield consists of an m × n grid, with any square of the grid holding 0 or 1 mines. Fortunately,
you were able to recover the engineers’ plans from when they deployed the mines. Unfortunately, the
exact locations of the mines were never written down: the engineers had a preselected independent
probability of deploying a mine in each square. However, you do know how many mines were placed in
total.
You would like to estimate how safe various parts of your island are. Write a program to compute the
probability of mine counts over various subsets of the minefield.

Input

The first line of input contains four integers m, n, t, and q, where m and n (1 ≤ m, n ≤ 500) are the
dimensions of the minefield, t (0 ≤ t ≤ mn) is the total number of mines, and q (0 ≤ q ≤ 500) is
the number of queries. The second line contains m real numbers p1 , p2 , . . . , pm (0 ≤ pi ≤ 0.1 for all
i, with at most six digits after the decimal point specified), and the third line contains n real numbers
q1 , q2 , . . . , qn (0 ≤ qj ≤ 0.1 for all j, with at most six digits after the decimal point specified). The
preselected probability of the engineers placing a mine on square (i, j) is pi + qj . All choices of whether
to place a mine on a given square were made independently, and the value of t is chosen so that the
probability of deploying exactly t mines is at least 10−5 .
Each of the remaining q lines describes a single query. Each of those lines begins with an integer s
(0 ≤ s ≤ 500), followed by s pairs of integers i and j (1 ≤ i ≤ m, 1 ≤ j ≤ n), which are the
coordinates of s distinct squares in the grid.

Output

For each query with s squares, output s + 1 real numbers, which are the probabilities of the s given
squares containing 0, 1, . . . , s mines. Your answer should have an absolute error of at most 10−6 .

 Sample Input 1                                        Sample Output 1
 2 2 1 2                                               0.75 0.25
 0.05 0.05                                             0.5 0.5 0
 0.05 0.05
 1 1 1
 2 2 1 1 2

Sample Input 2
3 4 3 4
0.02 0.04 0.06
0.005 0.07 0.035 0.09
1 3 2
3 1 4 2 4 3 4
4 1 2 2 3 3 1 1 4
8 1 1 1 2 1 3 2 1 2 3 3 1 3 2 3 3

Sample Output 2
0.649469772      0.350530228
0.219607636      0.527423751 0.237646792 0.015321822
0.267615440      0.516222318 0.201611812 0.014550429 0
0.054047935      0.364731941 0.461044157 0.120175967 0 0 0 0 0

Editorial

Rendered from the copied solution.tex file. The original TeX source remains available below.

Raw TeX

Key Observations

Let $X=\sum X_{ij}$ be the total number of mines. The hard part is the conditioning on $X=t$. A standard exponential tilt removes that difficulty.
For any $z>0$, define a new independent model by \[ \Pr_z(X_{ij}=1)=b_{ij}(z)=\frac{a_{ij}z}{1-a_{ij}+a_{ij}z}. \] For every full configuration $\omega$ with exactly $t$ mines, \[ \Pr_z(\omega)=C(z)\,z^t\,\Pr(\omega), \] where $C(z)$ depends only on $z$, not on $\omega$. Therefore the conditional distribution given $X=t$ is identical in the original and tilted models.
We choose $z$ so that $\mathbb E_z[X]=t$. Then the event $X=t$ becomes a central event in the tilted model.
For a query subset $S$, let $Y$ be the number of mines inside $S$ and let $Z$ be the number of mines outside $S$. In the tilted model, $Y$ and $Z$ are independent, so \[ \Pr(Y=r\mid X=t)=\frac{\Pr_z(Y=r)\Pr_z(Z=t-r)}{\Pr_z(X=t)}. \] The inside part $\Pr_z(Y=r)$ is exact by a tiny $O(s^2)$ DP because $s\le 500$.
We only need point probabilities near the mean. So instead of storing the whole distribution of $X$, we store the distribution of $X \bmod M$ for a carefully chosen odd modulus $M$. If $M$ is large enough, the probability mass at the other congruent integers is exponentially tiny.

Algorithm

1. Choose the tilt

Let \[ f(z)=\sum_{i,j}\frac{a_{ij}z}{1-a_{ij}+a_{ij}z}. \] This is strictly increasing from $0$ to $mn$, so binary search finds the unique $z$ with $f(z)=t$. Then compute all tilted probabilities \[ b_{ij}= \frac{a_{ij}z}{1-a_{ij}+a_{ij}z}. \]

2. Choose a modulus

Let \[ \sigma^2=\sum_{i,j} b_{ij}(1-b_{ij}) \] be the tilted variance of the total count. For the largest query size $s_{\max}$, choose the smallest odd $M>s_{\max}$ such that \[ 2\exp\!\left(-\frac{(M-s_{\max})^2}{2(\sigma^2+(M-s_{\max})/3)}\right) < 10^{-11}. \]

This is Bernstein's tail bound. It ensures that any count at distance at least $M-s_{\max}$ from the mean has negligible probability, which is enough for the final $10^{-6}$ error bound.

3. Build the total distribution modulo $M$

Let $D[r]=\Pr_z(X\equiv r \pmod M)$. We build it by a cyclic DP over all cells. If the next cell has probability $p$, then \[ D_{\text{new}}[r]=(1-p)D[r]+pD[r-1] \] with indices modulo $M$. This costs $O(mnM)$.

4. Answer one query

Suppose the query cells are $c_1,\dots,c_s$.

Inside part.

Run the ordinary $O(s^2)$ Poisson-binomial DP on the probabilities $b_{c_1},\dots,b_{c_s}$ to get \[ I[r]=\Pr_z(Y=r). \]

Outside part modulo $M$.

Start from the already computed total residue distribution $D$ and remove the selected cells one by one.

For one cell with probability $p$, if $C$ is the residue distribution before adding that cell and $D$ is the residue distribution after adding it, then \[ D[r]=(1-p)C[r]+pC[r-1]. \] So answering the query requires inverting this linear transform $s$ times.

Because $M$ is odd, the transform is invertible for every $0<p<1$: its eigenvalues are $1-p+p\omega^j$ for the $M$-th roots of unity $\omega^j$, and the only way such a value could be zero is $p=\tfrac12$ and $\omega^j=-1$, impossible for odd $M$.

The inversion itself is done in $O(M)$ by a simple recurrence:

if $p\le \tfrac12$, solve forward using \[ C[r]=\frac{D[r]-pC[r-1]}{1-p}; \]
if $p> \tfrac12$, solve backward using \[ C[r-1]=\frac{D[r]-(1-p)C[r]}{p}. \]
In either direction one unknown initial value remains; it is determined by the final wrap-around equation.

After removing all selected cells we obtain \[ O[r]=\Pr_z(Z\equiv r \pmod M). \]

Final formula.

For every $r=0,\dots,s$ output \[ \frac{I[r]\cdot O[(t-r)\bmod M]}{D[t\bmod M]}. \] Because of the chosen bound on $M$, the residue probabilities differ from the true point probabilities by far less than the allowed error.

Correctness Proof

We prove that the algorithm outputs the correct probabilities.

Lemma 1.

For any $z>0$, conditioning on the total number of mines being $t$ gives the same distribution in the original and tilted models.

Proof.

Fix a full configuration $\omega$ and let $|\omega|$ be its number of mines. In the original model, \[ \Pr(\omega)=\prod_{\omega_{ij}=1} a_{ij}\prod_{\omega_{ij}=0}(1-a_{ij}). \] In the tilted model, \[ \Pr_z(\omega)=\prod_{\omega_{ij}=1}\frac{a_{ij}z}{1-a_{ij}+a_{ij}z} \prod_{\omega_{ij}=0}\frac{1-a_{ij}}{1-a_{ij}+a_{ij}z}. \] Therefore \[ \Pr_z(\omega)= \left(\prod_{i,j}\frac{1}{1-a_{ij}+a_{ij}z}\right) z^{|\omega|}\Pr(\omega). \] If $|\omega|=t$, the multiplicative factor depends only on $z$ and $t$, not on $\omega$. So after normalizing over all configurations with total $t$, both conditional distributions are identical. □

Lemma 2.

For any query subset $S$ and any $r$, \[ \Pr(Y=r\mid X=t)=\frac{\Pr_z(Y=r)\Pr_z(Z=t-r)}{\Pr_z(X=t)}. \]

Proof.

By Lemma 1 we may work in the tilted model. There the cells are independent, so the counts inside and outside the query are independent: \[ \Pr_z(Y=r, Z=t-r)=\Pr_z(Y=r)\Pr_z(Z=t-r). \] Since $X=Y+Z$, conditioning on $X=t$ gives exactly the stated formula. □

Lemma 3.

The one-cell residue update \[ D[r]=(1-p)C[r]+pC[r-1] \] is invertible for odd $M$ and $0<p<1$.

Proof.

This is a circulant linear map on residue vectors of length $M$. Its eigenvalues in the Fourier basis are \[ \lambda_j=1-p+p\omega^j, \] where $\omega$ is a primitive $M$-th root of unity. If $\lambda_j=0$, then $p\omega^j=-(1-p)$. Taking absolute values gives $p=1-p$, so $p=\tfrac12$, and then $\omega^j=-1$. But $-1$ is not an odd-order root of unity. Hence no eigenvalue is zero, so the map is invertible. □

Lemma 4.

For each query, the algorithm computes the exact residue distribution of the outside count $Z \bmod M$.

Proof.

The total residue distribution is obtained by repeatedly applying the one-cell forward update, so it is exact. Removing one selected cell applies the inverse of that exact linear map, which exists by Lemma 3. After removing all selected cells, what remains is exactly the distribution of the sum of all unselected cells modulo $M$, i.e. the residue distribution of $Z$. □

Lemma 5.

For every quantity used in the final answer, replacing a point probability by the corresponding residue probability changes the value by at most the Bernstein tail bound used when choosing $M$.

Proof.

In the tilted model the total count has mean $t$. For a query of size $s$, the outside count $Z$ has mean $t-\mathbb E_z[Y]$, so the requested point $t-r$ differs from that mean by at most $s$. Any other integer congruent to $t-r$ modulo $M$ is therefore at least $M-s$ away from the mean. The residue probability is the sum of the desired point probability and all those other congruent point probabilities, so the aliasing error is bounded by the tail probability outside distance $M-s$. Bernstein's inequality gives exactly the bound used by the algorithm. □

Theorem.

The algorithm outputs every query answer with the required accuracy.

Proof.

By Lemma 2 the exact answer factorizes into the inside point probability, the outside point probability, and the total point probability. The inside part is computed exactly by the small DP. By Lemma 4 the outside and total residue distributions are exact modulo $M$, and by Lemma 5 the replacement of point probabilities by these residue probabilities introduces only a negligible error. Therefore every reported probability is within the required tolerance. □

Complexity Analysis

Let $N=mn$ and let $M$ be the chosen odd modulus.

Solving for the tilt takes $O(N \log \text{precision})$.
Building the total residue distribution takes $O(NM)$.
A query of size $s$ takes $O(s^2+sM)$.
So the whole program runs in \[ O\!\left(NM + \sum_{\text{queries}} (s^2+sM)\right) \] time and \[ O(N+M) \] memory.

With the given constraints, the chosen $M$ stays small (about a few thousand at worst), so this easily fits.

Implementation Notes

The cases $t=0$ and $t=mn$ are deterministic and are handled separately.
The forward inversion is numerically stable when $p\le \tfrac12$, and the backward inversion is numerically stable when $p>\tfrac12$.
After each query, the output vector is renormalized to sum to $1$ in order to remove tiny floating-point drift.

C++ competitive_programming/icpc/2020/L-sweep-stakes/solution.cpp

Exact copied implementation source.

Raw file

#include <bits/stdc++.h>
using namespace std;

namespace {

constexpr double ALIAS_EPS = 1e-11;

struct Query {
    vector<int> cells;
};

double bernstein_tail_bound(double variance, int distance) {
    if (distance <= 0) {
        return 1.0;
    }
    double denom = 2.0 * (variance + distance / 3.0);
    return 2.0 * exp(-(double)distance * distance / denom);
}

vector<double> subset_distribution(const vector<double>& probs) {
    vector<double> dp(probs.size() + 1, 0.0);
    dp[0] = 1.0;
    int used = 0;
    for (double p : probs) {
        for (int k = used; k >= 0; --k) {
            dp[k + 1] += dp[k] * p;
            dp[k] *= 1.0 - p;
        }
        ++used;
    }
    return dp;
}

double solve_tilt(const vector<double>& base, int target) {
    if (target == 0) {
        return 0.0;
    }
    int total = (int)base.size();
    if (target == total) {
        return numeric_limits<double>::infinity();
    }

    auto mean_at = [&](double z) {
        double sum = 0.0;
        for (double a : base) {
            sum += a * z / (1.0 - a + a * z);
        }
        return sum;
    };

    double lo = 0.0;
    double hi = 1.0;
    while (mean_at(hi) < target) {
        hi *= 2.0;
    }
    for (int it = 0; it < 100; ++it) {
        double mid = (lo + hi) * 0.5;
        if (mean_at(mid) < target) {
            lo = mid;
        } else {
            hi = mid;
        }
    }
    return (lo + hi) * 0.5;
}

vector<double> build_mod_distribution(const vector<double>& probs, int mod) {
    vector<double> cur(mod, 0.0), nxt(mod, 0.0);
    cur[0] = 1.0;
    for (double p : probs) {
        double q = 1.0 - p;
        nxt[0] = cur[0] * q + cur[mod - 1] * p;
        for (int r = 1; r < mod; ++r) {
            nxt[r] = cur[r] * q + cur[r - 1] * p;
        }
        cur.swap(nxt);
    }
    return cur;
}

void remove_cell_forward(const vector<double>& src, double p, vector<double>& dst) {
    int mod = (int)src.size();
    double a = 1.0 - p;
    double b = p;

    double u = 0.0;
    double v = 1.0;
    for (int r = 1; r < mod; ++r) {
        u = (src[r] - b * u) / a;
        v = (-b * v) / a;
    }
    double x = (src[0] - b * u) / (a + b * v);

    dst[0] = x;
    for (int r = 1; r < mod; ++r) {
        dst[r] = (src[r] - b * dst[r - 1]) / a;
    }
}

void remove_cell_backward(const vector<double>& src, double p, vector<double>& dst) {
    int mod = (int)src.size();
    double a = 1.0 - p;
    double b = p;

    double u = 0.0;
    double v = 1.0;
    for (int r = mod - 2; r >= 0; --r) {
        u = (src[r + 1] - a * u) / b;
        v = (-a * v) / b;
    }
    double x = (src[0] - a * u) / (b + a * v);

    dst[mod - 1] = x;
    for (int r = mod - 2; r >= 0; --r) {
        dst[r] = (src[r + 1] - a * dst[r + 1]) / b;
    }
}

void remove_cell(const vector<double>& src, double p, vector<double>& dst) {
    if (p < 1e-15) {
        dst = src;
        return;
    }
    if (1.0 - p < 1e-15) {
        int mod = (int)src.size();
        for (int r = 0; r < mod; ++r) {
            dst[r] = src[(r + 1) % mod];
        }
        return;
    }
    if (p <= 0.5) {
        remove_cell_forward(src, p, dst);
    } else {
        remove_cell_backward(src, p, dst);
    }
}

void solve() {
    int m = 0, n = 0, t = 0, q = 0;
    cin >> m >> n >> t >> q;

    vector<double> row(m), col(n);
    for (double& x : row) {
        cin >> x;
    }
    for (double& x : col) {
        cin >> x;
    }

    vector<Query> queries(q);
    int max_s = 0;
    for (int qi = 0; qi < q; ++qi) {
        int s = 0;
        cin >> s;
        max_s = max(max_s, s);
        queries[qi].cells.resize(s);
        for (int i = 0; i < s; ++i) {
            int r = 0, c = 0;
            cin >> r >> c;
            --r;
            --c;
            queries[qi].cells[i] = r * n + c;
        }
    }

    int total_cells = m * n;
    if (t == 0) {
        for (const Query& query : queries) {
            cout << fixed << setprecision(9) << 1.0;
            for (int i = 0; i < (int)query.cells.size(); ++i) {
                cout << ' ' << 0.0;
            }
            cout << '\n';
        }
        return;
    }
    if (t == total_cells) {
        for (const Query& query : queries) {
            int s = (int)query.cells.size();
            for (int i = 0; i < s; ++i) {
                cout << fixed << setprecision(9) << 0.0 << ' ';
            }
            cout << fixed << setprecision(9) << 1.0 << '\n';
        }
        return;
    }

    vector<double> base(total_cells);
    for (int i = 0; i < m; ++i) {
        for (int j = 0; j < n; ++j) {
            base[i * n + j] = row[i] + col[j];
        }
    }

    double z = solve_tilt(base, t);
    vector<double> tilted(total_cells);
    double variance = 0.0;
    for (int idx = 0; idx < total_cells; ++idx) {
        double a = base[idx];
        double p = a * z / (1.0 - a + a * z);
        tilted[idx] = p;
        variance += p * (1.0 - p);
    }

    int mod = max_s + 1;
    if ((mod & 1) == 0) {
        ++mod;
    }
    while (bernstein_tail_bound(variance, mod - max_s) > ALIAS_EPS) {
        mod += 2;
    }

    vector<double> total_mod = build_mod_distribution(tilted, mod);
    double denom = total_mod[t % mod];
    if (denom < 1e-300) {
        denom = 1e-300;
    }

    vector<double> cur = total_mod;
    vector<double> tmp(mod);

    for (const Query& query : queries) {
        vector<double> sel_prob;
        sel_prob.reserve(query.cells.size());
        for (int cell : query.cells) {
            sel_prob.push_back(tilted[cell]);
        }

        vector<double> inside = subset_distribution(sel_prob);
        cur = total_mod;
        for (double p : sel_prob) {
            remove_cell(cur, p, tmp);
            cur.swap(tmp);
        }

        vector<double> answer(query.cells.size() + 1, 0.0);
        double sum_ans = 0.0;
        for (int r = 0; r <= (int)query.cells.size(); ++r) {
            int residue = (t - r) % mod;
            if (residue < 0) {
                residue += mod;
            }
            double val = inside[r] * cur[residue] / denom;
            if (val < 0.0 && val > -1e-12) {
                val = 0.0;
            }
            answer[r] = val;
            sum_ans += val;
        }

        if (sum_ans > 0.0) {
            for (double& x : answer) {
                x = max(0.0, x / sum_ans);
            }
        }

        for (int i = 0; i < (int)answer.size(); ++i) {
            if (i) {
                cout << ' ';
            }
            cout << fixed << setprecision(9) << answer[i];
        }
        cout << '\n';
    }
}

}  // namespace

int main() {
    ios::sync_with_stdio(false);
    cin.tie(nullptr);

    solve();
    return 0;
}

TeX write-up competitive_programming/icpc/2020/L-sweep-stakes/solution.tex

Exact copied write-up source.

Raw file

\documentclass[11pt]{article}
\usepackage[margin=1in]{geometry}
\usepackage[T1]{fontenc}
\usepackage[utf8]{inputenc}
\usepackage{amsmath,amssymb,amsthm}
\usepackage{enumitem}

\title{ICPC World Finals 2020\\L. Sweep Stakes}
\author{}
\date{}

\begin{document}
\maketitle

\section*{Problem Summary}

Each cell $(i,j)$ independently contains a mine with probability $a_{ij}=p_i+q_j$.
We are given that the total number of mines is exactly $t$.
For each query subset $S$ of at most $500$ cells, we must output the conditional distribution of the number of mines in
$S$.

\section*{Key Observations}

\begin{itemize}[leftmargin=*]
    \item Let $X=\sum X_{ij}$ be the total number of mines.
    The hard part is the conditioning on $X=t$.
    A standard exponential tilt removes that difficulty.
    \item For any $z>0$, define a new independent model by
    \[
        \Pr_z(X_{ij}=1)=b_{ij}(z)=\frac{a_{ij}z}{1-a_{ij}+a_{ij}z}.
    \]
    For every full configuration $\omega$ with exactly $t$ mines,
    \[
        \Pr_z(\omega)=C(z)\,z^t\,\Pr(\omega),
    \]
    where $C(z)$ depends only on $z$, not on $\omega$.
    Therefore the conditional distribution given $X=t$ is \emph{identical} in the original and tilted models.
    \item We choose $z$ so that $\mathbb E_z[X]=t$.
    Then the event $X=t$ becomes a central event in the tilted model.
    \item For a query subset $S$, let $Y$ be the number of mines inside $S$ and let $Z$ be the number of mines outside
    $S$.
    In the tilted model, $Y$ and $Z$ are independent, so
    \[
        \Pr(Y=r\mid X=t)=\frac{\Pr_z(Y=r)\Pr_z(Z=t-r)}{\Pr_z(X=t)}.
    \]
    The inside part $\Pr_z(Y=r)$ is exact by a tiny $O(s^2)$ DP because $s\le 500$.
    \item We only need point probabilities near the mean.
    So instead of storing the whole distribution of $X$, we store the distribution of $X \bmod M$ for a carefully
    chosen odd modulus $M$.
    If $M$ is large enough, the probability mass at the other congruent integers is exponentially tiny.
\end{itemize}

\section*{Algorithm}

\subsection*{1. Choose the tilt}

Let
\[
    f(z)=\sum_{i,j}\frac{a_{ij}z}{1-a_{ij}+a_{ij}z}.
\]
This is strictly increasing from $0$ to $mn$, so binary search finds the unique $z$ with $f(z)=t$.
Then compute all tilted probabilities
\[
    b_{ij}= \frac{a_{ij}z}{1-a_{ij}+a_{ij}z}.
\]

\subsection*{2. Choose a modulus}

Let
\[
    \sigma^2=\sum_{i,j} b_{ij}(1-b_{ij})
\]
be the tilted variance of the total count.
For the largest query size $s_{\max}$, choose the smallest odd $M>s_{\max}$ such that
\[
    2\exp\!\left(-\frac{(M-s_{\max})^2}{2(\sigma^2+(M-s_{\max})/3)}\right) < 10^{-11}.
\]

This is Bernstein's tail bound. It ensures that any count at distance at least $M-s_{\max}$ from the mean has
negligible probability, which is enough for the final $10^{-6}$ error bound.

\subsection*{3. Build the total distribution modulo $M$}

Let $D[r]=\Pr_z(X\equiv r \pmod M)$.
We build it by a cyclic DP over all cells.
If the next cell has probability $p$, then
\[
    D_{\text{new}}[r]=(1-p)D[r]+pD[r-1]
\]
with indices modulo $M$.
This costs $O(mnM)$.

\subsection*{4. Answer one query}

Suppose the query cells are $c_1,\dots,c_s$.

\paragraph{Inside part.}
Run the ordinary $O(s^2)$ Poisson-binomial DP on the probabilities $b_{c_1},\dots,b_{c_s}$ to get
\[
    I[r]=\Pr_z(Y=r).
\]

\paragraph{Outside part modulo $M$.}
Start from the already computed total residue distribution $D$ and remove the selected cells one by one.

For one cell with probability $p$, if $C$ is the residue distribution before adding that cell and $D$ is the residue
distribution after adding it, then
\[
    D[r]=(1-p)C[r]+pC[r-1].
\]
So answering the query requires inverting this linear transform $s$ times.

Because $M$ is odd, the transform is invertible for every $0<p<1$:
its eigenvalues are $1-p+p\omega^j$ for the $M$-th roots of unity $\omega^j$, and the only way such a value could be
zero is $p=\tfrac12$ and $\omega^j=-1$, impossible for odd $M$.

The inversion itself is done in $O(M)$ by a simple recurrence:
\begin{itemize}[leftmargin=*]
    \item if $p\le \tfrac12$, solve forward using
    \[
        C[r]=\frac{D[r]-pC[r-1]}{1-p};
    \]
    \item if $p> \tfrac12$, solve backward using
    \[
        C[r-1]=\frac{D[r]-(1-p)C[r]}{p}.
    \]
\end{itemize}
In either direction one unknown initial value remains; it is determined by the final wrap-around equation.

After removing all selected cells we obtain
\[
    O[r]=\Pr_z(Z\equiv r \pmod M).
\]

\paragraph{Final formula.}
For every $r=0,\dots,s$ output
\[
    \frac{I[r]\cdot O[(t-r)\bmod M]}{D[t\bmod M]}.
\]
Because of the chosen bound on $M$, the residue probabilities differ from the true point probabilities by far less than
the allowed error.

\section*{Correctness Proof}

We prove that the algorithm outputs the correct probabilities.

\paragraph{Lemma 1.}
For any $z>0$, conditioning on the total number of mines being $t$ gives the same distribution in the original and
tilted models.

\paragraph{Proof.}
Fix a full configuration $\omega$ and let $|\omega|$ be its number of mines.
In the original model,
\[
    \Pr(\omega)=\prod_{\omega_{ij}=1} a_{ij}\prod_{\omega_{ij}=0}(1-a_{ij}).
\]
In the tilted model,
\[
    \Pr_z(\omega)=\prod_{\omega_{ij}=1}\frac{a_{ij}z}{1-a_{ij}+a_{ij}z}
    \prod_{\omega_{ij}=0}\frac{1-a_{ij}}{1-a_{ij}+a_{ij}z}.
\]
Therefore
\[
    \Pr_z(\omega)=
    \left(\prod_{i,j}\frac{1}{1-a_{ij}+a_{ij}z}\right)
    z^{|\omega|}\Pr(\omega).
\]
If $|\omega|=t$, the multiplicative factor depends only on $z$ and $t$, not on $\omega$.
So after normalizing over all configurations with total $t$, both conditional distributions are identical. \qed

\paragraph{Lemma 2.}
For any query subset $S$ and any $r$,
\[
    \Pr(Y=r\mid X=t)=\frac{\Pr_z(Y=r)\Pr_z(Z=t-r)}{\Pr_z(X=t)}.
\]

\paragraph{Proof.}
By Lemma~1 we may work in the tilted model.
There the cells are independent, so the counts inside and outside the query are independent:
\[
    \Pr_z(Y=r, Z=t-r)=\Pr_z(Y=r)\Pr_z(Z=t-r).
\]
Since $X=Y+Z$, conditioning on $X=t$ gives exactly the stated formula. \qed

\paragraph{Lemma 3.}
The one-cell residue update
\[
    D[r]=(1-p)C[r]+pC[r-1]
\]
is invertible for odd $M$ and $0<p<1$.

\paragraph{Proof.}
This is a circulant linear map on residue vectors of length $M$.
Its eigenvalues in the Fourier basis are
\[
    \lambda_j=1-p+p\omega^j,
\]
where $\omega$ is a primitive $M$-th root of unity.
If $\lambda_j=0$, then $p\omega^j=-(1-p)$.
Taking absolute values gives $p=1-p$, so $p=\tfrac12$, and then $\omega^j=-1$.
But $-1$ is not an odd-order root of unity.
Hence no eigenvalue is zero, so the map is invertible. \qed

\paragraph{Lemma 4.}
For each query, the algorithm computes the exact residue distribution of the outside count $Z \bmod M$.

\paragraph{Proof.}
The total residue distribution is obtained by repeatedly applying the one-cell forward update, so it is exact.
Removing one selected cell applies the inverse of that exact linear map, which exists by Lemma~3.
After removing all selected cells, what remains is exactly the distribution of the sum of all unselected cells modulo
$M$, i.e.\ the residue distribution of $Z$. \qed

\paragraph{Lemma 5.}
For every quantity used in the final answer, replacing a point probability by the corresponding residue probability
changes the value by at most the Bernstein tail bound used when choosing $M$.

\paragraph{Proof.}
In the tilted model the total count has mean $t$.
For a query of size $s$, the outside count $Z$ has mean $t-\mathbb E_z[Y]$, so the requested point $t-r$ differs from
that mean by at most $s$.
Any other integer congruent to $t-r$ modulo $M$ is therefore at least $M-s$ away from the mean.
The residue probability is the sum of the desired point probability and all those other congruent point probabilities,
so the aliasing error is bounded by the tail probability outside distance $M-s$.
Bernstein's inequality gives exactly the bound used by the algorithm. \qed

\paragraph{Theorem.}
The algorithm outputs every query answer with the required accuracy.

\paragraph{Proof.}
By Lemma~2 the exact answer factorizes into the inside point probability, the outside point probability, and the total
point probability.
The inside part is computed exactly by the small DP.
By Lemma~4 the outside and total residue distributions are exact modulo $M$, and by Lemma~5 the replacement of point
probabilities by these residue probabilities introduces only a negligible error.
Therefore every reported probability is within the required tolerance. \qed

\section*{Complexity Analysis}

Let $N=mn$ and let $M$ be the chosen odd modulus.

\begin{itemize}[leftmargin=*]
    \item Solving for the tilt takes $O(N \log \text{precision})$.
    \item Building the total residue distribution takes $O(NM)$.
    \item A query of size $s$ takes $O(s^2+sM)$.
\end{itemize}

So the whole program runs in
\[
    O\!\left(NM + \sum_{\text{queries}} (s^2+sM)\right)
\]
time and
\[
    O(N+M)
\]
memory.

With the given constraints, the chosen $M$ stays small (about a few thousand at worst), so this easily fits.

\section*{Implementation Notes}

\begin{itemize}[leftmargin=*]
    \item The cases $t=0$ and $t=mn$ are deterministic and are handled separately.
    \item The forward inversion is numerically stable when $p\le \tfrac12$, and the backward inversion is numerically
    stable when $p>\tfrac12$.
    \item After each query, the output vector is renormalized to sum to $1$ in order to remove tiny floating-point
    drift.
\end{itemize}

\end{document}

C++ implementation competitive_programming/icpc/2020/L-sweep-stakes/solution.cpp

Exact copied implementation source.

Raw file

#include <bits/stdc++.h>
using namespace std;

namespace {

constexpr double ALIAS_EPS = 1e-11;

struct Query {
    vector<int> cells;
};

double bernstein_tail_bound(double variance, int distance) {
    if (distance <= 0) {
        return 1.0;
    }
    double denom = 2.0 * (variance + distance / 3.0);
    return 2.0 * exp(-(double)distance * distance / denom);
}

vector<double> subset_distribution(const vector<double>& probs) {
    vector<double> dp(probs.size() + 1, 0.0);
    dp[0] = 1.0;
    int used = 0;
    for (double p : probs) {
        for (int k = used; k >= 0; --k) {
            dp[k + 1] += dp[k] * p;
            dp[k] *= 1.0 - p;
        }
        ++used;
    }
    return dp;
}

double solve_tilt(const vector<double>& base, int target) {
    if (target == 0) {
        return 0.0;
    }
    int total = (int)base.size();
    if (target == total) {
        return numeric_limits<double>::infinity();
    }

    auto mean_at = [&](double z) {
        double sum = 0.0;
        for (double a : base) {
            sum += a * z / (1.0 - a + a * z);
        }
        return sum;
    };

    double lo = 0.0;
    double hi = 1.0;
    while (mean_at(hi) < target) {
        hi *= 2.0;
    }
    for (int it = 0; it < 100; ++it) {
        double mid = (lo + hi) * 0.5;
        if (mean_at(mid) < target) {
            lo = mid;
        } else {
            hi = mid;
        }
    }
    return (lo + hi) * 0.5;
}

vector<double> build_mod_distribution(const vector<double>& probs, int mod) {
    vector<double> cur(mod, 0.0), nxt(mod, 0.0);
    cur[0] = 1.0;
    for (double p : probs) {
        double q = 1.0 - p;
        nxt[0] = cur[0] * q + cur[mod - 1] * p;
        for (int r = 1; r < mod; ++r) {
            nxt[r] = cur[r] * q + cur[r - 1] * p;
        }
        cur.swap(nxt);
    }
    return cur;
}

void remove_cell_forward(const vector<double>& src, double p, vector<double>& dst) {
    int mod = (int)src.size();
    double a = 1.0 - p;
    double b = p;

    double u = 0.0;
    double v = 1.0;
    for (int r = 1; r < mod; ++r) {
        u = (src[r] - b * u) / a;
        v = (-b * v) / a;
    }
    double x = (src[0] - b * u) / (a + b * v);

    dst[0] = x;
    for (int r = 1; r < mod; ++r) {
        dst[r] = (src[r] - b * dst[r - 1]) / a;
    }
}

void remove_cell_backward(const vector<double>& src, double p, vector<double>& dst) {
    int mod = (int)src.size();
    double a = 1.0 - p;
    double b = p;

    double u = 0.0;
    double v = 1.0;
    for (int r = mod - 2; r >= 0; --r) {
        u = (src[r + 1] - a * u) / b;
        v = (-a * v) / b;
    }
    double x = (src[0] - a * u) / (b + a * v);

    dst[mod - 1] = x;
    for (int r = mod - 2; r >= 0; --r) {
        dst[r] = (src[r + 1] - a * dst[r + 1]) / b;
    }
}

void remove_cell(const vector<double>& src, double p, vector<double>& dst) {
    if (p < 1e-15) {
        dst = src;
        return;
    }
    if (1.0 - p < 1e-15) {
        int mod = (int)src.size();
        for (int r = 0; r < mod; ++r) {
            dst[r] = src[(r + 1) % mod];
        }
        return;
    }
    if (p <= 0.5) {
        remove_cell_forward(src, p, dst);
    } else {
        remove_cell_backward(src, p, dst);
    }
}

void solve() {
    int m = 0, n = 0, t = 0, q = 0;
    cin >> m >> n >> t >> q;

    vector<double> row(m), col(n);
    for (double& x : row) {
        cin >> x;
    }
    for (double& x : col) {
        cin >> x;
    }

    vector<Query> queries(q);
    int max_s = 0;
    for (int qi = 0; qi < q; ++qi) {
        int s = 0;
        cin >> s;
        max_s = max(max_s, s);
        queries[qi].cells.resize(s);
        for (int i = 0; i < s; ++i) {
            int r = 0, c = 0;
            cin >> r >> c;
            --r;
            --c;
            queries[qi].cells[i] = r * n + c;
        }
    }

    int total_cells = m * n;
    if (t == 0) {
        for (const Query& query : queries) {
            cout << fixed << setprecision(9) << 1.0;
            for (int i = 0; i < (int)query.cells.size(); ++i) {
                cout << ' ' << 0.0;
            }
            cout << '\n';
        }
        return;
    }
    if (t == total_cells) {
        for (const Query& query : queries) {
            int s = (int)query.cells.size();
            for (int i = 0; i < s; ++i) {
                cout << fixed << setprecision(9) << 0.0 << ' ';
            }
            cout << fixed << setprecision(9) << 1.0 << '\n';
        }
        return;
    }

    vector<double> base(total_cells);
    for (int i = 0; i < m; ++i) {
        for (int j = 0; j < n; ++j) {
            base[i * n + j] = row[i] + col[j];
        }
    }

    double z = solve_tilt(base, t);
    vector<double> tilted(total_cells);
    double variance = 0.0;
    for (int idx = 0; idx < total_cells; ++idx) {
        double a = base[idx];
        double p = a * z / (1.0 - a + a * z);
        tilted[idx] = p;
        variance += p * (1.0 - p);
    }

    int mod = max_s + 1;
    if ((mod & 1) == 0) {
        ++mod;
    }
    while (bernstein_tail_bound(variance, mod - max_s) > ALIAS_EPS) {
        mod += 2;
    }

    vector<double> total_mod = build_mod_distribution(tilted, mod);
    double denom = total_mod[t % mod];
    if (denom < 1e-300) {
        denom = 1e-300;
    }

    vector<double> cur = total_mod;
    vector<double> tmp(mod);

    for (const Query& query : queries) {
        vector<double> sel_prob;
        sel_prob.reserve(query.cells.size());
        for (int cell : query.cells) {
            sel_prob.push_back(tilted[cell]);
        }

        vector<double> inside = subset_distribution(sel_prob);
        cur = total_mod;
        for (double p : sel_prob) {
            remove_cell(cur, p, tmp);
            cur.swap(tmp);
        }

        vector<double> answer(query.cells.size() + 1, 0.0);
        double sum_ans = 0.0;
        for (int r = 0; r <= (int)query.cells.size(); ++r) {
            int residue = (t - r) % mod;
            if (residue < 0) {
                residue += mod;
            }
            double val = inside[r] * cur[residue] / denom;
            if (val < 0.0 && val > -1e-12) {
                val = 0.0;
            }
            answer[r] = val;
            sum_ans += val;
        }

        if (sum_ans > 0.0) {
            for (double& x : answer) {
                x = max(0.0, x / sum_ans);
            }
        }

        for (int i = 0; i < (int)answer.size(); ++i) {
            if (i) {
                cout << ' ';
            }
            cout << fixed << setprecision(9) << answer[i];
        }
        cout << '\n';
    }
}

}  // namespace

int main() {
    ios::sync_with_stdio(false);
    cin.tie(nullptr);

    solve();
    return 0;
}

ICPC 2020 - L. Sweep Stakes

Source-first archive entry

Problem Statement

Editorial

Key Observations

Algorithm

1. Choose the tilt

2. Choose a modulus

3. Build the total distribution modulo $M$

4. Answer one query

Correctness Proof

Complexity Analysis

Implementation Notes

Code

Source Files