Modeling contiguity of machine processing in a flow shop environment via a MIP

Question

I'm working on a Mixed-Integer-Programing (MIP) formulation for a flow shop scheduling problem. One of the requirements/wishes is that for each machine $i$, processing should be contiguous, or at least contiguous to a certain extent. What I mean with contiguous processing is visualized below for a single machine:

High contiguity : Low contiguity:

Some important details of this problem:

Not all jobs visit all machines. For each machine $i$, the jobs that visit it are a member of the set $\mathcal{V}_i$.
Operations are not allowed to overlap on a machine.

I would like to influence the contiguity somehow by including it as a part of my cost function. My approach so far was:

For each operation $(i,j)$ of job $j$ on machine $i$, I do have a decision variable $c_{i,j}$ for its completion time. What's more, binary decision variable $x_{i,j,k}$ equals 1 IFF job $j$ precedes job $k$ on machine $i$ (not necessarily directly).
I introduce a continuous variable $\gamma_{i,j,k}$ that captures the difference between the completion time of operation $(i,j)$ and the start of operation $(i,k)$. That is $$\gamma_{i,j,k} = \quad (c_{i,k} - p_{i,k}) - c_{i,j}\quad \forall i; j\neq k\in\mathcal{V}_i\times \mathcal{V}_i,$$ where $p_{i,k}$ is the processing time of operation $(i,k)$ so that the first term denotes the corresponding starting time. Note that $\gamma_{i,j,k}$ is negative when $k$ precedes $j$ on machine $i$.
I introduce a binary variable $b_{i,j,k}$ that equals 1 IFF $\gamma_{i,j,k}\leq 0$. I model this via big-M formulations with sufficiently large constant $Q$: $$Q\cdot\big(1-b_{i,j,k}\big)-\gamma_{i,j,k} \geq \quad 0 \quad \forall i; j\neq k\in\mathcal{V}_i\times \mathcal{V}_i$$ $$Q\cdot\big(1-b_{i,j,k}\big)-\gamma_{i,j,k} \leq Q-\epsilon \quad \forall i; j\neq k\in\mathcal{V}_i\times \mathcal{V}_i$$
I then add the following term to my cost function over which I minimize: $$-\sum_i\sum_{j\neq k\in\mathcal{V}_i\times \mathcal{V}_i} x_{i,j,k}\cdot b_{i,j,k}.$$ By multiplying $b_{i,j,k}$ with $x_{i,j,k}$ I make sure that only those $b_{i,j,k}$ values contribute where job $j$ actually precedes job $k$ on machine $i$. In other words, for each machine $i$, for each pair of jobs $j,k$ ($j$ precedes $k$ on machine $i$), I look at the time inbetween their completion and start and add $1$ as penalty whenever this time is positive, that is, whenever these jobs do not start immediately after each other.

Note: whenever I use the indices $j,k$ simultaneously, I assume $j\neq k$.

Question:

Not sure if I can ask this here, but I was wondering if I could get some feedback on this. I observe that including this in my cost function dramatically slows up the solving time. Maybe someone has an idea for improvement, or a whole different approach to obtain the same?

Edit:

One issue here is that I do not know which jobs directly succeed each other. If I knew, it would be much easier to solve the problem. Then I would only have to iterate over all pairs of directly consecutive jobs, instead of all possible pairs for every machine. However, given the mentioned decision variables, I don't see how I can determine this.

First, does every job use every machine? You seem to be implying that, but do not explicitly state it. Second, do you allow operations to overlap on a machine (same machine doing two things at once)? Assuming the answer is no, the $b$ variables appear to be redundant. $\gamma_{i,j,k}\ge 0$ iff job $k$ follows $j$ (not necessarily directly) iff $x_{i,j,k}=1$. — prubin, Dec 01 '21 at 17:13
@prubin 1) I should've bee more clear about this. No I do not assume this, I'll add some extra details to the post. 2) I do not allow overlapping indeed. I don't see how the $b$ variables are redundant. With them, I would like to capture IF there is a positive idle time between two jobs on the same machine. Now if $k$ does precede $j$, then $\gamma_{i,j,k}<0$ and accordingly, $b_{i,j,k}=1$ in my formulation. I want to exclude this contribution of such pairs, which I do by multiplying with $x_{i,j,k}$. — Rutger, Dec 01 '21 at 17:52
Let $s_{i,k}=c_{i,k} - p_{i,k}$ be the start time for job $k$ on machine $i$, and consider the $(i,j,k)$ term in your penalty sum. $b_{i,j,k}=1\implies \gamma_{i,j,k} \le 0$. $\gamma_{i,j,k} = s_{i,k} - c_{i,j}$, so $b_{i,j,k}=1\implies s_{i,k} \le c_{i,j}$. Since jobs can't overlap, $s_{i,k} \le c_{i,j} \implies s_{i,k} \le s_{i,j}$ (else $k$ starts while $j$ is running). Thus $b_{i,j,k}=1\implies x_{i,k,j} = 1 \implies x_{i,j,k} = 0$. So the penalty term will always be zero. Am I making a mistake here? — prubin, Dec 01 '21 at 18:34
@prubin Your analysis is correct to a certain extent. The mistake lies in the implication $s_{i,k}\leq c_{i,j}\Rightarrow s_{i,k}\leq s_{i,j}$. Yes, it is correct that jobs cannot overlap, but you ignore the case $s_{i,k}=c_{i,j}$. This is exactly the case that I am looking for; for tuples $(i,j,k)$ with $s_{i,k}=c_{i,j}$, we have that $b_{i,k,j} = 1 = x_{i,j,k}$, resulting in a penalty term of $-1$, which is favourable as we minimize. — Rutger, Dec 02 '21 at 07:12

prubin · Accepted Answer · 2021-12-02T20:27:11.137

If I understand the question correctly (not a certainty), you want to reduce the number of times a machine $i$ falls idle. To start, suppose that we redefine $x_{i,j,k}$ to be 1 iff job $j$ immediately precedes job $k$ on machine $i$. That should be a fairly easy change to implement in the overall model, and the number of binary variables remains the same.

Now we add binary variable $z_{i,j}$ for all machines $i$ and all jobs $j\in \mathcal{V}_i$, taking value 1 iff machine $i$ falls idle for a time longer than some threshold $\epsilon > 0$ immediately after processing job $j$. Assuming you are going to penalize $z_{i,j}=1$, we only need to enforce the "if" side; the solver will not set $z_{i,j}=1$ unless it has to. To define $z_{i,j}$, we add the constraints $$s_{i,k} - c_{i,j} \le \epsilon + M(z_{i,j} + 1 - x_{i,j,k})$$ for all machines $i$ and all $j,k \in \mathcal{V_i}$ with $j\neq k$, where $M$ is the usual "large enough" constant and $s_{i,k}$ is the start time for job $k$ on machine $i$. The inequality is potentially binding only if $x_{i,j,k} = 1$ and $z_{i,j}=0$ (meaning $k$ immediately follows $j$ on $i$ with at most $\epsilon$ slack time).

Addendum: To address a comment, here is how to model using immediate precedence. We start with three constraints: $$\sum_k x_{i,j,k} \le 1 \quad \forall i,j$$$$\sum_j x_{i,j,k} \le 1 \quad \forall i,k$$ $$\sum_{j,k} x_{i,j,k} = \vert \mathcal{V_i} \vert - 1.$$These specify that every job has at most one immediate successor and at most one immediate predecessor, and that every job but one has a successor. To that we add the following constraint on timing:$$ s_{i,k} \ge c_{i,j} - M(1-x_{i,j,k}) \quad \forall i,j,k.$$This enforces the rule that the immediate successor of a task cannot start until the predecessor ends. (This is basically the Miller-Tucker-Zemlin TSP formulation with the start and end times taking on the role served by their counting variables (subtour elimination).

This seems like a good solution. However, I don't see how it is straightforward to redefine $x_{i,j,k}$ like you say. If I'm correct, I will then have to add a constraint stating that '$x_{i,j,k}=1$ IFF operation $(i,j)$ and $(i,k)$ directly succeed each other'. But for determining whether two operations succeed each other, don't we need some extra binary variable? As an example, I figured out how to determine the position of a job on a machine, denoted with $l_{i,j}\in\mathbb{Z}$ for operation $(i,j)$. Then I should find out how to model $l_{i,k}-l_{i,j}=1 \Leftrightarrow x_{i,k,j}=1$. — Rutger, Dec 02 '21 at 07:42
No, we don't need extra binaries. I've amended my answer to explain how the formulation would look. — prubin, Dec 02 '21 at 16:32
I assume that in your summations, the job indices cannot be equal? Unfortunately, this does not work. If I am not mistaken, the following counterexample will show ( I drop the machine index, only one machine): consider 3 operations $A\to B\to C$. Then we say $x_{AC}=x_{BC}=1$. — Rutger, Dec 02 '21 at 19:14
Sorry, I omitted one constraint: that every task has at most one predecessor as well as at most one successor. I've updated the answer (again). — prubin, Dec 02 '21 at 20:31

Modeling contiguity of machine processing in a flow shop environment via a MIP

1 Answers1