## Reforking

Reforking is `pitchfork`'s main feature. To understand how it works, you must first understand Copy-on-Write.

### Copy-on-Write

In old UNIX systems of the ’70s or ’80s, forking a process involved copying its entire addressable memory over
to the new process address space, effectively doubling the memory usage. But since the mid ’90s, that’s no longer
true as, most, if not all, fork implementations are now sophisticated enough to trick the processes into thinking
they have their own private memory regions, while in reality they’re sharing it with other processes.

When the child process is forked, its pages tables are initialized to point to the parent’s memory pages. Later on,
if either the parent or the child tries to write in one of these pages, the operating system is notified and will
actually copy the page before it’s modified.

This means that if neither the child nor the parent write in these shared pages after the fork happens,
forked processes are essentially free.

### Shared Memory Invalidation in Ruby

So in theory, preforking servers shouldn't use more memory than threaded servers.

However, in a Ruby process, there is generally a lot of memory regions that are lazily initialized.
This includes the Ruby Virtual Machine inline caches, JITed code if you use YJIT, and also
some common patterns in applications, such as memoization:

```ruby
module MyCode
  def self.some_data
    @some_data ||= File.read("path/to/something")
  end
end
```

However, since workers are forked right after boot, most codepaths have never been executed,
so most of these caches are not yet initialized.

As more code gets executed, more and more memory pages get invalidated. If you were to graph the ratio
of shared memory of a Ruby process over time, you'd likely see a logarithmic curve, with a quick degradation
during the first few processed request as the most common code paths get warmed up, and then a stabilization.

### Reforking

That is where reforking helps. Since most of these invalidations only happen when a code path is executed for the
first time, if you take a warmed up worker out of rotation, and use it to fork new workers, warmed up pages will
be shared again, and most of them won't be invalidated anymore.


When you start `pitchfork` it forks a `mold` process which loads your application:

```
PID   COMMAND
100   \_ pitchfork master
101       \_ pitchfork (gen:0) mold
```

Once the `mold` is done loading, the `master` asks it to spawn the desired number of workers:

```
PID   COMMAND
100   \_ pitchfork master
101       \_ pitchfork (gen:0) mold
102       \_ pitchfork (gen:0) worker[0]
103       \_ pitchfork (gen:0) worker[1]
104       \_ pitchfork (gen:0) worker[2]
105       \_ pitchfork (gen:0) worker[3]
```

As the diagram shows, while workers are forked from the mold, they become children of the master process.
We'll see how does that work [later](#forking-sibling-processes).

When a reforking is triggered, one of the workers is selected to fork a new `mold`:

```
PID   COMMAND
100   \_ pitchfork master
101       \_ pitchfork (gen:0) mold
102       \_ pitchfork (gen:0) worker[0]
103       \_ pitchfork (gen:0) worker[1]
104       \_ pitchfork (gen:0) worker[2]
105       \_ pitchfork (gen:0) worker[3]
105       \_ pitchfork (gen:1) mold
```

Again, while the mold was forked from a worker, it becomes a child of the master process.
We'll see how does that work [later](#forking-sibling-processes).

When that new mold is ready, `pitchfork` terminates the old mold and starts a slow rollout of older workers and replace them with fresh workers
forked from the mold:

```
PID   COMMAND
100   \_ pitchfork master
102       \_ pitchfork (gen:0) worker[0]
103       \_ pitchfork (gen:0) worker[1]
104       \_ pitchfork (gen:0) worker[2]
105       \_ pitchfork (gen:0) worker[3]
105       \_ pitchfork (gen:1) mold
```

```
PID   COMMAND
100   \_ pitchfork master
103       \_ pitchfork (gen:0) worker[1]
104       \_ pitchfork (gen:0) worker[2]
105       \_ pitchfork (gen:0) worker[3]
105       \_ pitchfork (gen:1) mold
106       \_ pitchfork (gen:1) worker[0]
```

etc.

### Forking Sibling Processes

Normally on unix systems, when calling `fork(2)`, the newly created process is a child of the original one, so forking from the mold should create
a process tree such as:

```
PID   COMMAND
100   \_ pitchfork master
101       \_ pitchfork mold (gen:1)
105          \_ pitchfork (gen:1) worker[0]
```

However the `pitchfork` master process registers itself as a "child subreaper" via [`PR_SET_CHILD_SUBREAPER`](https://man7.org/linux/man-pages/man2/prctl.2.html).
This means that any descendant process that is orphaned will be re-parented as a child of the master process rather than a child of the init process (pid 1).

With this in mind, the mold forks twice to create an orphaned process that will get re-attached to the master process,
effectively forking a sibling rather than a child. Similarly, workers do the same when forking new molds.
This technique eases killing previous generations of molds and workers.

The need for `PR_SET_CHILD_SUBREAPER` is the main reason why reforking is only available on Linux.