4. Task execution

17 Feb, 2025

1. Executing tasks in threads

We need to identify boundary for each task to foster task independence, which is coupled with task execution policy can exhibit:

Better concurrency as independent tasks can be executed in parallel if there are adequate processing resources.
Good throughput and responsiveness.
Graceful degradation.

For example, oftentimes, we see server application choose to separate each client request as a task boundary. This helps a task not being affected by other tasks. Also, one message is easy to digest and require a very small percentage of server’s total capacity.

1.1. Executing tasks sequentially

Following os a simple web server, which execute tasks sequentially. This server has poor throughput and responsiveness, as a request comes in , it might be blocked by I/O operation, leading to unnecessary delay for other incoming requests.

1
2
3
4
5
6
7
8
9
class SingleThreadWebServer {
    public static void main(String[] args) throws IOException {
        ServerSocket socket = new ServerSocket(80);
        while (true) {
            Socket connection = socket.accept();
            handleRequest(connection);
        }
    } 
}

1.2. Explicitly creating threads for tasks

We attempt to create a thread for each request coming in following code. This definitely offers higher throughput and responsiveness.

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
class ThreadPerTaskWebServer {
    public static void main(String[] args) throws IOException {
        ServerSocket socket = new ServerSocket(80);
        while (true) {
            final  Socket connection = socket.accept();
            Runnable task = new Runnable() {
                public void run() {
                    handleRequest(connection);
                } };
            new Thread(task).start();
        } 
    }
}

1.3. Disadvantages of unbounded thread creation.

However, the thread-per-task approach has some practical drawbacks:

Thread lifecycle overhead: Thread creation and teardown are not free, it introduces significant latency if the requests are frequent and lightweight.
Resource consumption: Key ideas is to add as many threads sufficiently to keep CPUs busy. Once they are, adding new threads will just introduce performance cost due to contention between threads. Memory usage will increase as each thread being added as well, and might place a pressure on the garbage collector, which further makes things worse.
Stability: there are limit on the number of threads that are allowed to be created for an application, hitting this number can result in OutOfMemorryError which is risky to recover. It is far easier to structure your program to avoid hitting this limit.
Possibility of server crash: This method doesn’t provide any means to place the limit of threads can be created. Malicious user, or enough ordinary users, can attempt to push the traffic to exceed your app maximum capacity. An app always need to provide high availability and grateful degradation under load.

2. The executor framework

To address the problems of sequential and thread-per-task approach, we can use the implementation of the Executor to limit the size of threads can be created - thread pool.

It is worth to mention what is Executor. Following is its interface:

1
2
3
public interface Executor {
    void execute(Runnable command);
}

Executor represents executor policy of an application (or part of it) which specifies “what, where, when, and how” of task execution, including:

In what thread will tasks be executed?
In what order should tasks be executed (FIFO, LIFO, priority order)?
How many tasks may execute concurrently?
How many tasks may be queued pending execution?
If a task has to be rejected because the system is overloaded, which task should be selected as the victim, and how should the application be notified?
What actions should be taken before or after executing a task?

✍️ Excerpt: Separating the specification of execution policy from task submission makes it practical to select an execution policy at deployment time that is matched to the available hardware.

For example, you can benchmark to check the number of threads sufficient to run on your limited memory and CPUs and tool the Executor accordingly.

Following is a webserver using Executor:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
class TaskExecutionWebServer {
    private static final int NTHREADS = 100;
    private static final Executor exec
            = Executors.newFixedThreadPool(NTHREADS);

    public static void main(String[] args) throws IOException {
        ServerSocket socket = new ServerSocket(80);
        while (true) {
            final Socket connection = socket.accept();
            Runnable task = new Runnable() {
                public void run() {
                    handleRequest(connection);
                }
            };
            exec.execute(task);
        }
    }
}

As you can see, the web sever uses Executor as its policy to execute concurrent requests. It simplifies the responsibility of the web server by only submitting the request to the Executor , which will be responsible to execute the tasks by its internal policy instead of spreading this implementation in the web server itself.

You can implement execution policy like thread-per-task web server as well as sequential one:

1
2
3
4
5
public class ThreadPerTaskExecutor implements Executor {
    public void execute(Runnable r) {
        new Thread(r).start();
    }; 
}

Executor can also opens the door to all sorts of additional opportunities for tuning, management, monitoring, logging, error reporting, and other possibilities that would have been far more difficult to add without a task execution framework.

2.1. Thread pools:

Thread pools executor place a limit on number of worker threads can be created. Each worker thread, in turn of execution, takes a task from a task queue (which can be bounded or unbounded, though unbounded one can cause out of memory) and execute.

Thread-pool approach not only addresses those problems mentioned in Section 1.3, as it creates thread only there are number of requests exceeding the current size of the pool, it limits on the number of times creating the threads, and thus improving responsiveness.

There are several implementations of thread pool, which you can set up using static factory methods in Executor:

newFixedThreadPool: A fixed‐size thread pool creates threads as tasks are submitted, up to the maximum pool size, and then attempts to keep the pool size constant.
newCachedThreadPool: A cached thread pool has more flexibility to reap idle threads when the current size of the pool exceeds the demand for processing, and to add new threads when demand increases, but places no bounds on the size of the pool.
newScheduledThreadPool. A fixed‐size thread pool that supports delayed and periodic task execution, similar to Timer.
newSingleThreadExecutor. A single‐threaded executor creates a single worker thread to process tasks, replacing it if it dies unexpectedly. Tasks are guaranteed to be processed sequentially according to the order imposed by the task queue (FIFO, LIFO, priority order).

The newFixedThreadPool and newCachedThreadPool factories return instances of the general‐purpose ThreadPoolExecutor, which can also be used directly to construct more specialized executors.

2.2. Executor lifecycle

Because executor is a service that run asynchronously to the app, you need a mechanism to shut down it gratefully, or else, the JVM will not exit as there are still thread running.

ExecutorService extends Executor to include several methods to manage the lifecycle of the Executor

1
2
3
4
5
6
7
8
9
public interface ExecutorService extends Executor {
    void shutdown();
    List<Runnable> shutdownNow();
    boolean isShutdown();
    boolean isTerminated();
    boolean awaitTermination(long timeout, TimeUnit unit)
        throws InterruptedException;
    //  ... additional convenience methods for task submission
}

shutdown: shut down the executor gratefully, new tasks will not be accepted. However, tasks are accepted but still are not run still are allowed to be run.
shutdownNow: shutdown abruptly, tasks are queued but are not started yet will not begin.
awaitTermination: after shutting down the service, you can await on this method to wait until the executor stop running.

2.3. Delayed and periodic tasks.

Delayed tasks are tasks that we ask to execute after a certain period has passed, while periodic tasks are tasks that are executed at a specific time overtime.

Timer is an obsolete use for this kind of problem. Its drawbacks are:

Using only one thread: a TimerTask, which, for example, need to run every 2ms, might be executed in a rapid succession if another task run before it take more than 4ms to run.
Handle exception poorly: if a task throw exception while running, it cancels the whole Timer. In this case, we usually need it to recover and run other tasks instead.
- Also, if the initial task throw exception to fast, it causes the main thread to cancel too. As illustrated in following code, the code is supposed to end after 6 seconds, but it might return after 1 second.

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
public class OutOfTime {
    public static void main(String[] args) throws Exception {
        Timer timer = new Timer();
        timer.schedule(new ThrowTask(), 1);
        SECONDS.sleep(1);
        timer.schedule(new ThrowTask(), 1);
        SECONDS.sleep(5);
    }
    
    static class ThrowTask extends TimerTask {
        public void run() { throw new RuntimeException(); }
    } 
}

Using ScheduledThreadPoolExecutor you can address these problems. Two implementations you might want to test include DelayQueue and BlockingQueue

3. Finding the exploitable parallelism

The more sensible and proper your task is divided to run in parallel, the more you will gain from concurrency. This section walks you through an example to help you understand the idea of finding the exploitable parallelism.

3.1. Example: Sequential Page Renderer

Our task is to implement a HTML renderer with the input being a HTML document, which can contain images and text, and output being an image buffer, which would be used by another components to render the page.

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
public class SingleThreadRenderer {
    void renderPage(CharSequence source) {
        renderText(source);
        List<ImageData> imageData = new ArrayList<ImageData>();
        for (ImageInfo imageInfo : scanForImageInfo(source))
            imageData.add(imageInfo.downloadImage());
        for (ImageData data : imageData)
            renderImage(data);
    }
}

A simple idea can be to render as-you-go with one thread, like SingleThreadRenderer above. However, when an image is downloaded from the network , the CPU become idle, as this is an I/O bound task, while you can spend this time to render the text instead.

3.2. Example: Page Renderer with `Future`

The next idea is to delegate the load of images into an executor, while the main thread render text. After the text is done, we pick up on each thread result and render if it is downloaded like the following FutureRenderer implementation .

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
public class FutureRenderer {
    private final ExecutorService executor = ...;
    void renderPage(CharSequence source) {
        final List<ImageInfo> imageInfos = scanForImageInfo(source);
        Callable<List<ImageData>> task =
                new Callable<List<ImageData>>() {
                    public List<ImageData> call() {
                        List<ImageData> result
                                = new ArrayList<ImageData>();
                        for (ImageInfo imageInfo : imageInfos)
                            result.add(imageInfo.downloadImage());
                        return result;
                    }
                };
        
        Future<List<ImageData>> future =  executor.submit(task);
        renderText(source);
        try {
            List<ImageData> imageData =  future.get();
            for (ImageData data : imageData)
                renderImage(data);
        } catch (InterruptedException e) {
            // Re-assert the thread's interrupted status
            Thread.currentThread().interrupt();
            // We don't need the result, so cancel the task too
            future.cancel(true);
        } catch (ExecutionException e) {
            throw launderThrowable(e.getCause());
        }
    }
}

This exploits some parallelism, but we can do considerably better. All images can be downloaded concurrently instead of one waiting for another.

3.3. Example: Page Renderer with `CompletionService`

CompletionService combines the functionality of an Executor and a BlockingQueue allowing you to send a collection of tasks to an Executor and call take to retrieve the result as there is any available. This prevents you to keep the list of Future for the tasks and repetitively iterate over the list to check the completed result, like checking if an image is downloaded from the network.

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
public class Renderer {
    private final ExecutorService executor;
    Renderer(ExecutorService executor) { this.executor = executor; }
    void renderPage(CharSequence source) {
        final List<ImageInfo> info = scanForImageInfo(source);
        CompletionService<ImageData> completionService =
                new ExecutorCompletionService<ImageData>(executor);
        for (final ImageInfo imageInfo : info)
            completionService.submit(new Callable<ImageData>() {
                 public ImageData call() {
                     return imageInfo.downloadImage();
                 }
            });
        renderText(source);
        try {
            for(intt=0,n= info.size();t<n; t++){
                Future<ImageData> f = completionService.take();
                ImageData imageData = f.get();
                renderImage(imageData);
            }
        } catch (InterruptedException e) {
            Thread.currentThread().interrupt();
        } catch (ExecutionException e) {
            throw launderThrowable(e.getCause());
        }
    }
}

The above implementation improve the renderer by create a task for each image want to download from the network and send them to a pool of threads in Executor to execute. Hence, we can be able to load multiple images at the same time, and render them as they are available.

3.4. Placing time limits on tasks.

Sometimes, you need to limit the time a task can be executed before it is aborted. For example, when you load ads for your website, you might want to load the default ads if the ads from your providers take too long to return.

Future can help you do this with timeout method of get, cancel as the cancellation method for the task. For example:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
Page renderPageWithAd() throws InterruptedException {
    long endNanos = System.nanoTime() + TIME_BUDGET;
    Future<Ad> f = exec.submit(new FetchAdTask());
    // Render the page while waiting for the ad
    Page page = renderPageBody();
    Ad ad;
    try {
        // Only wait for the remaining time budget
        long timeLeft = endNanos - System.nanoTime();
        ad = f.get(timeLeft, NANOSECONDS);
    } catch (ExecutionException e) {
        ad = DEFAULT_AD;
    } catch (TimeoutException e) {
        ad = DEFAULT_AD;
        f.cancel(true);
    }
    page.setAd(ad);
    return page;
}

You can also use timeout version invokeAll of ExecutorService, this method will accept the list of tasks, and return Future when all tasks have been done, interrupted, or the timeout expires. Any tasks that are not complete when the timeout expires are cancelled. On return from invokeAll, each task will have either completed normally or been cancelled; the client code can call get or isCancelled to find out which.

3.5. Limitations of Parallelizing Heterogeneous Tasks

There are a note when you divide the tasks for execution. It is more significantly beneficial if your tasks are homogeneous instead of heterogeneous because the code will be much more simple and easy to scale. Imaging if you receive more workload from a task, but rather than you allocate more resource to that task, you allocate for the other types of tasks too which brings little benefit and may also introduce performance suffering as it requires more coordination overhead when you introduce more worker threads.

#Markdown #Syntax