Answer to the question

History

Prior to Java 5, the Java Memory Model (JMM) suffered from weak memory visibility guarantees that rendered many popular concurrency idioms unsafe. The Double-Checked Locking pattern emerged in the late 1990s as a purported performance optimization for lazy initialization, but it contained a fatal flaw regarding instruction reordering. JSR-133 redefined the semantics of the volatile keyword in 2004 to provide acquire-release memory ordering, specifically to resolve such visibility issues without the overhead of full synchronization.

Problem

Without volatile, the JVM and underlying CPU architectures are permitted to reorder instructions such that the assignment of a reference to a variable occurs before the execution of the constructor completes. This creates a window where another thread can observe a non-null reference to an object whose fields contain default or uninitialized values, leading to unpredictable behavior or NullPointerException. The concurrency hazard is particularly insidious because it manifests only under specific timing conditions and hardware memory models, making it difficult to reproduce during testing.

Solution

Declaring the instance field as volatile inserts a memory barrier that establishes a happens-before relationship between the write in the constructor and any subsequent reads by other threads. This prevents the compiler and processor from reordering the write to the volatile field with the preceding writes in the constructor, ensuring that the object is fully constructed before its reference becomes visible. The pattern allows threads to check the reference without locking after initialization, providing both thread safety and high performance.

public class ConnectionPool {
    private static volatile ConnectionPool instance;
    
    private ConnectionPool() {
        // Heavy initialization
    }
    
    public static ConnectionPool getInstance() {
        if (instance == null) {
            synchronized (ConnectionPool.class) {
                if (instance == null) {
                    instance = new ConnectionPool();
                }
            }
        }
        return instance;
    }
}

Situation from life

A high-throughput microservice handling payment processing required a singleton ConnectionPool to manage JDBC connections to a PostgreSQL cluster. During peak traffic, thousands of threads simultaneously invoked getInstance() when the service first started, necessitating a thread-safe initialization strategy that minimized lock contention. The initialization sequence involved establishing TCP sockets, allocating direct byte buffers, and executing schema validation queries, making eager instantiation prohibitively expensive for auto-scaling scenarios.

Eager Initialization

Eager Initialization involved creating the pool in a static initializer block. This approach guaranteed thread safety through classloading mechanics and eliminated the need for synchronized blocks entirely. However, the connection establishment required three seconds of TCP handshakes and credential exchange, which violated the service level agreement for cold-start times during auto-scaling events.

Synchronized Method

Synchronized Method wrapped the getInstance() method with the synchronized keyword. While this corrected the race condition by serializing all access, it introduced severe performance degradation under load. Profiling revealed that after initialization, threads spent needless cycles acquiring the monitor lock despite the immutable nature of the fully constructed pool, adding approximately 18 milliseconds of latency per call.

Double-Checked Locking with volatile

Double-Checked Locking with volatile was selected as the optimal approach. This solution used an unsynchronized fast path to check for null, followed by a synchronized block for the critical section, with a second null check inside to prevent multiple instantiations. The volatile modifier ensured that the fully initialized pool state was visible to all CPU cores immediately upon publication, balancing lazy initialization with zero-lock overhead after startup.

The chosen solution resulted in successful lazy initialization without blocking, allowing the service to handle 50,000 requests per second with sub-millisecond response times after the initial pool creation. The implementation eliminated race conditions during startup while maintaining lock-free access during steady-state operations, preventing the observed NullPointerException instances that previously occurred under high-concurrency scenarios. Monitoring confirmed that the JVM correctly handled the memory visibility across all 64 cores without explicit synchronization after the singleton was established.

What candidates often miss

Why does the double-checked locking pattern require two distinct null checks rather than a single synchronized check?

The first check operates outside the synchronized block to provide a fast, lock-free path for the common case where the instance already exists. The second check inside the synchronized block is essential because multiple threads can simultaneously pass the first null check when the instance is still uninitialized. Without this second verification, each thread would sequentially acquire the lock and create separate instances, violating the singleton property. The inner check ensures that only the first thread to enter the critical section performs construction, while subsequent threads discover the instance already initialized and skip creation.

How does the Java Memory Model distinguish between the visibility guarantees of a volatile write and a synchronized block exit?

Both constructs establish happens-before relationships, but they operate on different granularities and performance characteristics. A synchronized block exit flushes all modified variables in the thread's working memory to main memory, acting as a global memory barrier. In contrast, a volatile write specifically prevents reordering of that particular variable with surrounding instructions and ensures the write is immediately visible. Prior to Java 5, volatile lacked these guarantees, making it insufficient for safe publication; the modern JMM treats volatile writes similarly to C++ release operations and reads as acquire operations, providing targeted visibility without the full cost of monitor locking.

Can immutable objects eliminate the need for volatile in the double-checked locking pattern?

No, because final fields guarantee immutability only after the constructor completes, not during the publication of the reference itself. Without volatile, instruction reordering can cause the reference to be written to main memory before the constructor finishes executing, allowing another thread to observe a non-null reference to a partially constructed object. While final fields ensure that values cannot change after construction, they do not prevent the visibility of the default or uninitialized values if the reference escapes early. Safe publication requires either volatile or synchronized to ensure the happens-before relationship between construction and visibility, regardless of the object's internal immutability.

Why did subsequent Java Memory Model revisions mandate volatile semantics to secure the double-checked locking idiom?