Answer to the question.

History of the question

Before C++20, the C++ compilation model relied on textual inclusion through preprocessor directives. When a header file was included, the preprocessor literally copied the text of that header into the including file. This mechanism caused macros defined in headers to leak into the global namespace of every translation unit that included them, leading to subtle bugs and name collisions that were difficult to diagnose.

The problem

Macro leakage created maintenance nightmares in large codebases. A macro defined in a third-party library could silently redefine keywords or common identifiers in consumer code, causing compilation failures or runtime errors that appeared unrelated to the actual cause. Traditional workarounds like #undef guards were manual, error-prone, and did not scale across complex dependency graphs. The fundamental issue was that the preprocessor had no concept of scope or interface boundaries.

The solution

C++20 modules introduce a semantic import mechanism that operates at the language level rather than the preprocessor level. When importing a module with import module_name;, the compiler processes the module's exported interface without executing preprocessor directives from the importing translation unit. Macros defined within the module remain private to that module's implementation unless explicitly exported. This property ensures that macros do not leak across translation unit boundaries, providing true encapsulation and preventing name pollution.

// mathlib.cpp (Module implementation)
module;
#define INTERNAL_CALC_FACTOR 3.14  // Private macro, not leaked

export module mathlib;

export double compute(double x) {
    return x * INTERNAL_CALC_FACTOR;
}

// main.cpp (Consumer)
import mathlib;

// INTERNAL_CALC_FACTOR is NOT visible here
// #ifdef INTERNAL_CALC_FACTOR would be false

int main() {
    double result = compute(10.0);  // Works fine
}

Situation from life

A financial trading firm maintained a large codebase with millions of lines of code across hundreds of modules. They relied on a legacy math library that defined macros like MIN and MAX in its public headers. These macros frequently collided with standard library functions and third-party JSON parsing libraries that used min and max as variable names or function templates.

The first approach considered was wrapping all third-party headers with #pragma once style guards and manually #undefing problematic macros after each include. This required developers to remember which headers defined which macros and to clean up after each inclusion. The approach was fragile because missing a single #undef could cause failures in unrelated parts of the codebase. It also significantly increased compilation times due to the preprocessor processing the same header text repeatedly across translation units.

The second approach considered was converting the math library to use inline functions and templates instead of macros. While this solved the leakage problem, it required modifying the legacy library extensively. The math library was used by multiple teams and changing it risked breaking existing calculations that relied on specific macro evaluation semantics or side effects. The refactoring effort was estimated to take six months and was deemed too risky for the trading platform.

The chosen solution was migrating to C++20 modules. The team converted the math library into a module that exported mathematical functions while keeping macros internal to the module implementation. By using import mathlib; instead of #include <mathlib.h>, consuming translation units no longer saw the MIN and MAX macros. This approach required minimal changes to the library implementation—only adding export statements and converting headers to module interface units. The migration took two weeks instead of six months. The result was the elimination of macro-related name collisions across the codebase and a 15% reduction in compilation times due to the module's compiled interface.

What candidates often miss

How does the module interface unit's compiled binary format prevent macro leakage compared to textual header inclusion?

Candidates often miss that C++20 modules produce compiled module interface units (CMI) that are binary representations of the module's exported interface. Unlike textual headers that are processed by the preprocessor and contain macro definitions as text, CMIs store semantic information about exported functions, types, and templates. The preprocessor does not process the contents of an imported module; it only sees the import declaration. Therefore, macros defined in the module's implementation or even in its interface unit are not visible to the importer. This is fundamentally different from #include, which literally copies text including #define directives. Understanding this requires recognizing that modules shift from a textual inclusion model to a semantic import model.

Why do macros exported from a module using export import behave differently than macros from #include directives?

Candidates frequently confuse export import of macros with regular macro behavior. While C++20 allows exporting macros using export import, these macros only affect code that imports the module and do not leak beyond that import scope. Unlike #include where macros persist in the translation unit until explicitly undefined or the end of the file, exported macros from modules are scoped to the importing translation unit's exposure to that module. Furthermore, if multiple modules export conflicting macros, the conflict is detected at import time rather than causing silent redefinition errors later in compilation. This scoping behavior provides the hygiene that textual inclusion lacks.

How does the module's independence from the preprocessor affect build system integration and dependency scanning?

Candidates often miss that C++20 modules require build systems to understand module dependencies before compilation begins, unlike headers where dependencies are discovered during compilation. Because modules are compiled units rather than text files, the build system must parse module interface units to determine what they export and what they import. This requires a two-phase build process: first, scanning module interface units to build a dependency graph, then compiling in dependency order. The preprocessor independence means that traditional #ifdef guards for header inclusion are irrelevant, and macro-based configuration of module interfaces is limited. Build systems must track compiled module artifacts (BMI - Binary Module Interface) rather than just source files, fundamentally changing how dependency tracking and incremental builds work.

What specific property of C++20 modules eliminates macro leakage across translation unit boundaries?