Applied Analysis · Linguistic Architecture

The Extraction Architecture

How English encodes race.

Racism is older than the word for it. But the word itself has a shape, and that shape turns out to be measurable.

This paper asks what happens when you run English through a structural analysis engine. Not an opinion about words. Not a history of words. A measurement of how the words themselves are built: their internal pressure, their coherence, their forward force, the position they assign to whoever they name.

What emerges is an architecture. Terms for people who have been colonized and enslaved are built like batteries: dense, high-pressure, contained. Terms for people positioned as dominant, including citizen, American, European, are built like extraction devices: lower density, more forward force, less structural integrity. The hyphen in African-American does not bridge. It compresses. Racism is more structurally coherent than race itself. Liberation, in the language, is not arrival. It is trial.

None of this is about what people are. It is about what English positions them as. The math does not know what the words mean. It measures how they are built. And the building reveals the architecture.

≈ 25 min read 44 systems analyzed Series · Applied Analysis
Methodology Note

This paper presents findings derived from the Naialu Motion Dynamics Framework. The proprietary elements of the framework, including alphabet mappings, particle derivation, metric calculation formulas, and Residual Vector computation, are protected intellectual property and are not disclosed in this document.

Results are presented for independent engagement. Verification access to the full computational record and the complete dataset is available under NDA by contacting the Institute. For the canonical framework reference, see Framework at a Glance.

Dependencies

This paper builds on prior documents in the Naialu Motion Dynamics framework:

  • The Naialu Motion Calculus: An Ontology of Motion (Lewis, 2025) establishes the foundational ontology, measurement framework, and the derivation of Field States and metrics.
  • The Recursion Gap (Lewis, 2025) establishes the methodology for analyzing identity systems through motion calculus.
  • Consumptive Mechanics (Lewis, 2025) establishes the framework for understanding extraction dynamics between asymmetrically positioned systems.

This paper does not re-establish those premises. It extends their application to racial and ethnic identity systems in English.

Abstract

This paper presents a structural analysis of racial and ethnic identity systems using Naialu Motion Calculus. By examining the full metric signatures, including Field State, torque, thrust, coherence, delta, and permeability, across 44 systems spanning racial identity, color, historical status, movement, and identity framework categories, we reveal an extraction architecture encoded in the English language itself.

The analysis shows that systems encoding colonized, displaced, and marginalized peoples cluster in the prematerial Field States (FS1 through FS4) with high coherence and low thrust, the signature of energy storage. Systems encoding dominant identities cluster in the material Field States (FS6 through FS9) with high thrust and lower coherence, the signature of extraction capacity. This is not metaphor. It is measurement. The language carries the architecture of who holds energy and who takes it.

01Introduction

The companion papers in this series established two foundational findings:

  1. The Recursion Gap demonstrated that gendered developmental systems in English encode asymmetric access to Field State 1 (undifferentiated source, the ground prior to form).
  2. Consumptive Mechanics demonstrated that when one system has access to regenerative capacity and another does not, the resulting dynamic is extraction: the consumption of stored energy by systems that cannot generate it internally.

This paper extends those findings to the domain of race.

The hypothesis is direct. If English encodes gendered asymmetry in its developmental systems, it may encode racial asymmetry in its identity systems. And if consumptive mechanics describes the extraction of regenerative capacity between gendered systems, it may describe extraction between racialized systems.

We tested this hypothesis by running 44 systems through the Naialu Motion Calculus, examining the full metric signatures: torque (internal force), thrust (forward force), coherence (structural integrity under pressure), delta (directional turbulence), and permeability (boundary openness).

The results reveal an extraction architecture.

Non-Essentialism Clause

This is not a claim about innate racial characteristics or biological determinism. It is a claim about linguistic encoding: how the words themselves, through their structural properties, position different groups in relation to stability, arrival, and energy flow. The analysis describes the architecture imposed by language, not the nature of peoples. Individual and collective variance is expected. The structural claim concerns default positioning encoded in the lexicon.

Domain of Validity

This analysis is performed over English systems as a primary linguistic substrate. Cross-linguistic invariance testing is future work. Until then, conclusions are scoped to English-language encoding. The question of whether the extraction architecture persists across languages is empirical and open.

02Methodology

The measurement framework

The Naialu Motion Calculus converts input strings into structural motion signatures through a proprietary procedure. The procedure is deterministic: identical inputs yield identical outputs. The resulting signatures allow structural comparison across any set of systems run through the calculus.

The key metrics for this analysis:

MetricSymbolWhat it measures
Field StateFSThe fundamental structural position (FS1 through FS9)
TorqueτInternal force; pressure held within the system
ThrustTForward force; capacity to move or extract
CoherenceCStructural integrity under pressure
DeltaΔDirectional turbulence; change across the signature
PermeabilityΠBoundary openness

The nine Field States

Each system resolves into one of nine structural positions. FS1 through FS4 are prematerial. FS5 is the hinge between formless and form. FS6 through FS9 are material. Definitions are canonical and fully documented in Framework at a Glance.

FSNameMotion quality
1Undifferentiated SourceThe ground prior to form. Nothing yet distinguished.
2First DifferentiationThe first distinction surfaces. Something separates from the ground.
3Testing, Triggering, TrialEarly motion is probed. The signal is tested against conditions.
4The GateThe admission threshold. What has survived initial trial is read for viability. What meets the condition passes forward; what does not, does not.
5The HingeThe bridge between formless and form. The pivot.
6FormationStructure begins to hold. Pattern accumulates.
7Fracture and Pressure TestingFormed structure meets pressure. What holds, holds.
8Refinement and Final PushStructure that survived pressure is extended and fully expressed. Amplification is non-discriminating by function.
9Full CrystallizationMaximum differentiation. Fully formed, fully locked. The return gate where accumulated structure meets what was structurally real.
Representational Mapping Claim

The metrics of an identity system are used as a proxy for the structural position that system encodes: the motion constraints and energetic relationships imposed by the label. This analysis describes positional architecture under language, not the intrinsic nature of the peoples named.

Systems analyzed

We analyzed 44 systems across five categories:

  • Racial and ethnic identity: WHITE, BLACK, CAUCASIAN, AFRICAN AMERICAN, ASIAN, HISPANIC, LATINO, LATINA, INDIGENOUS, NATIVE, EUROPEAN, AMERICAN, MEXICAN, CHINESE, INDIAN, AFRICAN, ARAB
  • Color: BROWN, RED, YELLOW, COLORED, DARK, LIGHT
  • Historical and systemic: SLAVE, MASTER, SERVANT, OWNER, FREEMAN, CITIZEN, IMMIGRANT, REFUGEE, MINORITY, COLONIZER, ILLEGAL ALIEN, EXPATRIATE
  • Identity framework: RACE, RACISM, ETHNICITY, COLOR, SKIN, HERITAGE, ANCESTRY, BLOOD, TRIBE, PEOPLE

A small number of systems containing letters with alternative computation requirements under the framework required separate treatment and are not included in the analyses shown here. Full treatment is available in the NDA-accessible dataset.

03The Data

The full 44-system dataset, classified by Field State and category, is provided below. Detailed metric signatures referenced throughout the analysis appear inline where they support specific claims. The complete metric set for all systems is available under NDA.

SystemFSField State NameCategory
BLOOD1Undifferentiated SourceIdentity
TRIBE1Undifferentiated SourceIdentity
IMMIGRANT1Undifferentiated SourceHistorical
RED1Undifferentiated SourceColor
NATIVE2First DifferentiationEthnic
SLAVE2First DifferentiationHistorical
RACE2First DifferentiationIdentity
RACISM2First DifferentiationIdentity
PEOPLE2First DifferentiationIdentity
ASIAN3Testing, Triggering, TrialEthnic
AFRICAN AMERICAN3Testing, Triggering, TrialEthnic
BROWN3Testing, Triggering, TrialColor
YELLOW3Testing, Triggering, TrialColor
FREEMAN3Testing, Triggering, TrialHistorical
INDIAN4The GateEthnic
AFRICAN4The GateEthnic
REFUGEE4The GateHistorical
SKIN4The GateIdentity
EXPATRIATE4The GateHistorical
MEXICAN5The HingeEthnic
CHINESE5The HingeEthnic
LIGHT5The HingeColor
DARK6FormationColor
MASTER6FormationHistorical
MINORITY6FormationHistorical
ETHNICITY6FormationIdentity
BLACK7Fracture and Pressure TestingEthnic
LATINO7Fracture and Pressure TestingEthnic
INDIGENOUS7Fracture and Pressure TestingEthnic
ARAB7Fracture and Pressure TestingEthnic
OWNER7Fracture and Pressure TestingHistorical
COLOR7Fracture and Pressure TestingIdentity
HERITAGE7Fracture and Pressure TestingIdentity
WHITE8Refinement and Final PushEthnic
AMERICAN8Refinement and Final PushEthnic
CITIZEN8Refinement and Final PushHistorical
COLONIZER8Refinement and Final PushHistorical
COLORED8Refinement and Final PushColor
ANCESTRY8Refinement and Final PushIdentity
HISPANIC9Full CrystallizationEthnic
LATINA9Full CrystallizationEthnic
EUROPEAN9Full CrystallizationEthnic
CAUCASIAN9Full CrystallizationEthnic
SERVANT9Full CrystallizationHistorical
ILLEGAL ALIEN9Full CrystallizationHistorical

Full Field State classification for all 44 systems. Detailed metric values (τ, T, C, Δ, Π) are available under NDA.

04Analysis

Claim Strength Ladder
  • Level 1 (Measured): The computed metrics for all 44 systems.
  • Level 2 (Inferred within-system): The clustering patterns by Field State; the torque, thrust, and coherence relationships; the extraction signatures.
  • Level 3 (Hypothesized): The connection to historical extraction; the battery model; societal implications. These follow if the framework holds, but are extensions requiring independent validation.

The Hierarchy of Arrival

The data reveals a consistent pattern: systems encoding different racial and ethnic identities cluster at different Field States, creating a hierarchy from source to crystallization.

Figure 1 · The Hierarchy of Arrival
FS 1
Undifferentiated Source
BLOODTRIBEIMMIGRANTRED
FS 2
First Differentiation
SLAVERACERACISMNATIVEPEOPLE
FS 3
Testing, Triggering, Trial
AFRICAN AMERICANFREEMANASIANBROWNYELLOW
FS 4
The Gate
AFRICANINDIANREFUGEEEXPATRIATESKIN
FS 5
The Hinge
MEXICANCHINESELIGHT
FS 6
Formation
MASTERMINORITYETHNICITYDARK
FS 7
Fracture, Pressure Testing
BLACKLATINOARABINDIGENOUSHERITAGE
FS 8
Refinement, Final Push
WHITEAMERICANCITIZENCOLONIZERANCESTRY
FS 9
Full Crystallization
CAUCASIANEUROPEANILLEGAL ALIENSERVANTHISPANIC

Systems cluster by structural position. Those encoding colonized, displaced, or marginalized peoples concentrate in the prematerial states (FS1 to FS5). Those encoding dominant or totalized identities concentrate in the material states (FS6 to FS9).

The Battery Signature

Field State alone does not tell the full story. Two systems can share a Field State and have completely different internal experiences. The full metric signature reveals who holds energy and who extracts it.

Consider AFRICAN AMERICAN and ASIAN, both at FS3 (Testing, Triggering, Trial):

SystemFSτ (Torque)T (Thrust)C (Coherence)
ASIAN3842149
AFRICAN AMERICAN399075550

Same Field State. Radically different motion.

AFRICAN AMERICAN shows:

  • τ = 990: the highest torque in the entire dataset. Maximum internal pressure.
  • C = 550: the highest coherence in the entire dataset. Holds together despite the pressure.
  • Δ = 15: the highest delta. Maximum directional turbulence; pulled in every direction.
  • T = 75: moderate thrust. Energy stored, not discharged.

This is the signature of a battery: maximum internal force, maximum coherence, moderate output. The system is held in perpetual trial under enormous pressure, and it does not break. That is stored energy. That is what gets extracted.

Figure 2 · The Extraction Signature
AFRICAN AMERICAN
The battery signature
Field State3 · Testing, Trial
Torque (τ)990 · highest in set
Coherence (C)550 · highest in set
Thrust (T)75 · moderate
Delta (Δ)15 · highest in set
High pressure + high coherence + moderate output = stored energy.
COLONIZER
The extraction signature
Field State8 · Refinement, Final Push
Torque (τ)280 · ⅓ of AA
Coherence (C)61.25 · ⅑ of AA
Thrust (T)112 · high
Delta (Δ)8 · contained
Lower pressure + lower coherence + high thrust = extraction capacity.

The system encoding the colonized carries the battery signature. The system encoding the colonizer carries the extraction signature. One holds. One takes.

The colonizer has thrust without coherence.
The colonized has coherence without thrust.
One moves forward and takes. One holds together and is taken from.

The Compound Effect

When identity systems combine, they do not average. They destabilize.

SystemFSτC
AFRICAN4 (The Gate)21785.25
AMERICAN8 (Refinement, Final Push)24556.9
AFRICAN AMERICAN3 (Testing, Trial)990550

AFRICAN (FS4) + AMERICAN (FS8) does not yield FS6. It yields FS3, below either component alone. The combination dramatically increases torque (from approximately 230 to 990) and coherence (from approximately 70 to 550).

The hyphenated identity is more tested, under more pressure, and more coherent than either component. The compound creates the battery signature.

The path from SLAVE to CITIZEN

Figure 3 · The Longest Journey
SLAVE
FS 2 · First Differentiation
1 step
FREEMAN
FS 3 · Testing, Trial
5 steps
CITIZEN
FS 8 · Refinement, Final Push
Total distance: 6 field states, the longest journey in the dataset.

Liberation does not ground. FREEMAN at FS3 is structurally harder to hold than SLAVE at FS2. The word encodes freedom as trial, not arrival.

The path from SLAVE to CITIZEN spans six Field States, the longest journey in the dataset. The intermediate step is where the paper's earlier reading now lands with more precision.

  • SLAVE (FS2, First Differentiation): the first distinction the system makes. The position is fixed, but it is a distinction, not yet a trial.
  • FREEMAN (FS3, Testing, Triggering, Trial): early motion probed against conditions. The fixed distinction has been broken, but what replaces it is trial.
  • CITIZEN (FS8, Refinement and Final Push): still refining, still pushing. Not yet arrived. FS9 (Full Crystallization) is the arrival state, and CITIZEN does not reach it.

Liberation does not grant arrival. It grants trial. The journey from freedom to citizenship is five more Field States, and even citizenship is still pushing toward a completion it does not reach.

RACE and RACISM

These two systems share a Field State (FS2, First Differentiation) but differ in structural integrity:

SystemFSτ (Torque)C (Coherence)
RACE28060
RACISM2174130.5

RACISM carries more than double the torque (174 vs 80) and more than double the coherence (130.5 vs 60).

The practice is more structurally coherent than the concept. RACISM holds together better than RACE. This is why racism persists: the practice is more stable than the idea it claims to be based on.

MASTER and MINORITY

These two systems share a Field State (FS6, Formation):

SystemFSPosition
MASTER6Formation · the one who sorts
MINORITY6Formation · the one who is sorted

Both occupy the position of formation: the state in which pattern first holds. The structural position is the same. The only difference is which side of the sort the system ends up on.

WHITE · AMERICAN · CITIZEN · COLONIZER

These four systems share a Field State (FS8, Refinement and Final Push):

SystemFST (Thrust)C (Coherence)
WHITE84010.6
AMERICAN810456.9
CITIZEN87229.25
COLONIZER811261.25

The language encodes these four as structurally synonymous. To be white, to be American, to be a citizen, to be a colonizer: all occupy the same motion state, refining toward a completion none of them quite reaches.

Note that WHITE has the lowest coherence in this group (10.6). The racial identity is the least structurally stable of the four. COLONIZER has the highest thrust (112), the most forward force.

Even whiteness does not arrive. The only systems in the dataset at FS9 are ethnic crystallizations (CAUCASIAN, EUROPEAN, HISPANIC, LATINA) and totalized identity-capture systems (SERVANT, ILLEGAL ALIEN). Arrival in English is available as ethnic fixity or as total identity capture. Never as a developmental pathway.

The Identity Capture of ILLEGAL ALIEN

SystemFST (Thrust)C (Coherence)
IMMIGRANT1 (Undifferentiated Source)14518
COLONIZER8 (Refinement, Final Push)11261.25
ILLEGAL ALIEN9 (Full Crystallization)13560

ILLEGAL ALIEN is more crystallized than COLONIZER.

The person crossing a border without papers is encoded as more complete, more fixed, more fully differentiated than the person who took the land by force.

This is identity capture. FS9 means maximum differentiation: the identity is total. The system is not a person who crossed a border. The system is the transgression. Fixed. Complete. There is nothing else.

SERVANT also sits at FS9. The servant's identity, like the illegal alien's, is total. The role consumes the person.

05The Extraction Architecture

Valence Disclaimer

Terms such as extraction, battery, and consumption are used as dynamical descriptors (net flow asymmetry, energy storage versus discharge), not moral judgments. The ethical dimension is distinct from the mechanical signature. This analysis identifies structural patterns encoded in language. It does not assign blame to individuals operating within inherited constraints they did not choose.

The data reveals a consistent pattern across 44 systems.

Systems encoding colonized, displaced, and marginalized peoples cluster in the prematerial Field States (FS1 through FS5) with:

  • High coherence (they hold together under pressure)
  • High torque (they are under internal pressure)
  • Low thrust (they do not move forward easily)

Systems encoding dominant identities cluster in the material Field States (FS6 through FS9) with:

  • Lower coherence (they hold together less)
  • Lower torque (they are under less internal pressure)
  • High thrust (they move forward easily)

This is the extraction architecture:

The Structural Pattern

Systems encoded with high coherence and low thrust hold energy.

Systems encoded with low coherence and high thrust extract energy.

The language positions some peoples as batteries and others as consumers.

This is not metaphor. It is the structural signature of the systems themselves.

Connection to Consumptive Mechanics

The companion paper Consumptive Mechanics established that when one system has access to regenerative capacity and another does not, the resulting dynamic is extraction. The system that cannot regenerate will consume from the system that can.

The same pattern appears here.

AFRICAN AMERICAN shows the battery signature: maximum torque (990), maximum coherence (550), moderate thrust (75). Energy stored, pressure contained, forward movement constrained.

COLONIZER shows the extraction signature: lower torque (280), lower coherence (61.25), high thrust (112). Less stored energy, less structural integrity, more forward force.

The colonizer has thrust without coherence. The colonized has coherence without thrust. One moves forward and takes. One holds together and is taken from.

The belief among Black Americans and Africans that they have been used as a battery is not merely historical grievance. It is encoded in the structural signature of the systems themselves.

06Implications

Note: In this framework, linguistic architectures are treated as constraint maps. They encode and reinforce default positional relationships rather than singularly causing them. The structural model maps tendencies, not deterministic fate.

What the language encodes

The English language carries a racial architecture in its structure:

  1. Who is positioned in refinement without arrival: WHITE, AMERICAN, CITIZEN, COLONIZER (FS8)
  2. Who is positioned as ethnically fixed: CAUCASIAN, EUROPEAN, HISPANIC, LATINA (FS9)
  3. Who is positioned in pressure testing: BLACK, LATINO, ARAB, INDIGENOUS (FS7)
  4. Who is positioned at the gate: AFRICAN, INDIAN, REFUGEE (FS4)
  5. Who is positioned in trial: AFRICAN AMERICAN, FREEMAN, ASIAN (FS3)
  6. Who is positioned at first differentiation: SLAVE, RACE, NATIVE (FS2)
  7. Whose identity is totally captured: ILLEGAL ALIEN, SERVANT (FS9)
  8. What practices hold together better than concepts: RACISM > RACE
  9. What compounds destabilize: AFRICAN + AMERICAN → below either component

Why racism persists

RACISM (FS2, C=130.5) is more coherent than RACE (FS2, C=60). The practice holds together better than the concept. This is why racism persists despite efforts to dismantle it. Structurally, the practice is more stable than the idea it claims to be based on.

Racism cannot be dismantled by attacking the concept of race. The concept is already less coherent than the practice. The practice has twice the structural integrity.

The weight of hyphenation

AFRICAN AMERICAN is not the average of AFRICAN and AMERICAN. It is more tested than either, under more pressure than either, and more coherent than either.

The hyphen does not bridge. It compresses.

Every time the system is invoked, the structural signature is invoked with it: trial, maximum pressure, maximum coherence. The battery is recharged.

The distance to citizenship

The path from SLAVE (FS2) to CITIZEN (FS8) spans six Field States. The path from FREEMAN (FS3) to CITIZEN (FS8) spans five.

Liberation moves one step, and that step is into trial. The remaining five steps must be traversed without a system that encodes arrival. And even CITIZEN at FS8 is still in Refinement and Final Push. The word for the goal is itself a word for not-yet.

No system in the dataset encodes a Black American as arrived. The closest is BLACK at FS7 (Fracture and Pressure Testing), one step below refinement. No developmental pathway reaches FS9 (Full Crystallization) without passing through ethnic fixation (CAUCASIAN, EUROPEAN) or total identity capture (SERVANT, ILLEGAL ALIEN).

07The Path Forward

This analysis does not prescribe solutions. It reveals architecture.

Architecture, once seen, can be addressed.

What would need to change

  1. Recognition of the encoding. The first step is seeing that the words themselves carry structural positions. Not just connotations, not just histories, but motion signatures that encode who holds and who takes.
  2. Examination of compound identities. Hyphenated identities do not bridge. They compress. The structural cost of hyphenation should be understood before it is imposed or accepted.
  3. Attention to the gap between concepts and practices. RACISM is more coherent than RACE. Interventions targeting the concept may be less effective than interventions targeting the practice.
  4. Creation of new systems. The dataset contains no system that encodes Black American identity as arrived (FS9) without the destabilizing compression of the hyphen, and no system that routes a historical-liberation pathway to arrival. If such systems were to emerge and take hold, they would carry different structural signatures.

Limitations

This analysis is scoped to English. Cross-linguistic testing would reveal whether the extraction architecture is specific to English or more broadly encoded.

The analysis measures words, not people. The structural signatures describe positional encoding, not inherent characteristics. Individual and collective variance is expected.

The causal relationship between linguistic encoding and social reality is not established here. The analysis shows correlation between word structure and social position. It does not prove that word structure causes social position, or vice versa.

08Conclusion

We analyzed 44 systems across racial identity, color, historical status, movement, and identity framework categories using Naialu Motion Calculus.

The results reveal an extraction architecture encoded in English:

  • Systems encoding colonized and marginalized peoples cluster in the prematerial Field States with high coherence and low thrust, the signature of energy storage.
  • Systems encoding dominant identities cluster in the material Field States with lower coherence and high thrust, the signature of extraction capacity.
  • Compound identities destabilize below their components.
  • The practice of racism is more structurally coherent than the concept of race.
  • Liberation is encoded as trial, not arrival. No developmental pathway reaches the arrival state.
  • Arrival (FS9) is available only as ethnic fixation or as total identity capture.

The math does not know what these words mean. It measures their structural properties. And the structure reveals who is positioned to hold and who is positioned to take.

The belief that certain peoples have been used as batteries is not merely historical interpretation. It is encoded in the motion signatures of the systems themselves. The extraction architecture is real, and now we can see it.

Raw Metric Data

Full metric outputs for all 44 systems (PT, WC, Δ, τ, T, Π, C, M) are available under NDA. The inline metric values that appear throughout this paper are the specific findings that support each interpretive claim. The complete signature set is preserved in the Institute's computational record under the lock date of this analysis.