Density-Modulated Boids and Stigmergic Coverage: Phase Transitions in 500k-Agent Exploration Swarms
We characterize the order–disorder transition in density-modulated Boids and task-field ACO swarms at scale, quantifying critical exponents, susceptibility, and correlation length on a half-million-agent benchmark.
Chris Adams, Brian Nguyen, Vivek Bakshi
Arboria Labs, Alpharetta, GA United States
Corresponding Author email(s): cadams@arborialabs.com, [private], [private]
Abstract
We characterize the order–disorder transition in large-scale space-exploration swarms by integrating density-modulated Boids (DMB) and task-field Ant Colony Optimization (TF-ACO) in Gossamer (v0.2.0) and evaluating at up to agents in the Leviathan Engine (py-0.2.0, velocity-Verlet, OpenMP-parallel). We instrument the transition with the full toolkit a phase-transition paper requires: polar order parameter ψ, susceptibility χ_ψ, correlation length ξ, fourth-order Binder cumulant U_4, and finite-size scaling fits. Critical exponents recovered from the (density × noise) sweep place DMB in the 3D Vicsek universality class within statistical error (β ≈ 0.46, γ/ν ≈ 1.18, ν ≈ 0.74; bootstrap 95% CIs in §5.1). DMB introduces adaptive rule weights based on local density and obstacle potential fields; TF-ACO uses virtual stigmergy stored as OR-Set CRDT counters (sharing the consistency abstraction with the ICCD intent CRDT — see the cross-paper appendix on Eventually Consistent State Abstractions) rather than a hand-waved DHT-backed map. Across asteroid-belt survey scenarios, DMB improved ψ by 19% and reduced collision rate by 46% relative to fixed-weight Boids, while TF-ACO increased unique coverage χ by 24% at 0.7× messages vs a greedy assignment baseline. We add a learned-weight MAPPO Boids baseline trained against the same scenarios; DMB matches its ψ within 0.02 absolute at 60× lower training cost (zero gradient steps), arguing that the structural prior is doing real work. The supercritical density threshold at which naive policies oscillate is delayed by 1.6× under DMB+TF-ACO. We re-derive using the correct 3D mean-free-path form (the previous draft used the 2D form). The noise model now includes sensing and actuator noise alongside velocity noise. Maneuver.Map analysis notebooks (phase_diagram.py, criticality.py) reproduce all figures from a single committed exp_id.
Keywords
Emergent Behavior, Boids, Ant Colony Optimization, Coverage, Multi-Agent Systems, Space Exploration
1. Introduction
1.1. Background and Motivation. Space exploration demands robust, scalable sensing over vast, obstacle-rich domains (asteroid belts, ring systems). Swarm emergent behaviors offer adaptable coordination from local rules but degrade at high densities and under noise. The gap is a principled mapping from local interaction parameters to global performance and failure regimes at scale, guiding safe operating envelopes.
1.2. Problem Statement and Research Questions/Hypotheses. We quantify and improve emergent alignment, coverage, and safety at scale under density, noise, and obstacle fields. Our hypotheses are: (H1) Density-modulated Boids (DMB) increases alignment and reduces collisions versus fixed weights; (H2) TF-ACO improves coverage with lower message overhead than greedy assignment.
1.3. Proposed Approach and Contributions. We implement DMB and TF-ACO in Gossamer and evaluate in Leviathan across asteroid-belt analogs, with Maneuver.Map orchestrating sweeps. Contributions include density-modulated Boids (adaptive rule weights by local density and obstacle potential), task-field ACO (stigmergic coverage gradients with revisit decay and noise robustness), phase-diagram characterization of alignment, coverage, and collision regimes up to agents, and comparative analysis versus fixed-weight Boids and greedy tasking.
1.4. Paper Outline. Section 2 reviews background. Section 3 details DMB and TF-ACO. Section 4 describes the setup. Section 5 presents results. Section 6 discusses implications. Section 7 concludes and future work.
2. Related Work / Background
Emergence in swarms arises from local rules (Reynolds’ Boids) and stigmergy (ACO). At scale, density and noise induce phase changes affecting global order.
2.1. Swarm Intelligence Fundamentals. Boids rely on separation, alignment, and cohesion; ACO uses pheromone deposition and evaporation; PSO couples velocities for optimization. Fixed parameters struggle across densities.
2.2. Distributed Systems Principles. Local communication, gossip, and bounded-degree graphs mitigate broadcast storms; DTN patterns help when contacts are sparse.
2.3. Swarm Robotics in Space. Prior work demonstrates small-team coordination; few quantify phase transitions at >1e5 agents in sparse, obstacle-filled environments.
2.4. Existing Coordination Techniques. Fixed-weight Boids and greedy tasking are simple but brittle. Modern MARL with parameter sharing and centralized-training/decentralized-execution (CTDE) does scale — and we include a MAPPO baseline (§4.4) precisely to test the older claim that “MARL needs strong priors at this scale”. Our finding is more nuanced: MAPPO matches DMB’s ψ once trained but at substantial training cost, and shows higher variance under (density, noise) regimes outside its training distribution. The structural prior in DMB (sigmoid weight modulation) buys robustness that the learned policy doesn’t recover within an 8M-step training budget.
2.5. Positioning of Current Work. We provide scalable, parameter-robust emergent policies with empirical phase maps guiding safe density/noise envelopes.
3. Methodology / Proposed Framework / System Design
- We detail DMB and TF-ACO and their integration in Gossamer.
- 3.1. Conceptual Overview:
- Agents run DMB steering and TF-ACO coverage fields; Leviathan advances physics and obstacles; Maneuver.Map orchestrates sweeps and logging.
- Terms: order parameter ψ, coverage ratio χ, collision rate ρ_c.
- 3.2. Density-Modulated Boids (DMB):
- Rule weights w_sep, w_align, w_coh adapt with local density d via sigmoid schedules, reducing oscillations at high d.
-
Obstacle potential field U guides repulsion; steering v’ normalized and clamped by max acceleration and max turn-rate .
-
Implemented as
gossamer.algorithms.flocking.dmb_step. -
3.3. Task-Field ACO (TF-ACO):
- Agents deposit virtual pheromone on under-sampled cells; evaporation rate λ enforces revisit cadence; selection probability ∝ pheromone × heuristic (distance/uncertainty).
- Pheromone storage as OR-Set CRDT counters. Each cell maintains a grow-only deposit counter (G-Counter) and an evaporation timestamp; merge is componentwise max of deposit counts plus min of evaporation timestamps. Replicas reconcile via the same composite-CRDT machinery used for ICCD intent (Theorem 1, Appendix A of the ICCD paper). This replaces the previous hand-waved “DHT-backed sparse voxel map” with a primitive that has a convergence proof and matches the inter-replica consistency story used elsewhere in the Arboria stack. See the shared appendix “Eventually Consistent State Abstractions” on the research index.
- Replication topology. Each cell has a primary replica owned by the nearest active agent; secondary replicas are held by neighbors within
comm_range. Conflicts are resolved by CRDT merge, not by leader election. - Messages limited to local neighbors; parameters tuned via Maneuver.Map’s Optuna-backed sweep harness.
-
3.4. Mathematical Modeling:
Order parameter
Collision rate
-
3.5. Theoretical Analysis:
- DMB reduces eigenvalues of local linearized dynamics, dampening oscillations; TF-ACO balances exploration/exploitation via λ.
- We define supercritical density as the point at which the 3D mean free path drops below the braking distance . Using the 3D form for hard-sphere encounters with collision radius : Correction note. An earlier draft used , which is the 2D form. The 1,000×1,000×50 km belt analog is geometrically 3D (aspect ratio 20:1 is borderline quasi-2D, but agent motion is fully 3D). All numerical thresholds in §5 use the corrected 3D form. We provide both interpretations in Appendix F so the quasi-2D regime can be revisited if obstacle geometry confines motion to a thin shell.
- Critical exponents. Near we expect on the ordered side, with susceptibility and correlation length . We measure via finite-size scaling on the (density × noise) grid in §5.1 and compare against Vicsek (3D) reference values.
4. Experimental Setup / Simulation Environment
- All experiments were reproducible via Leviathan configs and Maneuver.Map runs.
- 4.1. Simulation Platform:
- Leviathan Engine py-0.2.0 with velocity-Verlet integration and OpenMP-parallel physics; Gossamer v0.2.0 policies via
gossamer.algorithms.coordination.{dmb,tfaco}; Maneuver.Map orchestrating the (density × noise) grid through the Sobol sweep design (design="sobol") plus a fine-resolution Cartesian sweep around for the susceptibility peak. Critical-phenomena instrumentation comes fromgossamer.metrics.criticality(Binder cumulant, susceptibility, correlation length) andgossamer.metrics.info(transfer entropy across the order transition).
- Leviathan Engine py-0.2.0 with velocity-Verlet integration and OpenMP-parallel physics; Gossamer v0.2.0 policies via
- 4.2. Scenario Design:
- Asteroid-belt analog: 1,000×1,000×50 km, agents, obstacles as inverse-square repulsive fields.
- Agent speed 5–15 m/s; neighbor radius 100–500 m.
- Noise model (expanded). Three independent stochastic channels per step: velocity noise (m/s, isotropic Gaussian on commanded velocity), sensing noise m on observed neighbor positions, and actuator noise m/s² on commanded acceleration (clipped to ). The Phase-1 RNG seed tree threads independent generators through each channel so noise contributions are individually ablatable.
- Reported densities use agents/km³ to reflect kilometer-scale separation in open space.
- Kinematic limits: max turn-rate rad/s and max acceleration 0.5 m/s² to model reaction-wheel constraints.
- 4.3. Input Data:
- Synthetic obstacle maps and initial seeds at
/nas/experiments/emergence/inputs/*.
- Synthetic obstacle maps and initial seeds at
- 4.4. Baseline Methods / Comparative Analysis:
- Fixed-weight Boids with Optuna-tuned weights (best of 200 TPE trials on the same scenarios) so the comparison isn’t against a strawman.
- Greedy nearest-task assignment for the coverage axis.
- Levy Flight biological search baseline (now reported in Table 1; previously the row was missing).
- MAPPO learned-weight Boids (new). A 2-layer GraphMLP policy that outputs as a function of local density and neighbor variance, trained with
gossamer.learning.mappoagainst +ψ − λ·collision_rate. 8M env steps on a single L4; ~60 GPU-hr to convergence. Same simulator, same observations as DMB; the only difference is whether the weight-modulation function is hand-coded sigmoids or learned MLPs.
- 4.5. Performance Metrics:
- Alignment , coverage , collision rate , message overhead (KB/agent·h), runtime/step.
- 4.6. Experimental Procedure:
- 15 seeds per condition; sweeps over density, neighbor radius, and noise. MLflow tracked runs at
/nas/experiments/emergence.
- 15 seeds per condition; sweeps over density, neighbor radius, and noise. MLflow tracked runs at
5. Results
All §5 numbers derive from exp_dmb_main_2026q2; see /research/reproducibility/dmb_tfaco.
- 5.1. Phase transition: critical exponents and universality class.
- Susceptibility peak. Across 4 system sizes (), peaks at a density that converges to agents/km³ as .
- Recovered exponents (DMB): , , (bootstrap 95% CIs over 5 seeds × 4 system sizes). These are statistically consistent with 3D Vicsek values (, , ).
- Binder cumulant crossing. curves for the four system sizes intersect within statistical noise at agents/km³, confirming the location of the transition.
- Comparison to fixed Boids. Fixed Boids in the same scenarios produces a sharper transition at agents/km³ — DMB delays by 1.60× in density, consistent with the abstract claim.
- 5.2. Alignment and Safety.
- DMB increased ψ by 19% (relative) and reduced by 46% vs fixed Boids at density 0.3 agents/km³; (Welch’s t-test, 15 seeds per condition).
- 5.3. Scalability.
- Runtime/step scales linearly with on the OpenMP path; message overhead remains per agent via local neighborhoods.
- 5.4. Robustness under the expanded noise model.
- DMB+TF-ACO sustains χ under combined noise up to with drop; fixed Boids degrades by 25% at the same operating point. Sensing noise was previously uninstrumented and turns out to dominate the robustness gap at high density — consistent with the intuition that DMB’s density-dependent cohesion damps observation jitter that fixed weights amplify.
- 5.5. Comparative Analysis.
- TF-ACO achieved χ = 0.82 vs greedy χ = 0.66 at equal steps; messages 0.7× greedy.
- DMB matches MAPPO on ψ within 0.02 absolute (DMB 0.69, MAPPO 0.71) at zero training cost. Outside the training distribution (noise or density 0.6 agents/km³), DMB is more stable: MAPPO’s ψ variance grows by 3.4× while DMB’s grows by 1.4×.
- (Figures and Tables):
- Figure 1: Phase diagram heatmap of over noise and density .
- Figure 2: Sigmoid modulation curves for , , vs local density .
- Figure 3: Coverage efficiency vs message overhead (Greedy vs Random Walk vs TF-ACO).
- Figure 4: Collision rate vs density with noise bands.
- Figure 5: Runtime per step vs agent count (log-log).
- Table 1: and vs policy at density .
- Table 2: Parameter sweep ranges and step sizes.
Table 1. Headline metrics at density 0.3 agents/km³, , 15 seeds, mean ± s.d. All from exp_dmb_main_2026q2.
| Policy | ψ (↑) | ρ_c (×10⁻⁴, ↓) | χ (↑) | Msgs (KB/agent·h) | Train cost |
|---|---|---|---|---|---|
| Fixed Boids (Optuna-tuned) | 0.58 ± 0.02 | 7.2 ± 0.6 | 0.66 ± 0.03 | — | 0 |
| Levy Flight | 0.41 ± 0.03 | 6.9 ± 0.5 | 0.71 ± 0.02 | — | 0 |
| Greedy assignment | 0.55 ± 0.02 | 6.4 ± 0.4 | 0.66 ± 0.02 | 1.0× | 0 |
| MAPPO learned-weight Boids | 0.71 ± 0.04 | 4.1 ± 0.3 | 0.78 ± 0.03 | — | 8M steps |
| DMB (ours) | 0.69 ± 0.02 | 3.9 ± 0.3 | 0.74 ± 0.02 | — | 0 |
| DMB + TF-ACO (ours) | 0.71 ± 0.02 | 3.8 ± 0.3 | 0.82 ± 0.02 | 0.7× | 0 |
6. Discussion
- DMB and TF-ACO tune emergent behavior for safer, more coherent exploration without centralized planning.
- 6.1. Interpretation of Key Findings: H1 and H2 are supported; adaptive weights and stigmergy provide robustness at scale with modest overhead.
- 6.2. Comparison with Related Work: Confirms Boids’ sensitivity to density; extends ACO to 3D coverage with revisit decay.
- 6.3. Implications of the Work: Provides phase maps to set safe densities and sensing radii for belts/rings; guides parameter choices.
- 6.4. Impact of Framework/Tools: Leviathan scaled to agents; Gossamer enabled rapid policy variants; Maneuver.Map revealed regime shifts.
7. Limitations and Future Work
Limitations include simplified sensing and obstacle models; no comms latency modeled here.
- 7.1. Limitations:
- The virtual pheromone map assumes timely local neighborhood synchronization; packet loss and CRDT conflict resolution for the map are not modeled.
- Energy costs include kinematics but exclude radio power for TF-ACO gossip and global map maintenance.
- Results are limited to asteroid-belt analogs and do not include solar radiation pressure or actuator faults beyond turn-rate limits.
- 7.2. Future Work:
- Model packet loss and CRDT-based conflict resolution for virtual stigmergy.
- Introduce torque/propulsion faults and heterogeneity in actuation limits.
- Evaluate Levy Flight and DMB/TF-ACO hybrids under variable gravity wells and dust-plume disturbances.
- Expand phase maps to include communication latency and bandwidth constraints.
8. Conclusion
We showed adaptive local rules (DMB) and stigmergic coverage (TF-ACO) yield robust emergent patterns at scale, improving alignment, safety, and coverage with modest overhead. These insights provide actionable phase maps and parameter choices for space-swarm deployments in belts and rings.
Acknowledgements
We thank the Arboria Visualization Team for Maneuver.Map support.
Data and Code Availability
Input maps and configs at /nas/experiments/emergence/inputs and /nas/experiments/emergence/configs; outputs at /nas/experiments/emergence/outputs. Policies in Gossamer are proprietary; analysis scripts available upon request.
References
[1] Reynolds, C., “Flocks, Herds, and Schools: A Distributed Behavioral Model,” 1987.
[2] Dorigo, M., et al., “Ant Colony Optimization,” 1999.
[3] Bonabeau, E., Dorigo, M., Theraulaz, G., “Swarm Intelligence,” 1999.
[4] Couzin, I., et al., “Collective Memory and Spatial Sorting in Animal Groups,” 2002.
[5] Gerkey, B., Mataric, M., “A Formal Analysis of Multi-Robot Task Allocation,” 2004.
[6] Jadbabaie, A., Lin, J., Morse, A. S., “Coordination of Groups of Mobile Autonomous Agents,” 2003.
Appendix / Supplementary Material
Appendix A: DMB Weight Modulation Functions
Where is local neighbor density, is the critical density inflection point, and control transition steepness.
Appendix B: Task-Field ACO with Revisit Decay
Virtual pheromone update:
Appendix C: Leviathan Simulation Config (Snippet)
{
"scenario": "asteroid_belt_alpha",
"dimensions": [1e5, 1e5, 5e4],
"agent_count": 500000,
"physics": {
"integrator": "verlet",
"dt": 0.1,
"kinematics": {
"max_velocity": 15.0,
"max_turn_rate_rad": 0.08,
"max_linear_accel": 0.5
}
},
"policy": {
"type": "DMB_TF_ACO",
"params": {
"d_crit": 12.0,
"alpha_sigmoid": 0.5,
"pheromone_evap_rate": 0.01,
"neighbor_query_limit": 32
}
}
}Appendix D: Sweep Ranges (Summary)
We sweep density (0.05–0.6 agents/km³), neighbor radius (100–500 m), and noise (0–0.2). Table 2 enumerates the discrete grid used for the phase maps.
Appendix E: Additional Ablations
We isolate (i) DMB without obstacle potentials, (ii) TF-ACO without revisit decay, and (iii) fixed-weight Boids with Optuna-tuned weights to separate tuning effects from adaptivity. The fixed-weight ψ peak under tuning is 0.62 ± 0.02 — still 0.07 below DMB — confirming the gain is not purely a calibration artifact.
Appendix F: 3D vs Quasi-2D Mean-Free-Path
The asteroid-belt analog domain (1,000 × 1,000 × 50 km, aspect ratio 20:1) sits between the 2D and 3D regimes. We use the 3D form throughout the main results because agent motion is fully 3D within the slab; the 2D form is appropriate only when vertical motion is suppressed by the obstacle field. We have re-measured under both interpretations: the 3D form gives agents/km³ and matches the empirical Binder-cumulant crossing; the 2D form gives 0.42 agents/km³ and overshoots the observed transition by ~30%, confirming that the slab geometry does not behave quasi-2D for this collision radius.
Appendix G: Reproducibility
All §5 numbers and figures regenerate from exp_dmb_main_2026q2 via notebooks/phase_diagram.py and notebooks/criticality.py. Provenance, seed tree, wheel SHAs at /research/reproducibility/dmb_tfaco. Symbol conventions (ψ, χ, ξ, β, γ, ν) follow the unified table in the ICCD paper, Appendix F.