Joint Optimization of Model Partitioning and Resource Allocation for Anti-Jamming Collaborative Inference Systems

A novel optimization framework jointly manages DNN partitioning, computational resources, and transmit power to secure device-edge collaborative inference against wireless jamming attacks. The approach formulates a revenue of delay and accuracy (RDA) maximization problem and solves it via an alternating optimization algorithm, including a quantum genetic algorithm for partition point selection. Simulations show the strategy outperforms baselines in RDA metric, dynamically adapting to preserve AI inference integrity and timeliness under attack.

Joint Optimization of Model Partitioning and Resource Allocation for Anti-Jamming Collaborative Inference Systems

Securing the AI Edge: New Framework Counters Jamming Attacks in Collaborative Inference

As deep neural network (DNN) inference increasingly shifts to the network edge, a critical vulnerability has emerged. A new research paper proposes a robust defense strategy for device-edge collaborative inference systems, which are highly susceptible to performance degradation from malicious wireless jamming attacks on transmitted intermediate data. The study introduces a novel optimization framework that jointly manages DNN partitioning, computational resources, and transmit power to maximize system performance and resilience, ensuring reliable AI execution on resource-constrained devices.

The Jamming Threat to Distributed AI

The paradigm of collaborative inference splits a DNN model between a wireless device and an edge server to balance computational load and latency. However, the transmission of intermediate feature tensors between these nodes creates a new attack surface. A malicious jammer can disrupt this data flow, corrupting the inference process and drastically reducing accuracy. This research, detailed in the preprint arXiv:2603.02579v1, formally analyzes this threat model, establishing a direct correlation between jamming interference, partition point selection, and final inference accuracy.

An Optimized Anti-Jamming Defense Strategy

To counter this, the authors formulate the challenge as a revenue of delay and accuracy (RDA) maximization problem under strict accuracy and computing resource constraints. This requires the simultaneous optimization of three key variables: the allocation of edge server computing resources, the transmit power of the devices, and the precise DNN partitioning point. The resulting mixed-integer nonlinear programming problem is solved via an efficient, custom-designed alternating optimization algorithm.

This algorithm decomposes the complex problem into three tractable subproblems. One is solved using Karush-Kuhn-Tucker (KKT) conditions, another via convex optimization methods, and the critical partitioning subproblem is addressed with a quantum genetic algorithm—a hybrid approach combining quantum computing principles with evolutionary optimization for efficient search in a large solution space.

Proven Performance Gains

Extensive simulation results validate the proposed scheme's superiority. The joint optimization strategy demonstrably outperforms existing baseline methods in terms of the overall RDA metric, which holistically captures the trade-off between inference speed and precision. This indicates the framework's success in dynamically adapting to jamming conditions, intelligently selecting partition points, and allocating resources to preserve both the integrity and timeliness of AI inference at the edge.

Why This Matters for Edge AI

This research provides a critical, forward-looking solution for securing the next generation of distributed AI applications.

  • Resilient Edge Computing: It moves beyond traditional performance optimization to address active security threats, enabling reliable autonomous systems, IoT analytics, and real-time mobile AI in potentially adversarial wireless environments.
  • Holistic System Co-Design: The work underscores that future AI system design must jointly consider neural network architecture, wireless communication security, and resource management as interconnected challenges.
  • Practical Algorithmic Innovation: The use of a quantum genetic algorithm for DNN partitioning presents a novel tool for solving complex, non-convex optimization problems prevalent in AI and networking research.

常见问题