Problem Utilizing Deep Reinforcement Learning to Effect Autonomous Results

The growth in space-capable entities has caused a rapid rise in Transfers and Intercepts via Electromagnetic Propulsion ➢ Analysis the number of derelict and space debris in orbit Gabriel Sutherland ([email protected]), Oregon State University, Corvallis, OR, USA ❖ The data suggests highly capable of around Earth, which pose a significant navigation hazard. Frank Soboczenski ([email protected]), King’s College London, London, UK neutralizing/capturing small to intermediate debris Athanasios Vlontzos ([email protected]), Imperial College London, London, UK ❖ In simulations, spacecraft was able to capture and/or Objectives neutralize debris ranging in mass from 1g to 100g ❖ Satellites suffered damage in some Develop a system that is capable of autonomously neutralizing simulations multiple pieces of space debris in various . Software & Simulations ❖ Simulations show that damaged segments of were vaporized, which means Background that no additional debris was added to orbit ➢ Dangerous amounts of space debris in orbit, estimates vary ❖ Neural Network based on Deep Deterministic Policy ➢20,000+ tracked objects larger Gradient (DDPG) effective at low-thrust orbit transfers in than 10 cm diameter 2D Hohmann transfer problem ➢Est 500,000 objects larger ❖ Using ASTOS simulation software, mission-representative than 1 cm in diameter model of spacecraft and experimental propulsion system ➢Est 100 million objects ❖ Interfaced with DRL to train the Neural smaller than 1cm in diameter ➢ Orbital speeds of these objects Network with realistic data vary from 100+ kph to 28,100 kph ➢ Future Studies ➢ Spacecraft collisions due to space ❖ Train DDPG for 3 dimensional orbit transfer problems debris have been sparse so far Tracked spacecraft in Earth orbit. Generated using Celestia Software. Earth’s magnetic lines and Van Allen radiation belts. Generated using ❖ Incorporate European Space Agency’s Space Debris ➢1985 – US Anti-satellite missile destroys aging weather satellite. Celestia Software. Software Suite Decayed orbit resulted in minimal additional debris ❖ Collision and intercept probability analysis ➢1996 - French satellite damaged by debris from French launched in 1980’s with DRAMA software ➢2007 - China destroyed a old weather satellite via a missile ❖ Train Neural Network for safe deorbit of ➢Added 3,000+ trackable debris to orbit debris using Oriundo model ➢2009 - derelict Russian satellite collided with and destroyed ❖ Manufacture CubeSat prototype Iridium communication satellite ❖ Multi-electrodynamic tether ➢Added 2,000+ trackable debris to orbit ❖ Onboard AI via next-gen VPU ➢ Kessler Syndrome remains an increased possibility ➢Cascade of debris collisions in orbit creating more debris in chaotic orbits

➢Would lead to many orbits becoming inaccessible due to debris Force vectors for a Lorentz Force propelled spacecraft in Earth’s geomagnetic field danger ➢ The Earth has a geomagnetic field surrounding it ➢Extends more than Applications 600,000km above the surface ➢ Deep Reinforcement Learning for Orbit Transfer Equivalence graph of the total intensity (unit: nT) ➢Produced by currents ➢ Useful for effecting intercepts between in Earth’s core primary spacecraft and target orbit ➢Interactions between Solar storms and the ➢ Highly efficient for low-thrust maneuvers Geomagnetic field ➢ Magnetic propulsion system cause it to expand or ➢ Propellant-less propulsion system could be Shrink used in future spacecraft design for long

General design for Deep Reinforcement Learning architecture mission life Cross-section of Geomagnetic field. (NASA) Equivalence graph of the declination angle (unit: deg) Equivalence graph of the inclination angle (unit: deg) ➢ Provides potential for satellite ➢ Lorentz Force constellations in close formation Spacecraft Design ➢ Could be used for planetary orbit insertion

➢ Lifetime >6 months for deep space missions ➢ Orbit: 2,000km – 36,000km

Onboard capabilities: ➢ Real-time Operating System Conclusions ➢ Lorentz equation describes the force exerted on a charged ➢ Autonomous target selection filter particle B by electrical field E and magnetic field v ➢ Capable of ignoring active spacecraft ➢ In materials, particles also generate forces described in Lorentz ➢ Carbon-fiber composite net sections Results indicate a system similar in design to that of equation ➢ Net deployment and actuation system spacecraft outlined in this study has the potential to ➢ Powered by Electroactive polymers (EAPs) ➢ Systems of equations involving conductive properties, ➢ Telemetry uplink/downlink drastically reduce amount of space debris and derelicts in specific heat, insolation, and density of a material must be Earth orbit, thus reducing the risk of a Kessler syndrome solved to accurately predict magnetic forces exerted on it Key Capabilities scenario happening. ➢ Electro-dynamic tethers exploit changes in earth’s magnetic ➢ Low-power onboard Neural Network for propulsion system control and target selection field across different orbital areas to generate electricity ➢ Radiation-resistant software ➢ Neural Networks have been shown to be effective for continuous Acknowledgements low-thrust orbit changes, and resilient to missed thrust events Anatomy of an Electroactive Polymer (EAP). These fibers contract 300% when a current is run through them at high voltage. Photo courtesy of Jet Propulsion ➢Oregon NASA Space Grant Consortium Rendering of current debris capture spacecraft design. Lab. ➢Valarie Dr. Nancy Squires, Oregon State University.