Reinforcement learning uses trial and error feedback to teach machines how to achieve a goal. Metron develops AI systems that teach themselves.Join Our Team
Metron is a leader in researching and developing techniques to scale reinforcement learning algorithms to large state and action spaces enabling 21st century strategic and tactical mission planning and battle management.
Action and Reward
Reinforcement learning uses trial and error feedback to teach machines how to achieve a goal. Applications include self-driving cars, playing video games, robotic manipulation, and strategy games. The simplest form of reinforcement learning involves an agent taking an action and observing a reward or penalty based on that action. For example, choosing a restaurant is an action, and the rating based on the experience is the reward. That rating (along with previous scores) affects how likely the agent will choose that restaurant in the future.
More sophisticated reinforcement learning problems can be addressed with Markov decision process models, and solved using stochastic, dynamic programming methods or Q-learning. Metron scientists specialize in selecting and applying the best mathematical methods to solve our clients’ toughest problems.
We teach AI systems to teach themselves.
Metron has successfully used stochastic, dynamic programming and Q-learning approaches to learn effective policies for a variety of applications. Part of this research includes how to scale these algorithms to large state and action spaces. We are also investigating multi-agent learning problems in which multiple players must learn and interact within the same environment. The resulting joint spaces are too large for traditional methods, so new techniques are required.
Metron is researching and developing reinforcement learning advancements that will allow the application of these techniques to large scale strategic and tactical mission planning and battle management. This work is enabled by our rich campaign simulation and modeling capabilities in Cyber Assassin and NSS.