McKinsey used an extremely advanced process called reinforcement learning to allow the bot to learn from its own mistakes. “Fundamentally you want to teach a computer how to sail a given boat design in the simulator as best as possible,” Hohn said. “And as best as possible is really important because if the computer is not as good as the sailors there’s no value in us doing that because we want to rank the designs." #Virtual sailor 7 key how to In reinforcement learning, the bot is rewarded when it sails well. “This reward component becomes really important because reinforcement learning is a way of learning which is very, very generic and that’s very similar in a way to how human’s would learn,” Hohn said. “If you think as a toddler you’re trying to learn to walk and suddenly gravity will pull you back down, that’s kind of a negative reward. And then you stand up again and slowly but surely you learn to balance yourself. You start crawling, you work and ultimately you run. “It’s very similar to the way our program actually learns to sail.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |