This paper deals with the challenge of multi-agent Mastering of a inhabitants of gamers, engaged inside a recurring normalform sport. Assuming boundedly-rational brokers, we propose a product of social Understanding based on demo and mistake, named "social reinforcement Studying". This extension of very well-recognized Q-Mastering algorithm, permits gamers inside a https://prussiab726lew3.blog5star.com/profile