A multi-agent cooperative reinforcement learning model using a hierarchy of consultants, tutors and workers