Combating Stagnation in Reinforcement Learning Through 'Guided Learning' With 'Taught-Response Memory'