Reinforcement Learning from Human Feedback: Progress and Challenges