In a significant stride towards advancing artificial intelligence, Meta, the parent company of Facebook, announced on Friday the release of a suite of new AI models, including the groundbreaking “Self-Taught Evaluator.” This innovative tool promises to reduce human involvement in the AI development process by enabling AI systems to assess and improve upon themselves.
The Self-Taught Evaluator leverages the “chain of thought” technique, a method recently employed by OpenAI’s o1 models. By breaking down complex problems into smaller, logical steps, the model enhances the accuracy of responses in challenging domains such as science, coding, and mathematics. Notably, Meta’s researchers trained this evaluator entirely on AI-generated data, eliminating the need for human input during training.
“The idea of being self-taught and able to self-evaluate is crucial to achieving a superhuman level of AI,” said Jason Weston, one of the researchers behind the project. “We hope that as AI becomes more and more capable, it will get better at checking its work, surpassing average human performance.”
The release of the Self-Taught Evaluator marks a potential pathway towards autonomous AI agents capable of learning from their own mistakes. Such agents could function as intelligent digital assistants, handling a vast array of tasks without human intervention—a vision long held by many in the AI field.
Currently, developing AI models often relies on Reinforcement Learning from Human Feedback (RLHF), a process that can be both expensive and inefficient. It requires specialized human annotators to label data accurately and verify complex outputs. Meta’s approach could streamline this process, reducing reliance on human feedback.
Other industry players, including Google and Anthropic, have explored similar concepts like Reinforcement Learning from AI Feedback (RLAIF). However, unlike Meta, these companies typically do not release their models for public use. Meta’s commitment to open research could accelerate advancements across the AI community.
In addition to the Self-Taught Evaluator, Meta also unveiled updates to its image-identification Segment Anything model, a tool designed to speed up large language model response times, and datasets aimed at aiding the discovery of new inorganic materials.
For businesses, investors, and academics alike, Meta’s latest offerings signal a shift towards more autonomous and self-improving AI systems. As AI continues to evolve, such innovations could dramatically impact industries by enhancing efficiency and enabling new capabilities.
Reference(s):
cgtn.com