According to Meta's research, the LSP method cleverly utilizes the concept of self-play from game theory, treating the model's capabilities as performance in competitive games. By allowing the model ...