LM Evaluation Test Suite by AI21Labs
Evaluate the performance of large-scale language models.
Free
Evaluate the performance of large-scale language models.
Build and manage data pipelines efficiently with collaborative workflows.
Access millions of trusted experimental protocols for research.