Run Automated Evaluation Scripts Tools

LM Evaluation Test Suite by AI21Labs

Evaluate the performance of large-scale language models.