BIG-bench
Collaborative benchmark for evaluating language model performance.
Free
from $4.00/m