Big-Bench - Subhaditya's Website

# Big-Bench - [Beyond the Imitation Game: Quantifying and Extrapolating the Capabilities of Language Models](https://arxiv.org/abs/2206.04615) - present and near-future capabilities and limitations of [[language]] models - Beyond the Imitation Game benchmark (BIG-bench) - benchmark that can measure progress well beyond the current state-of-the-art - 204 tasks, contributed by 442 authors across 132 institutions - Task topics are diverse, drawing problems from linguistics, childhood development, math, common-sense reasoning, biology, physics, social bias, software development - tasks that are believed to be beyond the capabilities of current [[language]] models - valuate the behavior of OpenAI’s [[GPT]] models, Google-internal dense [[Transformer]] architectures, and Switch-style sparse transformers on BIG-bench, across model sizes spanning millions to hundreds of billions of parameters