Hybrid H3 125M (0-shot, logit scoring) | 59.6 | Hungry Hungry Hippos: Towards Language Modeling with State Space Models | |
Hybrid H3 2.7B (3-shot, logit scoring) | 60.6 | Hungry Hungry Hippos: Towards Language Modeling with State Space Models | |
Bloomberg GPT 50B (1-shot) | 74.6 | BloombergGPT: A Large Language Model for Finance | - |
Hybrid H3 1.3B (0-shot, logit scoring) | 61.7 | Hungry Hungry Hippos: Towards Language Modeling with State Space Models | |