Will any open-source Transformers LLM model that function as a dense mixture of experts be released by end of 2024? | Manifold

Will any open-source Transformers LLM model that function as a dense mixture of experts be released by end of 2024?

Plus

4

Ṁ118

Jan 1

46%

chance

1D

1W

1M

ALL

Will any open source or weights Transformers LLM based model emerge that is functionally a dense version of mixture of experts where the empirical mathematical sparsity resembles dense models like Llama 3.1 405B or Mistral Large Enough. A tool that allows for the creation of this type of model even if no model is released along with it would resolve as yes as long as it is possible to create the model for example Mergekit for various ways of model manipulation. A paper would only resolve as yes if there was an accompanying model, functional code released, or implementation by a third party.

This question is managed and resolved by Manifold.

#️ Technology

#Technical AI Timelines

Get

1,000

and

3.00

Related questions

When will a non-Transformer model become the top open source LLM?

Will Meta release an open source language model that outperforms GPT-4 by the end of 2024

When will OpenAI release a more capable LLM?

Will Mamba be the de-facto paradigm for LLMs over transformers by 2025?

Will the most capable, public multimodal model at the end of 2027 in my judgement use a transformer-like architecture?

When will the first fully open-source advanced LLM (data, code, weights) be released?

Will Transformer based architectures still be SOTA for language modelling by 2026?

Will superposition in transformers be mostly solved by 2026?

Are Mixture of Expert (MoE) transformer models generally more human interpretable than dense transformers?

By EOY 2025, will the model with the lowest perplexity on Common Crawl will not be based on transformers?

Related questions

When will a non-Transformer model become the top open source LLM?

When will the first fully open-source advanced LLM (data, code, weights) be released?

Will Meta release an open source language model that outperforms GPT-4 by the end of 2024

Will Transformer based architectures still be SOTA for language modelling by 2026?

When will OpenAI release a more capable LLM?

Will superposition in transformers be mostly solved by 2026?

Will Mamba be the de-facto paradigm for LLMs over transformers by 2025?

Are Mixture of Expert (MoE) transformer models generally more human interpretable than dense transformers?

Will the most capable, public multimodal model at the end of 2027 in my judgement use a transformer-like architecture?

By EOY 2025, will the model with the lowest perplexity on Common Crawl will not be based on transformers?

© Manifold Markets, Inc.•Terms + Mana-only Terms•Privacy•Rules