Snowflake has recently launched a new Large Language Model (LLM) called Arctic, which is as effective or even more effective than their latest model, MetaCall 3, in tasks such as SQL instruction and code generation, with a lower training cost. Building tools powered by AI using LLM has traditionally been expensive and resource-intensive, but Snowflake has developed Arctic to reduce the cost of LLM training and make it more accessible for companies.
Arctic, developed by members of the Snowflake AI Research team, is a top-level LLM designed for business use. It excels in tasks like SQL generation and coding, and is truly open, using the Apache 2.0 license. According to Snowflake, Arctic’s capabilities are on par or better than Meta’s latest AI models, Llama 3 8B and Llama 2 70B, while using less computing resources.
One of the key features of Arctic is its high training efficiency, which allows for the training of personalized models in a more affordable way. This efficiency is achieved through a dense Mix of Experts (MoE) hybrid transformer architecture, combining a dense transformer model with a residual MLP MoE. The model’s open-source nature allows for easy sharing of research details and model training methodology.
Arctic weights are released through the Apache 2.0 license, enabling use for both commercial and research purposes. The model is available on various platforms and will soon be accessible on popular cloud platforms like Amazon AWS and Microsoft Azure. Snowflake’s Arctic represents a major step forward in making advanced AI technology more affordable and accessible for businesses.
In summary, Snowflake’s new Large Language Model (LLM) called Arctic is an effective tool that can be used for various tasks such as SQL instruction and code generation at a lower cost compared to traditional models like MetaCall 3. Its high training efficiency is achieved through a dense Mix of Experts (MoE) hybrid transformer architecture that combines two powerful models – dense transformer model with residual MLP MoE – making it easier to train personalized models in an affordable way. Additionally its open-source nature allows easy sharing of research details and model training methodology while its weights are released under the Apache 2.0 license enabling use for both commercial and research purposes on multiple platforms including popular cloud platforms like Amazon AWS
In recent years, DP World has invested heavily in Romania to improve connectivity between its…
Via Cognitive Health is set to open a new facility in Augusta next month aimed…
The 26th Cross-Strait Fair for Economy and Trade opened on Thursday in Fuzhou, the capital…
During his recent visit to Kyiv, Secretary of State Antony Blinken took a break from…
In Michigan, high schools may soon be required to offer computer science courses, although students…
Joe Kippley, a native of Brookings and a Minnehaha County Commissioner for the past year,…