Обложка канала

Spark in me - Internet, data science, math, deep learning, philosophy

2440 @snakers4

Канал про интересные мне темы - интернет - статистика - наука о данных Без рекламы и буллшита.

Spark in me - Internet, data science, math, deep learning, philosophy

3 года назад
Открыть в
​​LLaMA: Open and Efficient Foundation Language Models LLaMA is a set of large language models, ranging from 7B to 65B parameters, that have been trained on publicly available datasets containing trillions of tokens. The LLaMA-13B model performs better than GPT-3 (175B) on most benchmarks, and the LLaMA-65B model is competitive with other state-of-the-art models, such as Chinchilla70B and PaLM-540B. This suggests that it is possible to achieve excellent performance in language modeling without relying on proprietary or inaccessible datasets. Paper: research.facebook.com/publica…e-models Code: https://github.com/facebookresearch/llama A detailed unofficial overview of the paper: https://andlukyane.com/blog/paper-review-llama #deeplearning #nlp #transformer #sota #languagemodel