Every so often, we find the most interesting data science links from around the web and collect them in Data Science Briefings, the DataMiningApps newsletter. Subscribe now for free if you want to be the first to get up to speed on interesting resources.
- OpenAI introduces o1 – the next leap forward?
“We are introducing OpenAI o1, a new large language model trained with reinforcement learning to perform complex reasoning. o1 thinks before it answers—it can produce a long internal chain of thought before responding to the user.” - Notes on OpenAI’s new o1 chain-of-thought models
“There’s a lot to understand about these models—they’re not as simple as the next step up from GPT-4o” - g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
“This is an early prototype of using prompting strategies to improve the LLM’s reasoning capabilities through o1-like reasoning chains.” - WordLlama
WordLlama is a fast, lightweight NLP toolkit that handles tasks like fuzzy-deduplication, similarity and ranking with minimal inference-time dependencies and optimized for CPU hardware. - Reader-LM: Small Language Models for Cleaning and Converting HTML to Markdown
Two novel small language models inspired by Jina Reader, designed to convert raw, noisy HTML from the open web into clean markdown. Perfect for scraping. - Meet Yi-Coder: A Small but Mighty LLM for Code
Yi-Coder is a series of open-source code large language models (LLMs) that deliver state-of-the-art coding performance with fewer than 10 billion parameters. - Fine-Tuning for Precision and Privacy
Corgea reports on a fine-tuned LLM “offering complete data isolation and avoiding the need for customers to sign Business Associate Agreements (BAAs) for HIPAA compliance.” - Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model
“We introduce Transfusion, a recipe for training a multi-modal model over discrete and continuous data. Transfusion combines the language modeling loss function (next token prediction) with diffusion to train a single transformer over mixed-modality sequences.” - What I’ve learned building MLOps systems for four years
Some interesting insights to be found in this post. - Roblox Builds Open-Source 3D AI Model, Adds Tech for Faster Game Loading
Roblox already uses over 250 different AI models, and players may soon get another one that can create just about anything from scratch. - Reflection is a new AI model that can run on a good laptop and still beat GPT-4o in tests
It is a tuned-up version of Llama 70b - People Are Creating an Average of 34 Million Images Per Day
In the past few years, dozens of communities dedicated to AI art have accelerated across the Internet