Every so often, we find the most interesting data science links from around the web and collect them in Data Science Briefings, the DataMiningApps newsletter. Subscribe now for free if you want to be the first to get up to speed on interesting resources.
- The Complete Resource Of Artificial Intelligence Tools & Services
Great overview to keep track of all the AI tools that are popping up! - Databases in 2022: A Year in Review
“The massive funding rounds stopped in the second half of 2022” - MegaFace
“MegaFace is a large-scale public face recognition training dataset that serves as one of the most important benchmarks for commercial face recognition vendors. It includes 4,753,320 faces of 672,057 identities from 3,311,471 photos downloaded from 48,383 Flickr users’ photo albums.” - The Expanding Dark Forest and Generative AI
Proving you’re a human on a web flooded with generative AI content… - Playing Games with Ais: The Limits of GPT-3 and Similar Large Language Models (paper)
“We show how this analysis predicts that a widespread adoption of language generators as tools for writing could result in permanent pollution of our informational ecosystem with massive amounts of very plausible but often untrue texts.” - 2022 in review: neuroAI comes of age
“Developing the tools to study the brain’s reaction to causal manipulations might give modellers precisely the data they need to make progress in other corners of neuroAI.” - Some remarks on Large Language Models
“A short personal perspective of my thoughts of this (and similar) models, and where we stand with respect to language understanding.” - Introduction to Graph Machine Learning
“In this blog post, we cover the basics of graph machine learning.” - MLOps: The Whole Game
“An example of model building, model delpoyment, and model monitoring with R using palmerpenguins” - Stop using Airflow for data science
“The issues with Airflow are extensively documented. But beyond the standard complaints, it’s becomes clear to us that Airflow’s flaws are amplified for data scientists.” - Extracting, converting, and querying data in local files using clickhouse-local
“Wouldn’t it be nice to have a tool to analyze and transform the data in those files using the power of SQL, and all of the ClickHouse functions, but without having to deploy a whole database server or write custom Python code?” - Three-eyed forehead in Stable Diffusion
Stable Diffusion insists on two eyes. - VALL-E
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers