Every two weeks, we find the most interesting data science links from around the web and collect them in Data Science Briefings, the DataMiningApps newsletter. Subscribe now for free if you want to be the first to get up to speed on interesting resources.
- Strangelove redux: US experts propose having AI control nuclear weapons
One of the risks of incorporating more artificial intelligence into the nuclear command, control, and communications system involves the phenomenon known as automation bias. - Artificial intelligence in medicine raises legal and ethical concerns
AI in medicine also raises significant legal and ethical challenges. - AI’s white guy problem isn’t going away
A new report says current initiatives to fix the field’s diversity crisis are too narrow and shallow to be effective. - NLP’s Clever Hans Moment has Arrived
It is now almost a cliché to find out that BERT (Devlin et al., 2019) performs “surprisingly well” on whatever dataset you throw at it. - Don’t Let Metrics Undermine Your Business
A company can easily lose sight of its strategy and instead focus strictly on the metrics that are meant to represent it. - Continuous Delivery for Machine Learning
Automating the end-to-end lifecycle of Machine Learning applications. Recommended! - Scraping A Public Website Doesn’t Violate the CFAA, Ninth Circuit (Mostly) Holds
The Ninth Circuit Court of Appeals has handed down a groundbreaking decision today on the federal computer hacking law, the Computer Fraud and Abuse Act (CFAA). In HiQ Labs v. LinkedIn, the court held that scraping a public website is likely not a CFAA violation. - The three challenges keeping cars from being fully autonomous
Technical, regulatory, and business obstacles are still in the way of safe, useful, and affordable self-driving vehicles. - Zao’s deepfake face-swapping app shows uploading your photos is riskier than ever
“It highlights how great the risks have become of making your photos available online where anyone can use or abuse them.” - McDonald’s uses AI for ordering at drive-throughs
McDonald’s is to replace human servers with voice-based technology in its US drive-throughs. - Human Rights Implications of IBM Watson’s ‘Personality Insights’ Tool
“Can I “game” the system to make myself a better job candidate?” - Artificial intelligence is changing every aspect of war
A new type of arms race could be on the cards - How China is Cashing in on Group Chats
“In early 2015, “conversational commerce” was hailed as the future of online shopping. Back then, the term was commonly applied to tech like shopping bots and voice assistants. But the subsequent rise of private messaging suggests that group chats may actually be the secret to turning conversations into commerce.” - The Next Hot Job: Pretending to Be a Robot
As the promise of autonomous machines lags the underlying technology, the growing need for human robot-minders could juice the remote workforce - A 2019 Guide for Automatic Speech Recognition
- Human speech may have a universal transmission rate: 39 bits per second
“Indeed, no matter how fast or slowly languages are spoken, they tend to transmit information at about the same rate: 39 bits per second, about twice the speed of Morse code.” - Taylor Swift threatened to sue Microsoft over its racist chatbot Tay
According to Microsoft’s president, the singer already had trademark issues with the company’s US version of the Chinese chatbot XiaoIce. - We need to do better with our data – can citizen scientists make a difference?
“The citizen data scientist who has deep knowledge of the company’s culture, market and profitability drivers might be in a better position than the specialist data analyst with only very limited knowledge of the employer’s gears and levers.” - Using Deep Learning to Inform Differential Diagnoses of Skin Diseases
“We developed a deep learning system (DLS) to address the most common skin conditions seen in primary care. Our results showed that a DLS can achieve an accuracy across 26 skin conditions that is on par with U.S. board-certified dermatologists” - No Bones about It: People Recognize Objects by Visualizing Their “Skeletons”
This basic ability gives humans a leg up on computers - Building Custom Deep Learning Based OCR models
- Natural Language in Python using spaCy: An Introduction
- Salesforce releases CTRL
A Conditional Transformer Language Model for Controllable Generation - DeepPrivacy is a fully automatic anonymization technique for images.
- Command line utilities for tabular data files
This is a set of command line utilities for manipulating large tabular data files. Files of numeric and text data commonly found in machine learning, data mining, and similar environments. Filtering, sampling, statistics, joins, and more. - In Aging Singapore, 65-Year-Olds Are Learning How to Code
Government, businesses are investing more to upskill workers - Our Brains Are Not Multi-Threaded
The human brain is messier. - The Gambler Who Cracked the Horse-Racing Code
Bill Benter did the impossible: He wrote an algorithm that couldn’t lose at the track. Close to a billion dollars later, he tells his story for the first time. - Draw The Rest Of The Chart
Imposter Syndrome and Levelling Up in Data Visualization - DeepFaceLab is a tool that utilizes machine learning to replace faces in videos
- fullstack.ai
End-to-end machine learning project showing key aspects of developing and deploying real life machine learning driven application. - Open Forensic Science in R
This book is for anyone looking to do forensic science analysis in a data-driven and open way.