Every two weeks, we find the most interesting data science links from around the web and collect them in Data Science Briefings, the DataMiningApps newsletter. Subscribe now for free if you want to be the first to get up to speed on interesting resources.
- God is in the machine
Carl Miller on the terrifying, hidden reality of Ridiculously Complicated Algorithms - Artificial Intelligence Is Now a Pentagon Priority. Will Silicon Valley Help?
“The Pentagon appears to be pushing ahead on its own, looking for ways to strengthen its ties with A.I. researchers, particularly in Silicon Valley, where there is considerable wariness about working with the military and intelligence agencies.” - NLP’s generalization problem, and how researchers are tackling it
Generalization is a subject undergoing intense discussion and study in NLP. - The Future of Notebooks: Lessons from JupyterCon
“At JupyterCon, I learned three things: reactive notebooks are the future, Jupyter is the new Bash, and data science is a gateway drug.” Also take a look at the slides. - This AI is bad at drawing but will try anyways
“But when they trained the same algorithm on a huge and highly varied dataset, it had a lot more trouble generating a picture to go with that caption.” - AI Can Transform Anyone Into a Professional Dancer
Think of it as style transfer for dancing, a deep learning based algorithm that can convincingly show a real person mirroring the moves of their favorite dancers. - I Used AI To Clone My Voice And Trick My Mom Into Thinking It Was Me
“I ended up creating an AI-rendered digital recreation of my voice that was so convincing it fooled the person who arguably knows my voice better than anyone.” - Beyond the pixel plane: sensing and learning in 3D
“Imagine you’re building a self-driving car that needs to understand its surroundings. How would you enable your car to perceive pedestrians, bikers, and other vehicles around it in order to move safely?” - Deep Learning cheatsheet
Good overview from Stanford’s CS229 course. - Interpretations of Probability
“Well may we say that probability is a guide to life; but the task of understanding exactly how and why it is has still to be completed, and will surely be a guide to future theorizing about it.” - The World’s Oldest Blockchain Has Been Hiding in the New York Times Since 1995
This really gives a new meaning to the “paper of record.” - Reinforcement Learning and Generative models using Flux
“I worked towards enriching model zoo of Flux.jl, a machine learning library written in Julia.” - Apache Flink 1.6.0 Release Announcement
A big release with many new additions. - BooST (Boosting Smooth Trees) a new Machine Learning Model for Partial Effect Estimation in Nonlinear Regressions
The idea behind the BooST is to replace traditional Classification and Regression Trees (CART), which are not differentiable, by Smooth logistic trees. - HugeGraph
“HugeGraph is a fast-speed and highly-scalable graph database. Billions of vertices and edges can be easily stored into and queried from HugeGraph due to its excellent OLTP ability. As compliance to Apache TinkerPop 3 framework, various complicated graph queries can be accomplished through Gremlin (a powerful graph traversal language).” - Integrating ArcGIS and deep learning
“We demonstrated an integration of ArcGIS software with the latest innovations in deep learning to perform detection of swimming pools using aerial imagery.” - How to Analyze Billions of Records per Second on a Single Desktop PC
This article gives an overview of LocustDB, a new and extremely fast open-source analytics database built in Rust. - Support Vector Machine (SVM) Explorer
This demo lets you interactive explore Support Vector Machine (SVM) using Dash/Plotly.