Google: The Language Interpretability Tool (LIT): Interactive Exploration and Analysis of NLP Models

As natural language processing (NLP) models become more powerful and are deployed in more real-world contexts, understanding their behavior is becoming increasingly critical. While advances in modeling have brought unprecedented performance on many NLP tasks, many research questions remain.

Read More

Anyscale raises $40 million to launch a managed service for distributed computing workloads

Anyscale, the startup behind the open source project Ray, today closed a $40 million funding round. A company spokesperson says the capital will be put toward growing the ecosystem around Ray and promoting Anyscale’s first commercial offering, a managed Ray platform.

Read More

Facebook’s open source M2M-100 model can translate between 100 different languages

Facebook today open-sourced M2M-100, an algorithm it claims is the first capable of translating between any pair of 100 languages without relying on English data. The machine learning model, which was trained on 2,200 language pairs, ostensibly outperforms English-centric systems on a metric commonly used to evaluate machine translation performance.

Read More

Spotify open-sources Klio, a framework for AI audio research

This week at the 2020 International Society for Music Information Retrieval Conference, Spotify open-sourced Klio, an ecosystem that allows data scientists to process audio files (or any binary files) easily and at scale. It was built to run Spotify’s large-scale audio intelligence systems and…

Read More

LinkedIn open-sources GDMix, a framework for training AI personalization models

LinkedIn recently open-sourced GDMix, a framework that makes training AI personalization models ostensibly more efficient and less time-consuming. The company says it’s an improvement over LinkedIn’s previous release in the space — Photon ML — because it supports deep learning models.

Read More

AI data tracker encourages Scientific Research into COVID-19 non-pharmaceutical interventions

There are a few questions about the wide range of non-pharmaceutical interventions (NPIs) that have been applied by governments, globally. Since the onset of the pandemic, these NPIs have been implemented in various degrees, with the intention of reducing the transmission of COVID-19.

Read More

WhyLabs raises $4 million to grow AI and data monitoring platform

WhyLabs is launching out of stealth today with $4 million to grow its platform for data scientists who need help monitoring and troubleshooting problems they encounter with datasets or AI models. The goal is to help teams managing machine learning models save time and catch problems before they make trouble for businesses or customers.

Read More

Intellectual Property Rights for Data Scientists

Data Scientists use software they didn’t write and data they don’t own pretty much all the time. It is only thanks to open source that they can use programming languages. This should not at all be taken for granted. In fact, given how important and ubiquitous intellectual property is in the data science world, it is not being discussed enough I believe. This is why I wrote this blog post.

Read More

Why Deep Learning is still too difficult

Deep Learning is still too difficult

While deep learning has great potential, building practical applications powered by deep learning remains to be too expensive and too difficult for many organizations. In this article, we will describe some of the challenges to broader adoption of deep learning.

Read More

Let’s talk about Open Data…

According to the International Open Data Charter, it defines open data as those digital data that are made available with the technical and legal characteristics necessary so that they can be freely used, reused and redistributed by anyone, at anytime and anywhere. But what are the bases that are governed to comply with the definition of open data, the International Open Data Charter gives us the principles.

Read More