Ciencia de Datos | 🇬🇧 Data Science

Auto Added by WPeMatico

Scientific Visualization: Python + Matplotlib

The Python scientific visualisation landscape is huge. It is composed of a myriad of tools, ranging from the most versatile and widely used down to the more specialised and confidential. Some of these tools are community based while others are developed by companies. Some are made specifically for the web, others are for the desktop […]

Scientific Visualization: Python + Matplotlib Read More »

Federated Learning: Issues in Medical Application

In this presentation, the current issues to make federated learning flawlessly useful in the real world will be briefly overviewed. They are related to data/system heterogeneity, client management, traceability, and security. Also, we introduce the modularized federated learning framework, we currently develop, to experiment various techniques and protocols to find solutions for aforementioned issues. The

Federated Learning: Issues in Medical Application Read More »

Introduction to Datascience: Learn Julia Programming, Math & Datascience from Scratch

I was emboldened to write this book after my video series called Data Science With Julia got some traction. That too after a tweet about Decision Tree was liked by Julia Language itself. So I thought why not give it more?

Introduction to Datascience: Learn Julia Programming, Math & Datascience from Scratch Read More »

The Word is Mightier than the Label: Learning without Pointillistic Labels using Data Programming

We analyze the math fundamentals behind DP and demonstrate the power of it by applying it on two real-world text classification tasks. Furthermore, we compare DP with pointillistic active and semi-supervised learning techniques traditionally applied in data-sparse settings.

The Word is Mightier than the Label: Learning without Pointillistic Labels using Data Programming Read More »

A Farewell to the Bias-Variance Tradeoff? An Overview of the Theory of Overparameterized Machine Learning

This paper provides a succinct overview of this emerging theory of overparameterized ML (henceforth abbreviated as TOPML) that explains these recent findings through a statistical signal processing perspective. We emphasize the unique aspects that define the TOPML research area as a subfield of modern ML theory and outline interesting open questions that remain.

A Farewell to the Bias-Variance Tradeoff? An Overview of the Theory of Overparameterized Machine Learning Read More »

CARLA: A Python Library to Benchmark Algorithmic Recourse and Counterfactual Explanation Algorithms

CARLA (Counterfactual And Recourse LibrAry), a python library for benchmarking counterfactual explanation methods across both different data sets and different machine learning models. In summary, our work provides the following contributions: (i) an extensive benchmark of 11 popular counterfactual explanation methods, (ii) a benchmarking framework for research on future counterfactual explanation methods, and (iii) a

CARLA: A Python Library to Benchmark Algorithmic Recourse and Counterfactual Explanation Algorithms Read More »

Data as the main focus of “State of the art of data science in Spanish language and its application in the field of Artificial Intelligence”

According to the results, there is an evidence of cultural bias for data science in Spanish language. The outcome of the consultation, which carried out on 12 April 2021, confirms that only 10 out of 23.771 datasets “speaks” Spanish.”

Data as the main focus of “State of the art of data science in Spanish language and its application in the field of Artificial Intelligence” Read More »