A prolific data scientist and researcher

She is currently acting as general chair for KDD 2014, the oldest conference in Data Mining drawing 1200 participants from academia, government and industry. Claudia has won 15 awards in data science related fields such as big data, data mining, predictive modeling and ML.

As a leader on the data science field, Claudia Perlich, has over 50 scientific publications and a few patents in the area of machine learning.

Ethical principles of a scientist

The Belmont Report was published by the National Comission for the Protection of Human Subjects of Biomedical and Behabioral Research. It covers three basic ethical principles that every data scientist must include in their research work.

This article aims to help you question the ethical approach present in your research plan, including your motivations and methods selected before moving into action.

The CRISP-DM Methodology

Cross Industry Standard Process for Data Mining is a methodology created to create Data mining projects.

Every Data scientist must know what CRISP-DM is and what are the steps used in it. Would you be able to explain the methodology in your next interview?

Central Limit Theorem

The Central Limit Theorem (CLT) is used widely in science for hypothesis testing. Predictive modeling can be done using this statistical concept.

In this article, it will be explained when to use the CLT and why is so valuable. Additionally, the t-distribution is used to test the null hypothesis.

Ideo: The ideation phase

Whenever a new project begins is because we want to solve a new problem. The ideation phase is the first step in that direction.

Ideo is a global design company known for their advance practice of human-centered design. The Palo Alto company is known for successfully apply the “design thinking” process to promote innovation and create solutions with a positive impact.

The founder of RLadies

Gabriela de Queiroz is a Sr. Developer Advocate/Sr. Engineering & Data Science Manager at IBM where she leads the CODAIT Machine Learning Team. She is the founder of R-Ladies, a worldwide organization for promoting diversity in the R community with more than 150 chapters in 45+ countries.

October’s role model engineer

What is Data Science?

Data science is a “concept to unify statistics, data analysis, machine learning and their related methods” in order to “understand and analyze actual phenomena” with data.

Data science is the hottest field to learn in the 21st Century. It’s a field in constant evolution and filled with excitement.


© 2019. All rights reserved.