data science

53 items

DOCKDNuggets·20d ago

Anonymizing Production Data for Data Science with Mimesis

This content teaches how to anonymize sensitive production data for data science using Python's Mimesis library. It provides a step-by-step example for readers to try themselves.

learning data privacy security data science

Anonymizing Production Data for Data Science with Mimesis

ARTICLEDEV.to AI·4/10/2026

True to the Model or True to the Data?

O título levanta uma questão fundamental sobre a fidelidade dos sistemas de IA. Ele explora se a prioridade deve ser a consistência interna do modelo ou a representação acurada dos dados subjacentes.

model interpretability machine learning data science AI

DOCAnalytics Vidhya·5/7/2026

Feature Engineering with LLMs: Techniques & Python Examples

Feature engineering is the foundation of strong machine learning systems, but the traditional process is often manual and time-consuming. Large Language Models (LLMs) transform this by helping machines understand language and extract meaning from unstructured data.

LLMs Feature Engineering machine learning data science

DOCOpenAI Blog·26d ago

How data science teams use Codex

This content details how data science teams can leverage Codex to construct root-cause briefs, impact readouts, KPI memos, scoped analyses, and dashboard specifications from real work inputs. It serves as a practical guide for applying Codex in various analytical tasks.

learning data science AI tools analytics

RESEARCHarXiv CS.AI·4/9/2026

Toward Reducing Unproductive Container Moves: Predicting Service Requirements and Dwell Times

Este artigo apresenta um estudo de ciência de dados em um terminal de contêineres com o objetivo de reduzir movimentos improdutivos. Ele desenvolve modelos de machine learning para prever requisitos de serviço e tempos de permanência dos contêineres, superando heurísticas existentes.

logistics machine learning data science Container Terminal

DOCStatQuest (YouTube)·22d ago

The Essence of Linear Regression

This content explains the essence of linear regression, a fundamental statistical method used to model the relationship between a dependent variable and one or more independent variables. It covers the basic principles and significance of this technique in data analysis.

linear regression machine learning data science Statistics

ARTICLEKDNuggets·5/1/2026

The “Robust” Data Scientist: Winning with Messy Data and Pingouin

This article delves into the application of robust statistics within data science processes. It illustrates how to effectively handle messy data that fails to meet standard statistical assumptions.

Pingouin data science statistical methods Data Analysis

The “Robust” Data Scientist: Winning with Messy Data and Pingouin

ARTICLEfast.ai Blog·10/14/2025

How to Solve it With Code course now available

The fast.ai course "How to Solve it With Code" is now available after a year of improvements and updates. It is primarily designed for experienced coders, AI practitioners, and data scientists.

AI practitioners learning data science coding

How to Solve it With Code course now available

DOCTowards Data Science·2/3/2025

How to Find Seasonality Patterns in Time Series

This content explains how to find seasonality patterns in time series using the Fourier Transform.

Fourier Transform data science Time Series Analysis Seasonality

ARTICLEDEV.to AI·4/26/2026

Cross-Modal Knowledge Distillation for deep-sea exploration habitat design under multi-jurisdictional compliance

This article explores applying Cross-Modal Knowledge Distillation (CMKD) to design deep-sea exploration habitats. The author posits that CMKD can integrate chaotic, multi-source data to meet complex environmental, structural, and legal compliance across multiple jurisdictions.

multimodal AI Knowledge Distillation deep learning Deep-sea exploration

ARTICLETowards Data Science·2/3/2025

How to Get Promoted as a Data Scientist

This article provides valuable advice from a Lead Data Scientist who achieved two promotions in under two years. It explores strategies and practical tips on how to advance in a data science career.

Career Development professional growth promotion data science

DOCDEV.to AI·5d ago

Top Data Science Course in Chennai with Certification

The Data Science Course in Chennai addresses the increasing demand for professionals, focusing on practical knowledge in Python, Machine Learning, and AI. It provides training with real-time projects, case studies, and certification to prepare students for the job market.

certification learning machine learning data science

DOCDEV.to AI·4/26/2026

“Using R and Python Together with reticulate: A Practical Guide for Data Workflows”

This practical guide demonstrates how to use the `reticulate` package to integrate R and Python in data workflows, allowing Python objects to be used within R for tasks like machine learning and visualization. It provides steps for setting up environments and combining the strengths of both programming languages.

machine learning data science Python R

DOCDEV.to AI·5/3/2026

Jupyter Notebooks: Where Data Science Actually Happens

Jupyter Notebooks are a widely used open-source web application for creating and sharing documents that contain live code, equations, visualizations, and narrative text. They serve as a crucial tool in the data science workflow, enabling interactive data exploration, analysis, and model development.

Development Tools Jupyter Notebooks data science Programming

DOCKDNuggets·13d ago

5 Scipy.stats Tricks for Simulating ‘What If’ Scenarios

This article explores five essential tricks using scipy.stats, combined with NumPy, to design high-performance and rigorous simulations. It provides a detailed look at how to create effective "what if" scenarios with these libraries.

learning NumPy Scipy data science

5 Scipy.stats Tricks for Simulating ‘What If’ Scenarios

ARTICLEDEV.to AI·11d ago

Why Towards AI (Developers) Is IMPORTANT

This article highlights the importance of "Towards AI" as a vital resource for developers, offering essential insights, tools, and knowledge in artificial intelligence and machine learning. It keeps professionals updated on industry trends and advancements, providing practical tutorials for all learning stages.

learning AI resources machine learning data science

ARTICLEDEV.to AI·4/11/2026

Complete Data Cleaning Guide Using Pandas: A Must-Know Skill for Data Scientists

Data cleaning using Pandas is an essential skill for data scientists, crucial for transforming raw data into a structured and precise format. This fundamental step prevents incorrect results and biased models, consuming most of data scientists' time in projects.

Pandas Data Cleaning data science data preprocessing

ARTICLEDEV.to AI·4/20/2026

Monte Carlo Simulation in 5 Minutes: From Zero to Confidence Intervals in One API Call

This article explains Monte Carlo Simulation as a powerful technique to quantify uncertainty in forecasts, such as revenue targets or portfolio returns. Instead of a single estimate, it simulates thousands of possible futures to reveal the probability of various outcomes.

forecasting data science risk assessment simulation

DOCDEV.to AI·4/26/2026

Filtering Rows and Selecting Columns (The Right Way)

This post explains how to efficiently filter rows and select columns in a Pandas DataFrame using the `iloc` and `loc` selectors. It demonstrates how to perform complex data selection operations in a single expression, which is fundamental for data analysis.

Pandas data science data-manipulation

ARTICLEDEV.to AI·5/8/2026

How R Is Becoming a Powerful Tool for AI and Machine Learning in 2026

R, once considered just a statistics language, has by 2026 become a serious and practical tool for AI and machine learning. It's especially useful for analysts, researchers, and solo developers seeking quick results without significant engineering overhead.

R programming machine learning data science Programming Tools