alecor.net — Data Science

2026-02-08

LLM and RAG Evaluation: Metrics, Best Practices

This article provides a concise reference for evaluating large language models (LLMs) and retrieval-augmented generation (RAG) systems. It covers core metrics like accuracy, F1, BLEU, ROUGE, and...

Data Science · Python Data Science AI Products MLOps LLMs RAG Evaluation NLP English

2026-02-08

Multi-Step AI Agent Evaluation: Metrics, Best Practices

This article provides a concise reference for evaluating multi-step AI agents and agentic systems. It covers core metrics for task completion, reasoning, and efficiency, and highlights recent...

Data Science · AI Agents Multi-Step Reasoning MLOps Evaluation RL LLMs AI GenAI English

2026-02-07

Core Concepts Behind Modern AI Systems

Modern AI systems may look diverse on the surface, but under the hood they rely on a small set of recurring architectural and training ideas. This article distills foundational concepts—ranging...

Data Science · Machine Learning data science AI ML LLMs Generative AI MLOps Deep Learning English

2026-02-07

Core Tools in the Modern Python Data Analytics Stack

Modern data and AI products are built on a small set of recurring Python tools for data processing, visualization, interfaces, and APIs. This article provides a concise conceptual overview of...

Data Science · Python Data Science Visualization Dashboards APIs AI Products MLOps English

2025-01-03

How could Spotify's Discover Weekly recommendation system work?

Thinking and describing how Spotify's Discover Weekly leverages machine learning and statistical models to generate personalized music recommendations.

Data Science · AI ML Recommendation Systems English