alecor.net.
  • LLMs
  • Home
  • About
  • Contact
  • Projects
  • Tags
  • Categories
  • Archives
🇬🇧 🇦🇷 🇧🇷 🇵🇱 🇷🇺
  • 2026-02-08

    LLM and RAG Evaluation: Metrics, Best Practices

    This article provides a concise reference for evaluating large language models (LLMs) and retrieval-augmented generation (RAG) systems. It covers core metrics like accuracy, F1, BLEU, ROUGE, and...

    Data Science · Python Data Science AI Products MLOps LLMs RAG Evaluation NLP English
  • 2026-02-08

    Multi-Step AI Agent Evaluation: Metrics, Best Practices

    This article provides a concise reference for evaluating multi-step AI agents and agentic systems. It covers core metrics for task completion, reasoning, and efficiency, and highlights recent...

    Data Science · AI Agents Multi-Step Reasoning MLOps Evaluation RL LLMs AI GenAI English
  • 2026-02-07

    Core Concepts Behind Modern AI Systems

    Modern AI systems may look diverse on the surface, but under the hood they rely on a small set of recurring architectural and training ideas. This article distills foundational concepts—ranging...

    Data Science · Machine Learning data science AI ML LLMs Generative AI MLOps Deep Learning English
← Previous Page 2 of 2

Nothing you read here should be considered advice or recommendation. Everything is purely for informational purposes.