{"version": "https://jsonfeed.org/version/1", "title": "/dev/posts/ - Tag index - machine-learning", "home_page_url": "https://www.gabriel.urdhr.fr", "feed_url": "/tags/machine-learning/feed.json", "items": [{"id": "http://www.gabriel.urdhr.fr/2025/09/22/reinforcement-learning-formulas/", "title": "Reinforcement Learning formulas cheat sheet", "url": "https://www.gabriel.urdhr.fr/2025/09/22/reinforcement-learning-formulas/", "date_published": "2025-09-22T00:00:00+02:00", "date_modified": "2025-09-22T00:00:00+02:00", "tags": ["computer", "machine-learning", "reinforcement-learning", "neural-networks"], "content_html": "<p>Cheat sheet for (some) reinforcement learning mathematical formulas and algorithms.</p>\n"}, {"id": "http://www.gabriel.urdhr.fr/2025/05/14/llama.cpp-quickstart/", "title": "llama.cpp quickstart", "url": "https://www.gabriel.urdhr.fr/2025/05/14/llama.cpp-quickstart/", "date_published": "2025-05-14T23:12:16+02:00", "date_modified": "2025-05-14T23:12:16+02:00", "tags": ["computer", "machine-learning", "deep-learning", "language-model", "neural-networks", "LLM"], "content_html": "<p>How to quickly use llama.cpp for LLM inference (no GPU needed).</p>\n"}, {"id": "http://www.gabriel.urdhr.fr/2025/05/14/vllm-quickstart/", "title": "vLLM quickstart", "url": "https://www.gabriel.urdhr.fr/2025/05/14/vllm-quickstart/", "date_published": "2025-05-14T23:11:38+02:00", "date_modified": "2025-05-14T23:11:38+02:00", "tags": ["computer", "machine-learning", "deep-learning", "language-model", "neural-networks", "LLM"], "content_html": "<p>How to quickly use <a href=\"https://docs.vllm.ai/en/stable/\">vLLM</a> for LLM inference using CPU.</p>\n"}, {"id": "http://www.gabriel.urdhr.fr/2025/01/30/distillation/", "title": "Neural Network Distillation", "url": "https://www.gabriel.urdhr.fr/2025/01/30/distillation/", "date_published": "2025-01-30T00:00:00+01:00", "date_modified": "2025-01-30T00:00:00+01:00", "tags": ["computer", "machine-learning", "deep-learning", "neural-networks"], "content_html": "<p>Overview of neural network distillation\nas done in\n<a href=\"https://arxiv.org/abs/1503.02531\">\u201cDistilling the Knowledge in a Neural Network\u201d</a>\n(Hinton et al, 2014).</p>\n"}, {"id": "http://www.gabriel.urdhr.fr/2025/01/07/transformer-decoder-language-models/", "title": "Transformer-decoder language models", "url": "https://www.gabriel.urdhr.fr/2025/01/07/transformer-decoder-language-models/", "date_published": "2025-01-07T00:00:00+01:00", "date_modified": "2025-09-23T22:22:00+02:00", "tags": ["computer", "machine-learning", "deep-learning", "language-model", "neural-networks", "reinforcement-learning", "LLM"], "content_html": "<p>Some notes on how <a href=\"https://arxiv.org/abs/1801.10198\">transformer-decoder</a> language models work,\ntaking GPT-2 as an example,\nand with lots references in order to dig deeper.\nThis is intended both as a a roadmap for understanding on how LLMs work\n(especially the ones using a transformer-decoder architecture)\nand a a summary/recap on the topic.</p>\n"}, {"id": "http://www.gabriel.urdhr.fr/2024/12/26/github-copilot-prompt/", "title": "GitHub Copilot instructions", "url": "https://www.gabriel.urdhr.fr/2024/12/26/github-copilot-prompt/", "date_published": "2024-12-26T00:00:00+01:00", "date_modified": "2024-12-26T00:00:00+01:00", "tags": ["computer", "machine-learning", "deep-learning", "language-model", "security", "LLM"], "content_html": "<p>Extracting the system prompt from GitHub CoPilot.</p>\n"}, {"id": "http://www.gabriel.urdhr.fr/2022/08/28/trying-to-run-stable-diffusion-on-amd-ryzen-5-5600g/", "title": "Stable Diffusion on an AMD Ryzen 5 5600G", "url": "https://www.gabriel.urdhr.fr/2022/08/28/trying-to-run-stable-diffusion-on-amd-ryzen-5-5600g/", "date_published": "2022-08-28T00:00:00+02:00", "date_modified": "2022-08-28T00:00:00+02:00", "tags": ["computer", "machine-learning", "deep-learning", "generative-art", "neural-networks"], "content_html": "<p>Executing\nthe <a href=\"https://stability.ai/blog/stable-diffusion-public-release\">Stable Diffusion</a>\ntext-to-image model on an AMD Ryzen 5 5600G integrated GPU (iGPU).</p>\n"}]}