# i can haz fix! ## Session Summary **Date:** June 16, 2026 **Issue:** [ggml-org/llama.cpp#24684](https://github.com/ggml-org/llama.cpp/issues/24684) **Branch:** `ali0une-fix-fit-sleep-wake` --- ## Discovery The bug was discovered while running llama.cpp server with `--sleep-idle-seconds 60` and `--fit on`. The server would loop endlessly through sleep/wake cycles, generating hundreds of chat completion requests. Logs showed: ``` W common_fit_params: failed to fit params to free devic…

Tags: Bugllama.cppLLMGGMLAiLLaMAgitpi.devQwen
Category: LLaMALLM

Never used the cloud models so can't tell about that. My humble experience with llama.cpp + pi agent + Qwen3.6-27B + 3090 24Go VRAM and a codebase of a bit more than 130k is: if you have a workflow where you first draft a PLAN.md then make the model review it, update it with a few iterations adding comments in it like <!-- USER: keep this file untouched --> and implement it Phase by Phase in a git repository it works pretty fine and you can achieve huge amount of work be it refactoring, fixing…

Tags: LLMLLaMApi.devQwen
Category: LLMLLaMA

Backup : # backup list of installed packages dpkg --get-selections | grep -v deinstall > backup-packages.txt Restore : # mark all packages as "deinstall" except the essentials one so you have a very low-level Linux system sudo dpkg --clear-selections # restore your backup sudo aptitude install -y $(cat backup-packages.txt | awk '{print $1}')

Tags: LinuxDebianbackupdeb
Category: LinuxDebian

First you need to clone whisper.cpp repository : iman@Debian:~/whisper.cpp$ git clone https://github.com/ggml-org/whisper.cpp cd whisper.cpp Then save this as build.sh in the whisper.cpp directory and chmod +x build.sh #!/bin/bash export LANG=en_US.UTF-8 ## depends cuda-toolkit cmake curl libcurl4-openssl-dev # Check dependencies DEPENDENCIES=( 'cuda-toolkit' 'cmake' 'curl' 'libcurl4-openssl-dev' ) for i in "${DEPENDENCIES[@]}"; do dpkg -s $i > /dev/null 2>&1; if […

Tags: linuxwhisper.cppLLMGGMLCUDAAicompilegit
Category: LLM

gedit_LLaMA is a Gedit plugin that integrates with openai API compatible local LLM servers (like llama.cpp) to ask questions about selected text. Features Context-Aware Prompts: Automatically includes selected text in your prompt when asking LLaMA questions. Streaming Support: Displays responses as they arrive, providing real-time output from the model. Customizable Configuration: Easily configure API URL, API key, model name and keyboard shortcut. Multi-line Prompt Input: Use a multi-line te…

Tags: LLMGnomeGeditLLaMA
Category: LLMLLaMA