This library is designed for running small language models locally using llama.cpp. If you want to call external LLM APIs, this is not the right fit. Context Engineering - Use callbacks to manipulate ...