Llama Compose is the showcase application for Colombia AI Week, created to highlight the power of on-device AI running natively on Apple Silicon (M1 and later) with Metal acceleration. Through an interactive chat interface, it demonstrates how advanced language models can operate locally on iPhone and Mac with real-time performance. Key features: - Multiple chat modes (Simple & Agent) - On-device AI inference with llama.cpp optimized for Metal - Support for Meta's Llama and Google's Gemma model families - On-device agent with tool calling via Koog.ai - Model download and management - Runs on iOS and Apple Silicon Macs - Real-time, interactive conversation interface Important Disclaimer: This app includes experimental AI functionality. AI-generated responses may contain offensive, inaccurate, or inappropriate content. Users should exercise caution and not rely on this app for critical decisions or sensitive information. The app is provided for educational and demonstration purposes only.