Local Inference API
Create, remove, and manage API keys for users
Load, unload, and monitor model status