SH-LLM

Small and medium-sized enterprises (SMEs) often lack access to affordable, privacy-conscious, and regulation-compliant language model (LLM) solutions. Public LLM APIs may not meet their needs due to cost, privacy concerns, and limited customization. In this project, fortiss developed an in-house stack to run different publicly available LLMs such as LLama and DeepSeek on our own hardware.

Project description

Running LLMs locally is technically demanding. It requires specialized hardware, deep system integration knowledge, and expertise in model architecture, which most SMEs lack. At the same time, using cloud-based LLMs raises serious privacy and confidentiality concerns—especially when handling sensitive data or operating under strict regulations like GDPR. Off-the-shelf APIs are also expensive and poorly suited for domain-specific use cases, leaving SMEs without practical, compliant, or cost-effective AI solutions.

The project addresses these challenges by locally hosting an open-source LLM (starting with LLama and DeepSeek) on fortiss hardware, allowing full control over data and infrastructure. This setup enables experimentation with different models, finetuning strategies, and deployment architectures tailored to SME needs.

Research contribution

Ability to run SME-specific LLM workloads securely on fortiss hardware, ensuring data privacy, regulatory compliance, and full control over sensitive information
A Tangible Demonstrator: A real-world prototype showcasing the capabilities of a self-hosted, business-oriented LLM system, used for engagement, acquisition, and internal learning
Foundation for Industry Collaboration: A robust technical and organizational basis for future industry projects involving self-hosted AI, enabling tailored solutions and long-term partnerships with SMEs

Project duration

01.03.2025 - 30.06.2025

Name	Purpose	Lifetime	Type	Provider
_pk_id	Used to store a few details about the user such as the unique visitor ID.	13 months	HTML	Matomo
_pk_ref	Used to store the attribution information, the referrer initially used to visit the website.	6 months	HTML	Matomo
_pk_ses	Short lived cookie used to temporarily store data for the visit.	30 minutes	HTML	Matomo
_pk_cvar	Short lived cookie used to temporarily store data for the visit.	30 minutes	HTML	Matomo
_pk_hsr	Short lived cookie used to temporarily store data for the visit.	30 minutes	HTML	Matomo