Could we make a framework for using the MS server 2022 AI components to run LLMs in an IIS publishing instance of an AI?

TOGadmin March 11, 2025 0 Comments 1:34 pm

Yes, it is possible to build a framework for running Large Language Models (LLMs) using Microsoft Server 2022 AI components within an IIS (Internet Information Services) instance. Here’s a structured approach to making it work:

framework for Running LLMs on MS Server 2022 with IIS

1. System Requirements

Before setting up, ensure your server has:

Windows Server 2022 with IIS installed.
(such as ONNX Runtime, Windows ML, or DirectML for acceleration).
A GPU (optional but recommended) if leveraging CUDA, DirectML, or ONNX acceleration.
.NET Core 6/7+ or Python for hosting AI inference.
LLM Model Files (e.g., GPT-2, GPT-3, Llama, or a smaller ONNX-converted model).

2. Setting Up IIS for AI Model Hosting

IIS (Internet Information Services) can serve an AI inference API by hosting an application (asp.net Core, FastAPI via IIS, or Flask with WSGI).

Steps to configure IIS for LLM API hosting:

Enable IIS on Windows Server 2022:

Install-WindowsFeature -name Web-Server -IncludeManagementTools

Ensure IIS supports .NET and python apps:
- Install the .NET Core Hosting Bundle if using asp.net.
- Install WSGI or FastCGI for python-based LLM services.
Deploy an AI Web Service in IIS:
- Deploy an asp.net API (C#) or Flask/FastAPI (python) for model inference.Ensure the AI model is preloaded on the backend.The AI API should expose endpoints such as:
```
POST /predict   → Takes input text, returns LLM-generated response
```


Set IIS Application Pool to Use the Right python or .NET Runtime
If running python, set IIS to use FastCGI or WSGI.

If using .NET, ensure the AI module is inside an asp.net Core application.

Search Engine Optimization

Could we make a framework for using the MS server 2022 AI components to run LLMs in an IIS publishing instance of an AI?

framework for Running LLMs on MS Server 2022 with IIS

1. System Requirements

2. Setting Up IIS for AI Model Hosting

Steps to configure IIS for LLM API hosting:

Leave a Reply Cancel reply

Recent Posts

Recent Comments

Archives

Categories