more complex 2nd method - it can load parallel distribution as well as fallback mechanism - ask ur agent
Install and Run Your AI Model Server (on all three PCs):
You'll need an AI model server that can be accessed over HTTP (or HTTPS). Examples include:
FastAPI: A modern, high-performance Python web framework.
Flask: A simpler Python web framework.
Triton Inference Server: NVIDIA's inference server (more complex setup, but highly optimized).
