SQL Server 2025 introduces native support for vector data types and external AI models. This opens up new scenarios for semantic search and AI-driven experiences directly in the database. But as with any external service integration, performance and scalability are immediate concerns, especially when generating embeddings at scale.
https://github.com/nocentino/ollama-lb-sql
Problem: Bottlenecks in Embedding Generation When you call out to an external embedding service from T-SQL via REST over HTTPS, you’re limited by the throughput of that backend.