IBM helps developers deploy AI and ML models on Kubernetes

By Ian Barker
Published 4 years ago

Responding to a user request from an AI model -- 'model serving' -- is a key part of making use of the technology. But as the number of models expands serving them all raises problems and can lead to many being rarely used or abandoned.

Which is why IBM is introducing ModelMesh, a model serving management layer for Watson products that is designed to cope with high-scale, high-density and frequently-changing model use cases. It intelligently loads and unloads AI models to and from memory to strike an optimized trade-off between responsiveness to users and computational footprint.

ModelMesh already underpins many of Watson's cloud services including Watson Natural Language Understanding. It's open source and includes ModelMesh Serving, a controller for managing ModelMesh clusters via Kubernetes custom resources.

ModelMesh decides when and where to load and unload copies of models based on how recently they've been used and current request volumes -- if a particular model is under heavy load it will be scaled across more server pods. It's designed to minimize impact on runtime traffic but also allow urgent requests priority.

Using decentralized logic means there's no central controller involved in model management decisions. It also works with Kserve, an industry leading standardized model inference platform for trusted AI, with its origins in Kubeflow.

IBM helps developers deploy AI and ML models on Kubernetes

Recent Headlines

Xerox completes $1.5 billion Lexmark acquisition to boost print business

EU urged to pause rollout of new AI rules

‘Innovator passports’ set to boost UK health tech rollouts

Microsoft acknowledges Intune issue that wipes out security customizations

Microsoft releases PowerToys v0.92 with new features for many modules

How AI is changing the way organizations nurture talent [Q&A]

X turns to AI-powered bots to write Community Notes to clarify posts

Most Commented Stories

Betanews Is Growing Alongside You

16 Billion Passwords Exposed: Major Leak Hits Apple, Facebook and Google Users

Will Windows 10 stop working? See if your PC will survive the switch to Windows 11

Apple’s Liquid Glass Control Center Gets a Much-Needed Fix in iOS 26 Beta 2

Apple’s CarPlay Ultra Comes to a Halt as Industry Giants Start Changing Their Minds

Microsoft is making huge changes to Windows 10 and 11, cutting out nagging to use Edge... for some

Chaos RAT malware strikes Linux and Windows as hackers exploit its flaws

Website owners fear Google’s AI search, but is this concern reasonable?