Google gives enterprises new controls to manage AI inference costs and reliability

8 hours ago
1 min read

InfoWorld — While the cost of training large language models for artificial intelligence has been a concern in the past, the focus of attention is increasingly moving to inferencing, or the cost of using those models.

The new tiers, called Flex Inference and Priority Inference, address a problem that has grown more acute as enterprises move beyond simple AI chatbots into complex, multi-step agentic workflows, the company said in a blog post published Thursday.

Read the full story | InfoWorld

VIEW LATEST NEWS