Optimize GPU Resource Efficiency

GPU Utilization Challenges 

waste-icon

Costly GPU resources go to waste

utilization-icon

Utilization is typically less than 30%

lock-icon

Small jobs lock up entire GPUs

Pepperdata Resource Optimization for GPUs

How it Works

Pepperdata automatically partitions single GPUs into independent slices
GPU usage is monitored in real time so capacity can be dynamically adjusted 
Workloads are intelligently assigned to the most appropriate GPU partitions

Benefits

Achieve more effective GPU usage in the cloud or on prem
Increase throughput automatically by running more workloads to completion
Realize significant cost savings

without pepperdata resource optimization

down-arrowwith pepperdata resource optimization

 

"We consider Pepperdata to be the optimization layer for all our platforms, including our GPU environments. Our end users can rely on Pepperdata to do all the optimization for them, automatically—which frees them to focus on more strategic value-add initiatives for our company."


—Technical Fellow, Fortune 10 Enterprise

Frequently Asked Questions

What is GPU Resource Optimization?

Pepperdata Resource Optimization maximizes utilization and significantly reduces GPU cost by leveraging NVIDIA's Multi-Instance GPU (MIG) feature. Pepperdata continuously monitors GPU usage and demand. Based on this real-time data, Pepperdata dynamically creates the pools of sliced GPUs, adjusting the capacity of each GPU pool so each can scale up or down as needed to prevent underutilization and bottlenecks. Pepperdata then intelligently assigns workloads to the most appropriate GPU slices, learning from historical usage patterns to refine these assignments over time.

What is GPU Paritioning?

Pepperdata automatically partitions single GPUs into secure, independent GPU slices, creating three GPU partition pools in your environment for workload placement:

  1. Full GPUs: Dedicated for demanding workloads that require an entire GPU.
  2. ½ GPUs (2 MIG slices per GPU): For medium workloads that only require half of a GPU.
  3. ⅓ GPUs (3 MIG slices per GPU): Ideal for lighter workloads that fit within a third of a GPU.

How does Pepperdata differ from NVIDIA's Multi-Instance GPU (MIG)?

Managing MIG slices can be incredibly complex, manual, and challenging—but Pepperdata makes this effortless. Instead of investing tedious and time-consuming manual effort into planning GPU slices, tracking demand, coordinating workloads, and constantly resizing resources, Pepperdata does it all automatically for you. As a result, both platform operators and application developers are freed from the manual, error-prone overhead of guessing GPU needs and reconfiguring slices, which frees them to focus on higher-value work.

Start Optimizing Your GPUs

Pepperdata is currently working with a select group of partners with large-scale GPU environments. 

Fill out the form to secure your spot on the waitlist.