Storage Engineer - Hosting Job at Confidential, Miami, FL

akVyMExFcWlCMllxc2ZxWERRR0FINUJMa0E9PQ==
  • Confidential
  • Miami, FL

Job Description

Ready to help build the backbone of next-generation AI?

Join a Founders Fund-backed NVIDIA cloud partner that is creating the high-performance infrastructure powering some of the world’s most ambitious AI research. In the realm of GPU-as-a-Service, the bottleneck isn’t compute, it’s the data.

As a Storage Engineer, design and implement the data layer that enables foundation model training and enterprise-grade production inference. This role requires a deep understanding that AI at scale demands more than capacity: it demands massive throughput, ultra-low latency, and the ability to feed thousands of GPUs seamlessly.

Take the next step in your career and help shape the infrastructure that drives the future of AI.

Responsibilities:

  • Design & Deploy AI Storage: Architect and implement high-performance parallel file systems (Weka, Lustre, or similar) optimised specifically for GPU-heavy workloads and multi-node training.
  • Optimise Data Pipelines: Fine-tune storage performance to ensure maximum GPUDirect Storage (GDS) efficiency, minimising latency between the storage fabric and the GPU memory.
  • Manage Scale & Reliability: Build and maintain petabyte-scale storage clusters across multiple global data centers, ensuring 99.99% uptime for mission-critical AI research labs.
  • Infrastructure Integration: Partner with Network and Data Center engineers to configure high-speed storage networking (InfiniBand/400G Ethernet) and ensure seamless backend connectivity.
  • Automate Storage Ops: Develop Terraform providers, Ansible playbooks, or Python scripts to automate the provisioning, monitoring, and scaling of storage resources.
  • Troubleshoot Complex I/O: Act as the Tier-3 lead for storage-related performance degradation, identifying root causes in the filesystem, network, or Linux kernel.

Skills/Must have:

  • Specialised Storage Expertise: 5+ years of experience with high-performance storage solutions (WekaIO, VAST Data, BeeGFS, or DDN) in a Linux-heavy environment.
  • AI Infrastructure Knowledge: Deep understanding of how storage interacts with NVIDIA GPU stacks (HGX/DGX) and the specific I/O patterns of ML training (checkpoints, small file reads, etc.).
  • Networking Proficiency: Hands-on experience with InfiniBand, RoCEv2, and NVMe-over-Fabrics (NVMe-oF).
  • Systems Automation: Strong scripting skills in Python, Go, or Bash, and experience with IaC tools like Terraform or Pulumi.
  • Linux Internals: Deep knowledge of the Linux storage stack, including XFS/ZFS, LVM, and kernel tuning for high-throughput networking.

Benefits:

  • 10% bonus
  • Stock options

Salary:

  • $200,000 base salary

Job Tags

Permanent employment

Similar Jobs

RSM US LLP

Manager - Financial Due Diligence Job at RSM US LLP

 ...inspires and empowers you to thrive both personally and professionally. Theres no one like you and thats why theres nowhere like RSM. Job Responsibilities Leads execution of diligence engagements. Oversees engagement teams and timelines. Manages client... 

Dwight School

Physical & Health Education Teacher Job at Dwight School

 ...innovators and thought leaders. Known for its low student-teacher ratio, Dwight enrolls 1,100 students with 400 faculty and...  ...maintain a work environment free from discrimination. Physical & Health Education Teacher Full Time; Hours: 7:30am-4pm Start August 20... 

SGS Consulting

Technical Game Developer Job at SGS Consulting

 ...troubleshoot the design of our prototype demo experiences. Skills: ~ Portfolio required to view skills / craft ~ Minimum 8+ years of Game development and/or AR/VR prototype experiences. You are an expert in game mechanics and UI systems, demonstrating high end skillsets... 

EarthDaily Analytics

Marketing Events Manager Job at EarthDaily Analytics

 ...computing to solve the toughest challenges in agriculture, water management, carbon-capture verification and more. EDAs signature Earth...  ...s brand to life on the global stage? Do you thrive on building event programs that generate real pipeline, forging partnerships that... 

Portland Rescue Mission

Women’s Community Life Specialist Job at Portland Rescue Mission

 ...OVERVIEW Portland Rescue Mission is seeking a Women's Community Life Specialist. This gifted servant will personify community building, order, and responsibility as they join a Christ-centered team with an award-winning culture and a commitment to serving others in...