Voltage Park Logo

Voltage Park

Senior Infrastructure Software Engineer

Reposted 7 Days Ago
Remote
Hiring Remotely in USA
Senior level
Remote
Hiring Remotely in USA
Senior level
Design and implement automation tools and APIs for managing infrastructure, collaborate with engineering teams, and participate in architectural discussions.
The summary above was generated by AI

Voltage Park is seeking a Senior Infrastructure Software Engineer for our Infrastructure Engineering team. Our team is responsible for building automation, tooling, and API-driven systems to bridge the gap between our physical infrastructure and the systems that our customers depend on for AI/ML training, inference, and HPC workloads at scale.

In this role, you’ll design and implement systems that enable humans and software to interact programmatically with thousands of bare-metal servers, storage clusters, and high-performance networks. You will work closely with teams across Voltage Park to drive new infrastructure rollouts and improve the lifecycle management of existing resources.

This is a fully remote position, although candidates must be based in the continental United States. Unfortunately, we are unable to provide sponsorship for this role.

Responsibilities:
  • Design, build and maintain tools, APIs, and automation frameworks to manage physical infrastructure at scale.

  • Build and extend systems for server lifecycle management.

  • Implement observability, telemetry, and logging systems that enable visibility and insights into the health of our hardware.

  • Collaborate with our Network, Infrastructure Operations, Platform Engineering, and Customer Experience teams to define requirements for and build new tools.

  • Participate in architectural discussions to help define the direction of infrastructure engineering at Voltage Park.

  • Write clear design documents and technical documentation.

Qualifications:
  • 8+ years of professional experience in software engineering, infrastructure engineering, or related fields.

  • Strong experience with Linux in production environments.

  • Proficiency in Python or similar object-oriented programming languages.

  • Familiarity with containerization and orchestration concepts.

  • Understanding of HPC infrastructure fundamentals, bare-metal provisioning and out-of-band management.

  • Experience balancing pragmatic shipping with good long-term architecture.

  • Comfortable with navigating ambiguity.

  • Strong written and verbal communication skills.

Ideal Experiences
  • Experience with bare metal hardware troubleshooting and provisioning, extra points for working with Dell hardware.

  • Experience with GPU servers, both in bare metal form or under virtualization.

  • Deep experience with network switches, routers, and firewalls, particularly SONiC switches, Palo Alto firewalls and Juniper Networks as vendors.

  • Experience with VAST storage systems.

Culture:
  • Enjoy collaborating with a growing, motivated team focused on execution.

  • Comfortable operating with a high degree of autonomy and able to independently prioritize tasks aligning with company objectives.

  • Possess a breadth of knowledge in your domain while also embracing the opportunity to take on diverse responsibilities.

  • Value the importance of clear communication and documentation in driving success.

Voltage Park is an equal opportunity employer and makes employment decisions on the basis of merit. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, or any other characteristic under federal, state, or local law. If you require an accommodation during the job application process, please notify your recruiter. 

Top Skills

Containerization
Dell Hardware
Hpc Infrastructure
Juniper Networks
Linux
Networking
Orchestration
Palo Alto Firewalls
Python
Sonic Switches
Vast Storage Systems

Similar Jobs at Voltage Park

5 Days Ago
Remote
USA
Senior level
Senior level
Artificial Intelligence • Cloud • Hardware • Machine Learning • Software • Infrastructure as a Service (IaaS)
The Infrastructure Operations Engineer ensures the stability and scalability of high-performance environments for AI/ML workloads through design, deployment, and collaboration with internal teams.
Top Skills: AnsibleAWSBashCephElk StackGoJuniper NetworksKubernetesLinuxNfsPalo AltoPrometheusPythonSonicTerraformUbuntu
7 Days Ago
Remote
USA
Expert/Leader
Expert/Leader
Artificial Intelligence • Cloud • Hardware • Machine Learning • Software • Infrastructure as a Service (IaaS)
The Infrastructure Engineer will design and maintain observability platforms, such as metrics and alerting systems, collaborating closely with various teams to enhance operational insights and reliability.
Top Skills: BashElkGoGrafanaKafkaKubernetesOtelPrometheusPromtailPythonVictoriametrics
7 Days Ago
Remote
USA
Senior level
Senior level
Artificial Intelligence • Cloud • Hardware • Machine Learning • Software • Infrastructure as a Service (IaaS)
The Staff Network Engineer will design, deploy, and support high-performance AI network systems, ensuring low-latency operations and automation across data centers.
Top Skills: AclsAi InfrastructureAnsibleBgpEvpn/VxlanGrafanaInfinibandInfluxdbMplsNetwork SystemsNvidia Fabric ManagerOspfPrometheusPythonQosSflowTerraform

What you need to know about the Bristol Tech Scene

Along with Gloucester, Swindon and Bath, Bristol is part of the "Silicon Gorge" tech hub, a region in the U.K. renowned for its high-tech and research-driven industries, with a particular emphasis on sustainability and reducing environmental impact. As the European Green Capital, Bristol is home to 25,000 cleantech companies, including Baker Hughes and unicorn Ovo Energy. The city has committed to achieving net-zero emissions within the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account