Senior Software Engineer - Model Inferencing
Company: Red Hat
Location: Raleigh
Posted on: April 2, 2026
|
|
|
Job Description:
Red Hat® OpenShift® AI is a flexible, scalable artificial
intelligence (AI) and machine learning (ML) platform that enables
enterprises to create and deliver AI-enabled applications at scale
across hybrid cloud environments. Built using open-source
technologies, OpenShift AI provides trusted, operationally
consistent capabilities for teams to experiment, serve models, and
deliver innovative apps. The OpenShift AI team seeks a Software
Engineer with Kubernetes and Model Inference Runtimes experience to
join our rapidly growing engineering team. Our team focuses on
making machine learning model deployment and monitoring seamless
and scalable across the hybrid cloud and the edge. This is a
fascinating opportunity to build and impact the next generation of
hybrid cloud MLOps platforms. What You Will Do Develop and maintain
a high-quality, high-performing ML inference runtime platform for
multi-modal and distributed model serving. Contribute directly to
upstream inference runtime communities such as vLLM , TGI , PyTorch
, OpenVINO , and others. Maintain CI/CD build pipelines for
container images that allow faster, more secure, reliable, and
frequent releases Coordination and communication with various
stakeholders Applying a growth mindset by staying up to date with
AI and ML advancements What You Will Bring Highly experienced with
programming in Python and PyTorch Familiarity with model
parallelization, quantization, and memory optimization using vLLM,
TGI, and other inference libraries. Experience with Python
packaging, such as PyPI libraries Solid understanding of the
fundamentals of model inference architectures Experience with
Jenkins, Git, shell scripting, and related technologies Experience
with the development of containerized applications in Kubernetes
Experience with Agile development methodologies Experience with
Cloud Computing using at least one of the following Cloud
infrastructures: AWS, GCP, Azure, or IBM Cloud Ability to work
across a large, distributed, hybrid engineering team ? Following is
considered a plus Experience with open-source development is a plus
Development experience with C++, especially with the CUDA APIs, is
a big plus ? AI-HIRING LI-MD2 The salary range for this position is
$116,270.00 - $191,840.00. Actual offer will be based on your
qualifications. Pay Transparency Red Hat determines compensation
based on several factors including but not limited to job location,
experience, applicable skills and training, external market value,
and internal pay equity. Annual salary is one component of Red
Hat’s compensation package. This position may also be eligible for
bonus, commission, and/or equity. For positions with Remote-US
locations, the actual salary range for the position may differ
based on location but will be commensurate with job duties and
relevant work experience. About Red Hat Red Hat is the world’s
leading provider of enterprise open source software solutions,
using a community-powered approach to deliver high-performing
Linux, cloud, container, and Kubernetes technologies. Spread across
40 countries, our associates work flexibly across work
environments, from in-office, to office-flex, to fully remote,
depending on the requirements of their role. Red Hatters are
encouraged to bring their best ideas, no matter their title or
tenure. We're a leader in open source because of our open and
inclusive environment. We hire creative, passionate people ready to
contribute their ideas, help solve complex problems, and make an
impact. Benefits ? Comprehensive medical, dental, and vision
coverage ? Flexible Spending Account - healthcare and dependent
care ? Health Savings Account - high deductible medical plan ?
Retirement 401(k) with employer match ? Paid time off and holidays
? Paid parental leave plans for all new parents ? Leave benefits
including disability, paid family medical leave, and paid military
leave ? Additional benefits including employee stock purchase plan,
family planning reimbursement, tuition reimbursement,
transportation expense account, employee assistance program, and
more! Note: These benefits are only applicable to full time,
permanent associates at Red Hat located in the United States.
Inclusion at Red Hat Red Hat’s culture is built on the open source
principles of transparency, collaboration, and inclusion, where the
best ideas can come from anywhere and anyone. When this is
realized, it empowers people from different backgrounds,
perspectives, and experiences to come together to share ideas,
challenge the status quo, and drive innovation. Our aspiration is
that everyone experiences this culture with equal opportunity and
access, and that all voices are not only heard but also celebrated.
We hope you will join our celebration, and we welcome and encourage
applicants from all the beautiful dimensions that compose our
global village. Equal Opportunity Policy (EEO) Red Hat is proud to
be an equal opportunity workplace and an affirmative action
employer. We review applications for employment without regard to
their race, color, religion, sex, sexual orientation, gender
identity, national origin, ancestry, citizenship, age, veteran
status, genetic information, physical or mental disability, medical
condition, marital status, or any other basis prohibited by law.
Red Hat does not seek or accept unsolicited resumes or CVs from
recruitment agencies. We are not responsible for, and will not pay,
any fees, commissions, or any other payment related to unsolicited
resumes or CVs except as required in a written contract between Red
Hat and the recruitment agency or party requesting payment of a
fee. Red Hat supports individuals with disabilities and provides
reasonable accommodations to job applicants. If you need assistance
completing our online job application, email
application-assistance@redhat.com . General inquiries, such as
those regarding the status of a job application, will not receive a
reply.
Keywords: Red Hat, Jacksonville , Senior Software Engineer - Model Inferencing, IT / Software / Systems , Raleigh, North Carolina