Full Stack Software Engineer (AI Infrastructure)
Bytoa · Baltimore, US
Job description
Laurel, MDDescription:
Bytoa is seeking a Full-Stack Software Engineer to support our AI infrastructure team.
In this role, you’ll help build and maintain the platform that provides the foundation for the customer’s AI capabilities, focusing on inference services while supporting the broader ecosystem of AI-enabled applications.
This role is intended for experienced engineers who can independently design, implement, and operate scalable AI infrastructure components.
Salary range: $200,000-$220,000
Disclaimer:
Salary for this position, along with additional compensation options, will be determined on an individual basis following the interview process, considering various factors such as years of experience, skills, education/certifications, contract specifications, market conditions, etc.
Responsibilities:
- Design, implement, and optimize infrastructure for AI model inference at scale.
- Support the development and maintenance of production AI services and applications, including retrieval augmented generation (RAG), autonomous agents, and emerging technologies.
- Navigate ambiguity and define solutions for underspecified systems and requirements.
- Drive adoption of new technologies and practices across engineering teams.
- Implement monitoring, logging, and observability solutions for AI services.
- Automate infrastructure provisioning and configuration using IaC principles.
- Ensure high availability, reliability, and performance of AI platform components.
- Contribute to security best practices for AI systems and data.
- Provide technical guidance and informal mentorship to junior engineers.
Skills Requirements:
- Proven experience building and maintaining production systems at scale.
- Experience with high-volume web application architecture and performance optimization.
- Strong background in systems integration across diverse technologies and platforms.
- Hands-on experience with cloud engineering in AWS.
- Proficiency with Kubernetes administration and deployment patterns.
- Strong Python programming skills.
- Experience implementing observability solutions (APM, OpenTelemetry, Grafana, Prometheus).
- Familiarity with CI/CD pipelines and DevOps practices.
- Strong change management and organizational influence skills.
- Ability to thrive in ambiguous environments and create structure where needed.
- Excellent communication and collaboration skills.
Nice to Haves:
- Experience with AI inference serving technologies (vLLM, LiteLLM, etc.).
- Previous experience with agentic frameworks (LangChain).
- Knowledge of vector databases and embedding systems.
- Experience with high-performance computing or distributed systems.
Experience Requirement:
- 8 yrs., B.S. in a technical discipline or 4 additional yrs. in place of B.S.
Clearance Requirement:
- Active TS/SCI with a polygraph
Referrals & Inquiries
Do you know a cleared professional seeking to advance their career? Interested in earning some extra cash? If so, refer them to us with their name and contact details, and you could be eligible for a referral bonus of up to $10,000 for each successful hire.
Not seeing the right position right now? Reach out to us, and we’ll notify you as new contracts and opportunities become available!
ML/AI Work links you to the employer's original posting — always verify the details there before applying.
More ML Systems and Inference roles
View all →ML Systems Engineer
— · San Francisco, US
AI Vision engineer
SkyeBase · Remote · Antwerp
Backend Software Engineer, AI Platform
eBay · Dublin, IE
(Senior) Data Scientist (all genders) - The Pattern Hunter
Viewpointsystem · Vienna, AT
Technical Lead Manager, TorchTPU
Google · London, GB
Senior Software Engineer – Edge AI/GenAI
Qualcomm · San Diego, US