AI Developer / Senior AI Developer
— · Vancouver, CA
Job description
Role Overview
We are seeking a highly skilled AI Developer to design, develop, and deploy advanced AI systems. This role focuses on building scalable, production-grade solutions across LLM applications, system architecture, and inference optimization. You will work closely with cross-functional teams to deliver high-performance AI-driven products.
Key Responsibilities
AI Core Development
- Design and implement Retrieval-Augmented Generation (RAG) pipelines, including embedding model fine-tuning, chunking strategies, and vector database integration (e.g., Qdrant, Milvus, Pinecone).
- Develop and optimize model evaluation frameworks, including reranking algorithms and retrieval performance metrics (e.g., Hit Rate, MRR).
- Build and maintain multimodal AI pipelines using Vision-Language Models (VLMs) and OCR for complex document parsing and structured data extraction.
- Perform LLM fine-tuning using supervised fine-tuning (SFT) and synthetic data generation techniques.
- Design and implement AI agent systems, including tool usage, skill orchestration, and task planning.
Software & System Architecture
- Develop backend services using Python (FastAPI / Flask) or C#, and contribute to frontend development (Vue 3 / HTML) when needed.
- Design and implement scalable, multi-user systems with robust access control (RBAC), logging (ELK / Loki), and database architectures (SQL / NoSQL).
- Build high-performance, distributed systems leveraging protocols and frameworks such as TCP/IP, gRPC, and NATS.
- Implement caching and middleware solutions (e.g., Redis) to support high-concurrency workloads and session management.
Inference Optimization & Operations
- Deploy and optimize AI models using inference engines such as vLLM, llama.cpp, Ollama, or OVMS.
- Analyze and optimize system performance based on hardware characteristics (GPU/NPU), including TOPS, KV cache usage, latency, and throughput.
- Manage containerized environments using Docker / Podman, and support orchestration using Kubernetes (K8s).
Qualifications
- Strong experience in AI/ML system development, particularly with LLM-based applications.
- Proficiency in Python and/or C#, with solid software engineering fundamentals.
- Hands-on experience with distributed systems, APIs, and cloud-native architectures.
- Familiarity with modern AI infrastructure, including vector databases, inference engines, and container orchestration.
- Strong problem-solving skills and the ability to work in a fast-paced, collaborative environment.
Nice to Have
- Experience with multimodal AI applications and document intelligence.
- Prior work on AI agents or autonomous systems.
- Experience optimizing large-scale inference systems in production.
- Business fluency in Mandarin (to facilitate communication with our Taiwan HQ).
- Willing to travel among US, Canada, and Taiwan (HQ).
Equal Opportunity & Accommodations
We are an equal opportunity employer committed to diversity and inclusion. We welcome applications from all qualified individuals. Accommodations are available upon request for candidates taking part in all aspects of the selection process.
Pay: $10,000.00-$13,000.00 per month
Work Location: In person
ML/AI Work links you to the employer's original posting — always verify the details there before applying.
More Core AI Engineering roles
View all →Data & AI engineering intern - Paris, France
Green Giraffe · Paris, FR
INTERNSHIP - AI SOFTWARE ENGINEER– (Bac+5, end-of-study) F/M
Trekea SAS · Paris, FR
Senior AI Developer
neeyo Group GmbH · Wiener Neustadt, AT
AI Engineer (gn)
The Quality Group GmbH · Remote · Berlin
AI Engineer (gn)
The Quality Group GmbH · Remote · Hamburg
AI Engineer Trainee (m/w/d)
Estateanfrage · Munich, DE