Role Overview

We are seeking a highly skilled AI Developer to design, develop, and deploy advanced AI systems. This role focuses on building scalable, production-grade solutions across LLM applications, system architecture, and inference optimization. You will work closely with cross-functional teams to deliver high-performance AI-driven products.

Key Responsibilities

AI Core Development

Design and implement Retrieval-Augmented Generation (RAG) pipelines, including embedding model fine-tuning, chunking strategies, and vector database integration (e.g., Qdrant, Milvus, Pinecone).
Develop and optimize model evaluation frameworks, including reranking algorithms and retrieval performance metrics (e.g., Hit Rate, MRR).
Build and maintain multimodal AI pipelines using Vision-Language Models (VLMs) and OCR for complex document parsing and structured data extraction.
Perform LLM fine-tuning using supervised fine-tuning (SFT) and synthetic data generation techniques.
Design and implement AI agent systems, including tool usage, skill orchestration, and task planning.

Software & System Architecture

Develop backend services using Python (FastAPI / Flask) or C#, and contribute to frontend development (Vue 3 / HTML) when needed.
Design and implement scalable, multi-user systems with robust access control (RBAC), logging (ELK / Loki), and database architectures (SQL / NoSQL).
Build high-performance, distributed systems leveraging protocols and frameworks such as TCP/IP, gRPC, and NATS.
Implement caching and middleware solutions (e.g., Redis) to support high-concurrency workloads and session management.

Inference Optimization & Operations

Deploy and optimize AI models using inference engines such as vLLM, llama.cpp, Ollama, or OVMS.
Analyze and optimize system performance based on hardware characteristics (GPU/NPU), including TOPS, KV cache usage, latency, and throughput.
Manage containerized environments using Docker / Podman, and support orchestration using Kubernetes (K8s).

Qualifications

Strong experience in AI/ML system development, particularly with LLM-based applications.
Proficiency in Python and/or C#, with solid software engineering fundamentals.
Hands-on experience with distributed systems, APIs, and cloud-native architectures.
Familiarity with modern AI infrastructure, including vector databases, inference engines, and container orchestration.
Strong problem-solving skills and the ability to work in a fast-paced, collaborative environment.

Nice to Have

Experience with multimodal AI applications and document intelligence.
Prior work on AI agents or autonomous systems.
Experience optimizing large-scale inference systems in production.
Business fluency in Mandarin (to facilitate communication with our Taiwan HQ).
Willing to travel among US, Canada, and Taiwan (HQ).

Equal Opportunity & Accommodations

We are an equal opportunity employer committed to diversity and inclusion. We welcome applications from all qualified individuals. Accommodations are available upon request for candidates taking part in all aspects of the selection process.

Pay: $10,000.00-$13,000.00 per month

Work Location: In person

ML/AI Work links you to the employer's original posting — always verify the details there before applying.

AI Developer / Senior AI Developer

Job description

More Core AI Engineering roles

Data & AI engineering intern - Paris, France

INTERNSHIP - AI SOFTWARE ENGINEER– (Bac+5, end-of-study) F/M

Senior AI Developer

AI Engineer (gn)

AI Engineer (gn)

AI Engineer Trainee (m/w/d)