Data Scientist
Middesk · New York, US
Job description
About Middesk:
Middesk makes it easier for businesses to work together. Since 2018, we’ve been transforming business identity verification, replacing slow, manual processes with seamless access to complete, up-to-date data. Our platform helps companies across industries confidently verify business identities, onboard customers faster, and reduce risk at every stage of the customer lifecycle.
Middesk came out of Y Combinator, is backed by Sequoia Capital and Accel Partners, and was recently named to Forbes Fintech 50 List.
About The Role:
We’re building AI-driven applications that simplify customer workflows, starting with business onboarding. With our proprietary identity data and deep domain expertise, we’re in a strong position to expand into a broader set of intelligent, risk-aware products.
We’re looking for a hands-on engineer to help build the foundation for these systems. This role is less about inventing new ML algorithms and more about applying the right techniques to messy, real-world problems. You’ve worked in fraud, risk, or trust domains, and you understand how bad actors behave, how data breaks, and how to still ship reliable systems anyway.
This is a highly technical, hands-on role with broad influence over how we design, build, and scale data-driven systems at Middesk.
We follow a hybrid work model, and for this role, there is an expectation of 2 days per week in our SF/NYC office. Candidates should be based within a commutable distance, as we believe in the value of in-person collaboration and building strong team connections while also supporting flexibility where possible.
What You’ll Do:
- Build fraud & risk systems
Design and ship production systems that detect and prevent fraud across KYB, trust & safety, and compliance workflows.
- Work with messy, real-world data
Tackle problems with extreme class imbalance, sparse signals, evolving adversarial behavior, and limited ground truth.
- Leverage relationships in data
Apply graph-based approaches and entity resolution techniques to uncover hidden connections and improve risk detection.
- Improve signal & labeling
Use a mix of heuristics, weak supervision, and modern AI tools (including LLMs where appropriate) to generate better features and labels.
- Help scale our infrastructure
Partner with engineering to build and evolve systems for feature generation, model training, and production deployment across multiple use cases.
What We’re Looking For:
- 5+ years of experience in fraud, risk, or trust & safety
You’ve worked on real-world fraud or abuse problems and understand the domain deeply.
- Experience building and shipping production systems
You’ve deployed models or data-driven systems that power external-facing products.
- Strong foundation in applied ML or data systems
Comfortable working on classification problems with real-world constraints like imbalanced data, sparse signals, and changing patterns.
- Experience with graph or relational data approaches
Familiarity with knowledge graphs, network analysis, or entity linking is strongly preferred.
- Hands-on and pragmatic
You focus on impact over perfection and know how to balance speed, accuracy, and maintainability.
Compensation Range: $175K - $210K
ML/AI Work links you to the employer's original posting — always verify the details there before applying.
More Data Science roles
View all →Data Scientist – Seed Robotics & AI
Enza Zaden · Amersfoort, NL
Senior Data Scientist - Government & Public Services
Deloitte · Baltimore, US
DATA SCIENTIST LEAD L1(CONTRACT)
Wipro UK · Milton Keynes, GB
Data Scientist, Behavior Evaluation
Zoox · Boston, US
Data Scientist, Autonomy Behavior Monitoring
Zoox · Boston, US
Data Scientist, Behavior Evaluation
Zoox · Oakland, US