ML/AIWork

AI Data Intern

Toppan Digital Language · Seville, ES

Job description

  • Part-time (35 hours)
  • AI
  • Spain

COMPANY DESCRIPTION & JOB PURPOSE

Hello, we’re TOPPAN Digital Language. We’re a language solutions provider that’s enabled by tech. Our mission is to help global companies with high-risk, business-critical content sell with confidence in any language.

We aim to be the #1 localization partner for companies in life sciences and healthcare, market insights, retail and e-commerce, and financial services. Operating globally, we design language solutions with the best teams and the best tech to meet our customers’ needs at speed, at scale—securely. We are obsessed with our clients’ success, and as importantly, our employees’. We want to build a reputable company that attracts, develops, and retains exceptional talent in our industry.

Our values reflect the thinking of our founders who have decades of trusted industry and localization expertise. They act as a major force in shaping our company culture: an entrepreneurial spirit that embraces a growth mindset and an ethos of inspiring excellence.

This internship is an opportunity to gain hands-on experience at the intersection of data, AI and language technology. The AI Data Intern will support TDL’s AI Solutions team in ensuring that the data powering our AI models and solutions is accurate, well-organised and ready for training and evaluation. Working closely with the AI Data Research Director and the team of AI Engineers, the intern will play a meaningful role in the quality and reliability of TDL’s AI systems and solutions.

Location: Seville

Working model: On-site

ACCOUNTABILITIES AND RESPONSIBILITIES

As an AI Data Intern, you will work hands-on with production-grade AI systems, data quality processes, and industry-standard tools, while deepening your understanding of responsible AI principles in real business contexts. Throughout the internship, your responsibilities will be:

Data Quality & Validation

  • Support the review and validation of datasets for AI training, identifying errors and assisting the team in resolving issues.
  • Assist with developing and refining data quality checklists and validation protocols.

Data Cataloguing & Organisation

  • Assist in cataloguing and organising datasets, ensuring consistent metadata tagging and version control.
  • Support documentation of data collection and preprocessing procedures.

Data Preparation for AI Training & Evaluation

  • Assist in cleaning, formatting, and preprocessing datasets for AI model training.
  • Support preparation and verification of evaluation/test sets.

AI Assets Creation

  • Support the creation of labelled datasets, glossaries, and linguistic assets for AI training and evaluation.
  • Assist with documenting and version-controlling AI assets.

QUALIFICATIONS AND EXPERIENCE

  • Currently enrolled in or recently completed a Bachelor’s or Master’s degree in Linguistics, Computational Linguistics, Translation and Interpreting, AI or a related field.
  • Basic familiarity with Python or another scripting language for data processing or willingness to learn.
  • Comfortable working with structured data (CSV, JSON, XLIFF) and spreadsheet tools (Excel or Google Sheets).
  • Native or proficient in English is mandatory. Knowledge of additional languages is advantageous.

SKILLS

  • Strong organisational skills and a methodical approach to reviewing and organising information.
  • High attention to detail and ability to document findings clearly.
  • Collaborative, proactive, and eager to learn in a multidisciplinary environment.
  • Willingness to learn new tools, technologies, and processes—and to seek feedback to support continuous improvement—is essential for success in this role
  • Interest in AI, data, and language technology, with awareness of data privacy principles and responsibility when handling sensitive data.
  • Curiosity about AI tools and platforms; exposure to annotation tools (e.g. Label Studio, Prodigy) or NLP libraries (e.g. HuggingFace, spaCy) is a plus.

Toppan Digital Language is committed to equality in employment and is able to consider any necessary adjustments during the recruitment process.

ML/AI Work links you to the employer's original posting — always verify the details there before applying.

More MLOps and Platform roles

View all →
AI Data Intern
Toppan Digital Language
Apply →