Where great talent finds impact jobs

Didn’t find the opportunity you were looking for? Join our talent pool and get contacted later.
0
companies
0
jobs

Junior Data Scientist (Intern)

Vanilla Steel

Vanilla Steel

Data Science
Berlin, Germany
Posted on Aug 28, 2025
Your mission

As a Junior Data Scientist during our 6 month internship, you’ll be an active member of our data science team, responsible for designing, building, and deploying components of our data and machine learning stack. The position will be on-site in our Berlin office. Example projects include:

  • Designing and maintaining ETL pipelines to process supplier and customer data from Excel, PDF, ERP, and unstructured sources.

  • Building data wrangling, cleaning, and validation pipelines to ensure high-quality, reliable datasets for downstream models.

  • Contributing to recommendation systems based on live inventory, purchase behavior, and customer preferences.

  • Using Prefect, Airflow and MLflow to manage workflows, track experiments, and deploy models.

  • Evaluating model performance and proposing improvements.

  • Supporting the creation of dashboards and KPIs to monitor algorithm performance and business impact.

  • Collaborating with engineers, product managers, and business stakeholders to align technical solutions with real customer needs.

Your profile

We’re looking for someone who is junior but job-ready - able to contribute independently while still growing their skills. You should have:

  • ~1 year of practical experience in data science, machine learning, or data engineering (internship, research, or industry).

  • Strong skills in Python (Pandas, NumPy, Scikit-learn, etc.).

  • Experience with SQL and working with structured data.

  • Knowledge of ETL pipeline design and data wrangling.

  • Familiarity with MLOps tools such as Airflow or MLflow.

  • Solid understanding of machine learning fundamentals (training, validation, evaluation, deployment).

  • An advanced degree in Computer Science, Mathematics, Statistics, or another quantitative discipline.

  • The ability to communicate clearly, explain your approach, and work well with both technical and non-technical team members.

NICE TO HAVE

  • Experience with graph databases (e.g., Neo4j) and writing Cypher queries.

  • Exposure to large language models (LLMs) and their ecosystem (prompt engineering, RAG, CrewAI, Agno, LangChain, or LangGraph).

  • Hands-on experience with ML frameworks such as PyTorch or TensorFlow.

  • Familiarity with deploying models on cloud platforms (AWS, GCP, or Azure).

Why us?
  • Ownership of projects with a focus on outcomes rather than time-tracking

  • Fast-paced yet collaborative culture fostering individual performance and teamwork

  • Competitive compensation based on experience

  • Subsidized Urban Sports Club membership

  • Subsidized Deutschlandticket

  • Hybrid work format

  • Beautiful office located in the heart of Prenzlauer Berg, Berlin

  • Regular team building events, company breakfasts and Friday drinks

About us

We are a Berlin-based start-up that has successfully established a leading B2B marketplace for industrial metals across Europe.

The multi-billion-euro metal trading industry is operated on Excel, PDF and Email. We are on a mission to transform buying and selling in one of the oldest industries of the modern world with seamless and intuitive digital solutions. Our technologies increase liquidity, accelerate transactions, reduce scrapping rates and enhance buying convenience for hundreds of steel and metal distributors across Europe.

We are a lean team of young, international, and passionate talents gearing up for the next phase of growth. We are looking for an outstanding individual who wants to join an exciting early stage start-up.