Lead Data Scientist

Your role at Dynatrace

Dynatrace makes it easy and simple to monitor and run the most complex, hyper-scale multicloud
systems. Dynatrace is a full stack and completely automated monitoring solution that can trackevery user, every transaction, across every application.

Our team is looking for a Lead Data Scientist specialized in Large Language Models (LLMs) to design, build, and scale generative AI capabilities for real-world, enterprise-grade use cases. In this hands-on technical leadership role, you’ll own the end-to-end LLM stack, from data/knowledge, Ingestion and retrieval to prompt and tool-use architecture, evaluation frameworks,safety/guardrails, and cost/latency optimization.

Your Tasks
• Own the LLM system architecture: Retrieval pipelines, prompt/tool design,
routing/fallbacks, safety layers, and telemetry, optimized for quality, latency, and cost.
• Establish technical standards for RAG: content ingestion, chunking/windowing, hybrid
retrieval, reranking, query understanding, and structured output contracts.
• Define evaluation strategy: Create a rigorous eval suite covering answer correctness,
attribution/grounding, toxicity/safety, privacy leakage, determinism, latency, and cost.
• Formalize LLMOps: Versioning for prompts/datasets/models, experiment governance,
prompt and dataset registries, and promotion criteria from dev - staging - prod.
• Drive tool/agent design: API schema design for function calling, error handling, recovery
strategies, self-correction, and guardrail integration.
• Make build-vs-buy calls: Weigh managed providers vs. open-source/self-hosted,
considering performance, cost, IP, privacy, and compliance.
• Mentoring: Provide deep technical mentorship on prompting, retrieval design, evals, and
safe deployment; lead reviews of prompts, pipelines, and evaluation reports.

Hands-on Data Science
• Implement end-to-end RAG systems: ingestion - chunking - embeddings - hybrid search -
rerank - prompt assembly - tool calls - post-processing.
• Engineer robust prompts/tools: reusable templates, multi-turn strategies, structured
outputs via JSON Schema/Pydantic.

• Select/tune models: foundation models, embeddings, rerankers; apply LoRA/PEFT or
distillation when justified.
• Build eval corpora: golden sets, KPIs for accuracy, groundedness, deflection, tool
success.
• Implement guardrails: PII/PHI detection, policy prompts, jailbreak resistance, filters,
safety scorecards.
• Productionize: ship resilient services with analytics, alerts (drift, quality, cost), SLOs, etc.
• Optimize for scale: token, latency, cost; caching, context packing, batching, speculative
decoding, routing by intent

What will help you succeed

Minimum requirements:
• Advanced CS/AI/ML degree or equivalent, strong ML background.
• 7+ years DS/ML, 3+ years NLP /LLMs, shipped production systems.
• Python and core ML stack: 5+ years of professional Python.
• Data engineering for unstructured data (3+ years): text processing, parsing, embedding-
friendly preprocessing.
• Proven RAG expertise (1+ years): embeddings, retrieval, reranking, chunking.
• Evaluation depth (1+ years): offline/online evals for accuracy, grounding, safety.
• Safety/privacy (1+ years): moderation, PII/PHI redaction, policy enforcement.
• LLMOps (1+ years): prompt/version management, experiment tracking, monitoring.
• Excellent communication: explain trade-offs, drive data decisions.

Desirable experiance:
• Serving/scaling: vLLM/TGI, Ray Serve, Triton; GPU/CPU trade-offs.
• Tuning/distillation: LoRA/PEFT, safety alignment, synthetic data.
• Domain: observability, support systems, multilingual, regulated environments.
• Cloud/security: Snowflake/AWS, managed vs self-hosted.
• Experience with graph-based knowledge bases (e.g., GraphDB, Neo4j) and knowledge
graphs to complement RAG systems with entity modeling and relationship-aware retrieval.

Why you will love being a Dynatracer

• Working models that offer you the flexibility you need, ranging from full remote options to
hybrid ones combining home and in-office work
• A team that thinks outside the box, welcomes unconventional ideas, and pushes
boundaries
• An environment that fosters innovation enables creative collaboration and allows you to
grow
• A globally unique and tailor-made career development program recognizing your
potential, promoting your strengths, and supporting you in achieving your career goals
• A truly international mindset with Dynatracers from different countries and cultures all
over the world, and English as the corporate language that connects us all
• A culture that is being shaped by our global team’s diverse personalities, expertise, and
backgrounds
• A relocation team that is eager to help you start your journey to a new country, always
there to support and by your side. If you need to relocate for a position you’re applying for,
we offer you a relocation allowance and support with your visa, work permit,accommodation .

4855

Barcelona

ES

Data Science and Research

Flex

Full-time