Senior Data Engineer, GenAI Platform
פורסם לפני 19 ימים · 0 מועמדים
התפקיד במילים פשוטות
התפקיד כולל עיצוב, בנייה ותחזוקה של ארכיטקטורות נתונים הניתנות להרחבה, התומכות במחקר בינה מלאכותית גנרטיבית ואב טיפוס מהיר. המהנדס יכין, יבנה, ימטב ויאצור מערכי נתונים מגוונים לאימון מודלי AI ולמידת מכונה, ויפתח צינורות נתונים גמישים ואוטומטיים כדי להאיץ ניסויים ופיתוח של GenAI.
- Design, build, and maintain scalable data architectures that support generative AI research and rapid prototyping
- Prepare, structure, optimize, and curate diverse datasets for AI and machine learning model training
- Develop flexible, automated data pipelines to accelerate GenAI experimentation and development cycles
- Partner closely with AI researchers and engineers to understand evolving data requirements
- Conduct deep data exploration to uncover insights and identify new opportunities for GenAI applications
- Experience integrating with BigQuery
חולץ מתיאור המשרה · מתעדכן אוטומטית
למי זה מתאים
התפקיד מתאים למהנדסי נתונים בעלי ניסיון של 5+ שנים, הבנה עמוקה בדרישות נתונים למערכות למידת מכונה ו-GenAI, ומומחיות בפלטפורמות נתונים מבוססות ענן (AWS, Google Cloud, Azure). נדרשת מיומנות חזקה ב-SQL ו-Python, וניסיון בבנייה ואופטימיזציה של צינורות נתונים עבור עומסי עבודה של AI/ML.
תיאור המשרה המלא
המשרה המקורית · נשמר לעיוןCompany Overview
Cellebrite’s (Nasdaq: CLBT) mission is to enable its global customers to protect and save lives by enhancing forensic investigations and intelligence gathering to accelerate justice in communities around the world. Cellebrite’s AI-powered Digital Investigation Platform enables customers to lawfully access, collect, analyze and share digital evidence in legally sanctioned investigations while preserving data privacy. Thousands of public safety organizations, intelligence agencies and businesses rely on Cellebrite’s digital forensic and investigative solutions—available via cloud, on‑premises and hybrid deployments—to close cases faster and safeguard communities.
To learn more, visit www.cellebrite.com and follow us on social media @Cellebrite.
Position Overview
We are assembling an elite, small-scale team of innovators focused on transforming generative AI from breakthrough concepts into real-world products. As a Senior Data Engineer, you will serve as the data backbone of this GenAI innovation group, enabling rapid experimentation, research, and prototyping. In this role, you will transform complex, raw data into high-quality, AI-ready assets that directly power Cellebrite’s next generation of digital intelligence capabilities.
Key Responsibilities And Requirements
Design, build, and maintain scalable data architectures that support generative AI research and rapid prototyping
Prepare, structure, optimize, and curate diverse datasets for AI and machine learning model training
Develop flexible, automated data pipelines to accelerate GenAI experimentation and development cycles
Partner closely with AI researchers and engineers to understand evolving data requirements
Conduct deep data exploration to uncover insights and identify new opportunities for GenAI applications
Ensure data quality, reliability, performance, and accessibility across multiple data domains
Optimize data processing workflows for large-scale and complex datasets
Apply strong data modeling and transformation practices to support advanced analytics and AI use cases
Technical Capabilities
Deep understanding of data requirements for machine learning and generative AI systems
Strong expertise in cloud-based data platforms (AWS, Google Cloud, or Azure)
Advanced proficiency in SQL and experience with both relational and NoSQL databases
Strong Python skills with a focus on data processing and automation
Experience building and optimizing data pipelines for AI/ML workloads
Hands-on experience with big data technologies and distributed data processing
Knowledge of performance tuning and data infrastructure optimization techniques
Experience integrating with BigQuery – advantage
Research And Innovation Skills
Proven ability to derive meaningful insights from complex, large-scale datasets
Creative and analytical approach to data preparation and feature engineering
Strong experimental mindset with rigorous analytical thinking
Ability to identify unique data-driven opportunities that can inspire new GenAI initiatives
Requirements:
Bachelor’s degree in Computer Science, Data Science, or a related field
5+ years of progressive experience in data engineering or related roles
Demonstrated experience working with cloud platforms, big data technologies, and AI-focused data pipelines
Show more
Show less
שאלות על המשרה
- המשרה לא ציינה שכר. אנחנו מציגים שכר רק כשהמעסיק מפרסם אותו.
- Design, build, and maintain scalable data architectures that support generative AI research and rapid prototyping, Prepare, structure, optimize, and curate diverse datasets for AI and machine learning model training, Develop flexible, automated data pipelines to accelerate GenAI experimentation and development cycles, Partner closely with AI researchers and engineers to understand evolving data requirements, Conduct deep data exploration to uncover insights and identify new opportunities for GenAI applications