Senior AI Engineer (LLM & Multi-Agent Systems)
פורסם 17 במאי · 96 מועמדים
התפקיד במילים פשוטות
המהנדס הבכיר בתחום ה-AI יעסוק בפיתוח מערכת ניתוח פיננסי מבוססת מודלי שפה גדולים (LLM) ומערכות מרובות סוכנים. התפקיד כולל תכנון ארכיטקטורת סוכנים, אופטימיזציה של אינטראקציות עם מודלים, והבטחת אמינות המערכות בסביבת ייצור. העבודה תכלול גם אופטימיזציה של ביצועים, הטמעת כלי ניטור והערכה, ושיפור אסטרטגיות אחזור מידע מתקדמות.
- Strong proficiency in modern Python
- Deep understanding of asynchronous programming (asyncio) patterns
- Experience with FastAPI and Pydantic (v2)
- Production experience with LangChain
- Hands-on experience or deep conceptual understanding of LangGraph (or similar state-machine based agent frameworks)
- Experience with Elasticsearch (DSL queries, analyzers)
- Knowledge of vector databases and embedding models
חולץ מתיאור המשרה · מתעדכן אוטומטית
למי זה מתאים
התפקיד מתאים למומחים בפייתון עם הבנה עמוקה בתכנות אסינכרוני וניסיון עם FastAPI ו-Pydantic. נדרש ניסיון בייצור עם LangChain והבנה מעמיקה ב-LangGraph, כמו גם מומחיות בטיפול באי-דטרמיניזם של LLM, יצירת פלטים מובנים ואופטימיזציה של הקשר. מי שמחפש תפקיד הדורש התמודדות עם אתגרי ביצועים ואמינות במערכות AI מורכבות ימצא את התפקיד מתאים.
תיאור המשרה המלא
המשרה המקורית · נשמר לעיוןJoin a Company That Invests in You
Seeking Alpha is the world’s leading community of engaged investors. We’re the go-to destination for investors looking for actionable stock market opinions, real-time market analysis, and unique financial insights. At the same time, we’re also dedicated to creating a workplace where our team thrives. We’re passionate about fostering a flexible, balanced environment with remote work options and an array of perks that make a real difference.
Here, your growth matters. We prioritize your development through ongoing learning and career advancement opportunities, helping you reach new milestones. Join Seeking Alpha to be part of a company that values your unique journey, supports your success, and champions both your personal well-being and professional goals.
What We're Looking For
Role Overview: We are developing Ask Seeking Alpha — a high-load financial analysis system based on Large Language Models. The architecture is built on complex multi-agent orchestration using LangGraph, FastAPI, and Elasticsearch.
We are looking for a Senior Backend Engineer specialized in Generative AI to design agent workflows, optimize interactions with models (OpenAI, AWS Bedrock), and ensure the reliability of non-deterministic systems in production.
Tech Stack: Python (Asyncio), FastAPI, LangChain, LangGraph, Pydantic, Elasticsearch, AWS Bedrock / OpenAI API, LangSmith.
What You'll Do
Agent Architecture: Design and implement complex agent orchestration logic using LangGraph. You will define state management, conditional routing, and error handling within the agent graph.
Tool Engineering: Build and optimize the tool layer (function calling) that allows LLMs to interact with internal financial APIs and databases accurately.
Performance Optimization:
Reduce end-to-end latency through asynchronous processing and streaming (SSE).
Implement semantic caching strategies to minimize API costs and response time.
Optimize token usage without sacrificing answer quality.
Observability & Evaluation: Implement automated evaluation pipelines using LangSmith. You will be responsible for setting up regression testing for prompts and agents to measure quality (correctness, faithfulness) before deployment.
Advanced RAG: Refine retrieval strategies. Work on hybrid search implementation (Keyword + Vector), re-ranking, and query expansion to feed the most relevant context to the model.
Requirements
Python Expert: Strong proficiency in modern Python. Deep understanding of asynchronous programming (asyncio) patterns is mandatory, as our entire I/O pipeline (Network, DB, LLM) is non-blocking. Experience with FastAPI and Pydantic (v2).
Agentic Frameworks: Production experience with LangChain. Hands-on experience or deep conceptual understanding of LangGraph (or similar state-machine based agent frameworks).
Deep LLM Expertise (What we mean by "Deep"):
Non-determinism Management: Strategies for handling LLM hallucinations and ensuring reliable outputs (e.g., self-correction loops, specific prompting techniques like CoT/ReAct).
Structured Outputs: Experience forcing LLMs to adhere to strict schemas (Pydantic/JSON mode) for reliable downstream processing.
Context Optimization: Advanced strategies for managing limited context windows (summarization chains, sliding windows, selective context injection) beyond simple truncation.
Inference Economics: Understanding the trade-offs between model size, latency, and cost (e.g., when to route to GPT-4 vs. a smaller/faster model).
Nice to Have
Experience with Elasticsearch (DSL queries, analyzers).
Knowledge of vector databases and embedding models.
Background in FinTech or familiarity with financial data structures.
Show more
Show less
שאלות על המשרה
- המשרה לא ציינה שכר. אנחנו מציגים שכר רק כשהמעסיק מפרסם אותו.
- מרחוק
- Strong proficiency in modern Python, Deep understanding of asynchronous programming (asyncio) patterns, Experience with FastAPI and Pydantic (v2), Production experience with LangChain, Hands-on experience or deep conceptual understanding of LangGraph (or similar state-machine based agent frameworks)