For our client, we are looking for Azure AI Foundry Platform Engineer
Start: April 2026
Duration: 6 months (option of extension)
Workload: 100%
Location: Remote
Language: English
The primary mission of this role is to operate, manage, and evolve the Azure AI Foundry platform so that internal development teams can rely on a stable, secure, and well governed environment for building AI solutions.
This is a platform enablement and operations role, not an application development position. The ideal candidate is proactive, autonomous, understands platform level concepts, and can execute tasks without needing constant supervision.
Key Responsibilities:
Azure AI Foundry Platform Operations
• Administer and maintain Azure AI Foundry workspaces, environments, model catalogs, compute, and configurations.
• Manage lifecycle operations for LLMs, embeddings models, and model deployments within the platform (excluding agent deployment).
• Ensure reliability, scalability, and performance of AI Foundry environments across all stages (dev/test/prod).
Observability & Monitoring
• Configure and maintain logging, tracing, telemetry, and usage dashboards.
• Monitor performance, cost, quota consumption, and operational health of AI Foundry components.
• Implement alerting and diagnostics to ensure platform stability.
Governance & Security
• Apply and maintain Azure RBAC, workspace governance, and access policies inside the platform.
• Ensure consistency in workspace structure, permissions, and operational standards across teams.
• Support internal teams in onboarding and safe platform usage.
Documentation & Internal Enablement
• Produce high quality documentation, runbooks, platform guides, and onboarding materials.
• Assist internal teams in using AI Foundry capabilities effectively.
• Provide hands on troubleshooting and support for platform adoption.
Required Qualifications:
• Practical, hands on experience managing Azure platforms in production environments.
• Strong knowledge of Azure AI Foundry, Azure OpenAI, and related Azure AI/ML components.
• Familiarity with LLM lifecycle management, platform operations, and governance.
• Experience working autonomously within engineering or platform teams.
• Strong communication skills and ability to assist internal teams.
• Proven ability to produce clear and structured documentation.
Nice-to-Have Skills:
• Experience with Portkey for model routing, observability, caching, or guardrails.
• Knowledge of Azure API Management (APIM).
• Understanding of other AI ecosystem tools such as Promptflow, MLflow, LangChain, or Semantic Kernel.
• Exposure to cloud security best practices.
Candidate Profile
We are looking for someone who:
• Operates independently while keeping the team aligned.
• Understands platform engineering patterns and how developers consume AI capabilities.
• Is proactive, reliable, and capable of owning tasks end to end.
• Learns quickly and can adapt to an evolving AI platform ecosystem.