AI Ops Engineer - Doha, , Qatar
منذ يوم

وصف الوظيفة
<\/h3>
Job Summary<\/span><\/h3>
The AI Ops Engineer manages the
deployment, monitoring, and maintenance of AI models. This role involves
ensuring the reliability, scalability, and performance of AI systems,
collaborating with cross -functional teams to optimize AI operations, and troubleshooting
issues as they arise.<\/span><\/span><\/span>
<\/div>
<\/div>
Responsibilities and Duties<\/span><\/h3>- Deploy,
monitor, and maintain AI models and systems to ensure optimal performance and
reliability.<\/span><\/span><\/span><\/span>
<\/span><\/li> - Implement
and manage CI/CD pipelines for the continuous integration and delivery of AI
models.<\/span><\/span><\/span><\/span>
<\/span><\/li> - Collaborate
with data scientists, AI engineers, and other stakeholders to understand model
requirements and ensure successful deployment.<\/span><\/span><\/span><\/span>
<\/span><\/li> - Monitor
the performance of AI models and systems, identifying and resolving issues
promptly.<\/span><\/span><\/span><\/span>
<\/span><\/li> - Develop
and maintain automated monitoring and alerting systems to ensure the health and
performance of AI systems.<\/span><\/span><\/span><\/span>
<\/span><\/li> - Optimize
AI models and infrastructure for scalability and efficiency<\/span><\/span><\/span><\/span>
<\/span><\/li> - Ensure
compliance with data governance, security, and regulatory standards in AI
operations.<\/span><\/span><\/span><\/span>
<\/span><\/li> - Document
deployment procedures, monitoring processes, and maintenance activities.<\/span><\/span><\/span><\/span>
<\/span><\/li> - Stay
updated with the latest advancements in AI operations and infrastructure
technologies.<\/span><\/span><\/span><\/span>
<\/span><\/li> - Provide
technical support and guidance to junior team members.<\/span><\/span><\/span><\/span>
<\/span><\/li> - Participate
in project planning and contribute to the development of project timelines and
deliverables.<\/span><\/span><\/span>
<\/span><\/li> - Perform
other duties relevant to the job as assigned by the Sr. AI Ops Engineer or
senior management.<\/span><\/span><\/span>
<\/span><\/span><\/li><\/ul>
<\/div><\/span>
Requirements<\/h3>
<\/div>- Bachelor's
degree in Computer Science, Information Technology, or a related field<\/span>
<\/span><\/span><\/span><\/li> - Relevant
certifications (e.g., AWS Certified DevOps Engineer, Google Cloud Professional
DevOps Engineer) are preferred<\/span>
<\/span><\/span><\/span><\/li> - Minimum
of 3 years of experience in AI operations, DevOps, or related fields<\/span>
<\/span><\/span><\/span><\/li> - Experience
in managing the deployment and maintenance of AI models<\/span>
<\/span><\/span><\/span><\/li> - Strong
programming skills in languages such as Python<\/span>
<\/span><\/span><\/span><\/li> - Proficiency in AI and machine
learning frameworks (e.g., TensorFlow, PyTorch)<\/span>
<\/span><\/span><\/span><\/li> - Experience with CI/CD tools (e.g.,
Jenkins, GitLab CI)<\/span>
<\/span><\/span><\/span><\/li> - Excellent problem -solving and
troubleshooting skills<\/span>
<\/span><\/span><\/span><\/li> - Strong
communication and interpersonal skills<\/span>
<\/span><\/span><\/span><\/li> - In -depth
knowledge of AI operations and infrastructure management<\/span>
<\/span><\/span><\/span><\/li> - Familiarity with cloud platforms
(e.g., AWS, Azure, Google Cloud) and their AI services<\/span>
<\/span><\/span><\/span><\/li> - Understanding
of data governance, security, and regulatory standards<\/span>
<\/span><\/span><\/span><\/li> - Ability to
manage multiple tasks and prioritize effectively<\/span>
<\/span><\/span><\/span><\/li> - Strong attention to detail and
commitment to delivering high -quality work<\/span>
<\/span><\/span><\/span><\/li> - Ability
to work independently and as part of a team<\/span>
<\/span><\/span><\/span><\/li> - Programming
languages (e.g., Python)<\/span>
<\/span><\/span><\/span><\/li> - AI and machine learning frameworks
(e.g., TensorFlow, PyTorch)<\/span>
<\/span><\/span><\/span><\/li> - CI/CD tools (e.g., Jenkins, GitLab
CI)<\/span>
<\/span><\/span><\/span><\/li> - Cloud platforms (e.g., AWS, Azure,
Google Cloud)<\/span>
<\/span><\/span><\/span><\/li> - Monitoring and logging tools (e.g.,
Prometheus, ELK Stack)<\/span>
<\/span><\/span><\/span><\/li> - Collaboration
and communication tools (e.g., Slack, Microsoft Teams)<\/span><\/span>
<\/span><\/li><\/ul>
<\/div>
<\/div><\/span>
monitor, and maintain AI models and systems to ensure optimal performance and
reliability.<\/span><\/span><\/span><\/span>
<\/span><\/li>
and manage CI/CD pipelines for the continuous integration and delivery of AI
models.<\/span><\/span><\/span><\/span>
<\/span><\/li>
with data scientists, AI engineers, and other stakeholders to understand model
requirements and ensure successful deployment.<\/span><\/span><\/span><\/span>
<\/span><\/li>
the performance of AI models and systems, identifying and resolving issues
promptly.<\/span><\/span><\/span><\/span>
<\/span><\/li>
and maintain automated monitoring and alerting systems to ensure the health and
performance of AI systems.<\/span><\/span><\/span><\/span>
<\/span><\/li>
AI models and infrastructure for scalability and efficiency<\/span><\/span><\/span><\/span>
<\/span><\/li>
compliance with data governance, security, and regulatory standards in AI
operations.<\/span><\/span><\/span><\/span>
<\/span><\/li>
deployment procedures, monitoring processes, and maintenance activities.<\/span><\/span><\/span><\/span>
<\/span><\/li>
updated with the latest advancements in AI operations and infrastructure
technologies.<\/span><\/span><\/span><\/span>
<\/span><\/li>
technical support and guidance to junior team members.<\/span><\/span><\/span><\/span>
<\/span><\/li>
in project planning and contribute to the development of project timelines and
deliverables.<\/span><\/span><\/span>
<\/span><\/li>
other duties relevant to the job as assigned by the Sr. AI Ops Engineer or
senior management.<\/span><\/span><\/span>
<\/span><\/span><\/li><\/ul>
<\/div><\/span>
Requirements<\/h3>
<\/div>- Bachelor's
degree in Computer Science, Information Technology, or a related field<\/span>
<\/span><\/span><\/span><\/li> - Relevant
certifications (e.g., AWS Certified DevOps Engineer, Google Cloud Professional
DevOps Engineer) are preferred<\/span>
<\/span><\/span><\/span><\/li> - Minimum
of 3 years of experience in AI operations, DevOps, or related fields<\/span>
<\/span><\/span><\/span><\/li> - Experience
in managing the deployment and maintenance of AI models<\/span>
<\/span><\/span><\/span><\/li> - Strong
programming skills in languages such as Python<\/span>
<\/span><\/span><\/span><\/li> - Proficiency in AI and machine
learning frameworks (e.g., TensorFlow, PyTorch)<\/span>
<\/span><\/span><\/span><\/li> - Experience with CI/CD tools (e.g.,
Jenkins, GitLab CI)<\/span>
<\/span><\/span><\/span><\/li> - Excellent problem -solving and
troubleshooting skills<\/span>
<\/span><\/span><\/span><\/li> - Strong
communication and interpersonal skills<\/span>
<\/span><\/span><\/span><\/li> - In -depth
knowledge of AI operations and infrastructure management<\/span>
<\/span><\/span><\/span><\/li> - Familiarity with cloud platforms
(e.g., AWS, Azure, Google Cloud) and their AI services<\/span>
<\/span><\/span><\/span><\/li> - Understanding
of data governance, security, and regulatory standards<\/span>
<\/span><\/span><\/span><\/li> - Ability to
manage multiple tasks and prioritize effectively<\/span>
<\/span><\/span><\/span><\/li> - Strong attention to detail and
commitment to delivering high -quality work<\/span>
<\/span><\/span><\/span><\/li> - Ability
to work independently and as part of a team<\/span>
<\/span><\/span><\/span><\/li> - Programming
languages (e.g., Python)<\/span>
<\/span><\/span><\/span><\/li> - AI and machine learning frameworks
(e.g., TensorFlow, PyTorch)<\/span>
<\/span><\/span><\/span><\/li> - CI/CD tools (e.g., Jenkins, GitLab
CI)<\/span>
<\/span><\/span><\/span><\/li> - Cloud platforms (e.g., AWS, Azure,
Google Cloud)<\/span>
<\/span><\/span><\/span><\/li> - Monitoring and logging tools (e.g.,
Prometheus, ELK Stack)<\/span>
<\/span><\/span><\/span><\/li> - Collaboration
and communication tools (e.g., Slack, Microsoft Teams)<\/span><\/span>
<\/span><\/li><\/ul>
<\/div>
<\/div><\/span>
degree in Computer Science, Information Technology, or a related field<\/span>
<\/span><\/span><\/span><\/li>
certifications (e.g., AWS Certified DevOps Engineer, Google Cloud Professional
DevOps Engineer) are preferred<\/span>
<\/span><\/span><\/span><\/li>
of 3 years of experience in AI operations, DevOps, or related fields<\/span>
<\/span><\/span><\/span><\/li>
in managing the deployment and maintenance of AI models<\/span>
<\/span><\/span><\/span><\/li>
programming skills in languages such as Python<\/span>
<\/span><\/span><\/span><\/li>
learning frameworks (e.g., TensorFlow, PyTorch)<\/span>
<\/span><\/span><\/span><\/li>
Jenkins, GitLab CI)<\/span>
<\/span><\/span><\/span><\/li>
troubleshooting skills<\/span>
<\/span><\/span><\/span><\/li>
communication and interpersonal skills<\/span>
<\/span><\/span><\/span><\/li>
knowledge of AI operations and infrastructure management<\/span>
<\/span><\/span><\/span><\/li>
(e.g., AWS, Azure, Google Cloud) and their AI services<\/span>
<\/span><\/span><\/span><\/li>
of data governance, security, and regulatory standards<\/span>
<\/span><\/span><\/span><\/li>
manage multiple tasks and prioritize effectively<\/span>
<\/span><\/span><\/span><\/li>
commitment to delivering high -quality work<\/span>
<\/span><\/span><\/span><\/li>
to work independently and as part of a team<\/span>
<\/span><\/span><\/span><\/li>
languages (e.g., Python)<\/span>
<\/span><\/span><\/span><\/li>
(e.g., TensorFlow, PyTorch)<\/span>
<\/span><\/span><\/span><\/li>
CI)<\/span>
<\/span><\/span><\/span><\/li>
Google Cloud)<\/span>
<\/span><\/span><\/span><\/li>
Prometheus, ELK Stack)<\/span>
<\/span><\/span><\/span><\/li>
and communication tools (e.g., Slack, Microsoft Teams)<\/span><\/span>
<\/span><\/li><\/ul>
<\/div>
<\/div><\/span>
وظائف مماثلة
· <\/h3> · Job Summary<\/span> · <\/h3>The Sr. AI Ops Engineer manages the deployment, monitoring, and maintenance of AI models and systems. This role involves ensuring the reliability, scalability, and performance of AI systems, collaborating with cross -functional teams to opt ...
منذ يوم
· <\/h3> · Job Summary<\/span> · <\/h3>The Principal AI Ops Engineer manages the deployment, monitoring, and maintenance of AI models and systems. This role involves ensuring the reliability, scalability, and performance of AI systems, collaborating with cross -functional teams ...
منذ يوم
· <\/h3> · Job Summary<\/span> · <\/h3>The Principal DevOps Engineer oversees the continuous integration and continuous delivery (CI/CD) pipelines and automation for AI services. This role involves designing, implementing, and maintaining CI/CD pipelines, automating processes, a ...
منذ يوم