Sr. AI Ops Engineer - Doha, , Qatar
منذ يوم

وصف الوظيفة
<\/h3>
Job Summary<\/span>
<\/h3>
The Sr. AI Ops Engineer manages the deployment, monitoring, and maintenance of AI models and systems. This role involves ensuring the reliability, scalability, and performance of AI systems, collaborating with cross -functional teams to optimize AI operations, and troubleshooting issues as they arise.<\/span>
<\/p>
Responsibilities and Duties<\/span>
<\/h3>- Deploy, monitor, and maintain AI models and
systems to ensure optimal performance and reliability.<\/span><\/span>
<\/span><\/span><\/li> - Implement and manage CI/CD pipelines for
the continuous integration and delivery of AI models.<\/span><\/span>
<\/span><\/span><\/li> - Collaborate with data scientists, AI
engineers, and other stakeholders to understand model requirements and ensure
successful deployment.<\/span><\/span>
<\/span><\/span><\/li> - Monitor the performance of AI models and
systems, identifying and resolving issues promptly.<\/span><\/span>
<\/span><\/span><\/li> - Develop and maintain automated monitoring
and alerting systems to ensure the health and performance of AI systems.<\/span><\/span>
<\/span><\/span><\/li> - Optimize AI models and infrastructure for
scalability and efficiency.<\/span><\/span>
<\/span><\/span><\/li> - Ensure compliance with data governance,
security, and regulatory standards in AI operations.<\/span><\/span>
<\/span><\/span><\/li> - Document deployment procedures, monitoring
processes, and maintenance activities.<\/span><\/span>
<\/span><\/span><\/li> - Stay updated with the latest advancements
in AI operations and infrastructure technologies.<\/span><\/span>
<\/span><\/span><\/li> - Provide technical support and guidance to
AI Ops engineers and other team members.<\/span><\/span>
<\/span><\/span><\/li> - Participate in project planning and
contribute to the development of project timelines and deliverables.<\/span><\/span>
<\/span><\/span><\/li> - Perform other duties relevant to the job as
assigned by the Principal AI Ops Engineer or senior management.<\/span><\/span>
<\/span><\/li><\/ul>
<\/div><\/span>
Requirements<\/h3>- Bachelor's degree in Computer Science,
Information Technology, or a related field<\/span><\/span>
<\/span><\/span><\/li> - Relevant certifications (e.g., AWS
Certified DevOps Engineer, Google Cloud Professional DevOps Engineer) are
preferred<\/span><\/span>
<\/span><\/span><\/li> - Minimum of 5 years of experience in AI
operations, DevOps, or related fields<\/span><\/span>
<\/span><\/span><\/li> - Experience in managing the deployment and
maintenance of AI models<\/span><\/span>
<\/span><\/span><\/li> - Strong programming skills in languages such
as Python ,etc.<\/span><\/span>
<\/span><\/span><\/li> - Proficiency in AI and machine learning
frameworks (e.g., TensorFlow, PyTorch)<\/span><\/span>
<\/span><\/span><\/li> - Experience with CI/CD tools (e.g., Jenkins,
GitLab CI)<\/span><\/span>
<\/span><\/span><\/li> - Excellent problem -solving and
troubleshooting skills<\/span><\/span>
<\/span><\/span><\/li> - Strong communication and interpersonal
skills<\/span><\/span>
<\/span><\/span><\/li> - In -depth knowledge of AI operations and
infrastructure management<\/span><\/span>
<\/span><\/span><\/li> - Familiarity with cloud platforms (e.g.,
AWS, Azure, Google Cloud) and their AI services<\/span><\/span>
<\/span><\/span><\/li> - Understanding of data governance, security,
and regulatory standards<\/span><\/span>
<\/span><\/span><\/li> - Ability to manage multiple tasks and
prioritize effectively<\/span><\/span>
<\/span><\/span><\/li> - Strong attention to detail and commitment
to delivering high -quality work<\/span><\/span>
<\/span><\/span><\/li> - Ability to work independently and as part
of a team<\/span><\/span>
<\/span><\/span><\/li> - Programming languages (e.g., Python, Java)<\/span><\/span>
<\/span><\/span><\/li> - AI and machine learning frameworks (e.g.,
TensorFlow, PyTorch)<\/span><\/span>
<\/span><\/span><\/li> - CI/CD tools (e.g., Jenkins, GitLab CI)<\/span><\/span>
<\/span><\/span><\/li> - Cloud platforms (e.g., AWS, Azure, Google
Cloud)<\/span><\/span>
<\/span><\/span><\/li> - Monitoring and logging tools (e.g.,
Prometheus, ELK Stack)<\/span><\/span>
<\/span><\/span><\/li> - Collaboration and communication tools
(e.g., Slack, Microsoft Teams)<\/span><\/span>
<\/span><\/span><\/li><\/ul>
<\/div><\/span>
systems to ensure optimal performance and reliability.<\/span><\/span>
<\/span><\/span><\/li>
the continuous integration and delivery of AI models.<\/span><\/span>
<\/span><\/span><\/li>
engineers, and other stakeholders to understand model requirements and ensure
successful deployment.<\/span><\/span>
<\/span><\/span><\/li>
systems, identifying and resolving issues promptly.<\/span><\/span>
<\/span><\/span><\/li>
and alerting systems to ensure the health and performance of AI systems.<\/span><\/span>
<\/span><\/span><\/li>
scalability and efficiency.<\/span><\/span>
<\/span><\/span><\/li>
security, and regulatory standards in AI operations.<\/span><\/span>
<\/span><\/span><\/li>
processes, and maintenance activities.<\/span><\/span>
<\/span><\/span><\/li>
in AI operations and infrastructure technologies.<\/span><\/span>
<\/span><\/span><\/li>
AI Ops engineers and other team members.<\/span><\/span>
<\/span><\/span><\/li>
contribute to the development of project timelines and deliverables.<\/span><\/span>
<\/span><\/span><\/li>
assigned by the Principal AI Ops Engineer or senior management.<\/span><\/span>
<\/span><\/li><\/ul>
<\/div><\/span>
Requirements<\/h3>- Bachelor's degree in Computer Science,
Information Technology, or a related field<\/span><\/span>
<\/span><\/span><\/li> - Relevant certifications (e.g., AWS
Certified DevOps Engineer, Google Cloud Professional DevOps Engineer) are
preferred<\/span><\/span>
<\/span><\/span><\/li> - Minimum of 5 years of experience in AI
operations, DevOps, or related fields<\/span><\/span>
<\/span><\/span><\/li> - Experience in managing the deployment and
maintenance of AI models<\/span><\/span>
<\/span><\/span><\/li> - Strong programming skills in languages such
as Python ,etc.<\/span><\/span>
<\/span><\/span><\/li> - Proficiency in AI and machine learning
frameworks (e.g., TensorFlow, PyTorch)<\/span><\/span>
<\/span><\/span><\/li> - Experience with CI/CD tools (e.g., Jenkins,
GitLab CI)<\/span><\/span>
<\/span><\/span><\/li> - Excellent problem -solving and
troubleshooting skills<\/span><\/span>
<\/span><\/span><\/li> - Strong communication and interpersonal
skills<\/span><\/span>
<\/span><\/span><\/li> - In -depth knowledge of AI operations and
infrastructure management<\/span><\/span>
<\/span><\/span><\/li> - Familiarity with cloud platforms (e.g.,
AWS, Azure, Google Cloud) and their AI services<\/span><\/span>
<\/span><\/span><\/li> - Understanding of data governance, security,
and regulatory standards<\/span><\/span>
<\/span><\/span><\/li> - Ability to manage multiple tasks and
prioritize effectively<\/span><\/span>
<\/span><\/span><\/li> - Strong attention to detail and commitment
to delivering high -quality work<\/span><\/span>
<\/span><\/span><\/li> - Ability to work independently and as part
of a team<\/span><\/span>
<\/span><\/span><\/li> - Programming languages (e.g., Python, Java)<\/span><\/span>
<\/span><\/span><\/li> - AI and machine learning frameworks (e.g.,
TensorFlow, PyTorch)<\/span><\/span>
<\/span><\/span><\/li> - CI/CD tools (e.g., Jenkins, GitLab CI)<\/span><\/span>
<\/span><\/span><\/li> - Cloud platforms (e.g., AWS, Azure, Google
Cloud)<\/span><\/span>
<\/span><\/span><\/li> - Monitoring and logging tools (e.g.,
Prometheus, ELK Stack)<\/span><\/span>
<\/span><\/span><\/li> - Collaboration and communication tools
(e.g., Slack, Microsoft Teams)<\/span><\/span>
<\/span><\/span><\/li><\/ul>
<\/div><\/span>
Information Technology, or a related field<\/span><\/span>
<\/span><\/span><\/li>
Certified DevOps Engineer, Google Cloud Professional DevOps Engineer) are
preferred<\/span><\/span>
<\/span><\/span><\/li>
operations, DevOps, or related fields<\/span><\/span>
<\/span><\/span><\/li>
maintenance of AI models<\/span><\/span>
<\/span><\/span><\/li>
as Python ,etc.<\/span><\/span>
<\/span><\/span><\/li>
frameworks (e.g., TensorFlow, PyTorch)<\/span><\/span>
<\/span><\/span><\/li>
GitLab CI)<\/span><\/span>
<\/span><\/span><\/li>
troubleshooting skills<\/span><\/span>
<\/span><\/span><\/li>
skills<\/span><\/span>
<\/span><\/span><\/li>
infrastructure management<\/span><\/span>
<\/span><\/span><\/li>
AWS, Azure, Google Cloud) and their AI services<\/span><\/span>
<\/span><\/span><\/li>
and regulatory standards<\/span><\/span>
<\/span><\/span><\/li>
prioritize effectively<\/span><\/span>
<\/span><\/span><\/li>
to delivering high -quality work<\/span><\/span>
<\/span><\/span><\/li>
of a team<\/span><\/span>
<\/span><\/span><\/li>
<\/span><\/span><\/li>
TensorFlow, PyTorch)<\/span><\/span>
<\/span><\/span><\/li>
<\/span><\/span><\/li>
Cloud)<\/span><\/span>
<\/span><\/span><\/li>
Prometheus, ELK Stack)<\/span><\/span>
<\/span><\/span><\/li>
(e.g., Slack, Microsoft Teams)<\/span><\/span>
<\/span><\/span><\/li><\/ul>
<\/div><\/span>
وظائف مماثلة
· <\/h3> · Job Summary<\/span><\/h3> · The AI Ops Engineer manages the · deployment, monitoring, and maintenance of AI models. This role involves · ensuring the reliability, scalability, and performance of AI systems, · collaborating with cross -functional teams to optimize AI o ...
منذ يوم
· <\/h3> · Job Summary<\/span> · <\/h3>The Principal AI Ops Engineer manages the deployment, monitoring, and maintenance of AI models and systems. This role involves ensuring the reliability, scalability, and performance of AI systems, collaborating with cross -functional teams ...
منذ يوم
· <\/h3> · Job Summary<\/span> · <\/h3>The Principal DevOps Engineer oversees the continuous integration and continuous delivery (CI/CD) pipelines and automation for AI services. This role involves designing, implementing, and maintaining CI/CD pipelines, automating processes, a ...
منذ يوم