Position: DevOps | Cloud Architect
Company: Rogue Waves AI
Location: Florence/Turin/Remote
Company Description
Rogue Waves AI is a rapidly growing startup operating in the field of artificial intelligence, developing innovative solutions in areas such as Generative AI, Predictive Analysis, Data Science, Natural Language Processing (NLP), Computer Vision, Cloud Computing, and Optimization. The company is dedicated to integrating the latest AI technologies into business and industrial processes. We are looking for highly skilled professionals to help scale our projects and expand our AI infrastructure.
Role Description
We are seeking an ML Ops Engineer with 3 years of experience to optimize and automate the lifecycle of machine learning models. You will be responsible for the infrastructure that enables the deployment, monitoring, and scalability of AI models, working closely with data science, software development, and DevOps teams to build efficient and reliable solutions. Additionally, you will develop software to integrate models into automated pipelines or expose them via APIs.
Key Responsibilities
- Manage the lifecycle of machine learning models, from training to deployment, including production monitoring.
- Collaborate with data scientists to ensure models are scalable and operational in production environments.
- Implement APIs, processes, and scripts to create interfaces for ML/AI models, ensuring stability, security, and high performance.
- Develop continuous integration and continuous deployment (CI/CD) pipelines for ML models, ensuring rapid development and continuous release.
- Monitor AI model performance in production, implementing observability tools to detect drift or performance issues.
- Optimize cloud infrastructure (AWS, GCP, Azure) to ensure efficiency in terms of costs and resources.
- Automate data management, training, validation, and model deployment processes to reduce time-to-production.
- Enhance security and scalability practices within the ML infrastructure.
Requirements
- Bachelor’s degree in Computer Science, Engineering, Mathematics, or related fields.
- At least 2-3 years of experience in ML Ops, DevOps, or similar roles.
- Familiarity with cloud services and containerization (AWS, GCP, Docker, Kubernetes).
- Strong proficiency in Python, Git, Bash, and Linux systems.
- Experience with machine learning models and frameworks such as TensorFlow, PyTorch, or scikit-learn.
- Solid knowledge of databases (SQL, NoSQL) and strong skills in building ETL pipelines to retrieve data from external sources.
- Experience with CI/CD tools (e.g., Jenkins or GitHub/Bitbucket/GitLab pipelines).
- Ability to monitor model performance and manage machine learning model drift.
- Strong problem-solving skills and excellent teamwork attitude.
- Good command of the English language.
Nice to Have
- Experience with ML lifecycle automation tools such as MLflow, Kubeflow, or Airflow.
- Experience in large-scale AI applications and performance optimization in production environments.
- Experience designing and developing serverless ML systems.
- Familiarity with Infrastructure-as-Code (IaC) frameworks (Terraform, Ansible).
What We Offer
- Subscription to a coworking space in the heart of Florence or Turin. Full remote work is also possible in specific cases.
- Competitive salary (RAL) based on experience and skills.
- Daily meal vouchers.
- Hybrid work model: remote work options available after the initial in-office period.
- Opportunity to work on cutting-edge AI projects and contribute to significant innovations.
- A professional and personal growth environment with continuous training and career advancement opportunities.
How to Apply
Send your CV and a brief cover letter to recruiting@roguewavesai.com with the subject line “ML Ops Engineer Application – Rogue Waves AI”.