Skip to content

Machine Learning Engineer

  • Full Time
  • Toronto

DeepRec.ai

Job Title: MLOps Engineer

Work Arrangement: Remote

Location: Toronto, Canada


Salary: Up-to $125,000 CAD

MLOps Engineer – Real-Time AI Systems


We’re looking for an experienced MLOps Engineer to help deploy and scale cutting-edge ML models for real-time video and audio applications.

You’ll work alongside data scientists and engineers to build fast, reliable, and automated ML infrastructure.

Key Responsibilities


  • Build and manage ML pipelines for training, validation, and inference.
  • Automate deployment of deep learning and generative AI models.
  • Ensure model versioning, rollback, and reproducibility.
  • Deploy models on AWS, GCP, or Azure using Docker and Kubernetes.
  • Optimize real-time inference using TensorRT, ONNX Runtime, or PyTorch.
  • Use GPUs, distributed systems, and parallel computing for performance.
  • Create CI/CD workflows (GitHub Actions, Jenkins, ArgoCD) for ML.
  • Automate model retraining, validation, and monitoring.
  • Address data drift, latency, and compliance concerns.


What You Bring

  • 3+ years in MLOps, DevOps, or model deployment roles.
  • Strong Python and experience with ML frameworks (PyTorch, TensorFlow, ONNX).
  • Proficiency with cloud platforms, Docker, and Kubernetes.
  • Experience with ML tools like MLflow, Airflow, Kubeflow, or Argo.
  • Knowledge of GPU acceleration (CUDA, TensorRT, DeepStream).
  • Understanding of scalable, low-latency ML infrastructure.


Nice to Have

  • Experience with Ray, Spark, or edge AI tools (Triton, TFLite, CoreML).
  • Basic networking knowledge or CUDA programming skills.

To apply, please visit the following URL:

THISJOB.CA