hero

Find jobs at MIT startups!

Please email orbit-jobs@mit.edu to connect with the company's MIT founder. To post a job, please email: orbit-jobs-add@mit.edu.
MIT
MIT
94
companies
209
Jobs

Platform Engineer

Pison

Pison

Software Engineering
Boston, MA, USA
Posted on Friday, July 19, 2024
Pison is seeking a talented and motivated Site Reliability Engineer (SRE)/DevOps professional to join our dynamic team. The ideal candidate will play a critical role in ensuring the reliability, scalability, and performance of our groundbreaking neural interface platform. You will collaborate with cross-functional teams to implement best practices in system design, automation, and continuous integration and delivery (CI/CD).

Duties/Responsibilities:

  • Design, implement, and maintain scalable and reliable infrastructure solutions.
  • Develop and maintain CI/CD pipelines to automate deployments and streamline development workflows.
  • Monitor system performance, identify bottlenecks, and implement solutions to optimize system performance.
  • Ensure the security and integrity of our systems by implementing best practices in access control, data protection, and incident response.
  • Collaborate with software development teams to ensure systems are designed with reliability and scalability in mind.
  • Perform capacity planning and demand forecasting to meet future infrastructure needs.
  • Troubleshoot and resolve system issues, providing timely and effective support.
  • Develop and maintain documentation related to system architecture, processes, and procedures.
  • Participate in on-call rotations to provide 24/7 support for critical systems.

Skills/Abilities:

  • Strong proficiency in cloud platforms such as AWS, Azure, or Google Cloud.
  • Experience with containerization technologies like Docker and orchestration tools like Kubernetes.
  • Proficiency in scripting and automation using languages such as Python, Bash, or similar.
  • Hands-on experience with CI/CD tools like Jenkins, GitLab CI, or CircleCI.
  • Strong understanding of networking concepts, including TCP/IP, DNS, and load balancing.
  • Knowledge of infrastructure as code (IaC) tools such as Terraform, Ansible, or CloudFormation.
  • Familiarity with monitoring and logging tools such as Prometheus, Grafana, ELK Stack, or similar.
  • Strong problem-solving skills and the ability to troubleshoot complex issues.
  • Excellent communication and collaboration skills, with the ability to work effectively in a team environment.

Education/Experience:

  • Bachelor's degree in Computer Science, Information Technology, or a related field; or equivalent practical experience.
  • 3+ years of experience in a Site Reliability Engineer, DevOps, or similar role.
  • Proven experience managing and maintaining production systems in a high-availability environment.
  • Experience with neural interfaces, IoT, or similar technologies is a plus.
  • Certifications in cloud platforms (AWS, Azure, Google Cloud) or relevant technologies are a plus.