Position :- DevOps Engineer
Expertise: - Python, AWS, Linux,Unix, Shell,Bash Scripting, Site Reliability Engineering
Description
ABOUT INDUSTRY:
Lendistry is the country’s largest minority-led and technology-enabled small business and commercial real estate lender with Community Development Financial Institution (CDFI) and Community Development Entity (CDE) certification. We are a national employer whose mission is to provide economic opportunities and progressive growth for small business owners and their underserved communities as a source of financing and financial education.
GENERAL RESPONSIBILITIES:
- Troubleshooting alerts and escalated issues.
- Engage in and improve services from deployment, operation, through refinement.
- Maintain production environments by measuring and monitoring availability, latency, and overall system health.
- Scale systems sustainably through automation.
- Evolve systems by pushing for changes that improve reliability.
- Practice sustainable incident response and disaster recovery exercises.
- Collaborate and communicate in real-time using Slack and MS Teams.
- Follow infrastructure as code best practices.
- Participate in on-call rotation that will troubleshoot production impacting issues.
- Create and improve documentation and runbooks.
- Participate in blameless postmortems.
- Candidates for this role can be remote but must be based in the USA and must be available on Microsoft Teams/Slack during working hours of 9 AM to 5 PM Pacific Standard Time (PST).
PROFICIENCIES AND SKILLS:
- High sense of urgency and drive to resolve issues quickly.
- Expertise in analyzing and troubleshooting containerized workloads and applications.
- Script first mentality for automation.
- Ability to debug, optimize code, and automate routine tasks.
- Solid Bash, Python, Shell, Java and JavaScript knowledge.
- Systematic and creative problem-solving approach, with effective communication.
- Proven track record of supporting multi-az, multi-region, N-tier architecture applications in a public cloud-based infrastructure.
- Understanding of Unix/Linux operating systems.
- Understanding of application golden signal.
- Understanding of dashboarding using techniques like USE and RED.
- Ability to run Docker containers on AWS ECS.
- Managing cloud-based infrastructure on AWS (preferred), Azure, or GCP.
- Advanced knowledge of Infrastructure as code tools and best practices.
- Code repository best practices; Git, GitHub, “Git Flow” or other workflows.
- IaaS Administration (SDKs and cli - AWS preferred).
- Building, optimizing, hardening, and troubleshooting new services, tasks, and technology from POC to production.
- Application performance monitoring (APM).
- Keeps current on SRE practices by participating in forums like SRE slack groups, CNCF forums, etc.
- Experience using PostgreSQL and/or MySQL.
- Experience with Continuous Integration tools like GitHub Actions.
- Knowledge of web and application server management (Nginx, Tomcat, NodeJS).
- Experience with Pulumi, Terraform, Ansible, or Cloud Formation.
- Experience with AWS technologies such as EC2, ECS, S3, RDS, and CloudWatch.
EDUCATION AND EXPERIENCE:
- BS degree in Computer Science or related technical field, or equivalent practical experience.
- 5+ years of professional experience in Software Engineering, Cloud Engineering, DevOps.
- Completion of the Google Site Reliability Engineering book is a must.
- AWS and Terraform Certifications are a plus.
No comments:
Post a Comment