FresherHire: Site Reliability Engineer Lendistry Payroll | 82 - 106 Lakh/Year INR | Fully Remote

Position :- DevOps Engineer

Expertise: - Python, AWS, Linux,Unix, Shell,Bash Scripting, Site Reliability Engineering

Description

ABOUT INDUSTRY:

Lendistry is the country’s largest minority-led and technology-enabled small business and commercial real estate lender with Community Development Financial Institution (CDFI) and Community Development Entity (CDE) certification. We are a national employer whose mission is to provide economic opportunities and progressive growth for small business owners and their underserved communities as a source of financing and financial education.

GENERAL RESPONSIBILITIES:

Troubleshooting alerts and escalated issues.
Engage in and improve services from deployment, operation, through refinement.
Maintain production environments by measuring and monitoring availability, latency, and overall system health.
Scale systems sustainably through automation.
Evolve systems by pushing for changes that improve reliability.
Practice sustainable incident response and disaster recovery exercises.
Collaborate and communicate in real-time using Slack and MS Teams.
Follow infrastructure as code best practices.
Participate in on-call rotation that will troubleshoot production impacting issues.
Create and improve documentation and runbooks.
Participate in blameless postmortems.
Candidates for this role can be remote but must be based in the USA and must be available on Microsoft Teams/Slack during working hours of 9 AM to 5 PM Pacific Standard Time (PST).

PROFICIENCIES AND SKILLS:

High sense of urgency and drive to resolve issues quickly.
Expertise in analyzing and troubleshooting containerized workloads and applications.
Script first mentality for automation.
Ability to debug, optimize code, and automate routine tasks.
Solid Bash, Python, Shell, Java and JavaScript knowledge.
Systematic and creative problem-solving approach, with effective communication.
Proven track record of supporting multi-az, multi-region, N-tier architecture applications in a public cloud-based infrastructure.
Understanding of Unix/Linux operating systems.
Understanding of application golden signal.
Understanding of dashboarding using techniques like USE and RED.
Ability to run Docker containers on AWS ECS.
Managing cloud-based infrastructure on AWS (preferred), Azure, or GCP.
Advanced knowledge of Infrastructure as code tools and best practices.
Code repository best practices; Git, GitHub, “Git Flow” or other workflows.
IaaS Administration (SDKs and cli - AWS preferred).
Building, optimizing, hardening, and troubleshooting new services, tasks, and technology from POC to production.
Application performance monitoring (APM).
Keeps current on SRE practices by participating in forums like SRE slack groups, CNCF forums, etc.
Experience using PostgreSQL and/or MySQL.
Experience with Continuous Integration tools like GitHub Actions.
Knowledge of web and application server management (Nginx, Tomcat, NodeJS).
Experience with Pulumi, Terraform, Ansible, or Cloud Formation.
Experience with AWS technologies such as EC2, ECS, S3, RDS, and CloudWatch.

EDUCATION AND EXPERIENCE:

BS degree in Computer Science or related technical field, or equivalent practical experience.
5+ years of professional experience in Software Engineering, Cloud Engineering, DevOps.
Completion of the Google Site Reliability Engineering book is a must.
AWS and Terraform Certifications are a plus.