Senior Site Reliability Engineer II
BehavioSec
About the Business:
LexisNexis Risk Solutions is the essential partner in the assessment of risk. Within our Business Services vertical, we offer a multitude of solutions focused on helping businesses of all sizes drive higher revenue growth, maximize operational efficiencies, and improve customer experience. Our solutions help our customers solve difficult problems in the areas of Anti-Money Laundering/Counter Terrorist Financing, Identity Authentication & Verification, Fraud and Credit Risk mitigation and Customer Data Management. You can learn more about LexisNexis Risk at the link below, https://risk.lexisnexis.com
About our Team:
The Core Engineering Team is the core driver of our enterprise level standard process creation and delivery. We are a high-impact group focused on building, automating, and maintaining the standards that ensure our products are deployed reliably, securely, and at high velocity.
We are champions of automation, Infrastructure as Code (IaC), and operational excellence. Joining us means working hands-on with modern multi cloud platforms and cutting-edge tools to enhance system reliability, visibility, and security across the entire development lifecycle.
If you are passionate about scalable systems and accelerating engineering teams, you will make a significant impact here.
About the Role:
This position individuals are responsible for challenging reliability and toil reduction projects.
Key Responsibilities:
Monitoring & Observability: Design and implement advanced monitoring queries and dashboards; establish and refine service level baselines.
Incident Response: Lead incident resolution efforts; contribute to post-mortems and root cause analyses.
Disaster Recovery: Plan and execute disaster recovery tests; ensure system resilience and failover capabilities.
Automation & Infrastructure as Code: Develop and maintain automation scripts and infrastructure modules; execute code in production environments.
Documentation & Knowledge Sharing: Maintain and contribute to SRE knowledge bases; promote best practices across teams.
Collaboration: Work closely with Development, QA, IT Operations, Product SRE, and Project Management teams to drive reliability initiatives.
Required Skills & Tools:
Programming & Scripting: Python, scripting language, Java, C#
Cloud Platforms: AWS (EC2, S3, Lambda, Glue), Azure (Functions, Logic Apps, AKS), GCP (GKE, Cloud Functions)
Infrastructure as Code: Terraform, Ansible, Chef, Puppet
Containerization & Orchestration: Docker, Kubernetes
CI/CD & Automation: Jenkins, GitHub Actions, Bitbucket, GitLab
Monitoring & Observability: Prometheus, Grafana, DataDog, Dynatrace, Splunk, SignalFx
Networking & Security:AWS: VPCs, IAM, Transit Gateway, CloudWAN, route53, AWS KMS, RDS Azure: Application Gateway, VNET, Express route, private link, Azure firewall, MS Sentinel, Azure Entra ID, RBAC
We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form or please contact 1-855-833-5120.
Criminals may pose as recruiters asking for money or personal information. We never request money or banking details from job applicants. Learn more about spotting and avoiding scams here.
Please read our Candidate Privacy Policy.
We are an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law.
USA Job Seekers: