Results-driven leader with 25+ years of experience designing and delivering high-availability monitoring solutions for large-scale enterprises. Known for replacing complex vendor tools with streamlined, in-house applications that improve stability, reduce costs, and enhance user experience. Proven track record of building systems with near 100% availability. Strong advocate for simple, effective solutions over buzzword-heavy architectures.
Strategic planning
Operations management
Problem-solving
Python
AWS Services (ECS, Lambda, SNS/SQS)
Docker
TensorFlow
CI/CD
Gitlab
Observability Systems