Summary
DevOps Engineer & IT Specialist with extensive experience in Kubernetes cluster management & CI/CD pipeline development. Skilled in managing production environments, implementing containerized solutions, and maintaining high-availability systems. Collaborative problem-solver experienced in incident response, cross-functional team support, and technical documentation.
Areas of Expertise
DevOps & Cloud
Kubernetes
Docker
Helm Charts
Flux
CI/CD
GitLab Pipelines
GitHub Actions
Infrastructure as Code
Ansible
Terraform
OpenTofu
Monitoring & Operations
kube-prometheus-stack
Grafana Dashboards
Incident Response
Monitoring & Alerting
Security Patching
Technical Documentation
Infrastructure & Systems
Database Management
Container Orchestration
Linux
Cloudflare
GitLab Runners
GitHub Actions Runners
Accomplishments
- Managed automated deployments for 7 APIs and applications using GitLab CI/CD pipelines and Helm charts
- Migrated production PostgreSQL databases to a custom Kubernetes Postgres Operator with an automated daily backup system
- Led migration of 7 production repositories from GitLab to GitHub, converting CI/CD pipelines to GitHub Actions
- Deployed custom website across multiple cloud providers (Azure, GCP) using Cloudflare, NGINX, GitLab CI/CD, and Helm
- Developed bash script for automated PostgreSQL database backup and restoration within a Kubernetes environment
Professional Experience
Data Center Services Engineer (DevOps Engineer)
DFINITY | Remote
September 2023 - July 2025
Single point of contact for critical DevOps tasks supporting development and production Kubernetes environments.
- Maintain development and production Kubernetes clusters for enterprise-scale applications
- Maintain bare metal and EKS Amazon Kubernetes clusters
- Configure and develop Kubernetes deployments for applications utilizing Helm Charts & Flux Deployments
- Configure and develop Kubernetes resources such Deployments/Statefulsets/Daemonsets, Services, and Ingresses
- Research and deploy Kubernetes operators that utilize custom resource definitions
- Maintain CI/CD pipelines within GitLab, ensuring all components function as intended including CI/CD Variables, GitLab Runners, Pipelines, and GitLab-ci.yaml configurations
- Maintain CI/CD within GitHub, managing GitHub Actions secrets, variables, and workflows
- Monitor Kubernetes clusters utilizing the kube-prometheus-stack
- Develop Grafana Dashboards for Monitoring
- Develop Bash scripts for automation
- Execute and update Ansible Playbooks
- Perform maintenance and update Terraform files
- Serve as First Incident Responder Team (FIRT) member in on-call rotation
- Act as the first point of contact for alerts and pages during on-call shifts
- Triage and escalate incidents as necessary to ensure minimal downtime
Data Center Technician
DFINITY | Remote
May 2022 - September 2023
Provided hardware troubleshooting, security patch management, and infrastructure maintenance for global data center operations.
- Troubleshoot and repair hardware failures in data center environments including initial problem diagnosis, response, and triage
- Maintained comprehensive infrastructure documentation and diagrams across 10+ data centers
- Interact with smart hands directly or through customers for physical troubleshooting tasks
- Arrange vendor access for hardware issues including vendor support tickets and physical access coordination for repair technicians
- Work with Infrastructure Security to apply security patches for DFINITY servers worldwide
- Deploy firmware updates to servers across global locations
- Travel to data center locations as needed to perform installations and maintenance
- Handle technical writing and documentation for PFOps team
- Participate in on-call rotation for PFOps support
- Execute repetitive tasks to optimize team efficiency including creating Cloudflare tokens, running Ansible playbooks, and managing Cloudflare rules
Lead Systems Administrator
SOFWERX | Tampa, FL
May 2021 - May 2022
Developed strategies to maintain & ensure all information technology needs with $200K budget management.
- Responsible for managing and maintaining four different physical networks
- Responsible for managing and maintaining all IT infrastructure
- Responsible for network performance & security
- Responsible for maintaining windows server hosting mission critical software
- Plans and supervises upgrades, and patches of applications and equipment
- Responsible for budget estimation and monitoring
- Tactical and strategic planning include task creation, assignment, and supervision, backlog management
- Manage and mentor systems administrator & intern
- Selects, and implements new technologies and tools
- Responsible for interviewing and hiring intern & Systems Administrator(s)
- Works extensively with vendors for A/V, Computer Hardware, Network, Tools
Systems Administrator
SOFWERX | Tampa, FL
May 2019 - May 2021
Collaborated with Lead Systems Administrator to execute company information system objectives.
- Responsible for network, audio/video, and IT support for Data Engineering Lab (DEL) hosting 100+ SOCOM personnel and contractors
- Lead implementation of new systems including Planview Suite and Envoy workplace platform
- Managed two separate Office 365 Tenets, Zoom commercial portal, and ZoomGov portal
- Provide timely reports and support requests to leadership regarding status of information system resources
- Monitor systems and infrastructure tools, including regular software and system maintenance, patches, and upgrades
- Created Python program to automate inventory management tasks with SOFWERX's Point of Sale system
- Programmed REST integration between NinjaRMM and Slack for system maintenance notifications
- Mentor interns on basic and advanced system administration tools, techniques, and practices
- Developed Computer Based Training (CBT) sessions in Trainual covering all major processes within the IT department
Service Desk Analyst
HighPoint Solutions | Tampa, FL
February 2019 - May 2019
Resolved 20+ service tickets daily for IT company specializing in healthcare and life science industries.
- Performed troubleshooting across many different devices & operating systems to include iOS, Android, Mac OS, Windows 10
- Documented each step of troubleshooting process within ServiceNow
- Added to the knowledge base when new unique solutions were discovered
- Led remote troubleshooting and supported diverse systems users, providing individualized training
- Oversaw email account maintenance and mobile device management for both iOS and Android
Education
Bachelor of Science in Information Technology
University of South Florida | Tampa, FL | 2018