Joshua Nave
Principal Solutions Engineer
20+ years across infrastructure, automation, and application development. I build what's needed.
Contact Me Download PDFExperience
Principal Solutions Engineer
@ Bell Integration
Dec 2024 - Present
- Technical lead for greenfield HPC datacenter in UAE: 1,250 nodes x 8 AMD MI210 GPUs (10,000+ GPUs). Owned the entire implementation lifecycle from rack-and-stack through production handoff.
- Diagnosed and resolved architectural gaps in IBM CP4AIOps deployment. Rebuilt operator configurations and integrated with Argo CD for GitOps management.
- Architected repeatable HPC deployment framework using Ansible and Terraform, now the standard for international datacenter builds across multiple client engagements.
- Primary technical escalation point for vendor misconfigurations across networking, storage, and virtualization stacks.
- Technical screener for all technical roles company-wide. Evaluate candidates on architecture, problem-solving, and hands-on knowledge.
- Develop AI/ML proof-of-concepts for clients: RAG pipelines, multi-agent frameworks, LLM infrastructure. Handle full stack from model fine-tuning and pre-training through deployment infrastructure.
- Architecting dedicated AI PoC lab: 16x H100 GPUs, 16x L40 GPUs, high-memory compute cluster, Cisco/Arista networking, NetApp storage.
Senior Automation Engineer
@ Soundhound
Oct 2018 - Dec 2024
- Sole owner of automation for 4,000+ production hosts. Built and maintained the tooling that kept the fleet patched and consistent.
- Reverse-engineered undocumented legacy systems to enable new deployment patterns. Read source code when documentation was unavailable.
- Built custom APIs and integration layers for services lacking native interfaces.
- Reduced deployment times from hours to minutes by eliminating manual processes and human error vectors.
- Integrated disparate monitoring and data sources into unified MLops pipelines for predictive analysis.
Senior Consultant
@ CPrime
Dec 2021 - Nov 2024
- Technical consultant for client projects. Handled infrastructure, applications, networking - whatever was broken. Cut deployment times by 50%+ with Terraform.
- Took a monolithic application and broke it into containerized microservices, reducing error rates by 90%.
- Built Ansible toolkits that clients still use for day-to-day operations after engagement ended.
- Principal architect for complex cross-functional infrastructure problems.
Senior Consultant
@ Contegix
Dec 2015 - Oct 2018
- Full-stack technical support for high-profile clients. Infrastructure, applications, networking, firewalls.
- Managed VMware ESXi and AWS environments. Administered large-scale JunOS and Cisco deployments.
- Owned firewall architecture: iptables, pfSense, Shorewall.
Senior Systems Support Engineer
@ AT&T
Jun 2006 - Oct 2018
- Maintained 99.99% uptime for 7,000+ server VoIP platform. On-call for critical incident response.
- Built automation that provisioned systems across four departments, replacing manual processes.
- Owned change management for 1,500+ VoIP application servers. Patched Oracle clusters including Data Guard configs.
- Created database-driven web apps that eliminated 30%+ of manual workload by replacing inadequate internal tooling.
Skills
Languages
Python, Go, Bash, Perl, Ruby, Java
Infrastructure as Code
Terraform, Ansible, Puppet, Salt
Containers & Orchestration
Docker, Kubernetes, LXC, Argo CD, Harbor
Cloud & Virtualization
AWS, GCP, DigitalOcean, Proxmox, VMware ESXi
Networking
Cisco, Juniper, Arista, Nebula, WireGuard, pfSense
Identity & Auth
FreeIPA, Keycloak, Entra ID, OIDC, LDAP, SAML
Monitoring & Logging
Prometheus, Grafana, Loki, Zabbix, ELK Stack, Splunk
Databases
PostgreSQL, MySQL, Oracle, MongoDB, CouchDB, pgvector
HPC & AI/ML
NVIDIA CUDA, AMD ROCm, vLLM, Ollama, RAG pipelines
CI/CD
GitHub Actions, GitLab CI, Jenkins, Argo CD
Contact
(314) 570-7102
Warrenton, MO