Joshua Nave

Principal Solutions Engineer

20+ years across infrastructure, automation, and application development. I build what's needed.

Contact Me Download PDF

Experience

Principal Solutions Engineer @ Bell Integration
Dec 2024 - Present
  • Technical lead for greenfield HPC datacenter in UAE: 1,250 nodes x 8 AMD MI210 GPUs (10,000+ GPUs). Owned the entire implementation lifecycle from rack-and-stack through production handoff.
  • Diagnosed and resolved architectural gaps in IBM CP4AIOps deployment. Rebuilt operator configurations and integrated with Argo CD for GitOps management.
  • Architected repeatable HPC deployment framework using Ansible and Terraform, now the standard for international datacenter builds across multiple client engagements.
  • Primary technical escalation point for vendor misconfigurations across networking, storage, and virtualization stacks.
  • Technical screener for all technical roles company-wide. Evaluate candidates on architecture, problem-solving, and hands-on knowledge.
  • Develop AI/ML proof-of-concepts for clients: RAG pipelines, multi-agent frameworks, LLM infrastructure. Handle full stack from model fine-tuning and pre-training through deployment infrastructure.
  • Architecting dedicated AI PoC lab: 16x H100 GPUs, 16x L40 GPUs, high-memory compute cluster, Cisco/Arista networking, NetApp storage.
Senior Automation Engineer @ Soundhound
Oct 2018 - Dec 2024
  • Sole owner of automation for 4,000+ production hosts. Built and maintained the tooling that kept the fleet patched and consistent.
  • Reverse-engineered undocumented legacy systems to enable new deployment patterns. Read source code when documentation was unavailable.
  • Built custom APIs and integration layers for services lacking native interfaces.
  • Reduced deployment times from hours to minutes by eliminating manual processes and human error vectors.
  • Integrated disparate monitoring and data sources into unified MLops pipelines for predictive analysis.
Senior Consultant @ CPrime
Dec 2021 - Nov 2024
  • Technical consultant for client projects. Handled infrastructure, applications, networking - whatever was broken. Cut deployment times by 50%+ with Terraform.
  • Took a monolithic application and broke it into containerized microservices, reducing error rates by 90%.
  • Built Ansible toolkits that clients still use for day-to-day operations after engagement ended.
  • Principal architect for complex cross-functional infrastructure problems.
Senior Consultant @ Contegix
Dec 2015 - Oct 2018
  • Full-stack technical support for high-profile clients. Infrastructure, applications, networking, firewalls.
  • Managed VMware ESXi and AWS environments. Administered large-scale JunOS and Cisco deployments.
  • Owned firewall architecture: iptables, pfSense, Shorewall.
Senior Systems Support Engineer @ AT&T
Jun 2006 - Oct 2018
  • Maintained 99.99% uptime for 7,000+ server VoIP platform. On-call for critical incident response.
  • Built automation that provisioned systems across four departments, replacing manual processes.
  • Owned change management for 1,500+ VoIP application servers. Patched Oracle clusters including Data Guard configs.
  • Created database-driven web apps that eliminated 30%+ of manual workload by replacing inadequate internal tooling.

Skills

Languages

Python, Go, Bash, Perl, Ruby, Java

Infrastructure as Code

Terraform, Ansible, Puppet, Salt

Containers & Orchestration

Docker, Kubernetes, LXC, Argo CD, Harbor

Cloud & Virtualization

AWS, GCP, DigitalOcean, Proxmox, VMware ESXi

Networking

Cisco, Juniper, Arista, Nebula, WireGuard, pfSense

Identity & Auth

FreeIPA, Keycloak, Entra ID, OIDC, LDAP, SAML

Monitoring & Logging

Prometheus, Grafana, Loki, Zabbix, ELK Stack, Splunk

Databases

PostgreSQL, MySQL, Oracle, MongoDB, CouchDB, pgvector

HPC & AI/ML

NVIDIA CUDA, AMD ROCm, vLLM, Ollama, RAG pipelines

CI/CD

GitHub Actions, GitLab CI, Jenkins, Argo CD

Contact

(314) 570-7102 Warrenton, MO