Héctor Hernández Cruz
Senior Full-Stack Engineer · TypeScript · React · Go · Node.js
I build production AI systems and full-stack applications. React/TypeScript on the frontend, Node.js and Go on the backend, AWS for infrastructure, and LangChain for the RAG pipelines I work with in clinical AI. 9+ years shipping in globally distributed, English-first teams.
Professional Journey
Curie Health
Senior Full-Stack Engineer
Building clinical AI features end to end in a HIPAA-compliant production environment. RAG pipelines, structured LLM outputs, and the Go and Node.js services that power them.
- Build RAG pipelines with LangChain and PostgreSQL vector search (pgvector) for clinical AI
- Develop Go microservices for real-time clinical data ingestion via gRPC
- Optimized React/TypeScript/GraphQL frontends and cut average API response time by 42% with query optimization and Redis caching
- Apply prompt engineering and structured output validation in HIPAA-compliant production environments
- Integrate AWS Bedrock and GCP Vertex AI for model serving across clinical AI services
- Deploy and maintain services on AWS with Kubernetes and GitHub Actions CI/CD
Ashby
Software Engineer
Shipped React/TypeScript/GraphQL features and optimized Node.js APIs for a recruiting platform used by high-growth companies.
- Shipped React, TypeScript, and GraphQL frontend features across multiple product areas
- Built and optimized Node.js APIs, improving data-fetching and reducing response time by 42%
- Worked with distributed systems handling large data volumes via PostgreSQL and Redis caching
- Contributed to Go backend services for high-performance concurrent workloads
- Deployed services on AWS with Kubernetes and GitHub Actions CI/CD
Bazaarvoice
Software Engineer
Built and maintained e-commerce review and ratings features, optimized data pipelines, and migrated a large JavaScript frontend to TypeScript.
- Developed React and JavaScript storefront components for e-commerce ratings and review features
- Built Node.js REST APIs integrating third-party review aggregation services, reducing data sync latency by 11%
- Optimized PostgreSQL queries for product analytics and cut report generation from 8s to 1.2s
- Migrated frontend modules from JavaScript to TypeScript, improving reliability and reducing runtime errors
- Contributed to AWS infrastructure (EC2, S3, RDS) and handled production incidents on-call
Self-Employed
Freelance Software Engineer
Delivered full-stack web applications for small businesses and early-stage startups across the US.
- Built e-commerce storefronts, marketing sites, and REST APIs for retail and hospitality clients
- Collaborated directly with non-technical stakeholders to define requirements and iterate on features
- Worked across the full stack with JavaScript, React, and Node.js
Tech Arsenal
Featured Projects

ARIA — AI Interview Assistant
A native Windows SaaS platform that listens to both sides of a live interview and surfaces personalized AI suggestions in under one second. Built on premium WASAPI audio capture, a private desktop overlay invisible to screen share, and state-of-the-art LLMs grounded in the user's resume and job description. Ships STAR-method behavioral coaching, live coding analysis, and per-session transcription. Supports Free, Pro, and Elite tiers with multi-user simultaneous access and crypto payments.

rag-pipeline
A production-oriented RAG pipeline built around patterns from my clinical AI work. Hybrid retrieval combines dense vectors and keyword search fused via Reciprocal Rank Fusion, with cross-encoder reranking and graceful fallback. Answers are grounded with enforced citations. An evaluation harness tracks hit rate, MRR, and faithfulness so changes are measurable. Redis caches embeddings so re-ingestion is nearly free.

graphql-api
A production-shaped GraphQL API focused on the two things that actually move latency: N+1 elimination with DataLoader and Redis response caching with tag-based invalidation. Comes with a reproducible benchmark that counts DB round-trips before and after batching. 101 queries collapse to 2. Per-request loader isolation keeps data from leaking across users.

go-microservice
A gRPC microservice for real-time clinical data ingestion, built around patterns I use in production. Bounded worker pool with backpressure, context propagation that respects client cancellation, and graceful shutdown that drains in-flight work before exiting. Domain logic is isolated from the transport layer for unit-testable business rules. Concurrency verified under the Go race detector.

vitals-dashboard
A real-time clinical vitals monitoring dashboard, the frontend to the Go ingestion service above. Memoized rendering under per-second updates so unchanged patient cards skip re-rendering entirely. SVG sparklines are hand-rolled with no charting dependency. The color palette is calm and dark, with vivid color reserved strictly for alerts. Deployed and live on Vercel.
Let's Build Something Amazing
Ready to bring your ideas to life? I’m always open to exciting projects, collaborations, or fresh challenges that push creative and technical boundaries.