About the role
AspenView Technology Partners is working with a global leader in advanced server, storage, and networking solutions to find an exceptional Sr. AI Systems Engineer. This is a rare opportunity for a high-output engineer who doesn't just write code they ship entire systems.
The team has already built a production AI platform covering 38,000+ documents, Agentic RAG, hybrid search, custom embedding and reranking services, and multi-database integration — accomplished by a single engineer in under a month. They're looking for another engineer with that same mindset and velocity.
What you will do:
- Design and ship AI-powered systems end-to-end: RAG pipelines, agentic workflows, intelligent chatbots, product recommendation engines, automated RFQ assistants, and sales enablement tools
- Build and maintain hybrid search infrastructure combining vector databases (Qdrant, Chroma, Milvus) with keyword search (Elasticsearch/BM25), custom embedding services, and reranking pipelines
- Deploy and optimize LLM inference services using vLLM, SGLang, or equivalent frameworks on GPU clusters (H100, H200, GH200)
- Build document processing pipelines for large-scale ingestion of unstructured data — PDFs, Office docs, web content — covering extraction, chunking, contextual retrieval, and metadata enrichment
- Integrate AI systems with enterprise data sources including PostgreSQL, MSSQL, SharePoint, and SAP; expose capabilities through RESTful APIs (FastAPI)
- Use AI-assisted development tools (Cursor, Claude Code, etc.) as core productivity multipliers — architect the 20% and let AI write the 80%
- Identify the next high-impact problem across sales, engineering, customer support, and operations; prototype fast, ship to production, move on
What you bring:
- 8+ years of software engineering experience; Bachelor's in CS, EE, or related field (Master's is a plus)
- Demonstrated AI-assisted development proficiency — projects where you used Cursor, Claude Code, or similar tools to 10x your output (hard requirement, not a nice-to-have)
- Hands-on experience building RAG systems: vector databases, embedding models, reranking pipelines, and hybrid search architectures
- Strong Python proficiency for backend development, LLM orchestration, and automation; FastAPI experience a plus
- Experience with LLM deployment and inference optimization (vLLM, SGLang, or equivalent); comfortable with Linux, Docker, and GPU infrastructure/CUDA basics
- Self-directed and comfortable with ambiguity — "make our sales team more productive with AI" is enough to get started
Nice if you have:
- Web scraping and data ingestion pipelines (Playwright, Selenium, Crawl4AI)
- Agent frameworks (LangChain, LangGraph) and document processing tools (PDF extraction, OCR)
- Enterprise data sources (SharePoint, MSSQL, SAP) and database design (PostgreSQL, MySQL, MongoDB)
- LLM fine-tuning (LoRA, PEFT, Hugging Face) and frontend basics (React, Vue)
Equal Opportunity Employer:
AspenView is proud to be an equal opportunity employer. We believe in creating an environment where all employees feel welcome, valued, and empowered to succeed. We celebrate diversity and strive to build a culture of inclusion where all individuals, regardless of their race, color, gender, gender identity or expression, sexual orientation, disability, age, or any other characteristic, can thrive. We encourage applicants from all walks of life to join our team and make a lasting impact.