TITLE: AI Software Engineer
POSITION OVERVIEW: AleBex is a Vancouver-based AI startup building its own end-to-end real-time voice engine. We are looking for an AI Software Engineer to help turn speech recognition, speech synthesis, small-model intelligence, and asynchronous backend systems into a reliable product that can run in real business environments. This is not a role for someone who only calls APIs or only looks at training metrics. We are looking for someone who understands both models and engineering systems, and who can connect model training, inference optimization, service deployment, and user experience.
WHY JOIN US:
- Groundbreaking Projects – Work on AI, robotics, and industry-disrupting software.
- Innovation Culture – Be part of a team that values experimentation, creativity, and rapid delivery.
- Career Growth – Opportunities to lead projects and shape the future of our AI-augmented development strategy.
- Central Location – Based in downtown Vancouver at Alexander Innovation Centre (570 Dunsmuir Street).
- Facilities – Brand new campus and office space that emphasizes a culture of environmentally friendly practices. Secure bike lockers and shower.
ABOUT ALEBEX: ALEBEX builds an AI virtual assistant that helps organizations qualify and convert leads. When a lead is generated, the ALEBEX assistant follows up through phone calls, SMS, and email, engages prospects 24/7, qualifies intent, and escalates serious opportunities to a human team member.
EMPLOYMENT TYPE: Full-Time, Permanent
WORK WEEK: The Employee will work during regular business hours, Monday to Friday for 40 hours per week (not including a 30-minute unpaid meal break per day). Flexibility will be required. Occasional evening and weekend shifts may be required.
LOCATION: Downtown Vancouver, BC (Alexander Innovation Centre)
EXPERIENCE REQUIREMENTS:
- A background in Computer Science, Mathematics, Statistics, Artificial Intelligence, or a related field. Both undergraduate and graduate candidates are welcome.
- Strong software engineering fundamentals, with practical understanding of development workflows, computer systems, network services, asynchronous programming, and production deployment.
- Experience with Python backend development. You should be comfortable building reliable service APIs with frameworks such as FastAPI or Django, and understand API design, logging, testing, and production debugging.
- Solid machine learning and deep learning foundations, including linear algebra, calculus, loss design, batch training, efficient fine-tuning, offline training, and online inference.
- Hands-on ML development experience with PyTorch, TensorFlow, and Hugging Face. Experience with inference and deployment tools such as vLLM or ONNX is a strong plus, especially if you have worked on model loading, acceleration, quantization, or model serving.
- Familiarity with cloud and deployment environments such as AWS, Azure, or GCP. Experience with Docker, Linux, CI/CD, or GPU inference services is a plus.
- Basic testing discipline, including the ability to write and maintain unit tests, integration tests, and regression tests for long-term system quality.
- Most importantly, we value engineers who genuinely want to build a good product: people who communicate actively when requirements are unclear, propose practical solutions when metrics are hard to define, and write software with long-term maintainability and scalability in mind.
- Experience communicating complex technical concepts clearly to both technical and non-technical stakeholders.
- Experience working effectively within agile teams, contributing to shared codebases, peer reviews, and collaborative problem-solving initiatives.
SPECIFIC RESPONSIBILITIES:
- Build, optimize, and deploy the AleBex real-time voice engine, with a focus on low latency, stability, and natural conversational experience.
- Work on STT, including model training, inference acceleration, streaming transcription evaluation, and production performance monitoring.
- Work on TTS, including voice naturalness, time to first audio, streaming output quality, and online quality evaluation.
- Improve AleBex self-developed SLM modules, such as Transformer Encoder models similar to BERT and Decoder models similar to GPT, for intent understanding, conversation flow, latency masking, and user experience optimization.
- Contribute to the design and maintenance of a Python-based distributed asynchronous engine, including concurrent task scheduling, audio stream processing, model service calls, state management, and fault recovery.
- Design evaluation methods that fit real product scenarios. We care not only about loss or benchmarks, but also about user experience, reliability, latency, and deployment cost.
- Documented workflows, model behavior, and deployment processes to support team knowledge sharing and operational continuity.
- Developed clear technical documentation, including API specifications, testing procedures, and deployment guides.
- Created concise reports and summaries to communicate model performance, system reliability, and optimization results.
HOW TO APPLY:
To apply, please e-mail your resume and cover letter to [email protected], using the subject line “[Your full-name]_AI Engineer 2026.” If you have relevant event/trade show experience, please include examples such as events attended, outcomes achieved, or pipeline generated.
Only shortlisted applicants will be contacted. No phone calls please.
All qualified candidates are encouraged to apply; however Canadians and permanent residents will be given priority. Thank you!
Job Types: Full-time, Permanent
Pay: $100,000.00 per year
Benefits:
- Company events
- Dental care
- Extended health care
- On-site parking
- Paid time off
- Vision care
Ability to commute/relocate:
- Vancouver, BC: reliably commute or plan to relocate before starting work (required)
Education:
- Bachelor's Degree (preferred)
Experience:
- software engineer: 2 years (required)
Language:
Work Location: In person