Our valued client is seeking a Lead High Performance Computing (HPC) Architect to support the development of new high-performance computing (HPC) and Artificial Intelligence (AI) solutions!
Open to all of Canada. Work is to be performed remotely with semi-regular travel required throughout the year in various cities across the country (at least once a quarter).
As the successful candidate you will provide architectural leadership & best practices for new solutions with a strong focus on HPC, supercomputing, HTC (High through-put computing), and associated technologies such as Quantum and AI computing. This is an exciting opportunity to serve as a thought leader on an ambitious team aiming to implement some of Canada's first quantum computers!
Responsibilities:
Lead working groups and committees in HPC, HTC, AI, and other technology streams. Keep up with HPC/supercomputing emerging trends and market insights, both in academia and in industry. Lead experimental and proof-of-concept projects to test feasibility and value of initiatives. Coach, mentor and guide staff and community members on matters related to HPC architecture and architecture documentation. Work with the vendor and stakeholder community to understand the latest HPC developments, and how they might be incorporated into enterprise services and offerings. Analyze requirements for new advanced research computing solutions from diverse stakeholder groups, and transform those into scalable, flexible, and resilient technical architectures. Perform architecture options and feasibility analysis, proactively debate alternatives with subject matter experts, and build consensus on recommended architecture within the SME community. Communicate technical information to both technical and non-technical staff and stakeholders and participate in enterprise training initiatives. Participate in a range of national and international committees and working groups, and occasional speaking engagements to provide architectural and technical expertise.
Must Have Skills:
10+ years experience working with complex
High Performance Computing (HPC)and
High Thorough-put Computing (HTC), with a strong knowledge of similar infrastructure Experience researching and evaluating new technology and solutions within the realm of high performance computing,
supercomputing,
quantum computing,
Artificial Intelligence (AI) computing, storage systems, high-performance file systems, parallel workflows, networking at scale, and edge computing. Advanced knowledge of HPC middleware stacks including cluster management tools, job schedulers, and resources managers; such as HTCondor, Maui, Onesis, Slurm, PBS (or derivatives), OpenHPC, Rocks, etc. Demonstrated experience working with enterprise architecture frameworks and methodologies, such as TOGAF or ITIL. Demonstrated experience with the research, design, modification, implementation, and deployment of HPC applications and tools
Nice-to-have Skills:
Experience in virtualization, containerization, and public and private cloud technologies and associated management and orchestration tools. Cisco, Cray, Dell, HPE, or IBM training. Exposure to Quantum and AI technologies and workloads. TOGAF, ITIL, or other industry certifications. Bilingualism in English & French