Responsible LLM Deployment in Practice at Brook Health

As healthcare organizations embrace large language models (LLMs), the need for robust governance and systematic evaluation becomes mission critical. This session describes Brook AI’s journey to becoming a world-class AI governance organization – highlighting the processes, architectures, and tools that made it possible.

Brook AI delivers remote patient care, blending clinical teams with AI for continuous, always-on patient support. To evolve from early trials to an enterprise-ready foundation for generative AI deployment, Brook partnered with Pacific AI to implement a comprehensive governance framework aligned with NIST AI RMF, ISO 42001, CHAI, state laws, and federal regulations. This included policy rollouts, stakeholder training, automated governance tooling, and a full architecture and security review of Brook’s multi-agent LLM-based platform.

Brook also embedded LLM-based agent testing directly into its development lifecycle and CI/CD pipelines. Together, the teams created a rigorous benchmark suite covering key aspects of responsible AI in healthcare: fairness, accountability, transparency, explainability, robustness, safety, privacy, and accessibility. The suite was reviewed by healthcare professionals and applied to real-world scenarios mirroring clinician–patient conversations. It also enables comparative evaluations of Brook’s domain-specific agentic AI configurations against general-purpose LLMs such as ChatGPT, and incorporates automated daily evaluations with LLM-as-a-judge plus continuous monitoring for drift and safety incidents in production.

Attendees will leave with a clear blueprint for integrating AI governance into clinical GenAI applications – demonstrating that safety, compliance, and innovation can co-exist at scale.

About the speakers
Tal Amitay
VP of Engineering at Brook Health

Tal Amitay has over 20 years of work experience, starting with their role as a Major in the reserve forces in the IDF from 1996 to 2000. Tal then worked as a Software Engineer at WaveIP from 2004 to 2006, where they led a strategic project integrating WiMAX into an internet and VoIP solution. In 2006, Tal held the position of Software Engineer at Commatch (Closed) and 888.com – Random Logic Ltd.

From 2008 to 2012, they served as an Engineering Team Leader and Software Engineer at Traffix Systems. At F5, Tal worked from 2012 to 2021, holding various roles such as Senior Engineering Manager, Director Of Engineering, and Senior Director Of Engineering. Most recently, they served as an Engineering leader, Security at Stripe from 2021 to 2022. Tal’s current position is the VP of Engineering at Brook Health, a role they assumed in March 2023.

Julio Bonis
Principal Data Scientist at John Snow Labs

Julio Bonis is a data scientist working on NLP & LLM for Healthcare at John Snow Labs. Julio has broad experience in software development and design of complex data products within the scope of Real World Evidence (RWE) and Natural Language Processing (NLP).

He also has substantial clinical and management experience – including entrepreneurship and Medical Affairs. Julio is a medical doctor specialized in Family Medicine (registered GP), has an Executive MBA – IESE, an MSc in Bioinformatics, and an MSc in Epidemiology.

Building Trust in Healthcare AI: CHAI Certification and Pacific AI’s End-to-End Governance Platform

Trust, transparency, and accountability are now essential requirements for deploying AI in healthcare. To meet this need, the Coalition for Health AI (CHAI) has launched a certification program for organizations...