Unlocking LLM Potential: From Federated Learning to Introspection


Mar 07, 12:00 PM PST
  • Virtual SF Big Analytics
  • 147 RSVP
Description
Speaker

This virtual AI seminar is hosted by SF Big Analytics.

Tech Talk: Empowering Federated Learning for Massive Models with NVIDIA FLARE
Speaker: Holger Roth (NVIDIA LinkedIn )
Abstract: In the ever-evolving landscape of artificial intelligence (AI) and large language models (LLMs), handling and leveraging data effectively has become a critical challenge. Most state-of-the-art machine learning algorithms are data-centric. However, as the lifeblood of model performance, necessary data cannot always be centralized due to various factors such as privacy, regulation, geopolitics, copyright issues, and the sheer effort required to move vast datasets. In this talk, we explore how federated learning enabled by NVIDIA FLARE can address these challenges with easy and scalable integration capabilities, enabling parameter-efficient and full supervised fine-tuning of LLMs for natural language processing and biopharmaceutical applications to enhance their accuracy and robustness (For details, see our paper).

Tech Talk: Metacognition is all you need? - Using Introspection in Generative Agents to Improve Goal-directed Behavior
Speaker: Jason Toy (CrewSnap LinkedIn)
Abstract: A review of the paper "Metacognition is all you need? ": Recent advances in Large Language Models (LLMs) have shown impressive capabilities in various applications, yet LLMs face challenges such as limited context windows and difficulties in generalization. In this paper, we introduce a metacognition module for generative agents, enabling them to observe their own thought processes and actions. This metacognitive approach, designed to emulate System 1 and System 2 cognitive processes, allows agents to significantly enhance their performance by modifying their strategy. We tested the metacognition module on a variety of scenarios, including a situation where generative agents must survive a zombie apocalypse, and observe that our system outperform others, while agents adapt and improve their strategies to complete tasks over time.

Holger Roth (NVIDIA), Jason Toy (CrewSnap)

Holger Roth
Holger Roth, a Principal Federated Learning Scientist at NVIDIA, specializes in developing distributed and collaborative software and models for various industries using federated learning and analytics
Jason Toy
Jason Toy is part of an AI research group building the best open source agent simulation framework. He has built and ran multiple engineering teams.
The event ended.
Watch Recording
*Recordings hosted on Youtube, click the link will open the Youtube page.
Contact Organizer