Benchmarking Domain Intelligence | Data Brew | Episode 45

Falha ao colocar no Carrinho.

Tente novamente mais tarde

Falha ao adicionar à Lista de Desejos.

Tente novamente mais tarde

Falha ao remover da Lista de Desejos

Tente novamente mais tarde

Falha ao adicionar à Biblioteca

Tente outra vez

Falha ao seguir podcast

Tente outra vez

Falha ao parar de seguir podcast

Tente outra vez

Benchmarking Domain Intelligence | Data Brew | Episode 45

Ouça grátis

Ver detalhes do programa

Sobre este título

In this episode, Pallavi Koppol, Research Scientist at Databricks, explores the importance of domain-specific intelligence in large language models (LLMs). She discusses how enterprises need models tailored to their unique jargon, data, and tasks rather than relying solely on general benchmarks.

Highlights include:
- Why benchmarking LLMs for domain-specific tasks is critical for enterprise AI.
- An introduction to the Databricks Intelligence Benchmarking Suite (DIBS).
- Evaluating models on real-world applications like RAG, text-to-JSON, and function calling.
- The evolving landscape of open-source vs. closed-source LLMs.
- How industry and academia can collaborate to improve AI benchmarking.

Ainda não há avaliações