INDEX

Explanations

focuses on specific areas

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

женные

0.44

 зві

0.44

 журнала

0.43

 тільки

0.43

केवल

0.41

attice

0.41

қан

0.41

 Amitabh

0.41

 Прав

0.40

த்தனர்

0.40

POSITIVE LOGITS

 greater

0.49

 than

0.48

 overall

0.48

 planning

0.44

 prevention

0.43

ser

0.42

 resulting

0.42

 grievance

0.42

 much

0.41

 incredible

0.41

Activations Density 0.010%