INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 caregiver

0.66

 neurological

0.63

 minimalist

0.61

 paperwork

0.61

 therapies

0.61

 superstar

0.60

 stabbing

0.60

조

0.59

 microphone

0.59

 caregivers

0.58

POSITIVE LOGITS

prueba

0.49

Jangan

0.48

this

0.47

example

0.46

0.45

TypeError

0.44

nombre

0.42

theme

0.42

unsafe

0.42

else

0.42

Activations Density 2.992%

No Known Activations