INDEX

Explanations

interests, needs, and welfare

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Schlüssel

0.41

Reception

0.40

 isot

0.38

 luckily

0.38

 reception

0.37

 associated

0.37

 receptions

0.36

 Simply

0.36

[[

0.36

ptus

0.36

POSITIVE LOGITS

 kepentingan

0.97

利益

0.86

 interests

0.85

 हितों

0.83

 интере

0.82

 interessi

0.81

interests

0.80

 intereses

0.78

 welfare

0.76

 interesses

0.75

Activations Density 0.066%