INDEX

Explanations

pronoun "her" or "she"

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

aires

0.52

erà

0.46

роваться

0.46

DEL

0.46

 cargos

0.45

yesters

0.45

 cereals

0.44

笏

0.44

ಾರ್ಥ

0.44

cerer

0.43

POSITIVE LOGITS

0.50

0.48

0.47

0.46

 Squash

0.45

亩

0.44

Double

0.43

 Humanitarian

0.43

0.42

Activations Density 0.041%