INDEX

Explanations

Manmohan Singh, Arts degree, autocratic

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

kości

0.41

 Seab

0.39

Craig

0.38

Skip

0.38

 ولو

0.37

ション

0.36

諺

0.36

Quantity

0.36

Anthony

0.36

 لها

0.36

POSITIVE LOGITS

 riconoscimento

0.38

bm

0.36

 übernimmt

0.35

 acknowledges

0.35

acknowledge

0.34

body

0.34

璀

0.33

 primeiros

0.33

 matcher

0.33

 kajian

0.33

Activations Density 0.002%