INDEX

Explanations

describing states and actions

New Auto-Interp

Configuration

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

фера

0.51

 жё

0.50

푅

0.47

忔

0.47

처

0.46

ologien

0.45

쏜

0.45

 meget

0.43

fetched

0.43

;|

0.43

POSITIVE LOGITS

 drama

0.47

າວ

0.45

 dialysis

0.44

de

0.44

 komunitas

0.43

די

0.43

veg

0.42

א

0.42

ald

0.42

linux

0.42

Activations Density 0.000%