INDEX
Explanations
references to frontline workers and their experiences
New Auto-Interp
Negative Logits
ilig
-0.17
ndl
-0.15
uluk
-0.15
jac
-0.15
alah
-0.15
ENCHMARK
-0.15
åŀ
-0.15
Gry
-0.14
Hag
-0.14
maz
-0.14
POSITIVE LOGITS
vert
0.17
uth
0.14
elle
0.14
segment
0.14
Re
0.13
xe
0.13
innocent
0.13
-feed
0.13
eld
0.13
_locale
0.13
Activations Density 0.122%