INDEX
Explanations
references to fictional characters or settings
New Auto-Interp
Negative Logits
esi
-0.16
TERM
-0.15
inse
-0.15
Chancellor
-0.15
Mellon
-0.14
ãģ¹ãģį
-0.14
lk
-0.14
иÑĢа
-0.14
jang
-0.14
seo
-0.14
POSITIVE LOGITS
izr
0.16
_traits
0.15
ument
0.14
uards
0.14
ostel
0.14
impse
0.14
umblr
0.14
Leonardo
0.13
nit
0.13
çĽ
0.13
Activations Density 0.003%