INDEX
Explanations
elements related to observational commentary or opinion expressions
New Auto-Interp
Negative Logits
efined
-0.17
ial
-0.16
htt
-0.15
uty
-0.15
ses
-0.14
upertino
-0.14
esan
-0.14
ниÑĨÑĭ
-0.14
ASS
-0.14
aspers
-0.14
POSITIVE LOGITS
Binder
0.15
azu
0.14
synchronized
0.14
ÛĮز
0.14
Jame
0.14
alles
0.13
vla
0.13
kop
0.13
bla
0.13
place
0.13
Activations Density 0.114%