INDEX
Explanations
instances of quotes attributed to individuals in the text
New Auto-Interp
Negative Logits
oran
-0.18
ickey
-0.16
riger
-0.15
odos
-0.14
angkan
-0.14
ovu
-0.14
Specifier
-0.14
itan
-0.14
ãĤ¿ãĥ³
-0.14
argin
-0.13
POSITIVE LOGITS
skl
0.15
jit
0.14
yếu
0.14
teki
0.14
itm
0.14
vals
0.14
мени
0.14
BD
0.13
Ñĩий
0.13
ÛĮÙĦÛĮ
0.13
Activations Density 0.177%