INDEX
Explanations
references to the word "book."
the word "ok" in various contexts
New Auto-Interp
Negative Logits
lav
-0.80
Legions
-0.74
cort
-0.72
FACE
-0.67
Engineers
-0.67
draped
-0.67
Veil
-0.65
âĢ¢âĢ¢âĢ¢âĢ¢
-0.64
duct
-0.63
ãĥĺãĥ©
-0.62
POSITIVE LOGITS
lahoma
1.17
ok
1.09
arak
1.01
wana
0.95
uten
0.93
oro
0.92
sha
0.90
owski
0.89
obo
0.89
awaru
0.89
Activations Density 0.012%