INDEX
Explanations
references to the word "ok" with varying levels of emphasis
instances of the word "ok."
New Auto-Interp
Negative Logits
bugs
-0.70
lav
-0.66
oire
-0.65
ONSORED
-0.65
WHERE
-0.64
icone
-0.64
Arcade
-0.63
Agric
-0.63
natureconservancy
-0.62
missionaries
-0.62
POSITIVE LOGITS
lahoma
1.05
unin
1.03
lass
0.98
ettle
0.94
awaru
0.92
nown
0.92
aido
0.86
ernel
0.86
itty
0.86
owski
0.85
Activations Density 0.027%