INDEX
Explanations
instances of the word "over" and its variations as a significant term
New Auto-Interp
Negative Logits
qui
-0.17
ic
-0.16
sa
-0.16
orph
-0.15
949
-0.15
wards
-0.14
ilar
-0.14
isc
-0.14
owie
-0.14
bh
-0.14
POSITIVE LOGITS
stock
0.17
nite
0.17
heid
0.17
eview
0.17
tones
0.17
_UNDER
0.17
900
0.16
ắn
0.16
Drive
0.15
konkrét
0.15
Activations Density 0.041%