INDEX
Explanations
instances of the word "mean" and its variations, indicating discussions about meaning or significance
New Auto-Interp
Negative Logits
azzi
-0.19
gay
-0.16
brtc
-0.15
kova
-0.15
ipa
-0.15
gens
-0.15
oron
-0.15
elts
-0.15
isable
-0.14
lass
-0.14
POSITIVE LOGITS
ings
0.26
urement
0.23
INGLE
0.20
while
0.20
ioned
0.19
(mean
0.18
pir
0.18
_squared
0.18
ingles
0.17
-field
0.17
Activations Density 0.022%