INDEX
Explanations
instances of the word "strong" and its variations, indicating strength or power
New Auto-Interp
Negative Logits
ized
-0.18
bian
-0.17
ë¡ľ
-0.17
jur
-0.16
fulness
-0.15
umas
-0.15
aled
-0.15
ION
-0.15
otros
-0.15
cha
-0.14
POSITIVE LOGITS
holds
0.36
mẽ
0.25
bow
0.24
(er
0.23
hold
0.22
-arm
0.21
-strong
0.21
/we
0.20
bonds
0.19
enough
0.19
Activations Density 0.054%