INDEX
Explanations
instances of the verb "to be" in various forms
New Auto-Interp
Negative Logits
strike
-0.67
Lind
-0.66
oire
-0.64
riz
-0.63
nder
-0.62
-0.62
ć
-0.62
henko
-0.61
luence
-0.61
‑
-0.61
POSITIVE LOGITS
adversaries
0.74
���
0.73
icons
0.72
attackers
0.72
enemies
0.71
millenn
0.70
indistinguishable
0.69
agles
0.68
friends
0.67
heroes
0.66
Activations Density 0.151%