INDEX
Explanations
instances of the word "lead" in various contexts
New Auto-Interp
Negative Logits
leading
-0.23
Leading
-0.21
leadership
-0.20
Leading
-0.20
fik
-0.19
Leadership
-0.17
osate
-0.17
-leading
-0.17
ation
-0.17
leaders
-0.16
POSITIVE LOGITS
better
0.24
gers
0.21
poisoning
0.20
lined
0.19
ÂŃing
0.18
lights
0.18
off
0.18
à¹Ģà¸Ķà¸Ńร
0.18
singer
0.17
ings
0.16
Activations Density 0.021%