INDEX
Explanations
the word "followers" in sentences
mentions of "followers" in various contexts
New Auto-Interp
Negative Logits
ces
-0.73
Genocide
-0.69
circumstance
-0.65
ced
-0.63
Rim
-0.62
OUT
-0.62
Prosecutor
-0.62
posing
-0.61
Ukrain
-0.60
cer
-0.59
POSITIVE LOGITS
hip
1.39
hips
1.03
followers
0.94
wagon
0.83
follower
0.80
lia
0.80
lihood
0.80
leader
0.79
antry
0.78
fulness
0.78
Activations Density 0.018%