INDEX
Explanations
the word "cheer" or its variations
terms related to cheer and enthusiasm
New Auto-Interp
Negative Logits
ngth
-0.67
Danger
-0.60
arin
-0.59
amation
-0.58
consultation
-0.56
sequest
-0.56
orgetown
-0.56
fung
-0.56
ilibrium
-0.55
wedge
-0.55
POSITIVE LOGITS
leaders
1.39
leader
1.39
leading
1.32
cheer
1.13
lead
1.11
fulness
1.02
wart
0.92
cheering
0.92
fully
0.92
jee
0.88
Activations Density 0.024%