INDEX
Explanations
phrases related to publicity stunts
various uses of the word "stunt" in different contexts
New Auto-Interp
Negative Logits
vironment
-0.70
ishops
-0.66
oice
-0.65
icrobial
-0.64
oan
-0.64
Chapters
-0.64
uni
-0.63
soType
-0.62
inda
-0.61
Sett
-0.61
POSITIVE LOGITS
stunt
0.93
stunts
0.92
hered
0.90
strip
0.78
olicy
0.76
monkey
0.75
nel
0.74
crew
0.73
sled
0.73
enegger
0.71
Activations Density 0.020%