INDEX
Explanations
words related to actors or characters in the context of movies or media
recurring mentions of the term "star"
New Auto-Interp
Negative Logits
odcast
-0.83
esville
-0.69
essor
-0.69
apons
-0.68
mble
-0.67
idences
-0.66
Downloadha
-0.64
berra
-0.63
subp
-0.63
heastern
-0.62
POSITIVE LOGITS
star
1.27
stars
1.06
liner
0.85
stri
0.84
burst
0.82
stars
0.82
Stars
0.82
vation
0.81
fare
0.77
bucks
0.77
Activations Density 0.006%