INDEX
Explanations
specifications or references related to popular films and franchises
New Auto-Interp
Negative Logits
elev
-0.66
suc
-0.65
ĪĴ
-0.65
peanuts
-0.62
knees
-0.62
sooner
-0.62
illegitimate
-0.61
_>
-0.60
oler
-0.60
cursing
-0.60
POSITIVE LOGITS
Courtesy
0.96
©
0.95
Courtesy
0.88
Provided
0.85
COURT
0.85
Photograph
0.80
Bett
0.79
Getty
0.79
REUTERS
0.77
Contemporary
0.75
Activations Density 0.015%