INDEX
Explanations
references to movies
references to movies
New Auto-Interp
Negative Logits
distress
-0.73
stressing
-0.69
pains
-0.69
orns
-0.67
rist
-0.66
ashington
-0.66
nesota
-0.65
churn
-0.65
commission
-0.64
dread
-0.64
POSITIVE LOGITS
Movie
3.81
Movie
2.68
Movies
2.46
movie
1.96
Cinema
1.80
Film
1.69
Films
1.51
Cinem
1.51
Animation
1.50
Film
1.48
Activations Density 0.026%