INDEX
Explanations
mentions of memorable scenes from specific movies
references to or descriptions of scenes from movies or TV shows, particularly iconic moments or characters
New Auto-Interp
Negative Logits
METHOD
-0.77
scl
-0.75
osponsors
-0.73
foreseen
-0.72
gression
-0.71
nington
-0.71
Statement
-0.70
erity
-0.70
jri
-0.69
disag
-0.69
POSITIVE LOGITS
movies
1.33
sitcom
1.26
movie
1.26
films
1.23
film
1.14
Terminator
1.10
flick
1.08
Ghostbusters
1.07
Movie
1.05
movie
1.05
Activations Density 0.377%