INDEX
Explanations
phrases related to memorable dialogue or quotes from movies
New Auto-Interp
Negative Logits
deflate
-0.17
_tile
-0.16
ä»Ļ
-0.15
662
-0.15
erdale
-0.15
ông
-0.14
tile
-0.14
urette
-0.14
ocked
-0.14
еÑĢÑĤи
-0.13
POSITIVE LOGITS
Termin
0.35
TERMIN
0.35
Terminator
0.33
termin
0.32
terminator
0.32
Schwar
0.30
Judgment
0.29
Arnold
0.29
termin
0.28
Sarah
0.27
Activations Density 0.005%