INDEX
Explanations
movie titles enclosed in quotation marks
quotations from various sources
New Auto-Interp
Negative Logits
²¾
-0.79
senal
-0.75
¬¼
-0.73
ĻĤ
-0.72
ife
-0.69
cha
-0.67
comings
-0.66
acas
-0.65
isl
-0.64
nex
-0.63
POSITIVE LOGITS
/"
0.85
referring
0.82
aka
0.79
meaning
0.78
SPONSORED
0.78
implying
0.74
namely
0.72
according
0.71
referencing
0.71
agy
0.70
Activations Density 0.061%