INDEX
Explanations
details about specific movies
phrases related to political issues and notable individuals
New Auto-Interp
Negative Logits
":[{"-0.50
++++++++++++++++
-0.43
FANTASY
-0.43
aback
-0.42
iple
-0.42
Picture
-0.41
':
-0.41
:=
-0.41
ERG
-0.40
âĢº
-0.40
POSITIVE LOGITS
)).
0.86
%).
0.81
.).
0.80
).[
0.78
).
0.77
]).
0.76
?).
0.73
)."
0.71
]."
0.71
").
0.70
Activations Density 3.843%