INDEX
Explanations
references to economic commentary and political figures, especially in context to satire and critique
New Auto-Interp
Negative Logits
,
-0.54
start
-0.44
brazos
-0.44
kife
-0.42
but
-0.42
berper
-0.42
蠻
-0.41
go
-0.41
missione
-0.41
&
-0.40
POSITIVE LOGITS
batore
0.91
AnchorStyles
0.87
etheless
0.86
}`).
0.85
!')
0.83
"]).
0.83
’).
0.82
kegaard
0.82
FormTagHelper
0.81
Efq
0.81
Activations Density 0.601%