INDEX
Explanations
references to victories or achievements
topics related to political success and challenges
New Auto-Interp
Negative Logits
.).
-0.66
".
-0.58
'.
-0.58
âĵĺ
-0.57
%.
-0.57
$.
-0.56
instead
-0.56
usercontent
-0.55
}.
-0.54
atever
-0.54
POSITIVE LOGITS
nutshell
0.51
pires
0.49
depends
0.45
)]
0.44
¶
0.44
bulletin
0.42
ciplinary
0.41
otomy
0.40
varies
0.40
urgical
0.39
Activations Density 4.612%