INDEX
Explanations
words related to problems or conflicts
occurrences of the word "trouble" and its variations
New Auto-Interp
Negative Logits
oliberal
-0.71
ivist
-0.69
Revolution
-0.65
itar
-0.65
urse
-0.65
arb
-0.64
ithe
-0.63
rylic
-0.63
hetical
-0.62
ory
-0.62
POSITIVE LOGITS
hooting
1.35
makers
1.18
maker
0.99
some
0.91
engers
0.86
making
0.86
neck
0.83
nel
0.80
troubles
0.79
brewing
0.79
Activations Density 0.063%