INDEX
Explanations
phrases related to setting fires or conflicts
words and phrases related to fire, aggression, and conflict
New Auto-Interp
Negative Logits
Huntington
-0.71
commute
-0.66
cheat
-0.64
imeter
-0.63
quint
-0.61
lifetime
-0.59
peas
-0.56
towed
-0.56
paid
-0.55
hoe
-0.55
POSITIVE LOGITS
EEE
0.75
actionDate
0.73
aciously
0.72
§
0.71
earances
0.70
Dialogue
0.70
Film
0.70
ãĥĢ
0.69
76561
0.68
andise
0.66
Activations Density 0.159%