INDEX
Explanations
mentions of the chemical nerve agent "sarin"
occurrences of the word "sar in" and its variations
New Auto-Interp
Negative Logits
Trend
-0.87
Ĵ
-0.75
bes
-0.73
District
-0.73
¤
-0.72
Offline
-0.71
Univ
-0.71
Happiness
-0.68
stakes
-0.67
Ń·
-0.67
POSITIVE LOGITS
arin
1.55
illac
1.01
anus
0.98
alyst
0.85
hin
0.84
ergic
0.84
thal
0.83
oranges
0.83
xual
0.81
anguage
0.81
Activations Density 0.015%