INDEX
Explanations
words related to fires and emergencies
mentions of fires or fire-related incidents
New Auto-Interp
Negative Logits
xus
-0.83
laus
-0.74
Vide
-0.74
Freed
-0.73
Lans
-0.72
atem
-0.72
Virtue
-0.72
sembly
-0.70
amaru
-0.68
VIDIA
-0.68
POSITIVE LOGITS
flies
1.16
storm
1.14
exting
1.10
proof
1.07
balls
1.02
storms
1.02
fighting
1.00
fly
1.00
trap
0.98
fight
0.96
Activations Density 0.029%