INDEX
Explanations
terms related to fire safety and extinguishing agents
New Auto-Interp
Negative Logits
hardt
-0.17
ole
-0.15
yard
-0.14
epit
-0.14
onym
-0.14
aturated
-0.14
onymous
-0.14
spinner
-0.14
demean
-0.14
osaur
-0.13
POSITIVE LOGITS
sprink
0.37
fire
0.34
extingu
0.31
smoke
0.30
Fire
0.29
Spr
0.28
firefight
0.28
Smoke
0.28
Smoke
0.27
.fire
0.27
Activations Density 0.019%