INDEX
Explanations
terms related to fire and firefighting
New Auto-Interp
Negative Logits
soever
-0.19
rej
-0.17
ahr
-0.17
sk
-0.17
sett
-0.16
itud
-0.16
orative
-0.16
sst
-0.16
uco
-0.16
ask
-0.15
POSITIVE LOGITS
nze
0.34
places
0.32
brand
0.30
ball
0.27
brands
0.27
work
0.27
starter
0.26
nds
0.26
proof
0.26
walls
0.26
Activations Density 0.037%