INDEX
Explanations
terms related to fire and firefighting
New Auto-Interp
Negative Logits
soever
-0.20
rej
-0.18
sWith
-0.18
sk
-0.17
ahr
-0.17
sst
-0.17
itud
-0.16
ask
-0.16
sett
-0.16
orative
-0.16
POSITIVE LOGITS
places
0.34
nze
0.34
brand
0.32
brands
0.30
ball
0.28
work
0.27
proof
0.27
nds
0.26
starter
0.26
balls
0.26
Activations Density 0.035%