INDEX
Explanations
instances of significant injuries or accidents
New Auto-Interp
Negative Logits
Sadd
-0.16
longleftrightarrow
-0.15
tam
-0.15
hil
-0.14
vak
-0.14
stri
-0.14
.Meta
-0.14
fsp
-0.14
tam
-0.14
_STATS
-0.14
POSITIVE LOGITS
Melbourne
0.23
Sebastian
0.20
Cocoa
0.18
Hutchinson
0.18
Indian
0.18
Treasure
0.18
tit
0.18
Indian
0.17
mel
0.17
indian
0.17
Activations Density 0.027%