INDEX
Explanations
ties to accidents or physical harm
New Auto-Interp
Negative Logits
iety
-0.79
ctuary
-0.76
ieties
-0.72
itone
-0.71
iland
-0.70
istry
-0.70
Reviewed
-0.70
keep
-0.70
ancest
-0.67
keeping
-0.66
POSITIVE LOGITS
gunfire
1.18
lightning
1.08
blows
0.95
bullets
0.94
blow
0.92
projectiles
0.91
iceberg
0.91
debris
0.88
punches
0.85
Hurricane
0.85
Activations Density 0.101%