INDEX
Explanations
instances of potential danger or threats that are close in time or likely to happen soon
terms indicating an impending threat or danger
New Auto-Interp
Negative Logits
rooms
-0.86
lua
-0.85
girls
-0.84
yards
-0.81
pel
-0.76
verbs
-0.75
yer
-0.74
tek
-0.71
girl
-0.71
strap
-0.71
POSITIVE LOGITS
doom
1.26
demise
1.01
imminent
0.98
impending
0.98
famine
0.96
arrival
0.93
danger
0.92
threat
0.90
threats
0.89
endanger
0.86
Activations Density 0.015%