INDEX
Explanations
topics, events, or situations that are considered extreme or out of the ordinary
references to extreme situations or conditions
New Auto-Interp
Negative Logits
yard
-0.84
nance
-0.81
sburgh
-0.80
fman
-0.79
shaw
-0.79
bats
-0.79
ËĪ
-0.78
omo
-0.77
stown
-0.76
nexus
-0.73
POSITIVE LOGITS
extreme
0.90
extremes
0.89
lengths
0.86
temperatures
0.80
amounts
0.77
measures
0.75
poverty
0.75
Extreme
0.75
vetting
0.74
ideologies
0.73
Activations Density 0.011%