INDEX
Explanations
references to threats and dangers
New Auto-Interp
Negative Logits
scolas
-0.75
tituts
-0.69
Absorption
-0.68
hoga
-0.68
☀
-0.65
roxene
-0.63
urator
-0.62
adins
-0.61
oflavin
-0.60
uinal
-0.59
POSITIVE LOGITS
threat
1.82
threats
1.72
threat
1.72
Threat
1.64
Threats
1.60
Threat
1.57
threatens
1.45
threatened
1.43
Threats
1.39
threaten
1.36
Activations Density 0.085%