INDEX
Explanations
statements about intensity or severity in narratives
dangerous magnitude
New Auto-Interp
Negative Logits
-0.55
UserScript
-0.54
CallOverrides
-0.54
цездатний
-0.52
fortawesome
-0.51
NSCoder
-0.50
ädie
-0.49
IFTT
-0.48
Anſ
-0.48
вікі
-0.48
POSITIVE LOGITS
high
0.41
dangerous
0.39
dangers
0.37
caution
0.37
hohen
0.36
huge
0.36
large
0.35
tapete
0.35
severe
0.34
scary
0.34
Activations Density 0.087%