INDEX
Explanations
phrases related to topics of seriousness or importance
instances of the word "serious" in various contexts
New Auto-Interp
Negative Logits
enaries
-0.83
wright
-0.81
av
-0.75
sylv
-0.72
ifully
-0.72
tein
-0.71
AW
-0.70
anus
-0.70
seamlessly
-0.69
anic
-0.69
POSITIVE LOGITS
consideration
1.01
lly
0.91
contender
0.85
threat
0.81
injury
0.79
serious
0.78
allegation
0.77
danger
0.77
trouble
0.77
serious
0.76
Activations Density 0.057%