INDEX
Explanations
adjectives relating to strictness
instances of the word "strict" in various contexts
New Auto-Interp
Negative Logits
ilater
-0.93
igate
-0.83
assies
-0.81
********
-0.79
ovember
-0.72
FORE
-0.71
****
-0.70
****************
-0.70
ufact
-0.70
xxxxxxxx
-0.70
POSITIVE LOGITS
adherence
1.13
ures
1.12
ness
1.03
strict
0.95
nesses
0.93
scrutiny
0.86
criteria
0.83
est
0.83
adherent
0.81
restrictive
0.81
Activations Density 0.020%