INDEX
Explanations
the concept of satisfaction or fulfilling requirements
New Auto-Interp
Negative Logits
L
-0.74
Ge
-0.74
ge
-0.73
O
-0.71
C
-0.69
’
-0.66
Lu
-0.66
code
-0.64
Bo
-0.64
code
-0.64
POSITIVE LOGITS
Satis
1.64
satisfied
1.50
satisfaction
1.48
satis
1.46
satisfaction
1.46
Satis
1.46
Satisfied
1.44
satisfied
1.43
satis
1.43
Satisfaction
1.42
Activations Density 0.110%