INDEX
Explanations
verbs related to actions and their execution
New Auto-Interp
Negative Logits
houſe
-0.79
Efq
-0.77
Houſe
-0.77
pleaſure
-0.77
Theſe
-0.75
fometimes
-0.75
fince
-0.73
becauſe
-0.71
ftate
-0.70
perfons
-0.70
POSITIVE LOGITS
three
0.65
began
0.59
became
0.59
ecore
0.59
Then
0.59
five
0.59
turned
0.56
Then
0.55
then
0.55
four
0.55
Activations Density 0.476%