INDEX
Explanations
methods related to retrieving and manipulating object attributes or states
New Auto-Interp
Negative Logits
-
-0.69
-0.63
=
-0.62
*
-0.61
t
-0.61
set
-0.60
*
-0.60
p
-0.59
se
-0.59
//
-0.59
POSITIVE LOGITS
myſelf
1.62
himſelf
1.52
ſelf
1.50
itſelf
1.49
themſelves
1.46
ſelves
1.42
pleaſure
1.36
purpoſe
1.32
raiſ
1.28
ſever
1.25
Activations Density 0.036%