INDEX
Explanations
expressions related to default settings or configurations
New Auto-Interp
Negative Logits
a
-0.70
DTO
-0.64
.
-0.58
k
-0.53
::
-0.51
admin
-0.51
(
-0.50
i
-0.49
2
-0.49
dagog
-0.48
POSITIVE LOGITS
default
1.67
defaults
1.46
default
1.42
Default
1.39
defaults
1.37
DEFAULT
1.21
Default
1.20
Defaults
1.19
默认
1.15
standard
1.13
Activations Density 0.147%