INDEX
Explanations
instances of verbs conveying action or change
New Auto-Interp
Negative Logits
šem
-0.15
.Deep
-0.14
dbo
-0.14
Deutsche
-0.14
uzey
-0.13
iore
-0.13
.Dispose
-0.13
Dias
-0.13
Deutsch
-0.13
dims
-0.13
POSITIVE LOGITS
down
1.27
-down
1.04
down
1.00
Down
0.96
DOWN
0.94
Down
0.92
_down
0.84
.down
0.81
DOWN
0.78
_DOWN
0.68
Activations Density 0.330%