INDEX
Explanations
instances of emphasis on current action or state
New Auto-Interp
Negative Logits
VOKE
-0.18
grim
-0.17
Ïģη
-0.16
/story
-0.15
ifix
-0.14
ÑĢай
-0.14
Defaults
-0.14
visible
-0.14
\<^
-0.14
ÙĤدر
-0.14
POSITIVE LOGITS
ekce
0.16
ieres
0.15
mocker
0.14
Rover
0.14
omer
0.14
Dudley
0.13
عاÙĨ
0.13
UCT
0.13
ätz
0.13
Booth
0.13
Activations Density 0.000%