INDEX
Explanations
actions related to physical struggle and collapse
New Auto-Interp
Negative Logits
terminal
-0.17
Terminal
-0.17
-terminal
-0.16
agency
-0.16
اص
-0.16
keit
-0.15
borg
-0.15
aes
-0.15
terminal
-0.14
lew
-0.14
POSITIVE LOGITS
spacer
0.16
echan
0.15
ocos
0.15
rak
0.15
HEAP
0.14
OF
0.14
celik
0.14
agrid
0.14
mechan
0.14
851
0.14
Activations Density 0.068%