INDEX
Explanations
words related to stopping or ceasing an action
occurrences of the word "halt" and its variations in various contexts
New Auto-Interp
Negative Logits
Sov
-0.90
rics
-0.79
Dynamics
-0.76
ãĤĮ
-0.72
çīĪ
-0.72
ophers
-0.69
iosyncr
-0.68
ramid
-0.68
aldo
-0.66
RAM
-0.65
POSITIVE LOGITS
ŃĶ
0.86
steen
0.83
halt
0.82
ignt
0.79
seiz
0.74
derail
0.73
halted
0.70
down
0.69
downs
0.67
abruptly
0.66
Activations Density 0.018%