INDEX
Explanations
instructions or commands focusing on physical actions
phrases related to decision-making and taking action
New Auto-Interp
Negative Logits
TION
-0.92
proble
-0.76
requ
-0.76
nown
-0.74
ItemThumbnailImage
-0.73
Pwr
-0.70
etheless
-0.68
yssey
-0.68
ĺħ
-0.67
DragonMagazine
-0.67
POSITIVE LOGITS
chairs
1.21
torches
1.16
desks
1.14
benches
1.11
pipes
0.99
typew
0.99
bricks
0.98
keyboards
0.97
shovel
0.97
knives
0.97
Activations Density 0.730%