INDEX
Explanations
phrases related to actions and instructions
actions and activities in the narrative context
New Auto-Interp
Negative Logits
unity
-0.74
cens
-0.73
ection
-0.72
chev
-0.70
ortium
-0.70
uum
-0.69
avin
-0.66
][
-0.66
ã
-0.64
ás
-0.63
POSITIVE LOGITS
alike
0.80
frantically
0.66
instead
0.65
blindly
0.65
prest
0.65
anew
0.63
cryst
0.61
Doc
0.61
REPL
0.61
immediately
0.60
Activations Density 0.376%