INDEX
Explanations
phrases related to technological activities or tools
verbs and nouns related to action and manipulation
New Auto-Interp
Negative Logits
ãĥĹ
-0.65
º
-0.60
capacity
-0.60
WR
-0.58
arsity
-0.57
immer
-0.57
aida
-0.57
ãĤ´ãĥ³
-0.56
constellation
-0.56
ãĥ¼ãĥĨãĤ£
-0.56
POSITIVE LOGITS
oneself
0.93
ulate
0.90
ify
0.89
them
0.89
ourselves
0.86
everything
0.86
yourself
0.85
edly
0.85
THEM
0.83
him
0.82
Activations Density 0.352%