INDEX
Explanations
phrases related to tasks and their completion
Describes how something is done
New Auto-Interp
Negative Logits
matchCondition
-0.53
鰭
-0.52
ScreenState
-0.51
géographie
-0.50
tainen
-0.49
Availability
-0.48
Immediate
-0.47
Carreira
-0.47
<bos>
-0.46
Amm
-0.46
POSITIVE LOGITS
efficiently
0.94
PROPER
0.88
properly
0.86
correctly
0.86
flawlessly
0.84
efficient
0.82
Proper
0.81
proprement
0.79
proper
0.78
smarter
0.78
Activations Density 0.212%