INDEX
Explanations
phrases related to planning and decision-making
phrases related to teamwork and collaboration
New Auto-Interp
Negative Logits
arist
-0.60
surprisingly
-0.60
*:
-0.59
®
-0.59
%:
-0.55
NOW
-0.54
Alas
-0.53
Amid
-0.52
ãĤ´ãĥ³
-0.51
îĢ
-0.51
POSITIVE LOGITS
[
0.94
â̦"
0.84
,'"
0.82
),"
0.80
)."
0.80
gonna
0.78
..."
0.78
everybody
0.77
,"
0.77
.'"
0.75
Activations Density 1.515%