INDEX
Explanations
specific phrases indicating actions and cognitive states involving understanding, planning, and future outcomes
New Auto-Interp
Negative Logits
lus
-0.16
Jet
-0.16
.ColumnHeader
-0.15
-0.15
oton
-0.14
égor
-0.14
ÑĨев
-0.14
ensen
-0.14
wal
-0.14
hand
-0.14
POSITIVE LOGITS
happening
0.22
happens
0.21
happened
0.20
happen
0.20
ToDo
0.18
done
0.17
aconte
0.16
_done
0.16
Happ
0.16
åıijçĶŁ
0.16
Activations Density 0.204%