INDEX
Explanations
references to time spent, particularly in hours
New Auto-Interp
Negative Logits
ione
-0.18
chor
-0.18
éį
-0.15
Pride
-0.14
WXYZ
-0.14
adla
-0.14
ience
-0.14
Bray
-0.14
داد
-0.13
ji
-0.13
POSITIVE LOGITS
oÅĽci
0.14
neys
0.14
dint
0.13
شتÙĩ
0.13
ouser
0.13
esian
0.13
_sensitive
0.13
edik
0.13
Hess
0.13
esk
0.13
Activations Density 0.012%