INDEX
Explanations
specific numerical values and counts related to various items and categories
New Auto-Interp
Negative Logits
unas
-0.15
133
-0.15
optera
-0.15
umer
-0.15
iaux
-0.14
raud
-0.14
ory
-0.14
ri
-0.14
bjerg
-0.14
ica
-0.14
POSITIVE LOGITS
ième
0.19
fold
0.17
-sided
0.17
antry
0.16
-handed
0.15
onta
0.15
/qu
0.15
/if
0.15
orra
0.15
odoxy
0.15
Activations Density 0.205%