INDEX
Explanations
references to the concept of scale, particularly in relation to dimensions or levels of measurement
New Auto-Interp
Negative Logits
zelf
-0.18
ernals
-0.17
veau
-0.15
sell
-0.15
assis
-0.15
uries
-0.15
iates
-0.15
Hakk
-0.15
ession
-0.14
respond
-0.14
POSITIVE LOGITS
-down
0.28
-up
0.27
able
0.23
out
0.22
-out
0.20
way
0.20
ToFit
0.20
tron
0.18
up
0.18
azy
0.18
Activations Density 0.020%