INDEX
Explanations
concepts and terms related to minimalism
New Auto-Interp
Negative Logits
754
-0.16
388
-0.16
pong
-0.15
Barg
-0.15
168
-0.15
amiento
-0.15
INESS
-0.14
anel
-0.14
outh
-0.14
ppo
-0.14
POSITIVE LOGITS
/max
0.31
istic
0.26
ogue
0.23
/min
0.23
isters
0.22
IMUM
0.22
=min
0.21
imize
0.21
imal
0.20
amount
0.20
Activations Density 0.018%