INDEX
Explanations
references to the word "dim" and its variations
New Auto-Interp
Negative Logits
naire
-0.16
zen
-0.16
ificant
-0.15
ame
-0.15
ificate
-0.15
aleza
-0.15
naires
-0.15
athan
-0.15
PFN
-0.15
alg
-0.14
POSITIVE LOGITS
ENSIONS
0.27
Dim
0.27
dim
0.26
Dim
0.25
ENSION
0.25
inished
0.25
dim
0.23
ensions
0.23
ethyl
0.22
,dim
0.21
Activations Density 0.013%