INDEX
Explanations
mentions of numerical values associated with physical characteristics such as density and thresholds
references to density in various contexts
New Auto-Interp
Negative Logits
akia
-0.85
bara
-0.83
uberty
-0.81
pha
-0.77
porary
-0.75
McDonnell
-0.72
utic
-0.69
udos
-0.69
WATCHED
-0.68
cffffcc
-0.67
POSITIVE LOGITS
density
0.92
density
0.76
icity
0.73
clust
0.72
eater
0.71
ikuman
0.71
census
0.70
foam
0.70
ensity
0.69
layer
0.69
Activations Density 0.022%