INDEX
Explanations
references to physical properties like density
mentions of "density"
New Auto-Interp
Negative Logits
opal
-0.82
uberty
-0.80
unes
-0.80
akia
-0.77
porary
-0.75
soDeliveryDate
-0.73
WATCHED
-0.73
bara
-0.72
oÄŁ
-0.70
Ago
-0.69
POSITIVE LOGITS
density
1.04
ratio
0.88
density
0.84
ratios
0.83
cooker
0.80
dens
0.75
eater
0.74
utilization
0.74
gradient
0.73
clust
0.73
Activations Density 0.009%