INDEX
Explanations
numbers related to quantities or measurements
numerical values, particularly those related to counts or measurements
New Auto-Interp
Negative Logits
bara
-0.92
oru
-0.90
lace
-0.76
iguous
-0.70
folk
-0.69
ndra
-0.66
gart
-0.65
pared
-0.65
ça
-0.65
lay
-0.65
POSITIVE LOGITS
ILCS
1.25
129
0.75
chars
0.74
"$:/
0.72
dB
0.71
131
0.70
dB
0.68
sshd
0.66
Travels
0.66
EMBER
0.66
Activations Density 0.021%