INDEX
Explanations
numeric sequences such as dates, numbers, and codes
references to numerical indicators or counts
New Auto-Interp
Negative Logits
gerald
-0.73
manship
-0.61
"$:/
-0.61
awaru
-0.59
uine
-0.59
utan
-0.58
rolet
-0.58
ians
-0.58
urst
-0.56
ured
-0.56
POSITIVE LOGITS
nd
2.06
ND
1.17
133
1.04
147
1.03
160
1.02
thirds
0.97
externalToEVAOnly
0.96
187
0.95
245
0.92
155
0.91
Activations Density 0.135%