INDEX
Explanations
references to the word "new" in various contexts
New Auto-Interp
Negative Logits
sqor
-0.83
ascript
-0.82
ONSORED
-0.81
ickr
-0.75
uca
-0.75
OPA
-0.74
iage
-0.74
actionGroup
-0.72
aminer
-0.72
pless
-0.70
POSITIVE LOGITS
Zealand
1.35
Testament
1.21
York
1.21
Orleans
1.16
bies
1.13
bie
1.06
Years
1.03
Yorker
1.02
Guinea
1.00
Era
0.99
Activations Density 0.039%