INDEX
Explanations
instances where the number "one" is emphasized or highlighted in the text
instances of the word "one."
New Auto-Interp
Negative Logits
ooks
-0.96
efficients
-0.88
skirts
-0.87
ighters
-0.86
ourses
-0.86
acements
-0.85
ammers
-0.85
apons
-0.84
roots
-0.84
ributes
-0.84
POSITIVE LOGITS
instance
1.17
person
1.17
hundred
1.09
thing
1.06
aspect
1.03
ounce
0.96
exception
0.94
element
0.93
iteration
0.92
facet
0.91
Activations Density 0.077%