INDEX
Explanations
references to numerical values or quantities
New Auto-Interp
Negative Logits
antan
-0.20
outsider
-0.16
&a
-0.15
ied
-0.15
uyết
-0.15
iams
-0.15
zin
-0.14
-outs
-0.14
OUTPUT
-0.14
sgiving
-0.14
POSITIVE LOGITS
nowhere
0.36
bounds
0.30
reach
0.27
sight
0.25
Bounds
0.22
reach
0.21
0.21
bounds
0.20
necessity
0.20
_bounds
0.20
Activations Density 0.050%