INDEX
Explanations
statements about quantities, statistics, and measurements
New Auto-Interp
Negative Logits
ãģ¾ãģļ
-0.14
ayo
-0.13
however
-0.13
हल
-0.13
(çģ«
-0.12
ipment
-0.12
(åľŁ
-0.12
.unshift
-0.12
svp
-0.12
icy
-0.12
POSITIVE LOGITS
dol
0.16
bote
0.15
other
0.15
ender
0.15
odge
0.14
ãĥīãĥ«
0.14
bond
0.14
orsch
0.14
ìŀĦ
0.14
further
0.14
Activations Density 0.992%