INDEX
Explanations
references to the concept of "base" in various contexts
New Auto-Interp
Negative Logits
sla
-0.19
sov
-0.17
naire
-0.17
feas
-0.16
sp
-0.16
ats
-0.16
sh
-0.16
hare
-0.15
acity
-0.15
ride
-0.15
POSITIVE LOGITS
/base
0.23
-base
0.20
(base
0.17
base
0.17
cover
0.16
yonel
0.16
alone
0.15
.base
0.15
Unidos
0.15
brook
0.15
Activations Density 0.031%