INDEX
Explanations
the presence of knowledge or familiarity related to various subjects
New Auto-Interp
Negative Logits
ksi
-0.16
ilor
-0.16
ka
-0.16
ente
-0.15
aja
-0.15
#__
-0.14
with
-0.14
umeric
-0.14
ustr
-0.13
efe
-0.13
POSITIVE LOGITS
well
0.35
better
0.32
well
0.27
intimately
0.27
better
0.27
mieux
0.25
intimate
0.25
backwards
0.25
WELL
0.24
Better
0.24
Activations Density 0.163%