INDEX
Explanations
references to places, roles, or entities involved in agreements or exchanges
New Auto-Interp
Negative Logits
and
-0.54
the
-0.52
which
-0.39
of
-0.38
from
-0.37
with
-0.37
that
-0.36
these
-0.35
this
-0.35
their
-0.33
POSITIVE LOGITS
nôtre
0.77
vôtre
0.76
Económica
0.74
ejército
0.72
étoient
0.71
trône
0.71
préstamo
0.71
pédagogique
0.69
literario
0.69
règne
0.68
Activations Density 0.083%