INDEX
Explanations
references to decision-making processes and personal aspirations
New Auto-Interp
Negative Logits
reference
-0.17
áty
-0.16
reference
-0.15
lac
-0.15
iri
-0.15
referencing
-0.14
usu
-0.14
commune
-0.14
ecycle
-0.14
Dani
-0.14
POSITIVE LOGITS
narr
0.21
crib
0.16
hv
0.16
anyhow
0.15
term
0.15
preferred
0.15
religious
0.15
ptal
0.15
pac
0.15
used
0.14
Activations Density 0.537%