INDEX
Explanations
phrases indicating intention or purpose
phrases indicating intentions or actions related to personal and collective experiences
New Auto-Interp
Negative Logits
acies
-0.74
ventions
-0.70
sequ
-0.69
zie
-0.69
artisan
-0.66
Cas
-0.66
orio
-0.65
ilings
-0.65
ournal
-0.65
cffff
-0.63
POSITIVE LOGITS
liest
0.89
hest
0.71
longest
0.68
proverb
0.63
chin
0.62
iest
0.62
EMENT
0.59
richest
0.59
"$:/
0.58
behav
0.58
Activations Density 0.324%