INDEX
Explanations
phrases related to processes or actions
phrases that refer to the various ways something is done or achieved
New Auto-Interp
Negative Logits
acerb
-0.78
drowning
-0.75
bda
-0.71
bas
-0.68
åij
-0.64
kaya
-0.62
odium
-0.62
topic
-0.62
tty
-0.62
Guard
-0.61
POSITIVE LOGITS
soever
0.88
they
0.78
humans
0.77
individuals
0.77
corporations
0.73
we
0.72
Europeans
0.69
historians
0.68
observers
0.67
workers
0.66
Activations Density 0.114%