INDEX
Explanations
terms related to authority or control roles
New Auto-Interp
Negative Logits
es
-0.83
wyn
-0.79
hynch
-0.73
<blockquote>
-0.70
ES
-0.70
ernalia
-0.69
𝗲
-0.68
sjø
-0.68
czaj
-0.66
̀n
-0.66
POSITIVE LOGITS
ator
1.41
ators
1.16
vator
1.13
ATOR
1.09
urator
1.04
ator
1.02
icator
1.01
strator
0.98
Ziegler
0.97
Locator
0.95
Activations Density 0.066%