INDEX
Explanations
phrases related to the establishment or founding of organizations and their history
New Auto-Interp
Negative Logits
ears
-0.15
wsz
-0.14
ÑĥÑģÑĤановлен
-0.14
lẫn
-0.14
ipa
-0.13
että
-0.13
ãģ©
-0.13
å¾Ģ
-0.12
cluding
-0.12
دث
-0.12
POSITIVE LOGITS
out
0.27
initially
0.26
originally
0.23
with
0.22
under
0.22
aim
0.22
initial
0.21
Initially
0.20
prim
0.20
aiming
0.20
Activations Density 0.160%