INDEX
Explanations
the word "name" in various contexts
New Auto-Interp
Negative Logits
ness
-0.60
.
-0.48
matur
-0.46
ten
-0.46
nes
-0.45
ley
-0.45
our
-0.44
ly
-0.44
щихся
-0.43
NESS
-0.42
POSITIVE LOGITS
name
2.32
navnet
1.29
names
1.27
naam
1.19
namanya
1.11
名前
1.05
ⓧ
1.05
이름
1.03
rungsseite
1.03
名字
1.01
Activations Density 0.163%