INDEX
Explanations
proper nouns related to individuals and places
New Auto-Interp
Negative Logits
åĢ«
-0.16
oir
-0.15
ooke
-0.15
arkin
-0.15
Äijâu
-0.14
yect
-0.14
loth
-0.14
rish
-0.14
abbo
-0.14
rans
-0.14
POSITIVE LOGITS
xa
0.17
خت
0.17
prospects
0.15
кÑĤÑĥ
0.15
von
0.14
ben
0.14
sovere
0.14
ÌĨ
0.13
0.13
Pros
0.13
Activations Density 0.234%