INDEX
Explanations
proper names of individuals or organizations
Names of people and organizations
names followed by surnames or verbs
New Auto-Interp
Negative Logits
出版年
-0.97
autorytatywna
-0.97
Taktlose
-0.86
Хьажоргаш
-0.77
verwijspagina
-0.77
хьтан
-0.76
oa̍t
-0.74
Infórmanos
-0.74
ſind
-0.74
queſta
-0.73
POSITIVE LOGITS
is
0.46
is
0.40
1
0.38
2
0.37
was
0.35
The
0.35
has
0.34
-
0.33
是
0.33
ans
0.33
Activations Density 0.307%