INDEX
Explanations
proper nouns, particularly names of people and places
New Auto-Interp
Negative Logits
kontakte
-0.17
inus
-0.16
ActivityResult
-0.15
atus
-0.14
loquent
-0.14
ture
-0.14
emu
-0.14
oldt
-0.14
tick
-0.13
енÑĤа
-0.13
POSITIVE LOGITS
opoulos
0.33
ou
0.28
akis
0.28
Pap
0.27
outs
0.26
ourg
0.26
oul
0.26
tou
0.25
iou
0.25
atos
0.24
Activations Density 0.048%