INDEX
Explanations
phrases related to chronology and location
New Auto-Interp
Negative Logits
Ø«ÛĮر
-0.15
θή
-0.14
anv
-0.14
_tE
-0.14
yh
-0.14
_mB
-0.14
stÅĻÃŃ
-0.14
chip
-0.14
ittest
-0.14
agedList
-0.14
POSITIVE LOGITS
er
0.27
le
0.21
ANTS
0.18
ants
0.18
les
0.17
pres
0.17
antes
0.16
Roose
0.16
erule
0.16
’
0.15
Activations Density 0.008%