INDEX
Explanations
references to daily or regular activities and human experiences
New Auto-Interp
Negative Logits
>=",
-0.65
lète
-0.61
Espèce
-0.59
unhofer
-0.58
barbie
-0.58
toje
-0.55
chi̍t
-0.53
تقاوى
-0.53
hringer
-0.52
surla
-0.51
POSITIVE LOGITS
daily
1.35
Daily
1.29
Daily
1.25
daily
1.21
DAILY
1.05
DAILY
1.03
Everyday
0.99
everyday
0.93
Everyday
0.90
每日
0.89
Activations Density 0.133%