INDEX
Explanations
references to awards and recognitions in various contexts
New Auto-Interp
Negative Logits
elta
-0.17
umes
-0.15
born
-0.15
zar
-0.15
fold
-0.14
endar
-0.14
errer
-0.14
øy
-0.14
ulg
-0.14
ader
-0.14
POSITIVE LOGITS
choice
0.23
year
0.22
Month
0.22
Year
0.20
month
0.20
decade
0.19
Month
0.18
YEAR
0.18
choice
0.18
century
0.18
Activations Density 0.018%