INDEX
Explanations
specific cultural identifiers or characteristics associated with various communities or demographics
old people and seniority
New Auto-Interp
Negative Logits
Mith
-0.59
Avril
-0.57
Mith
-0.54
Cym
-0.52
inigung
-0.52
chell
-0.50
pathways
-0.50
chery
-0.50
ISM
-0.50
AMT
-0.50
POSITIVE LOGITS
老
1.79
老
1.45
的老
1.34
Lao
1.05
Lao
0.94
lão
0.76
old
0.71
lao
0.69
vieilles
0.68
Old
0.68
Activations Density 0.002%