INDEX
Explanations
references to young people and their experiences or characteristics
New Auto-Interp
Negative Logits
young
-0.68
young
-0.66
joven
-0.64
jeune
-0.59
YOUNG
-0.58
Young
-0.58
junge
-0.57
YOUNG
-0.57
jungen
-0.57
younger
-0.57
POSITIVE LOGITS
blood
0.80
sters
0.72
lings
0.69
ster
0.60
ish
0.59
bucks
0.58
STERS
0.57
esters
0.57
guns
0.56
adulthood
0.56
Activations Density 0.098%