INDEX
Explanations
mentions of individualism, collective interests, and society's organization around individual needs and participation
New Auto-Interp
Negative Logits
UV
-0.68
Lann
-0.66
GROUND
-0.66
ł
-0.63
LINE
-0.63
oof
-0.62
¡
-0.62
lands
-0.62
cloth
-0.61
ACTED
-0.61
POSITIVE LOGITS
ities
1.20
ividual
1.15
ized
1.09
istic
1.09
ism
0.98
identifiable
0.98
istically
0.96
istics
0.94
itarian
0.91
isms
0.89
Activations Density 0.507%