INDEX
Explanations
phrases mentioning the concept of modern society or technology
references to modernity
New Auto-Interp
Negative Logits
Bone
-0.85
kick
-0.82
REDACTED
-0.77
vana
-0.73
cius
-0.73
OTH
-0.71
ANY
-0.70
ï¸ı
-0.70
Jar
-0.67
Keys
-0.66
POSITIVE LOGITS
isation
1.16
ity
1.03
ization
0.98
izations
0.96
incarnation
0.94
isers
0.93
izing
0.91
era
0.88
ising
0.86
itized
0.85
Activations Density 0.025%