INDEX
Explanations
phrases related to cultural phenomena, particularly trends and shifts in societal norms
New Auto-Interp
Head Attr Weights
0:0.06
1:0.04
2:0.03
3:0.04
4:0.06
5:0.21
6:0.21
7:0.01
8:0.16
9:0.06
10:0.04
11:0.02
Negative Logits
assetsadobe
-1.56
CAST
-1.56
conom
-1.40
ioxide
-1.37
arial
-1.36
krit
-1.34
20439
-1.32
guiActiveUn
-1.27
vironment
-1.25
itsch
-1.25
POSITIVE LOGITS
especially
1.65
amiya
1.46
edi
1.46
�
1.44
inis
1.42
�
1.31
ods
1.30
iola
1.28
��
1.28
Dante
1.28
Activations Density 0.117%