INDEX
Explanations
content related to clothing restrictions and specifically the hijab in various contexts
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.06
3:0.07
4:0.06
5:0.03
6:0.04
7:0.07
8:0.04
9:0.06
10:0.40
11:0.07
Negative Logits
ventory
-1.96
NAS
-1.81
ⓘ
-1.80
Ur
-1.72
NES
-1.72
Interstellar
-1.72
Marsh
-1.71
channelAvailability
-1.69
Iter
-1.68
Puzzles
-1.66
POSITIVE LOGITS
veil
2.19
hijab
2.04
ceremony
1.99
citizenship
1.97
constitutionally
1.87
ceremon
1.85
ceremonial
1.85
dehuman
1.83
washing
1.81
modesty
1.77
Activations Density 0.005%