INDEX
Explanations
specific terms and references associated with a prominent public figure
New Auto-Interp
Head Attr Weights
0:0.04
1:0.08
2:0.04
3:0.06
4:0.03
5:0.06
6:0.12
7:0.05
8:0.03
9:0.38
10:0.04
11:0.03
Negative Logits
OGR
-3.62
Glen
-3.23
Magnet
-3.19
Glas
-3.18
ghazi
-3.18
Portal
-3.10
Gateway
-3.02
fres
-3.02
Sal
-3.00
0010
-2.98
POSITIVE LOGITS
48
5.28
49
4.65
1948
4.48
48
4.18
Dow
3.96
49
3.81
47
3.72
1949
3.70
Dar
3.48
1947
3.47
Activations Density 0.002%