INDEX
Explanations
phrases related to recommendations or suggestions
New Auto-Interp
Head Attr Weights
0:0.07
1:0.10
2:0.05
3:0.06
4:0.02
5:0.10
6:0.08
7:0.09
8:0.06
9:0.09
10:0.16
11:0.07
Negative Logits
olitics
-1.65
ghazi
-1.32
ylum
-1.24
Hussein
-1.23
mosques
-1.22
sectarian
-1.20
politics
-1.20
Muslims
-1.19
judicial
-1.19
-1.17
POSITIVE LOGITS
usability
1.42
backend
1.34
optimization
1.31
homebrew
1.30
setup
1.29
Setup
1.29
testers
1.28
reusable
1.28
Availability
1.27
plugin
1.27
Activations Density 0.719%