INDEX
Explanations
references to social issues and community challenges
New Auto-Interp
Head Attr Weights
0:0.09
1:0.02
2:0.48
3:0.08
4:0.02
5:0.06
6:0.02
7:0.05
8:0.03
9:0.03
10:0.04
11:0.01
Negative Logits
Oath
-2.80
earned
-2.54
ftime
-2.49
ceremonies
-2.49
celebr
-2.48
emetery
-2.48
symbolism
-2.47
ceremonial
-2.46
ceremony
-2.43
oath
-2.42
POSITIVE LOGITS
Solution
5.62
solutions
4.99
fix
4.95
Problem
4.87
solved
4.87
solve
4.70
fixes
4.68
Solution
4.59
solution
4.59
Fix
4.58
Activations Density 0.690%