INDEX
Explanations
expressions of frustration and feelings of inadequacy
New Auto-Interp
Head Attr Weights
0:0.03
1:0.01
2:0.13
3:0.12
4:0.08
5:0.06
6:0.03
7:0.04
8:0.13
9:0.10
10:0.12
11:0.10
Negative Logits
sidx
-1.34
Alphabet
-1.33
Variant
-1.24
subsidiaries
-1.20
appell
-1.19
Amen
-1.15
Mub
-1.15
Schro
-1.13
apiece
-1.08
quartered
-1.08
POSITIVE LOGITS
:(
1.47
wasted
1.44
anyways
1.38
anyway
1.38
wasting
1.37
haha
1.29
didnt
1.29
didn
1.28
bothering
1.19
stupid
1.18
Activations Density 0.512%