INDEX
Explanations
occurrences of the word "happen" and its variations, indicating events or actions that occur
New Auto-Interp
Head Attr Weights
0:0.13
1:0.14
2:0.03
3:0.05
4:0.03
5:0.21
6:0.05
7:0.06
8:0.09
9:0.05
10:0.06
11:0.06
Negative Logits
antam
-1.58
lean
-1.55
stands
-1.52
Kit
-1.49
Luxem
-1.45
Labour
-1.37
�
-1.34
matic
-1.32
urgently
-1.32
HI
-1.31
POSITIVE LOGITS
azard
1.79
lda
1.72
nce
1.71
Nanto
1.71
Blaz
1.71
rul
1.69
TextColor
1.68
rolet
1.65
ガ
1.63
ジ
1.62
Activations Density 0.001%