INDEX
Explanations
content referencing a specific popular board game from various sentences in the text data
words that indicate actions or functionalities related to creating or having something
New Auto-Interp
Head Attr Weights
0:0.06
1:0.01
2:0.32
3:0.05
4:0.20
5:0.05
6:0.02
7:0.02
8:0.06
9:0.09
10:0.04
11:0.02
Negative Logits
Monteneg
-1.33
NK
-1.24
reb
-1.19
Lak
-1.18
SW
-1.16
frames
-1.15
arat
-1.15
dw
-1.13
kaya
-1.13
iannopoulos
-1.10
POSITIVE LOGITS
aminer
1.36
OTAL
1.27
CHAT
1.25
ANE
1.20
estial
1.15
eer
1.14
)]
1.14
sticks
1.14
estern
1.14
uliffe
1.13
Activations Density 0.022%