INDEX
Explanations
references to deck-building mechanics in card games
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.03
3:0.07
4:0.04
5:0.04
6:0.02
7:0.04
8:0.02
9:0.09
10:0.40
11:0.15
Negative Logits
answ
-1.42
deaf
-1.40
seism
-1.37
talk
-1.37
CCTV
-1.37
whistlebl
-1.36
whistleblowers
-1.34
Hearing
-1.34
DMCA
-1.33
screenings
-1.32
POSITIVE LOGITS
ixture
1.38
erker
1.36
atta
1.33
pure
1.32
cruc
1.29
ophy
1.27
hybrid
1.27
repertoire
1.26
variants
1.26
inav
1.25
Activations Density 0.059%