INDEX
Explanations
numerical identifiers such as case numbers and bill numbers
New Auto-Interp
Head Attr Weights
0:0.03
1:0.02
2:0.15
3:0.05
4:0.05
5:0.03
6:0.08
7:0.32
8:0.04
9:0.04
10:0.11
11:0.05
Negative Logits
havoc
-1.82
mist
-1.74
strained
-1.67
impunity
-1.66
intimidated
-1.63
ilities
-1.60
loophole
-1.60
fumes
-1.57
loopholes
-1.57
insin
-1.55
POSITIVE LOGITS
MOT
1.72
Kinnikuman
1.72
505
1.70
�醒
1.67
�
1.67
247
1.66
Jaw
1.58
etsu
1.57
ommel
1.54
神
1.53
Activations Density 0.006%