INDEX
Explanations
phrases related to briefings and updates
New Auto-Interp
Head Attr Weights
0:0.01
1:0.01
2:0.04
3:0.06
4:0.11
5:0.04
6:0.04
7:0.37
8:0.03
9:0.03
10:0.10
11:0.11
Negative Logits
ucket
-1.78
Nguyen
-1.54
reenshots
-1.43
amy
-1.40
vable
-1.37
ipel
-1.36
Abbey
-1.34
overlap
-1.31
uably
-1.30
overboard
-1.30
POSITIVE LOGITS
Morning
1.54
confidential
1.52
beforehand
1.50
prepare
1.47
briefings
1.46
rehensive
1.45
acquaint
1.42
deliberations
1.40
advising
1.40
elligence
1.38
Activations Density 0.005%