INDEX
Explanations
references to specific political figures and events related to Brexit negotiations
New Auto-Interp
Head Attr Weights
0:0.21
1:0.03
2:0.12
3:0.10
4:0.04
5:0.03
6:0.02
7:0.01
8:0.14
9:0.07
10:0.04
11:0.13
Negative Logits
rep
-1.56
660
-1.49
760
-1.47
(<
-1.41
awaru
-1.39
eton
-1.37
VIS
-1.36
spons
-1.35
Tenn
-1.34
rouse
-1.33
POSITIVE LOGITS
magnets
1.57
molecules
1.50
TAG
1.49
esters
1.47
yang
1.47
neurons
1.45
ドラゴン
1.45
algorithms
1.43
サ
1.41
-|
1.39
Activations Density 0.000%