INDEX
Explanations
keywords related to political events and statements
instances of the word "for"
New Auto-Interp
Head Attr Weights
0:0.07
1:0.08
2:0.08
3:0.07
4:0.07
5:0.08
6:0.07
7:0.07
8:0.07
9:0.09
10:0.09
11:0.09
Negative Logits
cknowled
-1.41
Relative
-1.34
heet
-1.32
imar
-1.30
levied
-1.29
Fif
-1.29
Rel
-1.28
hailed
-1.24
Amendments
-1.23
Summers
-1.23
POSITIVE LOGITS
VK
1.61
═
1.59
Kik
1.59
tremend
1.56
Offline
1.54
tsy
1.52
Shard
1.49
sync
1.47
●
1.45
avatar
1.45
Activations Density 0.000%