INDEX
Explanations
verbs indicating movement or change
New Auto-Interp
Head Attr Weights
0:0.11
1:0.16
2:0.09
3:0.04
4:0.05
5:0.12
6:0.07
7:0.02
8:0.13
9:0.06
10:0.04
11:0.06
Negative Logits
idel
-1.73
collections
-1.65
icion
-1.59
clerks
-1.56
libraries
-1.53
reception
-1.51
press
-1.51
clipboard
-1.50
catalog
-1.49
itance
-1.44
POSITIVE LOGITS
ω
2.01
═
1.98
Freak
1.95
dstg
1.81
++)
1.81
GGGG
1.78
α
1.76
ENA
1.69
γ
1.68
Meow
1.64
Activations Density 0.001%