INDEX
Explanations
terms indicating size or magnitude in various contexts
New Auto-Interp
Head Attr Weights
0:0.11
1:0.02
2:0.16
3:0.17
4:0.05
5:0.08
6:0.02
7:0.01
8:0.08
9:0.17
10:0.05
11:0.02
Negative Logits
��
-1.45
��
-1.31
ribly
-1.21
Jenkins
-1.17
matically
-1.17
banter
-1.15
Jord
-1.13
Kaufman
-1.11
enko
-1.11
Jacques
-1.11
POSITIVE LOGITS
imaginable
1.44
occup
1.41
iameter
1.39
Union
1.32
(>
1.31
leground
1.26
agu
1.25
largest
1.23
Ever
1.23
abal
1.20
Activations Density 0.043%