INDEX
Explanations
indicators of emphasis or important points in the text
New Auto-Interp
Head Attr Weights
0:0.03
1:0.02
2:0.06
3:0.08
4:0.04
5:0.04
6:0.14
7:0.02
8:0.05
9:0.05
10:0.17
11:0.24
Negative Logits
vals
-1.61
adjust
-1.59
accuracy
-1.54
LOD
-1.53
Quadro
-1.48
triangles
-1.47
bounds
-1.43
ternity
-1.40
entity
-1.33
approximation
-1.32
POSITIVE LOGITS
etsk
1.81
oyer
1.60
)]
1.57
ublic
1.51
cong
1.51
.")
1.47
opian
1.46
atmosphere
1.45
Belg
1.44
]."
1.43
Activations Density 0.001%