INDEX
Explanations
terminology related to data processing and analysis
New Auto-Interp
Head Attr Weights
0:0.05
1:0.03
2:0.28
3:0.04
4:0.11
5:0.09
6:0.03
7:0.02
8:0.09
9:0.13
10:0.05
11:0.02
Negative Logits
theless
-1.65
\\\\\\\\
-1.35
liest
-1.15
DeL
-1.14
berto
-1.10
Chao
-1.10
Laos
-1.08
Mecca
-1.08
Rockefeller
-1.07
workers
-1.06
POSITIVE LOGITS
arnaev
1.64
hare
1.59
xual
1.47
merce
1.45
heet
1.45
pload
1.40
orrow
1.39
eanor
1.39
ucker
1.38
arkin
1.31
Activations Density 0.009%