INDEX
Explanations
occurrences of specific nouns and phrases in a document
New Auto-Interp
Head Attr Weights
0:0.05
1:0.09
2:0.03
3:0.10
4:0.08
5:0.05
6:0.04
7:0.34
8:0.03
9:0.04
10:0.05
11:0.04
Negative Logits
itans
-3.02
ega
-2.69
■
-2.28
unes
-2.19
lic
-2.19
Benz
-2.15
[/
-2.15
gs
-2.13
eva
-2.13
olith
-2.13
POSITIVE LOGITS
second
5.08
second
4.40
Second
4.25
Second
3.93
secondly
3.84
Secondly
3.56
subsequent
3.46
Secondly
3.12
third
2.91
fourth
2.79
Activations Density 0.021%