INDEX
Explanations
occurrences of common nouns and pronouns
New Auto-Interp
Head Attr Weights
0:0.12
1:0.37
2:0.03
3:0.04
4:0.03
5:0.13
6:0.03
7:0.02
8:0.04
9:0.05
10:0.04
11:0.04
Negative Logits
agger
-1.68
igan
-1.61
igans
-1.60
weather
-1.59
................
-1.57
iam
-1.53
izers
-1.53
fy
-1.53
dust
-1.52
otions
-1.51
POSITIVE LOGITS
first
1.89
second
1.82
first
1.80
isEnabled
1.79
グ
1.72
commencement
1.70
Redditor
1.64
third
1.63
secondly
1.59
eighteenth
1.58
Activations Density 0.009%