INDEX
Explanations
phrases indicating possession or association of knowledge or topics
New Auto-Interp
Head Attr Weights
0:0.01
1:0.02
2:0.05
3:0.06
4:0.08
5:0.02
6:0.08
7:0.34
8:0.02
9:0.02
10:0.10
11:0.13
Negative Logits
ylum
-1.42
bum
-1.33
elight
-1.31
ello
-1.28
iage
-1.25
ifference
-1.24
isson
-1.24
Piper
-1.21
*/(
-1.21
rate
-1.20
POSITIVE LOGITS
basics
1.70
arcane
1.43
whereabouts
1.43
factual
1.41
fundamentals
1.36
genetics
1.36
STOR
1.33
surroundings
1.31
tymology
1.29
programming
1.28
Activations Density 0.007%