INDEX
Explanations
phrases that denote locations or entries in a specific context
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.06
3:0.04
4:0.10
5:0.02
6:0.07
7:0.43
8:0.02
9:0.03
10:0.08
11:0.06
Negative Logits
arrang
-1.53
ér
-1.53
Delivery
-1.52
assurance
-1.41
delivering
-1.40
CLUD
-1.40
OUGH
-1.38
oulder
-1.36
rang
-1.35
inclination
-1.34
POSITIVE LOGITS
pmwiki
1.88
trivia
1.66
entries
1.62
lists
1.61
charts
1.59
Wonderland
1.50
Lists
1.49
otos
1.45
Dictionary
1.43
Encyclopedia
1.43
Activations Density 0.004%