INDEX
Explanations
references to conflict or opposition
New Auto-Interp
Head Attr Weights
0:0.03
1:0.03
2:0.08
3:0.05
4:0.11
5:0.04
6:0.08
7:0.27
8:0.03
9:0.04
10:0.08
11:0.10
Negative Logits
frog
-1.69
ocent
-1.66
oser
-1.58
essor
-1.58
onym
-1.58
ousse
-1.49
uben
-1.48
hex
-1.45
bye
-1.43
untu
-1.43
POSITIVE LOGITS
labor
1.38
theor
1.36
Ide
1.35
dial
1.34
labour
1.32
scarce
1.31
partName
1.27
Infinite
1.26
Printed
1.25
pod
1.25
Activations Density 0.000%