INDEX
Explanations
terms associated with counterarguments and related discussions
New Auto-Interp
Negative Logits
cents
-0.15
ency
-0.15
опаÑģ
-0.14
overrides
-0.14
ubar
-0.14
ENCY
-0.13
kee
-0.13
MBOL
-0.13
raquo
-0.13
_URI
-0.13
POSITIVE LOGITS
act
0.26
mand
0.24
productive
0.23
measure
0.22
acted
0.22
balance
0.22
acting
0.21
.Counter
0.21
party
0.20
attack
0.19
Activations Density 0.013%