INDEX
Explanations
words related to non-conformity and reform
words related to nonconformity and structure
New Auto-Interp
Negative Logits
bered
-0.72
paras
-0.69
[+
-0.65
dread
-0.65
brow
-0.60
po
-0.60
acqu
-0.60
rises
-0.59
cruising
-0.59
harbour
-0.58
POSITIVE LOGITS
idable
1.22
aldehyde
1.08
atted
1.05
ative
0.95
ational
0.95
formed
0.94
ulations
0.92
ifix
0.90
ations
0.86
ulate
0.86
Activations Density 0.015%