INDEX
Explanations
variations of the word "or" indicating alternatives or choices
New Auto-Interp
Negative Logits
HER
-0.88
ords
-0.78
ETS
-0.77
ocracy
-0.73
plates
-0.72
onomy
-0.72
Tycoon
-0.72
Pony
-0.72
arte
-0.69
ocrats
-0.69
POSITIVE LOGITS
misplaced
1.16
inadequ
1.09
unwanted
1.05
chard
1.04
defective
1.03
malfunction
1.02
inacc
1.02
misunderstood
1.00
worse
1.00
inaccur
0.99
Activations Density 0.070%