INDEX
Explanations
words related to exerting influence or control over certain processes or situations
terms related to risk and regulation in various contexts
New Auto-Interp
Negative Logits
avorite
-0.86
RESULTS
-0.75
Origin
-0.66
BACK
-0.64
REE
-0.64
Mayweather
-0.64
HOME
-0.63
GROUND
-0.63
Grail
-0.62
WD
-0.62
POSITIVE LOGITS
izing
1.98
ization
1.94
ized
1.89
ised
1.85
izers
1.83
izations
1.81
ising
1.81
isation
1.72
izes
1.70
izable
1.67
Activations Density 0.051%