INDEX
Explanations
action words related to consumption and habits
phrases relating to caution and responsible behavior
New Auto-Interp
Negative Logits
Wellington
-0.72
honoured
-0.71
hemisphere
-0.61
apologised
-0.61
realised
-0.61
Cohn
-0.60
recognised
-0.60
Hercules
-0.59
Nasa
-0.59
Jupiter
-0.57
POSITIVE LOGITS
âĢ
1.81
[/
1.64
</
1.61
ãĢ
1.57
»
1.54
âľ
1.52
>>>>
1.50
***
1.48
«
1.47
âĺ
1.46
Activations Density 1.762%