INDEX
Explanations
verbs associated with preventing or avoiding negative events or situations
terms related to prevention or averting negative outcomes
New Auto-Interp
Negative Logits
Soda
-0.80
cores
-0.73
Stone
-0.67
Cola
-0.66
Stone
-0.62
ogy
-0.61
atana
-0.61
ONES
-0.61
Candy
-0.61
essee
-0.61
POSITIVE LOGITS
aver
1.09
tle
1.04
ted
0.94
avert
0.89
gets
0.83
blink
0.80
wings
0.79
ting
0.78
uve
0.76
ties
0.75
Activations Density 0.032%