INDEX
Explanations
phrases related to prevention or avoiding negative outcomes
phrases related to prevention and protective measures
New Auto-Interp
Negative Logits
geist
-0.78
ammy
-0.78
bard
-0.77
enegger
-0.77
framework
-0.77
spirit
-0.75
sonian
-0.74
bold
-0.73
æ©
-0.71
lene
-0.70
POSITIVE LOGITS
ative
1.00
regress
0.98
duplication
0.97
accidental
0.95
detection
0.94
accidents
0.93
disasters
0.92
future
0.91
deterioration
0.90
misunderstand
0.90
Activations Density 0.054%