INDEX
Explanations
phrases indicating a strong warning or caution
instances of the word "warn" or its variations
New Auto-Interp
Negative Logits
ota
-0.68
Esports
-0.65
transc
-0.60
participation
-0.60
taking
-0.59
Bou
-0.57
embodied
-0.57
acquired
-0.57
RG
-0.56
Participant
-0.56
POSITIVE LOGITS
warn
3.76
warns
2.19
warning
1.99
warn
1.99
warnings
1.91
warned
1.88
Warn
1.85
warning
1.64
caution
1.60
WARN
1.55
Activations Density 0.014%