INDEX
Explanations
warnings or alerts
instances of the word "warn" and its variations, indicating caution or advisory statements
New Auto-Interp
Negative Logits
empl
-0.80
anuts
-0.79
examination
-0.78
acquisition
-0.73
productive
-0.71
engeance
-0.70
rafted
-0.69
ractions
-0.68
morph
-0.68
urgy
-0.68
POSITIVE LOGITS
warn
1.09
Warn
1.04
warnings
0.93
warning
0.92
Warning
0.92
warns
0.90
warn
0.88
ingly
0.84
warning
0.80
Msg
0.75
Activations Density 0.005%