INDEX
Explanations
phrases related to precautionary measures and readiness
New Auto-Interp
Negative Logits
ALSE
-0.72
edly
-0.72
innoc
-0.70
ugu
-0.67
etic
-0.66
phans
-0.64
advertising
-0.64
ihad
-0.63
sonian
-0.62
iphate
-0.61
POSITIVE LOGITS
lieu
1.27
order
1.15
anticipation
1.14
favour
1.13
accordance
1.08
favor
1.03
effic
1.03
preparation
0.99
case
0.99
regards
0.95
Activations Density 0.326%