INDEX
Explanations
phrases indicating being under pressure or scrutiny
New Auto-Interp
Negative Logits
ifold
-0.16
burst
-0.14
abs
-0.14
plitude
-0.14
Underground
-0.14
ka
-0.14
clap
-0.14
outlets
-0.14
toc
-0.13
izia
-0.13
POSITIVE LOGITS
attack
0.29
siege
0.26
fire
0.25
scrutiny
0.24
investigation
0.22
wraps
0.22
sie
0.21
review
0.21
examination
0.20
heavy
0.20
Activations Density 0.022%