INDEX
Explanations
phrases related to emotional states and fluctuations
New Auto-Interp
Negative Logits
escape
-0.15
amage
-0.15
Sweep
-0.14
progressively
-0.14
escape
-0.13
Unlimited
-0.13
ovice
-0.13
Escape
-0.13
pres
-0.13
reeze
-0.13
POSITIVE LOGITS
oscill
0.50
fluct
0.49
fluctuations
0.42
swings
0.41
Osc
0.41
osc
0.38
osc
0.35
cycles
0.33
swing
0.32
altern
0.32
Activations Density 0.358%