INDEX
Explanations
concepts related to emotional states and preparedness
New Auto-Interp
Negative Logits
ozor
-0.20
ially
-0.17
mpar
-0.17
typed
-0.16
urally
-0.16
686
-0.15
istically
-0.15
оÑĢоÑĤ
-0.15
iert
-0.15
atively
-0.15
POSITIVE LOGITS
ness
1.09
NESS
0.74
nes
0.63
ness
0.62
eness
0.60
ess
0.57
iness
0.56
itude
0.50
liness
0.50
nees
0.49
Activations Density 0.087%