INDEX
Explanations
adjectives or verbs related to intense emotions
terms related to deep emotional experiences and crises
New Auto-Interp
Negative Logits
PDATED
-0.70
kees
-0.69
GS
-0.68
sell
-0.66
ggles
-0.65
CLASS
-0.64
NUM
-0.64
saf
-0.64
OWN
-0.64
moderators
-0.63
POSITIVE LOGITS
depths
1.13
boiling
1.04
molten
1.00
fumes
0.97
fluids
0.97
liquids
0.97
saline
0.95
corros
0.94
sewage
0.93
reservoir
0.91
Activations Density 0.644%