INDEX
Explanations
expressions or comments emphasizing the intensity of a feeling or action
intense emphasis or affirmations of feelings or states
New Auto-Interp
Negative Logits
heid
-0.77
enary
-0.71
utions
-0.65
heny
-0.63
oire
-0.63
Watch
-0.61
ean
-0.60
Protection
-0.60
iture
-0.59
pige
-0.59
POSITIVE LOGITS
bothered
0.90
REALLY
0.89
darn
0.84
bother
0.83
bothering
0.81
differentiated
0.76
really
0.74
olkien
0.74
pissed
0.72
needed
0.71
Activations Density 0.036%