INDEX
Explanations
phrases related to negative events or situations
references to negative experiences or outcomes
New Auto-Interp
Negative Logits
ebus
-0.82
sonian
-0.82
verning
-0.77
racuse
-0.76
TAIN
-0.75
aukee
-0.72
erey
-0.72
tesy
-0.72
£ı
-0.72
ilant
-0.72
POSITIVE LOGITS
imaginable
0.89
Karma
0.89
karma
0.88
Worse
0.88
Syndrome
0.88
inflicted
0.87
worse
0.84
plague
0.80
syndrome
0.80
havoc
0.80
Activations Density 0.427%