INDEX
Explanations
words related to strong negative emotions or situations
words associated with fear or intimidation
New Auto-Interp
Negative Logits
á
-0.81
elman
-0.80
ublic
-0.80
uties
-0.76
adr
-0.74
eva
-0.73
ritis
-0.73
bers
-0.72
elf
-0.72
glas
-0.71
POSITIVE LOGITS
ly
1.16
unbeliev
0.92
NESS
0.83
ingly
0.79
reptiles
0.79
ively
0.77
LY
0.76
ously
0.74
heights
0.73
mares
0.72
Activations Density 0.016%