INDEX
Explanations
contexts of conflict or tension
words related to provocation or eliciting reactions
New Auto-Interp
Negative Logits
inance
-0.72
elsen
-0.67
Canad
-0.65
Centauri
-0.61
ainer
-0.60
essors
-0.60
adelphia
-0.58
agger
-0.56
Agric
-0.56
Butterfly
-0.55
POSITIVE LOGITS
oking
1.00
oked
0.89
creen
0.80
rue
0.79
otine
0.78
INESS
0.77
laughter
0.76
osi
0.70
ascus
0.69
neum
0.68
Activations Density 0.028%