INDEX
Explanations
mentions of disturbances or things that are disturbing
words and phrases related to disturbance or distress
New Auto-Interp
Negative Logits
uay
-0.82
ardi
-0.79
hler
-0.74
arde
-0.74
tin
-0.73
á
-0.73
veland
-0.72
elsen
-0.71
HER
-0.70
cellence
-0.70
POSITIVE LOGITS
ingly
0.89
Trend
0.67
signals
0.66
behaviour
0.66
Enlightenment
0.66
behaviours
0.64
Territories
0.62
Geo
0.61
behaviors
0.60
neighbours
0.60
Activations Density 0.064%