INDEX
Explanations
phrases expressing skepticism or criticism of societal views and historical narratives
Following personal pronouns
blaming, arrogance, and moralizing
New Auto-Interp
Negative Logits
ởi
-0.54
AssemblyCompany
-0.48
lohnt
-0.46
alimentaires
-0.45
chec
-0.45
ända
-0.41
melainkan
-0.41
hôtes
-0.40
nicki
-0.40
lourdes
-0.40
POSITIVE LOGITS
disambiguazione
0.84
StructEnd
0.80
rungsseite
0.80
Roskov
0.78
iprot
0.78
whining
0.75
Personendaten
0.75
arrog
0.75
blaming
0.73
whine
0.73
Activations Density 0.503%