INDEX
Explanations
adjectives with negative connotations
references to biased perspectives and dimensions in discussions or arguments
New Auto-Interp
Negative Logits
osponsors
-0.87
ombs
-0.85
KEN
-0.84
gone
-0.80
eeks
-0.76
iques
-0.73
heon
-0.72
worthy
-0.72
TON
-0.71
KER
-0.71
POSITIVE LOGITS
sided
0.86
imensional
0.81
adoes
0.76
ide
0.67
differe
0.67
ness
0.67
vantage
0.66
Yamato
0.65
plank
0.64
"{0.63
Activations Density 0.030%