INDEX
Explanations
phrases related to prioritizing personal interests above collective interests
punctuation, specifically commas
New Auto-Interp
Negative Logits
ICAN
-0.75
Springer
-0.70
inqu
-0.69
Bey
-0.63
IFA
-0.62
Die
-0.59
Viol
-0.58
REE
-0.58
ALK
-0.57
kinderg
-0.57
POSITIVE LOGITS
poral
0.77
³³³³
0.70
stead
0.70
esthesia
0.69
ombo
0.68
chrome
0.67
quad
0.66
iosyncr
0.65
unda
0.65
nir
0.64
Activations Density 0.000%