INDEX
Explanations
phrases related to legal and political contexts, as well as behaviors emphasizing individual interests over collective interests
periods at the end of sentences
New Auto-Interp
Negative Logits
ikuman
-0.74
tyr
-0.74
uly
-0.73
unstoppable
-0.71
metic
-0.70
deity
-0.69
gobl
-0.69
imperson
-0.69
oshenko
-0.68
purse
-0.68
POSITIVE LOGITS
Flavoring
1.11
Additionally
1.10
Needless
1.10
Nevertheless
1.09
Moreover
1.07
However
1.06
Furthermore
1.06
Accordingly
1.06
Conversely
1.06
Therefore
1.06
Activations Density 1.776%