INDEX
Explanations
phrases indicating a specific individual's perspective or actions
instances of the phrase "for its part" or variations of responsibility claims
New Auto-Interp
Negative Logits
itton
-0.78
olesc
-0.71
CENT
-0.70
millenn
-0.69
anth
-0.67
perspect
-0.62
lishes
-0.62
notor
-0.60
slicing
-0.60
conclud
-0.58
POSITIVE LOGITS
sis
0.69
dad
0.69
urers
0.68
urer
0.67
aniel
0.64
ographer
0.62
reads
0.62
played
0.62
oof
0.61
Benz
0.61
Activations Density 0.017%