INDEX
Explanations
pronouns and verbs indicating possession or action
mentions of "they" and "he", indicating a focus on group dynamics and individual roles
New Auto-Interp
Negative Logits
âĦ¢:
-0.72
®
-0.68
ienne
-0.66
Occ
-0.63
Amid
-0.63
DAY
-0.63
Sao
-0.60
aii
-0.60
dstg
-0.59
tains
-0.56
POSITIVE LOGITS
mathemat
0.93
're
0.87
've
0.83
'll
0.83
glim
0.81
ain
0.79
blat
0.77
gotta
0.75
contrace
0.74
incent
0.72
Activations Density 0.321%