INDEX
Explanations
proper names - specifically the first name "Dan."
the presence and repetition of the name "Dan" in various contexts
New Auto-Interp
Negative Logits
margins
-0.72
llor
-0.68
conspicuous
-0.66
CLASSIFIED
-0.66
culminating
-0.65
assurance
-0.65
Reloaded
-0.64
outgoing
-0.64
symmetry
-0.63
mandatory
-0.63
POSITIVE LOGITS
zig
1.18
forth
1.12
ube
1.01
olver
1.01
coni
0.98
vier
0.94
Marino
0.93
alog
0.90
isco
0.89
emark
0.89
Activations Density 0.017%