INDEX
Explanations
proper names, specifically the name "Mark."
mentions of the name "Mark"
New Auto-Interp
Negative Logits
committee
-0.92
cffff
-0.81
milo
-0.81
ibaba
-0.76
pmwiki
-0.76
decomp
-0.75
culus
-0.73
urses
-0.71
bably
-0.70
dayName
-0.69
POSITIVE LOGITS
Twain
1.16
Mark
1.07
Mark
1.00
emark
0.93
mark
0.90
marks
0.88
Zuckerberg
0.84
Marks
0.83
Thom
0.80
Gat
0.77
Activations Density 0.014%