INDEX
Explanations
mentions of specific names, particularly "Malkin" and "Morgan" in various contexts
the name "Malkin" and its variants, indicating a focus on specific individuals
New Auto-Interp
Negative Logits
ãĤ§
-0.74
ãĤ·ãĥ£
-0.73
BOOK
-0.71
âĸ¬âĸ¬
-0.70
cess
-0.69
PRES
-0.68
ciples
-0.66
ãĥį
-0.66
ffee
-0.64
BS
-0.63
POSITIVE LOGITS
Malk
1.19
sburgh
0.98
ovich
0.93
atform
0.82
ules
0.81
mus
0.80
hin
0.80
rils
0.80
ers
0.77
ername
0.76
Activations Density 0.017%