INDEX
Explanations
mentions of the word "Mort" with varying activations
references to the name "Mort" and its variations in different contexts
New Auto-Interp
Negative Logits
manship
-0.76
DragonMagazine
-0.72
Founders
-0.71
Ô
-0.69
FFFF
-0.69
âĻ¥
-0.68
lihood
-0.68
KEY
-0.67
Ethiopia
-0.67
Somali
-0.66
POSITIVE LOGITS
mort
1.28
imer
1.06
gage
0.99
ician
0.94
mort
0.93
icia
0.92
heast
0.92
uary
0.92
ipop
0.89
arily
0.88
Activations Density 0.010%