INDEX
Explanations
specific names, especially "Liz."
mentions of the name "Liz."
New Auto-Interp
Negative Logits
asar
-0.69
Ħ¢
-0.69
Archdemon
-0.66
restraining
-0.66
eve
-0.64
quartered
-0.63
night
-0.63
day
-0.63
¥ŀ
-0.63
mble
-0.62
POSITIVE LOGITS
Liz
0.98
iverpool
0.85
otte
0.77
Lemon
0.75
Lenin
0.75
MJ
0.74
wei
0.73
Cheney
0.73
enegger
0.72
ibly
0.72
Activations Density 0.021%