INDEX
Explanations
proper names, specifically the name "Erik"
mentions of specific individuals, particularly those named Erik
New Auto-Interp
Negative Logits
roads
-0.80
Tokens
-0.74
creen
-0.70
matically
-0.68
birth
-0.68
NEY
-0.67
AppData
-0.65
à¨
-0.65
enegger
-0.65
payer
-0.64
POSITIVE LOGITS
kson
0.98
Erik
0.97
ildo
0.96
Spo
0.89
lund
0.88
Wem
0.87
Hansen
0.86
Bry
0.83
ansson
0.82
odox
0.80
Activations Density 0.008%