INDEX
Explanations
mentions of the name "Els" with varying activation strengths
mentions of specific names or terms related to individuals
New Auto-Interp
Negative Logits
Occupations
-0.67
isites
-0.65
Seym
-0.64
--+
-0.64
scribed
-0.63
Augusta
-0.62
ãĥ¼ãĥĨ
-0.61
Sakuya
-0.60
Skydragon
-0.60
repe
-0.59
POSITIVE LOGITS
inki
1.10
pace
1.08
bach
0.99
warm
0.97
ength
0.95
hof
0.95
ink
0.91
kamp
0.91
ounge
0.91
kie
0.90
Activations Density 0.016%