INDEX
Explanations
mentions of a particular person's name with "ith" at the end
the recurring mention of a specific character or subject in historical contexts
New Auto-Interp
Negative Logits
¥µ
-0.69
tails
-0.67
Canaver
-0.65
detail
-0.64
Shelter
-0.64
patch
-0.63
berman
-0.61
lucky
-0.60
pees
-0.60
PDATE
-0.59
POSITIVE LOGITS
otle
0.99
yll
0.96
reading
0.85
rones
0.81
iasis
0.81
iop
0.81
ith
0.78
ium
0.78
ttp
0.75
ofer
0.74
Activations Density 0.013%