INDEX
Explanations
names, specifically the name "Stefan" with varying levels of similarity
mentions of the name "Stefan."
New Auto-Interp
Negative Logits
BACK
-0.75
nces
-0.69
rights
-0.66
LOAD
-0.63
ptive
-0.63
lled
-0.61
REE
-0.61
uably
-0.60
inals
-0.59
ptives
-0.59
POSITIVE LOGITS
Stefan
1.05
stadt
1.01
ovic
0.92
Stef
0.77
etti
0.76
apo
0.75
acci
0.74
Rah
0.74
ovich
0.74
Matte
0.71
Activations Density 0.006%