INDEX
Explanations
references to the name "Lucas" combined with numeric values
mentions of the name "Lucas."
New Auto-Interp
Negative Logits
ãĥĥãĥĪ
-0.73
yer
-0.70
eering
-0.69
req
-0.69
orget
-0.69
lying
-0.69
ifice
-0.67
lain
-0.67
Seym
-0.66
gencies
-0.66
POSITIVE LOGITS
film
1.30
Film
1.04
Skywalker
0.90
Lucas
0.89
eland
0.80
afort
0.79
ious
0.79
itic
0.76
Hunt
0.76
ifer
0.73
Activations Density 0.010%