INDEX
Explanations
mentions of the name "Tom" in various contexts
New Auto-Interp
Negative Logits
iaux
-0.18
ABLE
-0.17
able
-0.16
anine
-0.16
vet
-0.15
unning
-0.15
loi
-0.15
uben
-0.15
ifiable
-0.14
AMP
-0.14
POSITIVE LOGITS
Tom
0.25
atoes
0.24
tom
0.22
orrow
0.20
mas
0.20
kins
0.19
asso
0.19
cat
0.19
islav
0.19
asz
0.18
Activations Density 0.018%