INDEX
Explanations
instances of the word "even."
the recurring term "even" in various contexts
New Auto-Interp
Negative Logits
aim
-0.81
plex
-0.76
othy
-0.75
rend
-0.71
ceiver
-0.70
idelines
-0.68
IGHT
-0.68
cent
-0.66
chn
-0.65
ower
-0.64
POSITIVE LOGITS
remotely
1.03
outright
0.78
worse
0.72
romeda
0.67
tho
0.66
sshd
0.65
though
0.63
TAMADRA
0.63
stranger
0.62
pret
0.60
Activations Density 0.040%