INDEX
Explanations
the word "ey" with varying activation values
occurrences of the substring "ey" in text
New Auto-Interp
Negative Logits
mint
-0.77
ACTED
-0.75
istani
-0.73
dayName
-0.71
atoon
-0.70
女
-0.70
ãĥķãĤ©
-0.64
CPC
-0.63
IFA
-0.63
å°Ĩ
-0.62
POSITIVE LOGITS
ey
1.04
ewitness
0.95
ield
0.89
esy
0.88
oshi
0.87
terness
0.86
estinal
0.86
ipel
0.82
er
0.80
outube
0.79
Activations Density 0.009%