INDEX
Explanations
occurrences of the word "ever" and its variations
New Auto-Interp
Negative Logits
spar
-0.18
t
-0.18
bens
-0.17
ulary
-0.17
elic
-0.15
876
-0.15
lights
-0.15
ãĥ³ãĤ¸
-0.15
alic
-0.15
quo
-0.15
POSITIVE LOGITS
theless
0.23
querque
0.21
ton
0.20
ness
0.19
šek
0.18
iges
0.18
izon
0.16
loh
0.16
iggs
0.16
nia
0.16
Activations Density 0.023%