INDEX
Explanations
words ending in 'es' with high activations
occurrences of the suffix "es" at the end of words
New Auto-Interp
Negative Logits
è»
-0.74
iage
-0.73
ItemTracker
-0.70
Reviewer
-0.70
amental
-0.69
GAN
-0.66
allery
-0.66
=-=-=-=-
-0.65
staff
-0.65
ONSORED
-0.64
POSITIVE LOGITS
terday
1.27
ktop
1.22
peed
1.09
andro
1.08
earch
1.08
bians
1.06
pec
0.94
earchers
0.94
ury
0.90
leep
0.89
Activations Density 0.042%