INDEX
Explanations
occurrences of the word "influencer" and its variations
New Auto-Interp
Negative Logits
antee
-0.17
arily
-0.15
cky
-0.15
dz
-0.15
gado
-0.15
lessly
-0.15
ÛĮات
-0.15
ally
-0.14
alan
-0.14
Chew
-0.14
POSITIVE LOGITS
encer
0.43
ential
0.34
entials
0.32
encers
0.30
enced
0.29
encing
0.28
entially
0.27
encial
0.25
enc
0.24
ENTIAL
0.23
Activations Density 0.004%