INDEX
Explanations
names or words with the letter 'j' with a strong activation value
occurrences of the letter "j"
New Auto-Interp
Negative Logits
Lauder
-0.72
dispers
-0.71
rooting
-0.71
prolific
-0.68
Scotia
-0.66
milo
-0.66
Contra
-0.66
outgoing
-0.65
CONCLUS
-0.64
arming
-0.64
POSITIVE LOGITS
ournals
1.42
itsu
1.29
ordan
1.24
acket
1.19
unction
1.15
utsu
1.13
ealous
1.09
ij
1.08
unal
1.07
oking
1.05
Activations Density 0.021%