INDEX
Explanations
capital letters followed by certain patterns that spell out a specific word or name
occurrences of the letter 'G'
New Auto-Interp
Negative Logits
Ples
-0.66
lapse
-0.65
ĸļ
-0.65
occupied
-0.65
pale
-0.64
Hyde
-0.63
Mellon
-0.63
Aber
-0.60
Bam
-0.60
retrospect
-0.59
POSITIVE LOGITS
roups
1.46
raphic
1.44
reetings
1.36
reens
1.31
entle
1.30
uild
1.27
rowth
1.26
irlfriend
1.26
erald
1.26
ossip
1.24
Activations Density 0.045%