INDEX
Explanations
the term "word" in various contexts
New Auto-Interp
Negative Logits
myſelf
-1.18
pleaſure
-1.15
photolibrary
-1.12
NUMX
-1.10
himſelf
-1.07
%]
-1.06
chofe
-1.04
Monfieur
-1.03
poffe
-1.03
bibfield
-1.01
POSITIVE LOGITS
Word
2.05
word
2.05
Word
2.01
WORD
1.94
word
1.87
words
1.74
Words
1.64
WORD
1.61
Words
1.49
words
1.49
Activations Density 0.038%