INDEX
Explanations
instances of the term "word" and its variations in various contexts
New Auto-Interp
Negative Logits
myſelf
-1.08
Monfieur
-0.99
himſelf
-0.98
%]
-0.98
pleaſure
-0.98
"]];
-0.97
photolibrary
-0.97
"]]
-0.95
."));
-0.93
يتيمه
-0.93
POSITIVE LOGITS
words
1.66
Words
1.57
Word
1.53
word
1.50
Word
1.46
WORD
1.45
Words
1.39
WORDS
1.38
words
1.31
word
1.30
Activations Density 0.052%