INDEX
Explanations
words related to mentioning or references in a text
instances of the word "mention" and its variations
New Auto-Interp
Negative Logits
¯¯¯¯¯¯¯¯
-0.94
¯¯
-0.73
¯¯¯¯
-0.69
zers
-0.69
sett
-0.68
uilt
-0.68
insula
-0.67
¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯
-0.67
orneys
-0.67
heric
-0.67
POSITIVE LOGITS
mentions
0.90
mentioning
0.90
Kislyak
0.76
lihood
0.75
enance
0.74
utra
0.70
mention
0.67
aloud
0.67
pages
0.65
indexes
0.64
Activations Density 0.031%