INDEX
Explanations
phrases related to interviews or conversations with individuals
instances of the word "spoke" or its variations indicating communication
New Auto-Interp
Negative Logits
eers
-0.67
ilts
-0.61
ILCS
-0.60
inhab
-0.59
isal
-0.58
obe
-0.57
————
-0.56
herent
-0.56
eer
-0.56
misplaced
-0.55
POSITIVE LOGITS
extensively
0.98
glow
0.74
about
0.74
frankly
0.71
briefly
0.69
anonymously
0.68
bout
0.65
exclusively
0.64
bitcoin
0.63
ABOUT
0.63
Activations Density 0.074%