INDEX
Explanations
mentions of knowledge or familiarity in a context where information is being shared or referenced
references to prior knowledge or familiarity with specific content
New Auto-Interp
Negative Logits
ickey
-0.67
inals
-0.65
envelope
-0.63
Els
-0.62
jri
-0.62
ella
-0.59
ospel
-0.59
rame
-0.57
ransom
-0.57
subsystem
-0.57
POSITIVE LOGITS
mentioned
0.82
taboola
0.79
WATCHED
0.74
Ü
0.72
Ago
0.66
ebin
0.66
TPPStreamerBot
0.65
noticed
0.64
kens
0.63
ITNESS
0.62
Activations Density 0.228%