INDEX
Explanations
phrases where someone is being referred to or referenced
references or mentions of individuals or groups in a context
New Auto-Interp
Negative Logits
iem
-0.72
herer
-0.72
pite
-0.71
depended
-0.66
exploited
-0.66
oval
-0.64
ernel
-0.64
relies
-0.63
works
-0.61
olitics
-0.61
POSITIVE LOGITS
gars
0.78
phrases
0.76
ript
0.73
PsyNetMessage
0.70
çīĪ
0.70
phrase
0.68
Ô
0.63
è£ı
0.62
deaf
0.62
Topic
0.61
Activations Density 0.057%