INDEX
Explanations
references to prominent families associated with conspiracy theories
New Auto-Interp
Negative Logits
gap
-0.79
alities
-0.76
opl
-0.72
odic
-0.71
binary
-0.69
ÑĮ
-0.68
anooga
-0.68
ary
-0.67
experien
-0.67
usterity
-0.66
POSITIVE LOGITS
kefeller
0.89
Brothers
0.86
Rockefeller
0.85
enthal
0.84
Estate
0.81
Institution
0.79
iets
0.77
Plaza
0.75
Syndicate
0.74
dynasty
0.72
Activations Density 0.002%