INDEX
Explanations
mentions of specific names, possibly related to politics or law
mentions of specific characters and individuals associated with a certain narrative or context
New Auto-Interp
Negative Logits
printed
-0.77
acl
-0.76
geries
-0.70
formed
-0.69
information
-0.68
ocrine
-0.67
ribute
-0.66
cycles
-0.65
hedral
-0.63
ASE
-0.62
POSITIVE LOGITS
Goodman
1.23
Saul
0.93
Berman
0.86
Ö
0.79
Katz
0.77
iflower
0.77
éĹĺ
0.76
ÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤ
0.74
Misc
0.74
TING
0.74
Activations Density 0.013%