INDEX
Explanations
references to the online imageboard website "4chan"
references to the online community 4chan
New Auto-Interp
Negative Logits
ences
-0.81
Luxem
-0.71
ENC
-0.70
COUR
-0.70
Ferr
-0.68
Mandela
-0.66
Mead
-0.65
Oilers
-0.65
Moons
-0.63
Jarrett
-0.62
POSITIVE LOGITS
chan
1.28
nel
0.95
icum
0.90
bara
0.87
esthetic
0.86
crew
0.85
alyst
0.85
thood
0.84
nels
0.84
ters
0.83
Activations Density 0.012%