INDEX
Explanations
the name "Winston," particularly in the context of discussions surrounding themes of rebellion and totalitarianism
New Auto-Interp
Negative Logits
idge
-0.16
bsd
-0.16
berra
-0.16
ACA
-0.15
ãĥ¼ãĥĢ
-0.15
iles
-0.14
.wall
-0.14
ubbo
-0.14
/pub
-0.14
aiser
-0.14
POSITIVE LOGITS
anse
0.15
ayne
0.14
rops
0.14
Unc
0.14
orld
0.14
arty
0.14
unc
0.14
nom
0.14
vest
0.14
клÑİÑĩ
0.13
Activations Density 0.003%