INDEX
Explanations
references to ideological critiques, particularly in the context of communism and political philosophy
New Auto-Interp
Negative Logits
azzi
-0.16
VC
-0.14
Mong
-0.14
altru
-0.14
æħ
-0.14
аÑĢам
-0.14
Mathematic
-0.14
enville
-0.13
_dyn
-0.13
ÏĦιÏĤ
-0.13
POSITIVE LOGITS
Critical
0.32
critical
0.30
critical
0.27
Gram
0.25
Critical
0.25
Frankfurt
0.25
Lac
0.23
Hab
0.23
Structural
0.21
_critical
0.20
Activations Density 0.050%