INDEX
Explanations
proper nouns or names
key figures and names associated with philosophical arguments or critiques
New Auto-Interp
Negative Logits
»Ĵ
-0.71
Apex
-0.67
ĸļ
-0.65
fetch
-0.65
å§«
-0.64
ngth
-0.64
relocate
-0.64
acion
-0.63
lookout
-0.62
pinch
-0.62
POSITIVE LOGITS
philosophers
0.87
omsky
0.87
Krugman
0.82
Feminist
0.81
Chomsky
0.78
galitarian
0.77
argues
0.76
theorists
0.74
otle
0.74
proponents
0.73
Activations Density 0.569%