INDEX
Explanations
references to philosophical concepts and figures
New Auto-Interp
Negative Logits
pesan
-0.07
propri
-0.06
ushman
-0.06
ãĥ³ãĥĶ
-0.06
Grape
-0.06
subur
-0.06
EventData
-0.06
pros
-0.06
Matth
-0.06
rocket
-0.06
POSITIVE LOGITS
philosopher
0.09
Philosophy
0.08
philosophy
0.08
philosophers
0.08
ÙģÙĦس
0.08
Umb
0.08
philosophical
0.07
ÏĨι
0.07
phil
0.07
osopher
0.07
Activations Density 0.306%