INDEX
Explanations
mentions of academic disciplines and fields of study, particularly focusing on philosophy
occurrences of the word "philosophy" and related terms in the context of academic discussions
New Auto-Interp
Negative Logits
esty
-0.81
女
-0.79
ilee
-0.78
elight
-0.76
ells
-0.74
ookie
-0.73
ded
-0.73
bring
-0.71
ilant
-0.70
kefeller
-0.70
POSITIVE LOGITS
ophical
1.34
ophers
1.00
ophy
0.92
philosopher
0.89
philosophers
0.89
opher
0.87
otle
0.85
ophe
0.85
lectic
0.82
philosophy
0.80
Activations Density 0.034%