INDEX
Explanations
philosophical discussions and debates
phrases related to philosophical inquiries and argumentation
New Auto-Interp
Negative Logits
neighb
-0.68
Cele
-0.68
fired
-0.68
venge
-0.65
Trophy
-0.63
interstitial
-0.62
festivities
-0.62
escorted
-0.62
emetery
-0.60
commemor
-0.60
POSITIVE LOGITS
empirical
1.11
Suppose
1.05
philosophers
1.00
analogy
1.00
methodological
0.96
empir
0.96
argues
0.93
Chomsky
0.92
Argument
0.90
causation
0.87
Activations Density 1.081%