INDEX
Explanations
words related to philosophy and academic discourse
terms related to philosophy and philosophical concepts
New Auto-Interp
Negative Logits
raviolet
-0.87
linger
-0.77
ngth
-0.77
enegger
-0.76
lished
-0.75
orer
-0.74
ierrez
-0.73
kef
-0.72
favorite
-0.71
risome
-0.71
POSITIVE LOGITS
phia
0.75
Philipp
0.73
士
0.73
ans
0.72
us
0.71
ians
0.71
thous
0.69
Commun
0.68
etus
0.68
Ĺ
0.67
Activations Density 0.144%