INDEX
Explanations
terms related to scientific disciplines and their intersection with philosophy
New Auto-Interp
Negative Logits
cott
-0.16
inction
-0.15
icers
-0.15
ftar
-0.15
iei
-0.15
ugo
-0.14
owards
-0.14
éĥİ
-0.14
etro
-0.14
ctic
-0.14
POSITIVE LOGITS
lou
0.19
Hin
0.15
exact
0.15
wc
0.15
Nou
0.14
arak
0.14
wholes
0.14
lund
0.14
react
0.13
åŁŁ
0.13
Activations Density 0.133%