INDEX
Explanations
words related to philosophical and theoretical concepts
phrases centered around foundational principles and arguments in reasoning and philosophy
New Auto-Interp
Negative Logits
agues
-0.73
оÐ
-0.70
enery
-0.70
soon
-0.70
а
-0.70
NPR
-0.69
LAN
-0.68
enium
-0.67
Netflix
-0.67
Crew
-0.67
POSITIVE LOGITS
notions
1.28
causation
1.24
distinguishing
1.13
asserting
1.11
assigning
1.10
invoking
1.09
recognizing
1.09
presupp
1.07
methodological
1.05
considerations
1.05
Activations Density 0.307%