INDEX
Explanations
discussions related to beliefs, reasoning, and self-reflection
New Auto-Interp
Negative Logits
ascus
-0.84
abouts
-0.79
earthqu
-0.77
awaited
-0.77
billed
-0.77
officially
-0.75
ported
-0.74
mailed
-0.74
scheduled
-0.73
eteen
-0.73
POSITIVE LOGITS
Therefore
1.69
Hence
1.61
Whereas
1.58
Thus
1.53
Consequently
1.53
Ideally
1.52
Conversely
1.50
Otherwise
1.46
Often
1.44
Therefore
1.43
Activations Density 1.827%