INDEX
Explanations
details related to daily life activities and situations
New Auto-Interp
Negative Logits
itionally
-0.76
atical
-0.75
advoc
-0.73
intermediate
-0.71
izoph
-0.71
separation
-0.71
iliated
-0.70
organis
-0.69
intending
-0.69
enrol
-0.68
POSITIVE LOGITS
But
1.18
Meanwhile
1.13
Alas
1.08
Yet
1.07
Whatever
1.05
Likewise
1.04
Worse
1.04
Anyway
1.02
Thankfully
1.02
Fortunately
1.01
Activations Density 0.660%