INDEX
Explanations
phrases that indicate exceptions or conditions
New Auto-Interp
Negative Logits
både
-0.19
iliar
-0.16
BOTH
-0.15
’autres
-0.15
nejen
-0.15
both
-0.15
ãģªãģı
-0.15
lasses
-0.15
uctose
-0.14
866
-0.14
POSITIVE LOGITS
maybe
0.30
occasional
0.29
perhaps
0.28
maybe
0.26
few
0.24
minor
0.24
occasionally
0.23
possibly
0.23
perhaps
0.23
Maybe
0.22
Activations Density 0.110%