INDEX
Explanations
phrases indicating limitation or finality
the word "only" and its variations related to limitations or scarcity
New Auto-Interp
Negative Logits
insula
-0.74
align
-0.68
axis
-0.64
anytime
-0.64
includ
-0.63
essler
-0.62
belie
-0.61
finder
-0.61
ahead
-0.60
stem
-0.60
POSITIVE LOGITS
marginally
0.91
LIMITED
0.79
superficial
0.78
scratched
0.76
limited
0.74
ONE
0.74
Ń·
0.72
sporadic
0.72
minor
0.72
minimal
0.71
Activations Density 0.073%