INDEX
Explanations
phrases indicating frequency or common occurrence
phrases that emphasize frequency or habitual actions
New Auto-Interp
Negative Logits
utenberg
-0.72
anth
-0.70
Achievement
-0.70
vich
-0.69
otiation
-0.68
ajor
-0.68
sure
-0.67
atis
-0.67
ocracy
-0.67
atsu
-0.67
POSITIVE LOGITS
entimes
1.62
overlooked
1.32
times
1.23
times
1.22
misunderstood
1.09
mistaken
1.07
referred
1.06
errone
1.03
mistakenly
1.01
underestimated
1.00
Activations Density 0.058%