INDEX
Explanations
words related to change and improvement in various contexts
New Auto-Interp
Negative Logits
ARGER
-0.16
slightly
-0.15
лим
-0.15
zk
-0.15
ấn
-0.15
imore
-0.14
unequal
-0.14
%č↵
-0.13
Larger
-0.13
Goldberg
-0.13
POSITIVE LOGITS
significant
0.61
significant
0.52
dramatic
0.52
Significant
0.49
substantial
0.45
Dram
0.45
signific
0.45
drastic
0.44
знаÑĩ
0.40
considerable
0.40
Activations Density 0.540%