INDEX
Explanations
concepts related to improvements or enhancements
mentions of improvements or advancements in various contexts
New Auto-Interp
Negative Logits
ãĥĥãĥī
-0.73
zh
-0.68
zer
-0.66
azine
-0.64
thur
-0.64
mberg
-0.62
cium
-0.62
zona
-0.62
chi
-0.61
ãĥ£
-0.61
POSITIVE LOGITS
improvement
0.78
xual
0.77
ments
0.75
ment
0.74
Score
0.70
iants
0.68
attainment
0.67
agre
0.66
undown
0.66
>>\
0.65
Activations Density 0.032%