INDEX
Explanations
phrases related to improvement or enhancements
expressions of growth or progress
New Auto-Interp
Negative Logits
cium
-0.63
ãĥĥãĥī
-0.62
Monstrous
-0.61
pper
-0.60
ãĤ©
-0.59
Loaded
-0.59
da
-0.58
cient
-0.58
leton
-0.57
ãĥ£
-0.57
POSITIVE LOGITS
improvement
1.08
ments
0.91
enhancement
0.87
improvements
0.87
ment
0.86
Improvement
0.83
responsiveness
0.79
undermin
0.79
Improvements
0.77
deterioration
0.77
Activations Density 0.018%