INDEX
Explanations
superlatives describing things as the best or the greatest
terms indicating comparisons of quality or ranking
New Auto-Interp
Negative Logits
onto
-0.76
gow
-0.76
heit
-0.76
matter
-0.74
otto
-0.71
brance
-0.70
aker
-0.69
livion
-0.67
rium
-0.66
undle
-0.66
POSITIVE LOGITS
pillars
0.94
professions
0.89
distingu
0.86
inventions
0.86
inspir
0.86
ways
0.82
recurring
0.80
examples
0.79
manifestations
0.78
avenues
0.77
Activations Density 0.119%