INDEX
Explanations
phrases related to progress or achievement
New Auto-Interp
Negative Logits
wagen
-0.70
fert
-0.70
gad
-0.68
Afric
-0.67
bearer
-0.64
Beir
-0.63
Belg
-0.63
Springer
-0.63
Mous
-0.62
enegger
-0.62
POSITIVE LOGITS
į
1.10
ª
1.06
ĸļ
1.04
¡
1.03
¹
1.02
¤
1.02
ij
1.01
£
1.01
Ĵ
1.00
Ķ
1.00
Activations Density 0.113%