INDEX
Explanations
phrases indicating the concept of "room for improvement" or "space for change."
New Auto-Interp
Negative Logits
.apple
-0.16
bung
-0.15
åij½
-0.15
Gol
-0.15
Wert
-0.14
Estate
-0.14
azu
-0.14
Morse
-0.14
papers
-0.14
duplicate
-0.14
POSITIVE LOGITS
unan
0.16
argar
0.15
üt
0.14
tesy
0.14
esti
0.14
spacer
0.13
obi
0.13
pir
0.13
Y
0.13
adian
0.13
Activations Density 0.084%