INDEX
Explanations
form variations of the verb "work."
New Auto-Interp
Negative Logits
als
-0.16
uir
-0.16
/up
-0.15
xt
-0.15
uet
-0.14
oria
-0.14
Cres
-0.14
ars
-0.14
antha
-0.14
rek
-0.14
POSITIVE LOGITS
bench
0.19
manship
0.18
stations
0.18
åĿĬ
0.17
harder
0.15
aday
0.15
nehmer
0.15
spaces
0.15
loads
0.15
wonders
0.14
Activations Density 0.053%