INDEX
Explanations
references to fine arts and craftsmanship
New Auto-Interp
Negative Logits
atee
-0.18
cribe
-0.16
abouts
-0.15
lant
-0.15
aupt
-0.15
ateurs
-0.15
doors
-0.15
tery
-0.14
ting
-0.14
wards
-0.14
POSITIVE LOGITS
tuned
0.28
tuning
0.27
fine
0.25
fine
0.24
tune
0.23
Fine
0.23
Tun
0.23
Fine
0.21
Tune
0.21
-gr
0.20
Activations Density 0.021%