INDEX
Explanations
numerical and identifier data, particularly dates and specific brands or products
New Auto-Interp
Negative Logits
orners
-0.21
lio
-0.17
Knock
-0.15
iÃŃ
-0.15
aleb
-0.15
amework
-0.15
jadx
-0.15
iele
-0.15
synth
-0.15
ghest
-0.15
POSITIVE LOGITS
tone
0.15
lit
0.15
able
0.15
ote
0.15
eca
0.14
lastic
0.14
íĽ
0.14
ROTO
0.14
ecs
0.14
tty
0.13
Activations Density 0.616%