INDEX
Explanations
words related to specific activities or states of being
New Auto-Interp
Negative Logits
ccione
-0.16
vox
-0.16
Punch
-0.15
unner
-0.15
undan
-0.15
Oswald
-0.14
illet
-0.14
Crunch
-0.14
egrator
-0.14
анка
-0.14
POSITIVE LOGITS
inburgh
0.15
-spinner
0.15
aler
0.14
luck
0.14
l
0.14
X
0.13
lá
0.13
luet
0.13
chai
0.13
ennon
0.13
Activations Density 0.093%