INDEX
Explanations
references to artistic expression and creative works
New Auto-Interp
Negative Logits
istrovstvÃŃ
-0.15
dro
-0.15
Protective
-0.15
konkrét
-0.14
rid
-0.14
otch
-0.14
nam
-0.14
atic
-0.14
ixin
-0.14
iced
-0.14
POSITIVE LOGITS
Dy
0.19
aty
0.18
Dys
0.18
elow
0.18
dys
0.17
otyp
0.17
dy
0.17
ograf
0.17
yn
0.17
indy
0.16
Activations Density 0.065%