INDEX
Explanations
words or phrases in Cyrillic script
New Auto-Interp
Negative Logits
éĽħ
-0.15
usra
-0.14
GeneratedValue
-0.14
hay
-0.14
unkt
-0.14
assis
-0.14
amus
-0.14
ple
-0.14
azio
-0.14
dux
-0.13
POSITIVE LOGITS
twice
0.21
Twice
0.19
titles
0.16
seasons
0.15
cup
0.15
won
0.14
199
0.14
201
0.14
Roo
0.14
resents
0.14
Activations Density 0.026%