INDEX
Explanations
references to gymnastics
New Auto-Interp
Negative Logits
ypse
-0.17
paced
-0.17
itez
-0.17
ARING
-0.16
pole
-0.16
itaire
-0.16
poke
-0.16
igner
-0.16
kterého
-0.15
uels
-0.15
POSITIVE LOGITS
astics
0.42
nas
0.37
astic
0.32
ast
0.28
asts
0.27
kh
0.25
asi
0.24
nos
0.23
ternet
0.23
astically
0.22
Activations Density 0.007%