INDEX
Explanations
references to sporting events and achievements
New Auto-Interp
Negative Logits
ernal
-0.17
antages
-0.16
immel
-0.15
adius
-0.14
afort
-0.14
ghi
-0.14
toi
-0.14
ensen
-0.14
ursed
-0.13
compressed
-0.13
POSITIVE LOGITS
front
0.26
front
0.26
attack
0.24
-form
0.23
-front
0.23
stop
0.20
style
0.19
/styles
0.18
what
0.17
defence
0.17
Activations Density 0.047%