INDEX
Explanations
expressions of personal opinion or sentiment related to sports
New Auto-Interp
Negative Logits
δί
-0.17
ACHI
-0.15
ueba
-0.14
AIM
-0.14
ittest
-0.14
experience
-0.13
isay
-0.13
defgroup
-0.13
Guess
-0.13
Pap
-0.13
POSITIVE LOGITS
zzo
0.23
root
0.23
Root
0.22
root
0.22
handic
0.22
ROOT
0.21
penc
0.20
grade
0.19
ROOT
0.19
pencil
0.19
Activations Density 0.096%