INDEX
Explanations
plays in a sports context, particularly focused on football
New Auto-Interp
Negative Logits
Beir
-0.77
orpor
-0.66
izoph
-0.65
ink
-0.64
whisk
-0.61
Unic
-0.60
«ĺ
-0.60
filament
-0.59
Apost
-0.58
Reich
-0.57
POSITIVE LOGITS
ername
1.22
maker
1.17
wright
1.12
makers
1.06
calling
1.04
making
1.03
plays
1.01
call
1.00
writing
0.96
offs
0.95
Activations Density 0.026%