INDEX
Explanations
proper nouns, particularly names of athletes and sports teams
phrases related to sports events or activities
New Auto-Interp
Negative Logits
âĵĺ
-0.81
ental
-0.71
"],"
-0.69
Submit
-0.68
Written
-0.65
Allah
-0.63
eros
-0.62
interven
-0.62
Cause
-0.61
anz
-0.60
POSITIVE LOGITS
recent
0.81
resurg
0.76
sidel
0.75
replaced
0.74
controvers
0.73
inexper
0.73
resurrected
0.72
hindsight
0.69
bolstered
0.68
sandwic
0.67
Activations Density 0.780%