INDEX
Explanations
text indicating achievements or statistics in a sports context
sentences or complete thoughts related to sports statistics and achievements
New Auto-Interp
Negative Logits
responses
-0.88
izoph
-0.85
briefings
-0.83
compan
-0.79
metab
-0.79
manuals
-0.77
response
-0.77
censorship
-0.76
bindings
-0.76
orno
-0.76
POSITIVE LOGITS
Additionally
1.25
Previously
1.14
However
1.13
Congratulations
1.11
Prior
1.08
Despite
1.07
Played
1.06
His
1.04
With
1.02
Interestingly
1.02
Activations Density 0.242%