INDEX
Explanations
phrases related to sports and competition
phrases indicating duration or recurrent events
New Auto-Interp
Negative Logits
\\\\\\\\
-0.62
:(
-0.57
Mub
-0.57
.�
-0.56
.")
-0.55
looph
-0.55
Adin
-0.54
[];
-0.54
Ö¼
-0.54
eve
-0.54
POSITIVE LOGITS
respectively
0.87
etc
0.82
?,
0.80
etheless
0.69
remains
0.67
odan
0.66
becomes
0.65
?),
0.64
seems
0.63
thing
0.60
Activations Density 0.934%