INDEX
Explanations
the verb "do" occurring in various contexts
phrases indicating actions, decisions, or outcomes in various contexts
New Auto-Interp
Negative Logits
Coach
-0.67
Mages
-0.63
thous
-0.59
coach
-0.58
asso
-0.56
ascript
-0.55
charism
-0.54
puter
-0.54
Arena
-0.53
�
-0.53
POSITIVE LOGITS
pload
0.76
iw
0.75
roads
0.73
ggles
0.71
conom
0.70
rake
0.67
rive
0.66
wolves
0.66
Sov
0.66
lim
0.64
Activations Density 0.348%