INDEX
Explanations
the verb "gave" followed by a direct object
instances of the verb "gave."
New Auto-Interp
Negative Logits
eer
-0.58
enz
-0.57
BUS
-0.56
domestic
-0.54
multi
-0.54
arak
-0.53
oneself
-0.53
inline
-0.52
international
-0.52
uty
-0.51
POSITIVE LOGITS
gave
3.03
gives
1.88
threw
1.84
took
1.75
blew
1.66
drew
1.62
froze
1.57
showed
1.56
wore
1.52
drove
1.50
Activations Density 0.011%