INDEX
Explanations
aspects related to physical interaction or manipulation of objects
New Auto-Interp
Negative Logits
wcsstore
-0.68
lied
-0.60
predic
-0.60
ashington
-0.60
express
-0.58
20439
-0.58
eq
-0.58
predec
-0.57
mask
-0.57
subscribe
-0.57
POSITIVE LOGITS
them
1.15
him
1.01
THEM
0.95
us
0.91
it
0.90
everyone
0.90
everything
0.90
your
0.88
everybody
0.88
Them
0.87
Activations Density 1.166%