INDEX
Explanations
verbs related to actions and modifications
New Auto-Interp
Negative Logits
listed
-0.69
misled
-0.67
awar
-0.64
enery
-0.64
nor
-0.63
DOWN
-0.62
toured
-0.61
wordpress
-0.61
grain
-0.60
ice
-0.60
POSITIVE LOGITS
livion
0.92
manageable
0.88
accommodate
0.83
simpler
0.81
othy
0.75
mush
0.75
something
0.73
safer
0.72
adulthood
0.69
resemble
0.66
Activations Density 2.419%