INDEX
Explanations
violent actions or emotions
prepositions indicating location or position
New Auto-Interp
Negative Logits
rolet
-0.66
idays
-0.61
WITHOUT
-0.59
importantly
-0.58
PLEASE
-0.57
incumb
-0.57
released
-0.57
someday
-0.56
safely
-0.56
onym
-0.56
POSITIVE LOGITS
unison
1.37
front
1.16
lieu
1.14
animate
1.13
between
1.09
aud
1.05
ordinate
1.03
accordance
1.00
favor
1.00
effect
1.00
Activations Density 0.329%